We can get a computer to tag the birds, answer questions about them, and generate new pictures of them.
Well research teams have been working on it 10 years
they did good
edit: tbh honest i think it was around 5 years ago i started being able to identify things with google lens on my phone. they worked fast!
Goodtoknow ( @Goodtoknow@lemmy.ca ) 6•2 years agoAnd crazy how not only can tools recognize birds but generate novel new images of them.
zepplenzap ( @zepplenzap@lemmy.sdf.org ) 4•2 years agoExactly!
GamesRevolution ( @GamesRevolution@programming.dev ) 27•2 years agoIt’s actually even more correct because it underestimated the time needed by 5 years
modulojs ( @modulojs@programming.dev ) 24•2 years agoI remember this one. It seems as spot on now as it was then, IMO. It’s not trying to say that object detection is magic or impossible, since it was totally possible then as well. It just requires a dedicated team + time + money to pay them, which is what this comic was trying to express. It is true there are more off-the-shelf software available for newer programmers now than there was before, so dev time is shorter, but that’s more just degrees of comfort / budget as opposed to anything fundamentally different.
tvbusy ( @tvbusy@lemmy.dbzer0.com ) English4•2 years agoIt could have been the other way around if global positioning systems were either not developed or used only by the military. In that case, detecting scenery of a park could be easier than trying to figure out the position on the map.
Or it could just be that maps data are not shared. You’ll need to hire boats and hire people to go and draw the map.
transigence ( @transigence@kbin.social ) 24•2 years agoComputer vision was just popping off five years after that, so I would say that it is prescient.
obosob ( @obosob@feddit.uk ) English22•2 years agoEven with AI models that can identify that there are birds in the picture. Having it decide with accuracy that the picture is of a bird is still a hard problem.
Zetaphor ( @Zetaphor@zemmy.cc ) English20•2 years agoI have a book on learning Pytorch, this XKCD is in the first chapter and implementing this is the first code practice. It’s amazing how things progress.
oldfart ( @oldfart@lemm.ee ) 8•2 years agoDo you recommend that book? Title?
Zetaphor ( @Zetaphor@zemmy.cc ) English5•2 years agoYes I do! It’s a pretty great overview that isn’t extremely math heavy
The book is “Deep Learning for Coders with Fastai and PyTorch: AI Applications Without a PhD”
oldfart ( @oldfart@lemm.ee ) 1•2 years agoThanks! Not math heavy is good.
voxel ( @vox@sopuli.xyz ) 13•2 years ago… it’s still true…
vzq ( @vzq@lemmy.blahaj.zone ) 6•2 years agoI used to put this in my object detection presentations 5 years ago and it never failed to draw chuckles from the audience.
Shit has been going really really fast.
predmijat ( @predmijat@programming.dev ) 4•2 years agoOne of the coolest things I’ve seen this month: https://www.lerf.io/
TechNom (nobody) ( @technom@programming.dev ) English3•2 years agoEpilogue: She got what she asked for and completed the work 5 years ago.
Poik ( @Poik@pawb.social ) 2•2 years agoNot only is this not obsolete, it’s close to biographical as it closely references the first and second Artificial Intelligence Winters. The first being in the 60s. We’ve been working on these for a long time, so 5 years is short. It took until GPGPU to kick into full gear and some clever insights to get Deep Learning up and running (somewhat attributed to work published in 2011) to start reliably on this problem, and even that is an oversimplification of the timeline and the scope.
Others have mentioned oddities like the difficulty of subject matter (picture contains a bird vs picture of a bird) but there are a lot harder problems that are trivial to humans and counterintuitively incredibly hard for computers.
JackbyDev ( @JackbyDev@programming.dev ) English1•2 years agoI talk about this a lot. It’s a conspiracy theory of mine that this comic spurred the AI image tech we have today.