We can get a computer to tag the birds, answer questions about them, and generate new pictures of them.

  • @DannyMac@lemmy.world
    link
    fedilink
    9310 months ago

    Did you miss the part that said “I’ll need a research team and 5 years?” The XKCD character did it! SUCCESS!

  • @Zetaphor@zemmy.cc
    link
    fedilink
    English
    6110 months ago

    I have a book on learning Pytorch, this XKCD is in the first chapter and implementing this is the first code practice. It’s amazing how things progress.

  • @Zeth0s@lemmy.world
    link
    fedilink
    5610 months ago

    More than 5 years then. The comic was right, with the difference that it took more than 5 people to solve it

  • obosob
    link
    fedilink
    English
    3610 months ago

    Even with AI models that can identify that there are birds in the picture. Having it decide with accuracy that the picture is of a bird is still a hard problem.

  • @modulojs@programming.dev
    link
    fedilink
    34
    edit-2
    10 months ago

    I remember this one. It seems as spot on now as it was then, IMO. It’s not trying to say that object detection is magic or impossible, since it was totally possible then as well. It just requires a dedicated team + time + money to pay them, which is what this comic was trying to express. It is true there are more off-the-shelf software available for newer programmers now than there was before, so dev time is shorter, but that’s more just degrees of comfort / budget as opposed to anything fundamentally different.

    • @tvbusy@lemmy.dbzer0.com
      link
      fedilink
      English
      610 months ago

      It could have been the other way around if global positioning systems were either not developed or used only by the military. In that case, detecting scenery of a park could be easier than trying to figure out the position on the map.

      Or it could just be that maps data are not shared. You’ll need to hire boats and hire people to go and draw the map.

    • @satrunalia44@lemmy.world
      link
      fedilink
      510 months ago

      That’s true, even if the specific example doesn’t hold, the core concept does. If I needed to implement a bird detector today, I’d make an API call to AWS Rekognition or an equivalent service. It would take me a day or two to learn the API and then maybe 4 hours to actually implement. But if you asked me to implement a bird species detector, I’m pretty sure there is no off the shelf API capable of that, and I would indeed need months or years.

  • GamesRevolution
    link
    fedilink
    3210 months ago

    It’s actually even more correct because it underestimated the time needed by 5 years

  • @uskok@lemmy.world
    link
    fedilink
    2510 months ago

    Why do you think it’s obsolete? I suppose nowadays we can use AI generative models to explain the difference between the easy and the virtually impossible, but it still can be hard.

  • @vzq
    link
    2210 months ago

    I used to put this in my object detection presentations 5 years ago and it never failed to draw chuckles from the audience.

    Shit has been going really really fast.

  • @Poik@pawb.social
    link
    fedilink
    29 months ago

    Not only is this not obsolete, it’s close to biographical as it closely references the first and second Artificial Intelligence Winters. The first being in the 60s. We’ve been working on these for a long time, so 5 years is short. It took until GPGPU to kick into full gear and some clever insights to get Deep Learning up and running (somewhat attributed to work published in 2011) to start reliably on this problem, and even that is an oversimplification of the timeline and the scope.

    Others have mentioned oddities like the difficulty of subject matter (picture contains a bird vs picture of a bird) but there are a lot harder problems that are trivial to humans and counterintuitively incredibly hard for computers.