•  online   ( @online@lemmy.ml ) 
      link
      fedilink
      English
      14 months ago

      Look at the issues and you will notice it only works on comments visible from the profile page and that not all are visible. It appears that someone made a python script to solve this problem but that you need an API key to use it.

  • This is the best summary I could come up with:


    Reddit will let “an unnamed large AI company” have access to its user-generated content platform in a new licensing deal, according to Bloomberg yesterday.

    The deal, “worth about $60 million on an annualized basis,” the outlet writes, could still change as the company’s plans to go public are still in the works.

    The news also follows an October story that Reddit had threatened to cut off Google and Bing’s search crawlers if it couldn’t make a training data deal with AI companies.

    Last year, it successfully stonewalled its way out of the biggest protest in its history after changes to its third-party API access pricing caused developers of the most popular Reddit apps to shut down.

    As Bloomberg writes, Reddit’s year-over-year revenue was up by 20 percent by the end of 2023, but it was still $200 million shy of a $1 billion target it had set two years prior.

    The company was reportedly advised to seek a $5 billion valuation when it opens up for public investment, which is expected to happen in March.


    The original article contains 346 words, the summary contains 175 words. Saved 49%. I’m a bot and I’m open source!

        •  LWD   ( @LWD@lemm.ee ) 
          link
          fedilink
          34 months ago

          In theory, yes, but instances don’t ship with the ability to do that. There would need to be a change to the Lemmy code base if such a thing was to be seriously implemented.

          I’m no federation expert, so I can’t really comment on whether doing something like requiring API keys would be feasible, unfortunately.

  •  zeluko   ( @zeluko@kbin.social ) 
    link
    fedilink
    3
    edit-2
    4 months ago

    I dont see why someone would need this deal anyways… most is already available, and most the new stuff probably too, even without API access.
    I also expect the fediverse to be crawled and used for training, thats just the thing about publicly available stuff, it gets used, if we like it or not…

  • Ah, more glue on pizza incoming. Personally I don’t understand taking reddit posts as a source for LLM training. It’s like they never visited reddit and think that all posts/comments are true, or even useful. Depending on the sub, sarcasm can account for anywhere from 5% to 100%.