Reddit has struck a $60m deal with Google that lets the search giant train AI models on its posts

mesamune@lemmy.world · 7 months ago

Reddit has struck a $60m deal with Google that lets the search giant train AI models on its posts

bobburger@fedia.io · 7 months ago

To be fair it’s a pretty terrible dataset. The AI is just going to say “this” to every question you ask

rarkgrames@lemmy.world · 7 months ago

This.

Altima NEO@lemmy.zip · 7 months ago

and “and my axe”

Shimon@slrpnk.net · 7 months ago

and “rock and stone”

AwkwardLookMonkeyPuppet@lemmy.world · 7 months ago

"Reject humanity. Return to monke.

InFerNo@lemmy.ml · 7 months ago

& Knuckles, featuring Dante from the Devil May Cry series

lol@discuss.tchncs.de · edit-2 7 months ago

You’re exaggerating of course, but I don’t think it’s terrible at all; the opposite really. It’s likely incredibly useful for creating LLMs with specific knowledge or behavior.

The categorization into subreddits alone opens up so many possible applications. Imagine for example training a conversational AI with data from specific subreddits like science, askscience, biology, physics, astronomy,… or posts by users that frequent such subreddits in order to create sort of an academic AI.

You could do the same for all sorts of topics: Want a sports commentator AI, use sports related subreddits; an AI that supports you in writing a novel, use creative writing subreddits etc. Don’t want your AI to spew political opinions, exclude political subreddits from your data; don’t want it to use offensive language, only use well-moderated subreddits etc.

Adderbox76@lemmy.ca · 7 months ago

This presumes that Reddit is populated by so-called experts answering questions and posting in those subs.

But the vast overwhelming truth is that most people pretending to be experts are just regurgitating the answers they heard from another reddit post, and so on, and so on.

You might as well just train your AI on the “confidently incorrect” sub and call it a day.

MBM@lemmings.world · 7 months ago

It’s always an eye-opener when you look at an ELI5 thread where you’re actually knowledgeable about the topic

AwkwardLookMonkeyPuppet@lemmy.world · 7 months ago

Or “just Google it”.

GBU_28@lemm.ee · 7 months ago

Ai:

😭 I’m trying

captainlezbian@lemmy.world · 7 months ago

My heel turn as a mod back in the day was having automod remove lmgtfy links

brygphilomena@lemmy.world · 7 months ago

It was a weird day when I recently went to teach someone about lmgtfy and found the website dead. There are clones, but the original was so simple and great.

jkrtn@lemmy.ml · 7 months ago

Hey, now, be fair. There are some Top 40s song lyrics in there too.

OfCourseNot@fedia.io · 7 months ago

Yeah and Google already has everything scrapped and indexed