A new study shows that LLM models that are fed too much content that was generated by LLMs eventually collapse. Essentially, text generated by AI is poison if it makes its way into an LLMs training data. If the model eats too much of this poison, the model dies. By replacing your Reddit comments with AI generated text, you can...
As we are on the eve of rexxit - Is there a "best" way to sabotage our posts?
I suppose I see two ways of achieving this - 1) a single AI-response that we edit all posts with; or 2) actually using an AI to "reply", as in different posts which emulate the answers a human would provide but generated by AI
Imo, route 2 would be more time-consuming but harder to 'prevent' from degrading the dataset from reddit's perspective?
"Progress" (lemmy.world)
Businesses can discriminate against their customers? Alright then... (lemmy.world)
Overwriting Comments w/ AI Output Is the Quickest Way to Make Reddit's Data Useless to LLM Firms (arxiv.org)
A new study shows that LLM models that are fed too much content that was generated by LLMs eventually collapse. Essentially, text generated by AI is poison if it makes its way into an LLMs training data. If the model eats too much of this poison, the model dies. By replacing your Reddit comments with AI generated text, you can...