Activity - How many times do you think the same data appears after a model has as many...

j4k3, 1 year ago (edited 1 year ago)

How many times do you think the same data appears after a model has as many datasets as OpenAI is using now? Even unintentionally, there will be some inevitable overlap. I expect something like data related to OpenAI researchers to reoccur many times. If nothing else, overlap in redundancy found in foreign languages could cause overtraining. Most data is likely machine curated at best.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...