- Sibbo ( @Sibbo@sopuli.xyz ) English23•1 year ago
“Chatbot trained on conversations of real people acts like real people” Sounds like a very successful project.
Well, cynicism aside, the difficulty in creating language models etc. does not seem to lay in getting enough data anymore, but rather in getting the right data, together with identifying what is wrong.
- 🐝bownage [they/he] ( @bownage@beehaw.org ) English4•1 year ago
Yup it’s a common sentiment since the GPT era. Trash in = trash out still applies and we should be focusing our efforts on collecting quality data. Unfortunately that’s not what funders are interested in. Grants generally just go to people who promise better metrics.
- SSUPII ( @SSUPII@sopuli.xyz ) English15•1 year ago
Don’t train models on user data without curation, folks!
- BioDriver ( @BioDriver@beehaw.org ) English8•1 year ago
Garbage in, garbage out. This is more a reflection on the devs and the training data they curated than anything else