Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Have you considered that selection of material contributes to specialization and efficiency? This is meant to be a weights-small model.


its also apparently a well known result that filtering nsfw content IMPROVES scores

https://x.com/swyx/status/1661359483447316480


Or perhaps it was removing the curly brackets that improved it more than the damage caused by losing the nsfw content.

Or perhaps the measurement of improvement was biased. If a model doesn't understand the word gay there would certainly be people who would find real world use of the model to be substandard.

Did the assessment of what counts as improvement come from the same community that decided that excluding things with 'gay' was cleaning the data?


The word "gay" mentioned in your link isn't nsfw content though.


LLMs get distracted by porn too !?!?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: