Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This is interesting, but I found it moderately disturbing that they spend a LOT of effort up front talking about how they don’t have any access to the prompts or responses. And then they reveal that they did actually have access to the text and they spend 80% of the rest of the paper analyzing the content.




>And then they reveal that they did actually have access to the text

I'm not seeing that. All I'm seeing is them analyzing metadata.


>All I'm seeing is them analyzing metadata Read the section about how they achieve classifications for prompts (hint: They read the prompts)

From what I see the researchers aren't running a classifier on prompts they've acquired.

>The classifier is deployed within OpenRouter's infrastructure, ensuring that classifications remain anonymous and are not linked to individual customers.

OpenRouter has to have access to your prompts in order to route it somewhere else. The researchers don't get access to these prompts. They only get access to the metadata being generated from routing a prompt.


I didn’t read the paper but I know OR has an option to opt-in to reading/training off the prompts for a discount. Some free models also log, but I’m not sure if that is just the provider, or OR too



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: