Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I think this is a pretty big limitation of the architecture (STT->LLM->TTS) they've chosen. The intonation around struggling to speak or difficulty with certain phrases is totally lost when the text is transcribed.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: