The user you originally replied to specifically mentioned > without going to tex... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		ludwigschubert 3 months ago \| parent \| context \| favorite \| on: Qwen3-Omni: Native Omni AI model for text, image a... The user you originally replied to specifically mentioned > without going to text first

adastra22 3 months ago [–]

Yeah, and that's my understanding. Nothing goes video -> text, or audio -> text, or even text -> text without first going through state space. That's where the core of the transformer architecture is.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact