Makes sense, and each model has a max context length, so they could charge per t...

		daxfohl 6 months ago \| parent \| context \| favorite \| on: Breaking Quadratic Barriers: A Non-Attention LLM f... Makes sense, and each model has a max context length, so they could charge per token assuming full context by model if they wanted to assume worst case.