Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The v2 models are much smaller (15mb), because neural networks. The parsing, NER and tagging are mostly okay with the 50mb model. There are only word vectors for the top 5k words though, which can be a problem.

The v2 English models are more accurate, and can assign vectors to any word, including unknown words using the context and the word shape. Overall it's much better -- but it's still in alpha. The docs are already better, though.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: