I don't want to be dismissive, it's a fun project, but this has been done a lot ...

tildef · on May 19, 2024

There's literally an image of Anya pointing at Karpathy on this GitHub page.

fifilura · on May 19, 2024

I think they learned a lot doing this? And they tried hard explaining each step!

_giorgio_ · on May 20, 2024

What are your favourite implementations of a GPT? I like a lot the video series by Karpathy.

Anyway, I'll take a look at this too, not sure if it has inference and training. Having just inference would be a disappointment.

rvz · on May 19, 2024

Well given the fast pace of AI, it should not be a surprise that this is similar to llama2 and that we’re seeing the n + 1 toy implementations and likely has bugs or leaks in the background.

You might as well look at llama.cpp for a serious and production grade implementation to learn from. Otherwise, nothing to see here.

> Is there something particularly different about this one?

Other than the immature lowercase, anime BS, etc, then…

No.