esha_manideep's comments

esha_manideep · 2025-05-13T05:28:23 1747114103

Latest cursor update where they started charging for tokens is pretty good. I don't use non-MAX mode on cursor anymore

esha_manideep · 2025-05-08T23:23:53 1746746633

Claude's limits are so vague - its not clear if buying Claude Max is cheaper than just using the API. Has anyone benchmarked this?

varispeed · 2025-05-09T00:52:08 1746751928

Sounds like they are great fans of Numberwang.

esha_manideep · on June 15, 2024

They check after they scrape

deely3 · on June 15, 2024

How? Real people read all millions of pages of internet texts to verify it?

MOARDONGZPLZ · on June 15, 2024

Looks like it’s just scrambling each individual word. Seems straightforward to programmatically look for groups of things that aren’t legitimate words on a page.

bastawhiz · on June 15, 2024

That's a lot of time and bandwidth to waste

esha_manideep · on May 16, 2024

Great work guys! How did you benchmark traiser's 10-20% better? Would love to see exactly how each method scored

KhoomeiK · on May 16, 2024

Great question! See this thread:

https://news.ycombinator.com/item?id=40369713

esha_manideep · on April 1, 2024

Pretty amazing to see training data being discussed more openly

WiSaGaN · on April 1, 2024

Indeed. I think part of the reason when they are not discussed openly may be that much of the data used is copyrighted, which introduces some legal ambiguities.

YetAnotherNick · on April 1, 2024

IANAL but hiding something doesn't make someone legally immune. Any company could sue LLM companies and they can't hide it during the case. e.g. there is already a similar case on OpenAI.

fl0id · on April 1, 2024

Yes, but it at the very least delays any findings while you rake in the cash and try to create a favorable environment. OpenAI even stated that think using copyrighted texts is necessary and should be covered by fair use.

esha_manideep · on Feb 28, 2024

These models will are compatible with llama.cpp out of the box, we (GigaML - https://gigaml.com) are planning to train a small model (3-4B, 1-bit, opensource) with the latest stack-v2 dataset released today. Let me know if anyone is interested in collaborating with us.

a2code · on Feb 29, 2024

I'm interested in collaborating. For example, from the comments it occurred to me that a 128-bit SIMD register can contain 64 2-bit values. It seems straightforward that SIMD bitwise logical operations could be used in training such models.

libertalia0 · on Feb 29, 2024

Highly interested in collaborating – got a bunch of proprietary legal data already pre-sorted and labeled for various scenarios. I've already benchmarked legal use-cases (i.e. legal speciality, a few logic-based questions, and specific document creation) with various LLMs – so would love to see what benchmarks this can produced compared to early Mistral or Llama.

Let me know what's the best way to reach out!

esha_manideep · on Jan 23, 2024

Having a feature to summarise the website would be a much better than the first two. As I constantly find myself wishing for such a feature :/

seabass · on Jan 23, 2024

I’ve been working on a project [1] to do just that from within a Chrome extension. The idea was that as an extension, it could make use of the context menu and feel more like a native feature of the browser. I’m always hesitant to link to my things from comments but in this case I think it’s a perfect fit for what you’re describing.

[1] https://smudge.ai

rockemsockem · on Jan 24, 2024

I suspect Google is afraid of getting sued by more publishers :/

amf12 · on Jan 24, 2024

The linked post links to another blog which is about this.

renewiltord · on Jan 23, 2024

The Arc browser I use has that feature on hover.