Hacker Newsnew | past | comments | ask | show | jobs | submit | amarble's submissionslogin
1.I paid $170 and all I got was this demo (marble.onl)
3 points by amarble 13 days ago | past | 1 comment
2.I paid $170 and all I got was this stupid demo (marble.onl)
1 point by amarble 14 days ago | past
3.Task-free intelligence testing of LLMs (marble.onl)
69 points by amarble 45 days ago | past | 22 comments
4.Task-free intelligence testing of LLMs (marble.onl)
1 point by amarble 46 days ago | past
5.Intelligence is not just about task completion (marble.onl)
1 point by amarble 49 days ago | past
6.If You Meet ET in Space, Kill Him (2024) (nautil.us)
3 points by amarble 51 days ago | past
7.Intelligence is not just about task completion (marble.onl)
3 points by amarble 52 days ago | past
8.Show HN: Gen AI Writing Showdown (writing-showdown.com)
2 points by amarble 62 days ago | past
9.Ifrro member Kopinor signs agreement on newspaper content for AI in Norway (ifrro.org)
2 points by amarble 62 days ago | past
10.Comparing language model performance on creative writing transformations (writing-showdown.com)
2 points by amarble 63 days ago | past
11.Eminembench (marble.onl)
1 point by amarble 4 months ago | past
12.Promptware Attacks Against LLM-Powered Assistants in Production (sites.google.com)
1 point by amarble 5 months ago | past
13.Managing LLM application performance through code standards (marble.onl)
2 points by amarble 10 months ago | past
14.Catching Claude Cheating (marble.onl)
1 point by amarble 11 months ago | past
15.Catching Claude Cheating (marble.onl)
1 point by amarble 11 months ago | past | 1 comment
16.Scanning AI application code for vulnerabilities and performance issues (marble.onl)
3 points by amarble 11 months ago | past
17.Show HN: A static scanner for LLM app code (github.com/kereva-dev)
6 points by amarble 11 months ago | past | 1 comment
18.Scanning AI application code for vulnerabilities and performance issues (marble.onl)
2 points by amarble 11 months ago | past
19.The Model Trust Score: The Framework for Strategic Enterprise AI Model Selection (credo.ai)
2 points by amarble 11 months ago | past
20.Evals are not all you need (marble.onl)
58 points by amarble 11 months ago | past | 12 comments
21.An AI Cyber Incident in Plain Sight (marble.onl)
2 points by amarble on Nov 24, 2024 | past
22.AI agent using Anthropic's tool calling and the Pandas Python library (github.com/rbitr)
2 points by amarble on Nov 13, 2024 | past
23.Following LLM Manufacturer's Instructions (armilla.ai)
3 points by amarble on Oct 22, 2024 | past | 1 comment
24.AI Cybersecurity Lessons from GenAI (marble.onl)
1 point by amarble on June 22, 2024 | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: