amarble's submissions

1.		I paid $170 and all I got was this demo (marble.onl)
		3 points by amarble 13 days ago \| past \| 1 comment
2.		I paid $170 and all I got was this stupid demo (marble.onl)
		1 point by amarble 14 days ago \| past
3.		Task-free intelligence testing of LLMs (marble.onl)
		69 points by amarble 45 days ago \| past \| 22 comments
4.		Task-free intelligence testing of LLMs (marble.onl)
		1 point by amarble 46 days ago \| past
5.		Intelligence is not just about task completion (marble.onl)
		1 point by amarble 49 days ago \| past
6.		If You Meet ET in Space, Kill Him (2024) (nautil.us)
		3 points by amarble 51 days ago \| past
7.		Intelligence is not just about task completion (marble.onl)
		3 points by amarble 52 days ago \| past
8.		Show HN: Gen AI Writing Showdown (writing-showdown.com)
		2 points by amarble 62 days ago \| past
9.		Ifrro member Kopinor signs agreement on newspaper content for AI in Norway (ifrro.org)
		2 points by amarble 62 days ago \| past
10.		Comparing language model performance on creative writing transformations (writing-showdown.com)
		2 points by amarble 63 days ago \| past
11.		Eminembench (marble.onl)
		1 point by amarble 4 months ago \| past
12.		Promptware Attacks Against LLM-Powered Assistants in Production (sites.google.com)
		1 point by amarble 5 months ago \| past
13.		Managing LLM application performance through code standards (marble.onl)
		2 points by amarble 10 months ago \| past
14.		Catching Claude Cheating (marble.onl)
		1 point by amarble 11 months ago \| past
15.		Catching Claude Cheating (marble.onl)
		1 point by amarble 11 months ago \| past \| 1 comment
16.		Scanning AI application code for vulnerabilities and performance issues (marble.onl)
		3 points by amarble 11 months ago \| past
17.		Show HN: A static scanner for LLM app code (github.com/kereva-dev)
		6 points by amarble 11 months ago \| past \| 1 comment
18.		Scanning AI application code for vulnerabilities and performance issues (marble.onl)
		2 points by amarble 11 months ago \| past
19.		The Model Trust Score: The Framework for Strategic Enterprise AI Model Selection (credo.ai)
		2 points by amarble 11 months ago \| past
20.		Evals are not all you need (marble.onl)
		58 points by amarble 11 months ago \| past \| 12 comments
21.		An AI Cyber Incident in Plain Sight (marble.onl)
		2 points by amarble on Nov 24, 2024 \| past
22.		AI agent using Anthropic's tool calling and the Pandas Python library (github.com/rbitr)
		2 points by amarble on Nov 13, 2024 \| past
23.		Following LLM Manufacturer's Instructions (armilla.ai)
		3 points by amarble on Oct 22, 2024 \| past \| 1 comment
24.		AI Cybersecurity Lessons from GenAI (marble.onl)
		1 point by amarble on June 22, 2024 \| past