convexstrictly's submissions

1.		Gemini Flash 2.0 Thinking Experimental (github.com/googleapis)
		4 points by convexstrictly on Dec 19, 2024 \| past \| 3 comments
2.		What Questions Are in the Chinese College Entrance Exam? (cherylwu.substack.com)
		1 point by convexstrictly on June 14, 2024 \| past
3.		Generative AI Is Not Going to Build Your Engineering Team for You (stackoverflow.blog)
		1 point by convexstrictly on June 14, 2024 \| past
4.		Building GPT2o – Part 1: Audio (medium.com/nivibilla)
		3 points by convexstrictly on June 14, 2024 \| past \| 1 comment
5.		The Geometry of Categorical and Hierarchical Concepts in Large Language Models (arxiv.org)
		7 points by convexstrictly on June 8, 2024 \| past
6.		OpenAI says it has begun training a new flagship A.I. model (nytimes.com)
		8 points by convexstrictly on May 28, 2024 \| past
7.		California residents: call your legislators about AI bill SB 1047 (twitter.com/chrislengerich)
		22 points by convexstrictly on May 20, 2024 \| past \| 11 comments
8.		LISA: Layerwise Importance Sampling for Memory-Efficient LLM Fine-Tuning (arxiv.org)
		3 points by convexstrictly on March 27, 2024 \| past \| 1 comment
9.		NTIA AI Open Model Weights RFC (regulations.gov)
		1 point by convexstrictly on March 27, 2024 \| past \| 1 comment
10.		Mechanics of Next Token Prediction with Self-Attention (arxiv.org)
		1 point by convexstrictly on March 19, 2024 \| past
11.		Dive Deeper into Yi-9B (huggingface.co)
		1 point by convexstrictly on March 18, 2024 \| past
12.		You can now train a 70B language model at home (answer.ai)
		4 points by convexstrictly on March 6, 2024 \| past \| 1 comment
13.		Shape Suffixes – Good Coding Style (2024) (medium.com/noamshazeer)
		2 points by convexstrictly on Feb 28, 2024 \| past
14.		Star Trek prompt optimal for grade school math on Llama-70B (twitter.com/emollick)
		2 points by convexstrictly on Feb 26, 2024 \| past \| 1 comment
15.		(US Dept of Commerce) NTIA Solicits Comments on Open-Weight AI Models (commerce.gov)
		1 point by convexstrictly on Feb 23, 2024 \| past
16.		BitDelta: Your Fine-Tune May Only Be Worth One Bit (arxiv.org)
		2 points by convexstrictly on Feb 16, 2024 \| past \| 2 comments
17.		Time is encoded in the weights of finetuned language models (arxiv.org)
		124 points by convexstrictly on Dec 24, 2023 \| past \| 55 comments
18.		Zoology 1: Measuring and Improving Recall in Efficient Language Models (stanford.edu)
		2 points by convexstrictly on Dec 22, 2023 \| past \| 1 comment
19.		TinyGSM: Achieving >80% on GSM8k with small language models (arxiv.org)
		2 points by convexstrictly on Dec 15, 2023 \| past \| 1 comment
20.		Androids built to meet the labor demands (1x.tech)
		1 point by convexstrictly on Dec 1, 2023 \| past \| 1 comment
21.		Sam Altman likely to start company with researchers from OpenAI: Bloomberg (twitter.com/emilychangtv)
		4 points by convexstrictly on Nov 18, 2023 \| past \| 6 comments
22.		Three senior researchers have resigned from OpenAI
		879 points by convexstrictly on Nov 18, 2023 \| past \| 672 comments
23.		Ron Conway strongly disapproves of Sam Altman's firing (twitter.com/ronconway)
		1 point by convexstrictly on Nov 18, 2023 \| past
24.		Sutskever: OpenAI board doing its mission to build AGI that benefits all (twitter.com/garymarcus)
		121 points by convexstrictly on Nov 18, 2023 \| past \| 172 comments
25.		Kara Swisher: OpenAI dev day and store were "pushing too fast (twitter.com/karaswisher)
		2 points by convexstrictly on Nov 18, 2023 \| past
26.		GPT4 coding regression claims misleading (twitter.com/si_boehm)
		4 points by convexstrictly on July 22, 2023 \| past
27.		Model 4 bit inference 4.2x faster than 16 bit with full HF support (twitter.com/tim_dettmers)
		2 points by convexstrictly on July 11, 2023 \| past \| 1 comment
28.		SqueezeLLM: Dense-and-Sparse Quantization (arxiv.org)
		5 points by convexstrictly on June 16, 2023 \| past \| 1 comment
29.		Inference-Time Intervention: Eliciting Truthful Answers from a Language Model (arxiv.org)
		4 points by convexstrictly on June 8, 2023 \| past \| 1 comment
30.		Orca: Progressive Learning from Complex Explanation Traces of GPT-4 (arxiv.org)
		4 points by convexstrictly on June 6, 2023 \| past \| 1 comment
		More