Hacker Newsnew | past | comments | ask | show | jobs | submit | convexstrictly's submissionslogin
1.Gemini Flash 2.0 Thinking Experimental (github.com/googleapis)
4 points by convexstrictly on Dec 19, 2024 | past | 3 comments
2.What Questions Are in the Chinese College Entrance Exam? (cherylwu.substack.com)
1 point by convexstrictly on June 14, 2024 | past
3.Generative AI Is Not Going to Build Your Engineering Team for You (stackoverflow.blog)
1 point by convexstrictly on June 14, 2024 | past
4.Building GPT2o – Part 1: Audio (medium.com/nivibilla)
3 points by convexstrictly on June 14, 2024 | past | 1 comment
5.The Geometry of Categorical and Hierarchical Concepts in Large Language Models (arxiv.org)
7 points by convexstrictly on June 8, 2024 | past
6.OpenAI says it has begun training a new flagship A.I. model (nytimes.com)
8 points by convexstrictly on May 28, 2024 | past
7.California residents: call your legislators about AI bill SB 1047 (twitter.com/chrislengerich)
22 points by convexstrictly on May 20, 2024 | past | 11 comments
8.LISA: Layerwise Importance Sampling for Memory-Efficient LLM Fine-Tuning (arxiv.org)
3 points by convexstrictly on March 27, 2024 | past | 1 comment
9.NTIA AI Open Model Weights RFC (regulations.gov)
1 point by convexstrictly on March 27, 2024 | past | 1 comment
10.Mechanics of Next Token Prediction with Self-Attention (arxiv.org)
1 point by convexstrictly on March 19, 2024 | past
11.Dive Deeper into Yi-9B (huggingface.co)
1 point by convexstrictly on March 18, 2024 | past
12.You can now train a 70B language model at home (answer.ai)
4 points by convexstrictly on March 6, 2024 | past | 1 comment
13.Shape Suffixes – Good Coding Style (2024) (medium.com/noamshazeer)
2 points by convexstrictly on Feb 28, 2024 | past
14.Star Trek prompt optimal for grade school math on Llama-70B (twitter.com/emollick)
2 points by convexstrictly on Feb 26, 2024 | past | 1 comment
15.(US Dept of Commerce) NTIA Solicits Comments on Open-Weight AI Models (commerce.gov)
1 point by convexstrictly on Feb 23, 2024 | past
16.BitDelta: Your Fine-Tune May Only Be Worth One Bit (arxiv.org)
2 points by convexstrictly on Feb 16, 2024 | past | 2 comments
17.Time is encoded in the weights of finetuned language models (arxiv.org)
124 points by convexstrictly on Dec 24, 2023 | past | 55 comments
18.Zoology 1: Measuring and Improving Recall in Efficient Language Models (stanford.edu)
2 points by convexstrictly on Dec 22, 2023 | past | 1 comment
19.TinyGSM: Achieving >80% on GSM8k with small language models (arxiv.org)
2 points by convexstrictly on Dec 15, 2023 | past | 1 comment
20.Androids built to meet the labor demands (1x.tech)
1 point by convexstrictly on Dec 1, 2023 | past | 1 comment
21.Sam Altman likely to start company with researchers from OpenAI: Bloomberg (twitter.com/emilychangtv)
4 points by convexstrictly on Nov 18, 2023 | past | 6 comments
22.Three senior researchers have resigned from OpenAI
879 points by convexstrictly on Nov 18, 2023 | past | 672 comments
23.Ron Conway strongly disapproves of Sam Altman's firing (twitter.com/ronconway)
1 point by convexstrictly on Nov 18, 2023 | past
24.Sutskever: OpenAI board doing its mission to build AGI that benefits all (twitter.com/garymarcus)
121 points by convexstrictly on Nov 18, 2023 | past | 172 comments
25.Kara Swisher: OpenAI dev day and store were "pushing too fast (twitter.com/karaswisher)
2 points by convexstrictly on Nov 18, 2023 | past
26.GPT4 coding regression claims misleading (twitter.com/si_boehm)
4 points by convexstrictly on July 22, 2023 | past
27.Model 4 bit inference 4.2x faster than 16 bit with full HF support (twitter.com/tim_dettmers)
2 points by convexstrictly on July 11, 2023 | past | 1 comment
28.SqueezeLLM: Dense-and-Sparse Quantization (arxiv.org)
5 points by convexstrictly on June 16, 2023 | past | 1 comment
29.Inference-Time Intervention: Eliciting Truthful Answers from a Language Model (arxiv.org)
4 points by convexstrictly on June 8, 2023 | past | 1 comment
30.Orca: Progressive Learning from Complex Explanation Traces of GPT-4 (arxiv.org)
4 points by convexstrictly on June 6, 2023 | past | 1 comment

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: