| 1. | | Gemini Flash 2.0 Thinking Experimental (github.com/googleapis) |
| 4 points by convexstrictly on Dec 19, 2024 | past | 3 comments |
|
| 2. | | What Questions Are in the Chinese College Entrance Exam? (cherylwu.substack.com) |
| 1 point by convexstrictly on June 14, 2024 | past |
|
| 3. | | Generative AI Is Not Going to Build Your Engineering Team for You (stackoverflow.blog) |
| 1 point by convexstrictly on June 14, 2024 | past |
|
| 4. | | Building GPT2o – Part 1: Audio (medium.com/nivibilla) |
| 3 points by convexstrictly on June 14, 2024 | past | 1 comment |
|
| 5. | | The Geometry of Categorical and Hierarchical Concepts in Large Language Models (arxiv.org) |
| 7 points by convexstrictly on June 8, 2024 | past |
|
| 6. | | OpenAI says it has begun training a new flagship A.I. model (nytimes.com) |
| 8 points by convexstrictly on May 28, 2024 | past |
|
| 7. | | California residents: call your legislators about AI bill SB 1047 (twitter.com/chrislengerich) |
| 22 points by convexstrictly on May 20, 2024 | past | 11 comments |
|
| 8. | | LISA: Layerwise Importance Sampling for Memory-Efficient LLM Fine-Tuning (arxiv.org) |
| 3 points by convexstrictly on March 27, 2024 | past | 1 comment |
|
| 9. | | NTIA AI Open Model Weights RFC (regulations.gov) |
| 1 point by convexstrictly on March 27, 2024 | past | 1 comment |
|
| 10. | | Mechanics of Next Token Prediction with Self-Attention (arxiv.org) |
| 1 point by convexstrictly on March 19, 2024 | past |
|
| 11. | | Dive Deeper into Yi-9B (huggingface.co) |
| 1 point by convexstrictly on March 18, 2024 | past |
|
| 12. | | You can now train a 70B language model at home (answer.ai) |
| 4 points by convexstrictly on March 6, 2024 | past | 1 comment |
|
| 13. | | Shape Suffixes – Good Coding Style (2024) (medium.com/noamshazeer) |
| 2 points by convexstrictly on Feb 28, 2024 | past |
|
| 14. | | Star Trek prompt optimal for grade school math on Llama-70B (twitter.com/emollick) |
| 2 points by convexstrictly on Feb 26, 2024 | past | 1 comment |
|
| 15. | | (US Dept of Commerce) NTIA Solicits Comments on Open-Weight AI Models (commerce.gov) |
| 1 point by convexstrictly on Feb 23, 2024 | past |
|
| 16. | | BitDelta: Your Fine-Tune May Only Be Worth One Bit (arxiv.org) |
| 2 points by convexstrictly on Feb 16, 2024 | past | 2 comments |
|
| 17. | | Time is encoded in the weights of finetuned language models (arxiv.org) |
| 124 points by convexstrictly on Dec 24, 2023 | past | 55 comments |
|
| 18. | | Zoology 1: Measuring and Improving Recall in Efficient Language Models (stanford.edu) |
| 2 points by convexstrictly on Dec 22, 2023 | past | 1 comment |
|
| 19. | | TinyGSM: Achieving >80% on GSM8k with small language models (arxiv.org) |
| 2 points by convexstrictly on Dec 15, 2023 | past | 1 comment |
|
| 20. | | Androids built to meet the labor demands (1x.tech) |
| 1 point by convexstrictly on Dec 1, 2023 | past | 1 comment |
|
| 21. | | Sam Altman likely to start company with researchers from OpenAI: Bloomberg (twitter.com/emilychangtv) |
| 4 points by convexstrictly on Nov 18, 2023 | past | 6 comments |
|
| 22. | | Three senior researchers have resigned from OpenAI |
| 879 points by convexstrictly on Nov 18, 2023 | past | 672 comments |
|
| 23. | | Ron Conway strongly disapproves of Sam Altman's firing (twitter.com/ronconway) |
| 1 point by convexstrictly on Nov 18, 2023 | past |
|
| 24. | | Sutskever: OpenAI board doing its mission to build AGI that benefits all (twitter.com/garymarcus) |
| 121 points by convexstrictly on Nov 18, 2023 | past | 172 comments |
|
| 25. | | Kara Swisher: OpenAI dev day and store were "pushing too fast (twitter.com/karaswisher) |
| 2 points by convexstrictly on Nov 18, 2023 | past |
|
| 26. | | GPT4 coding regression claims misleading (twitter.com/si_boehm) |
| 4 points by convexstrictly on July 22, 2023 | past |
|
| 27. | | Model 4 bit inference 4.2x faster than 16 bit with full HF support (twitter.com/tim_dettmers) |
| 2 points by convexstrictly on July 11, 2023 | past | 1 comment |
|
| 28. | | SqueezeLLM: Dense-and-Sparse Quantization (arxiv.org) |
| 5 points by convexstrictly on June 16, 2023 | past | 1 comment |
|
| 29. | | Inference-Time Intervention: Eliciting Truthful Answers from a Language Model (arxiv.org) |
| 4 points by convexstrictly on June 8, 2023 | past | 1 comment |
|
| 30. | | Orca: Progressive Learning from Complex Explanation Traces of GPT-4 (arxiv.org) |
| 4 points by convexstrictly on June 6, 2023 | past | 1 comment |
|
|
| More |