Running Featured 49 Porting nanochat to Transformers: an AI modeling history lesson π 49 Learn about ML and Transformers through nanochat
Running on CPU Upgrade Featured 3.19k The Smol Training Playbook π 3.19k The secrets to building world-class LLMs
Running 81 Maintain the unmaintainable π 81 Explore the complex relationships between 400+ machine learning models
Running Agents 80 Transformers Timeline π€ 80 Interactive timeline to explore the π€Transformers models
deepseek-ai/DeepSeek-R1-0528 Text Generation β’ 685B β’ Updated May 29, 2025 β’ 3.69M β’ β’ 2.45k
Running on Zero Agents Featured 843 Florence 2 π 843 Generate captions, detections, and segmentations for any image