Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens Paper • 2503.01710 • Published 11 days ago • 3
More Documents, Same Length: Isolating the Challenge of Multiple Documents in RAG Paper • 2503.04388 • Published 8 days ago • 13
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8 • 263
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published Jan 14 • 276
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 346
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 3 days ago • 239
Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers Paper • 2503.00865 • Published 12 days ago • 58
MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning Paper • 2503.07365 • Published 4 days ago • 53
SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models Paper • 2503.07605 • Published 4 days ago • 63
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Paper • 2503.03601 • Published 9 days ago • 207
Unified Reward Model for Multimodal Understanding and Generation Paper • 2503.05236 • Published 7 days ago • 104
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published 7 days ago • 72
view article Article Manus AI: The Best Autonomous AI Agent Redefining Automation and Productivity By LLMhacker • 9 days ago • 136
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs Paper • 2503.01743 • Published 11 days ago • 72
view article Article Hugging Face and JFrog partner to make AI Security more transparent 11 days ago • 20
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality 11 days ago • 65