-
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper โข 2501.12948 โข Published โข 346 -
LightThinker: Thinking Step-by-Step Compression
Paper โข 2502.15589 โข Published โข 26 -
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Paper โข 2405.04434 โข Published โข 18 -
Model Compression and Efficient Inference for Large Language Models: A Survey
Paper โข 2402.09748 โข Published โข 1
Nvar Char
zombieofCrypto
ยท
AI & ML interests
machine learning to become more zombie-like
Recent Activity
updated
a collection
about 12 hours ago
llm_improvement_research
updated
a collection
about 12 hours ago
llm_improvement_research
updated
a collection
about 12 hours ago
llm_improvement_research
Organizations
Collections
4
spaces
5
datasets
None public yet