Perusha Moodley
moodlep
AI & ML interests
RL, DRL, Decision Transformers, Auxiliary signals, self-supervised methods
Recent Activity
upvoted
an
article
14 days ago
SmolLM - blazingly fast and remarkably powerful
liked
a Space
15 days ago
nanotron/ultrascale-playbook
liked
a dataset
about 2 months ago
Anthropic/hh-rlhf
Organizations
Collections
1
models
9
moodlep/smollm2-17b-dpo-cai-v1
Updated
•
5
moodlep/smollm2-1.7b-instr-sft-cai-v1
Updated
moodlep/smollm2-1.7b-instr-sft-cai
Updated
•
3
moodlep/mistral-7b-sft-constitutional-ai
Updated
•
5
moodlep/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
moodlep/output
Updated
moodlep/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
moodlep/ppo-Huggy
Reinforcement Learning
•
Updated
•
43
moodlep/ppo-LunarLander-v2
Reinforcement Learning
•
Updated