Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Reda alami
RedaAlami
Follow
misovalko's profile picture
Mastane's profile picture
mouadjer's profile picture
8 followers
·
3 following
AI & ML interests
Reinforcement Learning
Recent Activity
updated
a dataset
10 days ago
RedaAlami/OpenR1-Math-split-v2
published
a dataset
10 days ago
RedaAlami/OpenR1-Math-split-v2
published
a model
12 days ago
RedaAlami/Falcon3-7B-Instruct-OpenR1-Math
View all activity
Organizations
spaces
1
Sleeping
TestRecommenderSystem
👁
models
15
Sort: Recently updated
RedaAlami/Falcon3-7B-Instruct-OpenR1-Math
Text Generation
•
Updated
12 days ago
•
56
RedaAlami/Qwen-2.5-7B-Simple-RL
Updated
27 days ago
RedaAlami/Falcon3-7B-Instruct-Distill-DS-v1
Text Generation
•
Updated
about 1 month ago
•
283
RedaAlami/Qwen2-0.5B-GRPO-test
Updated
Feb 10
RedaAlami/zephyr-7b-dpo-qlora
Updated
Oct 4, 2024
•
48
RedaAlami/zephyr-7b-dpo-full
Updated
Aug 29, 2024
RedaAlami/merged-dataset0-dataset1
Updated
Aug 28, 2024
RedaAlami/zephyr-7b-gemma-dpo
Updated
Jul 31, 2024
•
5
RedaAlami/ultrafeedback_binarized_custom2
Updated
Jul 17, 2024
RedaAlami/ultrafeedback_binarized_custom
Updated
Jul 17, 2024
Expand 15 models
datasets
145
Sort: Recently updated
RedaAlami/OpenR1-Math-split-v2
Viewer
•
Updated
10 days ago
•
93.7k
•
115
RedaAlami/OpenR1-Math-split-v1
Viewer
•
Updated
17 days ago
•
93.7k
•
118
RedaAlami/OpenR1-Math-split-modified
Viewer
•
Updated
17 days ago
•
93.7k
•
76
RedaAlami/OpenR1-Math-split
Viewer
•
Updated
18 days ago
•
93.7k
•
117
RedaAlami/OpenR1-Math-220k-default-50percent
Viewer
•
Updated
20 days ago
•
46.9k
•
86
RedaAlami/OpenR1-Math-220k-default
Viewer
•
Updated
21 days ago
•
93.7k
•
129
RedaAlami/merged-dpo-safety
Viewer
•
Updated
Feb 3
•
3.95k
•
47
RedaAlami/eng-batch-3-dpo-safety_test
Viewer
•
Updated
Feb 3
•
36
•
45
RedaAlami/eng-batch-4-dpo-safety_test
Viewer
•
Updated
Feb 3
•
53
•
51
RedaAlami/eng-batch-5-dpo-safety_test
Viewer
•
Updated
Feb 3
•
63
•
56
Expand 145 datasets