Reda alami's picture

1

Reda alami

RedaAlami

·

AI & ML interests

Reinforcement Learning

Recent Activity

updated a dataset 10 days ago

RedaAlami/OpenR1-Math-split-v2

published a dataset 10 days ago

RedaAlami/OpenR1-Math-split-v2

published a model 12 days ago

RedaAlami/Falcon3-7B-Instruct-OpenR1-Math

View all activity

Organizations

spaces 1

TestRecommenderSystem

models 15

RedaAlami/Falcon3-7B-Instruct-OpenR1-Math

Text Generation • Updated 12 days ago • 56

RedaAlami/Qwen-2.5-7B-Simple-RL

Updated 27 days ago

RedaAlami/Falcon3-7B-Instruct-Distill-DS-v1

Text Generation • Updated about 1 month ago • 283

RedaAlami/Qwen2-0.5B-GRPO-test

RedaAlami/zephyr-7b-dpo-qlora

Updated Oct 4, 2024 • 48

RedaAlami/zephyr-7b-dpo-full

Updated Aug 29, 2024

RedaAlami/merged-dataset0-dataset1

Updated Aug 28, 2024

RedaAlami/zephyr-7b-gemma-dpo

Updated Jul 31, 2024 • 5

RedaAlami/ultrafeedback_binarized_custom2

Updated Jul 17, 2024

RedaAlami/ultrafeedback_binarized_custom

Updated Jul 17, 2024

datasets 145

RedaAlami/OpenR1-Math-split-v2

Viewer • Updated 10 days ago • 93.7k • 115

RedaAlami/OpenR1-Math-split-v1

Viewer • Updated 17 days ago • 93.7k • 118

RedaAlami/OpenR1-Math-split-modified

Viewer • Updated 17 days ago • 93.7k • 76

RedaAlami/OpenR1-Math-split

Viewer • Updated 18 days ago • 93.7k • 117

RedaAlami/OpenR1-Math-220k-default-50percent

Viewer • Updated 20 days ago • 46.9k • 86

RedaAlami/OpenR1-Math-220k-default

Viewer • Updated 21 days ago • 93.7k • 129

RedaAlami/merged-dpo-safety

Viewer • Updated Feb 3 • 3.95k • 47

RedaAlami/eng-batch-3-dpo-safety_test

Viewer • Updated Feb 3 • 36 • 45

RedaAlami/eng-batch-4-dpo-safety_test

Viewer • Updated Feb 3 • 53 • 51

RedaAlami/eng-batch-5-dpo-safety_test

Viewer • Updated Feb 3 • 63 • 56