Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

Misc with no match

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

53

Full-text search

Active filters: cpo, trl

NBA55/Experiment_with_trained_model_Final_CPO_for_all_3_issues-epoch-2

Updated May 12, 2024

smohammadi/llama2-lora-aligned-cpo

Updated Jul 20, 2024 • 9

NBA55/Final_Experiment_with_trained_model_Final_CPO_for_all_3_issues-epoch-2

Updated Aug 24, 2024

jbjeong91/llama3.1-cpo-full

Text Generation • Updated Sep 5, 2024 • 12

jbjeong91/llama3.1-cpo_j-full-0911

Text Generation • Updated Sep 11, 2024 • 9

jbjeong91/llama3.1-cpo-full-0911

Text Generation • Updated Sep 12, 2024 • 9

jbjeong91/llama3.1-cpo_j-full-0912

Text Generation • Updated Sep 12, 2024 • 7

jbjeong91/llama3.1-cpo-full-0912

Text Generation • Updated Sep 12, 2024 • 13

jbjeong91/llama3.1-cpo-full-0913

Text Generation • Updated Sep 13, 2024 • 13

Siddartha10/outputs_cpo

Text Generation • Updated Sep 14, 2024 • 47

ravithejads/test_model_sft

Text Generation • Updated Sep 15, 2024

maxmyn/c4ai-takehome-model-simpo

Text Generation • Updated Sep 15, 2024 • 41

twigs/smolm-cposimpo

Text Generation • Updated Sep 16, 2024 • 75

sarthakrw/cpo_model

Text Generation • Updated Sep 16, 2024 • 42

CharlesLi/OpenELM-1_1B-SimPO

Text Generation • Updated Sep 20, 2024 • 23

CharlesLi/OpenELM-1_1B-CPO

Text Generation • Updated Sep 20, 2024 • 26

NBA55/CPO_with_baseline_modalh

Text Generation • Updated Oct 1, 2024 • 11

NBA55/CPO_with_trained_model_for_all_3_issues-epoch-2

Updated Oct 1, 2024

rawsh/mirrorqwen2.5-0.5b-SimPO

Text Generation • Updated Nov 10, 2024 • 16

rawsh/simpo-math-model

Text Generation • Updated Nov 10, 2024 • 45

rawsh/mirrorqwen2.5-0.5b-SimPO-0

Text Generation • Updated Nov 10, 2024 • 42

mradermacher/mirrorqwen2.5-0.5b-SimPO-GGUF

Updated Nov 10, 2024 • 116

mradermacher/mirrorqwen2.5-0.5b-SimPO-0-GGUF

Updated Nov 10, 2024 • 103

rawsh/mirrorqwen2.5-0.5b-SimPO-1

Text Generation • Updated Nov 11, 2024 • 36

rawsh/mirrorqwen2.5-0.5b-SimPO-2

Text Generation • Updated Nov 11, 2024 • 35

rawsh/mirrorqwen2.5-0.5b-SimPO-3

Text Generation • Updated Nov 11, 2024 • 34

mradermacher/mirrorqwen2.5-0.5b-SimPO-1-GGUF

Updated Nov 12, 2024 • 148

mradermacher/mirrorqwen2.5-0.5b-SimPO-2-GGUF

Updated Nov 12, 2024 • 115

mradermacher/mirrorqwen2.5-0.5b-SimPO-3-GGUF

Updated Nov 12, 2024 • 73

botways/llama-CPO

Updated Nov 26, 2024