Models and datasets for Elastic Reset (NeurIPS 2023), code at https://github.com/mnoukhov/elastic-reset
Michael N
mnoukhov
AI & ML interests
Representation learning for functional language
Recent Activity
published
a model
18 days ago
mnoukhov/test
updated
a model
2 months ago
mnoukhov/SmolLM2-135M-tldr-sft
updated
a model
4 months ago
mnoukhov/SmolLM2-360M-tldr-sft
Organizations
Collections
2
models
39
mnoukhov/test
Updated
mnoukhov/SmolLM2-135M-tldr-sft
Text Generation
•
Updated
•
25
mnoukhov/SmolLM2-360M-tldr-sft
Text Generation
•
Updated
•
55
mnoukhov/SmolLM2-135M-Instruct_tldr-sft
Text Generation
•
Updated
•
26
mnoukhov/SmolLM2-135M-Instruct_tldr-rm
Text Classification
•
Updated
•
24
mnoukhov/pythia2.8b-rm-tldr6.9b
Text Classification
•
Updated
•
36
mnoukhov/pythia2.8b-sft-tldr
Text Generation
•
Updated
•
132
mnoukhov/pythia160m-sft-tldr
Text Generation
•
Updated
•
85
mnoukhov/pythia160m-rm-tldr6.9b
Text Classification
•
Updated
•
28
mnoukhov/pythia1b-rm-tldr6.9b
Text Classification
•
Updated
•
30
datasets
49
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144_relabel_pythia6.9b
Viewer
•
Updated
•
177k
•
88
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144_relabel2_llama8b
Viewer
•
Updated
•
92.1k
•
54
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144_relabel_llama8b
Viewer
•
Updated
•
176k
•
69
mnoukhov/summarize_from_feedback_tldr3_unlabelled_vllm_pythia410m-dpo-tldr_relabel_pythia1b
Viewer
•
Updated
•
107k
•
55
mnoukhov/summarize_from_feedback_tldr3_unlabelled_vllm_pythia410m-dpo-tldr
Viewer
•
Updated
•
107k
•
99
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144_relabel_pythia1b
Viewer
•
Updated
•
177k
•
901
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144
Viewer
•
Updated
•
179k
•
1.67k
mnoukhov/summarize_from_feedback_tldr3_unlabelled_vllm_pythia410m-dpo-tldr-step873_relabel_pythia1b
Viewer
•
Updated
•
20k
•
53
mnoukhov/summarize_from_feedback_tldr3_unlabelled_vllm_pythia410m-dpo-tldr-step873
Viewer
•
Updated
•
20k
•
59
mnoukhov/summarize_from_feedback_tldr3_unlabelled_vllm_dpo_costa_2.8b_bf16.yml_6e799_new
Viewer
•
Updated
•
20k
•
69