some high-quality Chinese corpus you can find
Leon Lee
Leon-Leee
AI & ML interests
LLMs, code generation, chatbot, workflows
Recent Activity
liked
a dataset
about 6 hours ago
EricLu/SCP-116K
commented on
an
article
about 7 hours ago
Open R1: Update #3
updated
a collection
2 days ago
Useful Pretrain-Datasets
Organizations
models
None public yet
datasets
5
Leon-Leee/unofficial-pyedu
Viewer
•
Updated
•
7.68M
•
162
Leon-Leee/OSS_Instruct_Python_zh_GPT35
Viewer
•
Updated
•
73.5k
•
55
Leon-Leee/Wizardlm_Evol_Instruct_v2_196K_backuped
Viewer
•
Updated
•
143k
•
58
•
1
Leon-Leee/WizardLM_evol_instruct_V2_only_code
Viewer
•
Updated
•
30.2k
•
62
Leon-Leee/Code-Feedback-decontamination
Viewer
•
Updated
•
66.4k
•
79