kaeru39 PRO
ryota39
AI & ML interests
LLM × RL
Recent Activity
liked a dataset 3 days ago
LiquidAI/ifstruct-v1.0 liked a dataset 3 days ago
Qwen/PolyMath liked a model 22 days ago
google/gemma-4-12B-itOrganizations
models 19
ryota39/Qwen3-8B-math-RL-ja
8B • Updated • 3
ryota39/Qwen3-8B-math-RL-en
Text Generation • 8B • Updated • 11
ryota39/gemma-2-2b-jpn-it-q8
3B • Updated • 7
ryota39/Tora-12B
Text Generation • 12B • Updated • 2 • 1
ryota39/Tora-7B-v0.1
Text Generation • Updated • 4 • 2
ryota39/mluke-large-lite-reward
Text Classification • 0.6B • Updated • 3
ryota39/retriva-bert-preference-classifier
Text Classification • 1B • Updated • 3
ryota39/Tora-7B-v0.2
Text Generation • 7B • Updated • 7 • 1
ryota39/llm-jp-1b-sft-100k-LoRA-dpo-12k
Text Generation • 1B • Updated • 4
ryota39/Phi-3-mini-4k-instruct-dpo
Text Generation • 4B • Updated • 5 • 3
datasets 34
ryota39/gsm8k-ja
Viewer • Updated • 8.79k • 10
ryota39/llmjp-chatbot-arena-v2
Viewer • Updated • 594 • 5
ryota39/aya-ja-evol-inst
Viewer • Updated • 29.1k • 12
ryota39/llm-jp-chatbot-arena-conversations-reformatted
Viewer • Updated • 990 • 15 • 1
ryota39/reviews_and_summaries2
Viewer • Updated • 50 • 8
ryota39/reviews_and_summaries
Viewer • Updated • 50 • 7
ryota39/movie_reviews_local
Viewer • Updated • 50 • 10
ryota39/movie_reviews
Viewer • Updated • 50 • 8
ryota39/wild_chat_ja
Viewer • Updated • 3.49k • 6
ryota39/aya-evol-instruct
Viewer • Updated • 29.2k • 19