Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
HerrHruby
/
MR_midtrain_9B_v3
like
0
Text Generation
Safetensors
English
qwen3_5
meta-reasoning
math
proofs
theorem-proving
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
MR_midtrain_9B_v3
19.3 GB
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
HerrHruby
Publish condgen arch (Qwen3_5ForConditionalGeneration) of step-4500 v3 weights; loads directly in vLLM + verl Megatron RL (text-only export recoverable at
072c7a3
)
1af5dbe
verified
3 days ago
.gitattributes
Safe
1.57 kB
Upload MR_midtrain_9B_v3 (v3 SFT, global_step_4500)
4 days ago
README.md
1.93 kB
Publish condgen arch (Qwen3_5ForConditionalGeneration) of step-4500 v3 weights; loads directly in vLLM + verl Megatron RL (text-only export recoverable at 072c7a3)
3 days ago
chat_template.jinja
Safe
7.76 kB
Upload MR_midtrain_9B_v3 (v3 SFT, global_step_4500)
4 days ago
config.json
2.69 kB
Publish condgen arch (Qwen3_5ForConditionalGeneration) of step-4500 v3 weights; loads directly in vLLM + verl Megatron RL (text-only export recoverable at 072c7a3)
3 days ago
generation_config.json
136 Bytes
Publish condgen arch (Qwen3_5ForConditionalGeneration) of step-4500 v3 weights; loads directly in vLLM + verl Megatron RL (text-only export recoverable at 072c7a3)
3 days ago
model.safetensors
19.3 GB
xet
Publish condgen arch (Qwen3_5ForConditionalGeneration) of step-4500 v3 weights; loads directly in vLLM + verl Megatron RL (text-only export recoverable at 072c7a3)
3 days ago
preprocessor_config.json
Safe
390 Bytes
Publish condgen arch (Qwen3_5ForConditionalGeneration) of step-4500 v3 weights; loads directly in vLLM + verl Megatron RL (text-only export recoverable at 072c7a3)
3 days ago
tokenizer.json
20 MB
xet
Upload MR_midtrain_9B_v3 (v3 SFT, global_step_4500)
4 days ago
tokenizer_config.json
Safe
1.1 kB
Upload MR_midtrain_9B_v3 (v3 SFT, global_step_4500)
4 days ago
video_preprocessor_config.json
Safe
385 Bytes
Publish condgen arch (Qwen3_5ForConditionalGeneration) of step-4500 v3 weights; loads directly in vLLM + verl Megatron RL (text-only export recoverable at 072c7a3)
3 days ago