Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
CEIA Reinforcement Learning
university
Activity Feed
Follow
9
AI & ML interests
None defined yet.
Recent Activity
luanagbmartins
published
a dataset
about 5 hours ago
CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_qwen3-4b-dw-lr-dpo-offline-energy-GRPO_v3
luanagbmartins
updated
a dataset
about 6 hours ago
CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_energy-exp1-dpo-offline_v3
luanagbmartins
updated
a dataset
about 6 hours ago
CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_qwen3-4b-dw-lr-dpo-offline-energy_v3
View all activity
Team members
7
CEIA-RL
's models
13
Sort: Recently updated
CEIA-RL/energyv2-dpo-offline-GRPO
4B
•
Updated
about 24 hours ago
•
8
CEIA-RL/qwen3-4b-dw-lr-SLERP
Text Generation
•
4B
•
Updated
14 days ago
•
22
CEIA-RL/qwen3-4b-dw-lr-GRPO-mix-preference
Updated
14 days ago
•
11
CEIA-RL/qwen3-4b-dw-lr-GRPO
Updated
14 days ago
•
109
CEIA-RL/energy-exp1-dpo-offline
Text Generation
•
4B
•
Updated
17 days ago
•
119
CEIA-RL/energyv2-dpo-offline
Text Generation
•
4B
•
Updated
18 days ago
•
279
CEIA-RL/qwen3-4b-dw-lr-dpo-offline-energy-GRPO
Text Generation
•
4B
•
Updated
24 days ago
•
197
CEIA-RL/qwen3-4b-dw-lr-dpo-offline-energy
Text Generation
•
4B
•
Updated
May 6
•
106
CEIA-RL/Qwen3-4B-Instruct-2507
Text Generation
•
4B
•
Updated
May 4
•
5
CEIA-RL/qwen3-4b-dw-lr-dpo
Text Generation
•
4B
•
Updated
May 1
•
159
CEIA-RL/qwen3-4b-dw-lr-hf-dpo
Text Generation
•
4B
•
Updated
Apr 21
•
8
CEIA-RL/Qwen3-4B-Instruct-2507-GRPO-GPT-OSS-120B-BUG
Updated
Apr 21
•
58
CEIA-RL/qwen3-4b-dw-lr-dpo-offline
Text Generation
•
4B
•
Updated
Apr 9
•
52