Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
👋
Open to Work
3
1
Laxmi Tiwari
laxuu
Follow
webxos's profile picture
elaine-myy's profile picture
Eghbal's profile picture
3 followers
·
6 following
TiwariLaxuu
laxmi-tiwari
AI & ML interests
Agentic AI, RL, MARL
Recent Activity
reacted
to
their
post
with 👍
3 days ago
Hot take :Wednesday🔥 For years, AI progress has often looked like: "Need a smarter model?" ➡️ Add more parameters. ➡️ Add more GPUs. ➡️ Hope your budget survives. RobotxR1 explores a different idea: what if intelligence comes from experience rather than scale? Instead of relying solely on massive pretraining, it combines LLMs with reinforcement learning, allowing models to learn through interaction, feedback, and mistakes. As someone interested in Recurrent RL and autonomous systems, this raises an exciting question: Are we entering the era where experience becomes more valuable than parameters? The next breakthrough AI might not be the biggest model. It might be the one that learns continuously. 📄 Paper: https://arxiv.org/pdf/2505.03238 💻 Code: https://github.com/ForzaETH/LLMxRobot/tree/main #ReinforcementLearning #LLM #Robotics #AI #smallmodelhackathonhuggingface
reacted
to
their
post
with 🧠
3 days ago
Hot take :Wednesday🔥 For years, AI progress has often looked like: "Need a smarter model?" ➡️ Add more parameters. ➡️ Add more GPUs. ➡️ Hope your budget survives. RobotxR1 explores a different idea: what if intelligence comes from experience rather than scale? Instead of relying solely on massive pretraining, it combines LLMs with reinforcement learning, allowing models to learn through interaction, feedback, and mistakes. As someone interested in Recurrent RL and autonomous systems, this raises an exciting question: Are we entering the era where experience becomes more valuable than parameters? The next breakthrough AI might not be the biggest model. It might be the one that learns continuously. 📄 Paper: https://arxiv.org/pdf/2505.03238 💻 Code: https://github.com/ForzaETH/LLMxRobot/tree/main #ReinforcementLearning #LLM #Robotics #AI #smallmodelhackathonhuggingface
reacted
to
their
post
with 🚀
3 days ago
Hot take :Wednesday🔥 For years, AI progress has often looked like: "Need a smarter model?" ➡️ Add more parameters. ➡️ Add more GPUs. ➡️ Hope your budget survives. RobotxR1 explores a different idea: what if intelligence comes from experience rather than scale? Instead of relying solely on massive pretraining, it combines LLMs with reinforcement learning, allowing models to learn through interaction, feedback, and mistakes. As someone interested in Recurrent RL and autonomous systems, this raises an exciting question: Are we entering the era where experience becomes more valuable than parameters? The next breakthrough AI might not be the biggest model. It might be the one that learns continuously. 📄 Paper: https://arxiv.org/pdf/2505.03238 💻 Code: https://github.com/ForzaETH/LLMxRobot/tree/main #ReinforcementLearning #LLM #Robotics #AI #smallmodelhackathonhuggingface
View all activity
Organizations
laxuu
's models
9
Sort: Recently updated
laxuu/transformer_vqa
Updated
May 19, 2025
laxuu/Florence-2-vqa_final
Image-Text-to-Text
•
0.3B
•
Updated
May 19, 2025
laxuu/Florence-2-vqa_demo
Image-Text-to-Text
•
0.3B
•
Updated
May 19, 2025
laxuu/Florence-2-vqa
Image-Text-to-Text
•
0.3B
•
Updated
May 19, 2025
laxuu/Taxi-v3
Reinforcement Learning
•
Updated
Jan 1, 2024
laxuu/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Jan 1, 2024
laxuu/ppo-LunarLander-v2-1
Reinforcement Learning
•
Updated
Feb 15, 2023
laxuu/ppo_model
Updated
Feb 15, 2023
laxuu/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Feb 15, 2023