Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
30.5
TFLOPS
19
45
216
Marc Kovka
GPT007
Follow
tahamajs's profile picture
bokesyo's profile picture
kenshinn's profile picture
7 followers
ยท
39 following
AI & ML interests
None yet
Recent Activity
liked
a dataset
1 day ago
GPT007/Emojis_HQ
liked
a model
over 1 year ago
gokaygokay/Flux-Watercolor-Strokes-LoRA
reacted
to
lewtun
's
post
with ๐ฅ
over 1 year ago
We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute ๐ฅ How? By combining step-wise reward models with tree search algorithms :) We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think" We're open sourcing the full recipe and sharing a detailed blog post. In our blog post we cover: ๐ Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time. ๐ Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets. ๐งญ Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM Here's the links: - Blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute - Code: https://github.com/huggingface/search-and-learn Enjoy!
View all activity
Organizations
None yet
GPT007
's models
2
Sort:ย Recently updated
GPT007/Emojis_SDXL_lora
Updated
Jun 18, 2024
GPT007/PrateritumGPT
Updated
Jun 1, 2024
โข
1