Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
ginigen-ai
PRO
ginigen-ai
4
13
140
Follow
PhysiQuanty's profile picture
DedeProGames's profile picture
DJLougen's profile picture
44 followers
ยท
125 following
AI & ML interests
None yet
Recent Activity
upvoted
an
article
about 7 hours ago
Adding a GPU Without Building One
reacted
to
SeaWolf-AI
's
post
with โค๏ธ
about 7 hours ago
๐ Adding a GPU without building one AI is usually framed as "how smart is the model / how many GPUs did you buy." The real bottleneck is elsewhere โ how efficiently you use the GPUs you already have. Training happens once; inference runs the entire time users use your product. So a service's economics come down to cost per token. Inference acceleration uses software to pull several times more out of the same GPU โ the effect of plugging in one more "virtual GPU." VIDRAFT's VKAE, measured (B200, same-harness, no quality loss): Qwen3.5-35B-A3B (MoE): 25.7 โ 601 tok/s (23.4ร) Darwin-36B-Opus (in-house MoE): 25.0 โ 280.8 (11.2ร) 10,000+ tok/s peak aggregate under concurrency The key: it's reproducible โ model + serving shipped as one container. docker pull vidraft/qwen35-vkae:601 Don't take our word for it โ run it yourself. The mechanism will be released as a paper. ๐ Leaderboard & demo ๐ https://huggingface.co/spaces/VIDraft/vkae Articles ๐ https://huggingface.co/blog/FINAL-Bench/vkae-leaderboard
liked
a model
about 8 hours ago
FINAL-Bench/Darwin-36B-Opus-VKAE
View all activity
Organizations
None yet
ginigen-ai
's models
3
Sort:ย Recently updated
ginigen-ai/Rogue-27B-KR
Text Generation
โข
26B
โข
Updated
25 days ago
โข
26
โข
13
ginigen-ai/Rogue-28B-MIX
Image-Text-to-Text
โข
28B
โข
Updated
25 days ago
โข
20
โข
10
ginigen-ai/Rogue-31B-KO
Text Generation
โข
Updated
May 27