10 11

孙紫怡

christopherrami

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Swift Sampling: Selecting Temporal Surprises via Taylor Series

upvoted a paper 4 days ago

TIDE: Efficient and Lossless MoE Diffusion LLM Inference with I/O-aware Expert Offload

liked a model 4 days ago

tencent/Hy-MT2-1.8B

View all activity

Organizations

None yet

upvoted a paper 3 days ago

Swift Sampling: Selecting Temporal Surprises via Taylor Series

Paper • 2605.22678 • Published 6 days ago • 9

upvoted a paper 4 days ago

TIDE: Efficient and Lossless MoE Diffusion LLM Inference with I/O-aware Expert Offload

Paper • 2605.20179 • Published 8 days ago • 4

liked a model 4 days ago

tencent/Hy-MT2-1.8B

Translation • 2B • Updated about 6 hours ago • 7.47k • • 985

upvoted a paper 5 days ago

Forcing-KV: Hybrid KV Cache Compression for Efficient Autoregressive Video Diffusion Models

Paper • 2605.09681 • Published 17 days ago • 10

upvoted a paper 8 days ago

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published 14 days ago • 268

liked a model 12 days ago

mradermacher/Qwen3-Space.Agent.DASD.Uncensored-4B-GGUF

4B • Updated 12 days ago • 979 • 1

liked a dataset 15 days ago

mbhise/my_data

Updated 15 days ago • 61 • 1

liked a dataset 20 days ago

Dongxiaokun/so101-pick-up-pens-put-in-the-cup

Viewer • Updated 19 days ago • 20.7k • 196 • 1

upvoted a paper 21 days ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published 24 days ago • 163

upvoted a paper 25 days ago

Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows

Paper • 2604.28139 • Published 27 days ago • 42

liked a model about 1 month ago

tencent/HY-Embodied-0.5

Image-Text-to-Text • 4B • Updated Apr 14 • 827 • 907

upvoted a paper about 1 month ago

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

Paper • 2604.04771 • Published Apr 6 • 123

liked a dataset about 1 month ago

open-index/hacker-news

Updated 3 minutes ago • 33.5k • 319

upvoted a paper about 2 months ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 504

liked a dataset about 2 months ago

thuerey-group/CRMpert

Viewer • Updated Apr 10 • 2.15k • 31 • 1

liked a model about 2 months ago

Sarmistha/Gemma_3_Idiom_VL

Updated Apr 8 • 1

liked a dataset about 2 months ago

netgoat-ai/SynthWAF

Viewer • Updated Apr 4 • 1M • 25 • 1

liked a model about 2 months ago

dianavdavidson/wh_small_iv_indic_voices_51708_trial

Automatic Speech Recognition • 0.2B • Updated Apr 3 • 5 • 1

liked a dataset about 2 months ago

HuggingFaceFW/finepdfs

Viewer • Updated Apr 3 • 476M • 54.6k • 868

upvoted a paper 2 months ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 372

孙 紫怡

AI & ML interests

Recent Activity

Organizations

christopherrami's activity

孙紫怡