ray lin
linsoft
ยท
AI & ML interests
RL, NLP, LLM
Recent Activity
upvoted a paper about 11 hours ago
Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning updated a model over 1 year ago
Tinytron/MLC-Tinytron liked a model almost 2 years ago
facebook/encodec_24khz