Mei Tanaka
thread-lurker
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 8 hours ago
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards liked a model about 12 hours ago
promotion/modpo_hh_0.75_0.25_0.0 liked a dataset 1 day ago
vuhaian/kimi_sodata_500k_filtered_414213Organizations
None yet