13 2

Isadora White

izzcw

https://icwhite.github.io/website/

AI & ML interests

LLMs, Reinforcement Learning, agents, embodiment, multi-agent collaboration

Recent Activity

upvoted a paper 3 days ago

SoCRATES: Towards Reliable Automated Evaluation of Proactive LLM Mediation across Domains and Socio-cognitive Variations

updated a model 9 days ago

izzcw/fingerprint_raw_conversations

published a model 9 days ago

izzcw/fingerprint_raw_conversations

View all activity

Organizations

upvoted a paper 3 days ago

SoCRATES: Towards Reliable Automated Evaluation of Proactive LLM Mediation across Domains and Socio-cognitive Variations

Paper • 2606.05563 • Published 8 days ago • 49

updated a model 9 days ago

izzcw/fingerprint_raw_conversations

Updated 9 days ago

published a model 9 days ago

izzcw/fingerprint_raw_conversations

Updated 9 days ago

upvoted a paper about 1 month ago

Efficient RL Training for LLMs with Experience Replay

Paper • 2604.08706 • Published Apr 9 • 22

liked a model about 1 month ago

microsoft/FrogBoss-32B-2510

Text Generation • Updated Jan 22 • 6.34k • • 30

upvoted 2 papers 5 months ago

BugPilot: Complex Bug Generation for Efficient Learning of SWE Skills

Paper • 2510.19898 • Published Oct 22, 2025 • 3

Evolving Programmatic Skill Networks

Paper • 2601.03509 • Published Jan 7 • 88

upvoted a paper 6 months ago

RefineBench: Evaluating Refinement Capability of Language Models via Checklists

Paper • 2511.22173 • Published Nov 27, 2025 • 15

upvoted a paper 8 months ago

Steering Autoregressive Music Generation with Recursive Feature Machines

Paper • 2510.19127 • Published Oct 21, 2025 • 8

upvoted a paper 11 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 320

published a model about 1 year ago

izzcw/dpo_model_3.1_8k

Updated Jun 4, 2025

updated a model about 1 year ago

izzcw/qwen_large_crafting_sft_success

Text Generation • 2B • Updated Jun 1, 2025 • 4

published a model about 1 year ago

izzcw/qwen_large_crafting_sft_success

Text Generation • 2B • Updated Jun 1, 2025 • 4

updated a model about 1 year ago

izzcw/large_crafting_sft_success

Text Generation • 2B • Updated Jun 1, 2025 • 7

updated a dataset about 1 year ago

izzcw/trajectory_crafting_dpo_pairs

Viewer • Updated Jun 1, 2025 • 244 • 7

updated a model about 1 year ago

izzcw/trajectory_crafting_dpo_pairs

Updated Jun 1, 2025

published a model about 1 year ago

izzcw/trajectory_crafting_dpo_pairs

Updated Jun 1, 2025

updated a model about 1 year ago

izzcw/trajectory_crafting_dpo_pairs.json

Updated Jun 1, 2025

published a model about 1 year ago

izzcw/trajectory_crafting_dpo_pairs.json

Updated Jun 1, 2025

published a dataset about 1 year ago

izzcw/trajectory_crafting_dpo_pairs

Viewer • Updated Jun 1, 2025 • 244 • 7

Isadora White

AI & ML interests

Recent Activity

Organizations

izzcw's activity