Healthcare, Life Science, AI Agent, Compliance Testing, Model Development
CHI-Bench: Can AI Agents Automate End-to-End, Long-Horizon, Policy-Rich Healthcare Workflows?