Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

University of Toronto CSSLab

university
https://csslab.cs.toronto.edu/
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

lilvjosephtang  authored a paper 4 days ago
LLM Safety From Within: Detecting Harmful Content with Internal Representations
lilvjosephtang  authored a paper 4 days ago
Maia-2: A Unified Model for Human-AI Alignment in Chess
lilvjosephtang  authored a paper 4 days ago
Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning
View all activity

Papers

LLM Safety From Within: Detecting Harmful Content with Internal Representations

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

View all Papers

Ashton Anderson's profile pictureJoseph Tang's profile pictureDifan Jiao's profile picture

UofTCSSLab 's datasets

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs