Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
XiuyingWei's picture
2

XiuyingWei

barpitf
euclaise's profile picture
·

AI & ML interests

None yet

Organizations

AIMO-EPFL's profile picture

authored 2 papers 2 months ago

RAT: Bridging RNN Efficiency and Attention Accuracy in Language Modeling

Paper • 2507.04416 • Published Jul 6, 2025 • 1

RAT+: Train Dense, Infer Sparse -- Recurrence Augmented Attention for Dilated Inference

Paper • 2602.18196 • Published Feb 20 • 1
authored 4 papers 10 months ago

QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models

Paper • 2310.08041 • Published Oct 12, 2023 • 1

Lossy and Lossless (L$^2$) Post-training Model Size Compression

Paper • 2308.04269 • Published Aug 8, 2023

From Markov to Laplace: How Mamba In-Context Learns Markov Chains

Paper • 2502.10178 • Published Feb 14, 2025

Building on Efficient Foundations: Effectively Training LLMs with Structured Feedforward Layers

Paper • 2406.16450 • Published Jun 24, 2024
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs