-
UCSC-VLAA/ClinSeek-35B-A3B
Text Generation • 35B • Updated • 57 -
UCSC-VLAA/ClinSeek-Bench
Viewer • Updated • 2.79k • 78 • 1 -
UCSC-VLAA/ClinSeek-Evaluation-Results
Preview • Updated • 1.28k -
ClinSeekAgent: Automating Multimodal Evidence Seeking for Agentic Clinical Reasoning
Paper • 2605.20176 • Published • 10
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
ClinSeekAgent: Automating Multimodal Evidence Seeking for Agentic Clinical Reasoning
VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation
A Family of Unified Visual Encoder with Unified Visual Representation.
-
UCSC-VLAA/gpt-image-edit-training
Image-to-Image • Updated • 26 -
UCSC-VLAA/GPT-Image-Edit-1.5M
Viewer • Updated • 2.78M • 3.71k • 78 -
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset
Paper • 2507.21033 • Published • 23 -
UCSC-VLAA/gpt-image-edit-benchmark-results
Viewer • Updated • 1.21k • 86 • 1
-
UCSC-VLAA/MedVLThinker-3B-SFT_m23k
Image-Text-to-Text • 4B • Updated • 8 -
UCSC-VLAA/MedVLThinker-3B-SFT_PMC
Image-Text-to-Text • 4B • Updated • 7 -
UCSC-VLAA/MedVLThinker-7B-SFT_m23k
Image-Text-to-Text • 8B • Updated • 6 -
UCSC-VLAA/MedVLThinker-3B-SFT_m23k-RL_PMC
Image-Text-to-Text • 4B • Updated • 4 • 1
-
UCSC-VLAA/VLAA-Thinker-Qwen2.5VL-3B
Image-Text-to-Text • 4B • Updated • 436 • 5 -
UCSC-VLAA/VLAA-Thinker-Qwen2.5VL-7B
Image-Text-to-Text • 8B • Updated • 142 • 2 -
UCSC-VLAA/VLAA-Thinker-Qwen2VL-2B
Image-Text-to-Text • 2B • Updated • 19 • 1 -
UCSC-VLAA/VLAA-Thinker-Qwen2VL-7B
Image-Text-to-Text • 8B • Updated • 3
-
UCSC-VLAA/m1-7B-1K
Question Answering • 8B • Updated • 17 • 1 -
m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning with Large Language Models
Paper • 2504.00869 • Published • 10 -
UCSC-VLAA/m1-32B-1K
Question Answering • 33B • Updated • 10 -
UCSC-VLAA/m1-7B-23K
Question Answering • 8B • Updated • 52
CLIPS
Staged post-training along the perception → reasoning capability axis. Models, datasets, paper. ICML 2026.
-
UCSC-VLAA/VLM-CapCurriculum-Qwen3-VL-8B-Staged
Image-Text-to-Text • 9B • Updated • 49 -
UCSC-VLAA/VLM-CapCurriculum-Qwen2.5-VL-7B-Staged
Image-Text-to-Text • 8B • Updated • 48 -
UCSC-VLAA/VLM-CapCurriculum-InternVL3-8B-Staged
Image-Text-to-Text • 8B • Updated • 50 -
UCSC-VLAA/VLM-CapCurriculum-InternVL3.5-8B-Staged
Image-Text-to-Text • 9B • Updated • 42
-
UCSC-VLAA/openvision-vit-tiny-patch16-224
Image Feature Extraction • Updated • 33 -
UCSC-VLAA/openvision-vit-tiny-patch8-224
Image Feature Extraction • Updated • 6 -
UCSC-VLAA/openvision-vit-tiny-patch16-384
Image Feature Extraction • Updated • 3 -
UCSC-VLAA/openvision-vit-tiny-patch8-160
Image Feature Extraction • Updated
-
UCSC-VLAA/MedReason-8B
Question Answering • 8B • Updated • 1.66k • 15 -
UCSC-VLAA/MedReason-Mistral
Question Answering • 266k • Updated • 9 -
UCSC-VLAA/MedReason
Viewer • Updated • 32.7k • 746 • 84 -
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs
Paper • 2504.00993 • Published • 3
-
UCSC-VLAA/ViT-bigG-14-CLIPA-datacomp1B
Zero-Shot Image Classification • Updated • 55 • 4 -
UCSC-VLAA/ViT-bigG-14-CLIPA-336-datacomp1B
Zero-Shot Image Classification • Updated • 96 • 4 -
UCSC-VLAA/ViT-L-14-CLIPA-336-datacomp1B
Zero-Shot Image Classification • Updated • 300 • 2 -
UCSC-VLAA/ViT-L-14-CLIPA-datacomp1B
Zero-Shot Image Classification • Updated • 415 • 3
-
UCSC-VLAA/ClinSeek-35B-A3B
Text Generation • 35B • Updated • 57 -
UCSC-VLAA/ClinSeek-Bench
Viewer • Updated • 2.79k • 78 • 1 -
UCSC-VLAA/ClinSeek-Evaluation-Results
Preview • Updated • 1.28k -
ClinSeekAgent: Automating Multimodal Evidence Seeking for Agentic Clinical Reasoning
Paper • 2605.20176 • Published • 10
Staged post-training along the perception → reasoning capability axis. Models, datasets, paper. ICML 2026.
-
UCSC-VLAA/VLM-CapCurriculum-Qwen3-VL-8B-Staged
Image-Text-to-Text • 9B • Updated • 49 -
UCSC-VLAA/VLM-CapCurriculum-Qwen2.5-VL-7B-Staged
Image-Text-to-Text • 8B • Updated • 48 -
UCSC-VLAA/VLM-CapCurriculum-InternVL3-8B-Staged
Image-Text-to-Text • 8B • Updated • 50 -
UCSC-VLAA/VLM-CapCurriculum-InternVL3.5-8B-Staged
Image-Text-to-Text • 9B • Updated • 42
A Family of Unified Visual Encoder with Unified Visual Representation.
-
UCSC-VLAA/gpt-image-edit-training
Image-to-Image • Updated • 26 -
UCSC-VLAA/GPT-Image-Edit-1.5M
Viewer • Updated • 2.78M • 3.71k • 78 -
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset
Paper • 2507.21033 • Published • 23 -
UCSC-VLAA/gpt-image-edit-benchmark-results
Viewer • Updated • 1.21k • 86 • 1
-
UCSC-VLAA/openvision-vit-tiny-patch16-224
Image Feature Extraction • Updated • 33 -
UCSC-VLAA/openvision-vit-tiny-patch8-224
Image Feature Extraction • Updated • 6 -
UCSC-VLAA/openvision-vit-tiny-patch16-384
Image Feature Extraction • Updated • 3 -
UCSC-VLAA/openvision-vit-tiny-patch8-160
Image Feature Extraction • Updated
-
UCSC-VLAA/MedVLThinker-3B-SFT_m23k
Image-Text-to-Text • 4B • Updated • 8 -
UCSC-VLAA/MedVLThinker-3B-SFT_PMC
Image-Text-to-Text • 4B • Updated • 7 -
UCSC-VLAA/MedVLThinker-7B-SFT_m23k
Image-Text-to-Text • 8B • Updated • 6 -
UCSC-VLAA/MedVLThinker-3B-SFT_m23k-RL_PMC
Image-Text-to-Text • 4B • Updated • 4 • 1
-
UCSC-VLAA/VLAA-Thinker-Qwen2.5VL-3B
Image-Text-to-Text • 4B • Updated • 436 • 5 -
UCSC-VLAA/VLAA-Thinker-Qwen2.5VL-7B
Image-Text-to-Text • 8B • Updated • 142 • 2 -
UCSC-VLAA/VLAA-Thinker-Qwen2VL-2B
Image-Text-to-Text • 2B • Updated • 19 • 1 -
UCSC-VLAA/VLAA-Thinker-Qwen2VL-7B
Image-Text-to-Text • 8B • Updated • 3
-
UCSC-VLAA/MedReason-8B
Question Answering • 8B • Updated • 1.66k • 15 -
UCSC-VLAA/MedReason-Mistral
Question Answering • 266k • Updated • 9 -
UCSC-VLAA/MedReason
Viewer • Updated • 32.7k • 746 • 84 -
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs
Paper • 2504.00993 • Published • 3
-
UCSC-VLAA/m1-7B-1K
Question Answering • 8B • Updated • 17 • 1 -
m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning with Large Language Models
Paper • 2504.00869 • Published • 10 -
UCSC-VLAA/m1-32B-1K
Question Answering • 33B • Updated • 10 -
UCSC-VLAA/m1-7B-23K
Question Answering • 8B • Updated • 52
CLIPS
-
UCSC-VLAA/ViT-bigG-14-CLIPA-datacomp1B
Zero-Shot Image Classification • Updated • 55 • 4 -
UCSC-VLAA/ViT-bigG-14-CLIPA-336-datacomp1B
Zero-Shot Image Classification • Updated • 96 • 4 -
UCSC-VLAA/ViT-L-14-CLIPA-336-datacomp1B
Zero-Shot Image Classification • Updated • 300 • 2 -
UCSC-VLAA/ViT-L-14-CLIPA-datacomp1B
Zero-Shot Image Classification • Updated • 415 • 3