2 16 5

Junfei Wu

Hyperwjf

Hyperwjf

AI & ML interests

None yet

Recent Activity

upvoted a paper about 15 hours ago

The Verification Horizon: No Silver Bullet for Coding Agent Rewards

updated a model 4 months ago

AntResearchNLP/ViLaVT-cold-start

updated a model 4 months ago

AntResearchNLP/ViLaVT

View all activity

Organizations

upvoted a paper about 15 hours ago

The Verification Horizon: No Silver Bullet for Coding Agent Rewards

Paper • 2606.26300 • Published 3 days ago • 30

updated 2 models 4 months ago

AntResearchNLP/ViLaVT-cold-start

9B • Updated Feb 20 • 3

AntResearchNLP/ViLaVT

9B • Updated Feb 20 • 4

updated a collection 4 months ago

ViLaVT

Collection

3 items • Updated Feb 13

published 2 models 4 months ago

AntResearchNLP/ViLaVT-cold-start

9B • Updated Feb 20 • 3

AntResearchNLP/ViLaVT

9B • Updated Feb 20 • 4

updated a collection 4 months ago

ViLaVT

Collection

3 items • Updated Feb 13

upvoted a paper 5 months ago

How Well Do Models Follow Visual Instructions? VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing

Paper • 2602.01851 • Published Feb 2 • 16

updated a dataset 5 months ago

AntResearchNLP/ViLaVT-Data-HF

Preview • Updated Jan 24 • 96

published a dataset 5 months ago

AntResearchNLP/ViLaVT-Data-HF

Preview • Updated Jan 24 • 96

upvoted a paper 5 months ago

MegaFlow: Large-Scale Distributed Orchestration System for the Agentic Era

Paper • 2601.07526 • Published Jan 12 • 23

upvoted a paper 6 months ago

ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

Paper • 2512.13586 • Published Dec 15, 2025 • 93

upvoted a paper 7 months ago

Test-Time Spectrum-Aware Latent Steering for Zero-Shot Generalization in Vision-Language Models

Paper • 2511.09809 • Published Nov 12, 2025 • 5

upvoted a paper 8 months ago

Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs

Paper • 2510.24514 • Published Oct 28, 2025 • 22

upvoted 2 papers 9 months ago

AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration

Paper • 2510.10395 • Published Oct 12, 2025 • 32

PromptCoT 2.0: Scaling Prompt Synthesis for Large Language Model Reasoning

Paper • 2509.19894 • Published Sep 24, 2025 • 34

liked a dataset 10 months ago

AntResearchNLP/ViLaSR-data

Updated Jun 24, 2025 • 67 • 5

New activity in prithivMLmods/Multimodal-VLM-v1.0 10 months ago

Friendly note: ViLaSR relies on thinking with images, but the demo uses text-only reasoning

#4 opened 10 months ago by

Hyperwjf

liked a Space 10 months ago

Multimodal VLM v1.0

⚡

OCR, VQA, Thinking and Object Detection.

Junfei Wu

AI & ML interests

Recent Activity

Organizations

Hyperwjf's activity

Friendly note: ViLaSR relies on thinking with images, but the demo uses text-only reasoning

Multimodal VLM v1.0