Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
rp-yu
's Collections
Discrete Diffusion LLM & MLLM
VPT Models
VPT Models
updated
Feb 20, 2025
Qwen2-VL Models with Visual Perception Token or used in training process.
Upvote
-
rp-yu/Qwen2-VL-2b-VPT-Seg
Image-Text-to-Text
•
3B
•
Updated
Jul 14, 2025
•
4
•
1
rp-yu/Qwen2-VL-2b-VPT-CLIP
Image-Text-to-Text
•
Updated
Mar 11, 2025
•
6
•
1
rp-yu/Qwen2-VL-2b-VPT-Seg-Alignment
Image-Text-to-Text
•
Updated
Mar 11, 2025
•
2
rp-yu/Qwen2-VL-2b-VPT-Det-Alignment
Image-Text-to-Text
•
Updated
Mar 11, 2025
•
8
rp-yu/Qwen2-VL-2b-VPT-Det
Image-Text-to-Text
•
Updated
Mar 11, 2025
•
2
rp-yu/Qwen2-VL-7b-VPT-CLIP
Image-Text-to-Text
•
8B
•
Updated
Jul 7, 2025
•
5
•
1
rp-yu/Qwen2-VL-2b-VPT-Det-NoPrompt
Image-Text-to-Text
•
Updated
Mar 11, 2025
•
3
Upvote
-
Share collection
View history
Collection guide
Browse collections