arxiv:2505.04410
Junjie Wang
xiaomoguhzz
AI & ML interests
computer vision, Vision-Language Models, Multimodal Large Language Models
Recent Activity
updated a dataset about 5 hours ago
xiaomoguhzz/codex-ppt-temp-visual-encoder-assets published a dataset about 5 hours ago
xiaomoguhzz/codex-ppt-temp-visual-encoder-assets