Om AI Lab
Open Multimodal AGI Research
Pinned Loading
Repositories
Showing 10 of 21 repositories
- VL-CheckList Public
Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. [EMNLP 2022]
om-ai-lab/VL-CheckList’s past year of commit activity - GroundVLP Public
GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)
om-ai-lab/GroundVLP’s past year of commit activity - VLM-FO1 Public
VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs
om-ai-lab/VLM-FO1’s past year of commit activity - ZoomEye Public
[EMNLP-2025 Oral] ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration
om-ai-lab/ZoomEye’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…