Change the repository type filter
All
Repositories list
95 repositories
VP-VLA
PublicDreamOmni2
PublicThis project is the official implementation of 'DreamOmni2: Multimodal Instruction-based Editing and Generation (CVPR2026 Highlight)''MGM-Omni
PublicMGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon SpeechRePlan
PublicRePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image EditingVisionDirector
PublicVisionReasoner
PublicVisionReasoner: Unified Reasoning-Integrated Visual Perception via Reinforcement LearningScaf-GRPO
PublicSearchGym
PublicTraveLLaMA
PublicSeg-Zero
PublicProject Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"DreamOmni3
PublicSmartSwitch
PublicVisionThink
PublicLSDBench
PublicA benchmark that focuses on the sampling dilemma in long-video tasks. Through well-designed tasks, it evaluates the sampling efficiency of long-video VLMs. (ICC…Jenga
PublicVisionZip
PublicOfficial repository for VisionZip (CVPR 2025)- Parametric Contrastive Learning (ICCV2021) & GPaCo (TPAMI 2023)
TGDPO
Public[ICML 2025] TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference OptimizationVideo-P2P
PublicVideo-P2P: Video Editing with Cross-attention ControlRL-GPT
PublicMagicMirror
PublicLogits-Based-Finetuning
PublicLLMGA
PublicThis project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024 OralARPO
PublicMoTCoder
PublicThis is the official code repository of MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tasks.Open-Code-Zero
PublicLISA
PublicProject Page for "LISA: Reasoning Segmentation via Large Language Model"Step-DPO
Public
ProTip! When viewing an organization's repositories, you can use the
props. filter to filter by custom property.