Change the repository type filter
All
Repositories list
13 repositories
OmAgent
PublicA Language and Multimodal Agents Framework for Smart Device and MoreOmDet
PublicReal-time and accurate open-vocabulary end-to-end object detection- ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration
OmAgentDocs
Public- RS5M: a large-scale vision language dataset for remote sensing [TGRS]
VL-CheckList
PublicEvaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. [EMNLP 2022]OmChat
PublicOmModel
Public- A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)
GroundVLP
PublicGroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)habitat-lab
Public