Skip to content
@om-ai-lab

Om AI Lab

Open Multimodal AGI Research

Popular repositories Loading

  1. OmAgent OmAgent Public

    Build multimodal language agents for fast prototype and production

    Python 1.3k 107

  2. OmDet OmDet Public

    Real-time and accurate open-vocabulary end-to-end object detection

    Python 1.2k 142

  3. RS5M RS5M Public

    RS5M: a large-scale vision language dataset for remote sensing [TGRS]

    Python 231 10

  4. awesome-RSVLM awesome-RSVLM Public

    Collection of Remote Sensing Vision-Language Models

    127 4

  5. VL-CheckList VL-CheckList Public

    Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. [EMNLP 2022]

    Python 123 4

  6. GroundVLP GroundVLP Public

    GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)

    Jupyter Notebook 63 5

Repositories

Showing 10 of 14 repositories
  • OmAgent Public

    Build multimodal language agents for fast prototype and production

    om-ai-lab/OmAgent’s past year of commit activity
    Python 1,264 Apache-2.0 107 6 8 Updated Jan 20, 2025
  • open-agent-leaderboard Public

    Reproducible Agent Research

    om-ai-lab/open-agent-leaderboard’s past year of commit activity
    Python 6 0 0 0 Updated Jan 16, 2025
  • OmChat Public

    A suite of multimodal language models that are powerful and efficient

    om-ai-lab/OmChat’s past year of commit activity
    Python 17 Apache-2.0 2 0 0 Updated Jan 13, 2025
  • OmAgentDocs Public
    om-ai-lab/OmAgentDocs’s past year of commit activity
    HTML 3 1 0 0 Updated Jan 8, 2025
  • ZoomEye Public

    ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration

    om-ai-lab/ZoomEye’s past year of commit activity
    Python 22 2 0 0 Updated Jan 1, 2025
  • OmDet Public

    Real-time and accurate open-vocabulary end-to-end object detection

    om-ai-lab/OmDet’s past year of commit activity
    Python 1,151 Apache-2.0 142 4 0 Updated Dec 18, 2024
  • RS5M Public

    RS5M: a large-scale vision language dataset for remote sensing [TGRS]

    om-ai-lab/RS5M’s past year of commit activity
    Python 231 MIT 10 2 0 Updated Sep 29, 2024
  • VL-CheckList Public

    Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. [EMNLP 2022]

    om-ai-lab/VL-CheckList’s past year of commit activity
    Python 123 4 8 0 Updated Sep 29, 2024
  • OmModel Public

    A collection of strong multimodal models for building multimodal AGI agents

    om-ai-lab/OmModel’s past year of commit activity
    38 Apache-2.0 1 1 0 Updated Jul 9, 2024
  • awesome-RSVLM Public

    Collection of Remote Sensing Vision-Language Models

    om-ai-lab/awesome-RSVLM’s past year of commit activity
    127 MIT 4 0 1 Updated May 13, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…