Skip to content
@CASIA-IVA-Lab

CASIA-IVA-Lab

Popular repositories Loading

  1. DANet DANet Public

    Dual Attention Network for Scene Segmentation (CVPR2019)

    Python 2.5k 484

  2. VALOR VALOR Public

    [TPAMI2024] Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

    Python 307 18

  3. VAST VAST Public

    [NIPS2023] Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

    Jupyter Notebook 298 18

  4. MRES MRES Public

    This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation", accepted by CVPR 2024.

    72

  5. VideoNIAH VideoNIAH Public

    VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs

    Python 55

  6. PrefixGrouper PrefixGrouper Public

    An efficient GRPO training util.

    Python 55 3

Repositories

Showing 10 of 16 repositories
  • S1-MMAlign Public

    S1-MMAlign: 科学多模态数据集(入口页,数据托管于Hugging Face)

    CASIA-IVA-Lab/S1-MMAlign’s past year of commit activity
    0 0 0 0 Updated Mar 19, 2026
  • UrbanNav Public

    [AAAI 2026] Official implementation of paper "UrbanNav: Learning Language-Guided Embodied Urban Navigation from Web-Scale Human Trajectories"

    CASIA-IVA-Lab/UrbanNav’s past year of commit activity
    Python 46 MIT 3 3 0 Updated Jan 30, 2026
  • ChatSearch Public

    ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval

    CASIA-IVA-Lab/ChatSearch’s past year of commit activity
    Python 9 Apache-2.0 0 1 0 Updated Jan 6, 2026
  • VRoPE Public

    [EMNLP 2025 Main] Official implementation of VRoPE: Rotary Position Embedding for Video Large Language Models.

    CASIA-IVA-Lab/VRoPE’s past year of commit activity
    Python 27 0 0 0 Updated Nov 18, 2025
  • PrefixGrouper Public

    An efficient GRPO training util.

    CASIA-IVA-Lab/PrefixGrouper’s past year of commit activity
    Python 55 MIT 3 0 0 Updated Jun 13, 2025
  • VideoNIAH Public

    VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs

    CASIA-IVA-Lab/VideoNIAH’s past year of commit activity
    Python 55 0 4 0 Updated Mar 9, 2025
  • COSA Public

    [ICLR2024] Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model

    CASIA-IVA-Lab/COSA’s past year of commit activity
    Python 43 MIT 3 3 0 Updated Dec 25, 2024
  • VALOR Public

    [TPAMI2024] Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

    CASIA-IVA-Lab/VALOR’s past year of commit activity
    Python 307 MIT 18 7 0 Updated Dec 25, 2024
  • DANet Public

    Dual Attention Network for Scene Segmentation (CVPR2019)

    CASIA-IVA-Lab/DANet’s past year of commit activity
    Python 2,456 MIT 484 61 1 Updated Dec 23, 2024
  • MRES Public

    This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation", accepted by CVPR 2024.

    CASIA-IVA-Lab/MRES’s past year of commit activity
    72 Apache-2.0 0 5 0 Updated Jun 3, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…