GitHub - NTU-AI4X/LaMoFCBench: Benchmark for Large Model Feature Coding

LaMoFCBench is a benchmark and evaluation toolkit for universal large model feature coding across multiple modalities.

Project Overview

This repository currently covers four task groups:

Common Vision Understanding (CVU), model family: DINOv3-ViT7B
Common Language Understanding (CLU), model families: Qwen3-8B, FalconMamba-7B
Common Audio Understanding (CAU), model family: KimiAudio-7B
Controllable Text-to-Image (CTTI), model family: StableDiffusion3.5 + ControlNet

Core directories:

coding/: feature coding pipeline (feature_coding.py) and batch launcher (feature_coding.sh)
machine/: downstream task evaluation scripts
lmfc_utils/handlers/: feature parsers/packers/unpackers
lmfc_utils/custom_codecs/: learned codec wrappers for the implementation in CompressAI used by feature coding
lmfc_utils/transform_mapping/: quantization mapping files

Data Resources

All hosted resources are under: https://www.modelscope.cn/collections/yooweey/LaMoFCBench

Main datasets:

Raw datasets: https://www.modelscope.cn/datasets/yooweey/FeatureCoding-RawDatasets
Raw extracted features:
- DINOv3: https://www.modelscope.cn/datasets/yooweey/FeatureCoding-DINOv3
- Qwen3/FalconMamba: https://www.modelscope.cn/datasets/yooweey/FeatureCoding-LargeLanguageModel
- KimiAudio: https://www.modelscope.cn/datasets/yooweey/FeatureCoding-KimiAudio
- SD3.5 + ControlNet: https://www.modelscope.cn/datasets/yooweey/FeatureCoding-StableDiffusion3.5Large
Post-coding features:

Quick Start

Environment

Recommended baseline:

Python 3.10+
PyTorch + CUDA (for GPU runs)
compressai, einops, zstandard, tabulate
task-specific dependencies used by scripts under machine/

Feature Coding

In the folder coding:

download the pre-trained codec weights by the shell script download_codec_weights.sh;
download the pre-extracted large model features from aforementioned links;
modify the path information of these features in the feature_coding.sh;
use the script feature_coding.sh to coding the pre-extracted large model features.

Notes for feature_coding.sh:

valid --handler values come from lmfc_utils/handlers/__init__.py
default mapping config is lmfc_utils/transform_mapping/10samples-8bits/mapping.json

Downstream Evaluation

The shell scripts in the folder machine load reconstructed features from --load_root, inject them into task inference, and report task-specific metrics.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
assets		assets
coding		coding
lmfc_utils		lmfc_utils
machine		machine
.gitignore		.gitignore
LICENSE.md		LICENSE.md
pyproject.toml		pyproject.toml
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Overview

Data Resources

Quick Start

Environment

Feature Coding

Downstream Evaluation

About

Uh oh!

Releases

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Project Overview

Data Resources

Quick Start

Environment

Feature Coding

Downstream Evaluation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Contributors

Uh oh!

Languages