M2C: Mask-to-Concept

Quan Zhou^1* · Shaoqing Zhai^1* · Qiang Hu^2,† · Jia Chen³ · Qiang Li² · Zhiwei Wang^2,†
¹WuHan University of Technology ²Huazhong University of Science and Technology
³Changzhou United Imaging Surgical Co., Ltd.
*co-first author †corresponding author

This work presents Mask-to-Concept (M2C), a efficient fine-tuning strategy for SAM3.

Overview of M2C-based human-in-the-loop annotation system.

🌟 Features

Pixel-space diffusion generation (operating directly in image space, without VAE or latent representations), capable of producing flying-pixel-free point clouds from estimated depth maps.
Our model integrates the discriminative representation (ViT) into generative modeling (DiT), fully leveraging the strengths of both paradigms.
Our network architecture is purely transformer-based, containing no convolutional layers.
Although our model is trained at a fixed resolution of 1024×768, it can flexibly support various input resolutions and aspect ratios during inference.

News

2026-03: code, models, and demo are all released.

Usage

Environment Configuration

Please refer to the official environment configuration of SAM3 to set up your Python environment.
Download the SAM3 official weights and put them in the checkpoint/ directory.

Dataset Preparation

Download the following datasets and organize them as follows:

datasets/
├── Kvasir-SEG/
├── ISIC-2017/

Few-shot Testing

To perform few-shot evaluation (e.g., 1-shot), split the Kvasir-SEG dataset into a Support Set and a Query Set (Ratio 1:9). Place them in:

datasets/Kavsir-seg/support/
datasets/Kavsir-seg/query/

Step 1: Run the Controller

python controller.py --pool_root "datasets/Kavsir-seg/support" --n_shot 1 --few_shot

Step 2: Run Evaluation

python test.py --test_pool_root "datasets/Kavsir-seg/query"

Simulated Annotation Process

To simulate the full annotation workflow using the entire dataset (e.g., 5-shot setup), put the full dataset in datasets/Kavsir-seg/ and run:

python controller.py --pool_root "datasets/Kavsir-seg" --n_shot 5

Acknowledgement

We are grateful to the Segment Anything Model 3 (SAM3) team for their code and model release.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github/workflows		.github/workflows
assets		assets
examples		examples
figs		figs
sam3		sam3
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
controller.py		controller.py
pyproject.toml		pyproject.toml
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

M2C: Mask-to-Concept

🌟 Features

News

Usage

Environment Configuration

Dataset Preparation

Few-shot Testing

Simulated Annotation Process

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

M2C: Mask-to-Concept

🌟 Features

News

Usage

Environment Configuration

Dataset Preparation

Few-shot Testing

Simulated Annotation Process

Acknowledgement

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages