FoodSAM: Any Food Segmentation

This is the official PyTorch implementation of our paper: FoodSAM: Any Food Segmentation.

Segment anything Model(SAM) demonstrates significant performance on various segmentation benchmarks, showcasing its impressing zero-shot transfer capabilities on 23 diverse segmentation datasets. However, SAM lacks the class-specific information for each mask. To address the above limitation and explore the zero-shot capability of the SAM for food image segmentation, we propose a novel framework, called FoodSAM. This innovative approach integrates the coarse semantic mask with SAM-generated masks to enhance semantic segmentation quality. Besides, it can perform instance segmentation on food images. Furthermore, FoodSAM extends its zero-shot capability to encompass panoptic segmentation by incorporating an object detector, which renders FoodSAM to effectively capture non-food object information. Remarkably, this pioneering framework stands as the first-ever work to achieve instance, panoptic, and promptable segmentation on food images.

[Arxiv] [Project] [IEEE TMM]

FoodSAM contains three basic models: SAM, semantic segmenter, and object detector. SAM generates many class-agnostic binary masks, the semantic segmenter provides food category labels via mask-category match, and the object detector provides the non-food class for background masks. It then enhances the semantic mask via merge strategy and produces instance and panoptic results. Moreover, a seamless prompt-prior selection is integrated into the object detector to achieve promptable segmentation.

Installation

Please follow our installation.md to install.

Getting Started

Demo shell

You can run the model for semantic and panoptic segmentation in a few command lines.

semantic segmentation:

# semantic segmentation for one img
python FoodSAM/semantic.py --img_path <path/to/img> --output <path/to/output> 

# semantic segmentation for one folder
python FoodSAM/semantic.py --data_root <path/to/folder> --output <path/to/output>

panoptic segmentation:

# panoptic segmentation for one img
python FoodSAM/panoptic.py --img_path <path/to/img> --output <path/to/output>

# panoptic segmentation for one folder
python FoodSAM/panoptic.py --data_root <path/to/folder> --output <path/to/output>

Evaluation shell

Furthermore, by setting args.eval to true, the model can output the semantic masks and evaluate the metrics. Here are examples of semantic segmentation and panoptic segmentation on the FoodSeg103 dataset:

python FoodSAM/semantic.py --data_root dataset/FoodSeg103/Images --output Output/Semantic_Results --eval

python FoodSAM/panoptic.py --data_root dataset/FoodSeg103/Images --output Output/Panoptic_Results

Quantitative results

FoodSeg103

Method	mIou	aAcc	mAcc
SETR_MLA(baseline)	45.10	83.53	57.44
FoodSAM	46.42	84.10	58.27

UECFOODPIXCOMPLETE

Method	mIou	aAcc	mAcc
deeplabV3+ (baseline)	65.61	88.20	77.56
FoodSAM	66.14	88.47	78.01

Qualitative results

cross domain results

semantic segmentation results

instance segmentation results

panoptic segmentation results

promptable segmentation results

Acknowledgements

A large part of the code is borrowed from the following wonderful works:

License

The model is licensed under the Apache 2.0 license.

Citation

If you want to cite our work, please use this:

@ARTICLE{10306316,
  author={Lan, Xing and Lyu, Jiayi and Jiang, Hanyu and Dong, Kun and Niu, Zehai and Zhang, Yi and Xue, Jian},
  journal={IEEE Transactions on Multimedia}, 
  title={FoodSAM: Any Food Segmentation}, 
  year={2023},
  volume={},
  number={},
  pages={1-14},
  doi={10.1109/TMM.2023.3330047}
}

@misc{lan2023foodsam,
      title={FoodSAM: Any Food Segmentation}, 
      author={Xing Lan and Jiayi Lyu and Hanyu Jiang and Kun Dong and Zehai Niu and Yi Zhang and Jian Xue},
      year={2023},
      eprint={2308.05938},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
FoodSAM		FoodSAM
UNIDET		UNIDET
assets		assets
ckpts/SETR_MLA		ckpts/SETR_MLA
configs		configs
mmseg		mmseg
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
installation.md		installation.md
requirement.txt		requirement.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FoodSAM: Any Food Segmentation

Installation

Getting Started

Demo shell

semantic segmentation:

panoptic segmentation:

Evaluation shell

Quantitative results

FoodSeg103

UECFOODPIXCOMPLETE

Qualitative results

cross domain results

semantic segmentation results

instance segmentation results

panoptic segmentation results

promptable segmentation results

Acknowledgements

License

Citation

About

Releases

Packages

Contributors 3

Languages

License

jamesjg/FoodSAM

Folders and files

Latest commit

History

Repository files navigation

FoodSAM: Any Food Segmentation

Installation

Getting Started

Demo shell

semantic segmentation:

panoptic segmentation:

Evaluation shell

Quantitative results

FoodSeg103

UECFOODPIXCOMPLETE

Qualitative results

cross domain results

semantic segmentation results

instance segmentation results

panoptic segmentation results

promptable segmentation results

Acknowledgements

License

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages