Mixture of Features on Lexicon3D

There has been a lot of work involving using re-trained 2D vision models with 3D frameworks. We chose to build upon Lexicon3D Man et al. [2024], in which an ablation study demonstrated improved outcome in the semantic segmentation task by creating an ensembe of models involving LSeg, Stable Diffusion, and Swin3D. Weinvestigated feature based fusion strategies, including additive and interleaved approaches, to refine the Mixture of Features paradigm. By extending the concept to encompass semantic segmentation, our goal is to identify optimal fusion techniques that outperform existing benchmarks, leveraging diverse pre-trained embeddings to improve understanding in 3D spaces

The paper for the work done is: MoF-Paper.pdf

Environment Setup

Please install the required packages and dependencies according to the requirements.txt file.

In addition,

in order to use the LSeg model, please follow this repo to install the necessary dependencies.
in order to use the Swin3D model, please follow this repo and this repo to install the necessary dependencies.

Finally, please download the ScanNet dataset from the official website and follow the instructions here to preprocess the ScanNet dataset and get RGB video frames and point clouds for each scannet scene.

Feature Extraction

To extract features from the foundation models, please run the corresponding scripts in the lexicon3d folder. For example, to extract features from the LSeg model, please run the following command:

python fusion_scannet_clip.py  --data_dir dataset/ScanNet/openscene/  --output_dir  dataset/lexicon3d/clip/ --split train --prefix clip

This script will extract features from the LSeg model for the ScanNet dataset. The extracted features will be saved in the output_dir folder, containing the feature embeddings, points, and voxel grids.

Acknowledgements

This repo is built based on the fantastic work of Lexicon3D & OpenScene.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
0353_1.png		0353_1.png
1.png		1.png
Chandrahas_Tejaswini_final_paper.pdf		Chandrahas_Tejaswini_final_paper.pdf
README.md		README.md
abc.png		abc.png
abc.txt		abc.txt
assets.zip		assets.zip
evals.zip		evals.zip
evaluate_complexity.py		evaluate_complexity.py
fusion_scannet_clip.py		fusion_scannet_clip.py
fusion_scannet_dinov2.py		fusion_scannet_dinov2.py
fusion_scannet_lseg.py		fusion_scannet_lseg.py
fusion_scannet_sd.py		fusion_scannet_sd.py
fusion_scannet_svd.py		fusion_scannet_svd.py
fusion_scannet_swin3d.py		fusion_scannet_swin3d.py
fusion_scannet_vjepa.py		fusion_scannet_vjepa.py
fusion_util.py		fusion_util.py
models.py		models.py
requirements(1).txt		requirements(1).txt
scene0001_00_2.pt		scene0001_00_2.pt
sem_seg copy.ipynb		sem_seg copy.ipynb
sem_seg.ipynb		sem_seg.ipynb
sem_seg_add.ipynb		sem_seg_add.ipynb
sem_seg_adpater_additive.ipynb		sem_seg_adpater_additive.ipynb
sem_seg_il.ipynb		sem_seg_il.ipynb
sem_seg_options copy.ipynb		sem_seg_options copy.ipynb
sem_seg_options.ipynb		sem_seg_options.ipynb
utils(1).py		utils(1).py
video_diffusion.zip		video_diffusion.zip
vis_seg_add.ipynb		vis_seg_add.ipynb
visualize_feature.py		visualize_feature.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mixture of Features on Lexicon3D

Environment Setup

Feature Extraction

Acknowledgements

About

Releases

Packages

Languages

Exorust/MoF-Lexicon3D

Folders and files

Latest commit

History

Repository files navigation

Mixture of Features on Lexicon3D

Environment Setup

Feature Extraction

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages