FusionSense

Integrates the vision, touch, and common-sense information of foundational models, customized to the agent's perceptual needs.

Usage

Select frames:

Run delete.py to select frames you want, or manually select, and you will get a folder of selected frames and transforms.json.

Remember to set transforms.json in right format.
Generate Mask_imgs by Grounded_SAM_2:

set your scene path and prompt text with the end of '.'
eg. 'transparent white statue.'
```
python grounded_sam2_hf_model_imgs_MaskExtract.py   
```
run the script to extract masks.

If the num_no_detection is not 0, you need to select the frame again. Then you will see mask_imgs in path/masks, and you can check path/annotated frames to see the results more directly.
Generate VisualHull by masks and transforms.json:

run VisualHull.py to generate visual hull.
```
python VisualHull.py --path your-path  
```
You will get a point cloud file foreground_pcd.ply, and a screenshot voxels.png of checking whether the generated VisualHull is correct.
RealSense depth & Metric3Dv2 depth:

Get your realsense depth from your camera file in realsense_depth folder.

Use your RGB images to generate predict depth with Metric3Dv2.
```
python run_metric3d_depth.py --root_dir your-path
```
Remember to set your camera intrinsics and image size in that file
Generate initial GS model sparse points:

run the script to generate initial sparse points using VisualHull pcd as forground and Metric3Dv2 depth as background.
```
python generate_pcd.py --path your-path   
```
The initial points will be saved in path/merged_pcd.ply

Generate normals by dsine:

set your rgb images path to generate normals.

python dn_splatter/scripts/normals_from_pretrain.py --data-dir [PATH_TO_DATA] --model-type dsine

Set transforms and configs:

To use realsense depth, set "depth_file_path": "realsense_depth/depth_0.png" each frame

To use initial pts, set "ply_file_path": "merged_pcd.ply"

To use Visual Hull prune supervised method, set "object_pc_path": "object.ply"

Train:

Select your method and configs.

ns-train dn-splatter --pipeline.model.use-depth-loss True\
                    --pipeline.model.normal-lambda 0.4\
                    --pipeline.model.sensor-depth-lambda 0.2\
                    --pipeline.model.use-depth-smooth-loss True \
                    --pipeline.model.use-normal-loss True\
                    --pipeline.model.normal-supervision mono\
                    --pipeline.model.random_init False normal-nerfstudio\
                    --data your-path\
                    --load-pcd-normals True --load-3D-points True  --normal-format opencv

Mesh Extraction:

gs-mesh {dn, tsdf, sugar-coarse, gaussians, marching} --load-config [PATH] --output-dir [PATH]

Dataset Format

tr-rabbit/
│
├── transforms.json
│
├── images/
│   ├── rgb_1.png
│   └── rgb_2.png
│
├── normals_from_pretrain/
│   ├── rgb_1.png
│   └── rgb_2.png
│
├── realsense_depth/
│   ├── depth_1.png
│   └── depth_2.png
│
├── object.ply
└── merged_pcd.ply

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
MESH/test		MESH/test
assets		assets
dn_splatter.egg-info		dn_splatter.egg-info
dn_splatter		dn_splatter
log_images_touch-gs-realrabbit		log_images_touch-gs-realrabbit
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
delete.py		delete.py
pixi.lock		pixi.lock
pixi.toml		pixi.toml
pyproject.toml		pyproject.toml
set_colmapdatabase.py		set_colmapdatabase.py
transform_colmap_camera.py		transform_colmap_camera.py
transforms2colmap.py		transforms2colmap.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FusionSense

Usage

Dataset Format

About

Releases

Packages

Contributors 10

Languages

License

ai4ce/FusionSense

Folders and files

Latest commit

History

Repository files navigation

FusionSense

Usage

Dataset Format

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 10

Languages

Packages