Depth Estimation From Stereo Images

Introduction:

bgnet.mp4

(Note: Upper part is Disparity Map and bottom part is Object detection + Depth Estimation(z=?))

Please check my Medium Blog for more information

Full Video output is shared at Link

Incase of Stereo Setup, Depth estimation is dependent on disparity map.

[PointCloud Output]

point_cloud_output.mp4

Dependency

Download Pre-Trained model which i shared at Download Link

Place it inside root folder and update the path in the config.py.

RAFT_STEREO_MODEL_PATH = "pretrained_models/raft_stereo/raft-stereo_20000.pth"
FASTACV_MODEL_PATH = "pretrained_models/fast_acvnet/kitti_2015.ckpt"
...

Download Yolo for object detection.I shared it at Download Link.

Setting up DataSet

Download Kitti Dataset from Download Link

Download Left/Right Images: Download stereo 2015/flow 2015/scene flow 2015 data set (2 GB)
Download Calibration files: Download calibration files (1 MB)

Keep these files in some path and update config.py

[config.py]
KITTI_CALIB_FILES_PATH=".../kitti_stereo_2015/data_scene_flow_calib/testing/calib_cam_to_cam/*.txt"
KITTI_LEFT_IMAGES_PATH=".../kitti_stereo_2015/testing/image_2/*.png"
KITTI_RIGHT_IMAGES_PATH=".../kitti_stereo_2015/testing/image_3/*.png"
...

How to use

Run "python3 demo.py" change the configuration in config.py in order to run different architecture such as BGNet, CreStereo, RAFT-Stereo etc.

KITTI_CALIB_FILES_PATH=".../kitti_stereo_2015/data_scene_flow_calib/testing/calib_cam_to_cam/*.txt"
KITTI_LEFT_IMAGES_PATH=".../kitti_stereo_2015/testing/image_2/*.png"
KITTI_RIGHT_IMAGES_PATH=".../kitti_stereo_2015/testing/image_3/*.png"

RAFT_STEREO_MODEL_PATH = "pretrained_models/raft_stereo/raft-stereo_20000.pth"
FASTACV_MODEL_PATH = "pretrained_models/fast_acvnet/kitti_2015.ckpt"
DEVICE = "cuda"

# raft-stereo=0, fastacv-plus=1, bgnet=2, gwcnet=3, pasmnet=4, crestereo=5, hitnet=6, psmnet=7
ARCHITECTURE_LIST = ["raft-stereo", "fastacv-plus", "bgnet", 'gwcnet', 'pasmnet', 'crestereo', 'hitnet', 'psmnet']
ARCHITECTURE = ARCHITECTURE_LIST[1]
SAVE_POINT_CLOUD = 0
SHOW_DISPARITY_OUTPUT = 1
SHOW_3D_PROJECTION = 0

Evaluation

Different state of the art (SOTA) deep learning based architetures are proposed to solve disparity and are given below:

Here is the profiling data:

Here is the inference time on Nvidia-2080Ti

Issue with HitNet Implementation.

Acknowledgements

Thanks to the authors of fastacv-plus, bgnet, gwcnet, pasmnet, crestereo, hitnet, psmnet and raft-stereo for their opensource code.

References

Reach me @

LinkedIn GitHub Medium

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
disparity_estimator		disparity_estimator
networks		networks
point_cloud_sample/raft-stereo		point_cloud_sample/raft-stereo
README.md		README.md
config.py		config.py
demo.py		demo.py
object_detector.py		object_detector.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Depth Estimation From Stereo Images

Introduction:

Dependency

Setting up DataSet

How to use

Evaluation

Acknowledgements

References

About

Releases

Packages

Languages

satya15july/depth_estimation_stereo_images

Folders and files

Latest commit

History

Repository files navigation

Depth Estimation From Stereo Images

Introduction:

Dependency

Setting up DataSet

How to use

Evaluation

Acknowledgements

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages