This is my copy of::
Python3 / PyTorch implementation of FSRE-Depth, as described in the following paper:
Fine-grained Semantics-aware Representation Enhancement for Self-supervisedMonocular Depth Estimation Hyunyoung Jung, Eunhyeok Park and Sungjoo Yoo
ICCV 2021 (oral)
The code was implemented based on Monodepth2.
This code was implemented under torch==1.3.0 and torchvision==0.4.1, using two NVIDIA TITAN Xp gpus with distrutibted training. Different version may produce different results.
pip install -r requirements.txt
KITTI Raw Data and pre-computed segmentation images are required for training.
KITTI/
├── 2011_09_26/
├── 2011_09_28/
├── 2011_09_29/
├── 2011_09_30/
├── 2011_10_03/
└── segmentation/ # download and unzip "segmentation.zip"
For training the full model, run the command as below:
CUDA_VISIBLE_DEVICES=0,1 python -m torch.distributed.launch --nproc_per_node 2 --master_port YOUR_PORT_NUMBER train_ddp.py --dapa_path YOUR_KITTI_DATA_PATH
will be uploaded soon
Please use the following citation when referencing our work:
@InProceedings{Jung_2021_ICCV,
author = {Jung, Hyunyoung and Park, Eunhyeok and Yoo, Sungjoo},
title = {Fine-Grained Semantics-Aware Representation Enhancement for Self-Supervised Monocular Depth Estimation},
booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
month = {October},
year = {2021},
pages = {12642-12652}
}