10000 GitHub - PardisTaghavi/SwinMTL: [IROS24]A Shared Architecture for Simultaneous Depth Estimation and Semantic Segmentation from Monocular Camera Images
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

PardisTaghavi/SwinMTL

Repository files navigation

[IROS24] SwinMTL: Multi-Task Learning with Swin Transformer

A multi-task learning framework designed for simultaneous depth estimation and semantic segmentation using the Swin Transformer architecture.

Project Page Paper Papers with Code

News

  • [30th June] Paper Accepted at the IROS 2024 Conference 🔥🔥🔥
Qualitative Results

Installation

To get started, follow these steps:

  1. Only for ROS installation (otherwise skip this part)

    cd catkin_ws/src
    catkin_create_pkg SwinMTL_ROS std_msgs rospy
    cd ..
    catkin_make
    source devel/setup.bash
    cd src/SwinMTL_ROS/src
    git clone https://github.com/PardisTaghavi/SwinMTL.git
    chmod +x inference_ros.py
    mv ./launch/ ./..  
  2. Clone the repository:

    git clone https://github.com/PardisTaghavi/SwinMTL.git
    cd SwinMTL
  3. Create a conda environment and activate it:

    conda env create --file environment.yml
    conda activate prc

Testing

To run the testing for the project, follow the below steps:

  1. Download Pretrained Models:

    • here access the pretrained models.
    • Download the pretrained models you need.
  2. Move Pretrained Models:

    • Create a new folder named model_zoo and move the pretrained models into the model_zoo folder you created in the project directory.
    • Refer to testLive.ipynb for testing.

ROS Launch

roslaunch SwinMTL_ROS swinmtl_launch.launch
Zero-shot Results

3D Mapping

3D Mapping Results

Zero-shot Results on the Kitti Dataset

Zero-shot Results

Citation

If you find our project useful, please consider citing:

@inproceedings{taghavi2024swinmtl,
  title={SwinMTL: A shared architecture for simultaneous depth estimation and semantic segmentation from monocular camera images},
  author={Taghavi, Pardis and Langari, Reza and Pandey, Gaurav},
  booktitle={2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)},
  pages={4957--4964},
  year={2024},
  organization={IEEE}
}

About

[IROS24]A Shared Architecture for Simultaneous Depth Estimation and Semantic Segmentation from Monocular Camera Images

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  
0