feat(autoware_tensorrt_rtmdet): add tensorrt rtmdet model #8165

StepTurtle · 2024-07-23T12:53:58Z

Description

This PR adds a instance segmentation method RTMDet to perception pipeline.

Test Video: https://youtu.be/dBJdZtc4BC8

Process Times

	Average Time (ms)	Standard Deviation (ms)
Preprocess	`1.61101 ms`	`0.470295 ms`
Inference	`1.71506 ms`	`0.757723 ms`
Postprocess	`13.4203 ms`	`0.628055 ms`
Visualization	`23.4338 ms`	`6.98011 ms`

Following table shows the total time for preprocess, inference and postprocess processes. It don't contain visualization

	Average Time (ms)	Standard Deviation (ms)
Total	`15.791 ms`	`1.58901 ms`

Process Times for 8 camera on single GPU

	Average Time (ms)	Standard Deviation (ms)	Min (ms)	Max (ms)
node-0	`43.5821 ms`	`17.0037 ms`	`16.2328 ms`	`102.521 ms`
node-1	`42.7286 ms`	`15.8802 ms`	`22.3733 ms`	`98.7796 ms`
node-2	`41.4004 ms`	`15.0862 ms`	`15.0169 ms`	`87.0995 ms`
node-3	`42.3738 ms`	`15.7069 ms`	`21.1178 ms`	`105.505 ms`
node-4	`36.4766 ms`	`13.2308 ms`	`20.3997 ms`	`81.181 ms`
node-5	`42.2531 ms`	`16.1687 ms`	`16.265 ms`	`91.1761 ms`
node-6	`35.9258 ms`	`13.5001 ms`	`19.7352 ms`	`76.6217 ms`
node-7	`36.4776 ms`	`15.4815 ms`	`21.2165 ms`	`80.9634 ms`

Computer Specifications

Device	Model
GPU	GeForce RTX 3090 (24 gB VRAM)
GPU	AMD® Ryzen 7 2700x eight-core processor × 16
Memory	32 gB

How was this PR tested?

Testers can follow these steps to test PR.

1) Download the pre-trained model:

gdown https://drive.google.com/drive/folders/1P1JyMrstJIN9Y4F947nUkbKxQ2JDntgF?usp=drive_link -O ~/autoware_data/ --folder

2) For autoware_internal_msgs use following PR:

feat(SegmentationMask): add instance segmentation message autoware_internal_msgs#23

3) For trt_batched_nms use https://github.com/autowarefoundation/trt_batched_nms:

Clone the repository into universe/extarnal

4) Build the package and other dependencies of package

cd autoware
colcon build --symlink-install --cmake-args -DCMAKE_BUILD_TYPE=Release --packages-up-to tensorrt_rtmdet

5) Update the topic parameters from launch/rtmdet.launch.xml and run the launch file

ros2 launch autoware_tensorrt_rtmdet rtmdet.launch.xml

Notes for reviewers

🟨 Plugin Loading

Autoware has the same logic that I used to load the TensorRT plugin.

for (const auto & plugin_path : plugin_paths) {
    int32_t flags{RTLD_LAZY};
    void * handle = dlopen(plugin_path.c_str(), flags); // Plugin path is '/path/to/plugin.so'
    if (!handle) {
      logger_.log(nvinfer1::ILogger::Severity::kERROR, "Could not load plugin library");
    }
}

After compilation, a file with the extension '.so' is created. This file stored in build and it should be parameter of dlopen() function.

Is there any information about can we handle this in Cmake. If we cannot, how can I provide the path to the file located inside the 'build' folder?

I was able to load the plugin using the file paths below:

./build/tensorrt_rtmdet/libtensorrt_rtmdet_plugin.so (relative path from workspace)
/home/user/projects/workspace/build/tensorrt_rtmdet/libtensorrt_rtmdet_plugin.so (absolute path)

🟨 TRTBatchedNMS Codebase

The RTMDet model uses the TRTBatchedNMS plugin (modified version of original TRTBatchedNMS by TensorRT) and I put all code base of plugin into src/trt_batched_nms and include/trt_batched_nms but I am not is it suitable or not?

🟨 int8 Precision Option

There are three precision option (fp16, fp32 and int8) and one of them was not work. In the current situation, it is working, but the result is not entirely correct. Watch video to see problem:

Video Link	https://youtu.be/3YlY3a9Xnpk

🟨 Message Type

feat(SegmentationMask): add instance segmentation message autoware_internal_msgs#23

Interface changes

None.

Effects on system behavior

None.

github-actions · 2024-07-23T12:54:15Z

Thank you for contributing to the Autoware project!

🚧 If your pull request is in progress, switch it to draft mode.

Please ensure:

You've checked our contribution guidelines.
Your PR follows our pull request guidelines.
All required CI checks pass before marking the PR ready for review.

github-actions · 2024-08-22T08:08:58Z

Documentation URL: https://autowarefoundation.github.io/autoware.universe/pr-8165/
Modified URLs:

https://autowarefoundation.github.io/autoware.universe/pr-8165/perception/autoware_tensorrt_rtmdet/

StepTurtle · 2024-08-25T16:40:07Z

Hi, @mitsudome-r

This PR looks ready to review.

perception/autoware_tensorrt_rtmdet/config/rtmdet.param.yaml

perception/autoware_tensorrt_rtmdet/include/tensorrt_rtmdet/calibrator.hpp

perception/autoware_tensorrt_rtmdet/src/trt_batched_nms/batched_nms/trt_batched_nms.cpp

perception/autoware_tensorrt_rtmdet/include/tensorrt_rtmdet/tensorrt_rtmdet.hpp

Owen-Liuyuxuan · 2024-08-28T01:42:40Z

Thank you for your PR. I am considering whether tensorrt_batched_nms should be a standalone package or add to the tensorrt_common. Any ideas?

perception/autoware_tensorrt_rtmdet/src/tensorrt_rtmdet.cpp

StepTurtle · 2024-09-02T08:45:59Z

Thank you for your PR. I am considering whether tensorrt_batched_nms should be a standalone package or add to the tensorrt_common. Any ideas?

Hi @Owen-Liuyuxuan, thanks for your thoughts,

I am not sure if it would be beneficial to use it as a separate package. When we organize it as a separate package, I would expect it to be used by multiple packages or models. Normally, the batchedNMSPlugin is a plugin published within the TensorRT library, and the version we are using has been modified by mmdeploy. Therefore, I don't think it will be used by another model or package in the future. If this plugin is used, it will most likely be in its original form and I guess we can directly reach the original form from TensorRT headers.

Apart from that, if we want to gather all currently used and future plugins within tensorrt_common, this seems reasonable to me, but it is equally reasonable to keep it within the package. So, both of them OK for me (keep current form or put it to tensorrt_common).

Owen-Liuyuxuan · 2024-09-03T01:45:25Z

@StepTurtle
Your reasoning is solid. I agree to keep it as its current form.

perception/autoware_tensorrt_rtmdet/src/trt_batched_nms/common_impl/nms/sortScoresPerImage.cu

perception/autoware_tensorrt_rtmdet/src/tensorrt_rtmdet_node.cpp

Shin-kyoto

@StepTurtle
As an honest opinion from a reviewer, reviewing a PR with more than 5000 lines is quite challenging.

May I suggest one of the following actions? 🙏

Add unit tests and ensure that they pass in CI.
- The reviewer will then verify whether the tests are appropriate.
Split the PR into smaller parts.

cc: @mitsudome-r @kminoda @mojomex
Please give your comments.

mojomex · 2024-09-07T06:01:51Z

@StepTurtle
Thank you for your PR! Thank you especially for the easy-to-follow PR description and documentation.

@Shin-kyoto

Add unit tests and ensure that they pass in CI.
The reviewer will then verify whether the tests are appropriate.

Split the PR into smaller parts.

cc: @mitsudome-r @kminoda @mojomex
Please give your comments.

I agree that this PR is proably better split up into parts. I would suggest making trt_batched_nms and related code its own PR as was also proposed by @Owen-Liuyuxuan.
in calibrator.hpp, remove things like Int8LegacyCalibrator if there is no strong technical reason as to why multiple calibrators are needed.
I agree that unit tests, or a module test with small sample rosbags as ground truths should be included.

After compilation, a file with the extension '.so' is created. This file stored in build and it should be parameter of dlopen() function.

Is there any information about can we handle this in Cmake. If we cannot, how can I provide the path to the file located inside the 'build' folder?

This is best handled in the launch stage. ROS 2 provides substitutions like $(find-pkg-prefix <pkg_name>) which can be used to get the package's location. There is also $(exec-in-package <exec-name> <package-name>) which looks like it just might work for shared libraries as well (I haven't verified though).

Loading plugins should also definitely be covered by unit tests to ensure they work reliably during runtime.

Thank you for your consideration.

StepTurtle · 2024-09-09T12:22:42Z

Hey, @mojomex @Shin-kyoto thanks for reviews.

At first, I thought we wouldn’t focus much on the plugin since it was developed by others, but it's nice that you reviewed the plugin code.

If we want to add the plugin as a separate package, should we put it under external folder or somewhere in perception folder? I can create a repository for the plugin under the leo-drive organization and we can clone the repository using the autoware.repos file.

I am working on the other topics you mentioned like calibrator.hpp code, unit tests, and so on. If I encounter any issues, I will let you know.

NOTE: Since there are a lot of TO DO, I switched the PR status as draft

Signed-off-by: Barış Zeren <[email protected]>

amadeuszsz · 2024-11-29T09:36:02Z

@StepTurtle
Regarding libtensorrt word, please make a PR for local dictionary here https://github.com/autowarefoundation/autoware.universe/blob/main/.cspell.json#L7

StepTurtle · 2024-11-30T14:33:17Z

@amadeuszsz

spell-check-differential passed somehow but idk how. Do you think should we still add libtensorrt word to .cspell.json file?

amadeuszsz · 2024-12-02T00:39:56Z

@amadeuszsz

spell-check-differential passed somehow but idk how. Do you think should we still add libtensorrt word to .cspell.json file?

Test passed because something wrong is with this workflow for now. It will fail after branch update or merge to main. Please, add libtensorrt to local dictionary.

perception/autoware_tensorrt_rtmdet/src/preprocess.cu

Signed-off-by: Barış Zeren <[email protected]>

StepTurtle · 2024-12-03T10:41:17Z

Test passed because something wrong is with this workflow for now. It will fail after branch update or merge to main. Please, add libtensorrt to local dictionary.

Related PR: #9550

Signed-off-by: Barış Zeren <[email protected]>

amadeuszsz · 2024-12-03T23:37:36Z

@StepTurtle
We were a bit unlucky with repository transition, could you add rtmdet to our new repository via casual PR? After that you can remove all cSpell ignores for rtmdet in your code. This word is needed as json files can't store comments in same line as desired word, thus spell check fails now.

Signed-off-by: Barış Zeren <[email protected]>

StepTurtle · 2024-12-05T15:02:39Z

@amadeuszsz

The PR which adds rtmdet is merged and spell-check looks okay now.

Signed-off-by: Barış Zeren <[email protected]>

amadeuszsz · 2024-12-09T05:13:28Z

@StepTurtle
I think we are close to finish review here. The only thing is to address message change

Signed-off-by: Barış Zeren <[email protected]>

StepTurtle · 2024-12-09T15:25:52Z

@StepTurtle I think we are close to finish review here. The only thing is to address message change

@amadeuszsz msg type is updated.

Signed-off-by: Barış Zeren <[email protected]>

StepTurtle · 2024-12-12T09:17:09Z

Fatih merged the msg type: autowarefoundation/autoware_internal_msgs#29

@amadeuszsz Code is updated with last msg type.

amadeuszsz

LGTM!

After some time I noticed there is a lot of boilerplate code which overlaps with other autoware perception modules. This makes code maintenance more challenging. Secondly, reviewer's comments doesn't make sense if after code owner change there is still a piece of code with same issues in different package. All of this suggest us to store common features (CUDA kernels) in common package. I will try to address this idea later.

@StepTurtle I would like to ask you for wait with merge until we complete all of subtasks. Also just before merge, check any universe package.xml <version> and assign appropriate value in your package.xml.

Great work! Thank you for your contribution!

StepTurtle added the component:perception Advanced sensor data processing and environment understanding. (auto-assigned) label Jul 23, 2024

StepTurtle self-assigned this Jul 23, 2024

github-actions bot added the type:documentation Creating or refining documentation. (auto-assigned) label Jul 23, 2024

xmfcx added the tag:run-build-and-test-differential Mark to enable build-and-test-differential workflow. (used-by-ci) label Jul 23, 2024

github-actions bot added the tag:require-cuda-build-and-test label Aug 21, 2024

StepTurtle force-pushed the feat/add_tensorrt_rtmdet branch 2 times, most recently from 2fb9f5c to d1149a4 Compare August 21, 2024 13:07

StepTurtle changed the title ~~feat(perception): add tensorrt rtmdet model~~ feat(autoware_tensorrt_rtmdet): add tensorrt rtmdet model Aug 21, 2024

StepTurtle added the run:deploy-docs Mark for deploy-docs action generation. (used-by-ci) label Aug 22, 2024

StepTurtle mentioned this pull request Aug 25, 2024

feat(SegmentationMask): add instance segmentation message autowarefoundation/autoware_internal_msgs#23

Closed

7 tasks

StepTurtle marked this pull request as ready for review August 25, 2024 16:36

StepTurtle force-pushed the feat/add_tensorrt_rtmdet branch from 132ff68 to cf09221 Compare August 25, 2024 16:39

Shin-kyoto requested changes Aug 27, 2024

View reviewed changes

Shin-kyoto reviewed Aug 29, 2024

View reviewed changes

perception/autoware_tensorrt_rtmdet/src/tensorrt_rtmdet.cpp Outdated Show resolved Hide resolved

Shin-kyoto requested changes Sep 3, 2024

View reviewed changes

perception/autoware_tensorrt_rtmdet/src/trt_batched_nms/common_impl/nms/sortScoresPerImage.cu Outdated Show resolved Hide resolved

perception/autoware_tensorrt_rtmdet/src/tensorrt_rtmdet_node.cpp Outdated Show resolved Hide resolved

Shin-kyoto requested changes Sep 7, 2024

View reviewed changes

StepTurtle marked this pull request as draft September 7, 2024 11:34

StepTurtle force-pushed the feat/add_tensorrt_rtmdet branch from f2fd0db to d86d2c3 Compare September 7, 2024 15:13

StepTurtle and others added 4 commits September 9, 2024 18:11

feat: add tensorrt rtmdet model

983b11e

Signed-off-by: Barış Zeren <[email protected]>

style(pre-commit): autofix

8ff3e28

fix: pre-commit.ci

21f656c

Signed-off-by: Barış Zeren <[email protected]>

style(pre-commit): autofix

10f7ded

This was referenced Nov 29, 2024

feat(words): add rtmdet tier4/autoware-spell-check-dict#838

Merged

feat(words): add libtensorrt tier4/autoware-spell-check-dict#839

Closed

amadeuszsz reviewed Dec 2, 2024

View reviewed changes

perception/autoware_tensorrt_rtmdet/src/preprocess.cu Show resolved Hide resolved

StepTurtle mentioned this pull request Dec 3, 2024

chore(cspell): add new word libtensorrt to cspell.json #9550

Merged

StepTurtle added 2 commits December 3, 2024 13:24

refactor: remove unused macro

8b32c19

Signed-off-by: Barış Zeren <[email protected]>

refactor: use #pragma unroll

88a42c1

Signed-off-by: Barış Zeren <[email protected]>

pre-commit-ci bot and others added 2 commits December 3, 2024 10:41

style(pre-commit): autofix

fc49031

refactor: use explicit cast

a7baba9

Signed-off-by: Barış Zeren <[email protected]>

StepTurtle mentioned this pull request Dec 4, 2024

feat(words): add rtmdet autowarefoundation/autoware-spell-check-dict#9

Merged

StepTurtle and others added 2 commits December 5, 2024 17:53

Merge branch 'main' into feat/add_tensorrt_rtmdet

fcab262

chore: remove redundant cspell

3fc38ee

Signed-off-by: Barış Zeren <[email protected]>

amadeuszsz and others added 2 commits December 6, 2024 10:02

Merge branch 'main' into feat/add_tensorrt_rtmdet

7dc456f

fix: pre-commit.ci

aec027b

Signed-off-by: Barış Zeren <[email protected]>

refactor: update msg type

fd778da

Signed-off-by: Barış Zeren <[email protected]>

StepTurtle mentioned this pull request Dec 9, 2024

feat(autoware_perception_msgs): add instance segmentation msg autowarefoundation/autoware_msgs#106

Closed

StepTurtle and others added 3 commits December 9, 2024 18:38

fix: wrong topic type

b994b72

Signed-off-by: Barış Zeren <[email protected]>

style(pre-commit): autofix

6b1833c

refactor: msg type

e36df2d

Signed-off-by: Barış Zeren <[email protected]>

style(pre-commit): autofix

d4f02ec

amadeuszsz approved these changes Dec 12, 2024

View reviewed changes

Shin-kyoto self-requested a review December 16, 2024 02:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(autoware_tensorrt_rtmdet): add tensorrt rtmdet model #8165

feat(autoware_tensorrt_rtmdet): add tensorrt rtmdet model #8165

StepTurtle commented Jul 23, 2024 •

edited

Loading

github-actions bot commented Jul 23, 2024 •

edited

Loading

github-actions bot commented Aug 22, 2024

StepTurtle commented Aug 25, 2024

Owen-Liuyuxuan commented Aug 28, 2024

StepTurtle commented Sep 2, 2024 •

edited

Loading

Owen-Liuyuxuan commented Sep 3, 2024

Shin-kyoto left a comment •

edited

Loading

mojomex commented Sep 7, 2024

StepTurtle commented Sep 9, 2024 •

edited

Loading

amadeuszsz commented Nov 29, 2024 •

edited

Loading

StepTurtle commented Nov 30, 2024

amadeuszsz commented Dec 2, 2024

StepTurtle commented Dec 3, 2024

amadeuszsz commented Dec 3, 2024

StepTurtle commented Dec 5, 2024

amadeuszsz commented Dec 9, 2024

StepTurtle commented Dec 9, 2024

StepTurtle commented Dec 12, 2024

amadeuszsz left a comment •

edited

Loading

feat(autoware_tensorrt_rtmdet): add tensorrt rtmdet model #8165

Are you sure you want to change the base?

feat(autoware_tensorrt_rtmdet): add tensorrt rtmdet model #8165

Conversation

StepTurtle commented Jul 23, 2024 • edited Loading

Description

Process Times

Process Times for 8 camera on single GPU

Related links

How was this PR tested?

Notes for reviewers

Interface changes

Effects on system behavior

github-actions bot commented Jul 23, 2024 • edited Loading

github-actions bot commented Aug 22, 2024

StepTurtle commented Aug 25, 2024

Owen-Liuyuxuan commented Aug 28, 2024

StepTurtle commented Sep 2, 2024 • edited Loading

Owen-Liuyuxuan commented Sep 3, 2024

Shin-kyoto left a comment • edited Loading

Choose a reason for hiding this comment

mojomex commented Sep 7, 2024

StepTurtle commented Sep 9, 2024 • edited Loading

amadeuszsz commented Nov 29, 2024 • edited Loading

StepTurtle commented Nov 30, 2024

amadeuszsz commented Dec 2, 2024

StepTurtle commented Dec 3, 2024

amadeuszsz commented Dec 3, 2024

StepTurtle commented Dec 5, 2024

amadeuszsz commented Dec 9, 2024

StepTurtle commented Dec 9, 2024

StepTurtle commented Dec 12, 2024

amadeuszsz left a comment • edited Loading

Choose a reason for hiding this comment

StepTurtle commented Jul 23, 2024 •

edited

Loading

github-actions bot commented Jul 23, 2024 •

edited

Loading

StepTurtle commented Sep 2, 2024 •

edited

Loading

Shin-kyoto left a comment •

edited

Loading

StepTurtle commented Sep 9, 2024 •

edited

Loading

amadeuszsz commented Nov 29, 2024 •

edited

Loading

amadeuszsz left a comment •

edited

Loading