Optimizing Model Performance: Exploring ONNX Export and Engine Integration with TensorRT and OpenVino #68

AntonioConsiglio · 2023-10-10T08:51:25Z

Hi, have you explored evaluating the architecture through the export to ONNX format and its implementation with different engines like TensorRT or OpenVino?

z-x-yang · 2023-10-22T09:29:24Z

Thanks for your interest. Currently, we haven't explored exporting the models to other formats.

AntonioConsiglio · 2023-10-22T09:40:03Z

Thanks for your interest. Currently, we haven't explored exporting the models to other formats.

I did some tests, unifying all the attention blocks and building it with TensorRT v8.5.
I've notice only improvement in memory consumption. For a long-term memory tensor of 5 frames max, the memory reserved (input size 1280×720) is reduced from 20GB to 10 GB.

While this improvement in memory the runtime using jetson platform is slower when building the TRT engine respect of pure Pytorch, instead using a nvidia RTX card it remain the same (I'm running the engine using python api).

What do you think about this approache (https://github.com/hkchengrex/Cutie) ? Your object memory version is similar?

bhack · 2023-10-22T16:05:22Z

There are some Torch compile issues with these models:
pytorch/pytorch#103716

SuyueLiu · 2024-08-16T08:16:36Z

Could you please share the sample script to convert model to onnx?

AntonioConsiglio mentioned this issue Aug 15, 2024

Does this model convert to onnx to run on edge device? #82

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimizing Model Performance: Exploring ONNX Export and Engine Integration with TensorRT and OpenVino #68

Optimizing Model Performance: Exploring ONNX Export and Engine Integration with TensorRT and OpenVino #68

AntonioConsiglio commented Oct 10, 2023

z-x-yang commented Oct 22, 2023

AntonioConsiglio commented Oct 22, 2023 •

edited

Loading

bhack commented Oct 22, 2023 •

edited

Loading

SuyueLiu commented Aug 16, 2024

Optimizing Model Performance: Exploring ONNX Export and Engine Integration with TensorRT and OpenVino #68

Optimizing Model Performance: Exploring ONNX Export and Engine Integration with TensorRT and OpenVino #68

Comments

AntonioConsiglio commented Oct 10, 2023

z-x-yang commented Oct 22, 2023

AntonioConsiglio commented Oct 22, 2023 • edited Loading

bhack commented Oct 22, 2023 • edited Loading

SuyueLiu commented Aug 16, 2024

AntonioConsiglio commented Oct 22, 2023 •

edited

Loading

bhack commented Oct 22, 2023 •

edited

Loading