You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Pass high_precision_dtype="fp32" and --op_types_to_quantize=["Conv"] to quantize() function. Then compilation with trtexec should work.
But I see that trtexec --onnx=LightStereo-S-KITTI.preprocessed.onnx --best gives the best runtime of 4.1084 ms, while modelopt quantized output ONNX with above parameters gives 5.04126 ms. We are working on this gap.
error info:
model structure:
model onnx:
https://drive.google.com/file/d/1gP568tWTZXISpwbB7r76xXwudR61_z0k/view?usp=sharing
I used the recommended function and default parameters to ptq
The text was updated successfully, but these errors were encountered: