How can i correctly use a CPU to perform inference of a quantized model #1374
Unanswered
neavo
asked this question in
CATCH-ALL: alpha testing the `multi-backend-refactor`
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I made some attempts, such as:
But the speed is very slow, so is there a correct code snippet as an example?
Beta Was this translation helpful? Give feedback.
All reactions