Bug: [CANN] inference running result is garbled in debug running model for LM models who's type is Q4_0 class #9979

leo-pony · 2024-10-21T11:35:28Z

What happened?

For CANN backend: inference running result is garbled in debug running model for LM models who's type is Q4_0 class

Name and Version

b3948

What operating system are you seeing the problem on?

No response

Relevant log output

No response

leo-pony added bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable) labels Oct 21, 2024

leo-pony mentioned this issue Oct 21, 2024

[CANN] Adapt to dynamically loadable backends mechanism #9970

Merged

4 tasks

hipudding assigned leo-pony Oct 21, 2024

hipudding added Ascend NPU issues specific to Ascend NPUs and removed bug-unconfirmed labels Oct 21, 2024

hipudding linked a pull request Oct 22, 2024 that will close this issue

[CANN] Adapt to dynamically loadable backends mechanism #9970

Merged

4 tasks

hipudding closed this as completed in #9970 Oct 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug: [CANN] inference running result is garbled in debug running model for LM models who's type is Q4_0 class #9979

Bug: [CANN] inference running result is garbled in debug running model for LM models who's type is Q4_0 class #9979

leo-pony commented Oct 21, 2024

Bug: [CANN] inference running result is garbled in debug running model for LM models who's type is Q4_0 class #9979

Bug: [CANN] inference running result is garbled in debug running model for LM models who's type is Q4_0 class #9979

Comments

leo-pony commented Oct 21, 2024

What happened?

Name and Version

What operating system are you seeing the problem on?

Relevant log output