Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Requesting Advice / Guidance on installing and running on Azure Databricks #1404

Closed
bennydean opened this issue Jun 27, 2024 · 2 comments
Closed

Comments

@bennydean
Copy link

bennydean commented Jun 27, 2024

Hello,

I am attempting to import and run the presidio-analyzer and presidio-anonymizer in an Azure databricks environment. However, there seems to be dependency mismatch issues with the python packages that are part of the databricks runtime. For example, I am getting the error below which indicates a numpy mismatch. Can anyone offer some advice on getting up and running on databricks given the mismatch of python package versions between the Presidio libraries and the databricks runtime? The basic setup in the docs doesn't seem to work and I haven't been able to find any additional information on overcoming these issues.

One other thing I wanted to mention is that tooling for package conflicts in python seems limited. For example, the PyPi repo for presidio-analyzer doesn't list any dependency requirements other than the supported python versions,

Thanks!

Command:

from presidio_analyzer import AnalyzerEngine
from presidio_anonymizer import AnonymizerEngine

Error:
ValueError: numpy.ndarray size changed, may indicate binary incompatibility. Expected 96 from C header, got 88 from PyObject

numpy version in databricks runtime:
numpy==1.20.1

EDIT: I am considering upgrading the databricks runtime version. Currently using 10.4 LST which seems somewhat outdated. However, would still love to hear any feedback / suggestions from others on other options.

@bennydean
Copy link
Author

Closing - Upgraded my DB runtime and everything is working.

@omri374
Copy link
Contributor

omri374 commented Jun 28, 2024

Possible duplicate of #1400

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants