-
Notifications
You must be signed in to change notification settings - Fork 64
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature request: configure all-purpose cluster libraries through DAB #1860
Comments
Hi @rsayn ! Thanks for reporting the issue. Just to confirm: when you run a workflow with this cluster, the library is not installed as well? |
Hey @andrewnester! If I define jobs to run on this cluster I can include libraries from the job / task definition. My request comes from the fact that you can specify cluster-scoped libraries from the Databricks UI, the SDK or via a cluster policy, but not via DABs. |
@rsayn thanks for clarifying, it makes sense. My expectation was that in the configuration like you have libraries will be installed when the cluster is started (when corresponding job is started). If that's not the case, this has to be fixed on our side and I'll look into this |
All right, thanks a lot! To further clarify: I think (please confirm) all-purpose clusters can still be used for jobs. In that case, I'd expect any library configured on the job's tasks to override the default cluster libraries (which I think is the current behaviour if you attach libraries to a cluster policy) 🤔 |
I think I might have misunderstood original issue. In any case, even if you use interactive cluster, you can use it in the job tasks. But for libraries to be installed, you need to specify them at
|
Exactly. In my case I don't have any jobs attached to the cluster, so I can't use the setup you provided |
Hello @andrewnester, any news about this? 🙏 LMK if I can help in any way! |
This issue has not received a response in a while. If you want to keep this issue open, please leave a comment below and auto-close will be canceled. |
I have a similar use case, wherein I want to boot up a cluster and run notebooks on the go for some rapid playing around. Also, I would want the libraries to be pre-installed on the cluster rather than wait for a job to be triggered to kick start the cluster and the subsequent library installation ! |
Describe the issue
Since
0.229.0
all-purpose (interactive) clusters can be created via DAB.With Job clusters, it's pretty straightforward to install a DAB wheel artifact by specifying the
libraries
for a task executed on that cluster.With All-purpose clusters this is currently not possible, and the only solution is to perform post-operations with the SDK or APIs to add a library programmatically.
Configuration
Expected Behavior
There should be a way to specify the deployed bundle wheel as a dependency.
Actual Behavior
There's currently no way to specify this behaviour.
The wheel needs to be post-attached to the cluster via the SDK by:
Note that both steps would greatly benefit from the substitution happening inside DABs - without it, the cluster name and library path have to be inferred somehow.
OS and CLI version
Is this a regression?
No, this is a new feature request
Debug Logs
N/A
The text was updated successfully, but these errors were encountered: