Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG][SPARK-49839][SQL] SPJ: Skip shuffles if possible for sorts #11935

Open
revans2 opened this issue Jan 7, 2025 · 0 comments
Open

[BUG][SPARK-49839][SQL] SPJ: Skip shuffles if possible for sorts #11935

revans2 opened this issue Jan 7, 2025 · 0 comments
Labels
audit_4.0.0 Audit related tasks for 4.0.0 bug Something isn't working performance A performance related task/issue

Comments

@revans2
Copy link
Collaborator

revans2 commented Jan 7, 2025

Describe the bug
Spark is trying to improve iceberg/delta lake query performance where the data may have already been partitioned properly that a shuffle is not needed for an order by operation. I am not sure if this is going to impact us directly or not. I don't think we support iceberg on spark 4.0.0 yet. But we do support delta lake so it might be related in some ways. Hard to tell.

apache/spark@d2965ae99aa

@revans2 revans2 added ? - Needs Triage Need team to review and classify audit_4.0.0 Audit related tasks for 4.0.0 bug Something isn't working performance A performance related task/issue labels Jan 7, 2025
@mattahrens mattahrens removed the ? - Needs Triage Need team to review and classify label Jan 9, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
audit_4.0.0 Audit related tasks for 4.0.0 bug Something isn't working performance A performance related task/issue
Projects
None yet
Development

No branches or pull requests

2 participants