Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

test: Enable Spark 4.0 tests #537

Merged
merged 83 commits into from
Jun 19, 2024

Conversation

kazuyukitanimura
Copy link
Contributor

@kazuyukitanimura kazuyukitanimura commented Jun 6, 2024

Which issue does this PR close?

Part of #372

Rationale for this change

To be ready for Spark 4.0

What changes are included in this PR?

This PR enables the spark-4.0 tests with comet enabled except for the ones listed in #551

How are these changes tested?

ANSI is enabled for Spark-4.0

@codecov-commenter
Copy link

Codecov Report

Attention: Patch coverage is 33.33333% with 6 lines in your changes missing coverage. Please review.

Project coverage is 54.70%. Comparing base (fd596ed) to head (024a422).
Report is 11 commits behind head on main.

Files Patch % Lines
...c/main/java/org/apache/comet/parquet/TypeUtil.java 0.00% 3 Missing ⚠️
.../scala/org/apache/comet/serde/QueryPlanSerde.scala 50.00% 2 Missing ⚠️
...ain/java/org/apache/comet/parquet/BatchReader.java 0.00% 1 Missing ⚠️
Additional details and impacted files
@@             Coverage Diff              @@
##               main     #537      +/-   ##
============================================
+ Coverage     54.68%   54.70%   +0.01%     
  Complexity      795      795              
============================================
  Files           102      103       +1     
  Lines          9688     9707      +19     
  Branches       1845     1849       +4     
============================================
+ Hits           5298     5310      +12     
- Misses         3433     3444      +11     
+ Partials        957      953       -4     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@@ -88,7 +87,7 @@ object QueryPlanSerde extends Logging with ShimQueryPlanSerde with CometExprShim
case _: BinaryType => 8
case _: TimestampType => 9
case _: DecimalType => 10
case dt if dt.typeName == "timestamp_ntz" => 11
case dt if isTimestampNTZType(dt) => 11
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice improvement

@andygrove
Copy link
Member

I didn't review the Spark diff, but the other changes LGTM from a first pass. I will review more carefully tomorrow.

@kazuyukitanimura
Copy link
Contributor Author

@andygrove just checking in to see if you have more feedback

@kazuyukitanimura
Copy link
Contributor Author

Friendly ping @andygrove @comphead @huaxingao @parthchandra @viirya

Copy link
Member

@andygrove andygrove left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thanks @kazuyukitanimura

@kazuyukitanimura
Copy link
Contributor Author

Waiting for #581 gets merged first

Copy link
Contributor

@comphead comphead left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm, thanks @kazuyukitanimura 👍

java-version: [11]
spark-version: [{short: '3.4', full: '3.4.2'}]
java-version: [17]
spark-version: [{short: '4.0', full: '4.0.0-preview1'}]
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this particular for ansi test? Hmm, is there any difference between 3.4 and 4.0?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This has been disabled. List of failing tests #551

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hmm, before we can enable it, can we still use 3.4 to run these tests?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have not tried running this with 3.4, but based on the comment, it likely fails.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah, I forgot we don't run ansi test in CI.

@kazuyukitanimura kazuyukitanimura merged commit a4b968e into apache:main Jun 19, 2024
48 checks passed
@kazuyukitanimura
Copy link
Contributor Author

Merged thank you @andygrove @comphead @viirya

himadripal pushed a commit to himadripal/datafusion-comet that referenced this pull request Sep 7, 2024
## Rationale for this change

To be ready for Spark 4.0

## What changes are included in this PR?

This PR enables the spark-4.0 tests with comet enabled except for the ones listed in apache#551

## How are these changes tested?

ANSI is enabled for Spark-4.0
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants