We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RangePartitioner
sortByKey
在解释sortByKey的时候:
https://github.com/JerryLead/SparkInternals/blob/master/markdown/2-JobLogicalPlan.md
sortByKey实际使用的是RangePartitioner。分片排序之后,我们还需要分片之间的顺序关系才能最终输出排序后的结果。
代码:
https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/rdd/OrderedRDDFunctions.scala#L62
The text was updated successfully, but these errors were encountered:
谢谢指出,最近在忙paper,我会在下次review的时候修改的
Sorry, something went wrong.
No branches or pull requests
在解释
sortByKey
的时候:https://github.com/JerryLead/SparkInternals/blob/master/markdown/2-JobLogicalPlan.md
sortByKey
实际使用的是RangePartitioner
。分片排序之后,我们还需要分片之间的顺序关系才能最终输出排序后的结果。代码:
https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/rdd/OrderedRDDFunctions.scala#L62
The text was updated successfully, but these errors were encountered: