Sedona-Iceberg Extension

This extension module is for using Apache Sedona seamlessly with Apache Iceberg, where UDT and serializer of geometry values were unified, and spatial predicates of Apache Sedona will be pushed down to Iceberg tables for partition pruning and data skipping.

Usage

Add the sedona-iceberg extension jar to the --jars argument of spark-submit command, and append org.apache.iceberg.spark.extensions.SedonaIcebergExtensions to spark.sql.extensions config property.

Typical spark job submission script looks like this:

spark-submit \
    --jars /path/to/iceberg-spark-runtime-jar,/path/to/sedona-iceberg-extension-jar,/path/to/geotools-wrapper-geotools-jar \
    --conf spark.serializer=org.apache.spark.serializer.KryoSerializer \
    --conf spark.kryo.registrator=org.apache.sedona.core.serde.SedonaKryoRegistrator \
    --conf spark.sql.extensions=org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions,org.apache.iceberg.spark.extensions.SedonaIcebergExtensions \
    --conf spark.sql.catalog.spark_catalog=org.apache.iceberg.spark.SparkSessionCatalog \
    --conf spark.sql.catalog.spark_catalog.type=hive \
    ...

Notices:

Don't forget to register kryo serializers provided by Apache Sedona, otherwise you'll suffer from poor performance and high memory usage.
Since GeoTools was published under GPL license, so we cannot bundle GeoTools into our extension jar. You need to obtain and add GeoTools jar yourself. Please refer to sedona documentation on GeoTools for detail.

Example

example directory contains an example spark job processing geometries stored in iceberg tables using Apache Sedona. Please refer to example/launch.sh for spark-submit command for launching jobs.

Build

You can use the following command to build the extension jar with specific spark version by yourself:

./gradlew -DsparkVersion=3.1 build

Currently, we only support Spark 3.1, 3.2 and 3.3.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github/workflows		.github/workflows
buildSrc		buildSrc
example		example
extension		extension
gradle/wrapper		gradle/wrapper
patched-sedona		patched-sedona
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build.gradle.kts		build.gradle.kts
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle.kts		settings.gradle.kts
version.txt		version.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sedona-Iceberg Extension

Usage

Example

Build

About

Releases 1

Packages

Contributors 2

Languages

License

spatialx-project/sedona-iceberg-extension

Folders and files

Latest commit

History

Repository files navigation

Sedona-Iceberg Extension

Usage

Example

Build

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages