Skip to content

v0.0.1

Compare
Choose a tag to compare
@Nikoletos-K Nikoletos-K released this 22 Jul 16:57
· 259 commits to main since this release
d1da0d2

First pyJedAI release: This release presents the basic structure of the well-known JedAI toolkit into the python environment. Contains:

  • Data reading techniques: RDF/OWL, SPARKQL, CSV, JSON, DB
  • Block building: Standard Blocking, QGrams & Extended, SuffixArray & Extended
  • Block cleaning: Block purging, Block filtering
  • Comparison cleaning: Weighted edge/node pruning, Cardinality edge/node pruning, BLAST, etc
  • Entity matching: strsimpy
  • Entity clustering: Connected component clustering
  • Similarity Joins: SchemaAgnosticΕJoin, TopKSchemaAgnosticJoin
  • Evaluation through Jupyter notebook