forked from thammegowda/autoextractor
-
Notifications
You must be signed in to change notification settings - Fork 11
Home
Thamme Gowda edited this page May 7, 2016
·
7 revisions
Welcome to the Auto-Extractor wiki!
Here you will find information related to Auto Extractor.
- Clustering the web pages based on style and structure
- Scalable on Apache Spark
- Work in progress - Visualization of clusters
- Auto extraction of content
- Integrate to Apache Tika and Apache Nutch
- Thamme Gowda N.
- Chirs Mattmann