Clojure library integrating and extending language processing tools available in Java.
It includes:
- parsing tools,
- stopwords/articles filtering,
- stemming tools,
- language detection,
- encoding detection,
- n-grams generation,
- …
To use langlab, add the the following dependency to your project.clj
You need Java 7 and lein 2 to compile and run this project.
Currently, this is work in progress. API is subject to change in subsequent releases.
API docs are available here.
Great thanks for Damian Doyle and ranks.nl for making their stopwords lists available for the project.
Distributed under the Eclipse Public License, the same as Clojure.