Skip to content

Latest commit

 

History

History
19 lines (13 loc) · 655 Bytes

README.md

File metadata and controls

19 lines (13 loc) · 655 Bytes

This code implements the summarization of text documents using Latent Semantic Analysis. For a good starting point to the LSA models in summarization, check this paper and this one.

Running this code

Firstly, It is necessary to download 'punkts' and 'stopwords' from nltk data. For that, run the code:

import nltk
nltk.download("punkt", quiet=True)
nltk.download("stopwords", quiet=True)

Further, run python summarization.py

Requirements

  • Python 3.x
  • numpy
  • NLTK