Skip to content

Enhances a CSV with article text retrieved using Newspaper3k

Notifications You must be signed in to change notification settings

justinlittman/article-text

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

article-text

Enhances a CSV with article text retrieved using Newspaper3k

Newspaper3k does all of the heavy lifting.

Setup

pip install newspaper3k

Usage

python enhance_text.py <source file> <destination file>

The source file must be a CSV containing a url column containing the URL of each article.

enhance_text.py will retrieve the article, extract text, and add to a new column called text. All other columns will be preserved.

About

Enhances a CSV with article text retrieved using Newspaper3k

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages