nltk-sentiment-analysis

A natural language processing probabilistic model for processing natural languages, namely in sentiment analysis in restaurant reviews. We train a Naive Bayes classifier to automatically label unseen reviews as positive or negative.

feature_vector.py

feature_vector.py is the main program.
It requires an additional filename be presented in order to process the feature vector for that file. If a file is not present, the program prompts the user for correct input and then exits.

USAGE: $ feature_vector.py [filename_in_directory_to_be_processed]

feature_vector.py calls data_helper.py for parsing through the data to obtain overall review score and the text of the review

Lastly the print statements are to let you know it doest freeze. On my server that I have been running it on it was getting caught for about 20 minutes. I doubt you'll have these problems, none-the-less the statements are nice to have

note on usage

the call to write files uses splicing. the command expects that if you give a filename to feature_vector that is not one of the three we were given, you must make sure it has at least 10 chars before the DATASET name and 5 chars after. This is because it splices: restaurant- (10) dataset .data (5).

if You are getting an error with a new file, this may be the case

Naive_Bayes.py

pretty broken. I couldnt figure out how to format the file so that I can call the classifier. I beleive I had a list of tuples but IDK. It really shouldnt work but you can still check the output or code if you like

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
data		data
.gitignore		.gitignore
Naive_Bayes.py		Naive_Bayes.py
README.md		README.md
assignment3.pdf		assignment3.pdf
data_helper.py		data_helper.py
feature_vector.py		feature_vector.py
restaurant-development.data		restaurant-development.data
restaurant-testing.data		restaurant-testing.data
restaurant-training.data		restaurant-training.data
word_category_counter.py		word_category_counter.py
word_features-development-features.txt		word_features-development-features.txt
word_features-testing-features.txt		word_features-testing-features.txt
word_features-training-features.txt		word_features-training-features.txt
word_pos_features-development-features.txt		word_pos_features-development-features.txt
word_pos_features-testing-features.txt		word_pos_features-testing-features.txt
word_pos_features-training-features.txt		word_pos_features-training-features.txt
word_pos_liwc_features-development-features.txt		word_pos_liwc_features-development-features.txt
word_pos_liwc_features-testing-features.txt		word_pos_liwc_features-testing-features.txt
word_pos_liwc_features-training-features.txt		word_pos_liwc_features-training-features.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nltk-sentiment-analysis

feature_vector.py

note on usage

Naive_Bayes.py

About

Releases

Packages

Languages

KevinBaileyCrum/nltk-sentiment-analysis

Folders and files

Latest commit

History

Repository files navigation

nltk-sentiment-analysis

feature_vector.py

note on usage

Naive_Bayes.py

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages