
-
Language: Python​
-
Libraries: NumPy, Pandas, Matplotlib, Sklearn, Re, NLTK, Graphviz
-
Platform: Jupyter Notebook​
Movie Review Sentiment Analysis
Purpose
Automatically determine the sentiment or emotional tone expressed (positive or negative) in user-generated movie reviews on the Internet Movie Database (IMDB) based on the text content of the reviews.
Tasks Done
Data Preprocessing
Tokenization
One hot encoding
Stemming
Feature Vector Creation
Train Test Split
Model Training
Model Testing & Evaluation
Dataset
IMDB movie review dataset with 50000 movie reviews collected.
Models
Logistic Regression Classifier
Naive Bayes Classifier
Applications
Determining the overall review polarity of specific movies. Filtering out particular user's favorite genre of the movies and give personalized recommendations