Sentiment-Analysis

Data Mining, Project 1:

Preprocessing(clean tweets), create model to predict sentiment-label(positive, negative or neutral) for tweets and model accuracy checking.

Contributors

System requirements

Python version 3.6
NLTK
Numpy

Run commands

jupyter notebook
pip install --user -U nltk
pip install --user -U numpy
pip install vaderSentiment

Implementation

Cleaning the data(process tweets from train.tsv and test.tsv using: Tokenization, StopWord filtering, Stemming)
Make workclouds and matplots for the data
Vectorization(using: BAG-OF-WORDS & TF-IDF)
TSNE model(Word2vec)
Classification: KNN , SVM
Check the accuracy from label predictions

Use f1_score to calculate the success rate (classification labels with the official labels of test tweets SemEval2017_task4_subtaskA_test_english_gold.txt)

Model Accuracy: 0.59 success

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
lexica		lexica
twitter_data		twitter_data
Analyze_tweets.ipynb		Analyze_tweets.ipynb
README.md		README.md
preprocess.py		preprocess.py
twitter_logo.png		twitter_logo.png
word2vec.model		word2vec.model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment-Analysis

Data Mining, Project 1:

Contributors

System requirements

Run commands

Implementation

About

Releases

Packages

Contributors 2

Languages

VasiaKoum/Twitter-Sentiment-Analysis

Folders and files

Latest commit

History

Repository files navigation

Sentiment-Analysis

Data Mining, Project 1:

Contributors

System requirements

Run commands

Implementation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages