Deep Twitter - Tweet Extraction Application

Tweet extraction application with Twitter API. Tweets will be using for Deep Learning with TensorFlow. Tweets should be clear for learning.

Designed and Implemented By:

About

This application has three components, below these three components will be clarified. Each application has own purpose, that's why there are three components.

Crawler

This application connects Twitter thanks to authorization keys provided by Twitter for developers. Needed tweets are Turkish tweets, that's why application is tracking tr language. However, one can change filter option. Collected tweets from Twitter are inserted into database.
Filtering

Unfortunately, tweets are hard to read and interpreted by computers. That's why, this application is removing unnecessary elements, words in tweets (emojis, links, short words etc.).
Transfer

This application reads tweets from text file and transfers to the database. At the end, clear database will be using with Elastic Search, but this is not scope of this application.

Important Note: Insertion of tweets to database has not been done one by one. Scheduler is created and set to 1 minute. Each 1 minute, batch tweets are inserted and committed into database. This method boosts the application speed.

Getting Started

Twitter account with phone verification is needed.
From Twitter Developer website, 4 authorization keys should be obtained and integrated to application.
- Consumer Key
- Consumer Secret
- Access Token
- Access Token Secret
MySQL database configuration has been used.
Configure persistence.xml for database connection pool / username / password etc.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
lib		lib
src/main		src/main
.gitignore		.gitignore
README.md		README.md
pom.xml		pom.xml
tweets.txt		tweets.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Twitter - Tweet Extraction Application

About

Getting Started

About

Releases

Packages

Languages

BenKosSoft/deepTwitter

Folders and files

Latest commit

History

Repository files navigation

Deep Twitter - Tweet Extraction Application

About

Getting Started

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages