GitHub - sri-1007/CSCE-678-PROJECT

CSCE-678 : TWITTER DATA ANALYSIS USING APACHE SPARK AND KAFKA

The project goal is to perform the sentiment analysis on trending hashtags' tweets of a live sream

Project flow:
The model architecture is as below,

->Get the live stream of tweets(using twitter API and tweepy module) onto Kafka Producer\
->On the producer, filter the live stream based on selective topics\
->On the spark-installed consumer,
    
    ->Filter the tweets with hashtags
    ->Based on map reduce operations, obtain the top 10 trending hashtags
    ->Perform a sentiment analysis on all the tweets pertaining to trending hashtags
    ->Aggregate the analysis of all tweets for each ofthe trending hashtags and report the overall sentiment of hashtags

Project Execution:
->Create Twitter API account and get keys for fetching live stream of tweets
->Setup a kafka cluster with 3 brokers(producer on one broker and consumer on different one) and one Zookeeepr node
->Install spark on consumer node
->Start the zookeeper node : $bin/zkServer.sh start
->Start all the kafka nodes : $kafka-server-start.sh config/server.properties
->Start the producer : $python3 producer.py
->Start the consumer by Spark submit:- spark-submit --jars spark-streaming-kafka-0-8-assembly_2.11-2.4.2.jar,spark-core_2.11-1.5.2.logging.jar consumer.py
->Trending hashtags with the overallsentiment analysis will be displayed on the consumer console

Final source files:
Please check the below source files for the final working implementation : final_producer.py, final_consumer.py

Demo :
A complete demo of the project is available in the link : https://drive.google.com/open?id=17epWAJ_lpYV8rgE6l-JEbMr3S3e0QCML

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
logs		logs
src		src
README.md		README.md
Twitter Data Analysis using Spark and Kafka Report.pdf		Twitter Data Analysis using Spark and Kafka Report.pdf
cluster configuration.md		cluster configuration.md
model_architecture.png		model_architecture.png
twitter-app-credentials.txt		twitter-app-credentials.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages