CourseProject

Introduction

As final project for CS 410 Text Information System, we participated in Text Classification Competition to detect Twitter Sarcasm. We were given both Training and Test datasets.

The Training dataset Content

label: SARCASM or NOT_SARCASM response: The Tweet to be classified context: The conversation context of the response example: {"label": "SARCASM", "response": "@USER @USER @USER I don't get this .. obviously you do care or you would've moved right along .. instead you decided to care and troll her ..", "context": ["A minor child deserves privacy and should be kept out of politics . Pamela Karlan , you should be ashamed of your very angry and obviously biased public pandering , and using a child to do it .", "@USER If your child isn't named Barron ... #BeBest Melania couldn't care less . Fact . 💯"]}

The Test dataset content

id: String identifier for sample. This id is required for project submission and grading. response: The Tweet to be classified context: The conversation context of the response example: {"id": "twitter_1", "response": "@USER @USER @USER My 3 year old , that just finished reading Nietzsche and then asked me : " ayo papa why these people always trying to cancel someone on Twitter , trying to pretend like that makes them better themselves ? " . To which I replied " idk " , and he just " cuz hoes mad " . Im so proud . ", "context": ["Well now that \u2019 s problematic AF ", "@USER @USER My 5 year old ... asked me why they are making fun of Native Americans ..", "@USER @USER @USER I will take shit that didn't happen for $ 100", "@USER @USER @USER No .. he actually in the gifted program and reads on second grade level . ... and he knows Kansas City is in Missouri"]}

Dataset size statistics

Train Test 5000 1800

Project Objective

Our project objective is to learn from the Training dataset and predict the labels of Test dataset (SARCASM or NOT_SARCASM).

Setup and Usage Instructions

Software Dependencies:

Python==3.8.3
nltk==3.5
pandas==1.0.5
scikit_learn==0.23.2

Setup and Usage Instructions:

conda create -n "project_demo" python=3.8.3
conda activate project_demo
git clone https://github.com/subhasishb-coder/CourseProject.git
cd CourseProject
pip install nltk==3.5
pip install pandas==1.0.5
pip install scikit_learn==0.23.2
cd code
python TestClassficationCompetion_Sarcasm_Detection.py

Software Usage Tutorial Link:

https://mediaspace.illinois.edu/media/t/1_xwb0wmzt

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
code		code
data		data
.DS_Store		.DS_Store
CS410_Project_Documentation.pdf		CS410_Project_Documentation.pdf
CS410_Project_Progress_Report.pdf		CS410_Project_Progress_Report.pdf
Final Project Proposal.pdf		Final Project Proposal.pdf
README.md		README.md
Software_Usage_Tutorial		Software_Usage_Tutorial
answer.txt		answer.txt
livedatalab_config.json		livedatalab_config.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CourseProject

Introduction

The Training dataset Content

The Test dataset content

Dataset size statistics

Project Objective

Setup and Usage Instructions

Software Dependencies:

Setup and Usage Instructions:

Software Usage Tutorial Link:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CourseProject

Introduction

The Training dataset Content

The Test dataset content

Dataset size statistics

Project Objective

Setup and Usage Instructions

Software Dependencies:

Setup and Usage Instructions:

Software Usage Tutorial Link:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages