Skip to content

Latest commit

 

History

History
19 lines (11 loc) · 1.3 KB

File metadata and controls

19 lines (11 loc) · 1.3 KB

CS410 Course Project Team GOAT

Background

Reproducing a Paper. For our final project, we attempted to reproduce results from the (contexual text mining) research paper listed below:

ChengXiang Zhai, Atulya Velivelli, and Bei Yu. 2004. A cross-collection mixture model for comparative text mining. In Proceedings of the 10th ACM SIGKDD international conference on knowledge discovery and data mining (KDD 2004). ACM, New York, NY, USA, 743-748. DOI=10.1145/1014052.1014150

The actual paper has been included in our repository for your own reference.

Setup/Dependencies

This repo assumes that the user has Python3 as well as Pip. If not, they can be found here.

Our project has two package dependencies, newsapi and numpy. Using the command "pip install -r requirements.txt" the user should be able to install the correct versions of both packages.

At this point, cd into directory /CourseProject/, run mixture.py, and follow the prompts in order to run our implementation of the Collective Text Mining comparison model.

Data

Our data is rather fluid in this case. We have an example of our demonstration video in a youtube link here. In an attempt to keep the data as similar as possible to the original paper, please use terms "Iraq" and "Afghanistan" when prompted.