README.md

CS410 Course Project Team GOAT

Background

Reproducing a Paper. For our final project, we attempted to reproduce results from the (contexual text mining) research paper listed below:

ChengXiang Zhai, Atulya Velivelli, and Bei Yu. 2004. A cross-collection mixture model for comparative text mining. In Proceedings of the 10th ACM SIGKDD international conference on knowledge discovery and data mining (KDD 2004). ACM, New York, NY, USA, 743-748. DOI=10.1145/1014052.1014150

The actual paper has been included in our repository for your own reference.

Setup/Dependencies

This repo assumes that the user has Python3 as well as Pip. If not, they can be found here.

Our project has two package dependencies, newsapi and numpy. Using the command "pip install -r requirements.txt" the user should be able to install the correct versions of both packages.

At this point, cd into directory /CourseProject/, run mixture.py, and follow the prompts in order to run our implementation of the Collective Text Mining comparison model.

Data

Our data is rather fluid in this case. We have an example of our demonstration video in a youtube link here. In an attempt to keep the data as similar as possible to the original paper, please use terms "Iraq" and "Afghanistan" when prompted.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CS410 Course Project Team GOAT

Background

Setup/Dependencies

Data

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

CS410 Course Project Team GOAT

Background

Setup/Dependencies

Data