ReviewShield

Inspiration

Online platforms are flooded with noisy reviews, ads disguised as feedback, irrelevant stories, or rants from people who never visited. This undermines trust and makes it hard for genuine experiences to stand out. We wanted to build a system that could automatically separate useful reviews from noise at scale.

What it does

ReviewShield is an AI-powered moderation pipeline for location-based reviews. It classifies each review into one of four categories:

Advertisement — promo codes, self-promotion, links

Irrelevant — off-topic content

No-visit rant — complaints from people who never visited

None — valid, on-topic review

This helps platforms automatically filter or flag problematic reviews, improving overall trustworthiness.

How we built it

Data ingestion & cleaning: Parsed ~500k Google Reviews (CSV + JSON) into a consistent format, yielding ~265k usable rows. For the hackathon demo, we downsampled to ~1,000 rows for faster iteration.

Policy definition: Wrote a clear policy.md describing the violation categories (Advertisement, Irrelevant, No-visit rant) plus “None.”

Pseudo-labeling: Generated ~5k labeled examples using GPT-4o, which served as training data.

Classical ML baseline: Built a TF-IDF + Logistic Regression classifier trained on pseudo-labels.

Zero-shot experiments: Tested Hugging Face zero-shot models (facebook/bart-large-mnli, MoritzLaurer/deberta-v3-large-zeroshot-v2.0) for direct classification without training. Useful as a baseline but too slow and noisy for large-scale runs.

Evaluation: Used scikit-learn to compute precision, recall, F1, and plot confusion matrices for validation.