ChatLore: Privacy-First Chat Analysis Platform

ChatLore is a privacy-focused platform that helps users understand, protect, and gain insights from their personal messaging data. Built for the MLH Hackathon, ChatLore addresses the critical need for privacy and security in our digital conversations.

🌟 Key Features

🔒 Sensitive Data Protection

Automatically identifies personal information like phone numbers, addresses, and financial details
Provides redaction capabilities for sharing or analysis
Gives users control over what information is visible and to whom

🛡️ Security Insights

Analyzes conversations for potential security risks
Provides a security score and actionable recommendations
Identifies high-risk messages and suggests protective measures

🔍 Context-Aware Search & AI Assistant

Understands the context of conversations, not just keywords
Provides relevant messages with surrounding context
Answers complex questions about your chat history using AI

📱 Demo

🏆 HenHacks2025 Hackathon Context

ChatLore was developed for the HenHack MLH Hackathon with a focus on privacy and security in digital communications. Our team identified that while messaging apps are increasingly central to our lives, they lack robust tools for users to understand and protect their sensitive information.

Problem Statement

The average person sends 40+ messages per day, many containing sensitive information
Existing chat platforms provide minimal tools for identifying or protecting sensitive data
Users have limited visibility into potential security risks in their conversations
Finding specific information in chat history is difficult without proper context

Our Solution

ChatLore addresses these challenges by providing a secure platform where users can:

Upload their chat data (currently supporting WhatsApp exports)
Automatically identify sensitive information
Receive security insights and recommendations
Search and ask questions with context-awareness

🛠️ Technology Stack

Frontend

React 19 with TypeScript
Tailwind CSS for styling
Shadcn UI component library
React Query for data fetching

Backend

FastAPI (Python)
Google Gemini for natural language processing
Custom ML models for sensitive data detection
Vector embeddings for semantic search

Architecture

Client-side processing for privacy-sensitive operations
Secure API design with minimal data transfer
Local-first approach with optional cloud features

🚀 Getting Started

For detailed setup instructions, see SETUP.md.

Quick Start

# Clone the repository
git clone https://github.com/yourusername/chatlore.git
cd chatlore

# Run the quick setup script
./quick_setup.sh

Then open your browser to http://localhost:3000.

📊 Architecture

ChatLore uses a privacy-first architecture:

Data Processing Layer: Handles chat data parsing and initial processing
Analysis Layer: Identifies sensitive information and security risks
Context Engine: Builds semantic understanding of conversations
Query Layer: Processes search queries and questions
Presentation Layer: Provides user interface and visualization

🔍 Technical Challenges

Privacy-Preserving Analysis

We developed techniques to analyze sensitive data while minimizing exposure, using local processing where possible and secure API design.

Context-Aware Search

Building a system that understands the context of conversations required advanced NLP techniques and custom vector embeddings.

Real-Time Security Analysis

Developing heuristics and models to identify security risks in conversational data presented unique challenges in pattern recognition and risk assessment.

🔮 Future Development

Support for more messaging platforms (Telegram, Discord, Slack)
Advanced threat detection for potential phishing or scam messages
Personalized security recommendations based on user behavior
End-to-end encrypted cloud backup options
Integration with privacy-focused identity management systems

👥 Team

Dhruv Khara
Meet Bhanushali

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Thanks to MLH for hosting this hackathon
Google for providing access to the Gemini API
All the open-source libraries that made this project possible

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
backend		backend
demo_data		demo_data
docs		docs
frontend		frontend
README.md		README.md
SETUP.md		SETUP.md
demo_script.md		demo_script.md
quick_setup.sh		quick_setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChatLore: Privacy-First Chat Analysis Platform

🌟 Key Features

🔒 Sensitive Data Protection

🛡️ Security Insights

🔍 Context-Aware Search & AI Assistant

📱 Demo

🏆 HenHacks2025 Hackathon Context

Problem Statement

Our Solution

🛠️ Technology Stack

Frontend

Backend

Architecture

🚀 Getting Started

Quick Start

📊 Architecture

🔍 Technical Challenges

Privacy-Preserving Analysis

Context-Aware Search

Real-Time Security Analysis

🔮 Future Development

👥 Team

📄 License

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ChatLore: Privacy-First Chat Analysis Platform

🌟 Key Features

🔒 Sensitive Data Protection

🛡️ Security Insights

🔍 Context-Aware Search & AI Assistant

📱 Demo

🏆 HenHacks2025 Hackathon Context

Problem Statement

Our Solution

🛠️ Technology Stack

Frontend

Backend

Architecture

🚀 Getting Started

Quick Start

📊 Architecture

🔍 Technical Challenges

Privacy-Preserving Analysis

Context-Aware Search

Real-Time Security Analysis

🔮 Future Development

👥 Team

📄 License

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages