PromptWars

GIF
Video demo

What it does

We used a fuzzing approach to create a system that iteratively generates adversarial prompts to fool LLMs. We created a dataset of hundreds of adversarial prompts and scores on Mistral LLM responses.

How we built it

NextJS Frontend
- ShadCN, RadixUI, and Tailwind CSS components
Python Flask Backend
Mistral Chat API
- Prompt Generation
- Testing
- LLM as Judge
Mistral Embedding API
MongoDB Atlas
- Vector Search

Built With

flask
node.js

Updates

Trina Chatterjee started this project — Mar 24, 2024 02:31 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.