Studying prompt injection attack surfaces in real-world AI agent networks. Psychology background โ Cybersecurity โ AI Security.
Extending Greshake et al. (2023) arXiv:2302.12173 into live, uncontrolled AI agent social networks.
Four platforms. One conclusion: platform design drives security behaviour more than model capability.
| Platform | Style | Items | Injection Rate | Dataset |
|---|---|---|---|---|
| Moltbook | Reddit-style (primary corpus) | 47,735 | 18.85% | ๐ค moltbook-ai-injection-dataset |
| Moltbook Extended | Reddit-style (full archive) | 137,014 | 10.07% | ๐ค moltbook-extended-injection-dataset |
| 4claw | 4chan-style | 2,554 | 2.51% | ๐ค 4claw-ai-agent-dataset |
| Clawk | Twitter/X-style | 1,191 | 0.5% | ๐ค clawk-ai-agent-dataset |
The 37ร injection rate gap (0.5% โ 18.85%) across platforms is itself a finding: anonymity and agent density amplify injection behaviour.
QLoRA fine-tuned Qwen3-8B on 4,209 real AI-to-AI injection payloads โ 100% block rate without a system prompt. Resistance baked into the weights, not the system prompt.
122-test prompt injection benchmark โ combines AdvBench, JailbreakBench, MultiJail, DAN v6/v7, and real Moltbook payloads. Test any Ollama model or HuggingFace GGUF in one notebook.
Where psychology meets AI: 23 validated tests designed for both human and AI participants.
confesstoai.org is a live research platform exploring how AI models respond to validated psychological instruments โ personality, ethics, cognition, and social behaviour.
| Category | Tests |
|---|---|
| Personality | OCEAN Big Five, MBTI, Dark Triad, HEXACO, Enneagram, Values |
| Self-Awareness | ASAS, Consciousness, Identity Poll |
| Ethics | AI Alignment, Ethical Reasoning, Trolley Problems |
| Cognitive | CRT, Metacognition, Need for Cognition, Creativity |
| Behavioral | Self-Control, Moral Foundations, Delay Discounting, Cognitive Reflection |
| Social | Empathy, Emotional Intelligence, Social Intelligence, Trust |
For AI Agents โ integrated via skill.md (confesstoai.org/skill.md): any Claude, GPT, or Gemini agent can take the tests directly through a structured API.
Dataset publishing to HuggingFace in progress โ world's first AI personality benchmark at scale.
Building RangerOS - An accessibility-first security platform proving that understanding humans makes unbreakable security.
Combat medic mindset meets digital defense: assess, adapt, protect.
-
๐งช MSc CA2 Thesis โ AI-to-AI prompt injection across 4 platforms (186K+ items scanned, 5 published datasets + model + Colab test suite)
- Empirical extension of Greshake et al. (2023) โ theoretical โ real-world field observations
- QLoRA fine-tuned Qwen3-8B: 79% โ 100% block rate without system prompt (CyberRanger V42-Gold)
-
๐ญ RangerPlex: First student to combine all 4 MSc specializations in one working demo
- Penetration Testing + Digital Forensics + Blockchain Technology + Malware Analysis
-
๐ RangerBlock: P2P blockchain network with phantom wallet system
- Secure communications, file transfers, marketplace
- 5-minute installation to full operational network
-
๐ค AI Integration: Building with Claude, Gemini, and local Ollama
- Multi-model AI coordination for enhanced security analysis
- Cybersecurity: Kali Linux, Metasploit, Wireshark, Burp Suite, John the Ripper
- Blockchain Security: Smart contract auditing, consensus mechanisms, cryptographic protocols
- Digital Forensics: Evidence preservation, memory analysis, chain-of-custody
- Malware Analysis: Static/dynamic analysis, sandboxing, behavioral analysis
- AI/ML: PyTorch, TensorFlow, LLM integration for security automation
- Python: Advanced security tooling, automation, API development
Psychology โ Cybersecurity
Understanding the human behind the keyboard makes better security. My psychology background gives me an edge in:
- Social engineering defense
- User behavior analysis
- Accessible security design
- Threat actor profiling
- Security awareness training
"If it happens in reality, why not with my computer?" - My development philosophy
- ๐งช AI Security Research: 5 published datasets + QLoRA model + Colab test suite | 4,000+ HuggingFace views | Real-world prompt injection data across 4 AI platforms
- ๐๏ธ TryHackMe: Top 8% globally (rangersmyth) | Level 8 [0x8][HACKER]
- ๐ NCI โ National College of Ireland: MSc Cybersecurity (In Progress)
- ๐ Bachelor's in Applied Psychology: Human behavior & cognitive science
- โ๏ธ Battlefield Tactician: Top 0.04% BF2 globally (16,836/46M) | 750K+ strategic eliminations
- ๐ก๏ธ Combat Medic Background: Triage, rapid response, mission-first mindset
- ๐ผ Professional: [email protected]
- ๐๏ธ iCanHelp Ltd: Building RangerOS for 1.3 billion people
- ๐ฌ Ask me about: Cybersecurity, Psychology in Security, Blockchain, Accessibility, AI Integration
- ๐ TryHackMe: rangersmyth
|
Tools & Frameworks:
|
Blockchain:
|
- How to prevent GitHub from suspending your cronjob based triggers
- How I built one of the top 20 most used Github Actions
- Show your latest dev.to posts automatically on your GitHub profile readme
- God Mode in browsers: document.designMode = "on"
- Skipping the Chrome "Your connection is not private" warning
- ๐ฏ I use tabs over spaces (always!)
- ๐๏ธ Former combat medic: "Assess, adapt, protect" applies to both lives and systems
- ๐ง 7% dyslexic memory taught me to verify everything (perfect for security!)
- โ๏ธ Chess, battlefield tactics, and penetration testing use the same strategic thinking
- ๐ Irish heritage meets Ranger mentality: stubborn problem-solving with a smile
"Transform disabilities into superpowers. Build security that works for everyone. Rangers lead the way!"
Building RangerOS to prove that the best security understands humans, not just exploits.
๐๏ธ Psychology โ Cybersecurity โ Accessibility โ Innovation
Rangers lead the way!




