GitHub - saifyxpro/HeadlessX: The undetected self-hosted browser automation platform. Powered by Camoufox (Firefox) for 0% detection rates. Built for speed, privacy, and scalability.

Self-hosted scraping and search workflows powered by Camoufox

Overview

HeadlessX is a self-hosted scraping platform with a web dashboard, protected API, queue-backed workflows, and a remote MCP endpoint.

Current live surfaces:

Website scraping: scrape, crawl, map, content extraction, screenshots
Google AI Search
Tavily
Exa
YouTube
Queue jobs, logs, API keys, proxy management, and config management
Remote MCP over /mcp

What Changed In v2.1.0

Simplified the dashboard around one global browser/runtime model
Added Tavily, Exa, and YouTube workspaces
Added queued crawl and job flows with Redis + worker support
Added remote MCP secured with normal dashboard-created API keys
Added setup and API guides aligned with the current route tree

Scrapers

Coming Soon

Scraper	Description	Status
Google Maps	Extract business listings, reviews, categories, ratings, contact details, opening hours, and location metadata from Google Maps search results.	Planned
Twitter / X	Capture profiles, posts, engagement metrics, media, hashtags, and conversation threads from public X pages.	Planned
LinkedIn	Extract public company and profile data, role details, locations, website links, and business metadata from LinkedIn surfaces.	Planned
Instagram	Collect public profile data, captions, post metadata, media links, reels references, and engagement signals.	Planned
Amazon	Extract product listings, seller data, pricing, ratings, reviews, availability, and catalog metadata from Amazon pages.	Planned
Facebook	Capture public page data, posts, about fields, links, follower counts, and engagement metadata from Facebook pages.	Planned
Reddit	Extract subreddit, post, comment, author, score, flair, and discussion metadata from Reddit threads and listings.	Planned
ThomasNet Suppliers Real-Time Scraper	Extract 70+ ThomasNet supplier fields including emails, phone numbers, company data, products, locations, certifications, and more.	Planned
TLS Appointment Booker	Automate TLS appointment availability checks and booking workflows with support for high-frequency monitoring and retry-safe session handling.	Planned
GlobalSpec Suppliers Scraper	Extract 200,000+ industrial supplier profiles from GlobalSpec Engineering360 with contact data, business type, product catalogs, specs, and datasheets.	Planned
ImportYeti Scraper	Extract supplier profiles, shipment records, and trade data from ImportYeti with 60+ fields including HS codes, shipping lanes, carriers, bills of lading, trading partners, and contact info.	Planned
MakersRow Scraper	Extract 11,600+ US manufacturer profiles from MakersRow with email, phone, address, website, GPS coordinates, capabilities, ratings, gallery images, and business hours.	Planned

Agent Surfaces Coming Soon

Surface	Description	Status
Web AI Agent (`/web`)	Interactive AI agent workspace inside the dashboard that can use all HeadlessX playground tools and scrapers, including Website, Google AI Search, Tavily, Exa, YouTube, and related workflow actions.	Planned

Agent Skills

You can add the HeadlessX CLI skill to AI coding agents such as Cursor, Claude Code, Warp, Windsurf, OpenCode, OpenClaw, Antigravity, and similar tools that support the skills installer flow.

npx skills add https://github.com/saifyxpro/HeadlessX --skill cli

This installs the HeadlessX CLI skill from this repository so the agent can use the published headlessx command and follow the packaged usage guidance.

UI Screenshots

Google AI Search (Recently Tested with Arabic Lang & Region)

Website

Proof

BrowserScan

Cloudflare Challenge

Pixelscan

Proxy Validation

Quick Start

Requirements

Node.js 22+
pnpm 9+
PostgreSQL
Redis
Python/uv for yt-engine
Go for the HTML-to-Markdown sidecar

Recommended Development Setup

Recommended for most developers:

PostgreSQL: Supabase or Docker
Redis: Docker
App runtime: pnpm dev or mise run dev

This keeps infrastructure simple while still running the app locally.

Local Development

Clone and install:

git clone https://github.com/saifyxpro/HeadlessX.git
cd HeadlessX
pnpm install

Create root .env from the full example:

cp .env.example .env

Current root .env.example:

# HeadlessX v2.1.0
# Local development environment

# Database
DATABASE_URL="postgresql://postgres.xxxxx:[email protected]:5432/postgres"

# API server
PORT=8000
HOST=0.0.0.0
NODE_ENV=development

# Required security
# Used by the Next.js dashboard server to authenticate against the API.
DASHBOARD_INTERNAL_API_KEY=replace-with-a-long-random-string

# Used to encrypt stored credentials at rest.
CREDENTIAL_ENCRYPTION_KEY=replace-with-a-different-long-random-string

# Queue and Redis
REDIS_URL=redis://localhost:6379

# Search providers
TAVILY_API_KEY=
EXA_API_KEY=

# Local engines
YT_ENGINE_URL=http://localhost:8090
YT_ENGINE_PORT=8090
YT_ENGINE_TIMEOUT_MS=45000
YT_ENGINE_TEMP_DIR=./tmp/yt-engine
YT_ENGINE_JOB_TTL_HOURS=12

HTML_TO_MARKDOWN_SERVICE_URL=http://localhost:8081
HTML_TO_MARKDOWN_PORT=8081
HTML_TO_MARKDOWN_TIMEOUT_MS=60000

# Browser and stealth defaults are managed in the dashboard settings UI.

# Web dashboard
WEB_PORT=3000
NEXT_PUBLIC_API_URL=http://localhost:8000

# Set this for Docker or custom internal networking.
# INTERNAL_API_URL=http://localhost:8000

# Set this only when the dashboard is hosted on a custom origin.
# FRONTEND_URL=https://dashboard.example.com

If you are using Docker instead of local services, start from the complete Docker env too:

cp infra/docker/.env.example infra/docker/.env

Prepare services:

pnpm db:push
pnpm camoufox:fetch

Start the workspace:

pnpm dev

This starts:

web
api
worker
HTML-to-Markdown service
yt-engine

Important:

pnpm dev does not provision PostgreSQL or Redis
Website Crawl requires both Redis and the worker

Docker

For the current Docker path:

cp infra/docker/.env.example infra/docker/.env
cd infra/docker
docker compose --profile all up --build -d

Important notes:

use --profile all
partial profile runs are not currently reliable because of depends_on relationships
full Docker now includes yt-engine, so YouTube works inside the same compose stack

See docs/setup-guide.md for the full matrix:

no-Docker setup
mixed local setup
full Docker setup
MCP client configuration

API Summary

All non-health backend routes are protected with x-api-key.

Core backend surfaces:

GET /api/health
GET/PATCH /api/config
GET /api/dashboard/stats
GET /api/logs
GET/POST/PATCH/DELETE /api/keys
proxy CRUD under /api/proxies
website operator routes under /api/operators/website/*
Google AI Search routes under /api/operators/google/ai-search/*
Tavily routes under /api/operators/tavily/*
Exa routes under /api/operators/exa/*
YouTube routes under /api/operators/youtube/*
queue job routes under /api/jobs/*
remote MCP endpoint at /mcp

See the full route reference in docs/api-endpoints.md.

MCP

HeadlessX exposes a remote MCP endpoint from the API:

http://localhost:8000/mcp

Use a normal API key created from the dashboard API Keys page.

Do not use DASHBOARD_INTERNAL_API_KEY for MCP clients.

Example client config:

{
  "mcpServers": {
    "headlessx": {
      "transport": "http",
      "url": "http://localhost:8000/mcp",
      "headers": {
        "x-api-key": "hx_your_dashboard_created_key"
      }
    }
  }
}

Monorepo Layout

apps/
  api/                    Express API + worker + MCP
  web/                    Next.js dashboard
  yt-engine/              Python YouTube engine
  go-html-to-md-service/  Go HTML-to-Markdown sidecar
docs/
  setup-guide.md
  api-endpoints.md
infra/docker/

Packages

Package	Description	Status
@headlessx-cli/core	Published CLI package for HeadlessX operators, jobs, and search workflows. Command: `headlessx`	Available
HeadlessX Agent Skills	Installable agent skill pack from this repository for Cursor, Claude Code, Warp, Windsurf, OpenCode, OpenClaw, Antigravity, and similar tools.	Available

Coming Soon

Package	Description	Status
headfox	HeadlessX-maintained Firefox-based anti-detect browser engine that will power the platform's next-generation browser runtime.	Planned
headfox-js	TypeScript package for launching, managing, and integrating Headfox in Node.js automation and scraping flows.	Planned

Notes

The dashboard uses the internal dashboard key for server-side internal requests
MCP uses normal user-created API keys, not the dashboard internal key
Queue-backed features return degraded/unavailable behavior when Redis is missing
Docker support now covers the full runtime stack, including yt-engine

Contributing

See CONTRIBUTING.md for the current contribution workflow, local setup expectations, pull request guidance, and commit message conventions.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 178 Commits
.github		.github
.nx		.nx
apps		apps
assets		assets
docs		docs
infra/docker		infra/docker
packages		packages
scripts		scripts
skills/cli		skills/cli
.dockerignore		.dockerignore
.env.example		.env.example
.eslintignore		.eslintignore
.gitignore		.gitignore
.npmrc		.npmrc
.prettierrc.json		.prettierrc.json
.semgrepignore		.semgrepignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
audit-results.json		audit-results.json
biome.json		biome.json
knip.json		knip.json
mise.toml		mise.toml
nx.json		nx.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
project.json		project.json

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Self-hosted scraping and search workflows powered by Camoufox

Overview

What Changed In v2.1.0

Scrapers

Coming Soon

Agent Surfaces Coming Soon

Agent Skills

UI Screenshots

Google AI Search (Recently Tested with Arabic Lang & Region)

Website

Proof

BrowserScan

Pixelscan

Proxy Validation

Quick Start

Requirements

Recommended Development Setup

Local Development

Docker

API Summary

MCP

Monorepo Layout

Packages

Coming Soon

Notes

Contributing

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 9

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Uh oh!

Contributors 5

Languages

Packages