Skip to content
Change the repository type filter

All

    Repositories list

    • pypac

      Public
      Find and use proxy auto-config (PAC) files with Python and Requests.
      Python
      Apache License 2.0
      21000Updated Apr 26, 2026Apr 26, 2026
    • A set of spiders and scrapers to extract location information from places that post their location on the internet.
      Python
      Other
      2550069Updated Apr 25, 2026Apr 25, 2026
    • iplist

      Public
      IP Address Collection and Management Service with multiple output formats: mikrotik, json, text, ipset, nfset, clashx, keenetic, switchy, amnezia
      HTML
      MIT License
      50000Updated Apr 24, 2026Apr 24, 2026
    • Scrapling

      Public
      🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!
      Python
      BSD 3-Clause "New" or "Revised" License
      3.4k000Updated Apr 24, 2026Apr 24, 2026
    • hero

      Public
      The web browser that’s nearly impossible for bot blockers to block
      TypeScript
      MIT License
      74000Updated Apr 24, 2026Apr 24, 2026
    • apify-js

      Public
      Apify SDK — The scalable web scraping and crawling library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) wit…
      TypeScript
      Apache License 2.0
      1.3k000Updated Apr 24, 2026Apr 24, 2026
    • Scrape data from websites using Open Graph, HTML metadata & fallbacks.
      HTML
      MIT License
      184005Updated Apr 24, 2026Apr 24, 2026
    • adblocker

      Public
      Efficient embeddable adblocker library
      TypeScript
      Mozilla Public License 2.0
      1180013Updated Apr 24, 2026Apr 24, 2026
    • Scrapy extension that gives you all the scraping monitoring, alerting, scheduling, and data validation you will need straight out of the box.
      Python
      BSD 3-Clause "New" or "Revised" License
      13000Updated Apr 23, 2026Apr 23, 2026
    • More routines for operating on iterables, beyond itertools
      Python
      MIT License
      316000Updated Apr 23, 2026Apr 23, 2026
    • Python client for Zyte Data API
      Python
      BSD 3-Clause "New" or "Revised" License
      6000Updated Apr 23, 2026Apr 23, 2026
    • HTTP client made for scraping based on got.
      TypeScript
      60000Updated Apr 22, 2026Apr 22, 2026
    • A browser driver on top of puppeteer, ready for production scenarios.
      JavaScript
      MIT License
      90000Updated Apr 22, 2026Apr 22, 2026
    • Zyte Data API integration for Scrapy
      Python
      BSD 3-Clause "New" or "Revised" License
      21000Updated Apr 22, 2026Apr 22, 2026
    • Fix Burp Suite's horrible TLS stack & spoof any browser fingerprint
      Java
      GNU General Public License v3.0
      116000Updated Apr 22, 2026Apr 22, 2026
    • JavaScript object that creates unique CSS selector for given object.
      TypeScript
      MIT License
      94001Updated Apr 22, 2026Apr 22, 2026
    • web-poet

      Public
      Web scraping Page Objects core library
      Python
      BSD 3-Clause "New" or "Revised" License
      19000Updated Apr 21, 2026Apr 21, 2026
    • Page Object pattern for Scrapy
      Python
      BSD 3-Clause "New" or "Revised" License
      29000Updated Apr 21, 2026Apr 21, 2026
    • spidermon

      Public
      Scrapy Extension for monitoring spiders execution.
      Python
      BSD 3-Clause "New" or "Revised" License
      103000Updated Apr 21, 2026Apr 21, 2026
    • Accurately separate the TLD from the registered domain and subdomains of a URL, using the Public Suffix List.
      Python
      BSD 3-Clause "New" or "Revised" License
      212000Updated Apr 21, 2026Apr 21, 2026
    • fpscanner

      Public
      TypeScript
      MIT License
      75000Updated Apr 21, 2026Apr 21, 2026
    • 43 MB Google Chrome to fit inside AWS Lambda Layer compressed with Brotli
      MIT License
      52000Updated Apr 21, 2026Apr 21, 2026
    • The New (auto rotate) Proxy [Finder | Checker | Server]. HTTP(S) & SOCKS :performing_arts:
      Python
      Apache License 2.0
      1.2k000Updated Apr 20, 2026Apr 20, 2026
    • A list of most common User Agent used on Internet.
      JavaScript
      MIT License
      21000Updated Apr 20, 2026Apr 20, 2026
    • Python type wrappers for Chrome DevTools Protocol (CDP)
      Python
      MIT License
      27000Updated Apr 18, 2026Apr 18, 2026
    • estela entrypoint for job runner 🕸
      Python
      2000Updated Apr 17, 2026Apr 17, 2026
    • An active fork of curl-impersonate with more versions and build targets. A series of patches that make curl requests look like Chrome and Firefox.
      Batchfile
      MIT License
      445000Updated Apr 17, 2026Apr 17, 2026
    • List of libraries, tools and APIs for web scraping and data processing.
      Makefile
      Other
      887000Updated Apr 17, 2026Apr 17, 2026
    • 🖱️ Generate human-like mouse movements with puppeteer or on any 2D plane
      TypeScript
      MIT License
      159000Updated Apr 15, 2026Apr 15, 2026
    • estela

      Public
      estela, an elastic web scraping cluster 🕸
      TypeScript
      MIT License
      18000Updated Apr 15, 2026Apr 15, 2026
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.