scry

Web scraping engines with Python and Scrapy

Implemented Engines

Engine: kickass

Automatically perform category / keyword searches at kickasstorrents.com and queue them for download with transmission bit torrent client. Setup a cron job to completely automate torrent searching and downloading.

Quick Start

git clone [email protected]:arapidhs/scry.git
cd /kickass
$ scrapy crawl kickass -a category=books -a keywords='python,java'
Setup a cron job to keep scraping at intervals

Example that runs the spider every morning at 09:00

00 09 * * * export DISPLAY=:0.0 && cd ~/scry/kickass && \ /usr/local/bin/scrapy crawl kickass -a category=books \ -a keywords='python,java,sclala topics' >> ~/scrapy.log 2>&1

Email an hourly updated list of downloading torrents and search results via cronjob

0 */1 * * * cd ~/development/scrapy/kickass && ~/development/scrapy/kickass/mail.sh

Installing Scrapy

sudo apt-key adv --keyserver hkp://keyserver.ubuntu.com:80 --recv 627220E7
echo 'deb http://archive.scrapy.org/ubuntu scrapy main' | sudo tee /etc/apt/sources.list.d/scrapy.list
sudo apt-get update && sudo apt-get install scrapy-0.24

Official installation instructions http://doc.scrapy.org/en/latest/intro/install.html#intro-install-platform-notes

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
kickass		kickass
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

scry

Implemented Engines

Engine: kickass

Quick Start

Installing Scrapy

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

scry

Implemented Engines

Engine: kickass

Quick Start

Installing Scrapy

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages