Skip to content

MajewskiMichal/Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Scraper

REST application for parsing webpage and scraping text and images from it.

Running

When you are in docker-compose.yml directory, this repo can be run by typing:

$ docker-compose up

Scrape text

Definition

POST /api/persist_text

Response

  • "success": "Text parsed into /tmp{webpage name}.txt on success
    {
        "url": "https://en.wikipedia.org/wiki/Python"

    }

Scrape images

Definition

POST '/api/persist_image

Response

  • "success": "success": "Images parsed into tmp catalogue" on success
    {
        "url": "https://en.wikipedia.org/wiki/Python"

    }

To be implemented:

threading, db, unitests

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors