REST application for parsing webpage and scraping text and images from it.
When you are in docker-compose.yml directory, this repo can be run by typing:
$ docker-compose upDefinition
POST /api/persist_text
Response
"success": "Text parsed into /tmp{webpage name}.txton success
{
"url": "https://en.wikipedia.org/wiki/Python"
}Definition
POST '/api/persist_image
Response
"success": "success": "Images parsed into tmp catalogue"on success
{
"url": "https://en.wikipedia.org/wiki/Python"
}threading, db, unitests