Scrap (swedish) news pages
Expexted stdout
<news title [<link>]
this is a title [https://omni.se/page/for/title]- clone repo
git clone <repo> - setup environment
python3 -m venv .venv source .venv/bin/activate pip install -r requirements.txt - run
python news_scrapper.py
example
$ python news_scrapper.py -h
usage: news_scrapper [-h] [--debug] {omni,nwt,dn} ...
Scrap news papper
optional arguments:
-h, --help show this help message and exit
--debug Enable debug
commands:
{omni,nwt,dn} sub-command
omni https://omni.se
nwt https://www.nwt.se/
dn https://www.dn.se/example: sub-command
$ python news_scrapper.py omni -h
usage: news_scrapper omni [-h] [--section {tech,inrikes,utrikes}]
Scrap news from https://omni.se
optional arguments:
-h, --help show this help message and exit
--section {tech,inrikes,utrikes}
sub-section- omni
- nwt
- dn
wish list
- aftonbladet
- expressen
- other?