Skip to content

frki/news-scrapper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

News scrapper

Scrap (swedish) news pages

Expexted stdout

<news title [<link>]
this is a title [https://omni.se/page/for/title]

how to

  1. clone repo git clone <repo>
  2. setup environment
    python3 -m venv .venv
    source .venv/bin/activate
    pip install -r requirements.txt
  3. run python news_scrapper.py

example

$ python news_scrapper.py -h
usage: news_scrapper [-h] [--debug] {omni,nwt,dn} ...

Scrap news papper

optional arguments:
  -h, --help     show this help message and exit
  --debug        Enable debug

commands:
  {omni,nwt,dn}  sub-command
    omni         https://omni.se
    nwt          https://www.nwt.se/
    dn           https://www.dn.se/

example: sub-command

$ python news_scrapper.py omni -h
usage: news_scrapper omni [-h] [--section {tech,inrikes,utrikes}]

Scrap news from https://omni.se

optional arguments:
  -h, --help            show this help message and exit
  --section {tech,inrikes,utrikes}
                        sub-section

supported news pages

  • omni
  • nwt
  • dn

wish list

  • aftonbladet
  • expressen
  • other?

About

scrap news pages

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages