Skip to content

jennz0/Web_scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CS410 ---web scraper

Modified scraper to fetch information from target URL

  • Driver functions are in scraper.py, bios.txt and bio_urls.txt are the output information
  • Modify scraper.py to fit specific tasks
  • Dependencies:
from bs4 import BeautifulSoup
from selenium import webdriver 
from selenium.webdriver.chrome.options import 

Structure

		├── README.md
		├── bio_urls.txt
		├── bios.txt
		├── livedatalab_config.json
		├── sample
		│   ├── bio_urls.txt
		│   ├── bios.txt
		│   └── scraper.ipynb
		└── scraper_code
		    └── scraper.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages