amyzhang/webCrawler
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|
Repository files navigation
inURLs.txt: text file of URLs to be crawled getURL.py: reads inURLs.txt and prints at least 10 URLs with activities/events from the same root domain to outURLs.txt outURLs.txt: text file of URLs with activities found