Skip to content

mustafos/GoogleScrape

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GoogleScrape

Header

GoogleScrape is a simple web application designed to fetch and parse organic search results from Google. The application provides a user-friendly interface with an input field where you can enter a search query, and it returns the results from the first page of Google's organic search in a structured format. This project demonstrates the ability to scrape web data without using the official Google API.

Features

  • User Input: Allows users to enter a search query through a web-based form.
  • Search Parsing: Extracts and displays the organic search results (title and URL) from the first page of Google search.
  • JSON Output: Returns the parsed results in a machine-readable JSON format for easy storage and processing.
  • Server-Side Processing: Uses a Python-based backend (Flask) to handle web scraping and data parsing.
  • Data Privacy: No Google API usage; data is extracted directly from the search results page.

Tech Stack

  • Frontend: HTML, JavaScript (Fetch API)
  • Backend: Python (Flask, BeautifulSoup, Requests)
  • Output: JSON data format

Body

Prerequisites

  • Python 3.x
  • Flask
  • BeautifulSoup4
  • Requests

How to Use

  1. Clone the repository.
  2. Install the required Python packages:
pip install flask beautifulsoup4 requests
  1. Run the Flask server:
python app.py
  1. Open a web browser and go to https://mustafos.pythonanywhere.com
  2. Enter a search query in the input field and click Search.
  3. The results from the first page of Google’s organic search will be displayed in JSON format.

Disclaimer

This project is intended for educational and personal use only. Scraping Google’s search results directly may violate their terms of service, and heavy usage can result in IP blocking. Use this application responsibly and at your own risk.

About

GoogleScrape is a simple web application designed to fetch and parse organic search results from Google. The application provides a user-friendly interface with an input field where you can enter a search query, and it returns the results from the first page of Google's organic search in a structured format.

Topics

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors