Web-Scrapper

Flask Web Scraper with Multiple Libraries

This project is a Flask web application that allows users to scrape content from websites using three different Python web scraping libraries. It provides a simple UI to input a URL, select a scraping method, and view the scraped content on the same page.

Features

Scraping Methods: Supports three popular scraping libraries:
- Playwright: For scraping dynamic content. It’s ideal for pages that use JavaScript heavily for rendering, such as modern web applications that rely on client-side frameworks like React, Angular, or Vue.
- BeautifulSoup: For simple and fast HTML parsing. It works best for scraping pages where all relevant content is directly present in the initial HTML response.
- lxml: A fast XML and HTML parser using XPath. Like BeautifulSoup, it won’t capture content that’s dynamically loaded by JavaScript.
User Interface: Input a URL, choose the scraping method, and view the scraped content via the web interface.
Dynamic Content Support: Ability to scrape dynamic content (such as JavaScript-rendered pages) using Playwright.

Tech Stack

Backend: Flask (Python)
Frontend: HTML, CSS
Scraping Libraries:

Installation

Prerequisites

Python 3.x
A web browser

Clone the Repository:

git clone <repository-url>
cd your_project

Install Required Packages: Make sure you have Python installed (preferably Python 3). Install the required packages using pip:
```
pip install flask requests beautifulsoup4 lxml playwright
```
Playwright requires an additional setup to install browsers:
```
playwright install
```
Run the Application: Start the Flask application:
```
python app.py
```
Access the App: Open your web browser and navigate to http://127.0.0.1:500 to access the application.

Name		Name	Last commit message	Last commit date
parent directory ..
templates		templates
README.md		README.md
app.py		app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Flask Web Scraper with Multiple Libraries

Features

Tech Stack

Table of Contents

Installation

Prerequisites

FilesExpand file tree

Web-Scrapper

Directory actions

More options

Directory actions

More options

Latest commit

History

Web-Scrapper

Folders and files

parent directory

README.md

Flask Web Scraper with Multiple Libraries

Features

Tech Stack

Table of Contents

Installation

Prerequisites