detect_html_tables_to_csv

HTML Table Scraper

Introduction

This Python script allows you to extract all tables from a HTML file and export each one of them to a csv. The extraction is done for all the tables in the page and the csv export can be customized for filenames & destination.

Usage

Prerequisites

Before using this script, ensure you have the following:

Python installed on your system.
Required libraries: BeautifulSoup,csv,Path,requests,datetime,argparse.

Running the Script

Place the PDF file you want to extract images from in the same directory as this script.
Replace the input_file variable with the name of your PDF file.

python3 detect_html_tables_to_csv/detect_html_tables_to_csv.py --prefix exp_table

Output

The script will generate the csv files in the python-helper-modules/detect_html_tables_to_csv/tables/ directory.

License

This script is released under the MIT License. Feel free to use, modify, and distribute it as needed.

Name		Name	Last commit message	Last commit date
parent directory ..
assets		assets
tables		tables
README.md		README.md
__init__.py		__init__.py
detect_html_tables_to_csv.py		detect_html_tables_to_csv.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

HTML Table Scraper

Introduction

Usage

Prerequisites

Running the Script

Output

License

FilesExpand file tree

detect_html_tables_to_csv

Directory actions

More options

Directory actions

More options

Latest commit

History

detect_html_tables_to_csv

Folders and files

parent directory

README.md

HTML Table Scraper

Introduction

Usage

Prerequisites

Running the Script

Output

License