A Python script that converts PDF files to CSV format using the Tabula library.
- Python 3
- Tabula library
- Pandas library
- Clone this repository:
- Install the Tabula and Pandas libraries:
- pip install tabula pandas
- Put your PDF files in the
pdfdirectory. - Run the script with the following command:
- python convert.py
- The CSV files will be generated in the
csvdirectory.
-for--file: specify a single file to convert (e.g.--file example.pdf).-por--pages: specify which pages to convert (e.g.--pages 1-3,5).-sor--stream: stream output to stdout instead of writing to file.-dor--debug: enable debug mode.
Contributions are welcome! If you would like to contribute to this project, please follow these steps:
- Fork this repository.
- Create a new branch:
git checkout -b feature-branch - Make your changes and commit them:
git commit -m "Add some feature" - Push to the branch:
git push origin feature-branch - Submit a pull request.
If you have any questions or comments, please contact me at [email protected]