Website Cloner

This project is a Node.js script that clones a website by downloading its HTML, CSS, JavaScript, and image resources. It's designed to create a local copy of a website for offline viewing or analysis.

Features

Downloads and saves the main HTML content of a specified URL
Retrieves and stores all linked CSS stylesheets
Captures and saves all referenced JavaScript files
Downloads all images found on the page
Maintains the original structure of resources in local directories
Generates unique filenames to avoid conflicts

Prerequisites

Before you begin, ensure you have met the following requirements:

Node.js installed on your local machine
npm (Node Package Manager) to install dependencies

Installation

Clone this repository to your local machine:

git clone https://github.com/yourusername/website-cloner.git

Navigate to the project directory:
```
cd website-cloner
```
Install the required dependencies:
```
npm install
```

Usage

Open the main.js file and modify the URL in the last line to the website you want to clone:
```
downloadWebpage('https://example.com')
```
Run the script:
```
node main.js
```
The cloned website will be saved in the websites directory, organized by domain name.

Project Structure

main.js: The main script that handles the website cloning process
websites/: Directory where cloned websites are stored
- [domain]/: Subdirectory for each cloned website (e.g., example.com/)
  - index.html: The main HTML file of the cloned website
  - css/: Directory containing downloaded CSS files
  - js/: Directory containing downloaded JavaScript files
  - img/: Directory containing downloaded image files

Dependencies

axios: Promise-based HTTP client for making requests
cheerio: Fast, flexible & lean implementation of core jQuery for parsing HTML

Limitations

This script captures the static content of a website. Dynamic content loaded via JavaScript may not be fully captured.
It does not follow links to other pages within the website.
Some websites may have measures in place to prevent scraping, which could affect the cloning process.

Contributing

Contributions to improve the Website Cloner are welcome. Please follow these steps:

Fork the repository
Create a new branch (git checkout -b feature/amazing-feature)
Make your changes
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

License

This project is open source and available under the MIT License.

Acknowledgments

This project uses Axios and Cheerio, which are fantastic libraries for HTTP requests and HTML parsing, respectively.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
websites		websites
.editorconfig		.editorconfig
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.js		main.js
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Website Cloner

Features

Prerequisites

Installation

Usage

Project Structure

Dependencies

Limitations

Contributing

License

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Website Cloner

Features

Prerequisites

Installation

Usage

Project Structure

Dependencies

Limitations

Contributing

License

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages