Website Copier

Overview

This project is a website copier that allows users to download and save the complete structure of a website, including HTML, CSS, JavaScript, and media files. The project is built using Selenium and other web scraping tools.

Features

Download full website structure (HTML, CSS, JavaScript, and assets)
Handle dynamic content loaded via JavaScript
Save pages locally with original structure
Multi-threaded downloading for efficiency

Requirements

Python 3.x
Selenium
BeautifulSoup
Requests
ChromeDriver or Edge WebDriver (based on your browser)

Installation

Clone the repository:

git clone https://github.com/nematovN/website_copier.git
cd website_copier

Install dependencies:
```
pip install -r requirements.txt
```
Download and set up the appropriate WebDriver for your browser (Chrome or Edge).

Usage

Run the script with the target website URL:

python copier.py --url "https://example.com"

The copied website will be saved in the output/ directory.

Configuration

You can modify the config.json file to customize:

User-Agent headers
Output directory
Exclusion rules

Notes

Make sure the website you are copying allows scraping (check robots.txt)
Avoid excessive requests to prevent being blocked
Do not use this tool for unauthorized or unethical purposes

License

MIT License

Author

NematovN

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Website Copier

Overview

Features

Requirements

Installation

Usage

Configuration

Notes

License

Author

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

nematovN/website_copier

Folders and files

Latest commit

History

Repository files navigation

Website Copier

Overview

Features

Requirements

Installation

Usage

Configuration

Notes

License

Author

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages