Web Scraping Project

A Python-based web scraping tool designed to extract product data from e-commerce websites (like Amazon) and static pages. This project demonstrates how to handle dynamic content, bypass bot detection, and save data into structured formats.

Project Structure

selenium.py (or spy.py): The main script using **Selenium WebDriver to scrape dynamic websites (e.g., Amazon.in). It includes "Stealth Mode" features to bypass bot detection.
beautifulsoup.py: A script using **BeautifulSoup for scraping static HTML pages (lighter and faster).
visualization.py: (Optional) Script to visualize the scraped data (e.g., price comparisons).
database.py: Handles connections to SQLite for storing data.

Features

Dynamic Scraping: Uses Selenium to render JavaScript and interact with pages.
Stealth Mode: Includes custom User-Agents and disables automation flags to avoid "503 Service Unavailable" errors.
Data Export: Automatically saves scraped data to:
- CSV (products_output.csv)
- Excel (products_output.xlsx)
- SQLite Database (products_db.sqlite)
Error Handling: Robust checks for missing elements or empty data.

Technologies Used

Python 3.x
Selenium: For browser automation.
BeautifulSoup4: For parsing HTML.
Pandas: For data manipulation and saving files.
OpenPyXL: For writing Excel files.
SQLite3: For database storage.
WebDriver Manager: For automatic Chrome driver management.

Installation

Clone the repository:

git clone [https://github.com/Dharineesh007/Web_scrapping_python/-.git](https://github.com/Dharineesh007/Web_scrapping_python/-.git)
cd web-scrapping-

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
beautifulsoup.py		beautifulsoup.py
selenium.py		selenium.py
visualisation.py		visualisation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web Scraping Project

Project Structure

Features

Technologies Used

Installation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Web Scraping Project

Project Structure

Features

Technologies Used

Installation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages