Skip to content

Latest commit

 

History

History
56 lines (39 loc) · 1.56 KB

README.md

File metadata and controls

56 lines (39 loc) · 1.56 KB

Project Title: Price-Extractor

Description:

This project contains a Python script that extracts text content from web pages specified by a list of URLs. It utilizes BeautifulSoup and requests libraries for web scraping and parsing HTML content.

Features:

Extracts text from web pages based on specified HTML elements and class attributes. Simple and straightforward script for basic web scraping tasks. Easily customizable to handle different web page structures.

Getting Started

Prerequisites:

Python 3.x

Install required libraries:

requests, beautifulsoup4 Installation

Clone the repository:

  1. Clone the repository to your local machine:

    git clone https://github.com/SUDEEP-M-SHETTY/Price-Extractor.git
  2. Install dependencies:

    pip install -r requirements.txt
  3. Usage: Edit the url1 list in the script to include URLs you want to scrape.

  4. Run the script:

    python extract_price.py

Contributing

  1. Fork the repository.
  2. Create a new branch (git checkout -b feature/improvement).
  3. Make your changes.
  4. Commit your changes (git commit -am 'Add new feature').
  5. Push to the branch (git push origin feature/improvement).
  6. Create a new Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Inspired by online tutorials and community forums on web scraping with Python. Thanks to BeautifulSoup and requests library developers for their contributions.

© 2024 SUDEEP M SHETTY