Merge pull request DhanushNehru#289 from Charul00/update-readme

hasan-py · web-flow · commit d373a96d25eb · 2024-10-04T23:58:43.000+06:00
Added Web Scraper Script
diff --git a/README.md b/README.md
@@ -122,6 +122,8 @@ More information on contributing and the general code of conduct for discussion
 | Weather GUI                          | [Weather GUI](https://github.com/DhanushNehru/Python-Scripts/tree/master/Weather%20GUI)                                                       | Displays information on the weather.                                                                                |
 | Website Blocker                      | [Website Blocker](https://github.com/DhanushNehru/Python-Scripts/tree/master/Website%20Blocker)                                               | Downloads the website and loads it on your homepage in your local IP.                                               |
 | Website Cloner                       | [Website Cloner](https://github.com/DhanushNehru/Python-Scripts/tree/master/Website%20Cloner)                                                 | Clones any website and opens the site in your local IP.                                                             |
+| Web Scraper                         | [Web Scraper](https://github.com/Charul00/Python-Scripts/tree/main/Web%20Scraper)                     | A Python script that scrapes blog titles from Python.org and saves them to a file. |
+
 | Weight Converter                      | [Weight Converter](https://github.com/WatashiwaSid/Python-Scripts/tree/master/Weight%20Converter)                                             | Simple GUI script to convert weight in different measurement units.                                                 |
 | Wikipedia Data Extractor             | [Wikipedia Data Extractor](https://github.com/DhanushNehru/Python-Scripts/tree/master/Wikipedia%20Data%20Extractor)                           | A simple Wikipedia data extractor script to get output in your IDE.                                                 |
 | Word to PDF                          | [Word to PDF](https://github.com/DhanushNehru/Python-Scripts/tree/master/Word%20to%20PDF%20converter)                                         | A Python script to convert an MS Word file to a PDF file.                                                            |
diff --git a/Web Scraper/README.md b/Web Scraper/README.md
@@ -0,0 +1,8 @@
+In this script, we use the `requests` library to send a GET request to the Python.org blogs page. We then use the `BeautifulSoup` library to parse the HTML content of the page.
+
+We find all the blog titles on the page by searching for `h2` elements with the class `blog-title`. We then print each title found and save them to a file named `blog_titles.txt`.
+
+To run this script, first install the required libraries:
+
+```bash
+pip install requests beautifulsoup4
diff --git a/Web Scraper/Web_Scraper.py b/Web Scraper/Web_Scraper.py
@@ -0,0 +1,30 @@
+import requests
+from bs4 import BeautifulSoup
+
+# URL to scrape data from
+URL = "https://www.python.org/blogs/"
+
+# Send a GET request to the URL
+response = requests.get(URL)
+
+# Parse the webpage content using BeautifulSoup
+soup = BeautifulSoup(response.content, "html.parser")
+
+# Find all the blog titles on the page
+titles = soup.find_all('h2', class_='blog-title')
+
+# Print each title found
+print("Python.org Blog Titles:\n")
+for i, title in enumerate(titles, start=1):
+    print(f"{i}. {title.get_text(strip=True)}")
+
+# Save the titles to a file
+with open("blog_titles.txt", "w") as file:
+    for title in titles:
+        file.write(title.get_text(strip=True) + "\n")
+
+print("\nBlog titles saved to 'blog_titles.txt'.")
+     
+   
+     
+