Web Scrapers

This repository contains three web scrapers designed to fetch job listings from different career platforms: Greenhouse, Lever, and Workday. Each scraper is implemented in Python and uses libraries such as Requests, BeautifulSoup, Selenium, and Pandas to extract and save job data.

Features

Greenhouse Scraper: Fetches job listings from Greenhouse career pages.
Lever Scraper: Fetches job listings from Lever career pages.
Workday Scraper: Fetches job listings from Workday career pages.
CSV Output: Saves job data into CSV files for each platform.
Error Logging: Logs companies that encounter errors during scraping.

Technologies Used

Python 3.x
Requests
BeautifulSoup
Selenium
Pandas
ChromeDriver

How to Run

Clone the repository: git clone https://github.com/your-username/Web-Crawlers.git
Install dependencies: pip install requests beautifulsoup4 selenium pandas
Download ChromeDriver: Ensure you have ChromeDriver installed and its path correctly set in the script. You can download ChromeDriver from the ChromeDriver website.
Run the script: a. Greenhouse Scraper : python greenhouse.py b. Lever Scraper : python lever.py c. Workday Scraper : python workday.py

Each script will:

Read company names or URLs from the respective configuration files.
Scrape job listings from the corresponding career pages.
Save the job data into CSV files in the respective output directories.
Log any companies that encountered errors during scraping.

Output

The script generates CSV files named {company_name}_jobs.csv. Each file contains the following columns:

Job Title: The title of the job. Job Location: The location of the job. Job Link: The URL to the job listing.

Troubleshooting

CSS Selectors: If the job listings are not being scraped correctly, update the CSS selectors in css_selectors.json.
ChromeDriver Path: Ensure the chrome_driver_path variable in the script points to the correct location of your ChromeDriver.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Greenhouse Crawler		Greenhouse Crawler
Lever Crawler		Lever Crawler
Workday Crawler		Workday Crawler
.DS_Store		.DS_Store
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web Scrapers

Features

Technologies Used

How to Run

Output

Troubleshooting

About

Releases

Packages

Languages

Maddila-Anjali/Web-Scrapers

Folders and files

Latest commit

History

Repository files navigation

Web Scrapers

Features

Technologies Used

How to Run

Output

Troubleshooting

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages