-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
2 changed files
with
72 additions
and
59 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,93 +1,106 @@ | ||
# id-jobs: Your One-Stop Shop for Indonesian Job Market Data | ||
# id-jobs: Your Ultimate Explosion of Indonesian Job Market Data! ๐ฅ๐งโโ๏ธ | ||
|
||
[![Daily update to Google Sheets](https://github.com/ceroberoz/id-jobs/actions/workflows/scrape.yml/badge.svg)](https://github.com/ceroberoz/id-jobs/actions/workflows/scrape.yml) | ||
[![Daily Explosion of Job Data](https://github.com/ceroberoz/id-jobs/actions/workflows/scrape.yml/badge.svg)](https://github.com/ceroberoz/id-jobs/actions/workflows/scrape.yml) | ||
[![License: GPL v3](https://img.shields.io/badge/License-GPLv3-blue.svg)](https://www.gnu.org/licenses/gpl-3.0) | ||
[![Python 3.12+](https://img.shields.io/badge/python-3.12+-blue.svg)](https://www.python.org/downloads/) | ||
![Made with Scrapy](https://img.shields.io/badge/Made%20with-Scrapy-green.svg) | ||
![Made with Playwright](https://img.shields.io/badge/Made%20with-Playwright-orange.svg) | ||
![Powered by Scrapy](https://img.shields.io/badge/Powered%20by-Scrapy-green.svg) | ||
![Enhanced by Playwright](https://img.shields.io/badge/Enhanced%20by-Playwright-orange.svg) | ||
|
||
## What's New? | ||
## ๐ What's New in Our Latest Explosion? ๐ | ||
|
||
- **Work Arrangement Feature**: Track whether jobs are Remote, Hybrid, or On-site. | ||
- **Job Apply End Date**: Automatically calculate application deadlines (30 days from last seen date). | ||
- **Enhanced Data Cleaning**: Improved pre-upload data sanitization process. | ||
- **Job Age Feature**: Track the age of job listings to identify the most recent opportunities. | ||
- **Expanded Job Portal Coverage**: Added new job portals including Blibli, Dealls, Evermos, GoTo, and Vidio. | ||
- **Flip Jobs Integration**: Unleashed the power to scrape job listings from Flip's career portal! | ||
- **Karir.com API Mastery**: Harnessed the arcane energies of Karir.com's API for maximum job data extraction! | ||
- **Explosive Pagination**: Implemented a pagination system that blasts through ALL available job opportunities! | ||
- **Work Arrangement Tracking**: Identify Remote, Hybrid, and On-site opportunities with pinpoint accuracy! | ||
- **Job Apply End Date Calculation**: Automatically determine application deadlines with magical precision! | ||
- **Enhanced Data Sanitization**: Improved our pre-upload cleansing rituals for purer, more potent data! | ||
- **Job Age Tracking**: Monitor the age of job listings to pinpoint the freshest opportunities! | ||
|
||
## Overview | ||
## ๐ Overview | ||
|
||
id-jobs uses advanced web scraping techniques to gather job listings from various Indonesian job portals and company websites, respecting each website's terms of service. | ||
id-jobs harnesses the explosive power of web scraping to gather job listings from a vast array of Indonesian job portals and company websites, always respecting each site's terms of service. It's like casting a wide-area Explosion spell on the job market! | ||
|
||
๐ **View Job Data:** [https://s.id/id-jobs-v2](https://s.id/id-jobs-v2) | ||
๐ **Witness the Explosion of Job Data:** [https://s.id/id-jobs-v2](https://s.id/id-jobs-v2) | ||
|
||
๐ฎ๐ฉ **Note:** id-jobs is specifically designed for the Indonesian job market. | ||
๐ฎ๐ฉ **Note:** id-jobs is specifically enchanted for the Indonesian job market. | ||
|
||
## Job Age Color Guidelines | ||
## ๐ฅ Job Age Color Codex | ||
|
||
To help users quickly identify the freshness of job listings, we use a color-coded system based on the job's age: | ||
Quickly identify the freshness of job listings with our color-coded system, inspired by the varying intensities of magical explosions: | ||
|
||
| Job Age Category | Time Range | Color | Description | | ||
|------------------|------------|-------|-------------| | ||
| New | <= 1 day | ![#B3E6B3](https://via.placeholder.com/15/B3E6B3/000000?text=+) Bright Light Green | Most recent and attractive opportunities | | ||
| Hot | 1 to 7 days | ![#FFCC66](https://via.placeholder.com/15/FFCC66/000000?text=+) Warm Light Orange | Very recent and appealing listings | | ||
| Recent | 8 to 15 days | ![#99CCFF](https://via.placeholder.com/15/99CCFF/000000?text=+) Light Blue | Still fresh and noteworthy opportunities | | ||
| Aging | 16 to 21 days | ![#F2F2F2](https://via.placeholder.com/15/F2F2F2/000000?text=+) Very Light Gray | Older listings, less priority | | ||
| Old | 22 to 30 days | ![#E6E6E6](https://via.placeholder.com/15/E6E6E6/000000?text=+) Light Gray | Significantly older listings, low priority | | ||
| Expired | > 30 days | ![#D9D9D9](https://via.placeholder.com/15/D9D9D9/000000?text=+) Medium Gray | Outdated listings, likely no longer active | | ||
| New | <= 1 day | ![#B3E6B3](https://via.placeholder.com/15/B3E6B3/000000?text=+) Bright Light Green | Fresh as a newly cast spell! | | ||
| Hot | 1 to 7 days | ![#FFCC66](https://via.placeholder.com/15/FFCC66/000000?text=+) Warm Light Orange | Still sizzling with opportunity! | | ||
| Recent | 8 to 15 days | ![#99CCFF](https://via.placeholder.com/15/99CCFF/000000?text=+) Light Blue | The magic lingers... | | ||
| Aging | 16 to 21 days | ![#F2F2F2](https://via.placeholder.com/15/F2F2F2/000000?text=+) Very Light Gray | The spell's power wanes... | | ||
| Old | 22 to 30 days | ![#E6E6E6](https://via.placeholder.com/15/E6E6E6/000000?text=+) Light Gray | Ancient arcana, approach with caution. | | ||
| Expired | > 30 days | ![#D9D9D9](https://via.placeholder.com/15/D9D9D9/000000?text=+) Medium Gray | The magic has dissipated. | | ||
|
||
This color scheme is designed to guide users towards the most recent job opportunities while de-emphasizing older listings. | ||
## ๐ฅ How It Works | ||
|
||
## How It Works | ||
id-jobs automatically casts its net wide, visiting Indonesian job websites with the precision of a perfectly aimed Explosion spell. It collects relevant information and organizes it into a single, powerful spreadsheet. The data undergoes rigorous magical cleansing and formatting before being uploaded, ensuring consistency and readability worthy of the finest spell books. | ||
|
||
id-jobs automatically visits Indonesian job websites, collects relevant information, and organizes it in a single spreadsheet. The data is cleaned and formatted before being uploaded to ensure consistency and readability. | ||
![The Explosive Scraping Process](how-scraper-works.gif) | ||
|
||
![How Scraper Works](how-scraper-works.gif) | ||
## ๐ฎ Preview | ||
|
||
## Preview | ||
Behold, a glimpse into the arcane power of id-jobs data: | ||
|
||
Here's a preview of the id-jobs data in action: | ||
![id-jobs in Action](screen-capture-dev.png) | ||
|
||
![id-jobs Preview](screen-capture-dev.png) | ||
## ๐ Why Harness the Power of id-jobs? | ||
|
||
## Why Use id-jobs? | ||
Navigating the labyrinth of job opportunities in Indonesia can be as challenging as mastering Explosion magic. id-jobs simplifies this quest by consolidating information from multiple realms (websites) into one central grimoire (spreadsheet), providing additional insights such as work arrangements and application deadlines that even Megumin would approve of! | ||
|
||
Finding the right job in Indonesia can be challenging. id-jobs simplifies this process by consolidating information from multiple websites into one place, providing additional insights such as work arrangements and application deadlines. | ||
## ๐ Tomes of Knowledge (Data Sources) | ||
|
||
## Data Sources | ||
We gather our arcane knowledge from a wide range of sources, each represented by a powerful spider in our magical arsenal: | ||
|
||
We collect data from a wide range of sources, including: | ||
- Jobstreet | ||
- Glints | ||
- Kalibrr | ||
- TopKarir | ||
- Indeed | ||
- Blibli | ||
- Dealls | ||
- Evermos | ||
- GoTo | ||
- Vidio | ||
- Various company career pages | ||
- Blibli ๐ | ||
- Dealls ๐ค | ||
- Evermos ๐ | ||
- Flip ๐ณ (New!) | ||
- GoTo ๐ | ||
- Jobstreet ๐ผ | ||
- Kalibrr ๐ | ||
- Karir.com ๐ (New!) | ||
- SoftwareOne ๐ป | ||
- Tiket โ๏ธ | ||
- Various company career portals ๐ข | ||
|
||
## Features | ||
Each of these sources is a realm of opportunity, waiting to be explored by our job-seeking wizards. Our spiders weave through these portals, extracting valuable job data with the precision and power of a well-cast Explosion spell! | ||
|
||
- **Daily Updates**: Automated daily updates through CI/CD pipelines. | ||
- **Work Arrangement Tracking**: Identify Remote, Hybrid, and On-site opportunities. | ||
- **Application Deadline Estimation**: Calculated end dates for job applications. | ||
- **Optimized Data Collection**: Improved accuracy and coverage of job listings. | ||
- **User-Friendly Interface**: Access job data through a Google Sheets interface. | ||
- **Comprehensive Information**: Data from multiple job boards and company websites. | ||
- **Job Age Tracking**: Identify the most recent job listings. | ||
๐ฎ Note: Our collection of magical spiders is ever-growing, as we continuously enhance our ability to scry the Indonesian job market. Keep an eye out for new additions to our arcane arsenal! | ||
|
||
## Getting Started | ||
## โจ Magical Features | ||
|
||
For a quick guide on how to use id-jobs, refer to our [Quickstart Guide](QUICKSTART.md). | ||
- **Daily Explosions of Updates**: Automated daily updates through CI/CD pipelines that would make any archmage jealous. | ||
- **Work Arrangement Scrying**: Identify Remote, Hybrid, and On-site opportunities with crystal-clear clarity. | ||
- **Application Deadline Divination**: Calculated end dates for job applications, because timing is everything in both magic and job hunting. | ||
- **Optimized Data Collection Rituals**: Improved accuracy and coverage of job listings, leaving no stone unturned. | ||
- **User-Friendly Spell Interface**: Access job data through a Google Sheets interface so intuitive, even a novice wizard could use it. | ||
- **Comprehensive Information Gathering**: Data from multiple job boards and company websites, all in one place. | ||
- **Job Age Tracking**: Identify the freshest job listings with the precision of a finely tuned magical sensor. | ||
|
||
## FAQ | ||
## ๐งโโ๏ธ Getting Started on Your Magical Journey | ||
|
||
Have questions? Check out our [FAQ](FAQ.md) for answers to common queries. | ||
For a quick guide on how to harness the power of id-jobs, consult our [Quickstart Grimoire](QUICKSTART.md). | ||
|
||
## Legal | ||
## ๐ Frequently Asked Arcane Questions | ||
|
||
id-jobs is open source under the GPL-3.0 license. You're free to use, modify, and share the code, as long as you keep it open source too. | ||
Have questions about our magical processes? Check out our [FAQ Scroll](FAQ.md) for answers to common queries from fellow wizards and job seekers. | ||
|
||
We always respect website terms of service when collecting data. | ||
## ๐ Legal Incantations | ||
|
||
id-jobs is open source under the GPL-3.0 license. You're free to use, modify, and share the code, as long as you keep it open source too. Think of it as sharing the secrets of Explosion magic with the world! | ||
|
||
We always respect website terms of service when collecting data, because even the most powerful wizards need to follow the rules of the realms they visit. | ||
|
||
Now go forth and explode your job search with the power of id-jobs! ๐ฅ๐ | ||
|
||
--- | ||
|
||
Note: I let "Megumin AI" write this readme. | ||
|
||
![alt text](megumin-pose-megumin-konosuba.mp4) |
Binary file not shown.