Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Job Scrapper Service #18

Open
deld opened this issue Jan 25, 2019 · 8 comments
Open

Job Scrapper Service #18

deld opened this issue Jan 25, 2019 · 8 comments
Labels
enhancement New feature or request

Comments

@deld
Copy link
Contributor

deld commented Jan 25, 2019

I want to get your input on the best way to do this, but instead of making employers have to manually add jobs to yet another place, I wonder if can make it so that employers can just register a link to an existing board.

Fro example

When an organization is registered one of the fields would be:

jobBoard or jobBoardURL:

So for example if my job board was https://www.indeed.com/jobs?q=software&l=Baltimore%2C+MD

These would be on there:

screenshot 2019-01-25 17 01 01

All of these would be ignored, but if you saw a posting with a name and also #baltimoretech

then suck it into the data file:

processed: Junior Software Developer #baltimoretech
ignored: Junior Software Developer

Thoughts, is that very difficult to do....all job board are different so the scraper couldn't be generic....so in that case maybe just the job title and link (and not the description)?

what are your thoughts @no13000 ?

@deld
Copy link
Contributor Author

deld commented Jan 26, 2019

I'm also fine is this is a future feature and not a part of the version 1 MVP

@tonyling94
Copy link
Contributor

Will look into this across the next few days

@tonyling94
Copy link
Contributor

I think generic support will be hard to achieve. What I can think of is to either

  1. Give every organization a "jobBoardUrl" field, so it link to the outside when clicked on "View jobs" from organization listing page. In this case, it's easily achievable but the downside is the jobs won't appear in "Job listing page" in BaltimoreTech unless manually adding the data.
  2. Handpick a few supported job boards (indeed.com, glassdoor, etc..), then allow organizations to add URLs. Scrapper will identify the supported URLs and scrape the data then automatically create data files in the GitHub repository.

@deld
Copy link
Contributor Author

deld commented Jan 31, 2019

ok - lets table this for now, #2 could be an option down the road. What would be the effort in hours to do those two (glassdoor and indeed) @no13000 ?

@tonyling94
Copy link
Contributor

I assume it'll take somewhere between 10 - 12 hours for me to complete this

@tonyling94
Copy link
Contributor

@deld should we create a new repository for this service

@deld
Copy link
Contributor Author

deld commented Feb 1, 2019

This isn't part of the MVP, so we won't work this right now. Just getting ideas as we may address this later.

@MarlonCopeland
Copy link
Collaborator

https://developer.careerbuilder.com/docs/v3jobid looks like careerbuilder has an api, can see if other job boards have an api, maybe this is one route..

@deld deld added the enhancement New feature or request label Feb 21, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants