Jobtimize
is a python package which collects, standardizes and completes information about job offers published on job search platforms. The package is mainly based on scraping and text classification to fill in missing data.
Release | Usage | Development |
---|---|---|
What’s new in the current version:
Dependencies
The data collected is mainly, if not entirely, obtained through the scraping of websites.
Missing information is filled in from the complete data using a word-processing algorithm (coming soon).
Depending on the installation method, the following packages are required and will also be downloaded and installed. Otherwise, they will have to be downloaded manually.
beautifulsoup4
jsonschema
lxml
pandas
requests
Installation
Conda
It is also possible to get the latest stable version with Anaconda Cloud
Git
The installation with git allows to have the latest version. However it can have some bugs.
How to use ?
As Jobtimize
is a package, in python you just have to import it. The main function (for now) is Jobtimize.jobscrap
.
from Jobtimize import jobscrap
df = jobscrap(["Data Scientist", "Data Analyst"],
["UK", "FR"],
prox = False
)
df.head()
The df
object is a dataframe pandas, so it inherits all its methods.
Contributing 🤝
🎊 Firstly, thank you for giving your time to contribute to Jobtimize
. 🎊
Contributions can only be made on the Github repository.
If you have a new feature to submit, don’t hesitate to open an issue (By checking “new feature” to make it easier to read) We can discuss it freely there.
Then you can make a “pull request” as explained in the contribution guidelines.
Same for all contributions, code improvement, documentation writing, translations… all ideas are welcome! Check out the guidelines to make it easier.
Jobtimize
gets better with contributions.