Trends in Data Science

The main goal of this project is to monitor the trends in the UK data science job market. Users can view all trends at this Shiny App https://apps.statcore.co.uk/trends-in-data-science/. For a write-up see this Medium article Python vs R: How to Analyse 4000 Job Advertisements Using Shiny & Machine Learning

I originally started this project in 2018 to help me decide whether to learn Python or not. I now use it as motivation to keep learning Python!

-- Project Status: Completed

Methods Used

Web Scraping
Data Visualization
Topic Modelling (LDA)
Web Application Development & Hosting
Task Scheduling

Technologies

R
Shiny
Selenium
Docker
Linux
Azure

Project Description

The data source for this project is the jobserve website. On a schedule (daily) we perform the following

Scrape all 'Data Scientist' jobs from jobserve
Pre-process data, produce visualisations and build topic models on the job description
Present output using an interactive web application

The three distinct tasks each have their own folder

Scraping
Analyse
Shiny

Each task has its own docker image, and is launched on a schedule using cron.

For the Shiny App we use Nginx as a reverse proxy and to encrypt all traffic using SSL. The Nginx folder contains the required config file.

Lastly there are a number of helper shell scripts in the root directory which automate some of the repetitive tasks (docker run, docker compose up etc).

Getting Started

Follow the setup instructions

Contact

tony@statcore.co.uk

Name		Name	Last commit message	Last commit date
Latest commit History 294 Commits
Analyse		Analyse
Docker		Docker
Nginx		Nginx
Scraping		Scraping
Shiny		Shiny
.gitignore		.gitignore
README.md		README.md
Setup.md		Setup.md
docker-compose-scraping-interactive.yml		docker-compose-scraping-interactive.yml
docker-compose-scraping.yml		docker-compose-scraping.yml
docker-compose-shiny-nginx.yml		docker-compose-shiny-nginx.yml
launch-analyse-interactive.sh		launch-analyse-interactive.sh
launch-analyse.sh		launch-analyse.sh
launch-scraping-interactive.sh		launch-scraping-interactive.sh
launch-scraping.sh		launch-scraping.sh
launch-shiny.sh		launch-shiny.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Trends in Data Science

-- Project Status: Completed

Methods Used

Technologies

Project Description

Getting Started

Contact

About

Releases

Packages

Languages

tonyjward/trends-in-data-science

Folders and files

Latest commit

History

Repository files navigation

Trends in Data Science

-- Project Status: Completed

Methods Used

Technologies

Project Description

Getting Started

Contact

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages