Skip to content

telescopeuser/S-SB-Workshop

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Forked from swirl courses

NICF- Statistics Bootcamp Workshops & References

This Statistics Bootcamp (SB) course is part of the Analytics and Intelligent Systems and Graduate Certificate in Business Analytics Practice series offered by NUS-ISS.

Co-Lecturer: GU Zhan (Sam)

zhan.gu@nus.edu.sg

Interactive R Workshops/Courses

This is a collection of interactive courses for use with the swirl R package. You'll find instructions for installing courses further down on this page. Some courses are still in development and we'd love to hear any suggestions you have as you work through them.

For more information regarding swirl, visit swirlstats.com or the swirl GitHub repository. If you'd like to write your own interactive content, please visit the Instructors page of our website.

Here are our current offerings, organized by level of difficulty:

Fundamental

  • R Programming: The basics of programming in R
  • Exploratory Data Analysis: Statistics and Data visualization in R

Intermediate

  • Open Intro: A review to statistics, data analysis, and data visualization Additional workshop
  • Mathematical Biostatistics Boot Camp: One- and two-sample t-tests, power, and sample size Additional workshop
  • Regression Models: The basics of regression modeling in R

Advanced

  • Statistical Inference: This intermediate to advanced level course closely follows the Statistical Inference course of the Johns Hopkins Data Science Specialization on Coursera. It introduces the student to basic concepts of statistical inference including probability, hypothesis testing, confidence intervals and p-values. It concludes with an initiation to topics of particular relevance to big data, issues of multiple testing and resampling.
  • Getting and Cleaning Data: dplyr, tidyr, lubridate, hardcore time consuming data pre-processing Additional workshop

Since our users come from a variety backgrounds, it's very hard to label material as Fundamental, Intermediate, or Advanced. If you find something that is labelled Fundamental to be challenging, please don't be discouraged. The first step of learning anything is to acknowledge that you are capable of understanding it. True understanding will come with time and practice.

Workshop Video Guides

  • Install R workshops

video


  • [Fundamental] R Programming

R Programming 1: Basic Building Blocks

video

R Programming 2: Workspace and Files

video

R Programming 3: Sequences of Numbers

video

R Programming 4: Vectors

video

R Programming 5: Missing Values

video

R Programming 6: Subsetting Vectors

video

R Programming 7: Matrices and Data Frames

video

R Programming 8: Logic

video

R Programming 9: Functions

Video To Be Announced

R Programming 10: lapply and sapply

video

R Programming 11: vapply and tapply

Video To Be Announced

R Programming 12: Looking at Data

video

R Programming 13: Simulation

video

R Programming 14: Dates and Times

Video To Be Announced

R Programming 15: Base Graphics

video


  • [Fundamental] Exploratory Data Analysis

Exploratory Data Analysis 1: Principles of Analytic Graphs

Video To Be Announced

Exploratory Data Analysis 2: Exploratory Graphs

video

Exploratory Data Analysis 3: Graphics Devices in R [PDF PNG SVG]

Video To Be Announced

Exploratory Data Analysis 4: Plotting Systems

video

Exploratory Data Analysis 5: Base Plotting System

video

Exploratory Data Analysis 6: Lattice Plotting System

Video To Be Announced

Exploratory Data Analysis 7: Working with Colors

Video To Be Announced

Exploratory Data Analysis 8: GGPlot2 Part1 [qplot]

video

Exploratory Data Analysis 9: GGPlot2 Part2 [ggplot]

video

Exploratory Data Analysis 10: GGPlot2 Extras [qplot]

Video To Be Announced

Exploratory Data Analysis 11: Hierarchical Clustering [Machine Learning]

Video To Be Announced

Exploratory Data Analysis 12: K Means Clustering [Machine Learning]

Video To Be Announced

Exploratory Data Analysis 13: Dimension Reduction [Machine Learning]

Video To Be Announced

Exploratory Data Analysis 14: Clustering Example [Machine Learning]

Video To Be Announced

Exploratory Data Analysis 15: CaseStudy [pm2.5 air pollution]

Video To Be Announced


  • [Intermediate] Open Intro

Open Intro 1: Overview of Statistics


  • [Intermediate] Mathematical Biostatistics Boot Camp

Mathematical Biostatistics Boot Camp 1: One Sample t-test [nitty-gritty]

Mathematical Biostatistics Boot Camp 2: Two Sample t-test [nitty-gritty]

Mathematical Biostatistics Boot Camp 3: Errors Power and Sample Size


  • [Intermediate] Regression Models

Regression Models 1: Introduction [regression to mean]

Regression Models 2: Residuals [var(data)=var(estmt)+var(resdls)]

Regression Models 3: Least Squares Estimation

Regression Models 4: Residual Variation [R^2=1-sRes/sTot=cor()^2]

Regression Models 5: Introduction to Multivariable Regression

Regression Models 6: MultiVar Examples

Regression Models 7: MultiVar Examples2

Regression Models 8: MultiVar Examples3 [MultipleLinearRegression]

Regression Models 9: Residuals Diagnostics and Variation

Regression Models 10: Variance Inflation Factors [VIF]

Regression Models 11: Overfitting and Underfitting [ANOVA, F-test]

Regression Models 12: Binary Outcomes

Regression Models 13: Count Outcomes


  • [Advanced] Statistical Inference

Statistical Inference 1: Introduction

video

Statistical Inference 2: Probability1 [dice, playing cards]

Video To Be Announced

Statistical Inference 3: Probability2 [PMF PDF CDF]

Video To Be Announced

Statistical Inference 4: ConditionalProbability [medic test]

Video To Be Announced

Statistical Inference 5: Expectations [E(X), CLT]

video

Statistical Inference 6: Variance [Var E(X^2)-(E(X))^2]

Video To Be Announced

Statistical Inference 7: CommonDistros [Bernoulli Normal Poisson]

Video To Be Announced

Statistical Inference 8: Asymptotics [central limit theorem, CI]

Video To Be Announced

Statistical Inference 9: T Confidence Intervals [nitty-gritty]

Video To Be Announced

Statistical Inference 10: Hypothesis Testing [t-test, z score]

video

Statistical Inference 11: P Values [nitty-gritty]

video

Statistical Inference 12: Power [false negative, Type II error]

Video To Be Announced

Statistical Inference 13: Multiple Testing [confusion matrix]

Video To Be Announced

Statistical Inference 14: Resampling [bootstrap]

Video To Be Announced


  • [Advanced] Getting and Cleaning Data

Getting and Cleaning Data 1: Manipulating Data with dplyr

Getting and Cleaning Data 2: Grouping and Chaining with dplyr

Getting and Cleaning Data 3: Tidying Data with tidyr

Getting and Cleaning Data 4: Dates and Times with lubridate


Install and run a workshop/course automatically

It automates the process by allowing you to do everything right from the R console.

  1. Make sure you have a recent version version of swirl:
install.packages("swirl")
  1. Enter the following from the R console to install:
library("swirl")

# Fundamental
install_course("R Programming E") # or: install_course("R Programming")
install_course("Exploratory Data Analysis")

# Intermediate
install_course("Regression Models")
install_course("The R Programming Environment") # Optional

# Advanced
install_course("Statistical Inference")
install_course("Getting and Cleaning Data")
install_course("Advanced R Programming") # Optional

swirl()

For example, install_course("R Programming") will install the R Programming course. Please note that course names are case sensitive!

Install and run additional workshop/course manually

If the automatic course installation method outlined above does not work for you, then there's a simple alternative.

  1. Click here to download the file S-SB-Workshop-master.zip.

  2. Install all all available courses:

library("swirl")
install_course_zip("~/Downloads/S-SB-Workshop-master.zip", multi=TRUE)
swirl()

Or, to install one course at a time, enter the following from the R console, substituting the correct file path to your downloaded file and the name of your desired course:

install_course_zip("path/to/file/here/S-SB-Workshop-master.zip", multi=TRUE, 
                   which_course="Course Name Here")

For example, if you download the zip file to ~/Downloads/S-SB-Workshop-master.zip, then the following command will install the R Programming course.

install_course_zip("~/Downloads/S-SB-Workshop-master.zip", multi=TRUE, which_course="R Programming")

Please note that course names are case sensitive!

Uninstall a course

If you'd like to remove a course at any time, you can use uninstall_course("Course Name Here"). If you'd like to remove all courses at any time, you can use uninstall_all_courses(force = FALSE).

Delete a user's progress

If you'd like to delete a user's workshop progress, you can use delete_progress("User Name Here").

Using swirl in the classroom

Instructors around the world are using swirl in their classrooms. We think this is awesome. If you're an instructor, please feel free to do the same -- free of charge. While your students may be paying to take your course or attend your institution, we simply ask that you don't charge people directly for the use of our software or instructional content.

If you are not sure about a particular use case, don't hesitate to send us an email at info@swirlstats.com.


Data Science Wars: R versus Python:

https://www.datasciencecentral.com/profiles/blogs/data-science-wars-r-versus-python


Data Science Road Map created by Swami Chandrasekaran:

http://nirvacana.com/thoughts/wp-content/uploads/2018/01/RoadToDataScientist1.png

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published