Audience | Computational Skills | Prerequisites | Duration |
---|---|---|---|
Biologists | Beginner/Intermediate | None | 12-day workshop (~78 hours of trainer-led time) |
This repository contains all course materials for the Fall 2018 version of HBC's In-depth NGS Data Analysis Course, a 12-day course run over 6 weeks.
These materials are developed for a trainer-led workshop, but they are also amenable to self-guided learning.
The course is aimed at bench biologists who are interested in learning about NGS-based genomic analysis. The topics covered in-depth during this course are analysis of RNA-Seq and ChIP-Seq data, with an optional Variant Calling session. The sessions will also include functional analysis downstream of sequence data processing. During this course, participants will gain skills in the areas of:
- UNIX and basic shell scripting
- high-performance compute clusters, and
- R for statistical analysis and data visualization.
At the end of this course, participants can expect to have the expertise to independently run data analysis for bulk RNA-seq, ChIP-seq and basic variant calling.
No prior programming experience or command-line training is required.
Click on the topics in the table below to get a detailed description of contents covered in each week of the course.
Topic | Duration |
---|---|
Introduction to bash, O2 and NGS Data Analysis | 2 Days - Week 1 |
RNA-Seq Part I: Differential Gene Expression Analysis | 2 Days - Week 2 |
RNA-Seq Part II: Differential Gene Expression Analysis | 2 Days - Week 3 |
RNA-Seq Part III: Isoform-level differential expression and other applications | 2 Days - Week 4 |
ChIP-seq | 2 Days - Week 5 |
Variant Calling, Version Control, Intermediate Shell | 2 Days - Week 6 |
NOTE: Additional materials are included in this repo but are not part of the main course.
Download and install the following programs:
NOTE: If you are going to be using a Mac laptop please use the instructions here to determine your OS X version. If you find that your Mac is running a version older than OS X 10.6, please email us.
- Filezilla
- Integrative Genomics Viewer (IGV). If you get an error when you try to open IGV, please download Java and try again.
- Rstudio (pick the appropriate file for your operating system)
- R (R-3.4.2 for Mac OS X 10.11 (El Capitan) and higher, R-3.3.3 for Mac OS X 10.9 (Mavericks) and R-3.1.3 for Mac OS X 10.6 (Snow Leopard), OS X 10.7 and OS X 10.8).
Mac OS specific download:
- Sublime Text or similar
Windows OS specific download:
- Git for Windows, we will be using the Git Bash tools from this suite.
- Notepad++ or similar
These materials have been developed by members of the teaching team at the Harvard Chan Bioinformatics Core (HBC). These are open access materials distributed under the terms of the Creative Commons Attribution license (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- Some materials used in these lessons were derived from work that is Copyright © Data Carpentry (http://datacarpentry.org/). All Data Carpentry instructional material is made available under the Creative Commons Attribution license (CC BY 4.0).