Part 1 - Introduction to Machine Learning
Introduction, regression and classification problems
Linear regression
Evaluation, training and test sets, ROC curves
Decision Trees, Random Forests
Linear Classifiers, Support Vector Machines
Unsupervised Learning: Dimensionality reduction, Clustering
Python libraries: pandas, numpy, visualization libraries, sklearn
Part 2 - Data Science and Statistics
Density estimation, MLE, Bayes classification
Statistics for scientists: correlations, p-values, and multiple testing
Advanced statistical methods: non-parametric tests,
A mini project in analyzing high throughput data
Part 3 - Class Workshops
NGS pipelines and Data Analysis
Part 4 - Presentation of the mini project results