Part 1 - Introduction to Machine Learning
Introduction, linear regression
Python libraries: pandas, numpy, visualization libraries
Evaluation, training and test sets, ROC curves
Classification
Clustering, PCA
Part 2 - Data Science and Statistics
Density estimation, MLE, Bayes classification
Statistics for scientists: correlations, p-values, and multiple testing
Advanced statistical methods: non-parametric tests,
A mini project in analyzing high throughput data
Part 3 - Class Workshops
NGS Data Analysis
End-to-end data analysis and model building.
Part 4 - Presentation of the mini project results