DATA 520 - Data Mining and Multivariate Statistics
An exploration of techniques used for very large data sets, such as cluster analysis, principal components/factor analysis, and discriminant analysis/regression, with an emphasis on statistical reasoning that connects mathematical structures from matrix algebra to computational issues such as computational complexity and distributed computing.