An exploration of techniques used to find patterns in very large data sets, with an emphasis on the statistical structure of the approaches and practical uses of key tools. Recommended: Completion or concurrent enrollment in PDAT 611G - Big Data Management.