2011-2012 University Catalog 
  
2011-2012 University Catalog

CS 757 - Mining Massive Datasets

Credits: 3 (NR)
Covers the techniques to mine large datasets, including Distributed File Systems and Map-Reduce, similarity search, and data stream processing.  Covers classic problems in data mining, such as clustering, association rule mining, and others from the point of view of scalability.  Includes a final project to exercise concepts covered in class.

Prerequisite(s): CS 750 or equivalent.

Hours of Lecture or Seminar per week: 2.5
When Offered: Fall