Mining Massive Data Sets
By Stanford
Certificate or Program credits available
This course focuses on Map Reduce as a tool for creating parallel algorithms that can process very large amounts of data.
Subject & Topics:
Engineering, Distributed File Systems, Hadoop, Map-Reduce, Pagerank, Topic-Sensitive Pagerank, Spam Detection, Hubs-And-Authorities, Similarity Search, Shingling, Minhashing, Random Hyperplanes, Locality-Sensitive Hashing, Analysis, Social Networks, Graphs, Association Rules, Dimensionality Reduction, UV, SVD, CUR Decompositions, Algorithms, Large-Scale Mining, Clustering, Nearest-Neighbor Search, Gradient Descent, Support-Vector Machines, Classification, Submodular Function, Optimization, Computer Science, Leskovec, CS 246.