Comprehensive guide to data mining, machine learning, and analysis of massive datasets, including techniques for similarity search, data-stream processing, and graph analysis.
Technical TutorialsData Science
Introduction
Mining of Massive Datasets by Jure Leskovec, Anand Rajaraman, Jeffrey D. Ullman is a comprehensive guide to data mining, machine learning, and analysis of massive datasets.
Highlights
Covers distributed file systems and map-reduce for parallel algorithms
Includes techniques for similarity search, data-stream processing, and search engine technology
Discusses frequent-itemset mining, clustering algorithms, and graph analysis
Covers dimensionality reduction and machine learning algorithms for large data
Recommendation
This book is recommended for students and professionals interested in data mining, machine learning, and analysis of large-scale datasets, especially those related to the web and social networks.