Comprehensive guide to data mining, machine learning, and analysis of massive datasets, including techniques for similarity search, data-stream processing, and graph analysis.
Technical TutorialsData Science
Introduction
Mining of Massive Datasets by Jure Leskovec, Anand Rajaraman, Jeffrey D. Ullman is a comprehensive guide to data mining, machine learning, and analysis of massive datasets.
Highlights
Covers distributed file systems and map-reduce for parallel algorithms
Includes techniques for similarity search, data-stream processing, and search engine technology
Discusses frequent-itemset mining, clustering algorithms, and graph analysis
Covers dimensionality reduction and machine learning algorithms for large data
Recommendation
This book is recommended for students and professionals interested in data mining, machine learning, and analysis of large-scale datasets, especially those related to the web and social networks.
How GetVM Works
Learn by Doing from Your Browser Sidebar
Access from Browser Sidebar
Simply install the browser extension and click to launch GetVM directly from your sidebar.
Select Your Playground
Choose your OS, IDE, or app from our playground library and launch it instantly.
Learn and Practice Side-by-Side
Practice within the VM while following tutorials or videos side-by-side. Save your work with Pro for easy continuity.