Data Mining | Machine Learning | Spark | Stanford University

Stanford University

Explore large-scale data mining and machine learning techniques using Spark. Gain practical skills for processing massive datasets at Stanford University.

University CoursesData ScienceMapReduceSpark

Introduction

The course will discuss data mining and machine learning algorithms for analyzing very large amounts of data. The emphasis will be on MapReduce and Spark as tools for creating parallel algorithms that can process very large amounts of data.

screenshot

Highlights

Covers topics such as Frequent itemsets and Association rules, Near Neighbor Search in High Dimensional Data, Locality Sensitive Hashing (LSH), Dimensionality reduction, Recommendation Systems, Clustering, Link Analysis, Large-scale Supervised Machine Learning, Data streams, Mining the Web for Structured Data, Web Advertising.
Provides hands-on experience with Spark through Colab notebooks.
Requires strong background in computer science, probability, linear algebra, and algorithmic analysis.

Recommendation

This course is recommended for students interested in large-scale data mining and machine learning, especially those with a strong computer science and quantitative background. It provides practical skills in using Spark for processing massive datasets.

How GetVM Works

Learn by Doing from Your Browser Sidebar

Access from Browser Sidebar

Access from Browser Sidebar

Simply install the browser extension and click to launch GetVM directly from your sidebar.

Select Your Playground

Select Your Playground

Choose your OS, IDE, or app from our playground library and launch it instantly.

Learn and Practice Side-by-Side

Learn and Practice Side-by-Side

Practice within the VM while following tutorials or videos side-by-side. Save your work with Pro for easy continuity.

Explore Similar Hands-on Tutorials

Getting Started with Artificial Intelligence , 2nd Edition

Technical TutorialsData ScienceMachine Learning

Comprehensive introduction to AI, covering machine learning and data science. Practical guide to building enterprise applications with real-world examples.

Machine Learning For Dummies, IBM Limited Edition

Technical TutorialsData ScienceMachine Learning

Comprehensive guide to machine learning and data science, suitable for beginners and experienced professionals. Authored by experts Daniel Kirsch and Judith Hurwitz.

A Programmers Guide to Data Mining

Technical TutorialsData SciencePython

Comprehensive guide to data mining techniques, including recommendation systems, classification, and clustering. Beginner-friendly introduction for programmers with hands-on exercises and Python code.

Data Mining Concepts and Techniques

Technical TutorialsData ScienceMachine Learning

Comprehensive coverage of data mining concepts and techniques, including data preprocessing, classification, clustering, and association rule mining. Essential resource for students, researchers, and professionals in data mining, machine learning, and data analysis.

Foundations of Data Science

Technical TutorialsComputer ScienceData Science

Dive into the core principles and techniques of data science with this comprehensive course by renowned experts. Gain a strong foundation in algorithms, machine learning, and more.

Fundamentals of Data Visualization

Technical TutorialsData AnalysisData ScienceData Visualization

Comprehensive guide to understanding the principles and techniques of data visualization, covering design, perception, and communication of visual data. Practical insights and tools for creating effective visualizations.

Hands-On Data Visualization

Technical TutorialsData ScienceJavaScript

Comprehensive guide to data visualization techniques and best practices. Learn to design interactive charts and customized maps for your website using free and easy-to-learn tools.

High-Dimensional Data Analysis with Low-Dimensional Models: Principles, Computation, and Applications

Technical TutorialsComputer ScienceData ScienceMathematics

Comprehensive exploration of high-dimensional data analysis, covering real-world applications in medical imaging, computer vision, and more. Valuable resource for researchers and practitioners.

Mining of Massive Datasets

Technical TutorialsData Science

Comprehensive guide to data mining, machine learning, and analysis of massive datasets, including techniques for similarity search, data-stream processing, and graph analysis.