Part#0: Introduction
Big Data Science Usecases
The Lifecycle of a Data Science Project
Part#1: Brief Revision of Analytics
Algorithms and Applications
Data Pre-processing
Techniques
Dimension Reduction
Algorithms
Principal Component Analysis
Singular Value Decomposition
Feature Selection and
Feature Extraction
Forward Selection Algorithms
Feature Ranking Algorithms
Finding Similarity in Data
Similarity Measures
K-means
Hierarchical Clustering
DBSCAN
MinHash
Bioinspired Algorithms
Finding Frequent Itemsets
Data Classification
Algorithms
Decision Trees
Neural Networks
KNN
Support Vector Machines
Ensemble Methods
Data Analytics Model Validation
Recommender Systems
Content Based Recommenders
Collaborative Filtering Recommenders
Trust Based Recommenders
Sentiment Analysis
Mining Data Streams
Relational Database
Management Systems
A brief history of Apache Hadoop
Analyzing Datasets with Hadoop
Java MapReduce
Hadoop Streaming
HDFS
Introduction to Yarn
Developing a MapReduce Application
Intro to Flume, Sqoop, Pig, Hive, Crunch HBase and ZooKeeper
Part#3: Introduction to Spark