CS8091 – Big Data Analytics – Regulation 2017 Syllabus

CS8091 – NOTES & QP

NOTES CLICK HERE
SEMESTER QP CLICK HERE

CS8091 – SYLLABUS

UNIT I INTRODUCTION TO BIG DATA 

Evolution of Big data — Best Practices for Big data Analytics — Big data characteristics — Validating — The Promotion of the Value of Big Data — Big Data Use Cases- Characteristics of Big Data Applications — Perception and Quantification of Value -Understanding Big Data Storage — A General Overview of High-Performance Architecture — HDFS — MapReduce and YARN — Map Reduce Programming Model

UNIT II CLUSTERING AND CLASSIFICATION 

Advanced Analytical Theory and Methods: Overview of Clustering — K-means — Use Cases — Overview of the Method — Determining the Number of Clusters — Diagnostics — Reasons to Choose and Cautions .- Classification: Decision Trees — Overview of a Decision Tree — The General Algorithm — Decision Tree Algorithms — Evaluating a Decision Tree — Decision Trees in R — Naïve Bayes — Bayes? Theorem — Naïve Bayes Classifier.

UNIT III ASSOCIATION AND RECOMMENDATION SYSTEM 

Advanced Analytical Theory and Methods: Association Rules — Overview — Apriori Algorithm — Evaluation of Candidate Rules — Applications of Association Rules — Finding Association& finding similarity — Recommendation System: Collaborative Recommendation- Content Based Recommendation — Knowledge Based Recommendation- Hybrid Recommendation Approaches.

UNIT IV STREAM MEMORY

Introduction to Streams Concepts — Stream Data Model and Architecture — Stream Computing,
Sampling Data in a Stream — Filtering Streams — Counting Distinct Elements in a Stream — Estimating
moments — Counting oneness in a Window — Decaying Window — Real time Analytics Platform(RTAP) applications — Case Studies — Real Time Sentiment Analysis, Stock Market Predictions. Using Graph Analytics for Big Data: Graph Analytics

UNIT V NOSQL DATA MANAGEMENT FOR BIG DATA AND VISUALIZATION 

NoSQL Databases : Schema-less Models?: Increasing Flexibility for Data Manipulation-Key Value Stores- Document Stores — Tabular Stores — Object Data Stores — Graph Databases Hive — Sharding —
Hbase — Analyzing big data with twitter — Big data for E-Commerce Big data for blogs — Review of Basic Data Analytic Methods using R.