Big Data Analytics

Big Data Analytics Course Details – BCS714D

BIG DATA ANALYTICS

Semester 7 | Course Code: BCS714D

CIE Marks
50
SEE Marks
50
Total Marks
100
Credits
03
Teaching Hours/Week (L:T:P:S)
3:0:0:0
Total Hours of Pedagogy
40
Exam Hours
3
Examination Nature (SEE)
Theory
BCS714D Big Data Analytics
MODULE-1

Big Data Analytics – Introduction & Environment: Classification of data, characteristics, evolution and definition of big data, traditional business intelligence vs big data, typical data warehouse and Hadoop environment, big data analytics types and importance, technologies in big data environments, key analytical tools, NoSQL, and Hadoop.

MODULE-2

Big Data Analytics – Hadoop & MapReduce: Hadoop motivation and overview, RDBMS vs Hadoop, HDFS concepts, processing data with Hadoop, resource and application management using YARN, and MapReduce programming concepts including mapper, reducer, combiner, partitioner, searching, sorting, and compression.

MODULE-3

Introduction to MongoDB: What is MongoDB, Why MongoDB, Terms used in RDBMS and MongoDB, Data Types in MongoDB, MongoDB Query Language. TB1: Ch 6: 6.1-6.5

MODULE-4

Introduction to Hive & Pig: What is Hive, Hive Architecture, Hive data types, Hive file formats, Hive Query Language (HQL), RC File implementation, User Defined Function (UDF). Introduction to Pig: What is Pig, Anatomy of Pig, Pig on Hadoop, Pig Philosophy, Use case for Pig, Pig Latin Overview, Data types in Pig, Running Pig, Execution Modes of Pig, HDFS Commands, Relational Operators, Eval Function, Complex Data Types, Piggy Bank, User Defined Function, Pig Vs Hive.

Scroll to Top