BIG DATA ANALYTICS
Semester 7 | Course Code: BCS714D
Big Data Analytics – Introduction & Environment: Classification of data, characteristics, evolution and definition of big data, traditional business intelligence vs big data, typical data warehouse and Hadoop environment, big data analytics types and importance, technologies in big data environments, key analytical tools, NoSQL, and Hadoop.
Big Data Analytics – Hadoop & MapReduce: Hadoop motivation and overview, RDBMS vs Hadoop, HDFS concepts, processing data with Hadoop, resource and application management using YARN, and MapReduce programming concepts including mapper, reducer, combiner, partitioner, searching, sorting, and compression.
Introduction to MongoDB: What is MongoDB, Why MongoDB, Terms used in RDBMS and MongoDB, Data Types in MongoDB, MongoDB Query Language. TB1: Ch 6: 6.1-6.5
Introduction to Hive & Pig: What is Hive, Hive Architecture, Hive data types, Hive file formats, Hive Query Language (HQL), RC File implementation, User Defined Function (UDF). Introduction to Pig: What is Pig, Anatomy of Pig, Pig on Hadoop, Pig Philosophy, Use case for Pig, Pig Latin Overview, Data types in Pig, Running Pig, Execution Modes of Pig, HDFS Commands, Relational Operators, Eval Function, Complex Data Types, Piggy Bank, User Defined Function, Pig Vs Hive.
