0 ratings0% found this document useful (0 votes) 228 views2 pagesSyllabus Big Data Analytics
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content,
claim it here.
Available Formats
Download as PDF or read online on Scribd
‘Course Code MT CyS-PEIS-20
‘Course Name Big Data Analytics
Credits 3
a one Go et
‘© Understand nosql big data management.
‘© Perform map-reduce analytics using Hadoop and related too! —anser
‘COURSE OUTCOMES:
‘+ Ldentify Big Data and its Business Implications,
‘+ Develop Big Data Solutions using Hadoop Evo System
‘+ Analyze Infosphere Biginsights Big Dats Recommendations.
+ Apply Machine Leaming Techniques using R. Perform map-reduce analytics using Hadoop
Syllabus Contents:
Unit 1:
‘What is big data, why big data, convergence of key trends, unstructured data, industry examples of
big data, web analytics, big data and marketing, fraud and big data, risk and big data, credit risk
‘management, big data and algorithmic trading, big data and healthcare, big data in medicine,
advertising and big data, big data technologies, introduction to Hadoop, open source technologies,
cloud and big data, mobile business intelligence, Crowd sourcing analytics, inter and trans firewall
analytics
Un
Introduction to NoSQL, aggregate dat models, aggregates, key-value and document data models,
relationships, graph databases, schemaless databases, materialized views, distribution models,
sharding, master-slave ceplication, peerpeer replication, sharding und replication, consistency,
relaxing consistency, version stamps, map-reduce, partitioning and combining, composing. map-
reduce calculations.
Unit 3: Data format, analyzing data with Hadoop, scaling out, Hadoop streaming, Hadoop pipes,
design of Hadoop distributed file system (HDFS), HDFS concepts, Java interface, data flow, Hadoop
VO, data integrity, compression, serialization, Avro, file-based data structure
Unit a:
MapReduce workflows, unit tests with MRUnit, test data and local tests, anatomy of MapReduce job
run, classic Map-reduce, YARN, failures in classic Map-reduce and YARN, job scheduling, shuffle
and sort, task execution, MapReduce types, input formats, output format,
Unit
Hbase, data model and implementations, Hbase clients, Hbase examples, praxis. Cassandra, Cassandra
data model, Cassandra examples, Cassandra clients, Hadoop integration,
Unit 6:
Pig, Grunt, pig data model, Pi developing and testing Pig Latin seripts. Hive, data types and
file formats, HiveQL data definition, HiveQL data manipulation, HiveQL. queries.
\ of
References:I. K. Gujral Punjab Technical University, Jalandhar
eae
Michael Minelli, Michelle Chambers, and AmbigaDhiraj, "Big Data, Big Analytics:
Emerging Business Intelligence and Analytic Trends for Today's Businesses", Wiley, 2013,
P. J, Sadalage and M. Fowler, "NoSQL Distilled: A Brief Guide to the Emerging World of
Polyglot Persistence", Addison-Wesley Professional, 2012.
‘Tom White, "Hadoop: The Definitive Guide", Third Edition, O'Reilley, 2012.
Eric Sammer, "Hadoop Operations", OReilley, 2012.
E, Capriolo, D. Wampler, and J. Rutherglen, "Programming Hive", O’Reilley, 2012.
Lars George, "HBase: The Definitive Guide", O'Reilley, 2011
Eben Hewitt, "Cassandra: The Definitive Guide", O'Reilley, 2010.
Alan Gates, "Programming Pig", O'Reilley, 2011
A KA