COLLEGE OF TECHNOLOGY-UNIVERSITY OF BUEA
Course: Big Data Analytics
Course code: COT602
6 credits
Course Instructors:
- Dr. TCHAGNA Aurelle
- Dr. MIH Thomas
Objectives: The course is designed to enable students to:
- Understand the framework to analyze data
- Understand the principle of big data/ data science ad data analytics
- Expose students to real data analysis system in telecommunication, power system,
electronic, etc.
COURSE CONTENT
Chapter 1 : Introduction and Overview to Big Data Analytics
1. Big Data Concept & Definition
2. Big Data, Data Science and Data Analytic
3. MapReduce Paradigm
4. Big Data Framework
5. Machine Learning (ML)
6. NoSQL Data Base
7. Big Data Applications
Chapter 2 : Practical Tutorial on MapReduce ad Hadoop for Big Data Analysis
1. Cloudera Platform Installation
2. Linux
3. First Hadoop/ MapReduce Program
4. Practical Homework
Chapter 3 : Developing a Strategy for Integrating Big Data Analytics into the Enterprise
1. Deciding What, How, and When Big Data Technologies Are Right for You
2. The Strategic Plan for Technology Adoption
3. Standardize Practices for Soliciting Business User Expectations
Page | 1
4. Acceptability for Adoption: Clarify Go/No-Go Criteria
5. Prepare the Data Environment for Massive Scalability
6. Promote Data Reuse
7. Institute Proper Levels of Oversight and Governance
8. Provide a Governed Process for Mainstreaming Technology
9. Considerations for Enterprise Integration
10. Thought Exercises
Chapter 4 Big Data Analytics Tools and Techniques
1. Understanding Big Data Storage
2. A General Overview of High-Performance Architecture
3. HDFS
4. MapReduce and YARN
5. Expanding the Big Data Application Ecosystem
6. Zookeeper
7. HBase
8. Hive
9. Pig
10. Mahout
17. Considerations
18. Thought Exercises
Chapter 5: Developing Big Data Analytics Applications
1. Parallelism
2. The Mythe of Simple Scalability
3. The Application Development Framework
4. The MapReduce Programming Model
5. A Simple Example
6. More on Map Reduce
7. Other Big Data Development Frameworks
8. The Execution Model
9. Thought Exercises
Chapter 6: Hadoop and Spark Projects in Telecommunication, Electronic and Power System
Chapter 7: Mathematical modelling of Big Data analysis
Page | 2