0% found this document useful (0 votes)

88 views3 pages

Big Data Computing - Week-5

The document outlines the submission details and questions from Week 5 of the NPTEL Big Data Computing course assignment. It includes various questions related to distributed graph processing frameworks, data processing frameworks, and specific use cases for Big Data tools, with correct answers indicated for each question. The assignment was submitted on September 25, 2024, before the deadline.

Uploaded by

21102042.atharva.dalvi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

88 views3 pages

Big Data Computing - Week-5

Uploaded by

21102042.atharva.dalvi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

X

(https://swayam.gov.in) (https://swayam.gov.in/nc_details/NPTEL)

atharva333dalvi@gmail.com 

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL) » Big Data Computing (course)

Course Week 5: Assignment 5

outline The due date for submitting this assignment has passed.
Due on 2024-09-25, 23:59 IST.
About NPTEL
()
Assignment submitted on 2024-09-25, 21:40 IST
How does an 1)What distributed graph processing framework operates on top of 1 point
NPTEL online Spark?
course work?
() MLlib
GraphX
Week-0 ()
Spark streaming
ALL
Week-1 ()
Yes, the answer is correct.
Week-2 () Score: 1
Accepted Answers:
Week-3 ()
GraphX

Week-4 ()
2)Which of the following frameworks is best suited for fast, in-memory 1 point
data processing and supports advanced analytics such as machine learning and
Week-5 ()
graph processing?

Design of
Apache Hadoop MapReduce
HBase (unit? Apache Flink
unit=50&lesson Apache Storm
=51)
Apache Spark
Spark
Yes, the answer is correct.
Streaming and Score: 1
Sliding Window
Accepted Answers:
Analytics (Part-
Apache Spark
I) (unit?
unit=50&lesson
=52)
3)A financial institution needs to analyze historical stock market data to 1 point
predict market trends and make investment decisions. Which Big Data
Spark processing framework is best suited for this scenario?
Streaming and
Sliding Window Apache Spark
Analytics (Part-
II) (unit?
Apache Storm
unit=50&lesson Hadoop MapReduce
=53)
Apache Flume
Sliding Window Yes, the answer is correct.
Analytics (unit? Score: 1
unit=50&lesson Accepted Answers:
=54) Apache Spark
Introduction to 4) A telecommunications company needs to process real-time call logs 1 point
Kafka (unit?
unit=50&lesson
from millions of subscribers to detect network anomalies. Which combination of
=55) Big Data tools would be appropriate for this use case?
Quiz: Week 5:
Assignment 5 Apache Hadoop and Apache Pig
(assessment?
Apache Kafka and Apache HBase
name=144)
Apache Spark and Apache Hive
Week 5: Lecture
Apache Storm and Apache Pig
Notes (unit?
unit=50&lesson No, the answer is incorrect.
=125) Score: 0
Accepted Answers:
Feedback for
Apache Kafka and Apache HBase
Week 5 (unit?
unit=50&lesson
=57)
5) Do many people use Kafka as a substitute for which type of solution? 1 point

Week 5: log aggregation

Assignment 5 compaction
Solution (unit?
unit=50&lesson
collection
=107) all of the mentioned
Yes, the answer is correct.
Week-6 () Score: 1
Accepted Answers:
Week-7 () log aggregation

Text 6)Which of the following features of Resilient Distributed Datasets 1 point

Transcripts () (RDDs) in Apache Spark contributes to their fault tolerance?

DOWNLOAD DAG (Directed Acyclic Graph)

VIDEOS () In-memory computation
Lazy-evaluation
Books ()
Lineage information
Yes, the answer is correct.
Score: 1
Accepted Answers:
Lineage information

7) Point out the correct statement. 1 point

Hadoop do need specialized hardware to process the data

Hadoop allows live stream processing of real-time data
In the Hadoop mapreduce programming framework output files are divided
into lines or records
None of the mentioned
Yes, the answer is correct.
Score: 1
Accepted Answers:
In the Hadoop mapreduce programming framework output files are divided
into lines or records
8) Which of the following statements about Apache Pig is true? 1 point

Pig Latin scripts are compiled into HiveQL for execution.

Pig is primarily used for real-time stream processing.
Pig Latin provides a procedural data flow language for ETL tasks.
Pig uses a schema-on-write approach for data storage.
Yes, the answer is correct.
Score: 1
Accepted Answers:
Pig Latin provides a procedural data flow language for ETL tasks.

9) An educational institution wants to analyze student performance data 1 point

stored in HDFS and generate personalized learning recommendations. Which
Hadoop ecosystem components should be used?
Apache HBase for storing student data and Apache Pig for processing.
Apache Kafka for data streaming and Apache Storm for real-time analytics.
Hadoop MapReduce for batch processing and Apache Hive for querying.
Apache Spark for data processing and Apache Hadoop for storage.
Yes, the answer is correct.
Score: 1
Accepted Answers:
Apache Spark for data processing and Apache Hadoop for storage.

10) A company is analyzing customer behavior across multiple channels 1 point

(web, mobile app, social media) to personalize marketing campaigns. Which
technology is best suited to handle this type of data processing?
Hadoop MapReduce
Apache Kafka
Apache Spark
Apache Hive
Yes, the answer is correct.
Score: 1
Accepted Answers:
Apache Spark

Week - 5
No ratings yet
Week - 5
7 pages
Big Data Course: Key Concepts & Tools
No ratings yet
Big Data Course: Key Concepts & Tools
66 pages
Asit Kumar Das - M5 SPARK
No ratings yet
Asit Kumar Das - M5 SPARK
24 pages
Unit 5
No ratings yet
Unit 5
14 pages
13 Lecture
No ratings yet
13 Lecture
23 pages
Big Data Lab Manual
No ratings yet
Big Data Lab Manual
44 pages
Big Data
No ratings yet
Big Data
19 pages
End Sem Paper
No ratings yet
End Sem Paper
4 pages
Lecture 4 - Hadoop Ecosystem - 1691899782480
No ratings yet
Lecture 4 - Hadoop Ecosystem - 1691899782480
36 pages
BATCH12
No ratings yet
BATCH12
32 pages
INTRO Hadoop-Ecosystem
No ratings yet
INTRO Hadoop-Ecosystem
6 pages
S - Hadoop Ecosystem
No ratings yet
S - Hadoop Ecosystem
14 pages
Big Data Computing - Week-1
No ratings yet
Big Data Computing - Week-1
3 pages
BIG DATA ANALYTICS MCQs
No ratings yet
BIG DATA ANALYTICS MCQs
8 pages
Big Data Analytics Course
No ratings yet
Big Data Analytics Course
2 pages
Hadoop Intro - Part1
No ratings yet
Hadoop Intro - Part1
45 pages
Unit 5 Bigdata
No ratings yet
Unit 5 Bigdata
14 pages
Practise Quiz Ccd-470 Exam (05-2014) - Cloudera Quiz Learning
No ratings yet
Practise Quiz Ccd-470 Exam (05-2014) - Cloudera Quiz Learning
74 pages
BIG Data Analytics 21CSH-471: Computer Science & Engineering
No ratings yet
BIG Data Analytics 21CSH-471: Computer Science & Engineering
24 pages
BDA-2 Hadoop
No ratings yet
BDA-2 Hadoop
28 pages
Lesson 1 - Introduction To Big Data and Hadoop
No ratings yet
Lesson 1 - Introduction To Big Data and Hadoop
46 pages
Hadoopvsspark 180108070838
No ratings yet
Hadoopvsspark 180108070838
17 pages
Hadoop Ecosystem
No ratings yet
Hadoop Ecosystem
7 pages
In9040 PHD Presentation Selimozcan 2
No ratings yet
In9040 PHD Presentation Selimozcan 2
36 pages
Hadoop Ecosystem Lab Manual
0% (1)
Hadoop Ecosystem Lab Manual
40 pages
BDT Unit04
No ratings yet
BDT Unit04
136 pages
Hadoop Ecosystem
No ratings yet
Hadoop Ecosystem
5 pages
4 Hadoop Ecosystem
No ratings yet
4 Hadoop Ecosystem
16 pages
Hadoop Ecosystem
No ratings yet
Hadoop Ecosystem
5 pages
Hadoop Ecosystem
No ratings yet
Hadoop Ecosystem
56 pages
Exp 5 Big Data Analytics and Computing Lab Manual
No ratings yet
Exp 5 Big Data Analytics and Computing Lab Manual
28 pages
Big Data Unit 4
No ratings yet
Big Data Unit 4
96 pages
BD Notes 5
No ratings yet
BD Notes 5
37 pages
Unit 5
No ratings yet
Unit 5
4 pages
Module 2
No ratings yet
Module 2
20 pages
Big Data Analytics - Sem 7 CVMU
No ratings yet
Big Data Analytics - Sem 7 CVMU
4 pages
Big Data Analytics
No ratings yet
Big Data Analytics
20 pages
Hadoop Ecosystem
No ratings yet
Hadoop Ecosystem
58 pages
CSET 371 Course File
No ratings yet
CSET 371 Course File
81 pages
Lab Manual Big Data Analytics Lab (LC-CSE-410G) : Department of Computer Science and Engineering
No ratings yet
Lab Manual Big Data Analytics Lab (LC-CSE-410G) : Department of Computer Science and Engineering
28 pages
BIG DATA Class 1 1741496163
No ratings yet
BIG DATA Class 1 1741496163
108 pages
Data Analytics Chapter 5
No ratings yet
Data Analytics Chapter 5
14 pages
Bda3 7
No ratings yet
Bda3 7
30 pages
Devops Slides
No ratings yet
Devops Slides
223 pages
BDA Lec9
No ratings yet
BDA Lec9
25 pages
Network Traffic Analysis: Hadoop Pig VS Typical Mapreduce
No ratings yet
Network Traffic Analysis: Hadoop Pig VS Typical Mapreduce
7 pages
Units 5
No ratings yet
Units 5
3 pages
Sub Unit 3
No ratings yet
Sub Unit 3
9 pages
Unit 5-1
No ratings yet
Unit 5-1
8 pages
Big Data Frameworks for Students
No ratings yet
Big Data Frameworks for Students
32 pages
BD Unit-4
No ratings yet
BD Unit-4
79 pages
Bda Angel
No ratings yet
Bda Angel
5 pages
Big Data Technologies Lab - 5th Unit
No ratings yet
Big Data Technologies Lab - 5th Unit
2 pages
Big Data Analytics Presentation
No ratings yet
Big Data Analytics Presentation
30 pages
Bda 23
No ratings yet
Bda 23
12 pages
Notes - 4 Unit-Big Data
No ratings yet
Notes - 4 Unit-Big Data
38 pages
Big Data & Apache Spark Explained
No ratings yet
Big Data & Apache Spark Explained
31 pages
Cobit Foundation Course Certified Information Systems Auditor (CISA)
No ratings yet
Cobit Foundation Course Certified Information Systems Auditor (CISA)
7 pages
BWC Company Profile Brief PLM
No ratings yet
BWC Company Profile Brief PLM
11 pages
Vehicle Showroom Management Report
No ratings yet
Vehicle Showroom Management Report
22 pages
20762B ENU Companion
0% (1)
20762B ENU Companion
212 pages
ISEM HU - Oientation-V9
No ratings yet
ISEM HU - Oientation-V9
12 pages
Important Interview Questions On Python
No ratings yet
Important Interview Questions On Python
6 pages
MD050 Sample
No ratings yet
MD050 Sample
8 pages
Process vs Thread: Key Differences
No ratings yet
Process vs Thread: Key Differences
5 pages
Syno UsersGuide NAServer Enu PDF
No ratings yet
Syno UsersGuide NAServer Enu PDF
73 pages
FDSRPDF 2023 05 29
No ratings yet
FDSRPDF 2023 05 29
184 pages
Palo Alto Networks Cybersecurity Academy: Attacker Profiles and Motivations
No ratings yet
Palo Alto Networks Cybersecurity Academy: Attacker Profiles and Motivations
2 pages
Chapter 1 - Automation Testing Tutorial
No ratings yet
Chapter 1 - Automation Testing Tutorial
14 pages
ServiceNow Developer Scenario Based Interview Questions
No ratings yet
ServiceNow Developer Scenario Based Interview Questions
84 pages
Ecommerce EDA Project
No ratings yet
Ecommerce EDA Project
14 pages
Quick Start Guide
No ratings yet
Quick Start Guide
9 pages
Staff Cybersecurity Training Guide
No ratings yet
Staff Cybersecurity Training Guide
5 pages
Automated Network Management Study
No ratings yet
Automated Network Management Study
17 pages
Guidance Framework F 798338 NDX
No ratings yet
Guidance Framework F 798338 NDX
30 pages
Digital Communication: Sujina Ummar
No ratings yet
Digital Communication: Sujina Ummar
22 pages
Oracle APEX 1Z0-771 Practice QA
No ratings yet
Oracle APEX 1Z0-771 Practice QA
4 pages
Pingid Registration Portal: User Guide
No ratings yet
Pingid Registration Portal: User Guide
19 pages
Metadata Management - Past, Present and Future PDF
No ratings yet
Metadata Management - Past, Present and Future PDF
23 pages
Survey On Wireless Network Security: Archives of Computational Methods in Engineering July 2021
No ratings yet
Survey On Wireless Network Security: Archives of Computational Methods in Engineering July 2021
21 pages
Unblur & Sharpen Image Online Free To Use PicWish
No ratings yet
Unblur & Sharpen Image Online Free To Use PicWish
1 page
Information and Communication Technology: Edexcel IGCSE
No ratings yet
Information and Communication Technology: Edexcel IGCSE
20 pages
IBM X-Force Threat Intelligence Index 2021
No ratings yet
IBM X-Force Threat Intelligence Index 2021
50 pages
Becse R2023curriculum
No ratings yet
Becse R2023curriculum
13 pages
Project Report Final
No ratings yet
Project Report Final
25 pages
Application For Visually Impaired - SRS
No ratings yet
Application For Visually Impaired - SRS
3 pages
CV ATS - Muhamad Wijayanto
No ratings yet
CV ATS - Muhamad Wijayanto
2 pages

Big Data Computing - Week-5

Uploaded by

Big Data Computing - Week-5

Uploaded by

X

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL) » Big Data Computing (course)

Course Week 5: Assignment 5

Week 5: log aggregation

Text 6)Which of the following features of Resilient Distributed Datasets 1 point

DOWNLOAD DAG (Directed Acyclic Graph)

7) Point out the correct statement. 1 point

Hadoop do need specialized hardware to process the data

Pig Latin scripts are compiled into HiveQL for execution.

9) An educational institution wants to analyze student performance data 1 point

10) A company is analyzing customer behavior across multiple channels 1 point

You might also like