Big Data Cat Questions
Big Data Cat Questions
lii"
'i1 &\ 114,1_ _ _ 1n
•
-I
CO NT INU OU S ASSESSMENT TE ST
/ Question Paper Code / 24NUS1F233
A ESSENTIALS
Course Code - Name 20ITPC502-BIG DAT
Degree & Program : B. Tech(Il)
Date of Exam : 19.8.24 Year / Semester : IIIN
Course Outcomes
COi Illustrate various bi data conce ts and its use cases in various a lication domains. Kl
CO2 Understand the Hadoop distributed file systems on different applications.(K2)
C03 Infer the workin of Hadoo architecture and Ma reduce Framework. K2
C04 Articulate the different Hadoop ecosystem components.(K.3)
COS Demonstrate the bi data solutions usin S ark Pro ammin K3
C06 Solve the various distributed a lications usin the Bi data technolo ies. K3
)
y ~~- I I I •.1 I I I I I I I I I l
SAi RAM ENGINEERING COLLEGE !'JI
An A_utonomous Institution I Afflllated to Anna University &Approved by AICTE New Delhi .~.......:;::•...........
~ by NBA 800 NAAC 'A+' I Bls.£0111S ISO 21001 : 2018 and 9001: '/015 Cenified and NIRF ~nled imlmioon , .
Sa1 Leo Nagar, West Tambaram, Chennai - 600 044. www.sairam.edu.ln ·
--___,;~
Kl CO3
A) Storing dat~ in a centralized location B) Processing data closet~ where it is
stored C) Moving data to a different cluster D) Encrypting data 'ror security \
10 Which of the following types of failures can occur in a MapReduce job?
Kl CO3
A) Task failure 8) Node failure
C) Application failure D) All of the above
r
\\
PART- B {t0x 2 = 20 Marks)
Write the Hadoop Archives Operations?
K-Level co I
12 o·efine Codec. List out some of the compression algorithm. \
K2
K2
'
CO2
CO2
\ / '
13 What is serialization and deserialization? i
K2 CO2
14 \
What are the features of Map Reduce?
K2 CO3
15 What is the use of Org.apcahe.hadoop.io.pac~age? K2 CO3
16 What is YARN? K2 CO3
17. List out the difference between Fair scheduling and Capacity scheduling. K2 CO3
18 List 5 steps in sub1nitting an application in YARN. K2 CO3
19 State one Map-Side tuning property and describe it. K2 CO3
20 What i~ Speculative execution? K2 CO3
21. a)
Explain briefly about Data Ingest with Flume and Scoop to K2 CO2
(Or) :I
• • detail about Hadoop VO- Compression. to K2 CO2
b) Exp1a1n 1n
. . d tail about the map reduce features. to K2 CO3
a) Explain 1n e
(Or)
Course O ut co m es ains.(KI)
d its us e ca se s in various application dom
an
COl Jllustrate va rio us big data concepts
plications.(K2)
di str ib ut ed fil e sy stems on different ap
CO2 Understand th e Ha do op
ework.(K.2)
Ha do op ar ch ite ctu re and Mapreduce Fiam
CO3 In fe r th e wo rk in g of
mponents.(K3)
4 ul at e th e di ffe re nt Ha do op ecosystem co
CO Ar tic
g(K3)
g da ta so lu tio ns us ing Spark Programmin
cos Demonstrate the bi
ta technologies.(K3)
I
Percenta e wise
Distribution of COs co s C 06
C 03 C 04
COi CO2
C O No.
41 .5 58.5
%
Page 3 of2
c r:valuate; K6 - Create
I ~:~- I I i, I J 1- I5 I I ~I~1° I § 7- 3 I I
2 0 C\
~
L_
7JRSi -,.~~ .... ,. . .err-:
iJI
~
V
-----
~
' . ·,
K-Level co
' ' \
~
11
PART-B (lOx 2 =20 Marks)
K2 CO4
'
Define PIG.
12 What are the major component in the Apache Pig· framework? K2 CO4
13 Compare HBase with RDBMS. K2 CO4
14 Define NULL value in PIG LATIN. K2 CO4
15
Explain the difference between transformations and actions in Spark. K2 cos
16 What is the use of' spark-shell' in Spark? K2 cos
17 What is DAG (Directed Acyclic Graph) in Spark? K2 cos
18 How does Spark handle fault tolerance in RDDs? K2 cos
19 What is the role of shared memory in CUDA? K2 CO6
20 What is the purpose of thread divergence in CUDA? K2 CO6
Course Outcomes
~-
-
COl Illustrate various big data concepts and its use cases in various application domains.(Kl) 7
CO2 Understand the Hadoop distributed file systems on different applications.(K2)
C03 Infer the working of Hadoop architecture and Mapreduce Framework.(K2)
C04 Articulate the different Hadoop ecosystem components.(K3) •
-1
l
cos Demonstrate the big data solutions using Spark Programming(K3)
C06 Solve the various distributed applications using the Big data technologies.(K3)
I
Distribution of COs (Percentage wise)
.
C06 '
I
.
__..._
% ,.___
---- 35% 35% 30% \
KI - Remember; K2 - Understand; K3 - Apply; K4 - Analyze; KS - Evaluate; K6 - Create