[go: up one dir, main page]

0% found this document useful (0 votes)
76 views2 pages

BDA Assignment

Uploaded by

hpbhati13223
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
76 views2 pages

BDA Assignment

Uploaded by

hpbhati13223
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

SAL engineering and technical institute

Department of Computer Engineering

Subject Name (Code): Big Data Analytics (3170722)


Semester: 7th

ASSIGNMENT-1
1. What is Big Data? Explain Characteristics of Big Data.
2. Discuss distributed file system in Big Data and its importance.
3. Explain 4 Vs of Big Data and write down its use cases.
4. List out various applications of Big Data and explain Smart City and Fraud Credit Card detection using Big
Data Analytics.
5. Explain Hadoop Ecosystem? What are the advantages of Hadoop? Explain Hadoop Architecture and its
Components with proper diagram.
6. Discuss main configuration parameters that are specified in MapReduce.
7. Explain Job Scheduling in Map Reduce. How it is done in case of
(i) The Fair Scheduler
(ii) The Capacity Scheduler
8. What is Name node & Data node in Hadoop Architecture
9. What is Map Reduce? Explain working of various phases of Map Reduce with appropriate
example and diagram
10. Discuss Hadoop YARN in detail with failures in classic MapReduce.
11. How HBase uses Zookeeper to Build Applications? Explain in detail.
12. Differentiate between HIVE and HBASE
13. Illustrate a simple example of the working of Map Reduce with its all phases.
14. Explain InputFormat, TextInputFormat, SequenceFileInputFormat, JobConf and
RecordReader in MapReduce.

ASSIGNMENT-2
1. Define: NoSQL. Where NoSQL is used? List out different types of NoSQL.
2. Explain the difference between structure and unstructured and semi structured data
3. Why NoSQL is necessary in the field of Big Data Analytics? Also List out the Advantages of NoSQL.
4. Differentiate between SQL, NoSQL. (Traditional DB Vs Non-Traditional DB)
5. Define NoSQL and where is it used? i) Document Oriented Database ii) Graph based Database.
6. Difference between master-slave versus peer-to-peer distribution models
7. Define: MongoDB. Describe why MongoDB is necessary to deal with Big Data Application.
8. List out the key features of MongoDB and describe each one in detail.
9. Describe the following method for MongoDB:
1. Insert 2. Save 3. Update 4. Remove 5. Find 6. Count 7. Limit 8. Sort 9. Skip 10. Pretty
10. Describe the Data Types available in MongoDB with example.
11. What is Array? How it can be useful in MongoDB. ? Describe in detail.
12. Define: List, Set and Map in MongoDB. Describe each one with example.

ASSIGNMENT-3
1. Explain Metastore in Hive.
2. Differentiate: Apache pig Vs Map Reduce.
3. Explain Storage mechanism in HBase. Compare Raw oriented and Column Oriented database
structures.
4. What is Zookeeper? List the benefits of it.
5. Discuss how Pig data model will help in effective data flow.
6. What do you mean by HiveQL Data Definition Language? Explain any three HiveQL DDL
command with its syntax and example.

ASSIGNMENT-4
1. What is Spark? Explain Spark components in detail. Also list the features of spark.
2. Write a brief short note on: Spark Unified Stack
3. What are the problems related to Map Reduce data storage? How Apache Spark solves it using
Resilient Distributed Dataset? Explain RDDs in detail.
4. What do you mean by HiveQL Data Definition Language? Explain any three HiveQL DDL
command with its syntax and example.
5. Explain any two commands of HDFS from following commands with syntax and at least one example
of each. (i) copyFromLocal (ii) setrep (iii) checksum (iv) get (v) cp (vi) chown

You might also like