Advanced Database Technology - Materials

The document provides an overview of advanced database technologies including MySQL, MongoDB, Cassandra, and Apache Hive. It highlights the features and functionalities of each database system, such as MySQL's support for both SQL and NoSQL, MongoDB's flexible document storage, Cassandra's high availability and scalability, and Hive's capabilities for large-scale data analytics. Additionally, it discusses the differences between SQL and document databases, emphasizing the advantages of NoSQL solutions for handling large and dynamic datasets.

Uploaded by

Arunmozhivarman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views9 pages

Advanced Database Technology - Materials

Uploaded by

Arunmozhivarman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Advanced Database Technology:

Resource Materials for Lab Session:

MYSQL:
The MySQL Server provides an industry-standard SQL interface to the cluster
enabling complex, relational queries to be run, and providing connectivity to all
of the standard MySQL connectors including:
 Common web development languages and frameworks, i.e. PHP, Perl,
Python, Django, Ruby, Ruby on Rails, etc;
 JDBC (for additional connectivity into ORMs including EclipseLink,
Hibernate, etc)
 .NET, ODBC, etc
 MySQL Document store gives users maximum flexibility in developing
traditional SQL relational applications and NoSQL schema-free document
database applications. This eliminates the need for a separate NoSQL
document database. Developers can mix and match relational data and
JSON documents in the same database as well as the same application.
For example, both data models can be queried in the same application
and results can be in table, tabular or JSON formats.

Specialized clients can perform SQL as well as CRUD operations on the
document database. These clients are MySQL Shell and the MySQL
Connectors. MySQL Shell is an interactive interface using JavaScript, Python or
SQL modes. Connectors are used for developing applications using
programming languages like Java, NodeJS, Python, C++, etc.
One of the great new features in MySQL 8.0 is the Document Store. Now with
MySQL you can store your JSON documents in collections and manage them
using CRUD operations.
A NoSQL database has a dynamic schema for unstructured data. Data is stored
in many ways which means it can be document-oriented, column-oriented,
graph-based, or organized as a key-value store. This flexibility means that
documents can be created without having a defined structure.
NoSQL databases are horizontally scalable. This means that you handle more
traffic by sharing or adding more servers in your NoSQL database. It is similar to
adding more floors to the same building versus adding more buildings to the
neighbourhood. Thus NoSQL can ultimately become larger and more powerful,
making these databases the preferred choice for large or ever-changing data
sets.
MONGODB
MongoDB is a document database. It stores data in a type of JSON format
called BSON. MongoDB is an open-source nonrelational database management
system that uses flexible documents instead of tables and rows to process and
store various forms of data. As a NoSQL database solution, MongoDB does not
require a relational database management system (RDBMS), so it provides an
elastic data storage model that enables users to store and query multivariate
data types with ease. This not only simplifies database management for
developers but also creates a highly scalable environment for cross-platform
applications and services.
A record in MongoDB is a document, which is a data structure composed of
key-value pairs similar to the structure of JSON objects.
Records in a MongoDB database are called documents, and the field values
may include numbers, strings, booleans, arrays, or even nested documents.
SQL vs Document Databases
SQL databases are considered relational databases. They store related data in
separate tables. When data is needed, it is queried from multiple tables to join
the data back together.
MongoDB is a document database which is often referred to as a non-relational
database. This does not mean that relational data cannot be stored in
document databases. It means that relational data is stored differently. A better
way to refer to it is as a non-tabular database.
MongoDB stores data in flexible documents. Instead of having multiple tables,
you can simply keep all of your related data together. This makes reading your
data very fast.
You can still have multiple groups of data too. In MongoDB, instead of tables,
these are called collections.
Cassandra :
Apache Cassandra is an open-source NoSQL distributed database trusted by
thousands of companies for scalability and high availability without
compromising performance. Linear scalability and proven fault tolerance on
commodity hardware or cloud infrastructure make it the perfect platform for
mission-critical data.

Cassandra is a free and open-source, distributed, wide-column

store, NoSQL database management system designed to handle large amounts
of data across many commodity servers, providing high availability with
no single point of failure.
The key space in Cassandra is a namespace that defines data replication across
nodes. Therefore, replication is defined at the keyspace level. Below is an
example of key space creation, including a column family in CQL 3.0: The
design goal of Cassandra is to handle big data workloads across multiple nodes
without any single point of failure. Cassandra has peer-to-peer distributed
system across its nodes, and data is distributed among all the nodes in a
cluster.
Data Replication in Cassandra
In Cassandra, one or more of the nodes in a cluster act as replicas for a given
piece of data. If it is detected that some of the nodes responded with an out-
of-date value, Cassandra will return the most recent value to the client. After
returning the most recent value, Cassandra performs a read repair in the
background to update the stale values.
The following figure shows a schematic view of how Cassandra uses data
replication among the nodes in a cluster to ensure no single point of failure.
Note − Cassandra uses the Gossip Protocol in the background to allow the
nodes to communicate with each other and detect any faulty nodes in the
cluster.
Cassandra Query Language
Users can access Cassandra through its nodes using Cassandra Query Language
(CQL). CQL treats the database (Keyspace) as a container of tables.
Programmers use cqlsh: a prompt to work with CQL or separate application
language drivers.
Clients approach any of the nodes for their read-write operations. That node
(coordinator) plays a proxy between the client and the nodes holding the data.
Apache HIVE:
Apache Hive is a distributed, fault-tolerant data warehouse system that enables
analytics at a massive scale. Hive Metastore(HMS) provides a central repository
of metadata that can easily be analysed to make informed, data-driven
decisions, and therefore it is a critical component of many data lake
architectures.
Hive is built on top of Apache Hadoop and supports storage on S3, adls, gs etc
through hdfs. Hive allows users to read, write, and manage petabytes of data
using SQL.
The Hive Metastore (HMS) is a central repository of metadata for Hive tables
and partitions in a relational database, and provides clients (including Hive,
Impala and Spark) access to this information using the metastore service API. It
has become a building block for data lakes that utilize the diverse world of
open-source software, such as Apache Spark and Presto. In fact, a whole
ecosystem of tools, open-source and otherwise, are built around the Hive
Metastore, some of which this diagram illustrates.
Hive provides full ACID support for ORC tables and insert-only support to all
other formats.

No SQL
No ratings yet
No SQL
10 pages
A Review Paper On Big Data Database'S: Cassandra, Hbase, Hive
No ratings yet
A Review Paper On Big Data Database'S: Cassandra, Hbase, Hive
6 pages
Name Shivam Prasad Reg No. 15BCE1196
No ratings yet
Name Shivam Prasad Reg No. 15BCE1196
8 pages
No SQL DB
No ratings yet
No SQL DB
18 pages
NoSQL Database Architectural Comparison
No ratings yet
NoSQL Database Architectural Comparison
19 pages
NoSQL Databases Notes
No ratings yet
NoSQL Databases Notes
5 pages
Unit III (FSWD)
No ratings yet
Unit III (FSWD)
27 pages
NOSQL Database
No ratings yet
NOSQL Database
6 pages
Unit 5 - 230601 - 174540-1
No ratings yet
Unit 5 - 230601 - 174540-1
14 pages
Unit 5 Nosql Databases
No ratings yet
Unit 5 Nosql Databases
9 pages
NOSQL Concept 2
No ratings yet
NOSQL Concept 2
4 pages
Comprehensive Guide to NoSQL Databases
No ratings yet
Comprehensive Guide to NoSQL Databases
1 page
Unit 1 (Iot)
No ratings yet
Unit 1 (Iot)
11 pages
Unit 5
No ratings yet
Unit 5
36 pages
NOSQL
No ratings yet
NOSQL
6 pages
Bcse302l Dbms Module-7 Nosql
No ratings yet
Bcse302l Dbms Module-7 Nosql
30 pages
Bda Unit-5 PDF
No ratings yet
Bda Unit-5 PDF
83 pages
NOSQL
No ratings yet
NOSQL
25 pages
NoSQL - Wikipedia, The Free Encyclopedia
No ratings yet
NoSQL - Wikipedia, The Free Encyclopedia
9 pages
Slide Sharvin's and Shashi Part
No ratings yet
Slide Sharvin's and Shashi Part
8 pages
Chapter 1 - Introducing Big Data & NoSQL
No ratings yet
Chapter 1 - Introducing Big Data & NoSQL
14 pages
Unit5 Notes Short DB
No ratings yet
Unit5 Notes Short DB
6 pages
NoSQL Databases: Overview & Benefits
No ratings yet
NoSQL Databases: Overview & Benefits
8 pages
NoSQL: A Guide for IT Students
No ratings yet
NoSQL: A Guide for IT Students
15 pages
Unit II
No ratings yet
Unit II
31 pages
Chapter 5: No SQL Data Management and Mongodb: Unit-2
No ratings yet
Chapter 5: No SQL Data Management and Mongodb: Unit-2
65 pages
Non Relational Database-NoSQL
No ratings yet
Non Relational Database-NoSQL
4 pages
Full Stack-Unit-Iii
No ratings yet
Full Stack-Unit-Iii
56 pages
CloudComputing DATABASE
No ratings yet
CloudComputing DATABASE
27 pages
Unit - 3
No ratings yet
Unit - 3
34 pages
BDT Unit 4
No ratings yet
BDT Unit 4
93 pages
Comparison of Nosql Databases: (King, 2020)
No ratings yet
Comparison of Nosql Databases: (King, 2020)
8 pages
Overview: High Performance Scalable Data Stores
No ratings yet
Overview: High Performance Scalable Data Stores
19 pages
BDAModule 3
No ratings yet
BDAModule 3
160 pages
Introduction To Nosql Databases
No ratings yet
Introduction To Nosql Databases
31 pages
NoSQL and Distributed Computing
No ratings yet
NoSQL and Distributed Computing
36 pages
Unit 3 Nosql Databases Adt
No ratings yet
Unit 3 Nosql Databases Adt
64 pages
Paper 6 - Schema-Based JSON Data Stores in Relational Databases
No ratings yet
Paper 6 - Schema-Based JSON Data Stores in Relational Databases
34 pages
MongoDB NoSQL Database Guide
No ratings yet
MongoDB NoSQL Database Guide
8 pages
NoSQL DATABSES
No ratings yet
NoSQL DATABSES
12 pages
NoSQL Lec
No ratings yet
NoSQL Lec
45 pages
NoSQL Databases for Students
No ratings yet
NoSQL Databases for Students
2 pages
Unit-I Remaining HM
No ratings yet
Unit-I Remaining HM
32 pages
Nosql Database: Abstract
No ratings yet
Nosql Database: Abstract
6 pages
NGD Mini Notes
No ratings yet
NGD Mini Notes
7 pages
Why Nosql - Ibm
No ratings yet
Why Nosql - Ibm
6 pages
Presentation by Rajashekar G.S
100% (1)
Presentation by Rajashekar G.S
79 pages
Unit 3
No ratings yet
Unit 3
10 pages
Application of Mongodb Technology in Nosql Database in Video Intelligent Big Data Analysis
No ratings yet
Application of Mongodb Technology in Nosql Database in Video Intelligent Big Data Analysis
5 pages
Apache Cassandra Key-Value Database Guide
No ratings yet
Apache Cassandra Key-Value Database Guide
6 pages
No SQL Lecture Notes
No ratings yet
No SQL Lecture Notes
17 pages
Emerging Research Trends in Database Systems
No ratings yet
Emerging Research Trends in Database Systems
21 pages
NoSQL Database
No ratings yet
NoSQL Database
45 pages
No SQL
No ratings yet
No SQL
12 pages
Top Free NoSQL Databases
No ratings yet
Top Free NoSQL Databases
4 pages
04 Introduction To CassandraDB
No ratings yet
04 Introduction To CassandraDB
19 pages
NoSQL Complete QB
No ratings yet
NoSQL Complete QB
43 pages
CS22512 Honors New
No ratings yet
CS22512 Honors New
33 pages
Comparison Between NoSQL and RDBMS
No ratings yet
Comparison Between NoSQL and RDBMS
6 pages
Excel Tips & Courses for Learners
No ratings yet
Excel Tips & Courses for Learners
5 pages
Absolute Database Features
No ratings yet
Absolute Database Features
4 pages
Types of Backups
No ratings yet
Types of Backups
5 pages
COM7036M BigData Assessment Brief2023-2024
No ratings yet
COM7036M BigData Assessment Brief2023-2024
8 pages
How Can I Tell If An Oracle Database Is Mounted and Activated?
No ratings yet
How Can I Tell If An Oracle Database Is Mounted and Activated?
11 pages
3 - Finding, Evaluating, and Processing Information
No ratings yet
3 - Finding, Evaluating, and Processing Information
27 pages
MIYIO GUIDELINESnew
No ratings yet
MIYIO GUIDELINESnew
5 pages
Volume7 Issue8 (2) 2018
No ratings yet
Volume7 Issue8 (2) 2018
357 pages
Advanced Hibernate: Course Duration: This Is A 2 Day Course With A Half-Day On-Line Webinar As
No ratings yet
Advanced Hibernate: Course Duration: This Is A 2 Day Course With A Half-Day On-Line Webinar As
4 pages
Library Management ER Diagram Guide
No ratings yet
Library Management ER Diagram Guide
6 pages
Question Bank For Cloud Computing Final
No ratings yet
Question Bank For Cloud Computing Final
2 pages
SQL Tips for Developers
No ratings yet
SQL Tips for Developers
184 pages
Patanjali Field Study Project
No ratings yet
Patanjali Field Study Project
26 pages
DYIX.E308290 - Rigid Ferrous Metal Conduit - UL Product Iq
No ratings yet
DYIX.E308290 - Rigid Ferrous Metal Conduit - UL Product Iq
1 page
SQL Server Connection Strings Guide
No ratings yet
SQL Server Connection Strings Guide
8 pages
Jurnal Increasing Learning Achievement Through The Application of Inquiry Methods in Entrepreneurship Courses
No ratings yet
Jurnal Increasing Learning Achievement Through The Application of Inquiry Methods in Entrepreneurship Courses
6 pages
HR Table For Create
No ratings yet
HR Table For Create
4 pages
PCS 7 - Documentation
No ratings yet
PCS 7 - Documentation
24 pages
A Real World Scenario Solution Using Pandas
No ratings yet
A Real World Scenario Solution Using Pandas
3 pages
Primitive Data Types
No ratings yet
Primitive Data Types
12 pages
Methods
No ratings yet
Methods
14 pages
Wiggins◦Jones・2023 - HOW·DATA·HAPPENED·A·HISTORY·FROM·THE·AGE·OF·REASON·TO·THE·AGE·OF·ALGORITHMS - W. W. Norton & Company
No ratings yet
Wiggins◦Jones・2023 - HOW·DATA·HAPPENED·A·HISTORY·FROM·THE·AGE·OF·REASON·TO·THE·AGE·OF·ALGORITHMS - W. W. Norton & Company
353 pages
Implementing A Municipal SDI With Servic
No ratings yet
Implementing A Municipal SDI With Servic
6 pages
Alarmas Previas Ri0002 La Cuesta
No ratings yet
Alarmas Previas Ri0002 La Cuesta
7 pages
Relational Database Management Systems by N.P. Singh
No ratings yet
Relational Database Management Systems by N.P. Singh
154 pages
G7 TEST - Criteria D
No ratings yet
G7 TEST - Criteria D
5 pages
A Digital Security System With Door Lock System Using RFID Technology
No ratings yet
A Digital Security System With Door Lock System Using RFID Technology
3 pages
Mysql Notes
No ratings yet
Mysql Notes
6 pages
Assignment 1 Database Systems
No ratings yet
Assignment 1 Database Systems
18 pages
DTH Churn Prediction Analysis
No ratings yet
DTH Churn Prediction Analysis
31 pages

Advanced Database Technology - Materials

Uploaded by

Advanced Database Technology - Materials

Uploaded by

Advanced Database Technology:

Resource Materials for Lab Session:

Cassandra is a free and open-source, distributed, wide-column

You might also like