0% found this document useful (0 votes)

24 views69 pages

Module 3 NOSQL

Uploaded by

sivefik636

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views69 pages

Module 3 NOSQL

Uploaded by

sivefik636

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 69

3.

NoSQL
Relational Database NoSQL
Student id Name Location Gender College
{
“Student_id” : “24”
“Name” : “Sachin”
24 Sachin Vasai M VCET “Hobby” : “Reading Books”
“Branch” : “CSE DS”
25 Dinesh Virar M VIT }
{
26 Mayuri Palghar F SPIT
“Student_id” : “25”
“Name” : “Dinesh”
1. Follow structure while entering the data. “Location” : “Virar”
2. Blank Space not allowed(Mark Nil - “Hobby” : “Singing”
occupies space or space wasted).
}
3. Can’t insert additional data.
Need to change data according to the
structure or Need to mold Structure
according to the data.(time consuming &
requires efforts.)
SQL NoSQL

1) Databases are categorized as Relational Database NoSQL databases are categorized as Non-relational or
Management System (RDBMS). distributed database system.

2) SQL databases have ﬁxed or static or predeﬁned schema. NoSQL databases have dynamic schema.

3) SQL databases display data in form of tables so it is known as NoSQL databases display data as collection of key-value pair,
table-based database. documents, graph databases or wide-column stores.

4) SQL databases use a powerful language "Structured Query In NoSQL databases, collection of documents are used to query
Language" to deﬁne and manipulate the data. the data. It is also called unstructured query language. It varies
from database to database.

5) SQL databases are best suited for complex queries.(structured NoSQL databases are not so good for complex queries
Query Language) because these are not as powerful as SQL queries (Untructured
Query Language).

6) SQL databases are not best suited for hierarchical data NoSQL databases are best suited for hierarchical data storage
storage. (Based on ACID Property) (CAP Theorem).

7) MySQL, Oracle, Sqlite, PostgreSQL and MS-SQL etc. are the MongoDB, BigTable, Redis, RavenDB, Cassandra, Hbase, Neo4j,
example of SQL database. CouchDB etc. are the example of nosql database
Brief History of NoSQL Databases

1998- Carlo Strozzi use the term NoSQL for his lightweight, open-source relational
database
• 2000- Graph database Neo4j is launched
• 2004- Google BigTable is launched
• 2005- CouchDB is launched
• 2007- The research paper on Amazon Dynamo is released
• 2008- Facebooks open sources the Cassandra project
• 2009- The term NoSQL was reintroduced
Introduction to NoSQL
• Non-relational database (doesn't have features which are related to Relational Database)
• Doesn’t have predefined schema (Structure of the table will not be predefined, Total no. of
attributes & meaning of entire database will not be same at every point of time)
• Doesn’t store data in tables (stores unstructured data)
• Generally used to store big data and real-time data (data can be of different types for eg.
image, audio, video for storing this we can’t use traditional database format. It also stores
streaming data i.e. data which is recorded live this was not handled by the structured format
so NoSQL was introduced)
• Follows a CAP Theorem (Consistency, Availability, Partitioning) NoSQL can’t follow all 3
properties at a time, it can follow any of 2 properties.
What is NoSQL?
• NoSQL is next generation database which is completely different from the traditional database.
• NoSQL refers to a non-relational database management system designed to handle large volumes of
diverse, unstructured, and semi-structured data that traditional relational databases struggle with.
• NoSQL stands for Not only SQL. SQL as well as other query languages can be used with NoSQL databases.
• NoSQL is non-relational database and it is schema-free.
• NoSQL uses distributed architecture and works on multiple processors to give high performance.
• NoSQL databases are horizontally scalable.
• Many open-source NoSQL databases are available. Data file can be easily replicated.
• NoSQL uses simple API.
• NoSQL can manage huge amount of data.
• NoSQL can be implemented on commodity hardware which has separate RAM and disk
(shared nothing concept).
Features of NoSQL
1. Never follows Relational Model (Never provide Table with fixed Column
Records & it is Schema Free).
2. Provide share Nothing Environment (if data is distributed in Nodes, each
Node will receive different data & each node performs its task
independently).
3. Scalability (Scale up i.e. Vertical scaling [increase in Hardware configuration
which means RAM , HDD - Expensive] & Scale out i.e. horizontal scaling
[dividing the load or data into multiple nodes – low cost] )
4. Low-cost hardware as compared to relational database.
5. Faster Performance (because space is not wasted).
NoSQL Business Drivers
NoSQL Business Drivers
Volume:
• Data is getting generated exponentially which is increasing the volume, with this increase
in data there is a need for extending the storage capacity.
• NOSQL databases are designed to handle massive amounts of data, which is a core
characteristic of big data.
• Because of this Volume, there is a need to scale up.
• Not only for the storage purpose, there is a need to scale up the processing speed as well
as resources.
• If the data is large in amount, requirement of processing it is more & resources used to
fulfilling these requirements will also be more.
• Because of RDBMS it would take more amount of time to process this huge data, so
organization shifted from serial to parallel processing (entire data will be divided into
clusters , every cluster will be processed parallelly & finally output from these clusters will
be combined).
NoSQL Business Drivers
Velocity:
• Velocity means the rate at which data was been generated.
• Initially rate at which data was getting generated was very low, hence there was
very low traffic generated.
• The no. of request for accessing the data was very less.
• Due to velocity , there was Random Bursts in web Traffic, it was difficult for RDBMS
to respond to all these request in given amount of time, it resulted in slow response
time.
• To deal with this problem, there was a need for huge number of resources, which
was expensive for the organization & company.
NoSQL Business Drivers
Variability:
• Big data is not always structured and consistent. NoSQL databases excel at handling
diverse data types (structured, semi-structured, and unstructured) without requiring rigid
schemas.
• This flexibility is essential for accommodating the various data formats generated by social
media, IoT devices and other modern sources.(availability of data)

Agility:
• NoSQL databases offer greater agility in terms of development and deployment.
• They allow for faster iteration and adaptation to changing business needs due to their
flexible schema designs and ability to scale horizontally.
• This agility is particularly important in fast-paced environments where time to market is a
competitive advantage. (time taken to feed or retrieve data is high)
CAP Theorem
• The CAP theorem, originally introduced as the CAP principle, can be used to explain
some of the competing requirements in a distributed system with replication.
• It is a tool used to make system designers aware of the trade-offs while designing
networked shared-data systems.
• The three letters in CAP refer to three desirable properties of distributed systems
with replicated data: consistency (among replicated copies), availability (of the
system for read and write operations) and partition tolerance (in the face of the
nodes in the system being partitioned by a network fault).
• The CAP theorem states that it is not possible to guarantee all three of the desirable
properties – consistency, availability, and partition tolerance at the same time in a
distributed system with data replication.
• The theorem states that networked shared-data systems can only strongly support
two of the following three properties:
• Consistency: means that all clients see the same data at the same time, no matter
which node they connect to in a distributed system. To achieve consistency,
whenever data is written to one node, it must be instantly forwarded or replicated to
all the other nodes in the system before the write is deemed successful.
• Availability: means that every non-failing node returns a response for all read and
write requests in a reasonable amount of time, even if one or more nodes are down.
Another way to state this — all working nodes in the distributed system return a valid
response for any request, without failing or exception.
• Partition Tolerance: means that the system continues to operate despite arbitrary
message loss or failure of part of the system. In other words, even if there is a
network outage in the data center and some of the computers are unreachable, still
the system continues to perform. Distributed systems guaranteeing partition
tolerance can gracefully recover from partitions once the partition heals.
The CAP theorem categorizes systems into three categories:
• CP (Consistent and Partition Tolerant) database:
A CP database delivers consistency and partition tolerance at the expense of
availability.
When a partition occurs between any two nodes, the system has to shut down the
non-consistent node (i.e., make it unavailable) until the partition is resolved.
Partition refers to a communication break between nodes within a distributed
system. Meaning, if a node cannot receive any messages from another node in the
system, there is a partition between the two nodes.
Partition could have been because of network failure, server crash, or any other
reason.
• AP (Available and Partition Tolerant) database:
An AP database delivers availability and partition tolerance at the expense of
consistency.
When a partition occurs, all nodes remain available but those at the wrong end
of a partition might return an older version of data than others.
When the partition is resolved, the AP databases typically resync the nodes to
repair all inconsistencies in the system.
• CA (Consistent and Available) database:
A CA delivers consistency and availability in the absence of any network
partition.
Often a single node’s DB servers are categorized as CA systems.
Single node DB servers do not need to deal with partition tolerance and are thus
considered CA systems.
NOSQL Case study
1. Amazon DynamoDB
2. Google’s BigTable
• Google’s motivation for developing BigTable is driven by its need for
massive scalability, better performance characteristics and ability run on
commodity hardware.
• Each time when a new service or increase in load happens, its solution
BigTable would result in only a small incremental cost.
• Volume of Google’s data generally is in petabytes and is distributed over
100,000 nodes
3. MongoDB

• MongoDB was designed by Eliot Horowitz with his team in 10gen.

• MongoDB was built based on their experiences in building large scale, high
availability, robust systems.
• MongoDB was thought of changing the data model of MySql from relational to
document based, to achieve speed, manageability, agility, schema-less databases
and easier horizontal scalability (also JOIN free).
• Relational databases like MySql or Oracle work well with, say, indexes, dynamic
queries and updates.
• MongoDB works exactly the same way but has the option of indexing an
embedded field.
4. Neo4j
• Neo4j is an open-source (source code is available in github) sponsored by
Neo Technology.
• Its NoSQL graph database is implemented in Java.
• Its development started in 2003; it was made publicly available since
2007.
• Neo4j is used today by hundreds to thousands of enterprises.
• To name a few: scientific research, routing, matchmaking, network
management, recommendations, social networks, software analytics,
organizational and project management
Desirable features of NoSQL that drive business are listed below:
1. 24 × 7 Data availability
2. Location transparency
3. Schema-less data model
4. Modern day transaction analysis
5. Architecture that suits big data
6. Analytics and business intelligence
NoSQL Data Architectural Patterns

Types of NoSQL Data Stores

1. Key−value store.
2. Column store.
3. Document store.
4. Graph store.
1. Key Value Store Database
• Most basic data model.
• Stores the data in the form of key-value pairs.
• Key is the representative of data value.
• Key can be integer, string or any other data type but it must always be
unique.
• Value is a data, that is correlated to the key (JSON, BLOB(Binary large
Object), String, etc.)
• The key-value pair storage databases generally store data as a hash table
where each key is unique.
• This type of pattern is usually used in shopping websites or e-commerce
applications.
1. Key Value Store Database
Advantages:
• Can handle large amounts of data and heavy load
• Easy retrieval of data by keys.
Disadvantages:
• Complex queries may involve multiple key-value pairs which may delay performance.
• Data can involve many-to-many relationships which may collide.
Use:
• DynamoDB
• Berkeley DB
Examples of Key−Value Stores
• Redis, Amazon Dynamo, Azure Table Storage (ATS), Riak, Memcache,
etc. Uses of Key−Value Stores Dictionary, image store, lookup tables,
cache query, etc.
• A key−value store is similar to Dictionary where for a word (key) all
the associated words (noun/verb forms, plural, picture, phrase in
which the word is used, etc.) and meaning (values) are given.
• External websites are stored as key−value store in Google’s
database. Amazon S3 (simple storage service) makes use of
key−value store to save the digital media content like photos, music,
videos in the cloud. In a key−value store, static component is the
URL of the website and images. The dynamic component of the
website generated by scripts is not stored in key−value store.
2. Column Store Database
• Data storage is done in individual cells (can relate with RDBMS)
• Every column is handled differently (all columns coming under particular
column is functioning separately).
• Individual columns will contain several columns inside it.
2. Column Store Database
Advantages:
• Readily available data
• Aggregate queries can run readily on data(SUM, AVG, COUNT , etc ).
Disadvantages:
• Not efficient with online transactional processing.
Use:
• HBase
• Cassandra
3. Document Database
• Stores the data in key-value pair but here values are termed as documents.
• Documents can be any complex data structures.
• Documents can be arrays, strings, XML, JSON, etc.
• Documents can be nested too ( Multiple documents can be inside single
document) which can increase the complexity of Storage of data.
• For example, if the root is Employee, the path can be
Employee[id=‘2300’]/Address/street/BuildingName/text()
• Though the document store tree structure is complex the search API is simple.
• Document structure uses JSON (JavaScript Object Notation) format for deep
nesting of tree structures associated with serialized objects.
• But JSON does not support document attributes such as bold, hyperlinks, etc.
• Examples include: MongoDB, CouchBase, and CouchDB.
3. Document Database
Advantages:
• Useful for semi-structured data.
• Retrieval and management of data is easy.
Disadvantages:
• Aggregate operations may not work fine (as data is stored in the form of semi-structure data).
Use:
• CouchDB
• MongoDB
MangoDB
This scalable, high performance, open source NOSQL db features
document-oriented storage, full index support, replication and fast on-site updates.
This product is suitable for dynamic queries, dynamic data structures, written in
C/C++.

CouchDB
Also, an open-source database that focuses on the ease of data storage in a series
of JSON documents, each with its own definition of the schema. Eventual
consistency is enforced by ACID semantics that prevents locking db files during
writing.
4. Graph Database
• Stores the data in form of graphs.
• Graphs are basic data structures that states connection between objects.
• Objects that are connected are termed as nodes(In this case objects are called as
nodes and these nodes are connected to every other node).
• Relationships that define these objects are termed as edges (the connection that
is used to define the relationships between these objects or nodes are termed as
edges).
• Graph stores contain sequence of nodes and relations that form the graph. Both
nodes and relationships contain properties like follows, friend, family, etc.
• So, a graph store has three fields: nodes, relationships and properties.
• Examples include: Neo4j, AllegroGraph, TeradataAster.
Some of Neo4j features are listed below:
1. Neo4j has CQL, Cypher query language much like SQL.
2. Neo4j supports Indexes by using Apache Lucence.
3. It supports UNIQUE constraints.
4. Neo4j Data Browser is the UI to execute CQL Commands.
5. It supports ACID properties of RDBMS.
6. It uses Native Graph Processing Engine to store graphs.
7. It can export query data to JSON and XLS format.
8. It provides REST API to Java, Scala, etc
4. Graph Database
Advantages:
• Fast traversal and retrieval of data.
Disadvantages:
• Because nodes are connected to each other, can easily traverse to the entire graph which
increases the traversal rate. Now disadvantage is that, if incase wrong relationship is
established in any two node. The problem of infinite loop may occur.
Use:
• Neo4J
• FlockDB
Type Typical usage Examples

Key-value store—A simple data •Image stores •Berkeley DB • Memcache

storage system that uses a key •Key-based file systems •Redis • Riak •DynamoDB
to access a value •Object cache
•Systems designed to scale

Column family store—A sparse • Web crawler results • Apache HBase • Apache
matrix system that uses a row •Big data problems that can relax Cassandra •Hypertable •
and a column as keys consistency rules Apache Accumulo

Graph store—For relationship •Social networks •Neo4j • AllegroGraph •Bigdata

intensive problems • Fraud detection (RDF data store) • InfiniteGraph
•Relationship-heavy data (Objectivity)

Document store—Storing •High-variability data •Document • MongoDB (10Gen) •CouchDB

hierarchical data structures search • Integration hubs • Web •Couchbase • MarkLogic •
directly in the database content management • eXist-db •Berkeley DB XM
Publishing
Variation of NoSQL Architectural patterns
• Variations – different architectural patterns NoSQL database follow .
• Variation exist because different problems need different storage and access
models.
• The key−value store, column family store, document store and graph store
patterns can be modified based on different aspects of the system and its
implementation.
• Database architecture could be distributed (manages single database
distributed in multiple servers located at various sites) or federated (manages
independent and heterogeneous databases at multiple sites).
1. Customization for RAM or SSD stores
2. Distributed stores
3. Grouping Items
NoSql Case Study
Case study: LiveJournal’s Memcache
• Engineers working on the blogging system LiveJournal started
to look at how their systems were using their most precious
resource: the RAM in each web server.
• LiveJournal had a problem. Their website was so popular that
the number of visitors using the site continued to increase on a
daily basis. The only way they could keep up with demand was
to continue to add more web servers, each with its own
separate RAM.
Case study: Google’s MapReduce - use commodity hardware to
create search indexes
• One of the most influential case studies in the NoSQL
movement is the Google MapReduce system. In this paper,
Google shared their process for transforming large volumes of
web data content into search indexes using lowcost commodity
CPUs.
• Though sharing of this information was significant, the concepts
of map and reduce weren’t new. Map and reduce functions are
simply names for two stages of a data transformation as given
in figure
What is a Big Data NoSQL Solution?
• A decade ago, NoSQL was deployed in companies such as Google,
Amazon, Facebook and LinkedIn.
• Nowadays, most enterprises that are customer-centric and
revenue-driving applications that serve millions of consumers are
adopting this database.
• The move is motivated by the explosive growth of mobile devices, the IoT
and cloud infrastructure.
• The need of industries for scalability and performance requirements was
rising which the relational database technology was never designed to
address.
• Thus, enterprises are turning to NoSQL to overcome these limitations. A
few of the case studies which require NoSQL kind of databases are listed
in the following subsections.
1 Recommendation
2 User Profile
3 Real-Time Data Handling
4 Content Management
5 Catalog Management
6 360-Degree Customer View
7 Mobile Applications
8 Internet of Thing
9 Fraud Detection
Use Case Explanation Suitable NoSQL Type Examples

Suggests products, movies, or content based on user history and

1. Recommendation preferences (e.g., Amazon, Netflix). Needs relationship tracking Graph DB / Document DB Neo4j, MongoDB
between users and items.

Stores dynamic user info (name, preferences, activity logs). Data

2. User Profile Document DB MongoDB, Couchbase
structure varies from user to user.

High-speed applications (stock trading, gaming, chat apps, IoT Key-Value Store / Wide-Column
3. Real-Time Data Handling Redis, Cassandra
sensors). Requires very fast read/write. Store

Manages unstructured/semi-structured data (articles, videos,

4. Content Management Document DB MongoDB, CouchDB
blogs, metadata). Needs search & scalability.

E-commerce catalogs with flexible product attributes (clothes vs.

5. Catalog Management Document DB MongoDB, Couchbase
electronics). Schema-less is needed.

Combines CRM, transactions, social media, and customer

6. 360-Degree Customer View Graph DB + Document DB Neo4j + MongoDB
support data for unified view.

7. Mobile Applications Apps need offline sync, fast response, and flexible JSON storage. Document DB / Key-Value Firebase, Couchbase Mobile

Continuous streams of data from sensors (temperature, GPS,

8. Internet of Things (IoT) Time-Series DB / Wide-Column Cassandra, InfluxDB
devices). High write throughput required.

Real-time identification of abnormal transactions. Relies on

9. Fraud Detection Graph DB / Wide-Column Store Neo4j, Cassandra
pattern and relationship analysis.
Understanding Types of Big Data Problems
Big Data problems are categorized into two broad types based on how the data is accessed and
used:
A. Read-mostly Problems
• These problems involve data that is written once (or rarely updated) but read many times.
• Example: Logs, images, documents.
Subcategories:
1. Image - Large collections of images stored and retrieved (e.g., medical images, satellite images).
2. Event-log
• System or user activity logs.
• Two modes of processing:
• Real time → Data is processed as it arrives.
Example: Clickstream data from a website, IoT sensor data.
• Batch → Data is collected and processed later in bulk.
Example: Daily operational reports, server log analysis.
3. Documents
• Text-heavy data requiring indexing and search.
• Full-text search problems include:
• Simple text → keyword or phrase search.
• Annotations → metadata or tagging for better retrieval.
Big Data problems are categorized into two broad types based on how the data is accessed and
used:
4. Graph
• Data represented as nodes and edges.
• Example: Social networks, recommendation engines.
2. Read-write Problems
• These involve frequent updates as well as reads.
• Subcategories:
• High availability
• Data must always be available with minimal downtime.
• Example: Cloud databases for e-commerce, stock exchanges.
• Transactions
• Require strong consistency and atomic updates.
• Example: Banking systems, online payments.
Some ways you classify big data problems and see how NoSql systems
are changing the way organization use data.
1. Read mostly
2. Log events
3. Full text documents
Analyzing Big Data with a Shared Nothing Architecture
• In the distributed computing architecture, there are two ways of resource sharing
possible or share nothing.
• The RAM can be shared or disk can be shared (by CPUs); or no resources shared.
• The three of them can be considered as shared RAM, shared disk and
shared-nothing.
• Each of these architectures works with different types of data to solve big data
problems.
• In shared RAM, many CPUs access a single shared RAM over a high-speed bus.
• This system is ideal for large computation and also for graph stores.
• For graph traversals to be fast, the entire graph should be in main memory.
• The shared disk system, processors have independent RAM but shares disk space
using a storage area network (SAN).
• Big data uses commodity machines which shares nothing (shares no resources).
Choosing Distribution Models : Master-Slave Versus Peer-to- Peer
• NoSQL database makes distribution of data easier, since it has to move only aggregate data and not all
the related data that is used in aggregation.
• There are two styles of distributing data: Sharding and replication. A system may use either or both
techniques.
Like Riak database shards the data and also replicates it.
1. Sharding: Horizontal partitioning of a large database leads to partitioning of rows of the database. Each
partition forms part of a shard, meaning small part of the whole. Each part (shard) can be located on a
separate database server or any physical location.
2. Replication: Replication copies entire data across multiple servers. So the data is replicated and
available in multiple places.
Replication comes in two forms: master−slave and peer-to-peer.
• Master−slave replication: One node has the authoritative copy that handles writes. Slaves synchronize
with the master and handle reads.
• Peer-to-peer replication: This allows writes to any node; the nodes coordinate between themselves to
synchronize their copies of the data
Four Ways that NoSQL System Handles Big Data Problems
Every business needs to find the technology trends that have impact on its revenue.
Modern business not only needs data warehouse but also requires web/mobile
application generated data and social networking data to understand their customer’s
need.
NoSQL systems help to analyze such data.
IT executives select the right NoSQL systems and set up and configure them.
1. Manage Data Better by Moving Queries to the Data
▪ NoSQL system uses commodity hardware to store fragmented data on their
shared-nothing architecture except for graph databases which require specialized
processors.
▪ NoSQL databases improve performances drastically over RDBMS systems by moving
the query to each node for processing and not transfer the huge data to a single
processor.
2. Using Consistent Hashing Data on a Cluster
▪ A server in a distributed system is identified by a key to store or
retrieve data.
▪ The most challenging problem here is when servers become
unreachable through network partitions or when server fails.
▪ Suppose there are “n” servers to store or retrieve a value.
▪ Server is identified by hashing the value’s key modulo s.
▪ But when server fails, the server no longer fills the hash space.
▪ The only option is to invalidate the cache on all servers, renumber
them, and start once again.
▪ This solution is not feasible if the system has hundreds of servers
and one or the other server fails.
3. Using Replication to Scale Reads
▪ Replication improves read performance and database server
availability. Replication can be used as a scale-out solution where
you want to split up database queries across multiple database
servers.
▪ Replication works by distributing the load of one master to one or
more slaves.
▪ This works best in an environment where there are high number of
reads and low number of writes or updates.
▪ Most users browse the website for reading articles, posts or view
products.
▪ Writes occur only when making a purchase (during session
management) or when adding a comment or sending message to a
forum.
4. Letting the Database Distribute Queries Evenly to DataNodes
▪ The most important strategy of NoSQL data store is moving query to
the database server and not vice versa.
▪ Every node in the cluster in shared-nothing architecture is identical;
all the nodes are peers.
▪ Data is distributed evenly across all the nodes in a cluster using
extremely random hash function and so there are no bottlenecks.
▪ In ideal scale-out architecture “shared-nothing” concept is used.
▪ Since no resource is shared, there is no bottleneck and all the nodes
in this architecture act as peers.
▪ Data is evenly distributed among peers through a process called
sharding

Bda Module 3
No ratings yet
Bda Module 3
20 pages
NoSQL Databases
No ratings yet
NoSQL Databases
8 pages
Unit 4 Cap Mongodb
No ratings yet
Unit 4 Cap Mongodb
23 pages
Unit 4
No ratings yet
Unit 4
47 pages
Unit Ii - Nosql Databases
No ratings yet
Unit Ii - Nosql Databases
112 pages
Module 2
No ratings yet
Module 2
100 pages
2 - NoSQL
No ratings yet
2 - NoSQL
32 pages
NoSQL Notes
No ratings yet
NoSQL Notes
11 pages
No SQL
No ratings yet
No SQL
13 pages
Full Stack UNIT3
No ratings yet
Full Stack UNIT3
57 pages
RK NoSQL
No ratings yet
RK NoSQL
35 pages
2.1 Nosql
No ratings yet
2.1 Nosql
25 pages
Unit VI - 1
No ratings yet
Unit VI - 1
31 pages
No SQL
No ratings yet
No SQL
19 pages
Bda Mod 3
No ratings yet
Bda Mod 3
70 pages
Module 5 - NoSQL Databases
No ratings yet
Module 5 - NoSQL Databases
33 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
13 pages
Unit 4 BDA
No ratings yet
Unit 4 BDA
22 pages
NoSQL D
No ratings yet
NoSQL D
26 pages
Lecture 6 - NoSQL
No ratings yet
Lecture 6 - NoSQL
28 pages
Module 2
No ratings yet
Module 2
104 pages
CHP 4
No ratings yet
CHP 4
47 pages
No SQL - Types, CAP Theorem
No ratings yet
No SQL - Types, CAP Theorem
12 pages
No SQL Lecture Notes
No ratings yet
No SQL Lecture Notes
17 pages
NoSQL Databases: A Beginner's Guide
No ratings yet
NoSQL Databases: A Beginner's Guide
12 pages
NGD Unit 1-4
No ratings yet
NGD Unit 1-4
43 pages
No SQL
No ratings yet
No SQL
12 pages
NoSQL for Tech Professionals
No ratings yet
NoSQL for Tech Professionals
29 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
43 pages
P.prabu (29x61c) CCS334 BDA - Unit 2
No ratings yet
P.prabu (29x61c) CCS334 BDA - Unit 2
29 pages
NoSql 2024 Assign2
No ratings yet
NoSql 2024 Assign2
189 pages
Database Management Systems: UNIT-5: Nosql Databases
No ratings yet
Database Management Systems: UNIT-5: Nosql Databases
39 pages
Introduction To: Nosql
No ratings yet
Introduction To: Nosql
27 pages
NoSQL Databases Overview
No ratings yet
NoSQL Databases Overview
8 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
16 pages
Nosql Tricks
No ratings yet
Nosql Tricks
34 pages
BDS Session 10
No ratings yet
BDS Session 10
70 pages
CS3492-DBMS Unit-5
No ratings yet
CS3492-DBMS Unit-5
9 pages
No SQL
No ratings yet
No SQL
109 pages
BDA Unit-2
No ratings yet
BDA Unit-2
30 pages
1504846528session31 NoSQL
No ratings yet
1504846528session31 NoSQL
12 pages
Bda Module 3
No ratings yet
Bda Module 3
35 pages
Chap2 NoSQL
No ratings yet
Chap2 NoSQL
13 pages
3.1 Introduction To NoSQL
No ratings yet
3.1 Introduction To NoSQL
10 pages
Big Data Analysis
No ratings yet
Big Data Analysis
9 pages
DSA Notes Unit-03
No ratings yet
DSA Notes Unit-03
144 pages
CH 2 BDA
No ratings yet
CH 2 BDA
3 pages
Chapter24 Nosql Dbs
No ratings yet
Chapter24 Nosql Dbs
35 pages
NoSQL Databases Explained
No ratings yet
NoSQL Databases Explained
13 pages
MODULE7
No ratings yet
MODULE7
23 pages
5.1 BDA NoSQL
No ratings yet
5.1 BDA NoSQL
23 pages
Module 1 Introduction
No ratings yet
Module 1 Introduction
9 pages
Unit No 1
No ratings yet
Unit No 1
34 pages
Module 2.3
No ratings yet
Module 2.3
25 pages
Nosql What Does It Mean
No ratings yet
Nosql What Does It Mean
8 pages
41 NoSQL Introduction
No ratings yet
41 NoSQL Introduction
18 pages
UNIT II First Half Notes
No ratings yet
UNIT II First Half Notes
21 pages
Chapter - 4 - NoSQL - 1676181987
No ratings yet
Chapter - 4 - NoSQL - 1676181987
85 pages
Unit 3 - Bda
No ratings yet
Unit 3 - Bda
36 pages
SIMPLE SQL Begginers Guide To Master SQL and Boost Career
90% (10)
SIMPLE SQL Begginers Guide To Master SQL and Boost Career
425 pages
SQL Server Very Good Content
92% (12)
SQL Server Very Good Content
60 pages
Hackers Guide To Machine Learning With Python PDF
100% (16)
Hackers Guide To Machine Learning With Python PDF
272 pages
SQL PDF
100% (13)
SQL PDF
221 pages
PostgreSQL Admin for IT Pros
50% (2)
PostgreSQL Admin for IT Pros
109 pages
Postgresql Tutorial
100% (1)
Postgresql Tutorial
257 pages
Advanced SQL Tutorial for Oracle
100% (7)
Advanced SQL Tutorial for Oracle
37 pages
Collect, Transform and Combine Data Using Power BI and Power Query in Excel (Business Skills)
86% (14)
Collect, Transform and Combine Data Using Power BI and Power Query in Excel (Business Skills)
543 pages
Python in Excel (2024)
100% (14)
Python in Excel (2024)
607 pages
SQL - With Practice Exercises, Learn SQL Fast (PDFDrive) PDF
100% (3)
SQL - With Practice Exercises, Learn SQL Fast (PDFDrive) PDF
167 pages
Hadoop 2 Quick Start Guide PDF
100% (1)
Hadoop 2 Quick Start Guide PDF
736 pages
Learn Python in A Day
100% (14)
Learn Python in A Day
141 pages
Microsoft Power BI Cookbook by Greg Deckler
100% (20)
Microsoft Power BI Cookbook by Greg Deckler
655 pages
PostgreSQL For Beginners
100% (6)
PostgreSQL For Beginners
142 pages
MongoDB Administrator Training
100% (1)
MongoDB Administrator Training
216 pages
Full Course of Machine Learning
100% (17)
Full Course of Machine Learning
660 pages
SQL & NoSQL Data PDF
100% (9)
SQL & NoSQL Data PDF
238 pages
Big Data & Hadoop
100% (3)
Big Data & Hadoop
189 pages
Mastering PostgreSQL Administration
100% (10)
Mastering PostgreSQL Administration
99 pages
SQL
90% (10)
SQL
101 pages
Top 200 Data Engineer Interview Question PDF
100% (4)
Top 200 Data Engineer Interview Question PDF
482 pages
Docker Tutorial
100% (4)
Docker Tutorial
150 pages
SQL Cheat Sheet
50% (2)
SQL Cheat Sheet
2 pages
Apache Kafka Tutorial
100% (3)
Apache Kafka Tutorial
61 pages
Oracle SQL
100% (3)
Oracle SQL
110 pages
SQL Commands Cheat Sheet
80% (10)
SQL Commands Cheat Sheet
1 page
S. Haines - Modern Data Engineering With Apache Spark - A Hands-On Guide For Building Mission-Critical Streaming Applications (2022) - Libgen - Li
60% (5)
S. Haines - Modern Data Engineering With Apache Spark - A Hands-On Guide For Building Mission-Critical Streaming Applications (2022) - Libgen - Li
592 pages
Data Warehousing & Mining Guide
100% (6)
Data Warehousing & Mining Guide
143 pages
Python Basics for Beginners
100% (12)
Python Basics for Beginners
2 pages
Applied Microsoft Power BI Bring Your Data To Life
100% (14)
Applied Microsoft Power BI Bring Your Data To Life
592 pages
MongoDB Why Documents
No ratings yet
MongoDB Why Documents
15 pages
NoSQL - Database Revolution
No ratings yet
NoSQL - Database Revolution
10 pages
BDA Notes
No ratings yet
BDA Notes
96 pages
Monitoring MongoDB Performance Metrics (WiredTiger) - Datadog
No ratings yet
Monitoring MongoDB Performance Metrics (WiredTiger) - Datadog
29 pages
Abdms-Unit 2 and Unit 5 Notes
No ratings yet
Abdms-Unit 2 and Unit 5 Notes
10 pages
NOSQL
No ratings yet
NOSQL
64 pages
Final MCQ DT
No ratings yet
Final MCQ DT
176 pages
Assignment No 11 Study of Mongodb Command - 181021004
No ratings yet
Assignment No 11 Study of Mongodb Command - 181021004
20 pages
NoSQL Unit 3
No ratings yet
NoSQL Unit 3
65 pages
MongoDB Architecture Guide
No ratings yet
MongoDB Architecture Guide
18 pages
MongoDB Databases in Python With Advance Indexing
100% (4)
MongoDB Databases in Python With Advance Indexing
230 pages
Full Download Big Data Computing A Guide For Business and Technology Managers 1st Edition Vivek Kale PDF
100% (7)
Full Download Big Data Computing A Guide For Business and Technology Managers 1st Edition Vivek Kale PDF
63 pages
Lab 3
No ratings yet
Lab 3
10 pages
Spring Boot With MongoDB
No ratings yet
Spring Boot With MongoDB
16 pages
The CAP Theorem Overview
No ratings yet
The CAP Theorem Overview
16 pages
NOSQL
No ratings yet
NOSQL
55 pages
Chapter14 BigData&NoSQLDatabases
No ratings yet
Chapter14 BigData&NoSQLDatabases
39 pages
How To Create A Simple REST API in PHP - Step by Step Guide!
No ratings yet
How To Create A Simple REST API in PHP - Step by Step Guide!
96 pages
Ps C:/Users/Faiza C/Users/Faiza/C/Firstrepo
No ratings yet
Ps C:/Users/Faiza C/Users/Faiza/C/Firstrepo
25 pages
FeuersteinJSON and PLSQL - Match Made in Database
No ratings yet
FeuersteinJSON and PLSQL - Match Made in Database
21 pages
Unit Iii
No ratings yet
Unit Iii
20 pages
Mongo DB
No ratings yet
Mongo DB
21 pages
Fresco
No ratings yet
Fresco
29 pages
5 Documentdatabases
No ratings yet
5 Documentdatabases
25 pages
LoreWeaver Project Comprehensive Overview
No ratings yet
LoreWeaver Project Comprehensive Overview
56 pages
Notes - 4 Unit-Big Data
No ratings yet
Notes - 4 Unit-Big Data
38 pages
BDA Exp4
No ratings yet
BDA Exp4
7 pages
NoSQL Database Insights
No ratings yet
NoSQL Database Insights
14 pages
Slide Isp610
No ratings yet
Slide Isp610
64 pages
DP-203 Exam-PG-111-120 - ExamTopics - Passei Direto
No ratings yet
DP-203 Exam-PG-111-120 - ExamTopics - Passei Direto
10 pages

Module 3 NOSQL

Uploaded by

Module 3 NOSQL

Uploaded by

3.

• MongoDB was designed by Eliot Horowitz with his team in 10gen.

Types of NoSQL Data Stores

Key-value store—A simple data •Image stores •Berkeley DB • Memcache

Graph store—For relationship •Social networks •Neo4j • AllegroGraph •Bigdata

Document store—Storing •High-variability data •Document • MongoDB (10Gen) •CouchDB

Suggests products, movies, or content based on user history and

Stores dynamic user info (name, preferences, activity logs). Data

Manages unstructured/semi-structured data (articles, videos,

E-commerce catalogs with flexible product attributes (clothes vs.

Combines CRM, transactions, social media, and customer

Continuous streams of data from sensors (temperature, GPS,

Real-time identification of abnormal transactions. Relies on

You might also like