0% found this document useful (0 votes)

41 views28 pages

Lecture 04 - Cloud Storage

storage

Uploaded by

idc.cupons

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views28 pages

Lecture 04 - Cloud Storage

storage

Uploaded by

idc.cupons

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Cloud

Computing
Lecture 4

Storage – CAP
RDBMS

Dan Amiga
Amiga.dan@post.idc.ac.il
Stateless Instances

http://yourapp.cloudapp.net
Putting It All Together

Web role Worker role

Web role Worker role
Web role Worker role
LB

Storage
Stateless compute
+ Durable storage
-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐
= Scalable application
Scale-up And Scale-out

Volume
Volume

WWW
$10,000
machine DNS

$1000
machine

$500 $500 $500 $500 $500

machine machine machine machine machine

# Machines
Scale Up Scale Out
Scale Up vs Scale Out
• Scale Up
– Easier (?)
– Bounded
– Expensive and proprietary
– Sometimes a must (?)
• Scale Out
– Harder (?)
– Slower when you start…
– Maintain Session (sticky vs regular)
– Unbounded, Cheaper, Always a must
• Storage is key for scaling out
On Premise / Traditional Storage Choices

• SAN, NAS, DAS

• Databases
• Offline Archival
• RAID Architecture on top of the above
Application Design Patterns

• Scale out for capacity

• Scale out for redundancy
• Asynchronous communication
• Short time outs with retries
• Idempotent operations
• Stateless with durable external storage
RAID (redundant array of independent disks)

• Storage technology that combines multiple

disk drive components into a logical unit.
• Data is distributed across the drives in one of
several ways called "RAID levels", depending
on the level of redundancy and performance
required.
Storage

• Simple, essential storage abstractions:

– Large items of data: Blobs, file streams, …
– Service state: Simple tables, caches, …
– Service communication: Queues, locks, …
• With an emphasis on:
– Massive scale, availability and durability
– Geo-location and geo-replication
• This is not a relational database in the cloud
Durable Storage

Blobs Tables Queues

…

• Three replicas of everything

• REST API
Storage
Blobs
Queues
AWS Storage Options

• Ephemeral Storage
• Elastic Block Storage (EBS)
• S3
• SQS
• NoSQL – Simple / Dynamo
• Relational Database Storage
• Storage Gateway
http://www.slideshare.net/AmazonWebServic
es/aws-storage-options
Amazon S3

• https://www.dropbox.com/help/7
• http://aws.amazon.com/s3/
• 1kb to 5TB of unlimited number
• You can choose a Region to optimize for
latency, minimize costs, or address
regulatory requirements.
• http://aws.amazon.com/s3-sla/
CAP Theorem
• Consistency (Atomic data objects)
– any read operation that begins after a write operation
completes must return that value, or the result of a
later write operation.
– E.g. if A writes 1 then 2 to location X, client B cannot
read 2 followed by 1.
• Available Data Objects
– even when severe (network? storage?) failures occur,
every request must terminate + minimal latency.
– Easier – all operations return successfully
• Partition Tolerance
– No set of failures less than total network failure is
allowed to cause the system to respond incorrectly.
– Easier – if the network stop delivering messages
between two sets of servers, the system will still
continue to work.
Simplified Proof
CAP Transactional Analysis

• You want consistency?

– Give up availability
– Or give up partition tolerance
Tradeoff

• Consistency give up
– DNS; Inconsistency;
• Availability give up
– Bad idea… Use retries
• Partition Tolerance
– VLDB/Clusters; Synchronous 2-phase commit
CAP In the real world

• AP: You are guaranteed get back responses

promptly (even with network partitions), but
you aren’t guaranteed anything about the
value/contents of the response.
• CP: You are guaranteed that any response you
get (even with network partitions) has a
consistent result. But you might not get any
responses whatsoever.
• CA: If the network never fails (and nodes never
crash, as they postulated earlier), then,
unsurprisingly, life is good. But if messages
could be dropped, all guarantees are off.
Consistency Models

• In an ideal world there would only be one consistency model;

when an update is made all observers will see that update

• Tradeoff to get a consistency update:

– Time
– Partition Tolerance or Availability

• An important observation is that in larger distributed scale

systems, network partitions are a given and as such consistency
and availability cannot be achieved at the same time. This
means that one has two choices on what to drop; relaxing
consistency will allow the system to remain highly available
under the partition conditions and prioritizing consistency
means that under certain conditions the system will not be
available.

• http://www.allthingsdistributed.com/2007/12/eventually_consis
tent.html
Eventual Consistency

• Different nodes keep replicas and each update is

“eventually” propagated to each replica
– And eventually, there is agreement on which
update is the latest
• As the consistency achieved is eventual, the
system has to resolve conflicts.
– Read repair: The correction is done when a read
finds an inconsistency. This slows down the read
operation.
– Write repair: The correction takes place during a
write operation, if an inconsistency has been
found, slowing down the write operation.
– Asynchronous repair: The correction is not part of
a read or write operation.
AWS S3 and Azure Storage Consistency
• Amazon S3 buckets in the US West (Oregon), US West
(Northern California), EU (Ireland), Asia Pacific (Singapore), Asia
Pacific (Tokyo), Asia Pacific (Sydney) and South America (Sao
Paulo) Regions provide read-after-write consistency for PUTS of
new objects and eventual consistency for overwrite PUTS and
DELETES. Amazon S3 buckets in the US Standard Region
provide eventual consistency.
• Azure storage is Consistent, Available, and Partition Tolerance.
How?
Storage Usage Comparison in Azure

CC Unit 3
No ratings yet
CC Unit 3
19 pages
CAP Theorem Lect 2
No ratings yet
CAP Theorem Lect 2
77 pages
The CAP Theorem and The Design of Large Scale Distributed Systems: Part I
No ratings yet
The CAP Theorem and The Design of Large Scale Distributed Systems: Part I
44 pages
Amazon Aurora Storage Demystified DAT401
No ratings yet
Amazon Aurora Storage Demystified DAT401
30 pages
HBase & NoSQL Database Insights
No ratings yet
HBase & NoSQL Database Insights
4 pages
REPEAT 1 Designing For Failure Architecting Resilient Systems On AWS ARC335-R1
No ratings yet
REPEAT 1 Designing For Failure Architecting Resilient Systems On AWS ARC335-R1
91 pages
Big Data Management and Nosql Databases: Doc. Rndr. Irena Holubova, PH.D
No ratings yet
Big Data Management and Nosql Databases: Doc. Rndr. Irena Holubova, PH.D
27 pages
Strong vs. Eventual Consistency - by Ashish Pratap Singh
No ratings yet
Strong vs. Eventual Consistency - by Ashish Pratap Singh
14 pages
KT AWS
No ratings yet
KT AWS
16 pages
Lecture 5 Distributed Storage Systems
No ratings yet
Lecture 5 Distributed Storage Systems
26 pages
The CAP Theorem in DBMS - GeeksforGeeks
No ratings yet
The CAP Theorem in DBMS - GeeksforGeeks
6 pages
Unit-4 - Cloud Storage and Database Services
No ratings yet
Unit-4 - Cloud Storage and Database Services
88 pages
Cap Critique
No ratings yet
Cap Critique
14 pages
The New Age of Data-Intensive Applications
No ratings yet
The New Age of Data-Intensive Applications
7 pages
Lecture 7 Chapter 5 Part 3 Big Data Storage Concepts
No ratings yet
Lecture 7 Chapter 5 Part 3 Big Data Storage Concepts
11 pages
Lec 14
No ratings yet
Lec 14
13 pages
Chapter 4 1712934164766
No ratings yet
Chapter 4 1712934164766
28 pages
A Critique of The CAP Theorem-Martin Kleppmann
No ratings yet
A Critique of The CAP Theorem-Martin Kleppmann
14 pages
Big Data Analytics Lecture 3A
No ratings yet
Big Data Analytics Lecture 3A
27 pages
04.1 Fault Tolerance 2
No ratings yet
04.1 Fault Tolerance 2
24 pages
CAP Theorem in Blockchain
No ratings yet
CAP Theorem in Blockchain
4 pages
CAP Theorem: Soft-State Replication Solution
No ratings yet
CAP Theorem: Soft-State Replication Solution
7 pages
AWS Basics for Tech Professionals
No ratings yet
AWS Basics for Tech Professionals
35 pages
Unit 2.1.1 - AWS
No ratings yet
Unit 2.1.1 - AWS
20 pages
Cloud Mod2
No ratings yet
Cloud Mod2
22 pages
DSM - CAP Theorem
No ratings yet
DSM - CAP Theorem
7 pages
Serverless, FAAS and Event-Driven Architecture
100% (1)
Serverless, FAAS and Event-Driven Architecture
63 pages
Module 2.3
No ratings yet
Module 2.3
25 pages
Random Af
No ratings yet
Random Af
15 pages
Oup Accepted Manuscript 2018
No ratings yet
Oup Accepted Manuscript 2018
18 pages
Consistency Replication
No ratings yet
Consistency Replication
49 pages
Eventually Consistent - Revisited - All Things Distributed
No ratings yet
Eventually Consistent - Revisited - All Things Distributed
5 pages
Cloud Unit-4-2
No ratings yet
Cloud Unit-4-2
32 pages
Unit 5
No ratings yet
Unit 5
21 pages
UNIT4CC
No ratings yet
UNIT4CC
45 pages
NoSQL Sharding and Replication Guide
No ratings yet
NoSQL Sharding and Replication Guide
28 pages
Ccaws Unit 5
No ratings yet
Ccaws Unit 5
17 pages
Cloud Computing
No ratings yet
Cloud Computing
68 pages
Amazon Dynamo DB - Presentation
100% (1)
Amazon Dynamo DB - Presentation
30 pages
CAP Theorem in Blockchain Explained
No ratings yet
CAP Theorem in Blockchain Explained
6 pages
Designing
No ratings yet
Designing
161 pages
NoSQL - Unit 2
No ratings yet
NoSQL - Unit 2
11 pages
Slides
No ratings yet
Slides
31 pages
Durability & Availability: Durability Can Be Described As The Probability That You Will Eventually Be
No ratings yet
Durability & Availability: Durability Can Be Described As The Probability That You Will Eventually Be
12 pages
System Design Interview Guide
100% (2)
System Design Interview Guide
91 pages
AWS Certified Solutions Architect Associate Exam Prep Regions, Availability Zones, and Edge Locations
No ratings yet
AWS Certified Solutions Architect Associate Exam Prep Regions, Availability Zones, and Edge Locations
90 pages
AWS Cloud Practitioner Cheat Sheet
100% (7)
AWS Cloud Practitioner Cheat Sheet
12 pages
ch07 Consistency Replication
No ratings yet
ch07 Consistency Replication
30 pages
NoSQL Trends for IT Professionals
No ratings yet
NoSQL Trends for IT Professionals
26 pages
Aws Storage and Edge Processin 1748041998 180511161600
No ratings yet
Aws Storage and Edge Processin 1748041998 180511161600
45 pages
Consistency Models in Distributed Systems
No ratings yet
Consistency Models in Distributed Systems
1 page
Cloud Storage With Amazon S3
No ratings yet
Cloud Storage With Amazon S3
12 pages
Visual Guide To NoSQL Systems - Nathan Hurst's Blog
No ratings yet
Visual Guide To NoSQL Systems - Nathan Hurst's Blog
10 pages
Amazon Aurora: On Avoiding Distributed Consensus For I/Os, Commits, and Membership Changes
No ratings yet
Amazon Aurora: On Avoiding Distributed Consensus For I/Os, Commits, and Membership Changes
8 pages
System Design
No ratings yet
System Design
385 pages
Lecture 11 Cloud Systems
No ratings yet
Lecture 11 Cloud Systems
80 pages
Preamble: Intro To Cloud Computing: Presented By: Aater Suleman, PHD
No ratings yet
Preamble: Intro To Cloud Computing: Presented By: Aater Suleman, PHD
48 pages
Unit 1 ADBMS
No ratings yet
Unit 1 ADBMS
36 pages
Dbms Unit 1 ..
No ratings yet
Dbms Unit 1 ..
24 pages
07 Modeling Guidelines
No ratings yet
07 Modeling Guidelines
24 pages
04 1 Linux
No ratings yet
04 1 Linux
13 pages
Credential
No ratings yet
Credential
17 pages
Quantitative Trading Data by The Numbers WD
No ratings yet
Quantitative Trading Data by The Numbers WD
9 pages
Er Model
No ratings yet
Er Model
69 pages
Unit 5
No ratings yet
Unit 5
60 pages
Markov Chains
No ratings yet
Markov Chains
37 pages
Fundamentals of Data Science NEP
No ratings yet
Fundamentals of Data Science NEP
2 pages
2020 DBMS Mid
No ratings yet
2020 DBMS Mid
2 pages
Data Science - Wikipedia
No ratings yet
Data Science - Wikipedia
6 pages
SQL Practical File for BBA Students
No ratings yet
SQL Practical File for BBA Students
28 pages
Snowflake Cloud Data Warehouse Guide
0% (1)
Snowflake Cloud Data Warehouse Guide
15 pages
Zero Data Loss Recovery Appliance - Deep Dive-Presentation
No ratings yet
Zero Data Loss Recovery Appliance - Deep Dive-Presentation
63 pages
Database Answers To Review Questions
No ratings yet
Database Answers To Review Questions
46 pages
Hashing Techniques in Databases
No ratings yet
Hashing Techniques in Databases
4 pages
SQL DBA - Sai Kumar
No ratings yet
SQL DBA - Sai Kumar
7 pages
PL/SQL Cursor Tutorial Guide
No ratings yet
PL/SQL Cursor Tutorial Guide
8 pages
Mca 3 Sem Database Management Systems Mca503 Dec 2017
No ratings yet
Mca 3 Sem Database Management Systems Mca503 Dec 2017
2 pages
Organisation of Knowledge2
No ratings yet
Organisation of Knowledge2
6 pages
Introduction To Big Data Notes Btech Su
No ratings yet
Introduction To Big Data Notes Btech Su
140 pages
11i Cloning Using Rapid Clone - RAHUL
100% (4)
11i Cloning Using Rapid Clone - RAHUL
27 pages
THE ETL PROCESS - Abboub - Mohamed - El - Mehdi
100% (1)
THE ETL PROCESS - Abboub - Mohamed - El - Mehdi
14 pages
Introduction To Structured Query Language (SQL) : E. F. Codd
No ratings yet
Introduction To Structured Query Language (SQL) : E. F. Codd
32 pages
Adobe Scan 02-Dec-2022 - 221202 - 143343
No ratings yet
Adobe Scan 02-Dec-2022 - 221202 - 143343
13 pages
Bimtek Portal Sata PB Jatim
No ratings yet
Bimtek Portal Sata PB Jatim
20 pages
1904001-DBMS Notes 5 Units
100% (2)
1904001-DBMS Notes 5 Units
70 pages
A Definition of Data Warehousing Market Overview:: Biographical Information Bill Inmon
No ratings yet
A Definition of Data Warehousing Market Overview:: Biographical Information Bill Inmon
85 pages
Final 2 PLSQL
No ratings yet
Final 2 PLSQL
16 pages

Lecture 04 - Cloud Storage

Uploaded by

Lecture 04 - Cloud Storage

Uploaded by

Cloud

Web role Worker role

$500 $500 $500 $500 $500

• SAN, NAS, DAS

• Scale out for capacity

• Storage technology that combines multiple

• Simple, essential storage abstractions:

Blobs Tables Queues

• Three replicas of everything

• You want consistency?

• AP: You are guaranteed get back responses

• In an ideal world there would only be one consistency model;

• Tradeoff to get a consistency update:

• An important observation is that in larger distributed scale

• Different nodes keep replicas and each update is

You might also like