Module 7: Storage
Module overview
Topics Demos
• Amazon Elastic Block Store (Amazon EBS) • Amazon EBS console
• Amazon Simple Storage Service (Amazon • Amazon S3 console
S3) • Amazon EFS console
• Amazon Elastic File System (Amazon EFS) • Amazon S3 Glacier console
• Amazon Simple Storage Service Glacier
Lab
• Working with Amazon EBS
Activities
• Storage solution case study
Knowledge check
2
Module objectives
After completing this module, you should be able to:
• Identify the different types of storage
• Explain Amazon S3
• Identify the functionality in Amazon S3
• Explain Amazon EBS
• Identify the functionality in Amazon EBS
• Perform functions in Amazon EBS to build an Amazon EC2 storage solution
• Explain Amazon EFS
• Identify the functionality in Amazon EFS
• Explain Amazon S3 Glacier
• Identify the functionality in Amazon S3 Glacier
• Differentiate between Amazon EBS, Amazon S3, Amazon EFS, and Amazon S3 Glacier
3
Core AWS services
Amazon S3 Amazon
EBS
Amazon Amazon S3
EFS Glacier
Amazon Virtual Amazon Elastic AWS Identity and Access
Private Cloud Compute Cloud Storage Management (IAM)
(Amazon VPC) (Amazon EC2)
Amazon Relational Amazon
Database Service DynamoDB
Database 4
Module 7: Storage
Section 1: Amazon Elastic Block Store (Amazon EBS)
Storage
Amazon Elastic Block Store
(Amazon EBS)
6
AWS storage options: Block storage versus
object storage
What if you want to change one character in a 1-GB file?
Block storage Object storage
Change one block (piece of the file) Entire file must be updated
that contains the character
7
Amazon EBS
Amazon EBS enables you to create individual storage volumes and attach
them to an Amazon EC2 instance:
• Amazon EBS offers block-level storage.
• Volumes are automatically replicated within its Availability Zone.
• It can be backed up automatically to Amazon S3 through snapshots.
• Uses include –
• Boot volumes and storage for Amazon Elastic Compute Cloud (Amazon EC2)
instances
• Data storage with a file system
• Database hosts
• Enterprise applications
8
Amazon EBS volume types
9
Amazon EBS volume type use cases
10
Amazon EBS features
• Snapshots –
• Point-in-time snapshots
• Recreate a new volume at any time
• Encryption –
• Encrypted Amazon EBS volumes
• No additional cost
• Elasticity –
• Increase capacity
• Change to different types
11
Amazon EBS: Volumes, IOPS, and pricing
1. Volumes –
• Amazon EBS volumes persist independently from the instance.
• All volume types are charged by the amount that is provisioned per month.
2. IOPS –
• General Purpose SSD:
• Charged by the amount that you provision in GB per month until storage is released.
• Magnetic:
• Charged by the number of requests to the volume.
• Provisioned IOPS SSD:
• Charged by the amount that you provision in IOPS (multiplied by the percentage of days
that you provision for the month).
12
Amazon EBS: Snapshots and data transfer
3. Snapshots –
• Added cost of Amazon EBS snapshots to Amazon S3 is per GB-month
of data stored.
4. Data transfer –
• Inbound data transfer is free.
• Outbound data transfer across Regions incurs charges.
13
Amazon EBS features:
Section 1 key
• Persistent and customizable block
takeaways storage for Amazon EC2
• HDD and SSD types
• Replicated in the same Availability
Zone
• Easy and transparent encryption
• Elastic volumes
• Back up by using snapshots
14
Elastic Block Store demo
Recorded demo:
Amazon Elastic
Block Store
15
Lab 4:
Working with
Amazon EBS
16
Lab 4: Scenario
This lab is designed to show you how to create an Amazon EBS volume. After
you create the volume, you will attach the volume to an Amazon EC2
instance, configure the instance to use a virtual disk, create a snapshot and
then restore from the snapshot.
Attached Created
Amazon Amazon Snapshot
EC2 EBS
instance
17
Lab 4: Final product
Attached Created
Amazon Amazon Snapshot
EC2 EBS
instance
18
Lab 4: Working with Amazon EBS
~ 30 minutes
Begin Lab 4: Working
with Amazon EBS
19
Lab debrief:
Key takeaways
20
Module 7: Storage
Section 2: Amazon Simple Storage Service
(Amazon S3)
Storage
Amazon Simple Storage Service
(Amazon S3)
22
Amazon S3 overview
• Data is stored as objects in buckets
• Virtually unlimited storage
• Single object is limited to 5 TB
• Designed for 11 9s of durability
• Granular access to bucket and objects
23
Amazon S3 storage classes
Amazon S3 offers a range of object-level storage classes that are
designed for different use cases:
• Amazon S3 Standard
• Amazon S3 Intelligent-Tiering
• Amazon S3 Standard-Infrequent Access (Amazon S3 Standard-IA)
• Amazon S3 One Zone-Infrequent Access (Amazon S3 One Zone-IA)
• Amazon S3 Glacier
• Amazon S3 Glacier Deep Archive
24
Amazon S3 bucket URLs (two styles)
Amazon S3
To upload your data:
1. Create a bucket in an AWS Region.
2. Upload almost any number of objects to the bucket.
Bucket path-style URL endpoint:
[bucket name] https://s3.ap-northeast-1.amazonaws.com/bucket-name
Region code Bucket name
Bucket virtual hosted-style URL endpoint:
Preview2.mp4 https:// bucket-name.s3-ap-northeast-1.amazonaws.com
Tokyo Region (ap-
northeast-1) Bucket name Region code
25
Data is redundantly stored in the Region
media/welcome.mp4
Facility 1 Facility 2 Facility 3
my-bucket-name
Region
26
Designed for seamless scaling
media/welcome.mp4 prod2.mp4 prod3.mp4 prod4.mp4
prod5.mp4 prod6.mp4 prod7.mp4 prod8.mp4
my-bucket-name
prod9.mp4 prod10.mp4 prod11.mp4 prod12.mp4
27
Access the data anywhere
AWS Management
AWS Command Line SDK
Console
Interface
28
Common use cases
• Storing application assets
• Static web hosting
• Backup and disaster recovery (DR)
• Staging area for big data
• Many more….
29
Amazon S3 common scenarios
• Backup and storage
• Application hosting
• Media hosting Amazon S3 buckets
• Software delivery
Corporate
data center
Amazon
EC2
instances
30
Amazon S3 pricing
• Pay only for what you use, including –
• GBs per month
• Transfer OUT to other Regions
• PUT, COPY, POST, LIST, and GET requests
• You do not pay for –
• Transfers IN to Amazon S3
• Transfers OUT from Amazon S3 to Amazon CloudFront or Amazon EC2 in the same
Region
31
Amazon S3: Storage pricing
To estimate Amazon S3 costs, consider the following:
1. Storage class type –
• Standard storage is designed for:
• 11 9s of durability
• Four 9s of availability
• S3 Standard-Infrequent Access (S-IA) is designed for:
• 11 9s of durability
• Three 9s of availability
2. Amount of storage –
• The number and size of objects
32
Amazon S3: Storage pricing
3. Requests –
• The number and type of requests (GET, PUT, COPY)
• Type of requests:
• Different rates for GET requests than other requests.
4. Data transfer –
• Pricing is based on the amount of data that is transferred out of the
Amazon S3 Region
• Data transfer in is free, but you incur charges for data that is transferred out.
33
• Amazon S3 is a fully managed cloud
Section 2 key storage service.
takeaways • You can store a virtually unlimited
number of objects.
• You pay for only what you use.
• You can access Amazon S3 at any
time from anywhere through a URL.
• Amazon S3 offers rich security
controls.
34
Module 7: Storage
Section 3: Amazon Elastic File System (Amazon EFS)
Storage
Amazon Elastic File System
(Amazon EFS)
36
Amazon EFS features
• File storage in the AWS Cloud
• Works well for big data and analytics, media processing workflows,
content management, web serving, and home directories
• Petabyte-scale, low-latency file system
• Shared storage
• Elastic capacity
• Supports Network File System (NFS) versions 4.0 and 4.1 (NFSv4)
• Compatible with all Linux-based AMIs for Amazon EC2
37
Amazon EFS architecture
VPC
Availability Zone A Availability Zone B Availability Zone C
Private subnet Private subnet Private subnet
Network Network Network
Interface Interface Interface
Private subnet
Network Network
Network
Interface Interface
Interface
Mount target Mount target Mount target
Elastic File System
38
Amazon EFS implementation
1 Create your Amazon EC2 resources and launch your Amazon EC2 instance.
2 Create your Amazon EFS file system.
3 Create your mount targets in the appropriate subnets.
4 Connect your Amazon EC2 instances to the mount targets.
5 Verify the resources and protection of your AWS account.
39
Amazon EFS resources
File system
• Mount target
• Subnet ID
• Security groups
• One or more per file system
• Create in a VPC subnet
• One per Availability Zone
• Must be in the same VPC
• Tags
• Key-value pairs
40
• Amazon EFS provides file storage over a
Section 3 key network.
takeaways • Perfect for big data and analytics, media
processing workflows, content
management, web serving, and home
directories.
• Fully managed service that eliminates
storage administration tasks.
• Accessible from the console, an API, or the
CLI.
• Scales up or down as files are added or
removed and you pay for what you use.
41
Module 7: Storage
Section 4: Amazon S3 Glacier
Storage
Amazon S3 Glacier
43
Amazon S3 Glacier review
Amazon S3 Glacier is a data archiving service that is designed for security,
durability, and an extremely low cost.
• Amazon S3 Glacier is designed to provide 11 9s of durability for objects.
• It supports the encryption of data in transit and at rest through Secure Sockets Layer
(SSL) or Transport Layer Security (TLS).
• The Vault Lock feature enforces compliance through a policy.
• Extremely low-cost design works well for long-term archiving.
• Provides three options for access to archives—expedited, standard, and bulk—retrieval
times range from a few minutes to several hours.
44
Amazon S3 Glacier
• Storage service for low-cost data
archiving and long-term backup
• You can configure lifecycle archiving
Archive after Delete after 5
of Amazon S3 content to Amazon S3 30 days years
Glacier
• Retrieval options – Amazon S3 Amazon S3
Glacier
bucket
• Standard: 3–5 hours Archive
• Bulk: 5–12 hours
• Expedited: 1–5 minutes
45
Amazon S3 Glacier use cases
Media asset archiving
Healthcare information archiving
Regulatory and compliance archiving
Scientific data archiving
Digital preservation
Magnetic tape replacement
46
Using Amazon S3 Glacier
RESTful
web services
Java or .NET
SDKs
Amazon S3 with
lifecycle policies
47
Lifecycle policies
Amazon S3 lifecycle policies enable you to delete or move objects based on
age.
Amazon S3 Amazon S3 Amazon Delete
Standard Standard - S3 Glacier
Infrequent Access
30 days 60 days 365 days
Preview2.mp4 Preview2.mp4 Preview2.mp4
48
Storage comparison
Amazon S3 Amazon S3 Glacier
Data Volume No limit No limit
Average Latency ms minutes/hours
Item Size 5 TB maximum 40 TB maximum
Cost/GB per Month Higher cost Lower cost
PUT, COPY, POST,
Billed Requests UPLOAD and retrieval
LIST, and GET
¢ ¢¢
Retrieval Pricing
Per request Per request and per GB
49
Server-side encryption
Corporate AWS Cloud
data center
https Your Applications on
Amazon EC2 Amazon EC2
AWS Cloud
Data is encrypted Your application must
by default enable server-side encryption
Amazon S3 Glacier Amazon S3
50
Security with Amazon S3 Glacier
Control access with
IAM
Amazon S3 Glacier encrypts
your data with AES-256
Amazon S3
Glacier
Amazon S3 Glacier manages
your keys for you
51
• Amazon S3 Glacier is a data archiving
Section 4 key service that is designed for security,
takeaways durability, and an extremely low
cost.
• Amazon S3 Glacier pricing is based
on Region.
• Its extremely low-cost design works
well for long-term archiving.
• The service is designed to provide 11
9s of durability for objects.
52
Activity: Storage Case
Studies
Photo by Pixabay from Pexels.
Photo by panumas nikhomkhai from Pexels.
53
Storage case study activity
Case 1: A data analytics company for travel sites must store billions of customer events per day. They use
the data analytics services that are in the diagram. The following diagram illustrates their architecture.
Amazon API Gateway Amazon Kinesis AWS Lambda
Amazon Kinesis Data
Firehose
Amazon Elastic Container Amazon Kinesis
Service
Storage ??
54
Storage case study activity
Case 2: A collaboration software company processes email for enterprise customers. They have more than
250 enterprise customers and more than half a million users. They must store petabytes of data for their
customers. The following diagram illustrates their architecture.
Elastic Load Balancing
Corporate data
center
Storage ??
Amazon EC2 instances
55
Storage case study activity
Case 3: A financial data processing and consulting company must store large amounts of data for
compliance reasons. They use Amazon Kinesis for processing the data and Amazon Redshift for analysis. The
following diagram illustrates their architecture.
Amazon Kinesis Data
Firehose
Amazon Kinesis AWS Lambda
Amazon Redshift
Clients
Storage ??
56
Module 7: Storage
Module wrap-up
Module summary
In summary, in this module, you learned how to:
• Identify the different types of storage
• Explain Amazon S3
• Identify the functionality in Amazon S3
• Explain Amazon EBS
• Identify the functionality in Amazon EBS
• Perform functions in Amazon EBS to build an Amazon EC2 storage solution
• Explain Amazon EFS
• Identify the functionality in Amazon EFS
• Explain Amazon S3 Glacier
• Identify the functionality in Amazon S3 Glacier
• Differentiate between Amazon EBS, Amazon S3, Amazon EFS, and Amazon S3 Glacier
58
Complete the knowledge check
59
Sample exam question
A company wants to store data that is not frequently accessed. What is the best and cost-
effective solution that should be considered?
A. AWS Storage Gateway
B. Amazon Simple Storage Service Glacier
C. Amazon Elastic Block Store (Amazon EBS)
D. Amazon Simple Storage Service (Amazon S3)
60
Additional resources
• AWS Storage page
• Storage Overview
• Recovering files from an Amazon EBS volume backup
• Confused by AWS Storage Options? S3, EFS, EBS Explained
61