MODULE 1
INTRODUCTION TO
INFORMATION
STORAGE
EMC Proven Professional. Copyright 2012 EMC Corporation. All Rights Reserved.
Module 1: Introduction to Information Storage
Module 1: Introduction to Information
Storage
Upon completion of this module, you should be able to:
Define data and information
Describe types of data
Describe the evolution of storage architecture
Describe the core elements of a data center
List the key characteristics of data center
Provide an overview of virtualization and cloud computing
EMC Proven Professional. Copyright 2012 EMC Corporation. All Rights Reserved.
Module 1: Introduction to Information Storage
Why Information Storage and Management?
Information is the knowledge derived from data
Growth of digital information has resulted in information
explosion
We live in an on-command, on-demand world
We need information when and where required
Increasing dependency on fast and reliable access to information
Businesses seek to store, protect, optimize, and leverage the
information
To gain competitive advantage
To derive new business opportunity
EMC Proven Professional. Copyright 2012 EMC Corporation. All Rights Reserved.
Module 1: Introduction to Information Storage
What is Data?
Data
It is a collection of raw facts from which conclusions may be drawn.
Data is converted into more
convenient form digital data
Factors for digital data growth
are:
Digital Movie
Movie
Digital Photo
Photo
Increase in data-processing
capabilities
Lower cost of digital storage
Affordable and faster
communication technology
Proliferation of applications and
smart devices
EMC Proven Professional. Copyright 2012 EMC Corporation. All Rights Reserved.
e-Book
Book
email
Letter
10101011010
00010101011
01010101010
10101011010
00010101011
01010101010
10101010101
01010101010
10101010101
Digital Data
Module 1: Introduction to Information Storage
Types of Data
Data can be classified as:
Structured
Unstructured
PDFs
email Attachments
Unstructured (90%)
Majority of data being
X-rays
created is unstructured
Manuals
Images
Forms
Contracts
Instant Messages
Documents
Web Pages
Rich Media
Invoices
Structured (10%)
Audio, Video
Database
EMC Proven Professional. Copyright 2012 EMC Corporation. All Rights Reserved.
Module 1: Introduction to Information Storage
Big Data
Big Data
It refers to data sets whose sizes are beyond the ability of commonly used
software tools to capture, store, manage, and process within acceptable
time limits.
Includes both structured and unstructured data generated by
variety of sources
Big data analysis in real time requires new techniques and tools
that provide:
High performance
Massively parallel processing (MPP) data platforms
Advanced analytics
Big data analytics provide an opportunity to translate large
volumes of data into right decisions
EMC Proven Professional. Copyright 2012 EMC Corporation. All Rights Reserved.
Module 1: Introduction to Information Storage
Storage
Stores data created by individuals and organizations
Provides access to data for further processing
Examples of storage devices are:
Media card in a cell phone or digital camera
DVDs, CD-ROMs
Disk drives
Disk arrays
Tapes
EMC Proven Professional. Copyright 2012 EMC Corporation. All Rights Reserved.
Module 1: Introduction to Information Storage
Evolution of Storage Architecture
Department 1
Server
Department 1
Server
Department 2
Server
Department 2
Server
Department 3
Server
Department 3
Server
Storage
Network
Server-centric Storage Architecture
Storage Device
Information-centric Storage Architecture
EMC Proven Professional. Copyright 2012 EMC Corporation. All Rights Reserved.
Module 1: Introduction to Information Storage
Data Center
Data Center
It is a facility that contains storage, compute, network, and other IT
resources to provide centralized data-processing capabilities.
Core elements of a data center
Application
Database management system (DBMS)
Host or Compute
Network
Storage
These core elements work together to address data-processing
requirements
EMC Proven Professional. Copyright 2012 EMC Corporation. All Rights Reserved.
Module 1: Introduction to Information Storage
Data Center: Online Order Transaction System
Example
Storage Array
Host/
Compute
Client
Storage
Network
LAN/WAN
User
Interface
OS and DBMS
EMC Proven Professional. Copyright 2012 EMC Corporation. All Rights Reserved.
Module 1: Introduction to Information Storage 10
Key Characteristics of a Data Center
Availability
Security
Data Integrity
Manageability
Capacity
Performance
Scalability
EMC Proven Professional. Copyright 2012 EMC Corporation. All Rights Reserved.
Module 1: Introduction to Information Storage 11
Managing Data Center
Key management activities include
Monitoring
Continuous process of gathering information on various elements
and services running in a data center
Reporting
Details on resource performance, capacity, and utilization
Provisioning
Configuration and allocation of resources to meet the capacity,
availability, performance, and security requirements
Virtualization and cloud computing have changed the way data
center infrastructure resources are provisioned and managed
EMC Proven Professional. Copyright 2012 EMC Corporation. All Rights Reserved.
Module 1: Introduction to Information Storage 12
Virtualization: An Overview
Virtualization is a technique of abstracting physical resources and
making them appear as logical resources
For example partitioning of raw disks
Pools physical resources and provides an aggregated view of
physical resource capabilities
Virtual resources can be created from pooled physical resources
Improves utilization of physical IT resources
EMC Proven Professional. Copyright 2012 EMC Corporation. All Rights Reserved.
Module 1: Introduction to Information Storage 13
Cloud Computing: An Overview
Enables individuals and organizations to use IT resources as a
service over network
Enables self-service requesting and automates requestfulfillment process
Enables users to scale up or scale down the usage of computing
resources quickly
Enables consumption-based metering
Consumers pay only for the resources they use
Example: CPU hours used, amount of data transferred, and Gigabytes
of data stored
EMC Proven Professional. Copyright 2012 EMC Corporation. All Rights Reserved.
Module 1: Introduction to Information Storage 14
Module 1: Summary
Key points covered in this module:
Data and information
Types of data
Big data
Evolution of storage architecture
Core elements of data center
Key characteristics of data center
Virtualization and cloud computing
EMC Proven Professional. Copyright 2012 EMC Corporation. All Rights Reserved.
Module 1: Introduction to Information Storage 15