Big Data and the Information Management
System
Lesson Objectives
After completing this lesson, you should be able to:
• Define the term Big Data
• Identify the challenges and opportunities in implementing
Big Data
• Describe the Oracle Information Management Architecture
for Big Data
1-2
1
A Strategic IM Perspective
Information Management
Information Management is the means by which an
organization maximizes the efficiency and value with
which it plans, collects, organizes, uses, controls,
stores, and disseminates its Information.
1-3
OLTP vs OLAP
1-4
2
OLTP vs OLAP
1-5
ETL (Extract, Transform, Load)
1-6
3
OLTP vs OLAP
1-7
Third Normal Form (3NF)
Third normal form (3NF) is used in normalizing a database design to reduce
the duplication of data. 3NF data modeling was ideal for online transaction
processing (OLTP) applications.
1-8
4
Star Schema
star schema is the simplest style of data mart schema and is the
approach most widely used to develop data warehouses and dimensional
data marts. The star schema consists of one or more fact tables
referencing any number of dimension tables.
1-9
Evolution of Big Data
• More people interacting with data
– Smartphones
– Internet
• Greater volumes of data being generated (machine-to-
machine generation)
– Sensors
– General Packet Radio Services (GPRS)
1 - 10
5
Big Data
Big Data is a term often used to describe data sets whose size
is beyond the capability of commonly used software tools to
capture, manage, and process.
Big Data can be generated from many different sources,
including:
• Social networks
• Banking and financial services
• E-commerce services
• Web-centric services
• Internet search indexes
• Scientific and document searches
• Medical records
• Web logs
1 - 11
Characteristics of Big Data
Volume Velocity
Social Networks
RSS Feeds Microblogs
Variety Value
1 - 12
6
Importance of Big Data
6%
8% 18%
Extremely Important
Somewhat Important
Very Important
Not Important Today
Don't know, Unsure
38% 30%
1 - 13
Big Data Opportunities: Some Examples
Today’s Challenge New Data What's Possible?
Preventive care, reduced
Healthcare: Remote patient
hospitalization,
Expensive office visits monitoring
epidemiological studies
Manufacturing: Automated and predictive
Product sensors
In-person support diagnosis and support
Geo-advertising,
Location-based services:
Real-time location data personalized notifications
Based on home ZIP code
and search
Increased availability,
Utilities: Detailed consumption
reduced cost, tiered
Complex distribution grid statistics
metering plans
1 - 14
7
Big Data Challenges
Schematize? Analyze? Processing? Governance?
1 - 15
Extending the Boundaries of Information
Management
• How can Big Data technologies be applied to
create additional business value or reduce
the costs of delivering Information
Management?
• Bridge Big Data and traditional relational
database worlds by integrating structured,
semi-structured, and unstructured
information.
• Augment Big Data analysis techniques with a
Business Intelligence, and Data Warehousing
technologies.
1 - 16
8
Summary
In this lesson, you should have learned how to:
• Define the term Big Data
• Identify the challenges and opportunities in implementing
big data
• Describe the Information Management Architecture for Big
Data
1 - 17