Digital data/Types of Data/Classification of data
Data Classification:
Process of classifying data in relevant categories so that it can be used or applied more
efficiently. The classification of data makes it easy for the user to retrieve it. Data classification
holds its importance when comes to data security and compliance and also to meet different
types of business or personal objective. It is also of major requirement, as data must be easily
retrievable within a specific period of time.
Types of Data Classification:
Data can be broadly classified into 3 types.
1. Structured Data:
Structured data is created using a fixed schema and is maintained in tabular format. The elements
in structured data are addressable for effective analysis. It contains all the data which can be
stored in the SQL database in a tabular format. Today, most of the data is developed and
processed in the simplest way to manage information.
Examples –
Relational data, Geo-location, credit card numbers, addresses, etc.
Consider an example for Relational Data like you have to maintain a record of students for a
university like the name of the student, ID of a student, address, and Email of the student. To
store the record of students used the following relational schema and table for the same.
S_I S_Nam S_Addres
S_Email
D e s
A@gmail.co
1001 A Delhi
m
B@gmail.co
1002 B Mumbai
m
2. Unstructured Data:
It is defined as the data which does not follow a pre-defined standard or any organized format.
This kind of data is also not fit for the relational database because in the relational database you
will see a pre-defined manner or you can say organized way of data. Unstructured data is also
very important for the big data domain and To manage and store Unstructured data there are
many platforms to handle it like No-SQL Database.
Examples –
Word, PDF, text, media logs, etc.
3. Semi-Structured Data :
Semi-structured data is information that does not reside in a relational database but that have
some organizational properties that make it easier to analyze. With some process, you can store
them in a relational database but is very hard for some kind of semi-structured data, but semi-
structured exist to ease space.
Example –
XML data.
Sources of Data
Industry
Data
Social
Media Public
Data
BIG DATA
Transactions Sensor Data
Unit of Measure Approximate size Mathematical Representation
Kilobyte(KB) 103 210
Megabyte(MB) 106 220
Gigabyte(GB) 109 230
Terabyte(TB) 1012 240
Petabyte(PT) 1015 250
Exabyte(EB) 1018 260
Zetabyte(ZB) 1021 270
Yottabyte(YB) 1024 280