0% found this document useful (0 votes)

8 views26 pages

Data Analytics

The document provides an overview of data and data analytics, explaining the types of data including numerical, textual, categorical, and time series data. It discusses the importance of data analytics in decision-making for businesses and introduces data science as a field that extracts insights from data. Additionally, it covers various types of data analytics, big data, data architecture, and the significance of data quality and governance.

Uploaded by

rrakeshfme

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views26 pages

Data Analytics

Uploaded by

rrakeshfme

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 26

DATA ANALYTICS

Avinash seekoli
WHAT IS DATA
Data refers to raw facts, figures, or details that can be processed or analyzed to
provide meaningful information. Data can come in various forms, such as
numbers, text, images, audio, or video, and it serves as the foundation for
generating insights and making decisions
EXAMPLES OF DATA
1. Numerical Data:
○ Temperature readings: 25°C, 30°C, 28°C
○ Sales figures: $500, $1000, $1500
2. Textual Data:
○ Customer reviews: "The product is excellent," "Delivery
was slow"
○ Names: "John", "Jane", "Ali"
3. Categorical Data:
○ Gender: Male, Female, Non-binary
○ Colors: Red, Blue, Green
Time Series Data:
● Stock prices over time: Day 1: $100, Day 2: $102, Day 3: $101
● Monthly rainfall: Jan: 50mm, Feb: 70mm
Image Data:
● A photograph of a cat
● MRI scan images
Audio Data:
● A recording of someone speaking
● Music files (e.g., MP3
TYPES OF DATA

Data categories can be measured using different scales depending on the

nature of the data. Here are the primary categories and how they are
measured
Nominal Data :
Nominal data is a basic data type that categorizes data by labeling or
naming values such as Gender, hair color, or types of animal. It does not
have any hierarchy
Ordinal data:Ordinal data involves classifying data based on rank, such
as social status in categories like ‘wealthy’, ‘middle income’, or ‘poor’.
However, there are no set intervals between these categories.
CNTD

Interval data:
 Interval data has meaningful intervals between values, but there is no true zero point.
 The difference between 20°C and 30°C is the same as between 30°C and 40°C (a 10-degree

difference), making intervals meaningful.

 However, 0°C does not mean the absence of temperature—it’s just a point on the scale.

Ratio data
 Ratio data has both meaningful intervals between values and a true zero point.
 A height of 0 cm means no height at all, making zero meaningful. It makes sense to say that a

person who is 180 cm tall is twice as tall as a person who is 90 cm, as the ratio is meaningful
WHAT IS DATA ANALYTICS

Data Analytics is used to get conclusions by processing the raw

data.

 It is helpful in various businesses as it helps the company to

make decisions based on the conclusions from the data.

Basically, data analytics helps to convert a Large number of

figures in the form of data into Plain English i.e., conclusions which
DATA SCIENCE
Data Science is a field that deals with extracting meaningful information and insights by
applying various algorithms preprocessing and scientific methods on structured and
unstructured data.
This field is related to Artificial Intelligence
Data Science is used in almost every industry today that can predict customer behavior and
trends and identify new opportunities.
Businesses can use it to make informed decisions about product development and marketing.
It is used as a tool to detect fraud and optimize processes.
Governments also use Data Science to improve efficiency in the delivery of public services.
TYPES OF DATA ANALYTICS
 Descriptive Analytics tells you what happened in the past.
 Diagnostic Analytics helps you understand why something happened in the past.
 Predictive Analytics predicts what is most likely to happen in the future.
 Prescriptive Analytics recommends actions you can take to affect those
outcomes.
 Descriptive analysis. This step, also known as data mining, is the most common
method of data analysis where large sets of data are captured and analyzed for any
patterns that can help scientists gain deeper insight into business processes. This kind
of analysis lets specialists find answers to key statistical questions. They might
want to know how much revenue the business is generating, how many customers
visit the business on average, and how much profit the business is taking away.
 Diagnostic or inferential analysis. As the name suggests, diagnostic inferential
analysis determines the root cause of current problems. It involves using data to find
out exactly how and why a business process failed.
 Predictive analysis. By utilizing previous data, specialists can use this
process to estimate what will likely happen in the future. These predictions
are made on the basis of historical data and past consumer trends.
 Prescriptive analysis. This helps specialists gain a statistical perspective on
an important business decision. Is it the right time to launch a new product?
Prescriptive analysis will answer that question. Can we afford to scale up
right now? This type of analysis will help you find out.
BIG DATA
● Big Data is the field of collecting the large data sets from various sources like social media,
GPS, sensors etc and analyzing them systematically and extract useful patterns using some
tools and techniques by enterprises.

● Most of the data is generated from social media sites like Facebook, Instagram, Twitter, etc,
and the other sources can be e-business, e-commerce transactions, hospital, school, bank data,
etc.
● This data is impossible to manage by traditional data storing techniques. So Big-Data came
into existence for handling the data which is big and impure.
● Before analyzing and determining the data, the data architecture must be designed by the
architect.
DATA ARCHITECTURE DESIGN AND DATA MANAGEMENT

Data architecture design is set of standards which are composed of certain policies, rules,
models and standards which manages, what type of data is collected, from where it is
collected, the arrangement of collected data, storing that data, utilizing and securing
the data into the systems and data warehouses for further analysis.

Data is one of the essential pillars of enterprise architecture through which it succeeds in
the execution of business strategy.
ARCHITECT
Data integration is the process of combining data from
different sources and providing users with a unified view.
This involves consolidating, transforming, and cleaning
data to make it consistent, reliable, and usable for
analysis, reporting, or business processes
A data lake is a centralized repository that allows you to
store all your structured, semi-structured, and
unstructured data at any scale. Unlike a traditional data
warehouse, which stores processed and refined data, a
data lake stores raw data in its native format until it’s
needed for analytics or processing.
 A data mart is a smaller, specialized subset of a
data warehouse designed for a specific department
or business function, such as sales or finance. It
allows users to quickly access relevant data without
needing to go through the entire data warehouse.
 Metadata is the "data about data," offering key
information to help describe, manage, and organize
data across systems. It plays a crucial role in
enabling data discovery, management, and
security across various applications and
industries.
 Metadata is information that describes other
data, providing context and details about its
characteristics. In real-time, it helps you
understand key aspects like:
• What the data is (e.g., file name, type, title).
• Who created or owns it (e.g., author, owner).
• When it was created or modified (e.g.,
timestamps).
• How it’s formatted or structured (e.g., file size,
format).
• Permissions (e.g., access rights, usage
restrictions).
• Master Data: Used to identify consistent
information (e.g., customer profiles in CRM).
• Reference Data: Provides fixed classifications
(e.g., currency code validation in transactions).
 Both help ensure data consistency, quality, and
accuracy across systems in real-time operations.
 Data quality refers to the accuracy, completeness, reliability, and relevance of data. High-

quality data is essential for making sound decisions, ensuring that the data is fit for its

intended purpose.
 Data governance is the framework of rules, policies, and procedures that ensure data is

managed properly, securely, and used consistently across an organization. It includes defining

who has authority over data and how it should be handled.

 Real-time example: A bank implements data governance to ensure customer information is

accurate and secure. When a customer updates their contact details, the bank’s system verifies

and standardizes the data before it is distributed to various departments, ensuring consistency

and compliance with privacy regulations in real time

 Data privacy refers to the protection and proper
handling of personal or sensitive information,
ensuring it is collected, stored, and shared in ways
that safeguard individuals' rights and prevent
unauthorized access or misuse.
DATA QUALITY
Data quality refers to how reliable, accurate, and usable your data is for analysis.
Poor data quality can lead to incorrect conclusions, so it’s essential to address
common
issues like noise , outliers , missing values , and duplicate data . Here’s a
breakdown in simple terms:
1.Noise
Definition : Noisy data is a meaningless data that can’t be interpreted by
machines. It can be generated due to faulty data collection, data entry errors
etc.
- Example : If a sensor reading gets random spikes due to interference, that’s
noise.
Impact : It makes your data less accurate and harder to interpret.
Solution : Use filtering techniques to remove or reduce noise.

2. Outliers
- Definition : Data points that are very different from the rest of the data.
- Example : In a group of people’s ages (20, 21, 22, 85), the age 85 might be
an outlier.
3. Missing Values
- Definition : When some data points are absent or not recorded.
- Example : In a survey, some respondents might skip answering certain
questions.
- Impact : Missing values can affect the accuracy of the results or analysis.
- Solution : You can either ignore missing data, fill in missing values (e.g., with
averages), or use algorithms that can handle missing data.

4. Duplicate Data
- Definition : When the same data is recorded more than once.
- Example : A customer may accidentally be registered twice in a database with
slight variations in their name.
- Impact : It can inflate results and cause incorrect conclusions.
- Solution : Detect and remove duplicates to keep data accurate.

Addressing these issues ensures that your data is clean, consistent, and ready for
reliable analysis.

Chapter-1 Introduction To Data Analytics
No ratings yet
Chapter-1 Introduction To Data Analytics
34 pages
SRU ADA Unit-1
No ratings yet
SRU ADA Unit-1
50 pages
Unitwise Imp Notes
No ratings yet
Unitwise Imp Notes
34 pages
Data Science Unit 2 Part 1
No ratings yet
Data Science Unit 2 Part 1
10 pages
Unit 1
No ratings yet
Unit 1
57 pages
Da Unit-1
No ratings yet
Da Unit-1
23 pages
Data Analysis - Unit1
No ratings yet
Data Analysis - Unit1
65 pages
Data Analytics For IOT
No ratings yet
Data Analytics For IOT
57 pages
Data Analytics
No ratings yet
Data Analytics
20 pages
BI Module 2
No ratings yet
BI Module 2
11 pages
Unit 1ppt
No ratings yet
Unit 1ppt
29 pages
UNIT-2: Importance of Analytics
No ratings yet
UNIT-2: Importance of Analytics
7 pages
Data Mining Introduction & Techniques
No ratings yet
Data Mining Introduction & Techniques
9 pages
Data Analytics
No ratings yet
Data Analytics
5 pages
Unit 1
No ratings yet
Unit 1
36 pages
All About Data Science
No ratings yet
All About Data Science
35 pages
Unit 1
No ratings yet
Unit 1
61 pages
Unit 1
No ratings yet
Unit 1
9 pages
Week 1 Lecture
No ratings yet
Week 1 Lecture
26 pages
Big Data and Analytics
No ratings yet
Big Data and Analytics
86 pages
Introduction To Data Analysis
100% (1)
Introduction To Data Analysis
94 pages
UNIT-1: What Is Data Analytics? Why Data Analytics Is Important? What Is The Role of Data Analytics and Ways To Use It?
No ratings yet
UNIT-1: What Is Data Analytics? Why Data Analytics Is Important? What Is The Role of Data Analytics and Ways To Use It?
10 pages
Data Analytics-Unit1 Notes
No ratings yet
Data Analytics-Unit1 Notes
30 pages
BDA Assignment 1: Big Data Features and Characteristics
No ratings yet
BDA Assignment 1: Big Data Features and Characteristics
14 pages
Da Notes U1 U3 Complete
No ratings yet
Da Notes U1 U3 Complete
56 pages
Advanced Data Analytics and Visualization Course Material
No ratings yet
Advanced Data Analytics and Visualization Course Material
45 pages
Data Analytics
No ratings yet
Data Analytics
42 pages
Business Analytics Notes
No ratings yet
Business Analytics Notes
31 pages
Data Management & Data Architecture
No ratings yet
Data Management & Data Architecture
21 pages
Analysis Terms
No ratings yet
Analysis Terms
1 page
Chapter-2 Data Science2
No ratings yet
Chapter-2 Data Science2
24 pages
Big Data Analytics (16!06!2025)
No ratings yet
Big Data Analytics (16!06!2025)
26 pages
Data Analytics-Unit1 Notes
No ratings yet
Data Analytics-Unit1 Notes
33 pages
Emergency Chapter Two
No ratings yet
Emergency Chapter Two
41 pages
Data Analysis
No ratings yet
Data Analysis
87 pages
CH 1
No ratings yet
CH 1
31 pages
Notes - KCS 061 Big Data Unit 1
No ratings yet
Notes - KCS 061 Big Data Unit 1
25 pages
Fda 1
No ratings yet
Fda 1
5 pages
Data Analytics Unit - I Data Analytics and Lifecycle
No ratings yet
Data Analytics Unit - I Data Analytics and Lifecycle
46 pages
Introduction To Data Analytics
No ratings yet
Introduction To Data Analytics
33 pages
Unit 1
No ratings yet
Unit 1
54 pages
Data Analytics for Aspiring Students
No ratings yet
Data Analytics for Aspiring Students
30 pages
Lecture 3 (DS) - Steps in Data Science Process
No ratings yet
Lecture 3 (DS) - Steps in Data Science Process
57 pages
Unit 1 Introduction
No ratings yet
Unit 1 Introduction
70 pages
Assignment OF Data Science (AIT 120) : Submitted To: Submitted by
No ratings yet
Assignment OF Data Science (AIT 120) : Submitted To: Submitted by
10 pages
Unit 2 - Data Science
No ratings yet
Unit 2 - Data Science
37 pages
ToolKit 1 - Unit 1 - Introduction To Data Analytics
No ratings yet
ToolKit 1 - Unit 1 - Introduction To Data Analytics
15 pages
Download
No ratings yet
Download
4 pages
Unit 1ppt 241202105748 Ba1c594f
No ratings yet
Unit 1ppt 241202105748 Ba1c594f
30 pages
Data Analytics
No ratings yet
Data Analytics
29 pages
Kit 601 L Unit 1 240219102731 858108ce
No ratings yet
Kit 601 L Unit 1 240219102731 858108ce
35 pages
L01-Fundamentals of Big Data and Data Analytics
No ratings yet
L01-Fundamentals of Big Data and Data Analytics
58 pages
Chapter 1
No ratings yet
Chapter 1
149 pages
2.1 Data Analytics
No ratings yet
2.1 Data Analytics
16 pages
Luminary-Form General
No ratings yet
Luminary-Form General
2 pages
Proposal For CCoE For SPPU
No ratings yet
Proposal For CCoE For SPPU
5 pages
Ch03 Cost Volume Profit Analysis
No ratings yet
Ch03 Cost Volume Profit Analysis
20 pages
Cheer Dance
No ratings yet
Cheer Dance
41 pages
Document 3
No ratings yet
Document 3
2 pages
BSIT 1A Learning Material No. 3 Problem Solving and Reasoning ANSWERS
No ratings yet
BSIT 1A Learning Material No. 3 Problem Solving and Reasoning ANSWERS
11 pages
Tooth Color Distribution Study
No ratings yet
Tooth Color Distribution Study
7 pages
ĐỀ 1
No ratings yet
ĐỀ 1
5 pages
Annapurna Ward 3 Sanitary Work Estimate
No ratings yet
Annapurna Ward 3 Sanitary Work Estimate
34 pages
Enterprise Rent - A-Car: Case: I I - Business Definition
No ratings yet
Enterprise Rent - A-Car: Case: I I - Business Definition
4 pages
High Rise Mechanical Design
100% (3)
High Rise Mechanical Design
55 pages
TDS EN Spring39Finer™
No ratings yet
TDS EN Spring39Finer™
2 pages
BGH PC200 270524
No ratings yet
BGH PC200 270524
5 pages
Sipawards 2019 Results
No ratings yet
Sipawards 2019 Results
81 pages
2nd Condition For Equilibrium
No ratings yet
2nd Condition For Equilibrium
2 pages
Secretariat-Commentary - Article - 1 - CISG 2
No ratings yet
Secretariat-Commentary - Article - 1 - CISG 2
2 pages
Michael Pollard - Teaching Philosophy Statement
No ratings yet
Michael Pollard - Teaching Philosophy Statement
3 pages
2023 GR 11 Final June Controlled Test MG (2) - 1
No ratings yet
2023 GR 11 Final June Controlled Test MG (2) - 1
20 pages
WS. 5.4-P2 Sims
No ratings yet
WS. 5.4-P2 Sims
35 pages
Public Sector Marketing
No ratings yet
Public Sector Marketing
8 pages
M138 Engine Specifications
No ratings yet
M138 Engine Specifications
10 pages
Damian - Chapter 2
No ratings yet
Damian - Chapter 2
3 pages
Gastric Cancer Preoperative Local Staging With 3D Multi-Detector Row CT-correlation With Surgical and Histopathologic Results
No ratings yet
Gastric Cancer Preoperative Local Staging With 3D Multi-Detector Row CT-correlation With Surgical and Histopathologic Results
11 pages
Aibuli: Environment-Friendly Natural Grass Weaves Wallcovering
No ratings yet
Aibuli: Environment-Friendly Natural Grass Weaves Wallcovering
12 pages
The Enquiry Letter
No ratings yet
The Enquiry Letter
1 page
Scaffold Inspection Register
100% (1)
Scaffold Inspection Register
1 page
Chapter 14
No ratings yet
Chapter 14
27 pages
Scilab, Xcos NITK
No ratings yet
Scilab, Xcos NITK
6 pages
JavaOBjEX User Guide
No ratings yet
JavaOBjEX User Guide
15 pages
RB 985mkii Om
No ratings yet
RB 985mkii Om
10 pages

Data Analytics

Uploaded by

Data Analytics

Uploaded by

DATA ANALYTICS

Data categories can be measured using different scales depending on the

difference), making intervals meaningful.

Data Analytics is used to get conclusions by processing the raw

 It is helpful in various businesses as it helps the company to

Basically, data analytics helps to convert a Large number of

who has authority over data and how it should be handled.

and compliance with privacy regulations in real time

You might also like