Lecture 2 Data Mining

Here's a 100-word document on data mining: *Data Mining:* Data mining is the process of discovering patterns, relationships, and insights from large datasets. It involves using statistical and mathematical techniques to analyze and extract valuable information from data. Data mining helps organizations make informed decisions, predict future trends, and improve business outcomes. *Key Steps:* 1. *Data Collection:* Gathering data from various sources. 2. *Data Preprocessing:* Cleaning and pre

Uploaded by

MUHAMMAD SHEHZAD

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views6 pages

Lecture 2 Data Mining

Uploaded by

MUHAMMAD SHEHZAD

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Lecture 2: Data

Preprocessing
PRESENTED BY : HALIMA TAHIR
Data Preprocessing

 Before using data for analysis or building models, we need to prepare it

properly. Raw data is often messy, incomplete, or scattered. Data
preprocessing is like cleaning and organizing your room before starting
work — it makes data ready for use.
It mainly includes the following steps:
1. Data Representation
2. Data Summarization
3. Data Cleaning
4. Data Integration and Transformation
Data Representation

 This means how data is stored and shown.

 Example: Numbers, text, images, tables, graphs, etc.
 If the data is not in a useful form, we convert it into a standard format so computers
can understand.
 👉 Think of it as writing notes neatly in one notebook instead of random papers.
Data Summarization

 Data can be huge, so we make summaries to understand it better.

 Example: Instead of keeping marks of 1,000 students, we calculate
average marks, highest marks, and lowest marks.
 Helps to quickly see patterns without going through all data.
 👉 Like making short notes from a big chapter.
Data Cleaning

 Real-world data usually has mistakes or missing values.

 Example:
 Some entries are empty (missing age).
 Some are wrong (age written as 500).
 Some are duplicates (same person added twice).
 In cleaning, we fix errors, fill missing values, and remove duplicates.
 👉 It’s like washing vegetables before cooking.
Data Integration and Transformation

 Data often comes from many different sources (databases, Excel files,
websites).
 We combine (integrate) them into one dataset.
 Transformation means changing the data into a common format.
 Example: Changing all dates to the same style (DD/MM/YYYY).
 Scaling numbers (marks out of 100 converted to percentage).
 👉 It’s like collecting ingredients from different shops and then
cutting/adjusting them before cooking.

Lecture 2 DM
No ratings yet
Lecture 2 DM
11 pages
Data Science PPT Module 1
100% (1)
Data Science PPT Module 1
24 pages
Lec 9
No ratings yet
Lec 9
1 page
U1 - DA - Data Preprocessing
No ratings yet
U1 - DA - Data Preprocessing
6 pages
Data Handling and Visualization 3rd Unit
No ratings yet
Data Handling and Visualization 3rd Unit
4 pages
Unit II (DWDM)
No ratings yet
Unit II (DWDM)
19 pages
Data Cleaning and Preparation
No ratings yet
Data Cleaning and Preparation
20 pages
Data Munging for Data Scientists
No ratings yet
Data Munging for Data Scientists
54 pages
Unit 2 Data Gathering
No ratings yet
Unit 2 Data Gathering
14 pages
CS322 - Lec 3 - S25
No ratings yet
CS322 - Lec 3 - S25
42 pages
DM Unit 1
No ratings yet
DM Unit 1
18 pages
Data Preprocessing Techniques Guide
No ratings yet
Data Preprocessing Techniques Guide
32 pages
Chapter 3& 4
No ratings yet
Chapter 3& 4
60 pages
Introduction To Data Science: Data Science Methodology & Data Preparation DR Shuhaida Mohamed Shuhidan Jan 2025
No ratings yet
Introduction To Data Science: Data Science Methodology & Data Preparation DR Shuhaida Mohamed Shuhidan Jan 2025
34 pages
UNIT 2 Data Warehousing
No ratings yet
UNIT 2 Data Warehousing
45 pages
Introduction To Data Science 1-2-2025
No ratings yet
Introduction To Data Science 1-2-2025
14 pages
Unit 2 Preprocessing
No ratings yet
Unit 2 Preprocessing
39 pages
Data Cleaning Preprocessing
No ratings yet
Data Cleaning Preprocessing
28 pages
U2L1
No ratings yet
U2L1
11 pages
Data Migration Process Infographics by Slidesgo
No ratings yet
Data Migration Process Infographics by Slidesgo
9 pages
DM Unit 3
No ratings yet
DM Unit 3
15 pages
Foundations of Data Science
No ratings yet
Foundations of Data Science
139 pages
Data Pre-processing Guide
No ratings yet
Data Pre-processing Guide
8 pages
3 DSEngineering
No ratings yet
3 DSEngineering
64 pages
633777800398832500ata Minig Presentation
No ratings yet
633777800398832500ata Minig Presentation
20 pages
Data Preprocessing Techniques Guide
No ratings yet
Data Preprocessing Techniques Guide
20 pages
Data Preprocessing Essentials
No ratings yet
Data Preprocessing Essentials
9 pages
Data Preprocessing Essentials
No ratings yet
Data Preprocessing Essentials
41 pages
Pre Processing
No ratings yet
Pre Processing
43 pages
Data Mining - Lecture 2
No ratings yet
Data Mining - Lecture 2
23 pages
Ch03 DS-Unit-2 ABM Final
No ratings yet
Ch03 DS-Unit-2 ABM Final
143 pages
DS-Unit-2 ABM Final
No ratings yet
DS-Unit-2 ABM Final
134 pages
Cours Preprocessing
No ratings yet
Cours Preprocessing
23 pages
Lesson 7 Data Description and Diagnostics
No ratings yet
Lesson 7 Data Description and Diagnostics
14 pages
UNIT - 2 .DataScience 04.09.18
No ratings yet
UNIT - 2 .DataScience 04.09.18
53 pages
Data Preprocessing AND Data Cleansing: By-Ahtesham Ullah Khan 1604610013 CS-3 Yr
No ratings yet
Data Preprocessing AND Data Cleansing: By-Ahtesham Ullah Khan 1604610013 CS-3 Yr
12 pages
UNIT - Introduction - DataScience - New
No ratings yet
UNIT - Introduction - DataScience - New
55 pages
DM Chapter 3
No ratings yet
DM Chapter 3
60 pages
Pre Processing
No ratings yet
Pre Processing
68 pages
Data Preprocessing
No ratings yet
Data Preprocessing
4 pages
DWM
No ratings yet
DWM
14 pages
7.data Preprocessing
No ratings yet
7.data Preprocessing
12 pages
Unit 2
No ratings yet
Unit 2
16 pages
Intro To Data Analytics - Cleanup & Transformation
No ratings yet
Intro To Data Analytics - Cleanup & Transformation
30 pages
BI Unit 4 Final
No ratings yet
BI Unit 4 Final
2 pages
Session-2-CO3-Introduction To Data Preprocessing
No ratings yet
Session-2-CO3-Introduction To Data Preprocessing
39 pages
BA-Unit 2
No ratings yet
BA-Unit 2
31 pages
Data Preprocessing: Clean, Transform, Integrate
No ratings yet
Data Preprocessing: Clean, Transform, Integrate
6 pages
Lec 2
No ratings yet
Lec 2
14 pages
21BCAD5C01 IDA Module 2 Notes
No ratings yet
21BCAD5C01 IDA Module 2 Notes
16 pages
DS Unit 2
No ratings yet
DS Unit 2
23 pages
Data Mining for Tech Enthusiasts
No ratings yet
Data Mining for Tech Enthusiasts
61 pages
Data Preprocessing Part 1
No ratings yet
Data Preprocessing Part 1
14 pages
Data Processing
No ratings yet
Data Processing
14 pages
Lecture 3 Unit 1
No ratings yet
Lecture 3 Unit 1
61 pages
Chapter 2
No ratings yet
Chapter 2
22 pages
Data Mining
No ratings yet
Data Mining
22 pages

Lecture 2 Data Mining

Uploaded by

Lecture 2 Data Mining

Uploaded by

Lecture 2: Data

 Before using data for analysis or building models, we need to prepare it

 This means how data is stored and shown.

 Data can be huge, so we make summaries to understand it better.

 Real-world data usually has mistakes or missing values.

You might also like