KDD Vs Data Mining

KDD refers to the overall process of discovering useful knowledge from large amounts of data, and involves data cleaning, preprocessing, reduction, and modeling to identify patterns. Data mining is one step in the KDD process that applies specific algorithms to extract patterns from the data. Some examples of data mining algorithms include clustering, classification, regression, and association. While KDD and data mining are often used interchangeably, data mining is actually a subset within the broader KDD process.

Uploaded by

Girish Sahare

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1K views2 pages

KDD Vs Data Mining

Uploaded by

Girish Sahare

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

KDD vs Data mining

KDD (Knowledge Discovery in Databases) is a field of computer science, which

includes the tools and theories to help humans in extracting useful and previously
unknown information (i.e. knowledge) from large collections of digitized data. KDD
consists of several steps, and Data Mining is one of them. Data Mining is application of
a specific algorithm in order to extract patterns from data. Nonetheless, KDD and Data
Mining are used interchangeably.
What is KDD?
As mentioned above, KDD is a field of computer science, which deals with extraction of
previously unknown and interesting information from raw data. KDD is the whole
process of trying to make sense of data by developing appropriate methods or
techniques. This process deal with the mapping of low-level data into other forms those
are more compact, abstract and useful. This is achieved by creating short reports,
modeling the process of generating data and developing predictive models that can
predict future cases. Due to the exponential growth of data, especially in areas such as
business, KDD has become a very important process to convert this large wealth of
data in to business intelligence, as manual extraction of patterns has become seemingly
impossible in the past few decades. For example, it is currently been used for various
applications such as social network analysis, fraud detection, science, investment,
manufacturing, telecommunications, data cleaning, sports, information retrieval and
largely for marketing. KDD is usually used to answer questions like what are the main
products that might help to obtain high profit next year in Wal-Mart?. This process has
several steps. It starts with developing an understanding of the application domain and
the goal and then creating a target dataset. This is followed by cleaning, preprocessing,
reduction and projection of data. Next step is using Data Mining (explained below) to
identify pattern. Finally, discovered knowledge is consolidates by visualizing and/or
interpreting.
What is Data Mining?

As mentioned above, Data Mining is only a step within the overall KDD process. There
are two major Data Mining goals as defined by the goal of the application, and they are
namely verification or discovery. Verification is verifying the users hypothesis about
data, while discovery is automatically finding interesting patterns. There are four major
data mining task: clustering, classification, regression, and association (summarization).
Clustering is identifying similar groups from unstructured data. Classification is learning
rules that can be applied to new data. Regression is finding functions with minimal error
to model data. And association is looking for relationships between variables. Then, the
specific data mining algorithm needs to be selected. Depending on the goal, different
algorithms like linear regression, logistic regression, decision trees and Nave Bayes
can be selected. Then patterns of interest in one or more representational forms are
searched. Finally, models are evaluated either using predictive accuracy or
understandability.
What is the difference between KDD and Data mining?
Although, the two terms KDD and Data Mining are heavily used interchangeably, they
refer to two related yet slightly different concepts. KDD is the overall process of
extracting knowledge from data while Data Mining is a step inside the KDD process,
which deals with identifying patterns in data. In other words, Data Mining is only the
application of a specific algorithm based on the overall goal of the KDD process.

Static and Dynamic Hashing
No ratings yet
Static and Dynamic Hashing
12 pages
Introduction To Emerging Trends
No ratings yet
Introduction To Emerging Trends
30 pages
ds4015 Big Data Analytics Vignesh K Notes
No ratings yet
ds4015 Big Data Analytics Vignesh K Notes
146 pages
BCA-404: Data Mining and Data Ware Housing
No ratings yet
BCA-404: Data Mining and Data Ware Housing
19 pages
AI Project Cycle PPT - Notes
No ratings yet
AI Project Cycle PPT - Notes
9 pages
Selective Tuning and Indexing
No ratings yet
Selective Tuning and Indexing
3 pages
Data Mining Introduction
No ratings yet
Data Mining Introduction
52 pages
Infosys Campus Registration Guide
No ratings yet
Infosys Campus Registration Guide
7 pages
MCA Syllabus BPUT
No ratings yet
MCA Syllabus BPUT
11 pages
CS1403 CASE Tools Lab Manual
100% (2)
CS1403 CASE Tools Lab Manual
67 pages
Digital Nurture 2.0 - Deep Skilling Stage - Handbook
No ratings yet
Digital Nurture 2.0 - Deep Skilling Stage - Handbook
11 pages
Rich Internet Applications (Rias) : Characteristics of Ria
No ratings yet
Rich Internet Applications (Rias) : Characteristics of Ria
24 pages
Unit01 03
No ratings yet
Unit01 03
147 pages
4.5 Issues in Code Generation
No ratings yet
4.5 Issues in Code Generation
7 pages
RT Why Agreements Matter
No ratings yet
RT Why Agreements Matter
176 pages
OS in 6 Hours
No ratings yet
OS in 6 Hours
73 pages
Introduction To AI & ML QUESTION BANK MODULEWISE
No ratings yet
Introduction To AI & ML QUESTION BANK MODULEWISE
3 pages
An XML File Which Will Display The Book Information and DTD
No ratings yet
An XML File Which Will Display The Book Information and DTD
7 pages
Evaluation of The E-Auction System of Coal India Limited: Summer Project Report ON
No ratings yet
Evaluation of The E-Auction System of Coal India Limited: Summer Project Report ON
91 pages
Data Science (UNIT 1)
No ratings yet
Data Science (UNIT 1)
31 pages
CS3492 DBMS-Important-2-Mark With Answer
No ratings yet
CS3492 DBMS-Important-2-Mark With Answer
16 pages
CS8392 - Oop - Unit - 3 - PPT - 3.4
No ratings yet
CS8392 - Oop - Unit - 3 - PPT - 3.4
23 pages
Data Mining and KDD
No ratings yet
Data Mining and KDD
15 pages
Integrity and Domain Constraints
No ratings yet
Integrity and Domain Constraints
25 pages
006 Practical List of DM-2023
No ratings yet
006 Practical List of DM-2023
1 page
Santa Barbara County Elections Office Contest Candidate List Aug. 16, 2018
No ratings yet
Santa Barbara County Elections Office Contest Candidate List Aug. 16, 2018
51 pages
Presentation On Industrial Training
No ratings yet
Presentation On Industrial Training
13 pages
M.SC Part II Syllabus
No ratings yet
M.SC Part II Syllabus
41 pages
Machine Learning Quantum
No ratings yet
Machine Learning Quantum
64 pages
MGBMC2012 02
100% (2)
MGBMC2012 02
7 pages
ZARUMA PROJECT Short
No ratings yet
ZARUMA PROJECT Short
80 pages
SIR Presentation
No ratings yet
SIR Presentation
31 pages
Royalty On MURUM
No ratings yet
Royalty On MURUM
5 pages
Projectors
No ratings yet
Projectors
9 pages
Msc. 3 Sem: Unit - 1
No ratings yet
Msc. 3 Sem: Unit - 1
57 pages
Data Science Notes
No ratings yet
Data Science Notes
10 pages
2021 Directory of Operating Mines and Quarries
No ratings yet
2021 Directory of Operating Mines and Quarries
29 pages
Database Management System
No ratings yet
Database Management System
32 pages
Ch1 Overview KDD - ML
No ratings yet
Ch1 Overview KDD - ML
23 pages
Peruvian & Brazilian Tin JLK Abril 2016
No ratings yet
Peruvian & Brazilian Tin JLK Abril 2016
23 pages
PKO Business
No ratings yet
PKO Business
35 pages
CH 6 Secondary Activities
No ratings yet
CH 6 Secondary Activities
19 pages
Btech 7th Cbcs
No ratings yet
Btech 7th Cbcs
14 pages
MINING: A Boon or A Bane????
No ratings yet
MINING: A Boon or A Bane????
3 pages
2 - PPT Multi Keyword Search in Cloud Data
No ratings yet
2 - PPT Multi Keyword Search in Cloud Data
13 pages
DM Module 1
No ratings yet
DM Module 1
11 pages
Point-to-Point Protocol: Semester 4, Chapter 4
No ratings yet
Point-to-Point Protocol: Semester 4, Chapter 4
51 pages
DM Practice
No ratings yet
DM Practice
15 pages
DBMS
100% (1)
DBMS
16 pages
LOI For PT. Kideco Jaya Gung
No ratings yet
LOI For PT. Kideco Jaya Gung
3 pages
Mineral Prospecting Work Programme
100% (1)
Mineral Prospecting Work Programme
18 pages
Subject Data Warehouse
No ratings yet
Subject Data Warehouse
42 pages
Perth Austmine Innovation Roadshow Program 100823
No ratings yet
Perth Austmine Innovation Roadshow Program 100823
7 pages
DDB - Presentation5data Mining Overview
No ratings yet
DDB - Presentation5data Mining Overview
19 pages
Ground Failure Investigation Over Abandoned Coal Mines - A Case Study (1988)
No ratings yet
Ground Failure Investigation Over Abandoned Coal Mines - A Case Study (1988)
5 pages
Topic:SLIP&PPP Submitted To:-Submitted By
No ratings yet
Topic:SLIP&PPP Submitted To:-Submitted By
13 pages
IIM Membership Upload
No ratings yet
IIM Membership Upload
8 pages
Data Mining Report
100% (1)
Data Mining Report
15 pages
A Generic Framework To Support The Implementation of Six Sigma Approach in SMEs
No ratings yet
A Generic Framework To Support The Implementation of Six Sigma Approach in SMEs
6 pages
Hindustan Zinc LTD
No ratings yet
Hindustan Zinc LTD
26 pages
Oriel Resources PLC - Voskhod Chrome Project - InvestEgate
No ratings yet
Oriel Resources PLC - Voskhod Chrome Project - InvestEgate
4 pages
Hadoop PDF
0% (1)
Hadoop PDF
4 pages
Maxwell An20170815a1 1
No ratings yet
Maxwell An20170815a1 1
3 pages
Vikass
No ratings yet
Vikass
5 pages
ASSIGNMENT 1 Questions BI
No ratings yet
ASSIGNMENT 1 Questions BI
1 page
Internship Opportunity at Movidu Technology-Hyderabad
No ratings yet
Internship Opportunity at Movidu Technology-Hyderabad
1 page
Chapter 2223
No ratings yet
Chapter 2223
35 pages
Counting Oneness in A Window
No ratings yet
Counting Oneness in A Window
12 pages
Java Notes Apna
No ratings yet
Java Notes Apna
14 pages
KDD
No ratings yet
KDD
3 pages
Note 23
No ratings yet
Note 23
1 page
DWDM Lab Manual - It - Iii-Ii - 2018-19 PDF
No ratings yet
DWDM Lab Manual - It - Iii-Ii - 2018-19 PDF
96 pages
Templates: Ookout
No ratings yet
Templates: Ookout
6 pages
Abhishek Tiwari's DBMS Interview Notes
No ratings yet
Abhishek Tiwari's DBMS Interview Notes
19 pages
SAIL IISCO Marketing Project PDF
No ratings yet
SAIL IISCO Marketing Project PDF
74 pages
File System Damage - Orphans and Lost+Found
No ratings yet
File System Damage - Orphans and Lost+Found
1 page
Data Analytics Important Questions
No ratings yet
Data Analytics Important Questions
11 pages
Crisp
No ratings yet
Crisp
14 pages
Lamp Load
100% (4)
Lamp Load
5 pages
Oss Unit III
No ratings yet
Oss Unit III
44 pages
XVXVXV
No ratings yet
XVXVXV
5 pages
Data Warehousing and Data Mining Important Question
No ratings yet
Data Warehousing and Data Mining Important Question
7 pages
Unit I DM
No ratings yet
Unit I DM
27 pages
Boe310 Lecture
No ratings yet
Boe310 Lecture
25 pages
Unit-1 Data Mining Metrics
No ratings yet
Unit-1 Data Mining Metrics
2 pages
Taxonomy
No ratings yet
Taxonomy
30 pages
Enhancement Lecture: Wasting Assets Lecture Notes
No ratings yet
Enhancement Lecture: Wasting Assets Lecture Notes
4 pages
L-2.9 Hmac Cmac
No ratings yet
L-2.9 Hmac Cmac
14 pages
APUSH Essay T. Roosevelt and Wilson
No ratings yet
APUSH Essay T. Roosevelt and Wilson
3 pages
JDBC Questions Bank 2020
No ratings yet
JDBC Questions Bank 2020
1 page
Assignment of ML
No ratings yet
Assignment of ML
5 pages
Ai Unit 4
No ratings yet
Ai Unit 4
23 pages
Presentation ON Neo4J
No ratings yet
Presentation ON Neo4J
5 pages
The Role of Algorithms in Computing
No ratings yet
The Role of Algorithms in Computing
9 pages
Bput Coa
No ratings yet
Bput Coa
2 pages
Mc-Unit I
No ratings yet
Mc-Unit I
16 pages
Introduction to Linux: Installation and Programming
From Everand
Introduction to Linux: Installation and Programming
N. B. Venkateswarlu
No ratings yet
Touchpad Plus Ver. 1.1 Class 7
From Everand
Touchpad Plus Ver. 1.1 Class 7
Nisha Batra
No ratings yet

KDD Vs Data Mining

Uploaded by

KDD Vs Data Mining

Uploaded by

KDD vs Data mining

KDD (Knowledge Discovery in Databases) is a field of computer science, which

You might also like