[go: up one dir, main page]

0% found this document useful (0 votes)
250 views3 pages

Data Warehousing & Mining Guide

A data warehouse consists of data from multiple sources organized for analysis. It differs from operational databases which store current data for transactions. A data warehouse uses a multidimensional schema and has components like an ETL process and OLAP tools. Data mining involves analyzing data to find patterns and derive useful information. It uses techniques like classification, clustering, association rule mining. Data mining helps businesses make better decisions through knowledge discovery from large datasets.

Uploaded by

Pulkit Saini
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
250 views3 pages

Data Warehousing & Mining Guide

A data warehouse consists of data from multiple sources organized for analysis. It differs from operational databases which store current data for transactions. A data warehouse uses a multidimensional schema and has components like an ETL process and OLAP tools. Data mining involves analyzing data to find patterns and derive useful information. It uses techniques like classification, clustering, association rule mining. Data mining helps businesses make better decisions through knowledge discovery from large datasets.

Uploaded by

Pulkit Saini
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
You are on page 1/ 3

DATA WAREHOUSING AND DATA MINING Unit 1 Q1: What is Data Warehouse? Discuss the components.

What are the differences between Operational Database and Data Warehouse? Q2: With a neat sketch, explain the architecture of a data warehouse. Compare two-tier architecture and three-tier client server architecture. Q3: What is OLAP? Discuss the typical OLAP operations with an example. Q4: What is Data Mart? Discuss different types of Data Marts. How data Marts are different from Data Warehouse? Q5: What is Meta Data? Why is it important? Discuss multidimensional data. Q6: What is Online Transaction Processing? Describe the evolution of OLTP. What are the critical features of OLTP systems? Q7: Compare and contrast two basic approaches to building a data warehouse Q8: What are the differences between the three main types of data warehouse usage: information processing, analytical processing and data mining? Discuss the motivation behind OLAP mining. Q9: A data mart is a subsection of a data warehouse that deals with specific information. List the advantages and disadvantages of using data mart over data warehouse Q10: Suppose that a data warehouse consists of three dimensions time, doctor and patient and the two measures count and charge, where the charge is the fee that a doctor charges a patient for a visit. Draw a schema diagram for the above data warehouse using Snowflake schema Q11: Describe the similarities and dissimilarities between Database and Data warehouse Q12: Give an example where application of data mining can be crucial to the success of business. Q13: What are the responsibilities of Data Warehouse Manager? Explain Q14: What do you understand by Data Granularity? Discuss with example Q15: What are Schemas? Discuss various Schemas used in Data Warehouse. Q16: What do you understand By Virtual Data Warehouse? Also, discuss the concept of Distributed Data Warehouse. Q17: Discuss OLAP, ROLAP, and MOLAP. Unit 2 Q1: What is Data Mining? Describe the steps involved in data mining when viewed as a process of knowledge discovery. Q2: Binning can be used for Data Reduction and Discretization. Support this statement with an example. Q3: What is Noise? What techniques are used to remove noise while building a data warehouse? Explain briefly. Q4: Write different approaches to Data Transformation. Q5: What are various goals of Data Mining? Explain the various tools and techniques of Data Mining.

Q6: List and explain the five primitives for specifying a data mining task. Q7: Describe the differences between the following approaches for the integration of a data mining system with a database: no coupling, loose coupling, semi tight coupling and tight coupling. Q8: Discuss the major issues in Data Mining. Q9: Discuss the architecture of a typical Data Mining system with diagram. Q10: Explain Discretization and Concept Hierarchy with example. Q11: What do you understand by Data Reduction? Explain the various strategies involved in it. Unit 3 Q1: Explain the algorithm for constructing a decision tree from training samples. Q2: What is Bayesian Classification? How it classifies the input data? Explain Bayes theorem. Q3: What are supervised and unsupervised Learning? Why Clustering is known as unsupervised Learning? Q4: Write and explain the algorithm for mining frequent item sets without candidate generation. Give relevant example. Q5: Discuss the approaches for mining multi level association rules from the transactional databases. Give relevant example. Q6: How does Clustering different from Classification? Q7: What are Classification Rules? How they are associated with Decision Tree? Q8: Define Association Rule Mining. How market basket analysis forms the association rule? Q9: Briefly outline how to compute the dissimilarity between objects described by following types of variables: (i) Numerical Variables (ii) Ratio-scaled variables (iii) Categorical Variables (iv) Asymmetric Variables Q10: What is Outlier Analysis? Why it is important? Q11: Explain categorization of major Clustering methods. Q12: Discuss K-MEANS and K-MEDOIDS ALGORITHMS. Q13: Numericals on Apriori Algorithm and Decision Tree Q14: Discuss the various categories of Clustering. Q15: Discuss the approaches for mining multidimensional association rule from data warehouse Q16: Write the Apriori algorithm for generating the frequent item sets. UNIT 4 Q1: What is a multimedia database? Explain the methods of mining multimedia database? Q2: Explain (i) Mining Time Series and Sequence data

(ii) Spatial Databases (iii) Descriptive Mining of Complex Data Objects (iv) Applications in Data Mining (v) Trends in Data Mining Q3: What kinds of Associations can be mined in Multimedia Data? Explain. Q4: Discuss the concept of Audio and Video Mining.

You might also like