Answer the following
1. Define Precision.
2. What is Outlier?
3. What is Data Science?
4. What are the main steps in the Data Science process?
5. What is the difference between supervised and unsupervised learning?
6. What is the role of a data scientist?
7. What are some common types of data?
8. What are some common Data Science tools and libraries?
9. What is a p-value in statistics?
10. What is cross-validation?
11. What is the difference between classification and regression?
12. What is the difference between population and sample in statistics?
13. What is descriptive statistics?
14. What is the difference between mean, median, and mode?
15. What is variance and standard deviation?
16. What is the normal distribution?
17. What are confidence intervals?
18. What is hypothesis testing?
19. What is a p-value?
20. What is Type I and Type II error?
21. What is the central limit theorem?
22. What is correlation, and how does it differ from causation?
23. What is a chi-square test?
24. What is linear regression?
25. What is multicollinearity in regression analysis?
26. What are outliers, and how can you detect them?
27. What is an ANOVA (Analysis of Variance)?
28. What is a t-test, and when is it used?
29. What is the difference between parametric and non-parametric tests?
30. What is the concept of skewness in statistics?
31. What is the difference between correlation and covariance?
32. What is data preprocessing?
33. Why is data preprocessing important in data science?
34. What are the main steps involved in data preprocessing?
35. What is missing data, and how can it be handled?
36. What is feature scaling, and why is it important?
37. What is one-hot encoding, and when is it used?
38. What is label encoding, and how does it differ from one-hot encoding?
39. What are outliers, and how can they be handled during preprocessing?
40. What is data imbalance, and how can it be handled?
41. What are some common data preprocessing techniques for text data?
42. What is the difference between normalization and standardization?
43. What is the role of data preprocessing in improving model performance?
44. What is data visualization?
45. Why is data visualization important?
46. What are some common types of data visualizations?
47. What is the difference between a bar chart and a histogram?
48. What is the purpose of a scatter plot?
49. What is a heatmap, and how is it useful?
50. What are box plots, and why are they useful?
51. What is the difference between a line chart and an area chart?
52. How do you handle missing data in visualizations?
53. What is the role of color in data visualizations?
54. What are some best practices for creating effective data visualizations?
55. What is the difference between a stacked bar chart and a grouped bar chart?
56. What is a pair plot?
57. What is the importance of choosing the right chart scale (logarithmic vs. linear)?
58. What is a funnel plot?
59. What are interactive visualizations, and why are they useful?
60. What tools and libraries are commonly used for data visualization?