Processing and
Analysis of Data
By- Sai R Patil
SGM College Karad
BBA II
Introduction to Data Processing and
Analysis
• Definition: Data processing is the collection and manipulation of data
to derive meaningful insights.
• The objective is to make data understandable and useful for decision-
making.
• Types:
Qualitative
Quantitative
Classification of Data
• Data classification refers to the process of grouping data into
categories.
• Types of Data Classification:
• Quantitative Data: Data that is measurable (e.g., income, age,
height).
• Qualitative Data: Data that describes qualities or characteristics (e.g.,
color, gender, brand).
Types of Classification
• Primary Classification: Nominal: Categories with no natural order
(e.g., gender, marital status).
• Ordinal: Categories with a natural order (e.g., education level: high
school, college, graduate).
• Secondary Classification: Discrete: Data that can only take specific
values (e.g., number of children).
• Continuous: Data that can take any value within a range (e.g.,
temperature, weight).
Tabulation of Data
• The process of arranging data in tables for easier interpretation.
• Types of Tables: Simple Table: Displays data for a single variable.
• Complex Table: Displays data for more than one variable.
• Example: Display a sample table with data on student marks in
different subjects.
Graphical Presentation of Data
• Graphs are a visual representation of data, making it easier to interpret.
• Types of Graphical Representations
Bar Diagram: Represents categorical data with rectangular bars.
Pie Chart: Displays the proportion of categories as slices of a circle.
Line Graph: Depicts trends over time.
Bar Diagram
• A bar chart represents data with rectangular bars.
• For comparing different categories or discrete data
• Example: Sales data for different products in a quarter.
• How to Interpret: The height of the bar corresponds to the value of
the data point.
Category 4
Category 3
Category 2
Category 1
0 1 2 3 4 5 6
Series 3 Series 2 Series 1
Pie Chart
• A circular graph divided into slices to illustrate numerical proportions.
• When to Use: To show the percentage or proportion of categories in a
whole.
• Example: Market share distribution among companies.
• How to Interpret: Each slice represents the percentage contribution
of each category. Sales
1st Qtr 2nd Qtr 3rd Qtr 4th Qtr
Curves (Line Graphs)
• Line graphs are used to show changes over a period of time.
• When to Use: For continuous data or trends.
• Example: Stock market trends over the past year.
• How to Interpret: Points connected by lines show the trend or change
in data over time. 14
12
10
8
6
4
2
0
Category 1 Category 2 Category 3 Category 4
Series 1 Series 2 Series 3
Descriptive Analysis
• Descriptive analysis involves summarizing the data using statistical
measures.
Mean (Average): Sum of all values divided by the number of values.
Mode: The most frequent value in the dataset.
Median: The middle value when the data is ordered.
Standard Deviation: Measures the spread of data from the mean.
Variance: The square of the standard deviation, indicating the data's spread.
Descriptive Analysis (Cont’d)
• Example Calculation
• Dataset: [5, 10, 15, 20, 25]
Mean: (5 + 10 + 15 + 20 + 25) / 5 = 15
Median: Middle value = 15
Mode: No mode (no repeating value)
Standard Deviation: Calculation process.
Variance: Square of the standard deviation.
Inferential Analysis
• Inferential analysis involves making predictions or inferences about a
population based on a sample of data.
T-Test: Compares the means of two groups to see if they are
significantly different.
Chi-Square Test: Tests if there is a significant association between
categorical variables.
T-Test
• A statistical test used to compare the means of two groups.
• Testing if the average exam scores of male and female students differ.
• Types:
Independent T-Test: Used when the two groups are independent.
Paired T-Test: Used when the two groups are related (e.g., pre-test vs
post-test scores).
Chi-Square Test
• A test used to determine whether there is a significant association
between categorical variables.
• Example: Testing if gender is associated with the choice of a major.
• Hypotheses:
• Null hypothesis (H0): No association between the variables.
• Alternative hypothesis (H1): There is an association between the
variables.
Conclusion
• Data classification and analysis help in making meaningful decisions.
• Graphical and statistical tools play a crucial role in data interpretation.
• Descriptive and inferential statistics are both essential for
understanding and making inferences from data.
• Importance for Business Analysis: Helps in understanding trends,
making predictions, and informed decision-making.
Thank you