Chapter 1
Introduction
24 January 2024
Statistics - Spring semester 2023-2024
Outline
1-1 Introduction
1-2 Variables and Types of Data
1-3 Data Collection
24 January 2024 Statistics - Spring semester 2023-2024
1-1 Introduction
What is Statistics?
• Statistics is the art of learning from data. It is
concerned with the collection of data, its subsequent
description, and its analysis, which often leads to the
drawing of conclusions.
• Branch of mathematics
24 January 2024 Statistics - Spring semester 2023-2024
1-1 Introduction
Descriptive and Inferential Statistics
Descriptive statistics: consists of the collection, organization,
summation and presentation of data.
items are counted or measured and the results are combined in
various ways to give useful results.
24 January 2024 Statistics - Spring semester 2023-2024
1-1 Introduction
inferential statistics: making decisions or predictions about
populations based on sample information.
For example, it is often not practical to measure all the items
produced by a process. Instead, we very frequently take a
sample and measure the relevant quantity on each member of
the sample.
24 January 2024 Statistics - Spring semester 2023-2024
1-1 Introduction
Populations and samples
In statistics, we are interested in obtaining information about a
total collection of elements, which we will refer to as the
population. The population is often too large for us to examine
each of its members. For instance, we might have all the
residents of a given state, or all the television sets produced in
the last year by a particular manufacturer. In such cases, we try
to learn about the population by choosing and then examining a
subgroup of its elements. This subgroup of a population is called
a sample.
A population is consists of all the items we are interested in it.
A sample is a chosen part of the population in question.
24 January 2024 Statistics - Spring semester 2023-2024
1-1 Introduction
In practice, a given sample generally cannot be assumed to be
representative of a population unless that sample has been chosen in a
random manner. This is because any specific nonrandom rule for
selecting a sample often results in one that is inherently biased toward
some data values as opposed to others.
24 January 2024 Statistics - Spring semester 2023-2024
1-2 Variables and Types of Data
- Nominal - Lowest level of measurement
- Ordinal
- Interval
- Ratio - Highest level of measurement
24 January 2024 Statistics - Spring semester 2023-2024
1-2 Variables and Types of Data
A nominal scale is an unordered set of categories identified only by
name. Nominal measurements only permit you to determine
whether two individuals are the same or different. Notice that all of
these scales are mutually exclusive.
Examples of Nominal data
24 January 2024 Statistics - Spring semester 2023-2024
1-2 Variables and Types of Data
An ordinal scale is an ordered set of categories. Ordinal
measurements tell you the direction of difference between two
individuals.
Examples of Ordinal data
24 January 2024 Statistics - Spring semester 2023-2024
1-2 Variables and Types of Data
An interval scale is an ordered series of equal-sized categories.
Interval measurements identify the direction and magnitude of a
difference. Interval scales are numeric scales in which we know
both the order and the exact differences between the values.
- The problem with interval values data is that they don’t have a
„true zero“. That means in regards to our example, that there is
no such thing as no temperature.
- With interval data, we can add and subtract, but we cannot
multiply, divide or calculate ratios.
- Because there is no true zero, a lot of descriptive and inferential
statistics can’t be applied.
24 January 2024 Statistics - Spring semester 2023-2024
1-2 Variables and Types of Data
Example of an interval scale is Celsius temperature
because the difference between each value is the
same. For example,
24 January 2024 Statistics - Spring semester 2023-2024
1-2 Variables and Types of Data
A ratio scale is an interval scale where a value of zero indicates
none of the variable. Ratio measurements identify the direction
and magnitude of differences and allow ratio comparisons of
measurements.
Examples of ratio variables include height, mass, distance and
many more.
Mass (kg) Length (cm)
- 0 - 0
- 5 - 10
- 10 - 20
- 15 - 30
- 20 - 40
24 January 2024 Statistics - Spring semester 2023-2024
1-2 Variables and Types of Data
1- Categorical Data
Categorical data represents characteristics. Therefore it can
represent things like a person’s gender, language etc. Categorical
data can also take on numerical values (Example: 1 for female and
0 for male). Note that those numbers don’t have mathematical
meaning.
24 January 2024 Statistics - Spring semester 2023-2024
1-2 Variables and Types of Data
2- Numerical Data
2.1 Discrete Data
We speak of discrete data if its values are distinct and separate. In
other words: We speak of discrete data if the data can only take
on certain values. This type of data can’t be measured but it can
be counted. It basically represents information that can be
categorized into a classification. An example is the number of
heads in 100 coin flips.
You can check by asking the following two questions whether you
are dealing with discrete data or not: Can you count it and can it
be divided up into smaller and smaller parts?
24 January 2024 Statistics - Spring semester 2023-2024
1-2 Variables and Types of Data
2.2 Continuous Data
Continuous Data represents measurements and therefore their
values can’t be counted but they can be measured. An example
would be the height of a person, which you can describe by using
intervals on the real number line.
24 January 2024 Statistics - Spring semester 2023-2024
1-2 Variables and Types of Data
Summary of Data Classifications
24 January 2024 Statistics - Spring semester 2023-2024
1-3 Data Collection
What is Data Collection?
Data collection is a methodical process of gathering and
analyzing specific information to proffer solutions to relevant
questions and evaluate the results. It focuses on finding out all
there is to a particular subject matter. Data is collected to be
further subjected to hypothesis testing which seeks to explain a
phenomenon.
24 January 2024 Statistics - Spring semester 2023-2024
1-3 Data Collection
Types of Data Collection
1- INTERVIEW
An interview is a face-to-face conversation between two individuals
with the sole purpose of collecting relevant information to satisfy a
research purpose.
2- QUESTIONNAIRES
This is the process of collecting data through an instrument
consisting of a series of questions and prompts to receive a
response from individuals it is administered to. Questionnaires are
designed to collect data from a group.
24 January 2024 Statistics - Spring semester 2023-2024
1-3 Data Collection
3- REPORTING
By definition, data reporting is the process of gathering and
submitting data to be further subjected to analysis. The key aspect
of data reporting is reporting accurate data because of inaccurate
data reporting leads to uninformed decision making.
4- EXISTING DATA
This is the introduction of new investigative questions in addition
to/other than the ones originally used when the data was initially
gathered. It involves adding measurement to a study or research.
An example would be sourcing data from an archive.
24 January 2024 Statistics - Spring semester 2023-2024
1-3 Data Collection
5- OBSERVATION
This is a data collection method by which information on a
phenomenon is gathered through observation. The nature of the
observation could be accomplished either as a complete observer,
an observer as a participant, a participant as an observer or as a
complete participant. This method is a key base of formulating a
hypothesis.
24 January 2024 Statistics - Spring semester 2023-2024