0% found this document useful (0 votes)

117 views12 pages

Chemometrics (Basic Chemical Data Operation)

Chemometrics combines mathematical and statistical methods to optimize experiments and analyze chemical data, enhancing the extraction of valuable information. Statistics plays a crucial role in chemistry for data summarization, quality control, and method validation. Understanding concepts such as variables, errors, significant figures, and data sources is essential for accurate chemical analysis and interpretation.

Uploaded by

zunairarasheed24

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

117 views12 pages

Chemometrics (Basic Chemical Data Operation)

Uploaded by

zunairarasheed24

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

What are Chemometrics?

Chemometrics is the science of using mathematical and statistical methods to design optimal experimental
procedures and to extract maximum useful chemical information from data obtained in the laboratory or
from chemical processes.
Explanation: Think of it as a bridge between chemistry and mathematics. In your chemistry lab, you
generate a lot of data (weights, volumes, absorbance readings, pH values, etc.). Chemometrics provides
the tools to:
1. Plan your experiments efficiently so you get the best results with the least resources.
2. Analyze the complex data you collect to find patterns, relationships, and meanings that are not
obvious by just looking at the numbers.
3. Build models to predict properties (e.g., predicting the concentration of a pollutant in water from
its spectroscopic data).
• Example:
o Without Chemometrics: You measure the absorbance of 10 known concentrations of a protein to
create a calibration curve. You then measure an unknown sample and estimate its concentration by
manually reading from the graph.
o With Chemometrics: You use a technique like Principal Component Analysis (PCA) to analyze
the entire UV-Vis spectrum (hundreds of wavelengths) of multiple complex protein mixtures to
identify which wavelengths are most important for distinguishing between different types of
proteins. This is much more powerful.

Basic Concept and Scope of Statistics in Chemical Sciences

Statistics is the science of collecting, organizing, analyzing, interpreting, and presenting data. In
chemistry, it is the toolbox we use to handle the uncertainty and variability inherent in all chemical
measurements.
Scope & Explanation: Why does a chemistry student need statistics?
1. Data Summary: To summarize large sets of data into a few meaningful numbers (e.g., average
yield of a reaction, standard deviation of replicate measurements).
2. Quality Control: To ensure that your analytical methods are precise and accurate (e.g., in
pharmaceutical industries to check the purity of a drug batch).
3. Optimization: To find the best conditions for a chemical reaction (e.g., what temperature and pH
give the highest product yield?).
4. Validation of Methods: To prove that your new analytical method is reliable and comparable to
standard methods.
5. Decision Making: To determine if a small difference between two measurements is real or just due
to random error (e.g., Is the concentration of lead in this water sample significantly higher than the
safe limit?).

Descriptive and Inferential Statistics

Descriptive Statistics: Methods used to summarize, organize, and describe the main features of a
collection of data. It deals only with the data you have in hand.
Example: Calculating the average (mean) pH of 10 lake water samples, or plotting a histogram of the
melting points of 50 synthesized compounds.
Inferential Statistics: Methods that use data from a small group (a sample) to make estimates,
predictions, or generalizations about a larger group (a population). It involves drawing conclusions that
go beyond the immediate data.
Example: You analyze 50 tablets from a batch of 1 million tablets to estimate the average drug content
for the entire batch. You then use a statistical test (like a t-test) to conclude that "with 95% confidence,
the average drug content is 250 mg ± 5 mg."

Population, Sample, Observations, Data

Population: The entire, complete set of items or individuals that you are interested in studying. It is the
"big picture."
Example in Chemistry:
▪ All the aspirin tablets produced in a factory on a given day.
▪ The entire volume of water in a lake.
▪ All possible measurements of the boiling point of ethanol that could ever be made under specific
conditions.
Sample: A subset, or a small part, selected from the population. It is used because studying the entire
population is often impossible, too expensive, or destructive.
Example in Chemistry:
▪ 100 aspirin tablets taken from the day's production for quality testing.
▪ 10 one-liter bottles of water collected from different parts of the lake.
▪ 5 replicate measurements of the boiling point of ethanol you make in your lab.
• Observations:
o Definition: The individual measurements or recordings that are made on each item in the sample.
o Example: For each of the 100 aspirin tablets (the sample), you measure the weight. Each of these
100 weight measurements is an observation.
Data: The collection of all observations. Data is the raw, unorganized facts and figures.
Example: The list of 100 weight measurements from the aspirin tablets is your data set.

Discrete and Continuous Variables

Variable: A characteristic or attribute that can be measured or counted and can vary from one observation
to another.
Discrete Variable: A variable that can only take on specific, separate values. Often these are counts of
items. You cannot have a fraction of the value.
Example:
▪ The number of atoms in a molecule (e.g., you can have 2 hydrogen atoms, but not 2.5).
▪ The number of carbon atoms in an alkane series.
▪ The number of times a experiment is repeated.
Continuous Variable: A variable that can take on any value within a given range. They are measured,
not counted. There is always an infinite number of possible values between any two points.
Example:
▪ Temperature: It can be 25.0°C, 25.01°C, 25.001°C, etc.
▪ Mass: A compound can weigh 2.5 g, 2.5001 g, 2.500001 g, etc.
▪ pH, Concentration, Volume, Time. Almost all measurements in a chemistry lab are continuous
variables.

Errors of Measurement
Definition: The difference between the measured value and the true value. No measurement is ever
perfectly exact. Understanding error is crucial in chemistry.
Formula for Error:
o Absolute Error = Measured Value - True Value
o (A positive value indicates the measurement is too high; a negative value indicates it is too low).
Types of Errors:
1. Systematic Error (Determinate Error): Error that is consistent and reproducible. It causes the
measurement to be always too high or always too low by a fixed amount. It is due to a flaw in the
instrument, method, or personal habit.
▪ Cause & Example:
▪ Faulty Calibration: A balance that is not zeroed correctly will give a mass that is consistently
0.05g too high.
▪ Personal Bias: A person who always reads a meniscus from above instead of at eye level.
▪ Method Error: An impurity in a sample that reacts and gives a falsely high reading.
▪ Key Point: Systematic errors affect accuracy (closeness to the true value) and can be corrected
by identifying and eliminating the cause.
2. Random Error (Indeterminate Error): Error that is unpredictable and varies randomly from one
measurement to the next. It causes scatter in the data around the true value.
Cause & Example:
▪ Electrical Noise: Small fluctuations in the electrical current of an instrument.
▪ Limitations of Measurement: The inevitable uncertainty in reading the scale of a burette or a
measuring cylinder.
▪ Environmental Changes: Tiny, uncontrollable variations in temperature or pressure.
▪ Key Point: Random errors affect precision (reproducibility of the measurement). They cannot be
eliminated, but their effect can be reduced by taking more measurements and using statistics.
Accuracy vs. Precision:
o Accuracy: How close a measurement is to the true or accepted value. (Related to Systematic Error).
o Precision: How close repeated measurements are to each other. (Related to Random Error).
o Analogy: A target shooter.
▪ High Accuracy, Low Precision: Shots are scattered but their average is near the bullseye.
▪ Low Accuracy, High Precision: Shots are clustered tightly together, but away from the bullseye.
▪ High Accuracy, High Precision: Shots are clustered tightly on the bullseye.

Significant Figures
Definition: The digits in a measurement that are known with certainty plus the first uncertain digit. They
indicate the precision of the measurement.
Rules for Determining Significant Figures:
1. All non-zero digits are significant. (e.g., 123.45 has 5 significant figures).
2. Zeros between non-zero digits are significant. (e.g., 1002 has 4 significant figures).
3. Leading zeros (zeros before the first non-zero digit) are NOT significant. They just locate the decimal
point. (e.g., 0.0056 has 2 significant figures).
4. Trailing zeros (zeros after the last non-zero digit) ARE significant IF there is a decimal point in the
number. (e.g., 100.0 has 4 significant figures; 100 has 1 significant figure—this is ambiguous and
best written in scientific notation).
• Rules for Calculations:
o Multiplication/Division: The result should have the same number of significant figures as the
measurement with the fewest significant figures.
▪ Example: 5.10 × 2.5 = 12.75. The calculator gives 12.75, but since 2.5 has 2 sig figs, the answer must
be rounded to 13 (or 1.3 × 10^1).
o Addition/Subtraction: The result should have the same number of decimal places as the
measurement with the fewest decimal places.
▪ Example: 10.5 + 1.025 = 11.525. Since 10.5 has one decimal place, the answer must be rounded
to 11.5.

Rounding of Data (Numbers): The process of reducing the number of significant digits in a number
while trying to keep its value similar.
General Rule:
1. Identify the last digit to be kept.
2. Look at the digit immediately to the right (the first digit to be dropped).
▪ If this digit is less than 5, simply drop it and all following digits. (e.g., Rounding 4.234 to two
decimal places: Look at the third digit (4), which is less than 5, so it becomes 4.23).
▪ If this digit is 5 or greater, increase the last kept digit by one and drop the rest. (e.g., Rounding
4.237 to two decimal places: Look at the third digit (7), which is greater than 5, so it becomes 4.24).
• Special Case (Rounding 5): There are different conventions, but a common one is "round to the
nearest even number" to avoid systematic bias. (e.g., 2.5 rounds to 2; 3.5 rounds to 4). For this
course, following the general rule (round up if 5 or greater) is usually sufficient.

Sources of Data: The origin from which statistical data is obtained.

Classification:
1. Internal Sources: Data that is generated within the organization or individual's own records.
▪ Example in a Chemistry Context:
▪ Laboratory notebooks from previous experiments.
▪ Quality control records from a pharmaceutical company's past production.
▪ A professor's own research data from earlier projects.
2. External Sources: Data that is obtained from outside the organization or individual.
Example in a Chemistry Context:
▪ Published Journals: Data from papers in the Journal of the American Chemical
Society or Analytical Chemistry.
▪ Government Publications: Data from the Pakistan Council of Scientific and Industrial Research
(PCSIR) or environmental protection agencies on pollutant levels.
▪ Books and Databases: Standard reference books like the CRC Handbook of Chemistry and Physics.
▪ Commercial Databases: Chemical supplier catalogs with purity data.

Collection of Primary and Secondary Data

Primary Data: Data that is collected originally by the investigator for the first time, specifically for the
current research problem. It is fresh and unpublished.
Collection Methods (for a Chemist):
Experimentation: Conducting a lab experiment and recording the measurements yourself (e.g.,
synthesizing a compound and measuring its yield and melting point).
Surveys/Questionnaires: (Less common in pure chemistry, but used in chemical education or public
health research) Surveying people about their use of household chemicals.
o Advantage: High relevance and control over quality.
o Disadvantage: Time-consuming and expensive.
Secondary Data: Data that has already been collected, compiled, and published by someone else for
another purpose. You use it for your own analysis.
o Collection Methods (for a Chemist):
▪ Literature Review: Searching through scientific journals, books, and online databases to find
data related to your topic.
▪ Using Standard Tables: Using data from standard reference tables (e.g., standard reduction
potentials, pKa values, spectral libraries).
o Advantage: Quick, easy, and inexpensive to obtain.
o Disadvantage: May not be perfectly suited to your needs, and you must trust the accuracy of the
original collector.

Editing of Data: The process of scrutinizing the collected raw data to detect and correct errors and
omissions to ensure the data is accurate, consistent, and ready for analysis. This is a crucial step before
any calculation or interpretation.
Types of Editing Checks:
1. Accuracy Check: Verifying that the data is free from errors.
Example:
▪ Does the recorded melting point (e.g., 250°C) make sense for the compound you synthesized? If the
literature value is 80°C, there might be a transcription error (maybe it was 25.0°C).
▪ Isthe mass of a product greater than the mass of the starting material? If yes, this is physically
impossible and indicates a major error.
2. Consistency Check: Ensuring that the data does not contain contradictions.
Example:
▪ The sum of the percentages of all components in a mixture should be 100%. If it adds up to 115%,
there is an error. The formula for this check is: Sum of all % = 100%.
3. Uniformity Check: Making sure data is recorded in the same units and format throughout.
▪ Example:

▪ Some volumes are recorded in mL and others in L. Before analysis, all must be converted to a single
unit using conversion factors (e.g., 1 L = 1000 mL).
▪ Dates are written in different formats (DD/MM/YYYY vs. MM/DD/YYYY). They should be
standardized.
4. Completeness Check: Verifying that all required data points have been collected and there
are no missing values.
▪ Example: In a table of results, ensure that there is a pH value for every temperature recorded. A blank
cell needs to be addressed.

Summary & Key Takeaways

• Chemometrics is the application of math and stats to solve chemical problems.
• Statistics is essential in chemistry to handle uncertainty, summarize data, and make informed
decisions.
• We study a Sample to make inferences about a Population. Our data consists of Observations on
the sample.
• Variables can be Discrete (counted) or Continuous (measured).
• All measurements have Error. Absolute Error = Measured Value - True Value. Systematic
Error affects accuracy (bias), while Random Error affects precision (scatter).
• Significant Figures reflect the precision of a measurement. Follow specific rules for rounding and
calculations.
• Data can come from Primary sources (you collect it) or Secondary sources (someone else
collected it).
• Before analysis, always perform Data Editing to check for accuracy, consistency (e.g., Sum % =
100%), uniformity, and completeness.
Introduction to Descriptive Statistics
Measures of Central Tendency (Mean, Median, Mode): Measures of central tendency are single values
that represent the center point or typical value of a dataset. They tell us where the data tends to cluster.
The three main measures are the Mean, the Median, and the Mode.
Why are they important in Chemistry? To report a single, representative value for a set of replicate
measurements (e.g., the average yield of a reaction).
o To compare the results from different experiments or different conditions.
o To summarize the properties of a set of samples (e.g., the average pH of water samples
from a river).
The Mean (Arithmetic Average): The mean is the sum of all observations in a dataset divided by the
number of observations. It is the most common measure of central tendency.
Formula:
o Sample Mean (x̄): x̄ = (Σxᵢ) / n
o Population Mean (μ): μ = (Σxᵢ) / N
o Where:
▪ x̄ (x-bar) is the symbol for the sample mean.
▪ μ (mu) is the symbol for the population mean.
▪ Σ (capital sigma) means "the sum of."
▪ xᵢ (x-sub-i) represents each individual value in the dataset.
▪ n is the number of observations in the sample.
▪ N is the number of observations in the population.
Example:
o You measure the melting point of a compound 5 times and get: 155.0°C, 155.8°C, 154.5°C,
156.1°C, 154.9°C.
o Mean (x̄) = (155.0 + 155.8 + 154.5 + 156.1 + 154.9) / 5
o x̄ = 776.3 / 5 = 155.26°C
Explanation:
o Advantage: It uses every value in the data set.
o Disadvantage: It is highly influenced by outliers (extreme values that are much higher or
lower than the rest). A single outlier can skew the mean.
The Median: The median is the middle value in a dataset when the observations are arranged in ascending
or descending order. It splits the data into two equal halves.
Formula: There is no formula, but a procedure.
1. Arrange the data in order from smallest to largest.
2. If the number of observations (n) is odd, the median is the middle value. Its position
is (n+1)/2.
3. If n is even, the median is the average of the two middle values.
Example 1 (n is odd):
o Data: 154.5, 154.9, 155.0, 155.8, 156.1 (already ordered).
o n=5 (odd). The median position is (5+1)/2 = 3.
o The 3rd value is 155.0°C. So, the median is 155.0°C.
Example 2 (n is even):
o Data: 154.5, 154.9, 155.0, 155.8, 156.1, 157.0 (ordered).
o n=6 (even). The two middle values are the 3rd (155.0) and 4th (155.8) values.
o Median = (155.0 + 155.8) / 2 = 155.4°C.
Explanation:
o Advantage: It is not affected by outliers. It is a "robust" statistic. If the highest value was
200.0°C instead of 156.1°C, the median would remain the same.
o Disadvantage: It does not use all the information in the data (only the middle values).
Measures of Central Tendency (Continued) & Dispersion
The Mode: The mode is the value that appears most frequently in a dataset. A dataset can have one mode
(unimodal), two modes (bimodal), or more (multimodal). If no number repeats, there is no mode.
Formula: No formula. It is found by observation or by creating a frequency table.
Example 1 (Unimodal):
o Data from a quality control test on tablet weights (mg): 499, 501, 500, 502, 499, 502,
503, 502.
o The value 502 appears three times, more than any other. The mode is 502 mg.
Example 2 (Bimodal):
o Data: 1.0, 1.1, 1.1, 1.5, 2.0, 2.0, 2.0, 2.1, 2.1
o The value 1.1 appears twice and 2.0 appears three times. The modes are 1.1 and 2.0.
Explanation:
o Usefulness in Chemistry: Particularly useful for categorical data or when the most
common value is of interest (e.g., the most frequently observed crystal structure, the most
common impurity level). For continuous numerical data (like most measurements), the
mode is often not very informative.
Summary: When to Use Which Measure?
Measure Best Used When... Caution in Chemistry
Mean The data is symmetrical and has A single contaminated sample or a gross error can make
no outliers. the mean very misleading.
Median The data is skewed or has outliers. The preferred measure for reporting typical values when
data quality is uncertain.
Mode Identifying the most frequent Less useful for continuous measurement data.
category or value.
Measures of Dispersion (Range, Variance, Standard Deviation): Measures of dispersion (or spread)
describe how much the data varies or how spread out the data points are around the center (mean). Two
datasets can have the same mean but very different levels of spread.
Why are they important in Chemistry?
o Precision: They quantify the precision (reproducibility) of your measurements. A small
spread means high precision.
o Uncertainty: They are used to express the uncertainty in your final result.
o Quality: A low spread in product quality (e.g., tablet weight) indicates a well-controlled
manufacturing process.
The Range: The range is the difference between the largest (maximum) and smallest (minimum) values
in the dataset.
Formula: Range = Maximum Value - Minimum Value
Example:
o Melting point data: 154.5, 154.9, 155.0, 155.8, 156.1 °C.
o Maximum = 156.1 °C, Minimum = 154.5 °C.
o Range = 156.1 - 154.5 = 1.6 °C.
Explanation:
o Advantage: Very simple to calculate.
o Disadvantage: It depends only on the two most extreme values and ignores all the
others. It is highly sensitive to outliers. A single mistake can make the range look very
large.

Measures of Dispersion (Variance and Standard Deviation)

The Variance The variance is the average of the squared differences of each data point from the mean. It
measures how far the data points are spread out from their average value. Because the differences are
squared, the variance is always a positive number.
Formula:
o Sample Variance (s²): s² = Σ(xᵢ - x̄)² / (n - 1)
o Population Variance (σ²): σ² = Σ(xᵢ - μ)² / N
o Where:
▪ s² is the sample variance.
▪ σ² (sigma squared) is the population variance.
▪ (xᵢ - x̄) is the deviation of each value from the mean.
• Why (n-1) for sample variance? This is called Bessel's correction. Using (n-1) instead of (n)
gives a better, unbiased estimate of the population variance based on a sample. Think of it as losing
one "degree of freedom" because we used the sample data to calculate the mean (x̄) first.
• Example Calculation (Sample Variance s²): Let's use our melting point data: 155.0, 155.8, 154.5,
156.1, 154.9 °C. We already calculated the mean, x̄ = 155.26°C.
1. Calculate each deviation from the mean: (xᵢ - x̄)
▪ 155.0 - 155.26 = -0.26
▪ 155.8 - 155.26 = 0.54
▪ 154.5 - 155.26 = -0.76
▪ 156.1 - 155.26 = 0.84
▪ 154.9 - 155.26 = -0.36
2. Square each deviation: (xᵢ - x̄)²
▪ (-0.26)² = 0.0676
▪ (0.54)² = 0.2916
▪ (-0.76)² = 0.5776
▪ (0.84)² = 0.7056
▪ (-0.36)² = 0.1296
3. Sum the squared deviations: Σ(xᵢ - x̄)²
▪ 0.0676 + 0.2916 + 0.5776 + 0.7056 + 0.1296 = 1.772
4. Divide by (n - 1): n=5, so n-1=4.
▪ s² = 1.772 / 4 = 0.443 (°C)²
• Explanation: The problem with variance is its units. The unit of variance is the square of the
original unit (e.g., (°C)², (grams)²). This is not intuitively easy to understand. This is why we need
the standard deviation.

The Most Important Measure of Dispersion

The Standard Deviation: The standard deviation is the square root of the variance. It is the most
important and commonly used measure of spread. It tells us the typical distance of the data points from
the mean.
Formula:
o Sample Standard Deviation (s): s = √[Σ(xᵢ - x̄)² / (n - 1)] = √(s²)
o Population Standard Deviation (σ): σ = √[Σ(xᵢ - μ)² / N] = √(σ²)
Example Calculation:
o From the previous example, we calculated the sample variance, s² = 0.443 (°C)².
o The sample standard deviation is: s = √(0.443) ≈ 0.666 °C.
Interpretation:
o The standard deviation of 0.67°C means that, typically, the individual melting point
measurements are about 0.67°C away from the average melting point of 155.26°C.
o A small standard deviation indicates that the data points are clustered closely around the
mean (high precision).
o A large standard deviation indicates that the data points are spread out over a wide range
(low precision).

Relative Standard Deviation (RSD) and Coefficient of Variation (CV): The Relative Standard
Deviation (RSD) is a measure of precision expressed as a percentage of the mean. It is also known as the
Coefficient of Variation (CV). It allows you to compare the precision of datasets that have different units
or very different means.
Formula:
o %RSD or CV = (s / x̄) × 100%
Example:
o For our melting point data: s ≈ 0.666 °C, x̄ = 155.26 °C.
o %RSD = (0.666 / 155.26) × 100% ≈ 0.43%
Explanation:
Why is it useful? Imagine you are comparing the precision of two methods: one for measuring high
concentrations of salt (in grams/L) and one for measuring trace lead (in micrograms/L). Their standard
deviations will be very different because the magnitudes are different. The %RSD standardizes the
precision, allowing for a fair comparison. A lower %RSD means better relative precision.

Practical Calculation and Summary

Step-by-Step Calculation of Mean and Standard Deviation
Let's calculate everything for a new, simple dataset. You measure the concentration of an solution (in
mol/L) four times: 0.101, 0.103, 0.098, 0.106.
Step Description Calculation Result
1 List the data (xᵢ) 0.101, 0.103, 0.098, 0.106 n=4
2 Calculate the Mean (x̄) (0.101+0.103+0.098+0.106) / 4 x̄ = 0.102
3 Find deviations (xᵢ - x̄) 0.101-0.102 = -0.001 ...
0.103-0.102 = 0.001
0.098-0.102 = -0.004
0.106-0.102 = 0.004
4 Square deviations (xᵢ - (-0.001)² = 0.000001 ...
x̄)² (0.001)² = 0.000001
(-0.004)² = 0.000016
(0.004)² = 0.000016
5 Sum squared Σ(xᵢ - x̄)² = Σ = 0.000034
deviations 0.000001+0.000001+0.000016+0.000016
6 Calculate Variance (s²) s² = 0.000034 / (4 - 1) = 0.000034 / 3 s² ≈ 0.0000113
7 Calculate Std. Dev. (s) s = √(0.0000113) s ≈ 0.00336
8 Calculate %RSD %RSD = (0.00336 / 0.102) × 100% %RSD ≈
3.29%
Final Result: The concentration is 0.102 ± 0.003 mol/L (often reported as mean ± standard deviation),
with a relative precision of about 3.3%.

Summary of Key Formulas

This page is a quick reference sheet for all the formulas learned.
Measure Symbol Formula Use
Sample Mean x̄ x̄ = (Σxᵢ) / n The average value.
Population Mean μ μ = (Σxᵢ) / N The true average of the entire population.
Median - Middle value of Robust center, not affected by outliers.
ordered data.
Sample Variance s² s² = Σ(xᵢ - x̄)² / (n - Average squared deviation from the mean.
1)
Population Variance σ² σ² = Σ(xᵢ - μ)² / N True variance of the population.
Sample Standard s s = √[ Σ(xᵢ - x̄)² / (n- Typical deviation from the mean. Most
Deviation 1) ] important measure of spread.
Population Standard σ σ = √[ Σ(xᵢ - μ)² / N True standard deviation of the population.
Deviation ]
Relative Standard %RSD or %RSD = (s / x̄) × Compares precision across different
Deviation CV 100% datasets.

Key Takeaways and Looking Ahead

Descriptive Statistics are used to summarize and describe the main features of a dataset.
Central Tendency (Mean, Median, Mode) tells you where the center of your data is.
o Mean is best for symmetric data without outliers.
o Median is robust and best for skewed data or data with outliers.
Dispersion (Range, Variance, Standard Deviation) tells you how spread out the data is.
o Standard Deviation (s) is the most useful measure, expressing spread in the original data
units.
o %RSD/CV is used to compare the precision of different measurements.
Reporting Results: In chemistry, it is standard practice to report a result as Mean ± Standard
Deviation (e.g., Yield = 65.2% ± 1.5%) to communicate both the average value and its precision.
Looking Ahead: These tools help you describe one set of data. Next, we will move into Inferential
Statistics, where you will learn how to:
• Estimate Confidence Intervals: To say, "I am 95% confident that the true concentration lies
between 0.099 and 0.105 mol/L."
• Perform Hypothesis Tests: Like the t-test, to determine if the difference between the means of
two experiments is statistically significant or just due to random chance.

Analytical Chem
No ratings yet
Analytical Chem
188 pages
Sampling and Errors - Analytical Chemistry
No ratings yet
Sampling and Errors - Analytical Chemistry
37 pages
Errors in Chemical Analyses
No ratings yet
Errors in Chemical Analyses
6 pages
Unit 2 Notes 11th
No ratings yet
Unit 2 Notes 11th
6 pages
Chem352 Unit 1
No ratings yet
Chem352 Unit 1
32 pages
Answer of The Maths Homework
No ratings yet
Answer of The Maths Homework
11 pages
Lecture Note CHM 242 2021
No ratings yet
Lecture Note CHM 242 2021
33 pages
CHE 2101 Analytical Chemistry KAB
No ratings yet
CHE 2101 Analytical Chemistry KAB
32 pages
Analytical Chemistry Essentials
No ratings yet
Analytical Chemistry Essentials
10 pages
Analytical Chemistry Study Notes Extraction
No ratings yet
Analytical Chemistry Study Notes Extraction
19 pages
Analytical Notes
No ratings yet
Analytical Notes
25 pages
9701 Chemistry Paper 5
No ratings yet
9701 Chemistry Paper 5
10 pages
LECTURE Note 213
No ratings yet
LECTURE Note 213
57 pages
Tools of Analytical Chemistry
No ratings yet
Tools of Analytical Chemistry
31 pages
Statistics For Anal. Chem. - Lecture Notes - Xu Ly So Lieu
100% (3)
Statistics For Anal. Chem. - Lecture Notes - Xu Ly So Lieu
157 pages
Midterms Notes (Analytical Chemistry)
No ratings yet
Midterms Notes (Analytical Chemistry)
5 pages
APP601S Chapters 1 To 3 - 015524
No ratings yet
APP601S Chapters 1 To 3 - 015524
73 pages
Che2614 Learning Units 1-4
No ratings yet
Che2614 Learning Units 1-4
51 pages
C. Analytical Chemistry
No ratings yet
C. Analytical Chemistry
10 pages
Basic Terms/Concepts IN Analytical Chemistry
No ratings yet
Basic Terms/Concepts IN Analytical Chemistry
53 pages
Practice 3. Measurements and Errors
No ratings yet
Practice 3. Measurements and Errors
8 pages
CHE2626 Practical Manual
No ratings yet
CHE2626 Practical Manual
39 pages
Chemistry 3 Unit 1 Reviewer
No ratings yet
Chemistry 3 Unit 1 Reviewer
5 pages
Analytical Chemistry Basics
No ratings yet
Analytical Chemistry Basics
74 pages
Wold Chemometrics History
No ratings yet
Wold Chemometrics History
7 pages
Clinical Chem PDF
No ratings yet
Clinical Chem PDF
26 pages
Eksperimental Design
No ratings yet
Eksperimental Design
15 pages
01 Introduction
No ratings yet
01 Introduction
3 pages
MINITAB for Analytical Chemists
100% (3)
MINITAB for Analytical Chemists
104 pages
Introduction To Analytical Chemistry: Analysis of Errors
No ratings yet
Introduction To Analytical Chemistry: Analysis of Errors
131 pages
4 6003574465387038522 PDF
No ratings yet
4 6003574465387038522 PDF
14 pages
Ana Chem Reviewer
No ratings yet
Ana Chem Reviewer
12 pages
Accuracy and Precision L2
No ratings yet
Accuracy and Precision L2
2 pages
Quality Control
No ratings yet
Quality Control
4 pages
Major Unit-1 Preparation of Graph and Standard Curve
No ratings yet
Major Unit-1 Preparation of Graph and Standard Curve
21 pages
Module 1A Notes Introduction To Statistical Analysis For Chemistry Students
No ratings yet
Module 1A Notes Introduction To Statistical Analysis For Chemistry Students
5 pages
Cbio Lec
No ratings yet
Cbio Lec
11 pages
Analytical Chemistry1
No ratings yet
Analytical Chemistry1
23 pages
Anachem Lesson 1 5
No ratings yet
Anachem Lesson 1 5
8 pages
Analytical Chemistry Self Notes
No ratings yet
Analytical Chemistry Self Notes
7 pages
Assignment Page4
No ratings yet
Assignment Page4
1 page
Precision Is The Closeness of Results To Others Obtained in Exactly The Same Way
No ratings yet
Precision Is The Closeness of Results To Others Obtained in Exactly The Same Way
5 pages
Bchet 141 em GP
No ratings yet
Bchet 141 em GP
34 pages
The Distinction Between Mass and Weight
No ratings yet
The Distinction Between Mass and Weight
6 pages
Topic: What Is The Importance of Statistics in Chemistry
No ratings yet
Topic: What Is The Importance of Statistics in Chemistry
2 pages
Zirwa Lec 1
No ratings yet
Zirwa Lec 1
23 pages
Analytical Chemistry (Theory)
100% (1)
Analytical Chemistry (Theory)
10 pages
Topic 2 Evaluation of Experimental Data
No ratings yet
Topic 2 Evaluation of Experimental Data
64 pages
Analytical Chemistry I
100% (1)
Analytical Chemistry I
38 pages
Analytical Chemistry Essentials
No ratings yet
Analytical Chemistry Essentials
46 pages
Genetics
No ratings yet
Genetics
25 pages
Journal of Chemometrics - 2018 - Brereton - Introduction To Analysis of Variance
No ratings yet
Journal of Chemometrics - 2018 - Brereton - Introduction To Analysis of Variance
4 pages
AC Basic Chemical Analysis
No ratings yet
AC Basic Chemical Analysis
37 pages
Analytical Chemistry - Introduction
No ratings yet
Analytical Chemistry - Introduction
45 pages
Unit 2A Error Analysis and Statistics
No ratings yet
Unit 2A Error Analysis and Statistics
19 pages
Notes On Error's in Statistics
No ratings yet
Notes On Error's in Statistics
9 pages
1 Notes Chemomatrics
No ratings yet
1 Notes Chemomatrics
20 pages
0-Introduction Analytical Chemistry
No ratings yet
0-Introduction Analytical Chemistry
181 pages
Biopolymer
No ratings yet
Biopolymer
5 pages
Bs Chemistry 3rd Semester 24-28 (14!10!25)
No ratings yet
Bs Chemistry 3rd Semester 24-28 (14!10!25)
1 page
ESSAY
No ratings yet
ESSAY
6 pages
Fischer Projection
No ratings yet
Fischer Projection
29 pages
Microwave
No ratings yet
Microwave
6 pages
2nd Semester Organic Practicals
No ratings yet
2nd Semester Organic Practicals
17 pages
NOMENCLATURE
No ratings yet
NOMENCLATURE
11 pages
Aujero 2019 J. Phys. Conf. Ser. 1180 012003
No ratings yet
Aujero 2019 J. Phys. Conf. Ser. 1180 012003
10 pages
Indefinite & Definite Integration (XII)
No ratings yet
Indefinite & Definite Integration (XII)
31 pages
Neo-Riemannian Theory
No ratings yet
Neo-Riemannian Theory
3 pages
Under Her Spell Roberto Rossellini in India 1st Edition Dileep Padgaonkar Available Any Format
100% (2)
Under Her Spell Roberto Rossellini in India 1st Edition Dileep Padgaonkar Available Any Format
117 pages
John Gabriel-An Introduction To The Single Variable New Calculus 2018 PDF
No ratings yet
John Gabriel-An Introduction To The Single Variable New Calculus 2018 PDF
123 pages
Compass Surveying Basics
No ratings yet
Compass Surveying Basics
37 pages
Lec STARCCM FoundationTrainingV2.0
No ratings yet
Lec STARCCM FoundationTrainingV2.0
342 pages
3D Modeling Basics for Beginners
100% (1)
3D Modeling Basics for Beginners
31 pages
Aaron Brown: Six Degrees of Idiocy
No ratings yet
Aaron Brown: Six Degrees of Idiocy
4 pages
CE 508 Cable & Arch Influence Diagrams
No ratings yet
CE 508 Cable & Arch Influence Diagrams
5 pages
PDF Joiner
No ratings yet
PDF Joiner
24 pages
DateSheet Sessionals UG-I (Revised)
No ratings yet
DateSheet Sessionals UG-I (Revised)
1 page
Gat Eee Nba Psa-2 18eee71 Co 2021 22
No ratings yet
Gat Eee Nba Psa-2 18eee71 Co 2021 22
2 pages
Feature Selection in Machine Learning
No ratings yet
Feature Selection in Machine Learning
34 pages
Introduction To The Microscopical Study of Diatoms
100% (2)
Introduction To The Microscopical Study of Diatoms
532 pages
Scenes From Greek Drama Bruno Snell Updated 2025
No ratings yet
Scenes From Greek Drama Bruno Snell Updated 2025
162 pages
Cheat Sheet Template
No ratings yet
Cheat Sheet Template
3 pages
Mastering Chemistry
No ratings yet
Mastering Chemistry
40 pages
Mathematical Literacy P1 Nov 2016 MEMO
No ratings yet
Mathematical Literacy P1 Nov 2016 MEMO
15 pages
契约理论讲义9 拍卖
No ratings yet
契约理论讲义9 拍卖
5 pages
Grade 1 Timetable
No ratings yet
Grade 1 Timetable
2 pages
Giữa kỳ lớp 6-giải
No ratings yet
Giữa kỳ lớp 6-giải
35 pages
Theory of Architecture - Reviewer 1.1
100% (1)
Theory of Architecture - Reviewer 1.1
7 pages
Representations of Rational Functions
No ratings yet
Representations of Rational Functions
37 pages
Python 89 Codes
No ratings yet
Python 89 Codes
80 pages
Class 8 Ganit Pradnya Test Paper
No ratings yet
Class 8 Ganit Pradnya Test Paper
1 page
Business Statistics3
No ratings yet
Business Statistics3
18 pages
Binomial Distribution
100% (1)
Binomial Distribution
15 pages
Meaning and Nature of Mathematics
No ratings yet
Meaning and Nature of Mathematics
3 pages

Chemometrics (Basic Chemical Data Operation)

Uploaded by

Chemometrics (Basic Chemical Data Operation)

Uploaded by

What are Chemometrics?

Basic Concept and Scope of Statistics in Chemical Sciences

Descriptive and Inferential Statistics

Population, Sample, Observations, Data

Discrete and Continuous Variables

Sources of Data: The origin from which statistical data is obtained.

Collection of Primary and Secondary Data

Summary & Key Takeaways

Measures of Dispersion (Variance and Standard Deviation)

The Most Important Measure of Dispersion

Practical Calculation and Summary

Summary of Key Formulas

Key Takeaways and Looking Ahead

You might also like