0% found this document useful (0 votes)

624 views54 pages

Psychometric Testing Essentials

This document discusses various concepts related to norms, reliability, and interpreting psychological test scores. It covers: 1) How raw test scores are converted to standardized scores based on a norm group to allow comparison to the general population. 2) Key statistical concepts used in analyzing test scores like frequency distributions, measures of central tendency and variability, and the normal distribution. 3) Different methods of establishing reliability of test scores including test-retest, parallel forms, split-half, and internal consistency reliability. 4) Factors that can introduce errors and reduce reliability such as testing conditions, subjective scoring, and practice effects. Reliability coefficients indicate how much true score versus error is being measured.

Uploaded by

JAGATHESAN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

624 views54 pages

Psychometric Testing Essentials

Uploaded by

JAGATHESAN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 54

Norms and Reliability

Norm and Test Standardization

•To make sense out of individual test scores
•The raw scores are converted to some form of derived score based on
comparison to a standardization or norm group
•A norm group:
Representative of the population
Large, heterogeneous sampling
Raw Scores
•Most basic level of information provided by a psychological test
•To make the score meaningful, researcher will interpret the score by consulting
norm
•“Norm-referenced Test”
Essential Statistical Concepts
Approaches to organizing and summarizing quantitative data
Frequency distributions
Measures of central tendency
Measures of variability
The normal distribution
Skewness
Frequency Distributions
•Prepared by specifying a small number of usually equal-sized class interval and
tallying how much scores fall within each interval
•The sums of the frequencies for all intervals = N ( the total number of sample)
•Histogram – graphic representation of the same information contained in the
frequency distribution
•Frequency polygon – the frequency of the class intervals is represented by
single points rather than columns and joined by straight line
Measures of Central Tendency
•Mean – adding up all the scores and dividing by N
•Median – the middlemost score when all the scores have been ranked
•Mode – the most frequently occurring score
•For extreme example, if a distribution of scores is skewed, the median is a better
index of central tendency than the mean
Measures of Variability
•To describe the degree of dispersion
•Standard deviation (s or SD) – reflects the degree of dispersion in a group of
scores

Distribution A has larger

standard deviation
Normal Distribution
•The distribution of scores resemble a symmetrical, mathematical defined, bell-
shaped curve
Skewness
•Symmetrical or asymmetrical of a frequency distribution
•Skewed distributions usually signify that the test developer has included too few
easy items or too few hard items
Raw Scores Transformations
Transforming raw scores into more interpretable and useful forms of
information
Percentile and percentile ranks
Standard scores
T-scores and standardized scores
Normalizing standard scores
Percentile and Percentile Ranks
•A percentile expresses the percentage of persons in the standardization sample
who scored below a specific raw score.
•For example,
In an IQ test, an examinee was found out that he has an IQ of 130. IQ 130
corresponding to a percentile of 98 (P98) that means his IQ exceeds 98% of the
standardization sample.
Standard Scores
•Standard scores expresses the distance from the mean in standard deviation units.

Example,
For a normative sample, M = 50, SD = 8
A: raw score of 35
Z = = -1.88 (below average)
B: raw score of 50
Z = = 0 (exactly average)
C: raw score of 70
Z = = +2.50 (above average)
T-Score and Other Standardized Scores
•Identical to standard scores
•Expressed in positive whole numbers
•T-score has a mean = 50, standard deviation = 10
Normalizing Standard Scores
•Transmuting a nonnormal distribution into a normal distribution by conversing
percentiles to normalized standard scores
•The percentile of each raw score is used to determine its corresponding standard
score
•Normalized standard scores are nonlinear transformation so the mathematical
relationship may not hold true
•Test developers are advised to adjust the difficulty of test in order to produce a
normal distributions
Stanines, Sten and C Scale
•Stanine scale
all raw scores are converted to a single-digit system of scores ranging form 1 to 9
Mean = 5, standard deviation ≈ 2

•Sten scale
5 units above and 5 units below the mean

•C scale
Consists of 11 units
Selecting a Norm Group
•Random sampling
•Stratified random sampling
To ensure the smaller norm groups are truly representative of the population
Selecting a Norm Group (cont.)
•Age Norm – the level of test performance for each separate age group in the
normative sample
Facilitated same age comparisons

•Grade Norm – the level of test performance for each separate grade in the
normative sample
Useful in reporting school achievement levels in schoolchildren

•Local norms – derived from representative local examinees

•Subgroup norms – scores obtained from an identified subgroup
Expectancy Tables
•Portrays the established relationship between test scores and expected outcomes
on a relevant task
•Based on previous performance of a large and representative sample of examines
whose test performances and criterion outcomes reflected existing social
condition and institutional policies
Criterion-Referenced Tests
Dimension Criterion-referenced Tests Norm-referenced Tests
Purpose Compare examinees’ performance Compare examines’
to standard performance to one
another
Item content Narrow domain of skills with real- Broad domain of skills
world relevance with indirect relevance
Item selection Most items of similar difficulty Items vary widely in
level difficulty level
Interpretation of Scores usually expressed as a Scores usually expressed
scores percentage, with passing level as a standard score,
predetermined percentile, or grade
equivalent
CONCEPTS OF RELIABILITY

BY,
TIW SEOK LIAN
CINDY LOH
Definition of Reliability
•Attribute of consistency in measurement

Minimal consistency nearly perfect repeatability

Reaction time weight

Classical Test Theory
= Theory of True and Error Scores

X=T+e
X = obtained score
T = true score
e = errors of measurement
Sources of Measurement Errors
Question Wording

Item
Selecti
on
Environment
X = T + es + e u
Systematic Test
Important
Measurem Aministra Emotion
ent Error Sources tion

Test Examiner
Scoring Scorin
criteria g
Subjective
judgement
Measurement Error and Reliability
•Errors can reduce reliability
•Measurement errors are incredibly complex and varied
•measurement errors are random
•Mean error of measurement = 0
•True score and errors are uncorrelated, rTE = 0
•Errors on different tests are unrelated, r12 = 0
The Reliability Coefficient
•The ration of true score variance to the total variance of the test scores
•When the measurement error is very small, reliability coefficient, rxx approaches
1.0
•0 < rxx < 1.0
•rxx approaches 1.0, test captures minimal measurement errors and produce
consistent and reliable scores.
The Correlation Coefficient
•Express the degree of linear relationship, r between two sets of scores obtained
from the same person.

r ≈ 1.0 r ≈ -1.0

r ≈ 0.4 r ≈ -0.7

r≈0 r ≈ -0.4
Measure
Reliability

Temporal Internal
Stability Consistency
Approaches Approaches

Alternate- Split-half
Test-retest Coefficient Interscorer
forms Reliability
Reliability Alpha Reliability
Reliability

2 test administration 1 test administration

Same subjects Same subjects
Intervening time interval
Test-retest Reliability
•Straight forward method
•Conduct identical test 2 times on the same person
•For ability/ achievement test, practice, maturation or treatment effect makes the
second score higher.
•Misleading in measuring reliability of a variable that fluctuates rapidly like
mood
Alternate-Forms Reliability
•Two different forms of the same test with same specifications
•Source of error variance: item-sampling differences
•Higher cost – cost of publishing a test and put on the market
Split-half Reliability
Split-half Reliability
•Correlate the pairs of scores obtained from the equivalent halves of a test
•Higher reliability compared to test-retest method
•Major challenge: items ranked according to difficulty level. Compare odd items
versus even items.
•For measuring large questionnaire with same construct
•For shorter test, use The Spearman-Brown Formula instead of Pearson r
The Spearman-Brown formula
2r hh
r SB =
1+r hh

rSB = Estimate reliability of a full test

rhh = half test reliability
Coefficient Alpha
Coefficient Alpha
•The means of all possible split-half coefficients

Alpha = [n/(n - 1)] x [(Vart - ΣVari)/Vart]

n = number of items
Vart = variance of the whole test (standard deviation squared)
ΣVari = sum the variance for all n items

•It is an index of internal consistency = interrelatedness of individual items

•Cronbach (1951) derived this from KR20 (Kuder-Richardson formula 20)
Interscorer Reliability
•For projective tests – leave judgments to the examiner in the assignment of scores
•Two or more examiners score the sample independently, then the score are correlated
•Suitable or qualitative research
•Test manual defines appropriate training and experience required by the examiners
Item Response Theory (IRT)
•Also called Latent Traits Theory (LTT)
•It also has a collection of mathematical models and statistical tools
•Application :
analyzing items and scales,
developing homogeneous psychological measures,
measuring individual psychological constructs eg. intelligence
administering psychological tests by computer
Item Response Theory (IRT)
It includes 3 fundamental elements which is
Item Response Functions (IRFs) – mathematical functions
Item Information functions (IIFs)– reliability & measurement precision
Assumptions of invariance – 2 assumptions
Item Response Theory (IRT)
•IRT represents the field of psychometrics – provide precision over a breadth of
scales that are used to measure latent constructs, or underlying traits that are
not directly observable
•consists of a class of statistical procedures that are used to model the association
between an individual's responses to survey questions/items (in probabilistic
terms) and an underlying latent trait that is measured by the items.
•appropriate for variables such as subjective health status, treatment outcomes,
and quality of life.
Result of IRT
•The results of IRT analysis can be used to determine

whether scale items are appropriate for measuring a particular trait,

how well items in a scale "hang together"
characterize the continuum of the underlying construct,
how strongly each of the items is connected to the underlying construct.
Item Response Functions (IRFs)
•Also known as Item Characteristic Curve (ICC) – mathematical equation that
describes the relationship between the amount of latent traits an individual
possesses and the probability of giving a designated response (correct answer) to
a test item that designed to measure that construct.
•Latent traits is assumed to directly influenced the examinee’s responses to the
items on the test (design to measure the traits in questions)
ICC Curve
Information Functions (IIFs)
•Information reduces uncertainty. More info means the closer you will get to the
answer or result. Leads to more precise measurement.
•The capacity of a test item to differentiate among people.
•Certain items to differentiate among individual with low traits and certain items
to differentiate individual in high traits level.
•Item information functions can be derived from IRF.
•Item information functions can be added together to derive scale information
function.
IIFs Curve
Invariance In IRT
•Two separate but related ideas
•First, examinee position on a latent-trait scores can be estimated from the
responses to any set of test item with known IRFs.
•Second, IRFs do not depend on characteristics of a particular population. The
result of different samples might help to find-tune different parts of the IRF but
outcome should fall on the same curve. The scale of the traits exists
independently of any set of items and independently of any particular population.
N O I T CE L L O C
: ELM
PAXE
AT AD
ITEM 1 ITEM 2 ITEM 3 ITEM 4 ITEM 5 AVERAGE
PERSON 1 1 1 1 1 1 1
PERSON 2 1 1 1 1 0.8
PERSON 3 1 1 1 0.6
PERSON 4 1 1 0.4
PERSON 5 1 0.2
AVERAGE 0.8 0.6 0.4 0.2 0

ITEM 1 ITEM 2 ITEM 3 ITEM 4 ITEM 5 AVERAGE ITEM 6

PERSON 1 1 1 1 1 1 1 0
PERSON 2 1 1 1 1 0.8 0
PERSON 3 1 1 1 0.6 0
PERSON 4 1 1 0.4 0
PERSON 5 1 0.2 1
AVERAGE 0.8 0.6 0.4 0.2 0 0.8

PERSON 6 1 1 0 0 0 0.4
Calculate Probability
Probability =
1/ (1+exp( 1+exp(-(proficiency – difficulty )) )
ITEM 1 ITEM 2 ITEM 3 ITEM 4 ITEM 5 TSP
PERSON
1 0.55 0.6 0.65 0.69 0.73 1
PERSON TSP – tentative student
2 0.5 0.55 0.6 0.65 0.69 0.8 proficiency
PERSON
3 0.45 0.5 0.55 0.6 0.65 0.6
PERSON TID – tentative item
4 0.4 0.45 0.5 0.55 0.6 0.4 difficulty
PERSON
5 0.35 0.4 0.45 0.5 0.55 0.2
TID 0.8 0.6 0.4 0.2 0
Data Comparison
ITEM 1 ITEM 2 ITEM 3 ITEM 4 ITEM 5 TSP
PERSON 1 0.55 0.6 0.65 0.73 0.69 1
PERSON 2 0.5 0.55 0.6 0.69 0.65 0.8
PERSON 3 0.45 0.5 0.55 0.65 0.6 0.6
PERSON 4 0.4 0.45 0.5 0.6 0.55 0.4
PERSON 5 0.35 0.4 0.45 0.55 0.5 0.2
TID 0.8 0.6 0.4 0 0.2
ITEM 1 ITEM 2 ITEM 3 ITEM 4 ITEM 5 AVERAGE
PERSON 1 1 1 1 1 1 1
PERSON 2 1 1 1 1 0.8
PERSON 3 1 1 1 0.6
PERSON 4 1 1 0.4
PERSON 5 1 0.2
AVERAGE 0.8 0.6 0.4 0.2 0

Use information to further calibrate the estimation using

a computer program
Steps
•Present data in ICC curve or IRFs
•There is a mathematical way to compute how much information each ICC can
tell us. This method is called the Item Information Function (IIFs)
•Present data in IIF curve
•Form balancing by TIF curve (sum of all IIFs)
The New Rules Of Measurement
Old vs new
CTT IRT

Std error of measurement is Std error of measurement

assumed to be constant that become substantially larger at
applies to all examinee both extremes of ability

Longer test is more reliable than Shorter test can be more

shorter test reliable than longer test.
Special Circumstances In The Estimation
Of Reliability
1. Unstable characteristics
- Emotional reactivity as measured by electrodermal or galvanic skin response.
Fluctuates quickly in reaction to loud noises
Underlying thought process
Stressful environment

2. Speed and power test

 Speed test contains uniform and simple questions, reflect speed performance
Power test allow enough time but no test taker will obtain perfect score
CONT.
3. Restriction of Range
Test-retest reliability will be low if it is based on a sample of homogeneous
subjects
It will be inappropriate to estimate the reliability of an intelligence test by
administering it twice to a sample of college students.
4. Reliability of criterion-referenced tests
Test items are designed to identify specific skills.
Items tends to be of “pass/fail” variety
Variability of scores among examinees is quite minimal
Classification is important
Reliabilty Coefficients
• What is an acceptable level of reliability?
•Eg :
Individual differences in characteristics – .90
Standard tests with reliabilities of .70 can be useful
Test reliability lower than that can be useful in research
•On more practical level, acceptable standards of reliability depends on the
amount of measurement error the user can tolerate in the proposed application of
test
Thank You

Experimental Research: Factorial Design
No ratings yet
Experimental Research: Factorial Design
9 pages
APP - 79 Principles of Test Construction
No ratings yet
APP - 79 Principles of Test Construction
6 pages
Reliability in Research Methodology
No ratings yet
Reliability in Research Methodology
9 pages
Validity
No ratings yet
Validity
4 pages
NPC Catalogue 2025
No ratings yet
NPC Catalogue 2025
176 pages
Independent Groups Design - Part 1
No ratings yet
Independent Groups Design - Part 1
22 pages
Psychological-Bases-Of-Education (1)
No ratings yet
Psychological-Bases-Of-Education (1)
3 pages
Career Maturity Level Among Adolescents at Senior Secondary School Stage
100% (1)
Career Maturity Level Among Adolescents at Senior Secondary School Stage
10 pages
Parenting Scale for Adolescents
50% (2)
Parenting Scale for Adolescents
11 pages
Ncert Guidelines
No ratings yet
Ncert Guidelines
5 pages
Practical-1 Cultural Free Self Inventory-Third Edition
No ratings yet
Practical-1 Cultural Free Self Inventory-Third Edition
6 pages
Types of Tests
No ratings yet
Types of Tests
15 pages
Norms
No ratings yet
Norms
36 pages
Norms: Unit 1
No ratings yet
Norms: Unit 1
16 pages
APPSY Past PPR PDF
No ratings yet
APPSY Past PPR PDF
93 pages
Attitude Likert Scale Statements
No ratings yet
Attitude Likert Scale Statements
3 pages
Slide 6 - Test Construction and Adaptation
No ratings yet
Slide 6 - Test Construction and Adaptation
34 pages
Q.1 Write A Note On Criterion Validity, Concurrents Validity and Predictive Validity
No ratings yet
Q.1 Write A Note On Criterion Validity, Concurrents Validity and Predictive Validity
9 pages
Sikkim Tourism Policy
No ratings yet
Sikkim Tourism Policy
50 pages
Reliability, Validity & Norms
No ratings yet
Reliability, Validity & Norms
25 pages
What Is Reliability
No ratings yet
What Is Reliability
3 pages
7th APSPA Con Brochure 2025
No ratings yet
7th APSPA Con Brochure 2025
14 pages
Reliability and Validity
No ratings yet
Reliability and Validity
2 pages
Communications-People & Police Relationships-Zain Aman 27 Oct 2021
No ratings yet
Communications-People & Police Relationships-Zain Aman 27 Oct 2021
5 pages
Validity
100% (2)
Validity
17 pages
Theories of Personality According To Muslim Scholars
No ratings yet
Theories of Personality According To Muslim Scholars
13 pages
Mangal Emotional Intelligence Scale
No ratings yet
Mangal Emotional Intelligence Scale
7 pages
Scope of Developmental Psychology
No ratings yet
Scope of Developmental Psychology
8 pages
Test Construction
No ratings yet
Test Construction
13 pages
Questionnaire
No ratings yet
Questionnaire
15 pages
What Is Reliability and Its Types
No ratings yet
What Is Reliability and Its Types
6 pages
Research Methodology Validity Presentation
No ratings yet
Research Methodology Validity Presentation
22 pages
Characteristics of Learners and Their Implications 2
100% (1)
Characteristics of Learners and Their Implications 2
14 pages
Empirical Criterion Keying
100% (1)
Empirical Criterion Keying
14 pages
Urdu Translation Short Form Azka Scale 1
100% (1)
Urdu Translation Short Form Azka Scale 1
11 pages
Non Standard
100% (1)
Non Standard
13 pages
Test Development
No ratings yet
Test Development
17 pages
Rating Scales Validated For Sri Lankan Populations
No ratings yet
Rating Scales Validated For Sri Lankan Populations
9 pages
Chap 2 Psychological Testing Norms
No ratings yet
Chap 2 Psychological Testing Norms
35 pages
1.introduction To Research Methodology
No ratings yet
1.introduction To Research Methodology
79 pages
Language Testing and Assessment: Day 6 - Test Design Reliability and Validity
No ratings yet
Language Testing and Assessment: Day 6 - Test Design Reliability and Validity
45 pages
BS Course Outlines Session 2021 2025
No ratings yet
BS Course Outlines Session 2021 2025
119 pages
Principles of Good Research
No ratings yet
Principles of Good Research
2 pages
Psychology Honours: Sources of Stress: Environmental, Social, Physiological & Psychological
No ratings yet
Psychology Honours: Sources of Stress: Environmental, Social, Physiological & Psychological
15 pages
Internal and External Validity
100% (1)
Internal and External Validity
11 pages
Updated Syllabus 2014
No ratings yet
Updated Syllabus 2014
106 pages
Academic Stress
No ratings yet
Academic Stress
13 pages
Role of A Teacher in Student Personality Development at Secondary Level in District
No ratings yet
Role of A Teacher in Student Personality Development at Secondary Level in District
10 pages
India National Youth Policy 2003
No ratings yet
India National Youth Policy 2003
13 pages
Problems and Errors in Measurements by Pugazh
No ratings yet
Problems and Errors in Measurements by Pugazh
10 pages
Norms and Statistics in Testing
No ratings yet
Norms and Statistics in Testing
17 pages
Clinical Assessment Guide
No ratings yet
Clinical Assessment Guide
72 pages
Normal Curve: Key Characteristics
100% (1)
Normal Curve: Key Characteristics
4 pages
Intro to Psychology for Students
No ratings yet
Intro to Psychology for Students
13 pages
Unit III Pro-Social Behaviour
No ratings yet
Unit III Pro-Social Behaviour
9 pages
Self-Efficacy Questionnaire (General) Directions: Administer These Questions and Have Students Circle One of The Options
No ratings yet
Self-Efficacy Questionnaire (General) Directions: Administer These Questions and Have Students Circle One of The Options
1 page
Research About Self Esteem
No ratings yet
Research About Self Esteem
50 pages
Psychological Testing - Final
No ratings yet
Psychological Testing - Final
25 pages
CHAPTER 4 Norms and Reliability
No ratings yet
CHAPTER 4 Norms and Reliability
43 pages
Downloadfile 9
No ratings yet
Downloadfile 9
32 pages
Abstract BI BM
No ratings yet
Abstract BI BM
3 pages
Let's Talk. (Textbook Page: 59) : Daily Lesson Plan English Language Year 3
No ratings yet
Let's Talk. (Textbook Page: 59) : Daily Lesson Plan English Language Year 3
9 pages
ESM641 Assignment 0124
No ratings yet
ESM641 Assignment 0124
7 pages
Request To Add Name As Safety Locker Bearer - Key No 346
No ratings yet
Request To Add Name As Safety Locker Bearer - Key No 346
1 page
Soccer Nutrition Guide for Peak Performance
100% (1)
Soccer Nutrition Guide for Peak Performance
20 pages
Intelligence Theories & Tests
No ratings yet
Intelligence Theories & Tests
135 pages
Chap 1 - 5 Turnitin File
No ratings yet
Chap 1 - 5 Turnitin File
81 pages
Infant & Preschool Assessment Guide
No ratings yet
Infant & Preschool Assessment Guide
91 pages
Female Students Perform Better Than Male Students in Language Classes
No ratings yet
Female Students Perform Better Than Male Students in Language Classes
3 pages
AbstractforCairo Selnne2
No ratings yet
AbstractforCairo Selnne2
1 page
Temple Donation Appeal 2021/2023
No ratings yet
Temple Donation Appeal 2021/2023
2 pages
Gym Set
No ratings yet
Gym Set
6 pages
Mastery Learning for Educators
100% (1)
Mastery Learning for Educators
2 pages
Student Group Assignments
No ratings yet
Student Group Assignments
20 pages
Business Development Executive
No ratings yet
Business Development Executive
1 page
PETTLEP Imagery and Tennis Service Performance: An Applied Investigation
No ratings yet
PETTLEP Imagery and Tennis Service Performance: An Applied Investigation
9 pages
Assignment International Business MIB603 0519
No ratings yet
Assignment International Business MIB603 0519
12 pages
Assessment For Group Project
No ratings yet
Assessment For Group Project
1 page
It Programs
100% (1)
It Programs
40 pages
KC-20VS Service Manual
No ratings yet
KC-20VS Service Manual
8 pages
Hydraulic Cylinder
No ratings yet
Hydraulic Cylinder
30 pages
Chauvin CA6240
No ratings yet
Chauvin CA6240
2 pages
Rohith Final Internship
No ratings yet
Rohith Final Internship
22 pages
Conjestive Heart Failure
No ratings yet
Conjestive Heart Failure
61 pages
схема и сервис мануал на английском Sharp LC-39LE440U PDF
No ratings yet
схема и сервис мануал на английском Sharp LC-39LE440U PDF
88 pages
Draft Prospectus-Bdpl (26.12.19)
No ratings yet
Draft Prospectus-Bdpl (26.12.19)
300 pages
Investing in Hedge Funds
No ratings yet
Investing in Hedge Funds
4 pages
Academic Research Paper and IPR Solutions: Made by Tanmay and Akshay
100% (1)
Academic Research Paper and IPR Solutions: Made by Tanmay and Akshay
51 pages
Phil
No ratings yet
Phil
2 pages
Consumer Expectations of Services
No ratings yet
Consumer Expectations of Services
7 pages
Tle 2 Syllabus
100% (1)
Tle 2 Syllabus
5 pages
Labor Case: Fallado vs. Europcar Phils.
No ratings yet
Labor Case: Fallado vs. Europcar Phils.
7 pages
THC 8 Syllabus 2022 1
No ratings yet
THC 8 Syllabus 2022 1
14 pages
Conservation and Management of Tropical Rainforests An Integrated Approach To Sustainability 2nd Edition Bruenig
No ratings yet
Conservation and Management of Tropical Rainforests An Integrated Approach To Sustainability 2nd Edition Bruenig
63 pages
Appeal in Labor Cases
0% (1)
Appeal in Labor Cases
2 pages
Examining The Importance of STEM Education in Enhancing Student Outcomes From The Perspective of ACLC Teachers
No ratings yet
Examining The Importance of STEM Education in Enhancing Student Outcomes From The Perspective of ACLC Teachers
33 pages
List of High Schools, Ludhiana
No ratings yet
List of High Schools, Ludhiana
12 pages
Pak Qatar Invoice ME
No ratings yet
Pak Qatar Invoice ME
1 page
CP4291 IOT LAb MANUAL-1
No ratings yet
CP4291 IOT LAb MANUAL-1
37 pages
Msc-International-Business .. Ulster
No ratings yet
Msc-International-Business .. Ulster
9 pages
Basalte Brochure Basalte Home en
No ratings yet
Basalte Brochure Basalte Home en
40 pages
UL Listing July 2011
No ratings yet
UL Listing July 2011
2 pages
1 s2.0 S0278691508003001 Main
No ratings yet
1 s2.0 S0278691508003001 Main
5 pages
6 To 7 Artificial Intelligence Lesson Plan
No ratings yet
6 To 7 Artificial Intelligence Lesson Plan
5 pages
Unit-5 C QB With Ans
No ratings yet
Unit-5 C QB With Ans
33 pages
Legal Precedents in Transport Liability
No ratings yet
Legal Precedents in Transport Liability
33 pages
Teacher Performance Review Form
No ratings yet
Teacher Performance Review Form
13 pages
Consultez Le Discours de Xavier-Luc Duval Dans Son Intégralité
No ratings yet
Consultez Le Discours de Xavier-Luc Duval Dans Son Intégralité
69 pages

Psychometric Testing Essentials

Uploaded by

Psychometric Testing Essentials

Uploaded by

Norms and Reliability

Norm and Test Standardization

Distribution A has larger

•Local norms – derived from representative local examinees

Minimal consistency nearly perfect repeatability

Reaction time weight

2 test administration 1 test administration

rSB = Estimate reliability of a full test

Alpha = [n/(n - 1)] x [(Vart - ΣVari)/Vart]

•It is an index of internal consistency = interrelatedness of individual items

whether scale items are appropriate for measuring a particular trait,

ITEM 1 ITEM 2 ITEM 3 ITEM 4 ITEM 5 AVERAGE ITEM 6

Use information to further calibrate the estimation using

Std error of measurement is Std error of measurement

Longer test is more reliable than Shorter test can be more

2. Speed and power test

You might also like