Use of EpiData/EPIFo
(questionnaire design and entry)
November, 2017
What is EpiData
• The initiative to make EpiData was taken
by a doctor, at Funen county – Denmark,
called Jens M.Lauritsen.
• It started for an epidemiological research,
as part of the “Initiative for Accident
Prevention”
What is special in EpiData
EpiData can be described in five points:
Developed for research studies usage, and oriented for
data management process
User friendly
Free of charge
Run on any computer, and on any Microsoft Windows or
Macintosh’s version, or Macintosh
Export data to many different statistical or database files’
formats.
Data cycle:
Each questionnaire consists of series of fields
called variables
Each questionnaire is called case (or record)
The set of data or some call it record (i.e.
questionnaires or cases) are stored under
one place, called Database file in the computer
EpiData provides many different
variable types:
Auto ID Number’ variables: They
increase one by one for every new record
that is entered automatically,
‘Text’ variables: they are used for storing
information consisting of text and/or
numbers
Example:
• Name: ________
• Address: ______
‘Numeric’ variables: they are used for
storing numerical information. Numeric
variables can be:
Categorical, like:
City number
Education
Marital status.
Or continuous, like:
Age
Weight
Height
‘Boolean, Logical, or Yes/No’ variables:
They are used for storing data with two
possible states such as whether a
respondent is male or female
‘Date’ variables: They are used to store
dates. Dates can be entered under the
American (i.e. mm/dd/yyyy), European (i.e.
dd/mm/yyyy
‘Soundex’ variables: They are special text
variables that apply soundex coding rules
to text data as it is entered
Example:A-999
How to download/Installed
EpiData
Go to EpiData homepage:
www.epidata.dk
Download and install EpiData
EpiData files
• .QES file
– Questionnaire
• .REC file
– Actual data
• .CHK file
– Any defined checks
• Other notes or log files
EpiData workflow
1.Define Data 4.Enter Data
2.Make Data File 5.Document
3. Set up Checks 6.Export Data
Creating Questionnaire
• Define Data
• Can either open .QES file or create one
New questionnaire
• Type in window
• Cut and paste from Word documents
• Preview questionnaire
– (click Make data file > preview data form)
Structure of questionnaire
Three sections:
– Field name (variable)
– Text describing field
– Input definition (number/ letters/ date)
Field name (variable)
• No more than 10 characters
• Begin with a letter
• No spaces or punctuation marks
Field Name (II)
• First word
– Uses first word of line
• Automatic
– EpiData generates field
names based on
question
– Uses first 10 letters
Childquest Child questionnaire <Y>
Automatic field names
• Text in curly brackets { } used in
preference
• Common words skipped (what, the, and,
etc.)
• If question starts with number, “N” is
inserted before the number
Automatic field names,
examples
Question: Field name:
Did you {eat ice cream} EATICECREAM
What is your name? ISYOURNAME
2. Age N2AGE
Variable type
• Define variables using “Pick List” or “Code writer”
• Choose type of variable:
– Numeric
– Text
– Date
– Soundex
– Boolean (Yes/No)
– Autonumber
Text variables
• Information of text and/or numbers
• Holding information (e.g. names, addresses)
• UPPER CASE
– Can only hold upper case (capital) letters
– Lower case variable automatically converted into
upper case text (ex: Egypt converted into EGYPT)
• No mathematical operations
• Length (How many characters)
• <_>
Numeric variables
• Numerical information
• Hold integers (whole numbers) or numbers
with a decimal point
• Length (digits, decimals after the comma)
• <#>, <##.#>
Other variables
• Boolean variables (s. logical variables s. YES/NO
variables)
– only two possible answers: Yes or No
– <Y>
• Date variables:
– Hold information on dates
– Data in american <MM/DD/YYYY>
– European <DD/MM/YYYY>
• Soundex:
– Coding of words (anonymous, eg. A-123)
– Code to limit orthographic errors (eg. Rome and Roma)
– <S >
System variables
• Values generated automatically
• Today date: date of the data entry
– <Today-dmy>
– <Today-mdy>
• Auto identification number: Counts the
records entered
– <IDNUM>
Save Questionnaire
• Preview data
• Save questionnaire
– Creates .QES file
• Create Data file
– Click Make Data file button > Make Data file
– Creates .REC file
• Questionnaire and Data file ready
• But……
Errors in data entry
• Tranposition (ex: 39 becomes 93)
• Copying errors (0 copy as an “o” letter)
• Consistency errors: two or more responses
are contradictory (sex: man, pregnancy=Yes)
• Range errors: answers outside of probable or
possible values (ex: heigh = 3 metres)
Preventing errors
• Standardised and previously tested questionnaire
• Training the interviewers and data entry clerck
• Checking and validating paper forms of the
questionnaires
• Checking during the data entry (Check module Epi-
Data)
• Validation: entering twice data by different operators
• Checking after data entry (Analysis module Epi-Info)
Checks (I)
• Reduce errors in input
• Checks help with data entry
• Many different types
• Examples:
– Limit entry of numbers to specific range
– Forcing entry to be made in field
– Conditional jumps
– Copying the data from the previous record
– Help messages
– Conditional operations (ex if….then operations)
Checks (II)
• Once Data file is created:
• Click Checks button > choose .REC file
Checks (III)
• Range, legal:
– 1-3, 9
– First range then individual numbers
– Tallinn, Riga, Vilnius
– Fixing only min or max value
• -INF-5 (all numbers inferior or equal 5)
• 0-INF (all positive numbers superior or equal to 0)
• Jumps:
– Field: AGEYEARS
– 0>AGEMONTHS
– 0>AGEMONTHS, 1>ADDRESS
– 1>END
• MustEnter:
– Data must be entered in Field
Checks (IV)
• Repeat:
– Show data from previous record
• Value Label
– Add text to explain values
– Click + to add label
– Format as shown
– Press F9 during data entry
to see labels
Data Entry (I)
• Click Enter data button
> choose .REC file
• Dates:
– 15/5/2006 – type 150506 or 15/5/06
• Value Labels:
– Press F9 to view
Data Entry (II)
• Record navigation:
• Delete records:
– Click cross to delete
– Record marked for deletion, but can be
recovered
Document Tools
• File Structure
• Data entry notes ( .NOT file)
– Use to write comments during data entry eg: difficult to read
handwriting etc
• View Data
• List Data
• Codebook
– Basic descriptive statistics on all variables
• Validate duplicate files
– Check consistency after double entry
Export to other programs
• Click Export data button
• Choose program
• Including Excel, Stata, SPSS
• For Epi-info open .REC file directly
References
• Lauritsen JM & Bruus M. EpiData (version 3.1). A
comprehensive tool for validated entry and
documentation of data. The EpiData Association,
Odense Denmark, 2004.
• Lauritsen JM, Bruus M. EpiTour - An introduction to
validated dataentry and documentation of data by use of
EpiData. The EpiData Association, Odense Denmark,
2005.
www.epidata.dk