8000 Merge pull request #9 from kgalinsky/master · DReichLab/EIG@1d9d2a2 · GitHub
[go: up one dir, main page]

Skip to content

Commit

Permalink
Merge pull request #9 from kgalinsky/master
Browse files Browse the repository at this point in the history
Eigensoft 6.1.2
  • Loading branch information
kgalinsky authored Jun 27, 2016
2 parents 942791b + 1b1cc62 commit 1d9d2a2
Show file tree
Hide file tree
Showing 197 changed files with 36,341 additions and 119,695 deletions.
32 changes: 32 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,32 @@
# Object files
*.o
*.ko
*.obj
*.elf

# Precompiled Headers
*.gch
*.pch

# Libraries
*.lib
*.a
*.la
*.lo

# Shared objects (inc. Windows DLLs)
*.dll
*.so
*.so.*
*.dylib

# Executables
*.exe
*.out
*.app
*.i*86
*.x86_64
*.hex

# Debug files
*.dSYM/
42 changes: 21 additions & 21 deletions EIGENSTRAT/README
10000
Original file line number Diff line number Diff line change
@@ -1,14 +1,14 @@
See ../README for high-level documentation of the entire EIGENSOFT package.

This file contains documentation of EIGENSTRAT programs:
smartpca.perl: run PCA on input genotype data (calls smartpca)
smarteigenstrat.perl: run EIGENSTRAT stratification correction. This program
smartpca.pl: run PCA on input genotype data (calls smartpca)
smarteigenstrat.pl: run EIGENSTRAT stratification correction. This program
supports all 5 file formats, and supports quantitative phenotypes.
gc.perl: apply Genomic Control (Devlin and Roeder, 1999) to the
gc.pl: apply Genomic Control (Devlin and Roeder, 1999) to the
association statistics computed by EIGENSTRAT.

We note that the programs eigenstrat and eigenstratQTL of EIGENSOFT version 2.0
have been replaced by smarteigenstrat.perl. However, we have retained the old
have been replaced by smarteigenstrat.pl. However, we have retained the old
programs for backwards compatibility (see below).

See ./example.perl and ./exampleQTL.perl for toy examples using our programs.
Expand All @@ -23,13 +23,13 @@ http://www.hsph.harvard.edu/faculty/alkes-price/files/eigensoftfaq.htm

------------------------------------------------------------------------

DOCUMENTATION of smartpca.perl program:
DOCUMENTATION of smartpca.pl program:

This program calls the smartpca program (see ../POPGEN/README).
For this to work, the bin directory containing smartpca MUST be in your path.
See ./example.perl for a toy example.

../bin/smartpca.perl
../bin/smartpca.pl
-i example.geno : genotype file in any format (see ../CONVERTF/README)
-a example.snp : snp file in any format (see ../CONVERTF/README)
-b example.ind : indiv file in any format (see ../CONVERTF/README)
Expand Down Expand Up @@ -77,15 +77,15 @@ large correlations with phenotype indicate highly mismatched cases vs. controls

------------------------------------------------------------------------

DOCUMENTATION of smarteigenstrat.perl program: [run smartpca.perl program first]
DOCUMENTATION of smarteigenstrat.pl program: [run smartpca.pl program first]

This program is a PERL wrapper which calls the C program smarteigenstrat.
Note: the bin directory containing smarteigenstrat MUST be in your path.
See ./example.perl for a toy example.
We recommend smarteigenstrat.perl for users who prefer command-line flags.
We recommend smarteigenstrat.pl for users who prefer command-line flags.
However, users who prefer parameter files can run smarteigenstrat instead.

../bin/smarteigenstrat.perl
../bin/smarteigenstrat.pl
-i example.geno : genotype file in any format (see ../CONVERTF/README)
-a example.snp : snp file in any format (see ../CONVERTF/README)
-b example.ind : individual file in any format (see ../CONVERTF/README).
Expand All @@ -97,7 +97,7 @@ However, users who prefer parameter files can run smarteigenstrat instead.
should be real numbers. The value -100.0 signifies "missing data".
If -q is set to NO, these values should be "Case" or "Control".
The default value for the -q parameter is NO.
-p example.pca : input file of principal components (output of smartpca.perl)
-p example.pca : input file of principal components (output of smartpca.pl)
-k 1 : (Default is 10) number of principal components along which to
correct for stratification. Note that l must be less than or equal to
the number of principal components reported in the file example.pca.
Expand All @@ -116,15 +116,15 @@ However, users who prefer parameter files can run smarteigenstrat instead.
consider to be appropriate.
-l example.log : standard output file

The running time of smarteigenstrat.perl is very fast compared to
the running time of smartpca.perl.
The running time of smarteigenstrat.pl is very fast compared to
the running time of smartpca.pl.

------------------------------------------------------------------------

DOCUMENTATION of smarteigenstrat program:

Users who prefer parameter files to command-line flags can run the
C program smarteigenstrat instead of the PERL wrapper smarteigenstrat.perl.
C program smarteigenstrat instead of the PERL wrapper smarteigenstrat.pl.
The syntax of smarteigenstrat is "../bin/smarteigenstrat -p parfile"

DESCRIPTION OF EACH PARAMETER in parameter file for smarteigenstrat:
Expand All @@ -137,7 +137,7 @@ outputname: name of output file of chisq association statistics
numeigs: number of principal components to correct for
qtmode: YES for quantitative phenotype, NO (default) otherwise

For details, see documentation of smarteigenstrat.perl above.
For details, see documentation of smarteigenstrat.pl above.

OPTIONAL PARAMETERS:

Expand All @@ -148,9 +148,9 @@ hashcheck: If set to YES and the input genotype file is in PACKEDANCESTRYMAP

------------------------------------------------------------------------

DOCUMENTATION of gc.perl: [run smartpca.perl & smarteigenstrat.perl first]
DOCUMENTATION of gc.pl: [run smartpca.pl & smarteigenstrat.pl first]

../bin/gc.perl infile outfile
../bin/gc.pl infile outfile
infile is input file of chisq statistics produced by eigenstrat program.
It contains both uncorrected and EIGENSTRAT statistics for each SNP.
outfile is output file. It lists
Expand All @@ -160,7 +160,7 @@ Computation of lambda is as described in Devlin and Roeder 1999.
A lambda above 1 indicates inflation in chisq statistics.
By definition, lambda is not allowed to be less than 1.

Running time of the gc.perl program is very fast.
Running time of the gc.pl program is very fast.

------------------------------------------------------------------------

Expand All @@ -179,7 +179,7 @@ our old program eigenstratQTL (for quantitative phenotypes) from that release,
which have now been replaced by our new program smarteigenstratQTL.perl.
See ./example.oldstyle.perl for an example involving the eigenstrat program.

Most users will want to use our new program smarteigenstrat.perl, which has
Most users will want to use our new program smarteigenstrat.pl, which has
added functionality. However, users wishing to understand or modify our
source code may find it advantageous to instead work with the simpler
eigenstrat programs.
Expand All @@ -190,10 +190,10 @@ BACKWARDS COMPATIBILITY with 07/23/06 EIGENSTRAT release: pca program

For backwards compatibility with the 07/23/06 EIGENSTRAT release, we have also
included our old program pca used in that release, which has now been replaced
by our new program smartpca.perl. See ./example.oldstyle.perl for an example
by our new program smartpca.pl. See ./example.oldstyle.perl for an example
involving the pca program.

Most users will want to use our new program smartpca.perl, which calls
Most users will want to use our new program smartpca.pl, which calls
the smartpca program and has added functionality. However, users wishing
to understand or modify our source code may find it advantageous to instead
work with the simpler pca program.
Expand Down Expand Up @@ -232,7 +232,7 @@ Thus, if running on much smaller data sets, it is necessary to exclude a
candidate marker from the set of markers used to infer principal components
used to correct for stratification at the candidate marker. In the case of
a data set which uses ancestry-informative markers to infer ancestry, a
good way to do this is to run smartpca.perl to infer principal components
good way to do this is to run smartpca.pl to infer principal components
*only* using ancestry-informative markers, excluding the candidate markers
(and excluding any ancestry-informative marker in LD with a candidate marker),
and then run eigenstrat on candidate markers.
Expand Down
8 changes: 8 additions & 0 deletions EIGENSTRAT/example.pcaselection.chisq
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
Chisq PCASELECTION
rs0000 0.65619 0.935652
rs1111 1.05351 4.2363
rs2222 0.698978 1.73221
rs3333 2.39363 0.295432
rs4444 1.6562 0.52183
rs5555 0.770901 0.140059
rs6666 1.57489 0.942783
5 changes: 5 additions & 0 deletions EIGENSTRAT/example.pcaselection.par
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
genotypename: example.geno
snpname: example.snp
indivname: example.ind
pcaname: example.pca.evec
outputname: example.pcaselection.chisq
Loading

0 comments on commit 1d9d2a2

Please sign in to comment.
0