0% found this document useful (0 votes)

111 views29 pages

Statistical Methods For Bioinformatics Lecture 4

Statistical Methods for Bioinformatics lecture 5Statistical Methods for Bioinformatics lecture 5Statistical Methods for Bioinformatics lecture 5Statistical Methods for Bioinformatics lecture 5Statistical Methods for Bioinformatics lecture 5

Uploaded by

javabe7544

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

111 views29 pages

Statistical Methods For Bioinformatics Lecture 4

Uploaded by

javabe7544

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Statistical Methods for Bioinformatics

II-3: Variable Selection

Statistical Methods for Bioinformatics

Today

Dimension reduction and factor analysis

Principal Component Regression
Partial Least Squares
Considerations in High Dimensions

Statistical Methods for Bioinformatics

High Dimensionality is associated with elevated Variance

Learning in High Dimensionality tends to a higher “variance”

component in the error. Solutions up to now:
Subset selection
Ridge/Lasso regularization.
From linear regression you may remember that highly
correlated variables have high standard errors.
Can we not combine correlated variables into a single variable?

Statistical Methods for Bioinformatics

The problem of co-linearity

Statistical Methods for Bioinformatics

Dimension Reduction Techniques and Factor Analysis

Dimension Reduction or Factor Analysis try to describe

variability in a dataset using a reduced set of dimensions
Mapping of (cor)related variables onto unobserved “factors”
A multitude of approaches: Principal Component Analysis,
Latent-variable models, Non-negative matrix factorization
Important for many fields e.g. computer vision, text mining,
psychology
Both exploratory and hypothesis driven analyses
Active field of research, new methodologies under
development.

Statistical Methods for Bioinformatics

Example: text mining

Imagine there are 10000 documents about the protein p53

You know the, say 10000, words that are used and the
frequency in which they are used.
A sparse 10000 by 10000 matrix representing the literature
about the gene
Using Non-negative matrix factorization the word
dimensionality is reduced by identifying words with similar
occurrence patterns.
The condensed variables can represent the topics discussed
summarize the literature
classify documents by topic
try for better document retrieval (by using the topics)
find genes active in similar processes

Statistical Methods for Bioinformatics

Factor Analysis in Psychology

Factor: An underlying, but unobserved, construct or

phenomena that ’shapes’ or ’explains’ the observed variables
(at least partially).
Used in Psychology for example to define hypothetical
underlying components to explain observable traits
e.g. to describe personality traits
Talkativeness
Sociable Outgoing

Extraversion

The goal of factor analysis here is to explain the relations

between variables using an underlying component
On the following slide correlation matrix with data from Bernard Malle,
figure from Psych253 Stanford online course, Statistical Theory, Models,
and Statistical Methodology
Finding underlying Personality factors

Statistical Methods for Bioinformatics

Reducing Dimensionality for Statistical Learning

1 Find correlated variables and map them to a smaller set of

new variables
2 Use the new variables in a regression
3 Use cross validation to find optimal number of variables.

Underlying ideas
The dimension reduction may reduce variance component in
the error
The variability in the data is relevant for the response

Statistical Methods for Bioinformatics

Principal Component Analysis

PCA reduces the dimensionality of a data-set of related

variables, while retaining as much as possible of the variation.
The set of variables is transformed to a new set of variables,
principal components, which are uncorrelated and sorted by
the variation they retain.
The first principal component of a set of features is the
normalized linear combination that has the largest variance

Z1 = φ11 X1 + φ21 X2 + . . . + φp1 Xp

for this we work with the covariance matrix C = X T X/n with n

observations of the p variable vector
PpXi . The optimization problem
T 2
is to find maxφ φ C φ with j=1 φj1 = 1 (normalized)

Statistical Methods for Bioinformatics

Principal Component Analysis

Procedural description:
1 Find linear set of φj1 so that:

(assuming all X centered around 0, so average of scaled X also

0, this formula represents variance)
2 Repeat till φjp ensuring no correlation between weighting sets

Statistical Methods for Bioinformatics

Principal Component Analysis

Statistical Methods for Bioinformatics

Finding the principal components
You start
 with n observations of an p variable vector Xi with
Xi,1
 .. 
Xi =  . 
Xi,p
Center all p variables around zero
Construct a covariance matrix C = n1 XX T . Entries in the
matrix arePCl,k = Cov (X.,l , X.,k ), which is estimated by
Cl,k = n1 ni (xi,l − µl )(xi,k − µk ). µ represents the mean (set
to 0 here).
Decompose C to find its eigenvectors. It can be shown that C
can be decomposed as C = VDV T with V an orthonormal
matrix, i.e. all columns have length 1 and are orthogonal to
each other. V columns are eigenvectors of C , D is a diagonal
matrix with eigenvalues.
The eigenvectors define the mapping vectors φ of the principal
components, the eigenvalues in D give the variance explained
by every component
Statistical Methods for Bioinformatics
Principal Component Regression

After performing PCA, you choose a number of components M to

make a regression. The
P fitted coefficients θ relate to the non
reduced fit as: βj = M m=1 θm φj,m . This puts a constraint on the
coefficients. PCR is related in effect and form to ridge regression,
but with a discrete form for the penalty.

Statistical Methods for Bioinformatics

Principal Component Regression

Statistical Methods for Bioinformatics

Principal Component Regression: Considerations
Resume
Predictors mapped through
P a linear transformation to a reduced
set of predictors:Zm = pj=1 φj,m Xj
Normal regression with this smaller set of predictors.
The fitted
P coefficients relate to the non reduced fit as
βj = M m=1 θm φj,m

1 PCR works best if the PCA transformation captures most of

the variance in few dimensions AND this variance is
associated with the response
2 It is a linear mapping approach, so strong non-linear relations
will not be captured well.
3 Because PCA combines variables, the scale of each variable
influences the outcome. If not directly comparable,
standardize the variables.
4 PCA works best on normally distributed variables, strong
departures will make PCA fail
Statistical Methods for Bioinformatics
The connection between Ridge and PCR

If we take a N × p predictor matrix X , with zero-centered

variables, we can apply the Singular Value Decomposition
(SVD)
X = USV T
U and V are N × p and p × p orthogonal matrices, S is a diagonal
matrix with value si . Then the least squares fit for response y is:

ŷ = UU T y
Ridge is given by:

si2

ŷ = Udiag 2
UT y
si + λ

PCR by:
ŷ = Udiag {1, . . . , 1, 0, . . . , 0} U T y
Statistical Methods for Bioinformatics
The connection between Ridge and PCR

si2

ŷ = Udiag UT y
si2 + λ
What is s? Remember the PCA: formulation
1
C = n−1 X T X = VDV T . Together with X = USV T we can
S2
derive that D = n−1 . Hence the singular values s are related to the
eigenvalues of the covariance matrix as follows:

si2
di =
n−1

Statistical Methods for Bioinformatics

Partial Least Squares Regression

PLSR is another linear dimension reduction technique that

fulfills
Xp
Zm = φj,m Xj
j=1

It differs from PCR in that not just structure in the

explanatory variables is captured, but also the relation
between the explanatory variables and the response variables.
The decomposition is such that most variation in Y is
extracted and explained by a latent structure of X
It works with a response matrix
Resulting Z1 . . . Zm used with least squares to fit a linear
model
vs PCR& Ridge: Can reduce bias but increase variance

Statistical Methods for Bioinformatics

High dimensionality

When p>n, a situation frequently encountered in modern

science
Least squares regression not appropriate (no remaining
degrees of freedom)
Large danger of over-fitting
Cp , AIC and BIC are not appropriate (estimating error
variance not possible)
PLSR, PCR, forward step-wise regression, ridge and lasso are
appropriate

Statistical Methods for Bioinformatics

Regressions in High Dimensions

The lasso with n = 100 observations and varying features (p). 20

features were associated with the response. Plots show test MSEs
that result over the tuning parameter λ (degrees of freedom
reported). Test MSE goes up with more features. This is related to
the “curse of dimensionality”.

Statistical Methods for Bioinformatics

Interpretation of regression in high dimensionality

In high dimensional data sets many variables are highly

co-linear
Selected variables may not be the unique or even best set of
predictors for found prediction performance
So even when we have a predictive model, we should not
overstate the results
The found model is not unique, one of many possible models

Statistical Methods for Bioinformatics

The problem of co-linearity

Statistical Methods for Bioinformatics

Co-linearity is a motivation for regularization

Even a small λ will stabilize coefficient estimates in ridge

regression, also when p<<n
When you have many co-linear variables, ridge and PCR will
use them all in a sensible way.
You might want “group” selection: select the predictive set of
correlated variables
Lasso will tend to do feature selection and select the variable
strongest related to the response
Perhaps arbitrarily.
Can be less robust.

Statistical Methods for Bioinformatics

High dimensionality in practice
What you should learn from this class

1 What is dimension reduction and what is its use.

2 What is the procedure and motivation for a PCR (know what
is PLSR)
3 How does PCR compare to the LASSO and ridge
4 Considerations in high dimensions

Statistical Methods for Bioinformatics

To do:

Preparation for next week

Reading: chapter 7 through to 7.5
Send in any questions day before class

Exercises
Finish labs of Chapter 6
and Exercise below.

Statistical Methods for Bioinformatics

Exercise

In this exercise we will analyze the gene expression data set

from Van de Vijver et al. (2002, N Engl J Med, 347). The
study analyzed the gene expression in breast cancer tumors
genome wide with DNA microarrays. The study compared the
gene expression signature of the tumors with the presence or
absence of distant metastasis (“DM” vs “NODM”). The idea
was to use the gene expression signature as a clinical tool to
decide if chemo- or hormone therapy would be beneficial.
For the exercises load/install the following libraries: glmnet,
with library(library) and install.packages(“library”).

Statistical Methods for Bioinformatics

Exercise
1 Load the file “VIJVER.Rdata”.
Explore the dataset. How many variables? What do they
represent? How many samples? What do these samples
represent?
2 What challenges do you foresee in using gene expression for
the stated goal (predict distant metastases).
3 For a couple of genes evaluate association with the phenotype.
Do you see proof for some predictive potential? Test your
intuition with a formal statistical test.
4 Demonstrate if co-linearity occurs between genes in this
dataset. Do you think this represents a challenge in the
analysis?
5 Use lasso, ridge and PCR methodology and make a predictor
based on the gene expression values. How many genes are
used for an optimal predictor? Evaluate the performance of
the predictors, and comment on what you find.
Pointers: For lasso/ridge use library(”glmnet”), in the glmnet functions alpha=1
corresponds to lasso. Use the function predict to Methods
Statistical measure performance
for Bioinformatics

Statistical Methods For Bioinformatics Lecture 3
No ratings yet
Statistical Methods For Bioinformatics Lecture 3
33 pages
Statistical Methods For Bioinformatics Lecture 5
No ratings yet
Statistical Methods For Bioinformatics Lecture 5
48 pages
Statistical Methods For Bioinformatics Lecture 2
No ratings yet
Statistical Methods For Bioinformatics Lecture 2
47 pages
Bioinformatics: Decoding Life's Data
No ratings yet
Bioinformatics: Decoding Life's Data
8 pages
Survival Analysis and Interpretation Of.32
No ratings yet
Survival Analysis and Interpretation Of.32
7 pages
Linear Discriminant Analysis (Lda)
No ratings yet
Linear Discriminant Analysis (Lda)
11 pages
Manual - Metodologia Livro PDF
No ratings yet
Manual - Metodologia Livro PDF
258 pages
Methods For Studying Proteins
No ratings yet
Methods For Studying Proteins
96 pages
Raphael Sonabend PHD Thesis
No ratings yet
Raphael Sonabend PHD Thesis
345 pages
Bioinformatics - Group21 - Report - Application of Bioinformatics in Agriculture
No ratings yet
Bioinformatics - Group21 - Report - Application of Bioinformatics in Agriculture
11 pages
Cheminformatics in Drug Development
No ratings yet
Cheminformatics in Drug Development
19 pages
Bioinformatics and Machine Learning For Cancer Biology-5
No ratings yet
Bioinformatics and Machine Learning For Cancer Biology-5
198 pages
SIS Model For An Infectious Disease
No ratings yet
SIS Model For An Infectious Disease
3 pages
Seefeld-Statistics Using R With Biological Examples PDF
No ratings yet
Seefeld-Statistics Using R With Biological Examples PDF
325 pages
Principles and Procedures of Exploratory Data Analysis: John T. Behrens
No ratings yet
Principles and Procedures of Exploratory Data Analysis: John T. Behrens
30 pages
Generalized Linear Models: Ariel Alonso Abad
No ratings yet
Generalized Linear Models: Ariel Alonso Abad
43 pages
Sequencing Depth and Coverage: Key Considerations in Genomic Analyses
No ratings yet
Sequencing Depth and Coverage: Key Considerations in Genomic Analyses
12 pages
Secondary Structure Prediction of Tuberculosis Genomes Using Machine Learning Algorithms
No ratings yet
Secondary Structure Prediction of Tuberculosis Genomes Using Machine Learning Algorithms
111 pages
Statistical Modeling for Analysts
No ratings yet
Statistical Modeling for Analysts
22 pages
Mar 13 Lae 08
No ratings yet
Mar 13 Lae 08
656 pages
Fundamentals of Biostatistics 8 Ed Rosner Ebook and TestBank Bundle Download Instantly
No ratings yet
Fundamentals of Biostatistics 8 Ed Rosner Ebook and TestBank Bundle Download Instantly
326 pages
Nonparametric Statistics On Manifolds and Their Applications To Object Data Analysis 1st Edition Victor Patrangenaru Full Digital Chapters
No ratings yet
Nonparametric Statistics On Manifolds and Their Applications To Object Data Analysis 1st Edition Victor Patrangenaru Full Digital Chapters
167 pages
Species Distribution Modeling For Machine Learning Practitioners - A Review
No ratings yet
Species Distribution Modeling For Machine Learning Practitioners - A Review
27 pages
Syllabus of Ph.D. Course
No ratings yet
Syllabus of Ph.D. Course
6 pages
SAS IML User Guide PDF
No ratings yet
SAS IML User Guide PDF
1,108 pages
Bioinformatics in Healthcare
No ratings yet
Bioinformatics in Healthcare
24 pages
ES714glm Generalized Linear Models
No ratings yet
ES714glm Generalized Linear Models
26 pages
Non Parametric Test Examples
No ratings yet
Non Parametric Test Examples
13 pages
Data Analysis with Python
No ratings yet
Data Analysis with Python
38 pages
Bootstrap Methods With Applications in R All-in-One Download
No ratings yet
Bootstrap Methods With Applications in R All-in-One Download
14 pages
13 Pag Design and Analysis of Experiments in The Health Sciences
100% (1)
13 Pag Design and Analysis of Experiments in The Health Sciences
13 pages
Cell Signalling in Cancer
No ratings yet
Cell Signalling in Cancer
341 pages
Biomeasurement A Student S Guide To Biological Statistics 2nd Edition Dawn Hawkins Download
No ratings yet
Biomeasurement A Student S Guide To Biological Statistics 2nd Edition Dawn Hawkins Download
61 pages
Partial Least Squares Algorithms and Met
No ratings yet
Partial Least Squares Algorithms and Met
19 pages
Integrating Multimodal Data To Understand Cortical Circuit Architecture and Function
No ratings yet
Integrating Multimodal Data To Understand Cortical Circuit Architecture and Function
14 pages
Least Square Regression
No ratings yet
Least Square Regression
13 pages
Lecture 2A - Biological Variability, Descriptive Stats
No ratings yet
Lecture 2A - Biological Variability, Descriptive Stats
9 pages
Bishop ML
No ratings yet
Bishop ML
3 pages
Classification - Prediction Data Model Very Important
No ratings yet
Classification - Prediction Data Model Very Important
173 pages
Engineering Data Analysis Course
No ratings yet
Engineering Data Analysis Course
43 pages
Statistical and Mathematical Modeling Guide
100% (1)
Statistical and Mathematical Modeling Guide
19 pages
Least Squares Problems: How To State and Solve Them, Then Evaluate Their Solutions
100% (1)
Least Squares Problems: How To State and Solve Them, Then Evaluate Their Solutions
63 pages
Exam With Model Answers
No ratings yet
Exam With Model Answers
4 pages
Linear Regression: Major: All Engineering Majors Authors: Autar Kaw, Luke Snyder
100% (1)
Linear Regression: Major: All Engineering Majors Authors: Autar Kaw, Luke Snyder
25 pages
Data Science in Healthcare
No ratings yet
Data Science in Healthcare
5 pages
STAT 650 - Foundations of Data Science Syllabus
No ratings yet
STAT 650 - Foundations of Data Science Syllabus
13 pages
General Linear Model in fMRI Analysis
No ratings yet
General Linear Model in fMRI Analysis
31 pages
Steyerberg Prediction Modeling 7 Steps Jan10
No ratings yet
Steyerberg Prediction Modeling 7 Steps Jan10
45 pages
Algorithms and Methods in Structural Bioinformatics Nurit Haspel Filip Jagodzinski Kevin Molloy Eds Instant Download
No ratings yet
Algorithms and Methods in Structural Bioinformatics Nurit Haspel Filip Jagodzinski Kevin Molloy Eds Instant Download
146 pages
PINHEIRO and BATES - 2000 - Mixed Effects Model in S and S-Plus
No ratings yet
PINHEIRO and BATES - 2000 - Mixed Effects Model in S and S-Plus
535 pages
NGS and Bioinformatics Guide
No ratings yet
NGS and Bioinformatics Guide
5 pages
Dmitry Grapov
No ratings yet
Dmitry Grapov
41 pages
Dmitry Grapov
No ratings yet
Dmitry Grapov
41 pages
Zou 2006
No ratings yet
Zou 2006
23 pages
hst951 7
No ratings yet
hst951 7
32 pages
Ferath Kherif PCA
No ratings yet
Ferath Kherif PCA
17 pages
Principal Component Analysis Guide
No ratings yet
Principal Component Analysis Guide
26 pages
Subject: Statistics
No ratings yet
Subject: Statistics
55 pages
Component Analysis Is A Dimension-Reduction Tool That Can
No ratings yet
Component Analysis Is A Dimension-Reduction Tool That Can
2 pages
Statistical ML Overview
No ratings yet
Statistical ML Overview
34 pages
Information Security Adherence in Institutions of Higher Education: A Case Study of American University of Nigeria
No ratings yet
Information Security Adherence in Institutions of Higher Education: A Case Study of American University of Nigeria
10 pages
Boosting Financial Self-Efficacy
No ratings yet
Boosting Financial Self-Efficacy
22 pages
Econometrica - 2003 - Bai - Determining The Number of Factors in Approximate Factor Models
No ratings yet
Econometrica - 2003 - Bai - Determining The Number of Factors in Approximate Factor Models
31 pages
Marketing of SME Products: A Relationship' Approach: Shalini N. Tripathi
No ratings yet
Marketing of SME Products: A Relationship' Approach: Shalini N. Tripathi
31 pages
AIS Impact on Indonesian Supply Chain
No ratings yet
AIS Impact on Indonesian Supply Chain
18 pages
A Brief Version of The Self Description Questionna
No ratings yet
A Brief Version of The Self Description Questionna
10 pages
Determining The Number of Factors To Retain in EFA - Using The SPS
No ratings yet
Determining The Number of Factors To Retain in EFA - Using The SPS
15 pages
Unit Wise Questions
No ratings yet
Unit Wise Questions
14 pages
High-Speed Internet Service Providers in Thailand: Customer Selection, Satisfaction and Loyalty
No ratings yet
High-Speed Internet Service Providers in Thailand: Customer Selection, Satisfaction and Loyalty
242 pages
Peer Influence on Youth Sport Motivation
No ratings yet
Peer Influence on Youth Sport Motivation
9 pages
EFA and CFA
No ratings yet
EFA and CFA
36 pages
School Transition Practices and Children's Social and Academic
No ratings yet
School Transition Practices and Children's Social and Academic
12 pages
Buss1992 PDF
No ratings yet
Buss1992 PDF
8 pages
Quantitative Psychological Research The Complete Student S Companion David Clark-Carter Download Full Chapters
80% (5)
Quantitative Psychological Research The Complete Student S Companion David Clark-Carter Download Full Chapters
152 pages
Chinese Version of The ST George Resp Questionnaire in COPD
No ratings yet
Chinese Version of The ST George Resp Questionnaire in COPD
7 pages
Further Development of A Measure of Perceived Envi
No ratings yet
Further Development of A Measure of Perceived Envi
24 pages
IIMK PCPM B13 Brochure
No ratings yet
IIMK PCPM B13 Brochure
32 pages
Proceedings Icemine 2019
No ratings yet
Proceedings Icemine 2019
926 pages
The HR Competencies On Organizational Performance in It / Ites Companies in Chennai
No ratings yet
The HR Competencies On Organizational Performance in It / Ites Companies in Chennai
10 pages
Consumer Perception of Patanjali Product PDF
No ratings yet
Consumer Perception of Patanjali Product PDF
13 pages
06 - Banerjee and Banerjee - Business Analytics - Ch06
No ratings yet
06 - Banerjee and Banerjee - Business Analytics - Ch06
21 pages
Ciritcal Journal Review - Gilang
No ratings yet
Ciritcal Journal Review - Gilang
28 pages
Reducing Counterproductive Work Behavior Through Employee Selection
No ratings yet
Reducing Counterproductive Work Behavior Through Employee Selection
11 pages
Bridger 2013
No ratings yet
Bridger 2013
11 pages
TiktokB 1
No ratings yet
TiktokB 1
13 pages
06 - Reni Rosaria, B Ó, Vania Olivine Danariliaa, Heni Ardiantoa, Ragil Satria Wicaksanab
No ratings yet
06 - Reni Rosaria, B Ó, Vania Olivine Danariliaa, Heni Ardiantoa, Ragil Satria Wicaksanab
13 pages
Celebrity Impact-A Model of Celebrity Endorsement: Jayant Sonwalkar, Manohar Kapse and Anuradha Pathak
100% (1)
Celebrity Impact-A Model of Celebrity Endorsement: Jayant Sonwalkar, Manohar Kapse and Anuradha Pathak
8 pages
(PDF) Educational Stress Scale For Adolescents Development, Validity, and Reliability With Chinese Students
No ratings yet
(PDF) Educational Stress Scale For Adolescents Development, Validity, and Reliability With Chinese Students
1 page
Chapter 11 Structural Equation Modeling
No ratings yet
Chapter 11 Structural Equation Modeling
26 pages
Asian ESP Journal: Vol 14, Issue 4
No ratings yet
Asian ESP Journal: Vol 14, Issue 4
137 pages

Statistical Methods For Bioinformatics Lecture 4

Uploaded by

Statistical Methods For Bioinformatics Lecture 4

Uploaded by

Statistical Methods for Bioinformatics

II-3: Variable Selection

Statistical Methods for Bioinformatics

Dimension reduction and factor analysis

Statistical Methods for Bioinformatics

Learning in High Dimensionality tends to a higher “variance”

Statistical Methods for Bioinformatics

Statistical Methods for Bioinformatics

Dimension Reduction or Factor Analysis try to describe

Statistical Methods for Bioinformatics

Imagine there are 10000 documents about the protein p53

Statistical Methods for Bioinformatics

Factor: An underlying, but unobserved, construct or

The goal of factor analysis here is to explain the relations

Statistical Methods for Bioinformatics

1 Find correlated variables and map them to a smaller set of

Statistical Methods for Bioinformatics

PCA reduces the dimensionality of a data-set of related

Z1 = φ11 X1 + φ21 X2 + . . . + φp1 Xp

for this we work with the covariance matrix C = X T X/n with n

Statistical Methods for Bioinformatics

(assuming all X centered around 0, so average of scaled X also

Statistical Methods for Bioinformatics

Statistical Methods for Bioinformatics

After performing PCA, you choose a number of components M to

Statistical Methods for Bioinformatics

Statistical Methods for Bioinformatics

1 PCR works best if the PCA transformation captures most of

If we take a N × p predictor matrix X , with zero-centered

Statistical Methods for Bioinformatics

PLSR is another linear dimension reduction technique that

It differs from PCR in that not just structure in the

Statistical Methods for Bioinformatics

When p>n, a situation frequently encountered in modern

Statistical Methods for Bioinformatics

The lasso with n = 100 observations and varying features (p). 20

Statistical Methods for Bioinformatics

In high dimensional data sets many variables are highly

Statistical Methods for Bioinformatics

Statistical Methods for Bioinformatics

Even a small λ will stabilize coefficient estimates in ridge

Statistical Methods for Bioinformatics

1 What is dimension reduction and what is its use.

Statistical Methods for Bioinformatics

Preparation for next week

Statistical Methods for Bioinformatics

In this exercise we will analyze the gene expression data set

Statistical Methods for Bioinformatics

You might also like