0% found this document useful (1 vote)

815 views3 pages

Solution of The Elements of Statistical Learning Ch6

The document discusses local polynomial regression and its properties. It proves that: 1) For local linear regression, the weighted sum of the differences between each point and the point of interest is 0. 2) For local polynomial regression of any degree, the weighted sum evaluating the constant term is 1. For higher order terms, the weighted sum is 0. 3) The bias of local polynomial regression depends only on terms of order j+1 and higher in the Taylor series expansion at the point of interest.

Uploaded by

zhoujing3721

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (1 vote)

815 views3 pages

Solution of The Elements of Statistical Learning Ch6

Uploaded by

zhoujing3721

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

The Element of Statistical Learning – Chapter 6

oxstar@SJTU
January 6, 2011

PN PN
Ex. 6.2 Show that i=1 (xi − x0 )li (x0 ) = 0 for local linear regression. Define bj (x0 ) = i=1 (xi −
x0 )j li (x0 ). Show that b0 (x0 ) = 1 for local polynomial regression of any degree (including local
constants). Show that bj (x0 ) = 0 for all j ∈ {1, 2, . . . , k} for local polynomial regression of degree
k. What are the implications of this on the bias?

Proof By the definition of vector-valued function, b(x)T = (1, x) and B = [1, x], so we have

b(x0 )T = b(x0 )T (BT W(x0 )B)−1 BT W(x0 )B

(1, x0 ) = b(x0 )T (BT W(x0 )B)−1 BT W(x0 )[1, x0 ]
PN
b(x0 )T (BT W(x0 )B)−1 BT W(x0 )1 = i=1 li (x0 )

1
= PN (1)
x0 b(x0 )T (BT W(x0 )B)−1 BT W(x0 )x0 = i=1 li (x0 )xi
N
X N
X N
X
(xi − x0 )li (x0 ) = li (x0 )xi − x0 li (x0 )
i=1 i=1 i=1
= x0 − x0 · 1 = 0

From (1), we have

N
X N
X
0
b0 (x0 ) = (xi − x0 ) li (x0 ) = li (x0 ) = 1
i=1 i=1

When j ∈ {1, 2, . . . , k}, we have vector-valued function b(x)T = (1, x, x2 , . . . , xk ) and B =

[1, x, x2 , . . . , xk ].

From (1), we similarly have

N
X
xj0 = li (x0 )xji (2)
i=1

Expanding (xj − x0 )j without combing of similar terms, each term can be written as (−1)b xai xb0 ,
where a + b = j. Obviously, the number of positive terms equals with the number of negative terms,
P2j bn
i.e. n=1 (−1) = 0. So each term of bj (x0 ) can be written as
N
X N
X
(−1)b xai xb0 li (x0 ) = (−1)b xb0 li (x0 )xai
i=1 i=1

= b b a
(−1) x0 x0 = (−1)b xj0 // (2)
N
X
bj (x0 ) = (xi − x0 )j li (x0 )
i=1
j
N X
X 2
= (−1)bn xj0 li (x0 ) = 0
i=1 n=1

1
Hence we have the bias
N
X
E fˆ(x0 ) − f (x0 ) = li (x0 )f (xi ) − f (x0 )
i=1
N
X N
X
= f (x0 ) li (x0 ) − f (x0 ) + f 0 (x0 ) (xi − x0 )li (x0 )
i=1 i=1
N
X N
X
00 2 (j+1)
+ k2 f (x0 ) (xi − x0 ) li (x0 ) + . . . + kj+1 f (x0 ) (xi − x0 )j+1 li (x0 ) + . . .
i=1 i=1
N
X
= kj+1 f (j+1) (x0 ) (xi − x0 )j+1 li (x0 ) + . . .
i=1

where kn is the coefficients of series expansion terms.

Now we see that the bias depends only on (j + 1)th-degree and higher order terms in the expan-
sion of f .

Ex. 6.3 Show that ||l(x)|| (Section 6.1.2) increases with the degree of the local polynomial.

Proof Preliminary: B is a N × 2 regression matrix, so B is not invertible while BBT is invert-

ible.

From

fˆ(xj ) = b(xj )T (BT W(xj )B)−1 BT W(xj )y

N
X
= li (xj )yi = l(xj )T y
i=1

we have

l(xj )T = b(xj )T (BT W(xj )B)−1 BT W(xj )

l(xj ) = W(xj )B(BT W(xj )B)−1 b(xj )
||l(xj )||2 = l(xj )T l(xj ) = [b(xj )T (BT W(xj )B)−1 BT W(xj )][W(xj )B(BT W(xj )B)−1 b(xj )]
= b(xj )T (BT W(xj )B)−1 BT W(xj )[BBT (BBT )−1 (BBT )−1 BBT ]
W(xj )B(BT W(xj )B)−1 b(xj )
= b(xj )T BT (BBT )−1 (BBT )−1 Bb(xj )
d+1
X d+1
X
||l(x)||2 = ||l(xj )||2 = b(xj )T BT (BBT )−1 (BBT )−1 Bb(xj )
j=1 j=1

= trace(BBT (BBT )−1 (BBT )−1 BBT ) // Prof. Zhang has proved it
= trace(Id+1 ) = d + 1
√
Hence ||l(x)|| = d + 1 which increases with the degree of the local polynomial.

Ex. 6.4 Suppose that the p predictors X arise from sampling relatively smooth analog curves
at p uniformly spaced abscissa values. Denote by Cov(X|Y ) = Σ the conditional covariance matrix
of the predictors, and assume this does not change much with Y . Discuss the nature of Mahalanobis
choice A = Σ−1 for the metric in (6.14). How does this compare with A = I? How might you
construct a kernel A that (a) downweights high-frequency components in the distance metric; (b)
ignores them completely?

2
p
Answer D = (x − x0 )T Σ−1 (x − x0 ) is called the Mahalanobis distance of the point x to x0 .
It takes the correlations of the data set into account. If the predictors are highly correlated, Maha-
lanobis distance is much
p accurate than Euclidian distance.
If A = I, then d = (x − x0 )T (x − x0 ) equals to Euclidian distance of the point x to x0 . Prior to
smoothing, we should standardize each predictor, for example

xi − E(xi )
x0i = p
Var(xi )

When comparing it with Σ−1 and using the standard predictors, we have

Cov(x0i , x0j ) = E[(x0i − E(x0i ))(x0j − E(x0j ))] = E(x0i x0j )

xi − E(xi ) xj − E(xj )
=E (p )( p )
Var(xi ) Var(xj )
E[(xi − E(xi ))(xj − E(xj ))]
= p p
Var(xi ) Var(xj )
Cov(xi , xj )
=p p = ρ(xi , xj )
Var(xi ) Var(xj )

Then the covariance matrix Σ will change to its standardized version (correlation matrix). Hence
A = I means ∀i 6= j, ρ(xi , xj ) = 0, i.e., all dimensions of x are not correlated.
(a) If we want to construct a kernel A that downweights high-frequency components (xi s) in the
distance metric, we can just decrease Cov(xi , xj ) or ρ(xi , xj ) in order to suppress their influence.
(b) If we want to construct a kernel A that ignores them completely, we can just set Cov(xi , xj ) or
ρ(xi , xj ) as 0.

Weather Wax Hastie Solutions Manual
No ratings yet
Weather Wax Hastie Solutions Manual
18 pages
Econ-607 - Unit2-W1-3
No ratings yet
Econ-607 - Unit2-W1-3
117 pages
1 Regression Analysis and Least Squares Estimators
No ratings yet
1 Regression Analysis and Least Squares Estimators
8 pages
1 Regression Analysis and Least Squares Estimators
No ratings yet
1 Regression Analysis and Least Squares Estimators
7 pages
Linear Regression
No ratings yet
Linear Regression
19 pages
Chapter 6: Regression
No ratings yet
Chapter 6: Regression
7 pages
Local Linear Regression For Functional Data: Alain Berlinet, Abdallah Elamine, André Mas Université Montpellier 2
No ratings yet
Local Linear Regression For Functional Data: Alain Berlinet, Abdallah Elamine, André Mas Université Montpellier 2
23 pages
Gary Chamberlain Econometric S
No ratings yet
Gary Chamberlain Econometric S
152 pages
Econometrics Regression Insights
No ratings yet
Econometrics Regression Insights
20 pages
Lect 6
No ratings yet
Lect 6
20 pages
Finite-Sample OLS Analysis
No ratings yet
Finite-Sample OLS Analysis
35 pages
L5 2025 Spring
No ratings yet
L5 2025 Spring
40 pages
PRML Slides 3
No ratings yet
PRML Slides 3
57 pages
Method of Moment
No ratings yet
Method of Moment
53 pages
Dis 1
No ratings yet
Dis 1
5 pages
NNLS1 2019 HW4 Solutions
No ratings yet
NNLS1 2019 HW4 Solutions
11 pages
Solutions For Tutorial 2
No ratings yet
Solutions For Tutorial 2
14 pages
Kondor Regression
No ratings yet
Kondor Regression
4 pages
Basic Econometrics Chapter 3 Solutions
75% (8)
Basic Econometrics Chapter 3 Solutions
13 pages
Linear Regression: 1 1 N N I I I D I I
No ratings yet
Linear Regression: 1 1 N N I I I D I I
20 pages
MIT18 650F16 Regression
No ratings yet
MIT18 650F16 Regression
44 pages
Lecture5 Module2 Anova 1
No ratings yet
Lecture5 Module2 Anova 1
9 pages
UMVUE Statmat 2 2022
No ratings yet
UMVUE Statmat 2 2022
43 pages
Manual Econometrics
No ratings yet
Manual Econometrics
20 pages
Classical Linear Regression and Its Assumptions
No ratings yet
Classical Linear Regression and Its Assumptions
63 pages
ML MS 24-25-II Key
No ratings yet
ML MS 24-25-II Key
4 pages
Econ 471 Notes 1
No ratings yet
Econ 471 Notes 1
14 pages
Lecture II - Docx - 12
No ratings yet
Lecture II - Docx - 12
12 pages
Week 4-Nonparametric and Semiparametric Estimation
No ratings yet
Week 4-Nonparametric and Semiparametric Estimation
33 pages
Advanced Econometrics PDF
No ratings yet
Advanced Econometrics PDF
58 pages
L3 SLR Model 3
No ratings yet
L3 SLR Model 3
16 pages
Simple Regression
No ratings yet
Simple Regression
18 pages
Wooldridge 6e AppE IM
No ratings yet
Wooldridge 6e AppE IM
5 pages
Linear Stochastic Models: 5.1 Least Squares
No ratings yet
Linear Stochastic Models: 5.1 Least Squares
12 pages
Lecture 24: Weighted and Generalized Least Squares 1 Weighted Least Squares
No ratings yet
Lecture 24: Weighted and Generalized Least Squares 1 Weighted Least Squares
8 pages
Econometrics: CLM & OLS Basics
No ratings yet
Econometrics: CLM & OLS Basics
11 pages
f23 Econ103 Week2 Ta Note
No ratings yet
f23 Econ103 Week2 Ta Note
5 pages
Deming Regression: Methcomp Package May 2007
100% (1)
Deming Regression: Methcomp Package May 2007
10 pages
Hayashi chp3
No ratings yet
Hayashi chp3
57 pages
Chap 7
No ratings yet
Chap 7
7 pages
Lec 4
No ratings yet
Lec 4
17 pages
LN5 - Least Squares Estimation - Large-Sample Properties - Ver4 - Slides
No ratings yet
LN5 - Least Squares Estimation - Large-Sample Properties - Ver4 - Slides
76 pages
Linear Regression
No ratings yet
Linear Regression
47 pages
Econometrics (EM2008) The K-Variable Linear Regression Model
No ratings yet
Econometrics (EM2008) The K-Variable Linear Regression Model
46 pages
Day 1
No ratings yet
Day 1
41 pages
Multiple Linear Reegression
No ratings yet
Multiple Linear Reegression
21 pages
Lecture16 Crossvalidation
No ratings yet
Lecture16 Crossvalidation
32 pages
Solutions Manual For Econometric Analysis 7th Edition by Greene Sample Chapter
No ratings yet
Solutions Manual For Econometric Analysis 7th Edition by Greene Sample Chapter
13 pages
Education and Research: UP School of Statistics Student Council
No ratings yet
Education and Research: UP School of Statistics Student Council
26 pages
14.382 Inference: Creative Commons BY-NC-SA
No ratings yet
14.382 Inference: Creative Commons BY-NC-SA
19 pages
Suggested Solutions: Problem Set 3 Econ 210: April 27, 2015
No ratings yet
Suggested Solutions: Problem Set 3 Econ 210: April 27, 2015
11 pages
Linear Model Methodology
No ratings yet
Linear Model Methodology
9 pages
Exercises
No ratings yet
Exercises
15 pages
ML 3
No ratings yet
ML 3
66 pages
Definition of Simple Linear Regression
No ratings yet
Definition of Simple Linear Regression
9 pages
HW 03 Sol
No ratings yet
HW 03 Sol
9 pages
MMW Reviewer
No ratings yet
MMW Reviewer
3 pages
Linear Rank Statistics and Some Nonparametric Tests PDF
No ratings yet
Linear Rank Statistics and Some Nonparametric Tests PDF
17 pages
Cárdenas, D., Conde-González, J., & Perales, J. C. (2015) .
No ratings yet
Cárdenas, D., Conde-González, J., & Perales, J. C. (2015) .
10 pages
MMW Descriptive Statistics
No ratings yet
MMW Descriptive Statistics
14 pages
Assessing Concrete Strength Variability in Existing Structures Based On The Results of NDTs - 2018
No ratings yet
Assessing Concrete Strength Variability in Existing Structures Based On The Results of NDTs - 2018
15 pages
Solutions Exercises 1 and 2 Multiple Linear Regression
No ratings yet
Solutions Exercises 1 and 2 Multiple Linear Regression
4 pages
Probability and Statistic
No ratings yet
Probability and Statistic
24 pages
Unit - IV - Sampling
No ratings yet
Unit - IV - Sampling
38 pages
Test 1 Crosstabs: Case Processing Summary
No ratings yet
Test 1 Crosstabs: Case Processing Summary
9 pages
One Sample T-Test
No ratings yet
One Sample T-Test
2 pages
Sap Week 5 Day 1
No ratings yet
Sap Week 5 Day 1
11 pages
MCQs Basic Statistics 1
100% (1)
MCQs Basic Statistics 1
6 pages
Hypothesis Testing and Confidence Intervals
0% (1)
Hypothesis Testing and Confidence Intervals
3 pages
Measures of Dispersion Guide
No ratings yet
Measures of Dispersion Guide
13 pages
AIC and BIC
No ratings yet
AIC and BIC
3 pages
BLOCO 8 - Issues in Outcomes Research An Overview of Randomization Techniques For Clinical Trials
No ratings yet
BLOCO 8 - Issues in Outcomes Research An Overview of Randomization Techniques For Clinical Trials
7 pages
Cross-Sectional Studies: Strengths, Weaknesses, and Recommendations
No ratings yet
Cross-Sectional Studies: Strengths, Weaknesses, and Recommendations
7 pages
MScFE 610 ECON - Video - Transcript - Lecture2 - M3 - U3
No ratings yet
MScFE 610 ECON - Video - Transcript - Lecture2 - M3 - U3
6 pages
Six Sigma ICGB Exam Questions
No ratings yet
Six Sigma ICGB Exam Questions
61 pages
Solution of The Elements of Statistical Learning Ch6
0% (1)
Solution of The Elements of Statistical Learning Ch6
3 pages
Introduction To Econometrics and Operations Research
No ratings yet
Introduction To Econometrics and Operations Research
28 pages
British J Clinic Psychol - February 1996 - Barkham - The IIP 32 A Short Version of The Inventory of Interpersonal Problems
No ratings yet
British J Clinic Psychol - February 1996 - Barkham - The IIP 32 A Short Version of The Inventory of Interpersonal Problems
15 pages
Practical 4: Range: R Maximum Value - Smallest Value Variance Standerd Deviation (SD)
No ratings yet
Practical 4: Range: R Maximum Value - Smallest Value Variance Standerd Deviation (SD)
5 pages
W-22 Model Answer 22397 .Final
No ratings yet
W-22 Model Answer 22397 .Final
23 pages
Economics 4th Sem Old Papers Help Book For Exam
No ratings yet
Economics 4th Sem Old Papers Help Book For Exam
65 pages
Download
No ratings yet
Download
5 pages
Business Tools Exam: B.Com CA
No ratings yet
Business Tools Exam: B.Com CA
1 page
Module 4A Two Sample Z-Test For Independent Groups - Updated
No ratings yet
Module 4A Two Sample Z-Test For Independent Groups - Updated
26 pages
Find The Value of and Make A Verbal Interpretation of The Following Scores. (With
No ratings yet
Find The Value of and Make A Verbal Interpretation of The Following Scores. (With
2 pages
Key To Final Examination AST 110 Data Analytics
No ratings yet
Key To Final Examination AST 110 Data Analytics
5 pages

Solution of The Elements of Statistical Learning Ch6

Uploaded by

Solution of The Elements of Statistical Learning Ch6

Uploaded by

The Element of Statistical Learning – Chapter 6

b(x0 )T = b(x0 )T (BT W(x0 )B)−1 BT W(x0 )B

From (1), we have

When j ∈ {1, 2, . . . , k}, we have vector-valued function b(x)T = (1, x, x2 , . . . , xk ) and B =

From (1), we similarly have

where kn is the coefficients of series expansion terms.

Proof Preliminary: B is a N × 2 regression matrix, so B is not invertible while BBT is invert-

fˆ(xj ) = b(xj )T (BT W(xj )B)−1 BT W(xj )y

l(xj )T = b(xj )T (BT W(xj )B)−1 BT W(xj )

Cov(x0i , x0j ) = E[(x0i − E(x0i ))(x0j − E(x0j ))] = E(x0i x0j )

You might also like