0% found this document useful (0 votes)

98 views11 pages

Linear Regression on Heart Disease Data

This document discusses three machine learning models - linear regression, artificial neural network (ANN), and baseline - for predicting cholesterol (ldl) levels using heart disease data. A 10-fold cross-validation was performed to evaluate the models. The linear regression model had the best estimated generalization error of 342.1, ANN was second with 388.4, and the baseline model was worst with 428.8. However, a statistical test could not reject the null hypotheses that ANN performs equally to linear regression, suggesting both models may be equivalent for this task. The document analyzes model parameters and performance in detail.

Uploaded by

Riyaz Alam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

98 views11 pages

Linear Regression on Heart Disease Data

Uploaded by

Riyaz Alam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Technical University of Denmark

02450 Introduction to Machine Learning

and Data Mining

Project 2

s191985 Julian Böhm (Regression, part a)

s196119 Emil Chrisander (Classification)
s192184 Jorge Montalvo Arvizu (Regression, part b)

November 2019
1 Regression, part a (s191985)
This section discusses the linear regression analysis of the South African heart
disease data set. At first, it has to be defined which of the variables is the pre-
dicted one. In case of a logistic regression we would choose the binary variable
chd (heart disease or no heart disease) as a dependent variable because in the
overall problem we are interested in how the other variables affects this state.
However, the task requires to solve the problem with a linear regression model.
Therefore, we have to pick a continuous variable and decided on ldl which is low
density lipoprotein cholesterol (often referred to as the ’bad’ cholesterol). We
decided on ldl because it showed the highest correlation rate with chd in the
last report (together with age, adiposity and tobacco consumption). Moreover,
we estimate high ldl levels to be connected to obesity and adiposity. All of the
nine variables are included in the model as independent ones.
To execute the regression analyses, we had to apply some feature transforma-
tions to our data set. The variables chd and famhist (given in categories) had
to be binarized. A one-out-of-coding was not necessary in our data frame. Fur-
thermore, the data was standardized by subtracting the mean and dividing by
the standard deviation. This results in a mean of 0 and a standard deviation of
1 for each column.
Linear regression model 1

yi = f (xi , w) = x̃Ti w
(1)
E(w) = ||y − X̃ T w||2

Based on this formulas one can estimate the optimal weights w∗ by minimizing
the error. This is computationally achieved by applying the least squares re-
gression model (lm.LinearRegression()).
Figure 1 shows the true ldl and the estimated ldl. From a visual observation,
it can be seen that the model does not take into account the extreme values of
the true ldl and ’compresses’ the scale of the estimated ldl. Since these extreme
values are just a few cases, we can concluded on a rather good fit of the model.
Additionally, the underlying graph describes the residuals which are in a rather
good accuracy of approximately -4 to 3 percent.

Following, we introduce a regularization parameter λ (equation 14.3) to the

model:
Eλ (w, w0 ) = ||y − w0 1 − X̂w||2 + λ||w||2 , λ ≥ 0 (2)
For the model, we set the range of λ from 10−5 to 1011 and observed an ideal λ
of 102 = 100. In figure 2, it can be noted that the generalization error drops at
λ = 100 and increases with higher λ values.
Table 1 lists the weights in the last fold. The highest values are obtained for
variables adiposity (0.46), chd (0.26) and obesity (0.23). This supports our
1 Equation 8.3 and 14.1; Introduction to Machine Learning and Data Mining, 2019; Tue

Herlau, Mikkel N. Schmidt, Morten Mørup.

1
Figure 1: Residuals

2
assumption that ldl is somehow connected to and adiposity and obesity. In
addition, the chd seems to have the highest effect on the level of ldl.

Table 1: Overview of weights w∗ in the last fold

Weights in last fold (K = 10)
Offset 4.75
sbp 0.0
tobacco 0.04
chd 0.26
adiposity 0.46
typea 0.08
obesity 0.23
alcohol -0.13
age 0.13
famhist present 0.12

Figure 2: Mean coefficient values and squared errors over different regularization
factors

3
2 Regression, part b (s192184)
2.1 Parameters and Models
In this section, we selected the same variable to perform the regression as in
Section 2, i.e. our objective was the continuous variable ldl. Also, we used the
normalized attributes by subtracting the mean and dividing by the standard
deviation. Then, we focused on comparing the following three models: the reg-
ularized linear regression model from the previous section, an artificial neural
network (ANN) and a baseline.
To compute these models, we implemented two-level cross-validation with K1 =
K2 = 10; given the excessively long computing times to tune the ANN model
with the number of hidden units h we first tried with K1 = K2 = 5. However,
we played with two parameters to maintain computing times low and to obtain
a feasible ANN model, i.e. maximum iterations and hidden units. Initially the
maximum iteration parameter was set between 10,000 and 50,000, that way we
could quickly compute the optimal number of hidden units (or layers), check the
necessary number of iterations for convergence, and then twitch the model for
additional robustness. During this initial test phase, we saw that 1) the number
of iterations to converge were between 30,000 and 50,000, and 2) by increasing
the complexity of the model (by increasing the number of hidden layers h), the
error increased.
Therefore, after the initial test runs, we used the following range of h with a
maximum iteration value of 50,000.

h ∈ [1 : 5] (3)
2
For λ we used an interval close to 10 , as we were expecting the optimal value
to be around 102 as analyzed in the previous section:

λ ∈ [10 : 200] (4)

Furthermore, the error measure we used for the regression is the squared loss
per observation:
N test
1 X
E = test (yi − yˆi )2 (5)
N i=1

Finally, as the project description required, the baseline model was a simple
linear regression with no features, i.e. the mean of y on the training data was
used to predict the y on the test data.

2.2 Results and Comparison

After running the models with the parameters from the previous subsection,
we obtained the results shown in Table 2. We also added a final row with the
estimated generalization error of each model, as explained in Section 3.3 and
calculated with formula (7).

4
Table 2: Summary of 2-level 10-fold CV for regression
Outer Fold ANN Linear regression Baseline
i data size h∗i Eitest λ∗i Eitest Eitest
1 47 2 403.2 80 383.1 450.4
2 47 2 532 80 435.6 527.1
3 46 1 449.3 100 373.3 385.7
4 46 1 351.9 100 311.6 385.9
5 46 1 345.8 100 290.3 399.1
6 46 1 295.5 100 267 357.8
7 46 1 280.3 100 263.4 353.2
8 46 1 576.3 100 499.7 572.4
9 46 1 375 100 387.5 498.1
10 46 1 271.4 100 206.7 355.7
Ê test 388.4 342.1 428.8

The results show, given the generalization error, that the ”best-performing
model” was the normal regression model with an estimated generalization error
of 342.1, the ”second best-performing model” was the ANN with an estimated
generalization error of 388.4, and the ”worst-performing model” was the base-
line with an estimated generalization error of 428.8. This last standing was
expected, since the baseline model is really simple as it only computes the av-
erage of the data. What is interesting, is that the specific errors of each model
vary between each fold and the optimal regularization parameters h and λ in
outer folds 1 and 2 are different, with λ being less strict in those folds compared
to the results of the previous section. The difference in the regularization pa-
rameter for the linear regression model may be that the observations in outer
fold 1 and 2 may have high variance and slow bias, contrary to the slightly-lower
variance and slightly-higher bias of the data in observations in the other outer
folds. Therefore, a more flexible regularization parameter works better for the
first two outer folds of the model. However, as we can see in the results, the
optimal regularization parameter of the whole model is argued to be 102 , as in
the previous section, given the results of the other eight folds.

2.3 Statistical test

Given the ”close” results of the three models, we’re interested in answering the
question: which of the three models is the best one and how do they compare
between each other? We attempted to answer this question by performing a
statistical set (setup II) on the results of the previous subsection. We used the
more robust setup II to test our models against variability from different train-
ing sets in our statistical estimates, i.e. we used the method layed out in section

5
11.4.2

Table 3: Summary of Setup II statistical test for regression

H0 p value Lower CI Upper CI Conclusion
test test
Ebaseline − Elinear = 0 0.001 0.491 1.241 H0 rejected
test test
EAN N − Elinear = 0 0.689 -0.07 0.101 H0 not rejected
test test
Ebaseline − EAN N = 0 0.002 0.435 1.265 H0 rejected

For the statistical test, we used J = K to obtain J splits on the data set (train
and test) and estimated the generalization error of the three models by taking
into account the randomization we want. The results are shown in Table 3. We
can see from the results that the p-value between the baseline model and both
the ANN model and linear regression model is lower than 0.01-0.02, so it shows
significance and we can conclude that the error between these models are not
the same. However, between the ANN model and the linear regression model
we can’t say the same.
In conclusion, we can say that if we were to use our models against new data,
we’d not use the baseline model because the linear regression model and the
ANN model are better than it. Between these two, we can’t say they’re iden-
tical but their results perform better compared to the baseline model. As a
recommendation, I’d look more into the tuning parameters of ANN, since the
available GPU on our computers was a restriction and it could’ve performed
better than the linear regression model if we could tweak other parameters and
not only the number of hidden layers h.

3 Classification (s196119)
3.1 Our setting
In this section we will perform a classification of the variable chd. Recall, that
chd a the binary variable which indicate whether a person is diagnosed with
a heart disease. Hence, our classification task is to predict whether a person
has a heart disease conditional on the nine attributes. To avoid issues from
differences in scale and variation, we normalize our nine attributes prior to the
classification analysis. Famhist, a binary categorial variable, is transformed to
an indicator variable using the one hot encoding principle. Although our data
set is imbalanced (we have more non-diagnosed than diagnosed persons), we do
not address this problem. The reason being, that we are not told explicitly how
to deal with this issue in the project 2 task description.
2 Introduction to Machine Learning and Data Mining, 2019; Tue Herlau, Mikkel N. Schmidt,

Morten Mørup

6
3.2 Choice of models
We decided to apply KNN as our second model. We did in fact implement
ANN in the 2 level K-fold cross validation. Sadly, our laptops do not have a
GPU, which resulted in infeasible long computing times for the hyperparameter
tuning task of ANN3 . Hence, for pragmatic reason we decided to choose KNN
as our second model. After a serial of trial runs we decided to let the trial space
of KKNN , the regularization parameter, be in the interval:

KKNN ∈ [1 : 50] (6)

Our optimal KKNN was always in inside this interval during the trial runs and
the test error grew large whenever the KKNN exceeded 40. Hence, we do not
expect that the global minimum of test error to be above 50. We tried every K
within the interval.

For λ, the regularization parameter of the logistic regression model, we decided

let the trial interval be:
λ ∈ [10−1 : 103 ] (7)
We initially tried with a very large interval in our trial CV run. We realized
that outside the interval stated above, there were never any candidates for the
global minimum. Therefore, we reduced the interval to an interval that always
included the candidate for the global minimum. Moreover, we decided to have
30 steps in the interval. Finally, we applied the L2 penalty term and the liblin-
ear solver option.

The baseline model does not have a regularization model, as it always pick the
majority class. We now proceed to the results of the classification task.

3.3 Results of classification

Table 4 shows the results of our 2-level 10 cross validation classification. We
have added a final row with the estimated generalization error of the KNN,
logistic regression and baseline model. The estimated generalization error is
calculated as:
10
X |Dtest |
i
Ê gen = Eitest (8)
i=1
N
We applied the most frequent selected model to estimate the generalization er-
ror. I.e. 31 nearest neighbours for the KNN method. Our best performing
classification model is the logistic regression. We estimate the generalization
error of the logistic regression to be 26.4. Our second best performer is the
KNN model with a generalization error of 28.4. Finally and unsurprisingly the
baseline model is worst performer with a generalization error of 34.6. Recall,
3 Selecting the optimal level of hidden units in the first hidden layer

7
that 34.6 percent of the persons in the sample is diagnosed with a heart dis-
ease. Hence, it is not very surprising that our baseline model on average guess
incorrect 34.6 percent of the time. Despite the fact that the logistic model has
a lower estimated generalization error, we cannot not conclude that it is in fact
a better model for our classification challenge. This is a consequence of the
fundamental statistical uncertainty. In the next sub section we perform a setup
II statistical test, to test the hypothesis of whether the logistic regression is a
better classifier within a reasonable statistic uncertainty. However, before we
do this we would like to add a few more words to table 4. As one can see from
the table, the estimated test errors differ quite a bit within each outer fold. We
expect this to be a direct consequence of the fact that the data size for each
outer fold is relatively small. Hence, the estimated test error is more sensitive
to incorrect predictions. That is, a few more incorrect prediction will have a
great impact on the test error within each outer fold. Finally, we see that the
KNN and Logistic regression are fairly consistent about their optimal choice
of regularization parameter. This gives us confident that the optimal choice of
regularization parameter is in fact the global optimal choice. We now proceed
to the statistical test subsection.

Table 4: Summary of 2-level 10-fold CV for classification

Outer Fold KNN Logistic regression Baseline
i data size Ki∗ Eitest λ∗i Eitest Eitest
1 47 23 23.4 22.1 23.4 36.2
2 47 23 25.5 22.1 21.3 27.7
3 46 31 32.6 11.7 34.8 41.3
4 46 31 19.6 11.7 19.6 30.4
5 46 31 23.9 11.7 26.1 28.3
6 46 31 39.1 11.7 37.0 32.6
7 46 31 26.1 11.7 26.1 32.6
8 46 31 30.4 11.7 23.9 34.8
9 46 31 30.4 11.7 26.1 47.8
10 46 31 32.6 11.7 26.1 34.8
Ê test 28.4 26.4 34.6

8
3.4 Statistical test (Setup II)
We decided to perform a setup II test, because we are interested in evaluating
how well we can expect our model to perform on an unknown data set, gener-
ated from the same population. In other words: Should we expect the logistic
model to outperform the KNN model on a new data set on heart disease from
South African villages?

To compute the statistical test we follow the approach outlined in method box
11.4.1 (correlated t-test for cross validation) in the text book. We compute a
separate CV from the CV that lead to our estimations in table 4. We do this,
because we want to avoid performing a statistical test on the same data that
we have used to perform model selection of. By computing a new random CV
we ensure that CV split for the statistical setup II is independent from the CV
used for model selection. We use the most frequent occurring regularization
parameter as a model. We applied J = 10 outer folds for the statistical test.
The results of the statistical test is stated in table 5. As one can see from the
table, we reject (on a five percent significance level) that the baseline model has
the same generalization error as the KNN and logistic regression. However, we
cannot reject that the generalization error for the KNN and logistic regression
is the same in the population. Thus, our conclusion is that if we were given the
task to predict heart diseases on a new data set from South African villages, it
would be a better approach to use KNN or a logistic regression than a simple
baseline model.

Table 5: Summary of Setup II statistical test for classification

H0 p value Lower CI Upper CI Conclusion
test test
Ebaseline − Elogistic = 0 0.004 0.033 0.123 H0 rejected
test test
EKN N − Elogistic = 0 0.559 -0.041 0.071 H0 not rejected
test test
Ebaseline − EKN N = 0 0.031 0.007 0.118 H0 rejected

4 Discussion
4.1 What did we learn?
From the regression part, we learned that the optimal regularization parameter
λ was an important parameter in the behaviour of the model; given it’s ability
to keep the complexity of the model in an optimal point, the error we obtained
was lower. This can be also seen in the ANN model, where the optimal number
of hidden layers were always 1 or 2 only, thus by keeping the model simple
we obtained good results. Also, from the statistical test, we learned that it
is important to do these tests, since it may be easy to see the generalization
error results and argue that one model is better than the other just with that
parameter in mind. In reality, we’d like to further test our model with new

9
data never seen before by our model and keep testing and improving it. On a
side note, an important constraint during this project was the lack of GPU and
processing power to quickly calculate the ANN model; the time-constraint was
always an issue and we had to be very selective on the tests we wanted to run
when choosing the final parameters of our model.

4.2 How can we relate to research performed on same the

data set?
Recall, that the original paper by Rousseauw et al (1984) makes use of a much
more detailed data set than ours. It primarily investigates the relationship
between chest pain, gender and chd. We do not have access to the chest pain
or gender attribute, making it difficult for us to compare our results with their
work. Nevertheless, we argue that our classification results show that it would
be possible to perform a fairly reliable screening for heart disease based on our
nine attributes. However, more work would have to be done, before it could
be implemented as a health policy tool. Firstly, more data from new persons
would have to collected to investigate whether the accuracy could be improved
through leveraging on more data. Secondly, a wider range of models would have
to be applied and tested to explore potential accuracy improvements. Thirdly,
we would have to address the class imbalance problem. Finally, we would also
have to look at other score metrics such as precision, recall, and AUC to better
understand where our classification is good (and less good). I.e. for a screening
device we would ideally want to eliminate false negatives.

04 - Notebook4 - Additional Information
No ratings yet
04 - Notebook4 - Additional Information
5 pages
Linear Regression Lab Guide
100% (1)
Linear Regression Lab Guide
8 pages
Ch06 MultipleLinearRegression
0% (2)
Ch06 MultipleLinearRegression
19 pages
15multiple Linear Regression
No ratings yet
15multiple Linear Regression
168 pages
Linear Regression and Qualitative Predictors Analysis
No ratings yet
Linear Regression and Qualitative Predictors Analysis
66 pages
ISLP - Website-135-200 (1) - 1-60
No ratings yet
ISLP - Website-135-200 (1) - 1-60
60 pages
Lecture 12 - Adv. Correlation and Multiple Regression
No ratings yet
Lecture 12 - Adv. Correlation and Multiple Regression
32 pages
FML Unit2
No ratings yet
FML Unit2
13 pages
Machine Learning Unit2
No ratings yet
Machine Learning Unit2
31 pages
Untitled Document
No ratings yet
Untitled Document
6 pages
Computer Lab 2 Block 1-3
No ratings yet
Computer Lab 2 Block 1-3
7 pages
Module 3: Linear Regression: TMA4268 Statistical Learning V2025
No ratings yet
Module 3: Linear Regression: TMA4268 Statistical Learning V2025
110 pages
Stat Modelling Notes
No ratings yet
Stat Modelling Notes
49 pages
Sample Lab File
No ratings yet
Sample Lab File
4 pages
Lecture 09 - 02.09.2024 - Regression-01
No ratings yet
Lecture 09 - 02.09.2024 - Regression-01
62 pages
Report
No ratings yet
Report
30 pages
Linear Regression for Researchers
No ratings yet
Linear Regression for Researchers
41 pages
Regression Model
No ratings yet
Regression Model
30 pages
Multiple Linear Regression & Nonlinear Regression Models
No ratings yet
Multiple Linear Regression & Nonlinear Regression Models
51 pages
ML EasySol
No ratings yet
ML EasySol
62 pages
CC02 Group6 Report
No ratings yet
CC02 Group6 Report
36 pages
Linear Regression Analysis Guide
No ratings yet
Linear Regression Analysis Guide
33 pages
试卷1
No ratings yet
试卷1
17 pages
ML Unit
No ratings yet
ML Unit
23 pages
R Lab 4
No ratings yet
R Lab 4
7 pages
Machine Learning Lecture 1
No ratings yet
Machine Learning Lecture 1
5 pages
Lab Linear Regression
No ratings yet
Lab Linear Regression
21 pages
Module01.1 LinearRegression
No ratings yet
Module01.1 LinearRegression
32 pages
Unit-2 ML
No ratings yet
Unit-2 ML
39 pages
18 SL Regression 1 320E F21
No ratings yet
18 SL Regression 1 320E F21
40 pages
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
No ratings yet
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
10 pages
Linear Models - Numeric Prediction
No ratings yet
Linear Models - Numeric Prediction
7 pages
Summary of Topics For Midterm Exam #2: STA 371G, Fall 2017
No ratings yet
Summary of Topics For Midterm Exam #2: STA 371G, Fall 2017
6 pages
Notes 07 - Regression
No ratings yet
Notes 07 - Regression
23 pages
试卷2
No ratings yet
试卷2
16 pages
SM Notes 2020
No ratings yet
SM Notes 2020
139 pages
Final Cc01 Group7
No ratings yet
Final Cc01 Group7
23 pages
FINAL - CC01 - Group7
No ratings yet
FINAL - CC01 - Group7
23 pages
Lec03 MultLinRegression
No ratings yet
Lec03 MultLinRegression
42 pages
Iml Unit III
No ratings yet
Iml Unit III
18 pages
Lecture 3
No ratings yet
Lecture 3
35 pages
Assignment AI-ML
No ratings yet
Assignment AI-ML
13 pages
Exampleofregressions
No ratings yet
Exampleofregressions
21 pages
Slides Linear Regression
No ratings yet
Slides Linear Regression
70 pages
Section 2
No ratings yet
Section 2
22 pages
19BCS2059 DL1
No ratings yet
19BCS2059 DL1
4 pages
Multiple Linear Regression in Data Mining
100% (1)
Multiple Linear Regression in Data Mining
14 pages
Cappstone
No ratings yet
Cappstone
2 pages
Residual Analysis For Simple Linear Regression: X B B y N e N e
No ratings yet
Residual Analysis For Simple Linear Regression: X B B y N e N e
15 pages
Fdsa UNIT V
No ratings yet
Fdsa UNIT V
18 pages
Regression Practice 1
No ratings yet
Regression Practice 1
14 pages
Test2 SolnV 2024
No ratings yet
Test2 SolnV 2024
3 pages
3.3 Regression Problem
No ratings yet
3.3 Regression Problem
30 pages
Dar Lec10
No ratings yet
Dar Lec10
22 pages
Adequacy Og Regression Model
No ratings yet
Adequacy Og Regression Model
10 pages
Wa0002.
No ratings yet
Wa0002.
5 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
11 pages
MATH6183 Introduction+Regression
No ratings yet
MATH6183 Introduction+Regression
70 pages
Lecture+Notes+-+Advanced+Regression
No ratings yet
Lecture+Notes+-+Advanced+Regression
12 pages
Business Analyst - Junior
No ratings yet
Business Analyst - Junior
2 pages
Machine Learning Project Analysis
No ratings yet
Machine Learning Project Analysis
10 pages
International Student Guide: Living Studying Thriving
No ratings yet
International Student Guide: Living Studying Thriving
15 pages
Tabella A
No ratings yet
Tabella A
1 page
International Student Guide: Living Studying Thriving
No ratings yet
International Student Guide: Living Studying Thriving
15 pages
Machine Learning & Portfolio Optimization: Gah-Yi Ban
No ratings yet
Machine Learning & Portfolio Optimization: Gah-Yi Ban
30 pages
Bjorck 1988
No ratings yet
Bjorck 1988
12 pages
Isatis Case Studies
100% (1)
Isatis Case Studies
910 pages
Tensor Factorization in Drug Target Prediction
No ratings yet
Tensor Factorization in Drug Target Prediction
12 pages
Epfl Machine Learning Final Exam 2021 Solutions
No ratings yet
Epfl Machine Learning Final Exam 2021 Solutions
21 pages
Image Enhancement by Regularization Methods
No ratings yet
Image Enhancement by Regularization Methods
44 pages
ML Theory Questions Final
No ratings yet
ML Theory Questions Final
3 pages
Real Estate Price Prediction With Regression and Classification
No ratings yet
Real Estate Price Prediction With Regression and Classification
5 pages
HW3: (Regularized) Least Square Problem (65 PTS) : Mathematical Backgrounds
No ratings yet
HW3: (Regularized) Least Square Problem (65 PTS) : Mathematical Backgrounds
13 pages
Neural Network Data Preprocessing Guide
No ratings yet
Neural Network Data Preprocessing Guide
17 pages
DLunit 2
No ratings yet
DLunit 2
8 pages
Huawei: Question & Answers
100% (1)
Huawei: Question & Answers
14 pages
Research On The Prediction of Boston House Price B
No ratings yet
Research On The Prediction of Boston House Price B
11 pages
Fast and Accurate Prediction of Electrical Characteristics of Next-Generation Node 3-D NAND Flash Memory Using Transfer Learning
No ratings yet
Fast and Accurate Prediction of Electrical Characteristics of Next-Generation Node 3-D NAND Flash Memory Using Transfer Learning
6 pages
Coordinate Descent Algorithms: Stephen J. Wright
No ratings yet
Coordinate Descent Algorithms: Stephen J. Wright
32 pages
CST395 - ML Syllabus
No ratings yet
CST395 - ML Syllabus
13 pages
Regularization
No ratings yet
Regularization
14 pages
Zach2008 VMV Fast Global Labeling
No ratings yet
Zach2008 VMV Fast Global Labeling
12 pages
Asset Pricing with Machine Learning
No ratings yet
Asset Pricing with Machine Learning
78 pages
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
No ratings yet
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
50 pages
2782 On The Generalization of
No ratings yet
2782 On The Generalization of
28 pages
14 Efficient Learning
No ratings yet
14 Efficient Learning
7 pages
DL Unit5
No ratings yet
DL Unit5
15 pages
Machine Learning 10-701 Exam Prep
No ratings yet
Machine Learning 10-701 Exam Prep
14 pages
15 Types of Regression in Data Science PDF
No ratings yet
15 Types of Regression in Data Science PDF
42 pages
Data Science Ai
No ratings yet
Data Science Ai
27 pages
Cylinder Pressure Reconstruction Based On Complex Radial Basis Function Networks From Vibration and Speed Signals
No ratings yet
Cylinder Pressure Reconstruction Based On Complex Radial Basis Function Networks From Vibration and Speed Signals
18 pages
Top 100 ML Interview Q&A
100% (1)
Top 100 ML Interview Q&A
39 pages
Regression Techniques Guide
No ratings yet
Regression Techniques Guide
74 pages
Module 2 Quiz - Correct
No ratings yet
Module 2 Quiz - Correct
4 pages

Linear Regression on Heart Disease Data

Uploaded by

Linear Regression on Heart Disease Data

Uploaded by

Technical University of Denmark

02450 Introduction to Machine Learning

s191985 Julian Böhm (Regression, part a)

Following, we introduce a regularization parameter λ (equation 14.3) to the

Herlau, Mikkel N. Schmidt, Morten Mørup.

Table 1: Overview of weights w∗ in the last fold

λ ∈ [10 : 200] (4)

2.2 Results and Comparison

2.3 Statistical test

Table 3: Summary of Setup II statistical test for regression

KKNN ∈ [1 : 50] (6)

For λ, the regularization parameter of the logistic regression model, we decided

3.3 Results of classification

Table 4: Summary of 2-level 10-fold CV for classification

Table 5: Summary of Setup II statistical test for classification

4.2 How can we relate to research performed on same the

You might also like