Computer Architecture 13-AdditionalMaterials

Uploaded by

oyb104113

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views9 pages

Computer Architecture 13-AdditionalMaterials

Uploaded by

oyb104113

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

CSE203 Computer Architecture

13. Additional Materials for

Final Exam (incl. HW2 Sol)

Prof. Seong Tae Kim (st.kim@khu.ac.kr)

Augmented Intelligence Lab. (ailab.khu.ac.kr)
School of Computing, Kyung Hee University
1
Assignment 1
• Suppose we have a processor with a base CPI of 1.0, assuming all references hit in
the primary cache and a clock rate of 1 GHz. Please assume a main memory
access time of 100 ns, including all the miss handling
a) Suppose the miss rate per instruction at the primary cache is 4%. Please
calculate the effective CPI with one level of caching
The miss penalty to main memory is
100 𝑛𝑠
𝑛𝑠 = 100 𝑐𝑙𝑜𝑐𝑘 𝑐𝑦𝑐𝑙𝑒𝑠
1
𝑐𝑙𝑜𝑐𝑘 𝑐𝑦𝑐𝑙𝑒
The effective CPI with one level of caching is given by
𝑇𝑜𝑡𝑎𝑙 𝐶𝑃𝐼 = 𝐵𝑎𝑠𝑒 𝐶𝑃𝐼 + 𝑀𝑒𝑚𝑜𝑟𝑦 − 𝑠𝑡𝑎𝑙𝑙 𝑐𝑦𝑐𝑙𝑒𝑠 𝑝𝑒𝑟 𝑖𝑛𝑠𝑡𝑟𝑢𝑐𝑡𝑖𝑜𝑛
= 1.0 + 4% ∗ 100 = 5
2
Assignment 1
• Suppose we have a processor with a base CPI of 1.0, assuming all references hit in the primary
cache and a clock rate of 1 GHz. Please assume a main memory access time of 100 ns, including
all the miss handling
b) To make the processor be fast, we add a secondary cache that has a 10 ns access time for either a
hit or a miss and is large enough to reduce the miss rate to the main memory to 1%. Please calculate
the effective CPI with two-level of caching:
With two levels of caching, a miss in the primary cache can be satisfied either by the secondary cache
or by main memory. The miss penalty for an access to the second-level cache is

10 𝑛𝑠
𝑛𝑠 = 10 𝑐𝑙𝑜𝑐𝑘 𝑐𝑦𝑐𝑙𝑒𝑠
1
𝑐𝑙𝑜𝑐𝑘 𝑐𝑦𝑐𝑙𝑒
𝑇𝑜𝑡𝑎𝑙 𝐶𝑃𝐼 = 𝐵𝑎𝑠𝑒 𝐶𝑃𝐼 + 𝑃𝑟𝑖𝑚𝑎𝑟𝑦 𝑠𝑡𝑎𝑙𝑙 𝑝𝑒𝑟 𝑖𝑛𝑠𝑡𝑟𝑢𝑐𝑡𝑖𝑜𝑛 + 𝑆𝑒𝑐𝑜𝑛𝑑𝑎𝑟𝑦 𝑠𝑡𝑎𝑙𝑙𝑠 𝑝𝑒𝑟 𝑖𝑛𝑠𝑡𝑟𝑢𝑐𝑡𝑖𝑜𝑛
= 1.0 + 4% ∗ 10 + 1% ∗ 100 = 2.4

3
Assignment 2
• Assume a pipelined processor (4 stage pipeline; instruction fetch/decoding/ execution/
write back) that has two functional units (i.e., a 3 cycle adder and a 5 cycle multiplier).
There is an instruction sequence as
ADD R3 ← R1, R2 a) Please calculate the number of cycles to complete the instruction sequence
ADD R5 ← R3, R4 in a non-pipelined machine: 3*6+3*8 = 42 cycles
MUL R7 ← R2, R6
ADD R10 ← R8, R9
b) Please calculate the number of cycles to complete the instruction sequence
MUL R11 ← R7, R10
MUL R5 ← R5, R11 at in-order dispatch without forwarding: 29 cycles

c) Please calculate the number of cycles to complete the instruction sequence

at in-order dispatch with forwarding : 22cycles

d) Please calculate the number of cycles to complete the instruction sequence

at out-of-order dispatch with forwarding : 20cycles 4
Assignment 2
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29
F D 1 2 3 W
F D D 1 2 3 W
F D 1 2 3 4 5 W
F D 1 2 3 W
F D D 1 2 3 4 5 W
F D D 1 2 3 4 5 W

F D 1 2 3 W
F D 1 2 3 W
F D 1 2 3 4 5 W
F D 1 2 3 w
F D 1 2 3 4 5 w
F D 1 2 3 4 5 w

F D 1 2 3 W
F D 1 2 3 W
F D 1 2 3 4 5 W
F D 1 2 3 W
F D 1 2 3 4 5 w
5
F D 1 2 3 4 5 w
Assignment 3
Consider a 16-way set-associative cache
- Data words are 1byte long
- Byte addressing (one address defines a byte)
- The cache holds 256 KB of data
- Each block holds 32 data words
- Physical addresses are 32 bits long
How many bits of tag, index, and offset are needed to support references to this cache?
Block offset: 5 bit (because there are 32 bytes in each
Tag: block)
# of Blocks in cache: 8K
Index: # of Cache Lines: 512
Index: 9 bit
Block Offset: Tag: 18 bit 6
Assignment 4
I1: R1 = R2 + R3 (ALU)
Assume a superscalar processor with the following specifications: I2: R4 = MEM[R1] (Load)
• Issue width: 4 instructions per cycle I3: R5 = R1 + R6 (ALU)
• Functional units: 2 ALUs, 1 Load/Store unit, 1 Branch unit I4: if (R5 == 0) jump (Branch)
You are given the following instruction stream: I5: R7 = R8 + R9 (ALU)
(a) Assuming no data hazards and ideal conditions, how many cycles will it take to issue all 5
instructions?
• Cycle 1: Issue I1 (ALU), I2 (Load), I4 (Branch), I5 (ALU) → 4 instructions issued
• Cycle 2: Issue I3 (ALU)
• Total: 2 cycles needed to issue all 5 instructions.
(b) If true data dependencies and structural hazards are considered, what kind of scheduling or
reordering might help improve performance?
✓ Out-of-order execution can help issue I5
•I3 depends on the result of I1 (R1), creating a data hazard. earlier while waiting on dependencies
•I4 depends on I3, adding another hazard. ✓ Dynamic scheduling (e.g., Tomasulo’s
•I2 and I3 both use R1 — load-use hazard may delay I3. algorithm) can resolve hazards at runtime. 7
*Example Assignment
We can first calculate the number of page table
• Please assume that you have 4 GB of main entries associated with each process.
memory at your disposal. - 16 KB pages implies that the offset associated
• 512GB of the 4 GB has been reserved for with a VA/PA is 14 bits (214 = 16KB)
process page table storage
- Thus, the remaining 18 bits are used to represent
• Each page table entry consists of: VPNs and serve as indices to the PT
• A physical frame number
- Thus, each PT has 218 entries.
• 1 valid bit
Next, each PA is 29 bits:
• 1 dirty bit
- 14 bits of the PA come from the offset, the other
• 1 LRU status bits
15 come from a PT lookup
• Virtual addresses are 32 bits - Given that each PT entry consists of a PFN, a
• Physical addresses are 29 bits valid bit, a dirty bit, and a LRU bit, each PT entry
• The page size is 16 KB will be 2 bytes (16 bits)
How many process page tables can fit in the Therefore
1GB space? Size of Page Table = 219 byte (512KB)
Maximum # of page tables in 512MB = 210 or 1,024

8
*Example Assignment
• What is the average memory access time when you have the following memory hierarchy? Assume
that (i) the cache uses physical addresses, (ii) the CPU stalls until the data is delivered, (iii)
everything fits into the memory (i.e., no page fault), and (iv) the hardware does the page table
access and updates TLB when TLB miss.
Unit Additional Access Latency Local Hit rate
TLB 1 cycle 95%
L1 1 cycle 90%
L2 10 cycles 95%
L3 50 cycles 98%
Memory 100 cycles 100%
Page table access & TLB update 200 cycles 100%
L3 miss penalty: 100
L2 miss penalty: 0.98*50+0.02*100=51
L1 miss penalty: 0.95*10+0.05*51=12.05
TLB/Page Table Access: 0.95*1+0.05*201=11
Data Memory Access: 0.9*1+0.1*12.05 = 2.105
Total: 13.105 9

Homework 5
No ratings yet
Homework 5
6 pages
2010 Final Exam Solutions
0% (1)
2010 Final Exam Solutions
13 pages
Review Problems For Exam 1: MIPS (Instruction Count) / (Execution Time X 10
No ratings yet
Review Problems For Exam 1: MIPS (Instruction Count) / (Execution Time X 10
6 pages
15IF11 Multicore E PDF
No ratings yet
15IF11 Multicore E PDF
14 pages
CS704 Finalterm QA Past Papers
No ratings yet
CS704 Finalterm QA Past Papers
20 pages
DigitalLogic ComputerOrganization L22 CachesP3 Handout
No ratings yet
DigitalLogic ComputerOrganization L22 CachesP3 Handout
52 pages
Computer Architecture Midterm Key
No ratings yet
Computer Architecture Midterm Key
7 pages
Midterm2 s2012 Sol
No ratings yet
Midterm2 s2012 Sol
5 pages
Cache Memory
No ratings yet
Cache Memory
28 pages
5.1 Problem Set
No ratings yet
5.1 Problem Set
15 pages
COE 308 Fall 2008 Final Exam Solution
No ratings yet
COE 308 Fall 2008 Final Exam Solution
8 pages
Archi Second 2013 2014 JCE
No ratings yet
Archi Second 2013 2014 JCE
2 pages
CompEng 361 Final Review Problems - Solutions
No ratings yet
CompEng 361 Final Review Problems - Solutions
6 pages
REV1
No ratings yet
REV1
12 pages
CSE 240A Assignment 3 Solutions
No ratings yet
CSE 240A Assignment 3 Solutions
5 pages
CS 251 Cache & Memory Assignment
No ratings yet
CS 251 Cache & Memory Assignment
6 pages
Assign1 PDF
No ratings yet
Assign1 PDF
5 pages
Cau 6 Cache
No ratings yet
Cau 6 Cache
25 pages
Major Solution
No ratings yet
Major Solution
6 pages
Final Exam Topics: CSE 564 Computer Architecture Summer 2017
No ratings yet
Final Exam Topics: CSE 564 Computer Architecture Summer 2017
78 pages
7th Question Paper
No ratings yet
7th Question Paper
21 pages
Computer Organization Exam Key
No ratings yet
Computer Organization Exam Key
7 pages
Data Hazards and Cache Optimization
No ratings yet
Data Hazards and Cache Optimization
2 pages
Archmidsem 2009 Sol
No ratings yet
Archmidsem 2009 Sol
5 pages
Practice Questions To Set 8
No ratings yet
Practice Questions To Set 8
8 pages
Coss
No ratings yet
Coss
2 pages
Computer Design Performance Analysis
No ratings yet
Computer Design Performance Analysis
2 pages
323 MT 1
No ratings yet
323 MT 1
3 pages
PDF
No ratings yet
PDF
6 pages
Exam OS 2 - Ready!
100% (1)
Exam OS 2 - Ready!
5 pages
HPCA Endsem SPR 2024
No ratings yet
HPCA Endsem SPR 2024
3 pages
Cmsc132part1 3rdexam
No ratings yet
Cmsc132part1 3rdexam
2 pages
CENG3420 Homework 3: Solutions
No ratings yet
CENG3420 Homework 3: Solutions
5 pages
Answer:: Remark
No ratings yet
Answer:: Remark
72 pages
IT4272E CS FinalExam 20211
No ratings yet
IT4272E CS FinalExam 20211
1 page
Lecture 41
No ratings yet
Lecture 41
41 pages
Test 2
No ratings yet
Test 2
4 pages
CO-Assignment 2
No ratings yet
CO-Assignment 2
2 pages
Lect12 Cache
No ratings yet
Lect12 Cache
39 pages
Hpca Pyqp
No ratings yet
Hpca Pyqp
17 pages
207 Assignment 6
No ratings yet
207 Assignment 6
7 pages
Coa Applied
No ratings yet
Coa Applied
13 pages
Special Problem Set Ver 3
No ratings yet
Special Problem Set Ver 3
14 pages
Problem Bank 3
No ratings yet
Problem Bank 3
9 pages
Ca Sol PDF
No ratings yet
Ca Sol PDF
8 pages
Advance Computer Architecture Homework 2 Solution
No ratings yet
Advance Computer Architecture Homework 2 Solution
8 pages
COA Digital-Cheatsheet
No ratings yet
COA Digital-Cheatsheet
4 pages
Solutions: 18-742 Advanced Computer Architecture
No ratings yet
Solutions: 18-742 Advanced Computer Architecture
8 pages
Computer Org and Arch: R.Magesh
No ratings yet
Computer Org and Arch: R.Magesh
48 pages
350 Exam 2 Spring 2024
No ratings yet
350 Exam 2 Spring 2024
7 pages
PS1
No ratings yet
PS1
1 page
Suggestion CA-2 Achitecture
No ratings yet
Suggestion CA-2 Achitecture
4 pages
111 Computer Organization - Final
No ratings yet
111 Computer Organization - Final
4 pages
CENG400-Final-Fall 2015
No ratings yet
CENG400-Final-Fall 2015
10 pages
Exercise 5 - With Solution
No ratings yet
Exercise 5 - With Solution
8 pages
Cache Performance and Memory Access Analysis
No ratings yet
Cache Performance and Memory Access Analysis
7 pages
Disc09 Sols
No ratings yet
Disc09 Sols
7 pages
Assignment 1 COS 122 - University of Pretoria
No ratings yet
Assignment 1 COS 122 - University of Pretoria
5 pages
Completed Data QA Associate Assessment
No ratings yet
Completed Data QA Associate Assessment
3 pages
Transport
100% (1)
Transport
2 pages
Data Sheet - HPR 410, Hydroacoustic Position Reference System - SSBL
100% (1)
Data Sheet - HPR 410, Hydroacoustic Position Reference System - SSBL
2 pages
Payment Form: Kawanihan NG Rentas Internas
No ratings yet
Payment Form: Kawanihan NG Rentas Internas
1 page
Flight Airworthiness Support Technology: J U L Y 2 0 0 4
No ratings yet
Flight Airworthiness Support Technology: J U L Y 2 0 0 4
21 pages
Drives English
No ratings yet
Drives English
16 pages
Originator ACH Reference Guide
No ratings yet
Originator ACH Reference Guide
8 pages
03-For Surveying Work
No ratings yet
03-For Surveying Work
4 pages
Railway Reservation System
75% (4)
Railway Reservation System
48 pages
A History of Book Clubs
No ratings yet
A History of Book Clubs
10 pages
BIR Citizen's Charter (2025-Edition)
No ratings yet
BIR Citizen's Charter (2025-Edition)
523 pages
Unpublished
No ratings yet
Unpublished
6 pages
Corporate Governance Using Balanced Scorecard
100% (2)
Corporate Governance Using Balanced Scorecard
11 pages
Flange
No ratings yet
Flange
6 pages
GEA 4-SB-GZ-WB-Unit 3-L3C
No ratings yet
GEA 4-SB-GZ-WB-Unit 3-L3C
7 pages
Automotive Engine Performance CDX Master Automotive Technician Series Ebook and TestBank Bundle Full Download
50% (2)
Automotive Engine Performance CDX Master Automotive Technician Series Ebook and TestBank Bundle Full Download
403 pages
De440014 Tc-Mini
No ratings yet
De440014 Tc-Mini
6 pages
June 7, 2024
No ratings yet
June 7, 2024
2 pages
Mini Research Psycholinguistics
No ratings yet
Mini Research Psycholinguistics
88 pages
Missile Autopilot Nonlinear Control
No ratings yet
Missile Autopilot Nonlinear Control
6 pages
Beer Production Process Guide
100% (1)
Beer Production Process Guide
10 pages
Volvo Engine White Smoke Solution
67% (3)
Volvo Engine White Smoke Solution
5 pages
UL STD 1066
No ratings yet
UL STD 1066
38 pages
Social Security Dispute Ruling
No ratings yet
Social Security Dispute Ruling
11 pages
Basics Sealing Principles PDF
No ratings yet
Basics Sealing Principles PDF
43 pages
RF in Line Digital Power Meter: Function
No ratings yet
RF in Line Digital Power Meter: Function
3 pages
Literature Review Example Dyslexia
100% (1)
Literature Review Example Dyslexia
8 pages
Design & Analysis of Pre-Engieered Building Structure
No ratings yet
Design & Analysis of Pre-Engieered Building Structure
3 pages
Welding Inspection Report
No ratings yet
Welding Inspection Report
2 pages
Poultry Breed of India PDF
100% (3)
Poultry Breed of India PDF
16 pages

Computer Architecture 13-AdditionalMaterials

Uploaded by

Computer Architecture 13-AdditionalMaterials

Uploaded by

CSE203 Computer Architecture

13. Additional Materials for

Prof. Seong Tae Kim (st.kim@khu.ac.kr)

c) Please calculate the number of cycles to complete the instruction sequence

d) Please calculate the number of cycles to complete the instruction sequence

You might also like