Tutorial 3

This document contains 6 tutorial problems about cache memory optimizations. The problems cover topics like calculating block size based on a memory address, determining when reducing miss rate or increasing hit latency improves average memory access time, calculating misses per 1000 instructions and memory stall cycles per miss, determining speedup from a perfect cache, and identifying which cache sets would be filled when executing a sequence of instructions and data accesses.

Uploaded by

Rama Devi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

77 views14 pages

Tutorial 3

Uploaded by

Rama Devi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Multicore Computer Architecture - Storage and Interconnects

Tutorial 3
Cache Memory Optimizations

Dr. John Jose

Assistant Professor
Department of Computer Science & Engineering
Indian Institute of Technology Guwahati, Assam.
Tutorial Problem-1
 The address of a word in a byte addressable 16MB physical memory is
0xAA0C2A. This word upon bringing to the cache is mapped to set 48.
What is the block size of the cache memory ?
 A A 0 C 2 A
 1010 1010 0000 1100 0010 1010
 1010 1010 0000 1100 0010 1010 offset  64bytes
Tutorial Problem-2
 A cache has access time (hit latency)=10 ns and miss rate is 5%. An
optimization was made to reduce the miss rate to 3 % but the hit latency
was increased to 15 ns. Under what condition this change will result in
better performance (Lower avg. memory access time)?
 AMAT 1 = HT1 + MR1 x MP HT1 = 10ns; MR1=0.05
 AMAT 2 = HT2 + MR2 x MP HT2 = 15ns; MR1=0.03
 AMAT2<AMAT1
Tutorial Problem-3
 A cache has hit rate of 90%, 64 byte block, cache hit latency of 5ns. Main
memory takes 150 ns to return first word (32 bits) of a block and 10 ns for
each subsequent word.
(a) What is the miss latency of the cache?
(b) If doubling the cache block size reduces the miss rate to 3%, does it
reduces average memory access time?
Tutorial Problem-3
 A cache has hit rate of 90%, 64 byte block, cache hit latency of 5ns. Main
memory takes 150 ns to return first word (32 bits) of a block and 10 ns for
each subsequent word.
(a) What is the miss latency of the cache?
(b) If doubling the cache block size reduces the miss rate to 3%, does it
reduces average memory access time?
Tutorial Problem-4
 For a cache, that has a miss rate of 3% and miss penalty of 500 cycles. In
a program 50% of the instructions are memory accesses (load-store)
 (a) Find the misses per 1000 instruction (MPKI)
 (b) Find memory stall cycles per miss
 Miss rate: miss/mem access = (miss / instruction)/(mem acc /instruction)
MR = MPI/MAPI MPI =MR x MAPI MAPI=1.5
Tutorial Problem-5
 Consider a cache system with miss rate of an I-cache is 2% and that of D-
cache is 4%. The processor CPI=2 without memory stalls and miss penalty
=100 cycles for all misses. Determine how much faster the processor
would run with a perfect cache that never missed. Assume frequency of all
loads and store is 36 %.
 Actual CPI real= Base CPI + stall CPI CPI ideal = Base CPI=2
 Stall CPI = (% use of IC x stall of IC)+(% use of DC x stall of DC)
Tutorial Problem-5
 miss penalty =100 cycles for all misses. Assume frequency of all loads and
store is 36 %.
 Actual CPI real= Base CPI + stall CPI CPI ideal = Base CPI=2
 Stall CPI = (% use of IC x stall of IC)+(% use of DC x stall of DC)
Tutorial Problem-6
 Consider a 32 bit processor with 16KB direct mapped L1-cache that uses
a block size of 4 words. It has an L2-cache of 256 KB with 4-way
associativity and block size of 8 words. The system uses a byte
addressable 256 MB DRAM system. Upon running a program, 16
consecutive fixed length instructions (each instruction is one word)
starting at main memory address 0x 8226620 are executed. These
instructions operate on an array A of 8 words, with starting address 0x
42AF5F8 Assuming caches are initially empty; indicate the non empty
sets on L1 cache and L2 cache after the execution of the program.
Tutorial Problem-6
 32 bit processor: 1 word  4 bytes: 256 MB DRAM  28 bit address
 L1 Cache: 16KB, direct mapped, block size= 4 words (16B)

 L2 Cache : 256 KB, 4-way, block size= 8 words (32B).

 Instruction 0x 8226620, 16 consecutive fixed length instructions (each

instruction is one word) Data 0x 42AF5F8 , array of 8 words.
Tutorial Problem-6
 L1 Cache: 16KB, direct mapped, block size= 4 words (16B)
 Instruction 0x 8226620, 16 consecutive fixed length instructions (each
instruction is one word) Data 0x 42AF5F8 , array of 8 words.
Tutorial Problem-6
 L2 Cache : 256 KB, 4-way, block size= 8 words (32B).
 Instruction 0x 8226620, 16 consecutive fixed length instructions (each
instruction is one word) Data 0x 42AF5F8 , array of 8 words.
Tutorial Problem-6
 Non-Empty Blocks
 L1: Sets 610, 611, 612,613 (4 words x 4 = 16 instructions)
Sets 863, 864, 865 ( 2 + 4 +2 words of data array A)

 L2: Sets 817, 818 (8 words x 2 = 16 instructions)

Sets 1967, 1968 ( 2 + 6 words of data array A)
johnjose@iitg.ac.in
http://www.iitg.ac.in/johnjose/

207 Assignment 6
No ratings yet
207 Assignment 6
7 pages
Cache Performance and Memory Access Analysis
No ratings yet
Cache Performance and Memory Access Analysis
7 pages
Lecture 41
No ratings yet
Lecture 41
41 pages
2010 Final Exam Solutions
0% (1)
2010 Final Exam Solutions
13 pages
Assign1 PDF
No ratings yet
Assign1 PDF
5 pages
Cache Memory
No ratings yet
Cache Memory
28 pages
Lect12 Cache
No ratings yet
Lect12 Cache
39 pages
A8 Solution 2
No ratings yet
A8 Solution 2
4 pages
Maths
No ratings yet
Maths
3 pages
Cache Performance Analysis Homework
No ratings yet
Cache Performance Analysis Homework
14 pages
CSE 240A Assignment 3 Solutions
No ratings yet
CSE 240A Assignment 3 Solutions
5 pages
Test 6 PracticeQuestion Cachememory 1
No ratings yet
Test 6 PracticeQuestion Cachememory 1
21 pages
5.1 Problem Set
No ratings yet
5.1 Problem Set
15 pages
Review Problems For Exam 1: MIPS (Instruction Count) / (Execution Time X 10
No ratings yet
Review Problems For Exam 1: MIPS (Instruction Count) / (Execution Time X 10
6 pages
5 1
No ratings yet
5 1
39 pages
School of Electronics Engineering (Sense) : Class Number: VL2021220101854 Semester
No ratings yet
School of Electronics Engineering (Sense) : Class Number: VL2021220101854 Semester
4 pages
Cmsc132part1 3rdexam
No ratings yet
Cmsc132part1 3rdexam
2 pages
15IF11 Multicore E PDF
No ratings yet
15IF11 Multicore E PDF
14 pages
10 Cacheperf
No ratings yet
10 Cacheperf
24 pages
PDF
No ratings yet
PDF
6 pages
Computer Org and Arch: R.Magesh
No ratings yet
Computer Org and Arch: R.Magesh
48 pages
Computer Architecture Homework
No ratings yet
Computer Architecture Homework
5 pages
Tutorial 7cache
No ratings yet
Tutorial 7cache
2 pages
DigitalLogic ComputerOrganization L22 CachesP3 Handout
No ratings yet
DigitalLogic ComputerOrganization L22 CachesP3 Handout
52 pages
Cache Memory Parameters Explained
No ratings yet
Cache Memory Parameters Explained
18 pages
Cache TLB
100% (1)
Cache TLB
15 pages
Cache Performance Average Memory Access Time
No ratings yet
Cache Performance Average Memory Access Time
23 pages
Cache Memory for Computer Science Students
No ratings yet
Cache Memory for Computer Science Students
16 pages
Assignment 3
No ratings yet
Assignment 3
4 pages
Lec 23
No ratings yet
Lec 23
13 pages
Test 6 PracticeQuestion Cachememory 1 Updated
No ratings yet
Test 6 PracticeQuestion Cachememory 1 Updated
22 pages
BaiTap Chuong4 PDF
No ratings yet
BaiTap Chuong4 PDF
8 pages
Cau 6 Cache
No ratings yet
Cau 6 Cache
25 pages
CA11 2023S1 New
No ratings yet
CA11 2023S1 New
26 pages
HW6 Spring2022 Solution 2
No ratings yet
HW6 Spring2022 Solution 2
10 pages
Cache Memory
No ratings yet
Cache Memory
10 pages
ARM hw5
No ratings yet
ARM hw5
5 pages
COSS MidSem 2020.07.05 MakeUp With Key COPYM06Tq# Name-Rana
No ratings yet
COSS MidSem 2020.07.05 MakeUp With Key COPYM06Tq# Name-Rana
5 pages
Ca Mod 2
No ratings yet
Ca Mod 2
40 pages
Advanced Concepts in Cache Memory - 1: Lecture 4F
No ratings yet
Advanced Concepts in Cache Memory - 1: Lecture 4F
11 pages
Cache Memory Architecture Tutorial
No ratings yet
Cache Memory Architecture Tutorial
2 pages
IT3030E CA Chap6 Memory
No ratings yet
IT3030E CA Chap6 Memory
65 pages
COE 308 Fall 2008 Final Exam Solution
No ratings yet
COE 308 Fall 2008 Final Exam Solution
8 pages
Advanced Architecture Memory
No ratings yet
Advanced Architecture Memory
13 pages
Midterm2 s2012 Sol
No ratings yet
Midterm2 s2012 Sol
5 pages
Exercise 5 - With Solution
No ratings yet
Exercise 5 - With Solution
8 pages
Week 6: Assignment Solutions
No ratings yet
Week 6: Assignment Solutions
4 pages
CPSC 312 Cache Memories: Topics
No ratings yet
CPSC 312 Cache Memories: Topics
39 pages
Solutions: 18-742 Advanced Computer Architecture
No ratings yet
Solutions: 18-742 Advanced Computer Architecture
8 pages
COATut 10
No ratings yet
COATut 10
1 page
CS 211 Cache Homework Solutions
No ratings yet
CS 211 Cache Homework Solutions
4 pages
Lecture # 1
No ratings yet
Lecture # 1
22 pages
1239302344
No ratings yet
1239302344
19 pages
Cache Memory Organization Guide
No ratings yet
Cache Memory Organization Guide
19 pages
Week8 SampleMidterm
No ratings yet
Week8 SampleMidterm
2 pages
Week8 SampleMidterm
No ratings yet
Week8 SampleMidterm
2 pages
SZDBXCN
No ratings yet
SZDBXCN
7 pages
CS2115 Chapter-6
No ratings yet
CS2115 Chapter-6
45 pages
W3 A3 Detailed
No ratings yet
W3 A3 Detailed
5 pages
Gem5 Practice
No ratings yet
Gem5 Practice
3 pages
Lec 6
No ratings yet
Lec 6
18 pages
Modeling A Hands On Physical Unclonable Functions
No ratings yet
Modeling A Hands On Physical Unclonable Functions
2 pages
Linux Material
No ratings yet
Linux Material
111 pages
TI Embedded Processing Guide
No ratings yet
TI Embedded Processing Guide
129 pages
Queue Depth
No ratings yet
Queue Depth
22 pages
Distributed Storage Performance For OpenStack Clouds Using Small-File IO Workloads: Red Hat Storage Server vs. Ceph Storage
100% (1)
Distributed Storage Performance For OpenStack Clouds Using Small-File IO Workloads: Red Hat Storage Server vs. Ceph Storage
43 pages
CH 1 Intro To Parallel Architecture
No ratings yet
CH 1 Intro To Parallel Architecture
18 pages
Unit 4 Os
No ratings yet
Unit 4 Os
68 pages
Scalable ML for Remote Sensing Data
No ratings yet
Scalable ML for Remote Sensing Data
47 pages
3.1 Computer Architecture EMK Notes 2023
No ratings yet
3.1 Computer Architecture EMK Notes 2023
6 pages
Grade 9 Computer Systems Guide
No ratings yet
Grade 9 Computer Systems Guide
7 pages
Info Dmi
No ratings yet
Info Dmi
17 pages
Computer Architecture Assignment 1
No ratings yet
Computer Architecture Assignment 1
12 pages
Os Study Material - UNIT 1
No ratings yet
Os Study Material - UNIT 1
21 pages
Basics of Computer Hardware
100% (1)
Basics of Computer Hardware
151 pages
Microchip SXP 12g Firmware User Manual 388661
No ratings yet
Microchip SXP 12g Firmware User Manual 388661
491 pages
ISA 2 Regular Solution
No ratings yet
ISA 2 Regular Solution
4 pages
Unit - I
No ratings yet
Unit - I
74 pages
R01uh0451ej0220 rh850d1lm
100% (2)
R01uh0451ej0220 rh850d1lm
4,090 pages
S17+ BrainOs+ Kernel
No ratings yet
S17+ BrainOs+ Kernel
5 pages
Multi Threading
No ratings yet
Multi Threading
128 pages
SS ZG516 Handout
No ratings yet
SS ZG516 Handout
4 pages
14 Sharing+page+tables+with+mshare + (LWN - Net)
No ratings yet
14 Sharing+page+tables+with+mshare + (LWN - Net)
1 page
Computer Systems: Multithreaded Programming and Multiprocessors
No ratings yet
Computer Systems: Multithreaded Programming and Multiprocessors
94 pages
Linux System Performance Metrics
No ratings yet
Linux System Performance Metrics
14 pages
Lecture 3 - 3 Evaluating Static Interconnection Networks
No ratings yet
Lecture 3 - 3 Evaluating Static Interconnection Networks
41 pages
Cache Memory: Chapter 4: Sections 4.1, 4.2
No ratings yet
Cache Memory: Chapter 4: Sections 4.1, 4.2
13 pages
Computer Architecture: Optional Homework Set: Black Board Due Date: Hard Copy Due Date
No ratings yet
Computer Architecture: Optional Homework Set: Black Board Due Date: Hard Copy Due Date
8 pages
Cisco Memory Maps
No ratings yet
Cisco Memory Maps
68 pages
Computer Awareness - Computer Awareness-1
No ratings yet
Computer Awareness - Computer Awareness-1
12 pages
KTMT Ptit
No ratings yet
KTMT Ptit
49 pages
Intel® Itanium™ Processor Core: Harsh Sharangpani
No ratings yet
Intel® Itanium™ Processor Core: Harsh Sharangpani
15 pages

Tutorial 3

Uploaded by

Tutorial 3

Uploaded by

Multicore Computer Architecture - Storage and Interconnects

Dr. John Jose

 L2 Cache : 256 KB, 4-way, block size= 8 words (32B).

 Instruction 0x 8226620, 16 consecutive fixed length instructions (each

 L2: Sets 817, 818 (8 words x 2 = 16 instructions)

You might also like