0% found this document useful (0 votes)

40 views5 pages

Archmidsem 2009 Sol

The document is a periodical examination paper for M. Tech (CS) students focusing on Computer Architecture, dated February 26, 2009. It includes various questions related to speedup calculations, memory hierarchy, instruction pipelining, and code execution in different architectures. The paper consists of multiple-choice and descriptive questions, requiring quantitative justifications and explanations.

Uploaded by

lalhriemsangfaihriemsang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views5 pages

Archmidsem 2009 Sol

Uploaded by

lalhriemsangfaihriemsang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

INDIAN STATISTICAL INSTITUTE

Periodical Examination

M. Tech (CS) - I Year (Semester - II)

Computer Architecture

Date: 26.02.2009 Maximum Marks: 40 Duration: 2.5 Hours

Note: Be precise in your answers. This is a three page question paper.

Q 1(i): A computer system contains a special purpose processor for doing floating-point operations.
You as a designer have determined that 70% of your computations can use the floating-point
processor. When a program uses the floating-point processor, the speedup of the floating-point
processor is 45% faster than when it does not use it. Find the overall speedup by using the
floating-point processor.

Ans 1(i): Here, fraction enhanced fen = 0.7 and speedup enhanced sen = 1.45. So, overall
1
speedup = 0.7
(1−0.7)+ 1.45
= 1.2775.

Q 1(ii): In order to improve the speedup, you are considering two options:

Option 1: The compiler design is modified so that 80% of the computations can use the
floating-point processor. Cost of this option is Rs. 25 lakhs.
Option 2: The floating-point processor is to be modified. The speedup of the floating-point
processor is 100% faster than when it does not use it. Assume in this case that 60% of
the computations can use the floating point processor. Cost of this option is Rs. 30 lakhs.

Which option would you recommend? Justify your answer quantitatively. [2+4=6]

Ans 1(ii): The relative figures for the two options are:
1
Option 1: Here fen = 0.8 and sen = 1.45. So, overall speedup is equal to 0.8
(1−0.8)+ 1.45
=
1.33.
The cost to speedup ratio is 25 lakhs = 18.7969 lakhs .
1.33
1
Option 2: Here fen = 0.6 and sen = 2.0. So, overall speedup is equal to (1−0.6)+ 0.6
=
2.0
1.4286.
30 lakhs
The cost to speedup ratio is 1.4286 = 20.9995 lakhs .

1
As the cost to speedup ratio is smaller for Option 1, we would choose Option 1.

Q 2: Suppose we make an enhancement to a computer that improves a mode of execution by a factor

of 15. The enhanced mode is used 55% of the time measured as a percentage of the execution
time when the enhanced mode is in use. Find out (a) what percentage of the original execution
time has been converted to fast mode? (b) what is the speedup we have obtained from fast
mode? [2+4=6]

Ans 2: Let the old execution time be t and the new execution time be t0 . Let t1 be the time span in
t1
t for which the enhanceement was used. Thus, fen = t . Because of the use of enhancement,
t1
let t1 of the old execution now finish in time t01 in the new execution. Thus, sen = t01 .
t01
In this problem, what we have been given instead is a new fraction f 0 = t0 for the enhanced
mode. We have to deduce speedup in terms of f 0 and sen . The new execution time t0 is
composed of the enhanced time t01 and the unenhanced time (t − t1 ).

t0 = t01 + (t − t1 )
= f 0 t0 + (t − t1 )
t0 (1 − f 0 ) = t − sen t01
= t − sen t0 f 0
t0 (1 − f 0 + sen f 0 ) = t
t
= 1 + f 0 (sen − 1)
t0
So, speedup is t
t0 = 1 + f 0 (sen − 1). Here, sen = 15 and f 0 = 0.55. So, speedup is equal to
1
1 + 0.55(15 − 1) = 8.7. We know, speedup is equal to (1−fen )+ fsen
en
. Plugging in the values
of speedup (8.7) and sen = 15, we have fen = 0.948.

Q 3 (i): Describe in short the concept of memory hierarchy explaining the role of each level of
memory.

Ans 3(i): This was discussed in class.

Q 3 (ii): The Clock cycles per instruction (CPI) of a computer system is 3.0 when all memory
accesses hit in the cache. The only data accesses are loads and stores and these total 53%
of the instructions. If the miss penalty is 31 clock cycles and the miss rate is 3%, how much
faster would the machine be if all instructions were cache hits? [4+2=6]

2
Ans 3(ii): CPU execution time for the machine that always hits =
(CPU clock cycles + Memory stall cycles) × clock cycle(CC) = (IC × CP I + 0) × CC
= IC × 3.0 × CC.
Memory stall cycles for the machine with the real cache =
IC × memory references per instruction × miss rate × miss penalty
= IC × (1 + 0.53) ×0.03 × 31
| {z }
1 instruction access and 0.53 data accesses per instruction
= IC × 1.4229.
CPU execution time in cache
= (IC × 3.0 + IC × 1.4229) × CC
= 4.4229 × IC × CC.
Performance ratio (inversion of execution times)
= CPUCPU execution time with cache
execution time without cache
4.4229×IC×CC
= 3.0×IC×CC
= 1.4743.

Q 4 Consider the following piece of ’C’ code.

for (i=0; i<= 100; i++)

{X[i] = Y[i] + Z;}

Assume that X and Y are arrays of 32-bit integers and Z and i are 32-bit integers. Assume that
all data values and their addresses are kept in memory at addresses 0, 5000, 1500 and 2000 for
X, Y, Z and i respectively except when they are operated on. Assume that values in registers
are lost between iterations of the loop.
(a) Write the code for DLX. (b) How many memory-data references will be executed? (c)
What is the code size in bytes? [8+2+1=11]

Ans 4: Note the following two sentences in the question. Assume that all data values and their
addresses are kept in memory at addresses 0, 5000, 1500 and 2000 for X, Y, Z and i respectively
except when they are operated on. Assume that values in registers are lost between iterations
of the loop.
Because of this assumption, registers are not used to hold updated or intermediate values.
Values are stored to memory and reloaded when needed. Also, as all addresses fit inside 16
bits, we use immediate instructions.

3
ADD R1, R0, R0 ;R1 will store ’i’; initialize it to zero
SW 2000(R0), R1 ;store ’i’
loop: LW R1, 2000(R0) ;get value of ’i’
SLL R2, R1, #2 ;making the offset for Y
ADDI R3, R2, #5000 ;add base address and the offset for Y
LW R4, 0(R3) ;load Y[i]
LW R5, 1500(R0) ;load Z
ADD R6, R4, R5 ;Y[i]+Z
LW R1, 2000(R0) ;again get value of ’i’ as
;it cannot be stored in register
SLL R2, R1, #2 ;making the offset for X
ADDI R7, R2, #0 ;add base address and the offset for X
SW 0(R7), R6 ;X[i]=Y[i]+Z
LW R1, 2000(R0) ;get value of ’i’
ADDI R1, R1, #1 ;increment ’i’
SW 2000(R0), R1 ;store ’i’
LW R1, 2000(R0) ;get value of ’i’
ADDI R8, R1, #-101 ;is counter at 101?
BNEZ R8, loop ;loop instruction

Instructions executed (though not asked for) is the number of instructions for initialization
instructions, plus the number of instructions in the loop times the number of iterations and is
equal to 2 + (16 × 101) = 1618.
The number of memory-data references executed is = 1 + (8 × 101) = 809.
The DLX instruction is 4 bytes wide, so the code size is 18 × 4 = 72.

Q 5: Show how the code sequence A × B − (A + C × B) will appear on the following architectures:
(a) stack, (b) accumulator, (c) register-memory, and (d) load-store. [1.5+1.5+1.5+1.5=6]

Ans 5: The code sequence for A×B −(A+C ×B) in the following architectures are shown below:

Q 6(i): Explain the effect of instruction pipelining on the bandwidth of the memory systems.

Ans 6(i): This was discussed in the class.

4
Stack Accumulator Register-Memory Load-Store
push A; load B; load R1, A; load R1, A;
push B; mul C; mul R1, B; load R2, B;
mul; add A; store R1, D; load R3, C;
push A; store D; load R2, C; mul R4, R3, R2;
push C; load A; mul R2, B; add R5, R1, R4;
push B; mul B; add R2, A; mul R6, R1, R2;
mul; sub D; sub R2, D; sub R7, R6, R5;
add;
sub;

Q 6(ii): Consider an unpipelined machine with five stages (Instruction Fetch, Instruction Decode/Register
Fetch, Execute/Address Calculation, Memory Access and Write Back). Assume that it has
11-ns clock cycles. The machine uses four cycles for ALU operations and branches and five
cycles for memory operations. Assume that the relative frequencies of these operations are
45%, 15% and 40% respectively. Pipelining the machine adds 1-ns of overhead to the clock.
Find out how much speedup we will gain in the instruction execution rate. You can ignore any
latency impact. [3+2=5]

Ans 6(ii): The average instruction execution time on the unpipelined machine is

Average instruction execution time = Clock Cycle(CC) × Average CPI

= 11ns × ((0.45 + 0.15) × 4 + 0.4 × 5)
= 11ns × 4.4
= 48.4ns

In the pipelined version, the clock will run at 11 + 1 = 12 ns. So, the average instruction
execution time is 12 ns. So, the speedup from pipelining is
ave. instruction time in unpipelined version
= ave. instruction time in pipelined version
= 48.4 ns
12 ns = 4.033

Ca Mid1 2017
No ratings yet
Ca Mid1 2017
9 pages
Illinois Exam2 Practice Solfa08
No ratings yet
Illinois Exam2 Practice Solfa08
4 pages
MIS 6110 Assignment #1 (Spring 2015)
No ratings yet
MIS 6110 Assignment #1 (Spring 2015)
14 pages
Pipelining Tutorial
No ratings yet
Pipelining Tutorial
14 pages
Computer Design Performance Analysis
No ratings yet
Computer Design Performance Analysis
2 pages
Midterm Solution
No ratings yet
Midterm Solution
18 pages
T-3 Solution With Marking Scheme
No ratings yet
T-3 Solution With Marking Scheme
6 pages
CSE 530 Homework #1 Due September 26 Anthony Dotterer: C C C T C T C C T T
No ratings yet
CSE 530 Homework #1 Due September 26 Anthony Dotterer: C C C T C T C C T T
9 pages
Nmam Institute of Technology: Department of Computer Science and Engineering
No ratings yet
Nmam Institute of Technology: Department of Computer Science and Engineering
8 pages
CPU Cycle and Instruction Analysis
No ratings yet
CPU Cycle and Instruction Analysis
8 pages
Coa Applied
No ratings yet
Coa Applied
13 pages
Computer Arch Test
No ratings yet
Computer Arch Test
8 pages
COSS MidSem 2020.07.05 MakeUp With Key COPYM06Tq# Name-Rana
No ratings yet
COSS MidSem 2020.07.05 MakeUp With Key COPYM06Tq# Name-Rana
5 pages
Ejercicios 2
No ratings yet
Ejercicios 2
13 pages
350 Exam 2 Spring 2024
No ratings yet
350 Exam 2 Spring 2024
7 pages
CENG3420 Homework 3: Solutions
No ratings yet
CENG3420 Homework 3: Solutions
5 pages
CS222 - COAL - SOLUTION - Final - Spring2023
No ratings yet
CS222 - COAL - SOLUTION - Final - Spring2023
12 pages
Solution CSE332 Sec 5 MT Fall2021 1
No ratings yet
Solution CSE332 Sec 5 MT Fall2021 1
3 pages
MCQs
No ratings yet
MCQs
19 pages
Computer Organization & Architecture Assignment
No ratings yet
Computer Organization & Architecture Assignment
5 pages
Answer:: Remark
No ratings yet
Answer:: Remark
72 pages
Wafer Yield & Computer Performance Analysis
No ratings yet
Wafer Yield & Computer Performance Analysis
6 pages
COE 308 Fall 2008 Final Exam Solution
No ratings yet
COE 308 Fall 2008 Final Exam Solution
8 pages
Homework 1
No ratings yet
Homework 1
11 pages
Final 222 2009 Sol
No ratings yet
Final 222 2009 Sol
6 pages
Solution Manual of Cmputer Organization and Architectur
44% (27)
Solution Manual of Cmputer Organization and Architectur
29 pages
Major Solution
No ratings yet
Major Solution
6 pages
PracticeSheetCPU PipeliningSoln
No ratings yet
PracticeSheetCPU PipeliningSoln
6 pages
Week6 Performance Numericals
No ratings yet
Week6 Performance Numericals
38 pages
CST131 Tutorial2
No ratings yet
CST131 Tutorial2
7 pages
F10 E1 Solution
No ratings yet
F10 E1 Solution
5 pages
Computer Architecture Cycle Test
No ratings yet
Computer Architecture Cycle Test
10 pages
Homework Set 4: Class CPI On P1 CPI On P2
No ratings yet
Homework Set 4: Class CPI On P1 CPI On P2
2 pages
CAO Units PDF
No ratings yet
CAO Units PDF
357 pages
207 Assignment 6
No ratings yet
207 Assignment 6
7 pages
CA Fall 2022 Final Exam
No ratings yet
CA Fall 2022 Final Exam
6 pages
Numerical: Central Processing Unit
No ratings yet
Numerical: Central Processing Unit
28 pages
Computer Architecture 13-AdditionalMaterials
No ratings yet
Computer Architecture 13-AdditionalMaterials
9 pages
Chap 2 Exercises With Solutions
No ratings yet
Chap 2 Exercises With Solutions
7 pages
Assignment - 1
0% (1)
Assignment - 1
4 pages
Computer Architecture Exercises
No ratings yet
Computer Architecture Exercises
10 pages
ASSIGNMENT1 Acsa
No ratings yet
ASSIGNMENT1 Acsa
3 pages
Chapter 1 Lecture 2 & 3 - Computer Performance
No ratings yet
Chapter 1 Lecture 2 & 3 - Computer Performance
37 pages
ECE 452: Computer Organization and Design
No ratings yet
ECE 452: Computer Organization and Design
9 pages
Special Problem Set Ver 3
No ratings yet
Special Problem Set Ver 3
14 pages
COA QP PerformanceQuestions MUST READ
No ratings yet
COA QP PerformanceQuestions MUST READ
4 pages
Homework 5
No ratings yet
Homework 5
6 pages
Computer Architecture Solutions
No ratings yet
Computer Architecture Solutions
4 pages
Hpca Pyqp
No ratings yet
Hpca Pyqp
17 pages
Computer Systems Homework
No ratings yet
Computer Systems Homework
2 pages
COSS - 2022-23 Question Paper
No ratings yet
COSS - 2022-23 Question Paper
6 pages
Fall 2022 Qs
No ratings yet
Fall 2022 Qs
15 pages
MID SEM Makeup QP July 2021
No ratings yet
MID SEM Makeup QP July 2021
4 pages
Ex 1
No ratings yet
Ex 1
4 pages
Master Policy
No ratings yet
Master Policy
59 pages
Enhancing Daily Rainfall Prediction in Urban Areas: A Comparative Study of Hybrid Artificial Intelligence Models With Optimization Algorithms
No ratings yet
Enhancing Daily Rainfall Prediction in Urban Areas: A Comparative Study of Hybrid Artificial Intelligence Models With Optimization Algorithms
20 pages
Slide (Sec 2)
No ratings yet
Slide (Sec 2)
80 pages
American Lung Association Health Insurance
No ratings yet
American Lung Association Health Insurance
2 pages
Week 10
No ratings yet
Week 10
4 pages
Flex 16 Sample Policy 15-48872
No ratings yet
Flex 16 Sample Policy 15-48872
22 pages
US Insurance Policy Docs Format Highlights
No ratings yet
US Insurance Policy Docs Format Highlights
2 pages
Sinos00017 1 PDF
No ratings yet
Sinos00017 1 PDF
43 pages
IGCSE Computer Science Basics
100% (1)
IGCSE Computer Science Basics
20 pages
Specification For A Process Automation System
No ratings yet
Specification For A Process Automation System
81 pages
PLC Traction User Manual PDF
No ratings yet
PLC Traction User Manual PDF
131 pages
Ai & Ds Syllabus - 22-23 III To IV - Nep 2 Print
No ratings yet
Ai & Ds Syllabus - 22-23 III To IV - Nep 2 Print
68 pages
Update INV 2012
No ratings yet
Update INV 2012
74 pages
CHAPTER-1: Introduction To Microprocessor (10%) : Short Answer Questions
No ratings yet
CHAPTER-1: Introduction To Microprocessor (10%) : Short Answer Questions
6 pages
Addressing Modes
No ratings yet
Addressing Modes
6 pages
Silver Oak College of Engineering and Technology: Computer Organization Module Solution - 4
No ratings yet
Silver Oak College of Engineering and Technology: Computer Organization Module Solution - 4
11 pages
Computer Book
100% (9)
Computer Book
296 pages
Revised Syllabus Notification
No ratings yet
Revised Syllabus Notification
12 pages
Digital Computer
No ratings yet
Digital Computer
5 pages
Cs2304 System Software
No ratings yet
Cs2304 System Software
2 pages
Xtensalx Overview Handbook
No ratings yet
Xtensalx Overview Handbook
260 pages
Return Instruction, RC, RNC, RP, RM, RZ, RNZ, Rpe, Rpo, Ret
No ratings yet
Return Instruction, RC, RNC, RP, RM, RZ, RNZ, Rpe, Rpo, Ret
2 pages
Lesson Plan For MP
No ratings yet
Lesson Plan For MP
3 pages
Unit 1
No ratings yet
Unit 1
63 pages
High Speed Power Transfer
No ratings yet
High Speed Power Transfer
33 pages
Lecture How To Write Program
No ratings yet
Lecture How To Write Program
10 pages
What Do You Know About CPU? How Does It Work?: Topic 13. Central Processing Unit (Cpu)
No ratings yet
What Do You Know About CPU? How Does It Work?: Topic 13. Central Processing Unit (Cpu)
7 pages
Suse Linux Enterprise Server and High Performance Computing
No ratings yet
Suse Linux Enterprise Server and High Performance Computing
51 pages
ZX Next Dev Guide r3
No ratings yet
ZX Next Dev Guide r3
243 pages
Micro Practical
No ratings yet
Micro Practical
38 pages
Operating System 4TH Semester
No ratings yet
Operating System 4TH Semester
49 pages
Simple Model of Computer
No ratings yet
Simple Model of Computer
11 pages
Chapter 4
No ratings yet
Chapter 4
71 pages
CSS Reviewer
No ratings yet
CSS Reviewer
6 pages
Lecture 2.3.1 (Input Output Organization-Asynchronous Data Transfer)
No ratings yet
Lecture 2.3.1 (Input Output Organization-Asynchronous Data Transfer)
23 pages
REL 100/RELZ 100: Advanced HV/EHV Protection Terminals
100% (1)
REL 100/RELZ 100: Advanced HV/EHV Protection Terminals
8 pages
Fundamental of Computer by NCTI Institut
No ratings yet
Fundamental of Computer by NCTI Institut
26 pages

Archmidsem 2009 Sol

Uploaded by

Archmidsem 2009 Sol

Uploaded by

INDIAN STATISTICAL INSTITUTE

M. Tech (CS) - I Year (Semester - II)

Date: 26.02.2009 Maximum Marks: 40 Duration: 2.5 Hours

Note: Be precise in your answers. This is a three page question paper.

Q 2: Suppose we make an enhancement to a computer that improves a mode of execution by a factor

Ans 3(i): This was discussed in class.

Q 4 Consider the following piece of ’C’ code.

for (i=0; i<= 100; i++)

Ans 6(i): This was discussed in the class.

Average instruction execution time = Clock Cycle(CC) × Average CPI

You might also like