0% found this document useful (0 votes)

44 views15 pages

4th Lecture Computer Architecture

This document discusses techniques for increasing instruction-level parallelism (ILP) in computer architecture. It covers two main techniques: hardware techniques that rely on the processor to dynamically discover and exploit parallelism, and software techniques that use compiler optimizations to find parallelism statically. It also describes different types of dependencies between instructions - data, name, and control dependencies - and how they can limit the ability to execute instructions in parallel.

Uploaded by

Aqdas Lone

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views15 pages

4th Lecture Computer Architecture

Uploaded by

Aqdas Lone

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Computer Architecture

ILP

Todays Agenda
Well consider techniques to increase instruction level
parallelism
o What limits ILP; how much we can expect to extract
o How to best exploit the available ILP

Two main techniques

o Hardware
o Software

Pipeline Review

Pipeline CPI = Ideal pipeline CPI + Structural stalls + Data hazard stalls + Control stalls

Ideal pipeline CPI

o Maximum performance of the implementation

Structural Hazards
o H/w cannot support this combination of instructions

Data Hazards
o Instruction consumes a result not yet produced

Control Hazards
o Caused by time required for branch and jump resolution
3

ILP Example
Caravanning on a trip,
must stay in order to
prevent losing anyone

At toll, everyone get in the same lane to stay in order

This works..but its slow. Everyone has to wait for D to
get through the toll booth

Get two at a time (in

parallel)

ILP Basic Concept

Basic Idea: overlap the execution of unrelated instructions to
improve performance is known as instruction-level parallelism

Simple ILP recipe

o
o

If instructions are independent, do them at the same time

If not, do them one at a time

Two main techniques

1.
2.

Rely on hardware to help discover and exploit the parallelism

dynamically (market winner: Intel Pentium series)
Rely on software technology to find parallelism, statically at compiletime (special niche markets: Intel Itanium)
5

Basic Instruction Block

Basic instruction block is a straight-line code sequence with
no branches in, except at the entry point, and no branch out
except at the exit point of the sequence
o Example: Body of a loop

In typical integer code, dynamic branch frequency is 15%

(resulting avg. basic block size of about 7 instructions)
To obtain substantial performance enhancements, we must
exploit ILP across multiple BB
6

Major ILP Techniques

Loop-Level Parallelism
Exploit parallelism among iterations of a loop

Vector execution is one way

o
o

Graphics, DPS, media apps

Execute the same instructions on multiple data simultaneously

If not vector, then either

o
o

Dynamic exploitation via branch prediction

Static exploitation via loop unrolling

Turn LLP into ILP

Parallel & Dependent Instructions

Instructions are parallel if they can execute
simultaneously, regardless of pipeline depth
Dependent instructions
o
o
o

Are not parallel

Must be executed in parallel
But may still be partially overlapped

Three types of dependence

o
o
o

Data dependency (true data dependence)

Name dependence
Control dependence
9

Dependence & Hazards

Hazards: Conflicts that arises in an instruction stream line
Dependencies are a property of program
o

Dependency => potential for hazard

Three types of dependence

o
o
o

Data dependency (true data dependence)

Name dependence
Control dependence

Data Dependence
Inst J is data dependent on Inst I if
o

If J tries to read an operand before I writes it, or

J is data dependent on inst K which is dependent on I

True Dependence (compiler term)

Can cause Read After Write (RAW) hazard

Name Dependence: Anti-dependence

Name dependence
o
o

Two instructions use same register or memory location (name)

No actual flow of data between the instructions

Anti-dependence
o

J writes an operand before I reads it

Can cause Write After Read hazard

Name Dependence: Output dependence

J writes an operand before I writes it

Can cause Write After Write hazard

In case of name dependence: change the name, remove the
dependence!
o

Register renaming for register naming dependence

Control Dependence
Every instruction (except in the very first basic block) is control
dependent on same set branches
In general, these control dependencies must be preserved to
preserve program order

o
o

S1 is control dependent on p1
S2 is control dependent on p2 but not on p1

THE END

EC483 Fall2024 W7
No ratings yet
EC483 Fall2024 W7
40 pages
CSE 820 Graduate Computer Architecture Week 5 - Instruction Level Parallelism
No ratings yet
CSE 820 Graduate Computer Architecture Week 5 - Instruction Level Parallelism
38 pages
Instruction Level Parallelism and Its Exploitation: Unit Ii by Raju K, Cse Dept
No ratings yet
Instruction Level Parallelism and Its Exploitation: Unit Ii by Raju K, Cse Dept
201 pages
Instruction Level Parallelism: Soner Onder
No ratings yet
Instruction Level Parallelism: Soner Onder
25 pages
ILP Overview and Scoreboard
No ratings yet
ILP Overview and Scoreboard
60 pages
Chapter 2 ILP
No ratings yet
Chapter 2 ILP
89 pages
CompanionAsset 9780128119051 Chapter03
No ratings yet
CompanionAsset 9780128119051 Chapter03
67 pages
Cosc530 Ch3all6up
No ratings yet
Cosc530 Ch3all6up
8 pages
03 Dynamic Sched
No ratings yet
03 Dynamic Sched
84 pages
Computer Architecture Insights
No ratings yet
Computer Architecture Insights
41 pages
Instruction Level Pipelining
100% (1)
Instruction Level Pipelining
113 pages
Pipelining Become Universal Technique in 1985
No ratings yet
Pipelining Become Universal Technique in 1985
16 pages
COA Report
No ratings yet
COA Report
13 pages
Instruction Level Parallelism-Concepts N Challenges
100% (1)
Instruction Level Parallelism-Concepts N Challenges
4 pages
2 TypesofParallelism
No ratings yet
2 TypesofParallelism
69 pages
MCP Unit 1
No ratings yet
MCP Unit 1
41 pages
Lecture-7-15 01 2025
No ratings yet
Lecture-7-15 01 2025
19 pages
Instruction Level Parallelism
No ratings yet
Instruction Level Parallelism
19 pages
13) Ilp1 PDF
No ratings yet
13) Ilp1 PDF
85 pages
Topic2c Ss Dynamicscheduling
No ratings yet
Topic2c Ss Dynamicscheduling
94 pages
Instruction Level Parallelism
No ratings yet
Instruction Level Parallelism
2 pages
Chapter 5 PPTV 41 STDV 1
No ratings yet
Chapter 5 PPTV 41 STDV 1
47 pages
Lecture 5
No ratings yet
Lecture 5
80 pages
CS 6290 Instruction Level Parallelism
No ratings yet
CS 6290 Instruction Level Parallelism
45 pages
Instruction-Level Parallelism (ILP), Since The
100% (1)
Instruction-Level Parallelism (ILP), Since The
57 pages
Instruction-Level Parallel Processors: Asim Munir
No ratings yet
Instruction-Level Parallel Processors: Asim Munir
28 pages
U3.1 Concepts and Challenges
No ratings yet
U3.1 Concepts and Challenges
12 pages
3a.ILP Dipendenze e Superscalare
No ratings yet
3a.ILP Dipendenze e Superscalare
24 pages
Unit4 Aca
No ratings yet
Unit4 Aca
6 pages
Instruction-Level Parallel Processors: Objective
No ratings yet
Instruction-Level Parallel Processors: Objective
31 pages
Parallel Computing for Students
No ratings yet
Parallel Computing for Students
113 pages
Module 5 Instruction Level Parallelism and Pipelining
No ratings yet
Module 5 Instruction Level Parallelism and Pipelining
54 pages
Ch2 Lec7 Instruction Piplining
No ratings yet
Ch2 Lec7 Instruction Piplining
34 pages
Instruction Level Parallelism Guide
No ratings yet
Instruction Level Parallelism Guide
31 pages
Q: What Is Instruction Level Parallelism (ILP) ? Explain Its Concepts
No ratings yet
Q: What Is Instruction Level Parallelism (ILP) ? Explain Its Concepts
18 pages
Instruction-Level Parallelism and Its Exploitation: Prof. Dr. Nizamettin AYDIN
No ratings yet
Instruction-Level Parallelism and Its Exploitation: Prof. Dr. Nizamettin AYDIN
170 pages
CAQA5e ch3
No ratings yet
CAQA5e ch3
45 pages
Instruction-Level Parallelism: Stalls Control Stalls WAW Stalls WAR Stalls RAW Stalls Structural CPI CPI
No ratings yet
Instruction-Level Parallelism: Stalls Control Stalls WAW Stalls WAR Stalls RAW Stalls Structural CPI CPI
50 pages
Unit - 1 Microprocessor Architecture
No ratings yet
Unit - 1 Microprocessor Architecture
52 pages
4-Advanced Pipelining - 241114 - 060906
No ratings yet
4-Advanced Pipelining - 241114 - 060906
80 pages
Instruction-Level Parallelism Guide
No ratings yet
Instruction-Level Parallelism Guide
16 pages
CA Lecture 12
No ratings yet
CA Lecture 12
48 pages
CH10-Processor Structure and Function
No ratings yet
CH10-Processor Structure and Function
14 pages
Compiler Techniques For Exposing ILP
No ratings yet
Compiler Techniques For Exposing ILP
26 pages
WINSEM2022-23 CSE4001 ETH VL2022230503160 Reference Material I 22-12-2022 2.1 ILP
No ratings yet
WINSEM2022-23 CSE4001 ETH VL2022230503160 Reference Material I 22-12-2022 2.1 ILP
34 pages
Instruction Level Parallelism
95% (21)
Instruction Level Parallelism
11 pages
Instruction Level Parallelism
No ratings yet
Instruction Level Parallelism
22 pages
Module3
No ratings yet
Module3
49 pages
Cs2354 Advanced Computer Architecture 2 Marks
No ratings yet
Cs2354 Advanced Computer Architecture 2 Marks
10 pages
ACA Unit 3
No ratings yet
ACA Unit 3
17 pages
Lecture 5
No ratings yet
Lecture 5
76 pages
Introduction To Instruction Level Parallelism (ILP) : ECE338 Parallel Computer Architecture Spring 2022
No ratings yet
Introduction To Instruction Level Parallelism (ILP) : ECE338 Parallel Computer Architecture Spring 2022
13 pages
EE457Unit9a OoO
No ratings yet
EE457Unit9a OoO
77 pages
Decode and Issue More and One Instruction at A Time Executing More Than One Instruction at A Time More Than One Execution Unit
No ratings yet
Decode and Issue More and One Instruction at A Time Executing More Than One Instruction at A Time More Than One Execution Unit
28 pages
William Stallings Computer Organization and Architecture 8 Edition Processor Structure and Function
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Processor Structure and Function
74 pages
Superscalar Processors & Parallelism
No ratings yet
Superscalar Processors & Parallelism
50 pages
Instruction Scheduling
No ratings yet
Instruction Scheduling
17 pages
Final Android
No ratings yet
Final Android
144 pages
Python Interview Questions
No ratings yet
Python Interview Questions
61 pages
Stress Analysis of A Plate With A Hole
No ratings yet
Stress Analysis of A Plate With A Hole
10 pages
Data Structures and Algorithms in Java
No ratings yet
Data Structures and Algorithms in Java
45 pages
Loops Activity 1
No ratings yet
Loops Activity 1
3 pages
ECEG2052 Course Outline
No ratings yet
ECEG2052 Course Outline
3 pages
Starting Out With Programming Logic & Design - Chapter5 - Repetition Structures
100% (1)
Starting Out With Programming Logic & Design - Chapter5 - Repetition Structures
14 pages
Interview Questions of Python
No ratings yet
Interview Questions of Python
32 pages
AI Programming Questions Answers
No ratings yet
AI Programming Questions Answers
4 pages
Coding Questions
No ratings yet
Coding Questions
2 pages
Win 24
No ratings yet
Win 24
24 pages
Learning Angular
100% (3)
Learning Angular
584 pages
Introduction To Programming Using Python 1st Edition Ebook
50% (2)
Introduction To Programming Using Python 1st Edition Ebook
13 pages
CONSTRUCTOR AND DESTRUCTOR Updated
No ratings yet
CONSTRUCTOR AND DESTRUCTOR Updated
51 pages
Sample Paper 1 - GR 11
No ratings yet
Sample Paper 1 - GR 11
8 pages
REXX Tutorial
No ratings yet
REXX Tutorial
5 pages
Procedural Programming in C Exam
No ratings yet
Procedural Programming in C Exam
10 pages
Neb Class 12 Computer Programming in C Notes
No ratings yet
Neb Class 12 Computer Programming in C Notes
60 pages
Fundamental Programming Structures in Java
No ratings yet
Fundamental Programming Structures in Java
23 pages
Coding Standards For Oracle Forms 6
No ratings yet
Coding Standards For Oracle Forms 6
8 pages
B.sc. Artificial Intelligence & Data Science I & II Sem. - (2023-24 Batch) - UG - Affiliated Colleges
No ratings yet
B.sc. Artificial Intelligence & Data Science I & II Sem. - (2023-24 Batch) - UG - Affiliated Colleges
48 pages
666 Computer Technology 3rd Sem
No ratings yet
666 Computer Technology 3rd Sem
24 pages
1.introduction To Test Case Design
No ratings yet
1.introduction To Test Case Design
122 pages
Dsa Record-2022 - Batch - CS
No ratings yet
Dsa Record-2022 - Batch - CS
94 pages
Oracle Integration Training Course
No ratings yet
Oracle Integration Training Course
9 pages
Class 7 Worksheet 2
No ratings yet
Class 7 Worksheet 2
3 pages
Fundamentals of Data Annotation Using Python V1.0 With Logo
No ratings yet
Fundamentals of Data Annotation Using Python V1.0 With Logo
196 pages
COBOL Coding Guidelines
No ratings yet
COBOL Coding Guidelines
33 pages
Practical No:-06 AIM:-Take A Simple Program in C/C++ and Perform Boundary Value Analysis by Writing Test Cases
No ratings yet
Practical No:-06 AIM:-Take A Simple Program in C/C++ and Perform Boundary Value Analysis by Writing Test Cases
5 pages
Java - A Step-by-Step Guide For Absolute Beginners
No ratings yet
Java - A Step-by-Step Guide For Absolute Beginners
183 pages

4th Lecture Computer Architecture

Uploaded by

4th Lecture Computer Architecture

Uploaded by

Computer Architecture

Two main techniques

Ideal pipeline CPI

At toll, everyone get in the same lane to stay in order

Get two at a time (in

ILP Basic Concept

Simple ILP recipe

If instructions are independent, do them at the same time

Two main techniques

Rely on hardware to help discover and exploit the parallelism

Basic Instruction Block

In typical integer code, dynamic branch frequency is 15%

Major ILP Techniques

Vector execution is one way

Graphics, DPS, media apps

If not vector, then either

Dynamic exploitation via branch prediction

Turn LLP into ILP

Parallel & Dependent Instructions

Are not parallel

Three types of dependence

Data dependency (true data dependence)

Dependence & Hazards

Dependency => potential for hazard

Three types of dependence

Data dependency (true data dependence)

If J tries to read an operand before I writes it, or

J is data dependent on inst K which is dependent on I

True Dependence (compiler term)

Can cause Read After Write (RAW) hazard

Name Dependence: Anti-dependence

Two instructions use same register or memory location (name)

J writes an operand before I reads it

Can cause Write After Read hazard

Name Dependence: Output dependence

Can cause Write After Write hazard

Register renaming for register naming dependence

You might also like