0% found this document useful (0 votes)

25 views15 pages

Lecture 36

The document discusses multicore computers, focusing on hardware and software performance issues, including parallelism, power consumption, and the organization of multicore systems. It highlights the challenges of effectively utilizing multicore architectures due to serial code and overhead, while also noting areas where multicore systems can excel, such as database management and server applications. The document outlines the potential for performance improvements with multicore processors, contingent on software's ability to leverage parallel resources.

Uploaded by

Amaresh Swain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views15 pages

Lecture 36

Uploaded by

Amaresh Swain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

COMPUTER ORGANIZATION AND

ARCHITECTURE (COA)

EET 2211
4TH SEMESTER – CSE & CSIT
CHAPTER 18, LECTURE 36
By Ms. Arya Tripathy
MULTICORE
COMPUTERS

2 MULTICORE COMPUTERS 7/16/2021

TOPICS TO BE COVERED
Ø HARDWARE PERFORMANCE ISSUES
1. Increase in Parallelism and Complexity
2. Power Consumption
Ø SOFTWARE PERFORMANCE ISSUES
1. Software on Multicore
Ø MULTICORE ORGANIZATION
1. Levels of Cache
2. Simultaneous Multithreading
Ø HETEROGENEOUS MULTICORE ORGANIZATION
1. Different Instruction Set Architectures
2. Equivalent Instruction Set Architecture

3 MULTICORE COMPUTERS 7/16/2021

LEARNING OBJECTIVES
v Understand the hardware performance issues that
have driven the move to multicore computers.

v Understand the software performance issues posed by

the use of multithreaded multicore computers.

v Present an overview of the two principal approaches to

heterogeneous multicore organization.

4 MULTICORE COMPUTERS 7/16/2021

MULTICORE PROCESSOR
v A multicore processor, also known as a chip multiprocessor, combines two or
more processor units (called cores) on a single piece of silicon (called a die).
v Typically, each core consists of all of the components of an independent processor, such
as registers, ALU, pipeline hardware, and control unit, plus L1 instruction and data
caches.
v In addition to multiple cores , contemporary multicore chips also includes L2 cache and
L3 cache also.
v The most highly integrated multicore processors, known as systems on chip (SoCs), also
include memory and peripheral controllers.

5 MULTICORE COMPUTERS 7/16/2021

HARDWARE PERFORMANCE ISSUES
vMicroprocessor systems have experienced a steady increase in execution performance for
decades.This increase is due to a number of factors, including increase in clock frequency,
increase in transistor density, and refinements in the organization of the processor on the
chip. All this leads to increase in complexity of the chip.

v1 s t
ha rdwa re p e r f o r m a n c e i s s u e i s I N C R E A S E I N PA R A L L E L I S M A N D
COMPLEXITY

vThe organizational changes in processor design have primarily been focused on exploiting
ILP, so that more work is done in each clock cycle.These changes include, in chronological
order:

1. Pipelining: Individual instructions are executed through a pipeline of stages so that while
one instruction is executing in one stage of the pipeline, another instruction is executing in
another stage of the pipeline.

2. Superscalar: Multiple pipelines are constructed by replicating execution resources.This

enables parallel execution of instructions in parallel pipelines, so long as hazards are avoided.

6 MULTICORE COMPUTERS 7/16/2021

3. Simultaneous multithreading (SMT): Register banks are expanded so that multiple threads
(thread: is the smallest sequence of programmed instructions that can be managed independently by a
scheduler, where scheduling is the method by which work is assigned to resources that complete the
work) can share the use of pipeline resources.

vWith each of these innovations, designers have over the years attempted to increase the performance
of the system by adding complexity.

vIn the case of pipelining, for example, simple three-stage pipelines were replaced by pipelines with
five stages.

vThere is a practical limit to how far this trend can be taken, because with more stages, there is the
need for more logic, more interconnections, and more control signals.

vSimilarly, with superscalar organization, increased performance can be achieved by increasing the
number of parallel pipelines.

vAgain, there are diminishing returns as the number of pipelines increases.

vMore logic is required to manage hazards and to stage instruction resources.

7 MULTICORE COMPUTERS 7/16/2021

vThis same point of diminishing returns is reached with SMT, as the complexity of managing multiple
threads over a set of pipelines limits the number of threads and number of pipelines that can be
effectively utilized.

vThe increase in complexity to deal with all of the logical issues related to very long pipelines,
multiple superscalar pipelines, and multiple SMT register banks means that increasing amounts of the
chip area are occupied with coordinating and signal transfer logic.

vThis increases the difficulty of designing, fabricating, and debugging the chips.

vIn general terms, the experience of recent decades has been encapsulated in a rule of thumb known
as Pollack’s rule, which states that performance increase is roughly proportional to square root of
increase in complexity.

vIn other words, if you double the logic in a processor core, then it delivers only 40% more
performance.

vIn principle, the use of multiple cores has the potential to provide near-linear performance
improvement with the increase in the number of cores—but only for software that can take advantage.

8 MULTICORE COMPUTERS 7/16/2021

v2nd hardware performance issue is POWER CONSUMPTION

üTo maintain the trend of higher performance as the number of transistors per chip rises, designers
have resorted to more elaborate processor designs (pipelining, superscalar, SMT) and to high clock
frequencies.

üUnfortunately, power requirements have grown exponentially as chip density and clock frequency
have risen.

üOne way to control power density is to use more of the chip area for cache memory.

üMemory transistors are smaller and have a power density an order of magnitude lower than that of
logic.

üPower considerations provide another motive for moving toward a multicore organization. Because
the chip has such a huge amount of cache memory, it becomes unlikely that any one thread of
execution can effectively use all that memory.

üEven with SMT, multithreading is done in a relatively limited fashion and cannot therefore fully
exploit a gigantic cache, whereas a number of relatively independent threads or processes has a greater
opportunity to take full advantage of the cache memory.
9 MULTICORE COMPUTERS 7/16/2021
SOFTWARE PERFORMANCE ISSUES
vThe potential performance benefits of a multicore organization depend on the ability
to effectively exploit the parallel resources available to the application.

vLet us focus first on a single application running on a multicore system.

vAmdahl’s law states that:

vSpeed up =

vThis law appears to make the prospect of a multicore organization attractive.

vBut as Figure (a) on the next slide shows, even a small amount of serial code has a
noticeable impact.

vIf only 10% of the code is inherently serial, running the program on a multicore
system with eight processors yields a performance gain of only a factor of 4.7.

10 MULTICORE COMPUTERS 7/16/2021

It shows even a small amount of serial code has a noticeable impact. If only 10% of
the code is inherently serial (f=0.9), running the program on a multicore system
with eight processors yields a performance gain of only a factor of 4.7.

11 MULTICORE COMPUTERS 7/16/2021

In addition, software typically incurs overhead as a result of communication and
distribution of work among multiple processors and as a result of cache coherence
overhead. This overhead results in a curve where performance peaks and then begins to
degrade because of the increased burden of the overhead of using multiple processors (e.g.,
coordination and OS management) as shown in Figure (b) below.

12 MULTICORE COMPUTERS 7/16/2021

Contd.
vHowever, software engineers have been addressing this problem and there
are numerous applications in which it is possible to effectively exploit a
multicore system.

vDatabase management systems and database applications are one area in

which multicore systems can be used effectively.

vMany kinds of servers can also effectively use the parallel multicore
organization, because ser vers typically handle numerous relatively
independent transactions in parallel.

vIn addition to general-purpose server software, a number of classes of

applications benefit directly from the ability to scale throughput with the
number of cores.
13 MULTICORE COMPUTERS 7/16/2021
Contd.
v Some of these include the following:

1. Multithreaded native applications (thread-level parallelism) :

Multithreaded applications are characterized by having a small
number of highly threaded processes.
2. Multiprocess applications (process-level parallelism) : Multiprocess
applications are characterized by the presence of many single-
threaded processes.
3. Java applications : Java applications embrace threading in a
fundamental way.
4. Multi-instance applications (application-level parallelism) : even if an
individual application does not scale to take advantage of a large
number of threads, it is still possible to gain from multicore
architecture by running multiple instances of applications in parallel.

14 MULTICORE COMPUTERS 7/16/2021

15 MULTICORE COMPUTERS 7/16/2021

Multicore Processors and Systems PDF
100% (1)
Multicore Processors and Systems PDF
310 pages
Lecture 37
No ratings yet
Lecture 37
17 pages
Multi-Core Processing: Advantages & Challenges
No ratings yet
Multi-Core Processing: Advantages & Challenges
35 pages
ITEC582 Chapter18
No ratings yet
ITEC582 Chapter18
36 pages
CH02 COA10e.performance Issues
No ratings yet
CH02 COA10e.performance Issues
19 pages
Chapter 2
No ratings yet
Chapter 2
15 pages
CH02-COA10e Spring 2025
No ratings yet
CH02-COA10e Spring 2025
24 pages
Modle 01 - HPC Introduction To Pipeline
No ratings yet
Modle 01 - HPC Introduction To Pipeline
124 pages
Many Core Processor Architecture
No ratings yet
Many Core Processor Architecture
36 pages
CI-0120 Arquitectura de Computadoras Ejemplos FundamentosDiseño
No ratings yet
CI-0120 Arquitectura de Computadoras Ejemplos FundamentosDiseño
52 pages
CC Unit 1
No ratings yet
CC Unit 1
24 pages
Slot29 CH18 MultiCoreComputers 18 Slides
No ratings yet
Slot29 CH18 MultiCoreComputers 18 Slides
18 pages
HPC Unit 1
No ratings yet
HPC Unit 1
65 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
24 pages
CH02-COA10e Spring 2025
No ratings yet
CH02-COA10e Spring 2025
24 pages
HPC Unit 2
No ratings yet
HPC Unit 2
72 pages
Week 4a - Computer Architecture Fundamentals - Part 1
No ratings yet
Week 4a - Computer Architecture Fundamentals - Part 1
45 pages
20BCE2351 Micro Assignment-02
No ratings yet
20BCE2351 Micro Assignment-02
5 pages
CS3350B Computer Architecture: Marc Moreno Maza
100% (1)
CS3350B Computer Architecture: Marc Moreno Maza
45 pages
Introduction To Parallel Processing Architecture
No ratings yet
Introduction To Parallel Processing Architecture
31 pages
HPC - Unit-1 Insem Notes
No ratings yet
HPC - Unit-1 Insem Notes
76 pages
CH18 MultiCoreComputers 18 Slides
No ratings yet
CH18 MultiCoreComputers 18 Slides
18 pages
CMP2008 L1
No ratings yet
CMP2008 L1
20 pages
Computer Architecture Ebook
No ratings yet
Computer Architecture Ebook
443 pages
Cse.m-ii-Advances in Computer Architecture (12scs23) - Notes
No ratings yet
Cse.m-ii-Advances in Computer Architecture (12scs23) - Notes
213 pages
SP23 CS 212 Week 2
No ratings yet
SP23 CS 212 Week 2
23 pages
Lecture 1
No ratings yet
Lecture 1
37 pages
Multicore Computers
No ratings yet
Multicore Computers
21 pages
ACA Notes UNIT-1
No ratings yet
ACA Notes UNIT-1
20 pages
Unit1 Aca
No ratings yet
Unit1 Aca
115 pages
Parallel Programming - Unit 1
No ratings yet
Parallel Programming - Unit 1
81 pages
20BCE2351 Micro Assignment-02
No ratings yet
20BCE2351 Micro Assignment-02
5 pages
Chapter 1 - Introduction - 2023 - Programming Massively Parallel Processors
No ratings yet
Chapter 1 - Introduction - 2023 - Programming Massively Parallel Processors
20 pages
Cao - Unit 4 - Notes - Final
No ratings yet
Cao - Unit 4 - Notes - Final
30 pages
What Is A Multicore Processor
No ratings yet
What Is A Multicore Processor
21 pages
Ünite
No ratings yet
Ünite
33 pages
Multi
No ratings yet
Multi
5 pages
Seminar Report
50% (4)
Seminar Report
30 pages
Chapter 2
No ratings yet
Chapter 2
34 pages
CCS 1202 Lecture 2 - Computer Evolution and Performance
No ratings yet
CCS 1202 Lecture 2 - Computer Evolution and Performance
32 pages
Ayushagrawal HPC
No ratings yet
Ayushagrawal HPC
17 pages
Comp422 534 2020 Lecture1 Introduction
No ratings yet
Comp422 534 2020 Lecture1 Introduction
49 pages
Multi-Core Architectures: Rakesh Kumar Rakumar@cs - Ucsd.edu
No ratings yet
Multi-Core Architectures: Rakesh Kumar Rakumar@cs - Ucsd.edu
23 pages
Lec2 ParallelProgrammingPlatforms
No ratings yet
Lec2 ParallelProgrammingPlatforms
26 pages
Multicore Processor
100% (1)
Multicore Processor
23 pages
Prebook MCAP
No ratings yet
Prebook MCAP
11 pages
Comparch Individual Assignment
No ratings yet
Comparch Individual Assignment
19 pages
HPC Insem 2024 FlyHigh Services
No ratings yet
HPC Insem 2024 FlyHigh Services
10 pages
CMP2008 L1
No ratings yet
CMP2008 L1
47 pages
Final Report: Multicore Processors
No ratings yet
Final Report: Multicore Processors
12 pages
Advanced Computer Architecture ECE 6373: Pauline Markenscoff N320 Engineering Building 1 E-Mail: Markenscoff@uh - Edu
No ratings yet
Advanced Computer Architecture ECE 6373: Pauline Markenscoff N320 Engineering Building 1 E-Mail: Markenscoff@uh - Edu
151 pages
Lec 2
No ratings yet
Lec 2
17 pages
ACSA1 Introduction
No ratings yet
ACSA1 Introduction
33 pages
High Performance Computing Unit 1
No ratings yet
High Performance Computing Unit 1
3 pages
SSC Course 6 CPU
No ratings yet
SSC Course 6 CPU
17 pages
Defining Computer Architecture
No ratings yet
Defining Computer Architecture
6 pages
24-25 - Parallel Processing PDF
No ratings yet
24-25 - Parallel Processing PDF
36 pages
CS-3006 2 PDC Overview Compressed
No ratings yet
CS-3006 2 PDC Overview Compressed
107 pages
Computer Architecture and Organization
No ratings yet
Computer Architecture and Organization
54 pages
Embedded Systems Programming with C: Writing Code for Microcontrollers
From Everand
Embedded Systems Programming with C: Writing Code for Microcontrollers
Larry Jones
No ratings yet
Cryptography Endsem 2024 6th Sem
No ratings yet
Cryptography Endsem 2024 6th Sem
4 pages
Forest Resources
No ratings yet
Forest Resources
15 pages
Subset Sum Shortest Paths - Knapsack
No ratings yet
Subset Sum Shortest Paths - Knapsack
55 pages
Lecture 40
No ratings yet
Lecture 40
26 pages
Lecture 18
No ratings yet
Lecture 18
21 pages
Lecture 12
No ratings yet
Lecture 12
19 pages
Lecture 3
No ratings yet
Lecture 3
24 pages
12 Physics
No ratings yet
12 Physics
19 pages
7.english Literature
No ratings yet
7.english Literature
18 pages
Chapter 2 Financial Modeling
No ratings yet
Chapter 2 Financial Modeling
10 pages
1 CBD - 20190823 - PPT - JV Concepts Rev CAAN
No ratings yet
1 CBD - 20190823 - PPT - JV Concepts Rev CAAN
37 pages
92 Fuel Gas Station - Sta. Ines
No ratings yet
92 Fuel Gas Station - Sta. Ines
10 pages
Exploratory Data Analysis Using Python
No ratings yet
Exploratory Data Analysis Using Python
10 pages
câu hỏi trắc nghiệm
No ratings yet
câu hỏi trắc nghiệm
9 pages
Wine Industry Report April2011
No ratings yet
Wine Industry Report April2011
26 pages
(DONE) OPIM - Practice Questions Week 6
No ratings yet
(DONE) OPIM - Practice Questions Week 6
5 pages
Product Teardown: New User Onboarding.: Nextleap: Learn in Public Challenge (1/6)
No ratings yet
Product Teardown: New User Onboarding.: Nextleap: Learn in Public Challenge (1/6)
10 pages
Status Note On YSR 0 Vaddi Urban
No ratings yet
Status Note On YSR 0 Vaddi Urban
3 pages
Bmac5203 Assg
No ratings yet
Bmac5203 Assg
8 pages
Republic of The Philippines) Cagayan de Oro City) S.S.: Agreement Carmona-Cruz Page 1
No ratings yet
Republic of The Philippines) Cagayan de Oro City) S.S.: Agreement Carmona-Cruz Page 1
2 pages
Backlog (PH) Disabled (JR - Asst)
No ratings yet
Backlog (PH) Disabled (JR - Asst)
15 pages
PTEq-X v1.0.2 User Manual
No ratings yet
PTEq-X v1.0.2 User Manual
10 pages
QAP Cover Sheet With CRS and Annex REV02
No ratings yet
QAP Cover Sheet With CRS and Annex REV02
4 pages
Veena Antony Asst - Professor Bca Ct&Ism ST - Teresa's College (Autonomous)
No ratings yet
Veena Antony Asst - Professor Bca Ct&Ism ST - Teresa's College (Autonomous)
16 pages
Centralization
No ratings yet
Centralization
1 page
27 Kiosk Banking
50% (2)
27 Kiosk Banking
30 pages
CASE Stakeholder Au Ba Chi
No ratings yet
CASE Stakeholder Au Ba Chi
9 pages
Chapter # 11: - Project Risk Management
No ratings yet
Chapter # 11: - Project Risk Management
25 pages
Reason and Impartiality
100% (3)
Reason and Impartiality
19 pages
Maypole Fund App Form 2017 (4)
No ratings yet
Maypole Fund App Form 2017 (4)
3 pages
Initial Public Offering (IPO)
No ratings yet
Initial Public Offering (IPO)
5 pages
Research Methodology Webinar
No ratings yet
Research Methodology Webinar
76 pages
Lecture 2 - The Microprocessor and Its Architecture
No ratings yet
Lecture 2 - The Microprocessor and Its Architecture
24 pages
Example of A Proposal
No ratings yet
Example of A Proposal
13 pages
2022 Valerie Britt Et Al V Valerie Britt Et Al COMPLAINT 2
No ratings yet
2022 Valerie Britt Et Al V Valerie Britt Et Al COMPLAINT 2
19 pages
Panel LG Display LC320WXN-SBD1 0 (DS)
No ratings yet
Panel LG Display LC320WXN-SBD1 0 (DS)
40 pages
Sst. SPQ 8 Sol.
No ratings yet
Sst. SPQ 8 Sol.
13 pages
Intro To C Programming-SIMP
No ratings yet
Intro To C Programming-SIMP
3 pages
SV300 Ventilator Product Brochure
No ratings yet
SV300 Ventilator Product Brochure
6 pages

Lecture 36

Uploaded by

Lecture 36

Uploaded by

COMPUTER ORGANIZATION AND

2 MULTICORE COMPUTERS 7/16/2021

3 MULTICORE COMPUTERS 7/16/2021

v Understand the software performance issues posed by

v Present an overview of the two principal approaches to

4 MULTICORE COMPUTERS 7/16/2021

5 MULTICORE COMPUTERS 7/16/2021

2. Superscalar: Multiple pipelines are constructed by replicating execution resources.This

6 MULTICORE COMPUTERS 7/16/2021

vAgain, there are diminishing returns as the number of pipelines increases.

vMore logic is required to manage hazards and to stage instruction resources.

7 MULTICORE COMPUTERS 7/16/2021

8 MULTICORE COMPUTERS 7/16/2021

vLet us focus first on a single application running on a multicore system.

vAmdahl’s law states that:

vThis law appears to make the prospect of a multicore organization attractive.

10 MULTICORE COMPUTERS 7/16/2021

11 MULTICORE COMPUTERS 7/16/2021

12 MULTICORE COMPUTERS 7/16/2021

vDatabase management systems and database applications are one area in

vIn addition to general-purpose server software, a number of classes of

1. Multithreaded native applications (thread-level parallelism) :

14 MULTICORE COMPUTERS 7/16/2021

You might also like