0% found this document useful (0 votes)

116 views44 pages

Slide02 Parallel Computers

1. Parallel computer architectures allow connecting multiple processors to create powerful computers in a more cost-effective way than building a single high-performance processor. They also provide fault tolerance by continuing tasks with degraded performance if some processors fail. 2. There are four main types of parallel computer architectures: single instruction single data (SISD), single instruction multiple data (SIMD), multiple instruction single data (MISD), and multiple instruction multiple data (MIMD). MIMD architectures can use either shared memory or message passing for processors to exchange information. 3. Current trends include using lower-cost clusters of workstations instead of expensive specialized parallel machines, and the growth of network and grid computing due to internet connectivity

Uploaded by

อภิเษก หงษ์วิทยากร

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

116 views44 pages

Slide02 Parallel Computers

Uploaded by

อภิเษก หงษ์วิทยากร

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 44

Parallel Computer Architecture

The End of the Road

Advantages of Multiprocessors
Able to create powerful computers by
simply connecting multiple processors performance single processor

More cost-effective than building a high Obtain fault-tolerance to carry on the

tasks, albeit with degraded performance

4 Decades of Computing

Batch Era (1960s)

IBM System/360 mainframe dominated the corporate computer centers (10 MB disk, 1 MB magnetic core memory) Typical batch processing machine No connection beyond the computer room

Time-Sharing Era (1970s)

Advancing in ss-memory & ICs spawned the minicomputer era Small, fast, and inexpensive enough to be spread throughout the company at the divisional level Still too expensive and difcult to use to hand over to end-users

Time-sharing computing Existing 2 kinds:

centralized data processing mainframes time-sharing minicomputers

Desktop Era (1980s)

PCs were introduced in 1977 Many players (Altairs, Tandy, Commondore, Apple, IBM, and etc) Became pervasive and change the face of computing Along came networked computers (LAN & WAN)

Network Era (1990s)

Advance network technologies led to network computing paradigm Transition from a processorcentric view of computing to a network-centric view A number of commercial parallel computers with multiple processors:

Shared memory systems Distributed memory systems

Four Decades of Computing

Feature Decade Location Users Data Objective Interface Operation Connectivity Owners Batch 1960s Time-Sharing 1970s Desktop 1980s Desktop Individuals Fonts, graphs Present See & point Layout LAN Departmental end-users Network 1990s Mobile Groups Multimedia Communicate Ask & tell Orchestrate Internet Everyone
Computer Room Terminal Room

Experts Alphanumeric Calculate Punched card Process None

Corporate computer centers

Specialists Text, numbers Access Kbd & CRT Edit Peripheral cable Divisional IS shops

Current Trends

The substitution of expensive and specialized parallel machines by the more cost-effective clusters of workstations

A cluster is a collection of stand-alone computers connected using some interconnection network

A pervasiveness of the Internet created interest in network computing and more recently in grid computing

Grids are geographically distributed platforms of computation - dependable, consistent, pervasive, and less expensive access to HPC facilities

Flynns Taxonomy of Computer Architecture

Based on the notion of a stream of
information

instruction data

CPU

fetch
Memory

execute
(manipulate data as programmed)

Single Instruction

Multiple Instruction

Single Data

SISD

MISD

Multiple Data

SIMD

MIMD

SIMD Architecture

Single Instruction, Multiple Data (SIMD)

prev instruction load A(1) load B(1) C(1)=A(1)*B(1) store C(1) next instruction prev instruction load A(2) load B(2) C(2)=A(2)*B(2) store C(2) next instruction prev instruction load A(n) load B(n) C(n)=A(n)*B(n) store C(n) next instruction

time

MIMD Architecture
Instruction Stream Control Unit-1 Instruction Stream P1 Data Stream M1

Instruction Stream Control Unit-n Instruction Stream Pn

Data Stream Mn

Multiple Instruction, Multiple Data (MIMD)

prev instruction load A(1) load B(1) C(1)=A(1)*B(1) store C(1) next instruction prev instruction call funcD x=y^z sum=x^2 call sub1(i,j) next instruction prev instruction do 10 i=1,N alpha=w**3 zeta=C(i) 10 continue next instruction

time

SIMD Architecture Model

Consists of two parts:

a front-end computer a processor array

each element in the processor array is identical to one another and performs operation on different data in sync front-end can access PEs memory via the bus

SIMD Architecture Model

lock-step synchronization Processors either do nothing or exactly the same ops simultaneously In SIMD, parallelism is exploited by applying simultaneous operations across large sets of data

SIMD Congurations
Control Unit P1 P2 P3 Pn-1 Pn

Each PE has its own local memory

Mn-1

Interconnection Network

Control Unit

Pn-1

PEs and memory modules communicate via the IN

Interconnection Network

Mn-1

ILLIAC IV

Control Unit

Pn-1

Mn-1

Interconnection Network

MIMD Architecture
M M M M Interconnection Network P P P P

INTRODUCTION TO ADVANCED COMPUTER ARCHITECTURE AND PARALLEL PROCESSING

Shared Memory MIMD Architecture

Interconnection Network

Interconnection Network P P P P P P P P

Shared Memory MIMD Architecture

Message Passing MIMD Architecture

Figure 1.6 Shared memory versus message passing architecture.

Interconnection Network

Commercial examples of SMPs are Sequent Computers Balance and Symmetry, Sun Microsystems multiprocessor servers, and Silicon Graphics Inc. multiprocessor servers. P P P P A message passing system (also referred to as distributed memory) typically combines the local memory and processor at each node of the interconnection network. M M M M There is no global memory, so it is necessary to move data from one local memory to another by means of message passing. This is typically done by a Send/Receive pair Message Passing MIMD Architecture of commands, which must be written into the application software by a programmer. Figure 1.6 Shared memory versus message passing architecture. Thus, programmers must learn the message-passing paradigm, which involves data copying and dealing with consistency issues. Commercial examples of message passing architectures c. 1990 were the nCUBE, iPSC/2, and various Transputer-based systems. These systems eventually gave way to Internet connected systems whereby Commercial examples of SMPs are Sequent Computers Balance and Symmetry, the processor/memory nodes were either Internet servers or clients on individuals Sun Microsystems multiprocessor servers, and Silicon Graphics Inc. multiprocessor

information exchange through central shared memory

information exchange through network in message passing systems

MIMD Architecture

P

NTRODUCTION TO ADVANCED COMPUTER ARCHITECTURE AND PARALLEL PROCESSING

using bus/cache architecture called SMP (symmetric multiprocessor) since

Interconnection Network

Shared Memory MIMD Architecture

equal chance to read/ write memory equal access speed

Interconnection Network

INTRODUCTION TO ADVANCED COMPUTER ARCHITECTURE AND PARALLEL PROCE

MIMD Architecture
Interconnection Network

also known as distributed memory no global memory using message passing to move data from one to another (Send/Recieve Figure 1.6 pair of commands)

Shared Memory MIMD Architecture

Interconnection Network

Message Passing MIMD Architecture

Shared memory versus message passing architecture.

this architecture give Commercial examples of SMPs are Sequent Computers Balance and Symm Sun Microsystems multiprocessor servers, and Silicon Graphics Inc. multiproc way to Internet servers. A message passing system (also referred to as distributed memory) typically connected systems bines the local memory and processor at each node of the interconnection net

There is no global memory, so it is necessary to move data from one local mem

MIMD Architecture
M M M M Interconnection Network P P P P

INTRODUCTION TO ADVANCED COMPUTER ARCHITECTURE AND PARALLEL PROCESSING

Shared Memory MIMD Architecture

Interconnection Network

Interconnection Network P P P P P P P P

Shared Memory MIMD Architecture

Message Passing MIMD Architecture

Figure 1.6 Shared memory versus message passing architecture.

Interconnection Network

programming is easier

provided scalability

DSM (distributed-shared memory) is the hybrid between the two

DSM
memory is physically distributed [message
passing]

memory can be addressed as one (logically

shared) address space [shared memory]

programming-wise, the architecture looks

and behaves like a shared memory machine, but a message passing architecture lives underneath the software

SGI Origin2000

SIMD
Control Unit
Control Unit

Pn-1

Mn-1

Interconnection Network

Mn-1

access control - which process accesses are

possible to which resources

synchronization - constraints limit the time

of accesses from sharing processes to shared resources

SIMD
Control Unit
Control Unit

Pn-1

Mn-1

Interconnection Network

Mn-1

protection - a system feature that prevents

processes from making arbitrary access to resources belonging to other processes

Interconnection Network

MIMD
P

Interconnection Network P P P P P P P

Shared Memory MIMD Architecture

Message Passing MIMD Architecture

nodes are typically able to simultaneously

store messages in buffers perform send/receive operations

scalable - the number of processors can be increased without signicant decrease in efciency of operation

Interconnection Networks

Interconnection Networks (INs)

Can be classied based on mode of operation control strategy switching techniques topology

Mode of Operation

Accordingly, INs are classied as:

Synchronous

a single global clock used by all operating in a lock-step manner

Asynchronous does not require a global clock handshaking signals are used

Sync tends to be slower than async, sync is race and hazard-free, however.

Control Strategy

Accordingly, INs are classied as

Centralized a single central CU is used to oversee

and control the operation

Decentralized the control function is distributed

among different components

Control Strategy
The function and reliability of the central
the multistage interconnection networks are decentralized control unit can become the bottleneck in a centralized control system

While the crossbar is a centralized system,

Switching Techniques

INs can be classied as:

circuit switching

a complete path has to be established and remain existence during the whole communication

packet switching communication takes place via messages that are divided into smaller entities (packets) packets travel in a store-and-forward manner

While packet s/w tends to use resources more efciently, it suffers from variable packet delays

Topology
Topology describes how to connect
processors and memories to other processors and memories

Shared Memory INs

bus-based
P

switch-based
C

Global Memory
P P C C

C P

C
P C

P
M M M M

Message Passing INs

Static interconnection network Dynamic interconnection network

Static INs

Linear Array

Ring

Mesh

Tree

Hypercube

Dynamic INs
Establish a connection between two or
more nodes on the y as messages are routed along the links

The number of hops in a path from source

to destination node is equal to the number of point-to-point links a message must traverse to reach its destination

Single-stage

Multiple-stage

Crossbar switch

02 Lecture Flynn IN
No ratings yet
02 Lecture Flynn IN
78 pages
PP16 Lec4 Arch3
No ratings yet
PP16 Lec4 Arch3
23 pages
MIMD Architectures Explained
No ratings yet
MIMD Architectures Explained
12 pages
U1-Theory of Parallelism
No ratings yet
U1-Theory of Parallelism
43 pages
CS516: Parallelization of Programs: Overview of Parallel Architectures
No ratings yet
CS516: Parallelization of Programs: Overview of Parallel Architectures
43 pages
Explicitly Parallel Platforms
No ratings yet
Explicitly Parallel Platforms
90 pages
Parallel Computing Platforms: Chieh-Sen (Jason) Huang
No ratings yet
Parallel Computing Platforms: Chieh-Sen (Jason) Huang
28 pages
Ch-9 MIMD Architecture and SPMD
No ratings yet
Ch-9 MIMD Architecture and SPMD
8 pages
CS 213: Parallel Processing Architectures: Laxmi Narayan Bhuyan
No ratings yet
CS 213: Parallel Processing Architectures: Laxmi Narayan Bhuyan
26 pages
Aca Unit 1.1
No ratings yet
Aca Unit 1.1
20 pages
CS213 Parallel Processing Syllabus
No ratings yet
CS213 Parallel Processing Syllabus
26 pages
COA U5 PPT Full
No ratings yet
COA U5 PPT Full
43 pages
Histroy of Computer Generation
No ratings yet
Histroy of Computer Generation
28 pages
Parallel Processing Explained
No ratings yet
Parallel Processing Explained
22 pages
Week 4a - Computer Architecture Fundamentals - Part 1
No ratings yet
Week 4a - Computer Architecture Fundamentals - Part 1
45 pages
Module 2
No ratings yet
Module 2
124 pages
Flynn's Classification
No ratings yet
Flynn's Classification
46 pages
A Comprehensive Survey of Various Processor Types & Latest Architectures
No ratings yet
A Comprehensive Survey of Various Processor Types & Latest Architectures
7 pages
History of Distributed Systems
No ratings yet
History of Distributed Systems
12 pages
Computer Architecture and Parallel Processing
No ratings yet
Computer Architecture and Parallel Processing
29 pages
Computer Architecture Flynn's Taxonomy
No ratings yet
Computer Architecture Flynn's Taxonomy
4 pages
Computer Architecture and Organization
No ratings yet
Computer Architecture and Organization
54 pages
Unit 1
No ratings yet
Unit 1
21 pages
CC Unit 1.2
No ratings yet
CC Unit 1.2
39 pages
L2
No ratings yet
L2
27 pages
Parallel Computing for Tech Students
No ratings yet
Parallel Computing for Tech Students
14 pages
Unit 1 - Part - 2
No ratings yet
Unit 1 - Part - 2
30 pages
Slides Taken From: Parallel Computing Platforms
No ratings yet
Slides Taken From: Parallel Computing Platforms
11 pages
Flynn's Classification
No ratings yet
Flynn's Classification
4 pages
Parallel & Distributed Computing: By: M. Imran Siddiqui
No ratings yet
Parallel & Distributed Computing: By: M. Imran Siddiqui
25 pages
Multiprocessor Basics & Performance
No ratings yet
Multiprocessor Basics & Performance
52 pages
Module 1
No ratings yet
Module 1
30 pages
Unit 1 - Part 1
No ratings yet
Unit 1 - Part 1
51 pages
CS621 Week 03
No ratings yet
CS621 Week 03
54 pages
Parallel Processors: Session 2
No ratings yet
Parallel Processors: Session 2
32 pages
Module-1 Theory of Parallelism: The State of Computing Computer Development Milestones
No ratings yet
Module-1 Theory of Parallelism: The State of Computing Computer Development Milestones
48 pages
PARALLEL PROCESSING (Autosaved) (Autosaved)
No ratings yet
PARALLEL PROCESSING (Autosaved) (Autosaved)
23 pages
Lecture 3 - 1 Dichotomy of Parallel Computing Platforms
No ratings yet
Lecture 3 - 1 Dichotomy of Parallel Computing Platforms
17 pages
PARALLEL PROGRAMMING Module 1
No ratings yet
PARALLEL PROGRAMMING Module 1
20 pages
COE4590 10 Flyns
No ratings yet
COE4590 10 Flyns
15 pages
Lecture-27 Interconnection Networks+chapter-5 Slides-Version-2
No ratings yet
Lecture-27 Interconnection Networks+chapter-5 Slides-Version-2
70 pages
1/1 Multiprocessors (Or) Shared Memory Multi-Processor Model
No ratings yet
1/1 Multiprocessors (Or) Shared Memory Multi-Processor Model
17 pages
W3C1 Principles of Parallel Computing
No ratings yet
W3C1 Principles of Parallel Computing
28 pages
Parallel Computing Platforms and Memory System Performance: John Mellor-Crummey
No ratings yet
Parallel Computing Platforms and Memory System Performance: John Mellor-Crummey
43 pages
Chapter01 PDF
No ratings yet
Chapter01 PDF
16 pages
Lec 5
No ratings yet
Lec 5
14 pages
Baker CHPT 5 SIMD Good
No ratings yet
Baker CHPT 5 SIMD Good
94 pages
Parallel Computing
No ratings yet
Parallel Computing
32 pages
L32 SMP
No ratings yet
L32 SMP
47 pages
Synopsis On "Massive Parallel Processing (MPP) "
No ratings yet
Synopsis On "Massive Parallel Processing (MPP) "
4 pages
Chapter 6 Parallel and Concurrent Computing
No ratings yet
Chapter 6 Parallel and Concurrent Computing
27 pages
HPA - Notes
No ratings yet
HPA - Notes
5 pages
Architecture
No ratings yet
Architecture
67 pages
10-Functions of Combinational Logic
No ratings yet
10-Functions of Combinational Logic
13 pages
08 LogicSimplification
No ratings yet
08 LogicSimplification
16 pages
08 LogicSimplification
No ratings yet
08 LogicSimplification
16 pages
Slide07 BooleanAlgebra
No ratings yet
Slide07 BooleanAlgebra
7 pages
Logic Gates for Electronics Students
No ratings yet
Logic Gates for Electronics Students
5 pages
Slide03 NumSys Ops Part1
No ratings yet
Slide03 NumSys Ops Part1
47 pages
The Digital Codes
No ratings yet
The Digital Codes
15 pages
Slide04 NumSys Ops Part2
No ratings yet
Slide04 NumSys Ops Part2
25 pages
Multimed Ia System S
No ratings yet
Multimed Ia System S
35 pages
Multimed Ia System S
No ratings yet
Multimed Ia System S
28 pages
517 454, 517 441: Parallel and Distributed Computing: Apisake Hongwitayakorn
No ratings yet
517 454, 517 441: Parallel and Distributed Computing: Apisake Hongwitayakorn
31 pages
Multimed Ia System S
No ratings yet
Multimed Ia System S
49 pages
Shared Memory Architecture
No ratings yet
Shared Memory Architecture
39 pages
Passenger Flow Detection Management Platform (English Version)
No ratings yet
Passenger Flow Detection Management Platform (English Version)
14 pages
React Js Full Course
50% (2)
React Js Full Course
11 pages
ENglish Presentation
No ratings yet
ENglish Presentation
12 pages
Arubaos Rfprotect Module: Data Sheet
No ratings yet
Arubaos Rfprotect Module: Data Sheet
3 pages
Bluepill DAC I2C
No ratings yet
Bluepill DAC I2C
32 pages
HARQ
No ratings yet
HARQ
9 pages
Formal Methods For Software Engineering Languages, Methods, Application
No ratings yet
Formal Methods For Software Engineering Languages, Methods, Application
537 pages
HDLM Software Interoperability Support Matrix
No ratings yet
HDLM Software Interoperability Support Matrix
235 pages
C20 Ec 502 Oct Nov 2023
No ratings yet
C20 Ec 502 Oct Nov 2023
3 pages
Print Server
No ratings yet
Print Server
3 pages
Hauffman Coading
No ratings yet
Hauffman Coading
6 pages
Lecture 1
No ratings yet
Lecture 1
23 pages
Computer Studies Post Mock Examination 2025
No ratings yet
Computer Studies Post Mock Examination 2025
8 pages
All ENCOR (350-401) Portable Resource
100% (5)
All ENCOR (350-401) Portable Resource
417 pages
Comprehensive Wi-Fi Technology Guide
No ratings yet
Comprehensive Wi-Fi Technology Guide
30 pages
Project On Hotel Management 12
No ratings yet
Project On Hotel Management 12
32 pages
Idea Gprs Trick
No ratings yet
Idea Gprs Trick
3 pages
HCIA-Storage V4.5 Exam Outline
No ratings yet
HCIA-Storage V4.5 Exam Outline
3 pages
Python Course for IT Students
No ratings yet
Python Course for IT Students
208 pages
Theory of Computation - Part - A - Anna University Questions
0% (1)
Theory of Computation - Part - A - Anna University Questions
8 pages
Brosur Zebra ZD230 PDF
No ratings yet
Brosur Zebra ZD230 PDF
3 pages
Forensic Data Collection Guide
No ratings yet
Forensic Data Collection Guide
13 pages
Module 1
No ratings yet
Module 1
26 pages
05.CA (CL) - IT - (Module-2) - (5) Information Technology-Data Resource Management
No ratings yet
05.CA (CL) - IT - (Module-2) - (5) Information Technology-Data Resource Management
20 pages
ATmega8535 (L)
No ratings yet
ATmega8535 (L)
321 pages
CS402 Sample Paper
No ratings yet
CS402 Sample Paper
12 pages
CASE STUDY-Types of GPU
No ratings yet
CASE STUDY-Types of GPU
10 pages
Mivoice Office 400 Communications Server
No ratings yet
Mivoice Office 400 Communications Server
6 pages
The Customers Will Be Able To Search For The Different Flower Bouquet Shops That Are Available Near To Their Places So That They Will Be Able To Order Online
100% (1)
The Customers Will Be Able To Search For The Different Flower Bouquet Shops That Are Available Near To Their Places So That They Will Be Able To Order Online
6 pages
S1700, S2700, S5700, and S6700 V200R020C10 Upgrade Guide
No ratings yet
S1700, S2700, S5700, and S6700 V200R020C10 Upgrade Guide
146 pages

Slide02 Parallel Computers

Uploaded by

Slide02 Parallel Computers

Uploaded by

Parallel Computer Architecture

The End of the Road

More cost-effective than building a high Obtain fault-tolerance to carry on the

Batch Era (1960s)

Time-Sharing Era (1970s)

Time-sharing computing Existing 2 kinds:

centralized data processing mainframes time-sharing minicomputers

Desktop Era (1980s)

Network Era (1990s)

Shared memory systems Distributed memory systems

Four Decades of Computing

Experts Alphanumeric Calculate Punched card Process None

A cluster is a collection of stand-alone computers connected using some interconnection network

Flynns Taxonomy of Computer Architecture

Single Instruction, Multiple Data (SIMD)

Instruction Stream Control Unit-n Instruction Stream Pn

Multiple Instruction, Multiple Data (MIMD)

SIMD Architecture Model

Consists of two parts:

a front-end computer a processor array

SIMD Architecture Model

Each PE has its own local memory

PEs and memory modules communicate via the IN

INTRODUCTION TO ADVANCED COMPUTER ARCHITECTURE AND PARALLEL PROCESSING

INTRODUCTION TO ADVANCED COMPUTER ARCHITECTURE AND PARALLEL PROCESSING

Shared Memory MIMD Architecture

Shared Memory MIMD Architecture

Message Passing MIMD Architecture

information exchange through central shared memory

information exchange through network in message passing systems

NTRODUCTION TO ADVANCED COMPUTER ARCHITECTURE AND PARALLEL PROCESSING

using bus/cache architecture called SMP (symmetric multiprocessor) since

Shared Memory MIMD Architecture

equal chance to read/ write memory equal access speed

INTRODUCTION TO ADVANCED COMPUTER ARCHITECTURE AND PARALLEL PROCE

Shared Memory MIMD Architecture

Message Passing MIMD Architecture

INTRODUCTION TO ADVANCED COMPUTER ARCHITECTURE AND PARALLEL PROCESSING

INTRODUCTION TO ADVANCED COMPUTER ARCHITECTURE AND PARALLEL PROCESSING

Shared Memory MIMD Architecture

Shared Memory MIMD Architecture

Message Passing MIMD Architecture

DSM (distributed-shared memory) is the hybrid between the two

memory can be addressed as one (logically

programming-wise, the architecture looks

access control - which process accesses are

synchronization - constraints limit the time

protection - a system feature that prevents

Shared Memory MIMD Architecture

Message Passing MIMD Architecture

nodes are typically able to simultaneously

store messages in buffers perform send/receive operations

Interconnection Networks (INs)

Accordingly, INs are classied as:

a single global clock used by all operating in a lock-step manner

Accordingly, INs are classied as

Centralized a single central CU is used to oversee

Decentralized the control function is distributed

While the crossbar is a centralized system,

INs can be classied as:

Shared Memory INs

Message Passing INs

The number of hops in a path from source

You might also like