0% found this document useful (0 votes)

22 views117 pages

Chapter # 1

1. Parallel computing involves breaking large problems into smaller parts that can be solved simultaneously by multiple processors. This approach saves time and reduces complexity compared to serial computing. 2. Parallel computing is used widely in science, engineering, industry, and commercial applications to model complex real-world problems and process large amounts of data. 3. The advantages of parallel computing include saving time and money, solving problems too large for serial computing, better utilizing available hardware resources, and taking advantage of non-local computing power when local resources are insufficient.

Uploaded by

Defenders

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views117 pages

Chapter # 1

Uploaded by

Defenders

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 117

Parallel and Distributed

Computing
Lecture –
Introduction

1
Background – Serial Computing
• Earlier, computer software was written conventionally for Serial
computing.
• Standard computing is also known as “Serial computing"
• This meant that to solve a problem, an algorithm divides the
problem into smaller instructions.
• These discrete instructions are then executed on the Central
Processing Unit of a computer one by one.
• Only after one instruction is finished, next one starts. 2
Background – Serial Computing

3
Background – Serial Computing

4
Background – Serial Computing
• A real-life example of this would be people standing in a queue waiting for a
movie ticket and there is only a cashier.
• The cashier is giving tickets one by one to the persons. The complexity of this
situation increases when there are 2 queues and only one cashier.
• So, in short, Serial Computing is following:
1. In this, a problem statement is broken into discrete instructions.
2. Then the instructions are executed one by one.
3. Only one instruction is executed at any moment of time.

5
Background – Serial Computing
• Look at point 3. This was causing a huge problem in the computing industry
as only one instruction was getting executed at any moment of time. This was
a huge waste of hardware resources, as only one part of the hardware will be
running for particular instruction and of time.
• As problem statements were getting heavier and bulkier, so does the amount
of time in execution of those statements. Examples of processors are
Pentium 3 and Pentium 4.
• Now let’s come back to our real-life problem. We could definitely say that
complexity will decrease when there are 2 queues and 2 cashiers giving
tickets to 2 persons simultaneously. This is an example of Parallel Computing.
6
Parallel Computer
• Virtually all stand-alone computers today are parallel from a hardware
perspective:
- Multiple functional units (L1 cache, L2 cache, branch, prefetch, decode,
floating-point, graphics processing (GPU), integer, etc.)
- Multiple execution units/cores
- Multiple hardware threads

7
Parallel Computer

8
Parallel Computer
• Networks connect multiple stand-alone computers (nodes) to make larger
parallel computer clusters.

9
Parallel Computing
• Kind of computing architecture where the large problems break into
independent, smaller, usually similar parts that can be processed in one go.

• It is done by multiple CPUs communicating via Shared Memory, which

combines results upon completion.

• It helps in performing large computations as it divides the large problem

between more than one processor.
10
Parallel Computing

11
Parallel Computing

12
Parallel Computing
• Helps in faster application processing and task resolution by increasing the
available computation power of systems.
• The parallel computing principles are used by most supercomputers employ
to operate.
• The operational scenarios that need massive processing power or
computation, generally, parallel processing is commonly used there.

13
Parallel Computing
• Typically, this infrastructure is housed where various processors are installed in a server
rack; application server distributes the computational requests into small chunks then
requests are processed simultaneously on each server.

• The earliest computer software is written for serial computation as they are able to
execute a single instruction at one time, but parallel computing is different where it
executes several processors an application or computation in one time.
14
Parallel Computing – Why?
• The Real-World is a Massively Complex
- In the natural world, many complex, interrelated events are happening at
the same time, yet within a temporal sequence.

15
Parallel Computing – Why?
• The Real-World is a Massively Complex
- Compared to serial computing, parallel computing is much better suited
for modeling, simulating and understanding complex, real world
- phenomena.
- For example, imagine modeling these serially:

16
Parallel Computing – Why?
• Save Time/Monet
- In theory, throwing more resources at a task will shorten its time to
completion, with potential cost savings.
- Parallel computers can be built from cheap, commodity components.

17
Parallel Computing – Why?
• Solve Larger/Complex Problems
- Many problems are so large and/or complex that it is
impractical or impossible to solve them using a serial
program, especially given limited computer memory.

18
Parallel Computing – Why?
• Solve Larger/Complex Problems
- Example: "Grand Challenge Problems"
(en.wikipedia.org/wiki/Grand_Challenge) requiring petaflops and
petabytes of computing resources.
- Example: Web search engines/databases processing millions of
transactions every second

Petaflops =
Petabytes =

19
Parallel Computing – Why?
• Provide Accuracy
- Single compute resource can only do one thing at a time.
Multiple compute resources can do many things simultaneously.
- Example: Collaborative Networks provide a global venue where
people from around the world can meet and conduct work
"virtually".

20
Parallel Computing – Why?
• Provide Accuracy
- Single compute resource can only do one thing at a time.
Multiple compute resources can do many things simultaneously.
- Example: Collaborative Networks provide a global venue where
people from around the world can meet and conduct work
"virtually".

21
Parallel Computing – Why?
• Take Advantage of non-local Resources
- Using compute resources on a wide area network, or even the
Internet when local compute resources are scarce or insufficient.
- Example: SETI@home (setiathome.berkeley.edu) has over 1.7
million users in nearly every country in the world. (May, 2018).

22
Parallel Computing – Why?
• BETTER USE OF UNDERLYING PARALLEL HARDWARE
- Modern computers, even laptops, are parallel in architecture with multiple
processors/cores.
- Parallel software is specifically intended for parallel hardware with
multiple cores, threads, etc.
- In most cases, serial programs run on modern computers "waste"
potential computing power.

23
Parallel Computing – Who is using?
• Science and Engineering
- Historically, parallel computing has been considered to be "the high end of
computing", and has been used to model difficult problems in many areas
of science and engineering:
- Atmosphere, Earth, Environment
- Physics - applied, nuclear, particle, condensed matter, high pressure,
fusion, photonics
- Bioscience, Biotechnology, Genetics
- Chemistry, Molecular Sciences

24
Parallel Computing – Who is using?
• Science and Engineering
- Geology, Seismology
- Mechanical Engineering - from prosthetics to spacecraft
- Electrical Engineering, Circuit Design, Microelectronics
- Computer Science, Mathematics
- Defense, Weapons

25
Parallel Computing – Who is using?
• Industrial and Commercial
- • Today, commercial applications provide an equal or greater driving force
in the development of faster computers. These applications require
- the processing of large amounts of data in sophisticated ways. For
example:
- "Big Data",data mining
- Artificial Intelligence (AI)
- Oil exploration

26
Parallel Computing – Who is using?
• Industrial and Commercial
- Web search engines, web based business services
- Medical imaging and diagnosis
- Pharmaceutical design
- Financial and economic modeling
- Management of national and multi-national corporations
- Advanced graphics and virtual reality, particularly in the entertainment
industry
- Networked video and multi-media technologies
- Collaborative work environments 27
Parallel Computing – Who is using?
• Global Applications
- Parallel computing is now being used extensively around the world, in a
wide variety of applications

28
Parallel Computing - Advantages
• It saves time and money as many resources working together will reduce the
time and cut potential costs.

• It can be impractical to solve larger problems on Serial Computing

• It can take advantage of non-local resources when the local resources are
finite.

• Reduces the Complexity

• Serial Computing ‘wastes’ the potential computing power, thus Parallel

Computing makes better work of the hardware. 29
Parallel Computing – Flynn’s Taxonomy
• Best known classification scheme for parallel computers Architectures
• Depends on parallelism it exhibits with its

- The sequence of instructions read from memory constitutes an Instruction Stream

- The operations performed on the data in the processor constitute a Data Stream.

• A sequence of instructions (the instruction stream) manipulates a sequence of operands

(the data stream)

• The instruction stream (I) and the data stream (D) can be either single (S) or multiple (M)
30
Parallel Computing – Flynn’s Taxonomy
• M.J. Flynn proposed a classification for the organization of a computer system by the
number of instructions and data items that are manipulated simultaneously.
• Parallel processing may occur in the instruction stream, in the data stream, or both.
• Four combinations: SISD, SIMD, MISD, MIMD

31
Parallel Computing – Flynn’s Taxonomy

• It represents the organization of a single computer containing a Control Unit, a Processor

Unit, and a Memory Unit.

• Instructions are executed sequentially, and the system may or may not have internal
parallel processing capabilities.

• Most conventional computers have SISD architecture like the traditional Von-Neumann
computers.

32
Parallel Computing – Flynn’s Taxonomy

• Instructions are decoded by the Control Unit and then the Control Unit sends the
instructions to the processing units for execution.

• Data Stream flows between the processors and memory bi-directionally.

CU = Control Unit PE = Processing Element M = Memory 33

Parallel Computing – Flynn’s Taxonomy

• A type of parallel computer

• All processing units execute the same instruction at any given clock cycle
• Each processing unit can operate on a different data element
• One instruction stream is broadcast to all processors
• Each processor (also called a processing element or PE) is very simplistic and
is essentially an ALU;
- PEs do not store a copy of the program nor have a program control unit
34
Parallel Computing – Flynn’s Taxonomy

• All active processor executes the same instruction synchronously, but on

different data
• On a memory access, all active processors must access the same location in
their local memory.
• The data items form an array and an instruction can act on the complete array
in one cycle.

35
Parallel Computing – Flynn’s Taxonomy

• How to View an SIMD Machine

- Think of soldiers all in a unit.

- A commander selects certain soldiers as active – for example, every even

numbered row.

- The commander barks out an order that all the active soldiers should do
and they execute the order synchronously.

36
Parallel Computing – Flynn’s Taxonomy

• Most modern computers, particularly those with graphics processor units (GPUs) employ
SIMD instructions and execution units.

• GPUs Computer chip that

renders graphics and images by
performing rapid mathematical
calculations

37
Parallel Computing – Flynn’s Taxonomy

• SIMD is mainly dedicated to array

processing machines.

• Array processors are also known as

multiprocessors or vector processors, used
to perform computations on large arrays of
data. They are used to improve the
performance of the computer.

38
Parallel Computing – Flynn’s Taxonomy

• A single data stream is fed into multiple processing units.

• MISD structure is only of theoretical interest since no practical system has

been constructed using this organization.

• In MISD, multiple processing units operate on one single-data stream. Each

processing unit operates on the data independently via separate instruction
stream.

39
Parallel Computing – Flynn’s Taxonomy

The experimental Carnegie-Mellon C.mmp computer (1971)

40
Parallel Computing – Flynn’s Taxonomy

• Processors are asynchronous, since they can independently execute different programs
on different data sets.

• Communications are handled either by

- through shared memory. (multiprocessors)
- use of message passing (multicomputers)

• MIMD’s have been considered by most researchers to include the most powerful, least
restricted computers.
41
Parallel Computing – Flynn’s Taxonomy

• Currently, most common type of parallel computer

• Every processor may be executing a different instruction stream
• Every processor may be working with a different data stream
• Execution can be synchronous or asynchronous, deterministic or non-
deterministic

42
Parallel Computing –Parallel Computer Architectures
• Shared memory: All processors can access the same memory

• Uniform memory access (UMA):

- Identical Processors
- Equal access and access times to memory

43
Parallel Computing –Non-Uniform Memory Access (NUMA)
• Not all processors have equal access to all memories

• Memory access across link is slower

• Advantages: -user-friendly programming perspective to memory -fast and

uniform data sharing due to the proximity of memory to CPUs

• Disadvantages: -lack of scalability between memory and CPUs. -Programmer

responsible to ensure "correct" access of global memory -Expense
44
03/06/2024 45
03/06/2024 46
03/06/2024 47
03/06/2024 48
03/06/2024 49
03/06/2024 50
03/06/2024 51
03/06/2024 52
03/06/2024 53
03/06/2024 54
03/06/2024 55
03/06/2024 56
03/06/2024 57
03/06/2024 58
03/06/2024 59
03/06/2024 60
03/06/2024 61
03/06/2024 62
03/06/2024 63
03/06/2024 64
03/06/2024 65
03/06/2024 66
Parallel Computing - Distributed Memory
• Distributed memory systems require a communication network to connect
inter-processor memory.

• Advantages: -Memory is scalable with number of processors. -No memory

interference or overhead for trying to keep cache coherency. -Cost effective

• Disadvantages: -programmer responsible for data communication between

processors. -difficult to map existing data structures to this memory
organization. 67
Parallel Computing - Distributed Memory
• Distributed memory systems require a communication network to connect
inter-processor memory.

• Advantages: -Memory is scalable with number of processors. -No memory

interference or overhead for trying to keep cache coherency. -Cost effective

• Disadvantages: -programmer responsible for data communication between

processors. -difficult to map existing data structures to this memory
organization. 68
03/06/2024 69
Parallel Computing - Hybrid Distributed-Shared Memory
• Generally used for the currently largest and fastest computers
• Has a mixture of previously mentioned advantages and disadvantages

70
Parallel Computing – Limitations
• It addresses such as communication and synchronization between multiple
sub-tasks and processes which is difficult to achieve.

• The algorithms must be managed in such a way that they can be handled in a
parallel mechanism.

• The algorithms or programs must have low coupling and high cohesion. But
it’s difficult to create such programs.

• More technically skilled and expert programmers can code a parallelism-

based program well.
71
Parallel Computing – Types
• Bit-level parallelism
• Instruction-level parallelism
• Task Parallelism

72
Parallel Computing – Types
• Bit-level parallelism
- Every task is dependent on processor word size. In terms of performing a
task on large-sized data, it reduces the number of instructions the
processor must execute.
- There is a need to split the operation into series of instructions. For
example, there is an 8-bit processor, and you want to do an operation on
16-bit numbers. First, it must operate the 8 lower-order bits and then the
8 higher-order bits. Therefore, two instructions are needed to execute the
operation. The operation can be performed with one instruction by a 16-
bit processor.
73
Parallel Computing – Types
• Instruction-level parallelism
- A processor can only address less than one instruction for each clock cycle
phase. These instructions can be re-ordered and grouped which are later
on executed concurrently without affecting the result of the program. This
is called instruction-level parallelism

74
Parallel Computing – Parallel Programming
• Parallel programming (also, unfortunately, sometimes called concurrent
programming), is a computer programming technique that provides for the
execution of operations concurrently, either
- within a single parallel computer OR
- across a number of systems.
• In the latter case, the term distributed computing is used.

75
Parallel Computing – Parallel Programming Model
• Shared Model
- tasks share a common address space, which they read and write
asynchronously.
- Various mechanisms such as locks / semaphores may be used to control
access to the shared memory.
- Advantage:
- no need to explicitly communicate of data between tasks -> simplified
programming
- Disadvantages:
- Need to take care when managing memory, avoid synchronization
conflicts
- Harder to control data locality
76
Parallel Computing – Parallel Programming Model
• Threads
- A thread can be considered as a subroutine in the main program
- Threads communicate with each other through the global memory
- commonly associated with shared memory architectures and operating
systems
- PosixThreads or pthreads
- OpenMP

77
Parallel Computing – Parallel Programming Model
• Message Passing
- A set of tasks that use their own local memory during computation.
- Data exchange through sending and receiving messages.
- Data transfer usually requires cooperative operations to be performed by
each process. For example, a send operation must have a matching receive
operation.
- MPI (released in 1994)
- MPI-2 (released in 1996)

78
Parallel Computing – Parallel Programming Model
• Data Parallel Model

- The data parallel model demonstrates the following characteristics:

- Most of the parallel work performs operations on a data set, organized into a common
structure, such as an array
- A set of tasks works collectively on the same data structure, with each task working on
a different partition
- Tasks perform the same operation on their partition

- On shared memory architectures, all tasks may have access to the data structure through
global memory. On distributed memory architectures the data structure is split up and
79
resides as "chunks" in the local memory of each task.
Parallel Computing – Parallel Programming Model
Data Parallel Model

80
Parallel Computing – Parallel Programming Model
• Hybrid
- combines various models, e.g. MPI/OpenMP
• Single Program Multiple Data (SPMD)
- A single program is executed by all tasks simultaneously

• Multiple Program Multiple Data (MPMD)

- An MPMD application has multiple executables. Each task can execute the same or
different program as other tasks.

81
von Neumann Computer
• Model for designing and building computers, based on the following
three characteristics:
1) The computer consists of four main sub-systems:
• Memory
• ALU (Arithmetic/Logic Unit)
• Control Unit
• Input/Output System (I/O)

2) Program is stored in memory during execution.

3) Program instructions are executed sequentially.

82
von Neumann Computer
Bus

Processor (CPU)

Memory Input-Output
Control Unit

ALU
Communicate with
Store data and program
"outside world", e.g.
• Screen
• Keyboard
Execute program
• Storage devices
• ...
Do arithmetic/logic operations
requested by program
83
von Neumann Computer
• All computers more or less based on the same basic design, the Von
Neumann Architecture!

84
Personnel Computer

85
Motivations for Parallel Computing
• Fundamental limits on single processor speed
• Heat dissipation from CPU chips
• Disparity between CPU & memory speeds
• Distributed data communications
• Need for very large scale computing platforms

86
Motivations for Parallel Computing
• Fundamental limits – Cycle Speed
- Intel 8080 2MHz 1974
- ARM 2 8MHz 1986
- Intel Pentium Pro 200MHz 1996
- AMD Athlon 1.2GHz 2000
- Intel QX6700 2.66GHz 2006
- Intel Core i7 3770k 3.9GHz 2013
- Speed of light: 30cm in 1ns
87
Moore’s Law
• Moore’s observation in 1965: the number of transistors per square
inch on integrated circuits had doubled every year since the
integrated circuit was invented
• Moore’s revised observation in 1975: the pace was slowed down a
bit, but data density had doubled approximately every 18 months
• How about the future? (price of computing power falls by a half
every 18 months?)

88
Moore’s Law – Held for Now

89
Power Wall Effect in Computer Architecture
• Too many transistors in a given chip die area
• Tremendous increase in power density
• Increased chip temperature
• High temperature slows down the transistor switching rate and the overall
speed of the computer
• Chip may melt down if not cooled properly  Efficient cooling systems are
expensive

90
Cooling Computer Chips
• Some people suggest to put computer chips in liquid nitrogen to cool them

91
Solutions
• Use multiple inexpensive processors A processor with multiple cores

92
A Multi-Core Processor

93
CPU and Memory Speed
• In 20 years, CPU speed (clock rate) has increased by a factor of 1000
• DRAM speed has increased only by a factor of smaller than 4
• How to feed data faster enough to keep CPU busy?
• CPU speed: 1-2 ns
• DRAM speed: 50-60 ns
• Cache: 10 ns

94
Memory Access and CPU Speed

95
CPU, Memory and Disk Speed

96
Possible Solutions
• A hierarchy of successively fast memory devices (multilevel caches)
• Location of data reference (code)
• Efficient programming can be an issue
• Parallel systems may provide
1.) larger aggregate cache
2.) higher aggregate bandwidth to the memory system

97
Parallel Computing – Useful Terms
• Concurrent - Events or processes which seem to occur or progress at the
same time.
• Parallel –Events or processes which occur or progress at the same time
• Parallel programming (also, unfortunately, sometimes called concurrent
programming), is a computer programming technique that provides for the
execution of operations concurrently, either
- within a single parallel computer OR
- across a number of systems.
• In the latter case, the term distributed computing is used. 98
Parallel Computing – Flynn’s Taxonmy
• Best known classification scheme for parallel computers.
• Depends on parallelism it exhibits with its
- Instruction stream
- Data stream
• A sequence of instructions (the instruction stream) manipulates a sequence
of operands (the data stream)
• The instruction stream (I) and the data stream (D) can be either single (S) or
multiple (M)
99
Parallel Computing – Flynn’s Taxonmy
• Four combinations: SISD, SIMD, MISD, MIMD

100
Parallel Computing – Flynn’s Taxonmy

• Serial Computer
• Single-CPU systems
- i.e., uniprocessors
- Note: co-processors don’t count as more processors

• Examples: older generation main frames, work stations, PCs

101
Parallel Computing – Flynn’s Taxonmy

• A type of parallel computer

102
Parallel Computing – Flynn’s Taxonmy

• All active processor executes the same instruction synchronously, but on

different data

• On a memory access, all active processors must access the same location in
their local memory.

• The data items form an array and an instruction can act on the complete array
in one cycle.
103
Parallel Computing – Flynn’s Taxonmy

• How to View an SIMD Machine

- Think of soldiers all in a unit.

- A commander selects certain soldiers as active – for example, every even

numbered row.

- The commander barks out an order that all the active soldiers should do
and they execute the order synchronously.
104
Parallel Computing – Flynn’s Taxonmy

• Most modern computers, particularly those with graphics processor units

(GPUs) employ SIMD instructions and execution units.

105
Parallel Computing – Flynn’s Taxonmy

• A single data stream is fed into multiple processing units.

• Each processing unit operates on the data independently via independent

instruction streams.

• Few actual examples : Carnegie-Mellon C.mmp computer (1971).

106
Parallel Computing – Flynn’s Taxonmy

• Processors are asynchronous, since they can independently execute different programs
on different data sets.

• Communications are handled either by

- through shared memory. (multiprocessors)
- use of message passing (multicomputers)

• MIMD’s have been considered by most researchers to include the most powerful, least
restricted computers.
107
Parallel Computing – Flynn’s Taxonmy

• Currently, most common type of parallel computer

108
Parallel Computing – Useful Terms
• All processors have access to all memory locations .
• Uniform memory access (UMA)
• Similar to uniprocessor, except additional, identical CPU’s are added to the
bus.
• Each processor has equal access to memory and can do anything that any
other processor can do.
• Also called a symmetric multiprocessor or SMP
• We will discuss in greater detail later (e.g., text pg 43)
• SMPs and clusters of SMPs are currently very popular 109
Parallel Computing – Useful Terms
• A Processing Unit within a CPU. CPU can have multiple cores

• Each CPU core has its own L1 cache, but may share L2 and L3 caches

110
Parallel Computing – Useful Terms
• Cache is a small amount of memory which is a part of the CPU - closer to the CPU than RAM. It is
used to temporarily hold instructions and data that the CPU is likely to reuse.
• The more cache there is, the more data can be stored closer to the CPU.

• Cache is graded as Level 1 (L1), Level 2 (L2) and Level 3 (L3):

- L1 is usually part of the CPU chip itself and is both the smallest and the fastest to access. Its
size is often restricted to between 8 KB and 64 KB.
- L2 and L3 caches are bigger than L1. They are extra caches built between the CPU and the
RAM. Sometimes L2 is built into the CPU with L1. L2 and L3 caches take slightly longer to access
than L1. The more L2 and L3 memory available, the faster a computer can run.
• Not a lot of physical space is allocated for cache. There is more space for RAM, which is usually
larger and less expensive.
111
Parallel Computing – Useful Terms
• Coupling and Cohesion
- In software engineering, coupling is the degree of interdependence between software
modules; a measure of how closely connected two routines or modules are; the
strength of the relationships between modules
- Coupling is usually contrasted with cohesion. Low coupling often correlates with high
cohesion, and vice versa. Low coupling is often thought to be a sign of a well-
structured computer system and a good design, and when combined with high
cohesion, supports the general goals of high readability and maintainability.

112
Parallel Computing – Useful Terms
• FLOPS
- Floating-point operations per second, or FLOPS, is the unit of
measurement that calculates the performance capability of a
supercomputer.
- Floating-point operations can only be executed on computers with
integrated floating-point registers.
- The average computer’s processor performance is measured by megahertz
(MHz) units to calculate its clock speed. Since supercomputers are far
more capable when it comes to power performance, the method in which
performance is calculated must be on a considerably larger scale

113
Parallel Computing – Useful Terms
• FLOPS
One petaFLOPS is equal to 1,000,000,000,000,000 (one quadrillion) FLOPS,
or one thousand teraFLOPS

One teraFLOPS is equal to 1,000,000,000,000 (one trillion) FLOPS.

114
Parallel Computing – Useful Terms
• SETI@home

SETI@home ("SETI at home") is a project of the Berkeley SETI Research

Center to analyze radio signals, searching for signs of extraterrestrial
intelligence. Until March 2020, it was run as an Internet-based public
volunteer computing project that employed the BOINC software platform. It
is hosted by the Space Sciences Laboratory at the University of California,
Berkeley, and is one of many activities undertaken as part of the worldwide
SETI effort.
115
Parallel Computing – Useful Terms
• Extraterrestrial intelligence
Extraterrestrial intelligence (often abbreviated ETI) refers to
hypothetical intelligent extraterrestrial life (Extraterrestrial
life is hypothetical life that may occur outside Earth and which did not
originate on Earth.)

116
Parallel Computing
• Parallel computing also helps in faster application processing and task
resolution by increasing the available computation power of systems.
• The parallel computing principles are used by most supercomputers employ
to operate.
• The operational scenarios that need massive processing power or
computation, generally, parallel processing is commonly used there.

117

PDC Complete Course File
No ratings yet
PDC Complete Course File
422 pages
CMP 252 - Parallelism Fundamentals
No ratings yet
CMP 252 - Parallelism Fundamentals
64 pages
Lecture 1 Introduction
No ratings yet
Lecture 1 Introduction
34 pages
Brkucc 2021
No ratings yet
Brkucc 2021
73 pages
OpenSL ES Specification 1.0.1
No ratings yet
OpenSL ES Specification 1.0.1
569 pages
Types of Parallel Computing
No ratings yet
Types of Parallel Computing
11 pages
1 Introduction
No ratings yet
1 Introduction
48 pages
Parallel Distributed Computing
No ratings yet
Parallel Distributed Computing
51 pages
PP Cuda Unit1 1
No ratings yet
PP Cuda Unit1 1
77 pages
Cloud Computing
No ratings yet
Cloud Computing
30 pages
EE664: Introduction To Parallel Computing: Dr. Gaurav Trivedi Lectures 5-14
No ratings yet
EE664: Introduction To Parallel Computing: Dr. Gaurav Trivedi Lectures 5-14
170 pages
Parallel and Distributed Computing
No ratings yet
Parallel and Distributed Computing
24 pages
Lecture 2 Introduction To Parallel and Distributed Computing
No ratings yet
Lecture 2 Introduction To Parallel and Distributed Computing
29 pages
PDC 3
No ratings yet
PDC 3
26 pages
Chapter 1 - Parallel Architectures
No ratings yet
Chapter 1 - Parallel Architectures
60 pages
Lecture 4
No ratings yet
Lecture 4
27 pages
Parallel Computing Unit 1 - Introduction To Parallel Computing
No ratings yet
Parallel Computing Unit 1 - Introduction To Parallel Computing
43 pages
Topic 1 2024
No ratings yet
Topic 1 2024
41 pages
Cloud Computing Unit-1
No ratings yet
Cloud Computing Unit-1
51 pages
PDC1 Computing Introduction
No ratings yet
PDC1 Computing Introduction
31 pages
Week1 Parallel and Distributed Computing
No ratings yet
Week1 Parallel and Distributed Computing
55 pages
CC - Unit 1
No ratings yet
CC - Unit 1
29 pages
Lec1-Introduction To Parallel - Distributed System
No ratings yet
Lec1-Introduction To Parallel - Distributed System
29 pages
Lec 3 Storage Units
No ratings yet
Lec 3 Storage Units
41 pages
Parallel and Distributed Computing
No ratings yet
Parallel and Distributed Computing
90 pages
Week 1
No ratings yet
Week 1
74 pages
Chapter1 - CLO1
No ratings yet
Chapter1 - CLO1
28 pages
Getting Started With Labwindows/Cvi
No ratings yet
Getting Started With Labwindows/Cvi
75 pages
DX Log
No ratings yet
DX Log
27 pages
PDC 1
No ratings yet
PDC 1
41 pages
Csc4306 Net-Centric Computing
100% (1)
Csc4306 Net-Centric Computing
5 pages
Miflash@2018516
No ratings yet
Miflash@2018516
2 pages
Assignment 1st PC
No ratings yet
Assignment 1st PC
12 pages
Lecture 1 Introduction To PDC
No ratings yet
Lecture 1 Introduction To PDC
17 pages
CS621 - Handouts - Mids
No ratings yet
CS621 - Handouts - Mids
61 pages
PD Computing Introduction. Why Use PDC
No ratings yet
PD Computing Introduction. Why Use PDC
31 pages
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 05-Aug-2021 Module1 (Part 1)
No ratings yet
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 05-Aug-2021 Module1 (Part 1)
30 pages
Parallel and Distributed Computing-1
No ratings yet
Parallel and Distributed Computing-1
17 pages
Management Information Systems
100% (1)
Management Information Systems
326 pages
Intro PDC
No ratings yet
Intro PDC
23 pages
Introduction To Parallel Computing
No ratings yet
Introduction To Parallel Computing
30 pages
Android App Crash Bug Investigation Report
No ratings yet
Android App Crash Bug Investigation Report
6 pages
Lec1 and 2
No ratings yet
Lec1 and 2
52 pages
Deepak G Neethimany: Education
No ratings yet
Deepak G Neethimany: Education
4 pages
PDC Digital Notes 6 17
No ratings yet
PDC Digital Notes 6 17
12 pages
Lec1 Introduction
No ratings yet
Lec1 Introduction
23 pages
Design For Testability I: From Full Scan To Partial Scan: Chia Yee Ooi
No ratings yet
Design For Testability I: From Full Scan To Partial Scan: Chia Yee Ooi
28 pages
Introduction To Parallel Co...
No ratings yet
Introduction To Parallel Co...
44 pages
1080P SD Card MDVR Datasheet
No ratings yet
1080P SD Card MDVR Datasheet
4 pages
Intro Parallel Computing PDF
No ratings yet
Intro Parallel Computing PDF
58 pages
User Documentation: Midwest Network Management Database
No ratings yet
User Documentation: Midwest Network Management Database
10 pages
10 Parallel Computing
No ratings yet
10 Parallel Computing
15 pages
DWR 978 Datasheet EU EN
No ratings yet
DWR 978 Datasheet EU EN
2 pages
BMN Lab 2 Asterisk 1
No ratings yet
BMN Lab 2 Asterisk 1
4 pages
What Is OFDM
No ratings yet
What Is OFDM
39 pages
Last Class: Introduction To Operating Systems
No ratings yet
Last Class: Introduction To Operating Systems
14 pages
Lecture Week - 1 Introduction 1 - SP-24
No ratings yet
Lecture Week - 1 Introduction 1 - SP-24
51 pages
Introduction To: Parallel Distributed
No ratings yet
Introduction To: Parallel Distributed
32 pages
Introduction To Parallel Computing
100% (1)
Introduction To Parallel Computing
34 pages
Parallel Computing Main
No ratings yet
Parallel Computing Main
47 pages
CS ELEC 2 Introduce Parallel Computing
No ratings yet
CS ELEC 2 Introduce Parallel Computing
28 pages
Brutus Guide
No ratings yet
Brutus Guide
3 pages
Map Reduce
No ratings yet
Map Reduce
11 pages
Seminar Report
No ratings yet
Seminar Report
16 pages
Basics of Parallel Programming: Unit-1
No ratings yet
Basics of Parallel Programming: Unit-1
79 pages
Parallel Computing Varun Patial
No ratings yet
Parallel Computing Varun Patial
41 pages
Construction Safety Association of Ontario Rigging Manual PDF
No ratings yet
Construction Safety Association of Ontario Rigging Manual PDF
3 pages
Per Device: 802.11ac: Gigabit Wi-Fi
No ratings yet
Per Device: 802.11ac: Gigabit Wi-Fi
1 page
Saep 750
100% (1)
Saep 750
19 pages
Parallel Computing
No ratings yet
Parallel Computing
3 pages
What Is Parallel Computing 1 PDF
No ratings yet
What Is Parallel Computing 1 PDF
21 pages
CL1016M LCD - KVM - Switch
No ratings yet
CL1016M LCD - KVM - Switch
1 page
CV - Amey Dalvi
No ratings yet
CV - Amey Dalvi
2 pages
ShoutCast v2 - Broadcasting With Winamp & ShoutCast DSP Plugin
No ratings yet
ShoutCast v2 - Broadcasting With Winamp & ShoutCast DSP Plugin
17 pages
Nutanix Reference Architecture VMWare Horizon DaaS 6.1
No ratings yet
Nutanix Reference Architecture VMWare Horizon DaaS 6.1
60 pages
Ad Diploma Semester 1 2013 2014
No ratings yet
Ad Diploma Semester 1 2013 2014
14 pages
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
No ratings yet
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
63 pages
01 Intro Parallel Computing
No ratings yet
01 Intro Parallel Computing
40 pages
Intro PDC1
No ratings yet
Intro PDC1
3 pages
Book in Stal Guide
No ratings yet
Book in Stal Guide
52 pages
Introduction To Computing
No ratings yet
Introduction To Computing
6 pages
Parallel Computing Terminology
No ratings yet
Parallel Computing Terminology
11 pages
The New Trends of Parallel Processing
No ratings yet
The New Trends of Parallel Processing
5 pages
Introduction To Parallel Computing LLNL
No ratings yet
Introduction To Parallel Computing LLNL
44 pages
Inventory Management
100% (1)
Inventory Management
35 pages
3 PDF
No ratings yet
3 PDF
3 pages
Lecture Parallel Computing
No ratings yet
Lecture Parallel Computing
6 pages
Report
No ratings yet
Report
33 pages
Quantum Computer Vs Traditional Computer
From Everand
Quantum Computer Vs Traditional Computer
Arief Muinnudin
No ratings yet
Introduction to Computing DSST Quick Prep Sheet
From Everand
Introduction to Computing DSST Quick Prep Sheet
Justin Orgeron
No ratings yet

Chapter # 1

Uploaded by

Chapter # 1

Uploaded by

Parallel and Distributed

• It is done by multiple CPUs communicating via Shared Memory, which

• It helps in performing large computations as it divides the large problem

• It can be impractical to solve larger problems on Serial Computing

• Reduces the Complexity

• Serial Computing ‘wastes’ the potential computing power, thus Parallel

- The sequence of instructions read from memory constitutes an Instruction Stream

• A sequence of instructions (the instruction stream) manipulates a sequence of operands

• It represents the organization of a single computer containing a Control Unit, a Processor

• Data Stream flows between the processors and memory bi-directionally.

CU = Control Unit PE = Processing Element M = Memory 33

• A type of parallel computer

• All active processor executes the same instruction synchronously, but on

• How to View an SIMD Machine

- A commander selects certain soldiers as active – for example, every even

• GPUs Computer chip that

• SIMD is mainly dedicated to array

• Array processors are also known as

• A single data stream is fed into multiple processing units.

• MISD structure is only of theoretical interest since no practical system has

• In MISD, multiple processing units operate on one single-data stream. Each

The experimental Carnegie-Mellon C.mmp computer (1971)

• Communications are handled either by

• Currently, most common type of parallel computer

• Uniform memory access (UMA):

• Memory access across link is slower

• Advantages: -user-friendly programming perspective to memory -fast and

• Disadvantages: -lack of scalability between memory and CPUs. -Programmer

• Advantages: -Memory is scalable with number of processors. -No memory

• Disadvantages: -programmer responsible for data communication between

• Advantages: -Memory is scalable with number of processors. -No memory

• Disadvantages: -programmer responsible for data communication between

• More technically skilled and expert programmers can code a parallelism-

- The data parallel model demonstrates the following characteristics:

• Multiple Program Multiple Data (MPMD)

2) Program is stored in memory during execution.

3) Program instructions are executed sequentially.

• Examples: older generation main frames, work stations, PCs

• A type of parallel computer

• All active processor executes the same instruction synchronously, but on

• How to View an SIMD Machine

- A commander selects certain soldiers as active – for example, every even

• Most modern computers, particularly those with graphics processor units

• A single data stream is fed into multiple processing units.

• Each processing unit operates on the data independently via independent

• Few actual examples : Carnegie-Mellon C.mmp computer (1971).

• Communications are handled either by

• Currently, most common type of parallel computer

• Cache is graded as Level 1 (L1), Level 2 (L2) and Level 3 (L3):

One teraFLOPS is equal to 1,000,000,000,000 (one trillion) FLOPS.

SETI@home ("SETI at home") is a project of the Berkeley SETI Research

You might also like