0% found this document useful (0 votes)

7 views23 pages

PCP 2025 2 ProgrammingModel

The document provides an overview of parallel programming, focusing on the shared memory model where multiple threads can run concurrently, share data, and coordinate operations. It explains the differences between processes and threads, highlighting how threads allow for internal concurrency within a process. Additionally, it mentions other parallel programming models such as message-passing and MapReduce, which offer alternative methods for parallel computation.

Uploaded by

naidoojoshua123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views23 pages

PCP 2025 2 ProgrammingModel

Uploaded by

naidoojoshua123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Parallel and concurrent

programming
2. Programming Model

Michelle Kuttel
Overview: How to write parallel programs
To write a parallel program, you (the programmer) need
new primitives from a programming language or library,
that enable you to:

• run multiple operations at once

• share data between operations
• coordinate (a.k.a. synchronize) operations

2
Overview: How to write parallel programs
To write a parallel program, you (the programmer) need
new primitives from a programming language or library,
that enable you to:

• run multiple operations at once

• share data between operations
• coordinate (a.k.a. synchronize) operations

How this works/is done depends on the

parallel programming model used in the
language/library.
3
Java uses the Shared Memory
parallel programming model
The Shared Memory Model
All memory is placed into a
single (physical) address
space.

• Processors connected by
some form of cache cache cache
interconnection network Bus Bus

• Single virtual address shared memory

space across all of
memory. Each processor
can access all locations in
memory.

from: Art of Multiprocessor Programming

Shared Memory
The ideal picture of shared memory:

CPU0 CPU1 CPU2 CPU3

Read/
Write

Shared Memory
Shared Memory
The ideal picture of shared memory:
CPU0 CPU1 CPU2 CPU3

Read/
Write

Shared Memory
Shared Memory
The ideal picture of shared memory:
CPU0 CPU1 CPU2 CPU3

Read/
Write

Shared Memory

The actual architecture of shared memory systems:

Symmetric Multi-Processor (SMP): Distributed Shared Memory (DSM):

CPU0 CPU1 CPU2 CPU3 CPU0 CPU1 CPU2 CPU3

Read/
Write
Local Local Local Local
Cache Cache Cache Cache Local Local Local Local
R/W of Memory Memory Memory Memory
Misses + Module Module Module Module
Cache
Invalidate Shared Memory Network
[Also have Non-uniform Memory Access Symmetric Multi-Processor (NUMA-SMP)]
CPU0 CPU1 CPU2 CPU3

An aside: cache Local Local Local Local

(Architecture)
Cache Cache Cache Cache

Shared Memory

A memory cache, also called a "CPU cache," is a memory bank that

bridges main memory and the processor.
• It has faster static RAM (SRAM) chips than the dynamic RAM
(DRAM) used for main memory.
• The cache allows instructions to be executed and data to be read
and written at higher speed.

Can have multiple caches (L1, L2, L3) in modern chips

• L1 is the fastest; each subsequent cache is slower and larger than
L1, and instructions and data are staged from main memory to L3 to
L2 to L1 to the processor.
• On multicore chips, the L3 cache is generally shared among all the
processing cores.
A Process
(operating
system)

Operating system unit of resource allocation both for CPU time and
for memory.
• A process is represented by its code, data and the state of the
machine registers.
• data of the process divided into
• global variables and local variables
• organized as a stack.
• Generally, each process in an operating system has its own
address space
• entirely separate entities
It is hard to obtain parallelism or
concurrency with separate
processes… (why?)

… so operating systems created

threads.
Thread
Process given internal concurrency with multiple
lightweight processes or threads.
• multiple threads of control
• a process with multiple (lightweight) threads of control
has multiple stacks
• one for each thread.
but access to shared memory too.
Processes versus threads

Process memory model

Thread memory model

Graphic adapted slides obtained from : www.Intel-Software-Academic-Program.com

What is a parallel program?
The shared memory model has multiple explicit threads
running concurrently.

Threads can:
• perform multiple computations in parallel;
• perform separate simultaneous activities;
• communicate easily and implicitly with each other
through shared memory.
(but this is dangerous if you don't protect your variables correctly)

* This is true for the shared memory model of parallel computing discussed in this module.
Programming Model: Sequential program state
A running serial program has
• One program counter (current statement executing)
• One call stack
• each stack frame holds the local variables for a method call that has started but
not yet finished.
• Calling a method pushes a new frame and returning from a method pops a frame.
• Call stacks are why recursion is not “magic.”
• Objects. Object are created by calling new. We call the memory
that holds all the objects the heap.
(nothing to do with data structure called a heap)
• Static fields of classes.

Slide adapted from: Sophomoric Parallelism and Concurrency, Lecture 1

Programming Model: Shared memory

• Each thread has its own program counter, call stack and
local variables
• All threads share one collection of objects and static
fields
• Static fields of classes are also shared by all threads.
• Threads communicate through shared objects (implicit
communication)
• To communicate, write somewhere another
thread reads

Slide adapted from: Sophomoric Parallelism and Concurrency, Lecture 1

Sequential Computation

thread

memory

object object
image from slide set: Art of Multiprocessor Programming, Maurice Herlihy 17
Concurrent Computation
Any object
can be

s
shared, but
ead most are
not.
thr

memory

object object
image from slide set: Art of Multiprocessor Programming, Maurice Herlihy 18
How threads run
As the programmer, you create threads.
• The Operating System scheduler determines how
and when those threads are run on the available
processors
• Unless you are writing a scheduler, you don't have
control over this
• You don't know how many processors/cores your program will
use
(though you can guess).
• You don't know the order in which threads will execute
(you can't even guess).
Asynchrony

Threads are subject to sudden unpredictable delays:

• Cache misses (short)

Scheduling quantum finished
• Page faults (long)
CPU0 CPU1 CPU2 CPU3
Read/
• Scheduling quantum finished Write
Local Local Local Local
(really long) R/W of Cache Cache Cache Cache
Misses +
Cache
Invalidate
Shared Memory
Symmetric Multi-Processor Page fault
(SMP)
Disk
Other parallel programming models
We focus on shared memory, but several other models
exist. Common alternatives are:

• Message-passing:
• Explicit threads/processes, each with their own objects/data.
• Communication is via explicitly sending/receiving messages,
containing copies of the data (share nothing)
• Most common model on HPC systems (though usually in a
hybrid with shared memory)

21
Other parallel programming models
• Map reduce:
• Data parallelism concept from functional programming languages like
LISP
• Have primitives for things like “apply function to every element of an
array in parallel”.
• details of the underlying parallelization are hidden from the
programmer, provided you can express your program using the
available primitives
• MapReduce was developed by Google and the programming model
has since been adopted by many software frameworks, e.g. Apache’s
open-source Hadoop

22
Other parallel programming models

Golang
• Go-routines,
• Go runtime maps these onto operating system threads
• Channels – used for communication between Go-routines

package main

import "fmt"

func main() {
fmt.Println("Hello, World!")
}

2.2 DD2356 Threads
No ratings yet
2.2 DD2356 Threads
22 pages
Lecture-4 Parallel Programming Model
No ratings yet
Lecture-4 Parallel Programming Model
14 pages
Lecture 16
No ratings yet
Lecture 16
30 pages
Lecture 03
No ratings yet
Lecture 03
39 pages
Lecture 2
No ratings yet
Lecture 2
16 pages
HPC Module 4
No ratings yet
HPC Module 4
18 pages
ParallelProgramming Start2016
No ratings yet
ParallelProgramming Start2016
41 pages
PA Midsem
No ratings yet
PA Midsem
20 pages
Concurrency
No ratings yet
Concurrency
99 pages
Con Currency
No ratings yet
Con Currency
99 pages
Parallel and Distributed Computing Lecture#12
No ratings yet
Parallel and Distributed Computing Lecture#12
19 pages
Chapter 05 PCPF
No ratings yet
Chapter 05 PCPF
32 pages
Concurrency: CS2403 Programming Languages
No ratings yet
Concurrency: CS2403 Programming Languages
44 pages
Parallel Programming Unit 2
No ratings yet
Parallel Programming Unit 2
71 pages
Parallel Programming: Process and Threads
No ratings yet
Parallel Programming: Process and Threads
18 pages
Meet-7-Parallel Programming Models Bag1
No ratings yet
Meet-7-Parallel Programming Models Bag1
17 pages
PDC Lecture 05
No ratings yet
PDC Lecture 05
48 pages
Introduction To Parallel Programming: Center For Institutional Research Computing
No ratings yet
Introduction To Parallel Programming: Center For Institutional Research Computing
98 pages
3 ParallelProgrammingModels
No ratings yet
3 ParallelProgrammingModels
20 pages
Parallel Programming Models
No ratings yet
Parallel Programming Models
25 pages
Chapter 2 - Parallel Algorithm Design
No ratings yet
Chapter 2 - Parallel Algorithm Design
84 pages
Threads
No ratings yet
Threads
16 pages
Concurrent Programming With Threads: Rajkumar Buyya
No ratings yet
Concurrent Programming With Threads: Rajkumar Buyya
168 pages
Parallel Processing
No ratings yet
Parallel Processing
31 pages
2 Parallel Computer Memory Architectures
No ratings yet
2 Parallel Computer Memory Architectures
26 pages
Lec17 Threads Introduction
No ratings yet
Lec17 Threads Introduction
20 pages
Cloud Computing CS 15-319: Programming Models-Part I Lecture 4, Jan 25, 2012
No ratings yet
Cloud Computing CS 15-319: Programming Models-Part I Lecture 4, Jan 25, 2012
40 pages
IT105 Midterm Lecture Part1
No ratings yet
IT105 Midterm Lecture Part1
5 pages
Operating Systems Notes - Codeforces
No ratings yet
Operating Systems Notes - Codeforces
14 pages
Parallel Programming & Multithreading
No ratings yet
Parallel Programming & Multithreading
168 pages
Lec 4
No ratings yet
Lec 4
36 pages
Parallel Computing
No ratings yet
Parallel Computing
32 pages
Recent Trends in Parallel Computing
No ratings yet
Recent Trends in Parallel Computing
12 pages
Threads & Concurrency: Lecture 23 - CS2110 - Fall 2018
No ratings yet
Threads & Concurrency: Lecture 23 - CS2110 - Fall 2018
34 pages
OS Module2 Unit2
No ratings yet
OS Module2 Unit2
43 pages
Java Multithreading For Senior Engineering Interviews
No ratings yet
Java Multithreading For Senior Engineering Interviews
379 pages
Operating System 4
No ratings yet
Operating System 4
33 pages
Threads
No ratings yet
Threads
38 pages
Part 1 - Lecture 3 - Parallel Software-1
No ratings yet
Part 1 - Lecture 3 - Parallel Software-1
45 pages
Pdf24 Merged
No ratings yet
Pdf24 Merged
54 pages
OS Processes & Threads Guide
No ratings yet
OS Processes & Threads Guide
24 pages
Java Concurrency
No ratings yet
Java Concurrency
70 pages
Programming Models
No ratings yet
Programming Models
21 pages
Understanding Multicore and OpenMP
No ratings yet
Understanding Multicore and OpenMP
82 pages
Lecture 6 Parallel Programming Models
No ratings yet
Lecture 6 Parallel Programming Models
17 pages
Lecture-13-14 Parallel and Distributed Systems Programming Models-Jameel
No ratings yet
Lecture-13-14 Parallel and Distributed Systems Programming Models-Jameel
70 pages
Multiprocessor Basics & Performance
No ratings yet
Multiprocessor Basics & Performance
52 pages
Parallel Programming: Aaron Bloomfield CS 415 Fall 2005
No ratings yet
Parallel Programming: Aaron Bloomfield CS 415 Fall 2005
24 pages
Understanding Processes & Threads
No ratings yet
Understanding Processes & Threads
13 pages
2 ParallelArchExec
No ratings yet
2 ParallelArchExec
46 pages
CS 133 Parallel & Distributed Computing: Course Instructor: Adam Kaplan Lecture #1: 4/2/2012
No ratings yet
CS 133 Parallel & Distributed Computing: Course Instructor: Adam Kaplan Lecture #1: 4/2/2012
22 pages
OS Concepts
No ratings yet
OS Concepts
2 pages
Biruk Tewoderos 1790
No ratings yet
Biruk Tewoderos 1790
21 pages
Lec6 - Modern Development Concept - Concurrency
No ratings yet
Lec6 - Modern Development Concept - Concurrency
19 pages
EEE3093S 2024 Lecture 5
No ratings yet
EEE3093S 2024 Lecture 5
19 pages
PCP 2025 3 MultithreadingInJava
No ratings yet
PCP 2025 3 MultithreadingInJava
35 pages
CSC2002S PCP2 Assignment 2025
No ratings yet
CSC2002S PCP2 Assignment 2025
2 pages
Eee3096s Report Template
No ratings yet
Eee3096s Report Template
3 pages
EEE3093S Tutorial 3
No ratings yet
EEE3093S Tutorial 3
3 pages
EEE3093S Lab 2
No ratings yet
EEE3093S Lab 2
11 pages
EEE3093S Tutorial 1 2025
No ratings yet
EEE3093S Tutorial 1 2025
3 pages
CSC2002S PCP1 Assignment 2025
No ratings yet
CSC2002S PCP1 Assignment 2025
3 pages
ITCS 321 Test ONE NOV 2018 KEY AAA
No ratings yet
ITCS 321 Test ONE NOV 2018 KEY AAA
5 pages
BCS305: Computer Organisation: Mathatma Gandhi University
No ratings yet
BCS305: Computer Organisation: Mathatma Gandhi University
11 pages
8051 Microcontroller Guide
0% (1)
8051 Microcontroller Guide
3 pages
Latest Basic Computer Hardware Interview Questions
No ratings yet
Latest Basic Computer Hardware Interview Questions
3 pages
PATIL VINAY SUNIL UPSC 2023 Rank 122 MGP Answer Copy Ethics Paper
No ratings yet
PATIL VINAY SUNIL UPSC 2023 Rank 122 MGP Answer Copy Ethics Paper
64 pages
4aa6 7490eee PDF
No ratings yet
4aa6 7490eee PDF
6 pages
Asus P5SD1-FM2
No ratings yet
Asus P5SD1-FM2
76 pages
Microprocessors (CSE 206) (Makeup)
No ratings yet
Microprocessors (CSE 206) (Makeup)
2 pages
Perkakasan Komputer Dan Spesifikasi Peralatan Komputer
No ratings yet
Perkakasan Komputer Dan Spesifikasi Peralatan Komputer
8 pages
Manual Usuario Cone P (INGLÉS)
No ratings yet
Manual Usuario Cone P (INGLÉS)
40 pages
HP Usage Page Chetam
No ratings yet
HP Usage Page Chetam
2 pages
EVB-I94124ADI User Manual
No ratings yet
EVB-I94124ADI User Manual
19 pages
Release Note - English
No ratings yet
Release Note - English
2 pages
2 Evolution of Computer
No ratings yet
2 Evolution of Computer
9 pages
8086 Microprocessor Instruction Set
100% (1)
8086 Microprocessor Instruction Set
62 pages
PCI Express Bus Architecture Guide
No ratings yet
PCI Express Bus Architecture Guide
19 pages
Basic Computer Organization and Design
No ratings yet
Basic Computer Organization and Design
20 pages
LCD KVM Switch CL5708 / CL5716 User Manual
No ratings yet
LCD KVM Switch CL5708 / CL5716 User Manual
95 pages
Grade 9 Computer Systems Guide
No ratings yet
Grade 9 Computer Systems Guide
7 pages
Digital and Technological Solutions
No ratings yet
Digital and Technological Solutions
4 pages
The 8086 Input/output Interface: Dr. Mohanad A. Shehab/ Electrical Engineering Department/ Mustansiriyah University
No ratings yet
The 8086 Input/output Interface: Dr. Mohanad A. Shehab/ Electrical Engineering Department/ Mustansiriyah University
12 pages
HC11 Instruction Set
No ratings yet
HC11 Instruction Set
7 pages
Unit 15 Bus Structure
No ratings yet
Unit 15 Bus Structure
32 pages
Assignment 04 Solved (NAEEM HUSSAIN 18-CS-47)
No ratings yet
Assignment 04 Solved (NAEEM HUSSAIN 18-CS-47)
7 pages
Installation Manual of Firmware
No ratings yet
Installation Manual of Firmware
6 pages
Interfacing Through IC Peripheral Chips: Interface
No ratings yet
Interfacing Through IC Peripheral Chips: Interface
11 pages
HW 6
No ratings yet
HW 6
3 pages
Xerox b230 SFP SM v2
No ratings yet
Xerox b230 SFP SM v2
344 pages
Epson EPL-6200'6200L SM
100% (2)
Epson EPL-6200'6200L SM
204 pages
Pro-Face Product Guide 2008
100% (1)
Pro-Face Product Guide 2008
10 pages

PCP 2025 2 ProgrammingModel

Uploaded by

PCP 2025 2 ProgrammingModel

Uploaded by

Parallel and concurrent

• run multiple operations at once

• run multiple operations at once

How this works/is done depends on the

• Single virtual address shared memory

from: Art of Multiprocessor Programming

CPU0 CPU1 CPU2 CPU3

The actual architecture of shared memory systems:

CPU0 CPU1 CPU2 CPU3 CPU0 CPU1 CPU2 CPU3

An aside: cache Local Local Local Local

A memory cache, also called a "CPU cache," is a memory bank that

Can have multiple caches (L1, L2, L3) in modern chips

… so operating systems created

Process memory model

Thread memory model

Graphic adapted slides obtained from : www.Intel-Software-Academic-Program.com

Slide adapted from: Sophomoric Parallelism and Concurrency, Lecture 1

Slide adapted from: Sophomoric Parallelism and Concurrency, Lecture 1

Threads are subject to sudden unpredictable delays:

• Cache misses (short)

You might also like