0% found this document useful (0 votes)

12 views18 pages

OpenMP 4

The document outlines a course on parallel programming, covering both theoretical and practical aspects, including parallel computer architectures, memory models, and programming techniques using OpenMP, MPI, and CUDA. It details the structure of OpenMP programming, including directives, clauses, and scheduling methods such as static, dynamic, guided, and runtime. Additionally, it provides references for further reading and acknowledges sources of information related to OpenMP and parallel programming.

Uploaded by

Ranny

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views18 pages

OpenMP 4

Uploaded by

Ranny

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Index

• OpenMP
• Directives : if, for
• Clauses
• Schedule
• Static

• Dynamic

• Guided

• Runtime

• References
Course Outline

Course Plan: Theory:

Part A: Parallel Computer Architectures
Week 1,2,3: Introduction to Parallel Computer Architecture: Parallel Computing,
Parallel architecture, bit level, instruction level , data level and task level
parallelism. Instruction level parelllelism: pipelining(Data and control instructions),
scalar and superscalar processors, vector processors. Parallel computers and
computation.
Week 4,5: Memory Models: UMA, NUMA and COMA. Flynns classification, Cache
coherence,
Week 6,7: Amdahl’s Law. Performance evaluation, Designing parallel algorithms :
Divide and conquer, Load balancing, Pipelining.
Week 8 -11: Parallel Programming techniques like Task Parallelism using TBB,
TL2, Cilk++ etc. and software transactional memory techniques.
Course Outline
Part B: OpenMP/MPI/CUDA
Week 1,2,3 : Shared Memory Programing Techniques: Introduction to OpenMP :
Directives: parallel, for, sections, task, master, single, critical, barrier, taskwait,
atomic. Clauses: private, shared, firstprivate, lastprivate, reduction, nowait, ordered,
schedule, collapse, num_threads, if().
Week 4,5: Distributed Memory programming Techniques: MPI: Blocking, Non-
blocking.
Week 6,7 : CUDA : OpenCL, Execution models, GPU memory, GPU libraries.
Week 10,11,: Introduction to accelerator programming using CUDA/OpenCL and
Xeon-phi. Concepts of Heterogeneous programming techniques.
Practical:
Implementation of parallel programs using OpenMP/MPI/CUDA.
Assignment: Performance evaluation of parallel algorithms (in group of 2 or 3
members)
1. OpenMP
FORK – JOIN Parallelism
• OpenMP program begin as a single process: the master thread. The master
thread executes sequentially until the first parallel region construct is
encountered.
• When a parallel region is encountered, master thread
– Create a group of threads by FORK.
– Becomes the master of this group of threads and is assigned the thread id 0 within the group.
• The statement in the program that are enclosed by the parallel region
construct are then executed in parallel among these threads.
• JOIN: When the threads complete executing the statement in the parallel region
construct, they synchronize and terminate, leaving only the master thread.
2. OpenMP Programming: Directives : Parallel, For
#pragma omp for [clause[,]clause...] new-line
#pragma omp parallel [clause[,]clause...] new-line
for-loops
Structured-block
Clause: private(list)
Clause: if( scalar-expression)
firstprivate(list)
num_threads(integer-expression)
lastprivate(list)
default(shared|none)
reduction(operator:list)
private(list)
schedule(kind[,chunk_size])
firstprivate(list)
collapse(n)
shared(list)
ordered
copyin(list)
nowait
reduction(operator:list)
2. OpenMP Programming: Clauses : Schedule

#pragma omp for [clause[,]clause...] new-line Schedule(kind[,chunksize]) Clause

for-loops • Schedule clause specifies how iteration of the loop
Clause: private(list) are divided into contiguous non-empty subsets,
called chunks, and how these chunks are
firstprivate(list) assigned among threads of the team.
lastprivate(list) • Kind: It has following kind.
reduction(operator:list) • Static
• Dynamic
schedule(kind[,chunk_size]) • Guided
collapse(n) • runtime

ordered
nowait
2. OpenMP Programming: schedule(static, chunk_size)

#pragma omp for [clause[,]clause...] new-line Schedule(static, chunksize]) Clause

for-loops • Iterations are divided into chunk of size
chunk_size.
Clause: private(list)
• Chunks are statically assigned to threads in
firstprivate(list)
round robin fashion in the order of thread
lastprivate(list) number
reduction(operator:list) • Last chunk to be assigned may have smaller
schedule(kind[,chunk_size]) number of iterations.

collapse(n) • When no chunk size is specified,

iterations/threads
ordered
• Example: 28 iteration, threads= 4
nowait
• Schedule(static, 5)

thread0 thread1 thread2 Thread3 thread0 thread1

0-4 5-9 10-14 15-19 20-24 25-27
2. OpenMP Programming: schedule(dynamic, chunk_size)

#pragma omp for [clause[,]clause...] new-line Schedule(Dynamic, chunksize]) Clause

for-loops • Iterations are assigned to threads in chunksize
Clause: private(list) as the threads request them.
firstprivate(list) • Thread executes the chunk of iteration and
lastprivate(list) then requests another chunk, until
all iterations are complete.
reduction(operator:list)
• Each chunk contains chunksize except for the
schedule(kind[,chunk_size]) last chunk assigned.
collapse(n)
• Example: 28 iteration, threads= 4
ordered
• Schedule(dynamic, 5)
nowait

thread1 thread3 thread0 thread2 thread1 thread2

0-4 5-9 10-14 15-19 20-24 25-27
2. OpenMP Programming: schedule(guided, chunk_size)
Schedule(Guided, chunksize]) Clause • Chunk = remaining iterations / #threads

• Iterations are assigned to threads of chunksize • Example: 28 iteration, threads= 4

as the threads request them. • Schedule(guided,3)
• 28/4 = 7 [remaining = 28-7=21]
• Thread executes the chunk of iteration and
then requests another chunk, until • 21/4=5.2 => 6 [remaining =21-6 = 15]
all iterations are complete. • 15/4=3.7=>4 [remaining = 15-4 = 11]

• Chunk = remaining iterations / #threads • 11/4=2.7 =>3 [ remaining =11-3=8]

• 8/4 = 2 [ min is chunk size 3 . So assign 3:]
• Chunk size determines the minimum size
of chunk , except lst chunk. • [remaining =8-3 = 5]
• 5/4 = 1 [ min = 3: remaining : 2]
• Default value of chunk_size =1
• 2<=3 , so last chunk = 2

thread2 thread1 thread0 thread3 thread2 thread2 thread2

0-6 (7) 7-12(6) 13-16(4) 17-19(3) 20-22 (3) 23-25(3) 26-27(2)
2. OpenMP Programming: schedule(runtime)

#pragma omp for [clause[,]clause...] new-line Schedule(runtime) Clause

for-loops • The decision regarding scheduling is defered
Clause: private(list) until run time, and the schedule and chunk
size are taken from the run-sched-var control
firstprivate(list)
variable.
lastprivate(list)
reduction(operator:list)
schedule(kind[,chunk_size])
collapse(n)
ordered
nowait
2. OpenMP Programming: schedule(static, chunk_size)

thread0 thread1 thread2 Thread3 thread0 thread1

0-4 5-9 10-14 15-19 20-24 25-27
2. OpenMP Programming: schedule(dynamic, chunk_size)

thread2 thread1 thread0 thread3 thread1 thread3

0-4 5-9 10-14 15-19 20-24 25-27
2. OpenMP Programming: schedule(guided, chunk_size)

thread2 thread1 thread0 thread3 thread2 thread2 thread2

0-6 (7) 7-12(6) 13-16(4) 17-19(3) 20-22 (3) 23-25(3) 26-27(2)
2. OpenMP Programming: schedule(runtime)

thread1 thread0 thread3 Thread2 thread0

0 2 3 4-6 7-27
2. OpenMP Programming: schedule(runtime)
Index

• OpenMP
• Directives : if, for
• Clauses
• Schedule
• Static

• Dynamic

• Guided

• Runtime

• References
Reference
Text Books and/or Reference Books:
1. Professional CUDA C Programming – John Cheng, Max Grossman, Ty McKercher, 2014
2. B.Wilkinson, M.Allen, ”Parallel Programming: Techniques and Applications Using Networked
Workstations and Parallel Computers”, Pearson Education, 1999
3. I.Foster, ”Designing and building parallel programs”, 2003
4. Parallel Programming in C using OpenMP and MPI – Micheal J Quinn, 2004
5. Introduction to Parallel Programming – Peter S Pacheco, Morgan Kaufmann Publishers,
2011
6. Advanced Computer Architectures: A design approach, Dezso Sima, Terence Fountain, Peter
Kacsuk, 2002
7. Parallel Computer Architecture : A hardware/Software Approach, David E Culler, Jaswinder
Pal Singh Anoop Gupta, 2011 8. Introduction to Parallel Computing, Ananth Grama, Anshul
Gupta, George Karypis, Vipin Kumar, Pearson, 2011
Reference
Acknowledgements
1. Introduction to OpenMP https://www3.nd.edu/~zxu2/acms60212-40212/Lec-12-OpenMP.pdf
2. Introduction to parallel programming for shared memory
Machines https://www.youtube.com/watch?v=LL3TAHpxOig
3. OpenMP Application Program Interface Version 2.5 May 2005
4. OpenMP Application Program Interface Version 5.0 November 2018

Advanced BGP ATT
No ratings yet
Advanced BGP ATT
106 pages
Govindarajan - ParallelizationPrinciples NSM AstroPhysics
No ratings yet
Govindarajan - ParallelizationPrinciples NSM AstroPhysics
50 pages
OpenMP Workshop Day 1
No ratings yet
OpenMP Workshop Day 1
56 pages
PDSOpen MP
No ratings yet
PDSOpen MP
22 pages
CS-3006 8 UsingOpenMP SharedMemoryProgramming
No ratings yet
CS-3006 8 UsingOpenMP SharedMemoryProgramming
61 pages
HPC Lab
No ratings yet
HPC Lab
24 pages
OpenMP Tutorial - Lawrence Livermore National Laboratory
No ratings yet
OpenMP Tutorial - Lawrence Livermore National Laboratory
75 pages
OpenMP 3
No ratings yet
OpenMP 3
26 pages
Openmp HPC Ass1
No ratings yet
Openmp HPC Ass1
43 pages
314.07 Win8 Win7 Winvista Desktop Release Notes
No ratings yet
314.07 Win8 Win7 Winvista Desktop Release Notes
62 pages
Multiprocessing OpenMP
No ratings yet
Multiprocessing OpenMP
15 pages
Unit Iii
No ratings yet
Unit Iii
61 pages
Circuitikzmanual
No ratings yet
Circuitikzmanual
178 pages
Openmp 1
No ratings yet
Openmp 1
38 pages
Shared Memory and Accelerators
No ratings yet
Shared Memory and Accelerators
88 pages
Parallel Programming
No ratings yet
Parallel Programming
108 pages
Openmp 2
No ratings yet
Openmp 2
25 pages
PDC Lecture 15 OpenMP
No ratings yet
PDC Lecture 15 OpenMP
18 pages
Parallel Programming Module 3
No ratings yet
Parallel Programming Module 3
44 pages
Parallel Programming Using OpenMP
No ratings yet
Parallel Programming Using OpenMP
76 pages
Parallel Programming For Multicore Machines Using OpenMP and MPI Lecture Notes (Dr. Constantinos Evangelinos) (Z-Library)
No ratings yet
Parallel Programming For Multicore Machines Using OpenMP and MPI Lecture Notes (Dr. Constantinos Evangelinos) (Z-Library)
292 pages
Configuring Smart Licensing
No ratings yet
Configuring Smart Licensing
28 pages
High Performance Computing (HPC) - Lec3
No ratings yet
High Performance Computing (HPC) - Lec3
35 pages
Openmp
No ratings yet
Openmp
61 pages
Open MP2
No ratings yet
Open MP2
28 pages
Openmp Overview
No ratings yet
Openmp Overview
74 pages
System: A: Process
No ratings yet
System: A: Process
11 pages
HPC Summary
No ratings yet
HPC Summary
17 pages
Resume - 2023
No ratings yet
Resume - 2023
1 page
Lect11 Openmp1
No ratings yet
Lect11 Openmp1
35 pages
Lecture Open MP
No ratings yet
Lecture Open MP
35 pages
ParallelProgramming Start2016
No ratings yet
ParallelProgramming Start2016
41 pages
Lecture - 06 (Shared Memory Programming With OpenMP)
No ratings yet
Lecture - 06 (Shared Memory Programming With OpenMP)
65 pages
Chap4 OpenMP
No ratings yet
Chap4 OpenMP
35 pages
Unit III
No ratings yet
Unit III
15 pages
Chapter 3 - Shared-Memory Programming, OpenMP
No ratings yet
Chapter 3 - Shared-Memory Programming, OpenMP
65 pages
Shared Memory Parallel Programming: Introduction To Openmp
No ratings yet
Shared Memory Parallel Programming: Introduction To Openmp
39 pages
Digital Communication Lab-01
No ratings yet
Digital Communication Lab-01
2 pages
Omp Hands On SC08 PDF
No ratings yet
Omp Hands On SC08 PDF
153 pages
Lecture 10 Shared Memory Programming With OpenMP
No ratings yet
Lecture 10 Shared Memory Programming With OpenMP
30 pages
SE UNIT-3 Question Bank
No ratings yet
SE UNIT-3 Question Bank
17 pages
Introduction To OpenMP
No ratings yet
Introduction To OpenMP
46 pages
B28012018V5 1
No ratings yet
B28012018V5 1
23 pages
Lecture Open MP
No ratings yet
Lecture Open MP
25 pages
UCS 8 01 VM-Fex-01
No ratings yet
UCS 8 01 VM-Fex-01
24 pages
Unit 3
No ratings yet
Unit 3
13 pages
Lec 12 OpenMP
No ratings yet
Lec 12 OpenMP
152 pages
Moving Apps To The Cloud 3rd Edition PDF
No ratings yet
Moving Apps To The Cloud 3rd Edition PDF
206 pages
New Alarms List
No ratings yet
New Alarms List
14 pages
Untitled Document
No ratings yet
Untitled Document
23 pages
Leica Viva TS11 TS15 User Manual
No ratings yet
Leica Viva TS11 TS15 User Manual
220 pages
Parallel Programming Using Openmp: Mike Bailey
No ratings yet
Parallel Programming Using Openmp: Mike Bailey
27 pages
Proposal For A Cloud Computing Solution and Application in A Pedagogical Virtual Organization
No ratings yet
Proposal For A Cloud Computing Solution and Application in A Pedagogical Virtual Organization
10 pages
Parallel Programming: in C With Mpi and Openmp Michael J. Quinn
No ratings yet
Parallel Programming: in C With Mpi and Openmp Michael J. Quinn
73 pages
Omp Hands On SC08
No ratings yet
Omp Hands On SC08
153 pages
OpenMP Dynamic Scheduling
No ratings yet
OpenMP Dynamic Scheduling
6 pages
Num Tech
No ratings yet
Num Tech
39 pages
Unit 4 Shared-Memory Parallel Programming With Openmp
No ratings yet
Unit 4 Shared-Memory Parallel Programming With Openmp
37 pages
Openmp: Parallel Processing
No ratings yet
Openmp: Parallel Processing
40 pages
Predefined Exception in Pl/sqlpredefined Exceptions
No ratings yet
Predefined Exception in Pl/sqlpredefined Exceptions
4 pages
Agile Testing
No ratings yet
Agile Testing
11 pages
Mit Openmp Mpi
No ratings yet
Mit Openmp Mpi
77 pages
Parallel Computing and Openmp Tutorial: Shao-Ching Huang
No ratings yet
Parallel Computing and Openmp Tutorial: Shao-Ching Huang
58 pages
Mpsoc Architectures Openmp
No ratings yet
Mpsoc Architectures Openmp
35 pages
A Tutorial On Parallel Computing On Shared Memory Systems
No ratings yet
A Tutorial On Parallel Computing On Shared Memory Systems
23 pages
Os Lab4
No ratings yet
Os Lab4
18 pages
Open MP
No ratings yet
Open MP
30 pages
Sangfor Brochure
No ratings yet
Sangfor Brochure
2 pages
Eeol 2008jul16 Pow CTRLD Ta 01
No ratings yet
Eeol 2008jul16 Pow CTRLD Ta 01
3 pages
Open MP
No ratings yet
Open MP
35 pages
Worksharing and Parallel Loops
No ratings yet
Worksharing and Parallel Loops
23 pages
Forward Converter: J Fisher (PHD) Senior Lecturer
No ratings yet
Forward Converter: J Fisher (PHD) Senior Lecturer
23 pages
Open MPLecture
No ratings yet
Open MPLecture
54 pages
Multi Core Architectures and Programming
No ratings yet
Multi Core Architectures and Programming
10 pages
Direct Memory Access (DMA) : What Is A DMA Controller?
No ratings yet
Direct Memory Access (DMA) : What Is A DMA Controller?
3 pages
Liquid Level Controller Using TRIAC: Experiment #5
No ratings yet
Liquid Level Controller Using TRIAC: Experiment #5
7 pages
Xe 62011 Open MP
No ratings yet
Xe 62011 Open MP
46 pages
Programming Assignment: On Openmp
No ratings yet
Programming Assignment: On Openmp
19 pages
Computer Studies Paper 1 Teacher - Co - .Ke
No ratings yet
Computer Studies Paper 1 Teacher - Co - .Ke
70 pages
xPON 1GE Safe ONU Introduction - V1.0
No ratings yet
xPON 1GE Safe ONU Introduction - V1.0
3 pages
OpenMP Basics
No ratings yet
OpenMP Basics
47 pages
University of Management and Technology: Mid Term Exam
100% (2)
University of Management and Technology: Mid Term Exam
7 pages
Mega Squirt MS2 V3 Mazda CAS
No ratings yet
Mega Squirt MS2 V3 Mazda CAS
9 pages
CS201 Solved Subjective Final Term by Junaid
No ratings yet
CS201 Solved Subjective Final Term by Junaid
22 pages
Operational Amplifier
No ratings yet
Operational Amplifier
25 pages
Ertos U4 Notes
No ratings yet
Ertos U4 Notes
13 pages
Vxrail 4.0 Spec Sheet
No ratings yet
Vxrail 4.0 Spec Sheet
3 pages
Lisp Programming Language
From Everand
Lisp Programming Language
Faiz ul haque Zeya
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet

OpenMP 4

Uploaded by

OpenMP 4

Uploaded by

Index

Course Plan: Theory:

#pragma omp for [clause[,]clause...] new-line Schedule(kind[,chunksize]) Clause

#pragma omp for [clause[,]clause...] new-line Schedule(static, chunksize]) Clause

collapse(n) • When no chunk size is specified,

thread0 thread1 thread2 Thread3 thread0 thread1

#pragma omp for [clause[,]clause...] new-line Schedule(Dynamic, chunksize]) Clause

thread1 thread3 thread0 thread2 thread1 thread2

• Iterations are assigned to threads of chunksize • Example: 28 iteration, threads= 4

• Chunk = remaining iterations / #threads • 11/4=2.7 =>3 [ remaining =11-3=8]

thread2 thread1 thread0 thread3 thread2 thread2 thread2

#pragma omp for [clause[,]clause...] new-line Schedule(runtime) Clause

thread0 thread1 thread2 Thread3 thread0 thread1

thread2 thread1 thread0 thread3 thread1 thread3

thread2 thread1 thread0 thread3 thread2 thread2 thread2

thread1 thread0 thread3 Thread2 thread0

You might also like