0% found this document useful (0 votes)

4 views6 pages

Map55611 1 2

This document outlines the examination for the M.Sc. in High-Performance Computing, detailing instructions for candidates and the structure of the exam. It includes four main questions covering topics such as Amdahl's law, OpenMP, parallel computing, and MPI functions. Each question is designed to assess the candidates' understanding of high-performance computing concepts and their application in programming.

Uploaded by

yashhpc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views6 pages

Map55611 1 2

Uploaded by

yashhpc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

MAP55611-1

Faculty of Science, Technology, Engineering and Mathematics

School of Mathematics
M.Sc. in High-Performance Computing Michaelmas Term 2021

MAP55611: High-Performance Computing Software

Tuesday, 13 December 2022 RDS Simmonscourt 9.30-11.30

Michael Peardon and Darach Golden

Instructions to candidates:

Attempt ALL FOUR questions. All questions are worth 25 marks.

You may not start this examination until you are instructed to do so by the Invigi-
lator.

Page 1 of 6
MAP55611-1

1. (a) A computational task comprises three parts. The first step initialises data and takes
10 seconds to complete on a single core. The second part evaluates a function
on the data and takes 200 seconds on one core. The final part performs post-
processings and takes 10 seconds to complete on a single core. Only the algorithm
for the second part of the problem can be parallelised.

i. State Amdahl’s law and explain how it applies to estimating the value of
executing this composite task on more than one compute core. [3]

ii. Using Amdahl’s law, estimate how much faster this task can be executed on
a system with 8 cores and another system with a very large number of cores.
[4]

iii. Estimate how many cores would be needed to achieve at least 95% of the
maximum possible parallel performance for this task. [3]

(b) Explain which one of these two operations on arrays of length n can be parallelised
efficiently and write a function in OpenMP to perform the case where a simple
parallel implementation is possible
p p
gk = fk + gk−1 or gk = fk + fk−1 for k = 1 . . . n − 1

[8]

(c) Execution times for six runs of a parallel code performed with two different numbers
of compute cores, n and three problem sizes, p are given in the Table below.
Comment on the strong and weak scaling behaviour of this software. [7]

p = 100 p = 1, 000 p = 10, 000

n = 64 200s 2000s 30000s
n = 256 150s 1500s 25000s

Table 1: Run times for different problem sizes, p and numbers of compute cores, n.

Page 2 of 6
© TRINITY COLLEGE DUBLIN, THE UNIVERSITY OF DUBLIN 2022
MAP55611-1

2. (a) Describe the output of the following OpenMP code fragment: [7]

#pragma omp parallel num_threads(3)

{
int id = omp_get_thread_num();
printf("%d\n",id);
#pragma omp single
printf("%d\n",id);
}

(b) A function to sum the elements in an array is written to use the OpenMP library.

double sum=0; int n=1000;

#pragma omp parallel num_threads(3)
for (int i=0;i<n;i++)
sum += data[i];

Describe what is likely to fail with this code, and write a corrected version.
[6]

(c) The loop below calls function do task ten times

for (i=0;i<50;i++)
do_task(i);

For each example given below, where a prediction can be made, state which thread
will execute the function for each value of the argument i = 0, 1, . . . 49 when each
OpenMP directive is used to parallelise this loop. Explain your answer.

i. #pragma omp parallel for num_threads(4) [4]

ii. #pragma omp parallel for num_threads(4) schedule(chunk,3) [4]

iii. #pragma omp parallel for num_threads(4) schedule(dynamic) [4]

3. The Gauss-Seidel iteration to solve an approximation to the Laplace equation in three

dimensions visits every interior point (x, y, z) on a grid of dimension 40×40×40. These
points have co-ordinates x, y, z ∈ [1, 39]. At each visited site, the field is replaced with
the average of its six nearest neighbours.

(a) Explain why the ordering of site visits matters when considering if this algorithm
can be parallelised. [5]

(b) Write a function

double gauss_seidel(double phi[40][40][40])

that uses the OpenMP library to over-write the array phi with the next iteration of
Gauss-Seidel. The function should return the modulus of the difference between
phi between the two iterations. [20]

4. (a) For a matched pair of MPI Send() and MPI Recv() on two processes in the same
MPI communicator what information is the send and receive matched on? [3]

(b) Describe how using MPI Send and MPI Recv can lead to deadlock between pro-
cesses. Write a C snippet showing deadlock. Assume exactly two MPI processes.
[7]

(c) Using the same assumptions as above, write a C snippet showing a version of this
code that will not deadlock under any circumstances. [5]

(d) Explain the operation of MPI Gather. Write a function my gather() implement-
ing the effect of MPI Gather using at least some of the basic MPI functions
MPI Comm rank, MPI Comm size, MPI Send, MPI Recv and MPI Barrier. The
function my gather()should take the same arguments as MPI Gather [8]

(e) What is the difference between MPI Gather and MPI Allgather [2]

Appendix: MPI function declarations

int MPI_Allgather(void *sendbuf, int sendcount, MPI_Datatype sendtype,

void *recvbuf, int recvcount, MPI_Datatype recvtype,
MPI_Comm comm );
int MPI_Allreduce(void *send_buffer, void *recv_buffer, int count,
MPI_Datatype datatype, MPI_Op op, MPI_Comm comm);
int MPI_Alltoall( void *sendbuf, int sendcount, MPI_Datatype sendtype,
void *recvbuf, int recvcnt, MPI_Datatype recvtype,
MPI_Comm comm )
int MPI_Bcast(void *buffer, int count, MPI_Datatype datatype, int root,
MPI_Comm comm);
int MPI_Cart_shift(MPI_Comm comm, int direction, int displ, int *src, int *dest);
int MPI_Comm_rank(MPI_Comm comm, int *rank);
int MPI_Comm_size(MPI_Comm comm, int *size);
int MPI_Comm_spawn(const char *command, char *argv[], int maxprocs,
MPI_Info info, int root, MPI_Comm comm,
MPI_Comm *intercomm, int array_of_errcodes[])
int MPI_Comm_split(MPI_Comm comm, int colour, int key, MPI_Comm *newcomm);
int MPI_Finalize();
int MPI_Gather(void *send_buffer, int send_count, MPI_Datatype sendtype,
void *recv_buffer, int recv_count, MPI_Datatype recvtype,
int root, MPI_Comm comm);
int MPI_Init(int *argc, char ***argv);
int MPI_Init_thread(int *argc, char ***argv, int required, int *provided);
int MPI_Irecv(void *buf, int count, MPI_Datatype datatype, int source,
int tag, MPI_Comm comm, MPI_Request *request);
int MPI_Isend(void *buf, int count, MPI_Datatype datatype, int dest, int tag,
MPI_Comm comm, MPI_Request *request);

int MPI_Recv(void *buf, int count, MPI_Datatype datatype, int source,

int tag, MPI_Comm comm, MPI_Status *status );
int MPI_Reduce(void *sendbuf, void *recvbuf, int count, MPI_Datatype datatype,
MPI_Op op, int root, MPI_Comm comm);
int MPI_Scatter(void *sendbuf, int sendcnt, MPI_Datatype sendtype,
void *recvbuf, int recvcnt, MPI_Datatype recvtype,
int root, MPI_Comm comm);
int MPI_Send(void *buf, int count, MPI_Datatype datatype, int dest,
int tag, MPI_Comm comm);
int MPI_Sendrecv(void *sendbuf, int sendcount, MPI_Datatype sendtype,
int dest, int sendtag,
void *recvbuf, int recvcount, MPI_Datatype recvtype,
int source, int recvtag,
MPI_Comm comm, MPI_Status *status);
int MPI_Test(MPI_Request *request, int *flag, MPI_Status *status);
int MPI_Type_commit(MPI_Datatype *datatype)
int MPI_Type_contiguous(int count, MPI_Datatype oldtype,
MPI_Datatype *newtype)
int MPI_Type_free(MPI_Datatype *datatype)
int MPI_Type_struct(int count, int *array_of_blocklengths,
MPI_Aint *array_of_displacements, MPI_Datatype *array_of_types,
MPI_Datatype *newtype)
int MPI_Type_vector(int count, int blocklength, int stride,
MPI_Datatype oldtype, MPI_Datatype *newtype)
int MPI_Wait(MPI_Request *request, MPI_Status *status);

Parallel Computing Lab Manual PDF
100% (1)
Parallel Computing Lab Manual PDF
51 pages
Parallel MCQs Part 4
No ratings yet
Parallel MCQs Part 4
6 pages
HPC Lab Manual 2317 Merged Organized
No ratings yet
HPC Lab Manual 2317 Merged Organized
35 pages
PC Course Notes May17
No ratings yet
PC Course Notes May17
123 pages
CSC-334 - P&DC - Lab Manual - V2.0
No ratings yet
CSC-334 - P&DC - Lab Manual - V2.0
102 pages
E05 - 22cs4106R Lab WorkBook
No ratings yet
E05 - 22cs4106R Lab WorkBook
96 pages
Set 3 QP
No ratings yet
Set 3 QP
7 pages
Assignment 04
No ratings yet
Assignment 04
16 pages
Major
No ratings yet
Major
10 pages
Untitled Document
No ratings yet
Untitled Document
23 pages
PDC Assignment 01 (Theory)
No ratings yet
PDC Assignment 01 (Theory)
3 pages
HPC Int I Retest Answer Key
No ratings yet
HPC Int I Retest Answer Key
10 pages
Practice OpenMP
No ratings yet
Practice OpenMP
2 pages
Pdcnotes
No ratings yet
Pdcnotes
23 pages
Question Paper Code:60435: Reg. No.
No ratings yet
Question Paper Code:60435: Reg. No.
3 pages
PDC Experiments
No ratings yet
PDC Experiments
11 pages
Lab 9 PDCHHHGGFFFFDDD
No ratings yet
Lab 9 PDCHHHGGFFFFDDD
4 pages
MPI Lab 3
No ratings yet
MPI Lab 3
18 pages
End Sem Lab Exam Q Paper
No ratings yet
End Sem Lab Exam Q Paper
3 pages
Untitled Document
No ratings yet
Untitled Document
23 pages
Set 1 QP
No ratings yet
Set 1 QP
7 pages
Parallel Algorithm Merged
No ratings yet
Parallel Algorithm Merged
76 pages
Map55612 1
No ratings yet
Map55612 1
10 pages
Openmpcw 22
No ratings yet
Openmpcw 22
6 pages
Omp Exercises
No ratings yet
Omp Exercises
81 pages
(Serial)
No ratings yet
(Serial)
8 pages
Parallelizing Particle-In-Cell Codes With Openmp and Mpi: Nils Magnus Larsgård
No ratings yet
Parallelizing Particle-In-Cell Codes With Openmp and Mpi: Nils Magnus Larsgård
74 pages
Guidelines IntroductionToParallelProgramming
No ratings yet
Guidelines IntroductionToParallelProgramming
2 pages
SDC PDF
No ratings yet
SDC PDF
32 pages
Project Report Full On Music Palyer Angular
25% (4)
Project Report Full On Music Palyer Angular
49 pages
PDC MidTerm-I Spring2023
No ratings yet
PDC MidTerm-I Spring2023
3 pages
CS621 Final Term Current Papers
No ratings yet
CS621 Final Term Current Papers
9 pages
Computer Science, Career and Job
From Everand
Computer Science, Career and Job
Ramkrishna Ghosh
No ratings yet
Mid 1 Spring 2024
No ratings yet
Mid 1 Spring 2024
9 pages
VL2020210104311 Fat PDF
No ratings yet
VL2020210104311 Fat PDF
6 pages
OWASPLondon20161124 JSON Hijacking Gareth Heyes
No ratings yet
OWASPLondon20161124 JSON Hijacking Gareth Heyes
44 pages
Final PDC Exam
No ratings yet
Final PDC Exam
10 pages
MCAP Qb.
No ratings yet
MCAP Qb.
7 pages
Parallel Processing Previous Year Question
No ratings yet
Parallel Processing Previous Year Question
11 pages
Tutorial CTC RTC Sharing To AG
No ratings yet
Tutorial CTC RTC Sharing To AG
15 pages
National University of Computer and Emerging Sciences, Lahore Campus
No ratings yet
National University of Computer and Emerging Sciences, Lahore Campus
9 pages
Pseudo Code of Mpi Programs
No ratings yet
Pseudo Code of Mpi Programs
22 pages
Mid Sem QP&Solution
No ratings yet
Mid Sem QP&Solution
7 pages
Solutions Midterm 1 March 72020
No ratings yet
Solutions Midterm 1 March 72020
7 pages
HPC Programs
No ratings yet
HPC Programs
19 pages
Tibco Activematrix Businessworks™ Error Codes: November 2020
No ratings yet
Tibco Activematrix Businessworks™ Error Codes: November 2020
155 pages
Unit IV
No ratings yet
Unit IV
12 pages
Xe 62011 Open MP
No ratings yet
Xe 62011 Open MP
46 pages
Boundary Value Analysis Examples
100% (2)
Boundary Value Analysis Examples
4 pages
2 Mpi
No ratings yet
2 Mpi
13 pages
C Programming
From Everand
C Programming
Netra
No ratings yet
Parallel and Distributed Computing
33% (3)
Parallel and Distributed Computing
10 pages
Python V
100% (1)
Python V
34 pages
Group 11 Mini Project PDF
No ratings yet
Group 11 Mini Project PDF
36 pages
National University of Computer and Emerging Sciences, Lahore Campus
No ratings yet
National University of Computer and Emerging Sciences, Lahore Campus
10 pages
Par - 1 In-Term Exam - Course 2018/19-Q2
No ratings yet
Par - 1 In-Term Exam - Course 2018/19-Q2
9 pages
OpenMPCoursework2
100% (1)
OpenMPCoursework2
5 pages
.Trashed-1650000204-Hpc Prac Exam
No ratings yet
.Trashed-1650000204-Hpc Prac Exam
5 pages
Java Practise Exercise
No ratings yet
Java Practise Exercise
3 pages
10 Advanced C# Tricks For Developers ? Medium
No ratings yet
10 Advanced C# Tricks For Developers ? Medium
10 pages
Assignment Questions
No ratings yet
Assignment Questions
3 pages
Mpi Openmp Examples
No ratings yet
Mpi Openmp Examples
27 pages
CS4961: Parallel Programming Midterm Exam October 20, 2011
No ratings yet
CS4961: Parallel Programming Midterm Exam October 20, 2011
4 pages
As 3
No ratings yet
As 3
2 pages
2022 Mid 1
No ratings yet
2022 Mid 1
4 pages
Variable and Constant: Variables in C
No ratings yet
Variable and Constant: Variables in C
5 pages
Full Project Library
0% (1)
Full Project Library
77 pages
CS6801
No ratings yet
CS6801
7 pages
Advanced Developer Training: Corporation
No ratings yet
Advanced Developer Training: Corporation
35 pages
E 3 (Openmp - Iii) : Matrix Multiplication
No ratings yet
E 3 (Openmp - Iii) : Matrix Multiplication
10 pages
IGNOU BCA Introduction to Algorithm Design Previous Year Unsolved Papers BCS 042
From Everand
IGNOU BCA Introduction to Algorithm Design Previous Year Unsolved Papers BCS 042
Manish Soni
No ratings yet
MPI2
No ratings yet
MPI2
3 pages
Azure Key Vault Integration With IICS
No ratings yet
Azure Key Vault Integration With IICS
23 pages
WWW Totalav
No ratings yet
WWW Totalav
3 pages
Advance Js in Hindi
33% (3)
Advance Js in Hindi
112 pages
Google Cloud Platform (GCP) at A Glance
No ratings yet
Google Cloud Platform (GCP) at A Glance
8 pages
Assignment1 ITEC1620
No ratings yet
Assignment1 ITEC1620
3 pages
Dialogic Springware/JCT To DM3 Migration
No ratings yet
Dialogic Springware/JCT To DM3 Migration
15 pages
TEXT BOOK:"Client/Server Survival Guide" Wiley INDIA Publication, 3 Edition, 2011. Prepared By: B.Loganathan
No ratings yet
TEXT BOOK:"Client/Server Survival Guide" Wiley INDIA Publication, 3 Edition, 2011. Prepared By: B.Loganathan
41 pages
Fallsem2019-20 Cse4001 Eth Vl2019201001348 Reference Material Cse4001 Parallel and Distributed Computing May 2019 (003) 18
No ratings yet
Fallsem2019-20 Cse4001 Eth Vl2019201001348 Reference Material Cse4001 Parallel and Distributed Computing May 2019 (003) 18
4 pages
VB MCQ
No ratings yet
VB MCQ
17 pages
D3 On jBASE Cheatsheet
No ratings yet
D3 On jBASE Cheatsheet
2 pages
Elements of C: Programming Languages
No ratings yet
Elements of C: Programming Languages
21 pages
How To Use Jquery With Node - Js - GeeksforGeeks
No ratings yet
How To Use Jquery With Node - Js - GeeksforGeeks
1 page
3589 - Blue Line - Preventive Maintenance Sheets
No ratings yet
3589 - Blue Line - Preventive Maintenance Sheets
2 pages
Original
No ratings yet
Original
5 pages
Practical 1 (OOP)
No ratings yet
Practical 1 (OOP)
4 pages
Ayush Srivastav: Education
No ratings yet
Ayush Srivastav: Education
1 page
Difference Between ECC & S4HANA
No ratings yet
Difference Between ECC & S4HANA
2 pages
Spau and SPDD
No ratings yet
Spau and SPDD
2 pages

Map55611 1 2

Uploaded by

Map55611 1 2

Uploaded by

MAP55611-1

Faculty of Science, Technology, Engineering and Mathematics

MAP55611: High-Performance Computing Software

Tuesday, 13 December 2022 RDS Simmonscourt 9.30-11.30

Michael Peardon and Darach Golden

Attempt ALL FOUR questions. All questions are worth 25 marks.

p = 100 p = 1, 000 p = 10, 000

#pragma omp parallel num_threads(3)

double sum=0; int n=1000;

(c) The loop below calls function do task ten times

i. #pragma omp parallel for num_threads(4) [4]

ii. #pragma omp parallel for num_threads(4) schedule(chunk,3) [4]

iii. #pragma omp parallel for num_threads(4) schedule(dynamic) [4]

3. The Gauss-Seidel iteration to solve an approximation to the Laplace equation in three

(b) Write a function

double gauss_seidel(double phi[40][40][40])

Appendix: MPI function declarations

int MPI_Allgather(void *sendbuf, int sendcount, MPI_Datatype sendtype,

int MPI_Recv(void *buf, int count, MPI_Datatype datatype, int source,

You might also like