0% found this document useful (0 votes)

40 views9 pages

Parallel Assignment 3

The document outlines two tasks involving parallel and distributed computing using OpenMP. The first task computes the sum of a large array using sequential and parallel methods, comparing different scheduling strategies, while the second task demonstrates task parallelism through independent computations of matrix multiplication and Fibonacci series. Performance results indicate significant speedup in parallel execution compared to sequential methods.

Uploaded by

Shahbaz Magray

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views9 pages

Parallel Assignment 3

Uploaded by

Shahbaz Magray

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Name: Shahbaz Iftikhar

Reg No: BSCS-22S-0034

Semester: 07
Course: Parallel and Distributed Computing
Assignment: 03

Task 1:
Parallel Loop Optimization (CLO-1, CLO-3) • Write a sequential C
program that computes the sum of a large array (e.g., 1 million
elements). • Parallelize it using OpenMP (parallel for with reduction). •
Experiment with different scheduling policies (static, dynamic, guided)
and compare execution times. • Deliverable: o Source code (.c file). o A
brief report (1 page) explaining the performance differences.
CODE:

#include <omp.h>
#include <stdio.h>
#include <stdlib.h>
#include <time.h>
#include <sys/time.h>

#define LENGTH 1000000

long long get_time_ms() {

struct timeval t;
gettimeofday(&t, NULL);
return (t.tv_sec * 1000LL) + (t.tv_usec / 1000);
}

void fill_array(int *data, int count) {

srand(time(NULL));
for (int i = 0; i < count; i++) {
data[i] = rand() % 100;
}
}

long long sum_sequential(int *data, int count) {

long long total = 0;
for (int i = 0; i < count; i++) {
total += data[i];
}
return total;
}

long long sum_parallel(int *data, int count) {

long long total = 0;
#pragma omp parallel for reduction(+:total)
for (int i = 0; i < count; i++) {
total += data[i];
}
return total;
}
long long sum_static(int *data, int count) {
long long total = 0;
#pragma omp parallel for reduction(+:total) schedule(static, 100)
for (int i = 0; i < count; i++) {
total += data[i];
}
return total;
}

long long sum_dynamic(int *data, int count) {

long long total = 0;
#pragma omp parallel for reduction(+:total) schedule(dynamic, 100)
for (int i = 0; i < count; i++) {
total += data[i];
}
return total;
}

long long sum_guided(int *data, int count) {

long long total = 0;
#pragma omp parallel for reduction(+:total) schedule(guided, 100)
for (int i = 0; i < count; i++) {
total += data[i];
}
return total;
}

int main() {
int *arr = malloc(LENGTH * sizeof(int));
if (!arr) {
printf("No memory\n");
return 1;
}

fill_array(arr, LENGTH);

long long t0, t1;

t0 = get_time_ms();
long long seq = sum_sequential(arr, LENGTH);
t1 = get_time_ms();
printf("Sequential: %lld ms | Sum = %lld\n", t1 - t0, seq);

t0 = get_time_ms();
long long par = sum_parallel(arr, LENGTH);
t1 = get_time_ms();
printf("Parallel Default: %lld ms | Sum = %lld\n", t1 - t0, par);

t0 = get_time_ms();
par = sum_static(arr, LENGTH);
t1 = get_time_ms();
printf("Static: %lld ms | Sum = %lld\n", t1 - t0, par);

t0 = get_time_ms();
par = sum_dynamic(arr, LENGTH);
t1 = get_time_ms();
printf("Dynamic: %lld ms | Sum = %lld\n", t1 - t0, par);

t0 = get_time_ms();
par = sum_guided(arr, LENGTH);
t1 = get_time_ms();
printf("Guided: %lld ms | Sum = %lld\n", t1 - t0, par);

free(arr);
return 0;
}

OUTPUT:
Explanation:

This task focuses on computing the sum of a large array (1 million elements). Initially, the
program computes the sum sequentially using a simple loop. Then, OpenMP is used to
parallelize the same task with the #pragma omp parallel for directive and the
reduction clause to safely aggregate partial results from multiple threads.

We implemented and compared three OpenMP scheduling strategies:

● Static scheduling: Divides the loop iterations equally among threads before execution.

● Dynamic scheduling: Threads grab chunks of iterations as they finish their previous
work. This helps with load balancing.
● Guided scheduling: Starts with large chunks that decrease over time, balancing load and
reducing overhead.

Performance Results:

● Static performed best with uniform workloads (e.g., each array element has similar
processing time).

● Dynamic and Guided scheduling are better for uneven workloads.

● Parallel reduction showed significant speedup compared to sequential summation.

Task 2:
Task Parallelism with OpenMP (CLO-3) • Implement a program that
performs two independent computations (e.g., matrix multiplication and
Fibonacci series) using OpenMP sections or task constructs. • Measure
speedup compared to sequential execution.

CODE:

#include <omp.h>
#include <stdio.h>
#include <stdlib.h>
#include <time.h>
#include <sys/time.h>

#define DIM 200

#define FIB_N 40

long long current_ms() {

struct timeval tv;
gettimeofday(&tv, NULL);
return (tv.tv_sec * 1000LL) + (tv.tv_usec / 1000);
}

void matrix_op() {
int mat1[DIM][DIM], mat2[DIM][DIM], result[DIM][DIM];
for (int i = 0; i < DIM; i++)
for (int j = 0; j < DIM; j++) {
mat1[i][j] = i + j;
mat2[i][j] = i - j;
}

for (int i = 0; i < DIM; i++)

for (int j = 0; j < DIM; j++) {
result[i][j] = 0;
for (int k = 0; k < DIM; k++)
result[i][j] += mat1[i][k] * mat2[k][j];
}
}

int calc_fib(int n) {
if (n <= 1) return n;
return calc_fib(n - 1) + calc_fib(n - 2);
}

void fib_task() {
int val = calc_fib(FIB_N);
}

int main() {
double t_serial, t_parallel;
long long start, end;
start = current_ms();
matrix_op();
fib_task();
end = current_ms();
t_serial = (double)(end - start) / 1000.0;
printf("Time Serial: %.2f s\n", t_serial);

start = current_ms();
#pragma omp parallel sections
{
#pragma omp section
matrix_op();

#pragma omp section

fib_task();
}
end = current_ms();
t_parallel = (double)(end - start) / 1000.0;
printf("Time Parallel: %.2f s\n", t_parallel);

printf("Speedup: %.2f\n", t_serial / t_parallel);

return 0;
}

OUTPUT:
Explanation:

This task demonstrates task parallelism by executing two independent computations:

1. Matrix multiplication of two 200×200 matrices.

2. Recursive calculation of the 40th Fibonacci number.

First, both tasks were executed sequentially. Then, OpenMP’s #pragma omp parallel sections
was used to run them in parallel on separate threads. Each task was assigned to a separate section.

Observations:

● Parallel execution led to reduced total execution time.

● OpenMP sections are ideal when two or more independent tasks can be run simultaneously.

● Speedup was calculated by dividing sequential time by parallel time, showing clear improvement
through parallelism.

PDC Lab01 27076
No ratings yet
PDC Lab01 27076
3 pages
HPC Programs
No ratings yet
HPC Programs
19 pages
Parallel Computing Manual
No ratings yet
Parallel Computing Manual
15 pages
(Serial)
No ratings yet
(Serial)
8 pages
Question 1 - Serial: Output
No ratings yet
Question 1 - Serial: Output
9 pages
OpenMP Programs
No ratings yet
OpenMP Programs
4 pages
HPC Codes-2
No ratings yet
HPC Codes-2
15 pages
Parallel and Distributed Computing
No ratings yet
Parallel and Distributed Computing
5 pages
Lab 3
No ratings yet
Lab 3
23 pages
Lab Programs
No ratings yet
Lab Programs
18 pages
Par - 1 In-Term Exam - Course 2017/18-Q2
No ratings yet
Par - 1 In-Term Exam - Course 2017/18-Q2
7 pages
PDC LAB Experiment 2
No ratings yet
PDC LAB Experiment 2
12 pages
Module 4 - 4.6 - Understanding Shared Variables and Their Protection Mechanisms in OpenMP
No ratings yet
Module 4 - 4.6 - Understanding Shared Variables and Their Protection Mechanisms in OpenMP
5 pages
PC - Lab Manuall
No ratings yet
PC - Lab Manuall
15 pages
Name: Harshvardhan Singh Gahlaut Reg. No.: 19BCE2372 Slot: L41+L42
No ratings yet
Name: Harshvardhan Singh Gahlaut Reg. No.: 19BCE2372 Slot: L41+L42
3 pages
Parallel and Distributed Computing Lab Digital Assignment - 3
No ratings yet
Parallel and Distributed Computing Lab Digital Assignment - 3
10 pages
OpenMP Shared
No ratings yet
OpenMP Shared
28 pages
Lab 07 - Programming Threads
No ratings yet
Lab 07 - Programming Threads
9 pages
Gauravkumar 221it027@it301 Lab2
No ratings yet
Gauravkumar 221it027@it301 Lab2
28 pages
Assignment 5
No ratings yet
Assignment 5
6 pages
PDC-Lab 21BCE10419
No ratings yet
PDC-Lab 21BCE10419
20 pages
Parallel Programming
No ratings yet
Parallel Programming
10 pages
Lab Manual
No ratings yet
Lab Manual
31 pages
E 3 (Openmp - Iii) : Matrix Multiplication
No ratings yet
E 3 (Openmp - Iii) : Matrix Multiplication
10 pages
Parallel Computing 1 9
No ratings yet
Parallel Computing 1 9
16 pages
Par - 1 In-Term Exam - Course 2018/19-Q2
No ratings yet
Par - 1 In-Term Exam - Course 2018/19-Q2
9 pages
Lab Assignment 1 - 26269
No ratings yet
Lab Assignment 1 - 26269
4 pages
Vector Addition: Exercise 1 (Openmp-I) Scenario - I
100% (1)
Vector Addition: Exercise 1 (Openmp-I) Scenario - I
15 pages
All HPC Programs
No ratings yet
All HPC Programs
16 pages
PP Manual
No ratings yet
PP Manual
22 pages
HPC Codes
No ratings yet
HPC Codes
18 pages
Link For Video: 2f548388ec255440535e897?sid 9936f6 B2-C57d-49de-8124-3bb0e1a4e612
No ratings yet
Link For Video: 2f548388ec255440535e897?sid 9936f6 B2-C57d-49de-8124-3bb0e1a4e612
11 pages
4 Performance.4x
No ratings yet
4 Performance.4x
14 pages
OpenMP Matrix
No ratings yet
OpenMP Matrix
6 pages
PC Labmanual
No ratings yet
PC Labmanual
19 pages
CP4292 Mcap
No ratings yet
CP4292 Mcap
24 pages
Parallel Computing Lab Manual
No ratings yet
Parallel Computing Lab Manual
26 pages
HPC Printout 1
No ratings yet
HPC Printout 1
22 pages
Parallel Computing - 1-9-1
No ratings yet
Parallel Computing - 1-9-1
16 pages
MAP Lab Mannual
No ratings yet
MAP Lab Mannual
24 pages
Lab # 2 by Akram
No ratings yet
Lab # 2 by Akram
14 pages
HPC Output
No ratings yet
HPC Output
12 pages
OpenMP Programming Exercises
No ratings yet
OpenMP Programming Exercises
10 pages
Assignment 4
No ratings yet
Assignment 4
5 pages
CP 4292 MCP Lab Manual
No ratings yet
CP 4292 MCP Lab Manual
20 pages
CP4252 Multicore Architecture and Programming Lab Manual
No ratings yet
CP4252 Multicore Architecture and Programming Lab Manual
26 pages
Multithreaded Programming Exercises
No ratings yet
Multithreaded Programming Exercises
18 pages
SWE2017 - Lab Assignment 1pages-7
No ratings yet
SWE2017 - Lab Assignment 1pages-7
5 pages
Report Homework 1: 1 Openmp Experiment
No ratings yet
Report Homework 1: 1 Openmp Experiment
8 pages
Lab 7
No ratings yet
Lab 7
3 pages
Assignment 04
No ratings yet
Assignment 04
16 pages
Ass Parallel
No ratings yet
Ass Parallel
11 pages
Multicore Architecture Lab Manual
No ratings yet
Multicore Architecture Lab Manual
34 pages
OpenMP Programming Examples
No ratings yet
OpenMP Programming Examples
29 pages
Lab Manual
No ratings yet
Lab Manual
33 pages
20bce2126 PDC Lab Da 3
No ratings yet
20bce2126 PDC Lab Da 3
11 pages
MPC LAB Manual New
No ratings yet
MPC LAB Manual New
24 pages
CSC 326 Manual
No ratings yet
CSC 326 Manual
10 pages
Crystal
No ratings yet
Crystal
96 pages
High School Chemistry Essentials
No ratings yet
High School Chemistry Essentials
4 pages
Nepali Class 11211.
No ratings yet
Nepali Class 11211.
163 pages
Convert Dissertation To Article
100% (2)
Convert Dissertation To Article
8 pages
AIML Madhuparna CV
No ratings yet
AIML Madhuparna CV
3 pages
L3 - Embedding Audio and Video 3
No ratings yet
L3 - Embedding Audio and Video 3
37 pages
Acer Aspire Specs
No ratings yet
Acer Aspire Specs
1 page
ASB FastNet Statements
No ratings yet
ASB FastNet Statements
1 page
Serum Benzodiazepines Reagent Kit: B9P530 G91119R06
No ratings yet
Serum Benzodiazepines Reagent Kit: B9P530 G91119R06
6 pages
1 3 - 3 EN1991 1 7 Bridge Design Provisions of UK NA For EN1991 1 7 and PD6688 1 7
No ratings yet
1 3 - 3 EN1991 1 7 Bridge Design Provisions of UK NA For EN1991 1 7 and PD6688 1 7
14 pages
Printkote (1) q309
No ratings yet
Printkote (1) q309
2 pages
Renjish CV 07-06-2021
No ratings yet
Renjish CV 07-06-2021
5 pages
Pharmaceutics II
No ratings yet
Pharmaceutics II
36 pages
Flow Process of Apparel Manufacturing, Fardin, AE - 50
No ratings yet
Flow Process of Apparel Manufacturing, Fardin, AE - 50
14 pages
1 SM PDF
No ratings yet
1 SM PDF
15 pages
A New Fast and Efficient Decision-Based Algorithm For Removal of High-Density Impulse Noises
No ratings yet
A New Fast and Efficient Decision-Based Algorithm For Removal of High-Density Impulse Noises
4 pages
Physics 1
No ratings yet
Physics 1
8 pages
Building Foundation: Pyroprocessing 2a Kiln, Preheater & Cooler Technology
100% (2)
Building Foundation: Pyroprocessing 2a Kiln, Preheater & Cooler Technology
64 pages
Manual - Viking Pump Pressure Relief Valve
No ratings yet
Manual - Viking Pump Pressure Relief Valve
1 page
PostgreSQL & PostgREST Deployment Guide
No ratings yet
PostgreSQL & PostgREST Deployment Guide
5 pages
Pico Kaplan Turbine Study
No ratings yet
Pico Kaplan Turbine Study
9 pages
ESIC Maharashtra MTS - Result
No ratings yet
ESIC Maharashtra MTS - Result
8 pages
Freely-Jointed Chain: For Up To Date Version of This Document, See Z. Suo
No ratings yet
Freely-Jointed Chain: For Up To Date Version of This Document, See Z. Suo
6 pages
BLS-ACLS AHA 2015 Update Hipgabi Jan 2018
No ratings yet
BLS-ACLS AHA 2015 Update Hipgabi Jan 2018
49 pages
Task 3
No ratings yet
Task 3
3 pages
Overview, Analyzes The Entire Afghan War of The 1980s and Explains How The Rise of The Taliban
No ratings yet
Overview, Analyzes The Entire Afghan War of The 1980s and Explains How The Rise of The Taliban
2 pages
Datasheet Vertex DEG19RC.20 EN 2023 B
No ratings yet
Datasheet Vertex DEG19RC.20 EN 2023 B
2 pages
Corporate Strategy Essentials
No ratings yet
Corporate Strategy Essentials
3 pages
Azure 305 Practice Questions
No ratings yet
Azure 305 Practice Questions
2 pages

Parallel Assignment 3

Uploaded by

Parallel Assignment 3

Uploaded by

Name: ​ ​ Shahbaz Iftikhar

Reg No: ​ BSCS-22S-0034

#define LENGTH 1000000

long long get_time_ms() {

void fill_array(int *data, int count) {

long long sum_sequential(int *data, int count) {

long long sum_parallel(int *data, int count) {

long long sum_dynamic(int *data, int count) {

long long sum_guided(int *data, int count) {

long long t0, t1;

We implemented and compared three OpenMP scheduling strategies:

●​ Dynamic and Guided scheduling are better for uneven workloads.​

●​ Parallel reduction showed significant speedup compared to sequential summation.

#define DIM 200

long long current_ms() {

for (int i = 0; i < DIM; i++)

#pragma omp section

printf("Speedup: %.2f\n", t_serial / t_parallel);

This task demonstrates task parallelism by executing two independent computations:

1.​ Matrix multiplication of two 200×200 matrices.​

2.​ Recursive calculation of the 40th Fibonacci number.​

●​ Parallel execution led to reduced total execution time.​

You might also like

Name: Shahbaz Iftikhar

Reg No: BSCS-22S-0034

● Dynamic and Guided scheduling are better for uneven workloads.

● Parallel reduction showed significant speedup compared to sequential summation.

1. Matrix multiplication of two 200×200 matrices.

2. Recursive calculation of the 40th Fibonacci number.

● Parallel execution led to reduced total execution time.