CUDA, OpenMP & MPI Implementations

A collection of 55 parallel computing implementations exploring GPU programming with CUDA, shared-memory parallelism with OpenMP, and distributed computing with MPI.

Overview

This repository contains practical implementations of parallel algorithms across three dominant paradigms: CUDA for NVIDIA GPU acceleration, OpenMP for multi-core CPU parallelism, and MPI for distributed systems.

Key Technologies

CUDA: GPU kernel development, memory management, thread synchronization
OpenMP: Directive-based parallelization, loop scheduling, reduction operations
MPI: Point-to-point and collective communication, distributed algorithms

Technical Deep Dive

CUDA Implementations

GPU kernels leverage massive parallelism for data-parallel workloads. Includes memory coalescing optimization, shared memory usage, and kernel launch configuration.

OpenMP Parallelism

Multi-core CPU acceleration using pragma directives. Covers parallel for loops, sections, tasks, and synchronization constructs.

MPI Distributed Computing

Distributed memory implementations using message passing. Includes both blocking and non-communication patterns.

Building

# CUDA implementations
nvcc -O3 cuda_program.cu -o cuda_program

# OpenMP implementations
gcc -fopenmp -O3 openmp_program.c -o openmp_program

# MPI implementations
mpicc -O3 mpi_program.c -o mpi_program
mpirun -np 4 ./mpi_program

TODO

Add performance benchmarking suite
Document speedup achievements for each implementation
Add comparison charts between paradigms
Include visualization of parallel execution
Add regression tests for correctness

Name		Name	Last commit message	Last commit date
Latest commit History 124 Commits
assignments		assignments
labs		labs
utils		utils
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CUDA, OpenMP & MPI Implementations

Overview

Contents

Key Technologies

Technical Deep Dive

CUDA Implementations

OpenMP Parallelism

MPI Distributed Computing

Building

TODO

About

Uh oh!

Releases

Packages

Languages

caseyjkey/CUDA-OpenMP-MPI-Implementations

Folders and files

Latest commit

History

Repository files navigation

CUDA, OpenMP & MPI Implementations

Overview

Contents

Key Technologies

Technical Deep Dive

CUDA Implementations

OpenMP Parallelism

MPI Distributed Computing

Building

TODO

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages