Parallel Matrix Multiplication in Julia
In this notebook, we will:
Parallelize a simple algorithm
Learn about the performance of different parallel strategies
Implement the parallelization in Julia
Problem Description
Asssumptions
All matrices, including A, B, and C, are initially stored on the master process.
The final result will overwrite the matrix C.
Steps
To implement paral ...
Posted on Wed, 20 May 2026 04:43:01 +0000 by reeferd
Ray: A Powerful Python Library for Distributed Computing
In the era of big data and artificial intelligence, the demand for distributed computing frameworks is increasingly urgent. Python Ray library was created to meet this demand. It is a high-performance, easy-to-use distributed application framework designed specifically for solving parallel and distributed computing problems in Python. This arti ...
Posted on Sun, 10 May 2026 01:42:47 +0000 by ccrevcypsys
CUDA C++ Programming Model Fundamentals
KernelsCUDA C++ extends the standard C++ language by allowing the definition of kernels. A kernel is a function that, when called, is executed N times in parallel by N different CUDA threads, as opposed to a regular C++ function which executes only once. Kernels are defined using the __global__ declaration specifier and are invoked using a spec ...
Posted on Sat, 09 May 2026 18:28:05 +0000 by ciber