Parallel Matrix Multiplication in Julia

In this notebook, we will: Parallelize a simple algorithm Learn about the performance of different parallel strategies Implement the parallelization in Julia Problem Description Asssumptions All matrices, including A, B, and C, are initially stored on the master process. The final result will overwrite the matrix C. Steps To implement paral ...

Posted on Wed, 20 May 2026 04:43:01 +0000 by reeferd

Ray: A Powerful Python Library for Distributed Computing

In the era of big data and artificial intelligence, the demand for distributed computing frameworks is increasingly urgent. Python Ray library was created to meet this demand. It is a high-performance, easy-to-use distributed application framework designed specifically for solving parallel and distributed computing problems in Python. This arti ...

Posted on Sun, 10 May 2026 01:42:47 +0000 by ccrevcypsys

CUDA C++ Programming Model Fundamentals

KernelsCUDA C++ extends the standard C++ language by allowing the definition of kernels. A kernel is a function that, when called, is executed N times in parallel by N different CUDA threads, as opposed to a regular C++ function which executes only once. Kernels are defined using the __global__ declaration specifier and are invoked using a spec ...

Posted on Sat, 09 May 2026 18:28:05 +0000 by ciber