Raman SHRIVASTAVA’s Post

Raman SHRIVASTAVA

AI Expert & Leader | LLMs, RAGs, AI Agents | 40 under 40 Data Scientist, AIM 2019

3mo

Training on multiple GPUs using NCCL and PyTorch

Aniket Mishrikotkar

Machine Learning Engineer @ MathCo | MLOps | LLMs

3mo

Distributed training on multiple GPUs using NCCL and PyTorch NCCL is the standard communication backend for NVIDIA GPUs. we use NCCL for executing operations like all-reduce. NCCL works on a single or multiple machines and can use high performance networks as well. 1️⃣ similar to training on multiple CPUs, to train on multiple GPUs we need to initialize communication groups with `nccl` as the backend. `dist.init_process_group(backend="nccl")` 2️⃣ we need to make sure that each process is allocated to one GPU. to do this we can use `RANK` and assign it to `device` variable. 3️⃣ we can then use `torchrun` to launch distributed training. this way we can easily make our CUDA based programs run on multiple GPUs. #pytorch #deeplearning #distributedsystems

To view or add a comment, sign in

More Relevant Posts

Aniket Mishrikotkar

Machine Learning Engineer @ MathCo | MLOps | LLMs
3mo
Report this post
Distributed training on multiple GPUs using NCCL and PyTorch NCCL is the standard communication backend for NVIDIA GPUs. we use NCCL for executing operations like all-reduce. NCCL works on a single or multiple machines and can use high performance networks as well. 1️⃣ similar to training on multiple CPUs, to train on multiple GPUs we need to initialize communication groups with `nccl` as the backend. `dist.init_process_group(backend="nccl")` 2️⃣ we need to make sure that each process is allocated to one GPU. to do this we can use `RANK` and assign it to `device` variable. 3️⃣ we can then use `torchrun` to launch distributed training. this way we can easily make our CUDA based programs run on multiple GPUs. #pytorch #deeplearning #distributedsystems
1 Comment
Like Comment
To view or add a comment, sign in
AI Nexus

98 followers
7mo
Report this post
What is the real difference between CPUs and GPUs? Both components are essential for running computers. CPUs are the computer’s brain, and they perform all the general computer tasks; on the other hand, GPUs are a type of Application-Specific Integrated Circuits which perform specific tasks like 3D rendering through smaller but more cores enabling it to perform concurrent [parallel] calculations. Which one does AI utilize? Mainly, AI training utilizes GPUs since their parallel processing architecture and hardware optimizations significantly accelerate the process compared to CPUs because they split the required tasks amongst the multiple smaller cores to perform simultaneous computations. The video showcases the difference as explained by NVIDIA. Follow AI Nexus - Club for more….

NVIDIA GPU vs CPU Demo
Like Comment
To view or add a comment, sign in
HexaData®

478 followers
5mo
Report this post
The Hexadata HD-SW110 Rackmount System is designed for high-performance computing & AI applications, providing robust support for multiple GPUs and advanced CPUs. #hexadata #multigpuserver #Epycserver #education #IIT #IIIT #makeinidia #madeinindia #machinelearning #artificialintelligence #amd #nvidia #mellanox #HPC #supercomputer #datacentre #gpucomputing #cudacomputing #aitraining #datascience #dataanalytics #rendering
Like Comment
To view or add a comment, sign in
José-María Súnico

Technology & Strategy Advisor | EC Expert, Coach, Mentor @ EIC (Accelerator, Transition, Pathfinder, T2M), BlueInvest, Copernicus, Tetra, Eureka, Madrimasd, EWE.
10mo
Report this post
sn-news: #ai #ml #gpu #sw #dev ZLUDA lets you run unmodified CUDA applications with near-native performance on Intel AMD GPUs

GitHub - vosen/ZLUDA: CUDA on AMD GPUs

github.com
Like Comment
To view or add a comment, sign in
Imagination Technologies

47,449 followers
4mo
Report this post
Pipelined Data Masters is a new feature for Imagination's D-Series GPUs. It allows the firmware to set up (pipeline) the next job while a previous job is still processing within the GPU. Effectively, the firmware work overlaps with GPU work instead of running serialised in-between jobs. This approach enables higher performance for the same level of core, as we avoid idle cycles and improve utilisation of the GPU processing hardware, which means a better return on investment. Find out more in our blog: https://hubs.ly/Q02LWd3m0 #GPU #PowerVR
Like Comment
To view or add a comment, sign in
Justin Hunter

Senior Product Manager @ Acrisure Innovation
7mo
Report this post
This one helped me de-mystify the 'CPUs vs. GPUs' distinction in LLM training. GPUs pack a lot more processing units onto a chip; this allows them to execute the same instruction set across multiple processors in parallel (Single Instruction Multiple Data execution). This is different from current CPUs, where the hardware on the chip is optimized to minimize latency (the time between doing tasks) at the expense of fewer processors. Because machine learning (and crypto and graphics rendering before it) requires a lot of 'simple' math to be done, chips that allow you to do that math in parallel are faster at doing the math required in training. https://lnkd.in/gjVPKCUw

Demystifying GPU Compute Architectures

thechipletter.substack.com
Like Comment
To view or add a comment, sign in
Aptly Technology Corporation

15,461 followers
2mo
Report this post
⚡ CPU or GPU: Which one powers your workload best?💡💻 Choosing between a CPU and a GPU can make a huge difference in the performance of your tasks. CPUs handle general tasks, while GPUs excel at parallel processing. Ready to optimize your infrastructure and boost efficiency? 📹 Dive into the video to learn which one fits your business needs! 👇 #TechTips #ITInfrastructure #AI #CloudComputing #CPUvsGPU #CPU #GPU #IT #educationalvideo

3 Comments
Like Comment
To view or add a comment, sign in
Erik Thauvin

Thauvin.Net
3w
Report this post
How AMD Is Taking Standard C/C++ Code To Run Directly On GPUs #amd #clang #cplusplus #gpu

How AMD Is Taking Standard C/C++ Code To Run Directly On GPUs

phoronix.com
Like Comment
To view or add a comment, sign in
Justin Hunter

Senior Product Manager @ Acrisure Innovation
6mo
Report this post
Another great breakdown of the differences between CPUs and GPUs, this one with great diagrams on the different components that make up each type of chip. GPUs do one particular type of math well (in combination with lots of specialized software to format data and instruction sets), while CPUs remain the best at lots of different types of instructions. https://lnkd.in/gUJMfPpY

What Every Developer Should Know About GPU Computing

blog.codingconfessions.com
Like Comment
To view or add a comment, sign in