keywords: ParallelComputing, CUDA, HIP, DPC++

Books

CUDA Books

Learn CUDA Programming, published by Packt
https://github.com/PacktPublishing/Learn-CUDA-Programming

Docs

Heterogeneous Computing

Heterogeneous computing
https://en.wikipedia.org/wiki/Heterogeneous_computing

OpenCL: A Parallel Programming Standard for Heterogeneous Computing Systems
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2964860/

A Heterogeneous Parallel Processor for High-Speed Vision Chip
https://www.researchgate.net/publication/310823555_A_Heterogeneous_Parallel_Processor_for_High-Speed_Vision_Chip

AMD Offical Docs

AMD’s Performance Guide is a nice collection of tips on how to program the GCN and RDNA architectures efficiently.
https://gpuopen.com/performance/

ROCm Docs

AMD ROCm Tensorflow
https://rocmdocs.amd.com/en/latest/Deep_learning/Deep-learning.html

【全网首发】AMD显卡上完美原生运行PyTorch攻略,无需容器(Docker)
https://zhuanlan.zhihu.com/p/67940936

Building PyTorch on ROCm
https://lernapparat.de/pytorch-rocm/

DPC++ Docs

A Standards-Based, Cross-Architecture Language
https://software.intel.com/content/www/us/en/develop/tools/oneapi/components/dpc-compiler.html

Intel Data Parallel C++ Tutorial
https://github.com/jeffhammond/dpcpp-tutorial

SYCL Docs

Look ma, no CUDA! Programming GPUs with modern C++ and SYCL
https://nazavode.github.io/blog/sycl/

Accelerating your C++ on GPU with SYCL
https://blog.tartanllama.xyz/sycl/

Source

CUDA Source

Samples for CUDA Developers which demonstrates features in CUDA Toolkit.
https://github.com/NVIDIA/cuda-samples

Thin C++-flavored wrappers for the CUDA Runtime API
https://github.com/eyalroz/cuda-api-wrappers

HIP Source

HIP: C++ Heterogeneous-Compute Interface for Portability
https://github.com/ROCm-Developer-Tools/HIP

OpenCL Source

A C++ GPU Computing Library for OpenCL
https://github.com/boostorg/compute

SYCL Source (OpenCL Based)
TBB (CPU) Source

Official Threading Building Blocks (TBB) GitHub repository.
https://github.com/oneapi-src/oneTBB
For Commercial Intel® TBB distribution, please click here:
https://software.intel.com/en-us/tbb

Platform

ROCm Platform

ROCm Software Platform Repository
https://github.com/ROCmSoftwarePlatform

Tensors and Dynamic neural networks in Python with strong GPU acceleration
https://github.com/ROCmSoftwarePlatform/pytorch


how often', he said,'does a man ruin his disciples by remaining always with them. ― Romain Rolland, Life of Vivekananda and the Universal Gospel