This repo contains examples for general CUDA programming in C. Examples will be addeded periorically.
- Determine device coniguration
- Use information pertaining device configuration
- Run a function on GPU
- Create a function for GPU to add two vectors
- Measure kernel's execution time
- Benchnark a kernel's execution time
- CUDA program essentials - Synchronize all threads after kernel call
- Set up blocks configuration
- Use mamory management to allocate memory for kernel
- Use grid-stride loop to manipulate large or arbitrary array