Sample: simpleMultiGPU
Minimum spec: SM 3.5

This application demonstrates how to use the new CUDA 4.0 API for CUDA context management and multi-threaded access to run CUDA kernels on multiple-GPUs.

Key concepts:
Asynchronous Data Transfers
CUDA Streams and Events
Multithreading
Multi-GPU
