It is high time to reconsider the parallelization and improvement of OpenGL. Some extraordinary features implemented in new NVIDIA’s GK110 Kepler architecture pave the way for that. The three most important features for the purpose are:
Hyper-Q – enables multiple CPU cores to launch work on a single GPU simultaneously (GK110 allows 32 simultaneous hardware-managed connections);
Dynamic Parallelism – allows the GPU to generate new work for itself, synchronize on results, and control the scheduling of that work via dedicated, accelerated hardware paths, all without involving the CPU;
NVIDIA GPUDirect™ – enables GPUs within a single computer, or GPUs in different servers located across a network, to directly exchange data without needing to go to CPU/system memory. The RDMA feature in GPUDirect allows third party devices such as SSDs, NICs, and IB adapters to directly access memory on multiple GPUs within the same system.
All those features currently target CUDA, but it would be a shame not to significantly improve graphics API either.




