Citation :
"Highly irregular branching patterns (skip modes) and bit manipulation (quantization/entropy coding) don't suit present GPUs. IMO the only really good application at the moment are full search ME algorithms, in the end though accelerated full search is still slow even if it's faster than on the CPU." -- MfA Actually, basically everything can be reasonably done on the GPU except CABAC (which could be done, it just couldn't be parallelized). x264 CUDA will implement a fullpel and subpel ME algorithm initially; later on we could do something like RDO with a bit-cost approximation instead of CABAC. Because it has to do everything at single precision floating pointWrong, CUDA supports integer math. -- Dark Shikari
|