Jump to content

sefad

Members
  • Posts

    1
  • Joined

  • Last visited

Everything posted by sefad

  1. Hello, Using cuda-z and AIDA64 GPU benchmark I can see that "Device to Host" / "Memory Read" memory bandwidth is around 2.5 GiB/s on my eGPU Thunderbolt 3 GTX 1080. On an opengl app I am developing, I'm retrieving into CPU memory each frame/texture using http://spout.zeal.co/ My textures are 3840x2160 RGBA, right now it takes around 50 ms to retrieve the texture into client memory. If one compares with the benchmarks, I should be able to transfer the 31MB in less than 15 ms which is what I need at 60 fps. Spout uses an improved memcpy : https://github.com/leadedge/Spout2/blob/master/SpoutSDK/Source/SpoutCopy.cpp#L136 using the sse2 extension. After having setup a PBO and having mapped it to CPU memory : https://github.com/leadedge/Spout2/blob/master/SpoutSDK/Source/SpoutGLDXinterop.cpp#L2209 Cuda-z uses cudaMemcpy Would anyone have an idea of why I am not achieving the speed shown in the benchmarks? Would someone tell how the memory is copied from GPU to CPU in AIDA64? Is there any faster way to copy GPU memory in CPU memory than PBO/Mapping/SSE2 copy? Thanks in advance for any help. Cheers
×
×
  • Create New...