Jump to content

Home

Existing user? Sign In
Sign In

Remember me Not recommended on shared computers

Forgot your password?

Or sign in with one of these services
Sign Up

sefad

Members

View Profile See their activity

Posts
1
Joined
November 12, 2019
Last visited
November 12, 2019

Content Type

All Activity

Profiles

Forums

Topics
Posts

Events

Everything posted by sefad

eGPU Performance Device to Host

sefad posted a topic in Benchmarking, system performance

Hello, Using cuda-z and AIDA64 GPU benchmark I can see that "Device to Host" / "Memory Read" memory bandwidth is around 2.5 GiB/s on my eGPU Thunderbolt 3 GTX 1080. On an opengl app I am developing, I'm retrieving into CPU memory each frame/texture using http://spout.zeal.co/ My textures are 3840x2160 RGBA, right now it takes around 50 ms to retrieve the texture into client memory. If one compares with the benchmarks, I should be able to transfer the 31MB in less than 15 ms which is what I need at 60 fps. Spout uses an improved memcpy : https://github.com/leadedge/Spout2/blob/master/SpoutSDK/Source/SpoutCopy.cpp#L136 using the sse2 extension. After having setup a PBO and having mapped it to CPU memory : https://github.com/leadedge/Spout2/blob/master/SpoutSDK/Source/SpoutGLDXinterop.cpp#L2209 Cuda-z uses cudaMemcpy Would anyone have an idea of why I am not achieving the speed shown in the benchmarks? Would someone tell how the memory is copied from GPU to CPU in AIDA64? Is there any faster way to copy GPU memory in CPU memory than PBO/Mapping/SSE2 copy? Thanks in advance for any help. Cheers
- November 12, 2019
- 1 reply

×

Support
Online Store
Newsletter
Browse
- Back
- Forums
- Events
- Staff
- Online Users
Activity
- Back
- All Activity
- Search
Leaderboard

×

Create New...