CUDA slower than Direct ML

Forums Technical Discussion CUDA slower than Direct ML

342 0 1


jemabaris: Member; 9 posts; Joined: March 2025; Offline

Oct. 16, 2025 10:14 a.m.

I am doing some tests with the new ML volume upres node in H21 and weirdly choosing Direct ML as the execution provider is a lot faster than CUDA. I verified that Houdini actually uses CUDA Toolkit 12.8 using:

import os
print(os.environ.get("CUDA_PATH"))
print(os.environ.get("PATH"))

My cuDNN version is 9.13 and it's also on my PATH environment variable. My understanding is that CUDA should be the fastest option but it's in fact 3-5x slower than Direct ML.

Any ideas what could be going wrong here?

Quick Links

                    
                        Search links
                        Show recent posts
                        Show unanswered posts