[problem description]
The previous code can run normally. After the data set is expanded, the following errors are reported in the GPU program running the deep learning training model, but CUDA out of memory error is not prompted.
RuntimeError: CUDA error: CUBLAS_STATUS_ALLOC_FAILED when calling `cublasCreate(handle)`
[solution 1]
Change the program to run on the CPU and find that it can run normally, but the speed will be very slow and it will take a long time.
--device cpu
[solution 2]
Try to reduce the batch size used in the training model, and it can run normally.
Read More:
- RuntimeError: CUDA error: CUBLAS_STATUS_ALLOC_FAILED when calling `cublasCreate(handle)`
- failed to create cublas handle: CUBLAS_STATUS_ALLOC_FAILED
- PyTorch Error: RuntimeError: CUDA error: CUBLAS_STATUS_INVALID_VALUE when calling cublasSgemm()
- CUBLAS_STATUS_ALLOC_FAILED
- (Solved) pytorch error: RuntimeError: cuDNN error: CUDNN_STATUS_EXECUTION_FAILED (install cuda)
- RuntimeError: cuDNN error: CUDNN_STATUS_EXECUTION_FAILED
- Error: cudaGetDevice() failed. Status: CUDA driver version is insufficient for CUDA runtime version
- Runtimeerror using Python training model: CUDA out of memory error resolution
- RuntimeError: cuda runtime error (100) : no CUDA-capable device is detected at /opt/conda/conda-bld/
- RuntimeError: CUDA error: out of memory solution (valid for pro-test)
- Error Code 2: Internal Error (Assertion cublasStatus == CUBLAS_STATUS_SUCCES
- Successfully solved runtimeerror: CUDA runtime error (30)
- RuntimeError: CUDA out of memory. Tried to allocate 600.00 MiB (GPU 0; 23.69 GiB total capacity)
- (29)RuntimeError: cuda runtime error (999)
- RuntimeError: cuDNN error: CUDNN_ STATUS_ EXECUTION_ Failed solutions
- tensorflow2.1 Error:Could not create cudnn handle: CUDNN_STATUS_INTERNAL_ERROR
- RuntimeError:cuda runtime error (11) : invalid argument at /pytorch/aten/src/THC/generic
- TSV_TNEW_PAGE_ALLOC_FAILED
- RuntimeError: CUDA error: device-side assert triggered
- RuntimeError: cuda runtime error (801) : operation not supported at ..