GPU Compilation for Tracker Backward Pass through savevals #42

avik-pal · 2020-10-06T21:36:34Z

Backward Pass on GPUs currently fail due to

ERROR: InvalidIRError: compiling kernel broadcast_kernel(CUDA.CuKernelContext, CuDeviceArray{Float32,2,CUDA.AS.Global}, Base.Broadcast.Broadcasted{Nothing,Tuple{Base.OneTo{Int64},Base.OneTo{Int64}},typeof(Tracker.partial),Tuple{CUDA.CuRefValue{typeof(DiffEqBase.calculate_residuals)},Base.Broadcast.Extruded{CuDeviceArray{Float32,2,CUDA.AS.Global},Tuple{Bool,Bool},Tuple{Int64,Int64}},Int64,Base.Broadcast.Extruded{CuDeviceArray{Float32,2,CUDA.AS.Global},Tuple{Bool,Bool},Tuple{Int64,Int64}},Base.Broadcast.Extruded{CuDeviceArray{Float32,2,CUDA.AS.Global},Tuple{Bool,Bool},Tuple{Int64,Int64}},Base.Broadcast.Extruded{CuDeviceArray{Float32,2,CUDA.AS.Global},Tuple{Bool,Bool},Tuple{Int64,Int64}},Float32,Float32,CUDA.CuRefValue{typeof(DiffEqBase.ODE_DEFAULT_NORM)},Float32}}, Int64) resulted in invalid LLVM IR
Reason: unsupported dynamic function invocation (call to partial(f::F, Δ, i, args::Vararg{Any,N}) where {F, N} in Tracker at /mnt/research/Tracker/src/lib/array.jl:546)
Stacktrace:
 [1] _broadcast_getindex_evalf at broadcast.jl:648
 [2] _broadcast_getindex at broadcast.jl:621
 [3] getindex at broadcast.jl:575
 [4] broadcast_kernel at /home/avikpal/.julia/packages/GPUArrays/uaFZh/src/host/broadcast.jl:62

I was able to track that it happens when partial is called with f = DiffEqBase.calculate_residuals

The text was updated successfully, but these errors were encountered:

avik-pal · 2020-11-21T15:05:42Z

Using the patch in https://github.com/avik-pal/DiffEqBase.jl/tree/ap/fix_gpu_regnode GPU compilation is fixed. It has to do with inlining the functions.

ChrisRackauckas · 2020-11-21T17:54:17Z

Why would inlining matter?

avik-pal · 2020-11-22T13:42:24Z

I tried looking into it a bit more but can't figure out the exact reason (couldn't find any similar issue as well). avik-pal/DiffEqBase.jl@f1bf992 is the patch that is needed to make it work.

avik-pal mentioned this issue Oct 30, 2020

Performance Benchmarks #37

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU Compilation for Tracker Backward Pass through savevals #42

GPU Compilation for Tracker Backward Pass through savevals #42

avik-pal commented Oct 6, 2020

avik-pal commented Nov 21, 2020 •

edited

Loading

ChrisRackauckas commented Nov 21, 2020

avik-pal commented Nov 22, 2020

GPU Compilation for Tracker Backward Pass through savevals #42

GPU Compilation for Tracker Backward Pass through savevals #42

Comments

avik-pal commented Oct 6, 2020

avik-pal commented Nov 21, 2020 • edited Loading

ChrisRackauckas commented Nov 21, 2020

avik-pal commented Nov 22, 2020

avik-pal commented Nov 21, 2020 •

edited

Loading