OneHotMatrix causes a 'scalar getindex disallowed' error on GPU #1006

tanhevg · 2020-01-27T13:37:14Z

I think this issue is causing many reported bugs that complain about "slowness" of OneHotMatrix

I suspect the underlying issue is that OneHotVector is not properly adapted for GPU storage, and OnHotMatrix stores an vector of OneHotVectors. A workaround would be to change OnHotMatrix to store a Vector{Int} and a size, as proposed by #578 . It would then be easy to adapt the vector for GPU storage.

MWE:

using Flux, CuArrays
CuArrays.allowscalar(false)
using Flux: onehotbatch

ohb = onehotbatch(rand(1:10, 100), 1:10) |> gpu;
dl = Dense(10, 5) |> gpu;

dl(ohb);

ERROR: scalar getindex is disallowed
Stacktrace:
 [1] assertscalar(::String) at /data/.julia/packages/GPUArrays/1wgPO/src/indexing.jl:14
 [2] getindex at /data/.julia/packages/GPUArrays/1wgPO/src/indexing.jl:54 [inlined]
 [3] iterate at ./abstractarray.jl:914 [inlined]
 [4] iterate at ./abstractarray.jl:912 [inlined]
 [5] checkindex at ./abstractarray.jl:572 [inlined]
 [6] checkbounds_indices at ./abstractarray.jl:529 [inlined] (repeats 2 times)
 [7] checkbounds at ./abstractarray.jl:482 [inlined]
 [8] checkbounds at ./abstractarray.jl:503 [inlined]
 [9] _getindex at ./multidimensional.jl:669 [inlined]
 [10] getindex at ./abstractarray.jl:981 [inlined]
 [11] *(::CuArray{Float32,2,Nothing}, ::Flux.OneHotMatrix{CuArray{Flux.OneHotVector,1,Nothing}}) at /data/.julia/packages/Flux/2i5P1/src/onehot.jl:30
 [12] (::Dense{typeof(identity),CuArray{Float32,2,Nothing},CuArray{Float32,1,Nothing}})(::Flux.OneHotMatrix{CuArray{Flux.OneHotVector,1,Nothing}}) at /data/.julia/packages/Flux/2i5P1/src/layers/basic.jl:102
 [13] top-level scope at none:0

The text was updated successfully, but these errors were encountered:

DhairyaLGandhi · 2020-01-27T14:56:09Z

Could you try with #764 ?

tanhevg · 2020-01-27T15:56:52Z

@dhairyagandhi96, #764 makes no difference, it makes some changes to onecold which is not called in this example.

mrchaos · 2020-02-03T11:20:02Z

julia> ohb = float.(onehotbatch(rand(1:10, 100), 1:10)) |> gpu
10×100 CuArray{Float32,2,Nothing}:
....
or
julia> ohb = cu.(onehotbatch(rand(1:10, 100), 1:10)) |> gpu
10×100 CuArray{Float32,2,Nothing}:
....

julia> dl(ohb)
5×100 CuArray{Float32,2,Nothing}:
....

DhairyaLGandhi · 2020-02-03T11:44:25Z

Right, I misread that, apologies for that. Minimizing shouldn't be hard, it's probably in the matmul, but would need to cross check.

tanhevg · 2020-02-03T13:39:48Z

As far as I understand, the point of having OneHotMatrix as a separate type is to replace the expensive matmul with inexpensive indexing operation. This optimisation is lost in the workaround proposed by @mrchaos .

It also does not currently work properly with CuArrays, as demonstrated in the OP. I suppose this code from onehot.jl is supposed to make it work, but evidently it does not.

adapt_structure(T, xs::OneHotMatrix) = OneHotMatrix(xs.height, adapt(T, xs.data))

import .CuArrays: CuArray, cudaconvert
import Base.Broadcast: BroadcastStyle, ArrayStyle
BroadcastStyle(::Type{<:OneHotMatrix{<:CuArray}}) = ArrayStyle{CuArray}()
cudaconvert(x::OneHotMatrix{<:CuArray}) = OneHotMatrix(x.height, cudaconvert(x.data))

I am not 100% sure of the correct way Adapt.jl functions should be used to achieve the desired behaviour (docs are a bit scarce). One workaround I can think of is that instead of holding a Vector of OneHotVectors inside OneHotMatrix, it should hold an array of indices (Vector{Int}).

CarloLucibello · 2020-02-29T11:10:43Z

In any case, we don't have this problem when computing crossentropies (which is the main reason why OneHotMatrix is there).

using Flux, CuArrays
CuArrays.allowscalar(false)
using Flux: onehotbatch

ohb = onehotbatch(rand(1:10, 100), 1:10) |> gpu;
ŷ = CuArrays.rand(size(ohb)...)
Flux.crossentropy(ŷ, ohb)

I'm not sure wether OneHotMatrix was ever meant to be a fully-fledged AbstractMatrix to use as an input to a model

tanhevg · 2020-03-02T11:48:12Z

I'm not sure wether OneHotMatrix was ever meant to be a fully-fledged AbstractMatrix to use as an input to a model

One-hot encoding is a standard technique used in language modelling and other applications of deep learning. When training these models on GPU with Flux, one currently has to revert to dense arrays with zeros and ones for that purpose. This is not ideal; either OneHotMatrix needs to be fixed to support one-hot encoding (seems like a natural fit), or a separate type should be provided.

tanhevg · 2020-03-12T15:20:10Z

#958

tanhevg · 2020-09-24T10:26:47Z

Fixed by JuliaGPU/CUDA.jl#90. Make sure Julia 1.5 is used, as well as CUDA instead of CuArrays.

CarloLucibello added bug cuda help wanted labels Feb 26, 2020

darsnack mentioned this issue Mar 24, 2020

Fix for onecold broadcast bug #764

Merged

tanhevg mentioned this issue May 27, 2020

Indexing with a CuArrays causes a 'scalar indexing disallowed' error from checkbounds JuliaGPU/CUDA.jl#90

Closed

tanhevg closed this as completed Sep 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OneHotMatrix causes a 'scalar getindex disallowed' error on GPU #1006

OneHotMatrix causes a 'scalar getindex disallowed' error on GPU #1006

tanhevg commented Jan 27, 2020 •

edited

Loading

DhairyaLGandhi commented Jan 27, 2020

tanhevg commented Jan 27, 2020

mrchaos commented Feb 3, 2020 •

edited

Loading

DhairyaLGandhi commented Feb 3, 2020

tanhevg commented Feb 3, 2020 •

edited

Loading

CarloLucibello commented Feb 29, 2020

tanhevg commented Mar 2, 2020

tanhevg commented Mar 12, 2020

tanhevg commented Sep 24, 2020 •

edited

Loading

OneHotMatrix causes a 'scalar getindex disallowed' error on GPU #1006

OneHotMatrix causes a 'scalar getindex disallowed' error on GPU #1006

Comments

tanhevg commented Jan 27, 2020 • edited Loading

DhairyaLGandhi commented Jan 27, 2020

tanhevg commented Jan 27, 2020

mrchaos commented Feb 3, 2020 • edited Loading

DhairyaLGandhi commented Feb 3, 2020

tanhevg commented Feb 3, 2020 • edited Loading

CarloLucibello commented Feb 29, 2020

tanhevg commented Mar 2, 2020

tanhevg commented Mar 12, 2020

tanhevg commented Sep 24, 2020 • edited Loading

tanhevg commented Jan 27, 2020 •

edited

Loading

mrchaos commented Feb 3, 2020 •

edited

Loading

tanhevg commented Feb 3, 2020 •

edited

Loading

tanhevg commented Sep 24, 2020 •

edited

Loading