add rand dispatch for tuple types #50251

adienes · 2023-06-21T18:45:12Z

adienes · 2023-06-21T21:01:45Z

I doubt those build failures are related to this change

rfourquet · 2023-06-22T10:45:01Z

I'm not a fan of using broadcast for the implementation. Performance-wise, we should try to not be too far-off from RandomExtensions.jl:

# this PR
julia> @btime rand(Tuple{Bool,Int})
  1.310 μs (20 allocations: 744 bytes)

# RandomExtensions
julia> @btime rand(Tuple{Bool,Int})
  7.722 ns (0 allocations: 0 bytes)

adienes · 2023-06-22T12:51:01Z

ok, should match performance now (at least locally for me)

adienes · 2023-06-26T14:14:37Z

g2g? is this functionality desired

stdlib/Random/src/generation.jl

stdlib/Random/test/runtests.jl

adienes · 2023-07-03T20:56:11Z

given that this won't be in 1.10 either way and can sit on master for a while, I added in support also for sampling from Rational type and updated the docstring of rand; would appreciate a review 🙂

rfourquet · 2023-07-04T11:49:37Z

I suggest removing the Rational part from this PR, which would almost surely prevent merging. See #25993 and #46060 for other attempts, if you want to pursue this idea, a separate PR is warranted.

adienes · 2023-07-04T14:01:33Z

I suggest removing the Rational part from this PR, which would almost surely prevent merging. See #25993 and #46060 for other attempts, if you want to pursue this idea, a separate PR is warranted.

fair enough. I should have guessed that would be contentious; I've removed the Rational piece

rfourquet · 2023-07-21T13:37:28Z

marking for triage as suggested here, both PR could probably be triaged at the same time

rfourquet · 2023-07-21T13:51:27Z

I'm in favor of merging this PR as is, although there is room for optimization for some niche cases. Essentially, this implementation doesn't use the 2-staged random generation, i.e. the fact that in a call to rand, first a sampler is created, and then rand is called on the sampler; this means that for cases (rare, in particular when the distribution is a type like here, a tuple type) where the sampler creation can factor out some non-negligible computation when a bunch a random values are about to be generated (like in array generation). For reference, consider the following made-up example:

struct ZZ{N}
    x::Char
end

function Random.Sampler(::Type{RNG}, ::Type{ZZ{N}}, nn::Random.Repetition) where {N, RNG<:AbstractRNG}
    Random.SamplerSimple(ZZ{N}('a'), Random.Sampler(RNG, String(N), nn))
    # note: passing `ZZ{N}('a')` is somewhat artificial here, using SamplerTag would be better but i believe it's undocumented
end

Random.rand(rng::AbstractRNG, sp::Random.SamplerSimple{ZZ{N}}) where {N} = ZZ{N}(rand(rng, sp.data))

Then creating an array of such values wrapped in a tuple has significant overhead compared to raw values, unlike the no-overhead approach of RandomExtensions:

julia> @btime rand(ZZ{:asd}, 1000);
  2.712 μs (3 allocations: 4.16 KiB)

julia> @btime rand(Tuple{ZZ{:asd}}, 1000); # This PR
  17.623 μs (1001 allocations: 27.50 KiB)

julia> @btime rand(Tuple{ZZ{:asd}}, 1000); # Using instead `RandomExtensions`
  2.678 μs (3 allocations: 4.16 KiB)

But I think this optimization can be added later if anyone finds the motivation.

JeffBezanson · 2023-08-03T18:30:43Z

Triage agrees with adding this 👍
But triage thinks we should not merge something that conflicts with a package that already has a better implementation. We should move the good implementation here, or else hold off and just tell people to use RandomExtensions.

The docs should also clarify what is supported, i.e. basically only concrete tuple types? Or maybe say fixed-length tuples of sample-able types?

LilithHafner · 2023-08-04T00:28:05Z

The possibility of rand(Tuple{1:4, Int}, 10) did not make triage smile, nor did traige have anything authoritative to say about that possibility.

adienes · 2023-08-04T00:52:18Z

thank you for the comments.

I am happy to give it a shot to get the performance of long arrays of custom struct types performant compared to RandomExtensions.jl, but I may end up needing help on that

also, I did not know Tuple{1:4, Int} was even allowed 😅, don't all the parameters have to be types?

LilithHafner · 2023-08-04T01:16:10Z

don't all the parameters have to be types?

Nope! Types and isbitstype values are also allowed. This is useful because it allows Array{Float, 2} to be a two-dimensional array with dimensionality known at compile time. A unit range would be a reasonable type parameter for a static offset array.

You can also do silly things like eltype(Vector{:blue}) === :blue

adienes · 2023-09-03T18:14:07Z

please let me know if the latest revision addresses all comments. the benchmarks I see are now

# This PR
@btime rand(Tuple{Bool,Int}); # ~ 3.958 ns (0 allocations: 0 bytes)
@btime rand(Tuple{Bool, Char, Vararg{Tuple{Int, Float64, Tuple{Char, Bool}}, 80}}); # ~ 2.509 μs (81 allocations: 4.50 KiB)
@btime rand(ZZ{:asd}, 1000); # ~ 3.250 μs (3 allocations: 4.16 KiB)
@btime rand(Tuple{ZZ{:asd}}, 1000); # ~ 3.250 μs (3 allocations: 4.16 KiB)


# RandomExtensions.jl
@btime rand(Tuple{Bool,Int}); # ~ 3.958 ns (0 allocations: 0 bytes)
@btime rand(Tuple{Bool, Char, Vararg{Tuple{Int, Float64, Tuple{Char, Bool}}, 80}}); # ~ 1.067 μs (0 allocations: 0 bytes)
@btime rand(ZZ{:asd}, 1000); # ~ 3.260 μs (3 allocations: 4.16 KiB)
@btime rand(Tuple{ZZ{:asd}}, 1000); # ~ 3.276 μs (3 allocations: 4.16 KiB)

so still slightly worse on the very wide tuple case, but I'm not sure how to avoid that without more involved changes. I believe this would be a pretty rare use case either way---the extra allocations occur past length 10 according to the implementation of ntuple, so the performance here is more reflective of wide tuples generally I think rather than anything specific to rand

On reflection, I believe Tuple{1:4, Int} shouldn't be allowed, since rand(T) should return a value of type T and Tuple{1:4, Int} is not instantiable

adienes · 2023-09-14T13:17:15Z

@rfourquet could you possibly review the latest updates?
in particular, let me know if the extra allocations in the case of the super-wide heterogenous tuple are acceptable. I suspect to remove those the implementation would have to grow significantly in complexity

stdlib/Random/src/Random.jl

stdlib/Random/src/generation.jl

rfourquet · 2023-09-14T13:52:01Z

let me know if the extra allocations in the case of the super-wide heterogenous tuple are acceptable

I think it would be fair to merge this so that things move forward, but it's up to triage...

If this version is merged, I will have to update RandomExtensions before the 1.11 release, and in the process I will probably be able to extract from it a reasonable (not too complex) implementation which could supersede this PR; then I hope there won't be much opposition to such an update.

An alternative would be for you or me to directly pull RandomExtensions's implementation.

adienes · 2023-09-14T14:03:30Z

do note that since this was last evaluated on triage, the update has now matched performance with RandomExtensions on all cases except for the extremely wide tuple case. while I am not an expert on either implementation, my understanding of the reasoning is that Base.ntuple basically rewrites/unrolls only up to length 10, while RandomExtensions does a lot more codegen in the @make macros

given that this is kind of a more fundamental limitation to the performance of Tuple in Base rather than anything to do with rand, I'm inclined to think that the difference in performance is acceptable. however, I would completely understand if its desired to leave no performance on the table in these wide cases --- that being said, I'm not sure I have the requisite familiarity to pull the code over. I'm happy to attempt with some guidance, and I don't mean to put more work on your plate, but going that route it will probably be best if I let you handle it :)

LilithHafner · 2023-09-14T17:17:07Z

Does this support rand(Tuple{Int, 1:4})? Does this support rand(Tuple{Int, Vararg{Int}}) and if so, what are the semantics? Whether or not these are supported, there should be tests for them.

adienes · 2023-09-14T17:29:27Z

intentionally not supported to both. I'll go ahead and add tests and make the docs more clear on that

adienes · 2023-09-15T12:27:16Z

green!

stdlib/Random/src/Random.jl

EDIT: this was also implemented in #50251, which is now merged, so merge this now for the added tests, and the added feature of `rand(Tuple{})`. This allows e.g `rand(NTuple{5,Int})` to sample a tuple of 5 `Int`s. The implementation simply assembles a tuple by calling `rand` on the corresponding type parameters of the tuple type. A generated function is used to ensure type stability.

This is a rebase of #35856, where we keep only the tests, as the functionality was added in #50251. This also adds the possibility to call `rand` on an empty tuple type: `rand(Tuple{})`. Co-authored-by: Stephan Hilb <stephan@ecshi.net>

rfourquet added the randomness Random number generation and the Random stdlib label Jun 22, 2023

stevengj reviewed Jun 27, 2023

View reviewed changes

stdlib/Random/src/generation.jl Show resolved Hide resolved

stevengj reviewed Jun 27, 2023

View reviewed changes

stdlib/Random/test/runtests.jl Outdated Show resolved Hide resolved

adienes force-pushed the rand_tuple_type branch from b6b984f to ef0ac5e Compare July 3, 2023 20:54

adienes changed the title ~~add rand dispatch for tuple types~~ add rand dispatch for tuple and rational types Jul 3, 2023

adienes changed the title ~~add rand dispatch for tuple and rational types~~ add rand dispatch for tuple types Jul 4, 2023

rfourquet mentioned this pull request Jul 20, 2023

add rand(::Type{<:Pair}) #28705

Merged

rfourquet added the triage This should be discussed on a triage call label Jul 21, 2023

adienes force-pushed the rand_tuple_type branch 2 times, most recently from 58c160e to 9e58e6e Compare September 7, 2023 21:52

rfourquet approved these changes Sep 14, 2023

View reviewed changes

stdlib/Random/src/Random.jl Outdated Show resolved Hide resolved

stdlib/Random/src/generation.jl Outdated Show resolved Hide resolved

adienes force-pushed the rand_tuple_type branch 3 times, most recently from ef15e41 to 9702348 Compare September 14, 2023 21:11

rfourquet reviewed Sep 15, 2023

View reviewed changes

stdlib/Random/src/Random.jl Outdated Show resolved Hide resolved

rfourquet closed this Sep 23, 2023

rfourquet reopened this Sep 23, 2023

adienes and others added 14 commits September 23, 2023 11:22

add rand dispatch for tuple types

5039668

remove allocating broadcast

c54a969

use opt. generated ntuple & add @inferred test

dd5bfe1

remove access to internal properties

57f45e7

add rational rand dispatch

59a07bb

add to news

ef602c9

remove rational rand

030cc16

faster sampler generation

41da382

get rid of implicit convert

07ffd9a

fix double backticks

b2ba3b1

use Base.tail

73dc45e

update docs, apply code review, add tests

cf5e026

Update Random.jl complex domain docstring

a19e2ca

Update Random.jl complex docstring

e89dd56

adienes force-pushed the rand_tuple_type branch from 9499d39 to e89dd56 Compare September 23, 2023 15:23

rfourquet added 2 commits September 29, 2023 10:49

Merge branch 'master' into rand_tuple_type

728029a

fix test caused by merge with master

6a192ad

rfourquet merged commit 8436f68 into JuliaLang:master Sep 29, 2023
1 check passed

This was referenced Oct 7, 2023

support random sampling of tuple types: add tests #51630

Merged

support random sampling of tuple types #35856

Closed

FR: implement rand() for tuple types. #50236

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add rand dispatch for tuple types #50251

add rand dispatch for tuple types #50251

adienes commented Jun 21, 2023 •

edited

Loading

adienes commented Jun 21, 2023

rfourquet commented Jun 22, 2023

adienes commented Jun 22, 2023

adienes commented Jun 26, 2023

adienes commented Jul 3, 2023

rfourquet commented Jul 4, 2023 •

edited

Loading

adienes commented Jul 4, 2023

rfourquet commented Jul 21, 2023 •

edited

Loading

rfourquet commented Jul 21, 2023 •

edited

Loading

JeffBezanson commented Aug 3, 2023

LilithHafner commented Aug 4, 2023

adienes commented Aug 4, 2023

LilithHafner commented Aug 4, 2023

adienes commented Sep 3, 2023 •

edited

Loading

adienes commented Sep 14, 2023

rfourquet commented Sep 14, 2023

adienes commented Sep 14, 2023 •

edited

Loading

LilithHafner commented Sep 14, 2023

adienes commented Sep 14, 2023

adienes commented Sep 15, 2023

add rand dispatch for tuple types #50251

add rand dispatch for tuple types #50251

Conversation

adienes commented Jun 21, 2023 • edited Loading

adienes commented Jun 21, 2023

rfourquet commented Jun 22, 2023

adienes commented Jun 22, 2023

adienes commented Jun 26, 2023

adienes commented Jul 3, 2023

rfourquet commented Jul 4, 2023 • edited Loading

adienes commented Jul 4, 2023

rfourquet commented Jul 21, 2023 • edited Loading

rfourquet commented Jul 21, 2023 • edited Loading

JeffBezanson commented Aug 3, 2023

LilithHafner commented Aug 4, 2023

adienes commented Aug 4, 2023

LilithHafner commented Aug 4, 2023

adienes commented Sep 3, 2023 • edited Loading

adienes commented Sep 14, 2023

rfourquet commented Sep 14, 2023

adienes commented Sep 14, 2023 • edited Loading

LilithHafner commented Sep 14, 2023

adienes commented Sep 14, 2023

adienes commented Sep 15, 2023

adienes commented Jun 21, 2023 •

edited

Loading

rfourquet commented Jul 4, 2023 •

edited

Loading

rfourquet commented Jul 21, 2023 •

edited

Loading

rfourquet commented Jul 21, 2023 •

edited

Loading

adienes commented Sep 3, 2023 •

edited

Loading

adienes commented Sep 14, 2023 •

edited

Loading