sprand as samples from sparse distribution #3

abraunst · 2019-04-03T21:53:55Z

I was thinking, in the spirit of this package, maybe rand(Normal(),SparseMatrixCSC,p,m,n) could be better expressed as rand(Bernoulli(p,Normal()),SparseMatrixCSC,m,n) where Bernoulli(p, Normal()) would be the "Gauss-Bernoulli" or "Spike-and-Slab" mixture distribution

P(x) = (1-p) delta(x)+ p Normal(x)

It seems to make things a bit more generic.

The text was updated successfully, but these errors were encountered:

rfourquet · 2019-04-07T09:36:26Z

Sounds like a very interesting idea. It would require to have the non-zero struture depend on the values (i.e. test each produced value for nullity), so I wonder whether possible multiple allocations would have negative performance impact. But definitely worth exploring. I may be even possible to support both API (for now I guess I prefer to not get rid of the current API, as it feels closer to the sprand API and makes it probably easier to switch).

abraunst · 2019-04-07T09:54:42Z

Sounds like a very interesting idea. It would require to have the non-zero struture depend on the values (i.e. test each produced value for nullity), so I wonder whether possible multiple allocations would have negative performance impact. But definitely worth exploring. I may be even possible to support both API (for now I guess I prefer to not get rid of the current API, as it feels closer to the sprand API and makes it probably easier to switch).

For sure, if the p value is small enough, the sampler should do what the current sprand does, i.e. extract the non-zero indices and then fill them. I confess that I tried to implement it in RandomExtensions but I am still a bit lost in the design 😅 .

In any case, even if RandomExtensions makes its way into stdlib (which I would love to see), I think that the current interface in stdlib should be left as convenience functions (without maybe the rfn param and other bells and whistles).

rfourquet · 2019-04-07T11:31:57Z

I tried to implement it in RandomExtensions

Cool!!

I tried to implement it in RandomExtensions but I am still a bit lost in the design

Sorry for that, the internals have quite evolved last time I worked on it, and didn't document yet. Feel free to open an issue to ask for help, and I will answer there or write documentation (but I will have very little time in the upcoming week).

I think that the current interface in stdlib should be left

I don't have a lot of hopes for sprand to go away. I agree that sprand is more convenient vs rand([T], SparseVector, p, n, m), that's why I initially (in the Base PR) added the short version rand([T], p, n, m) to give it a chance to compete favorably against sprand. But IIRC, someone had noted somewhere that it's not this short version is not very clear, so this is not an unanimous solution!

abraunst changed the title ~~sparse arrays as samples from sparse distribution~~ sprand as samples from sparse distribution Apr 4, 2019

abraunst mentioned this issue Apr 4, 2019

[DO NOT MERGE] sprand sanity with rfn argument JuliaLang/julia#30637

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sprand as samples from sparse distribution #3

sprand as samples from sparse distribution #3

abraunst commented Apr 3, 2019

rfourquet commented Apr 7, 2019

abraunst commented Apr 7, 2019 •

edited

Loading

rfourquet commented Apr 7, 2019

sprand as samples from sparse distribution #3

sprand as samples from sparse distribution #3

Comments

abraunst commented Apr 3, 2019

rfourquet commented Apr 7, 2019

abraunst commented Apr 7, 2019 • edited Loading

rfourquet commented Apr 7, 2019

abraunst commented Apr 7, 2019 •

edited

Loading