Avoid returning NaN from Gamma::sample #46

mstoeckl · 2025-12-11T21:46:31Z

Added a CHANGELOG.md entry

Summary

Gamma::sample could return NaN due to multiplying 0.0 with something that had overflowed; this PR makes the sampler return 0.0 in such cases instead. This changes the order of some multiplications, which can slightly change the output of the sampler on typical inputs.

Motivation

This is another edge case that I found found with a parameter fuzzer (https://github.com/mstoeckl/rand_distr/commits/fuzz-params/), that turns out to be avoidable with only slight additional complexity.

Details

The specific NaN that I noticed came from multiplying u.powf(self.inv_shape), which for small inv_shape could be zero, with the output of GammaLargeShape::sample, which could overflow to +inf if scale was large. The general method to avoid this is to delay applying the distribution scale factor until the end of the calculation, and before that point try to operate on values which are close in magnitude to 1.

I also added a special case to Gamma::new to handle shape, scale = +inf in particular.

The parameter combinations which would previously produce Nan (shape very close to zero, scale very close to the max float value) will continue to produce somewhat inaccurate output (producing more zero output values than expected, because u.powf(self.inv_shape) is likely to underflow when shape is small) Fixing this particular edge case would likely require rewriting the GammaSmallShape sampler to use a different and possibly more expensive sampling method.

benjamin-lieser · 2025-12-14T13:01:25Z

I think our general approach to deal with extreme parameters should be to just return an Error from new if we cannot accurately sample from it. These extreme parameters will often be the result of bugs on the userside anyway.
If these parameters are needed and it is feasible to sample from them we can add this later.

mstoeckl · 2025-12-16T01:58:02Z

I think our general approach to deal with extreme parameters should be to just return an Error from new if we cannot accurately sample from it.

I'm not certain what a good threshold / measurement of accuracy would be for Gamma in particular, and for now have added a generic warning about inaccuracy to the documentation.

The issue depends on the precise floating point type -- underflow in u.powf(self.inv_shape) starts occurring as soon as shape < 1/7 or so for f32, while for f64 the threshold is, if my math is right, around 1/20. A majority of values underflowing to zero requires about 1/150 for f32, and 1/1000 for f64. The Kolmogorov-Smirnov test in distr_test , using f64, starts failing with (shape = 1/200, scale=1.0), around 3% underflow probability, although I am not sure why exactly.
What is "accurate" for floating point depends on what you use the distribution for -- some uses may work fine if on extreme inputs the mean and stddev are within additive epsilon, others if multiplicative, others may depend on tail behavior, or just that the value is nonnegative

dhardy

@benjamin-lieser handling edge cases like infinity and underflow-to-zero in a manner which is consistent with the general distribution is often simpler to deal with and good enough — so in my opinion this is the more sensible approach to take in general.

dhardy · 2025-12-22T11:48:58Z

src/gamma.rs

+/// When the shape (`k`) or scale (`θ`) parameters are close to the upper limits
+/// of the floating point type `F`, the implementation may overflow and
+/// produce `inf`; similarly, if either is sufficiently close to `0.0`,
+/// it may output `0.0`. Sampling may become inaccurate if `k` is close
+/// to zero and `θ` is very large.


Could this be a little more specific (e.g. as noted in your recent comment @mstoeckl)?

Yes, I've briefly explained the underflow behavior. On looking at the actual samples, I see the distribution is actually sampled pretty well with shape = 1/200, scale=1.0. The actual distribution is < the minimum positive f64 with notable (>0.02) probability, and the Gamma sampler correctly returns 0.0 with about the expected frequency. The distr_test failures result from the Kolmogorov-Smirnov implementation in distr_test assuming real values, not accounting for floating point discretization, and treating the jump from 0.0 to the min f64 as an actual discrepancy.

This changes the order of multiplications used to compute the result to avoid multiplying zero with an expression that can overflow to +inf. Note that the parameter combinations which could lead to this (shape very close to zero, scale very close to the max float value) continue to be inaccurately handled; the Gamma distribution sampler will now tend to return zero instead of NaN for them. The limit (shape 0, scale inf) is not well defined.

mstoeckl force-pushed the gamma-avoid-nan branch 2 times, most recently from 82ec7a8 to 4aca53d Compare December 11, 2025 21:48

mstoeckl force-pushed the gamma-avoid-nan branch from 4aca53d to cf3f46b Compare December 16, 2025 00:58

dhardy reviewed Dec 22, 2025

View reviewed changes

mstoeckl force-pushed the gamma-avoid-nan branch from cf3f46b to a3bb2b5 Compare December 23, 2025 13:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Avoid returning NaN from Gamma::sample #46

Avoid returning NaN from Gamma::sample #46

Uh oh!

mstoeckl commented Dec 11, 2025 •

edited

Loading

Uh oh!

benjamin-lieser commented Dec 14, 2025

Uh oh!

mstoeckl commented Dec 16, 2025 •

edited

Loading

Uh oh!

dhardy left a comment

Uh oh!

dhardy Dec 22, 2025

Uh oh!

mstoeckl Dec 23, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Avoid returning NaN from Gamma::sample #46

Are you sure you want to change the base?

Avoid returning NaN from Gamma::sample #46

Uh oh!

Conversation

mstoeckl commented Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation

Details

Uh oh!

benjamin-lieser commented Dec 14, 2025

Uh oh!

mstoeckl commented Dec 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dhardy left a comment

Choose a reason for hiding this comment

Uh oh!

dhardy Dec 22, 2025

Choose a reason for hiding this comment

Uh oh!

mstoeckl Dec 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mstoeckl commented Dec 11, 2025 •

edited

Loading

mstoeckl commented Dec 16, 2025 •

edited

Loading

mstoeckl Dec 23, 2025 •

edited

Loading