ENH: np.random.default_gen() #13840

rkern · 2019-06-26T20:54:41Z

We achieved some consensus in the default BitGenerator thread that we want numpy to express its opinion about the default BitGenerator through a function. Here's that function.

Along the way, I renamed the seed_seq arguments to seed and cleaned up the descriptions of that parameter. I think it's clearer this way. It puts the more familiar concepts ("a seed") up front before the newer concepts ("a SeedSequence").

I removed the argumentless Generator() construction in favor of default_gen(). I did notice that np.random.Generator().<method>() was being referenced a lot in the docstrings, as a text-substitution for np.random.<method>(). I changed many of these to explicitly create rng = np.random.default_gen() and then use the rng.<method>().

But then I noticed that we do have a default global Generator instance in np.random.generator._random_generator with function-style aliases, e.g. np.random.generator.normal(). These don't appear to be documented anywhere, and I don't recall discussing if it's a thing that we actually want in the 1.17 release (as it will commit us to maintaining them and the underlying global, and may encourage bad habits like re-seeding the global instance in the middle of a library). It was a reasonable choice for randomgen, and it might be for numpy 1.18, but I question if that's the right choice for numpy 1.17.

That said, the check_random_state() idiom is a good one, and it relies on having a global default instance somewhere. It might be best to provide that in numpy.random rather than spread out in all of the different libraries. This also raises the question what should default_gen() and default_gen(None) do? Currently, I implemented it to just create a fresh PCG64 instance with OS entropy. But maybe default_gen() should be more like check_random_state():

default_gen()  # -> global Generator
default_gen(None)  # -> global Generator
default_gen(seed)  # -> Generator(PCG64(seed))
default_gen(bit_generator_instance)  # -> Generator(bit_gen_instance)
default_gen(generator_instance)  # -> generator_instance

What do you think? I'm neutral on maintaining and documenting the global instance, and -1 on the method aliases.

The most common use will certainly be to pass in an integer seed, so I want to make sure that we talk in terms of "seeds" first and foremost. I don't want to overload people with new concepts and terminology right out of the gate.

mattip · 2019-06-27T15:58:23Z

But then I noticed that we do have a default global Generator instance in np.random.generator._random_generator with function-style aliases, e.g. np.random.generator.normal()

I think this was left by mistake and should be removed.

That said, the check_random_state() idiom is a good one, and it relies on having a global default instance somewhere.

What is the motivation for returning a singleton instance? To save the cost of sampling system entropy in creating a new Bitgenerator? I don't think we should add a singleton, since it can be abused in multi-threaded contexts. We can always add it later.

rkern · 2019-06-27T17:08:33Z

In the cases in which you would get the global, it's fine to reuse it in different threads. It's only when you want to reproduce the results that sharing an instance across threads would be a problem. Since we don't allow reseeding in-place, this would not come up.

But we can go ahead and delete it for now. We'll see who howls.

Going through those examples suggests that default_gen() should also take a BitGenerator or Generator () so that it can be used as a general "take this value and give me the Generator for it". This will let it be be used similarly to check_random_state() without the global.

doc/source/reference/random/new-or-different.rst

mattip · 2019-06-27T23:53:30Z

numpy/random/__init__.py

@@ -167,7 +167,7 @@

 from . import mtrand
 from .mtrand import *
-from .generator import Generator
+from .generator import Generator, default_gen


Line 9 Generator().function -> default_gen().function

mattip · 2019-06-27T23:55:59Z

other than a couple of documentation nits, LGTM

mattip · 2019-06-28T03:07:31Z

needs a rebase

mattip · 2019-06-28T13:24:52Z

Thanks @rkern

rkern added 3 commits June 26, 2019 10:55

ENH: Rename seed_seq argument to seed.

73ebbbb

The most common use will certainly be to pass in an integer seed, so I want to make sure that we talk in terms of "seeds" first and foremost. I don't want to overload people with new concepts and terminology right out of the gate.

ENH: Add default_gen() function.

bbc52dd

DOC: Remove some Generator() calls in the documentation.

f8801a1

charris added 01 - Enhancement component: numpy.random labels Jun 26, 2019

charris added this to the 1.17.0 release milestone Jun 26, 2019

rkern mentioned this pull request Jun 27, 2019

DOC: np.random documentation cleanup and expansion. #13849

Merged

rkern added 2 commits June 27, 2019 10:13

BUG: Remove unintended global Generator and aliases.

35c88f8

ENH: Allow default_gen() to take BitGenerator and Generator.

2f7c3cd

mattip reviewed Jun 27, 2019

View reviewed changes

doc/source/reference/random/new-or-different.rst Outdated Show resolved Hide resolved

mattip reviewed Jun 27, 2019

View reviewed changes

rkern added 3 commits June 27, 2019 19:26

DOC: clarify method references.

251d290

DOC: clarify numpy.random docstring.

b369381

Merge branch 'master' into enh/default-gen

1f2d8b9

rkern added 2 commits June 27, 2019 20:26

BUG: Drive-by removal of unused attribute.

da89d6b

DOC: Add default_gen to the docs properly.

ab5cd80

mattip merged commit 0ec7f12 into numpy:master Jun 28, 2019

shoyer mentioned this pull request Jun 30, 2019

Decide on new PRNG BitGenerator default #13635

Closed

h-vetinari mentioned this pull request May 13, 2021

DOC: redirect notice to default_rng() is not linked #17140

Clo 6BA4 sed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ENH: np.random.default_gen() #13840

ENH: np.random.default_gen() #13840

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ENH: np.random.default_gen() #13840

ENH: np.random.default_gen() #13840

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!