Python uses "closed open" intervals with `range(0, n)`, the reverse is then `ran...

mananaysiempre · on Nov 22, 2022

In Python, reversed(range(0, n)) (which is also a thing you can write—does that help?) is indeed written range(n-1, -1, -1), but I think I recently encountered a language where that was written as (the local syntax’s equivalent of) range(n, 0, -1), that is to say the rule was not “start inclusive, end exclusive” but “low inclusive, high exclusive”. I’m not sure what language it was and whether this option ultimately ends up more intuitive, but in any case it’s a valid option.

(One difference is that the start-end rule can work polymorphically with equality only, while the low-high rule requires an ordering.)

chkas · on Nov 22, 2022

With Knuth-Shuffle, it would be: for i in reversed(range(1, len(x))): That makes it better, but not much.

formerly_proven · on Nov 22, 2022

A potential issue with reversed(range(...)) is reversed() eagerly constructing the entire range.

mananaysiempre · on Nov 22, 2022

Nope: unlike, say, sorted(), reversed() will never force an iterator into a sequence—it will either call __reversed__ or fall back to __len__ and __getitem__. An iterator needs to know how to reverse itself, or it’s out of luck. (Which is quite annoying because it forces you to write a class rather than a generator function.) Ranges are in the former category, they know how to reverse themselves[1]. (Yet for some reason range(0, 42) and range(41, -1, -1) repr themselves as range(0, 42) and range(41, -1, -1) respectively while reversed(range(0, 42)) is instead <range_iterator object at ...>. Truly, I cannot touch anything without finding a bug or at least a suspiciously insectoid lifeform.)

[1] https://github.com/python/cpython/blob/7e3f09cad9b783d8968aa...

formerly_proven · on Nov 22, 2022

Thanks, TIL about the __reversed__ protocol despite having lived in chapter three of the Python reference for some time.

oittaa · on Nov 22, 2022

There's one pretty common exception that's "closed closed".

    random.randint(a, b)
    Return a random integer N such that a <= N <= b. Alias for randrange(a, b+1).

https://docs.python.org/3/library/random.html#random.randint

But in reality nowadays you almost always want to use the newer, simpler and more secure "secrets" module.

aftbit · on Nov 22, 2022

TIL about the "secrets" module. I have typically used os.urandom() when I needed secure random.

jwilk · on Nov 22, 2022

I know Knuth wrote it that way, but there's no good reason why the iteration should be downwards. That is, you could write:

  for i in range(1, len(x))

and you would still get an unbiased shuffle algorithm.

The upwards iteration has a few advantages:

• You can start shuffling before you know how big the input is.

• Algorithm R for reservior sampling can be seen a specialized version of shuffling, in which you skip the work that wouldn't affect the first k items, or would only affect their order.

chkas · on Nov 22, 2022

If you want to do it the other way around, you have to start at 0:

    for i in range(len(x) - 1):
        r = randrange(i, len(x))
        x[i], x[r] = x[r], x[i]

jwilk · on Nov 22, 2022

That probably works too, but it's even more awkard than the Knuth's one, and doesn't have the nice properties I mentioned.

I meant just:

    for i in range(1, len(x)):
        r = randrange(i + 1)
        x[i], x[r] = x[r], x[i]

chkas · on Nov 22, 2022

Cool. Seems to work, but I don't think it's still Knuth Shuffle.

jwilk · on Nov 23, 2022

You can think of it as Knuth's unshuffle. :)