In interested in a variation on Pòlya's urn. My aim is mostly just to understand its behaviour, but I also want to know whether there's anything special about it, i.e. whether it is in some sense a natural generalisation of the original.
In Pòlya's urn we start with an urn containing $n_0$ black and $m_0$ white balls (the canonical default is $n_0=m_0=1$) and then we repeat the following steps:
- draw one ball from the urn
- duplicate the ball, so if we drew a black ball we now have two black balls, and if we drew white we now have two white balls
- put both balls back into the urn, so it now contains one more ball than it did previously
It's an elementary but fascinating result that, starting with $n_0=m_0=1$, after $T$ time steps, the probability of there being $n$ black balls in the urn is the same for all $1\le n\le T$. In general it converges to a beta distribution.
I'm interested in the following variation. Fix a natural number $k>1$. Then start with $n_0$ black and $m_0$ white balls in the urn where $n_0+m_0\ge k$. On each step,
- draw $k$ balls from the urn (without replacement)
- duplicate all $k$ balls, so if $k=2$ and we drew a black ball and a white ball, we now have two black balls and two white balls
- put all $2k$ balls back into the urn, so it now contains $k$ more balls than it did previously.
I would expect this to behave similarly to Pòlya's urn when the number of balls in the urn is large, because each step will be approximately the same as performing $k$ steps of Pòlya's urn, but when there are small numbers of balls in the urn it will be different.
Here is a histogram from simulating the case where $k=2$ and $m_0=n_0=1$, running for 100 time steps.
It looks qualitatively similar if I increase or decrease the number of steps. The main question is what distribution this is converging to (as a function of $n_0, m_0$ and $k$). In this case it could plausibly be a beta distribution with parameters at some non-integer value between 2 and 3:
but it could equally be something else.
More generally, I'm interested in whether or not this can be seen as a natural generalisation of Pòlya's urn. That's something of a soft question, but if someone has an intuition for whether we should or shouldn't expect this model to share some of the nice properties that Pòlya's urn has, for example in terms of exchangeability, that would be really useful.
(There are many previous questions about variations of Pòlya's urn but I don't think this specific one has been asked about before. Apologies if it has and I missed it.)


