[This is from Keeling's notes on radioactive decay here]
From the radioactive decay wiki page:
Radioactive decay is the process by which an unstable atomic nucleus loses energy by radiation.
Radioactive decay is a random process at the level of single atoms. According to quantum theory, it is impossible to predict when a particular atom will decay, regardless of how long the atom has existed. The half-lives of radioactive atoms have a huge range: from nearly instantaneous to far longer than the age of the universe.
Consider radioactive decay of an element, known to be memoryless in waiting time. We have waiting time random variable ${ T \sim \text{Exp}(\lambda) , }$ where the constant ${ \lambda > 0 }$ depends on the nature of the element.
Rare events like earthquakes also have memorylessness in waiting time, and the waiting time can be modelled by an exponential random variable.
Let random variable ${ N(\tau) }$ denote the number of radioactive decays that occur in the time interval ${ [0, \tau]. }$ Let the PMF of ${ N(\tau) }$ be given by
$${ p _{\tau} (k) = \mathbb{P}(N(\tau) = k) \quad \text{ for } k \in \mathbb{Z} _{\geq 0} . }$$
For there to be ${ k }$ events in the time interval ${ [0, \tau] , }$ there must be
- ${ k - 1 }$ events in an interval ${ [0, s ], }$ ${ s \in (0, \tau) }$ with probability
$${ \mathbb{P}(N(s) = k-1) = p _{s} (k-1) }$$
- ${ 1 }$ event in an interval ${ [s, s + ds] }$ with probability
$${ {\begin{aligned} &\, \mathbb{P}(N(s + ds) = k \, \vert \, N(s) = k - 1) \\ = &\, \mathbb{P}(T \in [s, s + ds] \, \vert \, T > s) \\ = &\, 1 - e ^{- \lambda ds } \\ = &\, \lambda ds + o(ds) \end{aligned}} }$$
- no events in an interval ${ [s + ds, \tau] }$ with probability
$${ {\begin{aligned} &\, \mathbb{P}(N(\tau) = k \, \vert \, N(s + ds) = k) \\ = &\, \mathbb{P}(T > \tau - (s + ds)) \\ = &\, e ^{- \lambda (\tau - (s + ds)) } \\ = &\, e ^{- \lambda (\tau - s)} + o(ds) . \end{aligned}} }$$
Since these events are independent, the probability of all three is the product of the three. Integrating this over all possible intermediate times ${ s }$ we have
$${ {\begin{aligned} p _{\tau} (k) = &\, \int _0 ^{\tau} p _s (k - 1) \, (\lambda ds) \, e ^{- \lambda (\tau - s)} \\ = &\, \lambda e ^{- \lambda \tau } \int _0 ^{\tau} p _s (k - 1) e ^{\lambda s} \, ds . \end{aligned}} }$$
Now differentiating this wrt ${ \tau }$ we have
$${ p _{\tau} ^{'} (k) = - \lambda ^2 e ^{- \lambda t } \int _0 ^{\tau} p _s (k - 1) e ^{\lambda s} \, ds + \lambda e ^{- \lambda \tau } p _{\tau} (k - 1) e ^{\lambda \tau } }$$
that is
$${ \boxed{p _{\tau} ^{'} (k) = - \lambda p _{\tau} (k) + \lambda p _{\tau} (k - 1)} . }$$
For ${ k = 0 }$:
$${ p _{\tau} ^{'} (0) = - \lambda p _{\tau} (0) }$$
that is
$${ p _{\tau} (0) = e ^{- \lambda \tau } . }$$
For ${ k = 1 }$:
$${ {\begin{aligned} p ^{'} _{\tau} (1) = &\, - \lambda p _{\tau} (1) + \lambda p _{\tau} (0) \\ = &\, - \lambda p _{\tau} (1) + \lambda e ^{- \lambda \tau} . \end{aligned}} }$$
Recall the differential equation ${ \frac{dy}{dx} + P(x) y = Q(x) }$ has an integrating factor ${ e ^{\int P(x) \, dx } . }$ Hence
$${ \frac{d}{d\tau} (e ^{\lambda \tau} p _{\tau} (1) ) = \lambda }$$
that is
$${ p _{\tau} (1) = e ^{-\lambda \tau} \lambda \tau . }$$
For ${ k = 2 }$:
$${ {\begin{aligned} p ^{'} _{\tau} (2) = &\, - \lambda p _{\tau} (2) + \lambda p _{\tau} (1) \\ = &\, - \lambda p _{\tau} (2) + \lambda e ^{-\lambda \tau} \lambda \tau \end{aligned}} }$$
that is
$${ \frac{d}{d \tau} (e ^{\lambda \tau } p _{\tau} (2) ) = \lambda ^2 \tau }$$
that is
$${ p _{\tau} (2) = e ^{- \lambda \tau} \frac{(\lambda \tau) ^2}{2} . }$$
Formally by induction
$${ \boxed{ p _{\tau} (k) = e ^{- \lambda \tau} \frac{(\lambda \tau) ^k}{k ! } \quad \text{ for } k \in \mathbb{Z} _{\geq 0} } }$$
as needed.
For ${ \tau = 1 }$ the above probability distribution is called the Poisson distribution with parameter ${ \lambda > 0 . }$
is my take on this question. I spent a lot of time on this page and the linked pages trying to get a visual / combinatorial intuition for what was going on. I ended up turning my solution into a blog post geared toward those without much of a formal math background
– David May 13 '24 at 19:55