I am curious why Ash et al. (1999) introduces the outer measure when the authors try to extend the measure to a large class of sets.
Here is the basic roadmap they take:
- Begin with $\mathscr{F}_0$, a field of subsets of a set $\Omega$. Let $P$ be a probability measure on $\mathscr{F}_0$. Consider increasing sequences of sets where the limiting sets may not belong to $\mathscr{F}_0$, but establish that if the two limiting sets $A\subset A'$, then $\lim_{m\rightarrow\infty}P(A_m)\leq\lim_{n\rightarrow\infty}P(A'_n)$. If both sequences increase to the same limit, it is equality.
- The authors produce a larger set, $\mathscr{G}$, which is the collection of all limits of increasing sequences of sets in $\mathscr{F}_0$, essentially the collection of all countable unions of sets in $\mathscr{F}_0$. Because in (1), we established how the probability measure $P$ behaves in the limit of increasing sequences, this first extension of $P$ to $\mathscr{G}$ is natural. This extended measure is denoted as $\mu$ on $\mathscr{G}$.
- Now, the authors extend $\mu$ to the class of all subsets of $\Omega$, which also seems natrual, but the authors comment that the extension will NOT be countably additive on all subsets, but only on a smaller $\sigma$-field. Then, they introduce the outer measure definition as. $\mu^*(A)=\inf\{\mu(G):G\in\mathscr{G},A\subset G\}.$
My Question:
Why do we introduce this $\mu^*$ in step 3 of the extension of measures? Also, is this $\mu^*$ pictorially putting smaller boxes (i.e. bunch of subsets of $\Omega$ over $A$) to cover $A$?
Reference: $\textit{Probability and Measure Theory}$ (Robert B. Ash and Catherine A. Doleans-Dade), Harcourt/Academic Press, 1999.