Can the sample deviation of a combined data set be lower than the sample deviations of each separate data set? So, at first thought, my answer was no and that it can be equal to that of each separate data set at the lowest. However, when I tried to show this, I supposed that the mean, sample standard deviations, and sample size of both data sets are equal.
Fooling around with some examples such as: $$\text{data set 1 = data set 2: 1,2,3,4,5} \quad N=5, \mu=3, s\approx 1.58 \Rightarrow$$ $$ \text{combined data set: 1,1,2,2,3,3,4,4,5,5} \quad N=10,\mu=3 s\approx 1.49$$ So I'm thinking that for all sample data sets in which the sample sizes, sample standard deviations, and mean are equal, the sample standard deviation of the combined data set will be lower, while it will approach an equal value as $N$ goes to infinity.
How can I show this mathematically?