7

Consider $P:A\to\mathbb{R}$ where $P(x)> 0 \text{,} \ $ $A$ is an subset of $\mathbb{R}$ and

$$P(x)= \begin{cases} P_1(x) & x \in A_1\\ P_2(x) & x \in A_2\\ \end{cases}$$

where $A=A_1\cup A_2$ and $A_1,A_2$ are pairwise disjoint.

What is the "most intuitive" average of $P(x)$, for all $x \in A\cap[a,b]$, that is unique and where $a,b\in\mathbb{R}$?

Note “unique” means a system that produces a single average, instead of multiple averages, using reasoning/intuition.

Also note this question is subjective but there are areas of math (such as measure theory) that is based on intuitions/ intuitive properties. For example, we can pick any arbitrary measure to find the average of Dirichlet's Function in $[0,1]$, but instead, we use the Lebesgue measure.

Why is the Lebesgue measure intuitive? Because if real numbers in $[a,b]$ are written as infinite, randomly generated digits; there is zero probability they will represent a number in $[a,b]$. Moreover, there is zero probability that finite, randomly generated digits will repeat infinitely to give rational numbers; hence, these digits have a probability $1$ of being irrational. Say we multiplied these intuitive probabilities times $b-a$. This gives the results of the Lebesgue measure. Hence we can state the Lebesgue measure is intuitive for these cases?

However, if we are focusing on countable $A\cap[a,b]$, it should still have a measure of one and the average of $P$ should exist between the infimum and supermum of its range. The problem is most countably additive measures of $A\cap[a,b]$ is zero giving $P$ an average outside the range.

What measure should we use instead? What do you think are intuitive properties/examples that the average should follow?

Here are two examples, I believe our measure and average should match


If $A$ has a Lebesgue measure of one the average of $P$ should intuitively be

$$\int_{\mu(A_1\cap[a,b])}P_1(x)dx+\int_{\mu(A_2\cap[a,b])}P_2(x)dx$$

Where $\mu$ is a measure that equals the Lebesgue measure (when the Lebesgue measure of $A$ is one).

Even better, we could use the Gauge Integral, which is far more general and doesn't use a measure. However, even the Gauge integral gives the average of $P$ zero, when defined on a domain with zero Lebesgue measure.


Here are two cases that the average of $P$, where the Lebesgue measure of $A\cap[a,b]$ is zero, should intuitively follow.

If $A_1$ and $A_2$ are finite in $[a,b]$, the average of $P$ is

$$\frac{\sum\limits_{i\in A\cap[a,b]}P(i)}{\left|A\cap[a,b]\right|}=\frac{\left|A_1\cap[a,b]\right|\sum\limits_{i_1\in A_1\cap[a,b]}P(i_1)}{\left|A_1\cap[a,b]\right|\left|A\cap[a,b]\right|}+\frac{\left|A_2\cap[a,b]\right|\sum\limits_{i_2\in A_2\cap[a,b]}P(i_2)}{\left|A_2\cap[a,b]\right|\left|A\cap[a,b]\right|}=\frac{\left|A_1\cap[a,b]\right|}{\left|A\cap[a,b]\right|}\sum\limits_{i_1\in A_1\cap[a,b]}P(i_1)\frac{1}{\left|A_1\cap[a,b]\right|}+\frac{\left|A_2\cap[a,b]\right|}{\left|A\cap[a,b]\right|}\sum\limits_{i_2\in A_2\cap[a,b]}P(i_2)\frac{1}{\left|A_2\cap[a,b]\right|}$$

And probability measure $\mu(X\cap[a,b])$ for $X\subseteq A$ is

$$\mathbf{W_1}=\mu(A_1\cap[a,b])=\frac{\left|A_1\cap[a,b]\right|}{\left|A\cap[a,b]\right|}$$

$$\mathbf{W_2}=\mu(A_2\cap[a,b])=\frac{\left|A_2\cap[a,b]\right|}{\left|A\cap[a,b]\right|}$$

$$\mathbf{W}_1\sum\limits_{i_1\in A_1\cap[a,b]}P(i_1)\frac{1}{\left|A_1\cap[a,b]\right|}+\mathbf{W}_2\sum\limits_{i_2\in A_2\cap[a,b]}P(i_2)\frac{1}{\left|A_2\cap[a,b]\right|}$$


If $A_1$ is countable infinite in $[a,b]$ and $A_2$ is finite, since $A\cap[a,b]$ is countably infinite, if $\mathbf{J}_m$ contains all finite subsets of $A\cap[a,b]$ where $\max\limits_{m\in\mathbb{N}}|\mathbf{J}_m|=\left|\mathbf{I}_n\right|\le n$; $\mathbf{I}_n$ is arranged as $\left\{a_1,a_2,...,a_n\right\}$ where $a\le a_1<a_2<...<a_n\le b$, and

$$ \max_{i\le n,i\in\mathbb{N}}\left(\text{diff} (\mathbf{I_n})\right)=\max_{i\le n,i\in\mathbb{N}}\left\{a_2-a_1,a_3-a_2,...,a_n-a_{n-1}\right\}\le K(n)$$

such that $K:S\to\mathbb{R}$, $\mu(S)=1$ and $\lim\limits_{n\to\infty}K(n)=\inf\text{diff}(A)$; then the measures $W_1$ and $W_2$ are

$$\mathbf{W}_{1}=\mathbf{\mu}(A_1\cap[a,b])=\lim_{n\to\infty}\frac{\left|A_1\cap \mathbf{I}_n\cap[a,b]\right|}{\left|\mathbf{I}_{n}\cap[a,b]\right|}$$

$$\mathbf{W}_{2}=\mathbf{\mu}(A_2\cap[a,b])=\lim_{n\to\infty}\frac{\left|A_2\cap \mathbf{I}_n\cap[a,b]\right|}{\left|\mathbf{I}_{n}\cap[a,b]\right|}$$

And $\text{avg}(P)$ is

$$\mathbf{W}_{1}\lim_{n\to\infty}\sum_{i_1\in A_1 \cap I_n \cap [a,b]}P_1(i_1)\frac{1}{\left|A_1\cap I_n\cap[a,b]\right|}+\mathbf{W}_{2}\lim_{n\to\infty}\sum_{i_2\in A_2 \cap I_n \cap [a,b]}P_2(i_2)\frac{1}{\left|A_2\cap I_n\cap[a,b]\right|}$$

Any $\mathbf{I}_n$ gives the same answer but I'm not sure how to prove this.

Moreover, if $A$ is dense in $[a,b]$,

$$\text{avg}(P)=\frac{1}{b-a}\mathbf{W}_1\int_{a}^{b}P_1(x) \ dx+\frac{1}{b-a}\mathbf{W}_2\int_{a}^{b}P_2(x) \ dx=\frac{1}{b-a}(1)\int_{a}^{b}P_1(x) \ dx+\frac{1}{b-a}(0)\int_{a}^{b}P_2(x) \ dx$$

$$\frac{1}{b-a}\int_{a}^{b}P_1(x) \ dx$$

Here's the proof. Since $A$ is dense in $[a,b]$, it can approximate arbitrarily close to any point in $\mathbb{R}$. Hence it's possible for the limits outside the domain to exist. We extend $P:A\to\mathbb{R}$ to $P:A \cup C\to\mathbb{R}$ where for $c\in \mathbb{R}\setminus A$

$$C=\left\{P^{-1}\left(\lim_{\left\{ x\in A\right\}\to c}P(x)\right) \right\}$$

Since $A_1$ is dense in $\mathbb{R}$, $P:A\cup C\to\mathbb{R}$ can be split into $P_1:A_1\cup C\to\mathbb{R}$ and $P_2:A_2\to\mathbb{R}$.

Since $A_2$ is the discontinuties of $P:A\cup C \to \mathbb{R}$ and the Lebesgue measure of $A_2$ is zero, the Lebesgue criteria for reimman integration states the riemman integral for $P_1:A_1\cup C \to \mathbb{R}$ can exist. Hence we apply the mean value theorem of integration on $P_1$.


If $A_1$ and $A_2$ is dense in $[a,b]$ the average is unclear. For one, we could set $\mathbf{I}_n$ to equal subsets of $A_1$, $A_2$ or $A_1\cup A_2$. In this case, $\mathbf{W}_1$ or $\mathbf{W}_2$ could equal anything

I will post my intuition for this example. If you have a measure that matches these intuitions, post it below.

Arbuja
  • 53
  • This leaves us with the sequence that seperate the divisor two.

    $$F_n \cap [a,b]=\left{\frac{p}{2^k(2q+1)}:p,q\in\mathbb{Z}, k\in\mathbb{N},a<\frac{p}{2^k(2q+1)}<b,2^k\le n, 2q+1 \le n\right}\bigcup \left{\frac{jp}{2^k(2q+1)}:p,q\in\mathbb{Z}, k\in\mathbb{N},a<\frac{jp}{2^k(2q+1)}<b,2^k\le n, 2q+1 \le n\right}$$

    It is important to note any $h\in\mathbb{R}$ can be written as $jt$ such that $j\in\mathbb{R}\setminus\mathbb{Q}$ and $t\in\mathbb{Q}$. The hard part is turning any countable set into a Folner Sequence.

    – Arbuja Aug 21 '19 at 03:52
  • Is this reasonable? If not could we find the most "intuitive" Folner Sequence of any countable set in the form – Arbuja Aug 21 '19 at 03:53
  • 1
    Wait, so, we need to specify a group action to talk about a Følner sequence on $A$, right? Which action are you using? – Alexander Gruber Aug 21 '19 at 06:17
  • 10
    This question is really long and complicated so I only read the first part. You could clarify some things since the introduction is not totally clear. For one thing, $A$ is a countable subset of $\mathbb R$, right? And the cases should be $x\in A_1$ and $x\in A_2$ instead of $x=A_1$ and $x=A_2$. Finally, you could clarify that you want to define an average of $P(x)$ over all $x\in A\cup[a,b]$ -- at least, without realizing (from reading further in the question) that $A\subset\mathbb R$, I could only make sense of it as the average of all images $P(x)\in[a,b]$. –  Aug 21 '19 at 07:07
  • @AlexanderGruber Left group action. – Arbuja Aug 21 '19 at 14:40
  • @Rahul Made some edits. – Arbuja Aug 21 '19 at 15:04
  • of what group?${}$ – Alexander Gruber Aug 21 '19 at 17:56
  • @AlexanderGruber I assume we are taking the left group action of $(\mathbb{Z},+)$. Does this make sense? I’m not sure, I’m a first year (technically third year) undergraduate. – Arbuja Aug 21 '19 at 23:10
  • @AlexanderGruber It could be $\left(\mathbb{Z},+ \right)$ or $\left(\mathbb{Z},\times \right)$, I’m not sure. – Arbuja Aug 21 '19 at 23:21
  • @AlexanderGruber Nevermind, let us start with an example. If I want the Folner Sequence of $\mathbb{Q}$ to be $\left{\frac{p}{2^{k}(2q+1)}:p,q\in\mathbb{Z}, k\in\mathbb{N},2^{k} \le n, 2q+1 \le n \right}$ what action must we use? – Arbuja Aug 22 '19 at 01:24
  • wouldn't you just use Cesaro sums? (https://en.wikipedia.org/wiki/Cesàro_summation) – Zach Hunter Aug 22 '19 at 14:47
  • @ZacharyHunter We have enumerate any countable set into a sequence of elements? How do we do that? And if there are multiple sequences to enumerate a countable set which one is most intuitive? It would be pointless to say there is no average, between countable $A_1$ and $A_2$, both dense in $\mathbb{R}$. – Arbuja Aug 22 '19 at 16:05
  • I mean bijecting countable sets to the real numbers is achievable by their very definition. however I guess if you have infinite 1's and 2's, you can arrange them in such a way that you can get any average within the range [1,2]... I guess perhaps you would want to take an average of all "dense limit points" $x$ of $A$, where for any finite subset $S$, $x$ is still a limit point of $A \setminus S$. – Zach Hunter Aug 22 '19 at 16:19
  • if the set of dense limit points is finite, you'e done. if there's more than countably infinite of them, take an integral, else, rinse and repeat. – Zach Hunter Aug 22 '19 at 16:23
  • @ZacharyHunter I’m not taking the average of dense limit points. If $A$ is countably dense in $\mathbb{R}$ the output of $P(x)$ is $P_1(x)$ or $P_2(x)$ for $x\in\mathbb{R}$. The result wouldn’t be a function anymore. We need a different approach. – Arbuja Aug 22 '19 at 16:44
  • @ZacharyHunter If the average could be anything in the range then we should note that. However, could we narrow this down to “fewer averages”. – Arbuja Aug 22 '19 at 18:48
  • @JoséCarlosSantos Why is my post unclear? – Arbuja Aug 31 '19 at 12:46
  • @verret Why is my post unclear – Arbuja Aug 31 '19 at 12:47
  • @GEdgar Why is my post unclear? – Arbuja Aug 31 '19 at 12:50
  • @Shaun Why is my post unclear? – Arbuja Aug 31 '19 at 12:50
  • It's changed since I voted to close it as unclear. At the time I voted, it was a huge post that didn't have a clear question in it, at least at a cursory glance. – Shaun Aug 31 '19 at 12:59
  • @Shaun Could you reopen it? – Arbuja Aug 31 '19 at 12:59
  • 2
    If we had no intuition at all about the probability of repeating decimals, the irrationals would still have full Lebesgue measure, because we could prove it from the definitions. – Gerry Myerson Aug 31 '19 at 13:01
  • 1
    I voted to reopen, although What is the "most intuitive" average is unclear to me. I've removed the cryptic group-theoretic tags and included the [tag:intuition] tag. I'm not keen on reopening it to be honest, even now. – Shaun Aug 31 '19 at 13:03
  • @GerryMyerson If it’s intuitive a singularity and the rationals should have a measure of zero, and the irrationals should have a measure of one, then we can create a measure that meets these requirements. The lebesgue measure works for all these cases. However when we add the intuition that the average, using a measure, should be in the range of the function, then the Lebesgue measure does not work when the domain has a Lebesgue measure of zero. – Arbuja Aug 31 '19 at 13:18
  • I used to think that Lebesgue measure is intuitive, since it is translation invariance and has $m([0,1]) = 1$. That $\mathbb Q$ has measure zero is just a consequence. – Arctic Char Aug 31 '19 at 21:17

2 Answers2

6

There is no unique intuitive average of $P$ over an arbitrary countable subset $A$. This is for two reasons. First, viewing $A$ as just an abstract set of points to be averaged over is problematic. Any way to define an average over $A$ would have to use a specific ordering of the elements of $A$, and thus no unique or intuitive average could be achieved. You thus want to use the fact that $A$ is a subset of $\mathbb{R}$ (after all, why did you decide you wanted $A$ to be a subset of $\mathbb{R}$ and not just an abstract infinite set!). So we want to use some of the structure of $\mathbb{R}$; note that the Lebesgue average does this -- it uses intervals as the building blocks for the Lebesgue measure. Second, and related to the first, is that $P$ is an arbitrary function. If $P$ is an arbitrary function, then it doesn't care about the structure of $\mathbb{R}$; $A$ very well might then be an abstract infinite set on which $P$ is defined, and then there is no way to use that $A$ is a subset of the structured set $\mathbb{R}$. We therefore want to put some restrictions on $P$. One natural one is that $P$ is continuous.

With this context and these assumptions in mind, I'll try to provide a satisfactory answer to your (refined) question. We can assume $A \subseteq [a,b]$, since that's where everything is happening. For ease, I'll have $[a,b] = [0,1]$. Based on the above discussion, I'll just have $P$ be defined on all of $[0,1]$ and continuous. Let $E_1 = [0,1], E_2 = [0,\frac{1}{2}], E_3 = [\frac{1}{2},1], E_4 = [0,\frac{1}{4}], E_5 = [\frac{1}{4},\frac{1}{2}], E_6 = [\frac{1}{2},\frac{3}{4}]$, $E_7 = [\frac{3}{4},1], E_8 = [0,\frac{1}{8}]$, etc.

If $A$ is finite, it's obvious how to define the average of $P$ (just do $\frac{1}{|A|}\sum_{x \in A} P(x)$). So, assume $A$ is infinite. Consider the sets $A\cap E_1, A\cap E_2, \dots$. Let $x_1$ be a point in the first nonempty one of these sets. Let $x_2$ be a point in the second nonempty of these sets, etc. Look at the measures $\delta_{x_1}, \frac{\delta_{x_1}+\delta_{x_2}}{2}, \dots, \frac{\delta_{x_1}+\dots+\delta_{x_N}}{N},\dots$. Since $[0,1]$ is a compact metric space, there is some probability measure $\mu$ on $[0,1]$ that is a weak* limit of some subsequence of these measures, i.e. there is some $(N_k)_k$ with $\frac{1}{N_k}\sum_{j=1}^{N_k} f(x_j) \to \int_0^1 f d\mu$ for each $f \in C([0,1])$.

We then define the average of $P$ over $A$ to be $\int_0^1 Pd\mu$.

Benefits of this definition: (1) It coincides with the Lebesgue measure when $A = \mathbb{Q}$; in fact, it coincides with the Lebesgue measure whenever $A$ is dense in $[0,1]$. (2) It is localized to the right places (e.g. $A \subseteq [0,\frac{1}{2}]$ implies $\mu$ lives on $[0,\frac{1}{2}]$). (3) It is intuitively an average; we are sampling "randomly" from the interval $[0,1]$ and taking a limit of these empirical averages of samples.

Cons of this definition: (1) It is not unique (for two reasons: (a) the choice of the $x_i$'s is not unique; (b) there might be multiple weak* limits). However, I don't think this can be avoided -- I don't think one can get a unique, intuitive average over an arbitrary countably infinite set.

My answer to your question from 2 years ago (!) might be useful (good you're still studying this stuff).

I'll end with an interesting example. Consider $A = \{1,\frac{1}{2},\frac{1}{3},\frac{1}{4},\frac{1}{5},\dots\}$. We can have, for example, $x_1 = \frac{1}{3},x_2 = 1, x_3 = \frac{1}{5},x_4 = \frac{1}{2},\dots$. Note, for fun, that each element of $A$ will eventually be included in $(x_n)_n$ (to see this, note that any number $\frac{1}{l}$ is the only element of $A$ in an interval of the form $[\frac{m}{2^k},\frac{m+1}{2^k}]$ if $k$ is large enough). In any event, the measure $\mu$ obtained will just be $\delta_0$, the delta mass at $0$, so that $\int Pd\mu = P(0)$ is just the value of $P$ at $0$. This is intuitive, since the elements of $A$ are nearly all near $0$.

.

Added: Let $P: \mathbb{Q} \cap [0,1] \to \mathbb{R}$ be $P(x) = x^2$. Let $\tilde{P}: [0,1] \to \mathbb{R}$ be $\tilde{P}(x) = x^21_{\mathbb{Q}}(x)$, and let $T: [0,1] \to \mathbb{R}$ be $T(x) = x^2$. The Lebesgue integral of $\tilde{P}$ over $[0,1]$ is $0$, and the Lebesgue integral of $T$ over $[0,1]$ is $\frac{1}{3}$. It doesn't really make sense to say "the Lebesgue integral over $\mathbb{Q}$", but, for example, what one would mean when one says "the Lebesgue integral of $x^2$ over $\mathbb{Q}$ is $0$" is "the Lebesgue integral of $\tilde{P}$ over $[0,1]$ is $0$". The interval $[0,1]$ has the Lebesgue measure on it, so we can integrate functions over it. Since $\tilde{P}(x) = 0$ for almost every $x \in [0,1]$, it makes sense that the integral of $\tilde{P}$ is $0$. The Lebesgue integral is intuitive.

What you want, though, is to define a measure $\mu$ over $\mathbb{Q}$, so that "the average of $P$ over $\mathbb{Q}$" is simply $\int Pd\mu$. You want something different than the Lebesgue measure over $[0,1]$. (Note there is some confusing terminology here. The integral $\int Pd\mu$ is still called a "Lebesgue integral" even though the measure we are integrating over is not the Lebesgue measure). The way I defined that measure $\mu$ is described above. If $A = \mathbb{Q}$, or $\mathbb{Q}\cup \{\frac{\ln(m+\sqrt{3})}{100} : m \in \mathbb{N}\}$, or just any dense set, then what $\int Pd\mu$ turns out to be is $\int_{[0,1]} T dx$, the Lebesgue/Riemann integral of $T$ with respect to the Lebesgue measure, where $T$ is the continuous extension of $P$ to $[0,1]$. In particular, if $P = x^2$ and $A = \mathbb{Q}$, the "average of $P$ over $A$" given by my answer $\int Pd\mu$ is $\int_0^1 x^2dx = \frac{1}{3}$, the intuitive answer you sought.

mathworker21
  • 35,247
  • 1
  • 34
  • 88
  • Comments are not for extended discussion; this conversation has been moved to chat. – quid Sep 02 '19 at 23:16
  • @mathworker21 One more test. If $P(x)=x^2$ for $x\in[0,1/3]$, $P(x)=\sqrt{x}$ for $x\in[1/3,2/3]\cap\mathbb{Q}$, and $P(x)=3$ for $x\in[1/3,1]\cap\mathbb{R}\setminus\mathbb{Q}$, what is the average? – Arbuja Sep 03 '19 at 19:09
  • @Arbuja Wait, when you say $P(x) = 3$ for $x \in [1/3,1]\cap \mathbb{R}\setminus\mathbb{Q}$, is that $1/3$ supposed to be a $2/3$? I assumed it was in my above comment. – mathworker21 Sep 03 '19 at 19:14
  • @mathworker21 It's supposed to be $1/3$. I wanted to see what happend if we intersected the rationals with the irrationals. – Arbuja Sep 03 '19 at 19:21
  • @mathworker21 But seeing your answer, I'm completely sure this is perfect. – Arbuja Sep 03 '19 at 19:23
  • @Arbuja That's a good question. My average isn't unique, for the reasons described in my answer. And I think the example of intersecting rationals with irrationals is a big reason why we can't hope for uniqueness, since they are both dense. If you want to get uniqueness, you'd have to incorporate both measure theoretic and topological considerations about your set $A$. And these two can conflict with one another in certain situations. – mathworker21 Sep 03 '19 at 19:25
  • @mathworker21 Are you a professor. If not do you know any professors that can answer my question? Perhaps they have a better solution. – Arbuja Sep 04 '19 at 00:00
  • @Arbuja I've spent a long time on my answer and replying to all of your comments. I'd appreciate if you accepted my answer. – mathworker21 Sep 04 '19 at 00:12
  • Sorry if it seemed I insulted you and sorry for my carelessness confusion. I'm kind of depressed.....I hoped there was more to explore. – Arbuja Sep 04 '19 at 00:20
  • 1
    @Arbuja There is more to explore! There are tons of more questions to ask and different, more specific, setups (of $A$ and $P$) to think about; all I was saying is that, at this point, I think I have adequately answered the question you posed. I am happy to have helped out and clarified your confusion. You have asked great questions! – mathworker21 Sep 04 '19 at 00:26
  • @mathworker21 Does my answer give any more ideas? I know arbitrary countably infinite sets do not need structure but how else can we represent them? If sets do have a structure, I'm considering using specific Folner Sequences using intuition? Does this make sense? – Arbuja Sep 04 '19 at 16:24
  • @mathworker21 , what if you made your integral like a darboux integral? – Zach Hunter Sep 05 '19 at 16:32
  • @ZacharyHunter I don't see how my whole "weak$*$ limit of measures thing would work" – mathworker21 Sep 05 '19 at 17:07
  • @Arbuja I still have to read it. – mathworker21 Sep 05 '19 at 17:07
  • @mathwork21 If you downvoted my answer what’s is unclear about it? – Arbuja Sep 06 '19 at 13:26
  • @Arbuja I didn't downvote. I've been busy; I'll try to read it later. – mathworker21 Sep 07 '19 at 06:42
  • @mathworker21 I know you’re extremely busy but when will you have time to check my answer? – Arbuja Sep 12 '19 at 22:58
  • @Arbuja I'll try to get to it soon. Are you sure it's easy to read. Scrolling through it, the notation looks non-standard. – mathworker21 Sep 13 '19 at 00:08
  • @mathworker21 I'm not sure how else to describe it. I could do years of rigorous reading but this would take forever. – Arbuja Sep 13 '19 at 01:24
  • @mathworker21 How can we incorporated Density Rock answer this question. – Arbuja Oct 31 '19 at 21:31
  • @mathworker21 I think I proved you wrong. See this post. – Arbuja May 22 '20 at 14:46
  • @mathworker21 See this. A unique intuitive average can be determined. – Arbuja Jun 07 '20 at 18:12
  • @mathworker21 I haven't given up. I know I'm right. See this. A uniquely intuitive average can be determined. – Arbuja Jun 18 '20 at 17:56
2

The essence of this question moreso seems to be how to intuitively integrate on a countably infinite set. So, before we get into integrating with respect to some probability functions, lets see if we can integrate with respect to $f(x)=x$.In it's purest form, an integral is a bit like a fancy average, so let's try and think what behaviors we would want:

Let's say we have a set $A = \{C, 0,0,0,0 \dots \}$. Then treating it like a psuedo-Cesaro-sum, if we make $a_0=C$, we get: $$\lim_{n \to \infty} \frac{1}{n} \sum_{i=0}^n a_i = \lim_{n \to \infty}\frac{1}{n}(C+ \sum_1^n 0) = \lim_{n \to \infty}\frac{C}{n} = 0$$ This works out quite nicely. Generally, it should make sense that any finite number of points should not impact the average of $A$. This is a bit analogous to how in Lesbegue integration, any countably infinite amount of points doesn’t effect the integral. Explicitly:

Property 1: $\forall S= \{C_1,C_2,C_3\dots C_k\}, average(A)=average(A \setminus S)$ 
However, psuedo-Cesaro-sums aren’t perfect. Consider the set $A = \{0,1,0,1,0,1,0,1 \dots\}$ where $0$ and $1$ occur infinite times. There are many ways we can order $a_i$ such that both $0$ and $1$ appear infinite times, that still different results:

$$ 0+1+0+1 \dots = \frac{1}{2} $$ $$ 0+0+1+0+0+1 \dots = \frac{1}{3} $$ $$ 0+0+0+1+0+0+0+1 \dots = \frac{1}{4} $$ $$ 0+0+0+1+1+0+0+0+1+1 \dots = \frac{2}{5} $$

In fact, you can end up with any average in $[0,1]$ being your average. In fact, this is tangentially similar to the Riemann Series Theorem, which is about how you can rearrange the terms certain kinds of sums to get different values. So, how do we handle this? If we tried to take the “average” of psuedo-Cesaro-sums, that doesn’t really help, the whole reason we’re in this mess is because averaging with infinities is hard. Despite this being an adhoc proposal, let’s assert:

Property 2: if you have a finite set $S = \{a,b,c\dots z \}$, and $A$ has countably infinite copies of each element of $S$, $average(A) = average(S) = \sum_{x \in S} x/|S|$

I realize with some clever model of infinite permutations you could possibly rigorously prove that the average of a psuedo-Cesaro-sum for all permutations of $A$ will be this, but that sounds rather messy and I hope we can all intuitively accept property 2.

However, we are not done, in fact, here is where things get really iffy. Consider the sets $A_1 = \{0,.9,1.1,0,.99,1.01,0,.999,1.001\dots\}$ and $A_2 = \{0,.9,0,.99,0,.999\}$. For this I suggest the analysis concept of “limit points”. $x$ is a limit point of a set $A$ if $\forall \epsilon > 0, \exists y \in A \textrm{ s.t. } |x-y| < \epsilon$. We will say $x$ is a dense limit point of $A$ if $\forall S \subset A$ where $S$ is a finite set, $x$ is a limit point of $A \setminus S$. Now, let $x$ be dense limit point of $A$, and $y_0, y_1 \dots$ be a sequence of $y \in A$ s.t. $|x-y_i| < \epsilon_i$ with $\epsilon_0 > \epsilon_1 > \epsilon_2 \dots$ being a sequence with approaches zero. If we took the psuedo-Cesaro-sum with $a_i = y_i$, we would get that it approaches $x$, and it would stay that way even if we permitted the sequence, which also occurs if $a_i = x$ and we take the psuedo-Cesaro-sum. So, going out on a limb, let’s do the following:

Let $D(A) := \{x|\forall S \subset A,∀ \epsilon > 0, ∃y \in A \setminus S \textrm{ s.t. } |x-y| < \epsilon \}$, for finite sets $S$. If $A$ is a countably infinite set, let $average(A) = average(D(A))$. This satisfies properties 1 & 2. 
Now, if $D(A)$ is a finite set, we can rely on the elementary average where we sum all elements and divide by $|S|$. If $D(A)$ has non-zero measure, then we can simply get the average by integrating over $A$, and divide by it’s measure. Finally, we have the case where $D(A)$ is also countably infinite, where we recursively say $average(D(A)) = average(D(D(A))$. I don’t believe that you can have infinite countably infinite $D(A)$, but lack a proof for this. Neveretheless, let’s just classify $A$ with that property as being pathological, and out of our ability to average.

With these rules in line, we can average any non-pathological countably infinite set $A$ bounded by some interval $[a,b]$. By this is I mostly meant okay that we will not go from a countably infinite set $D_n(A)$ to an empty set $D_{n+1}(A)$, which occurs if $A = \mathbb{Z}$. As a rough proof of this, we can chop $[a,b]$ in half, and we get two shorter intervals, one of which still has countably infinite points. We can repeatedly halve all our intervals, and still one will have infinitely many points. As these intervals get smaller and smaller, we end up creating at least one dense limit point. Since it’s not pathalogical, eventually $D_n(A)$ will not be countably infinite, and it won’t immediately be empty too, thus it must be finite or have non-zero measure, allowing us to average it.

So, tada! I present a method of averaging countably infinite sets which intuitively makes sense (at least to me). Now what? What about probabilities and integrals? Typically integrals are done in respect to measure, they have scale and what not. However, countable sets lack this, so I think an averages are more akin to what actually works. So, with that in mine, let us define our “integral” off of this. First, let’s say:

$$ \int_{A} x = average(A) = \frac{\sum_{x \in D_n(A)} x}{|D_n(A)|} \textrm{ if } D_n(A) \textrm{ is finite or } \frac{\int_{D_n(A)} x dx}{\int_{D_n(A)} 1 dx} \textrm{ if } D_n(A) \textrm{ has nonzero measure}$$

Then, we can extend this for general $f$ like so: 
$$ \int_{A} f(x) = \frac{\sum_{x \in D_n(A)} f(x)}{|D_n(A)|} \textrm{ if } D_n(A) \textrm{ is finite or } \frac{\int_{D_n(A)} f(x) dx}{\int_{D_n(A)} 1 dx} \textrm{ if } D_n(A) \textrm{ has nonzero measure}$$

For continuous $f$, this should be consistent with psuedo-Cesaro-sums on dense limit points. So, this would be really interesting if the justification of property 2 is consistent with “random” psuedo-Cesaro-sums. Thus concludes integration. From there, hopefully we’re just a hop and a skip away from incorporating probabilities. Yet unfortunately, my attention has been drained writing this, and lack the energy to decipher what precise model of probability you desire to implement. Perhaps you can take it from here, basing things off of Bayesian priors, or convolutions. If you are confused, or have further thoughts and ideas, just let me know, and I’ll be excited to offer some more effort into this.

Zach Hunter
  • 1,858
  • for clarification: by psuedo-Cesaro-sum I mean $\lim_{n \to \infty} \frac{1}{n} \sum_{i=0}^n a_i$ – Zach Hunter Aug 23 '19 at 00:50
  • 1
    Thanks! I am hoping to create a measure that matches the results of your sum when $A\cap[a,b]$ is countable; and matches the results of the Lebesgue Integral when the Lebesgue measure of $A\cap[a,b]$ is 1. However if $A\cap[a,b]$ is uncountable and it’s Lebesgue measure is less than $1$, it can be divided into countable or null sub-intervals with the largest possible lengths, where we apply your sum, and uncountable sub-intervals where we apply the Lebesgue Integral. – Arbuja Aug 23 '19 at 02:44
  • yes, and by definition of this "integral", we revert to the Lebesgue integral when $A$, or I guess, $A \cap [a,b]$, has non-zero measure. – Zach Hunter Aug 23 '19 at 02:48
  • Also, does your intuition match my intuition? For example if $P(x)$ is defined for $x\in A\cap [0,1]$, where $A=\left{(\sqrt{m})/(\sqrt{4n+2}): m,n \in \mathbb{Z}\right}$ and $P(x)$ equals $x$ for $A_1=\left{(\sqrt{2m+1})/(\sqrt{4n+2})\right}$ and equals $1$ for $A_2=\left{(\sqrt{2m+1})/(\sqrt{2n+1})\right}$, then what is the average using your sum? (You don’t have to show the steps.) I get something like $\frac{2}{3}\int_{0}^{1} x \ + \frac{1}{3}\int_{0}^{1} 1= 2/3$. – Arbuja Aug 23 '19 at 03:16
  • If they are both dense on $[0,1]$, then I'd say their average is $\frac{\frac{\int_0^1 x dx}{1}+\frac{\int_0^1 1 dx}{1}}{2} = \frac{3}{4}$ – Zach Hunter Aug 23 '19 at 03:54
  • My intuition states if $A_1=\left{\frac{\sqrt{2m+1}}{\sqrt{12n+6}}:m,n\in\mathbb{Z}\right}$, $A_2=\left{\frac{\sqrt{2m+1}}{\sqrt{2n+1}}:m,n\in\mathbb{Z}\right}$ and $A=A_1 \cup A_2$, then $A_1$ has a "measure" of $2/3$ and $A_2$ has a "measure" of $1/3$. This is because when $x\in A_1$ is reduced it can only be written as $\left{\frac{\sqrt{2m+1}}{\sqrt{4n+2}}:m,n\in\mathbb{Z}, \gcd{(\sqrt{2m+1},\sqrt{4n+2})}=1\right}$ and $A_2$ can only be written as $\left{\frac{\sqrt{4m+2}}{\sqrt{4n+2}}:\gcd(\sqrt{4n+2},\sqrt{4n+2})=2\right}$.................................... – Arbuja Aug 23 '19 at 18:19
  • ........................Since the "denominators" of the functions in both sets are the same, we can apply something "similar to asymptotic density" on their numerators such that $d(A_1)=\frac{\left{2m+1\right}\cap\left(\left{2m+1\right}\cup\left{4n+2\right} \right)\cap[0,v]}{\left(\left{2m+1\right}\cup\left{4n+2\right} \right)\cap[0,v]}=2/3$ and $d(A_1)=\frac{\left{4m+2\right}\cap\left(\left{2m+1\right}\cup\left{4n+2\right} \right)\cap[0,v]}{\left(\left{2m+1\right}\cup\left{4n+2\right} \right)\cap[0,v]}=1/3$ – Arbuja Aug 23 '19 at 18:19
  • 2
    Currently I'm defining "measure" of $A$ as the measure of $D(A)$. If you can formalize your intuition of measure, then the getting the averages to work would be an easy extension. But from what I have seen you have not sufficiently defined measure. Could you try to formally list the properties you seek to capture with this "measure"? – Zach Hunter Aug 23 '19 at 19:15
  • I don't know how to? I could wait years to master the concepts but I wanted to know if I am on the right track. – Arbuja Aug 29 '19 at 19:58
  • I think mathworker21 solved the question? There is no unique intuitive average but there are special cases. – Arbuja Sep 01 '19 at 02:46
  • Mine will give you a unique average which is well defined. – Zach Hunter Sep 01 '19 at 02:48
  • might I ask what’s wrong with what I have now? my results match lebesgue measure when A is dense, matches mathworker21’s intuition in his last example, and it is always unique and well defined . – Zach Hunter Sep 01 '19 at 16:26
  • @ZacharyHunter my answer is unique when $D(D(\dots D(A)))$ eventually terminates, and my answer is defined when it doesn't. Also, what if $D(A)$ is countably infinite but measure $0$? How does your answer work then? – mathworker21 Sep 02 '19 at 00:27
  • If $D(A)$ is countable infinite then we look $D(D(A))$, but I admitted that I have nothing concrete if that doesn’t terminate. I assume we’d take a limit of $D(\dots D(A)\dots )$. However, are we sure this can occur in a finite segment? because $D(D(A)) \subsetneq D(A)$ if $D(A)$ is finite. – Zach Hunter Sep 02 '19 at 00:59
  • @ZacharyHunter I think looking at $D(D(A))$ if $D(A)$ is countably infinite is not intuitive. For example, take $A$ with $D(A) = {1,\frac{1}{2},\frac{1}{3},\frac{1}{4},\dots}\cup{0}$ (e.g. $A$ consists of ${1,\frac{1}{2},\frac{1}{3},\frac{1}{4},\dots}\cup{0}$ together with exponentially small countably infinite sets around each $\frac{1}{k}$, with $\frac{1}{k}$ being the only limit point in that exponentially small countably infinite set around $\frac{1}{k}$). Then your average would only output $P(0)$, since $D(D(A)) = {0}$. However, I think we really should be – mathworker21 Sep 02 '19 at 18:23
  • averaging over all the points $\frac{1]{k}$ and $0$, since $A$ genuinely contains those points. – mathworker21 Sep 02 '19 at 18:24
  • No I actually like that feature, and intentionally chose to reflect that, because to my intuition zero is infinitely more dense than other points – Zach Hunter Sep 02 '19 at 18:28
  • also, does the average of that sum reach zero in the limit? – Zach Hunter Sep 02 '19 at 18:43
  • @ZacharyHunter If $P(x)=x^2$ for $x\in[0,1/3]$, $P(x)=\sqrt{x}$ for $x\in[1/3,2/3]\cap\mathbb{Q}$ and $P(x)=3$ for $x\in[1/3,1]\cap\mathbb{R}\setminus\mathbb{Q}$, what is the average? – Arbuja Sep 03 '19 at 21:24
  • Since $\mathbb{r}$ is dense, it’s measure is greater than both other sets, thus it would be 3 – Zach Hunter Sep 03 '19 at 21:33
  • @ZacharyHunter Intuitively over $[0,1]$, the average of $P$ should be $\frac{1}{3}\int_{0}^{1} x^2 dx + \frac{2}{3}\int_{0}^{1} 3 dx= 19/9$. – Arbuja Sep 03 '19 at 22:22
  • Countable set have zero measure so they matter infinitely less in the average than anything with measure, such as ℝ – Zach Hunter Sep 03 '19 at 22:25
  • @ZacharyHunter Does my answer give any ideas? Does any of it make sense? – Arbuja Sep 04 '19 at 16:21
  • No it doesn’t really make sense how countable sets effect the average when there are sets with measure, you just get inconsistencies at this point – Zach Hunter Sep 04 '19 at 16:23
  • @ZacharyHunter I meant my answer to my original post. I was thinking about using Folner Sequences. – Arbuja Sep 05 '19 at 15:26
  • @ZacharyHunter Did you read my answer to my post. I was thinking about using Folner Sequences. What do you think? Does it make sense? – Arbuja Sep 10 '19 at 18:56
  • @ZacharyHunter Ok I guess it makes no sense. – Arbuja Sep 12 '19 at 23:14
  • @ZacharyHunter See the new answer to my question. Everything should make sense this time. – Arbuja Jun 07 '20 at 18:10
  • @ZacharyHunter I've made more progress. If I'm not correct I'm one step closer. See this answer. – Arbuja Jun 18 '20 at 17:59