10

Who first defined the norm in $L_p$ space as $$\left(\int{\lvert f(x) \rvert^p}\right)^{1/p}$$

Is there any reference for this? Is it just an simple extension from $L_2$?

$L_p$ space has some really nice properties. I think it relies on the definition of this norm, but this definition does not seem to come naturally or maybe I just can't see how it comes naturally. I understand this definition gives a norm. I just want to know the underlying thought of it.

Are those nice properties just by chance or does it actually have a relationship with this definition?

Semiclassical
  • 18,592

5 Answers5

20

In a certain sense, there are basically $4$ $L^p$ spaces with $p \geq 1$: $L^1,L^2,L^\infty$, and everything else. Specifically:

  • $L^2$ is the only "Hilbertable" $L^p$ space, which gives it a wealth of additional properties.
  • For "typical" measure spaces, all $L^p$ spaces except $L^\infty$ are separable.
  • For any measure space, $L^p$ is reflexive for all $1<p<\infty$. For "typical" measure spaces, $L^1$ and $L^\infty$ are both not reflexive.
  • $L^1$ loses some theorems held by $L^p$ spaces with $1<p<\infty$ because for "typical" measure spaces it is not isomorphic to the dual space of any Banach space. (It "would" be the dual of $L^\infty$ but it is usually too small.) This means that the theorem "the unit ball in a space which is the dual of something is weak-* compact" does not apply to $L^1$, and thus neither do many of its consequences.
  • $L^\infty$ also loses a lot of other theorems held by other $L^p$ spaces...and frankly I have some trouble giving a simple explanation for why. In PDE theory many of the relevant theorems fail because approximation by compactly supported smooth functions fails in $L^\infty$, but there is something more general that makes theorems in $L^\infty$ more elusive.

Here any time I say "typical" my statements include the Lebesgue measure, all measures which are mutually absolutely continuous with respect to it, and restrictions of such measures to positive measure subsets.

I know that one place where "unusual" values of $p$ (i.e. not $1,2$ or $\infty$) pop up is in Sobolev space theory, specifically with Sobolev embedding. Here a Sobolev space in sufficiently high dimension will embed into a Sobolev space with less regularity and more integrability. For example, in dimension $6$, $W^{1,2}$ embeds into $L^3$. In dimension $2$, $W^{1,2}$ embeds into $L^p$ for all $1 \leq p < \infty$.

Ian
  • 104,572
  • Small nitpick: if one allows $L^p$ spaces for $0<p<1$ (for whatever reason -- triangle inequalities are sometimes dispensable?), then there should be 5 types: $L^1,L^2,L^\infty$, pathological/non-metric ones, and everything else. – Clement C. Mar 08 '16 at 00:19
  • @ClementC. Valid point. Rather than actually inviting these into the discussion, I revised slightly to specifically exclude them from the discussion. – Ian Mar 08 '16 at 00:41
4

The cases $p=1,2$ are quite natural: the former is a very reasonable definition of "how large a function is" and the latter arises from Hilbert space theory: a well developed and useful theory. If you want to generalize these for arbitrary $p$, you are forced to consider $\int |f(x)|^p$. This is not a norm for arbitrary $p$ because $\int |a f(x)|^p = |a|^p \int |f(x)|^p$, and there is exactly one way to remedy this: by introducing the power $\frac{1}{p}.$ Miraculously (or maybe not?) the $L^p$ norm we constructed is actually a norm.

The most curious case is $p=\infty$. At a glance, there is no reason to call the essential supremum norm $L^\infty$ as it has nothing to do with integrals. The following result shows that under certain conditions the infinity norm is the limit case of the $p$ norm:

If $f\in L^p \cap L^\infty$, $p<q<\infty$ then $f \in L^q$ and $\lim_{q \to \infty} \|f\|_q = \|f\|_\infty$ So, naming $L^{\infty}$ like that is justified.

That being said, $L^{\infty}$ is often the worst space to work in: see @Ian's answer for details.

Svetoslav
  • 5,325
Milen Ivanov
  • 1,434
  • 7
  • 13
3

If p is close to 1, the p-norm gives more importance to the regions where the function is small and less importance to regions where it is large. If P is large, then the norm gives less importance to the regions where the function is small and more to those where it is large.

Thus varying to allows us to decide what we want to focus on.

  • That's basically true on an infinite measure space, but there is a distinct monotonicity in the finite measure space context. – Ian Mar 08 '16 at 01:34
  • The norms for different ps give different weight to the different regions independently of finiteness of the measure of the space. – Mariano Suárez-Álvarez Mar 08 '16 at 01:37
  • In terms of the actual value of the $L^p$ norm, yes, but in terms of "which functions are in which spaces", no. – Ian Mar 08 '16 at 01:38
  • But that is irrelevant. A norm is a way of measuring functions: it allows you to say "this function is larger than that one" and the choice of P fixes a criterion to do that, based on different priorities. – Mariano Suárez-Álvarez Mar 08 '16 at 01:47
  • Eh, that depends on how you think about it I guess. "Pure" real analysts often don't care what a number is as long as they know it's finite and doesn't depend on any parameters. I guess in regards to your point, what I am saying is that if the measure space is finite, it is not that small regions are weighted that much less as it is that large regions are weighted that much more, when $p$ is large. To be sure, small regions are indeed weighted considerably less in $| f |_p^p$ when $p$ is large, but if you remove that exponent that effect essentially goes away. – Ian Mar 08 '16 at 01:50
3

Lp spaces were first introduced by Frigyes Riesz (1910). (F&M Riesz both produced many important contributions to mathematics, and it is difficult to remember, which was which.)

The question in this link says that Riesz had worked on the moment problem in 1907. Later he generalized those results, and that required Minkowski and Hölder inequalities, so he developed the Lp spaces (1910). Where does the $L^p$ norm come from?

Lp spaces are named for Henri Lebesgue, although Riesz was first. Lebesgue introduced the Lebesgue integral in 1904.

Should Rogers and Hölder share the main credit for the "p" in Lp spaces and Lp norms with Riesz?

"Hölder's inequality was first found by Leonard James Rogers (1888). Inspired by Rogers' work, Hölder (1889) gave another proof as part of a work developing the concept of convex and concave functions and introducing Jensen's inequality,[1] which was in turn named for work of Johan Jensen building on Hölder's work.[2]" https://en.wikipedia.org/wiki/H%C3%B6lder%27s_inequality

Hermann Minkowski showed the Minkowski inequality for infinite sums in 1896. https://de.wikipedia.org/wiki/Minkowski-Ungleichung

It all stemmed from L2 results?

L2 Hölder inequality (i.e., Cauchy-Schwarz inequality) "for sums was published by Augustin-Louis Cauchy (1821). The corresponding inequality for integrals was published by Viktor Bunyakovsky (1859) and Hermann Schwarz (1888). Schwarz gave the modern proof of the integral version."

Hölder was German, Minkowski was German (& Polish & Jewish), Rogers was British, Riesz was Hungarian-Jewish, Lebesgue was French.

1

In a very rough and informal way, this is how I understand this definition (and I find it very natural)

Starting from finite dimension space, suppose a function $f$ has finite components, say $f_1, ...f_n$. You want to understand its certain behavior thru its components. One way of doing this is by averaging all its components in some way (here we choose $|f_i|^p$). You may choose "arithmetic average" $\displaystyle \frac {1}{p} \sum\limits_{i=0}^n |f_i|^p $, or "geometric average" $\displaystyle \big(\sum\limits_{i=0}^n |f_i|^p\big)^{1/p} $. The former is not so suitable because the usual average is not efficient to control the growth of exponential function, whereas the latter controls that much better, thus be a better candidate for a norm.

Now move to infinite dimension space. You would have "infinite" components in the norm, so you would end up with something like $||f||_p = \big(\sum\limits_{i=0}^\infty |f_i|^p\big)^{1/p}$. This makes you think of integral notation, because integral is essentially an infinite sum (think of how Riemann integral is constructed). Also you will have no more "components" but $f$ itself. Finally, you end up with something like the definition.

SiXUlm
  • 2,476
  • Neither of those is a mean (there are $n$ terms, not $p$ terms). – Ian Mar 10 '16 at 23:22
  • Yes, they are not. But I would say they "look" like mean. – SiXUlm Mar 10 '16 at 23:24
  • 2
    Not really: the $1/p$ is just the right exponent for homogeneity, that's all. – Ian Mar 10 '16 at 23:25
  • Yes. Using $p>1$ weights up the biggest elements in the sum (by $|f_i|^{p-1}$). Using $p<1$ weights down the biggest elements (by the same). Using p=1 is the neutral sum. Having $(\cdots)^{1/p}$ just produces homogeneity, i.e., $| \alpha f|_p=|\alpha||f|_p$. – user3810316 Sep 20 '22 at 17:34