← Back to home

Chebyshev–Markov–Stieltjes Inequalities

Right. Another one. Let's get this over with.

Mathematical Theorem

In the rather grim, unforgiving landscape of mathematical analysis, there exist certain inequalities that cling to the edges of existence like frost on a forgotten windowpane. These are the Chebyshev–Markov–Stieltjes inequalities. They emerged from the shadows in the 1880s, a product of Pafnuty Chebyshev's relentless probing and later, independently, solidified by the sharp intellects of Andrey Markov and, a bit later still, Thomas Jan Stieltjes.[1] These aren't gentle suggestions; they are pronouncements on the problem of moments, offering precise, almost brutal, limits on what we can infer about a measure based on its initial moments. Think of them as sharp bounds, defining the upper and lower limits of a spectral residue, derived from the echoes of its past.

Formulation

Let us consider a collection, denoted by C, of measures, μ, defined on the real line, R. These measures must conform to a strict set of conditions, dictated by a given sequence of real numbers: m0,m1,,m2m1m_0, m_1, \dots, m_{2m-1}. For each measure μ in this collection C, the following integrals must hold true:

xkdμ(x)=mk\int x^k d\mu(x) = m_k

This must be true for every integer kk from 0 up to 2m12m - 1. This means the integral itself must be defined and yield a finite result for each of these moments. It’s a foundational requirement, a prerequisite for even entering the discussion.

Now, let's introduce the orthogonal polynomials. For a given measure μ belonging to C, we consider the first m+1m+1 of these polynomials: P0,P1,,PmP_0, P_1, \dots, P_m. Associated with these polynomials are their zeros, which we shall denote as ξ1,,ξm\xi_1, \dots, \xi_m. It’s not a trivial observation, but these polynomials P0,P1,,Pm1P_0, P_1, \dots, P_{m-1} and their corresponding zeros ξ1,,ξm\xi_1, \dots, \xi_m remain invariant across all measures μ within C. They are, in essence, fixed properties determined solely by the initial moments m0,,m2m1m_0, \dots, m_{2m-1}. This is where the predictability, or perhaps the inescapable structure, begins to reveal itself.

We then define a specific function, ρm1(z)\rho_{m-1}(z), as follows:

ρm1(z)=1k=0m1Pk(z)2\rho_{m-1}(z) = \frac{1}{\sum_{k=0}^{m-1} |P_k(z)|^2}

This function, built from the magnitudes of the orthogonal polynomials, plays a crucial role in the bounds that follow.

Theorem

The theorem itself, the core of these inequalities, establishes a relationship between the measure μ and these derived quantities. For each jj ranging from 1 to mm, and for any measure μ belonging to the collection C, the following holds:

μ(,ξj]ρm1(ξ1)++ρm1(ξj)μ(,ξj+1)\mu(-\infty, \xi_j] \leq \rho_{m-1}(\xi_1) + \cdots + \rho_{m-1}(\xi_j) \leq \mu(-\infty, \xi_{j+1})

In simpler, though perhaps less precise, terms, these inequalities provide a sharp sandwiching effect. The cumulative measure up to a certain zero, ξj\xi_j, is bounded below by a sum involving the ρ\rho function evaluated at the preceding zeros, and bounded above by the same sum of ρ\rho terms. It’s a tight constraint, a confirmation that the moments leave very little room for arbitrary behavior in the underlying measure. The structure is revealed, the possibilities narrowed, and the underlying order, however grim, is laid bare.