Writing the article on Bourgain’s proof of the spherical maximal function theorem I suddenly recalled another interesting proof that uses a trick very similar to that of Bourgain – and apparently directly inspired from it. Recall that the “trick” consists of the following fact: if we consider only characteristic functions as our inputs, then we can split the operator in two, estimate these parts each in a different Lebesgue space, and at the end we can combine the estimates into an estimate in a single space by optimising in some parameter. The end result looks as if we had done “interpolation”, except that we are “interpolating” between distinct estimates for distinct operators!
The proof I am going to talk about today is a very simple proof given by Tony Carbery of the well-known Stein-Tomas restriction theorem. The reason I want to present it is that I think it is nice to see different incarnations of a single idea, especially if applied to very distinct situations. I will not spend much time discussing restriction because there is plenty of material available on the subject and I want to concentrate on the idea alone. If you are already familiar with the Stein-Tomas theorem you will certainly appreciate Carbery’s proof.
As you might recall, the Stein-Tomas theorem says that if denotes the Fourier restriction operator of the sphere (but of course everything that follows extends trivially to arbitrary positively-curved compact hypersurfaces), that is
(defined initially on Schwartz functions), then
for all exponents such that (and this is sharp, by the Knapp example).
There are a number of proofs of such statement; originally it was proven by Tomas for every exponent except the endpoint, and then Stein combined the proof of Tomas with his complex interpolation method to obtain the endpoint too (and this is still one of the finest examples of the power of the method around).
Carbery’s proof obtains the restricted endpoint inequality directly, and therefore obtains inequality (1) for all exponents < by interpolation of Lorentz spaces with the case (which is a trivial consequence of the Hausdorff-Young inequality).
where the LHS is clearly the norm of the characteristic function . Notice that we could write the inequality equivalently as .
1. The proof
We prove inequality (2) right away. The proof follows the method to start: we have
where in the last line we have used Plancherel (in the manipulations above I am being a bit liberal; you can check that everything can be justified); also, is the inverse Fourier transform of the spherical measure. Notice the operator is exactly , where is the (formal) adjoint of .
We will use Hölder’s inequality to bound the above, but FIRST we will split the spherical measure into two parts in terms of its frequencies, according to a cutoff that we will choose carefully at the end of the argument. This is exactly how Bourgain proceeds for the spherical maximal function!
Let therefore denote a Schwartz function such that on the unit ball and outside . For a certain positive parameter that will be chosen later (this is the threshold that separates low frequencies from high frequencies), we define to be given by
thus is the low frequency part (frequencies ) and is the high frequency part (frequencies ).
[The terminology here is a bit misleading: the surface measure already lives in the frequency space , so actually lives in physical space ; but the two are equivalent, and so we can retain this point of view.]
Using these two functions we can split the operator into the operators :
Precisely as in Bourgain’s case, it will not be necessary to estimate the operators in the same norms: we can estimate them separately in different spaces and then optimise in the parameter , taking advantage of the fact that and therefore we can pass from one Lebesgue space to another by changing the exponent of .
It turns out that it is extremely easy to estimate , since the spaces that do the job are very convenient ones – in the end, we will reduce to a couple of estimates, namely and .
1.1. estimate for
Starting with , which we will estimate in , we argue trivially that
1.2. estimate for
Now we estimate instead – in it will be easy. Indeed, we have by Plancherel (twice)
so it suffices to estimate (notice this is in frequency, unlike the estimate for – indeed, we are using to refer to the space of frequencies of for clarity). Now, technically we have by definition of that . Let us use this information heuristically first, to guess the maximum height of . We have that is essentially a bump function concentrated on the ball and normalised so that it has integral ; therefore is an average of at scale , and it should be largest close to the unit sphere , where it is about . This is our guess.
It is not hard to turn the above heuristic into a rigorous estimate. Indeed, by splitting dyadically, we have
as desired. As for the other terms, since is a Schwartz function we have (taking note of the normalisation) for a large ; thus
and summing in we obtain another contribution of and the claim derived heuristically is proven to be correct.
[In Bourgain’s proof we had to estimate instead, and that required a little more attention; here we only need and this is much easier.]
Thus we have shown (by Cauchy-Schwarz)
Remark: one could argue that by Hausdorff-Young and then use again the decay estimates for to estimate the latter; but the bound thus obtained would be too large to be of any use. Indeed, Hausdorff-Young is only efficient for functions that look like gaussians, and is far from looking like one.
1.3. Concluding the argument
to optimise this estimate we need to choose so that , that is . With this value of the LHS of the above inequality becomes , and this is precisely the LHS of (2) squared, as desired! The proof is concluded.