Up until now, we have avoided considering the wave nature of light which introduces diffraction from interference of light coming from different parts of the aperture. Because of diffraction, images of a point source will be slightly blurred. From simple geometric arguments, we can estimate the size of the blur introduced from diffraction:
To work out in detail the shape of the images formed from diffraction
involves understanding wave propagation. Basically, one integrates
over all of the source points in the aperture (or exit pupil for
an optical system), determining the contribution of each point at
each place in the image plane. The contributions are all summed taking
into account phase differences at each image point, which causes
reinforcment at some points and cancellation at others. The expression
which sums all of the individual source points is called the
diffraction integral. When the details are worked out, one finds
that the intensity in the image plane is related to the intensity
and phase at the exit pupil. In fact the wavefront is described at
any plane by the optical transfer function, which gives the
intensity and phase of the wave at all locations in that plane. The
OTF at the pupil plane and at the image plane are a Fourier transform
pair. Consequently, we can determine the light distribution in the
image plane by taking the Fourier transform of the pupil plane;
the light distribution, or point spread function, is just the
modulus-squared of the OTF at the image plane. Symbolically, we have
For the simple case of a plane wave with no phase errors, the
diffraction integral can be solved analytically. The result for
a circular aperture with a central obscuration, when the fractional
radius of the obscuration is given by , the expression
for the PSF is:
This expression gives the so-called Airy pattern which has a central disk surrounded by concentric dark and bright rings. One finds that the radius of the first dark ring is at the physical distance , or alternatively, the angular distance . This gives the size of the Airy disk.
For more complex cases, the diffraction integral is solved numerically by doing a Fourier transform. The pupil function is often more complex than a simple circle, because there are often additional items which block light in the pupil, such as the support structures for the secondary mirror.
This figure shows the Airy pattern, both without obscurations, and with a central obscuration and spiders in a setup typical of a telescope.
In addition, there may be phase errors in the exit pupil, because of the existence of any one of the sources of aberration discussed above. For general use, is often expressed as an series, where the expansion is over a set of orthogonal polynomials for the aperture which is being used. For circular apertures with (or without) a central obscuration (the case most often found in astronomy), the appropriate polynomials are called Zernike polynomials. The lowest order terms are just uniform slopes of phase across the pupil, called tilt, and simply correspond to motion in the image plane. The next terms correspond to the expressions for the OPD which we found above for focus, astigmatism, coma, and spherical aberration, generalized to allow any orientation of the phase errors in the pupil. Higher order terms correspond to higher order aberrations.
This figure shows the form of some of the low order Zernike terms: the first corresponds to focus aberration, the next two to astigmatism, the next two to coma, the next two to trefoil aberration, and the last to spherical aberration.
A wonderful example of the application of all of this stuff was in the diagnosis of spherical aberration in the Hubble Space Telescope, which has been corrected in subsequent instruments in the telescope, which introduce spherical aberration of the opposite sign. To perform this correction, however, required and accurate understanding of the amplitude of the aberration. This was derived from analysis of on-orbit images, as shown in this figure. Note that it is possible in some cases to try to recover the phase errors from analysis of images. This is called phase retrieval. There are several ways of trying to do this, some of which are complex, so we won't go into them, but it's good to know that it is possible. But an accurate amplitude of spherical aberration was derived from these images. This derived value was later found to correspond almost exactly to the error expected from an error which was made in the testing facility for the HST primary mirror, and the agreement of these two values allowed the construction of new corrective optics to proceed...