Because of the discrete nature of computers, we can only approximate the solution to a PDE at certain grid points on the plane. We label these points of approximation where and and employ the abbreviations and , where is the numerical approximation at the grid points of interest for , the actual PDE solution. We are interested, of course, in making as small as possible. To accomplish this we must consider the accuracy and the stability of our approximation scheme.
The accuracy of a scheme is really just a matter of minimizing the error term in the Taylor appoximation for the scheme. Because of its simplicity --- and its applicablity to eqns (7)-(9) --- we will use as an example the first order linear traveling wave equation
From Taylor's formula we have that
where and also that
where , so it follows that
since the and the terms cancel by applying eq (10). This suggests the following numerical algorithm to simulate eq (10):
where . This algorithm explicitly shows how to compute values for a new time, , given information at the current time, . To compute the value at a fixed time , the algorithm must be run through time steps. Since the error from each step is , the total error between the computed solution and the actual solution at is --- assuming that the errors from each time step can be added together (which is not the case when the algorithm is unstable as we will soon discuss). Due to the form of the total error (if the algorithm is stable), we refer to the algorithm as being ``first order accurate in x and first order accurate in t". If, instead of using the Taylor expansion in eq (11), we use
we end up with a similar numerical scheme that also is first order accurate in both x and t:
The scheme in eq (14) is called a downwind scheme; the scheme in eq (15) is called an upwind scheme. We could also subtract eq (15) from eq (11), which yields
which implies that the numerical scheme
is first order accurate in time and second order accurate in space since the error at a fixed time value is . Obviously, one wishes to employ schemes with higher order errors since these schemes require less grid points (and therefore less computational time) to obtain a computed solution that is within a given error of the actual solution.
There is, however, more to numerical simulation of PDEs than just Taylor expansions and accuracy questions. Consider eq (10) where the domain of x is infinite and the initial condition is specified:
The unique solution to this equation is simply
In other words the value of the solution is constant along each line x=at + C. These lines are called characteristic lines, and they occur when the domain of x is finite as well as infinite. In other words the value of the solution at a point depends only on which characteristic line crosses through that point. If we consider x to be the abscissa and t the ordinate, then each characteristic line has a positive slope of . The constant a is often referred to as the wave speed since it specifies how fast a disturbance at a point x propagates down to other values of x. Now consider what happens when we apply the downwind scheme in eq (14) in an attempt to solve eq (10). The downwind scheme only uses the points and to estimate the value of the point , but we know from the characteristic lines that the value at this point only depends on the value of the solution at if , which is outside the range of the two points being used. This causes the computer generated solution using the downwind scheme to go haywire and yield gibberish. One may wonder how can this happen since we know the scheme is accurate. The answer is that accuracy only considers local error, whereas the stability problem that is occuring in this case is a global issue.
The Taylor series analysis of error in determining accuracy assumes that the solution is correct at , and then finds the error that is created when the solution at is approximated. However, if the error accrued in a step is magnified by each subsequent step, we can no longer just add the individual errors produced at each step; in fact, when the errors are magnified by subsequent steps, we observe that the solution is unstable, that is, the global error quickly blows up. When a scheme attempts to approximate the solution at a point\ whose characteristic line crosses the line outside the range of points used by the scheme, then the scheme, no matter how accurate, will always magnify errors in each step and therefore be unstable. In other words the numerical range of dependence must always contain the theoretical range of dependence of the solution or instablity will occur. This principle is called the CFL (Courant-Friedrichs-Lewy) condition (sometimes referred to simply as the Courant condition) [7]. This guarantees that the downwind scheme will always be unstable and that the upwind scheme can only be stable if . Even if a scheme satisfies the CFL condition, it may still be unstable, but we will not be analyzing any schemes that possess this flaw in this paper.
Before analyzing the FPU nonlinear wave equations, we wish to apply the above analysis to the linear wave equation, , where we use to denote the constant for reasons that will soon be apparent. If we define where and then we have that the wave equation is satisfied, , and also that the mixed partials are equal, . This allows us to write the wave equation as the following first order partial differential system:
Eq (21), of course, is just the matrix form of the travelling wave equation from eq (10), and its solution contains many of the same properties as eq (10). As opposed to the solution depending upon one characteristic line, the solution, , now depends on two characteristic lines whose wave speeds are c and -c, the eigenvalues of . Therefore the slopes of the two characteristic lines are the reciprocals of the wave speeds, and . As before, we must satisfy the CFL stability condition, so we now require that .
All that remains is to find a sufficiently accurate scheme. To obtain an approximation for we add eq (11) to eq (15), yielding
which we can combine with the analogous expression from the Taylor expansions with respect to time to yield the numerical scheme
which we can reexpress as
We note that this scheme is 2nd order accurate with respect to both time and space.
We are now ready to consider the nonlinear FPU equations. We begin with the quadratic FPU equation
where and . (Note that since , like the spring constant k, is always positive in any realistic spring model.) As with the linear wave equation we can rewrite eq (24) in vector form, but now the matrix A will depend upon the value of :
The wave speeds of the two characteristics are still the eigenvalues of A, which are . (Note that we require to be positive since otherwise the wave speeds would be imaginery and the partial differential equation would no longer be hyperbolic, which leads to equations beyond our analysis here and causes the scheme to become unstable). Since the speeds are functions of , that means the speeds vary, which implies that the characteristics are now curves whose slopes vary as they progress in time as opposed to being straight lines as was the case in the linear wave equation. This leads to a stability problem as the CFL condition now requires that . If is small compared to , then will be positive and the variations in may not be severe enough to violate the CFL condition, but if the nonlinearity becomes more significant, which means the value of becomes larger, eventually the CFL condition will be violated (or will become nonpositive) and the scheme will become unstable. (Note: in the experiments in section 5, we will set and use values of that are slightly smaller than 1 which will cause the CFL condition to be violated before becomes nonpositive.) Since by definition , we see that if gets large not only does the numerical scheme become unstable, but also the underlying differential equation becomes less valid since higher order terms in the Taylor approximation given in eq (4) start to become significant.
The analysis for the cubic FPU equations is similar. We now have that
where , which can be expressed in matrix form as
The corresponding CFL condition is . Note that if is positive, the wave speeds will be real and we will always have a hyperbolic differential equation.
There are numerous accurate schemes that one can apply in the region where the algorithm is stable. To simulate the quadratic FPU equation, eq (8), we can combine eq (17) with eq (23) to form
which can be rewritten as a scheme which is second order accurate in time and space:
Eq (28) is a generalization of the FPU quadratic scheme (eq (ii)). After setting and , one can quickly establish that the right hand sides of eq (28) and eq (ii) are identical.
Similary, we can simulate the cubic FPU equation, eq (9), by again combining eq (17) with eq (23) to obtain the following second order accurate scheme:
The fact that the scheme in eq (30) is not the same as FPU's cubic scheme (eq (iii)) does not pose a problem. Either FPU's scheme or eq (30) can be used to simulate the cubic FPU equation as both schemes are second order accurate and will therefore produce essentially equally valid simulations. (The second order accuracy of the FPU's cubic scheme is quickly established using Taylor series.)
There is another phenomenon that is important in nonlinear hyperbolic partial differential equations. Because the characteristics are curves, characteristics within a family can intersect each other. (In the linear case, each family of characteristic lines has a constant slope: for one family, for the other family.) When the characteristics intersect, a shock, i.e., a discontinuity in the solution, occurs. While multiple weak solutions to the nonlinear equation are possible, one usually desires to simulate the unique observed solution that the phenomena being modeled exhibits. For most physical phenomena the solution of interest is the Lax entropy solution of the equation, which can be simulated using monotone schemes [8]. A particularly nice introduction to shock capturing monotone schemes is presented in [9].
We do not pursue these schemes here, however, as we are interested in other aspects of the transition to instability.