Section 5.1 Approximating Areas
Learning Objectives.
Use the sum of rectangular areas to approximate the area under a curve.
Use Riemann sums to approximate area.
Archimedes was fascinated with calculating the areas of various shapes—in other words, the amount of space enclosed by the shape. He used a process that has come to be known as the method of exhaustion, which used smaller and smaller shapes, the areas of which could be calculated exactly, to fill an irregular region and thereby obtain closer and closer approximations to the total area. In this process, an area bounded by curves is filled with rectangles, triangles, and shapes with exact area formulas. These areas are then summed to approximate the area of the curved region.
In this section, we develop techniques to approximate the area between a curve, defined by a function \(f(x),\) and the \(x\)-axis on a closed interval \([a,b].\) Like Archimedes, we first approximate the area under the curve using shapes of known area (namely, rectangles). By using smaller and smaller rectangles, we get closer and closer approximations to the area. Taking a limit allows us to calculate the exact area under the curve.
Let’s start by introducing some notation to make the calculations easier. We then consider the case when \(f(x)\) is continuous and nonnegative. Later in the chapter, we relax some of these restrictions and develop techniques that apply in more general cases.
Subsection 5.1.1 Sigma (Summation) Notation
As mentioned, we will use shapes of known area to approximate the area of an irregular region bounded by curves. This process often requires adding up long strings of numbers. To make it easier to write down these lengthy sums, we look at some new notation here, called sigma notation (also known as summation notation). The Greek capital letter \(\Sigma,\) sigma, is used to express long sums of values in a compact form. For example, if we want to add all the integers from 1 to 20 without sigma notation, we have to write
We could probably skip writing a couple of terms and write
which is better, but still cumbersome. With sigma notation, we write this sum as
which is much more compact.
Typically, sigma notation is presented in the form
where \(a_i\) describes the terms to be added, and the \(i\) is called the index. Each term is evaluated, then we sum all the values, beginning with the value when \(i=1\) and ending with the value when \(i=n.\) For example, an expression like \(\displaystyle{ \sum_{i=2}^{7} s_i}\) is interpreted as \(s_2+s_3+s_4+s_5+s_6+s_7.\) Note that the index is used only to keep track of the terms to be added; it does not factor into the calculation of the sum itself. The index is therefore called a dummy variable. We can use any letter we like for the index. Typically, mathematicians use \(i\text{,}\) \(j\text{,}\) \(k\text{,}\) \(m\text{,}\) and \(n\) for indices.
Theorem 5.2. Rule: Properties of Sigma Notation.
Let \(a_1,a_2 , \dots , a_n\) and \(b_1,b_2 , \dots , b_n\) represent two sequences of terms and let \(c\) be a constant. The following properties hold for all positive integers \(n\) and for integers \(m\text{,}\) with \(1\leq m\leq n.\)
- \begin{equation*} \displaystyle{ \sum_{i=1}^{n} c}=nc \end{equation*}
- \begin{equation*} \displaystyle{ \sum_{i=1}^{n} ca_i}=c\displaystyle{ \sum_{i=1}^{n} a_i} \end{equation*}
- \begin{equation*} \displaystyle{ \sum_{i=1}^{n} (a_i+b_i)}=\displaystyle{ \sum_{i=1}^{n} a_i}+\displaystyle{ \sum_{i=1}^{n} b_i} \end{equation*}
- \begin{equation*} \displaystyle{ \sum_{i=1}^{n} (a_i-b_i)}=\displaystyle{ \sum_{i=1}^{n} a_i}-\displaystyle{ \sum_{i=1}^{n} b_i} \end{equation*}
- \begin{equation*} \displaystyle{ \sum_{i=1}^{n} a_i}=\displaystyle{ \sum_{i=1}^{m} a_i}+\displaystyle{ \sum_{i=m+1}^n a_i} \end{equation*}
Subsection 5.1.2 Approximating Area
Now that we have the necessary notation, we return to the problem at hand: approximating the area under a curve. Let \(f(x)\) be a continuous, nonnegative function defined on the closed interval \([a,b].\) We want to approximate the area \(A\) bounded by \(f(x)\) above, the \(x\)-axis below, the line \(x=a\) on the left, and the line \(x=b\) on the right (Figure 5.3).
How do we approximate the area under this curve? The approach is a geometric one. By dividing a region into many small shapes that have known area formulas, we can sum these areas and obtain a reasonable estimate of the true area. We begin by dividing the interval \([a,b]\) into \(n\) subintervals of equal width, \(\frac{b-a}{n}.\) We do this by selecting equally spaced points \(x_0,x_1,x_2 , \dots , x_n\) with \(x_0=a,x_n=b,\) and
for \(i=1,2,3 , \dots , n.\)
We denote the width of each subinterval with the notation \(\Delta x \text{,}\) so \(\Delta x =\frac{b-a}{n}\) and
for \(i=1,2,3 , \dots , n.\) This notion of dividing an interval \([a,b]\) into subintervals by selecting points from within the interval is used quite often in approximating the area under a curve, so let’s define some relevant terminology.
Definition 5.4.
A set of points \(P={x_i}\) for \(i=0,1,2 , \dots , n\) with \(a=x_0\lt x_1\lt x_2\lt \cdots \lt x_n=b,\) which divides the interval \([a,b]\) into subintervals of the form \([x_0,x_1],[x_1,x_2] , \dots , [x_{n-1} ,x_n]\) is called a partition of \([a,b].\) If the subintervals all have the same width, the set of points forms a regular partition of the interval \([a,b].\)
We can use this regular partition as the basis of a method for estimating the area under the curve. We next examine two methods: the left-endpoint approximation and the right-endpoint approximation.
Note 5.5. Rule: Left-Endpoint Approximation.
On each subinterval \([x_{i-1} ,x_i]\) (for \(i=1,2,3 , \dots , n),\) construct a rectangle with width \(\Delta x \) and height equal to \(f(x_{i-1} ),\) which is the function value at the left endpoint of the subinterval. Then the area of this rectangle is \(f(x_{i-1} )\Delta x .\) Adding the areas of all these rectangles, we get an approximate value for \(A\) (Figure 5.6). We use the notation L_n to denote that this is a left-endpoint approximation of \(A\) using \(n\) subintervals.
The second method for approximating area under a curve is the right-endpoint approximation. It is almost the same as the left-endpoint approximation, but now the heights of the rectangles are determined by the function values at the right of each subinterval.
Note 5.7. Rule: Right-Endpoint Approximation.
Construct a rectangle on each subinterval \([x_{i-1} ,x_i],\) only this time the height of the rectangle is determined by the function value \(f(x_i)\) at the right endpoint of the subinterval. Then, the area of each rectangle is \(f(x_i)\Delta x \) and the approximation for \(A\) is given by
The notation \(R_n\) indicates this is a right-endpoint approximation for \(A\) (Figure 5.8).
The graphs in Figure 5.9 represent the curve \(f(x)=\frac{x^2}{2}.\) In graph (a) we divide the region represented by the interval \([0,3]\) into six subintervals, each of width 0.5. Thus, \(\Delta x =0.5.\) We then form six rectangles by drawing vertical lines perpendicular to \(x_{i-1} ,\) the left endpoint of each subinterval. We determine the height of each rectangle by calculating \(f(x_{i-1} )\) for \(i=1,2,3,4,5,6.\) The intervals are \([0,0.5],[0.5,1],[1,1.5],[1.5,2],[2,2.5],[2.5,3].\) We find the area of each rectangle by multiplying the height by the width. Then, the sum of the rectangular areas approximates the area between \(f(x)\) and the \(x\)-axis. When the left endpoints are used to calculate height, we have a left-endpoint approximation. Thus,
In Figure 5.9(b), we draw vertical lines perpendicular to \(x_i\) such that \(x_i\) is the right endpoint of each subinterval, and calculate \(f(x_i)\) for \(i=1,2,3,4,5,6.\) We multiply each \(f(x_i)\) by \(\Delta x \) to find the rectangular areas, and then add them. This is a right-endpoint approximation of the area under \(f(x).\) Thus,
Example 5.10. Approximating the Area Under a Curve.
Use both left-endpoint and right-endpoint approximations to approximate the area under the curve of \(f(x)=x^2\) on the interval \([0,2];\) use \(n=4.\)
First, divide the interval \([0,2]\) into \(n\) equal subintervals. Using \(n=4,\Delta x =\frac{(2-0)}{4}=0.5.\) This is the width of each rectangle. The intervals \([0,0.5],[0.5,1],[1,1.5],[1.5,2]\) are shown in Figure 5.11. Using a left-endpoint approximation, the heights are \(f(0)=0,f(0.5)=0.25,f(1)=1,f(1.5)=2.25.\) Then,
The right-endpoint approximation is shown in Figure 5.12. The intervals are the same, \(\Delta x =0.5,\) but now use the right endpoint to calculate the height of the rectangles. We have
The left-endpoint approximation is 1.75; the right-endpoint approximation is 3.75.
Checkpoint 5.13.
Sketch left-endpoint and right-endpoint approximations for \(f(x)=\frac{1}{x}\) on \([1,2];\) use \(n=4.\) Approximate the area using both methods.
Follow the solving strategy in Example 5.10 step-by-step.
The left-endpoint approximation is 0.7595. The right-endpoint approximation is 0.6345. See below.
Looking at Figure 5.9 and the graphs in Example 5.10, we can see that when we use a small number of intervals, neither the left-endpoint approximation nor the right-endpoint approximation is a particularly accurate estimate of the area under the curve. However, it seems logical that if we increase the number of points in our partition, our estimate of \(A\) will improve. We will have more rectangles, but each rectangle will be thinner, so we will be able to fit the rectangles to the curve more precisely.
We can demonstrate the improved approximation obtained through smaller intervals with an example. Let’s explore the idea of increasing \(n\text{,}\) first in a left-endpoint approximation with four rectangles, then eight rectangles, and finally 32 rectangles. Then, let’s do the same thing in a right-endpoint approximation, using the same sets of intervals, of the same curved region. Figure 5.14 shows the area of the region under the curve \(f(x)=(x-1)^3+4\) on the interval \([0,2]\) using a left-endpoint approximation where \(n=4.\) The width of each rectangle is
The area is approximated by the summed areas of the rectangles, or
Figure 5.15 shows the same curve divided into eight subintervals. Comparing the graph with four rectangles in Figure 5.14 with this graph with eight rectangles, we can see there appears to be less white space under the curve when \(n=8.\) This white space is area under the curve we are unable to include using our approximation. The area of the rectangles is
The graph in Figure 5.16 shows the same function with 32 rectangles inscribed under the curve. There appears to be little white space left. The area occupied by the rectangles is
We can carry out a similar process for the right-endpoint approximation method. A right-endpoint approximation of the same curve, using four rectangles (Figure 5.17), yields an area
Dividing the region over the interval \([0,2]\) into eight rectangles results in \(\Delta x =\frac{2-0}{8}=0.25.\) The graph is shown in Figure 5.18. The area is
Last, the right-endpoint approximation with \(n=32\) is close to the actual area (Figure 5.19). The area is approximately
Based on these figures and calculations, it appears we are on the right track; the rectangles appear to approximate the area under the curve better as \(n\) gets larger. Furthermore, as \(n\) increases, both the left-endpoint and right-endpoint approximations appear to approach an area of 8 square units. Table 5.20 shows a numerical comparison of the left- and right-endpoint methods. The idea that the approximations of the area under the curve get better and better as \(n\) gets larger and larger is very important, and we now explore this idea in more detail.
Values of \(n\) | Approximate Area L_n | Approximate Area Rn |
---|---|---|
\(n=4\) | 7.5 | 8.5 |
\(n=8\) | 7.75 | 8.25 |
\(n=32\) | 7.94 | 8.06 |
Subsection 5.1.3 Forming Riemann Sums
So far we have been using rectangles to approximate the area under a curve. The heights of these rectangles have been determined by evaluating the function at either the right or left endpoints of the subinterval \([x_{i-1} ,x_i].\) In reality, there is no reason to restrict evaluation of the function to one of these two points only. We could evaluate the function at any point x_i in the subinterval \([x_{i-1} ,x_i],\) and use \(f(x_i^* )\) as the height of our rectangle. This gives us an estimate for the area of the form
A sum of this form is called a Riemann sum, named for the 19th-century mathematician Bernhard Riemann, who developed the idea.
Definition 5.21.
Let \(f(x)\) be defined on a closed interval \([a,b]\) and let \(P\) be a regular partition of \([a,b].\) Let \(\Delta x \) be the width of each subinterval \([x_{i-1} ,x_i]\) and for each \(i\text{,}\) let \(x_i^* \) be any point in \([x_{i-1} ,x_i].\) A Riemann sum is defined for \(f(x)\) as
Recall that with the left- and right-endpoint approximations, the estimates seem to get better and better as \(n\) get larger and larger. The same thing happens with Riemann sums. Riemann sums give better approximations for larger values of \(n\text{.}\) We are now ready to define the area under a curve in terms of Riemann sums.
Definition 5.22.
Let \(f(x)\) be a continuous, nonnegative function on an interval \([a,b],\) and let \(\displaystyle{ \sum_{i=1}^{n} f(x_i^* )\Delta x }\) be a Riemann sum for \(f(x).\) Then, the area under the curve \(y=f(x)\) on \([a,b]\) is given by
Some subtleties here are worth discussing. First, note that taking the limit of a sum is a little different from taking the limit of a function \(f(x)\) as \(x\) goes to infinity. Limits of sums are discussed in detail in the chapter on Sequences and Series; however, for now we can assume that the computational techniques we used to compute limits of functions can also be used to calculate limits of sums.
Second, we must consider what to do if the expression converges to different limits for different choices of \({x_i^* }.\) Fortunately, this does not happen. Although the proof is beyond the scope of this text, it can be shown that if \(f(x)\) is continuous on the closed interval \([a,b],\) then \(\displaystyle{\lim_{n\to \infty } \sum_{i=1}^{n} f(x_i^* )\Delta x }\) exists and is unique (in other words, it does not depend on the choice of \({x_i^* }\text{ ). }\)
We look at some examples shortly. But, before we do, let’s take a moment and talk about some specific choices for \({x_i^* }.\) Although any choice for \({x_i^* }\) gives us an estimate of the area under the curve, we don’t necessarily know whether that estimate is too high (overestimate) or too low (underestimate). If it is important to know whether our estimate is high or low, we can select our value for \({x_i^* }\) to guarantee one result or the other.
If we want an overestimate, for example, we can choose \({x_i^* }\) such that for \(i=1,2,3 , \dots , n,f(x_i^* )\geq f(x)\) for all \(x∈[x_{i-1} ,x_i].\) In other words, we choose \({x_i^* }\) so that for \(i=1,2,3 , \dots , n,f(x_i^* )\) is the maximum function value on the interval \([x_{i-1} ,x_i].\) If we select \({x_i^* }\) in this way, then the Riemann sum \(\displaystyle{ \sum_{i=1}^{n} f(x_i^* )\Delta x }\) is called an upper sum. Similarly, if we want an underestimate, we can choose \({x_i^* }\) so that for \(i=1,2,3 , \dots , n,f(x_i^* )\) is the minimum function value on the interval \([x_{i-1} ,x_i].\) In this case, the associated Riemann sum is called a lower sum. Note that if \(f(x)\) is either increasing or decreasing throughout the interval \([a,b],\) then the maximum and minimum values of the function occur at the endpoints of the subintervals, so the upper and lower sums are just the same as the left- and right-endpoint approximations.
Example 5.23. Finding Lower and Upper Sums.
Find a lower sum for \(f(x)=10-x^2\) on \([1,2];\) let \(n=4\) subintervals.
With \(n=4\) over the interval \([1,2],\Delta x =\frac{1}{4}.\) We can list the intervals as \([1,1.25],[1.25,1.5],[1.5,1.75],[1.75,2].\) Because the function is decreasing over the interval \([1,2],\) Figure 5.24 shows that a lower sum is obtained by using the right endpoints.
The Riemann sum is
The area of 7.28 is a lower sum and an underestimate.
Checkpoint 5.25.
Find an upper sum for \(f(x)=10-x^2\) on \([1,2];\) let \(n=4.\)
Sketch the approximation.
Subsection 5.1.4 Key Concepts
The use of sigma (summation) notation of the form \(\displaystyle{ \sum_{i=1}^{n} a_i}\) is useful for expressing long sums of values in compact form.
For a continuous function defined over an interval \([a,b],\) the process of dividing the interval into \(n\) equal parts, extending a rectangle to the graph of the function, calculating the areas of the series of rectangles, and then summing the areas yields an approximation of the area of that region.
The width of each rectangle is \(\Delta x =\frac{b-a}{n}.\)
Riemann sums are expressions of the form \(\displaystyle{ \sum_{i=1}^{n} f(x_i^* )\Delta x },\) and can be used to estimate the area under the curve \(y=f(x).\) Left- and right-endpoint approximations are special kinds of Riemann sums where the values of \({x_i^* }\) are chosen to be the left or right endpoints of the subintervals, respectively.
Riemann sums allow for much flexibility in choosing the set of points \({x_i^* }\) at which the function is evaluated, often with an eye to obtaining a lower sum or an upper sum.
Subsection 5.1.5 Key Equations
Properties of Sigma Notation
\begin{gather*} \displaystyle{ \sum_{i=1}^{n} c}=nc\\ \displaystyle{ \sum_{i=1}^{n} ca_i}=c\displaystyle{ \sum_{i=1}^{n} a_i}\\ \displaystyle{ \sum_{i=1}^{n} (a_i+b_i)}=\displaystyle{ \sum_{i=1}^{n} a_i}+\displaystyle{ \sum_{i=1}^{n} b_i}\\ \displaystyle{ \sum_{i=1}^{n} (a_i-b_i)}=\displaystyle{ \sum_{i=1}^{n} a_i}-\displaystyle{ \sum_{i=1}^{n} b_i}\\ \displaystyle{ \sum_{i=1}^{n} a_i}=\displaystyle{ \sum_{i=1}^{m} a_i}+\displaystyle{ \sum_{i=m+1}^n a_i} \end{gather*}Left-Endpoint Approximation \(A\approx L_n=f(x_0)\Delta x +f(x_1)\Delta x + \cdots +f(x_{n-1} )\Delta x =\displaystyle{ \sum_{i=1}^{n} f(x_{i-1} )\Delta x }\)
Right-Endpoint Approximation \(A\approx R_n=f(x_1)\Delta x +f(x_2)\Delta x + \cdots +f(x_n)\Delta x =\displaystyle{ \sum_{i=1}^{n} f(x_i)\Delta x }\)