forked from APEXCalculus/APEXCalculus_Source
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathproofs.tex
120 lines (97 loc) · 11.3 KB
/
proofs.tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
\chapter{Selected Proofs}
We now present some of the proofs of the theorems in the text. If you need a reminder about sequences, you should (re)read \autoref{sec:sequences}. The first thing we will do is state some important properties of the real numbers.
\begin{theorem}[Completeness of the Real Numbers]\label{thm:realscomplete}
\hspace{0pt}\\[-5ex]\begin{itemize}
\item Suppose that $A$ is a nonempty subset of the real numbers that is bounded above. Then there is a unique real number $a^*$ that is the least upper bound of $A$: $A$ is bounded above by $a^*$, and any real number less than $a^*$ has some number from $A$ that is above $a^*$.
\item Suppose that $a_n$ is a sequence of real numbers that converges to $a_\infty$. Then $a_\infty$ is also a real number.
\item These two properties are equivalent.
\end{itemize}
\end{theorem}
The first thing you should think is that these properties are obviously true for the real numbers. The next thing you should realize is that these properties do not hold for the rational numbers: the set $\{q\in\mathbb{Q}:q^2<2\}$ is nonempty and bounded above, but it does not have a least upper bound that is a rational number. Similarly, we can find a sequence of rational numbers that approaches a limiting value that isn't rational (for example, the rational numbers 3, 3.1, 3.14, 3.141, 3.1415, 3.14159, \dotsc\ approaches a limiting value that isn't rational). This means that any of our theorems that use this property could fail if we use the rational numbers instead of the real numbers. It turns out that proving this theorem is quite difficult, and depends on how you actually define the real numbers (most definitions are tailored so that one of the first two properties is obviously true). This theorem is usually proved in a more advanced course with an intimidating name like ``Real Analysis'' or ``Advanced Calculus'' and is far beyond anything we will present here.
\begin{proof}[Proof of the {\hyperref[thm:IVT]{Intermediate Value Theorem}}]
\label{pf:IVT}
Let $X$ be the subset of real numbers $x\in[a,b]$ so that $f(x)<y$. Then $a\in X$ and $b$ is an upper bound for $X$. Therefore, $X$ has a least upper bound, which we will show is the desired $c$.
Given $\epsilon>0$, let $\delta$ be given by the continuity of $f$ at $c$. Because $c-\delta$ is not an upper bound of $X$, there must be some $x\in(c-\delta,c]$ so that $f(x)<y$, so that $f(c)<f(x)+\epsilon<y+\epsilon$. On the other hand, $c+\delta/2\notin X$ so $f(c+\delta/2)\ge y$ and $f(c)>f(c+\delta/2)-\epsilon>y-\epsilon$. Therefore, $\abs{f(c)-y}<\epsilon$. But since $\epsilon$ is arbitrary, this means that $f(c)=y$.
\end{proof}
The theorem (and the proof) do not hold if we use the rational numbers in place of the real numbers. For example, $f(x)=x^2-2$ on $[0,2]$ gives a counterexample.
%\begin{proof}[Proof of {\hyperref[thm:chainRule]{The Chain Rule}}]
%\label{pf:chainRule}
%We'll start with an incorrect proof of the chain rule, see why it's wrong, and then back up to fix what we've written. By the definition of the derivative,
%\begin{align*}
% \yp
% &= \lim_{h\to 0}\frac{f(g(x+h))-f(g(x))}h \\
% &= \lim_{h\to 0}\frac{f(g(x+h))-f(g(x))}{g(x+h)-g(x)}\times\frac{g(x+h)-g(x)}h \\
% &= \lim_{h\to 0}\frac{f(g(x+h))-f(g(x))}{g(x+h)-g(x)}\times
% \lim_{h\to 0}\frac{g(x+h)-g(x)}h \\
% &= \fp(g(x))\gp(x).
%\end{align*}
%The problem is that $g(x+h)-g(x)$ could be zero, which would mean we're multiplying and dividing by 0 and invalidating everything. The solution is to rewrite everything so that we're no longer dividing. Instead of showing
%\begin{equation}\label{eq:chainRulePfOrg}
% \lim_{h\to 0}\frac{f(g(x+h))-f(g(x))}h=\fp(g(x))\gp(x),
%\end{equation}
%we will rearrange the equation and show that
%\[f(g(x+h))-f(g(x))-h\fp(g(x))\gp(x)=h\epsilon(h)\]
%where \autoeqref{eq:chainRulePfOrg} means that $\epsilon(h)\to0$ as $h\to0$. The proof continues\dotso.
%\end{proof}
Before proving our next theorem, we'll first define an extension of an idea we saw (or will see) in \autoref{sec:sequences}.
\begin{definition}[Subsequence]\label{def:subsequence}
Suppose that $\{x_n\}$ is a sequence of numbers. Then $\{y_n\}$ is a \textbf{subsequence} if $g:\mathbb{N}^+\to\mathbb{N}^+$ is a strictly increasing function and $y_n=x_{g(n)}$.
\end{definition}
The idea may best be explained by a diagram:
\[
\begin{array}{ c c c c c c c c c c c c c c c }
x_1 & x_2 & x_3 & x_4 & x_5 & x_6 & x_7 & x_8 & x_9 & x_{10} & x_{11} & x_{12} & x_{13} & x_{14} & x_{15} \\
& y_1 & y_2 & & y_3 & & y_4 & & & & y_5 & & y_6 & &
\end{array}
\]
In words, the subsequence is a thinning of the original sequence. We keep everything in order (and still have infinitely many terms), but may skip over a few terms of the original sequence.
We can take a subsequence of a subsequence, which is sometimes called a subsubsequence. We could also talk about subsubsubsequences, but that starts to get annoying.
If the original sequence converges, then so does any subsequence. On the other hand, if the original sequence diverges, then we may be able to find a subsequence that still converges. For example, $\{(-1)^n\}$ diverges, but if we take every other term, then we get a subsequence that converges. Our next theorem is a powerful tool that gives conditions to guarantee that we'll be able to find a convergent subsequence.
\begin{theorem}[The Bolzano-Weierstrass Theorem]\label{thm:bolzanoWeierstrass}
Suppose that $\{x_n\}$ is a bounded sequence or real numbers. Then $\{x_n\}$ has a subsequence that converges.
\end{theorem}
\begin{proof}
Because the sequence is bounded, it is contained in some segment $[-M,M]$.
Partition this segment into $2M$ subsegments that are each 1 unit in length. One of these segments must have infinitely many elements of the original sequence $\{x_n\}$. Taking these elements as our subsequence, we have a subsequence $\{x_n^{(0)}\}$ where all of the elements are in the same unit length interval. Now divide this interval in half. One of these halves must have infinitely many elements of $\{x_n^{(0)}\}$, and we call this subsequence $\{x_n^{(1)}\}$. We can continue this process of dividing the half segment and always taking the semi-segment with infinitely many points. In this way, we get an infinite sequence of sequences $\{x_n^{(k)}\}$ where each $\{x_n^{(k)}\}$ is contained in an interval of length $2^{-k}$. This means that $\{x_n^{(n)}\}$ is a subsequence that must converge.
\end{proof}
The temptation in the previous proof is to just take the ``last'' subsequence. But there are an infinite number of subsequences, so we can't talk about the ``last'' one. Instead, we take the first element from the first subsequence, the second element from the second subsequence, and so on.
By \autoref{thm:realscomplete}, this limit must be a real number.
% Let $f$ be a continuous function defined on a closed interval $I$. Then $f$ has both a maximum and minimum value on $I$.
\begin{proof}[Proof of {\hyperref[thm:extremeVal]{The Extreme Value Theorem}}]
\label{pf:extremeVal}
We will prove that $f$ has a maximum. The proof that $f$ has a minimum is identical. Let $M$ be the least upper bound of the range of $f$, and choose $x_n\in[a,b]$ so that $f(x_n)\to M$. By \autoref{thm:bolzanoWeierstrass}, we can find a subsequence $y_n$ that converges to a limit $y$. Because each $y_n\in[a,b]$, their limit $y\in[a,b]$. Then
\[
f(y)=f(\lim_{n\to\infty}y_n)=\lim_{n\to\infty} f(y_n)=M
\qedhere
\]
\end{proof}
This theorem (and the proof) do not hold if we have a function from rational numbers to rational numbers. For example, $f(x)=x^3-6x$ on $[-2,2]$ gives a counterexample.
% Let a function $f$ have a relative extrema at the point $(c,f(c))$. Then $c$ is a critical point of $f$.
\begin{proof}[Proof of {\hyperref[thm:criticalpts]{Relative Extreme and Critical Points}}]\label{pf:criticalpts}
If $\fp(c)$ does not exist, then $c$ is a critical point of $f$. Therefore, we may assume that $\fp(c)$ exists. Without loss of generality, we will assume that $f(c)$ is a relative maximum. This means that $\dfrac{f(c+h)-f(c)}h$ is positive when $h<0$ and negative when $h>0$. Since we know that the limit as $h\to0$ exists, it must be zero because it is a limit of both positive and negative numbers. Therefore, $\fp(c)=0$, and $c$ is a critical point of $f$.
\end{proof}
% theorem
%Let $f$ be continuous on the closed interval $[a,b]$ and let $S_L(n)$, $S_R(n)$ and $S_M(n)$ be defined as before. Then:
%\begin{enumerate}
%\item $\ds \lim_{n\to\infty} S_L(n) = \lim_{n\to\infty} S_R(n) = \lim_{n\to\infty} S_M(n) = \lim_{n\to\infty}\sum_{i=1}^n f(c_i)\Delta x$,
%\item $\ds \lim_{n\to\infty}\sum_{i=1}^n f(c_i)\Delta x = \int_a^b f(x)\dd x$, and %$\ds \lim_{n\to\infty} S_L(n) = \int_a^b f(x)\dd x$.
%\item $\ds \lim_{\norm{\Delta x}\to 0} \sum_{i=1}^n f(c_i)\Delta x_i = \int_a^b f(x)\dd x$.%, where the latter sum is any Riemann sum of $f$ on $[a,b]$, and
%\end{enumerate}
% proof
%\begin{proof}[Proof of {\hyperref[thm:riemannSum]{Definite Integrals and the Limit of Riemann Sums}}]\label{pf:riemannSum}
%On the $i^\text{th}$ subinterval $[x_i,x_{i+1}]$, let $m_i$ be the extreme minimum of $f$, and let $M_i$ be the extreme maximum. Then the signed area over $[x_i,x_{i+1}]$ is between $m_i\Delta x_i$ and $M_i\Delta x_i$. Therefore, every expression in the theorem is bounded below by $\sum_{i=1}^n m_i\Delta x_i$ and bounded above by $\sum_{i=1}^n M_i\Delta x_i$.
%
%Given $\epsilon>0$, we want to find a $\delta$ so that $\abs{x-y}<\delta$ implies $\abs{f(x)-f(y)}<\epsilon/(b-a)$. (This is stronger than our usual notion of continuity, since we want the same $\delta$ to work for all possible values of $x$ and $y$.) Because $f$ is continuous, for each $x$, we can find a $\delta$ that depends on $x$ so that $\abs{x-y}<\delta$ implies $\abs{f(x)-f(y)}<\frac\epsilon{b-a}$. Let $\delta(x)$ be the largest value of $\delta$ that makes this statement work for a particular $x$, so that there is some $y$ so that $\abs{x-y}<\delta$ but $\abs{f(x)-f(y)}\ge\frac\epsilon{b-a}$. We want to show that there is a $\delta>0$ that is smaller than all the $\delta(x)$.
%
%Let's suppose for this paragraph that there is no such $\delta$. This means that there is a sequence $\{x_n\}$ so that $\delta(x_n)\to0$. By \autoref{thm:bolzanoWeierstrass}, there is a subsequence that converges to a limit $x^*$. Now let $\delta^*=\delta(x^*)$ and take $x_n$ within $\frac{\delta^*}2$ of $x^*$ so that $\delta(x_n)<\delta^*$. Then for any $y$ within $\delta(x_n)$ of $x_n$, $\abs{x^*-y}<\abs{x^*-x_n}+\abs{x_n-y}<\frac{\delta^*}2+\delta(x_n)=\delta^*$ and $\abs{f(x_n)-f(y)}<\abs{f(x_n)-f(x^*)}+\abs{f(x^*)-f(y)}<\frac\epsilon{2(b-a)}+\frac\epsilon{2(b-a)}$. But we chose $\delta(x_n)$ so that there is a $y$ where this doesn't happen. This means that this entire paragraph is one giant contradiction. Therefore, our beginning assumption must be wrong, and we can find such a $\delta$. NOT YET.
%
%Then once $\abs{\Delta x}<\delta$, the $x$-coordinates of the extreme values will be within $\delta$, and $\abs{M_i-m_i}<\epsilon/(b-a)$. This will mean that
%\[
% \abs{\sum_{i=1}^n M_i\Delta x_i-\sum_{i=1}^n m_i\Delta x_i}
% =\abs{\sum_{i=1}^n(M_i-m_i)\Delta x_i}
% <\abs{\sum_{i=1}^n\frac{\epsilon\Delta x_i}{b-a}}
% =\epsilon.
%\]
%Therefore, as $\abs{\Delta x}\to 0$ (and $n\to\infty$), every summation approaches the same value, which must be equal to the signed area.
%\end{proof}
% ratio & root tests