MSU MATHEMATICS |
MTH 254H |
Spring 2016 |
I will not collect daily homework, though you may hand in problems marked Extra Credit within a week of the HW date. Each problem you give up on is a lost opportunity to learn: only look at the solution after a serious effort.
I will give 1 extra point to the first person reporting each significant typo or error on this page. Corrections and recent revisions are in red. Tentative future assignments are marked in gray.
1a. Vector AB = B − A = (3,0) − (1,3) = (2,−3); OC = (3,2). The dot product is: AB • OC = (2,−3) • (3,2) = (2)(3) + (−3)(2) = 0, meaning the vectors are orthogonal.
1b. l(t) = A + t (AB) = (1,3) + t (2,−3) = (1 + 2t, 3 − 3t) and m(s) = s (3,2) = (3s, 2s).
1c. The lines intersect when l(t) = m(s), which means: 1 + 2t = 3s and 3 − 3t = 2s. Solving this linear system, for example by substitution, gives: (t,s) = (7⁄13, 9⁄13), so l(7⁄13) = m(9⁄13) = (27⁄13, 18⁄13) is the intersection point.
1d. The signed area of the parallelogram is: det(OA,OB) = det(1,3; 3,0) = (1)(0) − (3)(3) = −9. Alternative computation: OA⊥ • OB = (−3,1) • (3,0) = −9. (The negative sign indicates a clockwise turn from OA to OB, rather than the standard counterclockwise orientation.) Thus, the (absolute) area of the triangle is ½ |−9| = 9⁄2.
1e. The area of ΔACB is
½ det(A−C ; B−C)
= ½ det(−2,1; 0,−2) = ½((−2)(−2) − (0)(1))
= 2.
Adding the areas of ΔAOB and ΔACB gives 9⁄2 + 2 = 13⁄2.
2a. We compute: u • v = (0, −10) • (3,2) = (0)(3) + (−10)(2) = −20, and |v| = √(32 + 22) = √13 . Then |p| = |u • v|⁄|v| = 20⁄√13.
2b. We have: 20⁄√13 = |p| = |s| |v| = s√13, so s = ±20⁄13 and p = s v = ±20⁄13 (3,2) = ±(60⁄13, 40⁄13). In fact, we need the − sign, p = −20⁄13 (3,2), since there is an obtuse angle between u and v, so the projection p points opposite to v.
2c. The result in (b) agrees with the general formula: p = (u • v⁄v • v) v = −20⁄13 (3,2).
2d. q = u − p = (0, −10) + 20⁄13 (3,2)
= (60⁄13,
−90⁄13).
Here q is indeed orthogonal to v, because:
2e. Here p is the component of acceleration along the plane v, so the acceleration is |p| = 20⁄√13 ≈ 5 m/sec2, about half the acceleration of free-fall. The force on the surface is mass times acceleration, so it depends on how heavy the object is. A 1 kg object (pressing about 10 Newtons on a flat surface) would exert a force of 1 |q| = 30⁄√13 ≈ 8 N.
3. See the whole solution written as a
model proof.
3a. The vector from u to the midpoint ½(v + w) is:
½(v + w) − u = ½v + ½w − u. The median line segment is:
(Here we expanded using the distributive property of scalar multiplication over vector addition.) Similarly for the other medians.
3b. m(2⁄3) = 1⁄3 u + 1⁄3 v + 1⁄3 w = c. Doing a similar computation for the other medians, we see that the same point c lies on all three lines.
1a. Clearly m = (1,3) has ℓ(v) = m • v = (1,3) • (x,y) = x + 3y.
1b,d. Contour lines ℓ(x,y) = c, with gradient vector field ∇ℓ(x,y) = (1,3), not to scale:
That is, the direction v − v' along the contour is orthogonal to the gradient vector m.
2a,b,c. Below are contour curves f(x,y) = c; the gradient vector field ∇f(x,y) = (2x, 1), not to scale; also the tangent line from part (d).
2d. The contour curve of f(x,y) passing through (1,1) is given by the equation f(x,y) = f(1,1) = 2. If we zoom in near (1,1), the contour curve looks approximately linear, indistinguishable from its tangent line. To put this in equations, consider the affine approximation:
ℓ(x−1, y−1) + f(1,1) = f(1,1) ⇔ ∇f(1,1) • (x,y) − ∇f(1,1) • (1,1) = 0 ⇔ ∇f(1,1) • (x,y) = ∇f(1,1) • (1,1).
2e. With vertical ribs, and with horizontal contour lines:
I got these images from Wolfram Alpha, which is a great resource for computations and pictures.
3. The contour plot consists of the axes (xy = 0), and hyperbolas y = c/x. The gradient is ∇f(x,y) = (y,x): we have ∂⁄∂x(xy) = y, since in the partial derivative with respect to x, the variable y is considered as a constant. In the graph z = xy, the vertical curve above y = x is the upward parabola z = x2, while above y = −x it is the downward parabola z = −x2, making a "saddle".
5. Letting x,y,z be the side-lengths of the rectangular bin, the volume is xyz = 1, and the surface area of the four sides is s = xy + yz + 2xz. Eliminating the variable z = 1⁄xy, we get the function
The gradient field is: ∇s(x,y) = (y − 1⁄x2 , x − 2⁄y2):
The only critical point, where ∇s(x,y) =
(y − 1⁄x2,
x − 2⁄y2) = (0,0),
is (x,y) = (2−1/3, 22/3) ≈ (0.8, 1.6).
The vector field plot shows this is a local minimum, since the uphill vectors all point outward from this point.
For completeness, we should also consider the critical points where
∇s is undefined, namely the boundary points
where x = 0 or y = 0; but these are
clearly vertical asymptotes where s(x,y) tends to infinity.
Thus, our final answer is: (x,y,z) = (2−1/3, 22/3, 2−1/3).
Hint: Split the integral into intervals, ∫02 = ∫01 + ∫12.
1a. f(3,5) =
∮ F(c) • dc
=
∫01 F(c(t)) • c'(t) dt
=
∫01 F(3t,5t) • (3t, 5t)' dt
=
∫01 (6t,1) • (3,5) dt
=
∫01 18t + 5 dt
=
9t2 + 5t |1t=0
=
14.
1b. We repeat part (a) with (3,5) replaced by (a,b), getting f(a,b) = ∮ F(c) • dc = ∫01 (2at,1) • (a,b) dt = ∫01 2a2t + b dt = a2 + b. Thus, f(x,y) = x2 + y, and ∇f(x,y) = (∂⁄∂x(x2 + y), ∂⁄∂y(x2 + y)) = (2x, 1).
1c.
Thus, f(x,y) = x2 + y agrees with in part (b), which is guaranteed by the Gradient Theorem.
2a. The vortex vector field F(x,y) = (−y, x):
2b. From the vector field plot, we can see that F could not have any potential function f. If F(x,y) were the gradient or uphill direction for a graph z = f(x,y), we could follow an uphill path all the way around a circle centered at the origin, which is obviously impossible.
2c. f(a,b) =
∫01 (−bt, at) • (a,b) dt
=
∫01 −bat + abt dt
=
0.
That is, the vector field is everywhere
orthogonal to the curve, and
f(x,y) = 0 is the zero constant function.
The gradient is ∇f(x,y) = (0,0) ≠ F(x,y),
so the method fails, which means F(x,y) could not have been
a gradient in the first place.
2d.
f(a,b) =
∫01 F(at, 0) • (at, 0)' dt
+
∫12 F(a, b(t−1)) • (a, b(t−1))' dt
=
∫01 (0,at) • (a, 0) dt
+
∫12 (−b(t−1), a) • (0, b) dt
=
ab.
Thus f(x,y) = xy, but this also fails to give a potential function, since ∇f(x,y) = (y,x) ≠ F(x,y).
If one path fails to give a correct potential function,
no other path will work.
3a. f(a,b) = ∮ F(c) • dc
= ∫10 F(c(t)) • c'(t) dt
= ∫10 F(at, bt) • (at, bt)' dt
= ∫10 (ebt, at ebt) • (a, b) dt
= ∫10 a ebt + abt ebt dt
= at ebt |1t=0 = a eb.
Therefore f(x,y) = x ey.
3b. We try: ∮ F(p) • dp
= ∫10 F(t, t2) • (t, t2)' dt
= ∫10 et2 + 2t2et2 dt.
But ∫ et2 is well known to have no
algebraic anti-derivative, and it is easy to deduce
that the above integral also has none. Thus, our algebraic methods
are stuck.
3c. By the Gradient Theorem, ∮ F(p) • dp = f(p(1)) − f(p(0)) = 1e1 − 0e0 = e = 2.718...
3d. Spreadsheet. We get ∮ F(p) • dp ≈ 3.0, which is about 10% over the exact value e = 2.718... from part (c). This is expected, since F(p(t)) is increasing in magnitude, so its average value within each interval is less than at the right endpoint F(pi).
3e. Spreadsheet. This time, the approximation ∮ F (p) • dp ≈ 2.714 is accurate to 2 decimal places, which is pretty impressive since Δt = 0.1 has only one decimal place.
In fact, F has a potential function with F = ∇f if and only if curl F = 0 everywhere.
First compute the inner integral ∫ h(x) g(x) f(x,y) dy with respect to the variable y, treating x as a constant. The result is a function of x which you integrate in the outer integral ∫b a dx.
1a. F(x,y) = (0, x)
A small paddlewheel placed at any point
is pushed vertically on one side more than on the other,
giving a counterclockwise rotational force,
which means curl F > 0.
In fact, curl F =
∂⁄∂x(x)
− ∂⁄∂y(0)
= 1 − 0 = 1.
Since the curl is non-zero, the field is not path independent or
irrotational, so it cannot have a potential function f.
1b. F(x,y) = (x, 0).
The equal flows on the top and bottom of a paddlewheel
cancel each other, so we should have curl F = 0.
In fact, curl F =
∂⁄∂x(0)
− ∂⁄∂y(x)
= 0 − 0 = 0.
The line integral along c(t) = (at, bt) is:
f(a,b) =
∫01 F(at, bt) • (at, bt)' dt
=
∫01 (at, 0) • (a, b) dt
=
½a2.
The potential function is thus f(x,y) = ½x2.
1c. F(x,y) = (−y, x).
A paddlewheel is pushed harder on the side away
from the origin than on the side facing the origin,
giving a counterclockwise rotation,
curl F > 0.
Using the other physical model,
a person sitting anywhere on a turntable
must turn his head at a constant rate to counter the table's rotation, so we expect curl F is constant.
In fact, curl F =
∂⁄∂x(x)
− ∂⁄∂y(−y)
= 1 − (−1) = 2.
There is no potential function.
1d. F(x,y) = 1⁄r2 (x,y). <--Mathematica VectorPlot[ {x/(x^2 + y^2), y/(x^2 + y^2)}, Element[{x, y}, RegionDifference[ Rectangle[{-1, -1}, {1, 1}], Rectangle[{-.2, -.2}, {.2, .2}] ] ], Axes -> True, Frame -> False] -->
A paddlewheel is pushed
with equal force on both sides of a
radial line, so we should have curl F = 0.
This holds for any radial field
F(x,y) = g(r) (x,y).
We compute: r = √(x2+y2),
∂r⁄∂x
= x⁄√(x2+y2),
and similarly for ∂r⁄∂y,
so the Chain Rule gives:
The potential function is f(x,y) = r G(r), where G(c) = ∫c0 g(r) dr.
2a. The magnitude of F(x,y) = 1⁄r2 (−y, x) is |F(x,y)| = 1⁄r2 |(−y, x)| = 1⁄r2 r = 1⁄r .
On the side away from the center, a paddlewheel is pushed counterclockwise by the flow; on the side facing the center, it is pushed clockwise along a shorter arc by a stronger flow. The two forces balance, and we should have curl F = 0.
2b. For F(x,y) = (−y⁄x2+y2 , x⁄x2+y2), we use the Quotient Rule to get:
That is, F has no rotation at any point where it is defined, though it has some singular behavior at the origin.
2c. To compute the potential function f(a,b), we take the line integral along a two-segment path: we define the constant s = √(a2+b2), and take the line segment c1(t) = (t + (1−t)s, 0) from (1,0) to (r,0); then the circular arc c2(t) = (s cos(t), s sin(t)) from (r,0) to (a,b), where t goes from 0 to θ = arctan(b/a), which is the angle from the x-axis to the radial vector (a,b). The potential function is given by:
In the graph z = f(x,y), the ray at angle θ from the x-axis is at height θ, for −π ≤ θ ≤ π. This leads to a discontinuity on the negative x-axis, where the angle can be either π or −π. Thus, we cannot define the potential function all the way around the origin, which reflects the fact that F does have a rotation around that point, though nowhere else.
3a. R is the region above the interval [0,1] on the x-axis, and below the line y = 1−x, so R = { (x,y) with 0 ≤ x ≤ 1 and 0 ≤ y ≤ 1−x }.
3b. First, a rough estimate: the plate has area ½, but
its density increases slowly from 0 on the left edge to 1 at the right corner. Thus, its weight should be much less than ½.
Now, ∫01−x x2 dy
= x2 y|y=0y=1−x
= x2(1−x).
Thus, the weight is:
Geometrically, if F represents a fluid flow,
the divergence measures the rate of net outflow (= outflow minus inflow) from a small region near (x,y), per area enclosed.
Problem: For each vector field below, estimate from its
picture whether it has positive, negative, or zero
divergence at each point: does the flow going into one side of a small region balance the flow out the opposite side (div = 0, called incompressible flow),
or is the fluid density thinning out at that point, so that there is faster flow out than in (div > 0)?
Confirm your estimate by computing div F with the above formula.
The fields are:
and for c(t) = (x(t), y(t)), we let dc⊥ = (x'(t), y'(t))⊥ dt -->
1a. Curl Theorem: The double integral of the rate of rotation of the vector field F over the region R is equal to the total rotation line integral of F around the boundary curve c of R:
Stating it another way: The total rotation of F around a closed curve is equal to the double integral of the rate of rotation inside the curve. (The curve must go counterclockwise to give the correct sign.)
1b. F(x,y) = (0, x2):
The line integral has non-zero dot product of the
field with the tangent vector only on the diagonal
of the triangle, with a value about 1⁄2
at the middle and a maximum of about 1 at the right end.
Thus, we may guess a total rotation about 1.
For positive x, a paddlewheel would be pushed more strongly
counterclockwise by the flow to its right than clockwise
by the weaker flow to its left. Thus, curl F is clearly
positive, and increases from zero for x = 0
to largish at x = 2;
the right half of triangle has area 1⁄2
so the above guess of 1
is reasonable for the double integral of curl F
over the triangle.
1c. curl F(x,y) = 2x, and R = {(x,y) s.t. 0 ≤ x ≤ 2 , 0 ≤ y ≤ 1−½x}. The double integral is:
in line with our guess.
1d. We have R = {(x,y) s.t. 0 ≤ y ≤ 1 , 0 ≤ x ≤ 2−2y}, and:
Any double integral can be done either way (dy dx or dx dy), whichever is more convenient.
1e. The boundary c has three segments: c1(t) = (2t,0) ; c2(t) = (2,0) + [(0,1)−(2,0)]t = (2−2t, t) ; c3(t) = (0, 1−t), where all the parameter ranges are 0 ≤ t ≤ 1. The total rotation is:
Behold the Curl Theorem!
2a. F(x,y) = (0, x)
Visually, we see that a small rectangular region has the same flow at the bottom and at the top, so net outflow is zero, the flow is incompressible, and div F = 0. Computing: div F = ∂⁄∂x(0) + ∂⁄∂y(x) = 0 .
2b. F(x,y) = (x, 0).
For a small rectangle on the right side, the inflow through its left boundary is smaller than the outflow through its right boundary, so the net outflow is positive: div F > 0. Computing: div F = ∂⁄∂x(x) + ∂⁄∂y(0) = 1.
2c. F(x,y) = (−y, x)
Consider a small curved box, a "polar coordinates box", having two sides which are short sectors of concentric circles, and the other two sides short segments of radii. Then clearly the inflow across one of the radius edges equals outflow across the other, and there is no flow across the circular edges. Thus, the net outflow is zero, incompressible flow: div F = 0. Computing: div F = ∂⁄∂x(−y) + ∂⁄∂y(x) = 0.
2d. The same resoning as in 2(c) shows that for F(x,y) = g(r) (−y, x), we have div F = 0 for any g(r). We compute div F = 0 by applying the Chain Rule as in 2(e) below.
2e. Let F(x,y) = g(r) (x,y) be any radial field. As in the previous part, consider a polar coordinate box: we must balance inflow of g(r) along a short curve facing the origin, with outflow of g(r+Δr) along a longer curve facing away from the origin. This can be positive, zero, or negative net outflow, depending on how quickly g(r) increases with r. Here is F(x,y) = 1⁄r2 (x,y):
We compute: r = √(x2+y2), ∂r⁄∂x = x⁄√(x2+y2), and similarly for ∂r⁄∂y, so the Chain Rule gives:
which can be positive, zero, or negative.
It is zero precisely when g(r) satisfies the
differential equation
r ∂g⁄∂r + 2g = 0,
which has solution g(r) = k⁄r2.
Thus, the above picture is essentially the only
incompressible radial field (div = 0).
Of course, there is singular behavior at the origin
where the field becomes infinite.
1. See solutions to HW 1/22 #2.
2a. |F(v)| = | v⁄|v|2 | = |v|⁄|v|2 = 1⁄|v| = 1⁄r .
2b. We showed in HW 1/20 #1(d) that any radial field F(v) = g(r) v is irrotational. We showed in HW 1/22 #2(e) that our F = 1⁄r2 v = v⁄|v|2 is the only incompressible radial field.
Computing directly:
= (x2+y2−2x2) ⁄(x2+y2)2 + (x2+y2−2y2) ⁄(x2+y2)2 = 0.
2c. We look visually at the flux of F across the polar box curve:
The flow across the radial segments is zero. The outer circular arc is twice as long as the inner arc, but the field on the outside is half as strong, so outflow balances inflow, and the net outflow is zero: ∮ F(b) • dn = 0.
Calculating the flux, we use the parametrized segments:
with 0 ≤ t ≤ 1. The flux across these four segments is: ∮ F(b) • dn = ∮ F(b) • (−db⊥) =
Computing the last term in detail, we start with
|(a cos(θ), a sin(θ))| = |a|, so that:
F(a cos(θ), a sin(θ)) = 1⁄|a| (cos(θ), sin(θ)). Then:
This confirms our expectation of inflow over the inner circle.
The Divergence Theorem computes the flux integral right away, as a double integral over the polar box B enclosed by the curve: as ∮ F(b) • dn = ∬B div F dA = 0, since div F(x,y) = 0 everywhere.
2d. Visually, the outflow across the unit circle is obviously positive. Calculating, we use c(t) = (cos(t), sin(t)) for 0 ≤ t ≤ 2π. Then:
2e. The result of part (d) seems to contradict the Divergence Theorem, applied to the unit disk region C and its boundary c:
But: the integrand of the double integral on the right is not defined everywhere: div F(0,0) = 1⁄02 (0,0), which does not exist. In fact, F(x,y) has a point-source of flow from (0,0), which suggests an infinite rate of outflow per unit area. This "infinite value" at a single point is enough to make the double integral non-zero. It is actually possible to make mathematical sense of this statement.
where c0,...,cn are sample points of the curve, and Δci = ci − ci−1 are short steps along the curve. We can write this symbolically as: L = ∮ |dc|.
1a.
= ∑ni=1 |c'(ti)| Δt ≈ ∫10 |c'(t)| dt = ∫10 √(x'(t)2 + y'(t)2) dt,
1b. The same reasoning and formulas apply to R3 (or Rn), except with more components. The length of c(t) = (x(t), y(t), z(t)) for 0 ≤ t ≤ 1 is:
1c. The helix curve c(t) = (sin(t), cos(t), t) for 0 ≤ t ≤ 2π goes around the unit circle in the xy-plane, and rises at the same time. It has derivative (velocity vector) c'(t) = (−cos(t), sin(t), 1); and length:
2a. Let p be the orthogonal projection of v on the direction u, meaning p is parallel to u and the vector q from p to v is orthogonal to u. That is, p, q are the legs of a right triangle with hypotenuse v, so by the definition of cosine: |p| = |v| cos θpv = |v| cos θuv if θuv is an acute angle, and |p| = |v| |cos θuv| in general. Now the basic formula u • v = |u| |v| cos θuv implies:
2b. Letting u⊥ = (−u2, u1), the counterclockwise orthogonal of u, take the role of u in the previous problem, we find that the projection of v to the direction u⊥ is: q = |u⊥ • v|⁄|u⊥| = |u⊥ • v|⁄|u| , since clearly |u⊥| = |u|.
2c. For the parallelogram spanned by plane vectors u, v, the base has length |u|, and the height (the length of the projection of v to the perpendicular direction of u, is |q| = |u⊥ • v|⁄|u| by the previous problem. Therefore:
= |(−u2, u1) • (v1, v2)| = |−u2v1 + u1v2| = |u1v2 − u2v1| .
We will see that this quantity is the absolute value of the 2 × 2 determinant.
2d. Projections work the same in 3 or higher dimensions as they do in 2 dimensions. However, a vector u ∈ R3 has no well-defined orthogonal vector u⊥: there is a whole plane of vectors orthogonal to u, and a circle of orthogonal vectors with the same length |u|. Thus, we cannot compute the height of the parallelogram spanned by u, v, unless we find a way to compute the orthogonal to u within the plane spanned by u, v. (We will accomplish this later using the cross product.)
3a. This time p is the projection of u on direction v. Since |p| = |u| |cos θuv| and u • v = |u| |v| cos θuv, we have:
Alternatively, recall our definition in Lect 1 that u • v = ± |p| |v|, again giving |p| = |u • v|⁄|v| .
3b. The multiple of v with the correct direction and length is:
3c. We have: q = u − p = 1⁄11 (−13,2,17). We can check that q is indeed orthogonal to v, because:
3d. The ratio of the projected force vector in direction p to the original force in direction u is |p|⁄|u| = 28⁄14√77 = 2⁄√77 . Thus, the projected force is 2⁄√77 10 ≈ 2.28 Newtons. (A Newton is the force needed to accelerate 1 kilogram by 1 meter/second in 1 second.)
1a. v−u = (1,0,1), w−u = (1,1,−1).
1b. p(s,t) = u + s(v−u) + t(w−u) = (s+t, 1+t, 1+s−t).
1c. d(s,t) = |r − p(s,t)|
=
|(s+t, 1+t, 1+s−t) − (3,4,5)|
=
√((s+t−3)2 + (t−3)2
+ (s−t−4)2)
=
√(2s2−14s+3t2−4t+34)
1d. The function D(s,t) = d(s,t)2 = 2s2−14s+3t2−4t+34 has its minimum at the same (s,t) as d(s,t). This must occur at a critical point, where:
Thus, the minimum distance is: d(7⁄2, 2⁄3) = 7⁄√6.
2. See solutions for HW 1/11
3a. The triple u = (1,1,0), v = (1,0,1), w = (1,1,2):
Notice that we draw the x,y,z axes as a right-handed triple, and v is in the xz-plane in the foreground, whereas w recedes into the picture. This is a left-handed system: model it with your left palm up, index finger horizontal along u, middle finger almost vertical along v, thumb pointing away from you along w.
We compute the determinant, the signed volume of the parallelepiped spanned by the three vectors. We write the rows of a matrix consecutively in one long row, and write v(i) for the vector v with its ith coordinate omitted.
= 1 det(0,1; 1,2) − 1 det(1,1; 1,2) + 0 det(1,0; 1,1) = −1 − 1 + 0 = −2.
The negative value verifies the left-handedness.
3b. The triple u = (1,0,1), v = (1,1,0), w = (1,1,2), with the first two vectors switched from part (a), is right-handed, with the opposite determinant: det(u;v;w) = 2.
3c. The triple u = (1,1,0), v = (1,0,1), w = (2,1,1) has w = u + v, which means the three vectors all lie in the same plane. Thus, the parallelepiped is flat, with volume zero, and we may verify det(u;v;w) = 0. The triple is degenerate, neither right- nor left-handed.
1a. The cross product of two vectors is orthogonal to them both. Thus:
1b. Every direction along the plane is orthogonal to q from part (a). Thus, the vector from u to an arbitrary point p = (x,y,z) in the plane is orthogonal to q:
1c. The vector r − u = (3,4,5) − (0,1,1) = (3,3,4) points at an angle from
the plane to r; the "height" of this vector
(i.e. the component orthogonal to the plane) is the distance from the plane to r. To get the length of the projection of r − u on the orthogonal direction q, we take
the dot product with the unit vector q⁄|q|:
which agrees with the result from HW 1/25 #2(d).
2a. Expanding the definition of the determinant gives:
Note that the subscripts of the terms uivjwk are (i,j,k) = (1,2,3), (1,3,2), (2,1,3), (2,3,1), (3,1,2), (3,2,1). These are all 3! = 3•2•1 = 6 ways of rearranging (1,2,3).
For a higher dimension n, there are n! ways of permuting n distinct objects,
which we encode as one-to-one functions
π from {1,...,n} to itself. Examples:
the identity mapping π(i) = i; or the
flip-over mapping π(i) = n+1−i.
Consider n vectors u1,..., un,
with coordinates ui = (ui1,...,uin).
Their determinant is:
2b. We compute:
=
( u1 det(v(1); w(1))
− u2 det(v(2); w(2))
+ u3 det(v(3); w(3)) )
=
det(u, v, w)
+
det(u', v, w)
Similarly for det(su, v, w)
=
s det(u, v, w).
2c. Switching u, v in the input of det, we get:
since on the third line, each term appears
with the opposite sign as in det(u, v, w).
The other formula is similar.
2d. We can check this formula directly from the expansion
as in part (c), but there is an easier way.
If we use the part (c) formula det(v, u, w) = −det(u, v, w), and
replace v with u, we get:
But a = −a means 2a = 0 and a = 0, and so det(u, u, w) = 0.
2e. Again, we can check this directly as in part (c),
but there is an easier way. If we use the formula which motivated the cross-product, det(u, v, w) = (u × v) • w, we find:
But if a × w = b × w for every vector
w, then a = b, so
v × u = − u × v.
where sgn(π) = +1 if π can be produced from the identity permutation with an even number of switches of adjacent elements, and −1 if an odd number.
+
( u'1 det(v(1); w(1))
− u'2 det(v(2); w(2))
+ u'3 det(v(3); w(3)) )
=
v1u2w3
− v1u3w2
− v2u1w3
+ v2u3w1
+ v3u1w2
− v3u2w1
=
u2v1w3
− u3v1w2
− u1v2w3
+ u3v2w1
+ u1v3w2
− u2v3w1
=
−det(u, v, w),
The same argument shows det(u, v, v) = 0
and det(u, v, u) = 0.
1a. f(x,y) is not continuous at (0,0), since there is no way
to guarantee f(x,y) is close to f(0,0) = 0,
no matter how close (x,y) is to (0,0).
For example, f(t,t) = 1 no matter how small t is.
Furthermore, g(t) = f(ta,tb) is continuous only if (a,b) = (a,0) or (0,b);
however, any g(t) = f(ta,tb) 2ab⁄√(a2+b2) is constant outside t = 0,
so it has a removable discontinuity,
meaning g(t) would be continuous if we changed g(0) to the constant value.
However, there is no way to change f(0,0) to make f(x,y) continuous.
Since f(x,y) is not continuous at (0,0), it cannot be differentiable either.
Also, g'(0) = 0 for all t ≠ 0.
1b. f(x,y) is continuous at (0,0), and hence at (0,0) on every line (x,y) = (ta,tb).
Also, every g(t) = f(ta,tb) = (a(a2−3b2)⁄√(a2+b2)) t is a linear function of t, and has a derivative at t = 0.
However, f(x,y) is not differentiable at (0,0) since the radial tangent lines wave up and down too much, and do not fit into a tangent plane.
1c. f(x,y) is not continuous at (0,0) since f(t, ½t²) = 1 ≠ f(0,0) = 0, no matter how small t > 0; and hence f(x,y) is not differentiable at (0,0) either. However, g(t) = f(ta,tb) = 0 for |t| ≤ b⁄a , so g(t) = 0 in a neighborhood of t = 0 for any (a,b), and g(t) is both continuous and differentiable at t = 0 for any (a,b).
Hint: Given that limx→c |f(x) − f(c) − Dfc(x−c)|⁄|x−c| = 0, and similarly for g(x), you must show that the proposed derivative function is close to f(x)g(x), relative to |x − c|.
1. See Notes §4.
2. See Notes §8. Actually, this is probably too hard for an exercise.
3a. By the definition of derivative, we have:
This is equivalent to:
Answers to odd-numbered problems are in M&T pp 502−506.
1a. f(x,y) = (exy − 1)⁄y = x (exy − 1)⁄xy = x (et − 1)⁄t , where t = xy. Since limt→0 (et − 1)⁄t = 1 by L'Hopital's Rule, we have lim(x,y)→(0,0) f(x,y) = (0)(1) = 0.
1b. Again taking t = xy and using L'Hopital twice gives lim(x,y)→(0,0) f(x,y) = (cos(t) − 1)⁄t2 = −½.
1c. Consider f(t,0) = 0, and f(t3, t) = t6⁄2t6 = ½. Thus no matter how close (x,y) is to (0,0), there are values of f(x,y) equal to 0, and others equal to ½, so f(x,y) does not approach a single limit value.
1d. We take f(t,t) = 0, and f(t, −t) = (2t)2⁄2t2 = 2, so again f(x,y) does not approach a single limit value as (x,y) → (0,0).
2a. For f(x,y) = ex−y, we have: ∂f⁄∂x|(x,y)=(1,1) = 1, and ∂f⁄∂y|(x,y)=(1,1) = −1, so the affine approximation is:
and the tangent plane is: z = 1 + x − y.
2b. The equation of P can be written as: −x + y + z = 1, i.e. (x,y,z) • (−1, 1, 1) = 1. Thus q = (−1, 1, 1) is orthogonal to the plane. Indeed, any two points v, v' on the plane satisfy v • q = v' • q = 1 so the vector v −v' between them, a vector parallel to the plane, has (v−v') • q = 1−1 = 0.
3. For (x,y,z) = (4.01, 3.98, 2.02) near (4,4,2), we have the affine linear approximation f(x,y,z) ≈ f(4,4,2) + ∇f(4,4,2) • (x−4, y−4, z−2) ≈ 6 + (2⁄3, 2⁄3, 1⁄3) • (x−4, y−4, z−2) = 6 + 2⁄3(x−4) + 2⁄3(y−4) + 1⁄3(z−4) = 6 + 2⁄3(0.01) + 2⁄3(−0.02) + 1⁄3(0.02).
4. The curve c(t) = et (cos(t), sin(t)) has derivative (velocity vector, tangent vector) given by the product rule:
and we have |c(t)| = et, and |c'(t)| = et √2. A constant angle between c(t) and c'(t) would mean a constant dot product between the corresponding unit vectors:
Thus θcc' = ±π⁄4 .
The moth would go inward along the path
c(−t) with tangent −c'(−t), and it would see the flame at the
origin in the direction −c(−t),
which would give the same constant angle
π⁄4 .
5. For a function T(x,y) and a curve c(t) = (x(t), y(t)), the Chain Rule says:
Plugging in the given functions to this formula gives the same result as first plugging in to get f(t) = T(c(t)), and then using the one-variable Chain Rule to compute f '(t).
6. In the graph, z = f(x,y), the gradient ∇f(a,b) in the xy-plane points in the uphill direction from (a,b), which is always orthogonal to the contour curve (level curve) C = {(x,y) with f(x,y) = f(a,b)}. Thus, the orthogonal vector ∇f(a,b)⊥ points along the C at (a,b). Note: ∇f(a,b) has a unique orthogonal direction because it is a vector in the plane R2. (A three-dimensional vector has a whole plane of orthogonal vectors.)
7a. The points (x,y,z) with x2 + y2 + z2 = d form the sphere with radius √d from the origin. Squeezing these points to (x, y⁄√2, z⁄√3) gives the solutions to f(x,y,z) = d, namely the level surface S. Thus S is an ellipsoid: a sphere flattened by a factor of 1⁄√2 in the y direction, and by 1⁄√3 in the z direction.
7b. The gradient ∇f(a,b,c) ∈ R3 points in the direction of fastest increase of f(x,y,z), starting from (a,b,c). This is orthogonal to the directions of no increase, namely the tangents to the level surface. (Thinking of f(x,y,z) as temperature, the gradient points in the hottest direction, which is orthogonal to the directions of constant temperature.)
The plane orthogonal to q and passing through the point p is given by v = (x,y,z) satisfying q • (v−p) = 0, or equivalently q • v = q • p. Thus, the equation of the tangent plane to S is given by:
2ax + 4by + 6cz = 2a2 + 4b2 + 6c2.
For (a,b,c) = (1,0,2), this is 2x + 12z = 26.
8. Theorem: If a curve c(t) = (x(t),y(t),z(t)) goes along a level surface of the function f(x,y,z), then the tangent c'(t) is orthogonal to the gradient ∇f(c(t)).
Proof: By hypothesis, we assume f(c(t)) = a for all t and some constant a. Differentiating the left side by the multivariable Chain Rule gives:
∇f(c(t)) • c'(t) = 0.
This confirms that the gradient is orthogonal to the tangent, as desired.
a. |
| b. |
| c. |
| d. |
| e. |
|
1a. Since ℓ : R2 → R1, its matrix [ℓ] has 1 row, 2 columns. We have ℓ(1,0) = 2, ℓ(0,1) = 3, so [ℓ] = [ℓ(i) | ℓ(j)] = [2,3]. To get back the function, we perform matrix multiplication: [ℓ(v)] = [ℓ]•[v] = [2,3]•[x,y]T = 2x + 3y, which is the same as dot product of the row (2,3) with the vector v = (x,y).
1b. A linear mapping ℓ : R1 → R2 has a matrix [ℓ] with 2 rows, 1 column. The most general such mapping, corresponding to a column matrix [a,b]T, is ℓ(x) = (ax, bx).
1c. A linear mapping ℓ : R → R is just ℓ(x) = ax, with matrix [a].
2. The matrix of ℓ : Rn → Rn has n rows, n columns, given by [ℓ(e1) | ... | ℓ(en)]. Since ℓ(ei) = aei , this means every entry of [ℓ] is zero, except the entry in the ith row, ith column is a. This is called a diagonal matrix.
3. Matrices (a), (b), (c) all give mappings ℓ : R2 → R2.
3a. ℓ switches the basis vectors i and j in the plane, and ℓ(x,y) = (y,x). Geometrically, this is a reflection across the line y = x.
3b. ℓ stretches each basis vector by a factor of 3, and ℓ(x,y) = (3x, 3y), so ℓ stretches every vector the same way: ℓ(v) = 3v. This is a dilation.
3c. ℓ(i) = i and ℓ(j) = (1,1) = i + j. The mapping squeezes the rectangular grid into the lopsided grid with lines parallel to i and i + j.
3d. A matrix with 3 rows, 2 columns gives ℓ : R2 → R3, and matrix multiplication gives ℓ(x,y) = x(1,0,1) + y(1,1,0). You can picture this as sending the grid in R2 to a plane in R3 with grid lines parallel to (1,0,1) and (1,1,0).
3e. We have ℓ : R3 → R3 with matrix [ℓ] = [j | k | i], so ℓ(i) = j , ℓ(j) = k , ℓ(k) = i , and matrix multiplication gives ℓ(x,y,z) = (z,x,y). Picturing what this does to the triangle with corners at i, j, k, we see this is a ⅓ rotation around the axis (1,1,1) going through the center of the triangle.
4a. See M&T pp 54−58, or this website.
4b. [ Ds(ρ,θ,φ) ] is:
sin(φ) cos(θ) | −ρ sin(φ) sin(θ) | ρ cos(φ) cos(θ) |
sin(φ) sin(θ) | ρ sin(φ) cos(θ) | ρ cos(φ) sin(θ) |
cos(φ) | 0 | −ρ sin(φ) |
Plugging in (ρ,θ,φ) = (2, π, ⅓π)
gives:
−√3⁄2 | 0 | −1 |
0 | −√3 | 0 |
1⁄2 | 0 | −√3 |
4c. Multiplying [ Ds(x,y,z) ] •
[ρ, θ, φ]T:
−√3⁄2 | 0 | −1 | ρ | −√3⁄2 ρ + (−1) φ | ||
0 | −√3 | 0 | • | θ | = | −√3 θ |
1⁄2 | 0 | −√3 | φ | 1⁄2 ρ + (−√3) φ |
Examine the graphs of a couple of the possible compositions, and try to make sense of them. Check the Chain Rule in by evaluating the composition directly and finding its derivative; then computing the product of Jacobian matrices, and comparing.
<--
H = |
| , |
-->
q(x,y) = [x,y] • |
| • [x,y]T = ax2 + 2bxy + cy2, |
1. See M&T p. 156, Example 6.
3. See p. 509. Correction for #19: For f(x,y) = 1⁄3 x3 + 1⁄3 y3 − 1⁄2 x2 − 5⁄2 y2 + 6y + 10, the critical points are classified as:
4. See p. 510.
5. See p. 175.
1a. The critical points are defined by ∇f(x,y) = (2x+y+1, x+2y+1) = (0,0). Solving this linear system of equations gives (x,y) = (−⅓, −⅓). The Hessian is:
Hf(x,y) = |
| , |
1b. We solve ∇f(x,y) = λ ∇g(x,y) and g(x,y) = 1. That is:
Clearly λ = 0 is not a solution, since this would mean ∇f(x,y) = (0,0),
giving the critical point we found in part (a),
which does not lie on x2 + y2 = 1.
Thus, we may divide the first two equations, giving
(2x+y+1)⁄x+2y+1
=
x⁄y .
Cross-multiplying gives y2 − x2 + y − x = 0,
which factors as (y−x)(y+x+1) = 0, giving solutions y = x or y = −x−1.
If y = x, the equation x2 + y2 = 1 gives
(x,y) =
±(1⁄√2, 1⁄√2) .
If y = −x−1, the equation x2 + y2 = 1 gives
(x,y) = (0, −1) or (−1, 0)
Evaluating f(x,y) on these 4 critical points gives:
1c. To find the critical points of:
1d. We show how minimizing f(x,y) over
the parametrized constraint curve c(t) = (cos(t), sin(t))
leads to the Lagrange Multiplier method,
in which we consider the curve as a contour
of g(x,y) = x2 + y2.
To minimize h(t) = f(c(t)), we solve
0 = h'(t) = ∇f(c(t)) • c'(t).
For a solution t = a with c(a) = (x0,y0),
we have ∇f(x0,y0) orthogonal
to c'(a), the tangent direction of
the circle S at (x0,y0).
On the other hand, the circle is defined by
g(x,y) = x2 + y2 = 1,
so we have g(c(t)) = 1. Differentiating
this and using the Chain Rule gives:
∇g(c(t)) • c'(t) for all t,
including t = a, so that ∇g(x0,y0)
is also orthogonal to the tangent direction c'(a).
Since both ∇f(x0,y0)
and ∇g(x0,y0)
are orthogonal to the same direction in R2,
they must be parallel.
1e. Evaluate f(x,y) both at the critical points found in part (a),
and at the constrained extrema found in part (b) or (c).
The largest/smallest of these gives the absolute max/min
of f(x,y) within the disk.
This is the two-dimensional
version of finding the max/min of a single-variable
function f(x) on an interval x ∈ [a,b]
by finding critical points of f(x) inside the interval,
then comparing to the endpoint values f(a), f(b).
Answer: the absolute minimum is at the interior
critical point (−⅓, −⅓);
the absolute maximum is at the boundary point
(1⁄√2, 1⁄√2).
2. Letting x,y,z be the side-lengths of the rectangular bin missing the top and front, the volume is g(x,y,z) = xyz = 1, with ∇g(x,y,z) = (yz, xz, xy). The surface area of the four sides is f(x,y,z) = xy + yz + 2xz, with ∇f(x,y,z) = (y+2z, x+z, y+2x). The Lagrange Multiplier method gives equations ∇f(x,y,z) = λ ∇g(x,y,z) and g(x,y,z) = 1, solved for variables x,y,z > 0, and λ:
1a. Subtracting any two points on the plane P gives a direction along (parallel to) the plane, for example c− a = (2,1,1) and c− b = (1,1,2). To get to any point on the plane, move in these two directions from a base point c:
1b. A cross product u×w has direction orthogonal
to u and w, and length
|u| |w| sin θuw.
Thus, we get a normal vector, orthogonal to P,
as: n = (2,1,1) × (1,1,2) = (1,−3,1).
Now, for any point v = (x,y,z) on P, the direction
from c to v is orthogonal to n:
1c. The area of a triangle with side lengths u,w enclosing angle θ is ½uw sin θ. Our triangle has side vectors u = c−a and w = c−b, so the area is half the length of the cross product n = u×w from in part (b): area(△abc) = ½|n| = |(1,−3,1)| = ½√11.
1d. To get a vector m from the origin directly to the plane P, take any vector from the origin to P, such as c, and project it to the normal direction n orthogonal to P. The length of this projection is c dotted with the unit normal vector n⁄|n|:
1e. We can think of the tetrahedron as a cone (or pyramid) whose base is △abc and whose altitude is the vector m. The volume of any cone is ⅓ the base area times the height, which we know from parts (c) and (d), so the volume of our tetrahedron is: 1⁄3(√11⁄2)(2⁄√11) = 1⁄3 .
1f. The signed volume of the parallelpiped with edge vectors a, b, c is the determinant of the matrix of these three vectors (written as columns or rows: det(0,1,1; 1,1,0; 2,2,2) = −2, so the absolute volume is 2. The volume of a tetrahedron is 1⁄6 of this, namely 1⁄3 as before.
2a. The vector F(x,y) = (2x,1) depends only on x, and gives the same vector everywhere on a vertical line y = c: a vector pointing upward and away from the y-axis. Wolfram Alpha gives:
First step. Assume F = ∇f for some potential function f. Gradient Theorem: The line integral of the rate of increase of f along a curve is the total increase along the curve: ∮ ∇f(c) • dc = f(c(1)) − f(c(0)). We can then compute f(a,b) by taking the line integral of ∇f(x,y) from a base point (0,0) to (a,b). This line integral depends only on the endpoints (0,0) and (a,b), and is independent of how c(t) travels between them.
Second step. Suppose F is path indepenent. Given two paths c1, c2 between two points, we can follow the first path out and the second one back to get a closed curve c. Since the line integrals along c1 and c2 are equal, their difference, the line integral around c, is zero. That is, the vector field is irrotational, having zero line integral around any closed curve.
Third step. Curl Theorem: The double integral of the rate of rotation of F over a region R is equal to the total rotation of F around the boundary of R. If F is irrotational, having zero total rotation around any closed curve, this means the rate of rotation at every point must be zero: curl F(x,y) = 0 everywhere.
2d. See HW 1/20 #1. For F(x,y) = (p(x,y), q(x,y)), the curl F(x,y) = ∂q⁄∂x − ∂p⁄∂y measures how strongly a small paddlewheel at (x,y) would be turned by a fluid flowing with velocity F. A left-to-right increase in the vertical component of F leads to a positive (couterclockwise) curl, hence the term ∂q⁄∂x; a bottom-to-top increase in the horizontal component of F leads to a negative (clockwise) curl, hence the term −∂p⁄∂y.
2e. curl F = ∂⁄∂x(1) − ∂⁄∂y(2x) = 0. Thus F is irrotational, and must have a potential function f.
2f. We compute the potential function f(x,y) at (x,y) = (a,b) as the line integral of F(x,y) = (2x, 1) along the straight-line path c(t) = (at, bt) for 0 ≤ t ≤ 1:
= ∮01 F(at, bt) • (at, bt)' dt = ∫01 (2at,1) • (a,b) dt = ∫01 2a2t + b dt = a2 + b.
3a. The quarter-turn ℓ0 takes i to j = (0,1), and j to −i = (−1,0). The corresponding matrix of column-vectors is:
[ ℓ0 ]
=
|
3b. The z-axis quarter-rotation takes i to j, j to −i, and k to itself. These output vectors are the column vectors of the matrix; and similarly for the x-axis quarter-rotation. (Note: we are using the right-handed rotation, in which the thumb points along the positive axis of rotation, and the fingers curl in the rotation direction.)
[ ℓz ]
=
| [ ℓx ]
=
|
3c. Multiplying the above gives the matrix of composite linear mapping ℓz(ℓx(v)):
[ ℓz ∘ ℓx ]
=
|
3d. The columns of the above matrix are the outputs of the coordinate vectors: the mapping takes i → j → k → i. That is, the equilateral triangle with corners at i,j,k is rotated around its center ⅓(i+j+k) = (⅓,⅓,⅓). This means the linear mapping is a 120° rotation around the axis (⅓,⅓,⅓), or equivalently the axis (1,1,1). Note that composing in the other order, ℓx ∘ ℓz, gives a 120° rotation around a different axis.
4a. The Jacobian at (x,y) = (½, 0) of:
[ Df(½, 0) ]
=
| = |
|
4b. The point (x,y) = (½, 0) is on the line between 0 and i. We can predict the first column of the Jacobian by seeing that moving along the x-axis will have exactly opposite effects on the distances to 0 and i, which accounts for the +1 and −1; while it increases the distance to j at a positive rate, accounting for the bottom entry. As for the second column, moving vertically will affect the distances to 0 and i only tangentially (quadratically), so the first two rows are zero; while the last row is negative, since the vertical motion decreases the distance to j.
4c. The derivative of g(x,y,z) = x+y+z is the row-vector gradient: ∇g(x,y,z) = [1 1 1]. Multiplying a 1×3 matrix by a 3×2 matrix gives a 1×2 matrix, whch is the gradient of g(f(x,y)).
1a. Considering a derivative operator like ∂⁄∂x as a "scalar", we can "multiply" it by a scalar function f(x,y,z) to give the partial derivative ∂⁄∂x f(x,y,z). Then taking the dot product of the "vector" ∇ = (∂⁄∂x , ∂⁄∂y , ∂⁄∂z) with a vector field F = (F1, F2, F3) gives the formula for the divergence:
1b. We have curl F = curl (F1(x,y), F2(x,y), 0) = (0, 0, ∂⁄∂x F2 − ∂⁄∂y F1) = (curl (F1, F2)) k. That is, the axis of rotation is vertical, and the strength of the rotation at (x,y,z) is the same as the strength of the two-dimensional vector field at (x,y).
2a. We can compute divergence this by taking partial derivatives of each component in:
2c. To find when div g(r) v = r g'(r) + 3 g(r) = 0, we must solve the separable differential equation: r dg⁄dr = −3g. That is, ∫ dg⁄g = −3 ∫ dr⁄r , which means log(g) = −3 log(r) + C, so that g(r) = exp(−3 log(r) + C) = c⁄r3 . Thus F(v) = c⁄r3 v, a radial vector field with magnitude proportional to the inverse-square of the radius.
2d. As in part (a), we can compute component by component, or use p. 255 #10 to get:
3. The formula div curl F = ∇ • (∇ × F) = 0 follows from the same algebra that proves u • (u × v) = 0 for any vectors u, v ∈ R3. You can check it by brute-force computation.
4. See p. 220−221.
5. The arclength is: ∫ |c'(t)| dt = ∫02π √(x'(t)2+ y'(t)2+z'(t)2) dt = ∫02π √(sin2(t)+ cos2(t)+1) dt = 2π√2.
1. See the identities on p. 255.
1a. curl grad f = (0,0,0) for any function, which you can check from the definitions, coordinate-by-coordinate.
1b. div curl F = 0.
1c. div grad f = ∂2f⁄∂x2 + ∂2f⁄∂y2 + ∂2f⁄∂z2 .
2a. See picture on p. 270 #7. The wedge is defined by x2 + y2 ≤ r and 0 ≤ z ≤ mx, where m = tan θ. Writing the xy-shadow in y-simple form: -r ≤ x ≤ r, √−(r2−x2) ≤ y ≤ (r2−x2); and also 0 ≤ z ≤ mx. The volume integral becomes:
The inner integral is the area of the rectangular vertical cross-section of the solid in the plane y = c, where c is a fixed value of x.
3. See notes on More Integral Applications, last page, from Math 132.
1. Repeating the definition of center of gravity as the average value of each coordinate x,y, weighted by mass = density × length along the curve c(t), we get:
where L = ∫ δ(t) |c'(t)| dt is the arclength.
2a. The center of gravity of the three corners is the average of their vectors: ⅓((0,0) + (a, 0) + (b1, b2)) = (⅓a + ⅓b1, ⅓b2).
2b. The triangular region is bounded by lines y = 0, y = b2⁄b1 x, y = b2⁄(b1−a) (x−a). This can be split into two adjacent y-simple triangles corresponding to the two different floor-lines (for b1 > a) or ceiling-lines (for b1 < a). However, since one side of the triangle is the x-axis, the whole triangle is an x-simple region: 0 ≤ y ≤ b2, b1⁄b2 y ≤ x ≤ (b1−a)⁄b2 y + a. The double integrals 1⁄A (∬R x dA, ∬R y dA), where the area is A = ½ab2, give the same answer as in part (a).
2c. Splitting the line integral over the three line segments gives:
which is definitely different from parts (a), (b).
| = [T] • |
|
1. See M&T p. 332.
2a. See p. 325.
2b. S has an edge-curve with z2 = r2, and r2 + z2 = 1, hence r = z = 1⁄√2. Thus S is defined by: −1⁄√2 ≤ x ≤ 1⁄√2, −√(1⁄2−x2) ≤ y ≤ √(1⁄2−x2), √(x2+y2) ≤ z ≤ √(1−x2−y2). Under spherical coordinates, this pulls back to the box S* defined by 0 ≤ ρ ≤ 1, 0 ≤ θ ≤ 2π, 0 ≤ φ ≤ π⁄4. Hence the volume is:
3a. [T] = |
|
3b. B* is the triangle with vertices (s,t) = (0,0), (−1,1), (1,1), given by 0 ≤ t ≤ 1, −t ≤ s ≤ t.
3c. (x,y) = (−½s + ½t , ½s + ½t). [U] = [T]−1 = |
|
This is indeed the inverse of [T], since: [U] • [T] = |
| • |
| = |
| . |
3d,e. We have U(B*) = B, one-to-one and onto. Since U is linear, it is its own derivative at every point: DU(s,t) = U, and |∂(x,y)⁄∂(s,t)| = det[U] = ½ . Also y−x = s, y+x = t. Thus:
5a. F = (−y, x, 1)
is the velocity field of a
right-handed rotation of space around the z-axis
(i.e. right thumb along the z-axis, fingers
curling toward the rotation).
The velocity grows in proportion to the xy-radius.
The divergence is the net outflow from
a small region around a given point,
per unit volume enclosed.
Taking the region to be a small
cylindrical-coordinate box, the inflow on one side
equals the outflow on the other, and div F = 0.
The curl is the right-handed
axis of the rotation induced on a small
paddlewheel floating in the flow of F
near a given point,
with magnitude equal to the force of rotation.
This is clearly along the z-direction, with
positive magnitude.
5b. The product f F = f(z)(−y, x, 0) scales the speed of rotation for each horizontal plane. Thus, if f(z) = z, we could picture f F as a tornado, rotating slowly close to the ground, and faster and faster high up. (Below the xy-plane, the rotation would be reversed.) This would still have zero divergence, but the curl would be complicated by the shearing forces between horizontal layers, which depend on ∂f⁄∂z.
5c.
curl F = ∇ × F = det
i | j | k |
∂⁄∂x | ∂⁄∂y | ∂⁄∂z |
−y | x | 0 |
These are what we guessed.
5d. Any product formula is of the form: Δ(f∗g) = (Δf)∗g + f∗(Δg), where each Δ is some type of derivative and each ∗ some type of product. In our cases, there is only one type of derivative and one type of product which will make sense at each place in the formula, keeping in mind that: gradient takes a scalar function to a vector field, div takes a vector field to a scalar function, and curl takes a vector field to a vector field:
curl f F
=
∇ × (f F)
=
∇f × F
+ (f) ∇×F
=
(0, 0, ∂f⁄∂z) × (−y, x, 0) +
f(z) (0, 0, 2)
=
(−x ∂f⁄∂z ,
−y ∂f⁄∂z ,
2 f(z)).
16-6 #13. The solid bounded by planes y + z = 4, y = 0, z = 0,
and the parabolic cylinder y = 4−x2.
First examine the boundary surfaces involving only x,y: the
back wall y = 0, and the curving front wall y = 4−x2.
Next consider the roughly horizontal surfaces involving z:
the floor z = 0 and the slanted ceiling z = 4−x.
16-6 #19. The solid bounded by planes x + y + z = 2 and x = 0,
and the parabolic cylinders y = z2 and z = y2.
First examine the boundary surfaces involving only y,z: the
parabolic cylinders cutting each other in a squashed cylinder.
Next consider the surfaces involving x:
x = 0 and x = 2 − y − z, which cut off the ends of
the squashed cylinder.
16-9 #12(a). The equation x2 + y2 = 2ax is equivalent to (x−a)2 + y2 = a2, a circle centered at (a,0) with radius a, so that the circle contains the origin. The first two circles have centers (1,0) and (3,0), and similarly the second two have centers (0,1) and (0,4).
12(b). The line u = c is parametrized by (c,t) for t ∈ R, which corresponds to (x,y) = T(c,t). It is easy to verify that x = 2c⁄(c2+t2) and y = 2t⁄(c2+t2) satisfy x2 + y2 = 2⁄c x, which is equivalent to the circle equation. Similarly for v = c.
12(c). From the picture, we see R = { (x,y) = T(u,v) for 1⁄3 ≤ u ≤ 1 , 1⁄4 ≤ v ≤ 1 }
12(d). You can do this with Wolfram Alpha by pasting partial results into a scratch file, then pasting back into W|A. Result: the Jacobian determinant (streching factor) is |−4⁄(u2+v2)2| = 4⁄(u2+v2)2 .
12(e). The integral reduces to ∫1⅓ ∫1¼ (u2+v2)2 4⁄(u2+v2)2 dv du = 2.
12(f). Use W|A to solve the system of equations
x = 2u⁄(u2+v2)
and y = 2v⁄(u2+v2)
for the variables u,v. Keep in mind that the equations
are symmetric upon switching u,v and simultaneously switching x,y,
so a solution for u can be switched into a solution for v.
Note: This is also evident from the polar description
of T as an inversion across a circle of radius √2.
Furthermore, we can write T as a conjugate inversion
in the plane of complex numbers.
That is, we let (x,y) correspond to
the complex variable z = x + iy
with conjugate
z = x − iy;
then T(x,y) corresponds to the mapping
z ↦ 2⁄z
=
2z⁄zz.
= R (cos(θ), sin(θ), 0) + r cos(φ) (cos(θ), sin(θ), 0) + (0, 0, r sin(φ)).
[DT] = |
| . |
1a. Consider the torus parametrization:
1b. The Jacobian derivative matrix is:
[DT] = |
| . |
1c. The tangent vectors ∂T⁄∂θ , ∂T⁄∂φ of T(θ,φ) are the same as the corresponding derivatives of T(r,θ,φ): the last two columns of the Jacobian matrix in part (b). The normal vector is:
1d. The area of the torus is:
2. Ram's horn surface:
3a. The graph z = f(x,y) is parametrized by T(x,y) = (x, y, f(x,y)), giving stretching factor:
3b. The surface of revolution z = g(√(x2+y2) is parametrized by the cylindrical coordinate mapping T(r,θ) = (r cos(θ), r sin(θ), g(r)), giving stretching factor:
3c. The right circlular cone with height a and base radius b has a vertex-to-rim radius of ρ = √(a2+b2), and a rim circumference C = 2πb. We can unroll the lateral surface into a sector of a circle of radius ρ, with circular arc C. The area A of a sector is proportional to its circular arc, so C⁄2πρ = A⁄πρ2 . That is: A = πb√(a2+b2).
The cone is parametrized by T(r,θ) = (r cos(θ), r sin(θ), a⁄b r) for 0 ≤ r ≤ b and 0 ≤ θ ≤ 2π. The formula of part (b) becomes:
4b. For P(u,v) = (u cos(v), u sin(v), v) at (u,v) = (1,0), we have:
If we changed the parametrization to Q(u,v) = P(v,u), switching the roles of (u,v), we would get the opposite cross product ∂Q⁄∂u × ∂Q⁄∂v = (0,1,1) × (1,0,0) = (0,1,−1) pointing downward.
4c. The vector field F(x,y,z) = (0,y,1) is independent of x, and is illustrated by its picture in the yz-plane:
4d. The flux integral is:
This is consistent with our geometric estimate.
5. Examples are worked out in the text. Answers for odd-numbered exercises are on p. 526.
1a. From the picture of F(x,y) = (x+1, xy), we can see that curl(F) > 0 on the whole region. Also, F • c' is positive on the left, bottom, and right boundaries of R, while it is moderately negative on the top boundary (oriented right-to-left); thus the total rotation around the boundary should be positive.
Computing the left side of the Curl Theorem, with curl(F) = y − 0:
∮ F • dc | = | ∫−11 F(t,0) • (t,0)' dt + ∫02 F(1,t) • (1,t)' dt |
+ ∫1−1 F(t,1+t2) • (t,1+t2)' dt + ∫20 F(−1,t) • (−1,t)' dt | ||
= | ∫−11 (t+1,0) • (1,0) dt + ∫02 (2,t) • (0,1) dt | |
+ ∫1−1 (t+1,t+t3) • (1,2t) dt + ∫20 (0,−t) • (0,1) dt | ||
= | 2 + 2 − 62⁄15 + 2 = 28⁄15. |
1b. Split the region along the y-axis so that each half is doubly simple, with no vertical or horizontal bays or dents:
R2 = {(x,y) with −1 ≤ x ≤ 0, 0 ≤ y ≤ 1+x2} = {(x,y) with 0 ≤ y ≤ 2, −1 ≤ x ≤ −b(y)},
1c. For the region R1, and F = (P,Q) = (x+1, xy), the curl integral is:
∬R1 curl(F) dy dx | = | ∬R1 (∂Q⁄∂x − ∂P⁄∂y) dy dx | ||||||||||
= | ∫
y=0
2
∫
x=b(y)
1
∂⁄∂x(xy) dx dy
−
∫
x=0
1
∫
y=0
x2+1
∂⁄∂y(x+1) dy dx
=
| ∫
y=0
2
[xy]x=b(y)
x=1
dy
−
∫
x=0
1
[x+1]y=0
x2+1
dx
| =
| ∫
y=0
2
1y − b(y)y
dy
−
∫
x=0
1
(x+1) − (x+1) dx.
| =
| ∫
y=0
2
Q(1,y) − Q(b(y),y)
dy
−
∫
x=0
1
P(x,x2+1) − P(x,0) dx.
| |
∮ (P(c1),0) • dc1+ ∮ (0,Q(c)) • dc1 | = | ∫01 P(t,0) (t)' + ∫02 P(1,t) (1)' + ∫10 P(t,t2+1) (t)' dt + ∫10 P(0,t) (0)' dt |
+ ∫01 Q(t,0) (0)' + ∫02 Q(1,t) (t)' + ∫20 Q(t,b(t)) (t)' dt. |
1a. Answer
a. |
| b. |
| c. |
|
where the transpose operation T flips a 1×2 row matrix over to a 2×1 column matrix.
1. Matrices of the given linear mappings.
a. |
| b. |
| c. |
|
d. |
|
|
e. Product of matrices:
| • |
| = |
|
e. |
|
Equating with the entries in (e) gives:
cos(α+β) = cos(α)cos(β) − sin(α)sin(β) sin(α+β) = sin(α)cos(β) + cos(α)sin(β). This is the "real reason" for the angle-sum trig identities. |
2a.
The matrix of ℓ : R2
consists of columns given by the outputs of the function on the standard basis vectors of the input space R2, namely
ℓ(i) = ℓ(1,0) = 2 and ℓ(j) = ℓ(0,1) = 3, so
[ℓ] = [2,3].
To check this is correct, we compute:
[ℓ(v)] = [ℓ] • [v] = [2,3] • [x,y]T
= 2x + 3y. Note that the matrix product (row • column) is the same as the dot product of vectors (2,3) • (x,y).
2b. A general linear mapping L : R → R2 is of the form L(x) = (ax, bx), with each component a linear mapping R → R. Its matrix is the column given by the output of the standard basis vector e of the input space R, which is just the number v = 1; that is, [L] = [L(e)] = [L(1)] = [a,b]T.
2c. A general linear mapping R → R is ℓ(x) = ax, with 1×1 matrix [ℓ] = [a].
3a. Computing [L(x,y)] = [L] • [x,y]T, we get L(x,y) = (y,x), which is a reflection across the diagonal y = x. (This is the mapping which flips a graph y = f(x) over to the graph of the inverse function y = f−1(x).)
3b. L(x,y) = (3x, 3y), or L(v) = 3v, stretches each vector by a factor of 3.
3c. The shear mapping L(x,y) = (x, x+y) slants the unit square into the rhombus (diamond) with consecutive corners (0,0), (1,0), (1,2), (1,1). It does the same slanting to whole grid of R2. Although this mapping stretches and squashes angles, it does not distort area since the determinant of its matrix is 1.
Turn in:
<--
where cΔxΔy(t) is the curve which traces around a rectangle from (x,y) to (x+Δx, y) to (x+Δx, y+Δx) to (x, y+Δy) and back to (x,y).
Generalizing even more, the ∗ operation may combine two different kinds of objects (such as a scalar times a vector), and may produce yet another kind of object (such as the dot product of vectors producing a scalar). Thus, we might have three sets A, B, C with a ∗ b = c for a ∈ A, b ∈ B, c ∈ C.
Problem: For each of the following operations, state whether it obeys each property (i)−(v), and prove your answer.
Hints:
<--
-->
where we split the curve by sample points ci = c(i⁄n) going from the start point c0 = c(0) to the endpoint cn = c(1), and Δci = ci − ci−1 .
where Δt = 1⁄n. Taking the limit as n → ∞ gives:
which is the ordinary integral of the scalar function F(c(t)) • c'(t) over the interval t = 0 to t = 1.
Gradient Theorem: Suppose F = ∇f is the gradient vector field of a scalar function f(x,y), and c(t) is a parametrized curve between points c(0) and c(1). Then the line integral of the gradient field is the total change of f from the start to the end point of the curve:
Thus, if we are given the initial value f(0,0), we take the line integral over a curve from c(0) = (0,0) to c(1) = (a,b) to find the potential function:
Note that we may use any curve c(t) from (0,0) to (a,b).
= ∫01 (3t, 2t) • (2,3) dt = ∫01 12t dt, = 6t2 |1t=0 = 6.
Thus, if F = ∇f is the gradient field of some function f(x,y) with f(0,0) = 0, we must have f(2,3) = f(0,0) + ∮ F(c) • dc = 6.
Doing the same computation over a line c(t) = (at, bt) from (0,0) to (a,b), we end up with: f(a,b) = ab, so that f(x,y) = xy. Indeed, we check: ∇f(x,y) = (∂⁄∂x(xy), ∂⁄∂y(xy)) = (y,x).
It is easy to see that a potential function F = ∇f implies that F is irrotational, since the Gradient Theorem says that ∮ F(c) dc is the total change in the height of f from the beginning to the end of c, which is zero if c is a loop. Further, irrotational is equivalent to path independent, since a pair of paths from d to e is equivalent to the loop c formed by going from d to e along c1, then back along c2, and ∮ F(c) dc = ∮ F(c1) dc1 − ∮ F(c2) dc2 . The only unclear question is why a path-independent line integral always gives a valid potential function.
Geometrically, curl F measures the rate of rotation of F around a point (x,y) = (a,b), relative to the area enclosed in a small region around (a,b):
where the limit is over smaller and smaller regions R containing (a,b), and c is the counterclockwise boundary curve of R.
A small paddlewheel placed at (x,y) = (1, 0) would feel a stronger upward
push on its left side than on its right, and would turn clockwise,
so the field has negative curl at this point. At (x,y) = (−1, 0), it would turn counterclockwise, which means positive curl.
We compute:
curl F(x,y) =
∂⁄∂x(1⁄(1+x2))
−
∂⁄∂x(0)
=
−2x⁄(1+x2)2) ,
and we verify that curl F(1, 0) = −½,
and F(−1, 0) = ½.
Of course, we can also switch the roles of x and y.
= ∫1x=0
( y − x2y − 1⁄3 y3 |1−x
y=0
) dx
= ∫1x=0
(1−x) − x2(1−x) −
1⁄3(1−x)3 dx
= 1⁄6 .
Here the closed curve c must not cross itself, and it must go counterclockwise enclosing R, a region with no holes or points where curl F is undefined.
The intuitive explanation is that the rotation line integral on the right side can be split over a grid of subregions whose rotations cancel each other along their common boundaries, except along the outside boundary of R. As the grid gets finer and finer, each small line integral near (x,y) is approximated by curl F(x,y) dA, and their sum approaches the double integral on the left side.
= ∫10 ( −2xy⁄(1+x2)2 |1 y=0 ) dx = ∫10 −2x⁄(1+x2)2 dx = −1⁄2 .
The boundary of R consists of the line segments c1(t) = (t,0), c2(t) = (1,t), c3(t) = (1−t,1), c4(t) = (0,1−t) for 0 ≤ t ≤ 1; and the total rotation is the line integral:
In detail, one of these is:
To see how strongly the field F is flowing across the curve c at a given point, we take the the dot product of F(c(t)) • n(t). The flux is the line integral of this quantity:
We can also define it (and compute it numerically) as a Rienmann sum:
where we take sample points c1,...,cn on the curve c, and Δci = ci−ci−1 .
Geometrically, div F(a,b) measures the rate of outflow of F from a small region near (x,y) = (a,b), relative to the area of the region:
where the limit is over smaller and smaller regions R containing (a,b), and dn(t) = −dc(t)⊥ is the outward normal of the counterclockwise boundary curve c(t) of R.
A small box placed at (x,y) = (−1, 0) would
have a weaker inflow across its left boundary
and a stronger outflow across its right boundary:
this means positive net outflow, positive divergence.
(This is compatible with the incompressibility
of water because the river gets shallower
where it flows faster.)
We compute:
div F(x,y) =
∂⁄∂x(1⁄(1+x
Here the closed curve c must not cross itself, and it must go counterclockwise enclosing R, a region with no holes or points where div F is undefined.
The intuitive explanation is that the flux line integral on the right side can be split over a grid of subregions whose flux integrals cancel each other along their common boundaries, except along the outside boundary of R. As the grid gets finer and finer, each small line integral near (x,y) is approximated by div F(x,y) dA, and their sum approaches the double integral on the left side.
= ∫10 ( −2xy⁄(1+x2)2 |1 y=0 ) dx = ∫10 −2x⁄(1+x2)2 dx = −1⁄2 .
The boundary of R consists of the line segments c1(t) = (t,0), c2(t) = (1,t), c3(t) = (1−t,1), c4(t) = (0,1−t) for 0 ≤ t ≤ 1; and the total flux is the line integral:
In detail, one of these is:
det(v, u, w) = −det(u, v, w) = det(u, w, v)
det(su + s'u', v, w) = s det(u, v, w) + s' det(u', v, w).
u × v is orthogonal to u, v
u, v, (u × v) form a right-handed frame.
x1 |
⋮ |
xn |
L = [ ℓ1 | . . . | ℓn ] = |
| . |
x1 |
⋮ |
xn |
L • [v] = |
| • |
| = |
| . |
It is determined by the m×n matrix of its outputs on the standard basis vectors of Rn:
[Dfc] = [ ∂fi(c)⁄∂xj ] = [ ∂f⁄∂x1 | . . . | ∂f⁄∂xn ] = |
| . |
That is, we multiply matrix [ k ] by each of the column-vectors of [ ℓ ].
That is, we multiply K by each column vector of L. To compute this most conveniently, we take all dot products between the rows of K and the columns of L:
K • L = |
| • |
| = |
| . |
In terms of Jacobian matrices, this means: