Rafael Harth comments on Rafael Harth’s Shortform

Rafael Harth 20 Oct 2020 8:17 UTC
10 points
Yesterday, I spent some time thinking about how, if you have a function $f : R^{2} \to R$ and some point $x \in R^{2}$ , the value of the directional derivative from $x$ could change as a function of the angle. I.e., what does the function $ϕ : [0, 2 π] \to R_{+}$ look like? I thought that any relationship was probably possible as long as it has the property that $ϕ (α) = - ϕ (2 π - α)$ . (The values of the derivative in two opposite directions need to be negatives of each other.)

Anyone reading this is hopefully better at Analysis than I am and realized that there is, in fact, no freedom at all because each directional derivative is entirely determined by the gradient through the equation $\nabla_{v} f (x) = ⟨ \nabla f (x), v_{N} ⟩$ (where $v_{N} = \frac{v}{| | v | |}$ ). This means that $ϕ$ has to be the cosine function scaled by $| | \nabla_{v} f (x) | |$ , it cannot be anything else.

I clearly failed to internalize what this equation means when I first heard it because I found it super surprising that the gradient determines the value of every directional derivative. Like, really? It’s impossible to have more than exactly two directions with equally large derivatives unless the function is constant? It’s impossible to turn 90 degree from the direction of the gradient and having anything but derivative 0 in that direction? I’m not asking that $ϕ$ be discontinuous, only that it not be precisely $| | \nabla f (α) | | cos (α)$ . But alas.

This also made me realize that $cos$ if viewed as a function of the circle is just the dot product with the standard vector, i.e.,

$cos : S^{2} \to [- 1, + 1] cos : x \mapsto ⟨ x, (1, 0) ⟩$

or even just $cos (x, y) = x$ . Similarly, $sin (x, y) = y$ .

I know what you’re thinking; you need $sin$ and $cos$ to map $[0, 2 π]$ to $S^{2}$ in the first place. But the circle seems like a good deal more fundamental than those two functions. Wouldn’t it make more sense to introduce trigonometry in terms of ‘how do we wrap $R$ around $S^{2}$ ?’. The function that does this is $γ (x) = (cos (x), sin (x))$ , and then you can study the properties that this function needs to have and eventually call the coordinates $cos$ and $sin$ . This feels like a way better motivation than putting a right triangle onto the unit circle for some reason, which is how I always see the topic introduced (and how I’ve introduced it myself).

Looking further at the analogy with the gradient, this also suggests that there is a natural extension of $cos$ to $S^{n}$ for all $n \in N$ . I.e., if we look at some point $x \in R^{n}$ , we can again ask about the function $ϕ$ that maps each angle to the value of the directional derivative on $x$ in that direction, and if we associate these angles with points of $S^{n - 1}$ , then this yields the function $ϕ : S^{n - 1} \to R$ , which is again just the dot product with $(1, . . ., 0)$ or the projection onto the first coordinate (scaled by $| | \nabla f (x) | |$ ). This can then be considered a higher-dimensional $cos$ function.

There’s also the 0-d case where $S^{0} = {1, - 1}$ . This describes how the direction changes the derivative for a function $f : R \to R$ .
- Zack_M_Davis 20 Oct 2020 20:04 UTC
  5 points
  Parent
  
  I found it super surprising that the gradient determines the value of every directional derivative. Like, really?
  
  When reading this comment, I was surprised for a moment, too, but now that you mention it—it’s because if the function is smooth at the point where you’re taking the directional derivative, then it has to locally resemble a plane, just like a how a differentiable function of a single variable is said to be “locally linear”. If the directional derivative varied in any other way, then the surface would have to have a “crinkle” at that point and it wouldn’t be differentiable. Right?
  - Rafael Harth 21 Oct 2020 15:54 UTC
    2 points
    Parent
    That’s probably right.
    
    I have since learned that there are functions which do have all partial derivatives at a point but are not smooth. Wikipedia’s example is $f (x, y) = \frac{y^{3}}{x^{2} + y^{2}}$ with $f (0, 0) = 0$ . And in this case, there is still a continuous function $ϕ : S^{2} \to R$ that maps each point to the value of the directional derivative, but it’s $ϕ (x, y) = y^{3}$ , so different from the regular case.
    
    So you can probably have all kinds of relationships between direction and {value of derivative in that direction}, but the class of smooth functions have a fixed relationship. It still feels surprising that ‘most’ functions we work with just happen to be smooth.