Joseph Van Name answers Has anyone actually tried to convince Terry Tao or other top mathematicians to work on alignment?

Joseph Van Name 12 Jan 2024 17:01 UTC
10 points
Um. If you want to convince a mathematician like Terry Tao to be interested in AI alignment, you will need to present yourself as a reasonably competent mathematician or related expert and actually formulate an AI problem in such a way so that someone like Terry Tao would be interested in it. If you yourself are not interested in the problem, then Terry Tao will not be interested in it either.
Terry Tao is interested in random matrix theory (he wrote the book on it), and random matrix theory is somewhat related to my approach to AI interpretability and alignment. If you are going to send these problems to a mathematician, please inform me about this before you do so.
My approach to alignment: Given matrices $A_{1}, \dots, A_{r}; B_{1}, \dots, B_{r}$ , define a superoperator $Γ (A_{1}, \dots, A_{r}; B_{1}, \dots, B_{r})$ by setting
$Γ (A_{1}, \dots, A_{r}; B_{1}, \dots, B_{r}) (X) = \sum_{k = 1}^{r} A_{k} X B_{k}^{*}$ , and define $Φ (A_{1}, \dots, A_{r}) = Γ (A_{1}, \dots, A_{r}; A_{1}, \dots, A_{r})$ . Define the $L_{2}$ -spectral radius of $A_{1}, \dots, A_{r}$ as $ρ_{2} (A_{1}, \dots, A_{r}) = ρ (Φ (A_{1}, \dots, A_{r}))^{1 / 2}$ . Here, $ρ (A) = {lim}_{n \to \infty} ∥ A^{n} ∥^{1 / n}$ is the usual spectral radius.
Define $ρ_{2, d}^{K} (A_{1}, \dots, A_{r}) = max {\frac{ρ (Γ (A_{1}, \dots, A_{r}; X_{1}, \dots, X_{r}))}{ρ_{2} (X_{1}, \dots, X_{r})} : X_{1}, \dots, X_{r} \in M_{d} (K)}$ . Here, $K$ is either the field of reals, field of complex numbers, or division ring of quaternions.
Given matrices $A_{1}, \dots, A_{r}; B_{1}, \dots, B_{r}$ , define
$∥ (A_{1}, \dots, A_{r}) ≃ (B_{1}, \dots, B_{r}) ∥ = \frac{Γ (A_{1}, \dots, A_{r}; B_{1}, \dots, B_{r})}{ρ_{2} (A_{1}, \dots, A_{r}) ρ_{2} (B_{1}, \dots, B_{r})}$ . The value $∥ (A_{1}, \dots, A_{r}) ≃ (B_{1}, \dots, B_{r}) ∥$ is always a real number in the interval $[0, 1]$ that is a measure of how jointly similar the tuples $(A_{1}, \dots, A_{r}), (B_{1}, \dots, B_{r})$ are. The motivation behind $ρ_{2, d}^{K} (A_{1}, \dots, A_{r})$ is that $\frac{ρ_{2, d}^{K} (A_{1}, \dots, A_{r})}{ρ_{2} (A_{1}, \dots, A_{r})}$ is always a real number in $[0, 1]$ (well except when the denominator is zero) that measures how well $A_{1}, \dots, A_{r}$ can be approximated by $d \times d$ -matrices. Informally, $\frac{ρ_{2, d}^{K} (A_{1}, \dots, A_{r})}{ρ_{2} (A_{1}, \dots, A_{r})}$ measures how random $A_{1}, \dots, A_{r}$ are where a lower value of $\frac{ρ_{2, d}^{K} (A_{1}, \dots, A_{r})}{ρ_{2} (A_{1}, \dots, A_{r})}$ indicates a lower degree of randomness.
A better theoretical understanding of $ρ_{2, d}^{K} (A_{1}, \dots, A_{r})$ would be great. If $X_{1}, \dots, X_{r} \in M_{d} (K)$ and $\frac{ρ (Γ (A_{1}, \dots, A_{r}; X_{1}, \dots, X_{r}))}{ϕ_{2} (X_{1}, \dots, X_{r})}$ is locally maximized, then we say that $(X_{1}, \dots, X_{r})$ is an LSRDR of $(A_{1}, \dots, A_{r})$ . Said differently, $(X_{1}, \dots, X_{r}) \in M_{d} (K)$ is an LSRDR of $(A_{1}, \dots, A_{r})$ if the similarity $∥ (A_{1}, \dots, A_{r}) ≃ (X_{1}, \dots, X_{r}) ∥$ is maximized.
Here, the notion of an LSRDR is a machine learning notion that seems to be much more interpretable and much less subject to noise than many other machine learning notions. But a solid mathematical theory behind LSRDRs would help us understand not just what LSRDRs do, but the mathematical theory would help us understand why they do it.
Problems in random matrix theory concerning LSRDRs:
1. Suppose that $U_{1}, \dots, U_{r}$ are random matrices (according to some distribution). Then what are some bounds for $ρ_{2, d}^{K} (U_{1}, \dots, U_{r})$ .
2. Suppose that $U_{1}, \dots, U_{r}$ are random matrices and $A_{1}, \dots, A_{r}$ are non-random matrices. What can we say about the spectrum of $Γ (A_{1}, \dots, A_{r}; U_{1}, \dots, U_{r})$ ? My computer experiments indicate that this spectrum satisfies the circular law, and the radius of the disc for this circular law is proportional to $ρ_{2} (A_{1}, \dots, A_{r})$ , but a proof of this circular law would be nice.
3. Tensors can be naturally associated with collections of matrices. Suppose now that $U_{1}, \dots, U_{r}$ are the matrices associated with a random tensor. Then what are some bounds for $ρ_{2, d}^{K} (U_{1}, \dots, U_{r})$ .
P.S. By massively downvoting my posts where I talk about mathematics that is clearly applicable to AI interpretability and alignment, the people on this site are simply demonstrating that they need to do a lot of soul searching before they annoy people like Terry Tao with their lack of mathematical expertise.
P.P.S. Instead of trying to get a high profile mathematician like Terry Tao to be interested in problems, it may be better to search for a specific mathematician who is an expert in a specific area related to AI alignment since it may be easier to contact a lower profile mathematician, and a lower profile mathematician may have more specific things to say and contribute. You are lucky that Terry Tao is interested in random matrix theory, but this does not mean that Terry Tao is interested in anything in the intersection between alignment and random matrix theory. It may be better to search harder for mathematicians who are interested in your specific problems.
P.P.P.S. To get more mathematicians interested in alignment, it may be a good idea to develop AI systems that behave much more mathematically. Neural networks currently do not behave very mathematically since they look like the things that engineers would come up with instead of mathematicians.
P.P.P.P.S. I have developed the notion of an LSRDR for cryptocurrency research because I am using this to evaluate the cryptographic security of cryptographic functions.