the gears to ascension comments on Take 7: You should talk about “the human’s utility function” less.

the gears to ascension 8 Dec 2022 17:18 UTC
2 points
−2
component of why I’m not sure I agree with this: I claim stable diffusion has a utility function. does anyone disagree with this subclaim?
- Vladimir_Nesov 9 Dec 2022 4:53 UTC
  2 points
  0
  Parent
  Do you mean model’s policy as it works on a query, or learning as it works on a dataset? Or something specific to stable diffusion? What is the sample space here, and what are the actions that decisions choose between?
  - the gears to ascension 9 Dec 2022 5:24 UTC
    4 points
    −1
    Parent
    score based models, such as diffusion, work by modeling the derivative of the utility function (density function) over examples, I believe?
    
    see, eg, https://lilianweng.github.io/posts/2021-07-11-diffusion-models/ or any of the other recommended posts at the top.
    
    actions are denoising steps. sample space is output space, ie image space for stable diffusion.
    - cfoster0 9 Dec 2022 6:03 UTC
      3 points
      2
      Parent
      You’re talking about the score function, right? Which is the derivative of the log probability density function. I dunno how to get from there to a utility function interpretation. Like, we don’t produce samples from the model by globally maximizing over the PDF (at worst, trying that might produce an adversarial example, and at best, that would sample the “most modal” image).
      - the gears to ascension 9 Dec 2022 7:07 UTC
        3 points
        0
        Parent
        ah, okay. yup, you’re right, that’s what I was referring to. I am now convinced I was wrong in my original comment!
- Charlie Steiner 8 Dec 2022 23:07 UTC
  2 points
  0
  Parent
  Lots of things “have a utility function” in the colloquial sense that they can be usefully modeled as having consistent preferences. But sure, I’ll be somewhat skeptical if you want to continue “taking the utility-function perspective on stable diffusion is in some way useful for thinking about its alignment properties.”
  - the gears to ascension 9 Dec 2022 0:45 UTC
    2 points
    −1
    Parent
    but diffusion specifically works by modeling the derivative of the utility function, yeah?
    - Charlie Steiner 9 Dec 2022 5:30 UTC
      2 points
      0
      Parent
      Ah, you’re talking about guidance? That makes sense, but you could also take the perspective that guidance isn’t really playing the role of a utility function, it’s just nudging around this big dynamical system by small amounts.
      - the gears to ascension 9 Dec 2022 5:49 UTC
        2 points
        0
        Parent
        no, I’m talking about the basic diffusion model underneath. It models the derivative of the probability density function, which seems reasonable to call a utility function to me. see my other comment for link