pdf23ds comments on Adaptive bias

pdf23ds Jan 26, 2010, 5:22 PM
2 points
Hmm. I got the meaning of the first section of the clip the first time I heard it. OTOH, that was probably because I looked at the URL first, and so I was primed to look at the content that way.
- Dustin Jan 26, 2010, 9:53 PM
  1 point
  Parent
  The first and last parts sounded exactly the same to me.
  
  However, what “meaning” are you talking about? I got no meaning from the sound effects.
  - arundelo Jan 26, 2010, 10:52 PM
    2 points
    Parent
    The recording is:
    
    Squiggly noises
    An English sentence
    The same squiggly noises again
    
    Before hearing the sentence, the squiggly noises just sound like squiggly noises. After hearing the sentence, the squiggly noises sound (to me and presumably most people) like a distorted version of the sentence. The only reason the squiggly noises are there twice is so you don’t have to replay the recording to hear the effect.
    
    This blew me away the first time I heard it, and I already knew what pareidolia was.
    - pdf23ds Jan 26, 2010, 10:59 PM
      3 points
      Parent
      This isn’t actually a case of pareidolia, as the squiggly noises (they call it “sine wave speech”) are in fact derived from the middle recording, using an effect that sounds, to me, most like an extremely low bitrate mp3 encoding. Reading up on how they produce the effect, it is in fact a very similar process to mp3 encoding. (Perhaps inspired by it? I believe most general audio codecs work on very similar basic principles.)
      - Blueberry Jan 26, 2010, 11:27 PM
        5 points
        Parent
        So it’s the opposite of pareidolia. It’s actually meaningful sound, but it looks random at first. Maybe we should call it ailodierap.
      - arundelo Jan 27, 2010, 2:14 PM
        1 point
        Parent
        
        This isn’t actually a case of pareidolia
        
        True; I suppose it’s a demonstration of the thing that makes pareidolia possible—the should-be-obvious-but-isn’t fact that pattern recognition takes place in the mind.
    - Paul Crowley Jan 26, 2010, 11:12 PM
      0 points
      Parent
      I wish it were two recordings, so you could listen to the squiggly noises more than once before hearing the sentence.
      - AdeleneDawner Jan 27, 2010, 7:47 AM
        2 points
        Parent
        I ran into a set of these once before, and while it didn’t let me listen to any one noise more than once before hearing the related speech, after about 4 or 5 noise+speech+noise sets I started being able to recognize the words in the noise the first time through. So it does seem to be learnable, if that’s what you were curious about.
        What links here?
        AdeleneDawner's comment on Adaptive bias by Morendil (Jan 27, 2010, 2:44 PM; 0 points)
        Paul Crowley Jan 27, 2010, 8:08 AM
        1 point
        Parent
        I’m curious how much of the change is because you’ve heard the sentence in “plaintext”, and how much because you’re hearing the squiggly version a second time.
  - mattnewport Jan 26, 2010, 10:00 PM
    0 points
    Parent
    You didn’t hear the second part as a repeat of the speech? Are you not a native English speaker?
    - Dustin Jan 27, 2010, 12:37 AM
      1 point
      Parent
      No, I didn’t. I am a native English speaker from the Midwest part of America. I listened to it multiple times without hearing any speech in either of the sound effects.
      
      After reading your comment, I listened to the audio again and now both audio samples do sound like a repeat of the speech. At no point did the audio samples sound different from one another, though.
      - mattnewport Jan 27, 2010, 12:45 AM
        1 point
        Parent
        The woman does have an English rather than American accent. I’m from England originally and the effect was quite dramatic the first time I listened to it: meaningless noise, then speech, then completely intelligible speech (the repeat of the original meaningless noise). The second time I listened to it some time later (after reading your comment) I could understand the speech in the first sound but it was clearer in the second. Listening to it again shortly afterwards the first and last sound both sounded like speech and sounded much the same as each other. I wonder whether the accent is a factor?
      - Morendil Jan 27, 2010, 11:20 AM
        0 points
        Parent
        That’s very interesting. Can you try some of the other samples from Matt Davis’ page and report on your experiences?
        
        When I listened to some of those the first time I was, as luck would have it, in a slightly noisy environment, so that I couldn’t quite catch some bits of the English text the first time around; the corresponding parts of the “sine wave speech” remained obscure for me until I had listened again to the clear text.
        
        So for me the effect seems to be stronger rather than weaker as a result of the speaker’s accent plus English being a second language. I’m really puzzled as to why the effect might be weaker for you. Any ideas? Are you cognitively atypical in any way?
        Paul Crowley Jan 27, 2010, 12:36 PM
        0 points
        Parent
        One reason I wished it had been two samples rather than one is that I thought I heard speech in the noise the first time, and wanted to listen again to see if I could figure it out without being primed.
        AdeleneDawner Jan 27, 2010, 2:44 PM
        0 points
        Parent
        This is the question I tried to answer elsewhere—After training on 4 or 5 samples I was able to hear the words in the remainder of the coded sentences the first time I heard them, without being primed by the decoded version of those sentences.
        Paul Crowley Jan 27, 2010, 2:50 PM
        0 points
        Parent
        reads in more detail indeed—thanks!
- Morendil Jan 26, 2010, 5:40 PM
  0 points
  Parent
  How about the other vocoded samples?
  
  Thanks for the report anyway, that’s interesting to know.
  - pdf23ds Jan 27, 2010, 9:53 PM
    3 points
    Parent
    For people wanting different recordings of the garbled/non-garbled: it’s right on the page right above the one Morendil linked to.
    
    On the next sample, I only caught the last few words on the first play (of the garbled version only), and after five plays still got a word wrong. On the third, I only got two words the first time, and additional replays made no difference. On the fourth, I got half after one play, and most after two. On the fifth, I got the entire thing on the first play. (I’m not feeling as clear-headed today as I was the other day, but it didn’t feel like a learning effect.) On some of them, I don’t believe that even with a lot of practice I could ever get it all right, since some garbled words sound more like other plausible words than they do the originals.
    
    Thinking about it more, it’s a bit surprising that I did well. I generally have trouble making out speech in situations where other people don’t have quite as much trouble. I’ll often turn on subtitles in movies, even in my first language/dialect (American English). (In fact, I hate movies where the speech is occasionally muffled and there are no subtitles—two things that tend to go hand in hand with smaller production budgets.) OTOH, I have a good ear in general. I’ve had a lot of musical training, and I’ve worked with sound editing quite a bit.