Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Darkly)
  • No Skin
Collapse
Brand Logo
  1. Home
  2. Uncategorized
  3. Tried to extract my own glottal pulse to make the synth sound more human.

Tried to extract my own glottal pulse to make the synth sound more human.

Scheduled Pinned Locked Moved Uncategorized
38 Posts 8 Posters 39 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • T Tamas G

    @x0 @BorrisInABox Lol! The wild thing is... it would technically work? The formant filters don't care what you feed them. They just shape whatever harmonic-rich input they get. So we could get: WASHING MACHINE DEMON → formant filters → "h̷̰͝e̵̢͠l̷̨͘l̷͚̚o̵̱͝" The formants would still try to impose vowel shapes on the chaos. It would be cursed. 😄

    BorrisB This user is from outside of this forum
    BorrisB This user is from outside of this forum
    Borris
    wrote last edited by
    #12

    @Tamasg @x0 Oh boy. Now I want to try shaping some bad noises I made with a big metal fan about twenty years ago with formant filters. I could probably make a few single-cycle wave forms exist.

    x0X 1 Reply Last reply
    0
    • T Tamas G

      @x0 @BorrisInABox could give it a try for you and give you a speechplayer.dll for it. Would be nuts. Would sound broken. Never for production. But I have a version of the speechplayer with glottal table support, so I just use librosa to extract what I need and hand you a speechplayer.dll lol

      x0X This user is from outside of this forum
      x0X This user is from outside of this forum
      x0
      wrote last edited by
      #13

      @Tamasg @BorrisInABox Or it's like what softvoice did, all the different sources. Wait a minute! Is that the problem you're having with a female source? Do you need a real female glottal pulse to start from?

      T 1 Reply Last reply
      0
      • BorrisB Borris

        @Tamasg @x0 Oh boy. Now I want to try shaping some bad noises I made with a big metal fan about twenty years ago with formant filters. I could probably make a few single-cycle wave forms exist.

        x0X This user is from outside of this forum
        x0X This user is from outside of this forum
        x0
        wrote last edited by
        #14

        @BorrisInABox @Tamasg I got A.Liv on the Surge discord to kindly work with my source material and the results are now called Exocat's Metalodon in the 3rd-party wavetables folder of Surge's factory data.

        x0X 1 Reply Last reply
        0
        • T Tamas G

          Tried to extract my own glottal pulse to make the synth sound more human. Learned my voice is too gentle for radio. Sadness fills my soul. That's probably why I didn't stick with radio shows.
          I recorded sustained vowels and used IAIF (Iterative Adaptive Inverse Filtering) to extract my glottal waveform - the raw "buzz" before your throat shapes it into vowels.
          What I expected: Rich, characterful human excitation to replace the mathematical model.
          What I got: A softer, breathier sound than pure math! 😅
          The mathematical LF model with sharpness cranked to 10 actually produces MORE harmonics than my actual voice does. That "chest resonant radio announcer" sound? That's aggressive glottal snap that not everyone has.

          Andre LouisF This user is from outside of this forum
          Andre LouisF This user is from outside of this forum
          Andre Louis
          wrote last edited by
          #15

          @Tamasg I wonder how I would sound. Interesting.

          1 Reply Last reply
          0
          • T Tamas G

            Tried to extract my own glottal pulse to make the synth sound more human. Learned my voice is too gentle for radio. Sadness fills my soul. That's probably why I didn't stick with radio shows.
            I recorded sustained vowels and used IAIF (Iterative Adaptive Inverse Filtering) to extract my glottal waveform - the raw "buzz" before your throat shapes it into vowels.
            What I expected: Rich, characterful human excitation to replace the mathematical model.
            What I got: A softer, breathier sound than pure math! 😅
            The mathematical LF model with sharpness cranked to 10 actually produces MORE harmonics than my actual voice does. That "chest resonant radio announcer" sound? That's aggressive glottal snap that not everyone has.

            M This user is from outside of this forum
            M This user is from outside of this forum
            Martin
            wrote last edited by
            #16

            @Tamasg Ooo, audio please.

            T 1 Reply Last reply
            0
            • x0X x0

              @Tamasg @BorrisInABox Or it's like what softvoice did, all the different sources. Wait a minute! Is that the problem you're having with a female source? Do you need a real female glottal pulse to start from?

              T This user is from outside of this forum
              T This user is from outside of this forum
              Tamas G
              wrote last edited by
              #17

              @x0 @BorrisInABox sadly female I realized would require Formant frequency tuning for the phonemes. Right now if we just put a female glottal shape over that, at best it would just sound aliased on top of the deeper, male-characteristic voice. theoretically... a female voice with a sharper glottal closure would actually give us MORE harmonics to work with, not fewer! Would be genuinely interesting to compare though - extract a female glottal pulse and see if the shape is meaningfully different.

              1 Reply Last reply
              0
              • M Martin

                @Tamasg Ooo, audio please.

                T This user is from outside of this forum
                T This user is from outside of this forum
                Tamas G
                wrote last edited by
                #18

                @mcourcel lol sounds horrible!

                M J 2 Replies Last reply
                0
                • T Tamas G

                  @mcourcel lol sounds horrible!

                  M This user is from outside of this forum
                  M This user is from outside of this forum
                  Martin
                  wrote last edited by
                  #19

                  @Tamasg Hehehehehe lolol, sort of like E-Speak.

                  T 1 Reply Last reply
                  0
                  • T Tamas G

                    @mcourcel lol sounds horrible!

                    J This user is from outside of this forum
                    J This user is from outside of this forum
                    Joshua
                    wrote last edited by
                    #20

                    @Tamasg @mcourcel oh god that sounds so bad

                    Alex ChapmanA 1 Reply Last reply
                    0
                    • M Martin

                      @Tamasg Hehehehehe lolol, sort of like E-Speak.

                      T This user is from outside of this forum
                      T This user is from outside of this forum
                      Tamas G
                      wrote last edited by
                      #21

                      @mcourcel yep. This gave me some real good insight into what Espeak did to fuck up SpeechPlayer, mainly changing its glottal source a lot. Hahahaha good lesson-learning!

                      1 Reply Last reply
                      0
                      • x0X x0

                        @BorrisInABox @Tamasg I got A.Liv on the Surge discord to kindly work with my source material and the results are now called Exocat's Metalodon in the 3rd-party wavetables folder of Surge's factory data.

                        x0X This user is from outside of this forum
                        x0X This user is from outside of this forum
                        x0
                        wrote last edited by
                        #22

                        @BorrisInABox @Tamasg This is the raw source material, which I later trimmed and did some noise reduction on, and then A.Liv carefully turned it into something that was a consistent period to be turned into wavetables, I think at 2048 samples per frame.

                        x0X M S 3 Replies Last reply
                        0
                        • x0X x0

                          @BorrisInABox @Tamasg This is the raw source material, which I later trimmed and did some noise reduction on, and then A.Liv carefully turned it into something that was a consistent period to be turned into wavetables, I think at 2048 samples per frame.

                          x0X This user is from outside of this forum
                          x0X This user is from outside of this forum
                          x0
                          wrote last edited by
                          #23

                          @BorrisInABox @Tamasg SO if you actually wanted to do that for whatever reason, I can send you the wavetables which are already fixed length single-cycle waveforms, unless you already have surge.

                          T 1 Reply Last reply
                          0
                          • J Joshua

                            @Tamasg @mcourcel oh god that sounds so bad

                            Alex ChapmanA This user is from outside of this forum
                            Alex ChapmanA This user is from outside of this forum
                            Alex Chapman
                            wrote last edited by
                            #24

                            @J3317 @Tamasg @mcourcel Lmfao that should be an extra voice added for the lols

                            1 Reply Last reply
                            0
                            • x0X x0

                              @BorrisInABox @Tamasg This is the raw source material, which I later trimmed and did some noise reduction on, and then A.Liv carefully turned it into something that was a consistent period to be turned into wavetables, I think at 2048 samples per frame.

                              M This user is from outside of this forum
                              M This user is from outside of this forum
                              Martin
                              wrote last edited by
                              #25

                              @x0 @BorrisInABox @Tamasg Hehehe lololol! The spin sounds cool. Like a light saber.

                              1 Reply Last reply
                              0
                              • x0X x0

                                @BorrisInABox @Tamasg SO if you actually wanted to do that for whatever reason, I can send you the wavetables which are already fixed length single-cycle waveforms, unless you already have surge.

                                T This user is from outside of this forum
                                T This user is from outside of this forum
                                Tamas G
                                wrote last edited by
                                #26

                                @x0 @BorrisInABox lol! Can't even explain what it did, but it definitely introduces a metallic quality unlike any I've heard in speech synthesis before. Not even as tube-like as when I tried mine was, but boy is it bad. That grindyness really shows through.

                                x0X 1 Reply Last reply
                                0
                                • T Tamas G

                                  @x0 @BorrisInABox lol! Can't even explain what it did, but it definitely introduces a metallic quality unlike any I've heard in speech synthesis before. Not even as tube-like as when I tried mine was, but boy is it bad. That grindyness really shows through.

                                  x0X This user is from outside of this forum
                                  x0X This user is from outside of this forum
                                  x0
                                  wrote last edited by
                                  #27

                                  @Tamasg @BorrisInABox lmfaoooooo what, that's like the odd source of softvoice

                                  T 1 Reply Last reply
                                  0
                                  • x0X x0

                                    @Tamasg @BorrisInABox lmfaoooooo what, that's like the odd source of softvoice

                                    T This user is from outside of this forum
                                    T This user is from outside of this forum
                                    Tamas G
                                    wrote last edited by
                                    #28

                                    @x0 @BorrisInABox lol this thing is a trip to use. It's, just... So gritty, so metallic, nothin' quite like it. So I'm keeping it at https://eurpod.com/synths/speechPlayer-brokenmachine.dll - though clear proof that with the right matching glottal source it can sound less tubey and more natural, just gotta find the right radio announcer-type glottal source 😄

                                    1 Reply Last reply
                                    0
                                    • T Tamas G

                                      @BorrisInABox Oh cool! For the extraction I recorded 5 sounds:
                                      "ahh" sustained at normal pitch (~5 sec)
                                      2. "ahh" sustained at low pitch (~5 sec)
                                      3. "ahh" sustained at high pitch (~5 sec)
                                      4. "shhh" sustained fricative (~5 sec)
                                      5. "th" sustained unvoiced (~3 sec)
                                      The "ahh" vowels are for glottal pulse extraction at different F0s. The "sh" and "th" are for noise/frication characteristics.
                                      Recording tips:
                                      • Condenser or dynamic mic (I used a Blue Snowball, AT2005 was too noisy)
                                      • Peaks around -5 to -8 dB (NOT quiet - my first attempt at -30 dB was useless)
                                      • Steady volume, no vibrato
                                      • Quiet room
                                      • 44100 Hz, mono
                                      The key is getting a clean, loud, boring sustained vowel - no expression, just pure steady tone. The more monotone the better for extraction!

                                      T This user is from outside of this forum
                                      T This user is from outside of this forum
                                      Tamas G
                                      wrote last edited by
                                      #29

                                      @BorrisInABox Small add-on for the voice recording set: raw audio only, please — no noise suppression, auto gain, compressor/limiter, or EQ. The boring part matters here: keep the vowel steady with no vibrato, because I’m aligning and averaging glottal cycles and pitch wobble makes the final source less crisp. If you can, include ~10 seconds of room tone (silence) in a file, so I can calibrate noise and hum. And when you record “th”, make it the “think” version (/θ/). Optional but very helpful: a sustained “zzzz” (/z/) and “vvvv” (/v/) so I can capture voicing + turbulence together for better “edge” control later. Hope this helps too. LOL if this works out your voice would be forever partially captured into a synth. LOL.

                                      M BorrisB 2 Replies Last reply
                                      0
                                      • T Tamas G

                                        @BorrisInABox Small add-on for the voice recording set: raw audio only, please — no noise suppression, auto gain, compressor/limiter, or EQ. The boring part matters here: keep the vowel steady with no vibrato, because I’m aligning and averaging glottal cycles and pitch wobble makes the final source less crisp. If you can, include ~10 seconds of room tone (silence) in a file, so I can calibrate noise and hum. And when you record “th”, make it the “think” version (/θ/). Optional but very helpful: a sustained “zzzz” (/z/) and “vvvv” (/v/) so I can capture voicing + turbulence together for better “edge” control later. Hope this helps too. LOL if this works out your voice would be forever partially captured into a synth. LOL.

                                        M This user is from outside of this forum
                                        M This user is from outside of this forum
                                        Martin
                                        wrote last edited by
                                        #30

                                        @Tamasg @BorrisInABox Ooo, a Boris voice synth coming soon!

                                        BorrisB 1 Reply Last reply
                                        0
                                        • M Martin

                                          @Tamasg @BorrisInABox Ooo, a Boris voice synth coming soon!

                                          BorrisB This user is from outside of this forum
                                          BorrisB This user is from outside of this forum
                                          Borris
                                          wrote last edited by
                                          #31

                                          @mcourcel @Tamasg It's fake news.

                                          1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Don't have an account? Register

                                          • Login or register to search.
                                          Powered by NodeBB Contributors
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups