Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Darkly)
  • No Skin
Collapse
Brand Logo
  1. Home
  2. Uncategorized
  3. Tried to extract my own glottal pulse to make the synth sound more human.

Tried to extract my own glottal pulse to make the synth sound more human.

Scheduled Pinned Locked Moved Uncategorized
38 Posts 8 Posters 39 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • T Tamas G

    Tried to extract my own glottal pulse to make the synth sound more human. Learned my voice is too gentle for radio. Sadness fills my soul. That's probably why I didn't stick with radio shows.
    I recorded sustained vowels and used IAIF (Iterative Adaptive Inverse Filtering) to extract my glottal waveform - the raw "buzz" before your throat shapes it into vowels.
    What I expected: Rich, characterful human excitation to replace the mathematical model.
    What I got: A softer, breathier sound than pure math! 😅
    The mathematical LF model with sharpness cranked to 10 actually produces MORE harmonics than my actual voice does. That "chest resonant radio announcer" sound? That's aggressive glottal snap that not everyone has.

    M This user is from outside of this forum
    M This user is from outside of this forum
    Martin
    wrote last edited by
    #16

    @Tamasg Ooo, audio please.

    T 1 Reply Last reply
    0
    • x0X x0

      @Tamasg @BorrisInABox Or it's like what softvoice did, all the different sources. Wait a minute! Is that the problem you're having with a female source? Do you need a real female glottal pulse to start from?

      T This user is from outside of this forum
      T This user is from outside of this forum
      Tamas G
      wrote last edited by
      #17

      @x0 @BorrisInABox sadly female I realized would require Formant frequency tuning for the phonemes. Right now if we just put a female glottal shape over that, at best it would just sound aliased on top of the deeper, male-characteristic voice. theoretically... a female voice with a sharper glottal closure would actually give us MORE harmonics to work with, not fewer! Would be genuinely interesting to compare though - extract a female glottal pulse and see if the shape is meaningfully different.

      1 Reply Last reply
      0
      • M Martin

        @Tamasg Ooo, audio please.

        T This user is from outside of this forum
        T This user is from outside of this forum
        Tamas G
        wrote last edited by
        #18

        @mcourcel lol sounds horrible!

        M J 2 Replies Last reply
        0
        • T Tamas G

          @mcourcel lol sounds horrible!

          M This user is from outside of this forum
          M This user is from outside of this forum
          Martin
          wrote last edited by
          #19

          @Tamasg Hehehehehe lolol, sort of like E-Speak.

          T 1 Reply Last reply
          0
          • T Tamas G

            @mcourcel lol sounds horrible!

            J This user is from outside of this forum
            J This user is from outside of this forum
            Joshua
            wrote last edited by
            #20

            @Tamasg @mcourcel oh god that sounds so bad

            Alex ChapmanA 1 Reply Last reply
            0
            • M Martin

              @Tamasg Hehehehehe lolol, sort of like E-Speak.

              T This user is from outside of this forum
              T This user is from outside of this forum
              Tamas G
              wrote last edited by
              #21

              @mcourcel yep. This gave me some real good insight into what Espeak did to fuck up SpeechPlayer, mainly changing its glottal source a lot. Hahahaha good lesson-learning!

              1 Reply Last reply
              0
              • x0X x0

                @BorrisInABox @Tamasg I got A.Liv on the Surge discord to kindly work with my source material and the results are now called Exocat's Metalodon in the 3rd-party wavetables folder of Surge's factory data.

                x0X This user is from outside of this forum
                x0X This user is from outside of this forum
                x0
                wrote last edited by
                #22

                @BorrisInABox @Tamasg This is the raw source material, which I later trimmed and did some noise reduction on, and then A.Liv carefully turned it into something that was a consistent period to be turned into wavetables, I think at 2048 samples per frame.

                x0X M S 3 Replies Last reply
                0
                • x0X x0

                  @BorrisInABox @Tamasg This is the raw source material, which I later trimmed and did some noise reduction on, and then A.Liv carefully turned it into something that was a consistent period to be turned into wavetables, I think at 2048 samples per frame.

                  x0X This user is from outside of this forum
                  x0X This user is from outside of this forum
                  x0
                  wrote last edited by
                  #23

                  @BorrisInABox @Tamasg SO if you actually wanted to do that for whatever reason, I can send you the wavetables which are already fixed length single-cycle waveforms, unless you already have surge.

                  T 1 Reply Last reply
                  0
                  • J Joshua

                    @Tamasg @mcourcel oh god that sounds so bad

                    Alex ChapmanA This user is from outside of this forum
                    Alex ChapmanA This user is from outside of this forum
                    Alex Chapman
                    wrote last edited by
                    #24

                    @J3317 @Tamasg @mcourcel Lmfao that should be an extra voice added for the lols

                    1 Reply Last reply
                    0
                    • x0X x0

                      @BorrisInABox @Tamasg This is the raw source material, which I later trimmed and did some noise reduction on, and then A.Liv carefully turned it into something that was a consistent period to be turned into wavetables, I think at 2048 samples per frame.

                      M This user is from outside of this forum
                      M This user is from outside of this forum
                      Martin
                      wrote last edited by
                      #25

                      @x0 @BorrisInABox @Tamasg Hehehe lololol! The spin sounds cool. Like a light saber.

                      1 Reply Last reply
                      0
                      • x0X x0

                        @BorrisInABox @Tamasg SO if you actually wanted to do that for whatever reason, I can send you the wavetables which are already fixed length single-cycle waveforms, unless you already have surge.

                        T This user is from outside of this forum
                        T This user is from outside of this forum
                        Tamas G
                        wrote last edited by
                        #26

                        @x0 @BorrisInABox lol! Can't even explain what it did, but it definitely introduces a metallic quality unlike any I've heard in speech synthesis before. Not even as tube-like as when I tried mine was, but boy is it bad. That grindyness really shows through.

                        x0X 1 Reply Last reply
                        0
                        • T Tamas G

                          @x0 @BorrisInABox lol! Can't even explain what it did, but it definitely introduces a metallic quality unlike any I've heard in speech synthesis before. Not even as tube-like as when I tried mine was, but boy is it bad. That grindyness really shows through.

                          x0X This user is from outside of this forum
                          x0X This user is from outside of this forum
                          x0
                          wrote last edited by
                          #27

                          @Tamasg @BorrisInABox lmfaoooooo what, that's like the odd source of softvoice

                          T 1 Reply Last reply
                          0
                          • x0X x0

                            @Tamasg @BorrisInABox lmfaoooooo what, that's like the odd source of softvoice

                            T This user is from outside of this forum
                            T This user is from outside of this forum
                            Tamas G
                            wrote last edited by
                            #28

                            @x0 @BorrisInABox lol this thing is a trip to use. It's, just... So gritty, so metallic, nothin' quite like it. So I'm keeping it at https://eurpod.com/synths/speechPlayer-brokenmachine.dll - though clear proof that with the right matching glottal source it can sound less tubey and more natural, just gotta find the right radio announcer-type glottal source 😄

                            1 Reply Last reply
                            0
                            • T Tamas G

                              @BorrisInABox Oh cool! For the extraction I recorded 5 sounds:
                              "ahh" sustained at normal pitch (~5 sec)
                              2. "ahh" sustained at low pitch (~5 sec)
                              3. "ahh" sustained at high pitch (~5 sec)
                              4. "shhh" sustained fricative (~5 sec)
                              5. "th" sustained unvoiced (~3 sec)
                              The "ahh" vowels are for glottal pulse extraction at different F0s. The "sh" and "th" are for noise/frication characteristics.
                              Recording tips:
                              • Condenser or dynamic mic (I used a Blue Snowball, AT2005 was too noisy)
                              • Peaks around -5 to -8 dB (NOT quiet - my first attempt at -30 dB was useless)
                              • Steady volume, no vibrato
                              • Quiet room
                              • 44100 Hz, mono
                              The key is getting a clean, loud, boring sustained vowel - no expression, just pure steady tone. The more monotone the better for extraction!

                              T This user is from outside of this forum
                              T This user is from outside of this forum
                              Tamas G
                              wrote last edited by
                              #29

                              @BorrisInABox Small add-on for the voice recording set: raw audio only, please — no noise suppression, auto gain, compressor/limiter, or EQ. The boring part matters here: keep the vowel steady with no vibrato, because I’m aligning and averaging glottal cycles and pitch wobble makes the final source less crisp. If you can, include ~10 seconds of room tone (silence) in a file, so I can calibrate noise and hum. And when you record “th”, make it the “think” version (/θ/). Optional but very helpful: a sustained “zzzz” (/z/) and “vvvv” (/v/) so I can capture voicing + turbulence together for better “edge” control later. Hope this helps too. LOL if this works out your voice would be forever partially captured into a synth. LOL.

                              M BorrisB 2 Replies Last reply
                              0
                              • T Tamas G

                                @BorrisInABox Small add-on for the voice recording set: raw audio only, please — no noise suppression, auto gain, compressor/limiter, or EQ. The boring part matters here: keep the vowel steady with no vibrato, because I’m aligning and averaging glottal cycles and pitch wobble makes the final source less crisp. If you can, include ~10 seconds of room tone (silence) in a file, so I can calibrate noise and hum. And when you record “th”, make it the “think” version (/θ/). Optional but very helpful: a sustained “zzzz” (/z/) and “vvvv” (/v/) so I can capture voicing + turbulence together for better “edge” control later. Hope this helps too. LOL if this works out your voice would be forever partially captured into a synth. LOL.

                                M This user is from outside of this forum
                                M This user is from outside of this forum
                                Martin
                                wrote last edited by
                                #30

                                @Tamasg @BorrisInABox Ooo, a Boris voice synth coming soon!

                                BorrisB 1 Reply Last reply
                                0
                                • M Martin

                                  @Tamasg @BorrisInABox Ooo, a Boris voice synth coming soon!

                                  BorrisB This user is from outside of this forum
                                  BorrisB This user is from outside of this forum
                                  Borris
                                  wrote last edited by
                                  #31

                                  @mcourcel @Tamasg It's fake news.

                                  1 Reply Last reply
                                  0
                                  • x0X x0

                                    @BorrisInABox @Tamasg This is the raw source material, which I later trimmed and did some noise reduction on, and then A.Liv carefully turned it into something that was a consistent period to be turned into wavetables, I think at 2048 samples per frame.

                                    S This user is from outside of this forum
                                    S This user is from outside of this forum
                                    Scott
                                    wrote last edited by
                                    #32

                                    @x0 LMAO you've got a dubstep washer! @BorrisInABox @Tamasg

                                    x0X 1 Reply Last reply
                                    0
                                    • S Scott

                                      @x0 LMAO you've got a dubstep washer! @BorrisInABox @Tamasg

                                      x0X This user is from outside of this forum
                                      x0X This user is from outside of this forum
                                      x0
                                      wrote last edited by
                                      #33

                                      @Scott @BorrisInABox @Tamasg Yup, as soon as I heard that I thought of some Skrillex shit and had to get it put into a synth. It was recorded in 2019, and in 2022 it finally happened. This is the demo that A.Liv made with it, everything except the supersaw and drums are the resulting tables.

                                      T 1 Reply Last reply
                                      0
                                      • x0X x0

                                        @Scott @BorrisInABox @Tamasg Yup, as soon as I heard that I thought of some Skrillex shit and had to get it put into a synth. It was recorded in 2019, and in 2022 it finally happened. This is the demo that A.Liv made with it, everything except the supersaw and drums are the resulting tables.

                                        T This user is from outside of this forum
                                        T This user is from outside of this forum
                                        Tamas G
                                        wrote last edited by
                                        #34

                                        @x0 @Scott @BorrisInABox ah no way that's really cool! You can totally hear the samples in there 😄

                                        x0X 1 Reply Last reply
                                        0
                                        • T Tamas G

                                          @x0 @Scott @BorrisInABox ah no way that's really cool! You can totally hear the samples in there 😄

                                          x0X This user is from outside of this forum
                                          x0X This user is from outside of this forum
                                          x0
                                          wrote last edited by
                                          #35

                                          @Tamasg @Scott @BorrisInABox Now feed that into a vocoder and have this gigantic radio voice going "search and destroy" and then a killer dubstep drop, it would be perfect.

                                          1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Don't have an account? Register

                                          • Login or register to search.
                                          Powered by NodeBB Contributors
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups