Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Darkly)
  • No Skin
Collapse
Brand Logo
  1. Home
  2. Uncategorized
  3. Tried to extract my own glottal pulse to make the synth sound more human.

Tried to extract my own glottal pulse to make the synth sound more human.

Scheduled Pinned Locked Moved Uncategorized
38 Posts 8 Posters 39 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • T Tamas G

    @mcourcel lol sounds horrible!

    J This user is from outside of this forum
    J This user is from outside of this forum
    Joshua
    wrote last edited by
    #20

    @Tamasg @mcourcel oh god that sounds so bad

    Alex ChapmanA 1 Reply Last reply
    0
    • M Martin

      @Tamasg Hehehehehe lolol, sort of like E-Speak.

      T This user is from outside of this forum
      T This user is from outside of this forum
      Tamas G
      wrote last edited by
      #21

      @mcourcel yep. This gave me some real good insight into what Espeak did to fuck up SpeechPlayer, mainly changing its glottal source a lot. Hahahaha good lesson-learning!

      1 Reply Last reply
      0
      • x0X x0

        @BorrisInABox @Tamasg I got A.Liv on the Surge discord to kindly work with my source material and the results are now called Exocat's Metalodon in the 3rd-party wavetables folder of Surge's factory data.

        x0X This user is from outside of this forum
        x0X This user is from outside of this forum
        x0
        wrote last edited by
        #22

        @BorrisInABox @Tamasg This is the raw source material, which I later trimmed and did some noise reduction on, and then A.Liv carefully turned it into something that was a consistent period to be turned into wavetables, I think at 2048 samples per frame.

        x0X M S 3 Replies Last reply
        0
        • x0X x0

          @BorrisInABox @Tamasg This is the raw source material, which I later trimmed and did some noise reduction on, and then A.Liv carefully turned it into something that was a consistent period to be turned into wavetables, I think at 2048 samples per frame.

          x0X This user is from outside of this forum
          x0X This user is from outside of this forum
          x0
          wrote last edited by
          #23

          @BorrisInABox @Tamasg SO if you actually wanted to do that for whatever reason, I can send you the wavetables which are already fixed length single-cycle waveforms, unless you already have surge.

          T 1 Reply Last reply
          0
          • J Joshua

            @Tamasg @mcourcel oh god that sounds so bad

            Alex ChapmanA This user is from outside of this forum
            Alex ChapmanA This user is from outside of this forum
            Alex Chapman
            wrote last edited by
            #24

            @J3317 @Tamasg @mcourcel Lmfao that should be an extra voice added for the lols

            1 Reply Last reply
            0
            • x0X x0

              @BorrisInABox @Tamasg This is the raw source material, which I later trimmed and did some noise reduction on, and then A.Liv carefully turned it into something that was a consistent period to be turned into wavetables, I think at 2048 samples per frame.

              M This user is from outside of this forum
              M This user is from outside of this forum
              Martin
              wrote last edited by
              #25

              @x0 @BorrisInABox @Tamasg Hehehe lololol! The spin sounds cool. Like a light saber.

              1 Reply Last reply
              0
              • x0X x0

                @BorrisInABox @Tamasg SO if you actually wanted to do that for whatever reason, I can send you the wavetables which are already fixed length single-cycle waveforms, unless you already have surge.

                T This user is from outside of this forum
                T This user is from outside of this forum
                Tamas G
                wrote last edited by
                #26

                @x0 @BorrisInABox lol! Can't even explain what it did, but it definitely introduces a metallic quality unlike any I've heard in speech synthesis before. Not even as tube-like as when I tried mine was, but boy is it bad. That grindyness really shows through.

                x0X 1 Reply Last reply
                0
                • T Tamas G

                  @x0 @BorrisInABox lol! Can't even explain what it did, but it definitely introduces a metallic quality unlike any I've heard in speech synthesis before. Not even as tube-like as when I tried mine was, but boy is it bad. That grindyness really shows through.

                  x0X This user is from outside of this forum
                  x0X This user is from outside of this forum
                  x0
                  wrote last edited by
                  #27

                  @Tamasg @BorrisInABox lmfaoooooo what, that's like the odd source of softvoice

                  T 1 Reply Last reply
                  0
                  • x0X x0

                    @Tamasg @BorrisInABox lmfaoooooo what, that's like the odd source of softvoice

                    T This user is from outside of this forum
                    T This user is from outside of this forum
                    Tamas G
                    wrote last edited by
                    #28

                    @x0 @BorrisInABox lol this thing is a trip to use. It's, just... So gritty, so metallic, nothin' quite like it. So I'm keeping it at https://eurpod.com/synths/speechPlayer-brokenmachine.dll - though clear proof that with the right matching glottal source it can sound less tubey and more natural, just gotta find the right radio announcer-type glottal source 😄

                    1 Reply Last reply
                    0
                    • T Tamas G

                      @BorrisInABox Oh cool! For the extraction I recorded 5 sounds:
                      "ahh" sustained at normal pitch (~5 sec)
                      2. "ahh" sustained at low pitch (~5 sec)
                      3. "ahh" sustained at high pitch (~5 sec)
                      4. "shhh" sustained fricative (~5 sec)
                      5. "th" sustained unvoiced (~3 sec)
                      The "ahh" vowels are for glottal pulse extraction at different F0s. The "sh" and "th" are for noise/frication characteristics.
                      Recording tips:
                      • Condenser or dynamic mic (I used a Blue Snowball, AT2005 was too noisy)
                      • Peaks around -5 to -8 dB (NOT quiet - my first attempt at -30 dB was useless)
                      • Steady volume, no vibrato
                      • Quiet room
                      • 44100 Hz, mono
                      The key is getting a clean, loud, boring sustained vowel - no expression, just pure steady tone. The more monotone the better for extraction!

                      T This user is from outside of this forum
                      T This user is from outside of this forum
                      Tamas G
                      wrote last edited by
                      #29

                      @BorrisInABox Small add-on for the voice recording set: raw audio only, please — no noise suppression, auto gain, compressor/limiter, or EQ. The boring part matters here: keep the vowel steady with no vibrato, because I’m aligning and averaging glottal cycles and pitch wobble makes the final source less crisp. If you can, include ~10 seconds of room tone (silence) in a file, so I can calibrate noise and hum. And when you record “th”, make it the “think” version (/θ/). Optional but very helpful: a sustained “zzzz” (/z/) and “vvvv” (/v/) so I can capture voicing + turbulence together for better “edge” control later. Hope this helps too. LOL if this works out your voice would be forever partially captured into a synth. LOL.

                      M BorrisB 2 Replies Last reply
                      0
                      • T Tamas G

                        @BorrisInABox Small add-on for the voice recording set: raw audio only, please — no noise suppression, auto gain, compressor/limiter, or EQ. The boring part matters here: keep the vowel steady with no vibrato, because I’m aligning and averaging glottal cycles and pitch wobble makes the final source less crisp. If you can, include ~10 seconds of room tone (silence) in a file, so I can calibrate noise and hum. And when you record “th”, make it the “think” version (/θ/). Optional but very helpful: a sustained “zzzz” (/z/) and “vvvv” (/v/) so I can capture voicing + turbulence together for better “edge” control later. Hope this helps too. LOL if this works out your voice would be forever partially captured into a synth. LOL.

                        M This user is from outside of this forum
                        M This user is from outside of this forum
                        Martin
                        wrote last edited by
                        #30

                        @Tamasg @BorrisInABox Ooo, a Boris voice synth coming soon!

                        BorrisB 1 Reply Last reply
                        0
                        • M Martin

                          @Tamasg @BorrisInABox Ooo, a Boris voice synth coming soon!

                          BorrisB This user is from outside of this forum
                          BorrisB This user is from outside of this forum
                          Borris
                          wrote last edited by
                          #31

                          @mcourcel @Tamasg It's fake news.

                          1 Reply Last reply
                          0
                          • x0X x0

                            @BorrisInABox @Tamasg This is the raw source material, which I later trimmed and did some noise reduction on, and then A.Liv carefully turned it into something that was a consistent period to be turned into wavetables, I think at 2048 samples per frame.

                            S This user is from outside of this forum
                            S This user is from outside of this forum
                            Scott
                            wrote last edited by
                            #32

                            @x0 LMAO you've got a dubstep washer! @BorrisInABox @Tamasg

                            x0X 1 Reply Last reply
                            0
                            • S Scott

                              @x0 LMAO you've got a dubstep washer! @BorrisInABox @Tamasg

                              x0X This user is from outside of this forum
                              x0X This user is from outside of this forum
                              x0
                              wrote last edited by
                              #33

                              @Scott @BorrisInABox @Tamasg Yup, as soon as I heard that I thought of some Skrillex shit and had to get it put into a synth. It was recorded in 2019, and in 2022 it finally happened. This is the demo that A.Liv made with it, everything except the supersaw and drums are the resulting tables.

                              T 1 Reply Last reply
                              0
                              • x0X x0

                                @Scott @BorrisInABox @Tamasg Yup, as soon as I heard that I thought of some Skrillex shit and had to get it put into a synth. It was recorded in 2019, and in 2022 it finally happened. This is the demo that A.Liv made with it, everything except the supersaw and drums are the resulting tables.

                                T This user is from outside of this forum
                                T This user is from outside of this forum
                                Tamas G
                                wrote last edited by
                                #34

                                @x0 @Scott @BorrisInABox ah no way that's really cool! You can totally hear the samples in there 😄

                                x0X 1 Reply Last reply
                                0
                                • T Tamas G

                                  @x0 @Scott @BorrisInABox ah no way that's really cool! You can totally hear the samples in there 😄

                                  x0X This user is from outside of this forum
                                  x0X This user is from outside of this forum
                                  x0
                                  wrote last edited by
                                  #35

                                  @Tamasg @Scott @BorrisInABox Now feed that into a vocoder and have this gigantic radio voice going "search and destroy" and then a killer dubstep drop, it would be perfect.

                                  1 Reply Last reply
                                  0
                                  • T Tamas G

                                    @BorrisInABox Small add-on for the voice recording set: raw audio only, please — no noise suppression, auto gain, compressor/limiter, or EQ. The boring part matters here: keep the vowel steady with no vibrato, because I’m aligning and averaging glottal cycles and pitch wobble makes the final source less crisp. If you can, include ~10 seconds of room tone (silence) in a file, so I can calibrate noise and hum. And when you record “th”, make it the “think” version (/θ/). Optional but very helpful: a sustained “zzzz” (/z/) and “vvvv” (/v/) so I can capture voicing + turbulence together for better “edge” control later. Hope this helps too. LOL if this works out your voice would be forever partially captured into a synth. LOL.

                                    BorrisB This user is from outside of this forum
                                    BorrisB This user is from outside of this forum
                                    Borris
                                    wrote last edited by
                                    #36

                                    @Tamasg Have a thing.
                                    https://www.dropbox.com/scl/fi/06xusmq45tjvddimav861/glottles.wav?rlkey=opxqxp3ruhb80qdgwva5eoyzl&dl=1

                                    T 2 Replies Last reply
                                    0
                                    • BorrisB Borris

                                      @Tamasg Have a thing.
                                      https://www.dropbox.com/scl/fi/06xusmq45tjvddimav861/glottles.wav?rlkey=opxqxp3ruhb80qdgwva5eoyzl&dl=1

                                      T This user is from outside of this forum
                                      T This user is from outside of this forum
                                      Tamas G
                                      wrote last edited by
                                      #37

                                      @BorrisInABox Excellent! These are CLEAN recordings! Look at those noise floors! down to -55.9 dB on ah_normal.wav! And the F0 stability is fantastic (±1.5 Hz on the normal pitch).
                                      Key observations:
                                      Recording
                                      F0
                                      Notes
                                      ah_normal.wav
                                      119.4 Hz (±1.5)
                                      Best candidate - great level, super stable, lowest noise floor
                                      ah_normal_take2.wav
                                      115.7 Hz (±1.5)
                                      Also excellent, slightly higher noise floor
                                      ah_lower.wav
                                      89.8 Hz (±8.5)
                                      More pitch variation - less stable
                                      ah_lower_take2.wav
                                      90.6 Hz (±1.7)
                                      Much more stable than take1!
                                      ah_higher.wav
                                      215.3 Hz (±3.1)
                                      Good for testing F0 invariance
                                      ah_higher_take2.wav
                                      224.5 Hz (±3.2)
                                      Hottest levels (-2.6 dB peak)
                                      I'll use ah_normal.wav as the primary source — it has the best combination of:
                                      • Stable F0 (±1.5 Hz)
                                      • Good level (-11.9 dB peak, plenty of headroom)
                                      • Lowest noise floor (-55.9 dB)
                                      • Nice male radio voice F0 (~120 Hz)
                                      Huge thanks for this. We'll see how it goes.

                                      1 Reply Last reply
                                      0
                                      • BorrisB Borris

                                        @Tamasg Have a thing.
                                        https://www.dropbox.com/scl/fi/06xusmq45tjvddimav861/glottles.wav?rlkey=opxqxp3ruhb80qdgwva5eoyzl&dl=1

                                        T This user is from outside of this forum
                                        T This user is from outside of this forum
                                        Tamas G
                                        wrote last edited by
                                        #38

                                        @BorrisInABox so here's the big difference. Existing: my voice drops from 1.0 to ~0.09 in ~28 samples, but then oscillates
                                        Your pulse is cleaner, single clear cycle without the multi-peak oscillations. This should give more predictable harmonic structure.

                                        1 Reply Last reply
                                        1
                                        0
                                        Reply
                                        • Reply as topic
                                        Log in to reply
                                        • Oldest to Newest
                                        • Newest to Oldest
                                        • Most Votes


                                        • Login

                                        • Don't have an account? Register

                                        • Login or register to search.
                                        Powered by NodeBB Contributors
                                        • First post
                                          Last post
                                        0
                                        • Categories
                                        • Recent
                                        • Tags
                                        • Popular
                                        • World
                                        • Users
                                        • Groups