Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Darkly)
  • No Skin
Collapse
Brand Logo
  1. Home
  2. Uncategorized
  3. As much as I loathe LLM "AI" built from hoards of stolen data, machine learning "AI" has become terrifically useful.

As much as I loathe LLM "AI" built from hoards of stolen data, machine learning "AI" has become terrifically useful.

Scheduled Pinned Locked Moved Uncategorized
birdsbirdnetecologymachinelearning
8 Posts 5 Posters 1 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • Jon SullivanJ This user is from outside of this forum
    Jon SullivanJ This user is from outside of this forum
    Jon Sullivan
    wrote last edited by
    #1

    As much as I loathe LLM "AI" built from hoards of stolen data, machine learning "AI" has become terrifically useful.

    This past week I had 10 audio recorders set out in the forest and nearby grassland, all recording non-stop from Monday afternoon to Friday morning. That was on our recent university field ecology field trip.

    Today I downloaded all the files to a hard drive (156 GB of data) and then I set my little M1 Macbook Air to work, using the offline desktop BirdNet app to identify all of the birds in the recordings.

    It took most of the day, and now I have a 42,284 row spreadsheet of birds detected.

    It really feels like magic.

    Here's a quick sorted lists of all the bird detections with species IDs with a confidence score >0.9.

    Together with the students in the course, we'll later compare how birds have changed since we started doing this in 2020, and how the birds in the grassland differ from the forest.

    #birds #BirdNet #ecology #nz #MachineLearning

    Tillman ReuterT Ossie :qazlal:S mikiM AUSTRALOPITHECUS πŸ‡ΊπŸ‡¦πŸ‡¨πŸ‡ΏL 4 Replies Last reply
    1
    0
    • Jon SullivanJ Jon Sullivan

      As much as I loathe LLM "AI" built from hoards of stolen data, machine learning "AI" has become terrifically useful.

      This past week I had 10 audio recorders set out in the forest and nearby grassland, all recording non-stop from Monday afternoon to Friday morning. That was on our recent university field ecology field trip.

      Today I downloaded all the files to a hard drive (156 GB of data) and then I set my little M1 Macbook Air to work, using the offline desktop BirdNet app to identify all of the birds in the recordings.

      It took most of the day, and now I have a 42,284 row spreadsheet of birds detected.

      It really feels like magic.

      Here's a quick sorted lists of all the bird detections with species IDs with a confidence score >0.9.

      Together with the students in the course, we'll later compare how birds have changed since we started doing this in 2020, and how the birds in the grassland differ from the forest.

      #birds #BirdNet #ecology #nz #MachineLearning

      Tillman ReuterT This user is from outside of this forum
      Tillman ReuterT This user is from outside of this forum
      Tillman Reuter
      wrote last edited by
      #2

      @joncounts which audiorecorders did you use. I remember Audiomoth. Are those still a thing?

      Jon SullivanJ 1 Reply Last reply
      0
      • Jon SullivanJ Jon Sullivan

        As much as I loathe LLM "AI" built from hoards of stolen data, machine learning "AI" has become terrifically useful.

        This past week I had 10 audio recorders set out in the forest and nearby grassland, all recording non-stop from Monday afternoon to Friday morning. That was on our recent university field ecology field trip.

        Today I downloaded all the files to a hard drive (156 GB of data) and then I set my little M1 Macbook Air to work, using the offline desktop BirdNet app to identify all of the birds in the recordings.

        It took most of the day, and now I have a 42,284 row spreadsheet of birds detected.

        It really feels like magic.

        Here's a quick sorted lists of all the bird detections with species IDs with a confidence score >0.9.

        Together with the students in the course, we'll later compare how birds have changed since we started doing this in 2020, and how the birds in the grassland differ from the forest.

        #birds #BirdNet #ecology #nz #MachineLearning

        Ossie :qazlal:S This user is from outside of this forum
        Ossie :qazlal:S This user is from outside of this forum
        Ossie :qazlal:
        wrote last edited by
        #3

        @joncounts How do you record continuously? I have tried this but I run into battery life issues.

        Jon SullivanJ 1 Reply Last reply
        0
        • Jon SullivanJ Jon Sullivan

          As much as I loathe LLM "AI" built from hoards of stolen data, machine learning "AI" has become terrifically useful.

          This past week I had 10 audio recorders set out in the forest and nearby grassland, all recording non-stop from Monday afternoon to Friday morning. That was on our recent university field ecology field trip.

          Today I downloaded all the files to a hard drive (156 GB of data) and then I set my little M1 Macbook Air to work, using the offline desktop BirdNet app to identify all of the birds in the recordings.

          It took most of the day, and now I have a 42,284 row spreadsheet of birds detected.

          It really feels like magic.

          Here's a quick sorted lists of all the bird detections with species IDs with a confidence score >0.9.

          Together with the students in the course, we'll later compare how birds have changed since we started doing this in 2020, and how the birds in the grassland differ from the forest.

          #birds #BirdNet #ecology #nz #MachineLearning

          mikiM This user is from outside of this forum
          mikiM This user is from outside of this forum
          miki
          wrote last edited by
          #4

          @joncounts LLMs (combined with ASR models) are basically this but for text.

          LLMs let you turn large and messy corpora of text, audio and images into a neat csv, which you can then analyze in any data science tool of choice. You can't just Excel your way through "how likely are right-wing newspapers to mention the race of a rapist, depending on what that race is." Not without a team of grad students doing the gruntwork at least. LLMs automate away all of that gruntwork, letting you answer research questions much faster.

          You can't use pure Chat GPT for this, you need specialized tools.

          Sure, LLMs hallucinate, just like human annotators do. This is why you (as a human) need to go through a sample of your corpus and figure out what your hallucination rate is. This is still much faster than annotating the entire corpus.

          Jon SullivanJ 1 Reply Last reply
          0
          • Tillman ReuterT Tillman Reuter

            @joncounts which audiorecorders did you use. I remember Audiomoth. Are those still a thing?

            Jon SullivanJ This user is from outside of this forum
            Jon SullivanJ This user is from outside of this forum
            Jon Sullivan
            wrote last edited by
            #5

            @tillmanreuter We use NZ Department of Conservation manufactured AR4s, which come in excellent weather proofed cases, plus we’ve got a set of AudioMoths plugged into better microphones (the same that the AR4s use).

            1 Reply Last reply
            0
            • Ossie :qazlal:S Ossie :qazlal:

              @joncounts How do you record continuously? I have tried this but I run into battery life issues.

              Jon SullivanJ This user is from outside of this forum
              Jon SullivanJ This user is from outside of this forum
              Jon Sullivan
              wrote last edited by
              #6

              @spacefinner The Department of Conservation AR4s use four AA batteries and can easily run longer than a week, although we program them to record at lower frequency at night to save power (since the nocturnal birds in NZ don’t sing at such high frequencies). I use the three AA battery case for the AudioMoths, which runs for about two weeks (although only with lower power microSD cards).

              1 Reply Last reply
              0
              • mikiM miki

                @joncounts LLMs (combined with ASR models) are basically this but for text.

                LLMs let you turn large and messy corpora of text, audio and images into a neat csv, which you can then analyze in any data science tool of choice. You can't just Excel your way through "how likely are right-wing newspapers to mention the race of a rapist, depending on what that race is." Not without a team of grad students doing the gruntwork at least. LLMs automate away all of that gruntwork, letting you answer research questions much faster.

                You can't use pure Chat GPT for this, you need specialized tools.

                Sure, LLMs hallucinate, just like human annotators do. This is why you (as a human) need to go through a sample of your corpus and figure out what your hallucination rate is. This is still much faster than annotating the entire corpus.

                Jon SullivanJ This user is from outside of this forum
                Jon SullivanJ This user is from outside of this forum
                Jon Sullivan
                wrote last edited by
                #7

                @miki Thanks. Yes, it’s all the hoovering up of training data sets without permission by the big LLM products that I object to (plus the massive power consumption needed to build and refine the models). The tech behind the models is pretty neat.

                1 Reply Last reply
                0
                • Jon SullivanJ Jon Sullivan

                  As much as I loathe LLM "AI" built from hoards of stolen data, machine learning "AI" has become terrifically useful.

                  This past week I had 10 audio recorders set out in the forest and nearby grassland, all recording non-stop from Monday afternoon to Friday morning. That was on our recent university field ecology field trip.

                  Today I downloaded all the files to a hard drive (156 GB of data) and then I set my little M1 Macbook Air to work, using the offline desktop BirdNet app to identify all of the birds in the recordings.

                  It took most of the day, and now I have a 42,284 row spreadsheet of birds detected.

                  It really feels like magic.

                  Here's a quick sorted lists of all the bird detections with species IDs with a confidence score >0.9.

                  Together with the students in the course, we'll later compare how birds have changed since we started doing this in 2020, and how the birds in the grassland differ from the forest.

                  #birds #BirdNet #ecology #nz #MachineLearning

                  AUSTRALOPITHECUS πŸ‡ΊπŸ‡¦πŸ‡¨πŸ‡ΏL This user is from outside of this forum
                  AUSTRALOPITHECUS πŸ‡ΊπŸ‡¦πŸ‡¨πŸ‡ΏL This user is from outside of this forum
                  AUSTRALOPITHECUS πŸ‡ΊπŸ‡¦πŸ‡¨πŸ‡Ώ
                  wrote last edited by
                  #8

                  @joncounts have you by chance captured a sound of a tree falling, or it doesn't make a sound when there is a listening device

                  1 Reply Last reply
                  0
                  • R ActivityRelay shared this topic
                  Reply
                  • Reply as topic
                  Log in to reply
                  • Oldest to Newest
                  • Newest to Oldest
                  • Most Votes


                  • Login

                  • Don't have an account? Register

                  • Login or register to search.
                  Powered by NodeBB Contributors
                  • First post
                    Last post
                  0
                  • Categories
                  • Recent
                  • Tags
                  • Popular
                  • World
                  • Users
                  • Groups