Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Darkly)
  • No Skin
Collapse
Brand Logo
  1. Home
  2. Uncategorized
  3. Are you using #Codeberg to host your favorite AI-assisted and otherwise vibecoded project because your desire for dopamine has utterly destroyed your willingness to learn new things?

Are you using #Codeberg to host your favorite AI-assisted and otherwise vibecoded project because your desire for dopamine has utterly destroyed your willingness to learn new things?

Scheduled Pinned Locked Moved Uncategorized
codeberggithub
26 Posts 3 Posters 51 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • n0toose will be at FOSDEMN n0toose will be at FOSDEM

    @pojntfx Still 99.9% trained on top of stolen data, an unimaginable waste of resources by treating public infrastructure as "free real estate" (without respect for robots.txt, by visiting often the same links ten thousand times or using sketchy residential proxies to hamper services like Codeberg's and circumvent explicit "no"'s), and vibes towards the law (which really doesn't apply to the commonersβ„’ in many jurisdictions).

    n0toose will be at FOSDEMN This user is from outside of this forum
    n0toose will be at FOSDEMN This user is from outside of this forum
    n0toose will be at FOSDEM
    wrote last edited by
    #7

    @pojntfx The need for a "fuzzy web search" is the result of normal web search having been completely enshittified and serves another party's "want" of you not seeking out information by yourself but receiving them from "summaries" instead.

    Felicitas Pojtinger is in πŸ‡¨πŸ‡­P 1 Reply Last reply
    0
    • n0toose will be at FOSDEMN n0toose will be at FOSDEM

      @pojntfx Still 99.9% trained on top of stolen data, an unimaginable waste of resources by treating public infrastructure as "free real estate" (without respect for robots.txt, by visiting often the same links ten thousand times or using sketchy residential proxies to hamper services like Codeberg's and circumvent explicit "no"'s), and vibes towards the law (which really doesn't apply to the commonersβ„’ in many jurisdictions).

      Felicitas Pojtinger is in πŸ‡¨πŸ‡­P This user is from outside of this forum
      Felicitas Pojtinger is in πŸ‡¨πŸ‡­P This user is from outside of this forum
      Felicitas Pojtinger is in πŸ‡¨πŸ‡­
      wrote last edited by
      #8

      @n0toose That's not the case for a lot of those open models anymore. Apertus is a great example of it, it was built only from data sources that explicitly allow crawling https://www.swiss-ai.org/apertus

      "Stolen data" is ofc also very debatable. Tightening copyright law, patents and so on even further to "stop" LLMs from being trained will probably only have negative consequences on those writing OSS. I really don't want to live in a world where you're legally prohibited from learning.

      n0toose will be at FOSDEMN 1 Reply Last reply
      0
      • n0toose will be at FOSDEMN n0toose will be at FOSDEM

        @pojntfx The need for a "fuzzy web search" is the result of normal web search having been completely enshittified and serves another party's "want" of you not seeking out information by yourself but receiving them from "summaries" instead.

        Felicitas Pojtinger is in πŸ‡¨πŸ‡­P This user is from outside of this forum
        Felicitas Pojtinger is in πŸ‡¨πŸ‡­P This user is from outside of this forum
        Felicitas Pojtinger is in πŸ‡¨πŸ‡­
        wrote last edited by
        #9

        @n0toose I'm not sure I agree with that. There has never been a (legal) search engine that could "just give me the paper on CRIU where the author mentions Cricket" before. That stuff is genuinely new, and while stuff like Google has enshittified (I don't disagree with you there) there is also a whole new type of query that you just couldn't do before, all locally on your own device.

        n0toose will be at FOSDEMN 1 Reply Last reply
        0
        • Felicitas Pojtinger is in πŸ‡¨πŸ‡­P Felicitas Pojtinger is in πŸ‡¨πŸ‡­

          @n0toose That's not the case for a lot of those open models anymore. Apertus is a great example of it, it was built only from data sources that explicitly allow crawling https://www.swiss-ai.org/apertus

          "Stolen data" is ofc also very debatable. Tightening copyright law, patents and so on even further to "stop" LLMs from being trained will probably only have negative consequences on those writing OSS. I really don't want to live in a world where you're legally prohibited from learning.

          n0toose will be at FOSDEMN This user is from outside of this forum
          n0toose will be at FOSDEMN This user is from outside of this forum
          n0toose will be at FOSDEM
          wrote last edited by
          #10

          @pojntfx Oh, Apertus, that one by the Swiss universities. I mean yeah, that is a positive example with a positive approach but not one I expect to be relevant in any shape or form in 5 years from now. The exception doesn't make the rule, neither are the bad effects I stated above driven by the exceptions.

          n0toose will be at FOSDEMN Felicitas Pojtinger is in πŸ‡¨πŸ‡­P 2 Replies Last reply
          0
          • n0toose will be at FOSDEMN n0toose will be at FOSDEM

            @pojntfx Oh, Apertus, that one by the Swiss universities. I mean yeah, that is a positive example with a positive approach but not one I expect to be relevant in any shape or form in 5 years from now. The exception doesn't make the rule, neither are the bad effects I stated above driven by the exceptions.

            n0toose will be at FOSDEMN This user is from outside of this forum
            n0toose will be at FOSDEMN This user is from outside of this forum
            n0toose will be at FOSDEM
            wrote last edited by
            #11

            @pojntfx Personally I think that if you DoS random people's computers for data mining that also falls under the "law" but...

            Felicitas Pojtinger is in πŸ‡¨πŸ‡­P 1 Reply Last reply
            0
            • n0toose will be at FOSDEMN n0toose will be at FOSDEM

              @pojntfx Oh, Apertus, that one by the Swiss universities. I mean yeah, that is a positive example with a positive approach but not one I expect to be relevant in any shape or form in 5 years from now. The exception doesn't make the rule, neither are the bad effects I stated above driven by the exceptions.

              Felicitas Pojtinger is in πŸ‡¨πŸ‡­P This user is from outside of this forum
              Felicitas Pojtinger is in πŸ‡¨πŸ‡­P This user is from outside of this forum
              Felicitas Pojtinger is in πŸ‡¨πŸ‡­
              wrote last edited by
              #12

              @n0toose I honestly think the same approach that the EU has been doing with open search indexes should exist for LLM training data too eventually. There are clearly issues with how you get access to say my OSS projects if you want to train them without hammering my forge, I know that. At the same time though, without access to open training data you're ceeding the entire field - which pretty much everyone uses in one way or another - to private IP deals with publishers.

              1 Reply Last reply
              0
              • n0toose will be at FOSDEMN n0toose will be at FOSDEM

                @pojntfx Personally I think that if you DoS random people's computers for data mining that also falls under the "law" but...

                Felicitas Pojtinger is in πŸ‡¨πŸ‡­P This user is from outside of this forum
                Felicitas Pojtinger is in πŸ‡¨πŸ‡­P This user is from outside of this forum
                Felicitas Pojtinger is in πŸ‡¨πŸ‡­
                wrote last edited by
                #13

                @n0toose There were lots of proposals around criminalizing "unauthorized access" to services in the past few decades, about trying to make it so that only a "human" can access them, enforcing ToS legally ... I've really only seen them used against end users in practice (Reddit's anti-scraping policy/API shutdown, third-party clients for Signal, any reverse engineering project ever etc.)

                A lot of these kinds of laws will have effects far, far worse than DDoSing public infrastructure IMHO.

                n0toose will be at FOSDEMN 1 Reply Last reply
                0
                • Felicitas Pojtinger is in πŸ‡¨πŸ‡­P Felicitas Pojtinger is in πŸ‡¨πŸ‡­

                  @n0toose I'm not sure I agree with that. There has never been a (legal) search engine that could "just give me the paper on CRIU where the author mentions Cricket" before. That stuff is genuinely new, and while stuff like Google has enshittified (I don't disagree with you there) there is also a whole new type of query that you just couldn't do before, all locally on your own device.

                  n0toose will be at FOSDEMN This user is from outside of this forum
                  n0toose will be at FOSDEMN This user is from outside of this forum
                  n0toose will be at FOSDEM
                  wrote last edited by
                  #14

                  @pojntfx There has, they used to call them "Google dorks" or "Advanced Search". Now if that didn't work 100% of the time, well, applies to both cases (readjusting a query in a different form); besides the point anyway because people are not looking for something that works for minor edge cases, they are looking for a way to look for informationβ€”and you can't even look up for an omelette recipe anymore without SEO garbage taking up the first two pages.

                  n0toose will be at FOSDEMN Felicitas Pojtinger is in πŸ‡¨πŸ‡­P 2 Replies Last reply
                  0
                  • n0toose will be at FOSDEMN n0toose will be at FOSDEM

                    @pojntfx There has, they used to call them "Google dorks" or "Advanced Search". Now if that didn't work 100% of the time, well, applies to both cases (readjusting a query in a different form); besides the point anyway because people are not looking for something that works for minor edge cases, they are looking for a way to look for informationβ€”and you can't even look up for an omelette recipe anymore without SEO garbage taking up the first two pages.

                    n0toose will be at FOSDEMN This user is from outside of this forum
                    n0toose will be at FOSDEMN This user is from outside of this forum
                    n0toose will be at FOSDEM
                    wrote last edited by
                    #15

                    @pojntfx I also think that the notion of local LLMs letting you find niche papers exaggerates their abilities, and that the ability to use them depends on hardware that is not accessible anymore due to data center costs and IMO due to the overall war against general purpose computing.

                    Felicitas Pojtinger is in πŸ‡¨πŸ‡­P 1 Reply Last reply
                    0
                    • n0toose will be at FOSDEMN n0toose will be at FOSDEM

                      Are you using #Codeberg to host your favorite AI-assisted and otherwise vibecoded project because your desire for dopamine has utterly destroyed your willingness to learn new things? Do you neither care about how you're "flooding the room" of the free software ecosystem nor about the greater societal implications, insisting that "it is what it is"?

                      Check out #GitHub, the world's most popular AI-first platform! Copilot, annoying maintainers with slop, you'll fit just right in!

                      ByteB This user is from outside of this forum
                      ByteB This user is from outside of this forum
                      Byte
                      wrote last edited by
                      #16

                      @n0toose codeberg ought to ban that slop

                      1 Reply Last reply
                      0
                      • n0toose will be at FOSDEMN n0toose will be at FOSDEM

                        @pojntfx There has, they used to call them "Google dorks" or "Advanced Search". Now if that didn't work 100% of the time, well, applies to both cases (readjusting a query in a different form); besides the point anyway because people are not looking for something that works for minor edge cases, they are looking for a way to look for informationβ€”and you can't even look up for an omelette recipe anymore without SEO garbage taking up the first two pages.

                        Felicitas Pojtinger is in πŸ‡¨πŸ‡­P This user is from outside of this forum
                        Felicitas Pojtinger is in πŸ‡¨πŸ‡­P This user is from outside of this forum
                        Felicitas Pojtinger is in πŸ‡¨πŸ‡­
                        wrote last edited by
                        #17

                        @n0toose Yeah, the SEO slop is obviously horrendous. I'm ngl though, being able to use an LLM to search through say IndieWeb instead has been the first time in a long time that I've actually been able to find (non-code) answers to questions again. Used it for booking flights with niche airlines in a country I've never been to for example. That kind of stuff was always locked behind proprietary APIs for so long and now you can actually access them without them for the first time in forever.

                        n0toose will be at FOSDEMN 1 Reply Last reply
                        0
                        • Felicitas Pojtinger is in πŸ‡¨πŸ‡­P Felicitas Pojtinger is in πŸ‡¨πŸ‡­

                          @n0toose There were lots of proposals around criminalizing "unauthorized access" to services in the past few decades, about trying to make it so that only a "human" can access them, enforcing ToS legally ... I've really only seen them used against end users in practice (Reddit's anti-scraping policy/API shutdown, third-party clients for Signal, any reverse engineering project ever etc.)

                          A lot of these kinds of laws will have effects far, far worse than DDoSing public infrastructure IMHO.

                          n0toose will be at FOSDEMN This user is from outside of this forum
                          n0toose will be at FOSDEMN This user is from outside of this forum
                          n0toose will be at FOSDEM
                          wrote last edited by
                          #18

                          @pojntfx I think one can be for scraping and making data available e.g. for researchers but against the specific manners in which startups break thingsβ€”it's just hard to explain that to someone who doesn't operate infrastructure for people at scale.

                          Anyway, we just have to spend two or three times the price on SSDs I guess (see: greater societal impact), so if that's fine...

                          Felicitas Pojtinger is in πŸ‡¨πŸ‡­P 1 Reply Last reply
                          0
                          • n0toose will be at FOSDEMN n0toose will be at FOSDEM

                            @pojntfx I also think that the notion of local LLMs letting you find niche papers exaggerates their abilities, and that the ability to use them depends on hardware that is not accessible anymore due to data center costs and IMO due to the overall war against general purpose computing.

                            Felicitas Pojtinger is in πŸ‡¨πŸ‡­P This user is from outside of this forum
                            Felicitas Pojtinger is in πŸ‡¨πŸ‡­P This user is from outside of this forum
                            Felicitas Pojtinger is in πŸ‡¨πŸ‡­
                            wrote last edited by
                            #19

                            @n0toose There are production constraints around all of this atm, yes. But much like how you can't fix the housing crisis without making it cheaper to build houses and actually building them, I don't think we can fix something like this without actually building out the fabs and getting supply up there w/ demand.

                            And local LLMs are very much "real" now. Try out Newelle or Alpaca on GNOME on your regular laptop - even mine can run them w/o issues now via Vulkan, and I don't have a lot of VRAM.

                            n0toose will be at FOSDEMN 1 Reply Last reply
                            0
                            • Felicitas Pojtinger is in πŸ‡¨πŸ‡­P Felicitas Pojtinger is in πŸ‡¨πŸ‡­

                              @n0toose Yeah, the SEO slop is obviously horrendous. I'm ngl though, being able to use an LLM to search through say IndieWeb instead has been the first time in a long time that I've actually been able to find (non-code) answers to questions again. Used it for booking flights with niche airlines in a country I've never been to for example. That kind of stuff was always locked behind proprietary APIs for so long and now you can actually access them without them for the first time in forever.

                              n0toose will be at FOSDEMN This user is from outside of this forum
                              n0toose will be at FOSDEMN This user is from outside of this forum
                              n0toose will be at FOSDEM
                              wrote last edited by
                              #20

                              @pojntfx Yeah and you're treating this as if it's something to be taken for granted forever; the counterexamples are the equivalents of "searxng" or alternative search engines to me tbh.

                              Felicitas Pojtinger is in πŸ‡¨πŸ‡­P 1 Reply Last reply
                              0
                              • n0toose will be at FOSDEMN n0toose will be at FOSDEM

                                @pojntfx I think one can be for scraping and making data available e.g. for researchers but against the specific manners in which startups break thingsβ€”it's just hard to explain that to someone who doesn't operate infrastructure for people at scale.

                                Anyway, we just have to spend two or three times the price on SSDs I guess (see: greater societal impact), so if that's fine...

                                Felicitas Pojtinger is in πŸ‡¨πŸ‡­P This user is from outside of this forum
                                Felicitas Pojtinger is in πŸ‡¨πŸ‡­P This user is from outside of this forum
                                Felicitas Pojtinger is in πŸ‡¨πŸ‡­
                                wrote last edited by
                                #21

                                @n0toose I mean yes, optimally a law like you mention would try and fix this, but I have 0 trust in any jurisdiction actually making a law like that. I'm pretty certain we'll instead end up in a world where only massive companies that can pay for IP licensing agreements can train models.

                                1 Reply Last reply
                                0
                                • Felicitas Pojtinger is in πŸ‡¨πŸ‡­P Felicitas Pojtinger is in πŸ‡¨πŸ‡­

                                  @n0toose There are production constraints around all of this atm, yes. But much like how you can't fix the housing crisis without making it cheaper to build houses and actually building them, I don't think we can fix something like this without actually building out the fabs and getting supply up there w/ demand.

                                  And local LLMs are very much "real" now. Try out Newelle or Alpaca on GNOME on your regular laptop - even mine can run them w/o issues now via Vulkan, and I don't have a lot of VRAM.

                                  n0toose will be at FOSDEMN This user is from outside of this forum
                                  n0toose will be at FOSDEMN This user is from outside of this forum
                                  n0toose will be at FOSDEM
                                  wrote last edited by
                                  #22

                                  @pojntfx and you're using that to find obscure papers?

                                  Felicitas Pojtinger is in πŸ‡¨πŸ‡­P 1 Reply Last reply
                                  0
                                  • n0toose will be at FOSDEMN n0toose will be at FOSDEM

                                    @pojntfx Yeah and you're treating this as if it's something to be taken for granted forever; the counterexamples are the equivalents of "searxng" or alternative search engines to me tbh.

                                    Felicitas Pojtinger is in πŸ‡¨πŸ‡­P This user is from outside of this forum
                                    Felicitas Pojtinger is in πŸ‡¨πŸ‡­P This user is from outside of this forum
                                    Felicitas Pojtinger is in πŸ‡¨πŸ‡­
                                    wrote last edited by
                                    #23

                                    @n0toose I don't know about your experience, but any legal options I've found that try to solve this problem are worse than useless. I find nothing of relevance on Marginalia and other things like it.

                                    1 Reply Last reply
                                    0
                                    • n0toose will be at FOSDEMN n0toose will be at FOSDEM

                                      @pojntfx and you're using that to find obscure papers?

                                      Felicitas Pojtinger is in πŸ‡¨πŸ‡­P This user is from outside of this forum
                                      Felicitas Pojtinger is in πŸ‡¨πŸ‡­P This user is from outside of this forum
                                      Felicitas Pojtinger is in πŸ‡¨πŸ‡­
                                      wrote last edited by
                                      #24

                                      @n0toose Yeah! I mean I just did yesterday, for that CRIU one. Try it πŸ™‚ All you need is a CPU or anything that can do Vulkan. I use fully OSS drivers, even runs on those.

                                      n0toose will be at FOSDEMN 1 Reply Last reply
                                      0
                                      • Felicitas Pojtinger is in πŸ‡¨πŸ‡­P Felicitas Pojtinger is in πŸ‡¨πŸ‡­

                                        @n0toose Yeah! I mean I just did yesterday, for that CRIU one. Try it πŸ™‚ All you need is a CPU or anything that can do Vulkan. I use fully OSS drivers, even runs on those.

                                        n0toose will be at FOSDEMN This user is from outside of this forum
                                        n0toose will be at FOSDEMN This user is from outside of this forum
                                        n0toose will be at FOSDEM
                                        wrote last edited by
                                        #25

                                        @pojntfx I mean, mildly interesting (and thanks for letting me know) but doesn't convince me on ethical grounds nevertheless.

                                        Felicitas Pojtinger is in πŸ‡¨πŸ‡­P 1 Reply Last reply
                                        0
                                        • n0toose will be at FOSDEMN n0toose will be at FOSDEM

                                          @pojntfx I mean, mildly interesting (and thanks for letting me know) but doesn't convince me on ethical grounds nevertheless.

                                          Felicitas Pojtinger is in πŸ‡¨πŸ‡­P This user is from outside of this forum
                                          Felicitas Pojtinger is in πŸ‡¨πŸ‡­P This user is from outside of this forum
                                          Felicitas Pojtinger is in πŸ‡¨πŸ‡­
                                          wrote last edited by
                                          #26

                                          @n0toose Fair, I understand that. It's ultimately your choice. If you work best w/o those tools - then that's great too.

                                          1 Reply Last reply
                                          0
                                          • R ActivityRelay shared this topic
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Don't have an account? Register

                                          • Login or register to search.
                                          Powered by NodeBB Contributors
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups