• gressen@lemmy.zip
    link
    fedilink
    English
    arrow-up
    84
    arrow-down
    2
    ·
    8 days ago

    Write TOS that state that crawlers automatically accept a service fee and then send invoices to every crawler owner.

    • BodilessGaze@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      42
      ·
      8 days ago

      Huawei is Chinese. There’s literally zero chance a European company like Codeberg is going to successfully collect from a company in China over a TOS violation.

      • wischi@programming.dev
        link
        fedilink
        English
        arrow-up
        15
        ·
        8 days ago

        It’s not even a company. It’s a non-profit “eingetragener Verein”. They have very limited resources, especially money because they purely live on membership fees and donations.

        • BodilessGaze@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          8
          ·
          edit-2
          8 days ago

          I really doubt it. Lawsuits are expensive, and proving responsibility is difficult, since plausible deniability is easy. All scrapers need to do is use shared IPs (e.g. cloud providers), preferably owned by a company in a different legal jurisdiction. That could be the case here: a European company could be using Huawei Cloud to mask the source of their traffic.

          • Venia Silente@lemmy.dbzer0.com
            link
            fedilink
            English
            arrow-up
            5
            ·
            8 days ago

            All scrapers need to do is use shared IPs (e.g. cloud providers),

            Simple: just charge the cloud provider.

            Once that gets strong enough they’ll start placing terms against scraping in their TOS.

            • wischi@programming.dev
              link
              fedilink
              English
              arrow-up
              5
              ·
              edit-2
              8 days ago

              And then they just throw it in the bin because there was never a contract between you and them. What to do then? Sue Microsoft, Amazon and Google

              I’m sure Codeberg, a German non-profit Verein, has time and money to do that 🤣.

              • Venia Silente@lemmy.dbzer0.com
                link
                fedilink
                English
                arrow-up
                1
                ·
                6 days ago

                Sure but that’s a whole different part of the system. Society as a whole has to change (some guillotines would help) and no matter how cool Codeberg is, they can’t do all that on their own.

                In the meantime, what the elites visibly respond to and that is more readily accessible is monetary costs. Make it costly (operationally or legally) to scrape sites, and they’ll stop if at least to whine.

    • wischi@programming.dev
      link
      fedilink
      English
      arrow-up
      39
      arrow-down
      1
      ·
      8 days ago

      They typically don’t include a billing address in the User Agent when crawling 🤣

      • gressen@lemmy.zip
        link
        fedilink
        English
        arrow-up
        11
        arrow-down
        2
        ·
        8 days ago

        That’s a technicality. The billing address can be discovered for a nominal fee as well.

        • wischi@programming.dev
          link
          fedilink
          English
          arrow-up
          7
          ·
          edit-2
          8 days ago

          I’m sure it can’t, especially for foreign IP addresses, VPNs, and a ton of other situations. Even if directly connect to the internet just via your ISP, many countries in Europe (don’t know about US) have laws that would require you to have very good reasons and a court order to get the info you need from the ISP - for a single(!) case.

          If it would be possible to simply get the address of all digital visitors, we wouldn’t have to develop all this anti scrape tech and just sue them.