• CarbonScored [any]@hexbear.net
    link
    fedilink
    English
    arrow-up
    12
    ·
    edit-2
    4 months ago

    The real beauty of this is that he’s released it as code you can deploy on your sites. It’s not just a single website he owns that will quickly be blacklisted, it’s a tarpit you can put anywhere.

    I also liked these snippets from their site:

    “Lastly, optional Markov-babble can be added to the pages, to give the crawlers something to scrape up and train their LLMs on, hopefully accelerating model collapse.”

    Let’s say you’ve got horsepower and bandwidth to burn, and just want to see these AI models burn. Nepenthes has what you need:

    Don’t make any attempt to block crawlers with the IP stats. Put the delay times as low as you are comfortable with. Train a big Markov corpus and leave the Markov module enabled, set the maximum babble size to something big. In short, let them suck down as much bullshit as they have diskspace for and choke on it.