• 2 Posts
  • 797 Comments
Joined 3 years ago
cake
Cake day: July 5th, 2023

help-circle
  • The Fediverse is designed specifically to publish its data for others to use in an open manner.

    Sure, and if the AI companies want to configure their crawlers to actually use APIs and ActivityPub to efficiently scrape that data, great. Problem is that there’s been crawlers that have done things very inefficiently (whether by malice, ignorance, or misconfiguration) and scrape the HTML of sites repeatedly, driving up some hosting costs and effectively DOSing some of the sites.

    If you put Honeypot URLs in the mix and keep out polite bots with robots.txt and keep out humans by hiding those links, you can serve poisoned responses only to the URLs that nobody should be visiting and not worry too much about collateral damage to legitimate visitors.




  • If I am reading this correctly, anyone who wants to use this service can just configure their HTTP server to act as the man in the middle of the request, so that the crawler sees your URL but is retrieving poison fountain content from the poison fountain service.

    If so, that means the crawlers wouldn’t be able to filter by URL because the actual handler that responds to the HTTP request doesn’t ever see the canonical URL of the poison fountain.

    In other words, the handler is “self hosted” at its own URL while the stream itself comes from the same URL that the crawler never sees.



  • The hot concept around the late 2000’s and early 2010’s was crowdsourcing: leveraging the expertise of volunteers to build consensus. Quora, Stack Overflow, Reddit, and similar sites came up in that time frame where people would freely lend their expertise on a platform because that platform had a pretty good rule set for encouraging that kind of collaboration and consensus building.

    Monetizing that goodwill didn’t just ruin the look and feel of the sites: it permanently altered people’s willingness to participate in those communities. Some, of course, don’t mind contributing. But many do choose to sit things out when they see the whole arrangement as enriching an undeserving middleman.








  • Its a bad idea because ai doesnt “know” in the same way humans do.

    Does that matter? From the user’s perspective, it’s a black box that takes inputs and produces outputs. The epistemology of what knowledge actually means is kinda irrelevant to the decisions of how to design that interface and decide what types of input are favored and which are disfavored.

    It’s a big ol matrix with millions of parameters, some of which are directly controlled by the people who design and maintain the model. Yes, those parameters can be manipulated to be more or less agreeable.

    I’d argue that the current state of these models is way too deferential to the user, where it places too much weight on agreement with the user input, even when that input contradicts a bunch of the other parameters.

    Internal to the model is still a method of combining things it has seen to identify a consensus among what it has seen before, tying together certain tokens that actually do correspond to real words that carry real semantic meaning. It’s just that current models obey the user a bit too much to overcome a real consensus, or will manufacture consensus where none exists.

    I don’t see why someone designing an LLM can’t manipulate the parameters to be less deferential to the claims, or even the instructions, given by the user.


  • Apple supports its devices for a lot longer than most OEMs after release (minimum 5 years since being available for sale from Apple, which might be 2 years of sales), but the impact of dropped support is much more pronounced, as you note. Apple usually announces obsolescence 2 years after support ends, too, and stop selling parts and repair manuals, except a few batteries supported to the 10 year mark. On the software/OS side, that usually means OS upgrades for 5-7 years, then 2 more years of security updates, for a total of 7-9 years of keeping a device reasonably up to date.

    So if you’re holding onto a 5-year-old laptop, Apple support tends to be much better than a 5-year-old laptop from a Windows OEM (especially with Windows 11 upgrade requirements failing to support some devices that were on sale at the time of Windows 11’s release).

    But if you’ve got a 10-year-old Apple laptop, it’s harder to use normally than a 10-year-old Windows laptop.

    Also, don’t use the Apple store for software on your laptop. Use a reasonable package manager like homebrew that doesn’t have the problems you describe. Or go find a mirror that hosts old MacOS packages and install it yourself.


  • Most Costco-specific products, sold under their Kirkland brand, are pretty good. They’re always a good value and they’re sometimes are among the best in class separate from cost.

    I think Apple’s products improved when they started designing their own silicon chips for phones, then tablets, then laptops and desktops. I have beef with their operating systems but there’s no question that they’re better able to squeeze battery life out of their hardware because of that tight control.

    In the restaurant world, there are plenty of examples of a restaurant having a better product because they make something in house: sauces, breads, butchery, pickling, desserts, etc. There are counterexamples, too, but sometimes that kind of vertical integration can result in a better end product.





  • This write-up is really, really good. I think about these concepts whenever people discuss astrophotography or other computation-heavy photography as being fake software generated images, when the reality of translating the sensor data with a graphical representation for the human eye (and all the quirks of human vision, especially around brightness and color) needs conscious decisions on how those charges or voltages on a sensor should be translated into a pixel on digital file.


  • my general computing as a subscription to a server.

    You say this, but I think most of us have offloaded formerly local computing to a server of some kind:

    • Email organization, including folders and attachments, has mostly shifted from a desktop client saving offline copies retrieved and then deleted from the server, to web and app and even IMAP interfaces to the canonical cloud server organization.
    • A huge chunk of users have shifted their productivity tasks (word processing, spreadsheets, presentations, image editing and design) to web-based software.
    • A lot of math functionality is honestly just easier to plug into web-based calculators for finance, accounting, and even the higher level math that Wolfram Alpha excels at.
    • Lots of media organization, from photos to videos to music, are now in cloud-based searchable albums and playlists.

    All these things used to be local uses of computing, and can now be accessed from low powered smartphones. Things like Chromebooks give a user access to between 50-100% of what they’d be doing on a full fledged high powered desktop, depending on the individual needs and use cases.