• 4 Posts
  • 227 Comments
Joined 2 months ago
cake
Cake day: December 9th, 2024

help-circle










  • Agreed with @Breve@pawb.social’s comment: not a good article.

    First off, this article is just a link and a short summary of a much longer blog post of the Qualys AI security company. So this isn’t a standard industry or academic benchmark, this is something Qualys is using as part of their sales strategy for AI consulting services.

    Which, frankly, I would avoid if this article is indicative of how they work.

    They tested 1 model. One. Insane. DeepSeek released about a half-dozen distillation of llama and qwep2: 3B, 7B, 8B, 14B, 32B, and the full 671B model. As an AI-expert security company, why would you test one of the weakest distillations only? Was it really so expensive to rent some time on the cloud to test out the other models? Not even the 32B which should be well within most hobbyist ability and budget to run on cloud and test out.

    The full article is such obvious SEO bait, I’d be surprised if AI didn’t help write it. Just goes on and on and on talking about why AI security is important, why these tests are important, why its important each single test that it failed. But ultimately, no matter how many words they write, the point that they only test 1 very weak version of the model makes the whole thing pointless except as a way to tie the name Qualys to DeepSeek and thus lure in more gullible rubes. Like did they only test 8B because that’s cheap enough for them to run as a service for their business?

    Anyway, I’ve previously written here on Lemmy about how Deepseek distilations are so easy to break than I almost think they’re buggy and not just insufficient in this regard. I’d really want to see this kind of analysis done on the full model, and obviously also on the other big LLMs they could test to compare against. I wouldn’t even trust the 8B models for unsupervised work. It’s certainly not reliable and non-hallucenatory enough for things like content filtering or expert system usage, no 8B-level model I’ve tested is. So very concerning that it seems like that’s the level of service they’re pushing.



  • I’m noticing a trend of scientific-sounding announcements about physics results that turn out to be theoretical explorations of simulations. The whole “false vacuum” idea isn’t really even a hypothesis, just a what-if. We have no indication that the ground vacuum state isn’t the lowest energy configuration. I think people just find a non-zero minimum unintuitive.

    Anyway, the key figure in all these theoretical simulation articles is the multi-billion dollar quantum super computers running these simulations. Wouldn’t it be funny if tech investors with a lot of money staked on quantum devices pushed for low-quality science that required their machines to be done, thus expanding the market and value of their otherwise pointless supercomputers? This article ends on a very optimistic “these computers have so many uses in cryptography and science” which seems a little out of place when discussing physics results.




  • imho, BLM is a textbook example of recuperation. To start, you have a revolutionary movement with a revolutionary message/goal (abolish the police). Over time, the movement centralized leadership and softened message (defund/reform the police). And now today the movement is a brand that is owned by a company and the leadership is too timid to be effective or promote anything other than slogans and t-shirts.

    The Harris campaign ran and died on exactly the same playbook. Come out strong and rally the base with revolutionary talk, then walk back demands to reformist ones that are palatable to the donor class you’ve become dependent on, then become irrelevant and fail.

    History on this is clear to me. America is in terrible state, and everyone living here knows it, which is why revolutionary ideas are so immediately popular with the masses. But those ideas are too dangerous for Capital so they get watered down and defanged every. single. time. Until we build a decentralized mass movement that isn’t susceptible to leadership ideological capture, we’re going to keep losing.