• 6 Posts
  • 482 Comments
Joined 3 years ago
cake
Cake day: August 29th, 2023

help-circle






  • Not when what they want contradicts the basic limits of reality and logistics!

    Ed Zitron has done a breakdown on building normal sized data centers vs. the current target size of AI data centers, and on the bigger end normal data centers are 10s of MWs up to 100s of MWs. After 2 years Stargate Abilene has only turned on its first 200-300 MW. So I think even if regulators roll over on using twice the power of the entire state this project would take 2-3 years just to turn on the first few hundred megawatts then stall out.


  • Every author named as writing a paper bears full responsibility for the paper.

    This has the nice added bonus that it will likely catch PI’s that put their name on their grad students paper without actually doing the mentoring they were supposed to. It will also catch professors that coast (or at least inflate their citation index) by getting their name on papers they barely contributed to.

    I am quite convinced that, under these arxive guidelines, every single major PI in the field will be banned within a few years.

    Catching a lot of PIs that have allowed and even encouraged slop submission is a good thing in my book.



  • he just posted an entirely unnecessary amount of words

    taking a quick look at it… it’s actually short by Scott’s standards, but still overly long, given that the only point he makes is claiming Lindy’s Law is applicable to predicting AI progress in absence of other information.

    you-can’t-really-knows

    Yeah, he straw-mans AI critics/skeptics as trying to make an argument from ignorance, then tries to argue against that strawman using Lindy’s Law (which assumes ignorance and a pareto distribution). He completely ignores that AI critics are actually making detailed arguments about LLM companies consuming all the good and novel training data, hitting the limits on what compute costs they can afford, running into problems of the long lead time for building datacenters, etc. Which is pretty ironic given his AI 2027 makes a nominal claim to accounting for all that stuff (in actuality it basically all rests on METR’s task horizons, and distorts even that already questionable dataset).




  • Even Scott’s fantasy dream scenario for what prediction markets could be like and what questions they could answer feels… …deliberately naive? …like libertarian brainrot? …disconnected from reality?

    Ask yourself: what are the big future-prediction questions that important disagreements pivot around? When I try this exercise, I get things like:

    Will the AI bubble pop? Will scaling get us all the way to AGI? Will AI be misaligned?

    Huge amounts of money are being dumped into a bubble based on hype, so to hope a predict market would or could make better predictions than the existing business-idiot VCs funding this bubble feels hopelessly naive in a libertarian kind of way. There is already a method of aggregating the wisdom of the crowd and it is failing to incredibly lazy hype and PR.

    Will Trump turn America into a dictatorship? Make it great again? Somewhere in between?

    Again, there is already a mechanism for aggregating wisdom of the crowds, its called an election, and its also failed to get a answer predicated on reality or truth, so again, it seems incredibly naive to expect prediction markets to do better!

    Will YIMBY policies lower rents? How much?

    I mean, the councils and communities making these decision already ignore or overlook longer-term broader predictions of economic impact in favor of immediate home-owner value, I don’t see why Scott would expect prediction markets to help decision making go better here.

    Overall, it feels like Scott is overlooking the way decision making often already ignores science and experts. Society doesn’t have a problem making decent predictions compared to the problems it has communicating expert opinions to the public effectively and crafting policy aligned with the public interest.






  • You’ve described the problem with generalization yes. Well, you could maybe sort of train it not to generate ā€œall men are catsā€, but then that might also prevent it from making the more correct generalization ā€œall cats are mortalā€ or even completely valid generalizations like combing ā€œall men are mortalā€ and ā€œSocrates is manā€ to get ā€œSocrates is mortalā€.

    The problem with monofacts is a bit more subtle. Let’s say the fact that ā€œJohn Smith was born in Seattle in 1982, earned his PhD from Stanford in 2008, and now leads AI research at Tech Corp,ā€ appears only once in the training data set. Some of the other words the model will have seen multiple times and be able to generate tokens in the right way for. Like Seattle as a location in the US, Stanford as a college, 2008 as a date, etc. But the combination describing a fact about John Smith appearing uniquely trains the model to try to generate facts that are unique combinations of data. So the model might try to make up a fact like ā€œJane Doe was born in Omaha in 1984, earned her master from Caltech in 2006, and is now CEO of Tech Corpā€ because it fits the pattern of a unique fact that was in its training data set.



  • Theoretically if the people responsible for that training and reinforcement did their jobs well then those patterns should only include true statements but if it was that easy then you wouldn’t have [insert the entire intellectual history of the human species].

    I’m chiming in to agree with Architeuthis and mention a citation explaining more. LLMs have a hard minimum rate of hallucinations based on the rate of ā€œmonofactsā€ in their training data (https://arxiv.org/html/2502.08666v1). Basically, facts that appear independently and only once in the training data cause the LLM to ā€œlearnā€ that you can have a certain rate of disconnected ā€œfactsā€ that appear nowhere else, and cause it to in turn generate output similar to that, which in practice is basically random and thus basically guaranteed to be false.

    And as Architeuthis says, the ability of LLMs to ā€œgeneralizeā€ basically means they compose true information together in ways that is sometimes false. So to the extent you want your LLM to ever ā€œgeneralizeā€, you also get an unavoidable minimum of hallucinations that way.

    So yeah, even given an even more absurdly big training data source that was also magically perfectly curated you wouldn’t be able to iron out the intrinsic flaws of LLMs.