• 4 Posts
  • 997 Comments
Joined 3 years ago
cake
Cake day: July 5th, 2023

help-circle

  • Yes. But major differences:

    The dot com buildout of physical communications infrastructure involved basically 3 things:

    1. Switches/routers at the nodes for sending signals down the right route.
    2. Fiber optic cables connecting the nodes.
    3. Legal rights of way and easements for the legal right to keep the physical assets in that physical place, and to maintain/replace the stuff as needed.

    Category number 1? That stuff went obsolete quickly, and wasn’t really reused after the crash.

    Category number 2 was better. Turns out, fiber optics can carry signals on a lot more channels than those fibers were originally designed for. And they’re designed for useful lives measured in decades. So even if they sat dark from being unused for 5-10 years, eventually they could be used again.

    Category 3 is super important. That legal right is basically permanent, and so long as communications equipment needs to physically go from one place to another, having that legal right can be built on and profited on (including the ability to sell or lease those rights).

    What’s that gonna look like for the AI infrastructure? The servers full of GPUs are the bulk of the cost, and the GPUs are replaced with a new generation every 1-2 years, seem to require all new power and cooling infrastructure every 1-2 generations or so.

    Plus the AI buildout looks to be several trillion dollars. Even adjusting for inflation, that’s so much more than the tens of billions that each telecom company built out that infrastructure.

    And it’s hard to see how the servers themselves will be useful for regular businesses, much less consumers. A Blackwell 72-GPU server is $3 million and takes 130 kW to run. A residential electrical line maxes out at about 48kW. The newest Vera Rubin servers are projected to be up to 600kW, with all the power and cooling management that comes with that, plus all the ultra high end networking stuff built into that rack. Even deep pocketed businesses will have trouble finding a use for that server rack worth millions, requiring a ton of supporting infrastructure that not even normal pre-2025 data centers have.


  • I don’t think government funding can actually offset the crash in consumer and business demand being insufficient to cover the cost of the most expensive models on the most expensive GPUs. But if you look through my comment history I’ve made the comparison to supersonic flight, because I genuinely believe there’s a possibility that governments fund the expensive branch of this technology for their own military or surveillance or law enforcement purposes without the benefits necessarily actually spilling out into normal commercial applications.

    We’ve hit the point where training a model (both pre training and post training) isn’t the expensive part, and the expensive part is actual inference, which makes it hard to scale the most expensive models to where it’s useful for a lot of people. So it might be that the companies and governments that can afford to operate an expensive model might be the only ones to do it. And they’ll be able to, without necessarily the public being able to have access to the same tech.





  • There’s just no way to pay for the cost of these services, though.

    When someone constructs a 100 MW data center (now considered a smaller one for new construction), that’s about $2 billion in total costs to outfit the whole operation. And then once it’s on, we’re talking something like $10-20 million/month in electricity alone, and a few million in other costs. How many $20 subscriptions do you need to sell just to break even with your operating expenses? How many $100/month subscriptions do you need to sell to make a dent on your interest payments on the construction? Will there be a market for $1000/month subscriptions from millions of customers? If not, how’s this all going to be paid for?




  • Driver facing camera systems can be consistent with privacy, as long as they don’t record or transmit any data other than a single dimensional metric of how distracted or drowsy a driver is (or even discrete binary state of yes/no) and timestamps when that state was detected.

    A closed loop system that merely keeps that data for the current drive and maintains it solely in the vehicle’s own systems can be consistent with privacy principles that nobody else should know anything about how a car is being used, except what can be observed from the outside.



  • But you’re seeing a screenshot of an unmatched order that no driver has claimed yet. I’m saying that unless an actual match is accepted, that’s not really evidence that people in a place don’t tip well, just that some people don’t get their orders filled.

    If you never give less than $5, then any order you’re involved in will involve at least a $5 tip. That may not be representative of the orders you’re not involved with.





  • AI companies are bankrupting themselves with training costs that they need to recoup back by selling inference.

    I think they hit a wall in actual returns on performance with pretraining, years ago. Then they started scaling up on post-training/reinforcement learning to continue improvement, but that might be hitting a plateau as well. More recently it looks like they’re relying more heavily on scaling up on inference, which is a significant problem for their long term business models.

    If they’re not able to cheaply deliver inference (and charge at a premium), how will they be able to sustain their businesses?

    It seems that the most recent, largest models are using a lot more tokens to accomplish the same tasks, so even as token cost drops the actual cost of using the latest models seems to be going up with time (even as performance improves).




  • The only solution is to make sure they can’t read data you don’t want shared.

    Isn’t that the appropriate guardrail, then? LLM chats and agents and whatever need to be contained with external permissions settings that the LLMs simply do not and can never have the power to override.

    In a normal customer service setting with human agents, there are still plenty of examples of what a human agent simply doesn’t have the power to do. Often, they’ll need to escalate to a manager to do things like process refunds not just because they weren’t given social permission to do so, but because they weren’t given technical permissions to do so. LLM agents need to be contained in the same way. Any decent use of agents, human or software, requires carefully designed processes and permissions extrinsic to that agent’s own decisionmaking abilities to make sure that agents don’t do something bad for the company.