Astro - Hacker News

54 comments

ChrisArchitect an hour ago

[dupe] https://news.ycombinator.com/item?id=48282814
faitswulff 2 hours ago

I wonder how much of DeepSeek and Xiaomi's pricing cuts can be traced back to cheap energy in China.
[-]
- onlyrealcuzzo 2 hours ago
  
  Energy is like 10-20% of the cost of AI.
  The rest is mostly hardware depreciation.
  [-]
  - cmiles8 an hour ago
    
    Correct. There are challenges getting enough energy to new data center builds but the cost of the energy is low relative to other costs.
  - stingraycharles 2 hours ago
    
    So you think they’re running the same types of state of the art Nvidia deployments?
    
    [-]
    
    onlyrealcuzzo an hour ago
    
    It's supposed to be even MORE expensive:
    Nvidia H100: Typically priced around $25,000–$30,000 (global MSRP).
    Huawei Ascend 910C: Reported to cost roughly $28,000, yet it delivers only 60% of the inference performance of the Nvidia H100.
    Google's TPUs are significantly cheaper for Google for inference. That's pretty much it.
    There's a reason nVidia has an 80% margin right now.
    
    [-]
    
    thrownthatway an hour ago
    
    MSRP is irrelevant in this context.
- maxdo 2 hours ago
  
  it's not that huge of a deal if you compare commercial costs in china and cheapest us states, and electricity is only one of the factors.
  The real reason: anthropic + openai just cut the reasoning output to prevent distill, and hence you see the rise of chinese models to establish contracts globally .
  [-]
  - stingraycharles 2 hours ago
    
    “and hence you see the rise of chinese models to establish contracts globally”
    how will that help them working around the distill issue?
    
    [-]
    
    gessha an hour ago
    
    Collecting user data directly by competing on price. The next step would be figuring out how that data can bring them closer to SOTA.
    
    [-]
    
    stingraycharles an hour ago
    
    Yes ok but that doesn’t give them the thinking tokens, how to reason about the prompt, which is precisely what’s most important.
- pianopatrick 2 hours ago
  
  I've heard on podcasts that AI data centers in the US are powered by natural gas. Apparently there is currently a glut of natural gas. So the energy costs are actually pretty low in the US.
  [-]
  - esseph an hour ago
    
    We extract more than we can export. Currently sitting on something like at least 3,500 trillion cubic feet of ng. We consume 30-32tcf per year.
- colechristensen 2 hours ago
  
  In China the state and corporations can blend so it's difficult to tell the difference between the two. It is known for government sponsored dumping to meet some state goal or another.
  [-]
  - faitswulff 2 hours ago
    
    This runs counter to the last 50 years of American propaganda espousing the inefficiency of government. If the Chinese government can just throw money at industries and have them flourish, why can't other governments?
    
    [-]
    
    nradov an hour ago
    
    Government central planning and industrial policy is always less efficient than free markets. But government can sometimes be more effective in accomplishing critical strategic goals when those are more important than efficiency.
    
    foxygen 2 hours ago
    
    I believe it is more complicated than simply “throwing money at industries”. It seems to me that in China, the Government actually runs the country, while in the US, private capital does.
    
    jameson 2 hours ago
    
    Other governments do, but not as much as China does.
    Healthcare in South Korea for example is government managed and it is one of the best healthcare in the world.
    I believe utility companies are also government owned.
    Also some of the well known companies now were practically government owned during the Park dictatorship in the 70s.
    I wouldn't use the term "Flourish" as what you hear and see is strictly controlled
    
    isityettime an hour ago
    
    > If the Chinese government can just throw money at industries and have them flourish, why can't other governments?
    One possibility that seems likely to me: it takes longer than a single election cycle for an investment like that to bear fruit. And you have to be willing to admit that some bets the state places will lose. This is harder in the kind of democracy and political climate that the US currently has. China's government has more continuity of leadership and a strong emphasis on stability that seem hard to achieve in the US without a lot more political cohesion and more nuanced opposition than the two-party system currently affords.
    If we could achieve it, though, it'd be awesome. Some "best of both worlds" stuff.
    
    rapsey 2 hours ago
    
    As if the west does not use tariffs and subsidies. China is simply much smarter about it and has much more functional institutions.
    
    colechristensen an hour ago
    
    The government paying for your output which has nowhere to go is not a flourishing industry.
    If my kid starts a lemonade stand and I pay him $500 to dump 20 gallons of lemonade into the sewer, did he run a successful business for a day?
    Look into the Chinese ghost cities or US and EU actions against Chinese metals dumping.
    https://en.wikipedia.org/wiki/Underoccupied_developments_in_...
    
    slopinthebag 26 minutes ago
    
    The US government could throw tons of money at everything and get some good results, that doesn't mean it would be efficient. And their system is fundamentally different, I think most westerners would appreciate less efficient AI companies in return for democracy and human rights.
    
    lazide 2 hours ago
    
    They made their domestic steel industry ‘flourish’ by getting every peasant to make their own steel mills too, and mostly crashed their economies.
    When things line up and the decisions are decent, top down can be really good.
    When the decisions are bad, it is exceptionally dramatic failures too. Tofu dregs, etc.
    Right now, no one has to liquidate so it’s easy to hide the damage though.
    
    idiotsecant 2 hours ago
    
    The Chinese economy is deeply weird from a western perspective. Culture and economics are not orthogonal.
    
    [-]
    
    Forgeties79 2 hours ago
    
    Highly recommend everyone check out Breakneck. Felt like that gave me my first real insight into the relationship of the government and business in China.
    
    nonethewiser 2 hours ago
    
    Not really. Dumping != flourishing
    
    golgappa 2 hours ago
    
    [dead]
    
    adrianN 2 hours ago
    
    Any government can and does regularly throw money at industries to make them flourish. The American propaganda claims that this is less efficient than letting market forces decide which companies win.
    
    [-]
    
    nradov an hour ago
    
    And it turns out that the American propaganda is almost always correct.
- 2 hours ago
  
  [deleted]
sea-gold 5 hours ago

Other thread (with many more comments): https://news.ycombinator.com/item?id=48282814
genxy 5 hours ago

This is because the users are training the product. They need training data, so they sell inference at the price of power.
[-]
- gruez 2 hours ago
  
  ?
  >API Services . If you use the API services, we will collect your IP address and the content (text, audio, video, picture) you submit to analyze the relevant instructions based on the model you select and to generate the returned content. Xiaomi will not use the content you provide for model training or any other purposes.
  https://privacy.mi.com/XiaomiMiMoPlatform/en_GB/
  [-]
  - koteelok 2 hours ago
    
    Chinese corporation would never lie
  - colechristensen 2 hours ago
    
    And what legal recourse do you have if they don't follow those rules?
    
    [-]
    
    windexh8er 2 hours ago
    
    You have no recourse in the US, either. Trust no one is the only path given all of the training data is stolen in the first place.
    It will come to light that one or many of the Frontier providers held the data, changed ToS and trained later minimally. But I think they just don't care and will train regardless. None of them abide by any level of ethics that would actually prevent them from leveraging an opportunity.
- Tiberium 2 hours ago
  
  ChatGPT (the setting is shared with Codex) and Claude (shared with Claude Code) also have sharing enabled by default, so why aren't they cheaper?
  [-]
  - Springtime 2 hours ago
    
    There's evidence various third-party models (including Deepseek) used distilling in training, based on models from those leading services. So they have more flexibility with pricing.
    
    [-]
    
    malnourish an hour ago
    
    Is that fundamentally any different than what e.g., Meta and OpenAI have done?
    Besides, hasn't SCotUS ruled that raw LLM output isn't subject to copyright? So these companies would be breaking a ToS at worst.
    
    behnamoh an hour ago
    
    So? And Anthropic/OpenAI literally stole copyrighted content to train their models.
    
    [-]
    
    Springtime an hour ago
    
    The point was that distilling based on others' models for training means they're not spending the same amount on R&D and/or training, giving them headroom in other ways (responding to the parent's point). It wasn't a comment reflecting on copyright/fair use.
    
    [-]
    
    behnamoh an hour ago
    
    In the same fashion, Anthropic/OpenAI also reduced their training cost by not purchasing the license to copyrighted work and stealing it instead.
  - koteelok 2 hours ago
    
    They are? They give away thousands of dollars via subs.
- camelmel 2 hours ago
  
  Is this training data even valuable? Usually AI data annotators get paid to write LLM responses, but here all they'd be getting is a bunch of user queries.
  [-]
  - VerTiGo_Etrex 2 hours ago
    
    1. Feed the same queries into Claude 2. Train on the Claude responses 3. ??? 4. Profit
    This has been the strategy for months now
mariopt 5 hours ago

MiMo-V2.5 Series
Input (Cache Hit) Input (Cache Miss) Output mimo-v2.5-pro $0.0036 $0.435 $0.87
mimo-v2.5 $0.0028 $0.14 $0.28
[-]
- discordance 2 hours ago
  
  For reference (input cache hit, input cache miss, output):
  Deepseek V4 Flash: $0.0028, $0.14, $0.28
  Deepseek V4 Pro: $0.145, $1.74, $3.48
  GPT 5.5: $0.5, $5, 430
  GPT 5.5 Pro: $0.5, $30, $180
  Claude Sonnet 4.6: $0.30, $3, $15
  Claude 4.7 Opus: $0.5, $5, $25
  [-]
  - aftbit an hour ago
    
    $ / 1 million tokens
  - mynegation 2 hours ago
    
    GPT 5.5: 430 or $30?
    
    [-]
    
    sgc an hour ago
    
    They missed the shift key for the $ sign.
infocollector 2 hours ago

Does anyone know what cards these guys are running it?
dlev_pika 2 hours ago

Business Model 2026
1. Dump product to corner the market
2. Kill competition
3. Raise prices, enshitify things
4. Profit
[-]
- onlyrealcuzzo 2 hours ago
  
  With the latest GRAM architecture just announced, I won't be surprised if there's a model than can run on a MacBook pro M5 that outperforms the best frontier model at implementation in 1 year, and in 2 years, a MacBook Neo.
  The frontier models are going to need to REALLY up their game if they can justify $200/mo for pretty awful experiences.
theNewDevTalks an hour ago

[flagged]