• @Mojave
    link
    English
    392 days ago

    DeepSeek claimed the model training took 2,788 thousand H800 GPU hours, which, at a cost of $2/GPU hour, comes out to a mere $5.576 million.

    That seems impossibly low.

    DeepSeek is clear that these costs are only for the final training run, and exclude all other expenses