I wonder what his first clue was.

  • @KingRandomGuy
    link
    English
    13 hours ago

    TBH the paper is a bit light on the details, at least compared to the standards of top ML conferences. A lot of DeepSeek’s innovations on the engineering front aren’t super well documented (at least well enough that I could confidently reproduce them) in their papers.