• @datelmd5sum
    link
    22 hours ago

    IIRC you need double the compute for 10% improvement in a model and they’ve already computed quite a bit.