NVIDIA's Eos supercomputer can train a 175 billion parameter GPT-3 model in under four minutes

Lugh · 1 year ago

NVIDIA's Eos supercomputer can train a 175 billion parameter GPT-3 model in under four minutes

@blackfire · 1 year ago

So it was a perf test of a 1b token size model not the full 3.7T that get3 is trained with. I mean great. They are showing improvement but this is just a headline grabber they haven’t done anything actually useful here.

@[email protected] · 1 year ago

Just checking in to say they are still there - so many rascals showing off rigs these days

NVIDIA's Eos supercomputer can train a 175 billion parameter GPT-3 model in under four minutes

NVIDIA's Eos supercomputer can train a 175 billion parameter GPT-3 model in under four minutes

NVIDIA's Eos supercomputer just broke its own AI training benchmark record