- cross-posted to:
- [email protected]
- cross-posted to:
- [email protected]
The underlying research story is interesting, but the way it’s written up actively makes it worse.
The researchers based s1 on Qwen2.5, an open-source model from Alibaba Cloud.
Watch me create a racing car for less than $50. Step 1: start with a Mercedes F1 racer…
Highly misleading. They finetuned an existing model using a different existing model in a process called distillation.
The article is effectively saying “our model only cost $50 to make, plus whatever tens or hundreds of millions of dollars the models we stole from cost.”
while absolutely true, the same can be said about my chinese nonsensename companys dehumidifier that I bought for 1/4 of the cost of an american brand name one
That would only be a valid comparison if the american brand dehumidifier, as a complete product, was a part of the chinese one’s bill of materials. This is closer to the cartoon meme image where McGuyver builds a megaphone out of a squirrel, twigs, and a megaphone.
It’s pretty much how the global economy has worked for a few decades, right? Advanced countries design, research and run things, developing countries build them.
I’m not really sure what it has to do with OP, though.
That’s rookie numbers I trained one in 1min with $1!
Trained it to do very basic arithmetic tasks, not to rival OpenAI.
How long before congress bans this one too