Alibaba releases AI model it says surpasses DeepSeek

@[email protected] · 2 days ago

Alibaba releases AI model it says surpasses DeepSeek

@[email protected] · 1 day ago

Can’t wait for Wish.com to release DickGargle 3.8-Ultra1

@[email protected] · 1 day ago

Well, the models start comin’ and they don’t stop comin’…

The US tech sector has just been completely disrupted. Turns out decades of slashing public education and demonizing “liberal” colleges is starting to catch up. Even Elmo himself said that H1B visas are critical because the US simply isn’t producing enough talent, but he and the other tech billionaires didn’t realize that money can’t buy everything, as they are now being shown with their pants down.

@[email protected] · 1 day ago

Well, the models start comin’ and they don’t stop comin’…

Got my RTX, gonna hit the ground runnin’…

@pHr34kY · 20 hours ago

Didn’t make sense just to train for fun.

@locahosr443 · 20 hours ago

Gonna steal some data it’s free to learn

@mlg · 1 day ago

I read this entire comment synced to smashmouth lmao

@[email protected] · 20 hours ago

Get off muh swamp!

@[email protected] · 2 days ago

Time for my favorite GIF:

@[email protected] · 2 days ago

@disguy_ovahea · 2 days ago

Em Adespoton · 2 days ago

DeepSeek’s “big change” isn’t the performance of its model though; it’s that it is fully open and operates on a fraction of the resources.

Is alibaba’s model also open weights, open reasoning, free for anyone to run, and runnable (and trainable) on consumer hardware?

trevor · 1 day ago

Call it “open weight” if you want, but it’s not “fully open”. The training data is still proprietary, and the model can’t be accurately reproduced. It’s proprietary in the same way that llama is proprietary.

@[email protected] · edit-2 1 day ago

But I could use it as a starting point for training and build from it with my own data. I could fork it. I couldn’t fork llama, I don’t have the weights.

trevor · 1 day ago

You can also fork proprietary code that is source available (depending on the specific terms of that particular proprietary license), but that doesn’t make it open source.

Fair point about llama not having open weights though. So it’s not as proprietary as llama. It still shouldn’t be called open source if the training data that it needs to function is proprietary.

r00ty · 2 days ago

Oh, good. Maybe they will stop trying to scrape my websites at some ridiculous rate using faked real browser UAs. I just blocked their whole ASN (AS45102) in the end.

NielsBohron · 2 days ago

I thought for sure this was an Onion article

@Pregnenolone · 2 days ago

Temu is next

ThePowerOfGeek · 2 days ago

I already have the Temu AI psuedocode. Here you go:

10 print “Hi, how can I help?”

20 receive input

30 print “That’s awesome! What else?”

40 go to 20

@[email protected] · 1 day ago

Looks pretty basic to me!

@[email protected] · 2 days ago

Any word on the training cost? I feel like that’s the most relevant metric here.

ms.lane · 2 days ago

2 Reeses Cups and a pack of ramen. Alibaba are efficient!

@[email protected] · 2 days ago

Oh cool, I was worried my 401k had almost sort of recovered from the last bombshell earlier this week…

@A_A · 2 days ago

DeepSeek_R1 outperform or equalzz GPT-1o is major newZ, but : 4o is much better than 1o. Now, Qwen-2.5Max outperforms GPT-4o … watever the investment involved, this is even more important ( ! ).

@ebolapie · 1 day ago

Are you okay?

@A_A · 1 day ago

😋 yes, why ? becauzzze of the zzZ ?

@ebolapie · 1 day ago

Among other things, yes.