Hnery

Hnery@feddit.org · 14 hours ago

Could be the headline of an onion article

Hnery@feddit.org · 23 days ago

Engage your safety squints!

Hnery@feddit.org · 24 days ago

Is it a war or a cartel, though?

Hnery@feddit.org · 25 days ago

Not sure if OP/bot knows what this community is about… Massive shitpost, nevertheless.

Hnery@feddit.org · 27 days ago

Thanks for clarification!

Hnery@feddit.org · 27 days ago

So… as far as I understand from this thread, it’s basically a finished model (llama or qwen) which is then fine tuned using an unknown dataset? That’d explain the claimed 6M training cost, hiding the fact that the heavy lifting has been made by others (US of A’s Meta in this case). Nothing revolutionary to see here, I guess. Small improvements are nice to have, though. I wonder how their smallest models perform, are they any better than llama3.2:8b?

Hnery@feddit.org · 27 days ago

why are you so heavily and openly advertising Deepseek?

Hnery@feddit.org · 29 days ago

That article is written by DeepSeek R1 isn’t it

Hnery@feddit.org · edit-2 1 month ago

I sure hope everyone wiggles their fuel hose because else the golden liquid spills all over the place, posing serious biohazard. No one likes using a station with spilled liquid.