MazdakMA to Technology - Lemmy.org · 4 days agoA comparison of OpenAI's o3, o4-mini, and GPT-4.1; Aaron Levie says o3 nailed a multi-step financial modeling task; Scale AI CEO says o3 is “a big breakthrough”every.toexternal-linkmessage-square1fedilinkarrow-up12arrow-down11
arrow-up11arrow-down1external-linkA comparison of OpenAI's o3, o4-mini, and GPT-4.1; Aaron Levie says o3 nailed a multi-step financial modeling task; Scale AI CEO says o3 is “a big breakthrough”every.toMazdakMA to Technology - Lemmy.org · 4 days agomessage-square1fedilink
minus-squarePennomi@lemmy.worldlinkfedilinkEnglisharrow-up1·4 days agoI’m not sure I’d trust an AI with finances (okay, I’m sure that I wouldn’t trust it). But this is a big step forward if they can get it consistent. Multi-step anything has been the primary limitation of AI for a long time.
I’m not sure I’d trust an AI with finances (okay, I’m sure that I wouldn’t trust it). But this is a big step forward if they can get it consistent. Multi-step anything has been the primary limitation of AI for a long time.