Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

Anthropic CPO, Mike Krieger: Dario Amodei predicted the coding benchmark (SWE-bench) would reach 90% by the end of the year I’ve started taking AI timelines more seriously after seeing the progress. "mid-2025 now feels much closer than 2027"

46,067 Aufrufe • vor 1 Jahr •via X (Twitter)

9 Kommentare

Profilbild von John Bridges
John Bridgesvor 1 Jahr

It will only reach that for people that don’t write threatening software. Currently my Claude code might as well be my grandmother it’s soooo bad. @anthropic write into the var directory their caches on Mac and Linux - horrible practice

Profilbild von SecBriefs | Making Cybersecurity Simple
SecBriefs | Making Cybersecurity Simplevor 1 Jahr

🤖 What to Expect in Cybersecurity in 2025: From AI-driven threats to Zero Trust adoption, the landscape is evolving fast. Are you ready? Stay prepared with CYBERSECURITY DICTIONARY For Everyone, on Amazon: 🛒

Profilbild von BowtiedWhitebat + Read Pinned Tweet or NGMI
BowtiedWhitebat + Read Pinned Tweet or NGMIvor 1 Jahr

bigger the watermark ser

Profilbild von Frieren
Frierenvor 1 Jahr

interesting

Profilbild von Freedom_Aint_Free
Freedom_Aint_Freevor 1 Jahr

My hunch is, even after the frontier models totally saturate those tests (solid 100% across the board) they will still suck in many important fields. The should be research level problems, if they can crack decades old unsolved science problems then they would be in another level

Profilbild von Luxe Clouds
Luxe Cloudsvor 1 Jahr

50t tokens of premium data 50t tokens of deduplicated premium synthetic data will push beyond this then interactive fine tuning & optimizations its already possible to hit that mark & go further

Profilbild von Jeremy Mcnabb
Jeremy Mcnabbvor 1 Jahr

We have a *tendency* to *notice* when we have actively been involved. Knowledge,plans slipping out of our control seems *exciting* *Smiling fun* (AI roller coaster wasn’t designed funny… designed to fix the rides… and in doing so… change, riders management owners all of it

Profilbild von Omar وديع
Omar وديعvor 1 Jahr

AI progress is faster than lightning! By achieving the 90% benchmark sooner, it’s igniting thrilling possibilities for fintech’s future. Can’t wait to see its impact!

Profilbild von Tom Nicholson
Tom Nicholsonvor 1 Jahr

Why do American corporates always look like AI avatars?

Ähnliche Videos