
Chris
@Chrisgpt • 44,777 subscribers
Agi 2029 - AI Insider / Reporter as featured in The Information • NYT • Techcrunch
Shorts
Videos

Sam Altman on GPT 6 • The GDPv eval will shift some of the way we do post training • overall we are keeping the same strategy going into GPT 6 Sam Altman on AGI: Sam Altman about AGI: "we are finally in the moment": "As the closer we get to it, the fuzzier the Agi concept becomes. But the one thing that I care the most about and, to my great surprise, we're finally in the moment; where it's starting to happen is, when it can do novel discovery, when It can expand the total human knowledge base.“ I have seen some examples from credible professors, that novel math problems have been solved. Such as GPT 5 Pro reportedly solving Yu Tsumura’s 554th problem, an IMO level group theory challenge that an August preprint said no LLM had solved, so the new claim is being actively contested. Sébastien Bubeck also said GPT 5 Pro produced a new proof that tightened a bound in smooth convex optimization.
Chris221,708 görüntüleme • 8 ay önce

🚨 ANTHROPIC JUST REVEALED CLAUDE MYTHOS ABILITIES Anthropic just formally announced "Claude Mythos Preview" and launched "Project Glasswing" to deploy it for cybersecurity defense. The models are unlocking completely new, autonomous behaviors. This isn't about slightly better benchmark scores. This is about what the model can do. Here are the direct quotes from Anthropic’s research team (including Dario) on exactly what Mythos is capable of: • Chaining Exploits: "It has the ability to chain together vulnerabilities... this model is able to create exploits out of three, four, sometimes five vulnerabilities that in sequence give you some kind of very sophisticated end outcome." • The Professional Standard: "The model that we're experimenting with is, by and large, as good as a professional human at identifying bugs." • Unprecedented Autonomy: "It's just generally better at pursuing really long-range tasks that are kind of like the tasks that a human security researcher would do throughout the course of an entire day." The Reality Check: Dario Amodei flat out said: "There's a kind of accelerating exponential... Claude Mythos Preview is a particularly big jump along that point." Because this model has become so capable at identifying zero-days, they are restricting its release to top tech partners to try to patch the world's software before these capabilities leak out. The autonomous researcher era has officially arrived. It’s over 💀
Chris46,010 görüntüleme • 1 ay önce

I can confirm 5.1 is rolled to some users out either the model itself or the system prompt Cc: @figuret20
Chris89,595 görüntüleme • 6 ay önce
Daha fazla içerik yok.