Loading video...
Video Failed to Load
Non-robustness hints at paradigm failures. Reasoning can improve robustness. Alexander Wei explores reasoning-based defenses that let models ‘think’ before responding, helping counter adversarial attacks and strengthen AI safety."
508,419 views • 1 year ago •via X (Twitter)
2 Comments

FAR.AI1 year ago
Follow us for AI safety insights And watch the full video

Hooman Malekmohammadi1 year ago
@alexwei_ آیا هوش مصنوعی میتواند به مرور زمان تبدیل به یک دشمن شود ؟


