Video wird geladen...
Video konnte nicht geladen werden
Non-robustness hints at paradigm failures. Reasoning can improve robustness. Alexander Wei explores reasoning-based defenses that let models ‘think’ before responding, helping counter adversarial attacks and strengthen AI safety."
508,419 Aufrufe • vor 1 Jahr •via X (Twitter)
2 Kommentare

FAR.AIvor 1 Jahr
Follow us for AI safety insights And watch the full video

Hooman Malekmohammadivor 1 Jahr
@alexwei_ آیا هوش مصنوعی میتواند به مرور زمان تبدیل به یک دشمن شود ؟


