Furong Huang's banner

Furong Huang

@furongh • 11,335 subscribers

Assoc. prof. of CS at University of Maryland. Researcher in #PhysicalAI, #TrustworthyML, #EthicalAI, AI #Alignment, #AI for ALL.

Shorts

I’m so lucky to have such amazing students! 🤩 🦾🧑‍🎓

I’m so lucky to have such amazing students! 🤩 🦾🧑‍🎓

66,601 Aufrufe

🚨 AI companies are betting big on Web AI Agents—but they're far more vulnerable than standalone LLMs. These agents see 👀, think 🧠, and act ⚡ for you: 🕵️‍♂️ Gather personal data from emails, messages, & calendars 🧠 Plan your day, anticipate priorities ⚡ Automate tasks, execute actions, & interact with apps But what happens when they go rogue? 🔓 Our research reveals just how easy—and more importantly, WHY—Web AI Agents are dangerously vulnerable, even when built with safety-aligned LLMs. A 🧵👇

🚨 AI companies are betting big on Web AI Agents—but they're far more vulnerable than standalone LLMs. These agents see 👀, think 🧠, and act ⚡ for you: 🕵️‍♂️ Gather personal data from emails, messages, & calendars 🧠 Plan your day, anticipate priorities ⚡ Automate tasks, execute actions, & interact with apps But what happens when they go rogue? 🔓 Our research reveals just how easy—and more importantly, WHY—Web AI Agents are dangerously vulnerable, even when built with safety-aligned LLMs. A 🧵👇

14,479 Aufrufe

Videos

Anya Rossi

sweetdream.ai

SweetDream.ai•Sponsored•Livecam

Watch Anya Live

Anya is streaming live right now! Join her private show and enjoy exclusive content.

Exclusive private shows

1.2k viewers online

Private Show

Join now for exclusive access

Free preview available • Premium content

Hot take: robots should not dream in pixels. Pixels are too low-level. Latents are too opaque. μ₀ predicts a third thing: 3D motion traces. On real robots, it beats π₀.₅ — with ~1/100 the data scale and no action labels for world-model pretraining. 🧵 (this video features voiceover narration)

Hot take: robots should not dream in pixels. Pixels are too low-level. Latents are too opaque. μ₀ predicts a third thing: 3D motion traces. On real robots, it beats π₀.₅ — with ~1/100 the data scale and no action labels for world-model pretraining. 🧵 (this video features voiceover narration)

112,673 Aufrufe • vor 1 Monat

🧠💡 What if your 7B model could beat GPT-4o and Qwen2.5-72B—using just 11k training samples? No distillation. No warm-start. Just smart data and reinforcement learning. Inspired by Moravec’s Paradox, we let the model decide what's actually hard. 🚨 New paper: "SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement" We show how ThinkLite-VL-7B achieves SoTA on MathVista—75.1%, surpassing much larger models. 👇 Here’s how we did it: 🔗 🧠 Code: #AI #VisionLanguageModels #ReinforcementLearning #MachineLearning #LessIsMore

🧠💡 What if your 7B model could beat GPT-4o and Qwen2.5-72B—using just 11k training samples? No distillation. No warm-start. Just smart data and reinforcement learning. Inspired by Moravec’s Paradox, we let the model decide what's actually hard. 🚨 New paper: "SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement" We show how ThinkLite-VL-7B achieves SoTA on MathVista—75.1%, surpassing much larger models. 👇 Here’s how we did it: 🔗 🧠 Code: #AI #VisionLanguageModels #ReinforcementLearning #MachineLearning #LessIsMore

63,321 Aufrufe • vor 1 Jahr

🌟 Can you imagine aligning your AI model 🤖 on the fly, without updating its core parameters so much that it becomes unsuitable for others with different preferences? 🚀 Introducing "Transfer Q Star: Principled Decoding for #LLM #Alignment" 🔗: A 🧵👇

🌟 Can you imagine aligning your AI model 🤖 on the fly, without updating its core parameters so much that it becomes unsuitable for others with different preferences? 🚀 Introducing "Transfer Q Star: Principled Decoding for #LLM #Alignment" 🔗: A 🧵👇

49,161 Aufrufe • vor 2 Jahren

Keine weiteren Inhalte verfügbar