Loading video...

Video Failed to Load

There was a problem loading this video. This could be due to a temporary network issue or the video might be unavailable.

🚨How do LLMs acquire human values?🤔 We often point to preference optimization. However, in our new work, we trace how and when model values shift during post-training and uncover surprising dynamics. We ask: How do data, algorithms, and their interaction shape model values?🧵

Mehar Bhatia

1,357 subscribers

41,426 views • 8 months ago •via X (Twitter)

Anya Rossi• Live Now

Private livecam show

0 Comments

No comments available

Comments from the original post will appear here

Related Videos

Diffusions are excellent in creating fantastic images and videos 🔎 We cooked a *diffusion* model to synthesize structured data #ICLR2025 🔥 Introducing TabDiff, a mixed-type diffusion model for generating synthetic tabular data, imputing missing values, and beyond! 🧵 1/n

Diffusions are excellent in creating fantastic images and videos 🔎 We cooked a diffusion model to synthesize structured data #ICLR2025 🔥 Introducing TabDiff, a mixed-type diffusion model for generating synthetic tabular data, imputing missing values, and beyond! 🧵 1/n

Minkai Xu

50,434 views • 1 year ago

Ever wondered how training dynamics differ between LLMs 🖋️ and Vision 👁️ models? We explore this and close the gap between VMs and LLMs in our #NeurIPS2024 paper "TrAct: Making First-layer Pre-Activations Trainable". Paper📜 Video🎥

Ever wondered how training dynamics differ between LLMs 🖋️ and Vision 👁️ models? We explore this and close the gap between VMs and LLMs in our #NeurIPS2024 paper "TrAct: Making First-layer Pre-Activations Trainable". Paper📜 Video🎥

Felix Petersen

20,931 views • 1 year ago

Grasps are one of the primary ways in which we interact with and shape our environments. How can we faithfully capture human grasps with details such as hand/object shape and contact points? At #CVPR2026, we present MANUS, a method to accurately reconstruct grasps and contacts. 🧵

Grasps are one of the primary ways in which we interact with and shape our environments. How can we faithfully capture human grasps with details such as hand/object shape and contact points? At #CVPR2026, we present MANUS, a method to accurately reconstruct grasps and contacts. 🧵

Srinath Sridhar

10,779 views • 2 years ago

Living in harmony and respecting each other are good behaviors and values💞👍👍👍🙏Are you agree with me🤔 How we can call this behavior from this 👇👇 movie 🎥🤔

Living in harmony and respecting each other are good behaviors and values💞👍👍👍🙏Are you agree with me🤔 How we can call this behavior from this 👇👇 movie 🎥🤔

JP@Nshuti

18,900 views • 3 months ago

Republicans are attacking our democracy. This is how we chose to respond. California will not sit idle as they shred our democracy. We hope other states join us in defending our democratic values and the voices of voters all across this country.

Republicans are attacking our democracy. This is how we chose to respond. California will not sit idle as they shred our democracy. We hope other states join us in defending our democratic values and the voices of voters all across this country.

Governor Newsom Press Office

67,194 views • 11 months ago

How robust can model predictive control be if we can solve each trajectory optimization to global optimality? On the contact-rich push-T problem, we show that model-based global optimization is so robust that it never fails, even if the model is not even correct! We achieve global optimality via sparse Moment and SOS relaxations. -- Yes, we managed to solve SDPs online on a robot. Amazing work by Shucheng Kang and Guorui Liu.

How robust can model predictive control be if we can solve each trajectory optimization to global optimality? On the contact-rich push-T problem, we show that model-based global optimization is so robust that it never fails, even if the model is not even correct! We achieve global optimality via sparse Moment and SOS relaxations. -- Yes, we managed to solve SDPs online on a robot. Amazing work by Shucheng Kang and Guorui Liu.

Heng Yang

28,718 views • 1 year ago

Coarse2Real (C2R) transfers simple 3D renderings into realistic style video. Check our paper and project page to learn how to hedge small amount of synthetic paired data with real non-pair data for training the C2R model. We will release the model soon!

Coarse2Real (C2R) transfers simple 3D renderings into realistic style video. Check our paper and project page to learn how to hedge small amount of synthetic paired data with real non-pair data for training the C2R model. We will release the model soon!

Yi Zhou

12,406 views • 2 months ago

We built an AI model to simulate how a fruit fly walks, flies and behaves – in partnership with HHMI | Janelia. 🪰 Our computerized insect replicates realistic motion, and can even use its eyes to control its actions. Here’s how we developed it – and what it means for science. 🧵

We built an AI model to simulate how a fruit fly walks, flies and behaves – in partnership with HHMI | Janelia. 🪰 Our computerized insect replicates realistic motion, and can even use its eyes to control its actions. Here’s how we developed it – and what it means for science. 🧵

Google DeepMind

1,057,039 views • 1 year ago

In 2017, during Tesla’s 'production hell' while scaling up Model 3 production, Elon Musk camped on the roof of the Nevada Gigafactory, singing Johnny Cash’s 'Ring of Fire' and roasting marshmallows, showing how he is truly dedicated to Tesla and values Tesla investors.

In 2017, during Tesla’s 'production hell' while scaling up Model 3 production, Elon Musk camped on the roof of the Nevada Gigafactory, singing Johnny Cash’s 'Ring of Fire' and roasting marshmallows, showing how he is truly dedicated to Tesla and values Tesla investors.

SMX 🇺🇸

326,767 views • 10 months ago

His Highness محمد بن زايد “I call on everyone to abide by values of peace and stability and cooperation” This is our leader… this is how we operate… these values have been instilled into our very fabric as a nation. Now it’s time for the rest of the world to take the UAE as THE blueprint to a successful nation locally and globally.

His Highness محمد بن زايد “I call on everyone to abide by values of peace and stability and cooperation” This is our leader… this is how we operate… these values have been instilled into our very fabric as a nation. Now it’s time for the rest of the world to take the UAE as THE blueprint to a successful nation locally and globally.

Rauda Altenaiji

31,482 views • 10 months ago

When I look at Jimmy Carter, I see a man not only for our times, but for all times. A man who embodied the most fundamental human values we can never let slip away. And while we may never see his likes again, we would all do well to try to be a little more like Jimmy Carter.

When I look at Jimmy Carter, I see a man not only for our times, but for all times. A man who embodied the most fundamental human values we can never let slip away. And while we may never see his likes again, we would all do well to try to be a little more like Jimmy Carter.

President Biden Archived

1,878,457 views • 1 year ago

New work: The Value Axis 🎯 How do LLMs choose which path to take mid-task? We find they internally track the chance of reaching their goal along a linear axis, akin to a value function in RL. We show it modulates confidence in math & coding and can be reshaped with DPO and SFT.

New work: The Value Axis 🎯 How do LLMs choose which path to take mid-task? We find they internally track the chance of reaching their goal along a linear axis, akin to a value function in RL. We show it modulates confidence in math & coding and can be reshaped with DPO and SFT.

Nick Jiang

28,039 views • 1 month ago

President Trump on Iran: “We killed all their leadership. And then they met to choose new leaders, and we killed all of them. And now we have a new group and we can easily do that. But let’s see how they turn out.” Most based POTUS in history.

President Trump on Iran: “We killed all their leadership. And then they met to choose new leaders, and we killed all of them. And now we have a new group and we can easily do that. But let’s see how they turn out.” Most based POTUS in history.

Paul A. Szypula 🇺🇸

79,791 views • 4 months ago

We trained our Large Sensor Model (LSM) on over 40 million hours of de-identified multimodal sensor data from 165K users to demonstrate how it could improve performance in wearable tasks like exercise and activity recognition. Here’s what we found →

We trained our Large Sensor Model (LSM) on over 40 million hours of de-identified multimodal sensor data from 165K users to demonstrate how it could improve performance in wearable tasks like exercise and activity recognition. Here’s what we found →

Google AI

63,629 views • 1 year ago

Disinformation isn't new; it's part of human history. Our tendency to believe what we want to believe is now amplified by technology. How do we combat this growing threat to democracy and society? Learn more: #wef25

Disinformation isn't new; it's part of human history. Our tendency to believe what we want to believe is now amplified by technology. How do we combat this growing threat to democracy and society? Learn more: #wef25

UN Development

14,026 views • 1 year ago

"How do we define "good and evil", or rather, what are their measures? Do their definitions change over time? What is the root cause of these phenomena and how did they form in our world?" - Zafar Mirzo Zafar Mirzo

"How do we define "good and evil", or rather, what are their measures? Do their definitions change over time? What is the root cause of these phenomena and how did they form in our world?" - Zafar Mirzo Zafar Mirzo

Zafar Mirzo | Quotes

2,043,927 views • 1 year ago

XTRA Roadmap 2023 🚀 Our vision is to empower people to create XtraOrdinary Digital Identities across games and metaverses How will we do it? A Thread🧵...

XTRA Roadmap 2023 🚀 Our vision is to empower people to create XtraOrdinary Digital Identities across games and metaverses How will we do it? A Thread🧵...

XTRA

31,384 views • 3 years ago

Most of you know the work we do in GBV, we also do work in women empowerment. The two work in tandem. You know if you invite me to a conference, I stay behind and collect unused pens and notebooks for further use. I mean reduce, reuse and recycle. Lol. But most importantly I pick what we can use for our EMPAWA Mama program on Saturdays. I ask women who can share their skills to come to Soweto and speak to usm This Saturday we celebrated IWD and we did what we do best. Our Chama women met and were trained by one of the best Chama trainers CHAMA Champions who visited and gave great insights on how we can continue to grow VW Babes came to Soweto to participate and support our work and heh we have never seen these many VWs. The ladies stayed and also built our capacity We had a partnership with chiraz International at Jacaranda Hotel as usual we went with our Soweto mamas and there were topics like burnout, toxic relationships led by our own psychologist Njoki Maina. We also learnt first aid and self defense. And Separ will be coming to Soweto to train our ladies in first aid as well as self defense. That is how we Accelerate Action #Usikimye

Most of you know the work we do in GBV, we also do work in women empowerment. The two work in tandem. You know if you invite me to a conference, I stay behind and collect unused pens and notebooks for further use. I mean reduce, reuse and recycle. Lol. But most importantly I pick what we can use for our EMPAWA Mama program on Saturdays. I ask women who can share their skills to come to Soweto and speak to usm This Saturday we celebrated IWD and we did what we do best. Our Chama women met and were trained by one of the best Chama trainers CHAMA Champions who visited and gave great insights on how we can continue to grow VW Babes came to Soweto to participate and support our work and heh we have never seen these many VWs. The ladies stayed and also built our capacity We had a partnership with chiraz International at Jacaranda Hotel as usual we went with our Soweto mamas and there were topics like burnout, toxic relationships led by our own psychologist Njoki Maina. We also learnt first aid and self defense. And Separ will be coming to Soweto to train our ladies in first aid as well as self defense. That is how we Accelerate Action #Usikimye

Njeri Wa Migwi™

16,218 views • 1 year ago

When you train it allows you to realise how much you don’t know. You need to be open to new ideas and willing to learn, which by default means you’ll do things wrong, so this then requires discipline and motivation to over come criticism or any self doubt. All these values are characteristics that will help you in your life outside training. And enjoy it 👊

When you train it allows you to realise how much you don’t know. You need to be open to new ideas and willing to learn, which by default means you’ll do things wrong, so this then requires discipline and motivation to over come criticism or any self doubt. All these values are characteristics that will help you in your life outside training. And enjoy it 👊

Frederick

36,627 views • 10 months ago

There’s one thing which has stood out in these 500 days of aggression: the resilience of the brave Ukrainians protecting their homeland & fighting for our shared values. We stood with you from day one and we will continue to do so. For peace with justice, dignity & freedom 🇪🇺🇺🇦

There’s one thing which has stood out in these 500 days of aggression: the resilience of the brave Ukrainians protecting their homeland & fighting for our shared values. We stood with you from day one and we will continue to do so. For peace with justice, dignity & freedom 🇪🇺🇺🇦

Roberta Metsola

228,308 views • 3 years ago