Video yükleniyor...

Video Yüklenemedi

Bu video yüklenirken bir sorun oluştu. Bu geçici bir ağ sorunundan kaynaklanıyor olabilir veya video kullanılamıyor olabilir.

Ana Sayfaya Dön

In flow matching, a coupling determines how noise and data samples are paired during training. The choice of coupling is important because it influences the geometry of trajectories at inference time. The simplest choice is the independent coupling, where noise and data points are paired arbitrarily. This can lead... show more

Alec Helbling

10,795 subscribers

65,060 görüntüleme • 1 ay önce •via X (Twitter)

Eğitim Bilim & Teknoloji

Anya Rossi• Live Now

Private livecam show

0 Yorum

Yorum bulunmuyor

Orijinal gönderinin yorumları burada görünecek

Benzer Videolar

Flow matching produces smooth, deterministic trajectories. In contrast, the sampling process of a diffusion model is chaotic, resembling the random motion of gas particles.

Flow matching produces smooth, deterministic trajectories. In contrast, the sampling process of a diffusion model is chaotic, resembling the random motion of gas particles.

Alec Helbling

428,834 görüntüleme • 1 yıl önce

Flow-based generative models trained with flow matching tend to learn curved trajectories, which are challenging to approximate in a few steps. Rectified flows aim to learn straight trajectories, which are easier to simulate with less computation.

Flow-based generative models trained with flow matching tend to learn curved trajectories, which are challenging to approximate in a few steps. Rectified flows aim to learn straight trajectories, which are easier to simulate with less computation.

Alec Helbling

34,778 görüntüleme • 5 ay önce

System identification (sysid) is the process of finding the physical parameters that make a simulation match reality. If you're training an RL locomotion policy in simulation, the accuracy of your motor model directly affects how well the policy transfers to the real robot. A recent git commit by Kevin Zakka added a sysid toolbox to MuJoCo which automates this process: you provide recorded motor data and a MuJoCo model, and it optimizes the model parameters to minimize the difference between simulated and real trajectories. For my RobStride Dynamics RS02 QDD motors (17 Nm peak, 7.75:1 gear), I built a Rust tool that sends multi-sine torque excitation at 1 kHz and records position/velocity feedback. I then feed this data into MuJoCo's sysid optimizer.

System identification (sysid) is the process of finding the physical parameters that make a simulation match reality. If you're training an RL locomotion policy in simulation, the accuracy of your motor model directly affects how well the policy transfers to the real robot. A recent git commit by Kevin Zakka added a sysid toolbox to MuJoCo which automates this process: you provide recorded motor data and a MuJoCo model, and it optimizes the model parameters to minimize the difference between simulated and real trajectories. For my RobStride Dynamics RS02 QDD motors (17 Nm peak, 7.75:1 gear), I built a Rust tool that sends multi-sine torque excitation at 1 kHz and records position/velocity feedback. I then feed this data into MuJoCo's sysid optimizer.

David Bar

48,135 görüntüleme • 2 ay önce

Holy sh!t ! OpenAI will have their custom inference chips ready in just a few months and deployed at scale by the end of the year! 🤯 Training chip = The heavy lifters that require massive amounts of data and power to build and teach the AI models from scratch. Inference chip = The specialized, highly efficient chips that actually run the AI and generate the answers in real-time when you use it. This is going to help OpenAI drastically cut down their massive compute costs, speed up model reasoning times, and finally break free from relying entirely on Nvidia to scale their operations.

Holy sh!t ! OpenAI will have their custom inference chips ready in just a few months and deployed at scale by the end of the year! 🤯 Training chip = The heavy lifters that require massive amounts of data and power to build and teach the AI models from scratch. Inference chip = The specialized, highly efficient chips that actually run the AI and generate the answers in real-time when you use it. This is going to help OpenAI drastically cut down their massive compute costs, speed up model reasoning times, and finally break free from relying entirely on Nvidia to scale their operations.

Chris

60,278 görüntüleme • 3 ay önce

🦊 “yeonjun, between the title track and the coupling song, which one suits you better?” the one that suits me the most is the coupling song but the one that i have a lot of affection towards is the title track

🦊 “yeonjun, between the title track and the coupling song, which one suits you better?” the one that suits me the most is the coupling song but the one that i have a lot of affection towards is the title track

💬

21,966 görüntüleme • 10 ay önce

This animation shows how a uniform grid of points is deformed by a flow matching model. The visualization highlights how the learned flow warps the underlying space—stretching, compressing, and bending it—to transform one probability distribution into another.

This animation shows how a uniform grid of points is deformed by a flow matching model. The visualization highlights how the learned flow warps the underlying space—stretching, compressing, and bending it—to transform one probability distribution into another.

Alec Helbling

42,948 görüntüleme • 1 yıl önce

We’re proud to share that Tamarind Bio has been selected to build, host, and operate the inference infrastructure layer for TuneLab2.0, the next evolution of the platform. Eli Lilly and Company TuneLab is a first-of-its-kind, collaborative AI/ML drug discovery platform, bringing models trained on over $1B worth of Lilly proprietary data to the biotech ecosystem. Tamarind will power TuneLab’s scalable drug discovery workflows and model inference.

We’re proud to share that Tamarind Bio has been selected to build, host, and operate the inference infrastructure layer for TuneLab2.0, the next evolution of the platform. Eli Lilly and Company TuneLab is a first-of-its-kind, collaborative AI/ML drug discovery platform, bringing models trained on over $1B worth of Lilly proprietary data to the biotech ecosystem. Tamarind will power TuneLab’s scalable drug discovery workflows and model inference.

Deniz Kavi

41,959 görüntüleme • 10 gün önce

Amazon’s machine learning model collects 300 million data points per season and can now predict which players are likely to blitz before the snap. This is a look into the future of broadcasting.

Amazon’s machine learning model collects 300 million data points per season and can now predict which players are likely to blitz before the snap. This is a look into the future of broadcasting.

Joe Pompliano

688,455 görüntüleme • 2 yıl önce

The tokenization market is expected to reach $30T within the next ten years. And we are at the perfect spot to take the maximum advantage of it! DevvExchange assures full regulatory compliance - so we are the optimal choice to onboard all governments and institutions to web3.

The tokenization market is expected to reach $30T within the next ten years. And we are at the perfect spot to take the maximum advantage of it! DevvExchange assures full regulatory compliance - so we are the optimal choice to onboard all governments and institutions to web3.

DevvExchange

14,494 görüntüleme • 1 yıl önce

What if you kept asking an LLM to "make it better"? In some recent work at FAIR, we investigate how we can efficiently use RL to fine-tune LLMs to iteratively self-improve on their previous solutions at inference-time. Training for iterated self-improvement can be costly. The naive approach to training for K self-improvement steps leads to K times the number of rollout steps per episode. We introduce Exploratory Iteration (ExIt), an RL-based automatic curriculum method that bootstraps diverse training distributions of self-improvement tasks by upcycling the LLM's own responses at previous turns as the starting points for both self-improvement and *self-divergence.* In order to decide what task to train on next, the curriculum prioritizes sampling of partial turn histories that led to higher return variance in its GRPO group (a learnability score that comes for free). This automatic curriculum over the bootstrapped task space teaches the model how to perform iterated self-improvement while only ever training the model on single-step self-improvement tasks. We look at ExIt's impact in both single-turn (contest math problems) and multi-turn (BFCLv3 multi-turn tasks), as well as MLE-bench, where the LLM is run in a search scaffold to produce solutions to real Kaggle competitions. Across these eval settings, we find ExIt produces models with greater capacity for inference-time self-improvement compared to GRPO. Notably, ExIt models can self-improve on test tasks for many more steps than the typical solution depth encountered during training, including a 22% improvement in MLE-bench performance compared to GRPO.

What if you kept asking an LLM to "make it better"? In some recent work at FAIR, we investigate how we can efficiently use RL to fine-tune LLMs to iteratively self-improve on their previous solutions at inference-time. Training for iterated self-improvement can be costly. The naive approach to training for K self-improvement steps leads to K times the number of rollout steps per episode. We introduce Exploratory Iteration (ExIt), an RL-based automatic curriculum method that bootstraps diverse training distributions of self-improvement tasks by upcycling the LLM's own responses at previous turns as the starting points for both self-improvement and self-divergence. In order to decide what task to train on next, the curriculum prioritizes sampling of partial turn histories that led to higher return variance in its GRPO group (a learnability score that comes for free). This automatic curriculum over the bootstrapped task space teaches the model how to perform iterated self-improvement while only ever training the model on single-step self-improvement tasks. We look at ExIt's impact in both single-turn (contest math problems) and multi-turn (BFCLv3 multi-turn tasks), as well as MLE-bench, where the LLM is run in a search scaffold to produce solutions to real Kaggle competitions. Across these eval settings, we find ExIt produces models with greater capacity for inference-time self-improvement compared to GRPO. Notably, ExIt models can self-improve on test tasks for many more steps than the typical solution depth encountered during training, including a 22% improvement in MLE-bench performance compared to GRPO.

Minqi Jiang

41,066 görüntüleme • 9 ay önce

🚀New paper out - We present Video-MSG (Multimodal Sketch Guidance), a novel planning-based training-free guidance method for T2V models, improving control of spatial layout and object trajectories. 🔧 Key idea: • Generate a Video Sketch — a spatio-temporal plan with background, foreground, and motion in the pixel space. • Encode this structure directly into the latent space of the diffusion model during generation, which does not require fine-tuning or additional memory during inference. 🧵

🚀New paper out - We present Video-MSG (Multimodal Sketch Guidance), a novel planning-based training-free guidance method for T2V models, improving control of spatial layout and object trajectories. 🔧 Key idea: • Generate a Video Sketch — a spatio-temporal plan with background, foreground, and motion in the pixel space. • Encode this structure directly into the latent space of the diffusion model during generation, which does not require fine-tuning or additional memory during inference. 🧵

Jialu Li

35,060 görüntüleme • 1 yıl önce

OpenAI's Deep Research is getting a run for its money. Deep Lake was just released, and it's a different take on an AI system that can do deep research on your own data. You can use Deep Lake to build AI search with reasoning on your private and public data. (Look at the attached videos to get an idea of how it works.) If you want to research proprietary and sensitive data, Deep Research won't help you because it's limited to public data. Deep Lake, however, will allow you to use your private data. On top of that, Deep Lake supports multi-modal retrieval from the ground up. It uses vision language models for data ingestion and retrieval so that you can connect any data (PDFs, images, videos, structured data, etc.) You can even use mixed-data queries! Deep Lake can search your data from S3, Dropbox, and GCP. It learns from your queries over time, making the results as relevant to your work as possible!

OpenAI's Deep Research is getting a run for its money. Deep Lake was just released, and it's a different take on an AI system that can do deep research on your own data. You can use Deep Lake to build AI search with reasoning on your private and public data. (Look at the attached videos to get an idea of how it works.) If you want to research proprietary and sensitive data, Deep Research won't help you because it's limited to public data. Deep Lake, however, will allow you to use your private data. On top of that, Deep Lake supports multi-modal retrieval from the ground up. It uses vision language models for data ingestion and retrieval so that you can connect any data (PDFs, images, videos, structured data, etc.) You can even use mixed-data queries! Deep Lake can search your data from S3, Dropbox, and GCP. It learns from your queries over time, making the results as relevant to your work as possible!

Santiago

171,340 görüntüleme • 1 yıl önce

BIG NEWS. The Blockworks website has evolved: yesterday we were the home of news, today we are the home of onchain data. Head over to the site to see for yourself, but here's a little snippet of what you can expect: 1. Sector leaderboards (chains, DEXes, borrow lend, DATs, etc...) 2. Comprehensive data dashboards protocols 3. The ability to compare pricing and onchain data easily (coming soon) We're doing this because the industry still has a gigantic data problem. As investors get more sophisticated and fundamentals driven, basic high level facts are no longer sufficient. Investors need to be able to trust the data they are seeing and go much deeper than the surface level info that's available today. Additionally, because many data providers allow companies to essentially self report, you can't trust what you are seeing. This site is our contribution to fixing that problem and to ensuring clear, accurate data for investors. Blockworks is fully dedicated to becoming the most comprehensive data company in crypto in 2026. This is the first of many, many announcements like this this year, stay tuned.

BIG NEWS. The Blockworks website has evolved: yesterday we were the home of news, today we are the home of onchain data. Head over to the site to see for yourself, but here's a little snippet of what you can expect: 1. Sector leaderboards (chains, DEXes, borrow lend, DATs, etc...) 2. Comprehensive data dashboards protocols 3. The ability to compare pricing and onchain data easily (coming soon) We're doing this because the industry still has a gigantic data problem. As investors get more sophisticated and fundamentals driven, basic high level facts are no longer sufficient. Investors need to be able to trust the data they are seeing and go much deeper than the surface level info that's available today. Additionally, because many data providers allow companies to essentially self report, you can't trust what you are seeing. This site is our contribution to fixing that problem and to ensuring clear, accurate data for investors. Blockworks is fully dedicated to becoming the most comprehensive data company in crypto in 2026. This is the first of many, many announcements like this this year, stay tuned.

Mippo 🟪

91,704 görüntüleme • 5 ay önce

You believe: Earth’s rotation ≈ 1,000 mph Earth orbit of the Sun ≈ 67,000 mph. Both move ≈ 500,000 mph through the Galaxy. The Galaxy is moving through space at ≈ 1.3 million mph The universe is expanding at ≈ 152,112 miles per hour per megaparsec. Most of these motions are curved trajectories which by definition cause acceleration! Then we have this!

You believe: Earth’s rotation ≈ 1,000 mph Earth orbit of the Sun ≈ 67,000 mph. Both move ≈ 500,000 mph through the Galaxy. The Galaxy is moving through space at ≈ 1.3 million mph The universe is expanding at ≈ 152,112 miles per hour per megaparsec. Most of these motions are curved trajectories which by definition cause acceleration! Then we have this!

Ted Logan

42,232 görüntüleme • 11 ay önce

This is the level of noise in Bibu! Eileen on the gate has a unique charm and screams like a wookie! Our vets are checking Klaas as he had fever this morning. Its very important to act on time because they can get infections quickly due to their open belly button! #アザラシ幼稚園

This is the level of noise in Bibu! Eileen on the gate has a unique charm and screams like a wookie! Our vets are checking Klaas as he had fever this morning. Its very important to act on time because they can get infections quickly due to their open belly button! #アザラシ幼稚園

Panka

46,860 görüntüleme • 1 yıl önce

Fancy winning a pair of #EFL tickets for a home game of your choice? 😍 Reply with the answer to the following question and use #Super6Christmas for a chance to win 🎟️ How many points are Leicester City on in the league this season? 🦊 Winner announced at 4pm, T&Cs apply ⌚️

Fancy winning a pair of #EFL tickets for a home game of your choice? 😍 Reply with the answer to the following question and use #Super6Christmas for a chance to win 🎟️ How many points are Leicester City on in the league this season? 🦊 Winner announced at 4pm, T&Cs apply ⌚️

Super 6

53,490 görüntüleme • 2 yıl önce

Excited to share our NeurIPS 2024 Oral, Convolutional Differentiable Logic Gate Networks, leading to a range of inference efficiency records, including inference in only 4 nanoseconds 🏎️. We reduce model sizes by factors of 29x-61x over the SOTA. Paper:

Excited to share our NeurIPS 2024 Oral, Convolutional Differentiable Logic Gate Networks, leading to a range of inference efficiency records, including inference in only 4 nanoseconds 🏎️. We reduce model sizes by factors of 29x-61x over the SOTA. Paper:

Felix Petersen

157,435 görüntüleme • 1 yıl önce

At Dreamforce, the exploration of the possibilities of AI is incredible. At the same time, it's important that we understand that policies never move at the same speed as technology. So having a grasp over what is happening and how the negative aspects of AI don't proliferate in a big way, to minimize collateral damage is most important. I hope the companies and governments across the world wake up to this and ensure the best of AI possibilities will reach the people, as this is a phenomenal enabler for humanity like never before. -Sg Marc Benioff Dreamforce Salesforce #DF24

At Dreamforce, the exploration of the possibilities of AI is incredible. At the same time, it's important that we understand that policies never move at the same speed as technology. So having a grasp over what is happening and how the negative aspects of AI don't proliferate in a big way, to minimize collateral damage is most important. I hope the companies and governments across the world wake up to this and ensure the best of AI possibilities will reach the people, as this is a phenomenal enabler for humanity like never before. -Sg Marc Benioff Dreamforce Salesforce #DF24

Sadhguru

68,281 görüntüleme • 1 yıl önce

Depth Any Video with Scalable Synthetic Data AI physicists and chemists continue to make strides in depth estimation from video. Check out this new paper featuring some impressive examples. See the thread for more details (unfortunately no code yet). Abstract: Video depth estimation has long been hindered by the scarcity of consistent and scalable ground truth data, leading to inconsistent and unreliable results. In this paper, we introduce Depth Any Video, a model that tackles the challenge through two key innovations. First, we develop a scalable synthetic data pipeline, capturing real-time video depth data from diverse game environments, yielding 40,000 video clips of 5-second duration, each with precise depth annotations. Second, we leverage the powerful priors of generative video diffusion models to handle real-world videos effectively, integrating advanced techniques such as rotary position encoding and flow matching to further enhance flexibility and efficiency. Unlike previous models, which are limited to fixed-length video sequences, our approach introduces a novel mixed-duration training strategy that handles videos of varying lengths and performs robustly across different frame rates 0 - even on single frames. At inference, we propose a depth interpolation method that enables our model to infer high-resolution video depth across sequences of up to 150 frames. Our model outperforms all previous generative depth models in terms of spatial accuracy and temporal consistency.

Depth Any Video with Scalable Synthetic Data AI physicists and chemists continue to make strides in depth estimation from video. Check out this new paper featuring some impressive examples. See the thread for more details (unfortunately no code yet). Abstract: Video depth estimation has long been hindered by the scarcity of consistent and scalable ground truth data, leading to inconsistent and unreliable results. In this paper, we introduce Depth Any Video, a model that tackles the challenge through two key innovations. First, we develop a scalable synthetic data pipeline, capturing real-time video depth data from diverse game environments, yielding 40,000 video clips of 5-second duration, each with precise depth annotations. Second, we leverage the powerful priors of generative video diffusion models to handle real-world videos effectively, integrating advanced techniques such as rotary position encoding and flow matching to further enhance flexibility and efficiency. Unlike previous models, which are limited to fixed-length video sequences, our approach introduces a novel mixed-duration training strategy that handles videos of varying lengths and performs robustly across different frame rates 0 - even on single frames. At inference, we propose a depth interpolation method that enables our model to infer high-resolution video depth across sequences of up to 150 frames. Our model outperforms all previous generative depth models in terms of spatial accuracy and temporal consistency.

MrNeRF

27,428 görüntüleme • 1 yıl önce