Video yükleniyor...

Video Yüklenemedi

Ana Sayfaya Dön

Data curation is crucial for post-training recipes. But how do we curate? Curation is usually manual & tedious. And, it's hard to tell if a strategy in the data will be reliable! We introduce an automatic way to curate, informed by the robot's policy learning.

19,910 görüntüleme • 1 yıl önce •via X (Twitter)

3 Yorum

Chelsea Finn profil fotoğrafı
Chelsea Finn1 yıl önce

Key idea: When training on all data, policy success is indicative of whether the strategy it took is good! Paper: Led by @_anniechen_ and @AlecLessing, with @liu_yuejiang @StanfordAILab

Lucid Scientific, Inc. profil fotoğrafı
Lucid Scientific, Inc.1 yıl önce

Expand the possibilities of your metabolic research. Resipher tracks real-time cellular oxygen consumption in standard 96-well plates, delivering continuous real-time data directly from your incubator. Request a free virtual demo or quote today >>

Ada Wan profil fotoğrafı
Ada Wan1 yıl önce

No data "cleaning"!!! That's the point. Make it work WITHOUT hacking through data / "data curation". If an algorithm works with any data, without any "data cleaning/curation", that indicates there is some good generalization power in the algo. If the algo only works (well) 1/

Benzer Videolar

#WATCH | India AI Impact Summit 2026 | Delhi: Founder Chairman and CEO of Sampark Foundation & former CEO of HCL Technologies, Vineet Nayar says, "...From an employment point of view I think it is very important for us to understand that Indian companies, including Indian IT companies, are going to be profit-driven and therefore if you believe that they are going to create employment you must be dreaming. Therefore, the question is how do we create employment in this environment, and that employment comes from mass scale startups, which is what this government has already doing. So, how do we create new sets of people who are trying to solve new sets of problems not new sets of technology and if we do that we will get it right. I think we as Indians have to be very careful on who does data belong to and that is the debate we have a problem with. The LLM models which exist worldwide are far superior than the Indian models. Unfortunately, in India, we never develop products, so therefore we do not have SLMs and LLMs which are world-class. On one side, we have global LLM products which are coming to India and trading on our Indian data. Should we allowed that or should we not allowed that? But on the other side if we don't allow that then we have the data but we don't have the LLM models. So, how do we encourage technology completely to develop the LLM models. This needs radicals strategic thinking and a very important aspect otherwise we will either give up a data. So, I think it's a very critical aspect for us to think about - who does this data belong, what is the kind of incentives we are going to give to develop LLM technologies or SLM technologies fast so that we train on our data otherwise an LLM will come in with our data and we'll immediately see return and we'll celebrate and we will do all these kind of press releases but the India will lose a competitive advantage on something which is very critical for the next decade."

ANI

18,753 görüntüleme • 4 ay önce