Video yükleniyor...

Video Yüklenemedi

Bu video yüklenirken bir sorun oluştu. Bu geçici bir ağ sorunundan kaynaklanıyor olabilir veya video kullanılamıyor olabilir.

Ana Sayfaya Dön

Is it possible to build end-to-end autonomous discovery systems using Large Generative Models (LGMs)? 🧬 In this position paper, we argue: 🧵 (1/n) Ai2 Aristo Team at Ai2 Harshit Surana UMass Amherst University of Utah

Bodhisattwa Majumder

2,618 subscribers

30,781 görüntüleme • 2 yıl önce •via X (Twitter)

Eğitim Bilim & Teknoloji Haberler & Politika

Anya Rossi• Live Now

Private livecam show

11 Yorum

Bodhisattwa Majumder profil fotoğrafı

Bodhisattwa Majumder2 yıl önce

(2/n) 📊 We present a practical first step toward the goal of end-to-end automation of the scientific process focusing on observational or experimental data for two reasons: (1) an abundance of large-scale datasets that would benefit highly from automated discovery; 📈 (2) the practicality of automated verification enabled by data without the need for additional data collection. ⚗️

Bodhisattwa Majumder profil fotoğrafı

Bodhisattwa Majumder2 yıl önce

(3/n) A blueprint flow for data-driven discovery includes the following scenarios: 1. The user asks an explicit question around a particular line of inquiry or hypothesis. 🎯 2. The user can also ask a broad and partially defined high-level question, where the system must figure out the appropriate datasets, data transformations, variables, a list of possible hypotheses, and their verification. 📒 3. The user can provide follow-up feedback at any time, and the "continual learner" will continually evolve while providing updated experiments and results. 🤖

Bodhisattwa Majumder profil fotoğrafı

Bodhisattwa Majumder2 yıl önce

(4/n) We posit that: 1. LGMs present an incredible potential, such as knowledge-driven hypothesis search or tool usage to verify hypotheses—creating new avenues for ongoing efforts in the ML community on code generation, planning, and program synthesis. 🛠️ 2. LGMs are not all we need. Interfacing with fail-proof tools and inference-time functions, catering to domains and long-tail with user moderation, is required to have an accurate, reliable, and robust data-driven discovery. 👩‍👩‍👧

Bodhisattwa Majumder profil fotoğrafı

Bodhisattwa Majumder2 yıl önce

(5/n) We outline a set of desired properties for a data-driven discovery system. 🟩 1. Comprehensive Data Understanding 2. Hypothesis Generation 3. Planning and Orchestrating Research Pathways 4. Hypothesis Evaluation 5. Measurement of Progress 6. Knowledge Integration 7. Research Ethics and Fairness -- indicate high-level desiderata with several sub-properties delineated in the paper. Our survey across several existing automated and semi-automated data analysis and discovery systems reveals that these only partially cover the desired functionalities. 🔻

Bodhisattwa Majumder profil fotoğrafı

Bodhisattwa Majumder2 yıl önce

(6/n) As a proof of concept, we build DataVoyager—a system powered by GPT-4 that can semantically understand a dataset, programmatically explore verifiable hypotheses using the available data, run basic statistical tests (e.g., correlation and regression analyses) by invoking pre-defined functions or generating code snippets, and finally analyze the output with detailed analyses.

Bodhisattwa Majumder profil fotoğrafı

Bodhisattwa Majumder2 yıl önce

(7/n) Planning DataVoyager presents a strong base case for planning with decomposition, data transformation, and symbolic reasoning. However, LGM-based planners prefer direct, goal-oriented variables, which can lead to a lack of diversity in search, impacting the novelty of the outcome.

Bodhisattwa Majumder profil fotoğrafı

Bodhisattwa Majumder2 yıl önce

(8/n) Experimentation & Verification DataVoyager can use tools and insight-specific code generation to reasonably verify hypotheses. But LGMs are memoryless. They cannot automatically recover from past errors in execution and verification. We argue that how LGMs adapt to novel tools and code at inference time is still an open question.

Bodhisattwa Majumder profil fotoğrafı

Bodhisattwa Majumder2 yıl önce

(9/n) Knowledge Integration DataVoyager can partially achieve interdisciplinary knowledge integration. E.g., it could connect the role of economic pressure on health outcomes with cultural anthropology, psychological factors, public health intervention, and urban planning. Additionally, knowledge frontiers represent cutting-edge scientific exploration. DataVoyager shows promise in generating novel analysis in an experimental scientific frontier.

Bodhisattwa Majumder profil fotoğrafı

Bodhisattwa Majumder2 yıl önce

(10/n) 🚨 We also point out possible limitations of such automated systems, such as: 1. Hallucinations in LGMs undermining scientific rigor 2. Cost at scale in high-throughput fields 3. Data dredging resulting in sub-optimal policies 4. Autonomous discovery leading to legal implications 5. Potential percolation of bias originating from dual sources--the underlying dataset and the LGMs.

Bodhisattwa Majumder profil fotoğrafı

Bodhisattwa Majumder2 yıl önce

(11/n) 🌌 We hope our timely position can increase interest and efforts in developing, debating, and enhancing the vision for an accurate, reliable, and robust system for data-driven discovery. These systems can transform domains overwhelmed with vast amounts of data, including but not limited to observational social sciences, medicine, astronomy, biology, climate science, and consumer science. It can initiate a Cambrian explosion of discovery while promoting speed, reproducibility, and collaboration.

Bodhisattwa Majumder profil fotoğrafı

Bodhisattwa Majumder2 yıl önce

(n/n) This was a collaborative effort by @mbodhisattwa @surana_h, @dhruvagarwal17, @hsanchaita, @Ashish_S_AI, and Peter Clark from @allen_ai @ai2_aristo @UMassAmherst @UMass_NLP @UUtah

Benzer Videolar

Can LLMs help accelerate the discovery of data-driven scientific hypotheses? 🧬📊 We benchmark this in DiscoveryBench: 264 discovery tasks from 6 scientific domains, from humanities to biology: Ai2 Aristo Team at Ai2 Harshit Surana UMass Amherst

Can LLMs help accelerate the discovery of data-driven scientific hypotheses? 🧬📊 We benchmark this in DiscoveryBench: 264 discovery tasks from 6 scientific domains, from humanities to biology: Ai2 Aristo Team at Ai2 Harshit Surana UMass Amherst

Bodhisattwa Majumder

23,908 görüntüleme • 2 yıl önce

We are excited to announce CLIN 🤖: The first continually learning language agent that excels in both task adaptation and generalization to unseen tasks and environments in a pure zero-shot setup. Aristo Team at Ai2 Ai2 Website: Let's dive in 🧵 (1/n)

We are excited to announce CLIN 🤖: The first continually learning language agent that excels in both task adaptation and generalization to unseen tasks and environments in a pure zero-shot setup. Aristo Team at Ai2 Ai2 Website: Let's dive in 🧵 (1/n)

Bodhisattwa Majumder

35,052 görüntüleme • 2 yıl önce

Excited to share: Figure 01 completing real-world tasks This is end-to-end autonomous We have made advances in our autonomous navigation, learned perception models, manipulation robust to pose variation, & generalizable systems for future applications

Excited to share: Figure 01 completing real-world tasks This is end-to-end autonomous We have made advances in our autonomous navigation, learned perception models, manipulation robust to pose variation, & generalizable systems for future applications

Brett Adcock

232,949 görüntüleme • 2 yıl önce

Super excited to share the last paper of my PhD: "Hallucination in World Models is Predictable and Preventable"✨ We train a 350M-param generative world model on a large dataset w/ 210 tasks and show that we can predict *when* hallucination happens and use that to fix it! 🧵1/n

Super excited to share the last paper of my PhD: "Hallucination in World Models is Predictable and Preventable"✨ We train a 350M-param generative world model on a large dataset w/ 210 tasks and show that we can predict when hallucination happens and use that to fix it! 🧵1/n

Nicklas Hansen

54,573 görüntüleme • 1 ay önce

1/ Introducing ᴏᴘᴇɴꜱᴄʜᴏʟᴀʀ: a retrieval-augmented LM to help scientists synthesize knowledge 📚 UW NLP Ai2 With open models & 45M-paper datastores, it outperforms proprietary systems & match human experts. Try out our demo! We also introduce ꜱᴄʜᴏʟᴀʀQᴀʙᴇɴᴄʜ, a new large-scale multi-domain benchmark for scientific research synthesis, covering CS, Bio and Physics.

1/ Introducing ᴏᴘᴇɴꜱᴄʜᴏʟᴀʀ: a retrieval-augmented LM to help scientists synthesize knowledge 📚 UW NLP Ai2 With open models & 45M-paper datastores, it outperforms proprietary systems & match human experts. Try out our demo! We also introduce ꜱᴄʜᴏʟᴀʀQᴀʙᴇɴᴄʜ, a new large-scale multi-domain benchmark for scientific research synthesis, covering CS, Bio and Physics.

Akari Asai

249,287 görüntüleme • 1 yıl önce

"Everybody who is building these chatbots and Generative AI, when you are ready to run it, you need an AI factory and nobody is better at building end-end systems of very large scale for the enterprise than Dell Technologies . Any company and every company needs to build AI factory.."

"Everybody who is building these chatbots and Generative AI, when you are ready to run it, you need an AI factory and nobody is better at building end-end systems of very large scale for the enterprise than Dell Technologies . Any company and every company needs to build AI factory.."

Itzik Reich

12,941 görüntüleme • 2 yıl önce

In 30 min, we deployed MolmoAct2 from Ai2 at @GRASPLab and it worksout-of-the-box. One cool t hing it shows: self-recovery. The model reasons the cups get knocked off, and lifts it up, even when not asked to. Here is a thread of what stood out today 🧵

In 30 min, we deployed MolmoAct2 from Ai2 at @GRASPLab and it worksout-of-the-box. One cool t hing it shows: self-recovery. The model reasons the cups get knocked off, and lifts it up, even when not asked to. Here is a thread of what stood out today 🧵

Jie Wang

13,491 görüntüleme • 2 ay önce

Just published in nature, our new paper with Graeme Day's group, "Porous isoreticular non-metal organic frameworks" (N-MOFs). Congratulations to Megan OShaughnessy who led this work during her PhD at University of Liverpool 🎉 🧵1/10

Just published in nature, our new paper with Graeme Day's group, "Porous isoreticular non-metal organic frameworks" (N-MOFs). Congratulations to Megan OShaughnessy who led this work during her PhD at University of Liverpool 🎉 🧵1/10

Andy Cooper

56,641 görüntüleme • 2 yıl önce

Su Qing, Vice President and Chief Architect at Horizon — and former head of intelligent driving products at Huawei — said that Tesla’s FSD is generationally ahead of its Chinese competitors in certain respects. He noted that domestic systems, including Horizon’s, are in fact hybrid architectures combining end-to-end models with rule-based components, even though they are marketed as fully end-to-end solutions. According to Su, the key constraints facing Chinese autonomous driving systems — relative to FSD — include limited large-scale training compute, frequent and unpredictable infrastructure changes, and highly irregular driving behaviors. These uniquely complex road conditions have led Chinese developers to adopt a more programmatic, hybrid approach to autonomy rather than a purely end-to-end system. He said the FSD team’s level of expertise and spirit of innovation both deserve real admiration.

Su Qing, Vice President and Chief Architect at Horizon — and former head of intelligent driving products at Huawei — said that Tesla’s FSD is generationally ahead of its Chinese competitors in certain respects. He noted that domestic systems, including Horizon’s, are in fact hybrid architectures combining end-to-end models with rule-based components, even though they are marketed as fully end-to-end solutions. According to Su, the key constraints facing Chinese autonomous driving systems — relative to FSD — include limited large-scale training compute, frequent and unpredictable infrastructure changes, and highly irregular driving behaviors. These uniquely complex road conditions have led Chinese developers to adopt a more programmatic, hybrid approach to autonomy rather than a purely end-to-end system. He said the FSD team’s level of expertise and spirit of innovation both deserve real admiration.

Ray

83,127 görüntüleme • 5 ay önce

Tune Studio is an end-to-end platform for developing applications using Large Language Models. So far, I haven't seen any other platform like this one. You can do everything here: 1. You can curate your data. 2. Use the playground to play with different models and try your ideas. 3. Fine-tune an open-source model on your data. 4. Deploy the model when you are done. This is awesome for anyone building generative AI applications. You can use Tune Studio to work with any of the open-source models out there. They were one of the few companies to host Llama 2 and Llama 3 before anyone else. Here is a link to check it out: One of their main selling points is that Tune Studio scales! You don't have to worry about serving your model to lots of users. They also have built-in user management, authentication, on-prem support, user context management, and pretty much everything you need to build generative AI applications. Thanks to the Tune team for collaborating with me on this post. We are living through the best years of development tools for AI developers. The field is unstoppable.

Tune Studio is an end-to-end platform for developing applications using Large Language Models. So far, I haven't seen any other platform like this one. You can do everything here: 1. You can curate your data. 2. Use the playground to play with different models and try your ideas. 3. Fine-tune an open-source model on your data. 4. Deploy the model when you are done. This is awesome for anyone building generative AI applications. You can use Tune Studio to work with any of the open-source models out there. They were one of the few companies to host Llama 2 and Llama 3 before anyone else. Here is a link to check it out: One of their main selling points is that Tune Studio scales! You don't have to worry about serving your model to lots of users. They also have built-in user management, authentication, on-prem support, user context management, and pretty much everything you need to build generative AI applications. Thanks to the Tune team for collaborating with me on this post. We are living through the best years of development tools for AI developers. The field is unstoppable.

Santiago

39,101 görüntüleme • 2 yıl önce

This 1-hour tutorial on building a Football AI is the foundation traders use to build their own sports prediction systems one of them made $504K in 1 month using this exact approach Python + YOLO + OpenCV Detection, tracking, team classification from scratch Same stack powers the prediction systems in the article below Bookmark this & give it 1 hour today

This 1-hour tutorial on building a Football AI is the foundation traders use to build their own sports prediction systems one of them made $504K in 1 month using this exact approach Python + YOLO + OpenCV Detection, tracking, team classification from scratch Same stack powers the prediction systems in the article below Bookmark this & give it 1 hour today

Paone

200,600 görüntüleme • 3 ay önce

Tune in to ITV and ITVX at 9pm tonight for the first episode of ‘Prince William: We Can End Homelessness’. The two-part documentary follows the first year of Homewards, our ambitious five-year programme aiming to show that it is possible to end homelessness.

Tune in to ITV and ITVX at 9pm tonight for the first episode of ‘Prince William: We Can End Homelessness’. The two-part documentary follows the first year of Homewards, our ambitious five-year programme aiming to show that it is possible to end homelessness.

The Prince and Princess of Wales

273,343 görüntüleme • 1 yıl önce

Large-scale 3D Scene Generation (all scenes are real-time rendered)!! Physically-grounded generative data without hallucinations is the missing link for robot learning and testing at scale. We introduce a method that directly generates large-scale 3D driving scenes with accurate geometry, allowing for causal view synthesis and generation with object permanence and explicit 3D geometry. This also allows for extreme trajectory extrapolation without failure! We also show that we can build fully data-driven simulators for end-to-end learning with this approach. Project: with the amazing team of Julian Ost, Amogh Joshi , Andrea Ramazzina, Maximilian Bömer, Mario Bijelic.

Large-scale 3D Scene Generation (all scenes are real-time rendered)!! Physically-grounded generative data without hallucinations is the missing link for robot learning and testing at scale. We introduce a method that directly generates large-scale 3D driving scenes with accurate geometry, allowing for causal view synthesis and generation with object permanence and explicit 3D geometry. This also allows for extreme trajectory extrapolation without failure! We also show that we can build fully data-driven simulators for end-to-end learning with this approach. Project: with the amazing team of Julian Ost, Amogh Joshi , Andrea Ramazzina, Maximilian Bömer, Mario Bijelic.

Felix Heide

27,779 görüntüleme • 10 ay önce

"When I talk about [winning four rings], I am at a loss for words, because how did we end up in that position?...But we know what it takes to get it done." Draymond Green tells Malika Andrews ring No. 5 is not out of reach for this Warriors team 🏆

"When I talk about [winning four rings], I am at a loss for words, because how did we end up in that position?...But we know what it takes to get it done." Draymond Green tells Malika Andrews ring No. 5 is not out of reach for this Warriors team 🏆

NBA on ESPN

205,952 görüntüleme • 9 ay önce

Wow. AI agents are here I've been using Manus AI the last week and it actually is insane While at dinner I prompted it to build me a full app. By the end of dinner the app was done In this video I walk through Manus and show you how to build incredible apps (ya, bookmark this)

Wow. AI agents are here I've been using Manus AI the last week and it actually is insane While at dinner I prompted it to build me a full app. By the end of dinner the app was done In this video I walk through Manus and show you how to build incredible apps (ya, bookmark this)

Alex Finn

231,980 görüntüleme • 1 yıl önce

We have to fix what Democrats have broken. The alternative is that they keep blocking and delaying the president’s team from getting in place, leaving the administration short-staffed by hundreds of positions at the end of President Trump’s term. I’m taking steps to end it.

We have to fix what Democrats have broken. The alternative is that they keep blocking and delaying the president’s team from getting in place, leaving the administration short-staffed by hundreds of positions at the end of President Trump’s term. I’m taking steps to end it.

Leader John Thune

138,543 görüntüleme • 10 ay önce

This is huge! A UCLA team managed to build an optical generative model that runs on light instead of GPUs. In their demo, a shallow encoder maps noise into phase patterns, which a free-space optical decoder then transforms into images—digits, fashion, butterflies, faces, even Van Gogh–style art—without any computation during synthesis. ⚡ The results rival digital diffusion models, pointing to ultra-fast, energy-efficient AI powered by photonics. Optical generative models | Nature Paper:

This is huge! A UCLA team managed to build an optical generative model that runs on light instead of GPUs. In their demo, a shallow encoder maps noise into phase patterns, which a free-space optical decoder then transforms into images—digits, fashion, butterflies, faces, even Van Gogh–style art—without any computation during synthesis. ⚡ The results rival digital diffusion models, pointing to ultra-fast, energy-efficient AI powered by photonics. Optical generative models | Nature Paper:

机器之心 JIQIZHIXIN

173,604 görüntüleme • 10 ay önce