正在加载视频...

视频加载失败

加载此视频时出现问题。这可能是由于临时网络问题，或视频可能不可用。

Three years ago, I started working on an easy-to-use tool for interpretable machine learning in science. I wanted it to do for symbolic regression what Theano did for deep learning. Today, I am beyond excited to share with you the paper describing it! 1.

Miles Cranmer

14,003 subscribers

240,342 次观看 • 3 年前 •via X (Twitter)

教育新闻政治科学技术

Anya Rossi• Live Now

Private livecam show

30 条评论

Miles Cranmer 的头像

Miles Cranmer3 年前

Symbolic Regression (SR) is a supervised learning task where the space of potential models is spanned by analytic expressions. Often, the goal is to find simple yet accurate expressions that lend themselves to interpretation🔍. 2.

Miles Cranmer 的头像

Miles Cranmer3 年前

Throughout history, scientists have performed SR "manually," using a mix of intuition and trial-and-error. Empirically-discovered expressions can lead to new theory developments (e.g., Kepler’s law=>Newton's gravity; Planck’s law=>Quantum). 3.

Miles Cranmer 的头像

Miles Cranmer3 年前

But, with much of ML used in science relying on blackbox models, I worry we often miss out on this crucial step of *understanding* the world. After all, that is the ultimate goal of science! The Latin word scientia literally means “to know.” 4.

Miles Cranmer 的头像

Miles Cranmer3 年前

To understand a concept, you need to first represent it in your language (however abstract that language is). I think SR is attractive since it grounds ML models in the language of science: symbolic expressions! Just look at any physics cheat sheet: 5.

Miles Cranmer 的头像

Miles Cranmer3 年前

That isn’t to argue that we avoid deep learning; one can actually use SR as a distillation tool for such blackbox models! 6.

Miles Cranmer 的头像

Miles Cranmer3 年前

However, when I started my thesis, available SR codes were either: - Easy to use but slow ⏳ - Fast but hard to use 🤔 The only fast and easy-to-use tool was Eureqa, a proprietary and closed-source tool, which meant no customization or embedding into an analysis pipeline. 7.

Miles Cranmer 的头像

Miles Cranmer3 年前

Enter PySR: fast, easy-to-use, and open-source🎉. Today, PySR has even more features than proprietary alternatives! 8.

Miles Cranmer 的头像

Miles Cranmer3 年前

A driver of deep learning's accelerated innovation is the strong open-source tooling – we need similar tooling for SR too. This is also why I have also split up the evaluation code of SymbolicRegression.jl into a separate library: DynamicExpressions.jl. 9.

Miles Cranmer 的头像

Miles Cranmer3 年前

This package makes it easy for others to create new symbolic regression libraries with new ideas, built on a strong foundation of highly optimized kernels used in PySR. Here’s a deep learning analogy: 10.

Miles Cranmer 的头像

Miles Cranmer3 年前

Okay, so how does PySR work? It’s a fairly traditional approach: a multi-population evolutionary algorithm. Expressions are represented as binary trees, and evolve via a series of mutations and crossovers applied to the “fittest” members of each subpopulation: 11.

Miles Cranmer 的头像

Miles Cranmer3 年前

But there are many other tricks: BFGS for constant optimization, algebraic simplification, simulated annealing, age-regularized tournament selection, and an adaptive complexity penalty. It’s a bit too much to describe precisely here, so please see the paper if curious 🙂 12.

Miles Cranmer 的头像

Miles Cranmer3 年前

PySR also works seamlessly across 1000s of cores. Each population evolves independently, and will asynchronously "migrate" between these independent populations to share updates. 13.

Miles Cranmer 的头像

Miles Cranmer3 年前

A motif in PySR's design is flexibility – while also being extremely high-performance. PySR ought to be a tool that can solve model discovery problems all throughout science, without needing hacks. Here's a comparison: (includes links so you can check these others out!) 14.

Miles Cranmer 的头像

Miles Cranmer3 年前

In the paper, I demo a benchmark based on historical discoveries, and see whether codes can re-discover these with little prior information. Where possible, I include original datasets! (for Leavitt’s law I had to manually read off data from a 1912 plot…) 15.

Miles Cranmer 的头像

Miles Cranmer3 年前

To really emulate the problem of discovering an unknown model, I use the same hyperparameters as each author submitted to the SRBench competition (as well as PySR), and let every code search for 1 hour on 8 cores. The rediscovery results (scored: yes/no) - 16.

Miles Cranmer 的头像

Miles Cranmer3 年前

All methods seem to struggle with Planck’s law and Rydberg formula, likely due to the unusual scaling. Pure deep learning methods (EQL + SR-Transformer) seem to have difficulty on a range of problems. 17.

Miles Cranmer 的头像

Miles Cranmer3 年前

We can see EQL experiencing numerical instabilities, and SR Transformer (pre-trained on synthetic expressions in various levels of noise) seems to generate overly complex expressions in every test. 18.

Miles Cranmer 的头像

Miles Cranmer3 年前

While it is important to note some of these are tuned for accuracy alone, it is very interesting that pure deep learning methods still really struggle here. Perhaps it is a testament to the difficulty of learning representations in the space of symbolic expressions. 19.

Miles Cranmer 的头像

Miles Cranmer3 年前

Regardless of this, DL methods still perform well on synthetic benchmarks, which is what they are tuned for, so I see hybrid approaches as very much worth pursuing! 20.

Miles Cranmer 的头像

Miles Cranmer3 年前

Today, PySR has a growing community across academia and industry, with users working in a variety of fields from economics to astronomy. I am looking forward to seeing it continue to grow! I would like to thank: 21.

Miles Cranmer 的头像

Miles Cranmer3 年前

for providing resources for pursuing this research; @cosmo_shirley and @DavidSpergel for countless insightful discussions about PySR, feedback on this manuscript, promotion of it as a tool in the sciences, and for their support of this project; 22.

Miles Cranmer 的头像

Miles Cranmer3 年前

my research collaborators who provided feedback throughout the development of PySR, including @PabloLemosP @PeterWBattaglia @eigensteve @JayWadekar1 @paco_astro @physicskaze Elaine Cui @CDKreisch Nathan Kutz @DrumBushField Keaton Burns @dkochkov1 23.

Miles Cranmer 的头像

Miles Cranmer3 年前

Alvaro Sanchez-Gonzalez @AstroCKragh @PatrickKidger @KyleCranmer @Niall_Jeffrey Ana Maria Delgado @AstroKeming Pierre-Alexandre Kamienny, Michael Douglas, @f_charton; all the wonderful open-source code contributors, including @markkitti, T Coxon, Dhananjay Ashok, 24.

Miles Cranmer 的头像

Miles Cranmer3 年前

Johan Blåbäck, Julius Martensen, GitHub user ngam, @ChrisRackauckas @l_II_llI, Charles Fox @johannbrehmer @cosmic_mar, GitHub user Coba, Pietro Monticone, Mateusz Kubica, GitHub user Jgmedina95, Michael Abbott, Oscar Smith, and several others; 25.

Miles Cranmer 的头像

Miles Cranmer3 年前

for extremely helpful comments on a draft of this paper, as well as general feedback throughout the project; @w_la_cava for insight throughout the project as for spearheading the SRBench initiative, along with the rest of the SRBench organizers; 26.

Miles Cranmer 的头像

Miles Cranmer3 年前

Brenden Petersen for feedback on PySR as well as providing insightful discussions about the SR landscape; and so many others (am likely forgetting some) who have provided support to the project through email, Twitter, GitHub issues, and in-person! 27.

Miles Cranmer 的头像

Miles Cranmer3 年前

I would like to give a huge thanks to the SRBench team as well. I think part of deep learning's continued success is the proliferation of well-tested benchmarks, and the SRBench team is doing this for symbolic regression! 28.

Miles Cranmer 的头像

Miles Cranmer3 年前

FAQ 1: What about concepts we can't represent with existing operators? A: Interpreting something requires representing it in our language (whether that language be mathematical, programmatical, conceptual, etc.). 29.

Miles Cranmer 的头像

Miles Cranmer3 年前

Sometimes those representations are hierarchical, and sometimes those representations are also fuzzy. But for each new concept we define and add to our language, we have to ground it in our existing language. 30.

Miles Cranmer 的头像

Miles Cranmer3 年前

In a symbolic distillation context, this could entail a "feature learning" network, followed by another network that uses those features. You would then distill both networks to expressions in your existing language. 31.

相关视频

Here is what I'd do if I wanted to start learning Machine Learning today.

Here is what I'd do if I wanted to start learning Machine Learning today.

Santiago

169,689 次观看 • 2 年前

Meet Ace! 3 years ago, I built my first Deep Learning Rig, something I wanted to do for a long time but procrastinated due to self doubt and random fear of failure. But the machine continues to run smoothly and it is indeed surreal to work on it everyday! Love you, Ace💙

Meet Ace! 3 years ago, I built my first Deep Learning Rig, something I wanted to do for a long time but procrastinated due to self doubt and random fear of failure. But the machine continues to run smoothly and it is indeed surreal to work on it everyday! Love you, Ace💙

Akanksha

43,916 次观看 • 2 年前

Five years ago I created TabNine, the first commercial code completion tool to use deep learning. Today I'm releasing Supermaven, the first code completion tool with a context window exceeding 100,000 tokens.

Five years ago I created TabNine, the first commercial code completion tool to use deep learning. Today I'm releasing Supermaven, the first code completion tool with a context window exceeding 100,000 tokens.

Jacob Jackson

496,066 次观看 • 2 年前

Kai Cenat explains the meaning behind his VIVET journal “This journal was gifted to me in October of last year when I started to learn how clothes are made. I decided to document everything I’ve been learning by writing and putting Polaroids in it. I’ve carried this with me ever since and what I have started doing is scanning the pages and putting them online for anyone else learning about clothing. I’m grateful for who I’ve met and what I’m continuously learning. I feel like I need to share it with you guys.”

Kai Cenat explains the meaning behind his VIVET journal “This journal was gifted to me in October of last year when I started to learn how clothes are made. I decided to document everything I’ve been learning by writing and putting Polaroids in it. I’ve carried this with me ever since and what I have started doing is scanning the pages and putting them online for anyone else learning about clothing. I’m grateful for who I’ve met and what I’m continuously learning. I feel like I need to share it with you guys.”

FearBuck

85,762 次观看 • 5 个月前

I never really got to explain this journal clearly so I’m going to start being transparent, this journal was gifted to me in October of last year when I started to learn how clothes are made. I decided to document everything I’ve been learning by writing and putting Polaroids in it. I’ve carried this with me ever since and what I have started doing is scanning the pages and putting them online for anyone else learning about clothing. I’m grateful for who I’ve met and what I’m continuously learning. I feel like I need to share it with you guys.

I never really got to explain this journal clearly so I’m going to start being transparent, this journal was gifted to me in October of last year when I started to learn how clothes are made. I decided to document everything I’ve been learning by writing and putting Polaroids in it. I’ve carried this with me ever since and what I have started doing is scanning the pages and putting them online for anyone else learning about clothing. I’m grateful for who I’ve met and what I’m continuously learning. I feel like I need to share it with you guys.

Kai Cenat

4,383,098 次观看 • 5 个月前

My new record is out today and I am so excited to share it with you. I am excited for you to hear and feel the love and intention of all the brilliant, committed people involved in making this album. Thank you for listening.

My new record is out today and I am so excited to share it with you. I am excited for you to hear and feel the love and intention of all the brilliant, committed people involved in making this album. Thank you for listening.

Neko Case

13,431 次观看 • 10 个月前

Whole Earth AI is a project-based learning tool. I built it to explore two questions: 1. what new ux patterns do LLMs make possible? 2. what might the montessori method applied to software for adults look like? here's what I learned,

Whole Earth AI is a project-based learning tool. I built it to explore two questions: 1. what new ux patterns do LLMs make possible? 2. what might the montessori method applied to software for adults look like? here's what I learned,

kasey

50,106 次观看 • 1 年前

I didn't study a lick of AI in college. I had studied computer science at MIT, but because I had been interning and working in quant trading, it was much more statistically focused than anything related to machine learning. For anyone else looking to get started, Cursor is an incredible first step for going from something you are more familiar with, like the traditional IDE, to understanding how AI can accelerate your current work. #sponsored #cursorpartner Cursor

I didn't study a lick of AI in college. I had studied computer science at MIT, but because I had been interning and working in quant trading, it was much more statistically focused than anything related to machine learning. For anyone else looking to get started, Cursor is an incredible first step for going from something you are more familiar with, like the traditional IDE, to understanding how AI can accelerate your current work. #sponsored #cursorpartner Cursor

Sarah Chieng

52,761 次观看 • 1 个月前

| Igbo Alphabet | I have always enjoyed creating personalized alphabets for our kids, but this time, I wanted to do something bigger—something that others could benefit from, too. That is why I am really excited to introduce my very own Igbo alphabet poster - available exclusively on my website, with options in both white and blue backgrounds. From my own experience, this can make learning the alphabet fun and easy. Whether you are teaching your kids or learning yourself, it is a great tool to connect with the Igbo language. Why not give it a try? I will be sharing updates soon on Nyereugo’s progress with it, and I cannot wait to see how it helps others! Take a look and order yours here:

| Igbo Alphabet | I have always enjoyed creating personalized alphabets for our kids, but this time, I wanted to do something bigger—something that others could benefit from, too. That is why I am really excited to introduce my very own Igbo alphabet poster - available exclusively on my website, with options in both white and blue backgrounds. From my own experience, this can make learning the alphabet fun and easy. Whether you are teaching your kids or learning yourself, it is a great tool to connect with the Igbo language. Why not give it a try? I will be sharing updates soon on Nyereugo’s progress with it, and I cannot wait to see how it helps others! Take a look and order yours here:

Nwanyi Ocha

46,678 次观看 • 1 年前

I wanted to share some news, after 20 years representing Wales I am retiring from international netball. It has been an amazing journey and I am excited for my next chapter! I am incredibly grateful for the experiences, memories, people and travels along the way 🏴󠁧󠁢󠁷󠁬󠁳󠁿🫶🏻🏐😊

I wanted to share some news, after 20 years representing Wales I am retiring from international netball. It has been an amazing journey and I am excited for my next chapter! I am incredibly grateful for the experiences, memories, people and travels along the way 🏴󠁧󠁢󠁷󠁬󠁳󠁿🫶🏻🏐😊

Suzy Drane

169,721 次观看 • 3 年前

I made this video, what do you guys think of it? Very easy tool to use

I made this video, what do you guys think of it? Very easy tool to use

MOXXIE ♣️

76,266 次观看 • 6 个月前

Wanna see a magic trick? I built a screenshot tool for the Lua Learning Tutor. This was not easy (to put it lightly) but it will be super useful for getting help on specific things without needing to spell it out. #RobloxDev

Wanna see a magic trick? I built a screenshot tool for the Lua Learning Tutor. This was not easy (to put it lightly) but it will be super useful for getting help on specific things without needing to spell it out. #RobloxDev

Zack Williams

11,211 次观看 • 2 年前

Excited to launch FieldDay (FieldDay) today. It’s the culmination of years of work in working on approachable tools for machine learning with a data-centric approach, and a first step towards enabling SMEs and enthusiasts to build intelligent apps end to end.

Excited to launch FieldDay (FieldDay) today. It’s the culmination of years of work in working on approachable tools for machine learning with a data-centric approach, and a first step towards enabling SMEs and enthusiasts to build intelligent apps end to end.

Aaron Abentheuer

20,910 次观看 • 3 年前

10 years ago today, I apologized in advance to my Dad for what I was about to do...

10 years ago today, I apologized in advance to my Dad for what I was about to do...

GJake

56,055 次观看 • 8 个月前

#Oliverscampaign I am exhausted today I have worked trying to get signatures for Oliver McGowan Mandatory Training on Learning Disability & Autism for Education staff whilst you slept We now have 23500 Please keep signing I cant do it without your help

#Oliverscampaign I am exhausted today I have worked trying to get signatures for Oliver McGowan Mandatory Training on Learning Disability & Autism for Education staff whilst you slept We now have 23500 Please keep signing I cant do it without your help

Paula McGowan OBE

36,338 次观看 • 3 年前

Three months ago I started walking bc I was jealous of the people I saw walking on my way to work and I decided that was what I wanted to do with my mornings. This wasn’t just about being fit for me, it was me proving to myself that I could have discipline and be consistent with

Three months ago I started walking bc I was jealous of the people I saw walking on my way to work and I decided that was what I wanted to do with my mornings. This wasn’t just about being fit for me, it was me proving to myself that I could have discipline and be consistent with

Reni

100,638 次观看 • 1 年前

I worked with Ifihan 👩🏾‍🍳🛠️ to make a portfolio for her that highlights her technical writing experience 😅 lots of learning but, it was an amazing journey. After Effects is actually fun to work with👉🏿👈🏿 thanks to the amazing tutors on awwwards. for everything I am learning about AE❤️

I worked with Ifihan 👩🏾‍🍳🛠️ to make a portfolio for her that highlights her technical writing experience 😅 lots of learning but, it was an amazing journey. After Effects is actually fun to work with👉🏿👈🏿 thanks to the amazing tutors on awwwards. for everything I am learning about AE❤️

Zeus

17,792 次观看 • 3 年前

Today, I am launching Paper Breakdown. - PBD gets you academic paper recommendations and lets you study CS/ML/AI research with LLM agents. - It highlights relevant sections directly in the actual PDF - generates flowcharts/illustrations too - we provide an in-build screenshot tool to send images to the agent directly from the paper. - we also got agentic paper search that allows you to search our database of 70,000+ CS and ML Arxiv papers in seconds using natural language. I have been building PBD for almost half a year - it all started as a means for me to keep up with research and use AI to produce visuals and scripts for my own YouTube videos. I have developed it enough to confidently recommend it to you. Visit our landing page to learn more.

Today, I am launching Paper Breakdown. - PBD gets you academic paper recommendations and lets you study CS/ML/AI research with LLM agents. - It highlights relevant sections directly in the actual PDF - generates flowcharts/illustrations too - we provide an in-build screenshot tool to send images to the agent directly from the paper. - we also got agentic paper search that allows you to search our database of 70,000+ CS and ML Arxiv papers in seconds using natural language. I have been building PBD for almost half a year - it all started as a means for me to keep up with research and use AI to produce visuals and scripts for my own YouTube videos. I have developed it enough to confidently recommend it to you. Visit our landing page to learn more.

AVB

14,579 次观看 • 7 个月前