Video wird geladen...

Video konnte nicht geladen werden

Beim Laden dieses Videos ist ein Problem aufgetreten. Dies könnte an einem vorübergehenden Netzwerkproblem liegen oder das Video ist möglicherweise nicht verfügbar.

Three years ago, I started working on an easy-to-use tool for interpretable machine learning in science. I wanted it to do for symbolic regression what Theano did for deep learning. Today, I am beyond excited to share with you the paper describing it! 1.

Miles Cranmer

14,003 subscribers

240,342 Aufrufe • vor 3 Jahren •via X (Twitter)

Bildung Nachrichten & Politik Wissenschaft & Technologie

Anya Rossi• Live Now

Private livecam show

30 Kommentare

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

Symbolic Regression (SR) is a supervised learning task where the space of potential models is spanned by analytic expressions. Often, the goal is to find simple yet accurate expressions that lend themselves to interpretation🔍. 2.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

Throughout history, scientists have performed SR "manually," using a mix of intuition and trial-and-error. Empirically-discovered expressions can lead to new theory developments (e.g., Kepler’s law=>Newton's gravity; Planck’s law=>Quantum). 3.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

But, with much of ML used in science relying on blackbox models, I worry we often miss out on this crucial step of *understanding* the world. After all, that is the ultimate goal of science! The Latin word scientia literally means “to know.” 4.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

To understand a concept, you need to first represent it in your language (however abstract that language is). I think SR is attractive since it grounds ML models in the language of science: symbolic expressions! Just look at any physics cheat sheet: 5.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

That isn’t to argue that we avoid deep learning; one can actually use SR as a distillation tool for such blackbox models! 6.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

However, when I started my thesis, available SR codes were either: - Easy to use but slow ⏳ - Fast but hard to use 🤔 The only fast and easy-to-use tool was Eureqa, a proprietary and closed-source tool, which meant no customization or embedding into an analysis pipeline. 7.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

Enter PySR: fast, easy-to-use, and open-source🎉. Today, PySR has even more features than proprietary alternatives! 8.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

A driver of deep learning's accelerated innovation is the strong open-source tooling – we need similar tooling for SR too. This is also why I have also split up the evaluation code of SymbolicRegression.jl into a separate library: DynamicExpressions.jl. 9.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

This package makes it easy for others to create new symbolic regression libraries with new ideas, built on a strong foundation of highly optimized kernels used in PySR. Here’s a deep learning analogy: 10.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

Okay, so how does PySR work? It’s a fairly traditional approach: a multi-population evolutionary algorithm. Expressions are represented as binary trees, and evolve via a series of mutations and crossovers applied to the “fittest” members of each subpopulation: 11.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

But there are many other tricks: BFGS for constant optimization, algebraic simplification, simulated annealing, age-regularized tournament selection, and an adaptive complexity penalty. It’s a bit too much to describe precisely here, so please see the paper if curious 🙂 12.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

PySR also works seamlessly across 1000s of cores. Each population evolves independently, and will asynchronously "migrate" between these independent populations to share updates. 13.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

A motif in PySR's design is flexibility – while also being extremely high-performance. PySR ought to be a tool that can solve model discovery problems all throughout science, without needing hacks. Here's a comparison: (includes links so you can check these others out!) 14.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

In the paper, I demo a benchmark based on historical discoveries, and see whether codes can re-discover these with little prior information. Where possible, I include original datasets! (for Leavitt’s law I had to manually read off data from a 1912 plot…) 15.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

To really emulate the problem of discovering an unknown model, I use the same hyperparameters as each author submitted to the SRBench competition (as well as PySR), and let every code search for 1 hour on 8 cores. The rediscovery results (scored: yes/no) - 16.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

All methods seem to struggle with Planck’s law and Rydberg formula, likely due to the unusual scaling. Pure deep learning methods (EQL + SR-Transformer) seem to have difficulty on a range of problems. 17.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

We can see EQL experiencing numerical instabilities, and SR Transformer (pre-trained on synthetic expressions in various levels of noise) seems to generate overly complex expressions in every test. 18.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

While it is important to note some of these are tuned for accuracy alone, it is very interesting that pure deep learning methods still really struggle here. Perhaps it is a testament to the difficulty of learning representations in the space of symbolic expressions. 19.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

Regardless of this, DL methods still perform well on synthetic benchmarks, which is what they are tuned for, so I see hybrid approaches as very much worth pursuing! 20.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

Today, PySR has a growing community across academia and industry, with users working in a variety of fields from economics to astronomy. I am looking forward to seeing it continue to grow! I would like to thank: 21.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

for providing resources for pursuing this research; @cosmo_shirley and @DavidSpergel for countless insightful discussions about PySR, feedback on this manuscript, promotion of it as a tool in the sciences, and for their support of this project; 22.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

my research collaborators who provided feedback throughout the development of PySR, including @PabloLemosP @PeterWBattaglia @eigensteve @JayWadekar1 @paco_astro @physicskaze Elaine Cui @CDKreisch Nathan Kutz @DrumBushField Keaton Burns @dkochkov1 23.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

Alvaro Sanchez-Gonzalez @AstroCKragh @PatrickKidger @KyleCranmer @Niall_Jeffrey Ana Maria Delgado @AstroKeming Pierre-Alexandre Kamienny, Michael Douglas, @f_charton; all the wonderful open-source code contributors, including @markkitti, T Coxon, Dhananjay Ashok, 24.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

Johan Blåbäck, Julius Martensen, GitHub user ngam, @ChrisRackauckas @l_II_llI, Charles Fox @johannbrehmer @cosmic_mar, GitHub user Coba, Pietro Monticone, Mateusz Kubica, GitHub user Jgmedina95, Michael Abbott, Oscar Smith, and several others; 25.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

for extremely helpful comments on a draft of this paper, as well as general feedback throughout the project; @w_la_cava for insight throughout the project as for spearheading the SRBench initiative, along with the rest of the SRBench organizers; 26.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

Brenden Petersen for feedback on PySR as well as providing insightful discussions about the SR landscape; and so many others (am likely forgetting some) who have provided support to the project through email, Twitter, GitHub issues, and in-person! 27.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

I would like to give a huge thanks to the SRBench team as well. I think part of deep learning's continued success is the proliferation of well-tested benchmarks, and the SRBench team is doing this for symbolic regression! 28.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

FAQ 1: What about concepts we can't represent with existing operators? A: Interpreting something requires representing it in our language (whether that language be mathematical, programmatical, conceptual, etc.). 29.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

Sometimes those representations are hierarchical, and sometimes those representations are also fuzzy. But for each new concept we define and add to our language, we have to ground it in our existing language. 30.

Profilbild von Miles Cranmer

Miles Cranmervor 3 Jahren

In a symbolic distillation context, this could entail a "feature learning" network, followed by another network that uses those features. You would then distill both networks to expressions in your existing language. 31.

Ähnliche Videos

Here is what I'd do if I wanted to start learning Machine Learning today.

Here is what I'd do if I wanted to start learning Machine Learning today.

Santiago

169,635 Aufrufe • vor 2 Jahren

Meet Ace! 3 years ago, I built my first Deep Learning Rig, something I wanted to do for a long time but procrastinated due to self doubt and random fear of failure. But the machine continues to run smoothly and it is indeed surreal to work on it everyday! Love you, Ace💙

Meet Ace! 3 years ago, I built my first Deep Learning Rig, something I wanted to do for a long time but procrastinated due to self doubt and random fear of failure. But the machine continues to run smoothly and it is indeed surreal to work on it everyday! Love you, Ace💙

Akanksha

43,916 Aufrufe • vor 2 Jahren

Five years ago I created TabNine, the first commercial code completion tool to use deep learning. Today I'm releasing Supermaven, the first code completion tool with a context window exceeding 100,000 tokens.

Five years ago I created TabNine, the first commercial code completion tool to use deep learning. Today I'm releasing Supermaven, the first code completion tool with a context window exceeding 100,000 tokens.

Jacob Jackson

496,010 Aufrufe • vor 2 Jahren

Kai Cenat explains the meaning behind his VIVET journal “This journal was gifted to me in October of last year when I started to learn how clothes are made. I decided to document everything I’ve been learning by writing and putting Polaroids in it. I’ve carried this with me ever since and what I have started doing is scanning the pages and putting them online for anyone else learning about clothing. I’m grateful for who I’ve met and what I’m continuously learning. I feel like I need to share it with you guys.”

Kai Cenat explains the meaning behind his VIVET journal “This journal was gifted to me in October of last year when I started to learn how clothes are made. I decided to document everything I’ve been learning by writing and putting Polaroids in it. I’ve carried this with me ever since and what I have started doing is scanning the pages and putting them online for anyone else learning about clothing. I’m grateful for who I’ve met and what I’m continuously learning. I feel like I need to share it with you guys.”

FearBuck

85,762 Aufrufe • vor 4 Monaten

Today, THE LEARNING GAME is available in bookstores across the 🌎 Published in all formats, wherever you shop for books online. I am so excited for you all to read it and let me know what you think 💛 ORDER HERE:

Today, THE LEARNING GAME is available in bookstores across the 🌎 Published in all formats, wherever you shop for books online. I am so excited for you all to read it and let me know what you think 💛 ORDER HERE:

Ana Lorena Fabrega

255,839 Aufrufe • vor 2 Jahren

I never really got to explain this journal clearly so I’m going to start being transparent, this journal was gifted to me in October of last year when I started to learn how clothes are made. I decided to document everything I’ve been learning by writing and putting Polaroids in it. I’ve carried this with me ever since and what I have started doing is scanning the pages and putting them online for anyone else learning about clothing. I’m grateful for who I’ve met and what I’m continuously learning. I feel like I need to share it with you guys.

I never really got to explain this journal clearly so I’m going to start being transparent, this journal was gifted to me in October of last year when I started to learn how clothes are made. I decided to document everything I’ve been learning by writing and putting Polaroids in it. I’ve carried this with me ever since and what I have started doing is scanning the pages and putting them online for anyone else learning about clothing. I’m grateful for who I’ve met and what I’m continuously learning. I feel like I need to share it with you guys.

Kai Cenat

4,053,327 Aufrufe • vor 4 Monaten

My new record is out today and I am so excited to share it with you. I am excited for you to hear and feel the love and intention of all the brilliant, committed people involved in making this album. Thank you for listening.

My new record is out today and I am so excited to share it with you. I am excited for you to hear and feel the love and intention of all the brilliant, committed people involved in making this album. Thank you for listening.

Neko Case

13,379 Aufrufe • vor 9 Monaten

Whole Earth AI is a project-based learning tool. I built it to explore two questions: 1. what new ux patterns do LLMs make possible? 2. what might the montessori method applied to software for adults look like? here's what I learned,

Whole Earth AI is a project-based learning tool. I built it to explore two questions: 1. what new ux patterns do LLMs make possible? 2. what might the montessori method applied to software for adults look like? here's what I learned,

kasey

49,498 Aufrufe • vor 1 Jahr

I wanted to share some news, after 20 years representing Wales I am retiring from international netball. It has been an amazing journey and I am excited for my next chapter! I am incredibly grateful for the experiences, memories, people and travels along the way 🏴󠁧󠁢󠁷󠁬󠁳󠁿🫶🏻🏐😊

I wanted to share some news, after 20 years representing Wales I am retiring from international netball. It has been an amazing journey and I am excited for my next chapter! I am incredibly grateful for the experiences, memories, people and travels along the way 🏴󠁧󠁢󠁷󠁬󠁳󠁿🫶🏻🏐😊

Suzy Drane

169,711 Aufrufe • vor 3 Jahren

I made this video, what do you guys think of it? Very easy tool to use

I made this video, what do you guys think of it? Very easy tool to use

MOXXIE ♣️

76,190 Aufrufe • vor 5 Monaten

Wanna see a magic trick? I built a screenshot tool for the Lua Learning Tutor. This was not easy (to put it lightly) but it will be super useful for getting help on specific things without needing to spell it out. #RobloxDev

Wanna see a magic trick? I built a screenshot tool for the Lua Learning Tutor. This was not easy (to put it lightly) but it will be super useful for getting help on specific things without needing to spell it out. #RobloxDev

Zack Williams

11,211 Aufrufe • vor 2 Jahren

Excited to launch FieldDay (FieldDay) today. It’s the culmination of years of work in working on approachable tools for machine learning with a data-centric approach, and a first step towards enabling SMEs and enthusiasts to build intelligent apps end to end.

Excited to launch FieldDay (FieldDay) today. It’s the culmination of years of work in working on approachable tools for machine learning with a data-centric approach, and a first step towards enabling SMEs and enthusiasts to build intelligent apps end to end.

Aaron Abentheuer

20,910 Aufrufe • vor 3 Jahren

#Oliverscampaign I am exhausted today I have worked trying to get signatures for Oliver McGowan Mandatory Training on Learning Disability & Autism for Education staff whilst you slept We now have 23500 Please keep signing I cant do it without your help

#Oliverscampaign I am exhausted today I have worked trying to get signatures for Oliver McGowan Mandatory Training on Learning Disability & Autism for Education staff whilst you slept We now have 23500 Please keep signing I cant do it without your help

Paula McGowan OBE

36,338 Aufrufe • vor 3 Jahren

10 years ago today, I apologized in advance to my Dad for what I was about to do...

10 years ago today, I apologized in advance to my Dad for what I was about to do...

GJake

56,055 Aufrufe • vor 7 Monaten

Three months ago I started walking bc I was jealous of the people I saw walking on my way to work and I decided that was what I wanted to do with my mornings. This wasn’t just about being fit for me, it was me proving to myself that I could have discipline and be consistent with

Three months ago I started walking bc I was jealous of the people I saw walking on my way to work and I decided that was what I wanted to do with my mornings. This wasn’t just about being fit for me, it was me proving to myself that I could have discipline and be consistent with

Reni

100,638 Aufrufe • vor 11 Monaten

Today, I am launching Paper Breakdown. - PBD gets you academic paper recommendations and lets you study CS/ML/AI research with LLM agents. - It highlights relevant sections directly in the actual PDF - generates flowcharts/illustrations too - we provide an in-build screenshot tool to send images to the agent directly from the paper. - we also got agentic paper search that allows you to search our database of 70,000+ CS and ML Arxiv papers in seconds using natural language. I have been building PBD for almost half a year - it all started as a means for me to keep up with research and use AI to produce visuals and scripts for my own YouTube videos. I have developed it enough to confidently recommend it to you. Visit our landing page to learn more.

Today, I am launching Paper Breakdown. - PBD gets you academic paper recommendations and lets you study CS/ML/AI research with LLM agents. - It highlights relevant sections directly in the actual PDF - generates flowcharts/illustrations too - we provide an in-build screenshot tool to send images to the agent directly from the paper. - we also got agentic paper search that allows you to search our database of 70,000+ CS and ML Arxiv papers in seconds using natural language. I have been building PBD for almost half a year - it all started as a means for me to keep up with research and use AI to produce visuals and scripts for my own YouTube videos. I have developed it enough to confidently recommend it to you. Visit our landing page to learn more.

AVB

14,579 Aufrufe • vor 6 Monaten

Every time I found myself in an unfamiliar situation in my cricketing career, I never let it get me down. Instead, I adapted to it by learning. Early in my career, the T20 format was introduced. I trained hard for it and learnt to play it. Learning made me confident in something I had never done. It is this very habit of learning that made me confident enough to play all the formats in different conditions around the world. The learning didn’t stop even after I retired. I learnt to be a coach and a mentor. That’s why I like what I have done with the people at CoinDCX : India Ka Crypto Coach. We share a common belief “Learning is the only way to succeed.” #ConfidentCryptoStart #Ad

Every time I found myself in an unfamiliar situation in my cricketing career, I never let it get me down. Instead, I adapted to it by learning. Early in my career, the T20 format was introduced. I trained hard for it and learnt to play it. Learning made me confident in something I had never done. It is this very habit of learning that made me confident enough to play all the formats in different conditions around the world. The learning didn’t stop even after I retired. I learnt to be a coach and a mentor. That’s why I like what I have done with the people at CoinDCX : India Ka Crypto Coach. We share a common belief “Learning is the only way to succeed.” #ConfidentCryptoStart #Ad

Gautam Gambhir

330,651 Aufrufe • vor 1 Jahr

I built a Visual Studio Code extension to turn Visual Studio code into a custom learning environment for my Machine Learning class. Attached, you'll see a 2-minute video of how it works. I haven't tried this with my students yet (the February cohort will be the first to do so), but I think it will be huge. I wish more people would offer a learning experience like this. My next cohort starts in February and you can join at You'll get lifetime access to the best Machine Learning engineering cohort online.

I built a Visual Studio Code extension to turn Visual Studio code into a custom learning environment for my Machine Learning class. Attached, you'll see a 2-minute video of how it works. I haven't tried this with my students yet (the February cohort will be the first to do so), but I think it will be huge. I wish more people would offer a learning experience like this. My next cohort starts in February and you can join at You'll get lifetime access to the best Machine Learning engineering cohort online.

Santiago

82,846 Aufrufe • vor 1 Jahr

Three years ago, I appeared at Industry Committee and warned on the dangers of the RCMP signing technology contracts with companies connected to the Chinese government. It took three years for the government to act. What risks did we incur in the meantime?

Three years ago, I appeared at Industry Committee and warned on the dangers of the RCMP signing technology contracts with companies connected to the Chinese government. It took three years for the government to act. What risks did we incur in the meantime?

Raquel Dancho

22,270 Aufrufe • vor 7 Monaten