Yacine Mahdid's banner

Yacine Mahdid

@yacinelearning • 27,385 subscribers

(neuro/ai) I make technical deep learning tutorials 👺

Shorts

there is a rich and under-tapped paradigm where LLMs help you learn fundamentals topics in computer science which I think the folks at deep-ml have integrated beautifully for ML in a very engaging way this learning corner of the internet has became one of my favorite

there is a rich and under-tapped paradigm where LLMs help you learn fundamentals topics in computer science which I think the folks at deep-ml have integrated beautifully for ML in a very engaging way this learning corner of the internet has became one of my favorite

227,055 次观看

every time I see the primeagen in the bottom right corner I zone out and all I can think about is this masterpiece

every time I see the primeagen in the bottom right corner I zone out and all I can think about is this masterpiece

181,846 次观看

Videos

Anya Rossi

sweetdream.ai

SweetDream.ai•Sponsored•Livecam

Watch Anya Live

Anya is streaming live right now! Join her private show and enjoy exclusive content.

Exclusive private shows

1.2k viewers online

Private Show

Join now for exclusive access

Free preview available • Premium content

accurate representation of the best researchers I know going about their day

accurate representation of the best researchers I know going about their day

376,694 次观看 • 7 个月前

I had a fantastic time discussing with the learning legend Justin Skycak from Math Academy about learning math in the modern age. we've talked about his quite impressive self-learning journey (3000h of math in high school) all the way to how he hand curated the initial knowledge graph for math academy to make that process more efficient. great lively 3h discussion here are the chapters: 0:00:00 - intro: 0:02:10 - justin background 0:05:45 - 3000h math self study in high school 0:11:45 - what a day looked like for that 3000h stretch 0:16:10 - meta-learning vs pure math learning 0:21:50 - when did you get into cognitive neuro? 0:29:55 - how did the fundamental math helped in your research projects 0:43:10 - what does the math academy learning system looks like 0:47:34 - how did you guys build the 2000 topic knowledge graph 1:01:15 - would LLM be useful as an interface to that knowledge graph for the students? 1:10:46 - how does the FIRe spaced repetition algorithm works? 1:17:34 - does the same knowledge graph structure would work for physics? or other topic?: 1:34:05 - how do you understand the subject vs the curiculum 1:35:50 - is there a connection between studying math and learning a sport? 1:42:00 - do you think in math doing and teaching requires different skills? 1:56:25 - could you get understanding without automaticy? 2:05:35 - do you see any upside of confusion in learning? 2:14:11 - learning math as an adult? 2:19:20 - how to fill the motivation gap after learning the fundamental? 2:24:10 - how should teaching math for kids and adults balance fundamentals and creativity? 2:33:55 - is it ever too late to learn math seriously? 2:46:00 - mastery learning vs ultra learning 2:51:30 - top-down vs bottom-up 2:53:40 - mastery learning for domain without a structured hierarchical structure? 2:56:30 - neurodivergence / adhd for structured math learning? 3:06:20 - amateur mathematician augmented with technology will be able to contribute to research? 3:14:37 - what are you most excited about right now in term of learning enjoy!

I had a fantastic time discussing with the learning legend Justin Skycak from Math Academy about learning math in the modern age. we've talked about his quite impressive self-learning journey (3000h of math in high school) all the way to how he hand curated the initial knowledge graph for math academy to make that process more efficient. great lively 3h discussion here are the chapters: 0:00:00 - intro: 0:02:10 - justin background 0:05:45 - 3000h math self study in high school 0:11:45 - what a day looked like for that 3000h stretch 0:16:10 - meta-learning vs pure math learning 0:21:50 - when did you get into cognitive neuro? 0:29:55 - how did the fundamental math helped in your research projects 0:43:10 - what does the math academy learning system looks like 0:47:34 - how did you guys build the 2000 topic knowledge graph 1:01:15 - would LLM be useful as an interface to that knowledge graph for the students? 1:10:46 - how does the FIRe spaced repetition algorithm works? 1:17:34 - does the same knowledge graph structure would work for physics? or other topic?: 1:34:05 - how do you understand the subject vs the curiculum 1:35:50 - is there a connection between studying math and learning a sport? 1:42:00 - do you think in math doing and teaching requires different skills? 1:56:25 - could you get understanding without automaticy? 2:05:35 - do you see any upside of confusion in learning? 2:14:11 - learning math as an adult? 2:19:20 - how to fill the motivation gap after learning the fundamental? 2:24:10 - how should teaching math for kids and adults balance fundamentals and creativity? 2:33:55 - is it ever too late to learn math seriously? 2:46:00 - mastery learning vs ultra learning 2:51:30 - top-down vs bottom-up 2:53:40 - mastery learning for domain without a structured hierarchical structure? 2:56:30 - neurodivergence / adhd for structured math learning? 3:06:20 - amateur mathematician augmented with technology will be able to contribute to research? 3:14:37 - what are you most excited about right now in term of learning enjoy!

57,320 次观看 • 3 个月前

auto-research is starting to gain traction as a very viable paradigm for creating useful research discovery. now, that paradigm is still in its infancy and the infrastructure to hold all that trail of context as the agents blaze through experiments isn't well defined (to say the least). on that topic, I had the chance to chat with my boys francesco and giulio from paradigma about what underlying infra is needed to make this paradigm work. the paradigma's paradigm, which involves copious amount of DAGs, make this auto-research paradigm a paradigmatic case of essential infrastructure. here's the full video in full: - 0:00 - what is missing from auto-research? - 2:02 - giulio and francesco ai journey - 8:10 - research infra is the bottleneck? - 10:18 - paradigma vision of autonomous research - 13:17 - “important discovery per joules” - 17:15 - why is DAG the unit of research for auto-research? - 20:40 - is paradigma trying to replace the research publication? - 24:50 - how does knowledge is shared between experiments in the DAG? - 27:34 - what is even auto-research lol? - 33:53 - the value of the human mind in this auto-research future. - 37:00 - how do you reconcile hallucination in this auto-research paradigm? - 41:33 - the adoption of auto-research across varied fields? - 47:30 - ✨ introduction to the auto-research infrastructure. ✨ - 56:55 - where is the code? - 59:10 - full IDE next? - 1:03:20 - the place of the human in this DAG / code quality? manual node? token spent? - 1:16:02 - who’s the user for auto-research? - 1:18:13 - how to validate bad DAG? - 1:20:18 - ✨ auto-research agent results ✨ - 1:22:53 - ✨ how a big research DAG looks like? ✨ - 1:25:10 - how to get the canonical DAG for the final result? - 1:27:50 - the auto-research DAG being the new pre-print? - 1:30:05 - what’s next for paradigma and the auto-research infra? - 1:35:00 - what are they excited about research wise? enjoyyyyy my guys 🌹

auto-research is starting to gain traction as a very viable paradigm for creating useful research discovery. now, that paradigm is still in its infancy and the infrastructure to hold all that trail of context as the agents blaze through experiments isn't well defined (to say the least). on that topic, I had the chance to chat with my boys francesco and giulio from paradigma about what underlying infra is needed to make this paradigm work. the paradigma's paradigm, which involves copious amount of DAGs, make this auto-research paradigm a paradigmatic case of essential infrastructure. here's the full video in full: - 0:00 - what is missing from auto-research? - 2:02 - giulio and francesco ai journey - 8:10 - research infra is the bottleneck? - 10:18 - paradigma vision of autonomous research - 13:17 - “important discovery per joules” - 17:15 - why is DAG the unit of research for auto-research? - 20:40 - is paradigma trying to replace the research publication? - 24:50 - how does knowledge is shared between experiments in the DAG? - 27:34 - what is even auto-research lol? - 33:53 - the value of the human mind in this auto-research future. - 37:00 - how do you reconcile hallucination in this auto-research paradigm? - 41:33 - the adoption of auto-research across varied fields? - 47:30 - ✨ introduction to the auto-research infrastructure. ✨ - 56:55 - where is the code? - 59:10 - full IDE next? - 1:03:20 - the place of the human in this DAG / code quality? manual node? token spent? - 1:16:02 - who’s the user for auto-research? - 1:18:13 - how to validate bad DAG? - 1:20:18 - ✨ auto-research agent results ✨ - 1:22:53 - ✨ how a big research DAG looks like? ✨ - 1:25:10 - how to get the canonical DAG for the final result? - 1:27:50 - the auto-research DAG being the new pre-print? - 1:30:05 - what’s next for paradigma and the auto-research infra? - 1:35:00 - what are they excited about research wise? enjoyyyyy my guys 🌹

12,091 次观看 • 1 个月前

I had an awesome time interviewing idan shenfeld and Jonas Hübotter from MIT and ETH Zurich about self-distillation. this very promising post-training paradigm where the model acts as its own teacher by conditioning on environment feedback or demonstrations. we cover the SDPO algo for reinforcement learning with rich feedback and SDFT for continual learning without forgetting along with many applications. we dig into how it works, why it's simpler and faster than GRPO, and where this is already showing up in production systems. table of content: 0:00 - what is self distillation 2:50 - idan (MIT) and jonas (ETH Zurich) introduction and motivation 18:40 - different perspective of on-policy self-distillation (presentation) 36:00 - metacognition and specificity in self-distillation 37:24 - very long hard task and self-distillation 42:00 - continual learning with self-distillation (presentation) 1:16:50 - what is next in this research direction? 1:20:00 - is there any experience with subjective feedbacks? 1:22:50 - quality vs number of feedbacks? 1:26:40 - what setting would self-distillation struggle vs GRPO? the slides were super crisp really cool of them to share! enjoy my guys 🌹

I had an awesome time interviewing idan shenfeld and Jonas Hübotter from MIT and ETH Zurich about self-distillation. this very promising post-training paradigm where the model acts as its own teacher by conditioning on environment feedback or demonstrations. we cover the SDPO algo for reinforcement learning with rich feedback and SDFT for continual learning without forgetting along with many applications. we dig into how it works, why it's simpler and faster than GRPO, and where this is already showing up in production systems. table of content: 0:00 - what is self distillation 2:50 - idan (MIT) and jonas (ETH Zurich) introduction and motivation 18:40 - different perspective of on-policy self-distillation (presentation) 36:00 - metacognition and specificity in self-distillation 37:24 - very long hard task and self-distillation 42:00 - continual learning with self-distillation (presentation) 1:16:50 - what is next in this research direction? 1:20:00 - is there any experience with subjective feedbacks? 1:22:50 - quality vs number of feedbacks? 1:26:40 - what setting would self-distillation struggle vs GRPO? the slides were super crisp really cool of them to share! enjoy my guys 🌹

13,023 次观看 • 2 个月前

没有更多内容可加载