Загрузка видео...

Не удалось загрузить видео

Возникла проблема при загрузке этого видео. Это может быть связано с временными проблемами сети или видео может быть недоступно.

На главную

Functions are vectors! This perspective lets us apply the tools of linear algebra to computational problems from image and geometry processing to machine learning and light transport—and provides a natural explanation for Fourier series. Let's explore:

Max Slater

2,957 subscribers

30,424 просмотров • 2 лет назад •via X (Twitter)

Anya Rossi• Live Now

Private livecam show

Комментарии: 0

Нет доступных комментариев

Здесь появятся комментарии из оригинального поста

Похожие видео

Light doesn’t fake geometry, and perspective doesn’t lie, but the explanation never seems to stay the same.

Light doesn’t fake geometry, and perspective doesn’t lie, but the explanation never seems to stay the same.

Flat Earth Zone

18,600 просмотров • 2 месяцев назад

the first free interactive linear algebra textbook: - vectors and matrices - dot product and vector product - determinants - matrix ranks - eigenvalues and eigenvectors

the first free interactive linear algebra textbook: - vectors and matrices - dot product and vector product - determinants - matrix ranks - eigenvalues and eigenvectors

ℏεsam

63,641 просмотров • 1 год назад

[CLIP] by Hand ✍️ The CLIP (Contrastive Language–Image Pre-training) model, a groundbreaking work by OpenAI, redefines the intersection of computer vision and natural language processing. It is the basis of all the multi-modal foundation models we see today. How does CLIP work? Goal: 🟨 Learn a shared embedding space for text and image [1] Given ↳ A mini batch of 3 text-image pairs ↳ OpenAI used 400 million text-image pairs to train its original CLIP model. Process 1st pair: "big table" [2] 🟪 Text → 2 Vectors (3D) ↳ Look up word embedding vectors using word2vec. [3] 🟩 Image → 2 Vectors (4D) ↳ Divide the image into two patches. ↳ Flatten each patch [4] Process other pairs ↳ Repeat [2]-[3] [5] 🟪 Text Encoder & 🟩 Image Encoder ↳ Encode input vectors into feature vectors ↳ Here, both encoders are simple one layer perceptron (linear + ReLU) ↳ In practice, the encoders are usually transformer models. [6] 🟪 🟩 Mean Pooling: 2 → 1 vector ↳ Average 2 feature vectors into a single vector by averaging across the columns ↳ The goal is to have one vector to represent each image or text [7] 🟪 🟩 -> 🟨 Projection ↳ Note that the text and image feature vectors from the encoders have different dimensions (3D vs. 4D). ↳ Use a linear layer to project image and text vectors to a 2D shared embedding space. 🏋️ Contrastive Pre-training 🏋️ [8] Prepare for MatMul ↳ Copy text vectors (T1,T2,T3) ↳ Copy the transpose of image vectors (I1,I2,I3) ↳ They are all in the 2D shared embedding space. [9] 🟦 MatMul ↳ Multiply T and I matrices. ↳ This is equivalent to taking dot product between every pair of image and text vectors. ↳ The purpose is to use dot product to estimate the similarity between a pair of image-text. [10] 🟦 Softmax: e^x ↳ Raise e to the power of the number in each cell ↳ To simplify hand calculation, we approximate e^□ with 3^□. [11] 🟦 Softmax: ∑ ↳ Sum each row for 🟩 image→🟪 text ↳ Sum each column for 🟪 text→ 🟩 image [12] 🟦 Softmax: 1 / sum ↳ Divide each element by the column sum to obtain a similarity matrix for 🟪 text→🟩 image ↳ Divide each element by the row sum to obtain a similarity matrix for 🟩 image→🟪 text [13] 🟥 Loss Gradients ↳ The "Targets" for the similarity matrices are Identity Matrices. ↳ Why? If I and T come from the same pair (i=j), we want the highest value, which is 1, and 0 otherwise. ↳ Apply the simple equation of [Similarity - Target] to compute gradients of for both directions. ↳ Why so simple? Because when Softmax and Cross-Entropy Loss are used together, the math magically works out that way. ↳ These gradients kick off the backpropagation process to update weights and biases of the encoders and projection layers (red borders).

[CLIP] by Hand ✍️ The CLIP (Contrastive Language–Image Pre-training) model, a groundbreaking work by OpenAI, redefines the intersection of computer vision and natural language processing. It is the basis of all the multi-modal foundation models we see today. How does CLIP work? Goal: 🟨 Learn a shared embedding space for text and image [1] Given ↳ A mini batch of 3 text-image pairs ↳ OpenAI used 400 million text-image pairs to train its original CLIP model. Process 1st pair: "big table" [2] 🟪 Text → 2 Vectors (3D) ↳ Look up word embedding vectors using word2vec. [3] 🟩 Image → 2 Vectors (4D) ↳ Divide the image into two patches. ↳ Flatten each patch [4] Process other pairs ↳ Repeat [2]-[3] [5] 🟪 Text Encoder & 🟩 Image Encoder ↳ Encode input vectors into feature vectors ↳ Here, both encoders are simple one layer perceptron (linear + ReLU) ↳ In practice, the encoders are usually transformer models. [6] 🟪 🟩 Mean Pooling: 2 → 1 vector ↳ Average 2 feature vectors into a single vector by averaging across the columns ↳ The goal is to have one vector to represent each image or text [7] 🟪 🟩 -> 🟨 Projection ↳ Note that the text and image feature vectors from the encoders have different dimensions (3D vs. 4D). ↳ Use a linear layer to project image and text vectors to a 2D shared embedding space. 🏋️ Contrastive Pre-training 🏋️ [8] Prepare for MatMul ↳ Copy text vectors (T1,T2,T3) ↳ Copy the transpose of image vectors (I1,I2,I3) ↳ They are all in the 2D shared embedding space. [9] 🟦 MatMul ↳ Multiply T and I matrices. ↳ This is equivalent to taking dot product between every pair of image and text vectors. ↳ The purpose is to use dot product to estimate the similarity between a pair of image-text. [10] 🟦 Softmax: e^x ↳ Raise e to the power of the number in each cell ↳ To simplify hand calculation, we approximate e^□ with 3^□. [11] 🟦 Softmax: ∑ ↳ Sum each row for 🟩 image→🟪 text ↳ Sum each column for 🟪 text→ 🟩 image [12] 🟦 Softmax: 1 / sum ↳ Divide each element by the column sum to obtain a similarity matrix for 🟪 text→🟩 image ↳ Divide each element by the row sum to obtain a similarity matrix for 🟩 image→🟪 text [13] 🟥 Loss Gradients ↳ The "Targets" for the similarity matrices are Identity Matrices. ↳ Why? If I and T come from the same pair (i=j), we want the highest value, which is 1, and 0 otherwise. ↳ Apply the simple equation of [Similarity - Target] to compute gradients of for both directions. ↳ Why so simple? Because when Softmax and Cross-Entropy Loss are used together, the math magically works out that way. ↳ These gradients kick off the backpropagation process to update weights and biases of the encoders and projection layers (red borders).

Tom Yeh

67,790 просмотров • 2 лет назад

This textbook provides a comprehensive treatment of circuit elements, fundamental laws, and efficient analysis techniques for linear circuits, including algebraic graph theory, state-variable methods, Fourier transforms, and problem sets. Quote WSTWTR35 to enjoy 35% off today!

This textbook provides a comprehensive treatment of circuit elements, fundamental laws, and efficient analysis techniques for linear circuits, including algebraic graph theory, state-variable methods, Fourier transforms, and problem sets. Quote WSTWTR35 to enjoy 35% off today!

World Scientific

115,209 просмотров • 11 месяцев назад

We've released 10+ Linear Algebra concepts for Machine Learning! Here's one of the visualization how PCA identifies direction of maximum variance.

We've released 10+ Linear Algebra concepts for Machine Learning! Here's one of the visualization how PCA identifies direction of maximum variance.

TensorTonic

48,303 просмотров • 5 месяцев назад

Watch our co-founder, , explain how the future of machine learning will lead us to a continuously updated digital representation of the world on the internet, and why this requires shifts in the way we access and allocate computational resources.

Watch our co-founder, , explain how the future of machine learning will lead us to a continuously updated digital representation of the world on the internet, and why this requires shifts in the way we access and allocate computational resources.

gensyn

29,885 просмотров • 1 год назад

Dr. Robert Snellman's eight-hour course: The Fundamentals of Mathematics: Algebra, is available now. Enroll now for immediate access at Peterson Academy. Accompanying the course are practice questions as well! In this course, Dr. Snellman offers an accelerated and comprehensive introduction to algebra. He begins with basic variable manipulation and linear equations, then progresses to polynomials, quadratic equations, and exponential functions. Guided by Dr. Snellman, we build a strong mathematical foundation through a systematic study of functions, graphs, and solution methods, with a consistent emphasis on real-world applications in physics, finance, and cryptography. The course introduces increasingly sophisticated mathematical concepts and culminates in systems of linear equations and matrix operations, demonstrating how algebraic thinking can model and solve meaningful, practical problems.

Dr. Robert Snellman's eight-hour course: The Fundamentals of Mathematics: Algebra, is available now. Enroll now for immediate access at Peterson Academy. Accompanying the course are practice questions as well! In this course, Dr. Snellman offers an accelerated and comprehensive introduction to algebra. He begins with basic variable manipulation and linear equations, then progresses to polynomials, quadratic equations, and exponential functions. Guided by Dr. Snellman, we build a strong mathematical foundation through a systematic study of functions, graphs, and solution methods, with a consistent emphasis on real-world applications in physics, finance, and cryptography. The course introduces increasingly sophisticated mathematical concepts and culminates in systems of linear equations and matrix operations, demonstrating how algebraic thinking can model and solve meaningful, practical problems.

Peterson Academy

59,142 просмотров • 1 год назад

The Fourier series, modeling periodic functions with cosines and sines, and computing Fourier coefficients as projections onto orthogonal bases in Python. source: Steve Brunton

The Fourier series, modeling periodic functions with cosines and sines, and computing Fourier coefficients as projections onto orthogonal bases in Python. source: Steve Brunton

tetsuo

80,704 просмотров • 8 месяцев назад

Fourier Transform - Perfect and Final Version When you plot 225 functions, each constructed by up to 1000 complex exponential functions (i.e., rotating vectors) you get the following drawing. Math is Beautiful Math is Art

Fourier Transform - Perfect and Final Version When you plot 225 functions, each constructed by up to 1000 complex exponential functions (i.e., rotating vectors) you get the following drawing. Math is Beautiful Math is Art

Carl Friedrich Gauss

33,028 просмотров • 5 месяцев назад

Data preparation! It's crucial for machine learning, and we all hate it. Tools and techniques to reduce this burden? A quick summary of 10 years of R&D on this, from cheap tricks to LLMs and graph neural networks 1/9

Data preparation! It's crucial for machine learning, and we all hate it. Tools and techniques to reduce this burden? A quick summary of 10 years of R&D on this, from cheap tricks to LLMs and graph neural networks 1/9

Gael Varoquaux 🦋

13,752 просмотров • 1 год назад

AI Cinema is rushing towards us at light speed. These images are generated with #stablediffusion #sdxl and used as image prompts in Runway #gen2 to create the animations. Music = Principles of Geometry

AI Cinema is rushing towards us at light speed. These images are generated with #stablediffusion #sdxl and used as image prompts in Runway #gen2 to create the animations. Music = Principles of Geometry

-=HACKMANS=-

146,238 просмотров • 2 лет назад

a playlist of 30 youtube videos to learn machine learning fundamentals from scratch if you're struggling on where to start learning ML, this list goes this "Machine Learning: Teach by Doing" is a solid choice to learn both theory and code. (1) Introduction to Machine Learning Teach by Doing: (2) What is Machine Learning? History of Machine Learning: (3) Types of ML Models: (4) 6 steps of any ML project: (5) Install Python and VSCode and run your first code: (6) Linear Classifiers Part 1: (7) Linear Classifiers Part 2: (8) Jupyter Notebook, Numpy and Scikit-Learn: (9) Running the Random Linear Classifier Algorithm in Python: (10) The oldest ML model - Perceptron: (11) Coding the Perceptron: (12) Perceptron Convergence Theorem: (13) Magic of features in Machine Learning: (14) One hot encoding: (15) Logistic Regression Part 1: (16) Cross Entropy Loss: (17) How gradient descent works: (18) Logistic Regression from scratch in Python: (19) Introduction to Regularization: (20) Implementing Regularization in Python: (21) Linear Regression Introduction: (22) Ordinary Least Squares step by step implementation: (23) Ridge regression fundamentals and intuition: (24) Regression recap for interviews: (25) Neural network architecture in 30 minutes: (26) Backpropagation intuition: (27) Neural network activation functions: (28) Momentum in gradient descent: (29) Hands on neural network training in Python: (30) Introduction to Convolutional Neural Networks (CNNs):

a playlist of 30 youtube videos to learn machine learning fundamentals from scratch if you're struggling on where to start learning ML, this list goes this "Machine Learning: Teach by Doing" is a solid choice to learn both theory and code. (1) Introduction to Machine Learning Teach by Doing: (2) What is Machine Learning? History of Machine Learning: (3) Types of ML Models: (4) 6 steps of any ML project: (5) Install Python and VSCode and run your first code: (6) Linear Classifiers Part 1: (7) Linear Classifiers Part 2: (8) Jupyter Notebook, Numpy and Scikit-Learn: (9) Running the Random Linear Classifier Algorithm in Python: (10) The oldest ML model - Perceptron: (11) Coding the Perceptron: (12) Perceptron Convergence Theorem: (13) Magic of features in Machine Learning: (14) One hot encoding: (15) Logistic Regression Part 1: (16) Cross Entropy Loss: (17) How gradient descent works: (18) Logistic Regression from scratch in Python: (19) Introduction to Regularization: (20) Implementing Regularization in Python: (21) Linear Regression Introduction: (22) Ordinary Least Squares step by step implementation: (23) Ridge regression fundamentals and intuition: (24) Regression recap for interviews: (25) Neural network architecture in 30 minutes: (26) Backpropagation intuition: (27) Neural network activation functions: (28) Momentum in gradient descent: (29) Hands on neural network training in Python: (30) Introduction to Convolutional Neural Networks (CNNs):

ℏεsam

117,570 просмотров • 1 год назад

$TAO is tackling the core challenges in machine learning, from optimizing data collection to faster inference, aiming to outperform giants like OpenAI and Microsoft. With a focus on computational efficiency and robust models, the race is on.

$TAO is tackling the core challenges in machine learning, from optimizing data collection to faster inference, aiming to outperform giants like OpenAI and Microsoft. With a focus on computational efficiency and robust models, the race is on.

Grayscale

88,595 просмотров • 1 год назад

New Book & Video Series!!! (late 2025) Optimization Bootcamp: Applications in Machine Learning, Control, and Inverse Problems Comment for a sneak peak to help proofread and I'll DM (proof reading, typos, HW problems, all get acknowledgment in book!)

New Book & Video Series!!! (late 2025) Optimization Bootcamp: Applications in Machine Learning, Control, and Inverse Problems Comment for a sneak peak to help proofread and I'll DM (proof reading, typos, HW problems, all get acknowledgment in book!)

Steven Brunton

102,952 просмотров • 11 месяцев назад

Early careers roles at Williams are still live. We are on a long-term mission to return to the front of the grid and are looking for the brightest and best talent to help us get there. Visit our careers page to explore opportunities and apply ⬇️

Early careers roles at Williams are still live. We are on a long-term mission to return to the front of the grid and are looking for the brightest and best talent to help us get there. Visit our careers page to explore opportunities and apply ⬇️

Atlassian Williams Racing

220,441 просмотров • 1 год назад

Today we’re introducing SceneScript, a novel method for reconstructing environments and representing the layout of physical spaces from Reality Labs at Meta Research. Details ➡️ SceneScript is able to directly infer a room’s geometry using end-to-end machine learning and represent it using language. Compared to previous approaches, this results in representations of physical scenes that are compact, complete, interpretable and extensible.

Today we’re introducing SceneScript, a novel method for reconstructing environments and representing the layout of physical spaces from Reality Labs at Meta Research. Details ➡️ SceneScript is able to directly infer a room’s geometry using end-to-end machine learning and represent it using language. Compared to previous approaches, this results in representations of physical scenes that are compact, complete, interpretable and extensible.

AI at Meta

334,377 просмотров • 2 лет назад

if you're struggling on where to start learning ML, here’s a playlist of 30 youtube videos to learn machine learning fundamentals from scratch "Machine Learning: Teach by Doing" is a solid choice to learn both theory and code. (1) Introduction to Machine Learning Teach by Doing: (2) What is Machine Learning? History of Machine Learning: (3) Types of ML Models: (4) 6 steps of any ML project: (5) Install Python and VSCode and run your first code: (6) Linear Classifiers Part 1: (7) Linear Classifiers Part 2: (8) Jupyter Notebook, Numpy and Scikit-Learn: (9) Running the Random Linear Classifier Algorithm in Python: (10) The oldest ML model - Perceptron: (11) Coding the Perceptron: (12) Perceptron Convergence Theorem: (13) Magic of features in Machine Learning: (14) One hot encoding: (15) Logistic Regression Part 1: (16) Cross Entropy Loss: (17) How gradient descent works: (18) Logistic Regression from scratch in Python: (19) Introduction to Regularization: (20) Implementing Regularization in Python: (21) Linear Regression Introduction: (22) Ordinary Least Squares step by step implementation: (23) Ridge regression fundamentals and intuition: (24) Regression recap for interviews: (25) Neural network architecture in 30 minutes: (26) Backpropagation intuition: (27) Neural network activation functions: (28) Momentum in gradient descent: (29) Hands on neural network training in Python: (30) Introduction to Convolutional Neural Networks (CNNs):

if you're struggling on where to start learning ML, here’s a playlist of 30 youtube videos to learn machine learning fundamentals from scratch "Machine Learning: Teach by Doing" is a solid choice to learn both theory and code. (1) Introduction to Machine Learning Teach by Doing: (2) What is Machine Learning? History of Machine Learning: (3) Types of ML Models: (4) 6 steps of any ML project: (5) Install Python and VSCode and run your first code: (6) Linear Classifiers Part 1: (7) Linear Classifiers Part 2: (8) Jupyter Notebook, Numpy and Scikit-Learn: (9) Running the Random Linear Classifier Algorithm in Python: (10) The oldest ML model - Perceptron: (11) Coding the Perceptron: (12) Perceptron Convergence Theorem: (13) Magic of features in Machine Learning: (14) One hot encoding: (15) Logistic Regression Part 1: (16) Cross Entropy Loss: (17) How gradient descent works: (18) Logistic Regression from scratch in Python: (19) Introduction to Regularization: (20) Implementing Regularization in Python: (21) Linear Regression Introduction: (22) Ordinary Least Squares step by step implementation: (23) Ridge regression fundamentals and intuition: (24) Regression recap for interviews: (25) Neural network architecture in 30 minutes: (26) Backpropagation intuition: (27) Neural network activation functions: (28) Momentum in gradient descent: (29) Hands on neural network training in Python: (30) Introduction to Convolutional Neural Networks (CNNs):

ℏεsam

108,861 просмотров • 1 год назад

The theory of higher order topological dynamics, which combines multilevel interactions between discrete topology and nonlinear dynamics, has the potential to enhance our understanding of complex systems such as the functions of the nervous system, the development of next-generation machine learning and the creation of advanced nodal processing algorithms. An important and unexpected collective behavior of signal processing in multilevel nodal networks has been observed to lead to a synchronization and diffusion of the irrotational and the solenoidal components of the systems revealing a deep relation of these mechanisms with the complexity of discrete topology. The perspective of the preliminary study linked here offers insights into how topology morphs dynamics, how dynamics stem from topology and how topology evolves dynamically. 🔗

The theory of higher order topological dynamics, which combines multilevel interactions between discrete topology and nonlinear dynamics, has the potential to enhance our understanding of complex systems such as the functions of the nervous system, the development of next-generation machine learning and the creation of advanced nodal processing algorithms. An important and unexpected collective behavior of signal processing in multilevel nodal networks has been observed to lead to a synchronization and diffusion of the irrotational and the solenoidal components of the systems revealing a deep relation of these mechanisms with the complexity of discrete topology. The perspective of the preliminary study linked here offers insights into how topology morphs dynamics, how dynamics stem from topology and how topology evolves dynamically. 🔗

Maurizio Iβλἄ

40,749 просмотров • 1 год назад

Neural networks explained from first principles: how simple linear pieces combine into richer models, and why layered representations make machine learning more powerful. MIT OpenCourseWare, 6.036 Introduction to Machine Learning, Fall 2020.

Neural networks explained from first principles: how simple linear pieces combine into richer models, and why layered representations make machine learning more powerful. MIT OpenCourseWare, 6.036 Introduction to Machine Learning, Fall 2020.

tetsuo

10,720 просмотров • 16 дней назад

Quick demo of the voice to voice translation between English and Twi that you can try today on our web app. This one is from the perspective of a tourist visiting Ghana and needing to get around. Brought to you by the amazing team at Ghana Natural Language Processing (NLP) Algorine Research

Quick demo of the voice to voice translation between English and Twi that you can try today on our web app. This one is from the perspective of a tourist visiting Ghana and needing to get around. Brought to you by the amazing team at Ghana Natural Language Processing (NLP) Algorine Research

Paul Azunre

165,075 просмотров • 2 лет назад