Tech with Mak's banner

Tech with Mak

@techNmak • 40,769 subscribers

AI, coding, software, and whatever’s on my mind. | 📩 for Collaborations

Shorts

Software Engineers when they hear "you will be replaced by AI"

Software Engineers when they hear "you will be replaced by AI"

167,608 次观看

Web Scraping is dead. Web Agenting is here. Writing selectors (div > .class > span) breaks every time a site updates. Building custom bots for every new target is a waste of engineering hours. TinyFish turns the entire real-time web into a single API. Input: Natural Language ("Find availability for X"). Target: 1 or 100 URLs. Output: Structured JSON. This isn't a simulation. It visits the Real-Time Web. 1./ One API, Many Sites - Same contract whether you hit 1 URL or 50. You focus on the Goal (Business Logic). TinyFish handles the How (Navigation, Clicks, Inputs). 2./ Real Automation - It doesn't just "read." It interacts. It fills forms, navigates multi-step flows, and handles dynamic JS content. 3./ Production Ready - This is the same infrastructure used by large enterprises, now exposed as a clean developer primitive. Logs, error handling, and structured data are built-in. The web is finally a proper API.

Web Scraping is dead. Web Agenting is here. Writing selectors (div > .class > span) breaks every time a site updates. Building custom bots for every new target is a waste of engineering hours. TinyFish turns the entire real-time web into a single API. Input: Natural Language ("Find availability for X"). Target: 1 or 100 URLs. Output: Structured JSON. This isn't a simulation. It visits the Real-Time Web. 1./ One API, Many Sites - Same contract whether you hit 1 URL or 50. You focus on the Goal (Business Logic). TinyFish handles the How (Navigation, Clicks, Inputs). 2./ Real Automation - It doesn't just "read." It interacts. It fills forms, navigates multi-step flows, and handles dynamic JS content. 3./ Production Ready - This is the same infrastructure used by large enterprises, now exposed as a clean developer primitive. Logs, error handling, and structured data are built-in. The web is finally a proper API.

119,568 次观看

🚨 This is the best way to learn how LLMs work. Interactive. 3D. Step-by-step. Covers: → Embedding → Layer Norm → Self-Attention → MLP → Transformer layers → Softmax → Output Stop reading papers. Start seeing. Link in comments. Save this immediately.

🚨 This is the best way to learn how LLMs work. Interactive. 3D. Step-by-step. Covers: → Embedding → Layer Norm → Self-Attention → MLP → Transformer layers → Softmax → Output Stop reading papers. Start seeing. Link in comments. Save this immediately.

82,205 次观看

Microsoft made 100B parameter models run on a single CPU. bitnet.cpp: The official inference framework for 1-bit LLMs. The math behind 1-bit LLMs is what makes them revolutionary. Traditional LLMs use 16-bit floating point weights. Every parameter is a number like 0.0023847 or -1.4729. When you run inference, you multiply these floats together. Billions of times. That's why you need GPUs, they're optimized for floating point matrix multiplication. BitNet b1.58 uses ternary weights: {-1, 0, 1}. That's not a simplification. That's a fundamental change in the math. When your weights are only -1, 0, or 1: → Multiply by 1 = keep the value → Multiply by -1 = flip the sign → Multiply by 0 = skip entirely Matrix multiplication becomes addition and subtraction. No floating point operations. No GPU required. This is why bitnet.cpp achieves: → 2.37x to 6.17x speedup on x86 CPUs → 1.37x to 5.07x speedup on ARM CPUs → 71.9% to 82.2% energy reduction on x86 → 55.4% to 70.0% energy reduction on ARM The speedups scale with model size. Larger models see bigger gains because there are more operations to simplify. A 100B parameter model running at human reading speed (5-7 tokens/second) on a single CPU. That's not optimization. That's a different paradigm. Why 1.58 bits? Because log₂(3) ≈ 1.58. Three possible values = 1.58 bits of information per weight. The key insight: These models aren't quantized after training. They're trained from scratch with ternary weights. The model learns to work within the constraint. No precision loss. No quality tradeoff.

Microsoft made 100B parameter models run on a single CPU. bitnet.cpp: The official inference framework for 1-bit LLMs. The math behind 1-bit LLMs is what makes them revolutionary. Traditional LLMs use 16-bit floating point weights. Every parameter is a number like 0.0023847 or -1.4729. When you run inference, you multiply these floats together. Billions of times. That's why you need GPUs, they're optimized for floating point matrix multiplication. BitNet b1.58 uses ternary weights: {-1, 0, 1}. That's not a simplification. That's a fundamental change in the math. When your weights are only -1, 0, or 1: → Multiply by 1 = keep the value → Multiply by -1 = flip the sign → Multiply by 0 = skip entirely Matrix multiplication becomes addition and subtraction. No floating point operations. No GPU required. This is why bitnet.cpp achieves: → 2.37x to 6.17x speedup on x86 CPUs → 1.37x to 5.07x speedup on ARM CPUs → 71.9% to 82.2% energy reduction on x86 → 55.4% to 70.0% energy reduction on ARM The speedups scale with model size. Larger models see bigger gains because there are more operations to simplify. A 100B parameter model running at human reading speed (5-7 tokens/second) on a single CPU. That's not optimization. That's a different paradigm. Why 1.58 bits? Because log₂(3) ≈ 1.58. Three possible values = 1.58 bits of information per weight. The key insight: These models aren't quantized after training. They're trained from scratch with ternary weights. The model learns to work within the constraint. No precision loss. No quality tradeoff.

23,036 次观看

Videos

Anya Rossi

sweetdream.ai

SweetDream.ai•Sponsored•Livecam

Watch Anya Live

Anya is streaming live right now! Join her private show and enjoy exclusive content.

Exclusive private shows

1.2k viewers online

Private Show

Join now for exclusive access

Free preview available • Premium content

The math behind getting 10% better every week. Nothing happens for a while. Then everyone calls you lucky.

The math behind getting 10% better every week. Nothing happens for a while. Then everyone calls you lucky.

250,539 次观看 • 12 天前

Most people first see Euler’s Formula as a strange equation in a textbook. Then years later, they realize it quietly powers the modern world. Leonhard Euler discovered that these seemingly unrelated mathematical ideas: → exponential growth → imaginary numbers → sine waves → cosine waves → rotation … are all deeply connected through one identity: e^(iθ) = cos(θ) + i·sin(θ) At first glance it looks impossible. How can an exponential function suddenly produce circles and waves? The key insight is that multiplying by a complex exponential creates rotation. As the angle θ changes: - the cosine term tracks horizontal motion - the sine term tracks vertical motion - together they trace a perfect circle in the complex plane Euler showed that waves and rotation are mathematically the same phenomenon viewed differently. That single idea changed science and engineering forever. Today, Euler’s Formula sits underneath: → Fourier Transforms → signal processing → wireless communication → MRI scanners → quantum mechanics → electrical engineering → audio compression → radar systems → GPS → neural network frequency analysis Even modern AI systems indirectly rely on mathematics built on top of these foundations. The famous special case is Euler’s Identity: e^(iπ) + 1 = 0 Richard Feynman reportedly called it: "our jewel." Because it revealed that mathematics is not a collection of separate topics. It is one connected language describing reality itself.

Most people first see Euler’s Formula as a strange equation in a textbook. Then years later, they realize it quietly powers the modern world. Leonhard Euler discovered that these seemingly unrelated mathematical ideas: → exponential growth → imaginary numbers → sine waves → cosine waves → rotation … are all deeply connected through one identity: e^(iθ) = cos(θ) + i·sin(θ) At first glance it looks impossible. How can an exponential function suddenly produce circles and waves? The key insight is that multiplying by a complex exponential creates rotation. As the angle θ changes: - the cosine term tracks horizontal motion - the sine term tracks vertical motion - together they trace a perfect circle in the complex plane Euler showed that waves and rotation are mathematically the same phenomenon viewed differently. That single idea changed science and engineering forever. Today, Euler’s Formula sits underneath: → Fourier Transforms → signal processing → wireless communication → MRI scanners → quantum mechanics → electrical engineering → audio compression → radar systems → GPS → neural network frequency analysis Even modern AI systems indirectly rely on mathematics built on top of these foundations. The famous special case is Euler’s Identity: e^(iπ) + 1 = 0 Richard Feynman reportedly called it: "our jewel." Because it revealed that mathematics is not a collection of separate topics. It is one connected language describing reality itself.

25,383 次观看 • 2 个月前

Unpopular opinion: VS Code is holding back AI coding. I’ve been spinning up multiple Cline instances across tmux panes and terminal tabs. Each agent has its own isolated state and can run a different task, branch, or idea in parallel. They just keep going while I focus on something else, no context collisions, no babysitting a single chat thread. You can pipe input, chain output, or run them headless in CI/CD. It fits into the workflow I already use in the terminal. Getting started literally took me 30 seconds: npm install -g cline AI is finally just another process in my workflow, instead of a panel or plugin.

Unpopular opinion: VS Code is holding back AI coding. I’ve been spinning up multiple Cline instances across tmux panes and terminal tabs. Each agent has its own isolated state and can run a different task, branch, or idea in parallel. They just keep going while I focus on something else, no context collisions, no babysitting a single chat thread. You can pipe input, chain output, or run them headless in CI/CD. It fits into the workflow I already use in the terminal. Getting started literally took me 30 seconds: npm install -g cline AI is finally just another process in my workflow, instead of a panel or plugin.

43,090 次观看 • 5 个月前

没有更多内容可加载