
SemiAnalysis
@SemiAnalysis_ • 93,698 subscribers
Shorts
Videos

Jim Cramer "There is a company that I regard as the gospel, SemiAnalysis. I don't think people realize that SemiAnalysis is the arbiter. They're like God in the semis, and when they do, when they bless something. They are the most honest guys I've come across"
SemiAnalysis512,640 Aufrufe • vor 2 Monaten

After studying 300 Leetcode Hards, solving every Jane Street puzzle from the Dwarkesh ads, and watching one Horace He lecture, he finally landed the $400k annualized Jane Street internship. Unfortunately, during onboarding his manager said “this diff is negative alpha,” so Jane Street deployed an AI model to translate all feedback into HR-safe speech in real time.
SemiAnalysis134,572 Aufrufe • vor 23 Tagen

Technical breakdown of tokenizer improvements from GPT 4.6 to 4.7
SemiAnalysis37,784 Aufrufe • vor 18 Tagen

Long-term memory agreements have historically been the sign of the top. Micron prints incredible earnings, stock sells off. Everyone rotates out. But the prepayment terms and pricing floors this cycle look nothing like prior rounds. Our Core Research team breaks down why they're still bullish.
SemiAnalysis95,485 Aufrufe • vor 2 Monaten

It has become extremely trendy among some SF AI researchers to donate to shrimp welfare. They estimate that they help improve the welfare of 1,500 shrimps per year for every $1 donated. Why do they donate to shrimps? They claim that it is the most cost-effective way of reducing suffering of sentient beings. Note that the Shrimp Welfare non-profit does not actually prevent shrimps from being killed but instead promotes the use of electrical stunning as a more humane slaughtering method that aligns with the goal of reducing shrimp suffering.
SemiAnalysis264,319 Aufrufe • vor 6 Monaten

Here's what nobody is telling you about memory companies: hedge funds with billion dollar positions are flying blind until earnings calls reveal demand collapsed. Smart money is paying for real-time shipment data from Korea, Taiwan, China because getting caught in a cycle turn costs more than the tracking fee.
SemiAnalysis60,621 Aufrufe • vor 1 Monat

At GTC 2024, Jensen said that GB200 NVL72 was 35x faster than Hopper. Nobody believed it and thought it was classic fake Jensen Math. When we tested the performance of it, it wasn't just 35x faster, it was over 50x times faster even against an strong Hopper baseline with all of the inference optimization composed together like MTP, Disagg prefill, wideEP, etc. View the nuanced results at InferenceX dot com.
SemiAnalysis60,803 Aufrufe • vor 1 Monat

296 hackers. 18,432 B200 GPU hours. $180K in compute credits. 60,000mg of caffeine. We kicked off GTC with Fluidstack for From Silicon to Scale — the most full-stack AI hackathon ever built. 48 hours. 1 DGX Spark signed by Jensen. Here's what happened.(1/7)🧵
SemiAnalysis69,870 Aufrufe • vor 2 Monaten

Jensen surfing a wave with the American flag is the most accurate metaphor for what's actually happening right now. A kid immigrates from the country of Taiwan, washes dishes at Denny's, and builds a $3 trillion company that every nation on earth is now begging for access to. He is the definition of the American dream. In order for America to strive, Earth needs to be build on American standards
SemiAnalysis45,343 Aufrufe • vor 1 Monat

As we wrote in our TPUv7 article, TPUv7 will have competitive perf per TCO verus Blackwell in 2025 for large labs but TPUv8 will have worse perf per TCO verus Rubin in EOY 2026/2027. We added an subway surfer video as some people don't have the attention span to read an 10,000 word article. There will be 2 versions of TPU v8 but this will be different from the “P” (full) and “E” (lite) SKUs that featured in the then 4th and 5th generations. Rather, it is a dual track with one SKU co-designed with Broadcom (TPU 8AX ) and one SKU co-designed with MediaTek (TPU 8X). The strategy behind Google choosing to partner with MediaTek is to reduce the margin that they pay to their silicon design partners or specifically Broadcom. Broadcom charges Google for the whole System in Package which they stack a healthy margin on. This includes HBM. This is despite Google being largely responsible for both the front-end and back-end design of the compute elements of the chip, with Broadcom contributing various PHYs (most importantly Broadcom’s best in class SerDes) and controllers. Google wants to move to a model where their silicon partner only charges for the things they add value on and get closer to paying BOM cost for their silicon.
SemiAnalysis133,077 Aufrufe • vor 5 Monaten

A common topic of discussion amongst SF Bay Area AI researchers is debating whether they should move to New York City. Something NYC has that the Bay Area does not have is the traditional Mexican frozen treat called nieves, run by Fidel Cortés Jr. and his family at a street-side stand. They hand-swirl freshly blended juice made from real fruits in large wooden barrels that are packed with ice and salt for around two hours.
SemiAnalysis133,489 Aufrufe • vor 6 Monaten