
Andrew Feldman
@andrewdfeldman • 28,441 subscribers
CEO and Founder @Cerebras (NASDAQ: CBRS) where we build the fastest AI infrastructure in the world.
Shorts
Videos

Today Cerebras went public. We are traded on the Nasdaq. Our ticker CBRS. I could not be more proud:
Andrew Feldman118,693 Aufrufe • vor 1 Monat

Fast tokens are the most valuable tokens. Because inference speed is productivity. And Cerebras makes the fastest tokens in the world. Google DeepMind's Gemma 4 runs at 1,500+ tokens per second on Cerebras — more than an order of magnitude faster than on GPUs and 15x faster than Claude Haiku 4.5. Private preview today. GA later this month. Build blazing fast multimodal apps.
Andrew Feldman11,840 Aufrufe • vor 10 Tagen

In 2019, Cerebras solved the hardest problem in the computer industry - and nobody cared. I sat down with Elad Gil and sarah guo to talk through the 10 year journey of building Cerebras into a public company: Some highlights: - How we signed a $20B+ deal with OpenAI in 4.5 weeks - The 2-year stretch where we were burning $8m/month and couldn't even turn our chip on - How we solved one of the hardest problems in computing and for years, no one gave a sh*t - What open source is really doing to the AI ecosystem Full interview below
Andrew Feldman27,471 Aufrufe • vor 28 Tagen

.Cerebras just went public in the largest semis IPO ever. I sat down with Harry Stebbings on 20VC to talk about where the industry goes next: We covered: - Why the AI infrastructure "bubble" narrative is wrong - The biggest bottleneck in AI compute - and why it may last for years - My take on NVIDIA's strategy behind funding the neo clouds - What visionaries like Sam and Elon see that everyone else doesn’t. - What’s actually blocking enterprise AI adoption Link below.
Andrew Feldman16,210 Aufrufe • vor 1 Monat

People ask what faster AI actually means for them. Here’s a concrete example. In the video, GLM-4.6 on Cerebras builds Space Invaders in ~15% of the time it takes Claude Sonnet 4.5 Thinking. The point isn’t speed for its own sake. When latency drops far enough, new classes of workflows become possible. This is like Netflix. Netflix didn’t build a $400 billion business because it mailed DVDs faster. It's entire business changed. It became a movie studio. This transformation was made possible when streaming became fast and reliable enough that people stopped thinking about buffering, downloads, and storage altogether. At that point, behavior changed: people browsed, clicked, abandoned, re-tried - without friction. Fast inference does the same thing for AI. When responses are slow, you design around waiting.
Andrew Feldman32,957 Aufrufe • vor 6 Monaten
Keine weiteren Inhalte verfügbar