Andrew Feldman's banner

Andrew Feldman

@andrewdfeldman • 28,441 subscribers

CEO and Founder @Cerebras (NASDAQ: CBRS) where we build the fastest AI infrastructure in the world.

Shorts

Today, we reported Cerebras's financial results for Q1 FY 2026. This was our first quarterly earnings report as a public company. We remain laser focused on execution and on delivering the speed and performance our customers need. Read the report:

Today, we reported Cerebras's financial results for Q1 FY 2026. This was our first quarterly earnings report as a public company. We remain laser focused on execution and on delivering the speed and performance our customers need. Read the report:

14,927 Aufrufe

Videos

Anya Rossi

sweetdream.ai

SweetDream.ai•Sponsored•Livecam

Watch Anya Live

Anya is streaming live right now! Join her private show and enjoy exclusive content.

Exclusive private shows

1.2k viewers online

Private Show

Join now for exclusive access

Free preview available • Premium content

Today Cerebras went public. We are traded on the Nasdaq. Our ticker CBRS. I could not be more proud:

Today Cerebras went public. We are traded on the Nasdaq. Our ticker CBRS. I could not be more proud:

118,693 Aufrufe • vor 1 Monat

Today, we reported Cerebras's financial results for Q1 FY 2026. This was our first quarterly earnings report as a public company. We remain laser focused on execution and on delivering the speed and performance our customers need. Read the report:

Today, we reported Cerebras's financial results for Q1 FY 2026. This was our first quarterly earnings report as a public company. We remain laser focused on execution and on delivering the speed and performance our customers need. Read the report:

14,927 Aufrufe • vor 5 Tagen

Fast tokens are the most valuable tokens. Because inference speed is productivity. And Cerebras makes the fastest tokens in the world. Google DeepMind's Gemma 4 runs at 1,500+ tokens per second on Cerebras — more than an order of magnitude faster than on GPUs and 15x faster than Claude Haiku 4.5. Private preview today. GA later this month. Build blazing fast multimodal apps.

Fast tokens are the most valuable tokens. Because inference speed is productivity. And Cerebras makes the fastest tokens in the world. Google DeepMind's Gemma 4 runs at 1,500+ tokens per second on Cerebras — more than an order of magnitude faster than on GPUs and 15x faster than Claude Haiku 4.5. Private preview today. GA later this month. Build blazing fast multimodal apps.

11,840 Aufrufe • vor 10 Tagen

In 2019, Cerebras solved the hardest problem in the computer industry - and nobody cared. I sat down with Elad Gil and sarah guo to talk through the 10 year journey of building Cerebras into a public company: Some highlights: - How we signed a $20B+ deal with OpenAI in 4.5 weeks - The 2-year stretch where we were burning $8m/month and couldn't even turn our chip on - How we solved one of the hardest problems in computing and for years, no one gave a sh*t - What open source is really doing to the AI ecosystem Full interview below

In 2019, Cerebras solved the hardest problem in the computer industry - and nobody cared. I sat down with Elad Gil and sarah guo to talk through the 10 year journey of building Cerebras into a public company: Some highlights: - How we signed a $20B+ deal with OpenAI in 4.5 weeks - The 2-year stretch where we were burning $8m/month and couldn't even turn our chip on - How we solved one of the hardest problems in computing and for years, no one gave a sh*t - What open source is really doing to the AI ecosystem Full interview below

27,471 Aufrufe • vor 28 Tagen

Just a little bit faster than everyone else on OpenAI GPT-OSS-120B. > 10 times faster than the leading NVIDIA GPU provider

Just a little bit faster than everyone else on OpenAI GPT-OSS-120B. > 10 times faster than the leading NVIDIA GPU provider

124,175 Aufrufe • vor 10 Monaten

.Cerebras just went public in the largest semis IPO ever. I sat down with Harry Stebbings on 20VC to talk about where the industry goes next: We covered: - Why the AI infrastructure "bubble" narrative is wrong - The biggest bottleneck in AI compute - and why it may last for years - My take on NVIDIA's strategy behind funding the neo clouds - What visionaries like Sam and Elon see that everyone else doesn’t. - What’s actually blocking enterprise AI adoption Link below.

.Cerebras just went public in the largest semis IPO ever. I sat down with Harry Stebbings on 20VC to talk about where the industry goes next: We covered: - Why the AI infrastructure "bubble" narrative is wrong - The biggest bottleneck in AI compute - and why it may last for years - My take on NVIDIA's strategy behind funding the neo clouds - What visionaries like Sam and Elon see that everyone else doesn’t. - What’s actually blocking enterprise AI adoption Link below.

16,210 Aufrufe • vor 1 Monat

.Cerebras is pretty fast at Llama 4... 3.5X faster than the next nearest competitor and 20X faster than NVIDIA GPU based solutions.

.Cerebras is pretty fast at Llama 4... 3.5X faster than the next nearest competitor and 20X faster than NVIDIA GPU based solutions.

89,186 Aufrufe • vor 1 Jahr

People ask what faster AI actually means for them. Here’s a concrete example. In the video, GLM-4.6 on Cerebras builds Space Invaders in ~15% of the time it takes Claude Sonnet 4.5 Thinking. The point isn’t speed for its own sake. When latency drops far enough, new classes of workflows become possible. This is like Netflix. Netflix didn’t build a $400 billion business because it mailed DVDs faster. It's entire business changed. It became a movie studio. This transformation was made possible when streaming became fast and reliable enough that people stopped thinking about buffering, downloads, and storage altogether. At that point, behavior changed: people browsed, clicked, abandoned, re-tried - without friction. Fast inference does the same thing for AI. When responses are slow, you design around waiting.

People ask what faster AI actually means for them. Here’s a concrete example. In the video, GLM-4.6 on Cerebras builds Space Invaders in ~15% of the time it takes Claude Sonnet 4.5 Thinking. The point isn’t speed for its own sake. When latency drops far enough, new classes of workflows become possible. This is like Netflix. Netflix didn’t build a $400 billion business because it mailed DVDs faster. It's entire business changed. It became a movie studio. This transformation was made possible when streaming became fast and reliable enough that people stopped thinking about buffering, downloads, and storage altogether. At that point, behavior changed: people browsed, clicked, abandoned, re-tried - without friction. Fast inference does the same thing for AI. When responses are slow, you design around waiting.

32,957 Aufrufe • vor 6 Monaten

We are just a little bit faster than@nvidia GPUs on Qwen 235B...18X faster. inference is blazing fast...come build cool stuff on Cerebras inference

We are just a little bit faster than@nvidia GPUs on Qwen 235B...18X faster. inference is blazing fast...come build cool stuff on Cerebras inference

44,760 Aufrufe • vor 11 Monaten

We are pretty fast on AI at Meta's Llama 4 Maverick model. More than twice as fast as NVIDIA recently published Blackwell Performance according third party benchmarking firm Artificial Analysis.

We are pretty fast on AI at Meta's Llama 4 Maverick model. More than twice as fast as NVIDIA recently published Blackwell Performance according third party benchmarking firm Artificial Analysis.

30,203 Aufrufe • vor 1 Jahr

Keine weiteren Inhalte verfügbar