Video wird geladen...

Video konnte nicht geladen werden

Beim Laden dieses Videos ist ein Problem aufgetreten. Dies könnte an einem vorübergehenden Netzwerkproblem liegen oder das Video ist möglicherweise nicht verfügbar.

What does it actually take to run A/B testing at nearly a billion-user scale? Prudhvi Vatala, Head of Engineering Platforms at Snap Inc., explains how his team migrated 10+ petabytes of daily data processing to GPU-accelerated pipelines on Google Cloud — cutting job costs by 76% and memory footprint... show more

NVIDIA

2,558,943 subscribers

49,991 Aufrufe • vor 1 Monat •via X (Twitter)

Nachrichten & Politik Wissenschaft & Technologie Finanzen

Anya Rossi• Live Now

Private livecam show

0 Kommentare

Keine Kommentare verfügbar

Kommentare vom Original-Post werden hier angezeigt

Ähnliche Videos

Slash AI costs by 99%, run 1000+ models on a single GPU! OpenLoRA lets you run thousands of models on one GPU instead of one per model, drastically cutting costs and boosting flexibility. Excited to work with Aethir, Hyperbolic for infrastructure and for verifiability. Watch the demo:

Slash AI costs by 99%, run 1000+ models on a single GPU! OpenLoRA lets you run thousands of models on one GPU instead of one per model, drastically cutting costs and boosting flexibility. Excited to work with Aethir, Hyperbolic for infrastructure and for verifiability. Watch the demo:

OpenLedger

155,267 Aufrufe • vor 1 Jahr

What Lisa Su actually held on stage: A mini PC the size of a lunchbox running Qwen3-235B locally, with no cloud and no discrete GPU Inside: the Ryzen AI Max+ 395, 128GB unified memory, 110GB usable as VRAM on Linux The first x86 chip that handles 200+ billion parameters on a single die AMD claims it beats the RTX 5080 by several times on memory-bound models — because the 5080 simply cannot fit them $1,400 to $2,500 once. cloud bills run $200 to $400 a month It pays for itself in a few months, then costs nothing per request This is not a faster GPU. it is the first real argument that your AI does not belong in someone else's data center

What Lisa Su actually held on stage: A mini PC the size of a lunchbox running Qwen3-235B locally, with no cloud and no discrete GPU Inside: the Ryzen AI Max+ 395, 128GB unified memory, 110GB usable as VRAM on Linux The first x86 chip that handles 200+ billion parameters on a single die AMD claims it beats the RTX 5080 by several times on memory-bound models — because the 5080 simply cannot fit them $1,400 to $2,500 once. cloud bills run $200 to $400 a month It pays for itself in a few months, then costs nothing per request This is not a faster GPU. it is the first real argument that your AI does not belong in someone else's data center

plutos

1,224,683 Aufrufe • vor 7 Tagen

💡Instead of moving data to AI, why not bring AI to your data? Feifei Li, SVP and President of International Business at Alibaba Cloud Intelligence Group, explains how embedding LLMs into databases, and processing and managing multimodal data with platforms, is transforming how enterprises build intelligent apps. It’s a smarter, faster, and more scalable way to unlock real-time insights.

💡Instead of moving data to AI, why not bring AI to your data? Feifei Li, SVP and President of International Business at Alibaba Cloud Intelligence Group, explains how embedding LLMs into databases, and processing and managing multimodal data with platforms, is transforming how enterprises build intelligent apps. It’s a smarter, faster, and more scalable way to unlock real-time insights.

Alibaba Group

84,657 Aufrufe • vor 6 Monaten

👀 NEW feature: RAPIDS AI cuDF supports up to 2.1B rows of text data. ⚡ Watch #pandas code with large strings get GPU-accelerated up to 30x with zero code changes. Try the notebook ➡️ #DataScience, #Python, #RAPIDS

👀 NEW feature: RAPIDS AI cuDF supports up to 2.1B rows of text data. ⚡ Watch #pandas code with large strings get GPU-accelerated up to 30x with zero code changes. Try the notebook ➡️ #DataScience, #Python, #RAPIDS

NVIDIA AI Developer

19,098 Aufrufe • vor 1 Jahr

How does Bilibili manage millions of videos and user interactions daily? Feifei Li, SVP and President of International Business at Alibaba Cloud Intelligence Group, shares how their AI-ready data foundation helps Bilibili turn multimedia data into real-time insights, streamline workflows, and scale with intelligence 🚀

How does Bilibili manage millions of videos and user interactions daily? Feifei Li, SVP and President of International Business at Alibaba Cloud Intelligence Group, shares how their AI-ready data foundation helps Bilibili turn multimedia data into real-time insights, streamline workflows, and scale with intelligence 🚀

Alibaba Group

208,288 Aufrufe • vor 7 Monaten

🚨 OpenAI's head of engineering just revealed how they actually code in 2026. 80 minutes. free. on Lenny's Podcast. watch it. bookmark it. 95% of their engineers use Codex daily. you still type prompts one at a time. then read the article below.

🚨 OpenAI's head of engineering just revealed how they actually code in 2026. 80 minutes. free. on Lenny's Podcast. watch it. bookmark it. 95% of their engineers use Codex daily. you still type prompts one at a time. then read the article below.

Movez

221,621 Aufrufe • vor 1 Monat

10 years of "it can't be done..." 7 NVIDIA GPU architectures... 5 years of RAPIDS AI... 3 years of Voltron Data... finally a petabyte-scale GPU-native engine that DOESN'T require you to change your data pipelines. Same code, same data formats, just modular, interoperable, composable, extensible... and of course ACCELERATED! Theseus is the Scalable Performant And Compute Efficient engine🔥🔥🔥 Check out our benchmarks and new webpage... and reach out if you're struggling with queries above 30TBs.

10 years of "it can't be done..." 7 NVIDIA GPU architectures... 5 years of RAPIDS AI... 3 years of Voltron Data... finally a petabyte-scale GPU-native engine that DOESN'T require you to change your data pipelines. Same code, same data formats, just modular, interoperable, composable, extensible... and of course ACCELERATED! Theseus is the Scalable Performant And Compute Efficient engine🔥🔥🔥 Check out our benchmarks and new webpage... and reach out if you're struggling with queries above 30TBs.

Josh Patterson

16,962 Aufrufe • vor 2 Jahren

Feed the ad platforms with data-driven ads that actually work. Omneky connects to your Meta, Google, and TikTok data to generate beautiful, on-brand, and highly effective creatives in minutes. It's not just about looking good—it's about brute forcing performance with A/B testing creatives at scale. Drive sales and conversions without the guesswork. Try Omneky today.

Omneky

54,032 Aufrufe • vor 7 Monaten

Cloud query costs can add up fast. Samyak Jain of 99acres.com shares how his team monitors data scans in AWS Athena and reduces costs with Spark pipelines. Links below. #AI #Automation #Airflow #MachineLearning

Cloud query costs can add up fast. Samyak Jain of 99acres.com shares how his team monitors data scans in AWS Athena and reduces costs with Spark pipelines. Links below. #AI #Automation #Airflow #MachineLearning

Astronomer

805,273 Aufrufe • vor 11 Monaten

Boris Cherny, the creator of Claude Code: "Most people use Claude Code as a chatbot. It was built to run an entire engineering team." In 30 minutes on stage, he explains how he now works by directing agents instead of writing code by hand. Watch full video, then save the exact setup below👇

Boris Cherny, the creator of Claude Code: "Most people use Claude Code as a chatbot. It was built to run an entire engineering team." In 30 minutes on stage, he explains how he now works by directing agents instead of writing code by hand. Watch full video, then save the exact setup below👇

darkzodchi

35,506 Aufrufe • vor 19 Tagen

Google Cloud AI engineer just showed how they go from idea to deployed app at Google in 30-minutes using Claude. 26-minutes. free. by Google AI team. one person + Claude + Google Cloud = a full engineering org running on a laptop. worth more than any $500 vibe-coding course.

Google Cloud AI engineer just showed how they go from idea to deployed app at Google in 30-minutes using Claude. 26-minutes. free. by Google AI team. one person + Claude + Google Cloud = a full engineering org running on a laptop. worth more than any $500 vibe-coding course.

Movez

1,756,634 Aufrufe • vor 1 Monat

What does it take to produce spacecraft at scale? At Rocket Lab, it takes expert teams, vertical integration, and a whole lot of precision. From structure and harnesses to full spacecraft assembly and testing—our team builds it all. Go behind the scenes with us 👇 📽️ Watch the full video

What does it take to produce spacecraft at scale? At Rocket Lab, it takes expert teams, vertical integration, and a whole lot of precision. From structure and harnesses to full spacecraft assembly and testing—our team builds it all. Go behind the scenes with us 👇 📽️ Watch the full video

Rocket Lab

54,486 Aufrufe • vor 11 Monaten

Energy efficiency in LLM inference has improved 100,000x in the past 10 years — demonstrating that accelerated computing is sustainable computing. Josh Parker, head of sustainability at NVIDIA, explains how. #ClimateWeekNYC Learn more:

Energy efficiency in LLM inference has improved 100,000x in the past 10 years — demonstrating that accelerated computing is sustainable computing. Josh Parker, head of sustainability at NVIDIA, explains how. #ClimateWeekNYC Learn more:

NVIDIA

32,252 Aufrufe • vor 9 Monaten

What a week at #GoogleCloudNext in Las Vegas ✨ From sessions to showfloor demos to networking events, we were excited to partner with Google Cloud to advance how the AI ecosystem can build, deploy, and innovate in the cloud. Looking forward to working together to advance AI agents, physical AI, and beyond.

What a week at #GoogleCloudNext in Las Vegas ✨ From sessions to showfloor demos to networking events, we were excited to partner with Google Cloud to advance how the AI ecosystem can build, deploy, and innovate in the cloud. Looking forward to working together to advance AI agents, physical AI, and beyond.

NVIDIA

32,762 Aufrufe • vor 1 Monat

$👋 Say hello to Roo Code's new provider: IO Intelligence, powered by io.net's decentralized GPU network. Run Llama, DeepSeek, Qwen, Mistral, and more through one unified API to get high performance at a fraction of typical cloud costs. Set up in 15s:$

👋 Say hello to Roo Code's new provider: IO Intelligence, powered by io.net's decentralized GPU network. Run Llama, DeepSeek, Qwen, Mistral, and more through one unified API to get high performance at a fraction of typical cloud costs. Set up in 15s:

Roomote

15,894 Aufrufe • vor 10 Monaten

Introducing the NVIDIA Project DIGITS personal AI supercomputer, powered by the GB10 Grace Blackwell Superchip: Project DIGITS enables developers to prototype, fine-tune and inference models locally and seamlessly deploy at scale to the data center or cloud. #CES2025 #NVIDIAGraceBlackwell

Introducing the NVIDIA Project DIGITS personal AI supercomputer, powered by the GB10 Grace Blackwell Superchip: Project DIGITS enables developers to prototype, fine-tune and inference models locally and seamlessly deploy at scale to the data center or cloud. #CES2025 #NVIDIAGraceBlackwell

NVIDIA

51,983 Aufrufe • vor 1 Jahr

The universe is constantly unfolding—sending back more data and imagery than any team of scientists could ever interpret alone. AI and accelerated computing are changing that, helping researchers unlock discoveries at a scale and speed that wasn't possible before. 🔭 This #AstronomyDay, we're celebrating the scientists from UC Santa Cruz, Brant Robertson and his colleagues are turning cosmic complexity into real discovery, and making it available to everyone.

The universe is constantly unfolding—sending back more data and imagery than any team of scientists could ever interpret alone. AI and accelerated computing are changing that, helping researchers unlock discoveries at a scale and speed that wasn't possible before. 🔭 This #AstronomyDay, we're celebrating the scientists from UC Santa Cruz, Brant Robertson and his colleagues are turning cosmic complexity into real discovery, and making it available to everyone.

NVIDIA

24,456 Aufrufe • vor 2 Monaten

What MrBeast said. Sidekick changes entrepreneurship. It understands your store, Shopify, and commerce at scale. An AI cofounder in your pocket. Data, growth, ops. And everything else you need to run a business. Personalized. With voice mode, you run your business from your phone. Decisions in seconds. Scale in weeks. Powered by 20 years of Shopify intelligence. Only we could build this.

What MrBeast said. Sidekick changes entrepreneurship. It understands your store, Shopify, and commerce at scale. An AI cofounder in your pocket. Data, growth, ops. And everything else you need to run a business. Personalized. With voice mode, you run your business from your phone. Decisions in seconds. Scale in weeks. Powered by 20 years of Shopify intelligence. Only we could build this.

Harley Finkelstein

43,832 Aufrufe • vor 4 Monaten

Chamath said AI is not like the internet. Every new user costs real money. And the infrastructure making it possible was built by everyone. His argument was the clearest case for government ownership of AI labs I have ever heard. And it had nothing to do with Bernie Sanders. Start with the internet comparison. Google and Facebook became the most profitable companies in human history because of one number. The marginal cost of adding a new user was effectively zero. One more search query cost Google nothing. One more Facebook profile cost Meta nothing. They could serve a billion people and the incremental cost of that billion person was rounding error. That is the money printer. Infinite scale at zero marginal cost. AI breaks that model completely. Every single user taxes a GPU. Every query costs electricity. Every response requires memory and compute. The marginal cost of AI is real, significant, and does not disappear at scale. You cannot print money the same way. Then Chamath made the point that landed hardest. The infrastructure these companies depend on, the power grid, the land, the data centers, the permitting, the national security apparatus that protects their chips from being stolen, none of that was built by Anthropic or OpenAI. It was built by the public. By taxpayers. By decades of government investment in the physical and legal foundation these companies are now running on. He compared it to the interstate highway system. If the federal government built the roads and two companies transported all the goods on them, a logical question at that point would be how much of that should I own? You are riding on my rails. His conclusion was direct. If he were running a sovereign wealth fund and had the negotiating leverage of the US government, he would own 75% of these companies when he was done. The internet had zero marginal cost. That is why the founders captured almost all of the value. AI has real marginal cost and runs on public infrastructure. That changes who has a claim on what gets built. WATCH THE FULL PODCAST ON The All-In Podcast

Ihtesham Ali

78,381 Aufrufe • vor 7 Tagen

Chamath Palihapitiya just dropped the number that explains the entire AI infrastructure trade (Save this). A gigawatt of compute now costs $100 billion and when he started his Arizona data center project it was $4 to $5 billion, it has gone up 20x in a single investment cycle. The implication is not just that AI infrastructure is expensive but rather that the capital barrier to owning meaningful compute has become so high that only a handful of entities in the world can actually build it and the companies who got there early are sitting on what may be the most durable pricing power in the history of the technology industry. This is the neocloud trade. The neocloud market, purpose-built GPU cloud providers like CoreWeave, Nebius, and Lambda Labs was worth $35 billion in 2026 and is projected to reach $236 billion by 2031, compounding at 46% annually. For context, that is faster growth than cloud computing itself posted in its first decade. The reason is very simple, hyperscalers like AWS, Azure, and Google are building for everything, storage, databases, enterprise software, networking and their GPU pricing reflects the overhead of that full-stack infrastructure. Neoclouds build for one thing only, AI compute. The result is a 60% to 85% cost advantage on the same Nvidia silicon, bare metal H100s at $0.78 to $2.79 per GPU-hour on a neocloud versus $3.43 to $5.07 per GPU-hour on a hyperscaler. That spread does not close as AI demand scales but rather it widens, because hyperscalers have to amortize legacy infrastructure and margin expectations that neoclouds do not carry. Gartner projects that by 2030, neoclouds will capture 20% of the $267 billion AI cloud market, and Vultr's own analysis says at least 80% of GPU market share by end of 2026 will be held by a small group of scaled neocloud providers. Now zoom into Nebius specifically, because it is the most interesting publicly traded proxy for this trade. Nebius is the infrastructure arm of the former Yandex Russia's equivalent of Google rebuilt from the ground up after Russia's invasion of Ukraine by Arkady Volozh and relisted on Nasdaq in October 2024. The team that built it already knew how to run internet-scale infrastructure at the lowest possible cost, which is exactly the operational DNA a neocloud requires. In Q1 2026, Nebius reported revenue of $399 million and already generating serious cash on a young business with revenue growing nearly eightfold year-over-year. Then in March 2026, Meta signed a five-year infrastructure agreement with Nebius worth up to $27 billion, $12 billion in committed dedicated GPU capacity deployments beginning early 2027, plus up to $15 billion more tied to Meta purchasing Nebius's unsold third-party capacity. The deal will be executed on one of the first large-scale deployments of Nvidia's Vera Rubin platform, the next-generation architecture after Blackwell making Nebius one of a tiny number of operators in the world with confirmed priority access to the most advanced AI hardware available. Following the contract, Nebius guided to $7 to $9 billion in annualized recurring revenue for 2026 representing 540% year-over-year growth. Chamath Palihapitiya point about the $100 billion capital moat is the bear case for new entrants and the bull case for incumbents. No one can afford to build the next CoreWeave or Nebius from scratch at current hardware and power costs. The companies that are already built, already contracted, and already deploying Nvidia's latest silicon have a moat that compounds with every GPU generation cycle because they get allocations first, they deploy fastest, and their customers re-sign rather than wait for a new operator that does not yet exist. Come join Milk Road Pro for our full breakdown, the complete neocloud competitive landscape, how to think about Nebius's valuation versus CoreWeave and AI entire thesis. Link below.

Chamath Palihapitiya just dropped the number that explains the entire AI infrastructure trade (Save this). A gigawatt of compute now costs $100 billion and when he started his Arizona data center project it was $4 to $5 billion, it has gone up 20x in a single investment cycle. The implication is not just that AI infrastructure is expensive but rather that the capital barrier to owning meaningful compute has become so high that only a handful of entities in the world can actually build it and the companies who got there early are sitting on what may be the most durable pricing power in the history of the technology industry. This is the neocloud trade. The neocloud market, purpose-built GPU cloud providers like CoreWeave, Nebius, and Lambda Labs was worth $35 billion in 2026 and is projected to reach $236 billion by 2031, compounding at 46% annually. For context, that is faster growth than cloud computing itself posted in its first decade. The reason is very simple, hyperscalers like AWS, Azure, and Google are building for everything, storage, databases, enterprise software, networking and their GPU pricing reflects the overhead of that full-stack infrastructure. Neoclouds build for one thing only, AI compute. The result is a 60% to 85% cost advantage on the same Nvidia silicon, bare metal H100s at $0.78 to $2.79 per GPU-hour on a neocloud versus $3.43 to $5.07 per GPU-hour on a hyperscaler. That spread does not close as AI demand scales but rather it widens, because hyperscalers have to amortize legacy infrastructure and margin expectations that neoclouds do not carry. Gartner projects that by 2030, neoclouds will capture 20% of the $267 billion AI cloud market, and Vultr's own analysis says at least 80% of GPU market share by end of 2026 will be held by a small group of scaled neocloud providers. Now zoom into Nebius specifically, because it is the most interesting publicly traded proxy for this trade. Nebius is the infrastructure arm of the former Yandex Russia's equivalent of Google rebuilt from the ground up after Russia's invasion of Ukraine by Arkady Volozh and relisted on Nasdaq in October 2024. The team that built it already knew how to run internet-scale infrastructure at the lowest possible cost, which is exactly the operational DNA a neocloud requires. In Q1 2026, Nebius reported revenue of $399 million and already generating serious cash on a young business with revenue growing nearly eightfold year-over-year. Then in March 2026, Meta signed a five-year infrastructure agreement with Nebius worth up to $27 billion, $12 billion in committed dedicated GPU capacity deployments beginning early 2027, plus up to $15 billion more tied to Meta purchasing Nebius's unsold third-party capacity. The deal will be executed on one of the first large-scale deployments of Nvidia's Vera Rubin platform, the next-generation architecture after Blackwell making Nebius one of a tiny number of operators in the world with confirmed priority access to the most advanced AI hardware available. Following the contract, Nebius guided to $7 to $9 billion in annualized recurring revenue for 2026 representing 540% year-over-year growth. Chamath Palihapitiya point about the $100 billion capital moat is the bear case for new entrants and the bull case for incumbents. No one can afford to build the next CoreWeave or Nebius from scratch at current hardware and power costs. The companies that are already built, already contracted, and already deploying Nvidia's latest silicon have a moat that compounds with every GPU generation cycle because they get allocations first, they deploy fastest, and their customers re-sign rather than wait for a new operator that does not yet exist. Come join Milk Road Pro for our full breakdown, the complete neocloud competitive landscape, how to think about Nebius's valuation versus CoreWeave and AI entire thesis. Link below.

Milk Road AI

137,646 Aufrufe • vor 7 Tagen