Daily Dose of Data Science's banner
Daily Dose of Data Science's profile picture

Daily Dose of Data Science

@DailyDoseOfDS_48,029 subscribers

Delivering daily insights in DS, ML, RAGs, Agents & AI Engineering. Trusted by over 100k+ readers!

Shorts

K-Means is simple. Making it fast on GPU isn't. Flash-KMeans is an IO-aware implementation of exact k-means that rethinks the algorithm around modern GPU bottlenecks. By attacking the memory bottlenecks directly, Flash-KMeans achieves: - 30x speedup over cuML - 200x speedup over FAISS Using the same exact algorithm, just engineered for today’s hardware. At the million-scale, Flash-KMeans can complete a k-means iteration in milliseconds. Here's why this matters today: K-means has always been an offline primitive. Something you run once to preprocess data and move on. These speedups change that. ↳ Vector databases like FAISS use k-means to build search indices. Faster k-means means you can re-index dynamically as data changes, not batch it overnight. ↳ LLM quantization methods need k-means to find optimal weight codebooks, per layer, repeatedly. What takes hours could now take minutes. ↳ MoE models need fast token routing at inference time. Millisecond k-means makes it viable to run this inside the inference loop, not just in preprocessing. The 200x over FAISS is the number to internalize. FAISS is the industry standard. Most production vector search systems sit on top of it. Link to the paper and code in next tweet!

K-Means is simple. Making it fast on GPU isn't. Flash-KMeans is an IO-aware implementation of exact k-means that rethinks the algorithm around modern GPU bottlenecks. By attacking the memory bottlenecks directly, Flash-KMeans achieves: - 30x speedup over cuML - 200x speedup over FAISS Using the same exact algorithm, just engineered for today’s hardware. At the million-scale, Flash-KMeans can complete a k-means iteration in milliseconds. Here's why this matters today: K-means has always been an offline primitive. Something you run once to preprocess data and move on. These speedups change that. ↳ Vector databases like FAISS use k-means to build search indices. Faster k-means means you can re-index dynamically as data changes, not batch it overnight. ↳ LLM quantization methods need k-means to find optimal weight codebooks, per layer, repeatedly. What takes hours could now take minutes. ↳ MoE models need fast token routing at inference time. Millisecond k-means makes it viable to run this inside the inference loop, not just in preprocessing. The 200x over FAISS is the number to internalize. FAISS is the industry standard. Most production vector search systems sit on top of it. Link to the paper and code in next tweet!

23,748 görüntüleme

NN-SVG: Create neural network architecture drawings parametrically! Export them to SVG files and use them in your work!

NN-SVG: Create neural network architecture drawings parametrically! Export them to SVG files and use them in your work!

109,688 görüntüleme

NN-SVG: Create neural network architecture drawings parametrically! Export them to SVG files and use them in your work!

NN-SVG: Create neural network architecture drawings parametrically! Export them to SVG files and use them in your work!

70,805 görüntüleme

Visualizing complex data in Python just got easier! Meet Cosmograph for Python 🪐: The widget brings GPU-accelerated, interactive layout graph rendering right inside your Jupyter notebooks. Here’s why it’s a game-changer: ⚡ GPU-accelerated performance ⛓️ Interactive network exploration with pan, zoom, hover & selection ⚙️ Rich configuration APIs for layout, color, size & more 📦 Seamless notebook integration & easy Python installation But that’s not all: ✅ Force-directed simulations for dynamic layouts ✅ Smooth handling of large-scale networks ✅ Minimal setup—just pip install cosmograph Link to the repo in next tweet! ______ Follow us → Daily Dose of Data Science ✔️ For more insights & tutorials on AI and Machine Learning.

Visualizing complex data in Python just got easier! Meet Cosmograph for Python 🪐: The widget brings GPU-accelerated, interactive layout graph rendering right inside your Jupyter notebooks. Here’s why it’s a game-changer: ⚡ GPU-accelerated performance ⛓️ Interactive network exploration with pan, zoom, hover & selection ⚙️ Rich configuration APIs for layout, color, size & more 📦 Seamless notebook integration & easy Python installation But that’s not all: ✅ Force-directed simulations for dynamic layouts ✅ Smooth handling of large-scale networks ✅ Minimal setup—just pip install cosmograph Link to the repo in next tweet! ______ Follow us → Daily Dose of Data Science ✔️ For more insights & tutorials on AI and Machine Learning.

27,283 görüntüleme

Videos

Daha fazla içerik yok.