
Leland McInnes
@leland_mcinnes • 6,589 subscribers
A mathematician dabbling in the world of data science. Researcher at the Tutte Institute for Mathematics and Computing. UMAP, HDBSCAN, PyNNDescent, DataMapPlot.
Shorts
Videos

EVoC is a library designed specifically for fast clustering of high dimensional embedding vectors. It can produce high quality clusters extremely efficiently, and requires little to no hyperparameter tuning. Better clustering than UMAP + HDBSCAN; faster clustering than KMeans.
Leland McInnes25,761 просмотров • 2 месяцев назад

Explore Wikipedia through a data map. Pages are grouped by semantic similarity, for topic clusters. Hover to see details, zoom to explore fine-grained topics, click to go to a page. Search page names to find interesting starting points for exploration.🧵
Leland McInnes55,703 просмотров • 11 месяцев назад
Больше нет контента для загрузки