Vaibhav (VB) Srivastav's banner
Vaibhav (VB) Srivastav's profile picture

Vaibhav (VB) Srivastav

@reach_vb47,727 subscribers

Bringing Codex to developers @OpenAI | ex @huggingface | F1 fan | Here for @at_sofdog’s wisdom | *opinions my own

Shorts

Let’s fucking goo!! DeepSeek R1 1.5B running FULLY LOCALLY in your browser at 60 tok/ sec powered by WebGPU🔥 Intelligence truly is too cheap to meter! ⚡️

Let’s fucking goo!! DeepSeek R1 1.5B running FULLY LOCALLY in your browser at 60 tok/ sec powered by WebGPU🔥 Intelligence truly is too cheap to meter! ⚡️

973,002 görüntüleme

HOLY SHITT, Sesame Labs just dropped CSM (Conversational Speech Model) - Apache 2.0 licensed! 💥 > Trained on 1 MILLION hours of data 🤯 > Contextually aware, emotionally intelligent speech > Voice cloning & watermarking > Ultra fast, real-time synthesis > Based on llama architecture & Mimi like decoder > Apache 2.0 licensed > Weights on the Hub So cool to see such a strong Speech backbone out in the wild! Kudos Sesame team! 🤗

Sensitive content

HOLY SHITT, Sesame Labs just dropped CSM (Conversational Speech Model) - Apache 2.0 licensed! 💥 > Trained on 1 MILLION hours of data 🤯 > Contextually aware, emotionally intelligent speech > Voice cloning & watermarking > Ultra fast, real-time synthesis > Based on llama architecture & Mimi like decoder > Apache 2.0 licensed > Weights on the Hub So cool to see such a strong Speech backbone out in the wild! Kudos Sesame team! 🤗

684,874 görüntüleme

LMAO Qwen 2.5 VL can perform Computer Use, out of the box, taking on OpenAI Operator HEAD ON! 🐐

LMAO Qwen 2.5 VL can perform Computer Use, out of the box, taking on OpenAI Operator HEAD ON! 🐐

192,921 görüntüleme

HOLY SHIT - generate 3D mesh from a single image in LESS THAN A SECOND 🤯

HOLY SHIT - generate 3D mesh from a single image in LESS THAN A SECOND 🤯

154,108 görüntüleme

Fuck yeah! MaskGCT - New open SoTA Text to Speech model! 🔥 > Zero-shot voice cloning > Emotional TTS > Trained on 100K hours of data > Long form synthesis > Variable speed synthesis > Bilingual - Chinese & English > Available on Hugging Face Fully non-autoregressive architecture: > Stage 1: Predicts semantic tokens from text, using tokens extracted from a speech self-supervised learning (SSL) model > Stage 2: Predicts acoustic tokens conditioned on the semantic tokens. Synthesised: "Would you guys personally like to have a fake fireplace, an electric one, in your house? Or would you rather have a real fireplace? Let me know down below. Okay everybody, that's all for today's video and I hope you guys learned a bunch of furniture vocabulary!" TTS scene keeps getting lit! 🐐

Fuck yeah! MaskGCT - New open SoTA Text to Speech model! 🔥 > Zero-shot voice cloning > Emotional TTS > Trained on 100K hours of data > Long form synthesis > Variable speed synthesis > Bilingual - Chinese & English > Available on Hugging Face Fully non-autoregressive architecture: > Stage 1: Predicts semantic tokens from text, using tokens extracted from a speech self-supervised learning (SSL) model > Stage 2: Predicts acoustic tokens conditioned on the semantic tokens. Synthesised: "Would you guys personally like to have a fake fireplace, an electric one, in your house? Or would you rather have a real fireplace? Let me know down below. Okay everybody, that's all for today's video and I hope you guys learned a bunch of furniture vocabulary!" TTS scene keeps getting lit! 🐐

139,061 görüntüleme

That's an ElevenLabs-level TTS, fully open-source, running on consumer devices!

That's an ElevenLabs-level TTS, fully open-source, running on consumer devices!

142,488 görüntüleme

Google released an app that allows you to run LLMs from Hugging Face, fully privately and 100% local 🔥 > Generate code on-the-fly > Chat with images > Supports multi-turn conversations > Choose any model from Hugging Face > Based on LiteRT 🔥 > Sign in with HF Support for iOS coming soon! - exciting times for LiteRT and LocalLlama community! 💥

Google released an app that allows you to run LLMs from Hugging Face, fully privately and 100% local 🔥 > Generate code on-the-fly > Chat with images > Supports multi-turn conversations > Choose any model from Hugging Face > Based on LiteRT 🔥 > Sign in with HF Support for iOS coming soon! - exciting times for LiteRT and LocalLlama community! 💥

64,514 görüntüleme

Introducing Distil-Whisper v3 ⚡ > ~50% less parameters and 6x faster than Large-v3. > More accurate than large-v3 on long-form synthesis. Available with 🦀 WebGPU, Whisper.cpp, Transformers, Faster-Whisper and Transformers.js support! Drop in; no changes are required! 🔥

Introducing Distil-Whisper v3 ⚡ > ~50% less parameters and 6x faster than Large-v3. > More accurate than large-v3 on long-form synthesis. Available with 🦀 WebGPU, Whisper.cpp, Transformers, Faster-Whisper and Transformers.js support! Drop in; no changes are required! 🔥

90,651 görüntüleme

This is RICULOUSLY good, TRELLIS 3D Generation model by Microsoft! 🔥 Generate high-quality 3D assets from text or image prompts. Supports various formats like Radiance Fields, 3D Gaussians, and meshes Available for FREE on Hugging Face!

This is RICULOUSLY good, TRELLIS 3D Generation model by Microsoft! 🔥 Generate high-quality 3D assets from text or image prompts. Supports various formats like Radiance Fields, 3D Gaussians, and meshes Available for FREE on Hugging Face!

62,433 görüntüleme

Whisper running on WatchOS! 🔥 > Powered by WhisperKit by @argmaxinc > Supports up to Whisper base > Leverages Neural Engine ⚡ > Three lines of code ;) > Works real-time! > MIT license Quite amazed by the speed with which Argmax is shipping. Possibly the fastest & reliable way to run Whisper on Apple devices!

Whisper running on WatchOS! 🔥 > Powered by WhisperKit by @argmaxinc > Supports up to Whisper base > Leverages Neural Engine ⚡ > Three lines of code ;) > Works real-time! > MIT license Quite amazed by the speed with which Argmax is shipping. Possibly the fastest & reliable way to run Whisper on Apple devices!

85,318 görüntüleme

BOOOOM! you can now use the latest DeepSeek Prover V2 directly on the model page powered by Novita AI 🔥 Open Source FTW! 💥

BOOOOM! you can now use the latest DeepSeek Prover V2 directly on the model page powered by Novita AI 🔥 Open Source FTW! 💥

31,260 görüntüleme

PaliGemma 2 running 100% local, on-device, powered by MLX 🔥

PaliGemma 2 running 100% local, on-device, powered by MLX 🔥

30,370 görüntüleme

Let’s fucking goooo, starting today you can directly try out AI models on FREE Colab notebooks from Hugging Face 🔥 Continuing with our mission to make AI accessible to the masses - we’re excited to support Colaboratory for fast exploration and rapid prototyping! BONUS: you can put a custom “notebook.ipynb” in your model repo and we’ll serve that directly!

Let’s fucking goooo, starting today you can directly try out AI models on FREE Colab notebooks from Hugging Face 🔥 Continuing with our mission to make AI accessible to the masses - we’re excited to support Colaboratory for fast exploration and rapid prototyping! BONUS: you can put a custom “notebook.ipynb” in your model repo and we’ll serve that directly!

21,171 görüntüleme

NEW: You can now use Dia 1.6B SoTA Text-to-Speech model directly on Hugging Face via fal 🔥 You can get up-to 25 generations for less than a dollar 🤗 Run it 5 lines of code too: import requests API_URL = " co/fal-ai/fal-ai/dia-tts" headers = { "Authorization": f"Bearer ", } def query(payload): response = headers=headers, json=payload) url = response.json()["audio"]["url"] return requests.get(url).content audio = query( { "text": "[S1] hey hey, whats up", }) That's it try it out! 🤗

NEW: You can now use Dia 1.6B SoTA Text-to-Speech model directly on Hugging Face via fal 🔥 You can get up-to 25 generations for less than a dollar 🤗 Run it 5 lines of code too: import requests API_URL = " co/fal-ai/fal-ai/dia-tts" headers = { "Authorization": f"Bearer ", } def query(payload): response = headers=headers, json=payload) url = response.json()["audio"]["url"] return requests.get(url).content audio = query( { "text": "[S1] hey hey, whats up", }) That's it try it out! 🤗

19,912 görüntüleme

Videos

reach_vb's profile picture

new pet, who dis?

Vaibhav (VB) Srivastav

20,398 görüntüleme • 16 gün önce