Lior Alexander's banner
Lior Alexander's profile picture

Lior Alexander

@LiorOnAI115,524 subscribers

Founder @AlphaSignalAI → the Intelligence layer of AI (300k users) • MIT Lecturer • ex-MILA researcher • In ML since GANs

Shorts

With only one line of code, you can get access to Google Open Buildings, the largest building dataset, for any country.

With only one line of code, you can get access to Google Open Buildings, the largest building dataset, for any country.

500,709 Aufrufe

A team just made OpenAI Whisper 6x faster, 49% smaller, while keeping 99% of the accuracy. The model is already available on the HuggingFace Transformers library: model_id = "distil-whisper/distil-large-v2" You can also use their web UI to transcribe from URLs, files, or audio recordings. Model: Demo: Paper: Sasha Rush

A team just made OpenAI Whisper 6x faster, 49% smaller, while keeping 99% of the accuracy. The model is already available on the HuggingFace Transformers library: model_id = "distil-whisper/distil-large-v2" You can also use their web UI to transcribe from URLs, files, or audio recordings. Model: Demo: Paper: Sasha Rush

500,670 Aufrufe

NVIDIA finally released Neuralangelo's source code! The model can turn videos from any device into detailed 3D structures, fully replicating buildings, sculptures, or other real aworld objects or spaces virtually. Here's how it works: A model utilizes a 2D video with multiple angles of an object or scene. I selects frames from different viewpoints to understand depth, size, and shape. The AI creates an initial 3D representation, similar to a sculptor shaping a subject. The render is optimized to enhance details, like a sculptor refining texture. The outcome is a 3D object or scene suitable for virtual reality, digital twins, or robotics.

NVIDIA finally released Neuralangelo's source code! The model can turn videos from any device into detailed 3D structures, fully replicating buildings, sculptures, or other real aworld objects or spaces virtually. Here's how it works: A model utilizes a 2D video with multiple angles of an object or scene. I selects frames from different viewpoints to understand depth, size, and shape. The AI creates an initial 3D representation, similar to a sculptor shaping a subject. The render is optimized to enhance details, like a sculptor refining texture. The outcome is a 3D object or scene suitable for virtual reality, digital twins, or robotics.

478,001 Aufrufe

Anthropic might've just solved Prompt Engineering. Their new "Prompt Generator" tool can turn simple descriptions into advanced prompts optimized for LLMs.

Anthropic might've just solved Prompt Engineering. Their new "Prompt Generator" tool can turn simple descriptions into advanced prompts optimized for LLMs.

261,587 Aufrufe

This is a sneak peak into the future of medicine.. GlassAI launched an LLM-based tool capable of generating a diagnosis or clinical plan based on symptoms. Also, ChatGPT recently passed the US Medical Licensing Exam. Demo: Glass Health

This is a sneak peak into the future of medicine.. GlassAI launched an LLM-based tool capable of generating a diagnosis or clinical plan based on symptoms. Also, ChatGPT recently passed the US Medical Licensing Exam. Demo: Glass Health

256,200 Aufrufe

NVIDIA just made Pandas 150x faster with zero code changes. All you have to do is: %load_ext cudf.pandas import pandas as pd Their RAPIDS library will automatically know if you're running on GPU or CPU and speed up your processing. You can try it here: Repo:

NVIDIA just made Pandas 150x faster with zero code changes. All you have to do is: %load_ext cudf.pandas import pandas as pd Their RAPIDS library will automatically know if you're running on GPU or CPU and speed up your processing. You can try it here: Repo:

194,195 Aufrufe

Meta just announced that Code Llama was now free for both research and commercial. This might the strongest competitor to ChatGPT: ▸ Can generate, explain, and debug your code ▸ Handles input 100,000 tokens ▸ Free for research + commercial use ▸ Outperforms most open models ▸ Comes in 7B, 13B, and 34B ▸ Supports Python, C++, Java, PHP, Typescript (Javascript), C#, and Bash Available in: ▸ Foundation base models (Code Llama) ▸ Python specializations (Code Llama - Python), ▸ Instruction-following models (Code Llama - Instruct)

Meta just announced that Code Llama was now free for both research and commercial. This might the strongest competitor to ChatGPT: ▸ Can generate, explain, and debug your code ▸ Handles input 100,000 tokens ▸ Free for research + commercial use ▸ Outperforms most open models ▸ Comes in 7B, 13B, and 34B ▸ Supports Python, C++, Java, PHP, Typescript (Javascript), C#, and Bash Available in: ▸ Foundation base models (Code Llama) ▸ Python specializations (Code Llama - Python), ▸ Instruction-following models (Code Llama - Instruct)

196,380 Aufrufe

You can run full browser automations for AI agents without worrying about Chrome, Puppeteer, or infrastructure. Steel is an open-source browser API that wraps Chrome, manages sessions, handles proxies, and exposes everything via a REST API or SDKs.

You can run full browser automations for AI agents without worrying about Chrome, Puppeteer, or infrastructure. Steel is an open-source browser API that wraps Chrome, manages sessions, handles proxies, and exposes everything via a REST API or SDKs.

67,928 Aufrufe

NVIDIA just released a very impressive text-to-video paper. Video Latent Diffusion Models (Video LDMs) use a diffusion model in a compressed latent space to generate high-resolution videos. Here's a brief overview of how it works: 1. Pre-train image LDM on a dataset of images. 2. Turn the image LDM into a Video LDM by adding temporal layers to model video frames. 3. Fine-tune the Video LDM on encoded video sequences to create a video generator. 4. Temporally align diffusion model upsamplers to generate high-resolution videos. 5. Validate Video LDM on real driving videos of 512x1024 resolution, achieving state-of-the-art performance. 6. Apply the approach in creative content creation with text-to-video modeling. Paper: Project:

NVIDIA just released a very impressive text-to-video paper. Video Latent Diffusion Models (Video LDMs) use a diffusion model in a compressed latent space to generate high-resolution videos. Here's a brief overview of how it works: 1. Pre-train image LDM on a dataset of images. 2. Turn the image LDM into a Video LDM by adding temporal layers to model video frames. 3. Fine-tune the Video LDM on encoded video sequences to create a video generator. 4. Temporally align diffusion model upsamplers to generate high-resolution videos. 5. Validate Video LDM on real driving videos of 512x1024 resolution, achieving state-of-the-art performance. 6. Apply the approach in creative content creation with text-to-video modeling. Paper: Project:

158,539 Aufrufe

JUST IN: Bard, Google's ChatGPT, is now available in the US and UK, with more countries to come. Waitlist:

JUST IN: Bard, Google's ChatGPT, is now available in the US and UK, with more countries to come. Waitlist:

101,806 Aufrufe

Quick tip, you can use pip-chill instead of pip freeze to get the packages you are actually using. I always wondered why my requirements.txt was so long. 𝚙𝚒𝚙 𝚒𝚗𝚜𝚝𝚊𝚕𝚕 𝚙𝚒𝚙-𝚌𝚑𝚒𝚕𝚕 𝚙𝚒𝚙-𝚌𝚑𝚒𝚕𝚕 >> 𝚛𝚎𝚚𝚞𝚒𝚛𝚎𝚖𝚎𝚗𝚝𝚜.𝚝𝚡𝚝

Quick tip, you can use pip-chill instead of pip freeze to get the packages you are actually using. I always wondered why my requirements.txt was so long. 𝚙𝚒𝚙 𝚒𝚗𝚜𝚝𝚊𝚕𝚕 𝚙𝚒𝚙-𝚌𝚑𝚒𝚕𝚕 𝚙𝚒𝚙-𝚌𝚑𝚒𝚕𝚕 >> 𝚛𝚎𝚚𝚞𝚒𝚛𝚎𝚖𝚎𝚗𝚝𝚜.𝚝𝚡𝚝

97,946 Aufrufe

Videos