Video yükleniyor...

Video Yüklenemedi

Ana Sayfaya Dön

Fully local Code Assistant running on NVIDIA GPU! In this tutorial, I'll show you how to run Llama3 using TensorRT and Nvidia's Triton Inference Server to use it as a Code Assistant in VSCode In this thread 🧵, I'll walk you through the integration process, explaining each step simply...

42,152 görüntüleme • 2 yıl önce •via X (Twitter)

11 Yorum

Daniel San profil fotoğrafı
Daniel San2 yıl önce

To get started, we need a @nvidia GPU 🤩 In this case, we will use the following hardware 💻

Daniel San profil fotoğrafı
Daniel San2 yıl önce

We need to have Docker and CUDA installed Follow the guides below for installing both tools Docker: CUDA: and then run the following commands to confirm everything is set up correctly

Daniel San profil fotoğrafı
Daniel San2 yıl önce

Download the llama3-8B model from @huggingface

Daniel San profil fotoğrafı
Daniel San2 yıl önce

Now, Run TensorRT to compile the model using the Docker container Clone the TensorRT repository and move the model folder

Daniel San profil fotoğrafı
Daniel San2 yıl önce

You should now be able to test the compiled model

Daniel San profil fotoğrafı
Daniel San2 yıl önce

Perfect! We have the model now, let's deploy it on Triton Inference Server

Daniel San profil fotoğrafı
Daniel San2 yıl önce

The server is up and ready to connect with CodeGPT via the custom connection Open CodeGPT in VSCode, select Custom as the provider, and enter "ensemble" for the model

Daniel San profil fotoğrafı
Daniel San2 yıl önce

That's all! I'm sharing the link to the full article with all the details of the tutorial

Alexander Mia profil fotoğrafı
Alexander Mia1 yıl önce

INTRODUCING: Agentic Security - LLM Security Scanner! 🔍 🔑 Features: Scans for prompt injections, jailbreaking & more. Provides detailed reports & options to customize attack rules. 🔗access the GitHub Link ↓

₣rancisco Trillo profil fotoğrafı
₣rancisco Trillo2 yıl önce

Or just use Continue and Ollama with whatever brand GPU 🤷‍♂️ that’s open source

Daniel San profil fotoğrafı
Daniel San2 yıl önce

you can also use CodeGPT with Ollama Check this link:

Benzer Videolar