Загрузка видео...

Не удалось загрузить видео

На главную

Don't do RAG - One counter intuitive thing I learnt past few weeks - Load whole knowledge base into prompt is actually fast, cheap & more accurate than normal RAG pipeline I was building an MCP for reading external doc, I was able to generate right code example by...

82,818 просмотров • 1 год назад •via X (Twitter)

Комментарии: 10

Фото профиля Mark Fer
Mark Fer1 год назад

What if you want to dive into thousands of sales calls, marketing content, etc?

Фото профиля Breadcrumb
Breadcrumb1 год назад

Looking to automate reporting? Use AI agents to turn spreadsheets to reports in minutes without any coding.

Фото профиля Imiël Visser
Imiël Visser1 год назад

Gemini has 1M context window vs Claude 200k... 50 pages is nothing, but what is the cost vs RAG for volume processing?

Фото профиля Brandon Culver
Brandon Culver1 год назад

No offense, but RAG isn't for 50 pages, it's for 500k pages

Фото профиля josh
josh1 год назад

Have you tried using Figma MCP? I’m curious if it’s possible to load prebuilt components that the AI can use during the build.

Фото профиля AJ
AJ1 год назад

this is a good stuff. I am thinking of is cursor is able to read npm packages well or not because with not so known packages its' did a really bad job manybe we can connect something like this to a package wdyt?

Фото профиля Ajay Jagota 🔑🏃‍♂️
Ajay Jagota 🔑🏃‍♂️1 год назад

This is brilliant TY 🙏 Love how context drives the performance and you can track it in helicone 👏

Фото профиля Sam Sung
Sam Sung1 год назад

Do you think this is the way to go for giving “memory” to AI? I’m working on a chatbot that I want to give memory, but worried that as conversations get longer (and users have more conversations) that the cost of processing it all will be too much (currently used Claude 3.5 sonnet with caching, but still it gets expensive - like 50 cents for a single, long conversation). I’ve been wondering how I can figure out this memory issue - RAG? Create summaries with cheaper models?

Фото профиля Kris
Kris1 год назад

We are working on big 1GB+ pdf DB. I think OCR once. Summarize to 50 pages. Send that plus relevant files to Gemini to answer in super context? Wanna join our Discord?

Фото профиля Boring traveler
Boring traveler1 год назад

How about in local?

Похожие видео

New course: MCP: Build Rich-Context AI Apps with Anthropic. Learn to build AI apps that access tools, data, and prompts using the Model Context Protocol in this short course, created in partnership with Anthropic Anthropic and taught by Elie Schoppik Elie Schoppik, its Head of Technical Education. Connecting AI applications to external systems that bring rich context to LLM-based applications has often meant writing custom integrations for each use case. MCP is an open protocol that standardizes how LLMs access tools, data, and prompts from external sources, and simplifies how you provide context to your LLM-based applications. For example, you can provide context via third-party tools that let your LLM make API calls to search the web, access data from local docs, retrieve code from a GitHub repo, and so on. MCP, developed by Anthropic, is based on a client-server architecture that defines the communication details between an MCP client, hosted inside the AI application, and an MCP server that exposes tools, resources, and prompt templates. The server can be a subprocess launched by the client that runs locally or an independent process running remotely. In this hands-on course, you'll learn the core architecture behind MCP. You’ll create an MCP-compatible chatbot, build and deploy an MCP server, and connect the chatbot to your MCP server and other open-source servers. Here’s what you’ll do: - Understand why MCP makes AI development less fragmented and standardizes connections between AI applications and external data sources - Learn the core components of the client-server architecture of MCP and the underlying communication mechanism - Build a chatbot with custom tools for searching academic papers, and transform it into an MCP-compatible application - Build a local MCP server that exposes tools, resources, and prompt templates using FastMCP, and test it using MCP Inspector - Create an MCP client inside your chatbot to dynamically connect to your server - Connect your chatbot to reference servers built by Anthropic’s MCP team, such as filesystem, which implements filesystem operations, and fetch, which extracts contents from the web as markdown - Configure Claude Desktop to connect to your server and others, and explore how it abstracts away the low-level logic of MCP clients - Deploy your MCP server remotely and test it with the Inspector or other MCP-compatible applications - Learn about the roadmap for future MCP development, such as multi-agent architecture, MCP registry API, server discovery, authorization, and authentication MCP is an exciting and important technology that lets you build rich-context AI applications that connect to a growing ecosystem of MCP servers, with minimal integration work. Please sign up here!

Andrew Ng

142,010 просмотров • 1 год назад

I built an agent that answers machine-learning questions. It's autonomous, and the best part is that I built the whole thing without writing a single line of Python code. Here is what I did and how I did it: Over a year ago, a friend and I built a site that publishes multi-choice questions. You get a new one every day. I decided to have GPT-3.5 answer questions. Here is what I needed to build: 1. Connect to the site's API to retrieve today's question 2. Extract the question and the potential choices 3. Connect to OpenAI's API and ask GPT-3.5 to answer the question 4. Parse the answer from the model 5. Submit the answer back to the API to get the score Not difficult. Likely several hours of work. But I didn't have to write any code. I built the whole thing by dragging and dropping components using Vellum is a YC-backed platform for developers to build LLM applications. They are the only ones I've seen offering this functionality. They sponsored this post, and their team helped me with all my questions while I built this. I created a workflow. The platform supports several node types to build whatever you have in mind. I show how I put the whole thing together in the attached video. The only code I had to write was a few lines of Jinja to parse and transform the API and the LLM results. There are three lessons I want to share from this experience: First, the best possible code is the one you didn't write. I'm a big fan of no-code tools because they help me materialize my ideas fast. They help product people, designers, and no coders collaborate on the solution. Second, Large Language Models are sensitive to how you prompt them. Small changes to prompts can make a big difference in results. This is more pronounced when you are building a multi-step workflow. Third, automated testing and evaluation for prompts is critical. There aren't many companies thinking about this. They'll have a hard time moving from a demo phase. The attached video will show you what I did.

Santiago

309,825 просмотров • 2 лет назад