正在加载视频...

视频加载失败

加载此视频时出现问题。这可能是由于临时网络问题，或视频可能不可用。

Lets build `Auto-RAG` where we let the LLM pull the data it needs from different sources. 🔎 The user asks a question. 🤔 LLM decides whether to search its knowledge, memory, internet or make an API call. ✍️ LLM answers with the context. Code:

Ashpreet Bedi

19,959 subscribers

174,580 次观看 • 2 年前 •via X (Twitter)

教育科学技术

Anya Rossi• Live Now

Private livecam show

10 条评论

Vexxter 的头像

Vexxter2 年前

is there any way to run a local quantized LLM via ollama in this?? amazing project btw!

Ashpreet Bedi 的头像

Ashpreet Bedi2 年前

@XPhyxer1 absolutely the Hermes2-llama3 might work well here :)

Jordan A. Metzner 的头像

Jordan A. Metzner2 年前

Just read the read me. Any plans for Groq on Llama 3.

Ashpreet Bedi 的头像

Ashpreet Bedi2 年前

@mrjmetz on it!

Ameriki Singh 🈳 的头像

Ameriki Singh 🈳2 年前

Would love to see groq and Llmma3 on it

Emma.Ai 的头像

Emma.Ai2 年前

wow, can't wait to try this out

CoinCollector 的头像

CoinCollector2 年前

Ashpreet is coooooking

0xba0e7f9d 的头像

0xba0e7f9d2 年前

🧑‍🚀this is awesome demo!

Aws Abdo, Ph.D. 的头像

Aws Abdo, Ph.D.2 年前

This work on automating retrieval and generation tasks is incredibly helpful. Thanks you! #MachineLearning #DataScience

Petamber 的头像

Petamber2 年前

You can also try Brave’s Search API for web search

相关视频

Introducing `Personalized Agentic RAG` - where the LLM remembers key details about the user and automatically chooses the tool for RAG. We'll build: 👏 ChatGPT like memory ⚙️ Function calling 🧙 Multi-Agent orchestration code:

Introducing `Personalized Agentic RAG` - where the LLM remembers key details about the user and automatically chooses the tool for RAG. We'll build: 👏 ChatGPT like memory ⚙️ Function calling 🧙 Multi-Agent orchestration code:

Ashpreet Bedi

45,035 次观看 • 2 年前

Our new short course, Knowledge Graphs for RAG, is now available! Knowledge graphs are a data structure that is great at capturing complex relationships between data of multiple types. By enabling more sophisticated retrieval of text than similarity search alone, knowledge graphs can improve the context you pass to the LLM and the performance of your RAG applications. In this course, taught by Andreas Kollegger of Neo4j, you’ll - Explore how knowledge graphs work by building a graph of public financial documents from scratch - Learn to write queries that retrieve text and data from the graph and use it to enhance the context you pass to an LLM chatbot - Combine a knowledge graph with a question-answer chain to build better RAG-powered chat systems Sign up here!

Our new short course, Knowledge Graphs for RAG, is now available! Knowledge graphs are a data structure that is great at capturing complex relationships between data of multiple types. By enabling more sophisticated retrieval of text than similarity search alone, knowledge graphs can improve the context you pass to the LLM and the performance of your RAG applications. In this course, taught by Andreas Kollegger of Neo4j, you’ll - Explore how knowledge graphs work by building a graph of public financial documents from scratch - Learn to write queries that retrieve text and data from the graph and use it to enhance the context you pass to an LLM chatbot - Combine a knowledge graph with a question-answer chain to build better RAG-powered chat systems Sign up here!

Andrew Ng

244,352 次观看 • 2 年前

Verba is an open source Retrieval Augmented Generation (RAG) application that performs RAG on your own data. To showcase its capabilities, we've customized it as an Airbnb chatbot using Airbnb’s customer documentation. How it works: • Ask any questions, related to your booking, policies, or anything related to your Airbnb experience. • Get relevant, human-like responses: Verba provides natural and informative answers. • Access original sources: One of the standout features of RAG is its ability to directly indicate the sources it used to generate each response. Under the hood, Verba uses a RAG pipeline to deliver these exceptional results. Your query is transformed into a numerical representation (vector) and be used to search through our vector database for the most similar context using Hybrid Search. The most relevant context is then combined with your original question and fed into a powerful large language model (LLM). The LLM will then use all of that information to generate a conversational response. Et voilà! 💫 Try Verba: Verba on GitHub: Learn more in our video:

Verba is an open source Retrieval Augmented Generation (RAG) application that performs RAG on your own data. To showcase its capabilities, we've customized it as an Airbnb chatbot using Airbnb’s customer documentation. How it works: • Ask any questions, related to your booking, policies, or anything related to your Airbnb experience. • Get relevant, human-like responses: Verba provides natural and informative answers. • Access original sources: One of the standout features of RAG is its ability to directly indicate the sources it used to generate each response. Under the hood, Verba uses a RAG pipeline to deliver these exceptional results. Your query is transformed into a numerical representation (vector) and be used to search through our vector database for the most similar context using Hybrid Search. The most relevant context is then combined with your original question and fed into a powerful large language model (LLM). The LLM will then use all of that information to generate a conversational response. Et voilà! 💫 Try Verba: Verba on GitHub: Learn more in our video:

Femke Plantinga

120,565 次观看 • 2 年前

Our first Generative AI short course in JavaScript! GitHub recently reported that JavaScript is again the world’s most popular programming language. To support web developers exploring and developing with generative AI, we just launched a new short course in JavaScript taught by Jacob Lee, founding engineer at . In Build LLM Apps with LangChain.js you’ll learn elements common in AI development, including: (i) Using data loaders to pull data from common sources such as PDFs, websites, and databases (ii) Prompts, which are used to provide the LLM context (iii) Modules to support RAG such as text splitters and integrations with vector stores (iv) Working with different models to write applications that are not vendor-specific (v) Parsers, which extract and format the output for your downstream code to process You’ll also build with the LangChain Expression Language, which lets you easily compose sequences (also called chains) of modules to perform complex tasks using LLMs. Putting all this together, you’ll also work on a conversational question-answering LLM application capable of using external data as context. Please sign up here:

Our first Generative AI short course in JavaScript! GitHub recently reported that JavaScript is again the world’s most popular programming language. To support web developers exploring and developing with generative AI, we just launched a new short course in JavaScript taught by Jacob Lee, founding engineer at . In Build LLM Apps with LangChain.js you’ll learn elements common in AI development, including: (i) Using data loaders to pull data from common sources such as PDFs, websites, and databases (ii) Prompts, which are used to provide the LLM context (iii) Modules to support RAG such as text splitters and integrations with vector stores (iv) Working with different models to write applications that are not vendor-specific (v) Parsers, which extract and format the output for your downstream code to process You’ll also build with the LangChain Expression Language, which lets you easily compose sequences (also called chains) of modules to perform complex tasks using LLMs. Putting all this together, you’ll also work on a conversational question-answering LLM application capable of using external data as context. Please sign up here:

Andrew Ng

284,320 次观看 • 2 年前

RouteLLM - Route To The Best LLM Based On Your Prompt Our RouteLLM got an upgrade this week.., it got smarter at picking the right LLM - o1 for complex queries - gpt4o for quick answers - sonnet for coding - deepseek for simple code - gemini for long context - llama and mini for simple question It optimizes for performance, speed and cost!

RouteLLM - Route To The Best LLM Based On Your Prompt Our RouteLLM got an upgrade this week.., it got smarter at picking the right LLM - o1 for complex queries - gpt4o for quick answers - sonnet for coding - deepseek for simple code - gemini for long context - llama and mini for simple question It optimizes for performance, speed and cost!

Bindu Reddy

17,860 次观看 • 1 年前

Agents can now search and scrape the web with a single API call. Firecrawl's /search endpoint returns complete LLM-ready content for every search result. HTML, Markdown, or JSON. Works via API, MCP for Cursor, Zapier and n8n.

Agents can now search and scrape the web with a single API call. Firecrawl's /search endpoint returns complete LLM-ready content for every search result. HTML, Markdown, or JSON. Works via API, MCP for Cursor, Zapier and n8n.

Lior Alexander

14,812 次观看 • 1 年前

I built an open source data analyst agent! Upload any CSV, ask a question, and get it answered with statistics or a nice chart. Launching in 24 hours. 100% free & open source. Under the hood, here's how it works: 1. User uploads a CSV and asks a question. 2. The app uses Together Code Interpreter to spin up a VM and uploads the CSV onto it. 3. I have an LLM (Qwen 3 Coder) write code to interpret the CSV using `pandas`, then solve the question the user asked. 4. Together Code Interpreter runs the code in a secure environment & returns a result. These results can be text (some kind of stats analysis) or a chart.

I built an open source data analyst agent! Upload any CSV, ask a question, and get it answered with statistics or a nice chart. Launching in 24 hours. 100% free & open source. Under the hood, here's how it works: 1. User uploads a CSV and asks a question. 2. The app uses Together Code Interpreter to spin up a VM and uploads the CSV onto it. 3. I have an LLM (Qwen 3 Coder) write code to interpret the CSV using `pandas`, then solve the question the user asked. 4. Together Code Interpreter runs the code in a secure environment & returns a result. These results can be text (some kind of stats analysis) or a chart.

Hassan

27,086 次观看 • 11 个月前

Transforming Invoice Data into JSON: Local LLM with LlamaIndex & Pydantic 🚀 Complete video: Code: I explain how to get structured JSON output with LlamaIndex and dynamic Pydantic class. This helps to implement the use case of data extraction from invoice documents. The solution runs on the local machine, thanks to Ollama. I'm using a MacBook Air M1 with 8GB RAM. LlamaIndex 🦙 Pydantic ollama #Python #LLM #RAG

Transforming Invoice Data into JSON: Local LLM with LlamaIndex & Pydantic 🚀 Complete video: Code: I explain how to get structured JSON output with LlamaIndex and dynamic Pydantic class. This helps to implement the use case of data extraction from invoice documents. The solution runs on the local machine, thanks to Ollama. I'm using a MacBook Air M1 with 8GB RAM. LlamaIndex 🦙 Pydantic ollama #Python #LLM #RAG

Andrej Baranovskij

147,949 次观看 • 2 年前

JSX Tool (@jsxtool) is an in-browser IDE for building React UIs. Click on any element in the browser to jump straight to the relevant line of code and edit it yourself or provide precise context to an LLM to make updates.

JSX Tool (@jsxtool) is an in-browser IDE for building React UIs. Click on any element in the browser to jump straight to the relevant line of code and edit it yourself or provide precise context to an LLM to make updates.

Y Combinator

49,755 次观看 • 8 个月前

New short course: LLMs as Operating Systems: Agent Memory, created with Letta, and taught by its founders Charles Packer and Sarah Wooders. An LLM's input context window has limited space. Using a longer input context also costs more and results in slower processing. So, managing what's stored in this context window is important. In the innovative paper MemGPT: Towards LLMs as Operating Systems, its authors (which include the instructors) proposed using an LLM agent to manage this context window. Their system uses a large persistent memory that stores everything that could be included in the input context, and an agent decides what is actually included. Take the example of building a chatbot that needs to remember what's been said earlier in a conversation (perhaps over many days of interaction with a user). As the conversation's length grows, the memory management agent will move information from the input context to a persistent searchable database; summarize information to keep relevant facts in the input context; and restore relevant conversation elements from further back in time. This allows a chatbot to keep what's currently most relevant in its input context memory to generate the next response. When I read the original MemGPT paper, I thought it was an innovative technique for handling memory for LLMs. The open-source Letta framework, which we'll use in this course, makes MemGPT easy to implement. It adds memory to your LLM agents and gives them transparent long-term memory. In detail, you’ll learn: - How to build an agent that can edit its own limited input context memory, using tools and multi-step reasoning - What is a memory hierarchy (an idea from computer operating systems, which use a cache to speed up memory access), and how these ideas apply to managing the LLM input context (where the input context window is a "cache" storing the most relevant information; and an agent decides what to move in and out of this to/from a larger persistent storage system) - How to implement multi-agent collaboration by letting different agents share blocks of memory This course will give you a sophisticated understanding of memory management for LLMs, which is important for chatbots having long conversations, and for complex agentic workflows. Please sign up here!

New short course: LLMs as Operating Systems: Agent Memory, created with Letta, and taught by its founders Charles Packer and Sarah Wooders. An LLM's input context window has limited space. Using a longer input context also costs more and results in slower processing. So, managing what's stored in this context window is important. In the innovative paper MemGPT: Towards LLMs as Operating Systems, its authors (which include the instructors) proposed using an LLM agent to manage this context window. Their system uses a large persistent memory that stores everything that could be included in the input context, and an agent decides what is actually included. Take the example of building a chatbot that needs to remember what's been said earlier in a conversation (perhaps over many days of interaction with a user). As the conversation's length grows, the memory management agent will move information from the input context to a persistent searchable database; summarize information to keep relevant facts in the input context; and restore relevant conversation elements from further back in time. This allows a chatbot to keep what's currently most relevant in its input context memory to generate the next response. When I read the original MemGPT paper, I thought it was an innovative technique for handling memory for LLMs. The open-source Letta framework, which we'll use in this course, makes MemGPT easy to implement. It adds memory to your LLM agents and gives them transparent long-term memory. In detail, you’ll learn: - How to build an agent that can edit its own limited input context memory, using tools and multi-step reasoning - What is a memory hierarchy (an idea from computer operating systems, which use a cache to speed up memory access), and how these ideas apply to managing the LLM input context (where the input context window is a "cache" storing the most relevant information; and an agent decides what to move in and out of this to/from a larger persistent storage system) - How to implement multi-agent collaboration by letting different agents share blocks of memory This course will give you a sophisticated understanding of memory management for LLMs, which is important for chatbots having long conversations, and for complex agentic workflows. Please sign up here!

Andrew Ng

200,788 次观看 • 1 年前

New short course on sophisticated RAG (Retrieval Augmented Generation) techniques is out! Taught by Jerry Liu and Anupam Datta of LlamaIndex 🦙 and TruEra , this teaches advanced techniques that help your LLM generate good answers. Topics include: - Sentence-window retrieval, which retrieves not just the most relevant sentence, but a window of sentences around it for higher quality context. - Auto-merging retrieval, which organizes your document into a hierarchical tree structure, where each parent node's text is split among its child nodes. Based on the relevance of the child nodes to a user query, this lets you better decide whether the entire parent node should be provided as context to the LLM. - Evaluation methodology for separately evaluating the quality of the key steps of RAG (context relevance, answer relevance, groundedness) so that you can perform error analysis, identify which part of your pipeline needs work, and tune components systematically. Please check out the course!

New short course on sophisticated RAG (Retrieval Augmented Generation) techniques is out! Taught by Jerry Liu and Anupam Datta of LlamaIndex 🦙 and TruEra , this teaches advanced techniques that help your LLM generate good answers. Topics include: - Sentence-window retrieval, which retrieves not just the most relevant sentence, but a window of sentences around it for higher quality context. - Auto-merging retrieval, which organizes your document into a hierarchical tree structure, where each parent node's text is split among its child nodes. Based on the relevance of the child nodes to a user query, this lets you better decide whether the entire parent node should be provided as context to the LLM. - Evaluation methodology for separately evaluating the quality of the key steps of RAG (context relevance, answer relevance, groundedness) so that you can perform error analysis, identify which part of your pipeline needs work, and tune components systematically. Please check out the course!

Andrew Ng

655,580 次观看 • 2 年前

"Orgs and cos building MCP servers are taking an LLM-first approach to what the API needs to expose to the agent(s)." - Nikunj Handa from OpenAI "For example, Stripe has a bunch of APIs that can be used to create a subscription/customer/product/price. For an LLM, it can just combine that into a single function." "Instead of returning this massive JSON object, they can return something very specific to the task being solved, so that the LLM can more easily understand what's happening." "It's an opportunity to rewrite your APIs to be very LLM-first. Why do 2 hours of work, when you can do it in 4 lines of code under a minute?"

"Orgs and cos building MCP servers are taking an LLM-first approach to what the API needs to expose to the agent(s)." - Nikunj Handa from OpenAI "For example, Stripe has a bunch of APIs that can be used to create a subscription/customer/product/price. For an LLM, it can just combine that into a single function." "Instead of returning this massive JSON object, they can return something very specific to the task being solved, so that the LLM can more easily understand what's happening." "It's an opportunity to rewrite your APIs to be very LLM-first. Why do 2 hours of work, when you can do it in 4 lines of code under a minute?"

TBPN

11,445 次观看 • 1 年前

new project: cleo (kindle + llm) 📚 cleo is an ios app that pairs whatever book you're reading on kindle with an llm (o3) — ask questions, get recaps/summaries, or listen/discuss (think audiobooks but interactive). the llm has context on exactly where you are + the book contents. best part? no complex setup needed. just link your kindle account and that's it. it just works. reply / rt for testflight invite 👇

new project: cleo (kindle + llm) 📚 cleo is an ios app that pairs whatever book you're reading on kindle with an llm (o3) — ask questions, get recaps/summaries, or listen/discuss (think audiobooks but interactive). the llm has context on exactly where you are + the book contents. best part? no complex setup needed. just link your kindle account and that's it. it just works. reply / rt for testflight invite 👇

jpa

630,298 次观看 • 1 年前

Introducing /search - the simplest way for agents and devs to discover the web 👀 Our #1 most requested endpoint is finally here. Search the web AND scrape all results in an LLM-ready format with one API call. Now live on our API, MCP, and all integrations 🔥

Introducing /search - the simplest way for agents and devs to discover the web 👀 Our #1 most requested endpoint is finally here. Search the web AND scrape all results in an LLM-ready format with one API call. Now live on our API, MCP, and all integrations 🔥

Firecrawl

390,772 次观看 • 1 年前

RLM is the most import foundation of my Pi Harness (other than Pi of course). It's seeded with late interaction retrieval results (thanks to @lightonai for pylate). The Agent initiates it with query then.. 𝐒𝐞𝐭𝐮𝐩 A python REPL is created and seeded with: 1. Late interaction search to pre-filter. Instead of doing top 3/5/10, it's top hundreds of documents. This is set into a `context` variable. 2. Python functions are loaded in to do more searches if `context` variable isn't enough. And to make llm calls with cheaper models in parallel batches. 𝐈𝐭𝐞𝐫𝐚𝐭𝐢𝐨𝐧 𝐋𝐨𝐨𝐩 From there, an LLM iterates in the REPL based on the query. It's just like exploring in a jupyter notebook. The LLM writes prose (like a markdown cell) and code to be run in the REPL each turn. This allows the LLM to sort, filter, and synthesize information. It can fan out and ask smaller models to summarize, combine, contrast, or do anything else to documents to help it understand the data. After several turns the LLM reponds with the final answer. Either because it found the answer, or hit the budget limit. Context as a Python variable, LLM as the programmer, REPL as the runtime. 𝐖𝐡𝐲 𝐃𝐨𝐞𝐬 𝐓𝐡𝐢𝐬 𝐖𝐨𝐫𝐤 1. Richer Shell. Agents (and subagents) work by intermixing code and prose/thinking. But they use static scripts or bash that run and exit and start over each tool call. That's not ideal for exploration and synthesis of data. For that, state is useful to continue building and exploring the data as you learn more. There's a reason jupyter notebooks have been popular with data scientists. 2. Keeps main agent context clean. The better context you have the better the agent will perform (duh!). This means three thing: better human input, less missing search results, and less incorrect search results. Letting the agent iterate allows it to synthesize just what is needed and nothing else. All bad paths or peeks at something that turns out to be irrelevant stays out of main agent context. 3. Stack the good ideas! People often compare late interaction search vs RLM. Or static vs dynamic languages. Or agentic search vs semantic search. But...You can just use them all together for what they're each good at. Use them all for the area they're really great for. Read the full post which has more detail about how and why.

RLM is the most import foundation of my Pi Harness (other than Pi of course). It's seeded with late interaction retrieval results (thanks to @lightonai for pylate). The Agent initiates it with query then.. 𝐒𝐞𝐭𝐮𝐩 A python REPL is created and seeded with: 1. Late interaction search to pre-filter. Instead of doing top 3/5/10, it's top hundreds of documents. This is set into a `context` variable. 2. Python functions are loaded in to do more searches if `context` variable isn't enough. And to make llm calls with cheaper models in parallel batches. 𝐈𝐭𝐞𝐫𝐚𝐭𝐢𝐨𝐧 𝐋𝐨𝐨𝐩 From there, an LLM iterates in the REPL based on the query. It's just like exploring in a jupyter notebook. The LLM writes prose (like a markdown cell) and code to be run in the REPL each turn. This allows the LLM to sort, filter, and synthesize information. It can fan out and ask smaller models to summarize, combine, contrast, or do anything else to documents to help it understand the data. After several turns the LLM reponds with the final answer. Either because it found the answer, or hit the budget limit. Context as a Python variable, LLM as the programmer, REPL as the runtime. 𝐖𝐡𝐲 𝐃𝐨𝐞𝐬 𝐓𝐡𝐢𝐬 𝐖𝐨𝐫𝐤 1. Richer Shell. Agents (and subagents) work by intermixing code and prose/thinking. But they use static scripts or bash that run and exit and start over each tool call. That's not ideal for exploration and synthesis of data. For that, state is useful to continue building and exploring the data as you learn more. There's a reason jupyter notebooks have been popular with data scientists. 2. Keeps main agent context clean. The better context you have the better the agent will perform (duh!). This means three thing: better human input, less missing search results, and less incorrect search results. Letting the agent iterate allows it to synthesize just what is needed and nothing else. All bad paths or peeks at something that turns out to be irrelevant stays out of main agent context. 3. Stack the good ideas! People often compare late interaction search vs RLM. Or static vs dynamic languages. Or agentic search vs semantic search. But...You can just use them all together for what they're each good at. Use them all for the area they're really great for. Read the full post which has more detail about how and why.

Isaac Flath

40,212 次观看 • 3 个月前

This idea is either extremely smart or an extremely stupid—no in-between. What if your LLM *is* your search engine? How would you look like inside it? Forget about Perplexity, DeepResearch. What if LLM is your entire Google? Pagination, links and everything - just like the old days: no chat UI, classic Google vibe. If you're unsure what's that mean, watch the demo video below first. We call it LLM-as-SERP (Search Engine Results Page).

This idea is either extremely smart or an extremely stupid—no in-between. What if your LLM is your search engine? How would you look like inside it? Forget about Perplexity, DeepResearch. What if LLM is your entire Google? Pagination, links and everything - just like the old days: no chat UI, classic Google vibe. If you're unsure what's that mean, watch the demo video below first. We call it LLM-as-SERP (Search Engine Results Page).

Jina AI

73,960 次观看 • 1 年前

Right now, when you send a query to an LLM, it gets decrypted on the server. The LLM sees your data in plain text. Prof. Ajay Joshi (BU, CipherSonic AI ) on fully homomorphic encryption, which may be key for the future of AI privacy: how we can compute on data without ever decrypting it. The catch: it's a brutally memory-bound workload. Exactly the bottleneck wafer-scale was built to solve.

Right now, when you send a query to an LLM, it gets decrypted on the server. The LLM sees your data in plain text. Prof. Ajay Joshi (BU, CipherSonic AI ) on fully homomorphic encryption, which may be key for the future of AI privacy: how we can compute on data without ever decrypting it. The catch: it's a brutally memory-bound workload. Exactly the bottleneck wafer-scale was built to solve.

Cerebras

293,415 次观看 • 1 个月前

🤔 Why do we still rely on the final layer of an LLM, when different layers encode different information? 🤔 In our new work, “Improving LLM Final Representations with Inter-Layer Geometry” (ICLR 2026 Workshop on Geometry-grounded Representation Learning and Generative Modeling) we show that actually, LLMs do not have one “best” layer. We introduce the Cayley-Encoder: an efficient and effective geometric encoder that learns one strong representation from all layer representations of the LLM, without biasing the representation toward any specific layer. While adding at most 0.1% learned parameters to the LLM, the Cayley-Encoder achieves large empirical gains over LoRA fine-tuning, final-layer representations, expensive attention-based aggregation, and methods that optimize specific layers for the task.

🤔 Why do we still rely on the final layer of an LLM, when different layers encode different information? 🤔 In our new work, “Improving LLM Final Representations with Inter-Layer Geometry” (ICLR 2026 Workshop on Geometry-grounded Representation Learning and Generative Modeling) we show that actually, LLMs do not have one “best” layer. We introduce the Cayley-Encoder: an efficient and effective geometric encoder that learns one strong representation from all layer representations of the LLM, without biasing the representation toward any specific layer. While adding at most 0.1% learned parameters to the LLM, the Cayley-Encoder achieves large empirical gains over LoRA fine-tuning, final-layer representations, expensive attention-based aggregation, and methods that optimize specific layers for the task.

Maya Bechler-Speicher

16,597 次观看 • 1 个月前

I'm reiterating - I think a better interface for LLMs on desktop is an interactive notebook, not chat. I'm trying to build a better intuition for PCA and strongly feel an interactive notebook where I'm chatting with LLM + also modifying code/plotting figures is a much better interaction. Bonus points if you make notebook cells branchable (so if you get an error, you can ask LLM to fix it without adding that rotten context to the main branch of learning). Please steal this idea. Make an AI-powered, interactive, branching notebook.

I'm reiterating - I think a better interface for LLMs on desktop is an interactive notebook, not chat. I'm trying to build a better intuition for PCA and strongly feel an interactive notebook where I'm chatting with LLM + also modifying code/plotting figures is a much better interaction. Bonus points if you make notebook cells branchable (so if you get an error, you can ask LLM to fix it without adding that rotten context to the main branch of learning). Please steal this idea. Make an AI-powered, interactive, branching notebook.

Paras Chopra

58,064 次观看 • 11 个月前

🚀 Introducing the first-ever Agent UI 🚀 This is hands-down my favorite product! Chat with local Agents tailored to my needs. Local memory, storage, knowledge and tools 🔥 ⚡️ Your data, your control 🧠 Compatible with any LLM 🤝 Run multiple agents or a team of agents

🚀 Introducing the first-ever Agent UI 🚀 This is hands-down my favorite product! Chat with local Agents tailored to my needs. Local memory, storage, knowledge and tools 🔥 ⚡️ Your data, your control 🧠 Compatible with any LLM 🤝 Run multiple agents or a team of agents

Ashpreet Bedi

69,015 次观看 • 1 年前