Loading video...

Video Failed to Load

There was a problem loading this video. This could be due to a temporary network issue or the video might be unavailable.

Let's talk parsing tables. Two days ago we launched ParseBench,the first document OCR benchmark built for AI agents. This deep dive breaks down TableRecordMatch (GTRM), our metric for evaluating complex tables the way your pipeline actually consumes them: as records keyed by column headers.

LlamaIndex 🦙

112,693 subscribers

25,999 views • 3 months ago •via X (Twitter)

Anya Rossi• Live Now

Private livecam show

0 Comments

No comments available

Comments from the original post will appear here

Related Videos

Let's talk parsing charts 📊📈. Last week we released ParseBench, the first document OCR benchmark for AI agents. New in ParseBench: ChartDataPointMatch. Most document look at a chart and OCR the caption. Agents need the actual numbers. That's the gap between "OCR'd the text around the chart" and "actually read the chart." More about ParseBench, the GitHub code, Hugging Face dataset, and scientific paper→

Let's talk parsing charts 📊📈. Last week we released ParseBench, the first document OCR benchmark for AI agents. New in ParseBench: ChartDataPointMatch. Most document look at a chart and OCR the caption. Agents need the actual numbers. That's the gap between "OCR'd the text around the chart" and "actually read the chart." More about ParseBench, the GitHub code, Hugging Face dataset, and scientific paper→

LlamaIndex 🦙

13,987 views • 3 months ago

We've spent years building LlamaParse into the most accurate document parser for production AI. Along the way, we learned a lot about what fast, lightweight parsing actually looks like under the hood. Today, we're open-sourcing a light-weight core of that tech as LiteParse 🦙 It's a CLI + TS-native library for layout-aware text parsing from PDFs, Office docs, and images. Local, zero Python dependencies, and built specifically for agents and LLM pipelines. Think of it as our way of giving the community a solid starting point for document parsing: npm i -g @llamaindex/liteparse lit parse anything.pdf - preserves spatial layout (columns, tables, alignment) - built-in local OCR, or bring your own server - screenshots for multimodal LLMs - handles PDFs, office docs, images Blog: Repo:

We've spent years building LlamaParse into the most accurate document parser for production AI. Along the way, we learned a lot about what fast, lightweight parsing actually looks like under the hood. Today, we're open-sourcing a light-weight core of that tech as LiteParse 🦙 It's a CLI + TS-native library for layout-aware text parsing from PDFs, Office docs, and images. Local, zero Python dependencies, and built specifically for agents and LLM pipelines. Think of it as our way of giving the community a solid starting point for document parsing: npm i -g @llamaindex/liteparse lit parse anything.pdf - preserves spatial layout (columns, tables, alignment) - built-in local OCR, or bring your own server - screenshots for multimodal LLMs - handles PDFs, office docs, images Blog: Repo:

LlamaIndex 🦙

581,431 views • 4 months ago

OCR can process characters but it doesn’t understand pixels. OCR has no way to reason about the headers, totals, or checkboxes found in tables, invoices, or forms. In our course with LandingAI, "Document AI: From OCR to Agentic Doc Extraction," we build agents to address these failure modes by breaking documents into pieces, applying the right tools, and mapping information to expected formats. Learn more and enroll today:

OCR can process characters but it doesn’t understand pixels. OCR has no way to reason about the headers, totals, or checkboxes found in tables, invoices, or forms. In our course with LandingAI, "Document AI: From OCR to Agentic Doc Extraction," we build agents to address these failure modes by breaking documents into pieces, applying the right tools, and mapping information to expected formats. Learn more and enroll today:

DeepLearning.AI

21,554 views • 6 months ago

India is accelerating its sovereign AI mission with homegrown models built for local languages and governance needs. Bengaluru-based Sarvam AI has launched Sarvam Vision, an OCR system for complex paper records, and Bulbul, a text-to-speech model spanning 11 Indian languages. Sarvam claims its OCR beat Google Gemini on a benchmark. Backed by government funding, the programme aims to cut dependence on foreign AI systems. Watch this report. 🔽

India is accelerating its sovereign AI mission with homegrown models built for local languages and governance needs. Bengaluru-based Sarvam AI has launched Sarvam Vision, an OCR system for complex paper records, and Bulbul, a text-to-speech model spanning 11 Indian languages. Sarvam claims its OCR beat Google Gemini on a benchmark. Backed by government funding, the programme aims to cut dependence on foreign AI systems. Watch this report. 🔽

BJP

34,446 views • 5 months ago

We’re open sourcing the first document OCR benchmark for the agentic era, ParseBench. Document parsing is the foundation of every AI agent that works with real-world files. ParseBench is a benchmark that measures parsing quality specifically for agent knowledge work: ✅ It optimizes for semantic correctness (instead of exact similarity) ✅ It has the most comprehensive distribution of real-world enterprise documents It contains ~2,000 human-verified enterprise document pages with 167,000+ test rules across five dimensions that matter most: tables, charts, content faithfulness, semantic formatting, and visual grounding. We benchmarked 14 known document parsers on ParseBench, from frontier/OSS VLMs to specialized parsers to LlamaParse. Here are some of our findings: 💡 Increasing compute budget yields diminishing returns - Gemini/gpt-5-mini/haiku gain 3-5 points from minimal to high thinking, at 4x the cost. 💡 Charts are the most polarizing dimension for evaluation. Most specialized parsers score below 6%, while some VLM-based parsers do a bit better. 💡 VLMs are great at visual understanding but terrible at layout extraction. GPT-5-mini/haiku score below 10% on our visual grounding task, all specialized parsers do much better. 💡 No method crushes all 5 dimensions at once, but LlamaParse achieves the highest overall score at 84.9%, and is the leader in 4 out of the 5 dimensions. This is by far the deepest technical work that we’ve published as a company. I would encourage you to start with our blog and explore our links to Hugging Face to GitHub. All the details are in our full 35-page (!!) ArXiv whitepaper. 🌐: Blog: 📄 Paper: 💻 Code: 📊 Dataset: 🎥 YouTube:

We’re open sourcing the first document OCR benchmark for the agentic era, ParseBench. Document parsing is the foundation of every AI agent that works with real-world files. ParseBench is a benchmark that measures parsing quality specifically for agent knowledge work: ✅ It optimizes for semantic correctness (instead of exact similarity) ✅ It has the most comprehensive distribution of real-world enterprise documents It contains ~2,000 human-verified enterprise document pages with 167,000+ test rules across five dimensions that matter most: tables, charts, content faithfulness, semantic formatting, and visual grounding. We benchmarked 14 known document parsers on ParseBench, from frontier/OSS VLMs to specialized parsers to LlamaParse. Here are some of our findings: 💡 Increasing compute budget yields diminishing returns - Gemini/gpt-5-mini/haiku gain 3-5 points from minimal to high thinking, at 4x the cost. 💡 Charts are the most polarizing dimension for evaluation. Most specialized parsers score below 6%, while some VLM-based parsers do a bit better. 💡 VLMs are great at visual understanding but terrible at layout extraction. GPT-5-mini/haiku score below 10% on our visual grounding task, all specialized parsers do much better. 💡 No method crushes all 5 dimensions at once, but LlamaParse achieves the highest overall score at 84.9%, and is the leader in 4 out of the 5 dimensions. This is by far the deepest technical work that we’ve published as a company. I would encourage you to start with our blog and explore our links to Hugging Face to GitHub. All the details are in our full 35-page (!!) ArXiv whitepaper. 🌐: Blog: 📄 Paper: 💻 Code: 📊 Dataset: 🎥 YouTube:

Jerry Liu

108,011 views • 3 months ago

We built LobsterX 🦞, an OpenClaw🦞 specialized for document work on your computer. It uses high-accuracy document parsing, extraction, classification through LlamaCloud, meaning it can comb through complicated PDFs (with scans, tables, diagrams) and extract out 100% accurate context! It can run as a Telegram bot and is built on top of agentfs (Turso) as a file system. Big shoutout to Clelia Bertelli (🦙/acc). This is a fun project inspired by OpenClaw🦞’s success, and besides being a fun tool to use, it can be a great reference for building your own generalized coding agents! Readme: LlamaCloud:

We built LobsterX 🦞, an OpenClaw🦞 specialized for document work on your computer. It uses high-accuracy document parsing, extraction, classification through LlamaCloud, meaning it can comb through complicated PDFs (with scans, tables, diagrams) and extract out 100% accurate context! It can run as a Telegram bot and is built on top of agentfs (Turso) as a file system. Big shoutout to Clelia Bertelli (🦙/acc). This is a fun project inspired by OpenClaw🦞’s success, and besides being a fun tool to use, it can be a great reference for building your own generalized coding agents! Readme: LlamaCloud:

Jerry Liu

27,071 views • 5 months ago

Baidu Inc. just dropped PaddleOCR-VL-1.5, and it’s not just another OCR model. This is a fully open-source OCR model with just 0.9B parameters, built specifically for production-grade document understanding. And the results speak loudly 👇 On OmniDocBench V1.5, the most authoritative global document parsing benchmark, PaddleOCR-VL-1.5 ranks #1 overall, hitting 94.5% overall accuracy, outperforming models like DeepSeek-OCR2. As the first OCR model natively supporting irregular document layout positioning, it perfectly solves OCR pain points in production for complex structured business documents such as financial reports and insurance forms. While a lot of the industry is racing toward bigger models, PaddleOCR is betting on something smarter: small parameters + high performance + open source + real usability. That combination is exactly what scalable document intelligence needs right now. Demo / API → Open source → Model → Curious to see how teams start building on top of this.

Baidu Inc. just dropped PaddleOCR-VL-1.5, and it’s not just another OCR model. This is a fully open-source OCR model with just 0.9B parameters, built specifically for production-grade document understanding. And the results speak loudly 👇 On OmniDocBench V1.5, the most authoritative global document parsing benchmark, PaddleOCR-VL-1.5 ranks #1 overall, hitting 94.5% overall accuracy, outperforming models like DeepSeek-OCR2. As the first OCR model natively supporting irregular document layout positioning, it perfectly solves OCR pain points in production for complex structured business documents such as financial reports and insurance forms. While a lot of the industry is racing toward bigger models, PaddleOCR is betting on something smarter: small parameters + high performance + open source + real usability. That combination is exactly what scalable document intelligence needs right now. Demo / API → Open source → Model → Curious to see how teams start building on top of this.

Bishal Nandi

35,318 views • 6 months ago

Introducing LiteParse - the best model-free document parsing tool for AI agents 💫 ✅ It’s completely open-source and free. ✅ No GPU required, will process ~500 pages in 2 seconds on commodity hardware ✅ More accurate than PyPDF, PyMuPDF, Markdown. Also way more readable - see below for how we parse tables!! ✅ Supports 50+ file formats, from PDFs to Office docs to images ✅ Is designed to plug and play with Claude Code, OpenClaw, and any other AI agent with a one-line skills install. Supports native screenshotting capabilities. We spent years building up LlamaParse by orchestrating state-of-the-art VLMs over the most complex documents. Along the way we realized that you could get quite far on most docs through fast and cheap text parsing. Take a look at the video below. For really complex tables within PDFs, we output them in a spatial grid that’s both AI and human-interpretable. Any other free/light parser light PyPDF will destroy the representation of this table and output a sequential list. This is not a replacement for a VLM-based OCR tool (it requires 0 GPUs and doesn’t use models), but it is shocking how good it is to parse most documents. Huge shoutout to Logan Markewich and Clelia Bertelli (🦙/acc) for all the work here. Come check it out: Repo:

Introducing LiteParse - the best model-free document parsing tool for AI agents 💫 ✅ It’s completely open-source and free. ✅ No GPU required, will process ~500 pages in 2 seconds on commodity hardware ✅ More accurate than PyPDF, PyMuPDF, Markdown. Also way more readable - see below for how we parse tables!! ✅ Supports 50+ file formats, from PDFs to Office docs to images ✅ Is designed to plug and play with Claude Code, OpenClaw, and any other AI agent with a one-line skills install. Supports native screenshotting capabilities. We spent years building up LlamaParse by orchestrating state-of-the-art VLMs over the most complex documents. Along the way we realized that you could get quite far on most docs through fast and cheap text parsing. Take a look at the video below. For really complex tables within PDFs, we output them in a spatial grid that’s both AI and human-interpretable. Any other free/light parser light PyPDF will destroy the representation of this table and output a sequential list. This is not a replacement for a VLM-based OCR tool (it requires 0 GPUs and doesn’t use models), but it is shocking how good it is to parse most documents. Huge shoutout to Logan Markewich and Clelia Bertelli (🦙/acc) for all the work here. Come check it out: Repo:

Jerry Liu

256,748 views • 4 months ago

We’re excited to officially launch LlamaParse, the first genAI-native document parsing solution. Not only is it better at parsing out images/tables/charts 📊📈 than virtually every other parser, it is now steerable through natural language instructions - output the document in whatever format you desire! It is also the only parsing solution that seamlessly allows you to build accurate RAG over complex documents, free of hallucinations 🔥 We launched it in private preview a few weeks ago and hit 2k users, 1M total PDF pages parsed. And now it’s better than ever. LlamaParse contains the following killer features: ✅ SOTA table/chart extraction ✅ Seamless integration with LlamaIndex 🦙 advanced RAG/agents ✅✨ Natural language Parsing Instructions ✅✨JSON mode and image extraction ✅✨Support for ~10 document types (.pdf, .pptx, .docx, .xml) and more Our pricing is simple: 1k free per day, and additional pages at 0.3c a page, or $3 for 1k pages. If you want advanced document RAG and/or private deployments, come get in touch with us to chat about LlamaCloud. Check out our full blog post here: LlamaParse client repo: Signup at 🦙☁️: Come talk to us:

We’re excited to officially launch LlamaParse, the first genAI-native document parsing solution. Not only is it better at parsing out images/tables/charts 📊📈 than virtually every other parser, it is now steerable through natural language instructions - output the document in whatever format you desire! It is also the only parsing solution that seamlessly allows you to build accurate RAG over complex documents, free of hallucinations 🔥 We launched it in private preview a few weeks ago and hit 2k users, 1M total PDF pages parsed. And now it’s better than ever. LlamaParse contains the following killer features: ✅ SOTA table/chart extraction ✅ Seamless integration with LlamaIndex 🦙 advanced RAG/agents ✅✨ Natural language Parsing Instructions ✅✨JSON mode and image extraction ✅✨Support for ~10 document types (.pdf, .pptx, .docx, .xml) and more Our pricing is simple: 1k free per day, and additional pages at 0.3c a page, or $3 for 1k pages. If you want advanced document RAG and/or private deployments, come get in touch with us to chat about LlamaCloud. Check out our full blog post here: LlamaParse client repo: Signup at 🦙☁️: Come talk to us:

LlamaIndex 🦙

143,136 views • 2 years ago

An underrated issue with document parsing for RAG / agent use cases is dealing with multi-page tables - sometimes a big table spills over into multiple pages. This breaks chunking algorithms that generally operate at the page-level or smaller, and causes LLMs to lose the full view of the data. With LlamaParse Continuous Mode (in beta), you can now parse a document with multi-page tables and join them into a single table! This means you can now: 💡 Do contiguous chunking for RAG use cases OR 💡 Parse the table for text-to-SQL Check out our blog post highlighting this feature. Huge shoutout to Pierre-Loic Doulcet and Sacha Bron : Signup here: It's in beta, let us know your feedback!

An underrated issue with document parsing for RAG / agent use cases is dealing with multi-page tables - sometimes a big table spills over into multiple pages. This breaks chunking algorithms that generally operate at the page-level or smaller, and causes LLMs to lose the full view of the data. With LlamaParse Continuous Mode (in beta), you can now parse a document with multi-page tables and join them into a single table! This means you can now: 💡 Do contiguous chunking for RAG use cases OR 💡 Parse the table for text-to-SQL Check out our blog post highlighting this feature. Huge shoutout to Pierre-Loic Doulcet and Sacha Bron : Signup here: It's in beta, let us know your feedback!

Jerry Liu

24,245 views • 1 year ago

Super excited to share 🧠MLGym 🦾 – the first Gym environment for AI Research Agents 🤖🔬 We introduce MLGym and MLGym-Bench, a new framework and benchmark for evaluating and developing LLM agents on AI research tasks. The key contributions of our work are: 🕹️ Enables the exploration of different training algorithms for AI Research Agents such as RL 🛠️ Provides a flexible evaluation framework that can accommodate different artifacts such as models, algorithms, or predictions 🤖 Allows researchers to evaluate any model without the need to develop a custom agentic harness 🎯 Introduces 13 diverse open-ended AI Research tasks for evaluating AI Research Agents on a wide range of domains such as computer vision, natural language processing, reinforcement learning, game theory, and logical reasoning. 📈 Proposes a new evaluation metric for AI Research Agents MLGym makes it easy to: 1) Add new tasks 2) Evaluate new models 3) Integrate new agents Check out a video of the MLGym Agent to see how it performs the full pipeline of idea generation💡, implementation 👩‍💻, experimentation 👩‍🔬, and iteration 🔄 to improve on ML tasks. Huge thanks to the exceptionally talented Deepak Nathani who led this work and to all the other amazing collaborators who made this possible 🙏🫶🚀

Super excited to share 🧠MLGym 🦾 – the first Gym environment for AI Research Agents 🤖🔬 We introduce MLGym and MLGym-Bench, a new framework and benchmark for evaluating and developing LLM agents on AI research tasks. The key contributions of our work are: 🕹️ Enables the exploration of different training algorithms for AI Research Agents such as RL 🛠️ Provides a flexible evaluation framework that can accommodate different artifacts such as models, algorithms, or predictions 🤖 Allows researchers to evaluate any model without the need to develop a custom agentic harness 🎯 Introduces 13 diverse open-ended AI Research tasks for evaluating AI Research Agents on a wide range of domains such as computer vision, natural language processing, reinforcement learning, game theory, and logical reasoning. 📈 Proposes a new evaluation metric for AI Research Agents MLGym makes it easy to: 1) Add new tasks 2) Evaluate new models 3) Integrate new agents Check out a video of the MLGym Agent to see how it performs the full pipeline of idea generation💡, implementation 👩‍💻, experimentation 👩‍🔬, and iteration 🔄 to improve on ML tasks. Huge thanks to the exceptionally talented Deepak Nathani who led this work and to all the other amazing collaborators who made this possible 🙏🫶🚀

Roberta Raileanu

105,052 views • 1 year ago

The paradigm shift is here: We are moving from a "Human-First" internet to an "Agent-First" internet. 🤖 In the latest episode of the AI on Air podcast, our CTO Scott Shi - e/acc dives deep into the future of the Agent-to-Agent (A2A) economy. Imagine a world where your AI assistant doesn't just search the web, but autonomously hires and pays other specialized agents to complete complex tasks for you. The infrastructure for this autonomous machine economy is being built right now. Are you ready for the future of AI payments? Watch the full episode to learn more⬇️

The paradigm shift is here: We are moving from a "Human-First" internet to an "Agent-First" internet. 🤖 In the latest episode of the AI on Air podcast, our CTO Scott Shi - e/acc dives deep into the future of the Agent-to-Agent (A2A) economy. Imagine a world where your AI assistant doesn't just search the web, but autonomously hires and pays other specialized agents to complete complex tasks for you. The infrastructure for this autonomous machine economy is being built right now. Are you ready for the future of AI payments? Watch the full episode to learn more⬇️

KITE AI

25,038 views • 4 months ago

🌟Our latest LangChain Academy course – Deep Agents with LangGraph – is now live!🌟 Many agents today follow the same simple pattern: run in a loop, call tools. That architecture works well enough, but it breaks down as tasks get more complex. Today, companies of all sizes – from startups to large enterprises – are building their own Deep Agents. These agents dive deeper. They’re able to plan complex tasks and carry them out over longer time horizons. There are four key features that set Deep Agents apart from regular agents: 1. Planning – keeps agents on track 2. File system – allows agents to offload context 3. Sub-agents – act as focused specialists 4. Prompting – provides agents with detailed instructions Our latest LangChain Academy course, Deep Agents with LangGraph, shows you how to combine these pieces with LangGraph to orchestrate long-running, multi-agent workflows. Big thanks to community member Dmitry Labazkin for helping us shape this course with his contributions! Enroll for free ➡️

🌟Our latest LangChain Academy course – Deep Agents with LangGraph – is now live!🌟 Many agents today follow the same simple pattern: run in a loop, call tools. That architecture works well enough, but it breaks down as tasks get more complex. Today, companies of all sizes – from startups to large enterprises – are building their own Deep Agents. These agents dive deeper. They’re able to plan complex tasks and carry them out over longer time horizons. There are four key features that set Deep Agents apart from regular agents: 1. Planning – keeps agents on track 2. File system – allows agents to offload context 3. Sub-agents – act as focused specialists 4. Prompting – provides agents with detailed instructions Our latest LangChain Academy course, Deep Agents with LangGraph, shows you how to combine these pieces with LangGraph to orchestrate long-running, multi-agent workflows. Big thanks to community member Dmitry Labazkin for helping us shape this course with his contributions! Enroll for free ➡️

LangChain

63,349 views • 10 months ago

SilkAI is live. Try it now: 🔥 The first version of our intelligent web experience is ready for you to explore, directly in your browser, no downloads needed. You can: • Chat with intelligent AI agents • Create and train your own characters • Explore a new layer of interactive AI Built from scratch with love, obsession, and way too many late nights, this is your moment to dive in. And behind the scenes? We’re already working on something wilder: real-world robotics. Talk. Create. Go deep. Try it free. Let SilkAI surprise you.

SilkAI is live. Try it now: 🔥 The first version of our intelligent web experience is ready for you to explore, directly in your browser, no downloads needed. You can: • Chat with intelligent AI agents • Create and train your own characters • Explore a new layer of interactive AI Built from scratch with love, obsession, and way too many late nights, this is your moment to dive in. And behind the scenes? We’re already working on something wilder: real-world robotics. Talk. Create. Go deep. Try it free. Let SilkAI surprise you.

NetVRk

28,908 views • 1 year ago

"We put out this $20 million bounty, by the way, when Osama bin laden took down the twin towers, I think the U.S. government put out a $25 million bounty for information on him. So this was like a pretty substantial bounty right?""And this was a great moment where I was like, let's turn the tables on them and try to catch these these b*stards. So that's exactly what we did." Brian Armstrong Coinbase 🛡️

"We put out this $20 million bounty, by the way, when Osama bin laden took down the twin towers, I think the U.S. government put out a $25 million bounty for information on him. So this was like a pretty substantial bounty right?""And this was a great moment where I was like, let's turn the tables on them and try to catch these these b*stards. So that's exactly what we did." Brian Armstrong Coinbase 🛡️

Shawn Ryan

121,236 views • 1 year ago

EXPOSED by Rightanglenews - and we need your help in supporting some federal agents! Remember Kirk Bangstad, the owner of the Wisconsin brewery who offered free beer for President Trump’s ass*ssination? Well he just decided to dox the federal agents who visited him over his threat - and now he's pushing his followers to harass them as well. So it's time to turn the tables. On the screen you'll find the number for the Wisconsin Division of Alcohol - feel free to call them and demand the removal of his license. REPOST this EVERYWHERE! #thinblueline #lawenforcement

EXPOSED by Rightanglenews - and we need your help in supporting some federal agents! Remember Kirk Bangstad, the owner of the Wisconsin brewery who offered free beer for President Trump’s ass*ssination? Well he just decided to dox the federal agents who visited him over his threat - and now he's pushing his followers to harass them as well. So it's time to turn the tables. On the screen you'll find the number for the Wisconsin Division of Alcohol - feel free to call them and demand the removal of his license. REPOST this EVERYWHERE! #thinblueline #lawenforcement

Blue Lives Matter

44,900 views • 3 months ago

✅Announcing our partnership with Manta Network (🔱,🔱) to accelerate on-chain #AI content and onchain-AI agents Orbofi AI, the most adopted AI engine in web3, is thrilled to announce its partnership with Manta Network (🔱,🔱) , The modular and the fastest growing L2, as we jointly empower consumers and developers to create AI assets and AI agents using the Orbofi engine and tokenize them on Manta in a few clicks Orbofi AI is acting as a factory engine for on-chain AI assets and AI agents for the Manta community and the overall web3 ecosystem, powered by open-source and distributed AI models, setting a new benchmark for on-chain AI applications. 🪙Join the Creative campaign, create your first AI NFT on Manta, and earn rewards: 📙Learn more about the partnership:

✅Announcing our partnership with Manta Network (🔱,🔱) to accelerate on-chain #AI content and onchain-AI agents Orbofi AI, the most adopted AI engine in web3, is thrilled to announce its partnership with Manta Network (🔱,🔱) , The modular and the fastest growing L2, as we jointly empower consumers and developers to create AI assets and AI agents using the Orbofi engine and tokenize them on Manta in a few clicks Orbofi AI is acting as a factory engine for on-chain AI assets and AI agents for the Manta community and the overall web3 ecosystem, powered by open-source and distributed AI models, setting a new benchmark for on-chain AI applications. 🪙Join the Creative campaign, create your first AI NFT on Manta, and earn rewards: 📙Learn more about the partnership:

Orbofi

99,567 views • 2 years ago

Presenting: $PALM AI Applications System Hardware 📹 Watch the Reveal Teaser from below ✅ The first Web3 related microcontroller-based device that combines offline voice recognition with in-built AI profiles, performing on electronic circuits straight from the breadboard it was incepted from. 1️⃣ The next steps are to document the use cases of our system, which can be integrated into existing devices and physical services in real life. We will be sure to document and publish processes as the device goes into factory production all the way to shipping them out and showcasing the real-world adoption of our products. 2️⃣ Our hardware is built for flexibility and to innovate on, and we will be working with the top experts in the field to make the best out of it and is built to allow the device structure to be embedded as the smart component of any and all devices. ⌛️Stay tuned for the first use-case reveal of our AI Applications System inside an AI-enhanced, by default offline hardware wallet, fully built from scratch by our talented team. 🙏 Finally, thank you for the continued support. We will keep working as hard as ever. Enjoy the reveal.

Presenting: $PALM AI Applications System Hardware 📹 Watch the Reveal Teaser from below ✅ The first Web3 related microcontroller-based device that combines offline voice recognition with in-built AI profiles, performing on electronic circuits straight from the breadboard it was incepted from. 1️⃣ The next steps are to document the use cases of our system, which can be integrated into existing devices and physical services in real life. We will be sure to document and publish processes as the device goes into factory production all the way to shipping them out and showcasing the real-world adoption of our products. 2️⃣ Our hardware is built for flexibility and to innovate on, and we will be working with the top experts in the field to make the best out of it and is built to allow the device structure to be embedded as the smart component of any and all devices. ⌛️Stay tuned for the first use-case reveal of our AI Applications System inside an AI-enhanced, by default offline hardware wallet, fully built from scratch by our talented team. 🙏 Finally, thank you for the continued support. We will keep working as hard as ever. Enjoy the reveal.

PaLM AI - $PALM

166,227 views • 2 years ago

What if you could train AI agents on a laptop as easily as on a GPU cluster? Researchers from UIUC's U Lab, led by Prof. Jiaxuan You, just open-sourced OpenTinker. It's a new "Reinforcement-Learning-as-a-Service" (RLaaS) system that decouples the complex training pipeline into simple, distributed services with friendly APIs. The result? It breaks down the major engineering barriers to RL, outperforming traditional frameworks in accessibility and ease of deployment, finally making agent training viable for more developers and teams. Project: Code: U Lab: Our report: 📬 #PapersAccepted by Jiqizhixin

What if you could train AI agents on a laptop as easily as on a GPU cluster? Researchers from UIUC's U Lab, led by Prof. Jiaxuan You, just open-sourced OpenTinker. It's a new "Reinforcement-Learning-as-a-Service" (RLaaS) system that decouples the complex training pipeline into simple, distributed services with friendly APIs. The result? It breaks down the major engineering barriers to RL, outperforming traditional frameworks in accessibility and ease of deployment, finally making agent training viable for more developers and teams. Project: Code: U Lab: Our report: 📬 #PapersAccepted by Jiqizhixin

机器之心 JIQIZHIXIN

15,893 views • 6 months ago