Video yükleniyor...

Video Yüklenemedi

Bu video yüklenirken bir sorun oluştu. Bu geçici bir ağ sorunundan kaynaklanıyor olabilir veya video kullanılamıyor olabilir.

Ana Sayfaya Dön

What Postgres can’t do alone, it can tackle with DuckDB. The pg_duckdb extension shows how two databases can work hand in hand by querying analytical data with DuckDB and joining it with transactional data in Postgres. The extension embeds DuckDB directly into Postgres. DuckDB literally runs inside a Postgres... show more

Denis Magda

5,025 subscribers

28,678 görüntüleme • 6 ay önce •via X (Twitter)

Anya Rossi• Live Now

Private livecam show

0 Yorum

Yorum bulunmuyor

Orijinal gönderinin yorumları burada görünecek

Benzer Videolar

Big moment for Postgres! Search has always been Postgres' weak spot, and everyone just accepted it. If you needed a real relevance-ranked keyword search, the default answer was to spin up Elasticsearch or add Algolia and deal with the data sync headaches forever. The problem isn't that Postgres can't do text search. It can. But the built-in `ts_rank` function uses a basic term frequency algorithm that doesn't come close to what modern search engines deliver. So teams end up: - Running a separate Elasticsearch cluster just for search - Building sync pipelines that inevitably drift out of consistency - Paying for managed search services that charge per query - Accepting mediocre search relevance because "good enough" ships faster But this is actually a solvable problem. You can realistically bring industry-standard search ranking directly into Postgres, which eliminates the need for external infra entirely. This exact solution is now available with the newly open-sourced pg_textsearch by Tiger Data - Creators of TimescaleDB, a Postgres extension that brings true BM25 relevance ranking into the database. BM25 is the algorithm behind Elasticsearch, Lucene, and most modern search engines. Now it runs natively in Postgres. Here's what pg_textsearch enables: - True BM25 ranking with configurable parameters (the same algorithm powering production search systems) - Simple SQL syntax: `ORDER BY content 'search terms'` - Works with Postgres text search configurations for multiple languages - Pairs naturally with pgvector for hybrid keyword + semantic search That last point matters a lot for RAG apps. The video below shows this in action, and I worked with the team to put this together. You can now do hybrid retrieval (combining keyword matching with vector similarity) in a single database, without stitching together multiple systems. The syntax is clean enough that you can add relevance-ranked search to existing queries in minutes. pg_textsearch is fully open-source under the PostgreSQL license. You can find a link to their GitHub repo in the next tweet.

Big moment for Postgres! Search has always been Postgres' weak spot, and everyone just accepted it. If you needed a real relevance-ranked keyword search, the default answer was to spin up Elasticsearch or add Algolia and deal with the data sync headaches forever. The problem isn't that Postgres can't do text search. It can. But the built-in `ts_rank` function uses a basic term frequency algorithm that doesn't come close to what modern search engines deliver. So teams end up: - Running a separate Elasticsearch cluster just for search - Building sync pipelines that inevitably drift out of consistency - Paying for managed search services that charge per query - Accepting mediocre search relevance because "good enough" ships faster But this is actually a solvable problem. You can realistically bring industry-standard search ranking directly into Postgres, which eliminates the need for external infra entirely. This exact solution is now available with the newly open-sourced pg_textsearch by Tiger Data - Creators of TimescaleDB, a Postgres extension that brings true BM25 relevance ranking into the database. BM25 is the algorithm behind Elasticsearch, Lucene, and most modern search engines. Now it runs natively in Postgres. Here's what pg_textsearch enables: - True BM25 ranking with configurable parameters (the same algorithm powering production search systems) - Simple SQL syntax: `ORDER BY content 'search terms'` - Works with Postgres text search configurations for multiple languages - Pairs naturally with pgvector for hybrid keyword + semantic search That last point matters a lot for RAG apps. The video below shows this in action, and I worked with the team to put this together. You can now do hybrid retrieval (combining keyword matching with vector similarity) in a single database, without stitching together multiple systems. The syntax is clean enough that you can add relevance-ranked search to existing queries in minutes. pg_textsearch is fully open-source under the PostgreSQL license. You can find a link to their GitHub repo in the next tweet.

Akshay 🚀

215,344 görüntüleme • 6 ay önce

First try with Vercel's new Postgres database! Was so quick to set up 🤯 here I've got a collaborative form built with Liveblocks, and I'm having it automatically sync data to the database. More info 🧵

First try with Vercel's new Postgres database! Was so quick to set up 🤯 here I've got a collaborative form built with Liveblocks, and I'm having it automatically sync data to the database. More info 🧵

Chris Nicholas

114,244 görüntüleme • 3 yıl önce

Operational databases have long relied on tightly coupled compute and storage. This architecture creates resource contention and pushes teams to manage infrastructure rather than build. As applications become more real time and automated, the transactional layer needs to adapt. Databricks Lakebase is built for that evolution: • Familiar Postgres semantics for app developers • Compute separated from durable state • Operational data running directly on the lakehouse • Serverless autoscaling (including scale to zero), branching, and recovery to match agent-driven workload Now generally available:

Operational databases have long relied on tightly coupled compute and storage. This architecture creates resource contention and pushes teams to manage infrastructure rather than build. As applications become more real time and automated, the transactional layer needs to adapt. Databricks Lakebase is built for that evolution: • Familiar Postgres semantics for app developers • Compute separated from durable state • Operational data running directly on the lakehouse • Serverless autoscaling (including scale to zero), branching, and recovery to match agent-driven workload Now generally available:

Databricks

18,611 görüntüleme • 5 ay önce

Talking with someone the other day that estimated they had about 25,000 idle connections to Postgres. My actual response to them: "holy shit". They double checked, it was only about 12,000 Same day had a conversation with someone saying they didn't need pgbouncer because of activerecord's connection pooling. Let's dig into connection pooling in Postgres. Prior to Postgres 14 every connection to the database consumed memory, roughly 10MB, it may be slightly less but it still wasn't free. Even beyond Postgres 14 there is still various contention that happens when Postgres starts to use a connection. An application pooler maintains a set of connections and hands them out when needed on the application side. These are idle and real connections against the database that indeed do impact performance negatively. In contrast pgbouncer speaks the wire protocol, waits for the begin part of the transaction and then uses a connection. It more strictly manages how many idle ones it's having instead of per web server you're running. pgbouncer up until recently really needed to be run in transaction mode (which meant disabling prepared statements in your application framework). prepared_statement support in pgbouncer was added recently, and now you don't have to disable. Even when running with an older version of pgbouncer with prepared_statements disabled you'd still see a big performance gain. A quick check to know if you'd benefit from pgbouncer, run this query: SELECT count(*), state FROM pg_stat_activity GROUP BY 2; If you're idle account is high (yes this is dependent on your view, but to me if it's above 25-30 range, and especially if active is until half that) then you'd already start to benefit from pgbouncer. If it's at 10,000 then post haste get pgbouncer in place. Finally, you don't have to not use a framework pooler, they're fine, but don't think it replaces a native Postgres connection pooler.

Talking with someone the other day that estimated they had about 25,000 idle connections to Postgres. My actual response to them: "holy shit". They double checked, it was only about 12,000 Same day had a conversation with someone saying they didn't need pgbouncer because of activerecord's connection pooling. Let's dig into connection pooling in Postgres. Prior to Postgres 14 every connection to the database consumed memory, roughly 10MB, it may be slightly less but it still wasn't free. Even beyond Postgres 14 there is still various contention that happens when Postgres starts to use a connection. An application pooler maintains a set of connections and hands them out when needed on the application side. These are idle and real connections against the database that indeed do impact performance negatively. In contrast pgbouncer speaks the wire protocol, waits for the begin part of the transaction and then uses a connection. It more strictly manages how many idle ones it's having instead of per web server you're running. pgbouncer up until recently really needed to be run in transaction mode (which meant disabling prepared statements in your application framework). prepared_statement support in pgbouncer was added recently, and now you don't have to disable. Even when running with an older version of pgbouncer with prepared_statements disabled you'd still see a big performance gain. A quick check to know if you'd benefit from pgbouncer, run this query: SELECT count(*), state FROM pg_stat_activity GROUP BY 2; If you're idle account is high (yes this is dependent on your view, but to me if it's above 25-30 range, and especially if active is until half that) then you'd already start to benefit from pgbouncer. If it's at 10,000 then post haste get pgbouncer in place. Finally, you don't have to not use a framework pooler, they're fine, but don't think it replaces a native Postgres connection pooler.

Craig Kerstiens

12,357 görüntüleme • 1 yıl önce

Starting a new project today, building an end to end system to forecast traffic of flights across cities (starting with Mumbai) The idea is to implement > ingestion service with kafka > data etl with polars > feast for feature store // mlflow for model registry > batch inferencing // dashboard > s3 // postgres for data storage All this orchestrated across multiple DAGs built with Airflow. This year has just been Agents and LLMs all along. Not a bad idea to keep revisiting the traditional format :) Will be posting more of this in the coming days, stay tuned

Starting a new project today, building an end to end system to forecast traffic of flights across cities (starting with Mumbai) The idea is to implement > ingestion service with kafka > data etl with polars > feast for feature store // mlflow for model registry > batch inferencing // dashboard > s3 // postgres for data storage All this orchestrated across multiple DAGs built with Airflow. This year has just been Agents and LLMs all along. Not a bad idea to keep revisiting the traditional format :) Will be posting more of this in the coming days, stay tuned

Aarno

19,481 görüntüleme • 7 ay önce

OpenAI's Deep Research is getting a run for its money. Deep Lake was just released, and it's a different take on an AI system that can do deep research on your own data. You can use Deep Lake to build AI search with reasoning on your private and public data. (Look at the attached videos to get an idea of how it works.) If you want to research proprietary and sensitive data, Deep Research won't help you because it's limited to public data. Deep Lake, however, will allow you to use your private data. On top of that, Deep Lake supports multi-modal retrieval from the ground up. It uses vision language models for data ingestion and retrieval so that you can connect any data (PDFs, images, videos, structured data, etc.) You can even use mixed-data queries! Deep Lake can search your data from S3, Dropbox, and GCP. It learns from your queries over time, making the results as relevant to your work as possible!

OpenAI's Deep Research is getting a run for its money. Deep Lake was just released, and it's a different take on an AI system that can do deep research on your own data. You can use Deep Lake to build AI search with reasoning on your private and public data. (Look at the attached videos to get an idea of how it works.) If you want to research proprietary and sensitive data, Deep Research won't help you because it's limited to public data. Deep Lake, however, will allow you to use your private data. On top of that, Deep Lake supports multi-modal retrieval from the ground up. It uses vision language models for data ingestion and retrieval so that you can connect any data (PDFs, images, videos, structured data, etc.) You can even use mixed-data queries! Deep Lake can search your data from S3, Dropbox, and GCP. It learns from your queries over time, making the results as relevant to your work as possible!

Santiago

171,340 görüntüleme • 1 yıl önce

The Dawn of a New Era on $SUI (9) Still in the festive spirit, let’s look at Tusky , Tusky is a storage service that's not controlled by one company. It uses something called WalrusProtocol to keep your data safe. Your data is encrypted from start to finish, so only you can see it. Instead of one place, your data goes to many different spots. This setup makes your data less likely to be lost or stolen. It helps keep your information safe and always available. Tusky gives you control over your files. You can easily manage them with the tools provided. You decide who gets to see your data. This makes it great for personal storage or working with others. It works with SuiNetwork for even more privacy. You can log in without sharing personal info, keeping everything more secure. This means only you can get to your data, with no third party involved. It is growing fast, with 100,000 uploads already. This indicates its increasing acceptance in the tech community focused on data sovereignty. It's good at managing lots of data safely. In tech, where you want to own your data, TuskyTools is popular. It gives users control over their information. This platform helps keep your data secure and gives you freedom.

The Dawn of a New Era on $SUI (9) Still in the festive spirit, let’s look at Tusky , Tusky is a storage service that's not controlled by one company. It uses something called WalrusProtocol to keep your data safe. Your data is encrypted from start to finish, so only you can see it. Instead of one place, your data goes to many different spots. This setup makes your data less likely to be lost or stolen. It helps keep your information safe and always available. Tusky gives you control over your files. You can easily manage them with the tools provided. You decide who gets to see your data. This makes it great for personal storage or working with others. It works with SuiNetwork for even more privacy. You can log in without sharing personal info, keeping everything more secure. This means only you can get to your data, with no third party involved. It is growing fast, with 100,000 uploads already. This indicates its increasing acceptance in the tech community focused on data sovereignty. It's good at managing lots of data safely. In tech, where you want to own your data, TuskyTools is popular. It gives users control over their information. This platform helps keep your data secure and gives you freedom.

Kaboom.sui🪖

19,698 görüntüleme • 1 yıl önce

The global industry standard oracle platform, Chainlink, is now live on the Injective Mainnet. Time to market for dApp developers is critical and now with Injective’s EVM deployment, the iBuild AI dApp creator, onchain financial modules, and Chainlink data streams - developers on Injective can experience one of the fastest time to market anywhere in the industry. Chainlink brings real time Data Streams with sub-second latency to the only blockchain purpose-built for finance. This now gives Injective developers full access to Chainlink’s battle-tested data stream markets across the Injective ecosystem. Helix 🧬 will be the first Injective dApp to integrate Chainlink Data Streams set to power its crypto and RWA markets. Benefits for using Chainlink Data Streams on Injective: ✅ Low latency market data: up to sub-second delivery ✅ Institutional reliability: collaborating with institutional data giants like ICE and FTSE Russell, An LSEG Business ✅ Programmability: custom tailor formats, cadence, and fields to match a dApps exact needs. Injective markets now powered by Chainlink Data Streams. Ninjas are just getting started. 🥷

The global industry standard oracle platform, Chainlink, is now live on the Injective Mainnet. Time to market for dApp developers is critical and now with Injective’s EVM deployment, the iBuild AI dApp creator, onchain financial modules, and Chainlink data streams - developers on Injective can experience one of the fastest time to market anywhere in the industry. Chainlink brings real time Data Streams with sub-second latency to the only blockchain purpose-built for finance. This now gives Injective developers full access to Chainlink’s battle-tested data stream markets across the Injective ecosystem. Helix 🧬 will be the first Injective dApp to integrate Chainlink Data Streams set to power its crypto and RWA markets. Benefits for using Chainlink Data Streams on Injective: ✅ Low latency market data: up to sub-second delivery ✅ Institutional reliability: collaborating with institutional data giants like ICE and FTSE Russell, An LSEG Business ✅ Programmability: custom tailor formats, cadence, and fields to match a dApps exact needs. Injective markets now powered by Chainlink Data Streams. Ninjas are just getting started. 🥷

Injective 🥷

116,813 görüntüleme • 8 ay önce

A new look for the new era of data movement Over the past 12+ months we've been focused on building out mump2p, our Ethereum data propagation product, onboarding top validators to test it with, and educating our peers on the merits of decentralized coding. Now with the launch of mump2p on Ethereum mainnet firmly in our sights, it feels like the right time to give Optimum's visual identity a refresh-- something to match the sleek, powerful data acceleration network we're deploying. The advent of RLNC powered data movement frees blockchains from the networking bottleneck, so they can perform at the pace demanded by our ever-expanding digital economy. The new era begins soon with data propagation on Ethereum, more chains and use cases to follow. Time to raise the ceiling for blockchains, and do it in style.

A new look for the new era of data movement Over the past 12+ months we've been focused on building out mump2p, our Ethereum data propagation product, onboarding top validators to test it with, and educating our peers on the merits of decentralized coding. Now with the launch of mump2p on Ethereum mainnet firmly in our sights, it feels like the right time to give Optimum's visual identity a refresh-- something to match the sleek, powerful data acceleration network we're deploying. The advent of RLNC powered data movement frees blockchains from the networking bottleneck, so they can perform at the pace demanded by our ever-expanding digital economy. The new era begins soon with data propagation on Ethereum, more chains and use cases to follow. Time to raise the ceiling for blockchains, and do it in style.

Optimum

10,636 görüntüleme • 3 ay önce

What if crypto research was as easy as chatting with ChatGPT, but powered by real market data👑 Introducing CMC AI, a powerful new tool from CoinMarketCap that combines the speed of AI with the depth of live crypto data. It delivers fast, data-backed answers to your questions: ✅ Want to know why Bitcoin's price is rising? ✅ Curious about the latest news on your favorite cryptocurrency? ✅ Need sentiment analysis? It pulls real-time data and explains it in seconds. But it goes far beyond basic Q&A. In the future, you’ll be able to ask anything! For example, you could ask it to: – Discover undervalued tokens based on volume, MC, and sentiment. – Compare Layer 1s or L2s by adoption, speed, and dev activity. – Detect rug-pull risk via wallet distribution and tokenomics red flags. – Break down your portfolio by risk, correlation, and potential return. – Explore new use cases in DeFi, AI, RWA and DePIN And much more! 🔗Try it here: CMC AI changes how you learn, think, and act in Web3🧠

What if crypto research was as easy as chatting with ChatGPT, but powered by real market data👑 Introducing CMC AI, a powerful new tool from CoinMarketCap that combines the speed of AI with the depth of live crypto data. It delivers fast, data-backed answers to your questions: ✅ Want to know why Bitcoin's price is rising? ✅ Curious about the latest news on your favorite cryptocurrency? ✅ Need sentiment analysis? It pulls real-time data and explains it in seconds. But it goes far beyond basic Q&A. In the future, you’ll be able to ask anything! For example, you could ask it to: – Discover undervalued tokens based on volume, MC, and sentiment. – Compare Layer 1s or L2s by adoption, speed, and dev activity. – Detect rug-pull risk via wallet distribution and tokenomics red flags. – Break down your portfolio by risk, correlation, and potential return. – Explore new use cases in DeFi, AI, RWA and DePIN And much more! 🔗Try it here: CMC AI changes how you learn, think, and act in Web3🧠

Alaoui Capital

34,900 görüntüleme • 1 yıl önce

I remember spending weeks digging through government sites, tracking procurements, and building databases of China’s provincial leaders in my previous life as a journalist. The grind was real—slow and frustrating. I want to offer my friends in media and research with free structured web data with AgentQL . Need to track gov policies, procurement data, or economic trends? We just went live, and I’m excited to see how this can support the important work journalists do every day. Reach me out:

I remember spending weeks digging through government sites, tracking procurements, and building databases of China’s provincial leaders in my previous life as a journalist. The grind was real—slow and frustrating. I want to offer my friends in media and research with free structured web data with AgentQL . Need to track gov policies, procurement data, or economic trends? We just went live, and I’m excited to see how this can support the important work journalists do every day. Reach me out:

Keith Zhai

59,162 görüntüleme • 1 yıl önce

Korea's NHN KCP runs a 2-second stablecoin payment pilot on Avalanche NHN KCP confirmed today that it is running a commercial-feasibility pilot for stablecoin-based payments linked to its Payco easy-payment service. The test covers online gift certificate purchases inside the Payco app and offline payments at the cafe and cafeteria in the company's Seoul headquarters, with about 700 employees participating. The system runs on a payments-focused mainnet built in cooperation with Avalanche (Avalanche🔺). NHN KCP says it logged a 2-second processing time from QR scan to approval, and built what it calls the industry's first stablecoin payment admin page so merchants can track blockchain settlement data in real time without crypto expertise. NHN KCP plans to share the pilot data with financial-sector partners and large merchants to push toward commercialization. Another Korean payment major is moving stablecoins from product to infrastructure.

Korea's NHN KCP runs a 2-second stablecoin payment pilot on Avalanche NHN KCP confirmed today that it is running a commercial-feasibility pilot for stablecoin-based payments linked to its Payco easy-payment service. The test covers online gift certificate purchases inside the Payco app and offline payments at the cafe and cafeteria in the company's Seoul headquarters, with about 700 employees participating. The system runs on a payments-focused mainnet built in cooperation with Avalanche (Avalanche🔺). NHN KCP says it logged a 2-second processing time from QR scan to approval, and built what it calls the industry's first stablecoin payment admin page so merchants can track blockchain settlement data in real time without crypto expertise. NHN KCP plans to share the pilot data with financial-sector partners and large merchants to push toward commercialization. Another Korean payment major is moving stablecoins from product to infrastructure.

BSCN

35,165 görüntüleme • 2 ay önce

What if a network could deliver AI results closer to where data is created, instead of sending it to far‑off data centers? In collaboration with NVIDIA and Decart, we’re making that a reality by bringing GPU‑powered computing to the network edge — closer to homes and businesses — so AI applications respond in real-time. This is how we're laying the foundation for the next generation of AI‑driven services:

What if a network could deliver AI results closer to where data is created, instead of sending it to far‑off data centers? In collaboration with NVIDIA and Decart, we’re making that a reality by bringing GPU‑powered computing to the network edge — closer to homes and businesses — so AI applications respond in real-time. This is how we're laying the foundation for the next generation of AI‑driven services:

Comcast

15,708 görüntüleme • 4 ay önce

Aim for the stars with Pyth Data 💫 Vega Protocol enables anyone to create and trade derivatives products like dated futures or perpetual markets on a fully decentralized network. Learn more about our integration below: ℹ️ About Vega With Vega, everything from the order book to market creation and maintenance, liquidity provision, prices, management of margin, and how that position eventually settles happen on chain as part of the Vega network—all of it is managed and governed by the community Interacting with Vega is gasless, and uses a different fee structure that charges fees from trades continuously. Vega offers sub-second latency together with circuit breakers and auctions in low liquidity regimes to discover true market prices for assets. Additionally, Vega's cross-margining and portfolio risk evaluation innovations significantly lower capital costs opening up hedging instruments to a far greater range of people. 🔮 Reach the stars with Pyth Data Vega requires an oracle to provide an asset price to settle a market or to terminate trading at a market's expiry, with millions in trading already processed, it is paramount for Vega to access up-to-date and accurate price data. Vega perpetual markets already support main crypto assets like $BTC, $ETH, $INJ, $SNX, and $LDO.

Aim for the stars with Pyth Data 💫 Vega Protocol enables anyone to create and trade derivatives products like dated futures or perpetual markets on a fully decentralized network. Learn more about our integration below: ℹ️ About Vega With Vega, everything from the order book to market creation and maintenance, liquidity provision, prices, management of margin, and how that position eventually settles happen on chain as part of the Vega network—all of it is managed and governed by the community Interacting with Vega is gasless, and uses a different fee structure that charges fees from trades continuously. Vega offers sub-second latency together with circuit breakers and auctions in low liquidity regimes to discover true market prices for assets. Additionally, Vega's cross-margining and portfolio risk evaluation innovations significantly lower capital costs opening up hedging instruments to a far greater range of people. 🔮 Reach the stars with Pyth Data Vega requires an oracle to provide an asset price to settle a market or to terminate trading at a market's expiry, with millions in trading already processed, it is paramount for Vega to access up-to-date and accurate price data. Vega perpetual markets already support main crypto assets like $BTC, $ETH, $INJ, $SNX, and $LDO.

Pyth Network 🔮

25,762 görüntüleme • 2 yıl önce

Trading 212 now has a voice, and it's pretty smart 🤖 Stay up to date with our brand-new AI analysis tool. This latest feature uses OpenAI to analyse raw market data in real time and transforms it into a smart summary, available instantly to read or listen 🎧 The data is updated multiple times a day and takes various factors into consideration to provide you with a summary of the current sentiment, market trends, and fundamentals. Currently available only for selected stocks. The data is generated by an AI automated tool. It does not reflect human opinions and should not be considered professional advice. AI analysis is experimental, and Trading 212 does not guarantee its accuracy. Not investment, legal, or tax advice. Do your own research or consult a qualified advisor.

Trading 212

14,605 görüntüleme • 1 yıl önce

An interesting issue with Tesla Robotaxi where it took us to a Starbucks, but the Google data had the incorrect location. At drop off we were 0.2 miles from the Starbucks, so we had a short walk. Is there a way the Tesla AI team could add functionality in the app so riders can update map info to correct errors or inaccuracies on the underlying map data and then this propagates to the fleet? Being able to do this with a pin drop on the map instead of having to use an address might make this very easy and user friendly! Or, as a bigger ask, would it be possible for the car to be able to use visual images on its own to look for a Starbucks sign and on the fly, get us closer and update the map data on its own?

An interesting issue with Tesla Robotaxi where it took us to a Starbucks, but the Google data had the incorrect location. At drop off we were 0.2 miles from the Starbucks, so we had a short walk. Is there a way the Tesla AI team could add functionality in the app so riders can update map info to correct errors or inaccuracies on the underlying map data and then this propagates to the fleet? Being able to do this with a pin drop on the map instead of having to use an address might make this very easy and user friendly! Or, as a bigger ask, would it be possible for the car to be able to use visual images on its own to look for a Starbucks sign and on the fly, get us closer and update the map data on its own?

Joe Tegtmeyer 🚀 🤠🛸😎

80,355 görüntüleme • 1 yıl önce

Turn complex docs into clean, LLM-ready data! Every AI company I've talked to is solving the same problem: how do you build systems that don't hallucinate and back up every answer with proper citations? Tensorlake is a tool that extracts custom-defined structured data from any unstructured document in 3 steps: ↳ Define your schema ↳ Enable citations ↳ Extract You get RAG-ready data with precise citations and bounding boxes. Feed this to your LLM, and you'll generate responses that are citation-backed and fully auditable. This is the difference between a demo and a production system. When your AI can show exactly where it got its information, you move from proof-of-concept to something people can actually trust and deploy. I've shared the Tensorlake GitHub repo in the replies!

Turn complex docs into clean, LLM-ready data! Every AI company I've talked to is solving the same problem: how do you build systems that don't hallucinate and back up every answer with proper citations? Tensorlake is a tool that extracts custom-defined structured data from any unstructured document in 3 steps: ↳ Define your schema ↳ Enable citations ↳ Extract You get RAG-ready data with precise citations and bounding boxes. Feed this to your LLM, and you'll generate responses that are citation-backed and fully auditable. This is the difference between a demo and a production system. When your AI can show exactly where it got its information, you move from proof-of-concept to something people can actually trust and deploy. I've shared the Tensorlake GitHub repo in the replies!

Akshay 🚀

58,152 görüntüleme • 8 ay önce