正在加载视频...

视频加载失败

Building a RAG system that works with real-life documents is crazy hard. Why is nobody talking about this? (I'll show you what a complex document looks like in the attached video. Good luck with the 10-line code demos if you want to deal with this.) All I see online...

75,780 次观看 • 1 年前 •via X (Twitter)

10 条评论

Δημήτρης 的头像
Δημήτρης1 年前

have you seen looks similar and promising?

Santiago 的头像
Santiago1 年前

I haven't seen it, no. Thanks for sharing it!

Daniel Davis 的头像
Daniel Davis1 年前

We agree. That's why we build @trustgraphAI. TrustGraph is a full model agnostic AI Engine with native GraphRAG. No need to build anything. Deploys with Docker or Kubernetes.

Erik 的头像
Erik1 年前

I thought that building RAG systems is easy until I tried to build one. If you want it to work well it is really hard.

Santiago 的头像
Santiago1 年前

Yeah, building a demo is very simple. You can understand the basic components in a few minutes. But doing anything serious with it is really complex.

Shane 的头像
Shane1 年前

Really amazing! How can one find the documention to install it in your own Kubernetes cluster and run it locally? I've only been able to find the SaaS option...

AK⚡ 的头像
AK⚡1 年前

The problem is, in the most of the enterprise use cases, where the data is highly regulated, they do not approve open source frameworks.

Santiago 的头像
Santiago1 年前

Actually, I've seen two different approaches: 1. Enterprises that don't approve closed-systems running outside their premises. They would rather use open-source. 2. Enterprises that don't approve open-source software that doesn't come with certain guarantees or is too complex to set up.

Philip Collins 的头像
Philip Collins1 年前

Or just pass your pdf's to gemini files API. Is working very well for me with 15 documents, with 10 to 165 pages each, all in a single prompt. Gemini finds and extracts multiple small details and builds and returns a compliant JSON response.

Santiago 的头像
Santiago1 年前

If this is working for you, that's awesome. But in many cases, relying on the output of the model is not enough to build a reliable retriever. Notice that here, the process is not just about "understanding the document," but augmenting it with contextual information as well.

相关视频

Here is how you can install an open-source, enterprise-grade RAG system on your server (with the best document understanding I've seen.) First, something obvious to anyone trying to sell RAG in the market: You are crazy if you think companies will let their data travel to a hosted model. No one wants to send their data anywhere (those who do haven't found an alternative.) Every single company would rather have an air-gapped system with no internet access. GroundX is an open-source RAG system that you can run on your servers (or any cloud provider, as long as you have access to GPUs) and works without a network. (If the military wants to do RAG, this is precisely what they will be looking for.) I installed GroundX on my AWS account and recorded a video to show you how to use it. There are two services you can use: 1. Ingest: This service uses a pretrained vision model to ingest and understand your knowledge base. 2. Search: This service combines text and vector search with a fine-tuned re-ranker model to retrieve information from your knowledge base. A quick note about the Ingest service: 99% of people think they need better "retrieval" mechanisms. I think they need better "ingestion." That's where this service comes in! Ingest "understands" your documents in a way I haven't seen before. After you try it, you'll realize why showing your LLM your raw documents is a bad idea. In the video, I use a free tool called X-Ray to test a document and understand how the Ingest service breaks it down. You can access this tool by signing up for a free GroundX cloud account and uploading your documents. You'll see a bit more about this in the video.

Santiago

89,624 次观看 • 1 年前

Culture is genetic because behavior is genetic. This beaver never saw a dam in its life. No beavers or anything else ever taught it to build a dam. It wants to build a dam because it is a beaver. Many beavers together build a big dam. That is beaver culture. Humans are not different. Nothing is different. This is what life is. This is how life works. Your body is your mind. A caterpillar wants to build a chrysalis. A bee wants to build a hive. A lion wants to build a pride. You are not special. You are not above your nature. you are INSIDE of it. The thoughts that we think are genetic thoughts. The crimes we commit are genetic crimes. The art we create is genetic art. Just like this beaver, you can give the animal different sticks and it will build a different dam, but it will always build a dam. And you can give humans different "education," but the human will always use it to do what its genes tell it to do. This is the first big answer that you need. This is the biggest piece of the puzzle. This is how to understand people 90% of the way. You just... notice what they do, and get out of the way, and watch them do it. And if they need sticks, you give them sticks. And if you don't like what they do, you have to get away from them. You cannot train dam-building into them or out of them any more than you can with a beaver. A beaver wants to build a dam because it is a beaver. Whatever you see people build, that's what they wanted to build from the sticks they got in the river they were in. Stop pretending you can change it.

hoe_math = PsychoMath

1,189,466 次观看 • 10 个月前

"You can either produce excellence or you can avoid criticism. But you cannot do both of those. The reason that you don't have certain excellence that you want is because you are afraid of getting criticized. You are afraid of the judgment that comes with it. You are afraid of standing out. You are afraid of being alone. You are afraid of people looking at you. You are worried about what people think of you. There are 2 categories of things in this world: 1) Things that are up to you 2) Things that are not up to you Which category does your reputation sit in? Your reputation is not up to you. I'm the one who associates your reputation with something, not you. You just do things. What's up to you? How you act. Your decisions. Your actions. That is up to you. Your reputation is not up to you. Here's how I know that: You all have a reputation about me and it's not in my control. I get to say and do whatever I say and do up here. I am in control of saying it. I am in control of doing it. The moment words leave my lips, who has control over what is done with those words? You! You are in control of what you think of me. And there's no way everybody in this room is going to think the exact same thing about me. No way. When it comes to exceptional, what we've got to understand is you can spend your whole life trying to avoid criticism and earn reputation, and it still won't be in your control. We can waste a lot of time missing out on excellence we could have been producing if we were just simply LESS trying to engineer what we wanted other people to think about us."

Brian Kight

308,812 次观看 • 1 年前

Vallée and the Closed System: Are We Prisoners? 🧠👽 Vallée: "Are We Being Taken Over by a Species from Somewhere in Space That's Vastly More Intelligent Than We Are?" 👽🧠 "..the simulation...was a new concept that I initially rejected." ~Vallée "Is it looking for us to try to interact with it as equals or with parity?" ~Scafish "Nobody says that to Congress, and I think Congress should hear it." ~Vallée If, "it's a closed system, we're like prisoners and something is going to happen to us, and there is very little we can do." ~Vallée Turn the thermostat dial. "If the temperature doesn't change, then I know I'm inside a control system. So, we can do the same thing with UFOs, but we have to react. We have to, number one, acknowledge that it exists, and number two, we have to react to it." ~Vallée ~~~ I've been wanting to share this one for a while... Vallée: "So, he said, the question you have to ask about UFOs is, number one, is it a natural system or an artificial system...control system. And if it is a control system, is it open or closed? In other words, are we being taken over by a species from somewhere in space that's vastly more intelligent than we are?" (I've never heard him even suggest that possibility.) Vallée: "You know, as Dr. Garry P. Nolan says, you know, people who have had ten, you know, scientific revolutions, or a hundred or a thousand, and come here with superior science to do something... And in which case, you know, it's a closed system, we're like prisoners and something is going to happen to us, and there is very little we can do. Or, is it an open system where we can, in fact, communicate with it. And if we can communicate with it, then the question for me as an information scientist is, what are the modalities of the interaction, you know? It's not just can we learn their language? And they say, you know, 'We come in peace to save mankind,' or something. Or 'We will give you the cure for cancer' or something. I don't think it's at that level." (Will we ever be able to get answers to these extremely important questions? If it's an open system, how do we communicate with it? How do we provoke it to react? We know it reacts to anything nuclear but we still don't know why. This is why we need the USG (and other governments) to present evidence that shows the masses this is real and extremely important for our species to investigate. If that evidence exists and is shown, we'll have an easier time getting the world's best minds to join the effort in figuring out the best way to answer these questions. We still may fail but we should at least try.) Vallée: "I think it's a meta-system. It's not a system. And that's my fear...if we can circle back to your earlier question about, you know, about NIDS and about BAASS, what we did for the government and what we did for the Defense Intelligence Agency. Half of the budget was spent developing, you know, a super database. And we don't know where it went. I mean, I'm not cleared to know where it went." (On the contractor (BAASS) side, Bigelow should have all copies of what AAWSAP produced. And on the DIA side, Lacatksi said he put all of the digital files in a specific place that he didn't name. As long as someone didn't delete it all, it should still be there. Vallée has said that the Capella database has about 250,000 cases from around the world.) Vallée: "But that would be a very interesting question, because the people who are getting [the database] are getting raw data, which we have very well organized, all in English. So they have the luxury of, you know, we had five translators from French, English, Portuguese, Russian, you know, everything was translated in a single structure across fourteen databases. "That's what we need to answer the question about the control system, and it's not being done. And we hired a whole team that we had trained to work on it. So to rebuild that will take the next ten or fifteen years. And nobody says that to Congress, and I think Congress should hear it, because it's our money." (As long as names and personal details are scrubbed from that database, there is no reason NOT to release it to the public. This way, we can take it and use AI to help decipher patterns and maybe answer some of these questions. Can Congress help us get access to that database?) Vallée: "When you ask, is it a control system? That's a big question." Peter Scafish Peter Skafish - "You asked, at one point, whether the system is open or closed, and you said, additionally, I believe, if it's open, that it would be possible to communicate back to it. And it sounds to me like that's the key question for you. Is, if you can understand what what the system of symbols is, or the modalities of communication, then you can understand enough to engage in some kind of communication, or at least give some kind of response to show that you understand." Vallée: "Yes." Scafish: "So then the question, and we have a member named Jacqueline, who has asked this. Could the system be stimulating us - provided there is such a system - to interact with it, more as subjects or agents than as something like animals or objects? Is it looking for us to try to interact with it as equals or with parity?" (When people report getting injured or sick from being in close proximity to UAP, it suggests to me that the phenomenon won't go out of its way to avoid affecting us in a negative manner if we get in their its/way (assuming it even knows that close encounters with UAP are not good for humans). Kind of like how we treat lower lifeforms. If we encounter a wild rabbit crossing the road, many of us will do our best to avoid it, but not if it means damaging our car or ourselves in an accident. It may ruin our day if we hit it, but it won't stop us from driving again in the future. Do NHI have bad days if their tech injures us or makes us sick? I have no clue.) Vallée: "Well, what I saw in the notes you gave me, is she was also asking: Is it a control system because we think it is? And that's a very interesting question. Because we react to the UFO phenomenon, or the UAP phenomenon. And, you know, at this point when I think about what I'm going to do next in this research, if I'm given the the opportunity to live a little longer, I'm not going to go back and write any more computer programs. There are better people to do that now, they have the data, and we're in a different phase now. We're in a whole different system. I have the luxury of doing some experiments I wanted to do for a long time." (Would have liked Scafish to ask him: What types of experiments?) Vallée: "So, if you think you are inside the control system, there are things that you can do. Or, if you think you're inside the simulation, you know, which was a new concept that I initially rejected, and then, you know, Ray's (Kurzweil?) work and others have brought it back to the forefront. And we have to ask that at the same time. Can we test it? How would you test it? Well, if, you know, I'm here in my apartment, and the temperature is constant in this apartment. But outside, I can see it's cold, or I can see the sun is out and it's warm, and how come it's constant here? So this would lead me to think that there is a control system, namely a thermostat, that is somewhere. "So I can start looking around the walls, and if I see dial, I can turn it, or I could start a fire and see what happens, see if the temperature changes. If the temperature doesn't change, then I know I'm inside a control system. So, we can do the same thing with UFOs, but we have to react. We have to, number one, acknowledge that it exists, and number two, we have to react to it."

Joe Murgia

27,613 次观看 • 6 个月前

I hear so often from the Dommes I work with that they struggle with people online fetichizing them and simply seeing them for how sexy and beautiful they are. They project their fantasies and their desires onto you. That stops immediately once you move the attention from you to them. From 'look at me' to 'I see you'. What does that look like? When you create content, think of them and what this scene or that narrative is evoking. What will they learn from you? What they want is not to passively watch how sexy you are, but for you to train them, to give them instructions, to teach them, to guide them, to be in charge, to command them. This is not being an object but the main subject. The Authority figure. How is your content already doing that. The sexy photos can still be there, they are important to already capture des attention. But what you do with that attention once you have it, is where the power dynamic is established. Positioning yourself as more than a stunning Goddess, but actually a woman who has a voice, opinions, perspective, a philosophy, a way to doing things, teaching them what you like, how you like it, why you like it, already makes them want to be that for you. You hold the attention, you hold the power, so you direct it. And for that, you want them to know you get them and you know what lives within them... that creates the desire for you to be the one exposing it. You instantly build trust. Not because you demanded it, but because you earned it: you showed them you know what you are doing. You have experience, you understand them. They are not told to come see you, they are seduced into it. They desire it. And they will work for it. This will attract better clients (real subs) and instead of you trying to get their attention, they will work to earn yours. If you want to learn more about power dynamics, building a brand as a Pro or the psychology behind BDSM, you can now access all my trainings and classes in one place for a fraction of the cost of The Dominatrix Academy. And you can reinvest the total amount towards the Program. Message me [SECRET] for the details. This offer is not available on my website.

Ms. Malissia

14,790 次观看 • 2 个月前

Garry Nolan says there are more groups doing what skywatcher is doing right now “we know how to call them” “Skywatcher is one group of several that I'm aware of that are doing it independently.” Source -Sol Foundation 🔗 in comments Garry -“The, the information's out there, we, you know, it's already pretty well understood. I mean, look, there's been enough whistleblower types where the information of how to do this has leaked out. You know, we know how to call them. Whether you believe in psionics or not, it seems to be part of the process. So we know how to call them. The question is not can you video them? Skywatcher has already shown that you can video them and there'll be more of that kind of stuff, I think coming in the future, you know, so Skywatcher is one group of several that I'm aware of that are doing it independently. So that's citizen science. I mean, I think the answer is you don't wait for the government to do it, for you. Don't wait for daddy or mommy to tell you what's going on. You just do it yourself. Because as long as you're not going out there with, with guns or energy, weapons, trying to pull something down and, you know, get yourself in a bad situation, there's no reason people can't do it themselves and organize. So, you know, that's, that I think is the threat in a way that one needs to use against the governmental authorities who think that they hold all the, all the, all the marbles at this point, they don't anymore because the people who've been in the program, like Jake and others who've, you know, made that statement publicly, have basically made their knowledge and ability public. So do it.”

neandrewthal

74,654 次观看 • 1 年前

The People Of America Have Been Deceived. They’ve Been Cheated & As A Result Our Country & World Is Now In Turmoil. It all Started With A Fake Barack Obama Birth Certificate EXPERTS CONFIRM Obama’s Certificate Was A Fraud “Today you're going to hear lots of information that some of you are going to understand and going to be able to tell the true story. In fact, please know that this is a very technical, but the evidence is clear if you'll pay attention. Please note you're going to hear about two separate experts. These experts are two separate continents with no knowledge of each other and they draw similar conclusions. Again, that said, I know some of you are going to get this story and are going to tell the story the way it was.” —- “We and anyone else who dared to question the document have been the line falsely labeled grossly criticized in the bulk of the media on certain internet sources for years. Today we're going to set the record straight. I believe you will be shocked by what you hear and see today.” —- “Like the sheriff just told you, when you conduct criminal investigations, you have to let the evidence lead you. You never lead the evidence. And in doing this, my motive was to clear the document. Because to be quite honest with you, I didn't believe it. I didn't believe this was possible. I didn't think this would ever happen in this nation. I didn't believe it.” — “Back in 2012, I told you about Reed Hayes, a document examiner. Let me tell you about Reed Hayes, a man with 40 years, since 1974, 40 plus years of experience in examining forensic document, handwriting, a man who's well respected in his expertise, a court recognized expert, a document examiner. He is the man you go to when somebody gives you a bad check with a bad signature. This is the guy you run to. Law firms use him all the time. He's been maligned. And let me tell you something about Mr. Hayes. When I contacted Mr. Hayes, Mr. Hayes told me right off the bat, I'm an Obama supporter. I voted for him twice. He goes, and I will never do anything to hurt the President of the United States. What I had said to him was, Reed, I am not asking you to hurt the President of the United States. I'm asking you to take a look at this document and clear it and tell me there's nothing wrong with it. Would you at least do that? And he took a look at it. And when he called me back, he told me, Mike.” “I can't clear this, there's something wrong with it." And I asked him, I said, Reed, would you continue? I said, I know your position, but would you continue? And his answer to me was, this is what I do. I'll look at it, I'll do it. That's a man of integrity, respecting what his ability is to get to the truth. Because for Sheriff Apoyo and myself, this was never about Barack Obama. This is about a document. You take that document and you remove the name, Barack Hussein Obama, and put your name on there. If it was your document and it was brought to us, we would do the same thing with this document.” They COULD NOT clear the document. Much more info in this video if you watch the whole thing

Wall Street Apes

1,140,768 次观看 • 2 年前

THIS ONGOING DEMOLITION IN DELTA STATE SHOULD TELL YOU THAT IF YOU BUILD ANYWHERE IN NIGERIA WITHOUT HAVING TITLE DOCUMENTS, YOU ARE WASTING YOUR MONEY. When it comes to real estate and demolition of properties, only a heartless person wouldn't sympathize over such a huge loss. However, let's learn to do things how it's done so that it can be how it should be. Regardless of wherever you are in Nigeria, don't buy land without title documents. They won't ask you if you are Igbo or Yoruba. They will ask for your documents. Receipts, survey plans, and contracts of land sale aren't title documents. In Europe, America, South Africa, or anywhere the principle of "Quicquid plantatur solo, solo cedit" (whatever is attached to the land belongs to the landowner) is applicable, you can't just build on land that you don't have its title document. It will end in tears. See, the law is the law. If you have title documents and your property is earmarked for demolition because the government needs it, you will be adequately compensated. But if the land isn't yours, no one will compensate you. And that's why when buying lands, you must make sure you get the title documents: either excision, C of O, or Governor's consent. With registered or provisional survey, receipts, deeds of assignments, court judgments, or gazettes that prove the seller has the right to sell what they are selling and what they are selling is in good standing. Real estate is an investment. You make better investment decisions when you have knowledge of the investment. Lastly, real estate, like all investment, is risky. You don't negate risk; you mitigate it.

🏘️IamEri'Oluwa🇳🇬

16,501 次观看 • 2 年前

🛑 Burkina Faso 🇧🇫 - Captain with school boys and girls! The young Captain was having a conversation with the pupils, and here is what he saying, “I was telling you a while ago, in school they were telling us that we couldn’t do it here. They lied to us. We grow wheat here, and it works well, and we will develop it. Some people have started, this year, I was able to see people who did it, as part of the presidential initiative, and I was told that in the past, some were able to do it and they produced it well. Currently, we are sowing wheat in some farmlands as part do the presidential initiative. What you eat must be produced here. So, this is why I say that we will teach you many things, and we will review the curricula they teach you. For those who drink coffee, they told us that your coffee, chocolate, it is only in the countries with abundant rainfalls, that here is only savanna, desert, it does not rain, we cannot farm. Again, they lied to us. It’s not true! Coffee grows well here, cocoa grows well too. There are people here who have the farms here, even in Ouagadougou here, there are people who have cocoa trees in their yards. This means that, chocolate that children envy those from well to do familes can be manufactured here in Burkina and all the children can eat chocolate in Burkina. We found out it is possible. As for milk, why do we have to import it? We can do it. I just want to tell you that there are many things that they never told us the truth about. You guys are lucky, we are now teaching you, and we promise you that we will do all we can so that you can eat your fill. As we say, you will eat well in the morning before you go to school, you will go to school for free, you will eat lunch, you will have fun, and in the afternoon, when you return home, you will have fun in the neighborhood, then in the evening, you will learn and review your homework and sleep. This is the dream we have. As long as the children in Burkina are not in these conditions, our fight will not stop. Ok? (Claps). So, we know these are your aspirations and it is right and legal. Any parent is fighting for this. Even those who do not have children fight in the hope of having children and to take care of them, so that they can live in better conditions, and be better than them. This is the fight of everyone, this is the fight of every generation. We are lucky God gave us everything. Do you know that everywhere in Burkina we can farm? Everywhere! In the Sahel where they tell you it is the desert, it is only sand, we can farm. As for us, we have been lied to so much, it is the brainwashing of the colonizer. He did that so that we may not think 💭. But we finally found out that everything was a lie ( damn lie, emphasis is mine). If God left many lakes in that desert, He knows why. We can farm everything in Burkina, we can do everything, the land is fertile. And there are so many natural things in Burkina that we never planted but they were here, isn’t it ? Have you ever learned how to plant a shea tree in Burkina? You were born and found them already here right? It is there in the wild in nature. You know it is a gift from God. There are many things in the shea fruit. You have the shea butter, that is oil; do you know that there is chocolate in it? There are seven derivatives in the shea fruit. You also have the Parkia biglobosa (also known as the African locust bean) which is a natural fruit. We have many things, it is not only the minerals in the soil. Even with the soil, we were told that it’s ferralitic soil, that it is not fertile, everything is a lie. You see that today there is so much gold in Burkina. But it is just poorly managed. Our mission is to well manage these resources, and to take good care of you, so that you can be in your basic rights, to lead a good life, to go to school, and that we may protect you. And also that you may fulfill your duties, because your duties are very important, aren’t they?…

Sy Marcus Herve Traore

94,967 次观看 • 2 年前