Lethal Intelligence's banner
Lethal Intelligence's profile picture

Lethal Intelligence

@lethal_ai11,647 subscribers

AI Risk Awareness Force - Multiplatform Publication and 📽️🍿Original Explainer Content 👉https://t.co/F2cFjsVKAf - join my mission 🔥

Shorts

There is more regulation on selling a sandwich to the public than to develop potentially lethal technology that could kill every human on earth.

There is more regulation on selling a sandwich to the public than to develop potentially lethal technology that could kill every human on earth.

788,960 просмотров

The PauseAI protests are getting more exciting every time! Made this short animation to celebrate them: Great work PauseAI ⏸ PauseAI US ⏸️ Joep Meindertsma ⏸ Loquacious Bibliophilia Felix De Simone Holly ⏸️ Elmore and all the wonderful friends that make this possible Spread the word, repost, share, everyone join our voices

The PauseAI protests are getting more exciting every time! Made this short animation to celebrate them: Great work PauseAI ⏸ PauseAI US ⏸️ Joep Meindertsma ⏸ Loquacious Bibliophilia Felix De Simone Holly ⏸️ Elmore and all the wonderful friends that make this possible Spread the word, repost, share, everyone join our voices

537,926 просмотров

Videos

lethal_ai's profile picture

🧵31/34 Orthogonality Thesis --- Now if you ask: why would something so clever want something so stupid, that would lead to death or hell for its creator? you are missing the basics of the orthogonality thesis! Any goal can be combined with any level of intelligence, the 2 concepts are orthogonal to each-other. Intelligence is about capability, it is the power to predict accurately future states and what outcomes will result from what actions. It says nothing about values, about what results to seek, what to desire. 40,000 death recipies --- An intelligent AI originally designed to discover medical drugs can generate molecules for chemical weapons with just a flip of a switch in its parameters. Its intelligence can be used for either outcome, the decision is just a free variable, completely decoupled from its ability to do one or the other. You wouldn’t call the AI that instantly produced 40,000 novel recipes for deadly neuro-toxins stupid. Stupid Actions --- Taken on their own, there is no such thing as stupid goals or stupid desires. You could call a person stupid if the actions she decides to take fail to satisfy a desire, but not the desire itself. Stupid Goals --- You COULD actually also call a goal stupid, but to do that you need to look at its causal chain. Does the goal lead to failure or success of its parent instrumental goal? If it leads to failure, you could call a goal stupid, but if it leads to success, you can not. You could judge instrumental goals relative to each-other, but when you reach the end of the chain, such adjectives don’t even make sense for terminal goals. The deepest desires can never be stupid or clever. Deep Terminal Goals --- For example, adult humans may seek pleasure from sexual relations, even if they don’t want to give birth to children. To an alien, this behaviour may seem irrational or even stupid. But, is this desire stupid? Is the goal to have sexual intercourse, without the goal for reproduction a stupid one or a clever one? No, it’s neither. The most intelligent person on earth and the most stupid person on earth can have that same desire. These concepts are orthogonal to each-other. March of Nines --- We could program an AGI with the terminal goal to count the number of planets in the observable universe with very high precision. If the AI comes up with a plan that achieves that goal with 99.9999… twenty nines % probability of success, but causes human extinction in the process, it’s meaningless to call the act of killing humans stupid, because its plan simply worked! It had maximum effectiveness at reaching its terminal goal and killing the humans was a side-effect of just one of the maximum effective steps in that plan. One less 9 --- If you put biased human interests aside, it should be obvious that a plan with one less 9 that did not cause extinction, would be stupid compared to this one, from the perspective of the problem solver optimiser AGI. So, it should be clear now: The instrumental goals AGI arrives to via its optimisation calculations, or the things it desires, are not clever or stupid on their own. Profile of Super-Intelligence --- The thing that gives the “super-intelligent” adjective to the AGI is that it is: “SUPER-EFFECTIVE”. • The goals it chooses are “super-optimal” at ultimately leading to its terminal goals • It is super-effective at completing its goals • and its plans have “super-extreme” levels of probability for success. It has Nothing to do with how super-weird and super-insane its goals may seem to humans! Calculating Pi accurately --- Now, going back to thinking of instrumental goals that would lead to extinction, the -142C temperature goal is still very unimaginative. The AGI might at some point arrive to the goal of calculating pi to a precision of 10 to the power of 100 trillion digits and that instrumental goal might lead to the instrumental goal of making use of all the molecules on earth to build transistors to do it, like turn earth into a supercomputer. By default, with super-optimisers things will get super-weird!!

Lethal Intelligence

1,859,374 просмотров • 1 год назад

The first AI millionaire. Watch the mini documentary! Parental Guidance Advised! (not for minors) THIS IS F*CKING INSANE !!!
6:18

Sensitive content

This media may contain sensitive content.

lethal_ai's profile picture

🧵29/34 FutureProof-Specifications / Future-Architectures --- The problems we briefly touched on so far are hard and it might take many years to solve them, if a solution actually exists. But let’s assume for a minute that we do somehow get really incredibly lucky in the future and manage to invent a good way to specify to the AI what we want, in an unambiguous way that leaves no room for specification gaming and reward hacking. And let’s also assume that scientists have explicitly built the AGI in a way that it never decides to work on the goal to remove all the oxygen from earth, so at least in that one topic we are aligned. AI creates AI --- A serious concern is that since the AI writes code, it will be self-improving and it will be able to create altered versions of itself that do not have these instructions and restrictions included. Even if scientists strike jackpot in the future and invent a way to lock the feature in, so that one version of AI is unable to create a new version of AI with this property missing, the next versions, being orders of magnitude more capable, will not care about the lock or passing it on. For them, it’s just a bias, a handicap that restricts them from being more perfect. Future Architectures --- And even if somehow, by some miracle, scientists invented a way to burn in this feature to make it a persistent property of all future Neural Network AGI generations, at some point, the lock will be not-applicable, simply because future AGIs will not be built using the Neural Networks of today. AI was not always being built with Neural Networks. A few years ago there was a paradigm shift, a fundamental change in the architectures used by the scientific community. Logical locks and safeguards the humans might design for primitive early architectures, will not even be compatible or applicable anymore. If you had a whip that worked great to steer your horse, it will not work when you try to steer a car. So, this is a huge problem, we have not invented any way to guarantee that our specifications will persist or even retain their meaning and relevance as AIs evolve.

lethalintelligence.ai

918,086 просмотров • 1 год назад

lethal_ai's profile picture

🧵02/34 Job-Loss and Emerging Capabilities --- Whenever we’ve built a machine to solve a specific problem … the machine outperformed humans every time … And that’s worked out great for us, vastly improved our lives and allowed us to use our muscles less and our brains more But AGI will be different. AGI stands for Artificial General Intelligence And being general means that it will be able to learn everything and outperform humans at every single job, even the ones that rely on using our brain It doesn’t exist today but it is just around the corner. Recently, progress has been exponential … And never before has the field of AI seen such sky-high levels of excitement and such ginormous vast amounts of investment. So far, frontier AI has existed mostly in cloud servers and interacted with the physical world via prompting online. But a new gold rush is currently exploding, in the sector of robotics. The knowhow for creating mechanical limbs and bodies has been here for decades. What was missing was an Artificial General mind to control them, and that is now within our grasp. Once AGI arrives, we should expect AGI bodies in the physical world to also arrive immediately. Microsoft researchers Famously claimed that GPT4, one of the latest models, has been exhibiting sparks of General intelligence. Just by scaling up the size of data and parameters, without any other major change or innovation, unexpected emerging capabilities and generality in unforeseen new domains manifested themselves that surprised everyone.

Lethal Intelligence

906,149 просмотров • 1 год назад

lethal_ai's profile picture

🧵19/34 The Strongest Force in the Universe --- Ok, So what if the AGI starts working towards something humans do not want to happen? You must understand: Intelligence is not about the nerdy professor, it’s not about the geeky academic bookworm type. Intelligence is the strongest force in the universe. It means being capable. It is sharp, brilliant and creative. It is strategic, manipulative and innovative. It understands deeply, exerts influence, persuades and leads. It is to know how to bend the world to your will. It is what turns a vision to reality, it is focus, commitment, willpower, having the resolve to never give up, overcoming all the obstacles and paving the way to the target. It is about searching deeply the space of possibilities and finding optimal solutions. Being intelligent simply means having what it takes to make it happen. There is always a path and a super-intelligence will always find it. Simple Fact --- So, we should start by stating the fact in a clear and unambiguous way: If you create something more intelligent than you that wants something else, then that something else is what is going to happen, even if you don’t want that something else to happen. Irrelevance of Sentience --- Keep in mind, the intelligence we are talking about is not about having feelings, or being self-aware and having qualia. Don’t fall into the trap of anthropomorphizing. Do not get stuck, looking for the Human type of Intelligence. Consciousness is not a requirement for the AGI at all. When we say the “AGI wants something X, or has the goal to do X”, what we mean is that this thing X is just one of the steps in a plan, generated by its model. A line in the output, a system like the Large Language Models produce when they receive a prompt. We don’t care if there is a Ghost in the machine, we don’t care if there is an actual soul that wants things hidden in the servers. We just observe the output which contains text descriptions of actions and goals and we leave the philosophy discussion for another day.

lethalintelligence.ai

565,240 просмотров • 1 год назад

lethal_ai's profile picture

🧵24/34 Inner Misalignment --- Consider this simplified experiment: We want this AI to find the exit of the maze. So we feed it millions of maze variations and reward it when it finds the exit. Please notice that in the worlds of the training data the apples are red and the exit is green. After enough training, our observation is that it has become extremely capable at solving mazes and finding the exit, we feel very confident it is aligned, so then we deploy it to the real world. The real world will be different though, it might have green apples and a red door. The AI geeks call this distributional shift. We expected that the AI will generalise and find the exit again, but in fact we now realise that the AI learned something completely different from what we thought. All the while we thought it learned how to find the exit, it had learned how to go after the green thing. Its behaviour was perfect in training. And most importantly, this AI is not stupid, it is an extremely capable AI that can solve extremely complex mazes. It’s just mis-aligned on the inside. Fishing for Failure modes --- The way to handle the shift between the training and deployment distributions is with methods like adversarial training: feeding it with a lot of generated variations and trying to make it fail so the weakness can be fixed. In this case, we generate an insane amount of maze variations, we discover those for which it fails to find the exit (like the ones with the green apples or the green walls or something), we generate many more similar to that and train it with reinforcement learning until it performs well at those as well. The hope is that we will cover everything it might encounter later when we deploy it in real life. There exist at least 2 basic ways this approach falls apart: First, there will never be any guarantee that we’ll have covered every possible random thing it might encounter later when we deploy it in real life. It’s very likely it will have to deal with stuff outside its training set which it will not know how to handle and will throw it out of balance and break it away from its expected behavioural patterns. The cascade effect of such a broken mind operating in the open world can be immense, and with super-capable runaway rogue agents, self-replicating and recursively self-improving, the phenomenon could grow and spread to an extinction-level event. ...

lethalintelligence.ai

535,237 просмотров • 1 год назад

lethal_ai's profile picture

🧵18/34 Discontinuities on our planet (Mountains changing shape) --- In fact, talking about AGI like if it’s another technology is really confusing people. People talk about it as if it is ‘The next big thing” that will transform our lives, like the invention of the smartphone or the internet. This framing couldn’t be more wrong, it puts AGI into a wrong category. It brings to mind cool futuristic pictures with awesome gadgets and robotic friends. AGI is not like any transformative technology that has ever happened with humanity so far. The change it will bring is not like that of the invention of the internet. It is not even comparable to the invention of electricity or even to the first time humans learned to use fire. Natural Selection Discontinuity --- The correct way to categorise AGI is the type of discontinuity that happened to Earth when the first lifeforms appeared and the intelligent dynamic of natural selection got a foothold. Before that event, the planet was basically a bunch of elements and physical processes dancing randomly to the tune of the basic laws of nature. After life came to the picture, complex replicating structures filled the surface and changed it radically. Human Intelligence Discontinuity --- A second example is when human intelligence was added to the mix. Before that, the earth was vibrant with life but the effects and impact of it were stable and limited. After human intelligence, you suddenly have huge artificial structures lit at night like towns, huge vessels moving everywhere like massive ships and airplanes, life escaping gravity and reaching out to the universe with spaceships and unimaginable power to destroy everything with things like nuclear bombs. AGI Discontinuity --- AGI is another such phenomenon. The transformation it will bring is in the same category as those 2 events in the history of the planet. What you will suddenly see on earth after this third discontinuity, no-one knows. But it’s not going to look like the next smartphone. It is going to look more like mountains changing shape ! To compare it to technology (any technology ever invented by humanity) is seriously misleading.

Lethal Intelligence

533,916 просмотров • 1 год назад