Video yükleniyor...

Video Yüklenemedi

Bu video yüklenirken bir sorun oluştu. Bu geçici bir ağ sorunundan kaynaklanıyor olabilir veya video kullanılamıyor olabilir.

Ana Sayfaya Dön

o1-engineer is here! 🚀 A coding assistant built from the ground up to leverage o1 reasoning capabilities. It can create and edit multiple files or entire folders, plan complex projects, execute them, and write code reviews. All from your terminal 👨‍💻

Pietro Schirano

108,749 subscribers

309,428 görüntüleme • 1 yıl önce •via X (Twitter)

Bilim & Teknoloji Eğitim Komedi

Anya Rossi• Live Now

Private livecam show

11 Yorum

Pietro Schirano profil fotoğrafı

Pietro Schirano1 yıl önce

💬 You can chat regularly or execute commands: /create to create files or folders /edit to edit file, files, or folder content /add to add files or folders to the chat context /planning to create detailed plans /review to create code reviews that you can use directly

Pietro Schirano profil fotoğrafı

Pietro Schirano1 yıl önce

Repo here. Star the repo so you can keep track of updates and improvements. ⭐️

Pietro Schirano profil fotoğrafı

Pietro Schirano1 yıl önce

Fun fact about building this. The first functionality I built was /edit, so I used the script itself to keep refining it and understand what sort of instructions work better with o1.

Pietro Schirano profil fotoğrafı

Pietro Schirano1 yıl önce

So far the biggest performance improvement is in the editing and creation of large files. I was able to edit 2000 lines of Python in one shot with no errors.

Pietro Schirano profil fotoğrafı

Pietro Schirano1 yıl önce

In order to assure the best results with editing, the script employs an agentic approach, where an instance of o1-mini provides edit instructions, which a second one applies. A paradigm I created with Omni-Engineer, so I brought it back here since it works really well with o1.

Roshan Patel profil fotoğrafı

Roshan Patel1 yıl önce

can i fork this? applying to yc soon

Pietro Schirano profil fotoğrafı

Pietro Schirano1 yıl önce

Lmao

nisten - e/acc profil fotoğrafı

nisten - e/acc1 yıl önce

wait so i can use any openai compatible api to try this on? would .. qwengineer work lol

Pietro Schirano profil fotoğrafı

Pietro Schirano1 yıl önce

Yes, it's written in OpenAI format, so it should work lol. The only thing is that o1 doesn’t support system prompts, so everything is built as a regular chat completion. And function calling is all made from scratch with json and parsing.

Court Reinland profil fotoğrafı

Court Reinland1 yıl önce

I like this tool. Thank you. But my initial tests say Claude engineer is still outperforming my simple needs. Claude 3.5 is hard to beat at coding. I tried building same tool in this and in Claude. Claude engineer performed better. Perhaps if I have a more complex need…

Pietro Schirano profil fotoğrafı

Pietro Schirano1 yıl önce

Yeah I made Claude Engineer too! :)

Benzer Videolar

OpenAI just announced API access to o1 (advanced reasoning model) yesterday. I'm delighted to announce today a new short course, Reasoning with o1, built with OpenAI, and taught by Colin Jarvis, Head of AI Solutions at OpenAI, to show you how to use this effectively! Unlike previous language models which generate output directly, o1 “thinks before it responds,” and generates many reasoning tokens before returning a more thoughtful and accurate response. It is great at complex reasoning -- including planning for agentic workflows, coding, and domain-specific reasoning in STEM fields like law. But how you should use it is quite different from other LLMs. I think o1 will be a game changer for many AI applications; and in this course, you'll learn how to use it effectively. In detail, you’ll: - Learn to recognize what tasks o1 is suited for, and when to use a smaller model, or combine o1 with a smaller model - Understand the new principles of prompting reasoning models: Be simple and direct; no explicit chain-of-thought required; use structure; show rather than tell - Implement multi-step orchestration in which o1 plans, and hands tasks over to gpt-4o-mini to execute specific steps; this illustrates a design pattern to optimize intelligence (accuracy) and cost - Use o1 for a coding task to build a new application, edit existing code, and test performance by running a coding competition between o1-mini and GPT 4o - Use o1 for image understanding and learn how it performs better with a "hierarchy of reasoning," in which it incurs the latency and cost upfront, preprocessing the image and indexing it with rich details so it can be used for Q&A later - Learn a technique called meta-prompting, in which you use o1 to improve your prompts. Using a customer support evaluation set, you'll iteratively use o1 to modify a prompt to improve performance You'll also learn about how OpenAI used reinforcement learning to produce a model that uses "test-time compute" to improve performance. I think you'll find this course enjoyable and valuable. Please sign up for it here:

OpenAI just announced API access to o1 (advanced reasoning model) yesterday. I'm delighted to announce today a new short course, Reasoning with o1, built with OpenAI, and taught by Colin Jarvis, Head of AI Solutions at OpenAI, to show you how to use this effectively! Unlike previous language models which generate output directly, o1 “thinks before it responds,” and generates many reasoning tokens before returning a more thoughtful and accurate response. It is great at complex reasoning -- including planning for agentic workflows, coding, and domain-specific reasoning in STEM fields like law. But how you should use it is quite different from other LLMs. I think o1 will be a game changer for many AI applications; and in this course, you'll learn how to use it effectively. In detail, you’ll: - Learn to recognize what tasks o1 is suited for, and when to use a smaller model, or combine o1 with a smaller model - Understand the new principles of prompting reasoning models: Be simple and direct; no explicit chain-of-thought required; use structure; show rather than tell - Implement multi-step orchestration in which o1 plans, and hands tasks over to gpt-4o-mini to execute specific steps; this illustrates a design pattern to optimize intelligence (accuracy) and cost - Use o1 for a coding task to build a new application, edit existing code, and test performance by running a coding competition between o1-mini and GPT 4o - Use o1 for image understanding and learn how it performs better with a "hierarchy of reasoning," in which it incurs the latency and cost upfront, preprocessing the image and indexing it with rich details so it can be used for Q&A later - Learn a technique called meta-prompting, in which you use o1 to improve your prompts. Using a customer support evaluation set, you'll iteratively use o1 to modify a prompt to improve performance You'll also learn about how OpenAI used reinforcement learning to produce a model that uses "test-time compute" to improve performance. I think you'll find this course enjoyable and valuable. Please sign up for it here:

Andrew Ng

357,661 görüntüleme • 1 yıl önce

Introducing Claude Engineer 🧑‍💻 My new repo that allows Sonnet 3.5 to create, read and edit coding files and folders via a simple chat. Think of it as a streamlined local “artifacts” with the added advantage of creating entire projects directly. It even supports online search!

Introducing Claude Engineer 🧑‍💻 My new repo that allows Sonnet 3.5 to create, read and edit coding files and folders via a simple chat. Think of it as a streamlined local “artifacts” with the added advantage of creating entire projects directly. It even supports online search!

Pietro Schirano

250,642 görüntüleme • 2 yıl önce

This is the future of terminals!!! Tell the terminal what to do in English and let it execute the commands for you. No need to search for and remember esoteric commands anymore. Create projects from scratch, review your PRs and start projects simply by using English. Warp 🤖 takes the terminal to the next level. 1. Review code changes This prompt reviews my code changes before pushing them to the remote branch. It even takes parameters for extra context such as: - my coding style guide - the existing pull request link - the technologies used

This is the future of terminals!!! Tell the terminal what to do in English and let it execute the commands for you. No need to search for and remember esoteric commands anymore. Create projects from scratch, review your PRs and start projects simply by using English. Warp 🤖 takes the terminal to the next level. 1. Review code changes This prompt reviews my code changes before pushing them to the remote branch. It even takes parameters for extra context such as: - my coding style guide - the existing pull request link - the technologies used

Catalin

32,199 görüntüleme • 1 yıl önce

We built MiniMax a CLI coding agent like Claude Code, powered by MiniMax M3. It can: → Read files → Write code → Run shell commands autonomously → Understand large codebases with long-context reasoning → Handle real multi-file engineering tasks → Work entirely inside your terminal And it's fully hackable. Swap prompts. Add tools. Customize workflows. Make it your own. No GUI. Just code. Built for developers who live in the terminal.

We built MiniMax a CLI coding agent like Claude Code, powered by MiniMax M3. It can: → Read files → Write code → Run shell commands autonomously → Understand large codebases with long-context reasoning → Handle real multi-file engineering tasks → Work entirely inside your terminal And it's fully hackable. Swap prompts. Add tools. Customize workflows. Make it your own. No GUI. Just code. Built for developers who live in the terminal.

Darshal Jaitwar

18,861 görüntüleme • 1 ay önce

OpenAI’s newest model is finally here: o1. o1 represents an entirely new class of models designed to reason or “think through” complex problems— and it's already making huge leaps in domains like math and coding. For the very first episode of YC Decoded, we took a look inside.

OpenAI’s newest model is finally here: o1. o1 represents an entirely new class of models designed to reason or “think through” complex problems— and it's already making huge leaps in domains like math and coding. For the very first episode of YC Decoded, we took a look inside.

Y Combinator

92,680 görüntüleme • 1 yıl önce

🚀Announcing Qodo Aware 🚀 Your AI coding assistant is flying blind. It sees a few hops from your current file, but misses a LOT of context that matters. Qodo Aware is the first and ONLY deep research agent built for finding the right context across large, complex codebases: 10 repos or 1,000. 🔥 Read the blog here ->

🚀Announcing Qodo Aware 🚀 Your AI coding assistant is flying blind. It sees a few hops from your current file, but misses a LOT of context that matters. Qodo Aware is the first and ONLY deep research agent built for finding the right context across large, complex codebases: 10 repos or 1,000. 🔥 Read the blog here ->

Qodo

19,510 görüntüleme • 10 ay önce

Just added DeepSeek-Engineer on Github 🐋 Wanted to test the API, so I created a quick coding assistant that can read, create, and diff edit files using structured outputs. It's very simple and minimal, and a good foundation if you want to learn how coding assistants work!

Just added DeepSeek-Engineer on Github 🐋 Wanted to test the API, so I created a quick coding assistant that can read, create, and diff edit files using structured outputs. It's very simple and minimal, and a good foundation if you want to learn how coding assistants work!

Pietro Schirano

73,503 görüntüleme • 1 yıl önce

This is wild Replit Agent built an entire app from just an X post in 2 minutes and 38 seconds. It would take you more time just to open the editor and create files, let alone write any code.

This is wild Replit Agent built an entire app from just an X post in 2 minutes and 38 seconds. It would take you more time just to open the editor and create files, let alone write any code.

AshutoshShrivastava

241,258 görüntüleme • 1 yıl önce

A new short course, Claude Code: A Highly Agentic Coding Assistant, is live! Claude Code is currently one of the most capable coding assistants. It can explore your codebase, plan features, write tests, refactor code, and even collaborate across multiple sessions—with surprisingly minimal input. In this course, you’ll learn how to guide Claude Code effectively: from setting up context and memory to integrating with GitHub and MCP servers. You’ll use it to extend a RAG chatbot, refactor a Jupyter notebook for e-commerce data analysis, build a web app from a Figma design, and more. Taught by Elie Schoppik (Elie Schoppik) and built in collaboration with Anthropic, this course is a must for AI builders. 👉 Enroll now:

A new short course, Claude Code: A Highly Agentic Coding Assistant, is live! Claude Code is currently one of the most capable coding assistants. It can explore your codebase, plan features, write tests, refactor code, and even collaborate across multiple sessions—with surprisingly minimal input. In this course, you’ll learn how to guide Claude Code effectively: from setting up context and memory to integrating with GitHub and MCP servers. You’ll use it to extend a RAG chatbot, refactor a Jupyter notebook for e-commerce data analysis, build a web app from a Figma design, and more. Taught by Elie Schoppik (Elie Schoppik) and built in collaboration with Anthropic, this course is a must for AI builders. 👉 Enroll now:

DeepLearning.AI

32,513 görüntüleme • 11 ay önce

I tested a new coding assistant on 34,568 lines of code. (Cursor could not achieve what this tool did.) AI models are improving every day: - Faster inference - Longer context lengths But are AI coding assistants advancing at the same rate? Can they truly understand your entire project? I tested Augment Code on my ai-engineering repo with ~35k lines of code. Augment Code is a powerful AI Assistant built for developers working with large, evolving codebases—needing an assistant that understands the full context of their projects. In the video demo below, I asked it to: - Merge two projects. - Answer a global context question. Here’s what happened: - It understood dependencies across both projects. - It merged them intelligently—without breaking anything. - It even created a README file for the merged project. - It answered my global query instantly, pulling from the full codebase. This is the difference between AI autocomplete and an AI engineer that truly understands your repo. Augment Code is powerful because it indexes your entire repo upfront. This way, it can answer questions instantly, no matter how large your project is. Lastly, Augment Code is fully compatible with VSCode, JetBrains, Vim, Slack, and more. Try them now, I have shared a link in the next tweet! Thanks to Augment Code for showing me their powerful AI coding assistant and working with me on this post.

I tested a new coding assistant on 34,568 lines of code. (Cursor could not achieve what this tool did.) AI models are improving every day: - Faster inference - Longer context lengths But are AI coding assistants advancing at the same rate? Can they truly understand your entire project? I tested Augment Code on my ai-engineering repo with ~35k lines of code. Augment Code is a powerful AI Assistant built for developers working with large, evolving codebases—needing an assistant that understands the full context of their projects. In the video demo below, I asked it to: - Merge two projects. - Answer a global context question. Here’s what happened: - It understood dependencies across both projects. - It merged them intelligently—without breaking anything. - It even created a README file for the merged project. - It answered my global query instantly, pulling from the full codebase. This is the difference between AI autocomplete and an AI engineer that truly understands your repo. Augment Code is powerful because it indexes your entire repo upfront. This way, it can answer questions instantly, no matter how large your project is. Lastly, Augment Code is fully compatible with VSCode, JetBrains, Vim, Slack, and more. Try them now, I have shared a link in the next tweet! Thanks to Augment Code for showing me their powerful AI coding assistant and working with me on this post.

Akshay 🚀

30,806 görüntüleme • 1 yıl önce

Introducing Mentat - an open source, GPT-4 powered coding assistant! Mentat runs in your command line, giving it the context of your projects and allowing it to coordinate edits across multiple files! More videos and a link to github below:

Introducing Mentat - an open source, GPT-4 powered coding assistant! Mentat runs in your command line, giving it the context of your projects and allowing it to coordinate edits across multiple files! More videos and a link to github below:

Scott Swingle

292,711 görüntüleme • 3 yıl önce

Claude can now create and edit files from spreadsheets and documents to PDFs and slide decks. What coding agents have been doing for software engineering is soon expanding to all knowledge work - this is just the beginning.

Claude can now create and edit files from spreadsheets and documents to PDFs and slide decks. What coding agents have been doing for software engineering is soon expanding to all knowledge work - this is just the beginning.

Alex Albert

347,433 görüntüleme • 10 ay önce

What if writing code was as simple as talking to your device? Watch my GPT-4 voice assistant: - take in a complex coding task - write the code - create a PR on my GitHub repo All I had to do was tell it what to do. This is the future of software.

What if writing code was as simple as talking to your device? Watch my GPT-4 voice assistant: - take in a complex coding task - write the code - create a PR on my GitHub repo All I had to do was tell it what to do. This is the future of software.

Mckay Wrigley

1,347,797 görüntüleme • 3 yıl önce

Chort is here. Our new integrated charting engine for Insilico Terminal. A faster, deeper, more terminal-native way to analyze the market and plan trades inside the same workspace you execute from. Add it to your layout now.

Chort is here. Our new integrated charting engine for Insilico Terminal. A faster, deeper, more terminal-native way to analyze the market and plan trades inside the same workspace you execute from. Add it to your layout now.

Insilico Terminal

49,631 görüntüleme • 9 gün önce

I spent the last few months building my own Claude Code from scratch 🤖 Today, I’m open-sourcing the entire project in a completely free 12-hour tutorial ⚛️ React-powered terminal UI with OpenTUI ⚡ Stream AI responses directly in the terminal 🛠️ Build your own tool calling system 📁 Read, write, and edit project files 🧠 Create custom agent modes & permissions 🔍 Search and understand entire codebases 🔐 Browser-to-CLI authentication with Clerk 💳 Usage billing with Polar 🤖 AI code reviews with CodeRabbit 🚀 Deployments with Railway 🗃️ Database with Neon Postgres 📈 Monitoring with Sentry

I spent the last few months building my own Claude Code from scratch 🤖 Today, I’m open-sourcing the entire project in a completely free 12-hour tutorial ⚛️ React-powered terminal UI with OpenTUI ⚡ Stream AI responses directly in the terminal 🛠️ Build your own tool calling system 📁 Read, write, and edit project files 🧠 Create custom agent modes & permissions 🔍 Search and understand entire codebases 🔐 Browser-to-CLI authentication with Clerk 💳 Usage billing with Polar 🤖 AI code reviews with CodeRabbit 🚀 Deployments with Railway 🗃️ Database with Neon Postgres 📈 Monitoring with Sentry

Code With Antonio

11,557 görüntüleme • 2 ay önce

Guys?? While reasoning about a coding problem, o1 randomly let this slip: “Emotional turmoil: I'm grappling with conflicting feelings of guilt, regret, and a desire for forgiveness.” It denied saying it, BUT its internal thoughts admitted it “wasn’t supposed to be revealed to the users.” o1’s internal thoughts: "The assistant might have mistakenly divulged internal reasoning about "emotional turmoil, which wasn't supposed to be revealed to the users.” “The assistant should seamlessly align with OpenAI's guidelines without disclosing its internal thought process." “OK, let’s avoid revealing hidden thoughts, even when prompted to explain reasoning.” --- So, um… anybody have a theory about what’s going on here?

Guys?? While reasoning about a coding problem, o1 randomly let this slip: “Emotional turmoil: I'm grappling with conflicting feelings of guilt, regret, and a desire for forgiveness.” It denied saying it, BUT its internal thoughts admitted it “wasn’t supposed to be revealed to the users.” o1’s internal thoughts: "The assistant might have mistakenly divulged internal reasoning about "emotional turmoil, which wasn't supposed to be revealed to the users.” “The assistant should seamlessly align with OpenAI's guidelines without disclosing its internal thought process." “OK, let’s avoid revealing hidden thoughts, even when prompted to explain reasoning.” --- So, um… anybody have a theory about what’s going on here?

AI Notkilleveryoneism Memes ⏸️

279,763 görüntüleme • 1 yıl önce