正在加载视频...

视频加载失败

SWE-Agent is an open-source software engineering agent with a 12.3% resolve rate on SWE-Bench! Check out SWE-agent in action at Repo:

138,703 次观看 • 2 年前 •via X (Twitter)

10 条评论

carlos 的头像
carlos2 年前

The SWE-agent open-source repository provides a framework for turning general LMs into software engineering agents. SWE-agent lets LMs like GPT-4 interact with their own Docker container using an Agent Computer Interface (ACI) - allowing it to browse, search, edit, and run code.

carlos 的头像
carlos2 年前

It’s been amazing to work on this with such a great team: @jyangballin*, @_carlosejimenez*, @_awettig, @ShunyuYao12, @karthik_r_n, and @OfirPress Keep an eye out for the paper coming out April 10th!

MindBranches 的头像
MindBranches2 年前

Here is a overview of the new open source Software Engineering Agent (SWE-Agent):

Elman Mansimov 的头像
Elman Mansimov2 年前

@OfirPress Very cool!

Eddie Forson 的头像
Eddie Forson2 年前

Nice work! Already starred the repo. Will take a look at the code in the next few days 👌🏿

Harry Tormey 🇮🇪 | 🇺🇸| 🇺🇦 的头像
Harry Tormey 🇮🇪 | 🇺🇸| 🇺🇦2 年前

Amazing work with SWE-Bench and now this! Thanks for publishing your research and the source to this!.

Konisberg Heinrich 的头像
Konisberg Heinrich2 年前

Incredible!

Gopinathan A 的头像
Gopinathan A2 年前

Nice work!!

Hadi 的头像
Hadi2 年前

Does it just work with anthropic and openai models right now?

Owen Campbell-Moore ✪ 的头像
Owen Campbell-Moore ✪2 年前

(OpenAI PM here!) Super cool! I'm curious, what changes to our models or APIs would have made this easier to build or would make it work better? 🤔

相关视频

🚀New Amazon Q Developer agent for software development is available to customers: This agent is based on a new agent architecture that has exciting results coming from the SWE-bench scores (on the full and verified benchmarks) representing AI models’ ability to resolve real-world coding problems. Interesting aspect of Q Agent is that with these newest updates, Q drove nearly 50% more successful coding tasks completed. What makes Q Dev Agent remarkable? The agent architecture is not just about using the best LLMs (which we do), but also giving the agent the ability to constantly explore multiple paths to find the best way to resolve a particular problem (and back tracking when it has reached dead end like a developer would do). Needless to say, we are just getting started on the developer agent and we are constantly pushing to advance our AI capabilities while maintaining quality, security, privacy, and reliability to keep Amazon Q Developer an innovative and trusted option available to our customers using agents for software development. We highlighted the results of our first SWE-bench submission of Amazon Q Developer back in June blog post; with these updates, our new agent resolves 51% more coding tasks than its previous iteration on the SWE-bench verified dataset, and 43% more on the full dataset. That’s the difference a few months make, and I can’t wait to share what our teams will deliver at re:Invent this December. Here's a quick demo showcasing our new Agent in action:

Swami Sivasubramanian

28,946 次观看 • 1 年前