Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

SWE-Agent is an open-source software engineering agent with a 12.3% resolve rate on SWE-Bench! Check out SWE-agent in action at Repo:

138,703 Aufrufe • vor 2 Jahren •via X (Twitter)

10 Kommentare

Profilbild von carlos
carlosvor 2 Jahren

The SWE-agent open-source repository provides a framework for turning general LMs into software engineering agents. SWE-agent lets LMs like GPT-4 interact with their own Docker container using an Agent Computer Interface (ACI) - allowing it to browse, search, edit, and run code.

Profilbild von carlos
carlosvor 2 Jahren

It’s been amazing to work on this with such a great team: @jyangballin*, @_carlosejimenez*, @_awettig, @ShunyuYao12, @karthik_r_n, and @OfirPress Keep an eye out for the paper coming out April 10th!

Profilbild von MindBranches
MindBranchesvor 2 Jahren

Here is a overview of the new open source Software Engineering Agent (SWE-Agent):

Profilbild von Elman Mansimov
Elman Mansimovvor 2 Jahren

@OfirPress Very cool!

Profilbild von Eddie Forson
Eddie Forsonvor 2 Jahren

Nice work! Already starred the repo. Will take a look at the code in the next few days 👌🏿

Profilbild von Harry Tormey 🇮🇪 | 🇺🇸| 🇺🇦
Harry Tormey 🇮🇪 | 🇺🇸| 🇺🇦vor 2 Jahren

Amazing work with SWE-Bench and now this! Thanks for publishing your research and the source to this!.

Profilbild von Konisberg Heinrich
Konisberg Heinrichvor 2 Jahren

Incredible!

Profilbild von Gopinathan A
Gopinathan Avor 2 Jahren

Nice work!!

Profilbild von Hadi
Hadivor 2 Jahren

Does it just work with anthropic and openai models right now?

Profilbild von Owen Campbell-Moore ✪
Owen Campbell-Moore ✪vor 2 Jahren

(OpenAI PM here!) Super cool! I'm curious, what changes to our models or APIs would have made this easier to build or would make it work better? 🤔

Ähnliche Videos

🚀New Amazon Q Developer agent for software development is available to customers: This agent is based on a new agent architecture that has exciting results coming from the SWE-bench scores (on the full and verified benchmarks) representing AI models’ ability to resolve real-world coding problems. Interesting aspect of Q Agent is that with these newest updates, Q drove nearly 50% more successful coding tasks completed. What makes Q Dev Agent remarkable? The agent architecture is not just about using the best LLMs (which we do), but also giving the agent the ability to constantly explore multiple paths to find the best way to resolve a particular problem (and back tracking when it has reached dead end like a developer would do). Needless to say, we are just getting started on the developer agent and we are constantly pushing to advance our AI capabilities while maintaining quality, security, privacy, and reliability to keep Amazon Q Developer an innovative and trusted option available to our customers using agents for software development. We highlighted the results of our first SWE-bench submission of Amazon Q Developer back in June blog post; with these updates, our new agent resolves 51% more coding tasks than its previous iteration on the SWE-bench verified dataset, and 43% more on the full dataset. That’s the difference a few months make, and I can’t wait to share what our teams will deliver at re:Invent this December. Here's a quick demo showcasing our new Agent in action:

Swami Sivasubramanian

28,946 Aufrufe • vor 1 Jahr