Video yükleniyor...

Video Yüklenemedi

Ana Sayfaya Dön

SWE-Agent is an open-source software engineering agent with a 12.3% resolve rate on SWE-Bench! Check out SWE-agent in action at Repo:

138,703 görüntüleme • 2 yıl önce •via X (Twitter)

10 Yorum

carlos profil fotoğrafı
carlos2 yıl önce

The SWE-agent open-source repository provides a framework for turning general LMs into software engineering agents. SWE-agent lets LMs like GPT-4 interact with their own Docker container using an Agent Computer Interface (ACI) - allowing it to browse, search, edit, and run code.

carlos profil fotoğrafı
carlos2 yıl önce

It’s been amazing to work on this with such a great team: @jyangballin*, @_carlosejimenez*, @_awettig, @ShunyuYao12, @karthik_r_n, and @OfirPress Keep an eye out for the paper coming out April 10th!

MindBranches profil fotoğrafı
MindBranches2 yıl önce

Here is a overview of the new open source Software Engineering Agent (SWE-Agent):

Elman Mansimov profil fotoğrafı
Elman Mansimov2 yıl önce

@OfirPress Very cool!

Eddie Forson profil fotoğrafı
Eddie Forson2 yıl önce

Nice work! Already starred the repo. Will take a look at the code in the next few days 👌🏿

Harry Tormey 🇮🇪 | 🇺🇸| 🇺🇦 profil fotoğrafı
Harry Tormey 🇮🇪 | 🇺🇸| 🇺🇦2 yıl önce

Amazing work with SWE-Bench and now this! Thanks for publishing your research and the source to this!.

Konisberg Heinrich profil fotoğrafı
Konisberg Heinrich2 yıl önce

Incredible!

Gopinathan A profil fotoğrafı
Gopinathan A2 yıl önce

Nice work!!

Hadi profil fotoğrafı
Hadi2 yıl önce

Does it just work with anthropic and openai models right now?

Owen Campbell-Moore ✪ profil fotoğrafı
Owen Campbell-Moore ✪2 yıl önce

(OpenAI PM here!) Super cool! I'm curious, what changes to our models or APIs would have made this easier to build or would make it work better? 🤔

Benzer Videolar

🚀New Amazon Q Developer agent for software development is available to customers: This agent is based on a new agent architecture that has exciting results coming from the SWE-bench scores (on the full and verified benchmarks) representing AI models’ ability to resolve real-world coding problems. Interesting aspect of Q Agent is that with these newest updates, Q drove nearly 50% more successful coding tasks completed. What makes Q Dev Agent remarkable? The agent architecture is not just about using the best LLMs (which we do), but also giving the agent the ability to constantly explore multiple paths to find the best way to resolve a particular problem (and back tracking when it has reached dead end like a developer would do). Needless to say, we are just getting started on the developer agent and we are constantly pushing to advance our AI capabilities while maintaining quality, security, privacy, and reliability to keep Amazon Q Developer an innovative and trusted option available to our customers using agents for software development. We highlighted the results of our first SWE-bench submission of Amazon Q Developer back in June blog post; with these updates, our new agent resolves 51% more coding tasks than its previous iteration on the SWE-bench verified dataset, and 43% more on the full dataset. That’s the difference a few months make, and I can’t wait to share what our teams will deliver at re:Invent this December. Here's a quick demo showcasing our new Agent in action:

Swami Sivasubramanian

28,946 görüntüleme • 1 yıl önce