Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

We've reached a small but exciting milestone for OpenDevin, the open source AI software engineer -- OpenDevin sends a pull request to the OpenDevin repo. You can see the PR here:

53,069 Aufrufe • vor 2 Jahren •via X (Twitter)

11 Kommentare

Profilbild von Graham Neubig
Graham Neubigvor 2 Jahren

To be clear on the level of autonomy, I sent a prompt with the URL of the issue to resolve. OpenDevin browsed to the issue, cloned the repo, fixed the issue, and pushed to GitHub. Then I copy-pasted the PR URL, checked the work, wrote the PR description, and hit submit.

Profilbild von Graham Neubig
Graham Neubigvor 2 Jahren

This is exciting because it demonstrates the potential for an open source agent to do end-to-end software dev tasks, but we still have a lot of work to do: we are almost done setting up benchmarking, and based on this we will bring agents implemented in our agent hub up to SOTA.

Profilbild von Graham Neubig
Graham Neubigvor 2 Jahren

I strongly believe that AI agents are going to be the future, and the future should be built by everyone! If that resonates with you please come join us on github, slack, or discord:

Profilbild von Graham Neubig
Graham Neubigvor 2 Jahren

We are trying to build a community where AI researchers, software developers, and even users without as much development experience would be able to join and contribute!

Profilbild von Graham Neubig
Graham Neubigvor 2 Jahren

And if you would like to see some things we have planned for the future, we have our open road map here: If there is anything else you would like to see on there just send us a message!

Profilbild von Rainmaker
Rainmakervor 2 Jahren

Here I share an XGBoost model that delivers a 25% CAGR with minimal drawdown on Visa stock. In this free Substack post I share code and commentary for a powerful Machine Learning strategy that delivers powerful returns.

Profilbild von DF
DFvor 2 Jahren

Did you double checked the changes it did to the code? Because I can see this as the starting plot of a dystopian movie.

Profilbild von Graham Neubig
Graham Neubigvor 2 Jahren

Yes! If you see the last part of the video, and also my explanation in the following tweet, I did check the code and press submit myself. In addition, we're doing standard peer review for OpenDevin, so another developer needs to check as well.

Profilbild von Binfeng Xu
Binfeng Xuvor 2 Jahren

The mysterious gpt2-chatbot can really replace software developers at large scale. Petrified after testing

Profilbild von nlp research
nlp researchvor 2 Jahren

I'm wondering why you have not benchmarked SWE Bench? Is Devin not similar to SWE Agent?

Profilbild von Graham Neubig
Graham Neubigvor 2 Jahren

So the main reason is that we had some trouble getting swe-bench to work properly and it took us a while to set up. But as of today we're mostly finished and ready to benchmark!

Ähnliche Videos