Loading video...

Video Failed to Load

Go Home

1/4 Devin can learn how to use unfamiliar technologies.

741,966 views • 2 years ago •via X (Twitter)

7 Comments

Cognition's profile picture
Cognition2 years ago

Today we're excited to introduce Devin, the first AI software engineer. Devin is the new state-of-the-art on the SWE-Bench coding benchmark, has successfully passed practical engineering interviews from leading AI companies, and has even completed real jobs on Upwork. Devin is an autonomous agent that solves engineering tasks through the use of its own shell, code editor, and web browser. When evaluated on the SWE-Bench benchmark, which asks an AI to resolve GitHub issues found in real-world open-source projects, Devin correctly resolves 13.86% of the issues unassisted, far exceeding the previous state-of-the-art model performance of 1.96% unassisted and 4.80% assisted. Check out what Devin can do in the thread below.

Cognition's profile picture
Cognition2 years ago

2/4 Devin can contribute to mature production repositories.

Cognition's profile picture
Cognition2 years ago

3/4 Devin can train and fine tune its own AI models.

Cognition's profile picture
Cognition2 years ago

4/4 We even tried giving Devin real jobs on Upwork and it could do those too!

Cognition's profile picture
Cognition2 years ago

For more details on Devin, check out our blog post here: See Devin in action If you have any project ideas, drop them below and we'll forward them to Devin.

Cognition's profile picture
Cognition2 years ago

We'd like to thank all our supporters who have helped us get to where we are today, including @patrickc, @collision, @eladgil, @saranormous, Chris Re, @eglyman, @karimatiyeh, @bernhardsson, @t_xu, @FEhrsam, @foundersfund, and many more. If you’re excited to solve some of the world’s biggest problems and build AI that can reason, learn more about our team and apply to join us here.

swax's profile picture
swax2 years ago

Cool stuff. Just tried out the same test -

Related Videos