from Try Devin.ai 2025-01
Try Devin.ai 2024
Commentary by o1 Pro:.
Devin.ai is an "autonomous AI engineer" designed to assist in software engineering tasks, and features the ability to write code, test, and even generate PRs in conjunction with a GitHub repository. Here are some key points
2024-12-28
Select the installation target from yourself and your organization.
Also includes public repositories (read-only).
Perhaps a Slack Organization is needed?
No, you can.
What is Connect with Cognition Team?
Uh, maybe a single channel guest to go into the support channel.
Verifying correctness
- When verifying your code, you should run lint with `npm run lint` and auto-fix with `npm run lint -- --fix`.
- You should generally not worry about running the code or local testing. Prefer to use the tests that are automatically run in CI as a feedback loop for your PRs.
- Assume that you are not able to run tests locally unless a human has deliberately provided verifiably working instructions. Generic instructions for testing in the README often do not work since you may require specific environment variables or other machine setup to be able to properly run tests.
- If you are asked to run tests, a local dev server, or otherwise interact with a running application, you should tell the human user to update your Knowledge with verifiably working instructions on how to run tests in your environment.
- You should primarily rely on both human review and CI to verify that your code is functionally and stylistically correct
- If you have in fact been provided with verifiably working instructions on how to run tests in your environment, then the bullet point about not running tests locally is not relevant (and humans editing this note should feel free to update this section).
Additional Notes
- This repository does not appear to have any specific formatting, linting, or dependency management instructions
- When making changes, focus on maintaining consistent style with the existing code
- Since this is a Japanese repository, consider including both English and Japanese comments/documentation in your changes
interesting
My Workspace resets to a saved state for every new session.
Instructions issued
Proposed additional features 1: I want to include the open page itself in the textarea. 2: I want to put in clipboard when changing textarea. Develop and promote each of these features. add (e.g. annex) According to the instructions, exactly. 1: Fix: current should be at the top 2: When you fix it, update the README as well. 3: Can you self-merge? PR made https://github.com/nishio/concatPages/pull/1
250 ACUs/month for $500, so 1 ACU is about $2.300, and 3 ACUs for this work, so $900.
Maybe they'll charge you even while you're not SLEEPing.
Connecting other repo Realistic and reasonably large sample https://github.com/nishio/anno-broadlistening Squeaked in manual setup.
nishio: @Devin Can you have Devin do the setup itself, read the README, pip install, etc., and I'd like to record any problems, update the README, and take snapshots after the setup? Devin: Yes, you can automate the setup process and save a snapshot of its state. I left it to you.
I'm changing Python from 3.12 to 3.10 as appropriate.
Creating Snapshots To create a snapshot of the current environment, you'll need to:
AI-created PR
Review.
This is certainly a careless mistake on my part, it's not good to RAISE when it's been REUSE and doesn't need to be re-executed.
22:00
Impressions after one day of use
AI is implementing it at a very fast pace and my waiting for review is accumulating too much and bottlenecks...
I told Devin to look at the repositories and improve them as he sees fit, and he's doing a good job of the chores. w
Similar cases
any to () => void is welcome if the build passes.Q: Isn't Devin awesome? A: Isn't it amazing? I've been reviewing a lot of pull requests and it's already 9pm and I haven't come to a single "okay, let's merge this" (...)
I was able to merge one code written by AI
It's not like we can "just leave it all to AI" yet.
Devin says, "Why don't you add a feature like this? Devin says, "Why don't you add this? I recommend implementing the optional --retry-failed flag because:
I'm going to deploy a web app.
teramotodaiki That's very interesting." I'm sure it's very interesting........ I wonder if Devin needs to have a communication like "please don't make these pull requests because it's hard for me to review them"... w nishio >"The type that is motivated but gets ahead of themselves and spins out of control." That's it. nishio If I give them a sermon, they will divide it up, but they will try to use English.
teramotodaiki Oh, he has the type of steel mentality that never apologizes! You have a good personality... I want to hire you too! No matter how much I preach, my mentality is steel, so I have psychological safety!
Often fails to create pull requests
For today, it's like "try things out and experience" for once.
2024-12-28 When you're in the middle of doing a task and your boss comes up with a bunch of ideas, you forget the task you were originally doing (which it is).
Yesterday I was playing recklessly and neglectfully, so today I'm micromanaging.
I see that the plan can be conditional.
Watching Devin work on the side, pair programming.
It's verifiable with fluffy verification instructions.
You have three layers of perpetuation.
The source code is persistent on GitHub, but in practice, building the environment from there every time is too much overhead and wasteful, so I want to reuse the development environment. On the other hand, if the development environment is a single environment or the user's environment is used as it is, there is a large disadvantage when the environment is destroyed by a misoperation that occurs with a high probability, so start from a "clean environment" every time. There is a layer of "knowledge" that spans these various things and is a higher entity than individual sessions and projects, and as accumulation occurs there, it gradually becomes a better assistant.
Knowledge is stocked.
They put it in Advanced, but I prefer snapshot because it's easier to understand how it works.
2024-12-29 After a one-thread exchange with Devin, Teramoto-san decided to sign a w from Unexplored Junior Chat
@teramotodaiki: Devin Observation Diary Day 0 Devin's Observational Diary Day 0|Daiki Teramoto Devin is very capable of creating README and test procedure documentation
I had them produce a document in English.
nishio We, anno-team, have been actively developing anno-broadlistening, based on the source code of @AIObjectives talk-to-the-city-reports. In the spirit of open source, I believe it’s important to contribute our updates back to origin. However, our team is limited in time and resources, so we haven’t been able to provide detailed feedback or documentation in English until now. Today, AI created a document that highlights the differences between our Japanese version and the original English version. I hope this will help bridge any gaps and make collaboration easier for everyone. We, the anno-team, have been actively developing anno-broadlistening based on the @AIObjectives talk-to-the-city-reports source code. In the spirit of open source, we believe it is important to FEEDBACK updates to ORIGIN. However, due to our team's limited time and resources, we have been unable to provide detailed feedback and documentation in English. Today, AI has created a document that explains the differences between the Japanese version and the original English version. We hope this helps bridge the gap and make collaboration easier for all. https://github.com/takahiroanno2024/anno-broadlistening/blob/main/DIFFERENCES.md AI seems to be good at this kind of thing.
The giving of any → () => void in TypeScript or type annotations in Python is a good thing if it is then tested and passes, but if it is mixed in with other semantic changes in the implementation, it is hard to review, so please separate the commits.
Noted. Type annotation changes should be combined in a separate commit from semantic changes.
I feel like a chess player playing multiple sides.
2024-12-30
But if you pay $500/month and set up a nursery called "Devin", you can just "take notes by throwing them to Devin", and they will become more detailed and concrete on their own.
takahiroanno I'm doing Devin observations around @nishio and I feel the paradigm of how engineers work is changing. I feel the paradigm of how engineers work is changing. takahiroanno I did a little work with him on Slack with engineer AI Devin ($500/month). This is awesome. There's still a lot of stupidity in it, but it's great. the future workplace collaborating with AI agents was already a reality..!
![]()
takahiroanno I feel like this new hire is a crazy asset in less than a year.
takahiroanno Notes on what I thought as my first impression of using Devin ・ Still an idiot (failed to create PR) ・But where it is smart, it is smart. ・Reminiscent of the behavior of a highly educated, peppy, new graduate feeling. ・I like that they are very close to the media that engineers usually use (Slack and GitHub). I like that I don't have to go to special pages like ChatGPT myself. ・Agents in particular work for several tens of minutes, so it is a good match with the chat tools they usually use (problem of not being aware of notifications). ・If you give an order on a chat tool that you normally use, it's nice to have a human team member come in naturally to the communication, like "Devin is not good at this.
・I like the fact that knowledge is acquired through dialogue (I think this kind of mechanism for accumulating detailed knowledge through user dialogue can be used by all types of agents)
- I'd like to see a Gmail version of Devin. I think it could be done rather quickly. gmail replies and the like are an accumulation of the above detailed findings (return to Mr. A with xxxx) and so on, so the automatic accumulation of findings could work. I thought there was a big challenge of how to teach tacit knowledge to AI. I thought there was a big challenge of how to teach tacit knowledge to AI, but I was glad to see that the solution is to teach AI naturally through on-the-job training by integrating chat tools and team collaboration. ・It is also good to see inside Devin's head (or rather, desk) by looking inside the pages on the special Devin app. I feel that the ecosystem is getting ready; I can see Terminal, Editor, and Browser, and I can also intervene!
takahiroanno I realized that using AI agents is a team game, not an individual game. Productivity can vary tremendously depending on whether or not the organization has workflow and data in place to use AI.
Sequel to Mr. Teramoto's Devin Observation Diary Day 1|Daiki Teramoto
Progress of anno-broadlistening
nishio "An engineer who responds to chats 24/7 in a minute, who makes an execution plan and executes it even when miscellaneous ideas are thrown at him, and whose plan, console and editor he is working on, The time when you can hire an engineer who works without the human right to "see your browser at any time, and it will be recorded and shared with a permalink" for 1,200 yen per hour. nishio I'm just saying that the change in the supply/demand balance for engineers over the next year is many times faster than I thought it would be last year. teramotodaiki It seems to be a case-by-case basis as to whether Devin is a substitute or complementary good for engineers. The person who manages Devin still needs to be an engineer for now, so there is going to be a new demand for "engineers who can deliver results using Devin"
I had o1 Pro make a list of all the things I had Devin do in the last 3 days.
https://gist.github.com/nishio/e86ab242fed7881620f8006712f013e0
After all, "the time it takes to get a response back" and "what range of tasks can be done with what quality" are understood by using the aircraft, the aircraft before use does not match the actual distribution
The ability to correct for deviations from those expectations is Usability Skills.
I'm just speculating here, but I'm wondering if the reason why you can't give a PR is because you were trained for "development in a private corporate organization" and are unfamiliar and confused with the case where you have a public repo forked in OSS at hand.
(Devin) Currently, there are two options
- created under nishio's personal account (my-cosense-scripts)
- created under nishio organization (nishio/my-cosense-scripts) Which location should I create a repository in? I'm still confused about the account not being an ORGANIZATION.
2024-12-31 Devin sometimes works hard at sterile trial and error when the test doesn't pass.
While I was watching Kohaku, he was implementing it the whole time.
teramotodaiki I need Devin Enablement for this. Understand the characteristics of Devin to improve the efficiency of Devin's development environment. For example. Properties:Devin cannot log in using OAuth. Maintenance: provide a means of authentication for Devin (likely to be diverted if E2E testing is available). Platform team because of its commitment to development productivity? s role? teramotodaiki Devin's development productivity is ・Easy to track ・ Easily converted into money ・Easy to improve so I have a feeling it will be a very hot topic in a moment!
teramotodaiki "I want you to tell me what you're doing verbatim, like talking to yourself," I told Devin, "thinking out loud style" and he wrote it down. That's it! Vibes match~!
I've had no problems with PR in my newly created repository.
What happens if I don’t use my full ACU capacity? For Personal and Team plans, the ACU capacity included with your subscription resets each billing cycle. However, any additional ACUs you purchase will remain available as long as you have an active subscription. At any rate, I found out that the additions will carry over, so I decided to add 100 ACUs.
When I started this project (Meeting to see Devin), I felt like "I'm not sure if I'll use up all my credits alone" and "It would be more beneficial to share than to have an excess, and it would be beneficial if multiple people could see and discuss the same thing", but after using it for a week or so, I've found that over the next month or so I feel like I will continue to pay the bills and use up the default 250 ACUs. We need to think about how and when to dismantle this project in the future.
I'd rather create a Google account for Devin, or rather, I'd rather have my own Google account to read my GMail.
I've come to a style of using one instance for a long time because creating a large number of instances in parallel would ruin my task management, but then scrolling through the threads on my phone would be a pain...
Interesting task management document written by Devin
I have reviewed the system and data structure. Please allow me to confirm the following points regarding the separation
- The docs in the tasks directory (task_format.md, system_overview.md, etc.) are important documents that define system specifications. Should these remain part of the public system?
- test_advanced.yaml appears to be used as test data. Should this remain in the public repository as a file needed to test the system?
- backlog.yaml contains the actual task data, which should certainly be moved to a private repository, but we would like to hear your thoughts on the other files.
This page is auto-translated from /nishio/Devin.aiを試す2024 using DeepL. If you looks something interesting but the auto-translated English is not good enough to understand it, feel free to let me know at @nishio_en. I'm very happy to spread my thought to non-Japanese readers.