Hear your agent suffer through your code

(github.com)

54 points | by AndrewVos 2 hours ago

7 comments

fredley 1 hour ago
I need a version of this which swears loudly when an assumption it made turns out to be wrong, with the volume/passion/verbosity correlated with how many tokens it's burned on the incorrect approach.
[-]
- shivaniShimpi_ 58 minutes ago
  i didnt realize i needed the volume scaling with tokens burned as much as i do now xD imagine the screaming when it confidently refactors something for 40k tokens and then finds out the thing it deleted was load bearing
  [-]
  - vasco 1 minute ago
    I have general reviewer named Feynman with his personality that shits on anything other agents do and sends it back before it hits me and it sounds perfect to include some sound bites from YouTube clips. Great idea!!
  - ben30 8 minutes ago
    I have in my agents file “Chesterton’s fence” as pointer to think carefully before you remove something
AndreVitorio 1 hour ago
This desperately needs a demo video in the repo.
[-]
- shivaniShimpi_ 57 minutes ago
  hear hear!!!
8-prime 1 hour ago
Does this actually relate to the code quality being observed by the agent? The readme isn't very clear on that IMO. I have some projects I'd love to try this out on, but only if I am to get an accurate representation of the LLMs suffering.
[-]
- CWwdcdk7h 1 hour ago
  https://github.com/AndrewVos/endless-toil/blob/main/plugins/...
  So it is left up to agent to decide.
  [-]
  - Ghoelian 1 hour ago
    The agent is instructed to execute this Python script: https://github.com/AndrewVos/endless-toil/blob/main/plugins/...
    So looks like it's mainly looking for FIXME/TODO etc comments, deep nesting, large files, broad catches, stuff like that.
- AndrewVos 1 hour ago
  I'm very open to suggestions, but currently it's a very simple scan of the code. Check the python scripts.
esperent 1 hour ago
I tried it but all I hear is a choir of angels, is it broken?
[-]
- medwezys 1 hour ago
  I guess you’re working on a greenfield project?
- AndrewVos 1 hour ago
  Actually, that's not a bad idea!
greg_dc 1 hour ago
Honestly, I don't care about Opus 4.7. This is the true evolution of agentic coding.
coldcity_again 1 hour ago
I really want this! Any chance of a Cursor version?
[-]
- AndrewVos 1 hour ago
  I just added a cursor plugin to the repo, let me know how it goes!
AndrewVos 1 hour ago
[dead]