Show HN: Workflow Use – Deterministic, self-healing browser automation (RPA 2.0)

(github.com)

61 points | by gregpr07 16 hours ago

13 comments

casslin 3 hours ago
If any one wanna try it in browser can try implement it on our chrome extension code base in typescript: https://github.com/nanobrowser/nanobrowser, we support browser use and re wrote its code in typescript, not yet supporting workflow use and love to hear how it works from community!
deepdarkforest 14 hours ago
Very cool. 1) How do you deal with timings? If a step includes clicking on a link or something that needs loading, then if you just fire off the generated playwright code at at once, some steps might fail because the xpath is not there yet. So to be safe, i'm guessing you would have to wait using the difference in the timestamps in the json. 2. For self healing, we worked on something similar, and we found it's very easy to get off the rails if one step fails because if your assertions of if the fix was correct are off, then the next and next steps will also fail etc. The most stable way was to just regenerate all steps if a step fails in 2 attempts (2 runs of the flow) consecutively. If the xpath is broken for a step, very likely the subsequent ones won't be worth healing individually.
[-]
- gregpr07 14 hours ago
  1) we made this sick function in browser use library which analyses when there are no more requests going through - so we just reuse that!
  2) yeah good question. The end goal is to completely regenerate the flow if it breaks (let browser use explore the “new” website and update the original flow). But let’s see, soo much could be done here!
  What did you work on btw?
  [-]
  - deepdarkforest 14 hours ago
    1. Oh yes right. I remember trying it out thinking it was going to be brittle because of analytics etc but it filters for those surprisingly well.
    2. We are working on https://www.launchskylight.com/ , agentic QA. For the self onboarding version we are using pure CUA without caching. (We wanted to avoid playwright to make it more flexible for canvas+iframe based apps,where we found HTML based approaches like browser-use limited, and to support desktop apps in the future).
    We are betaing caching internally for customers, and releasing it for the self-onboarding soon. We use CUA actions for caching instead of playwright. Caching with pixel native models is def a bit more brittle for clicks and we focus on purely vision based analysis to decide to proceed or not. I think for scaling though you are 100% right, screenshots every step for validating are okay/worth it, but running an agent non-deterministically for actions is def an overkill for enterprise, that was what we found as well.
    Geminis video understanding is also an interesting way to analyze what went wrong in more interactive apps. Apart from that i think we share quite a bit of the core thinking, would be interested to chat, will DM!
Stiopa 7 hours ago
Haha was just thinking last week there should be a tool called “show don’t tell” that infers a routine from recording, great minds think alike :)) Awesome feature guys, looking forward to playing around!
vasusen 12 hours ago
Very cool evolution!
Really great to see the fallback to the agentic run when the automation breaks. For our e2e testing browser automation at Donobu, we independently arrived at the same pattern and have been impressed with how well it works. Automatic self-healed PR example here: https://github.com/donobu-inc/playwright-flows/pull/6/files
edit: typo
ListeningPie 6 hours ago
We use selenium for RPAs. Saving this as an alternative to explore.
petethomas 13 hours ago
It's not mentioned here but Kapwork contributed the beginnings of this work in a PR a couple weeks ago: https://github.com/browser-use/browser-use/pull/1437. Thank you Gregor & Magnus for the great tech and all you're doing for the community.
cdolan 16 hours ago
This is amazing. We've been using BrowserUser to try and create deterministic playwright scripts for months with mixed results.
So, so, so excited to see this
crazymoka 13 hours ago
So I can use this and it will pull new data from a database I can use to have it fill out a form? And I can trigger it when I need it to run with the updated form information?
ProofHouse 10 hours ago
Can utilizing Chrome Extensions be added? It seems no one has and would be a critical bridge to many browser tasks
[-]
- casslin 3 hours ago
  we built a chrome extension supporting browser use: https://github.com/nanobrowser/nanobrowser, feel free to implement workflow use on top of our code base in typescript. Love to learn how it works!
- nico 10 hours ago
  Yes. Also, would love something that can run directly in my browser with my sessions
  There’s a lot of websites that are super hostile to automation and make it really hard to do simple, small, but repetitive stuff with things like playwright, selenium, chromedriver
  [-]
  - casslin 3 hours ago
    we built a chrome extension supporting browser use and can run locally in ur browser: https://github.com/nanobrowser/nanobrowser, feel free to implement workflow use on top of our code base in typescript. Love to learn how it works!
rammy1234 12 hours ago
seems similar to selenium plugin for firefox, minus the scripting it generates.
pzullo 16 hours ago
Cool stuff!
Richardson143 6 hours ago
[dead]
gitroom 14 hours ago
[dead]