Deeds675 hours ago
To be honest, the official superpowers/brainstorming skill already does TDD so well, I don't see that much of a need for this. TDD is definitely the way to go with agentic development.
synergy2015 minutes ago
how?i saw superpowers/brainstorming but never saw tdd code produced
shruubi7 hours ago
Two questions
1) Do you not feel self-conscious or weird about calling this "EvanFlow"? Seems like a lot of people these days are naming their AI tools/skills/whatever after themselves which seems self-absorbed. Either that or they hope that if their thing takes off like OpenClaw did then they'll grab the fame that comes along with it.
2) Why does your TDD flow miss the refactor step of TDD?
toyg4 hours ago
I initially thought it was a pun on Pearl Jam's classic "Even Flow", then I read your comment and noticed the username... Sad.
wenc6 hours ago
I feel like 1 is a self correcting problem. If this goes nowhere it will soon be forgotten.
I can think of one example that did go somewhere: Linux.
EvanKnowles2 hours ago
Feels like a bonus to me.
globular-toastan hour ago
Linus did not name it Linux himself: https://en.wikipedia.org/wiki/Linux#Naming
normie30006 hours ago
Ref 1, he should have called it Daughter.
reitzensteinm6 hours ago
No Code, surely?
ButlerianJihad3 hours ago
"Evenflo is a hundred year old infant feeding brand." Probably named to market its baby bottles and accessories.
Everybody who grew up to listen to Pearl Jam had seen or used an Evenflo pacifier, baby bottle, or car seat. That's one reason the song already sounded so familiar.
s20n7 hours ago
EvanFlow - thoughts arrive like butterflies?
sbseitz7 hours ago
Oh, he don't know, so he chases them away
jamesbfb7 hours ago
Oooohhhh
ge965 hours ago
Seeeethinnggg tests failing not complete... again
__mharrison__5 hours ago
Someday soon he'll begin his life again
[deleted]7 hours agocollapsed
[deleted]11 minutes agocollapsed
dmitry_dvan hour ago
The refactor step is the silent casualty in AI-assisted TDD. Once the test is green, Claude optimizes for moving to the next test, not for cleaning up the impl that just passed. An "iterate-until-clean" pass at the end is a different thing: you're refactoring cold code, not refactoring with a freshly-written test as the safety net.
evanklem2004op9 hours ago
Built this as an opinionated Claude Code development flow based on evidence based practices and what has been working for me while developing professional code.
EvanFlow is a single TDD-driven loop. Say "let's evanflow this" and it walks brainstorm → plan → execute → tdd → iterate → STOP. Real checkpoints at design and plan approval. Never auto-commits, never auto-stages, never proposes integration - every git op is your call.
The three things that actually changed how I work:
1. Vertical-slice TDD. One failing test → minimal impl → next test. Watch each test fail before writing the impl that passes it. (Sounds obvious. Almost no agent does it by default. ~62% of LLM-generated test assertions are wrong per HumanEval research, so testing TDD discipline matters more than the impl discipline.)
2. Embedded grilling at decision points. Before locking a plan: what breaks if a user does X? What's the rollback? What's explicitly out of scope? Catches design flaws while they're still cheap.
3. Iterate-until-clean (hard cap of 5 rounds). Re-read the diff against dead code, naming, the deletion test, assertion correctness, and a Five Failure Modes pass (hallucinated actions, scope creep, cascading errors, context loss, tool misuse). For UI: screenshot via headless Chromium.
For bigger plans with 3+ independent units sharing types, it forks into a parallel coder/overseer orchestration. Integration tests at touchpoints ARE the cohesion contract.
Three install paths: Claude Code plugin marketplace, npx skills add, manual copy. MIT.
girvo5 hours ago
Please don’t post AI generated comments :(
Just write it yourself. I promise it’s worth it
deauxan hour ago
He's even being cheeky by intentionally replacing the em-dash by a regular dash, haha
dpark5 hours ago
I’ve thought of going down the TDD model for LLMs as a way of providing constraints on their behavior. I would think that “vertical slice” TDD would encourage the LLM to start tailoring the tests to the implementation rather than establishing the invariants up front, though. I was considering “horizontal” TDD to force the agent to implement constraints before coding to them.
alex1sa2 hours ago
[dead]
lukewrites5 hours ago
Curious, In the repo you mention
> Several rules come from 2025-2026 industry research on agentic coding failure modes
What are some of the papers you read?
esperent4 hours ago
With no disrespect intended because this is also how I would do it (but I wouldn't publish and name it after myself!) - they didn't read the research. They had the AI that actually created this do that for them.
esperent4 hours ago
> execute → tdd
How are these separate steps?
TDD is how you execute, not something you tack on afterwards.
nghnam3 hours ago
superpowers/brainstorming is doing TDD as well.
here2learnstuff5 hours ago
Not bad, but also, forgive how mean this is going to come across: not using a product from someone who just started their undergrad.
xaxfixho2 hours ago
i'm new around here, how do i *DOWN VOTE* stuff?
fragmede4 hours ago
Linus started Linux when he was 21, an undergrad at the University of Helsinki. You're entirely welcome to use whatever filtering function for products you use, but it doesn't seem like soley using this particular product's creator's age as a disqualifier comes from a place of sound reasoning, to me.
avyjit4 hours ago
This is such a BS take. If you feel the product is immature or not great - that's valid criticism. This is not
sdevonoes3 hours ago
TDD in 2026? Besides, TDDs main benefit is to come up with a decent architecture for your system… LLMs can already do that if instructed. I don’t see the point of TDD
jtfrench8 hours ago
How does this handle “dumb zone” evasion while looping?
cratermoon6 hours ago
enesz2 hours ago
[dead]
tommy29tmar4 hours ago
[dead]
youwangd4 hours ago
[dead]
jonahs1976 hours ago
[dead]
marsven_4226 hours ago
[dead]