The Tragedy of Given-When-Then

119 点作者 wheresvic1大约 6 年前

22 条评论

pjc50大约 6 年前

Whereas:> The reduction of the tester to an expert translator/typist is a tragedy> The Given-When-Then detracts from understanding and readability but provides the much prized automation through tools like Cucumber> Instead of abstract domain models in Rational RoseNow I see how we got into this situation. All this misery comes from the people who understand the code and the people who understand the business being different, non-intersecting groups of people.Since the BAs can't read the code, there is a desire to construct a technical representation that they can read, representing a some kind of formal specification, and then derive as much as possible the program from that. This was the promise of Rational Rose and the whole UML project. Had it been successful, it would have reduced programmers to stenographers, "coders" in a very limited sense. As it is, in a "Rational" system of this kind, coders end up producing increasingly elaborate polyfills between the machine-generated parts of the system and the rest.The target of this is always the "5GL" dream:<pre><code> - specify program in English - ??? - automated tools transform to code </code></pre> .. without programmers involved. Unfortunately we've not managed to reduce the irreducible complexity of the step in the middle.GWT does this de-skilling to testers instead; it's effectively TDD with tests in this quasi-human-readable format that gets unimaginatively translated into programs by the testers.The proposed solution is to do more communication through spreadsheet-prototypes. Since these are effectively programs in Excel, with all the real programming capabilities thereof, you get a real working model of the system that actually behaves like a program.Many businesses simplify this process by then just deploying the spreadsheet to production.(I'm not familiar with Cucumber, but it looks like a thing for constructing toy "human readable" DSLs for tests? I wonder if we could get BAs to learn INFORM?)

评论 #19675344 未加载

评论 #19678094 未加载

评论 #19674699 未加载

评论 #19676996 未加载

joshwa大约 6 年前

The article seems to assume that it is, in fact, possible to completely and correctly specify a system, and not only that, do so ahead of development.This is a pipe dream in all but a minuscule slice of software projects.GWT/cucumber is as decent a tool as any for creating automated tests of system actions (and especially interactions) in a mostly-human-readable format that is likely to be understood by BAs, testers, and devs, even if not all of them are expected to be able to write them.I've used them across many projects with great success, with the understanding that it's not intended to replace unit tests for calculations, nor stories for initial specification.As endian says downthread[0]: ", G/W/T isn't the solution to domain understanding, conversations are. You should be continually communicating to between all stake holders to maintain a current domain understanding."[0] <a href="https://news.ycombinator.com/item?id=19674013" rel="nofollow">https://news.ycombinator.com/item?id=19674013</a>

评论 #19678144 未加载

评论 #19697451 未加载

pytester大约 6 年前

>Although Given-When-Then is a fantastic way to describe interactions, state and behaviour, it is a lousy way to describe data and calculations.I follow a slightly different technique that I think makes given/when/then also a useful tool for describing data and calculations.1) Write the test as follows, using deliberately simplified but still realistic data (this is crucial):>Given <a set of market data and trades in a table> >When <arbitrary event such as calculation is performed> >Then <leave blank>2) Write the code that outputs the data/calculation.3) Run the test in a "rewrite" mode that fills in the results of Then based upon actual output (this process is somewhat similar to golden master).4) You now have a passing test and generated data which you can eyeball to see if it is in line with what you would expect. This test with recorded output can then be shown to the PO (or whomever) and committed to source control and used for regression testing.This obviously isn't possible with cucumber or other gherkiny tools and it does require your processes to be fully deterministic (a laudable goal anyway), but it works pretty well IMHO.

评论 #19675377 未加载

endiangroup大约 6 年前

Act 1 - G/W/T can be used to express iteratively aspects of an algorithm such that you can derive it without knowing or fully understanding it (algorithm triangulation: think GPS where each satellite is a constraint and you derive through iteration of each passing scenario the general algorithm).Act 2 - is really about process rather than G/W/T (which is really just AAA, arrange, act & assert).Act 3 - again process, G/W/T isn't the solution to domain understanding, conversations are. You should be continually communicating to between all stake holders to maintain a current domain understanding.We wrote an article recently on the limits of BDD [1], G/W/T didn't really come up, there are other more glaring issues with BDD when it comes to systems that intersect mismatched understandings of the real world between experts and users. Unrealistic wants and goals are killer. Additionally we started writing a tool to attach metadata to scenarios (G/W/T) so you can capture technical details about things called SpecStack [2][1] <a href="https://endian.io/articles/limits-of-bdd/" rel="nofollow">https://endian.io/articles/limits-of-bdd/</a> [2] <a href="https://github.com/endiangroup/specstack" rel="nofollow">https://github.com/endiangroup/specstack</a>

评论 #19697508 未加载

ryanmarsh大约 6 年前

G/W/T is great at capturing the context/action/outcome of a test scenario in English. It isn’t great for all cases though. Furthermore Gherkin could be updated to allow more flexibility.Having structured format for tests/requirements, in English (et. al.), can be incredibly helpful. I would love to see some innovation around helping programmers and non-programmers reach a shared understanding of what the system should do and the cases we will use to verify it.I don’t think unstructured conversation, unstructured English, or Excel tables are the solution. This is still an unsolved problem in our industry.

评论 #19697459 未加载

60sec大约 6 年前

The main problem with cucumber / GWD is that in most implementations it serves as an opaque abstraction layer which is an incomplete/incorrect model abstraction of the system itself.Been doing a lot of API testing recently with karate dsl and writing cucumber tests that include json expressions with some syntactical sugar for validation. The tests serve as a specification for the system which is actually quite a bit more precise than even swagger since you can even go back in time and compare the deltas on request/response between test executions to troubleshoot regressions.Agree that GWT can't help business understand the inherently complexity of a state machine, but individual tests can be used effectively to model state transitions, especially at the api level.

Rooster61大约 6 年前

I think a lot of the issue is the misconception that G/W/T feature files can essentially replace specifications/requirements. They can and absolutely should REFLECT the requirements, but they are ill suited to act as the actual specifications themselves.Feature files to me are most effective when they act as a roadmap of the steps one needs to take to effectively test a given set of use cases, NOT a 1-1 carbon copy of the requirements. It should tell a non-programmer what the test is doing without having to dive into the code, while being a scaffold to which a programmer can build their test logic into. If one needs to look at the requirements, one should do just that, read the requirements document, or in the developer's case, read the requirements set forth in the user story. Scenarios are guides to how to navigate what the test is doing, not what the application itself should be doing.Also, I often see programmers attempt to write a BDD test and run into a case that doesn't quite fit flush into G/W/T, then ask the community of how they might go about writing that test. Instead of understanding flexibility, they are met with an abrupt "that's not BDD, you are doing it wrong, if you did it the BDD way everything would work out". That's discouraging, frustrating, and destructive. G/W/T is not gospel, and it doesn't fit all test cases. I see nothing wrong with fudging some tests to not follow Gherkin to-the-letter if it better facilitates a test while still remaining clear what the test is doing in plain English within the feature file's scenario.

neves大约 6 年前

I can't agree more. I'm still to see a testing tool that can be used for specification and used by end users. Today everything is developer centric. Maybe there's no escape to this. The solution is really to make your developers understand the business.

评论 #19673882 未加载

评论 #19679346 未加载

评论 #19697518 未加载

RHSeeger大约 6 年前

> We will realise that describing data and calculations using the Given-When-Then format leads to tragedy, and will create and popularise tools and approaches using Excel to document examples.Given all the examples given, it sounds more like Cucumber is the problem. Having specifications/tests written in the form of given-when-then isn't shown to have any issues. Rather, taking those specs/requirements and disconnecting them from the people who need them is the issue.

评论 #19673807 未加载

评论 #19675658 未加载

评论 #19674222 未加载

rgoulter大约 6 年前

A common anti-pattern with Given-When-Then has the Three Amigoes collaborating on scenarios that are stored as acceptance criteria in user stories.I agree with this. Writing Cucumber/Gherkin scenarios is extra effort if the Cucumber files themselves aren't used/read elsewhere. -- It'd be simpler to embed "Given/When/Then" statements within test code (like RSpec).I'd emphasise that Gojko Adzic's "Specification By Example" suggests discussing examples before refining to a specification; that may get around the author's complaint that non-table formats don't allow for important cases.That said, "Given/When/Then" is hardly magical, so doesn't deserve much praise/criticism itself. Any test involves "do an action, check the result" (with "setup the system" and "cleanup" being implied). Sometimes called "Assemble, Act, Assert". "G/W/T" is just a neat, consistent format for describing behaviour in English. A table of values specifies some computation; the column titles help to describe that behaviour.

评论 #19675702 未加载

dmitryminkovsky大约 6 年前

This is why I like Spock [0]. You can go G/W/T/, W/T, T, or Expect [1]. It's billed as "multi-paradigm" which really just means you can do whatever feels right for a given case. Also its data table feature is wonderful [2][3].[0]: <a href="http://spockframework.org/" rel="nofollow">http://spockframework.org/</a>[1]: <a href="http://spockframework.org/spock/docs/1.3/spock_primer.html#_blocks" rel="nofollow">http://spockframework.org/spock/docs/1.3/spock_primer.html#_...</a>[2]: <a href="http://spockframework.org/spock/docs/1.3/data_driven_testing.html" rel="nofollow">http://spockframework.org/spock/docs/1.3/data_driven_testing...</a>[3]: <a href="https://twitter.com/dminkovsky/status/1116727735399976966" rel="nofollow">https://twitter.com/dminkovsky/status/1116727735399976966</a>

评论 #19677019 未加载

raldi大约 6 年前

This article would have been better if at some point it explained what Given-When-Then is.

评论 #19677626 未加载

verisimilitudes大约 6 年前

>Before the internet, user experience was considered of little value because most users of systems were internal employees of companies.I'm skeptical of this claim. It would help if there were a year, considering it's not clear if the author means the modern Internet, the ARPANET, or when the modern Internet became widely available to a larger group of people. Based on the mentions of Excel, I'm inclined to believe it's the last option I listed.The MIT AI lab and other research areas come to mind as places that cared about how the programs were operated and whatnot and these weren't exclusively used by employees. SHRDLU comes to mind. While I'm thinking about it, the Apple Macintosh also does.I don't believe I was familiar with this Given-When-Then model beforehand, but I also think that's because it's somewhat natural, or at least seems natural. The author has failed to convince me why this is a bad thing. I suppose I can see why these larger business practices are poor, but that has me failing to see why this particular practice is singled out.

exelius大约 6 年前

Yeah; I suspect the core of the problem is that too many software developers fancy themselves business analysts while too many business analysts start to cower in fear whenever you suggest they check something in to GitHub.

评论 #19673427 未加载

projektfu大约 6 年前

You might want to check out Fit as a testing framework that may be more appropriate for your use case than Cucumber. Not sure if it's still well maintained, but it could be brought up to speed without much effort.

noveltyaccount大约 6 年前

To;Dr, given-when-then can sometimes obscure requirements rather than illuminate them. Use Excel or other tools to document such scenarios, as everyone in the software design process can understand Excel formulas.

评论 #19673790 未加载

jrochkind1大约 6 年前

I think there's a lot of people down on cucumber after experience with it, in several different contexts... what do you think, what has been your experience?

评论 #19675959 未加载

评论 #19675681 未加载

philipodonnell大约 6 年前

> The discussion helped me realise that Given-When-Then is as much of a hindrance in some contexts as it is a help in other contexts.I like when authors express a strong viewpoint but then also include descriptions of circumstances where their viewpoint may not be applicable. This seems to be alluded in the above quote, but are there specific contexts where Given-When-Then _is_ helpful and the appropriate mechanism to document requirements?

评论 #19674900 未加载

xchip大约 6 年前

That is pretty much the core of software engineering, I'd call it Data-If-Then and I fail to see why this is a tragedy.

评论 #19673223 未加载

barbecue_sauce大约 6 年前

What ever happened to Systems Analysts?

jodrellblank大约 6 年前

arcfide has claimed on several occasions that non-programmers take to APL quite easily, and can read and collaborate on it with a programmer, be talked through it directly, in a way they can't/won't do for mainstream languages.I find this such an unlikely sounding claim that I want to reject it without consideration, or at least assume that he's only talking to a very restricted subset of engineering non-programmers.APL is decades old and very much a business language by origin in IBM System 360, there ought to be decades of people's experience with this on both sides to back it up or refute it - programmer and non-programmer. Is there?

macca321大约 6 年前

It's a fallacy that GWT has to be at the browser automation level. And it's a lot easier to understand the point of a test if it's got GWT comments interspersed with the code.