Rantings of a Selenium Contributor: WebDriver: Y U NO HAVE HTTP Status Codes?!

Wednesday, July 25, 2012

WebDriver: Y U NO HAVE HTTP Status Codes?!

There's a long-standing issue in the Selenium issue tracker dealing with the fact that the WebDriver API does not expose HTTP status codes to the user. For those of you who like to keep score at home, it's issue #141 in the tracker. The issue was opened in February, 2009, and was closed as "Won't Fix" in December of that year. Despite the finality of many project contributors and the main project architect saying that this feature will not be made available, it has continued to garner comments, many vehement, for reconsideration. I'm going to try to spend a little bit making what I hope is a reasoned, rational argument why this decision was made, and why the feature isn't needed in the project.

How Did We Get Here?

What follows is an oversimplified brief recap of some history. In the beginning was Selenium RC, which was an API that grew organically during its existence, with no rhyme or reason for how methods were tacked onto its single object. Over time, it became a brilliant example of the God object anti-pattern, violating all kinds of object-oriented programming principles. One of the more obscure methods engendered by the organic growth of the project was called captureNetworkTraffic(). This method purported to capture all of the network traffic between the browser and the site being automated, and was made possible because in some browser configurations, Selenium acted as a proxy between the browser and the site being automated, thus all browser traffic passed through Selenium, and could be captured or manipulated.

And so it came to pass that Selenium RC was found wanting, and lo, there was much weeping and wailing and gnashing of teeth in the browser automation community. And thus it was that the WebDriver project was born, and eventually merged into the Selenium project to become Selenium WebDriver. WebDriver was a completely different approach to browser automation, preferring to act more like a user, which solved the fundamental problems inherent in the Selenium RC approach. However, since much of the actual driving of the browser would now be done external to the browser itself, and with no proxy in between the browser and the site being automated, it made creating a method similar to captureNetworkTraffic() difficult at best, and impossible at worst.

Where Are We Now?

This brings us to the current state of affairs, with HTTP status codes being unavailable in the WebDriver API. An architectural decision was made during the creation and development of the WebDriver API that this feature would not be implemented, and that it would be declared "out of scope" for Selenium WebDriver. This has caused some consternation among users, especially since the issue has been closed, and it's been said that the feature won't be implemented, and been said so in extremely plain, even blunt, terms. There are some valid technical reasons why this decision was made. Here are just a few of them:

What about redirects? Do you just return the last code after the redirect, or the code that indicates a redirect? What do you say to those who disagree with your answer to that question?
Some browsers make it impossible to get the status code. Do you really want an API that works on some browsers, but not others?
WebDriver is concerned with driving the browser, not necessarily just web application testing. Does returning HTTP status codes really fit this mission?
WebDriver is concerned with driving the browser as a user would. Does a browser show HTTP status codes to the user, or just rendered pages?

Nevertheless, that hasn't stopped people from arguing that it should be included. Let's take a few minutes to examine some of those arguments, and then we'll take a look at what options there are for solving this issue.

"HTTP status codes are an important part of website testing."

Yes, I can see the argument that HTTP status codes could be an important part of testing your website. However, while web application testing is an important use case of the Selenium project, it's far from the only one. A frequent response to this is, "But your own web pages and literature say it's for web application testing!" That's a fair critique about documentation and public perception of the project, but that's really a separate discussion. HTTP status codes are not a required part of automating the browser.

"I don't care that it doesn't work on all browsers, I need those status codes."

One of the major advantages of the WebDriver API over what has come before is its elegance and purity. Implementing a solution that works in only some browsers is vastly inferior to one that works for all browsers, especially when the solution isn't in the core competency of the library. There are solutions that do work for all browsers without polluting the API, albeit they require integration with other tools that <gasp> are not Selenium. Tacking on this feature for some browsers in a suboptimal way that might miss important edge cases because it's not a core competency is akin to driving a wood screw into a board with a hammer because that's the only tool you have. Yes, it'll work, but it's not going to be pretty, and is likely to fail you somewhere down the line.

"Other parts of the WebDriver API let you do things a user can't do, why not expose status codes?"

Proponents of this argument usually cite manipulation of cookies, or finding of elements by anything other than visual inspection, or viewing a page's HTML source, or any number of other items. All of those items are germane to interacting with the page itself, with what is displayed. The HTTP status code is not directly concerned with what is displayed on the page.

"But so many people want the feature, you should really add it."

Ah, yes, the old, "But I want a pony," argument. Or it's corollary, "20 million New Kids On The Block fans can't be wrong!" This argument is occasionally followed by a sometimes hostile, "Well, your lack of this feature makes your library completely useless to me," or even, "You'd better add it or else I'll stop using it!" This latter response is the equivalent of, "I'm going to hold my breath until you give me exactly what I want!" Just because something's popular doesn't make it a good idea.

Where Do We Go From Here?

Just as proponents of wanting to see HTTP status codes in the WebDriver API are convinced that the arguments against including it don't hold water, the members of the project team are equally convinced that adding it is a bad idea. So if you feel like you absolutely need to have them, what can you do? Well, you have a couple of options.

Remember how I said earlier that Selenium RC was able to give you this information because it acted as a proxy? You can recreate that exact same environment with WebDriver! Of course, you have to use a dedicated proxy to do it, but guess what? There are lots of software proxies around that make this really easy. Additionally, you're using the proxy to capture the traffic, not something half-baked that's been shoehorned into the WebDriver API, which is a little thing I like to call using the right tool for the job. As an example, the BrowserMob proxy is one that's being used successfully by lots of people. It's open-source, and it's written in Java, so if you're using Java, you can even control it directly from your existing code. If you're not using Java, fear not, as there are wrapper libraries written for many different languages, including Python, .NET, Ruby, and PHP, to name a few. The WebDriver library even allows you to set the browser you're automating to use the proxy.

"But I don't want to use a proxy! I only want to have to manage Selenium as a dependency," I hear you say. I've got a little secret for you: WebDriver is Open Source Software. It's even very liberally licensed. If you don't like a decision made by the project team, fork it, create a patch, and share with the world. I personally love innovation in the Open Source world. Don't tell me how you want to see it done, show me, with working code. Unless you can demonstrate your solution working for all browsers though, I'll probably use a proxy if I need this functionality, and that's my choice.

22 comments:

YaciAugust 3, 2012 at 5:52 AM
The problem with BrowserMob is that it's still in beta phase and it's not very stable. Although I've used it successfully in many project it's been always causing some problems sometimes to the extent that I had to abandon it. Alternatively I use Firefox+Firebug+NetExport but this solution is slow and has it's drawbacks too. If anyone knows about a stable proxy I could use with Java I would appreciate if they share their knowledge.
ReplyDelete
Replies
asbjornuNovember 15, 2012 at 7:56 AM
Not exposing HTTP Status codes was the single most important reason I stopped using Selenium WebDriver. It's just a way too important detail about HTTP to leave out of an HTTP testing framework.

The only argument you make that holds water (imho) is that introducing HTTP status codes won't make it coherent across all implementations. The solution to this is to throw exception in browsers that don't expose it. People are used to special-case their for different browsers already, so this wouldn't cause much confusion (if the exception thrown has an intuitive message, at least).

So, I didn't understand this decision before reading this blog post and still don't.
ReplyDelete
Replies
LarsNovember 15, 2012 at 8:47 AM
Yes, I can see the argument that HTTP status codes could be an important part of testing your website.

It's encouraging to see that you are listening to these important points in favor of exposing HTTP status codes.

while web application testing is an important use case of the Selenium project, it's far from the only one

That's a bit like saying that going forward is an important use case for a car. The Seleniumhq web site says it is the *primary* use case: "Primarily it is for automating web applications for testing purposes." And it's very difficult to reconcile this with your conclusion that "the feature isn't needed in the project".

That's a fair critique about documentation and public perception of the project, but that's really a separate discussion.

You can try to separate it, but it still needs to be addressed, because this whole issue turns on the question (it didn't use to be a question at all) of whether Selenium is for testing web applications. Until the conflict between public promises of web application testing, and statements that WebDriver is only for browser automation, are addressed, you will continue to have users who feel like they've been "baited and switched."

HTTP status codes are not a required part of automating the browser.

This just begs the above question.

I hear you acknowledging that the feature is important, but that it would be very difficult to implement in a way that works across all browsers. That's a fair answer. Asking others to help/implement the feature is a fair answer, if the project committers would accept contributions. What's not fair is claiming that web application testing features are not needed in Selenium/WebDriver, despite years of marketing Selenium as a web application testing tool.
ReplyDelete
Replies
Jack of all tradesMay 7, 2013 at 9:33 AM
This is probably too late for anyone to read.. but just in case.. I am using Webdriver and a site I test, upon login, redirects me to another page. The response on login is 302. When I do this manually the site works fine. When I record a script using Selenium IDE and play it back, it works fine (uses clickAndWait). For some reason, despite putting wait code in my java webdriver code, the redirect never occurs upon login and the main login screen shows up again. It's as if the browser that is being driven just doesn't issue the redirect for some reason. I've no idea why this is. I would love to have access to the status code and if it's a 302, grab the location and use webdriver to navigate to that location, but obviously that's impossible. I'd be fine without the codes if the webdriver driven browser would just redirect while my code "waits" for an element to show up.

If any of you know why this happens.. why selenium ide works, manual works, but webdriver prevents the browser from redirecting.. I'd appreciate it.
ReplyDelete
Replies
InnovapathJuly 26, 2013 at 2:07 PM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
UnknownJanuary 28, 2017 at 2:23 AM
Hi,
Well explained about Rantings of a Selenium Contributor. can you explain about What is the difference between Selenium core extensions and Selenium IDE extensions?
Thanks,
David,
Selenium Developer
ReplyDelete
Replies
Severity OneFebruary 9, 2018 at 2:02 AM
While I appreciate that getting the response codes is not always possible, the argument that Selenium mimics a real browser user doesn't hold much water.

Why? Because a computer program is not a person.

When I browse to my favourite site, to read the news or to watch a funny cat video, I do so with the specific purpose of getting information, or entertainment, or whatever. A computer program going to the same site does so for very different reasons: either to test the site, or to scrape information from it. A computer program doesn't care about what happened in the world, or cat antics.

A computer program needs to get information from the responses to the requests it makes. And some of the most valuable sources of information.

It's not a problem that not all browsers offer this information. In fact, it's totally irrelevant. If I write a computer program that needs this information, then it's up to me to choose a browser that does. It's not up to Selenium to decide that I can't have this information, because some browsers that I don't care about don't offer it.
ReplyDelete
Replies
ppsspppspMarch 27, 2019 at 11:43 PM
This comment has been removed by the author.
ReplyDelete
Replies
ManipriyanJuly 16, 2019 at 11:27 PM
This comment has been removed by the author.
ReplyDelete
Replies

Add comment