RSS icon [ Subscribe to Alex Horstmann's RSS feed ]

 BlobFisk.com

The online home of user experience manager Alex Horstmann

Usability testing in times of reduced budget

There is no better way of informing decisions around design, usability and user interaction with a web page than testing it on real users. The results that you get from testing are second to none, and have made a huge difference to the latest versions of the Channel 4 website, which I’ve had the pleasure of being involved with.

Unfortunately, Formal testing is expensive and time consuming. You need to pay for the use of a testing lab; pay for recruitment; spend time preparing the recruitment screener and test script; spend 2 days testing (with, ideally, 2 people taking notes); and write up your results and recommendations. The results are great but the cost is high.

There is an alternative though! Guerilla testing.

The Formal Way

The process I use when conducting formal, lab based user testing sessions is this:

1.  Prepare the recruitment screener based on the site’s Personas,
2.  Write the test script (this can take a bit of time!),
3.  Test the script internally to see how long the test takes, tweak questions etc.,
4.  Conduct the tests. These are usually run over 2 days, using 10 participants. The test lasts an hour and we take the users through a number of tasks on the site, getting their thoughts and feedback throughout. 1 or 2 people are in the observation room taking detailed notes.
5.  Review the notes, with the recording of the sessions and summarise, with recommendations.

This is quite a time consuming process, and can take up to 2 weeks. In projects with a tight deadline and even tighter budgets this sort of testing can be too much.

The Guerilla Way

Jakob Neilson tells us that testing with 5 people will find 85% of usability issues, and testing with 15 will find close on 100% of issues (Alertbox, 2000: Why You Only Need to Test With 5 Users). This is very interesting. It means that if you can quickly and easily test your site/application with 5 users you will find the vast majority of issues, and you if you do this early enough, you can fix them.

The next test, again with 5 users, you are testing the fixes you made on the back of the first test and find the majority of the remaining issues. Interestingly, Neilson says that this second test allows you to:

probe deeper into the usability of the fundamental structure of the site, assessing issues like information architecture, task flow, and match with user needs. These important issues are often obscured in initial studies where the users are stumped by stupid surface-level usability problems that prevent them from really digging into the site.

This methodology is possible using formal, lab based testing. However, this is expensive. Instead, you can conduct guerilla testing. Ad-hoc, on the fly testing. You still need to recruit participants that match your Personas, but you carry out the test in a meeting room, canteen, coffee shop… How though?

The Silverback Gorilla

The Silverback Gorilla

I’ve been looking at Silverback, a guerilla testing application from Clearleft. It’s lightweight and very cost effective ($50). The downside is that it only runs on a Mac, but buying the hardware and software is still cheaper that a formal lab session, and it’s reusable.

It is slightly harder if you want note takers in guerilla testing, as they need to be within earshot of the participant. However, Silverback uses the built in camera (or external web cam) to record the user, captures what the user is seeing on the screen and shows where the user has clicked. The person running the test can insert mark points to act as a way point for questions or when something interesting has happened.

I really like the guerilla testing method. It’s fast and effective, and allows you to get very quick results. I don’t think that it will become a replacement for formal lab testing, but I see it becoming a more prevalent asset in the User Experience and Usability professional’s toolbox.

There are other, remote, testing solutions coming to the fore, like Userfly and Webnographer, which have the potential to provide great insight. However, nothing beats the act of watching someone interact with your site/software – that’s when you get the real insight into what is good and bad and what needs changing.

Bookmark and Share

Tags: , ,

This entry was posted on Wednesday, January 21st, 2009 at 14:01 and is filed under Usability. You can follow any responses to this entry through the RSS 2.0 feed. You can leave a response, or trackback from your own site.

3 Responses to “Usability testing in times of reduced budget”

  1. January 21st, 2009 at 17:58

    James Page says:

    Thanks for the plug.

    I do have an issue with Neilson 5 user claim, is that it is based on very flawed maths. It assumes that every problem has the same chance of being discovered. Jared Spool and co did a study that showed you sometimes needed over 40 users to discover 85% of usability issues. With just 5 users you margin of error will be region of +/-28%. You need to estimate the number of issues that you will discover, and the severity of them to know the number of participants needed.

    Neilson’s flawed maths was one of the reasons that we came up with the Webnographer idea.

    I would add to this that the configuration of users machine has major impact. Screen Size, version of Flash, and Operating System. So Silverback would be fantastic if everybody used Macs. Sadly they don’t.

    After saying all that we use Guerilla testing to see if we are on the right track or not, with remote as a means of verification.

  2. January 22nd, 2009 at 13:05

    Alex says:

    Hi James,

    Thanks for your comment. I see your point about the probability of finding issues with just 5 users, but I feel that you need to conduct a number of guerrilla tests, iteratively, with 5 users. That way, the probability of finding all the issues increases with each iteration.

    Do you have a URL for the Spool study? I’d be really interested to see it – is it for qualitative or quantitative studies?

    You make a good point about the test machine configuration – but you can get around this using virtual machines (I know, it’s a bit of a pain!).

  3. January 22nd, 2009 at 15:01

    James Page says:

    I will try finding the academic paper, but the write up to the study is here.

    http://www.uie.com/articles/eight_is_not_enough

    By the time you have tested with 90 users (which is Spool’s findings to find 80% of the issues), with 5 users every week that will take 18 weeks! The challenge with this method is that is gives you no prioritisation. Will the first 5 users identify the issues that you should fix first.

    Using remote would let you find most of the issues within one week, and be able to prioritise the issues that are important.

Leave a Reply