|
SIMS 213 Assignment 8Pilot Usability Study
IntroductionUsing the second interactive prototype, we developed a pilot usability study involving four tasks (three search tasks and task of submitting a brief). We tested the prototype with six testers and learned that the third interactive prototype needs to include more metadata for briefs. In addition, we discovered we would need to modify our method for recording task completion times to improve accuracy and we developed a hypothesis that providing additional brief metadata will make it easier for users to determine whether a brief satisfies a search. MethodParticipants
We selected tester 1 because we wanted to make sure our usability study was designed properly. We included tester 1 in our results because his results were consistent with the other testers. We selected testers 2 - 6 because they have significant legal research experience. ApparatusWe conducted tests 1 - 5 in the SIMS lab. We conducted test 6 in the tester's office. We used our second interactive prototype for all of the tests. Tasks
Procedure
Test MeasuresWe tested the prototype using observations of the testers' actions & comments as they attempted the tasks and a written post-test evaluation that the testers completed at the conclusion of the four tasks. ObservationsDuring our observations, we focused on the following:
Post-test evaluationAt the conclusion of the tasks, testers stated their degree of agreement or disagreement with statements regarding the prototype. These statements concerned the ease or difficulty of completing the tasks. Additionally, testers provided comments regarding the easiest and most difficult aspects of the prototype. ResultsObservationsPage Flow: All of the testers grasped the page flow for the search and submit a brief functions. Several testers asked whether the textfields for describing a submitted brief could hold complete descriptions. One tester questioned the purpose of the "Other Court" textfield for submitting a brief. Labels: For the most part, testers were comfortable with the prototype's labels and vocabulary. One tester was not sure whether the term "Brief Title" referred to the title of a brief or an abbreviated name for the case. Relevance of retrieval: Several testers commented that they would like to have more metadata for each legal brief. Currently, the prototype provides extensive metadata for the cases, but it only provides a title for each brief. Task completion time: We attempted to collect task completion times for testers 4, 5, and 6. In general, the times appeared to be acceptable, and testers needed less time to complete task 3 than task 1 (two similar searches). In addition, we observed that testers needed significantly more time to complete task 2 than either task 1 or task 3. The completion times for task 2 were longer because this task required testers to review the contents of briefs to determine whether a brief satisfied the task requirements. Obtaining accurate times was difficult because testers discussed their approaches to evaluating legal briefs in addition to commenting on the tasks. Consequently, it was not always clear when a tester believed a task was completed. Post-test evaluation scores (Statements evaluated on a scale of 1 [Disagree] to 5 [Agree])
Post-test commentsWhat was easiest? All of the testers commented on the ease of searching. What was most difficult? None of the testers raised the same issue. One tester mentioned that the label "brief title" was confusing. Another tester mentioned the inability to select several courts when searching. Did you notice you could not search by subject? Only three of the testers noticed they could not search by subject. Those who noticed the lack of subject search did not consider it to be a significant problem. DiscussionChanges to interfaceAccording to our observations and the testers post-test feedback, the most important change is the inclusion of additional metadata for the briefs. In particular, when displaying briefs we intend to identify which party a brief supports and whether a brief is an amicus brief or not. To facilitate the gathering of this information, we will also ask individuals to provide this information when describing a brief that is going to be submitted to BriefBank. Following are other potential changes
Changes to "real" experimentIn conducting a "real" experiment we would change our approach to recording the task completion times. To increase time accuracy, we would ask testers to indicate when a task is complete and to hold their comments until after a task is completed. We would phrase this request carefully because we do not want to discourage testers from providing comments. Formal Experiment DesignHypothesis: Our hypothesis is that providing additional metadata for briefs will make it easier for users to determine whether a brief satisfies a search. Factors and levels: The factor is the amount of metadata provided for briefs. The four levels are T (title of brief only), A (title of brief and whether brief is an amicus brief or not), S (title of brief and which party the brief supports), and Z (title of brief, whether brief is an amicus brief or not, and which party the brief supports). The response variable is the time to complete a search. Search completion time is an appropriate response variable because testers should complete searches more quickly if additional brief metadata allows them to include (or exclude) a brief. Blocking and repetitions: The experiment will be based on a within groups design and involve 16 third year law students. Each law student will complete five searches. The first search will not be included in the test results because we do not want the completion times to be influenced by a learning affect. Searches 2 - 5 would involve one search for each level. Appendices
|