Validity and reliability of naturalistic driving scene categorization judgments from crowdsourcing

Christopher D. Cabrall
Zhenji Lu
Miltos Kyriakidis
Laura Manca
Chris Dijksterhuis
Riender Happee
Joost de Winter

A common challenge with processing naturalistic driving data for many different possible driving research interests or applications is that humans may need to categorize great volumes of recorded visual information until automated algorithms might be trained to do so alone.

This study, by means of the online platform CrowdFlower, investigated the potential of crowdsourcing to provide content identification categorizations of driving scene features (e.g., presence of another vehicle, straight road segments, etc.) at greater scale than a single person or a small team of researchers would be capable of. The validity and reliability of CrowdFlower results were examined, both with and without employing a set of randomly embedded controlled questions (Gold Test Questions) intermixed with experimental questions (Work Mode). In total, 200 workers from 46 countries participated in this study, and the collection of data lasted one and a half days.

By employing Gold Test Questions, we found significantly more accurate and consistent responses from external workers at both a smaller and larger scale of video segment categorizations for the identification of common driving scene elements (e.g., position and behavior of other vehicles, road and signage characteristics, etc.). In terms of validity and at the small scale, an average accuracy of 91% on paired items was found with the controlled questions compared to 78% without. A difference in bias was found where without Gold Test Questions external workers returned more false positives than false negatives whereas the opposite was found true of the condition with Gold Test Questions. At the large scale (making use of the controlled questions), a random subset of categorizations returned similar levels of accuracy (95%) and a similar pattern of error bias. In terms of reliability and at the small scale, where segments were rated in triplicate redundancy, the percentage of unanimous agreement was found significantly higher when using controlled questions (90%) than without them (65%). Across the small scale of internally validated answers, more than two-thirds of any correct categorization were unanimously returned and 86% or more of any correct categorization was returned by a majority vote. Where it would be infeasible to validate every response for accuracy, similar voting reliability results were found to exist across the responses of the large scale.

Overall results support compelling evidence for CrowdFlower as being able to yield valid and reliable crowdsourced categorizations of naturalistic driving scene contents in a short period of time and thus a potentially powerful and as-of-yet under-utilized resource in the toolbox of driving research and driving automation development.

MEET US


25-27
Aug

ICTTP 2020

ICTTP, International Conference on Traffic and Transport Psychology, is held in Gothenburg, Sweden.

LATEST NEWS


2018-11-19

Report regarding government commission on the costs of traffic to society has been submitted

Since 2013, the Swedish National Road and Transport Research Institute (VTI) has had several government commissions to produce documentation on the costs to society caused by traffic. On 1 November 2018, the agency reported its latest commission, Samkost 3....


2018-10-29

International standardisation efforts have many advantages

VTI participates in several international standardisation committees. The work is important because it helps to ensure that standards can be adapted to Swedish conditions and it also provides access to valuable contacts and networks.


2018-10-24

China wants to work with the best

Through the CTS cooperation, VTI is gaining valuable research contacts with China. The country is facing major challenges in the field of road safety but also has enormous potential.


2018-10-23

VTI participated in conference on electric roads

Systems with electrified roads are a relatively new concept and many projects have been launched in recent years. To stimulate the transfer of knowledge and collaboration, the Research and Innovation Platform for Electric Roads arranged its second...


2018-10-18

ADAS&ME is tackling the interaction between people and technology

ADAS&ME is a major EU project focused on automation, the human condition and the human environment. The budget is EUR 9.6 million and VTI is the coordinator.


2018-10-05

Users contribute to the development of train simulators

Apart from advanced driving simulators, VTI has developed several variations of train simulators which are used for training, education and research. In recent years, interest has increased drastically among major actors in the railway sector, and VTI has...