Lean Multiclass Crowdsourcing

Research output: Contribution to journal › Conference article › Research › peer-review

Grant Van Horn
Steve Branson
Scott Loarie
Belongie, Serge
Pietro Perona

We introduce a method for efficiently crowdsourcing multiclass annotations in challenging, real world image datasets. Our method is designed to minimize the number of human annotations that are necessary to achieve a desired level of confidence on class labels. It is based on combining models of worker behavior with computer vision. Our method is general: it can handle a large number of classes, worker labels that come from a taxonomy rather than a flat list, and can model the dependence of labels when workers can see a history of previous annotations. Our method may be used as a drop-in replacement for the majority vote algorithms used in online crowdsourcing services that aggregate multiple human annotations into a final consolidated label. In experiments conducted on two real-life applications we find that our method can reduce the number of required annotations by as much as a factor of 5.4 and can reduce the residual annotation error by up to 90% when compared with majority voting. Furthermore, the online risk estimates of the models may be used to sort the annotated collection and minimize subsequent expert review effort.

Original language	English
Journal	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Pages (from-to)	2714-2723
Number of pages	10
ISSN	1063-6919
DOIs	https://doi.org/10.1109/CVPR.2018.00287
Publication status	Published - 14 Dec 2018
Externally published	Yes
Event	31st Meeting of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2018 - Salt Lake City, United States Duration: 18 Jun 2018 → 22 Jun 2018

Conference

Conference	31st Meeting of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2018
Country	United States
City	Salt Lake City
Period	18/06/2018 → 22/06/2018

Bibliographical note

ID: 301826009