Abstract
The evaluation of large amounts of digital image data is of growing importance for biology, including for the exploration and monitoring of marine habitats. However, only a tiny percentage of the image data collected is evaluated by marine biologists who manually interpret and annotate the image contents, which can be slow and laborious. In order to overcome the bottleneck in image annotation, two strategies are increasingly proposed: "citizen science" and "machine learning". In this study, we investigated how the combination of citizen science, to detect objects, and machine learning, to classify megafauna, could be used to automate annotation of underwater images. For this purpose, multiple large data sets of citizen science annotations with different degrees of common errors and inaccuracies observed in citizen science data were simulated by modifying "gold standard" annotations done by an experienced marine biologist. The parameters of the simulation were determined on the basis of two citizen science experiments. It allowed us to analyze the relationship between the outcome of a citizen science study and the quality of the classifications of a deep learning megafauna classifier. The results show great potential for combining citizen science with machine learning, provided that the participants are informed precisely about the annotation protocol. Inaccuracies in the position of the annotation had the most substantial influence on the classification accuracy, whereas the size of the marking and false positive detections had a smaller influence.
Publication types
-
Research Support, Non-U.S. Gov't
MeSH terms
-
Animals
-
Aquatic Organisms
-
Arthropods / anatomy & histology
-
Arthropods / classification
-
Citizen Science / methods*
-
Cnidaria / anatomy & histology
-
Cnidaria / classification
-
Deep Learning*
-
Echinodermata / anatomy & histology
-
Echinodermata / classification
-
Humans
-
Image Processing, Computer-Assisted / statistics & numerical data*
-
Imaging, Three-Dimensional
-
Marine Biology / instrumentation
-
Marine Biology / methods*
-
Mollusca / anatomy & histology
-
Mollusca / classification
-
Porifera / anatomy & histology
-
Porifera / classification
Grants and funding
We thank NVIDIA Corporation (
www.nvidia.com) for donating the GPU used in this project to TWN. The funder provided no support in the form of salaries for any author and did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. DL has received funding by Projektträger Jülich (grant no 03F0707C), as well as ESL, DOBJ under the framework of JPI Oceans. ESL, DOBJ, BH have received funding from the European Union Seventh Framework Programme (FP7/2007-2013) under the MIDAS (Managing Impacts of DeepseA reSource exploitation) project, grant agreement 603418. Funding was also provided from the UK Natural Environment Research Council through National Capability funding to NOC. The funder provided support in the form of salaries for authors DL, ESL, DOBJ, BH, but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section”.