In this paper we report on an evaluation of unsupervised labeling of audiovisual content using collateral text data sources to investigate how such an approach can provide acceptable results given requirements with respect to archival quality, authority and service levels to external users. We conclude that with parameter settings that are optimized using a rigorous evaluation of precision and accuracy, the quality of automatic term-suggestion are sufficiently high. Having implemented the procedure in our production work-flow allows us to gradually develop the system further and also assess the effect of the transformation from manual to automatic from an end-user perspective. Additional future work will be on deploying different information sources including annotations based on multimodal video analysis such as speaker recognition and computer vision.

, , , ,
S. Kapidakis (Sarantos) , C. Mazurek (Cezary) , M. Werla (Marcin)
Springer International Publishing
Springer International Publishing
doi.org/10.1007/978-3-319-24592-8_4
Access

de Boer, V., Ordelman, R., & Schuurman, J. (2015). Practice-Oriented Evaluation of Unsupervised Labeling of Audiovisual Content in an Archive Production Environment. In S. Kapidakis, C. Mazurek, & M. Werla (Eds.), Research and Advanced Technology for Digital Libraries (pp. 43–55). doi:10.1007/978-3-319-24592-8_4


Additional Files
External-Link-Default-Image.png Cover Image , 170kb