Publikation

Style Modeling for Tagging Personal Photo Collections

M. Duan; Adrian Ulges; Thomas Breuel; X.-Q. Wu

In: Proceedings of the International Conference on Image and Video Retrieval. ACM International Conference on Image and Video Retrieval (CIVR-09), July 8-10, Santorini, Greece, ACM, New York, US, 7/2009.

Zusammenfassung

While current image annotation methods treat each input image individually, users in practice tend to take multiple pictures at the same location, with the same setup, or over the same trip, such that the images to be labeled come in groups sharing a coherent "style". We present an approach for annotating such style-consistent batches of pictures. The method is inspired by previous work in handwriting recognition and models style as a latent random variable. For each style, a separate image annotation model is learned. When annotating a batch of images, style is inferred using maximum likelihood over the whole batch, and the style-specific model is used for an accurate tagging. In quantitative experiments on the COREL dataset and real-world photo stock downloaded from Flickr, we demonstrate that %style consistency helps image annotation to disambiguate and improves the overall tagging performance. - by making use of the additional information that images come in style-consistent groups - our approach outperforms several baselines that tag images individually. Relative performance improvements of up to $80$\% are achieved, and on the COREL-5K benchmark the proposed method gives a mean recall/precision of 39%/25%, which is the best result reported to date.

Projekte

MOONVID - Statistische Modellierung des Inhaltes von Online Videos zur automatisierten Detektion semantischer Konzepte in Videos

styletagging-civr.pdf (pdf, 1 MB )