ImageCLEF Social Media User Data Awareness

The data set consists of 1000 user profiles with 100 photos per profile annotated with an appeal score for a series of real-life situations via crowdsourcing. Participants to the experiment were asked to provide a global rating of each profile in each situation modeled using a 7-points Likert scale ranging from strongly unappealing to strongly appealing. An averaged and normalized appeal score will was used to create a ground truth composed of ranked users in each modeled situation. User profiles are created by repurposing a subset of the YFCC100M dataset. In accordance with GDPR, data minimization is applied keeping only the information necessary to carry out the research in an anonymized form. Resources include (i) anonymized visual concept ratings for each situation modeled; (ii) automatically extracted predictions for the images that compose the profiles.

The dataset was validated during the 2021 ImageCLEF aware task and 2022 ImageCLEF aware task

For more details see:
  1. A. Popescu, J. Deshayes-Chossart, H. Schindler, B. Ionescu, "Overview of the ImageCLEF 2022 Aware Task", ImageCLEF 2022, CLEF Conference and Labs of the Evaluation Forum, ISSN: 1613-0073, September 5–8, 2022, Bologna, Italy.

This dataset was conceived using the data gathered during the ImageCLEFaware task. We acknowledge therefore the valuable contribution of the task organizers: Adrian Popescu, Jérôme Deshayes-Chossart and Bogdan Ionescu. This work was supported under project AI4Media, A European Excellence Centre for Media, Society and Democracy, H2020 ICT-48-2020, grant #951911.