Extracting mise-en-scene and emotional metadata from video content

Presenter(s): Paolo Cremonesi (Contentwise)

In this talk, Paolo Cremonesi from Contentwise describes a multi-modal content-based recommender system that replaces traditional metadata with emotional descriptors automatically extracted from the visual and audio channels of a video. Cremonesi also presents the results of a number of user studies where we evaluate the quality of recommendations with emotional descriptors against metadata-based baselines.