Data

EBU activities related to Data, AI, and Machine Learning

Overview

PSMs are facing the need to raise the quantity and the quality of content they create to address linear and non-linear channels. To do so, they are adopting agile technologies to accelerate the production and the management of massive amounts of data. The three pillars of these evolutions are data management, computing architectures, and machine learning. EBU groups are addressing these three pillars by providing:

  • Development of open-source Tools - Machine Learning - Metadata 
  • Defining best practices and standards
  • Outreach and dissemination

Groups

AIM 

AIM is the umbrella group that oversees the DATA, METADATA and AI domain works, maximizing knowledge sharing and collaboration. This group meets monthly.

AI Benchmarking

The EBU and several Members have created and continue to develop a tool for benchmarking machine learning-based applications. After designing and publishing a tool to benchmark Speech-To-Text (STT) systems, a dataset to evaluate face recognition application, the group is now working on LLM, RAG and agents.

Metadata Modelling

The main activity of this group is to support EBU Members in the area of metadata. To achieve this, the group develops specifications and promotes innovation, notably through the development of EBUCorePlus, an open source ontology for media enterprises.

AI and Automatic Metadata Extraction (suspended)

The group has been suspended, and the presentations are still available in the workspace. Work addressed in this group includes metadata schemas, the capabilities and performance of automatic metadata extraction tools, and the development of machine learning algorithms and related tools.

Event

Data Technology Seminar - DTS is your ticket to staying ahead in the ever-evolving media landscape. Learn from the best in the industry, and actively contribute to the future of media with AI at its core.

 

Currently, the following deliverables are planned (green indicates the deliverable has already been delivered). Note that deliverables are dependent on enough participation in the work and that the planning is subject to change. New deliverables are added regularly.

2025

  • status_med_12px.png Launch of the AI Sandbox - a collaborative platform where EBU members can showcase and evaluate custom AI models designed for media applications. 

  • status_med_12px.png Knowledge sharing on metadata/data/AI technology - AIM monthly meetings

  • status_med_12px.png Organise the DataTech Seminar 2025

  • status_med_12px.png AI benchmarking group: Develop a Proof of Concept on Retrieval-Augmented Generation (RAG) to explore state-of-the-art applications in news production. Write report of findings and recommendations for the News and Technical Committees

  • status_med_12px.png Release of EBUCorePlus 2.0

  • status_med_12px.png  SMPTE - Update Engineering Report on AI for Media - Write a standard proposal on AI model registration

  •  status_med_12px.png TEMS (EU project ) - design and maintenance of the TEMSCore - the Media Data Space Ontology

  • status_med_12px.png VeraAI (EU project) - Design of ML algorithms for news authorship attribution and audience profile prediction

2024

  • status_done_12px.png Knowledge sharing on metadata/data/AI technology - AIM monthly meetings

  • status_done_12px.png DataTech Seminar 2024

  • status_done_12px.png Development and deployment of the meta-radio application on the AI HUB

  • status_done_12px.png SMPTE task force on AI - One report published - 3 standard proposals

  • status_done_12px.png Development of the cloud-hosted AI-HUB to evaluate, expose, and exchange AI applications

  • status_done_12px.png Development and deployment of a face recognition system for TV programme on the AI HUB

  • status_done_12px.pngDevelop a demo for IBC - Technical Paper accepted

  • status_done_12px.png Research project with the EPFL - One master's thesis successfully completed

  • status_done_12px.png Specification of the Metadata model forTEMS - first version published

  • status_done_12px.png MCMA - update the Libraries and publish a SMPTE standard - ST 2126

  • status_med_12px.png AI Benchmarking Group: POC on RAG/LLM/Agents  for News - deliverable in 2025

  • status_med_12px.png Update of the EBUCorePlus - v2.0

  • status_med_12px.png Development of AI models for analysing editorial content for VeraAI

2023

  • status_done_12px.png Development of datasets to evaluate facial recognition systems: the biggest for AV content!

  • status_done_12px.png Publication of the first release of the EBUCorePlus : the EBU ontology for media

  • status_done_12px.png DataTech Seminar 2023

  • status_done_12px.png ETC/SMPTE task force on AI:  engineering report AI for Media  

  • status_med_12px.png Development of the cloud-hosted AIM platform to evaluate, expose, and exchange AI applications

  • status_done_12px.png Development of an open-source Facial Recognition Framework for Video 

  • status_done_12px.png Knowledge sharing on metadata/data/AI technology 

  • status_done_12px.png Maintenance of  the MCMA libraries  and MAM

  • status_done_12px.png Advanced Studies on Machine Learning for Members 

2022

  • status_done_12px.png Open source demo MAM based on serverless MCMA framework (Q2 2022) 

  • status_done_12px.png MDN Workshop 2022 ( Q2 2022)

  • status_done_12px.png Fake News detector for English text: API open to Members (Q2 2022)

  • status_done_12px.png EBUCore+ Demonstrator Kit: a cloud-hosted demo (Q3 2022)

  • status_done_12px.png Cloud-hosted Metadata Exchange Platform for archive (Q3 2022)

  • status_done_12px.png Knowledge Sharing on MetaData and AI (Q4 2022) 

  • status_done_12px.png Organisation of DataTech Seminar 2023

  • status_med_12px.png Development of an open-source Facial Recognition Framework for Video (Q1 2023)