Efficient and precise annotation of local structures in data

Event date: 2025-04-24

Event location:

Welcome to this week’s Learning Machines seminar.

This seminar is a collaboration between RISE and Climate AI Nordics – climateainordics.com.

Title: Efficient and precise annotation of local structures in data

Speaker: John Martinsson, RISE and Lund University

Abstract: Machine learning models now help scientists analyze vast datasets across every branch of science. These models typically improve with more data and larger architectures, mainly through supervised learning. Both training and evaluation therefore rely on labeled datasets. A main challenge is scaling the data labeling effort to the volumes required, because it is costly and label quality can vary. Methods that deliver inexpensive yet accurate labels are therefore essential.

This talk examines how to lower annotation cost and increase label quality when labeling local structures in data—for example, a local structure can be a sound event in an audio recording. By detecting the boundaries of such structures automatically, we let annotators focus on supplying concise textual descriptions for the content within those boundaries. In this setting we analyze a widely used labeling method for audio where fixed and equal length audio segments are labeled with presence or absence of an event class. We benchmark it against an oracle method that defines an upper bound, and propose adaptive labeling techniques that achieve higher‑quality labels for the studied datasets at a lower cost.

About the speaker: John Martinsson is a researcher specializing in machine listening-using machine learning to analyze audio data-and its application to bioacoustics and biodiversity monitoring. A key focus of his research is on streamlining the often labor-intensive process of labeling bioacoustic data, and on creating robust models that perform effectively even with limited labeled data. More information on his website: johnmartinsson.org.

Location: This is an online seminar. Connect using Zoom.

Date: 2025-04-24 15:00

Upcoming seminars:

  • 2025-05-08: Ghjulia Sialelli, ETH Zurich
  • 2025-05-15: Peter Dueben, European Centre for Medium-Range Weather Forecasts
  • All seminars are 15:00 CET.

More information and coming seminars: https://ri.se/lm-sem

– The Learning Machines Team

Published:

Categories: