Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
SemEval-2025 Task 9: The Food Hazard Detection Challenge
Stockholm University, Faculty of Social Sciences, Department of Computer and Systems Sciences.ORCID iD: 0000-0002-7938-2747
Stockholm University, Faculty of Social Sciences, Department of Computer and Systems Sciences. Athens University of Economics and Business, Greece; Archimedes, Athena Research Center, Greece.ORCID iD: 0000-0001-9188-7425
Stockholm University, Faculty of Social Sciences, Department of Computer and Systems Sciences.ORCID iD: 0000-0001-9731-1048
Stockholm University, Faculty of Social Sciences, Department of Computer and Systems Sciences.ORCID iD: 0000-0001-7713-1381
Show others and affiliations
Number of Authors: 52025 (English)In: Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025) / [ed] Sara Rosenthal; Aiala Rosá; Debanjan Ghosh; Marcos Zampieri, Association for Computational Linguistics , 2025, p. 2523-2534Conference paper, Published paper (Refereed)
Abstract [en]

In this challenge, we explored text-based food hazard prediction with long tail distributed classes. The task was divided into two subtasks: (1) predicting whether a web text implies one of ten food-hazard categories and identifying the associated food category, and (2) providing a more fine-grained classification by assigning a specific label to both the hazard and the product. Our findings highlight that large language model-generated synthetic data can be highly effective for oversampling long-tail distributions. Furthermore, we find that fine-tuned encoder-only, encoder-decoder, and decoder-only systems achieve comparable maximum performance across both subtasks. During this challenge, we gradually released (under CC BY-NC-SA 4.0) a novel set of 6,644 manually labeled food-incident reports.

Place, publisher, year, edition, pages
Association for Computational Linguistics , 2025. p. 2523-2534
National Category
Computer Sciences
Research subject
Computer and Systems Sciences
Identifiers
URN: urn:nbn:se:su:diva-247395ISBN: 979-8-89176-273-2 (electronic)OAI: oai:DiVA.org:su-247395DiVA, id: diva2:2000581
Conference
The 19th International Workshop on Semantic Evaluation, July 2025, Vienna, Austria.
Available from: 2025-09-24 Created: 2025-09-24 Last updated: 2025-09-24Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Länk till publikationen

Authority records

Randl, Korbinian RobertPavlopoulos, IoannisHenriksson, AronLindgren, Tony

Search in DiVA

By author/editor
Randl, Korbinian RobertPavlopoulos, IoannisHenriksson, AronLindgren, Tony
By organisation
Department of Computer and Systems Sciences
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 28 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf