6 papers to be presented at LREC 2022
Six papers from DFKI-NLP authors have been accepted for publication at LREC 2022, the 13th Language Resources and Evaluation Conference. The conference is planned to be a hybrid meeting and will take place in Marseille, France, from June 20th through June 25th, 2022. The paper by Dehio et al. is on claim extraction and matching in COVID-19-related Legislation, the one by Raither et al. presents a novel corpus for German-language Adverse Drug Reaction (ADR) detection in patient-generated content. The paper by Gabryszak et al. also presents a corpus, in this case of German-language tweets annotated with their relevance for public transportation, and with sentiment towards aspects related to barrier-free travel. The fourth paper by Macketanz et al. presents a fine-grained machine translation test suite for the language pair German-English. The test suite is based on a number of linguistically motivated categories and phenomena and the semi-automatic evaluation is carried out with regular expressions. The fSeiffe et al.’s paper presents a work on how to model subjective text complexity, by constructing and analyzing a German text corpus labeled with expert and non-expert complexity ratings. The final paper by Calizzano et al. introduces a new dataset called WikinewsSum for English, German, French, Spanish, Portuguese, Polish, and Italian summarisation tailored for extended summaries of approx. 11 sentences, and compares three multilingual transformer models on the extractive summarisation task and three training scenarios on which we fine-tune mT5 to perform abstractive summarisation.