dScience – Centre for Computational and Data Science la ut dette på nytt
Over the coming days, LTG will be presenting 8 fresh papers at the Nordic–Baltic NLP conference in Tallinn – NoDaLiDa/Baltic-HLT 2025. 🔥 Several of these represent collaborations with colleagues from UiB, NTNU, and the National Library. 🤝 📄 A Collection of Question Answering Datasets for Norwegian, by Mikhailov et al. 📄 The Impact of Copyrighted Material on Large Language Models: A Norwegian Perspective, by de la Rosa et al. 📄 Large Language Models for Small Languages: A Study of Continual Pretraining on Languages of Norway, by Samuel et al. 📄 Benchmarking Abstractive Summarisation: A Dataset of Human-authored Summaries of Norwegian News Articles, by Touileb et al. 📄 NorEventGen: generative event extraction from Norwegian news, by You et al. 📄 Mixed Feelings: Cross-Domain Sentiment Classification of Patient Feedback, by Rønningstad et al. 📄 Multi-label Scandinavian Language Identification (SLIDE), by Fedorova et al. 📄 Interactive maps for corpus-based dialectology, by Scherrer et al. All papers are available in the proceedings: https://lnkd.in/e7TSi_Nt David Samuel, Vladislav Mikhailov, Huiling You, Samia Touileb, Marie Ingeborg Kroka, Tita Enstad, Petter Mæhlum, Lilja Charlotte Storset, Egil Rønningstad, Victoria Ovedie Chruickshank Langø, Javier de la Rosa, Lemei Zhang, Aslak Sira Myhre, Jon Atle Gulla, Freddy Wetjen, Peng Liu, Rolv-Arild Braaten, Magnus Breder Birkenes, Svein Arne Brygfjeld, Wilfred Østgulen, Lucas C., Olli Kuparinen, Stephan Oepen, Yves Scherrer, Andrey Kutuzov, Lilja Øvrelid, Erik Velldal