Publications

2026

Jenny Kunz. 2026. Preferences for Idiomatic Language are Acquired Slowly – and Forgotten Quickly: A Case Study on Swedish. arXiv:2602.03484.
Jenny Kunz. 2026. A Diagnostic Benchmark for Sweden-Related Factual Knowledge. arXiv:2510.21360.

2025

Kevin Glocker, Kätriin Kukk, Romina Oji, Marcel Bollmann, Marco Kuhlmann, and Jenny Kunz. 2025. Grow Up and Merge: Scaling Strategies for Efficient Language Adaptation. arXiv:2512.10772.
Jenny Kunz, Iben Nyholm Debess, and Annika Simonsen. 2025. Family Matters: Language Transfer and Merging for Adapting Small LLMs to Faroese. arXiv:2510.00810.
Frank Drewes, Marco Kuhlmann, and Olle Torstensson. 2025. Dynamically Weighted Tree Transducers. In Giuseppa Castiglione and Sabrina Mantaci, editors, Implementation and Application of Automata, pages 115–128, Cham. Springer Nature Switzerland.
Denitsa Saynova, Lovisa Hagström, Moa Johansson, Richard Johansson, and Marco Kuhlmann. 2025. Fact Recall, Heuristics or Pure Guesswork? Precise Interpretations of Language Models for Fact Completion. In Wanxiang Che, Joyce Nabende, Ekaterina Shutova, and Mohammad Taher Pilehvar, editors, Findings of the Association for Computational Linguistics: ACL 2025, pages 18322–18349, Vienna, Austria. Association for Computational Linguistics.
Julian Schlenker, Jenny Kunz, Tatiana Anikina, Günter Neumann, and Simon Ostermann. 2025. Only for the Unseen Languages, Say the Llamas: On the Efficacy of Language Adapters for Cross-lingual Transfer in English-centric LLMs. In Jin Zhao, Mingyang Wang, and Zhu Liu, editors, Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop), pages 849–871, Vienna, Austria. Association for Computational Linguistics.
Jenny Kunz. 2025. Train More Parameters But Mind Their Placement: Insights into Language Adaptation with PEFT. In Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025), Tallinn, Estonia.
Romina Oji and Jenny Kunz. 2025. How to Tune a Multilingual Encoder Model for Germanic Languages: A Study of PEFT, Full Fine-Tuning, and Language Adapters. In Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025), Tallinn, Estonia.
Kätriin Kukk, Danila Petrelli, Judit Casademont, Eric J. W. Orlowski, Michał Dzieliński, and Maria Jacobson. 2025. BiaSWE: An Expert Annotated Dataset for Misogyny Detection in Swedish. In Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025), Tallinn, Estonia.
Olle Torstensson and Oskar Holmström. 2025. A Grammar-Based Method for Instilling Empirical Dependency Structure in LLMs. In Trond Trosterud, Linda Wiechetek, and Flammie Pirinen, editors, Proceedings of the 9th Workshop on Constraint Grammar and Finite State NLP, pages 45–49, Tallinn, Estonia. University of Tartu Library.
Ehsan Doostmohammadi and Marco Kuhlmann. 2025. Studying the Role of Input-Neighbor Overlap in Retrieval-Augmented Language Models Training Efficiency. arXiv:2505.14309.

2024

Jenny Kunz and Marco Kuhlmann. 2024. Properties and Challenges of LLM-Generated Explanations. In Su Lin Blodgett, Amanda Cercas Curry, Sunipa Dev, Michael Madaio, Ani Nenkova, Diyi Yang, and Ziang Xiao, editors, Proceedings of the Third Workshop on Bridging Human–Computer Interaction and Natural Language Processing, pages 13–27, Mexico City, Mexico. Association for Computational Linguistics.
Niklas Wretblad, Oskar Holmström, Erik Larsson, Axel Wiksäter, Hjalmar Öhman, Oscar Söderlund, Ture Pontén, Martin Forsberg, Martin Sörme, and Fredrik Heintz. 2024. Synthetic SQL Column Descriptions and Their Impact on Text-to-SQL Performance. In NeurIPS 2024 Third Table Representation Learning Workshop.
Niklas Wretblad, Fredrik Riseby, Rahul Biswas, Amin Ahmadi, and Oskar Holmström. 2024. Understanding the Effects of Noise in Text-to-SQL: An Examination of the BIRD-Bench Benchmark. In Lun-Wei Ku, Andre Martins, and Vivek Srikumar, editors, Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 356–369, Bangkok, Thailand. Association for Computational Linguistics.
Ehsan Doostmohammadi, Oskar Holmström, and Marco Kuhlmann. 2024. How Reliable Are Automatic Evaluation Methods for Instruction-Tuned LLMs?. In Yaser Al-Onaizan, Mohit Bansal, and Yun-Nung Chen, editors, Findings of the Association for Computational Linguistics: EMNLP 2024, pages 6321–6336, Miami, Florida, USA. Association for Computational Linguistics.
Kushal Tatariya, Artur Kulmizev, Wessel Poelman, Esther Ploeger, Marcel Bollmann, Johannes Bjerva, Jiaming Luo, Heather Lent, and Miryam de Lhoneux. 2024. How Good is Your Wikipedia?. arXiv:2411.05527.
Heather Lent, Kushal Tatariya, Raj Dabre, Yiyi Chen, Marcell Fekete, Esther Ploeger, Li Zhou, Ruth-Ann Armstrong, Abee Eijansantos, Catriona Malau, Hans Erik Heje, Ernests Lavrinovics, Diptesh Kanojia, Paul Belony, Marcel Bollmann, Loïc Grobol, Miryam de Lhoneux, Daniel Hershcovich, Michel DeGraff, Anders Søgaard, and Johannes Bjerva. 2024. CreoleVal: Multilingual Multitask Benchmarks for Creoles. Transactions of the Association for Computational Linguistics, 12:950–978.
Marc Braun and Jenny Kunz. 2024. A Hypothesis-Driven Framework for the Analysis of Self-Rationalising Models. In Neele Falk, Sara Papi, and Mike Zhang, editors, Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop, pages 148–161, St. Julian's, Malta. Association for Computational Linguistics.
Jenny Kunz and Oskar Holmström. 2024. The Impact of Language Adapters in Cross-Lingual Transfer for NLU. In Raúl Vázquez, Timothee Mickus, Jörg Tiedemann, Ivan Vulić, and Ahmet Üstün, editors, Proceedings of the 1st Workshop on Modular and Open Multilingual NLP (MOOMIN 2024), pages 24–43, St Julians, Malta. Association for Computational Linguistics.
Elliot Gestrin, Marco Kuhlmann, and Jendrik Seipp. 2024. NL2Plan: Robust LLM-Driven Planning from Minimal Text Descriptions. In ICAPS 2024 Workshop on Human-Aware and Explainable Planning (HAXP), Banff, Alberta, Canada.

2023

Marcel Bollmann, Nathan Schneider, Arne Köhn, and Matt Post. 2023. Two Decades of the ACL Anthology: Development, Impact, and Open Challenges. In Liling Tan, Dmitrijs Milajevs, Geeticka Chauhan, Jeremy Gwinnup, and Elijah Rippeth, editors, Proceedings of the 3rd Workshop for Natural Language Processing Open Source Software (NLP-OSS 2023), pages 83–94, Singapore. Association for Computational Linguistics.
Olle Torstensson and Tjark Weber. 2023. Hammering Floating-Point Arithmetic. In Uli Sattler and Martin Suda, editors, Frontiers of Combining Systems, pages 217–235, Cham. Springer Nature Switzerland.
Ehsan Doostmohammadi, Tobias Norlund, Marco Kuhlmann, and Richard Johansson. 2023. Surface-Based Retrieval Reduces Perplexity of Retrieval-Augmented Language Models. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 521–529, Toronto, Canada. Association for Computational Linguistics.
Oskar Holmström and Ehsan Doostmohammadi. 2023. Making Instruction Finetuning Accessible to Non-English Languages: A Case Study on Swedish Models. In Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa), pages 634–642, Tórshavn, Faroe Islands. University of Tartu Library.
Oskar Holmström, Jenny Kunz, and Marco Kuhlmann. 2023. Bridging the Resource Gap: Exploring the Efficacy of English and Multilingual LLMs for Swedish. In Proceedings of the Second Workshop on Resources and Representations for Under-Resourced Languages and Domains (RESOURCEFUL-2023), pages 92–110, Tórshavn, the Faroe Islands. Association for Computational Linguistics.
Tobias Norlund, Ehsan Doostmohammadi, Richard Johansson, and Marco Kuhlmann. 2023. On the Generalization Ability of Retrieval-Enhanced Transformers. In Findings of the Association for Computational Linguistics: EACL 2023, pages 1485–1493, Dubrovnik, Croatia. Association for Computational Linguistics.
Emanuel Sanchez Aimar, Arvi Jonnarth, Michael Felsberg, and Marco Kuhlmann. 2023. Balanced Product of Calibrated Experts for Long-Tailed Recognition. In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 19967–19977.
Emanuel Sanchez Aimar, Hannah Helgesen, Michael Felsberg, and Marco Kuhlmann. 2023. Align, Distill, and Augment Everything All at Once for Imbalanced Semi-Supervised Learning. arXiv:2306.04621.

2022

Jenny Kunz, Martin Jirenius, Oskar Holmström, and Marco Kuhlmann. 2022. Human Ratings Do Not Reflect Downstream Utility: A Study of Free-Text Explanations for Model Predictions. In Proceedings of the Fifth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, pages 164–177, Abu Dhabi, United Arab Emirates (Hybrid). Association for Computational Linguistics.
Jenny Kunz and Marco Kuhlmann. 2022. Where Does Linguistic Information Emerge in Neural Language Models? Measuring Gains and Contributions across Layers. In Proceedings of the 29th International Conference on Computational Linguistics, pages 4664–4676, Gyeongju, Republic of Korea. International Committee on Computational Linguistics.
Ehsan Doostmohammadi and Marco Kuhlmann. 2022. On the Effects of Video Grounding on Language Models. In Proceedings of the First Workshop on Performance and Interpretability Evaluations of Multimodal, Multipurpose, Massive-Scale Models, pages 1–6, Virtual. International Conference on Computational Linguistics.
Lena Katharina Schiffer, Marco Kuhlmann, and Giorgio Satta. 2022. Tractable Parsing for CCGs of Bounded Degree. Computational Linguistics, 48(3):593–633.
Marco Kuhlmann, Lena Katharina Schiffer, and Andreas Maletti. 2022. The Tree-Generative Capacity of Combinatory Categorial Grammars. Journal of Computer and System Sciences, 124(March):214–233.
Joakim Nivre, Ali Basirat, Luise Dürlich, and Adam Moss. 2022. Nucleus Composition in Transition-based Dependency Parsing. Computational Linguistics, 48(4):849–886.
Michael Boyden, Ali Basirat, and Karl Berglund. 2022. Digital Conceptual History and the Emergence of a Globalized Climate Imaginary. Contributions to the History of Concepts, 17(2):95–122.

2021

Jenny Kunz and Marco Kuhlmann. 2021. Test Harder than You Train: Probing with Extrapolation Splits. In Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, pages 15–25, Punta Cana, Dominican Republic. Association for Computational Linguistics.
Ali Basirat, Marc Allassonnière-Tang, and Aleksandrs Berdicevskis. 2021. An empirical study on the contribution of formal and semantic features to the grammatical gender of nouns. Linguistics Vanguard, 7(1):20200048.

2020

Jenny Kunz and Marco Kuhlmann. 2020. Classifier Probes May Just Learn from Linear Context Features. In Proceedings of the 28th International Conference on Computational Linguistics, pages 5136–5146, Barcelona, Spain (Online). International Committee on Computational Linguistics.
Robin Kurtz, Stephan Oepen, and Marco Kuhlmann. 2020. End-to-End Negation Resolution as Graph Parsing. In Proceedings of the 16th International Conference on Parsing Technologies and the IWPT 2020 Shared Task on Parsing into Enhanced Universal Dependencies, pages 14–24, Online. Association for Computational Linguistics.
Riley Capshaw, Marco Kuhlmann, and Eva Blomqvist. 2020. Probing a Semantic Dependency Parser for Translational Relation Embeddings. In Proceedings of the Workshop on Deep Learning for Knowledge Graphs (DL4KG2020) Co-located with the 17th Extended Semantic Web Conference 2020 (ESWC 2020), Heraklion, Greece – moved online.
Fredrik Sand Aronsson, Marco Kuhlmann, Vesna Jelić, and Per Östberg. 2020. Is Cognitive Impairment Associated with Reduced Syntactic Complexity in Writing? Evidence from Automated Text Analysis. Aphasiology, 35(7):900–913.

2019

Stephan Oepen, Omri Abend, Jan Hajic, Daniel Hershcovich, Marco Kuhlmann, Tim O'Gorman, and Nianwen Xue, editors. 2019. Proceedings of the Shared Task on Cross-Framework Meaning Representation Parsing at the 2019 Conference on Natural Language Learning. Association for Computational Linguistics, Hong Kong.
Stephan Oepen, Omri Abend, Jan Hajic, Daniel Hershcovich, Marco Kuhlmann, Tim O'Gorman, Nianwen Xue, Jayeol Chun, Milan Straka, and Zdenka Uresova. 2019. MRP 2019: Cross-Framework Meaning Representation Parsing. In Proceedings of the Shared Task on Cross-Framework Meaning Representation Parsing at the 2019 Conference on Natural Language Learning, pages 1–27, Hong Kong. Association for Computational Linguistics.
Marco Kuhlmann, Andreas Maletti, and Lena Katharina Schiffer. 2019. The Tree-Generative Capacity of Combinatory Categorial Grammars. In Arkadev Chattopadhyay and Paul Gastin, editors, 39th IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science (FSTTCS 2019), volume 150, pages 44:1–44:14, Dagstuhl, Germany. Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik.
Robin Kurtz and Marco Kuhlmann. 2019. The Interplay Between Loss Functions and Structural Constraints in Dependency Parsing. Northern European Journal of Language Technology, 6: Special Issue of Selected Contributions from the Seventh Swedish Language Technology Conference (SLTC 2018):43–66.
Robin Kurtz, Daniel Roxbo, and Marco Kuhlmann. 2019. Improving Semantic Dependency Parsing with Syntactic Features. In Proceedings of the First NLPL Workshop on Deep Learning for Natural Language Processing, pages 12–21, Turku, Finland. Linköping University Electronic Press.
Jenny Kunz and Christian Hardmeier. 2019. Entity Decisions in Neural Language Modelling: Approaches and Problems. In Proceedings of the Second Workshop on Computational Models of Reference, Anaphora and Coreference, pages 15–19, Minneapolis, USA. Association for Computational Linguistics.

2018

Marco Kuhlmann, Giorgio Satta, and Peter Jonsson. 2018. On the Complexity of CCG Parsing. Computational Linguistics, 44(3):447–482.

2017

Robin Kurtz and Marco Kuhlmann. 2017. Exploiting Structure in Parsing to 1-Endpoint-Crossing Graphs. In Proceedings of the 15th International Conference on Parsing Technologies, pages 78–87, Pisa, Italy. Association for Computational Linguistics.
Marco Kuhlmann and Tatjana Scheffler, editors. 2017. Proceedings of the 13th International Workshop on Tree Adjoining Grammars and Related Formalisms. Association for Computational Linguistics, Umeå, Sweden.
Marco Kuhlmann and Christian Wurm. 2017. Finite-State Methods and Mathematics of Language, Introduction to the Special Issue. Journal of Language Modelling, 5(1):1–2.

2016

Per Fallgren, Jesper Segeblad, and Marco Kuhlmann. 2016. Towards a Standard Dataset of Swedish Word Vectors. In Proceedings of the Sixth Swedish Language Technology Conference (SLTC), Umeå, Sweden.
Marco Kuhlmann and Stephan Oepen. 2016. Towards a Catalogue of Linguistic Graph Banks. Computational Linguistics, 42(4):819–827.
Stephan Oepen, Marco Kuhlmann, Yusuke Miyao, Daniel Zeman, Silvie Cinková, Dan Flickinger, Jan Hajič, Angelina Ivanova, and Zdeňka Urešová. 2016. Towards Comparability of Linguistic Graph Banks for Semantic Parsing. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 3991–3995, Portorož, Slovenia. European Language Resources Association (ELRA).

2015

Marco Kuhlmann and Peter Jonsson. 2015. Parsing to Noncrossing Dependency Graphs. Transactions of the Association for Computational Linguistics, 3:559–570.
Stephan Oepen, Marco Kuhlmann, Yusuke Miyao, Daniel Zeman, Silvie Cinková, Dan Flickinger, Jan Hajič, and Zdeňka Urešová. 2015. SemEval 2015 Task 18: Broad-Coverage Semantic Dependency Parsing. In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), pages 915–926, Denver, Colorado. Association for Computational Linguistics.
Marco Kuhlmann, Makoto Kanazawa, and Gregory M. Kobele, editors. 2015. Proceedings of the 14th Meeting on the Mathematics of Language (MoL 2015). Association for Computational Linguistics, Chicago, USA.
Marco Kuhlmann, Alexander Koller, and Giorgio Satta. 2015. Lexicalization and Generative Power in CCG. Computational Linguistics, 41(2):187–219.
Frank Drewes, Kevin Knight, and Marco Kuhlmann. 2015. Formal Models of Graph Transformation in Natural Language Processing (Dagstuhl Seminar 15122). Dagstuhl Reports, 5(3):143–161.

2014

Stephan Oepen, Marco Kuhlmann, Yusuke Miyao, Daniel Zeman, Dan Flickinger, Jan Hajič, Angelina Ivanova, and Yi Zhang. 2014. SemEval 2014 Task 8: Broad-Coverage Semantic Dependency Parsing. In Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014), pages 63–72, Dublin, Ireland. Association for Computational Linguistics.
Marco Kuhlmann. 2014. Linköping: Cubic-Time Graph Parsing with a Simple Scoring Scheme. In Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014), pages 395–399, Dublin, Ireland. Association for Computational Linguistics.
Marco Kuhlmann and Giorgio Satta. 2014. A New Parsing Algorithm for Combinatory Categorial Grammar. Transactions of the Association for Computational Linguistics, 2:405–418.