Publications
2024
- Kushal Tatariya, Artur Kulmizev, Wessel Poelman, Esther Ploeger, Marcel Bollmann, Johannes Bjerva, Jiaming Luo, Heather Lent, and Miryam de Lhoneux. 2024. How Good is Your Wikipedia?. arXiv:2411.05527.
- Heather Lent, Kushal Tatariya, Raj Dabre, Yiyi Chen, Marcell Fekete, Esther Ploeger, Li Zhou, Ruth-Ann Armstrong, Abee Eijansantos, Catriona Malau, Hans Erik Heje, Ernests Lavrinovics, Diptesh Kanojia, Paul Belony, Marcel Bollmann, Loïc Grobol, Miryam de Lhoneux, Daniel Hershcovich, Michel DeGraff, Anders Søgaard, and Johannes Bjerva. 2024. CreoleVal: Multilingual Multitask Benchmarks for Creoles. Transactions of the Association for Computational Linguistics, 12:950–978.
- Ehsan Doostmohammadi, Oskar Holmström, and Marco Kuhlmann. 2024. How Reliable Are Automatic Evaluation Methods for Instruction-Tuned LLMs?. arXiv:2402.10770.
- Niklas Wretblad, Fredrik Gordh Riseby, Rahul Biswas, Amin Ahmadi, and Oskar Holmström. 2024. Understanding the Effects of Noise in Text-to-SQL: An Examination of the BIRD-Bench Benchmark. arXiv:2402.12243.
- Jenny Kunz and Oskar Holmström. 2024. The Impact of Language Adapters in Cross-Lingual Transfer for NLU. In Raúl Vázquez, Timothee Mickus, Jörg Tiedemann, Ivan Vulić, and Ahmet Üstün, editors, Proceedings of the 1st Workshop on Modular and Open Multilingual NLP (MOOMIN 2024), pages 24–43, St Julians, Malta. Association for Computational Linguistics.
2023
- Marcel Bollmann, Nathan Schneider, Arne Köhn, and Matt Post. 2023. Two Decades of the ACL Anthology: Development, Impact, and Open Challenges. In Liling Tan, Dmitrijs Milajevs, Geeticka Chauhan, Jeremy Gwinnup, and Elijah Rippeth, editors, Proceedings of the 3rd Workshop for Natural Language Processing Open Source Software (NLP-OSS 2023), pages 83–94, Singapore. Association for Computational Linguistics.
- Olle Torstensson and Tjark Weber. 2023. Hammering Floating-Point Arithmetic. In Uli Sattler and Martin Suda, editors, Frontiers of Combining Systems, pages 217–235, Cham. Springer Nature Switzerland.
- Ehsan Doostmohammadi, Tobias Norlund, Marco Kuhlmann, and Richard Johansson. 2023. Surface-Based Retrieval Reduces Perplexity of Retrieval-Augmented Language Models. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 521–529, Toronto, Canada. Association for Computational Linguistics.
- Oskar Holmström and Ehsan Doostmohammadi. 2023. Making Instruction Finetuning Accessible to Non-English Languages: A Case Study on Swedish Models. In Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa), pages 634–642, Tórshavn, Faroe Islands. University of Tartu Library.
- Oskar Holmström, Jenny Kunz, and Marco Kuhlmann. 2023. Bridging the Resource Gap: Exploring the Efficacy of English and Multilingual LLMs for Swedish. In Proceedings of the Second Workshop on Resources and Representations for Under-Resourced Languages and Domains (RESOURCEFUL-2023), pages 92–110, Tórshavn, the Faroe Islands. Association for Computational Linguistics.
- Tobias Norlund, Ehsan Doostmohammadi, Richard Johansson, and Marco Kuhlmann. 2023. On the Generalization Ability of Retrieval-Enhanced Transformers. In Findings of the Association for Computational Linguistics: EACL 2023, pages 1485–1493, Dubrovnik, Croatia. Association for Computational Linguistics.
- Emanuel Sanchez Aimar, Arvi Jonnarth, Michael Felsberg, and Marco Kuhlmann. 2023. Balanced Product of Calibrated Experts for Long-Tailed Recognition. In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 19967–19977.
- Emanuel Sanchez Aimar, Hannah Helgesen, Michael Felsberg, and Marco Kuhlmann. 2023. Align, Distill, and Augment Everything All at Once for Imbalanced Semi-Supervised Learning. arXiv:2306.04621.
2022
- Jenny Kunz, Martin Jirenius, Oskar Holmström, and Marco Kuhlmann. 2022. Human Ratings Do Not Reflect Downstream Utility: A Study of Free-Text Explanations for Model Predictions. In Proceedings of the Fifth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, pages 164–177, Abu Dhabi, United Arab Emirates (Hybrid). Association for Computational Linguistics.
- Jenny Kunz and Marco Kuhlmann. 2022. Where Does Linguistic Information Emerge in Neural Language Models? Measuring Gains and Contributions across Layers. In Proceedings of the 29th International Conference on Computational Linguistics, pages 4664–4676, Gyeongju, Republic of Korea. International Committee on Computational Linguistics.
- Ehsan Doostmohammadi and Marco Kuhlmann. 2022. On the Effects of Video Grounding on Language Models. In Proceedings of the First Workshop on Performance and Interpretability Evaluations of Multimodal, Multipurpose, Massive-Scale Models, pages 1–6, Virtual. International Conference on Computational Linguistics.
- Lena Katharina Schiffer, Marco Kuhlmann, and Giorgio Satta. 2022. Tractable Parsing for CCGs of Bounded Degree. Computational Linguistics, 48(3):593–633.
- Marco Kuhlmann, Lena Katharina Schiffer, and Andreas Maletti. 2022. The Tree-Generative Capacity of Combinatory Categorial Grammars. Journal of Computer and System Sciences, 124(March):214–233.
- Joakim Nivre, Ali Basirat, Luise Dürlich, and Adam Moss. 2022. Nucleus Composition in Transition-based Dependency Parsing. Computational Linguistics, 48(4):849–886.
- Michael Boyden, Ali Basirat, and Karl Berglund. 2022. Digital Conceptual History and the Emergence of a Globalized Climate Imaginary. Contributions to the History of Concepts, 17(2):95–122.
2021
- Jenny Kunz and Marco Kuhlmann. 2021. Test Harder than You Train: Probing with Extrapolation Splits. In Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, pages 15–25, Punta Cana, Dominican Republic. Association for Computational Linguistics.
- Ali Basirat, Marc Allassonnière-Tang, and Aleksandrs Berdicevskis. 2021. An empirical study on the contribution of formal and semantic features to the grammatical gender of nouns. Linguistics Vanguard, 7(1):20200048.
2020
- Jenny Kunz and Marco Kuhlmann. 2020. Classifier Probes May Just Learn from Linear Context Features. In Proceedings of the 28th International Conference on Computational Linguistics, pages 5136–5146, Barcelona, Spain (Online). International Committee on Computational Linguistics.
- Robin Kurtz, Stephan Oepen, and Marco Kuhlmann. 2020. End-to-End Negation Resolution as Graph Parsing. In Proceedings of the 16th International Conference on Parsing Technologies and the IWPT 2020 Shared Task on Parsing into Enhanced Universal Dependencies, pages 14–24, Online. Association for Computational Linguistics.
- Riley Capshaw, Marco Kuhlmann, and Eva Blomqvist. 2020. Probing a Semantic Dependency Parser for Translational Relation Embeddings. In Proceedings of the Workshop on Deep Learning for Knowledge Graphs (DL4KG2020) Co-located with the 17th Extended Semantic Web Conference 2020 (ESWC 2020), Heraklion, Greece – moved online.
- Fredrik Sand Aronsson, Marco Kuhlmann, Vesna Jelić, and Per Östberg. 2020. Is Cognitive Impairment Associated with Reduced Syntactic Complexity in Writing? Evidence from Automated Text Analysis. Aphasiology, 35(7):900–913.
2019
- Stephan Oepen, Omri Abend, Jan Hajic, Daniel Hershcovich, Marco Kuhlmann, Tim O'Gorman, and Nianwen Xue, editors. 2019. Proceedings of the Shared Task on Cross-Framework Meaning Representation Parsing at the 2019 Conference on Natural Language Learning. Association for Computational Linguistics, Hong Kong.
- Stephan Oepen, Omri Abend, Jan Hajic, Daniel Hershcovich, Marco Kuhlmann, Tim O'Gorman, Nianwen Xue, Jayeol Chun, Milan Straka, and Zdenka Uresova. 2019. MRP 2019: Cross-Framework Meaning Representation Parsing. In Proceedings of the Shared Task on Cross-Framework Meaning Representation Parsing at the 2019 Conference on Natural Language Learning, pages 1–27, Hong Kong. Association for Computational Linguistics.
- Marco Kuhlmann, Andreas Maletti, and Lena Katharina Schiffer. 2019. The Tree-Generative Capacity of Combinatory Categorial Grammars. In Arkadev Chattopadhyay and Paul Gastin, editors, 39th IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science (FSTTCS 2019), volume 150, pages 44:1–44:14, Dagstuhl, Germany. Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik.
- Robin Kurtz and Marco Kuhlmann. 2019. The Interplay Between Loss Functions and Structural Constraints in Dependency Parsing. Northern European Journal of Language Technology, 6: Special Issue of Selected Contributions from the Seventh Swedish Language Technology Conference (SLTC 2018):43–66.
- Robin Kurtz, Daniel Roxbo, and Marco Kuhlmann. 2019. Improving Semantic Dependency Parsing with Syntactic Features. In Proceedings of the First NLPL Workshop on Deep Learning for Natural Language Processing, pages 12–21, Turku, Finland. Linköping University Electronic Press.
- Jenny Kunz and Christian Hardmeier. 2019. Entity Decisions in Neural Language Modelling: Approaches and Problems. In Proceedings of the Second Workshop on Computational Models of Reference, Anaphora and Coreference, pages 15–19, Minneapolis, USA. Association for Computational Linguistics.
2018
- Marco Kuhlmann, Giorgio Satta, and Peter Jonsson. 2018. On the Complexity of CCG Parsing. Computational Linguistics, 44(3):447–482.
2017
- Robin Kurtz and Marco Kuhlmann. 2017. Exploiting Structure in Parsing to 1-Endpoint-Crossing Graphs. In Proceedings of the 15th International Conference on Parsing Technologies, pages 78–87, Pisa, Italy. Association for Computational Linguistics.
- Marco Kuhlmann and Tatjana Scheffler, editors. 2017. Proceedings of the 13th International Workshop on Tree Adjoining Grammars and Related Formalisms. Association for Computational Linguistics, Umeå, Sweden.
- Marco Kuhlmann and Christian Wurm. 2017. Finite-State Methods and Mathematics of Language, Introduction to the Special Issue. Journal of Language Modelling, 5(1):1–2.
2016
- Per Fallgren, Jesper Segeblad, and Marco Kuhlmann. 2016. Towards a Standard Dataset of Swedish Word Vectors. In Proceedings of the Sixth Swedish Language Technology Conference (SLTC), Umeå, Sweden.
- Marco Kuhlmann and Stephan Oepen. 2016. Towards a Catalogue of Linguistic Graph Banks. Computational Linguistics, 42(4):819–827.
- Stephan Oepen, Marco Kuhlmann, Yusuke Miyao, Daniel Zeman, Silvie Cinková, Dan Flickinger, Jan Hajič, Angelina Ivanova, and Zdeňka Urešová. 2016. Towards Comparability of Linguistic Graph Banks for Semantic Parsing. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 3991–3995, Portorož, Slovenia. European Language Resources Association (ELRA).
2015
- Marco Kuhlmann and Peter Jonsson. 2015. Parsing to Noncrossing Dependency Graphs. Transactions of the Association for Computational Linguistics, 3:559–570.
- Stephan Oepen, Marco Kuhlmann, Yusuke Miyao, Daniel Zeman, Silvie Cinková, Dan Flickinger, Jan Hajič, and Zdeňka Urešová. 2015. SemEval 2015 Task 18: Broad-Coverage Semantic Dependency Parsing. In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), pages 915–926, Denver, Colorado. Association for Computational Linguistics.
- Marco Kuhlmann, Makoto Kanazawa, and Gregory M. Kobele, editors. 2015. Proceedings of the 14th Meeting on the Mathematics of Language (MoL 2015). Association for Computational Linguistics, Chicago, USA.
- Marco Kuhlmann, Alexander Koller, and Giorgio Satta. 2015. Lexicalization and Generative Power in CCG. Computational Linguistics, 41(2):187–219.
- Frank Drewes, Kevin Knight, and Marco Kuhlmann. 2015. Formal Models of Graph Transformation in Natural Language Processing (Dagstuhl Seminar 15122). Dagstuhl Reports, 5(3):143–161.
2014
- Stephan Oepen, Marco Kuhlmann, Yusuke Miyao, Daniel Zeman, Dan Flickinger, Jan Hajič, Angelina Ivanova, and Yi Zhang. 2014. SemEval 2014 Task 8: Broad-Coverage Semantic Dependency Parsing. In Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014), pages 63–72, Dublin, Ireland. Association for Computational Linguistics.
- Marco Kuhlmann. 2014. Linköping: Cubic-Time Graph Parsing with a Simple Scoring Scheme. In Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014), pages 395–399, Dublin, Ireland. Association for Computational Linguistics.
- Marco Kuhlmann and Giorgio Satta. 2014. A New Parsing Algorithm for Combinatory Categorial Grammar. Transactions of the Association for Computational Linguistics, 2:405–418.