LiU NLP

LiU NLP group picture as of April 2024

We develop and analyse computational models of human language. Our work ranges from basic research on algorithms and machine learning to applied research in language technology and computational social science.

Research

Our current focus is on analysing and enhancing neural language models. Specifically, we are working on methods for improving model efficiency, trustworthiness, and usefulness for lesser-resourced languages. We also have a long-standing interest in work on the intersection of natural language processing and theoretical computer science.

We are participating in several national and international research collaborations, including the Wallenberg AI, Autonomous Systems and Software Program (WASP), the EU-funded project TrustLLM – Democratize Trustworthy and Efficient Large Language Model Technology for Europe, and the Swedish Excellence Centre for Computational Social Science (SweCSS).

Current Projects

Adapting Large Language Models for Swedish News Media

Newsrooms are rapidly adopting large language models (LLMs) into their operations, but stringent requirements on text quality risk hampering the technology’s usefulness. This project, a collaboration between Linköping University and Bonnier News, aims to address this challenge by developing methods for adapting LLMs to the domain of Swedish news media, focusing on the truthfulness and stylistics of the generated text. The methods developed in the project will be validated through real-world editorial applications and emphasise scalability and resource efficiency to guarantee industrial relevance.

Funded by WASP in collaboration with Bonnier News

Efficient Modularisation and Adaptation of Large Language Models

The latest generation of large language models (LLMs) is extremely resource-hungry, to the point where training and even deploying own models has become infeasible for most enterprises and public actors. We propose to tackle this problem by dissecting LLMs into modules that encapsulate functional capabilities such as languages and knowledge retrieval, and can be trained and updated independently of other model parts. We will develop a framework for the targeted adaptation of these modules and evaluate its effectiveness in settings where training data is relatively scarce, focusing on NLP for medium-resource languages such as Swedish.

Funded by WASP

TrustLLM: Democratize Trustworthy and Efficient Large Language Model Technology for Europe

TrustLLM is a European project involving 11 partner organizations. Its main objective is to develop open, trustworthy, and sustainable large language models (LLMs) initially targeting the Germanic languages. This will create the foundation for an advanced open ecosystem for next generation modular and extensible European trustworthy, sustainable, and democratized large language models. Within this project, LiU NLP leads the work package on multilingual training and transfer learning for low-resource languages. Here, we are researching methods to improve LLMs for low-resource languages through e.g. training efficient language adapters, better handling of tokenization, or exploiting typological relatedness for cross-lingual transfer.

Funded by the European Union | Project website

STING: Synthesis and analysis with Transducers and Invertible Neural Generators

Human communication is multimodal and occurs through speech, language, gesture, facial expression, and similar signals. To enable natural interactions with human beings, artificial agents must be capable of analysing and producing these rich and interdependent signals and connecting them to their semantic implications. STING aims to unify synthesis and analysis through transducers and invertible neural models, connecting concrete, continuous-valued sensory data such as images, sound, and motion, with high-level, predominantly discrete representations of meaning. The project has the potential to endow synthesis output with human-understandable high-level explanations while simultaneously improving the ability to attach probabilities to semantic representations.

Funded by WASP

A Practical Theory of Computation for Modern Neural Network Architectures

Today, neural networks are the most important computational paradigm for AI and machine learning. While their computational power has been studied for decades, the theoretical development has not been able to keep up with the recent upshot of new neural architectures, such as the Transformer. Also, existing theoretical results are of limited practical value because they often apply to monolithic architectures, whereas applications typically use networks composed of re-usable components. Moreover, theoretical results tend to make idealising assumptions such as infinite precision or unlimited training time rather than acknowledging that such commodities are in short supply and should thus be treated as parameters of expressive power. In this project, we want to address these shortcomings and develop a practical theory of computation for modern neural network architectures. The project combines methods from theoretical computer science – especially the theory of computation and formal languages – with empirical validation in natural language processing (NLP).

Funded by WASP

Interpreting and Grounding Pre-Trained Representations for Natural Language Processing

Building computers that understand human language is one of the central goals of artificial intelligence. A recent breakthrough on the way towards this goal is the development of neural models that learn deep contextualized representations of language. However, while these models have substantially advanced the state of the art in natural language processing (NLP), our understanding of the learned representations and our repertoire of techniques for integrating them with other knowledge representations and reasoning facilities remain severely limited. To address these gaps, we will develop new methods for interpreting, grounding, and integrating deep contextualized representations of language and evaluate the usefulness of these methods in the context of threat intelligence applications together with our industrial partner.

Funded by WASP

Selected Recent Publications

Kevin Glocker, Kätriin Kukk, Romina Oji, Marcel Bollmann, Marco Kuhlmann, and Jenny Kunz. 2025. Grow Up and Merge: Scaling Strategies for Efficient Language Adaptation. arXiv:2512.10772.
Jenny Kunz, Iben Nyholm Debess, and Annika Simonsen. 2025. Family Matters: Language Transfer and Merging for Adapting Small LLMs to Faroese. arXiv:2510.00810.
Denitsa Saynova, Lovisa Hagström, Moa Johansson, Richard Johansson, and Marco Kuhlmann. 2025. Fact Recall, Heuristics or Pure Guesswork? Precise Interpretations of Language Models for Fact Completion. In Wanxiang Che, Joyce Nabende, Ekaterina Shutova, and Mohammad Taher Pilehvar, editors, Findings of the Association for Computational Linguistics: ACL 2025, pages 18322–18349, Vienna, Austria. Association for Computational Linguistics.
Kätriin Kukk, Danila Petrelli, Judit Casademont, Eric J. W. Orlowski, Michał Dzieliński, and Maria Jacobson. 2025. BiaSWE: An Expert Annotated Dataset for Misogyny Detection in Swedish. In Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025), Tallinn, Estonia.
Romina Oji and Jenny Kunz. 2025. How to Tune a Multilingual Encoder Model for Germanic Languages: A Study of PEFT, Full Fine-Tuning, and Language Adapters. In Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025), Tallinn, Estonia.

See all publications

Teaching

Our teaching portfolio comprises courses and degree projects in natural language processing and text mining at the basic, advanced, and doctoral levels.

Current Courses

AI for Natural Language (Bachelor)

Writing computer programs that understand and produce human language is a central goal of artificial intelligence. This course aims to provide an understanding of how far we are from this goal: what is and what is not possible today. Students will learn about current methods in natural language processing, about what resources are required to build language technology systems, and about how one can assess the quality of these methods and systems.

Course website ETE335

Language & Computers (Bachelor)

This course aims to provide an introduction to basic linguistic concepts and to teach techniques for collecting, analysing and validating linguistic data sets using computers. The focus of the course is on linguistic data in text form.

Course website 729G49 (only in Swedish)

Language Technology (Bachelor)

Language technology – technology for the analysis and interpretation of natural language – forms a key component of smart search engines, personal digital assistants, and many other innovative applications. The goal of this course is to give an introduction to language technology as an application area, as well as to its basic methods. The course focuses on methods that process text.

Course website 729G86/TDP030

Natural Language Processing (Master)

Natural Language Processing (NLP) develops methods for making human language accessible to computers. This course aims to provide students with a theoretical understanding of and practical experience with the advanced algorithms that power modern NLP. The course focuses on methods based on deep neural networks.

Course website TDDE09

Text Mining (Master)

Text Mining is about distilling actionable insights from text. The overall aim of this course is to provide students with practical experience of the main steps of text mining: information retrieval, processing of text data, modelling, analysis of experimental results. The course ends with an individual project where students work on a self-defined problem.

Course website 732A81/TDDE16

Deep Learning for NLP (PhD)

Natural Language Processing (NLP) develops methods for making human language accessible to computers. The goal of this course is to provide students with a theoretical understanding of and practical experience with the advanced algorithms that power modern NLP. The course focuses on methods based on deep neural networks.

This course is given in the context of the WASP Graduate School.

Course website

Open Master’s Thesis Proposals

Automated Verification of References in Texts and PDFs

Generative AI systems are gaining popularity in academic contexts, particularly among students. Today, these systems not only generate text, but also suggest relevant scientific references. Unfortunately, we frequently see citations that appear plausible but do not actually exist, contain numerous incorrect details, or are riddled with inconsistencies in the generated bibliography. This project aims to develop methods and tools for assisting verification of references, thereby increasing reliability and efficiency in academic workflows. The specific project goals are:

to develop an automated process to verify references extracted from PDFs
to explore approaches for generating correct references to replace hallucinated ones
to investigate existing literature and systems in the field of automated reference validation
to evaluate how LLMs can be trained or prompted to verify references
to prototype one or more verification tools that integrate into academic workflows

Contact: Johannes Fichte

Cross-Lingual Factual Consistency in LLMs

LLMs store extensive amounts of factual knowledge in their parameters. The answers to factual questions depend on exact formulations in prompts, but also on the language the question is asked in. Recent works report this inconsistency, highlight differences in transfer directions, and offer potential solutions. In this project, you will explore the extent of the problem, how proposed technical solutions mitigate it and what the remaining limitations are. We will narrow down the project based on your research interests.

You will join an ongoing research project in our group, working on modularity and multilinguality of LLMs.

Contact: Jenny Kunz

Fine-grained Language Identification

Identifying which language a text is written in sounds like a solved problem: most translation tools, for example, can automatically detect the language of the source text. However, language identification methods only tend to work reliably on longer inputs, such as documents or full sentences. There are many situations in which we might want to identify languages on a more fine-grained level, for example:

In code-switching, bilingual speakers mix expressions from two languages within a single utterance.
In datasets gathered from websites like Wikipedia, sentences often contain titles or names in other languages.

In this project, you will apply modern algorithms and machine learning techniques to try to improve language identification on a word or phrase level in mixed-language data. There are several ways you can approach this project:

You could focus on disambiguating a specific pair of languages in code-switched text.
You could focus on identifying spans in a common, high-resource language (such as English or Spanish) within texts of other, lower-resourced languages (such as Jamaican Creole or Northern Sámi).
You could focus on disambiguating closely related languages (such as Danish and Norwegian) or dialects, which is particularly challenging as they will also tend to look very similar.

Contact: Marcel Bollmann

Language Adaption of LLMs

Parameter-efficient fine-tuning techniques are increasingly popular as LLMs become larger and more powerful. They not only make the fine-tuning more cost-efficient and climate-friendly but also help preserve the models’ general capabilities and avoid catastrophic forgetting. In this project, you implement and compare different approaches to adapting a model to specific languages and evaluate them on multilingual datasets for different tasks and domains. You will join an ongoing project in our group and help us to improve our results and/or get new insights on where such methods are (not) useful.

You are welcome to contact us to discuss specific contribution. It could be along these lines:

Finding the optimal adaption setup for a specific language or task.
An in-depth evaluation of a language adaption method, including testing modifications from the literature or proposed by us or you.

Contact: Jenny Kunz