Marek Rei

I am a researcher in Machine Learning and Natural Language Processing. My work investigates new architectures and optimization methods for language modelling and multimodal representations.

I am an Associate Professor of Machine Learning at Imperial College London and a Visiting Researcher at the University of Cambridge. I am an AI Advisor for Locai Labs and Esgrid Technologies, and I provide consultancy services through Perception Labs.

Previously, I did a post-doc in Cambridge and worked in the Research team at SwiftKey, where we developed experimental technologies for language modeling and natural language processing. One of my main projects was the neural network language model for text prediction. SwiftKey has since been acquired by Microsoft.

I received my PhD at the University of Cambridge, in Churchill College, with my thesis on Minimally supervised dependency-based methods for natural language processing, under the supervision of Professor Ted Briscoe. Before that, in 2008-2009 I did an MPhil course in the Computer Lab, called Computer Speech, Text and Internet Technology. The topic of my dissertation was Adaptive Interactive Information Extraction.

I also studied three years at Tallinn University of Technology where I got my bachelor's degree with the thesis Creating a Model for Audiovisual Speech in Estonian.

Download: My CV

Research interests

My main areas of interest currently include:

planning and reasoning with LLMs models
explainability and uncertainty of ML models
modelling of medical and healthcare data
extraction and structuring of information from unstructured text
grammatical error correction and essay scoring

I provide consultancy services in the areas of machine learning and natural language processing. If you are interested, feel free to get in touch.

Contact

E-mail: marek@marekrei.com

Publications

No Need for Explanations: LLMs can implicitly learn from mistakes in-context [arxiv] Lisa Alazraki, Maximilian Mozes, Jon Ander Campos, Yi Chern Tan, Marek Rei, Max Bartolo In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025) Suzhou, China, 2025

Predictive Multimodal Modeling of Diagnoses and Treatments in EHR [arxiv] Cindy Shih-Ting Huang, Clarence Boon Liang Ng, Marek Rei In Proceedings of the MLLM in Clinical Practice Workshop at MICCAI2025 (MLLMCP 2025) Daejeon, Republic of Korea, 2025

StateAct: Enhancing LLM Base Agents via Self-prompting and State-tracking [arxiv] Nikolai Rozanov, Marek Rei In Proceedings of the Workshop for Research on Agent Language Models (REALM 2025) Vienna, Austria, 2025

DiffuseDef: Improved Robustness to Adversarial Attacks [arxiv] Zhenhao Li, Huichi Zhou, Marek Rei, Lucia Specia In Proceedings of the 63nd Annual Meeting of the Association for Computational Linguistics (ACL 2025) Vienna, Austria, 2025

Enhancing LLM Robustness to Perturbed Instructions: An Empirical Study [arxiv] Aryan Agrawal, Lisa Alazraki, Shahin Honarvar, Marek Rei ICLR 2025 Workshop - Building Trust in LLMs and LLM Applications: From Guardrails to Explainability to Regulation Singapore, 2025

Assessing the effectiveness of interdependent corporate sustainability choices [link] Simone Cenci, Matteo Burato, Marek Rei, Maurizio Zollo npj Climate Action (Nature) 2025

Meta-Reasoning Improves Tool Use in Large Language Models [arxiv] Lisa Alazraki, Marek Rei Findings of the Association for Computational Linguistics: NAACL 2025 Albuquerque, New Mexico, 2025

SoK: Membership Inference Attacks on LLMs are Rushing Nowhere (and How to Fix It) [arxiv] Matthieu Meeus, Igor Shilov, Shubham Jain, Manuel Faysse, Marek Rei, Yves-Alexandre de Montjoye IEEE Conference on Secure and Trustworthy Machine Learning (SaTML 2025) *Best paper award* Copenhagen, Denmark, 2025

Atomic Inference for NLI with Generated Facts as Atoms [arxiv] [link] [blog] [pdf] Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Oana-Maria Camburu, Marek Rei In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024) Miami, USA, 2024

Distilling Robustness into Natural Language Inference Models with Domain-Targeted Augmentation [arxiv] [link] [pdf] [video] Joe Stacey, Marek Rei In Findings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024) Bangkok, Thailand, 2024

Continuous Predictive Modeling of Clinical Notes and ICD Codes in Patient Health Records [arxiv] [link] [pdf] Mireia Hernandez Caralt, Clarence Boon Liang Ng, Marek Rei In Proceedings of the Biomedical Natural Language Processing Workshop (BioNLP 2024) Bangkok, Thailand, 2024

Predicting cell type-specific epigenomic profiles accounting for distal genetic effects [link] [biorxiv] Alan E Murphy, William Beardall, Marek Rei, Mike Phuycharoen, Nathan G Skene Nature Communications 2024

Prompting open-source and commercial language models for grammatical error correction of English learner text [arxiv] [link] [pdf] Christopher Davis, Andrew Caines, Øistein Andersen, Shiva Taslimipoor, Helen Yannakoudakis, Zheng Yuan, Christopher Bryant, Marek Rei, Paula Buttery In Findings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024) Bangkok, Thailand, 2024

Did the Neurons Read your Book? Document-level Membership Inference for Large Language Models [arxiv] Matthieu Meeus, Shubham Jain, Marek Rei, Yves-Alexandre de Montjoye The 33rd USENIX Security Symposium (2024) Philadelphia, PA, USA, 2024

The alignment of companies' sustainability behavior and emissions with global climate targets [link] Simone Cenci, Matteo Burato, Marek Rei, Maurizio Zollo Nature Communications, 2023

When and Why Does Bias Mitigation Work? [link] [pdf] Abhilasha Ravichander, Joe Stacey, Marek Rei In Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023) Singapore, 2023

Competitive Pressure and Emission Reduction: Unravelling the Link [link] Simone Cenci, Hossein Asgharian, Lu Liu, Marek Rei, Maurizio Zollo SSRN, 2023

Climbing empirical fitness landscapes requires companies to manage interconnected sustainability choices [link] Simone Cenci, Matteo Burato, Marek Rei, Maurizio Zollo SSRN 2023

Finding the Needle in a Haystack: Unsupervised Rationale Extraction from Long Text Classifiers [arxiv] Kamil Bujel, Andrew Caines, Helen Yannakoudakis, Marek Rei ArXiv, 2023

Does competitive pressure drive effective corporate environmental actions? [link] Simone Cenci, Hossein Asgharian, Lu Liu, Marek Rei, Maurizio Zollo SSRN 2023

On the application of Large Language Models for language teaching and assessment technology [arxiv] Andrew Caines, Luca Benedetto, Shiva Taslimipoor, Christopher Davis, Yuan Gao, Oeistein Andersen, Zheng Yuan, Mark Elliott, Russell Moore, Christopher Bryant, Marek Rei, Helen Yannakoudakis, Andrew Mullooly, Diane Nicholls, Paula Buttery In Proceedings of the AIED 2023 Workshop on Empowering Education with LLMs (AIED LLM 2023) Tokyo, Japan, 2023

Modelling Temporal Document Sequences for Clinical ICD Coding [pdf] Clarence Ng, Diogo Santos, Marek Rei In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2023) Dubrovnik, Croatia, 2023

An Extended Sequence Tagging Vocabulary for Grammatical Error Correction [pdf] Stuart Mesham, Christopher Bryant, Marek Rei, Zheng Yuan In Findings of the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2023) Dubrovnik, Croatia, 2023

Multimodal Conversation Modelling for Topic Derailment Detection [pdf] Zhenhao Li, Marek Rei, Lucia Specia In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022) Abu Dhabi, United Arab Emirates, 2022

Logical Reasoning with Span-Level Predictions for Interpretable and Robust NLI Models [arxiv] [blog] [pdf] Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Marek Rei In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022) Abu Dhabi, United Arab Emirates, 2022

Control Prefixes for Parameter-Efficient Text Generation [arxiv] [pdf] Jordan Clive, Kris Cao, Marek Rei In Proceedings of the Second workshop on Generation, Evaluation & Metrics (GEM 2022) Abu Dhabi, United Arab Emirates, 2022

An Analysis of Corporate Sustainability Behaviour Through the Lens of Empirical Fitness Landscapes [pre-print] Simone Cenci, Marek Rei, Maurizio Zollo SSRN pre-print under review, 2022

Supervising Model Attention with Human Explanations for Robust Natural Language Inference [arxiv] [poster] Joe Stacey, Yonatan Belinkov, Marek Rei In Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI 2022) *Acceptance rate: 15%* Virtual Conference, 2022

Business sustainability behaviour and alignment with climate targets [pre-print] Simone Cenci, Matteo Burato, Marek Rei, Maurizio Zollo Research Square pre-print under review, 2022

Memorisation versus Generalisation in Pre-trained Language Models [arxiv] [pdf] [video] [poster] Michael Tänzer, Sebastian Ruder, Marek Rei In Proceedings of the 60th annual meeting of the Association for Computational Linguistics (ACL 2022) Dublin, Ireland, 2022

Probing for targeted syntactic knowledge through grammatical error detection [pdf] Christopher Davis, Christopher Bryant, Andrew Caines, Marek Rei, Paula Buttery In Proceedings of the 26th Conference on Computational Natural Language Learning (CoNLL 2022) Abu Dhabi, United Arab Emirates, 2022

Guiding Visual Question Generation [arxiv] [pdf] [video] Nihir Vedd, Zixu Wang, Marek Rei, Yishu Miao, Lucia Specia In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL-HLT 2022) Seattle, Washington, USA, 2022

Visual Cues and Error Correction for Translation Robustness [arxiv] [pdf] [video] [code] Zhenhao Li, Marek Rei, Lucia Specia In Findings of the Association for Computational Linguistics: EMNLP 2021

GiBERT: Enhancing BERT with Linguistic Information using a Lightweight Gated Injection Method [arxiv] [pdf] [video] [code] Nicole Peinelt, Marek Rei, Maria Liakata In Findings of the Association for Computational Linguistics: EMNLP 2021

Contextual Sentence Classification: Detecting Sustainability Initiatives in Company Reports [arxiv] Dan Hirlea, Christopher Bryant, Maurizio Zollo, Marek Rei ArXiv, 2021

Zero-shot Sequence Labeling for Transformer-based Sentence Classifiers [arxiv] [pdf] [video] [code] Kamil Bujel, Helen Yannakoudakis, Marek Rei In Proceedings of the 6th Workshop on Representation Learning for NLP (RepL4NLP 2021) Virtual Conference, 2021

How Metaphors Impact Political Discourse: A Large-Scale Topic-Agnostic Study Using Neural Metaphor Detection [arxiv] [pdf] Vinodkumar Prabhakaran, Marek Rei, Ekaterina Shutova In Proceedings of the 15th AAAI International Conference on Web and Social Media (ICWSM 2021) *Acceptance rate: 21.4%* Atlanta, USA, 2021

Grammatical error detection in transcriptions of spoken English [pdf] [dataset] Andrew Caines, Christian Bentz, Kate Knill, Marek Rei, Paula Buttery In Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020) Virtual Conference, 2020

Verbal Multiword Expressions for Identification of Metaphor [pdf] [video] Omid Rohanian, Marek Rei, Shiva Taslimipoor, Le An Ha In Proceedings of the 58th annual meeting of the Association for Computational Linguistics (ACL 2020) *Acceptance rate: 25.2%* Seattle, USA, 2020

Grammatical Error Correction in Low Error Density Domains: A New Benchmark and Analyses [arxiv] [pdf] [video] [dataset] Simon Flachs, Ophélie Lacroix, Helen Yannakoudakis, Marek Rei, Anders Søgaard In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020) *Acceptance rate: 22.4%* Virtual Conference, 2020

Seeing Both the Forest and the Trees: Multi-head Attention for Joint Classification on Different Compositional Levels [arxiv] [pdf] [code] Miruna Pislar, Marek Rei In Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020) Virtual Conference, 2020

Multidirectional Associative Optimization of Function-Specific Word Representations [arxiv] [pdf] [video] [code] Daniela Gerz, Ivan Vulić, Marek Rei, Roi Reichart, Anna Korhonen In Proceedings of the 58th annual meeting of the Association for Computational Linguistics (ACL 2020) *Acceptance rate: 25.2%* Seattle, USA, 2020

Semi-Supervised Bootstrapping of Dialogue State Trackers for Task-Oriented Modelling [pdf] Bo-Hsiang Tseng, Marek Rei, Paweł Budzianowski, Richard Turner, Bill Byrne, Anna Korhonen In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019) Hong Kong, China, 2019

Modelling the interplay of metaphor and emotion through multitask learning [pdf] Verna Dankers, Marek Rei, Martha Lewis, Ekaterina Shutova In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019) Hong Kong, China, 2019

Neural and FST-based approaches to grammatical error correction [pdf] Zheng Yuan, Felix Stahlberg, Marek Rei, Bill Byrne, Helen Yannakoudakis In Proceedings of the 14th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2019) Florence, Italy, 2019

Jointly Learning to Label Sentences and Tokens [arxiv] [pdf] [poster] [code] [slides] Marek Rei, Anders Søgaard In Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI 2019) *Acceptance rate: 16.2%* Honolulu, USA, 2019

CAMsterdam at SemEval-2019 Task 6: Neural and graph-based feature extraction for the identification of offensive tweets [pdf] Guy Aglionby, Christopher Davis, Pushkar Mishra, Andrew Caines, Helen Yannakoudakis, Marek Rei, Ekaterina Shutova, Paula Buttery In Proceedings of the International Workshop on Semantic Evaluation 2019 (SemEval 2019) Minneapolis, USA, 2019

A Simple and Robust Approach to Detecting Subject-Verb Agreement Errors [pdf] Simon Flachs, Ophélie Lacroix, Marek Rei, Helen Yannakoudakis, Anders Søgaard In Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2019) Minneapolis, USA, 2019

Bad Form: Comparing Context-Based and Form-Based Few-Shot Learning in Distributional Semantic Models [arxiv] [pdf] Jeroen Van Hautte, Guy Emerson, Marek Rei In Proceedings of the Second Workshop on Deep Learning for Low-Resource NLP (DeepLo 2019) Hong Kong, China, 2019

Context is Key: Grammatical Error Detection with Contextual Word Representations [arxiv] [pdf] [code] Samuel Bell, Helen Yannakoudakis, Marek Rei In Proceedings of the 14th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2019) Florence, Italy, 2019

Variable Typing: Assigning Meaning to Variables in Mathematical Text [pdf] [video] [slides] Yiannos Stathopoulos, Simon Baker, Marek Rei, Simone Teufel In Proceedings of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2018) New Orleans, United States, 2018

Scoring Lexical Entailment with a Supervised Directional Similarity Network [arxiv] [pdf] [video] [code] [slides] Marek Rei, Daniela Gerz, Ivan Vulić In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018) *Acceptance rate: 24.9%* Melbourne, Australia, 2018

Zero-shot Sequence Labeling: Transferring Knowledge from Sentences to Tokens [arxiv] [pdf] [video] [code] [slides] Marek Rei, Anders Søgaard In Proceedings of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2018) New Orleans, United States, 2018

Neural Multi-task Learning in Automated Assessment [arxiv] Ronan Cummins, Marek Rei arXiv:1801.06830, 2018

Advance Prediction of Ventricular Tachyarrhythmias using Patient Metadata and Multi-Task Networks [arxiv] [poster] Marek Rei, Josh Oppenheimer, Marek Sirendi In Proceedings of the NeurIPS Workshop on Machine Learning for Health (ML4H 2018) Montreal, Canada, 2018

Sequence classification with human attention [pdf] [code] Maria Barrett, Joachim Bingel, Nora Hollenstein, Marek Rei, Anders Søgaard In Proceedings of the SIGNLL Conference on Computational Natural Language Learning (CoNLL 2018) *Special award for the best paper on research inspired by human language learning and processing* Brussels, Belgium, 2018

Grasping the Finer Point: A Supervised Similarity Network for Metaphor Detection [arxiv] [pdf] [video] [code] [slides] Marek Rei, Luana Bulat, Douwe Kiela, Ekaterina Shutova In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP-2017) *Acceptance rate: 26%* Copenhagen, Denmark, 2017

An Error-Oriented Approach to Word Embedding Pre-Training [arxiv] [pdf] Youmna Farag, Marek Rei, Ted Briscoe In Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications (BEA-2017) Copenhagen, Denmark, 2017

Detecting Off-topic Responses to Visual Prompts [arxiv] [pdf] [poster] Marek Rei In Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications (BEA-2017) Copenhagen, Denmark, 2017

Auxiliary Objectives for Neural Error Detection Models [arxiv] [pdf] [slides] Marek Rei, Helen Yannakoudakis In Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications (BEA-2017) Copenhagen, Denmark, 2017

Artificial Error Generation with Machine Translation and Syntactic Patterns [arxiv] [pdf] Marek Rei, Mariano Felice, Zheng Yuan, Ted Briscoe In Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications (BEA-2017) Copenhagen, Denmark, 2017

Semi-supervised Multitask Learning for Sequence Labeling [arxiv] [pdf] [poster] [code] Marek Rei In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL-2017) Vancouver, Canada, 2017

Neural Sequence-Labelling Models for Grammatical Error Correction [pdf] Helen Yannakoudakis, Marek Rei, Øistein E. Andersen, Zheng Yuan In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP-2017) *Acceptance rate: 26%* Copenhagen, Denmark, 2017

Attending to characters in neural sequence labeling models [arxiv] [pdf] [poster] [code] Marek Rei, Gamal K.O. Crichton, Sampo Pyysalo In Proceedings of the 26th International Conference on Computational Linguistics (COLING-2016) Osaka, Japan, 2016

Compositional Sequence Labeling Models for Error Detection in Learner Writing [arxiv] [pdf] [poster] [code] Marek Rei, Helen Yannakoudakis In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL-2016) Berlin, Germany, 2016

Automatic Text Scoring Using Neural Networks [arxiv] [pdf] [poster] Dimitrios Alikaniotis, Helen Yannakoudakis, Marek Rei In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL-2016) Berlin, Germany, 2016

A Joint Model for Word Embedding and Word Morphology [arxiv] [pdf] Kris Cao, Marek Rei In Proceedings of the 1st Workshop on Representation Learning for NLP (RepL4NLP-2016) Berlin, Germany, 2016

Sentence Similarity Measures for Fine-Grained Estimation of Topical Relevance in Learner Essays [arxiv] [pdf] [code] [slides] [weights] Marek Rei, Ronan Cummins In Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications (BEA) San Diego, United States, 2016

Online Representation Learning in Recurrent Neural Language Models [pdf] [poster] Marek Rei In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP) Lisbon, Portugal, 2015

Looking for hyponyms in vector space [pdf] [poster] [dataset] [slides] [vectorsets] Marek Rei, Ted Briscoe In Proceedings of the Eighteenth Conference on Computational Natural Language Learning (CoNLL-14) Baltimore, Maryland, United States, 2014

Minimally supervised dependency-based methods for natural language processing [pdf] Marek Rei PhD thesis, University of Cambridge Cambridge, United Kingdom, 2013

Parser lexicalisation through self-learning [pdf] [poster] Marek Rei, Ted Briscoe In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2013). Atlanta, United States, 2013

Unsupervised Entailment Detection between Dependency Graph Fragments [pdf] [dataset] Marek Rei, Ted Briscoe In Proceedings of the 2011 Workshop on Biomedical Natural Language Processing (BioNLP-11). Portland, United States, 2011

Intelligent Information Access from Scientific Papers [link] Ted Briscoe, Karl Harrison, Andrew Naish-Guzman, Andy Parker, Marek Rei, Advaith Siddharthan, David Sinclair, Mark Slater, Rebecca Watson Current Challenges in Patent Information Retrieval, edited by Mihai Lupu, Katja Mayer, John Tait and Anthony J. Trippe. Springer, Dordrecht, 2011

Combining Manual Rules and Supervised Learning for Hedge Cue and Scope Detection [pdf] Marek Rei, Ted Briscoe The 14th Conference on Natural Language Learning (CoNLL-10). Uppsala, Sweden, 2010

Adaptive Interactive Information Extraction [pdf] Marek Rei MPhil thesis Computer Laboratory, University of Cambridge, 2009