Iñigo Alonso

NLP Research Scientist

Researching multimodal document understanding methods for reasoning over documents that combine text, tables, and images. Building large-scale multimodal ML systems.

University of Edinburgh
GitHub Scholar Twitter

About

I'm a postdoctoral researcher at the University of Edinburgh, working with Prof. Mirella Lapata on how vision-language models reason over real-world documents that combine text, tables, figures, and images. Tables have been my long-standing focus, but my work now spans documents as a whole. I have 14 years of combined experience across academia and industry.


Experience

Postdoctoral Research Associate
Jan. 2025 - Present
EdinburghNLP, University of Edinburgh

Research on multimodal document understanding in Prof. Mirella Lapata's group, focusing on visually-represented language, table understanding, and reasoning over multimodal documents.

  • TABLET (ICLR 2026): a 4M-example visual table understanding dataset across 21 tasks, where fine-tuned models improve robustness when exposed to real-world visually rich tables.
  • Improving numerical reasoning of VLMs over long-context hybrid documents. Exploring agentic thinking and latent reasoning beyond chain-of-thought.
Research Assistant
Jan. 2023 - Dec. 2024
HiTZ Center, University of the Basque Country UPV/EHU

Research across two projects: Luminous (EU project on multimodal dialogue for mixed reality headsets) and Antidote (retrieval-augmented medical question answering in LLMs).

  • MATE (ACL 2025): a benchmark to test cross-modal entity correlation in vision-language models, showing state-of-the-art models fall significantly short of human performance.
  • MedExpQA (Artificial Intelligence in Medicine, 2024): the first multilingual medical QA benchmark with gold explanations from medical doctors. Later adopted by Google as a training dataset for MedGemma.
Machine Learning Engineer
May 2018 - Sep. 2021
Sherpa.ai

Developed T5-based data-to-text generation for a widely-used commercial smart assistant. Earlier, developed LSTM-based models for text summarisation and slot filling, deployed in production on AWS.

Research Intern, Research Assistant
Feb. 2012 - Jan. 2015 · Jan. 2017 - Sep. 2017
Deustotech Computing

Early-career research roles applying machine learning to data science problems, including anomaly detection in web logs and retail customer segmentation.


Publications

TABLET: A Large-Scale Dataset for Robust Visual Table Understanding
Iñigo Alonso, Imanol Miranda, Eneko Agirre, Mirella Lapata
ICLR 2026
Accepted
MATE: A Cross-modal Entity Linking Benchmark for Vision Language Models
Iñigo Alonso, Ander Salaberria, Gorka Azkune, Oier Lopez de Lacalle
ACL 2025
Accepted at Findings
PixT3: Pixel-based Table-To-Text Generation
Iñigo Alonso, Eneko Agirre, Mirella Lapata
ACL 2024
Accepted at Main Conference
MedExpQA: Multilingual Benchmarking of Large Language Models for Medical Question Answering
Iñigo Alonso, Maite Oronoz, Rodrigo Agerri
Artificial Intelligence in Medicine
Accepted • 2024
Automatic Logical Forms improve fidelity in Table-to-Text generation
Iñigo Alonso, Eneko Agirre
Expert Systems with Applications
Accepted • 2023