Iñigo Alonso

NLP Research Scientist

Researching multimodal document understanding methods for reasoning over documents that combine text, tables, and images. Building large-scale multimodal ML systems.

University of Edinburgh

GitHub Scholar Twitter

About

I'm a postdoctoral researcher at the University of Edinburgh, working with Prof. Mirella Lapata on how vision-language models reason over real-world documents that combine text, tables, figures, and images. Tables have been my long-standing focus, but my work now spans documents as a whole. I have 14 years of combined experience across academia and industry.

Experience

Postdoctoral Research Associate

Jan. 2025 - Present

EdinburghNLP, University of Edinburgh

Research on multimodal document understanding in Prof. Mirella Lapata's group, focusing on visually-represented language, table understanding, and reasoning over multimodal documents.

TABLET (ICLR 2026): a 4M-example visual table understanding dataset across 21 tasks, where fine-tuned models improve robustness when exposed to real-world visually rich tables.
Improving numerical reasoning of VLMs over long-context hybrid documents. Exploring agentic thinking and latent reasoning beyond chain-of-thought.

Research Assistant

Jan. 2023 - Dec. 2024

HiTZ Center, University of the Basque Country UPV/EHU

Research across two projects: Luminous (EU project on multimodal dialogue for mixed reality headsets) and Antidote (retrieval-augmented medical question answering in LLMs).

MATE (ACL 2025): a benchmark to test cross-modal entity correlation in vision-language models, showing state-of-the-art models fall significantly short of human performance.
MedExpQA (Artificial Intelligence in Medicine, 2024): the first multilingual medical QA benchmark with gold explanations from medical doctors. Later adopted by Google as a training dataset for MedGemma.

Machine Learning Engineer

May 2018 - Sep. 2021

Sherpa.ai

Developed T5-based data-to-text generation for a widely-used commercial smart assistant. Earlier, developed LSTM-based models for text summarisation and slot filling, deployed in production on AWS.

Research Intern, Research Assistant

Feb. 2012 - Jan. 2015 · Jan. 2017 - Sep. 2017

Deustotech Computing

Early-career research roles applying machine learning to data science problems, including anomaly detection in web logs and retail customer segmentation.

Publications

TABLET: A Large-Scale Dataset for Robust Visual Table Understanding

Iñigo Alonso, Imanol Miranda, Eneko Agirre, Mirella Lapata

ICLR 2026

Accepted

MATE: A Cross-modal Entity Linking Benchmark for Vision Language Models

Iñigo Alonso, Ander Salaberria, Gorka Azkune, Oier Lopez de Lacalle

ACL 2025

Accepted at Findings

PixT3: Pixel-based Table-To-Text Generation

Iñigo Alonso, Eneko Agirre, Mirella Lapata

ACL 2024

Accepted at Main Conference

MedExpQA: Multilingual Benchmarking of Large Language Models for Medical Question Answering

Iñigo Alonso, Maite Oronoz, Rodrigo Agerri

Artificial Intelligence in Medicine

Accepted • 2024

Automatic Logical Forms improve fidelity in Table-to-Text generation

Iñigo Alonso, Eneko Agirre

Expert Systems with Applications

Accepted • 2023