Carsten Eickhoff, Ph.D.

Professor | Scientific Director | Founder | Board Member | Expert in Natural Language Processing and AI

Tübingen

Research Expertise

Natural Language Processing

Information Retrieval

Digital Health

Generative AI

Machine Learning

Technology Entrepreneurship

About

Carsten is a Professor at the University of Tübingen where his lab specializes in the development of interpretable natural language processing and AI techniques. Prior to joining Tübingen, he was the Manning Assistant Professor of Medical and Computer Science at Brown University. He received degrees from the University of Edinburgh and TU Delft, and was a postdoctoral fellow at ETH Zurich and Harvard University. Carsten has authored more than 150 articles in computer science conferences (e.g., ICLR, ACL, SIGIR, WWW, KDD) and clinical journals (e.g., Nature Digital Medicine, The Lancet - Respiratory Medicine, Radiology, European Heart Journal). His research has been supported by the Swiss National Science Foundation, NSF, NIH, DARPA, IARPA, Google, Amazon, Microsoft and others. Aside from his academic endeavors, he is a founder and board member of several deep technology startups.

Legacy Map

Full View

Publications

A Transformer-based Framework for Multivariate Time Series Representation Learning

Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining

2021

Machine learning for real-time prediction of complications in critical care: a retrospective study

The Lancet Respiratory Medicine

2018

Quality through flow and immersion

Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval

2012

Increasing cheat robustness of crowdsourcing tasks

Information Retrieval

2012

Cognitive Biases in Crowdsourcing

Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining

2018

Lessons from the journey

Proceedings of the 7th ACM international conference on Web search and data mining

2014

Managing the Quality of Large-Scale Crowdsourcing

Unknown Venue

2011

Probabilistic Bag-Of-Hyperlinks Model for Entity Linking

Proceedings of the 25th International Conference on World Wide Web

2016

GeAnn at the TREC 2011 Crowdsourcing Track

Unknown Venue

2011

Comment on the Paper Titled ’The Origin of Quantum Mechanical Statistics: Insights from Research on Human Language’ (arXiv preprint arXiv:2407.14924, 2024)

Unknown Venue

2024

Advancing health equity with artificial intelligence

Journal of Public Health Policy

2021

Multimodal attention-based deep learning for Alzheimer’s disease diagnosis

Journal of the American Medical Informatics Association

2022

Deep-learning-based real-time prediction of acute kidney injury outperforms human predictive performance

npj Digital Medicine

2020

The where in the tweet

Proceedings of the 20th ACM international conference on Information and knowledge management

2011

Talking Heads: Understanding Inter-Layer Communication in Transformer Language Models

Advances in Neural Information Processing Systems 37

2024

Overview of ImageCLEF 2018: Challenges, Datasets and Evaluation

Lecture Notes in Computer Science

2018

Detecting Large Vessel Occlusion at Multiphase CT Angiography by Using a Deep Convolutional Neural Network

Radiology

2020

Overview of ImageCLEF 2017: Information Extraction from Images

Lecture Notes in Computer Science

2017

Drug–drug interaction prediction with Wasserstein Adversarial Autoencoder-based knowledge graph embeddings

Briefings in Bioinformatics

2020

ArXiv preprint server plans multimillion-dollar overhaul

Nature

2016

Language Models Implement Simple Word2Vec-style Vector Arithmetic

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)

2024

TripClick: The Log Files of a Large Health Web Search Engine

Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval

2021

Web2Text: Deep Structured Boilerplate Removal

Lecture Notes in Computer Science

2018

An Eye-Tracking Study of Query Reformulation

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval

2015

Personalizing atypical web search sessions

Proceedings of the sixth ACM international conference on Web search and data mining

2013

Mitigating Bias in Search Results Through Contextual Document Reranking and Neutrality Regularization

Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

2022

On the Effect of Low-Frequency Terms on Neural-IR Models

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

2019

COVID-19 mortality prediction in the intensive care unit with deep learning based on longitudinal chest X-rays and clinical data

European Radiology

2022

Not All Relevance Scores are Equal: Efficient Uncertainty and Calibration Modeling for Deep Retrieval Models

Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval

2021

A combined topical/non-topical approach to identifying web sites for children

Proceedings of the fourth ACM international conference on Web search and data mining

2011

Preprint site arXiv is banning computer-science reviews: here’s why

Nature

2025

IsoScore: Measuring the Uniformity of Embedding Space Utilization

Findings of the Association for Computational Linguistics: ACL 2022

2022

Unsupervised Learning of Parsimonious General-Purpose Embeddings for User and Location Modeling

ACM Transactions on Information Systems

2018

An automated COVID-19 triage pipeline using artificial intelligence based on chest radiographs and clinical data

npj Digital Medicine

2022

Web page classification on child suitability

Proceedings of the 19th ACM international conference on Information and knowledge management

2010

Unsupervised Multivariate Time-Series Transformers for Seizure Identification on EEG

2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)

2022

Parameter-efficient Modularised Bias Mitigation via AdapterFusion

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics

2023

NEWTS: A Corpus for News Topic-Focused Summarization

Findings of the Association for Computational Linguistics: ACL 2022

2022

Supporting children's web search in school environments

Proceedings of the 4th Information Interaction in Context Symposium

2012

Introduction to the special issue on search as learning

Information Retrieval Journal

2017

Crowd-powered experts

Proceedings of the First International Workshop on Gamification for Information Retrieval

2014

Development of a Deep Learning Network to Classify Inferior Vena Cava Collapse to Predict Fluid Responsiveness

Journal of Ultrasound in Medicine

2020

Do “Undocumented Workers” == “Illegal Aliens”? Differentiating Denotation and Connotation in Vector Spaces

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

2020

Dynamic compression schemes for graph coloring

Bioinformatics

2018

Modelling Term Dependence with Copulas

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval

2015

SIMSUM: Document-level Text Simplification via Simultaneous Summarization

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

2023

Machine learning to predict hemorrhage and thrombosis during extracorporeal membrane oxygenation

Critical Care

2020

Enriching Word Embeddings for Patent Retrieval with Global Context

Lecture Notes in Computer Science

2019

A Modern Perspective on Query Likelihood with Deep Generative Retrieval Models

Proceedings of the 2021 ACM SIGIR International Conference on Theory of Information Retrieval

2021

Computing Web-scale Topic Models using an Asynchronous Parameter Server

Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval

2017

Copulas for information retrieval

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval

2013

Transcriptional profiles of pulmonary artery endothelial cells in pulmonary hypertension

Scientific Reports

2023

CATS: Customizable Abstractive Topic-based Summarization

ACM Transactions on Information Systems

2021

A Cross-Platform Collection of Social Network Profiles

Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval

2016

Exploiting User Comments for Audio-Visual Content Indexing and Retrieval

Lecture Notes in Computer Science

Search Result Explanations Improve Efficiency and Trust

Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval

2020

Want a coffee?

Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval

2012

CODER: An efficient framework for improving retrieval through COntextual Document Embedding Reranking

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing

2022

Exploiting Document Content for Efficient Aggregation of Crowdsourcing Votes

Proceedings of the 24th ACM International on Conference on Information and Knowledge Management

2015

Geo-spatial Domain Expertise in Microblogs

Lecture Notes in Computer Science

2014

Biomedical Question Answering via Weighted Neural Network Passage Retrieval

Lecture Notes in Computer Science

2018

Named Entity Recognition of traditional architectural text based on BERT

2021 International Conference on Culture-oriented Science & Technology (ICCST)

2021

Experimental IR Meets Multilinguality, Multimodality, and Interaction

Lecture Notes in Computer Science

2020

Implicit Negative Feedback in Clinical Information Retrieval

Swiss Medical Informatics

2016

Probabilistic Local Expert Retrieval

Lecture Notes in Computer Science

2016

Web Search Query Assistance Functionality for Young Audiences

Lecture Notes in Computer Science

2011

What Do VLMs NOTICE? A Mechanistic Interpretability Pipeline for Gaussian-Noise-free Text-Image Corruption and Evaluation

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)

2025

The Scholarly Impact of CLEF 2010–2017

The Information Retrieval Series

2019

Crowdsourced user interface testing for multimedia applications

Proceedings of the ACM multimedia 2012 workshop on Crowdsourcing for multimedia

2012

Towards Objective Quantification of Hand Tremors and Bradykinesia Using Contactless Sensors: A Systematic Review

Frontiers in Aging Neuroscience

2021

EmSe

Proceedings of the 4th Information Interaction in Context Symposium

2012

Self-Supervised Neural Topic Modeling

Findings of the Association for Computational Linguistics: EMNLP 2021

2021

Brown University at TREC Deep Learning 2019

Unknown Venue

2019

Model sensitivity analysis on arxiv

Unknown Venue

2018

Artificial intelligence-assisted care in medicine: a revolution or yet another blunt weapon?

European Heart Journal

2019

Active Content-Based Crowdsourcing Task Selection

Proceedings of the 25th ACM International on Conference on Information and Knowledge Management

2016

Neural Summarization of Electronic Health Records (Preprint)

Unknown Venue

2023

Search as Learning (SAL) Workshop 2016

Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval

2016

The downside of markup

Proceedings of the 21st ACM international conference on Information and knowledge management

2012

Exploring Facilitators and Barriers for Personalized Dietary Incentives Among Online Shoppers at Cardiovascular Risk and Key Informants to Inform an Automated Shopping Platform

Journal of Nutrition Education and Behavior

2025

K-Paths: Reasoning over Graph Paths for Drug Repurposing and Drug Interaction Prediction

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.2

2025

Towards Best Practices of Axiomatic Activation Patching in Information Retrieval

Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval

2025

Workshop on Explainability in Information Retrieval

Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval

2025

Retrieval Augmented Therapy Suggestion for Molecular Tumor Boards: Algorithmic Development and Validation Study

Journal of Medical Internet Research

2025

What’s Going On With Me and How Can I Better Manage My Health? The Potential of GPT-4 to Transform Discharge Letters Into Patient-Centered Letters to Enhance Patient Safety: Prospective, Exploratory Study

Journal of Medical Internet Research

2025

MechIR: A Mechanistic Interpretability Framework for Information Retrieval

Lecture Notes in Computer Science

2025

Unknown Venue

2024

A Language Model–Powered Simulated Patient With Automated Feedback for History Taking: Prospective Study

JMIR Medical Education

2024

Retrieval Augmented Zero-Shot Text Classification

Proceedings of the 2024 ACM SIGIR International Conference on Theory of Information Retrieval

2024

Retrieval Augmented Therapy Suggestion for Molecular Tumor Boards: Algorithmic Development and Validation Study (Preprint)

Unknown Venue

2024

Axiomatic Causal Interventions for Reverse Engineering Relevance Computation in Neural Retrieval Models

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval

2024

Evaluating Search System Explainability with Psychometrics and Crowdsourcing

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval

2024

Retrieval-Based Diagnostic Decision Support: Mixed Methods Study

JMIR Medical Informatics

2024

A Language Model–Powered Simulated Patient With Automated Feedback for History Taking: Prospective Study (Preprint)

Unknown Venue

2024

Learning to Make Rare and Complex Diagnoses With Generative AI Assistance: Qualitative Study of Popular Large Language Models

JMIR Medical Education

2024

Wasserstein adversarial learning based temporal knowledge graph embedding

Information Sciences

2024

Predicting Acute Brain Injury in Venoarterial Extracorporeal Membrane Oxygenation Patients with Tree-Based Machine Learning: Analysis of the Extracorporeal Life Support Organization Registry

Unknown Venue

2024

Utilizing Machine Learning to Predict Neurological Injury in Venovenous Extracorporeal Membrane Oxygenation Patients: An Extracorporeal Life Support Organization Registry Analysis

Unknown Venue

2023

Predictive Uncertainty-based Bias Mitigation in Ranking

Proceedings of the 32nd ACM International Conference on Information and Knowledge Management

2023

Learning to Make Rare and Complex Diagnoses With Generative AI Assistance: Qualitative Study of Popular Large Language Models (Preprint)

Unknown Venue

2023

Retrieval-Based Diagnostic Decision Support: Mixed Methods Study (Preprint)

Unknown Venue

2023

Weakly supervised pneumonia localization in chest X‐rays using generative adversarial networks

Medical Physics

2021

Categorization of free-text drug orders using character-level recurrent neural networks

International Journal of Medical Informatics

2019

Dynamic compression schemes for graph coloring

Unknown Venue

2017

The Accuracy And Clinical Relevance of Chat GPT-4 in Triple Negative Breast Cancer Research

Acta Informatica Medica

2025

Report from the 4th Strategic Workshop on Information Retrieval in Lorne (SWIRL 2025)

ACM SIGIR Forum

2025

The topology of molecular representations and its influence on machine learning performance

Journal of Cheminformatics

2025

Artificial intelligence based real-time prediction of imminent heart failure hospitalisation in patients undergoing non-invasive telemedicine

Frontiers in Cardiovascular Medicine

2024

Identifying momentary suicidal ideation using machine learning in patients at high-risk for suicide

Journal of Affective Disorders

2024

Künstliche Intelligenz in der Medizin: Wo stehen wir heute, und was liegt vor uns?

Zeitschrift für Herz-,Thorax- und Gefäßchirurgie

2024

Short-term vital parameter forecasting in the intensive care unit: A benchmark study leveraging data from patients after cardiothoracic surgery

PLOS Digital Health

2024

One Third of Alcohol Use Disorder Diagnoses are Missed by ICD Coding

Substance Use & Addiction Journal

2024

Pre-operative lung ablation prediction using deep learning

European Radiology

2024

Interpretable machine learning-based predictive modeling of patient outcomes following cardiac surgery

The Journal of Thoracic and Cardiovascular Surgery

2025

Editorial: The Potential of Machine-learning in Pharmacogenetics, Pharmacogenomics and Pharmacoepidemiology: Volume II

Frontiers in Pharmacology

2023

AI-Controlled Closed-Loop Electrical Stimulation Implants: A Feasibility Study

Unknown Venue

2022

Delirium detection using wearable sensors and machine learning in patients with intracerebral hemorrhage

Frontiers in Neurology

2023

Neural text generation in regulatory medical writing

Frontiers in Pharmacology

2023

Risk Factors for Pediatric Sepsis in the Emergency Department

Pediatric Emergency Care

2023

Editorial: The Potential of Machine Learning in Pharmacogenetics, Pharmacogenomics and Pharmacoepidemiology

Frontiers in Pharmacology

2022

Correction to: COVID-19 mortality prediction in the intensive care unit with deep learning based on longitudinal chest X-rays and clinical data

European Radiology

2022

On the Role of “Digital Actors” in Entertainment-Based Virtual Worlds

The Oxford Handbook of Virtuality

2013

Machine learning and deep learning-based approaches in epilepsy

Unknown Venue

Quantifying the Risks of LLM- and Tool-assisted Rephrasing to Linguistic Diversity

Findings of the Association for Computational Linguistics: EMNLP 2025

2025

Position: Benchmarking is Broken - Don't Let AI be its Own Judge

Unknown Venue

2025

Pixels Versus Priors: Controlling Knowledge Priors in Vision-Language Models through Visual Counterfacts

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing

2025

Pathway to Relevance: How Cross-Encoders Implement a Semantic Variant of BM25

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing

2025

Interpretability Analysis of Arithmetic In-Context Learning in Large Language Models

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing

2025

Paths Not Taken: Understanding and Mending the Multilingual Factual Recall Pipeline

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing

2025

Beyond Contrastive Learning: Synthetic Data Enables List-wise Training with Multiple Levels of Relevance

Findings of the Association for Computational Linguistics: EMNLP 2025

2025

Re-Evaluating Evaluation for Multilingual Summarization

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing

2024

Forgotten Polygons: Multimodal Large Language Models are Shape-Blind

Findings of the Association for Computational Linguistics: ACL 2025

2025

Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models

Findings of the Association for Computational Linguistics: NAACL 2025

2025

One-Versus-Others Attention: Scalable Multimodal Integration for Biomedical Data

Biocomputing 2025

2024

Text Simplification via Adaptive Teaching

Findings of the Association for Computational Linguistics ACL 2024

2024

Stable On-Line Learning with Optimized Local Learning, But Minimal Change of the Global Output

2013 12th International Conference on Machine Learning and Applications

2013

Enhancing the Ranking Context of Dense Retrieval through Reciprocal Nearest Neighbors

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

2023

Outlier Dimensions Encode Task Specific Knowledge

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

2023

Pretraining on Interactions for Learning Grounded Affordance Representations

Proceedings of the 11th Joint Conference on Lexical and Computational Semantics

2022

American Medical Informatics Association (AMIA) 2007 Annual Symposium

Unknown Venue

2008

Inconsistent Ranking Assumptions in Medical Search and Their Downstream Consequences

Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

2022

APA-RST: A Text Simplification Corpus with RST Annotations

Proceedings of the 4th Workshop on Computational Approaches to Discourse (CODI 2023)

2023

SOCCER: An Information-Sparse Discourse State Tracking Collection in the Sports Commentary Domain

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

2021

34th Annual Symposium on Biomedical and Health Informatics (AMIA 2010) conference report

ACM SIGHIT Record

2011

Web-Based Visualization of MeSH-Based PubMed/MEDLINE Statistics

Studies in Health Technology and Informatics

2019

Baxter Amia Automated Peritoneal Dialysis Cycler Set

Biomedical Safety & Standards

2024

Education

Technische Universiteit Delft

Ph.D. (Computer Science) / October, 2014

Delft

The University of Edinburgh

M.Sc. (Artificial Intelligence) / November, 2009

Edinburgh

FHDW Hannover

B.Sc. / 2008

Experience

University of Tübingen

Professor / 2022 — Present

Brown University

Manning Assistant Professor / 2018 — 2022

ETH Zurich

Postdoc / 2014 — 2018

University Hospital Tübingen

Scientific Director (Medical Data Integration Center) / 2022 — Present

Harvard University

Visiting Fellow / 2017 — 2017

codiag AG

Co-Founder & Chief Scientist / 2018 — 2022

CareCrowd

Co-Founder & CTO / 2024 — Present

Links & Social Media

Research Web Site

Personal Homepage

ORCID

Join Carsten on NotedSource!

Join Now

At NotedSource, we believe that professors, post-docs, scientists and other researchers have deep, untapped knowledge and expertise that can be leveraged to drive innovation within companies. NotedSource is committed to bridging the gap between academia and industry by providing a platform for collaboration with industry and networking with other researchers.

For industry, NotedSource identifies the right academic experts in 24 hours to help organizations build and grow. With a platform of thousands of knowledgeable PhDs, scientists, and industry experts, NotedSource makes connecting and collaborating easy.

For academic researchers such as professors, post-docs, and Ph.D.s, NotedSource provides tools to discover and connect to your colleagues with messaging and news feeds, in addition to the opportunity to be paid for your collaboration with vetted partners.