Carsten Eickhoff, Ph.D.

Professor | Scientific Director | Founder | Board Member | Expert in Natural Language Processing and AI

Tübingen

Research Expertise

Natural Language Processing
Information Retrieval
Digital Health
Generative AI
Machine Learning
Technology Entrepreneurship

About

Carsten is a Professor at the University of Tübingen where his lab specializes in the development of interpretable natural language processing and AI techniques. Prior to joining Tübingen, he was the Manning Assistant Professor of Medical and Computer Science at Brown University. He received degrees from the University of Edinburgh and TU Delft, and was a postdoctoral fellow at ETH Zurich and Harvard University. Carsten has authored more than 150 articles in computer science conferences (e.g., ICLR, ACL, SIGIR, WWW, KDD) and clinical journals (e.g., Nature Digital Medicine, The Lancet - Respiratory Medicine, Radiology, European Heart Journal). His research has been supported by the Swiss National Science Foundation, NSF, NIH, DARPA, IARPA, Google, Amazon, Microsoft and others. Aside from his academic endeavors, he is a founder and board member of several deep technology startups.

Legacy Map

Full View

Publications

A Transformer-based Framework for Multivariate Time Series Representation Learning
Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining
2021
Machine learning for real-time prediction of complications in critical care: a retrospective study
The Lancet Respiratory Medicine
2018
Quality through flow and immersion
Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
2012
Increasing cheat robustness of crowdsourcing tasks
Information Retrieval
2012
Cognitive Biases in Crowdsourcing
Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining
2018
Lessons from the journey
Proceedings of the 7th ACM international conference on Web search and data mining
2014
Managing the Quality of Large-Scale Crowdsourcing
Unknown Venue
2011
Probabilistic Bag-Of-Hyperlinks Model for Entity Linking
Proceedings of the 25th International Conference on World Wide Web
2016
GeAnn at the TREC 2011 Crowdsourcing Track
Unknown Venue
2011
Comment on the Paper Titled ’The Origin of Quantum Mechanical Statistics: Insights from Research on Human Language’ (arXiv preprint arXiv:2407.14924, 2024)
Unknown Venue
2024
Advancing health equity with artificial intelligence
Journal of Public Health Policy
2021
Multimodal attention-based deep learning for Alzheimer’s disease diagnosis
Journal of the American Medical Informatics Association
2022
Deep-learning-based real-time prediction of acute kidney injury outperforms human predictive performance
npj Digital Medicine
2020
The where in the tweet
Proceedings of the 20th ACM international conference on Information and knowledge management
2011
Talking Heads: Understanding Inter-Layer Communication in Transformer Language Models
Advances in Neural Information Processing Systems 37
2024
Overview of ImageCLEF 2018: Challenges, Datasets and Evaluation
Lecture Notes in Computer Science
2018
Detecting Large Vessel Occlusion at Multiphase CT Angiography by Using a Deep Convolutional Neural Network
Radiology
2020
Overview of ImageCLEF 2017: Information Extraction from Images
Lecture Notes in Computer Science
2017
Drug–drug interaction prediction with Wasserstein Adversarial Autoencoder-based knowledge graph embeddings
Briefings in Bioinformatics
2020
ArXiv preprint server plans multimillion-dollar overhaul
Nature
2016
Language Models Implement Simple Word2Vec-style Vector Arithmetic
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
2024
TripClick: The Log Files of a Large Health Web Search Engine
Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval
2021
Web2Text: Deep Structured Boilerplate Removal
Lecture Notes in Computer Science
2018
An Eye-Tracking Study of Query Reformulation
Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval
2015
Personalizing atypical web search sessions
Proceedings of the sixth ACM international conference on Web search and data mining
2013
Mitigating Bias in Search Results Through Contextual Document Reranking and Neutrality Regularization
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval
2022
On the Effect of Low-Frequency Terms on Neural-IR Models
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval
2019
COVID-19 mortality prediction in the intensive care unit with deep learning based on longitudinal chest X-rays and clinical data
European Radiology
2022
Not All Relevance Scores are Equal: Efficient Uncertainty and Calibration Modeling for Deep Retrieval Models
Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval
2021
A combined topical/non-topical approach to identifying web sites for children
Proceedings of the fourth ACM international conference on Web search and data mining
2011
Preprint site arXiv is banning computer-science reviews: here’s why
Nature
2025
IsoScore: Measuring the Uniformity of Embedding Space Utilization
Findings of the Association for Computational Linguistics: ACL 2022
2022
Unsupervised Learning of Parsimonious General-Purpose Embeddings for User and Location Modeling
ACM Transactions on Information Systems
2018
An automated COVID-19 triage pipeline using artificial intelligence based on chest radiographs and clinical data
npj Digital Medicine
2022
Web page classification on child suitability
Proceedings of the 19th ACM international conference on Information and knowledge management
2010
Unsupervised Multivariate Time-Series Transformers for Seizure Identification on EEG
2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)
2022
Parameter-efficient Modularised Bias Mitigation via AdapterFusion
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics
2023
NEWTS: A Corpus for News Topic-Focused Summarization
Findings of the Association for Computational Linguistics: ACL 2022
2022
Supporting children's web search in school environments
Proceedings of the 4th Information Interaction in Context Symposium
2012
Introduction to the special issue on search as learning
Information Retrieval Journal
2017
Crowd-powered experts
Proceedings of the First International Workshop on Gamification for Information Retrieval
2014
Development of a Deep Learning Network to Classify Inferior Vena Cava Collapse to Predict Fluid Responsiveness
Journal of Ultrasound in Medicine
2020
Do “Undocumented Workers” == “Illegal Aliens”? Differentiating Denotation and Connotation in Vector Spaces
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
2020
Dynamic compression schemes for graph coloring
Bioinformatics
2018
Modelling Term Dependence with Copulas
Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval
2015
SIMSUM: Document-level Text Simplification via Simultaneous Summarization
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
2023
Machine learning to predict hemorrhage and thrombosis during extracorporeal membrane oxygenation
Critical Care
2020
Enriching Word Embeddings for Patent Retrieval with Global Context
Lecture Notes in Computer Science
2019
A Modern Perspective on Query Likelihood with Deep Generative Retrieval Models
Proceedings of the 2021 ACM SIGIR International Conference on Theory of Information Retrieval
2021
Computing Web-scale Topic Models using an Asynchronous Parameter Server
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval
2017
Copulas for information retrieval
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
2013
Transcriptional profiles of pulmonary artery endothelial cells in pulmonary hypertension
Scientific Reports
2023
CATS: Customizable Abstractive Topic-based Summarization
ACM Transactions on Information Systems
2021
A Cross-Platform Collection of Social Network Profiles
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval
2016
Exploiting User Comments for Audio-Visual Content Indexing and Retrieval
Lecture Notes in Computer Science
2013
YouTube Videos
Watching YouTube
2010
Search Result Explanations Improve Efficiency and Trust
Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval
2020
Want a coffee?
Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
2012
CODER: An efficient framework for improving retrieval through COntextual Document Embedding Reranking
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
2022
Exploiting Document Content for Efficient Aggregation of Crowdsourcing Votes
Proceedings of the 24th ACM International on Conference on Information and Knowledge Management
2015
Geo-spatial Domain Expertise in Microblogs
Lecture Notes in Computer Science
2014
Biomedical Question Answering via Weighted Neural Network Passage Retrieval
Lecture Notes in Computer Science
2018
Named Entity Recognition of traditional architectural text based on BERT
2021 International Conference on Culture-oriented Science & Technology (ICCST)
2021
Experimental IR Meets Multilinguality, Multimodality, and Interaction
Lecture Notes in Computer Science
2020
Implicit Negative Feedback in Clinical Information Retrieval
Swiss Medical Informatics
2016
Probabilistic Local Expert Retrieval
Lecture Notes in Computer Science
2016
Web Search Query Assistance Functionality for Young Audiences
Lecture Notes in Computer Science
2011
What Do VLMs NOTICE? A Mechanistic Interpretability Pipeline for Gaussian-Noise-free Text-Image Corruption and Evaluation
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
2025
The Scholarly Impact of CLEF 2010–2017
The Information Retrieval Series
2019
Crowdsourced user interface testing for multimedia applications
Proceedings of the ACM multimedia 2012 workshop on Crowdsourcing for multimedia
2012
Towards Objective Quantification of Hand Tremors and Bradykinesia Using Contactless Sensors: A Systematic Review
Frontiers in Aging Neuroscience
2021
EmSe
Proceedings of the 4th Information Interaction in Context Symposium
2012
Self-Supervised Neural Topic Modeling
Findings of the Association for Computational Linguistics: EMNLP 2021
2021
Brown University at TREC Deep Learning 2019
Unknown Venue
2019
Model sensitivity analysis on arxiv
Unknown Venue
2018
Artificial intelligence-assisted care in medicine: a revolution or yet another blunt weapon?
European Heart Journal
2019
Active Content-Based Crowdsourcing Task Selection
Proceedings of the 25th ACM International on Conference on Information and Knowledge Management
2016
Neural Summarization of Electronic Health Records (Preprint)
Unknown Venue
2023
Search as Learning (SAL) Workshop 2016
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval
2016
The downside of markup
Proceedings of the 21st ACM international conference on Information and knowledge management
2012
Exploring Facilitators and Barriers for Personalized Dietary Incentives Among Online Shoppers at Cardiovascular Risk and Key Informants to Inform an Automated Shopping Platform
Journal of Nutrition Education and Behavior
2025
K-Paths: Reasoning over Graph Paths for Drug Repurposing and Drug Interaction Prediction
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.2
2025
Towards Best Practices of Axiomatic Activation Patching in Information Retrieval
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval
2025
Workshop on Explainability in Information Retrieval
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval
2025
Retrieval Augmented Therapy Suggestion for Molecular Tumor Boards: Algorithmic Development and Validation Study
Journal of Medical Internet Research
2025
What’s Going On With Me and How Can I Better Manage My Health? The Potential of GPT-4 to Transform Discharge Letters Into Patient-Centered Letters to Enhance Patient Safety: Prospective, Exploratory Study
Journal of Medical Internet Research
2025
MechIR: A Mechanistic Interpretability Framework for Information Retrieval
Lecture Notes in Computer Science
2025
What’s Going On With Me and How Can I Better Manage My Health? The Potential of GPT-4 to Transform Discharge Letters Into Patient-Centered Letters to Enhance Patient Safety: Prospective, Exploratory Study (Preprint)
Unknown Venue
2024
A Language Model–Powered Simulated Patient With Automated Feedback for History Taking: Prospective Study
JMIR Medical Education
2024
Retrieval Augmented Zero-Shot Text Classification
Proceedings of the 2024 ACM SIGIR International Conference on Theory of Information Retrieval
2024
Retrieval Augmented Therapy Suggestion for Molecular Tumor Boards: Algorithmic Development and Validation Study (Preprint)
Unknown Venue
2024
Axiomatic Causal Interventions for Reverse Engineering Relevance Computation in Neural Retrieval Models
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval
2024
Evaluating Search System Explainability with Psychometrics and Crowdsourcing
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval
2024
Retrieval-Based Diagnostic Decision Support: Mixed Methods Study
JMIR Medical Informatics
2024
A Language Model–Powered Simulated Patient With Automated Feedback for History Taking: Prospective Study (Preprint)
Unknown Venue
2024
Learning to Make Rare and Complex Diagnoses With Generative AI Assistance: Qualitative Study of Popular Large Language Models
JMIR Medical Education
2024
Wasserstein adversarial learning based temporal knowledge graph embedding
Information Sciences
2024
Predicting Acute Brain Injury in Venoarterial Extracorporeal Membrane Oxygenation Patients with Tree-Based Machine Learning: Analysis of the Extracorporeal Life Support Organization Registry
Unknown Venue
2024
Utilizing Machine Learning to Predict Neurological Injury in Venovenous Extracorporeal Membrane Oxygenation Patients: An Extracorporeal Life Support Organization Registry Analysis
Unknown Venue
2023
Predictive Uncertainty-based Bias Mitigation in Ranking
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management
2023
Learning to Make Rare and Complex Diagnoses With Generative AI Assistance: Qualitative Study of Popular Large Language Models (Preprint)
Unknown Venue
2023
Retrieval-Based Diagnostic Decision Support: Mixed Methods Study (Preprint)
Unknown Venue
2023
Weakly supervised pneumonia localization in chest X‐rays using generative adversarial networks
Medical Physics
2021
Categorization of free-text drug orders using character-level recurrent neural networks
International Journal of Medical Informatics
2019
Dynamic compression schemes for graph coloring
Unknown Venue
2017
The Accuracy And Clinical Relevance of Chat GPT-4 in Triple Negative Breast Cancer Research
Acta Informatica Medica
2025
Report from the 4th Strategic Workshop on Information Retrieval in Lorne (SWIRL 2025)
ACM SIGIR Forum
2025
The topology of molecular representations and its influence on machine learning performance
Journal of Cheminformatics
2025
Artificial intelligence based real-time prediction of imminent heart failure hospitalisation in patients undergoing non-invasive telemedicine
Frontiers in Cardiovascular Medicine
2024
Identifying momentary suicidal ideation using machine learning in patients at high-risk for suicide
Journal of Affective Disorders
2024
Künstliche Intelligenz in der Medizin: Wo stehen wir heute, und was liegt vor uns?
Zeitschrift für Herz-,Thorax- und Gefäßchirurgie
2024
Short-term vital parameter forecasting in the intensive care unit: A benchmark study leveraging data from patients after cardiothoracic surgery
PLOS Digital Health
2024
One Third of Alcohol Use Disorder Diagnoses are Missed by ICD Coding
Substance Use & Addiction Journal
2024
Pre-operative lung ablation prediction using deep learning
European Radiology
2024
Interpretable machine learning-based predictive modeling of patient outcomes following cardiac surgery
The Journal of Thoracic and Cardiovascular Surgery
2025
Editorial: The Potential of Machine-learning in Pharmacogenetics, Pharmacogenomics and Pharmacoepidemiology: Volume II
Frontiers in Pharmacology
2023
AI-Controlled Closed-Loop Electrical Stimulation Implants: A Feasibility Study
Unknown Venue
2022
Delirium detection using wearable sensors and machine learning in patients with intracerebral hemorrhage
Frontiers in Neurology
2023
Neural text generation in regulatory medical writing
Frontiers in Pharmacology
2023
Risk Factors for Pediatric Sepsis in the Emergency Department
Pediatric Emergency Care
2023
Editorial: The Potential of Machine Learning in Pharmacogenetics, Pharmacogenomics and Pharmacoepidemiology
Frontiers in Pharmacology
2022
Correction to: COVID-19 mortality prediction in the intensive care unit with deep learning based on longitudinal chest X-rays and clinical data
European Radiology
2022
On the Role of “Digital Actors” in Entertainment-Based Virtual Worlds
The Oxford Handbook of Virtuality
2013
Machine learning and deep learning-based approaches in epilepsy
Unknown Venue
Quantifying the Risks of LLM- and Tool-assisted Rephrasing to Linguistic Diversity
Findings of the Association for Computational Linguistics: EMNLP 2025
2025
Position: Benchmarking is Broken - Don't Let AI be its Own Judge
Unknown Venue
2025
Pixels Versus Priors: Controlling Knowledge Priors in Vision-Language Models through Visual Counterfacts
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
2025
Pathway to Relevance: How Cross-Encoders Implement a Semantic Variant of BM25
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
2025
Interpretability Analysis of Arithmetic In-Context Learning in Large Language Models
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
2025
Paths Not Taken: Understanding and Mending the Multilingual Factual Recall Pipeline
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
2025
Beyond Contrastive Learning: Synthetic Data Enables List-wise Training with Multiple Levels of Relevance
Findings of the Association for Computational Linguistics: EMNLP 2025
2025
Re-Evaluating Evaluation for Multilingual Summarization
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
2024
Forgotten Polygons: Multimodal Large Language Models are Shape-Blind
Findings of the Association for Computational Linguistics: ACL 2025
2025
Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models
Findings of the Association for Computational Linguistics: NAACL 2025
2025
One-Versus-Others Attention: Scalable Multimodal Integration for Biomedical Data
Biocomputing 2025
2024
Text Simplification via Adaptive Teaching
Findings of the Association for Computational Linguistics ACL 2024
2024
Stable On-Line Learning with Optimized Local Learning, But Minimal Change of the Global Output
2013 12th International Conference on Machine Learning and Applications
2013
Enhancing the Ranking Context of Dense Retrieval through Reciprocal Nearest Neighbors
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
2023
Outlier Dimensions Encode Task Specific Knowledge
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
2023
Pretraining on Interactions for Learning Grounded Affordance Representations
Proceedings of the 11th Joint Conference on Lexical and Computational Semantics
2022
American Medical Informatics Association (AMIA) 2007 Annual Symposium
Unknown Venue
2008
Inconsistent Ranking Assumptions in Medical Search and Their Downstream Consequences
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval
2022
APA-RST: A Text Simplification Corpus with RST Annotations
Proceedings of the 4th Workshop on Computational Approaches to Discourse (CODI 2023)
2023
SOCCER: An Information-Sparse Discourse State Tracking Collection in the Sports Commentary Domain
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
2021
34th Annual Symposium on Biomedical and Health Informatics (AMIA 2010) conference report
ACM SIGHIT Record
2011
Web-Based Visualization of MeSH-Based PubMed/MEDLINE Statistics
Studies in Health Technology and Informatics
2019
Baxter Amia Automated Peritoneal Dialysis Cycler Set
Biomedical Safety & Standards
2024

Education

Technische Universiteit Delft

Ph.D. (Computer Science) / October, 2014

Delft

The University of Edinburgh

M.Sc. (Artificial Intelligence) / November, 2009

Edinburgh

FHDW Hannover

B.Sc. / 2008

Experience

University of Tübingen

Professor / 2022Present

Brown University

Manning Assistant Professor / 20182022

ETH Zurich

Postdoc / 20142018

University Hospital Tübingen

Scientific Director (Medical Data Integration Center) / 2022Present

Harvard University

Visiting Fellow / 20172017

codiag AG

Co-Founder & Chief Scientist / 20182022

CareCrowd

Co-Founder & CTO / 2024Present

Join Carsten on NotedSource!
Join Now

At NotedSource, we believe that professors, post-docs, scientists and other researchers have deep, untapped knowledge and expertise that can be leveraged to drive innovation within companies. NotedSource is committed to bridging the gap between academia and industry by providing a platform for collaboration with industry and networking with other researchers.

For industry, NotedSource identifies the right academic experts in 24 hours to help organizations build and grow. With a platform of thousands of knowledgeable PhDs, scientists, and industry experts, NotedSource makes connecting and collaborating easy.

For academic researchers such as professors, post-docs, and Ph.D.s, NotedSource provides tools to discover and connect to your colleagues with messaging and news feeds, in addition to the opportunity to be paid for your collaboration with vetted partners.

Expert Institutions
NotedSource has experts from Stanford University
Expert institutions using NotedSource include Oxfort University
Experts from McGill have used NotedSource to share their expertise
University of Chicago experts have used NotedSource
MIT researchers have used NotedSource
Proudly trusted by
Microsoft uses NotedSource for academic partnerships
Johnson & Johnson academic research projects on NotedSource
ProQuest (Clarivate) uses NotedSource as their industry academia platform
Slamom consulting engages academics for research collaboration on NotedSource
Omnicom and OMG find academics on notedsource
Unilever research project have used NotedSource to engage academic experts

Connect with researchers and scientists like Carsten Eickhoff, Ph.D. on NotedSource to help your company with innovation, research, R&D, L&D, and more.