Large Language Models

Eliciting Nmuerical Predictive Distributions of LLMs without Autoregression

We train a set of simple probes to recover the predictive distributions of LLMs, with uncertainty estimates.

Julianna Piskorz, Kasia Kobalczyk, Mihaela van der Schaar

Jan 30, 2026

Preference Learning for AI Alignment: a Causal Perspective

We propose to adopt a causal framework for preference learning to define and address challenges like causal misidentification, preference heterogeneity, and crucially, confounding due to user-specific objectives.

Kasia Kobalczyk, Mihaela van der Schaar

May 1, 2025

Preference Learning for AI Alignment: a Causal Perspective

The Synergy of LLMs & RL Unlocks Offline Learning of Generalizable Language-Conditioned Policies with Low-fidelity Data

We train LLM agents as Language-conditioned policies without requiring expensive labeled data or online experimentation. The framework leverages LLMs to enable the use of unlabeled datasets and improve generalization to unseen goals and states.

Thomas Pouplin, Kasia Kobalczyk, Hao Sun, Mihaela van der Schaar

May 1, 2025

Active Task Disambiguation with LLMs

This paper formalizes task ambiguity in tasks specified in natural language and frames task disambiguation through Bayesian Experimental Design, leading to more effective strategies for LLMs to pose clarifying questions.

Kasia Kobalczyk, Nicolás Astorga, Tennison Liu, Mihaela van der Schaar

Jan 22, 2025