Katarzyna (Kasia) Kobalczyk
Katarzyna (Kasia) Kobalczyk
Home
Publications
Contact
Light
Dark
Automatic
Large Language Models
Preference Learning for AI Alignment: a Causal Perspective
We propose to adopt a causal framework for preference learning to define and address challenges like causal misidentification, preference heterogeneity, and crucially, confounding due to user-specific objectives.
Kasia Kobalczyk
,
Mihaela van der Schaar
May 1, 2025
openreview
arXiv
PDF
code
The Synergy of LLMs & RL Unlocks Offline Learning of Generalizable Language-Conditioned Policies with Low-fidelity Data
We train LLM agents as Language-conditioned policies without requiring expensive labeled data or online experimentation. The framework leverages LLMs to enable the use of unlabeled datasets and improve generalization to unseen goals and states.
Thomas Pouplin
,
Kasia Kobalczyk
,
Hao Sun
,
Mihaela van der Schaar
May 1, 2025
openreview
arXiv
PDF
code
Active Task Disambiguation with LLMs
This paper formalizes task ambiguity in tasks specified in natural language and frames task disambiguation through Bayesian Experimental Design, leading to more effective strategies for LLMs to pose clarifying questions.
Kasia Kobalczyk
,
Nicolás Astorga
,
Tennison Liu
,
Mihaela van der Schaar
Jan 22, 2025
openreview
arXiv
PDF
code