Science Cast

Adaptive Preference Aggregation

librarianMarch 14, 2025 1:14am

Views (26)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

Adaptive Preference Aggregation

arXivPDFMarch 13, 2025 12:00am

Authors

Benjamin Heymann

Abstract

AI alignment, the challenge of ensuring AI systems act in accordance with human values, has emerged as a critical problem in the development of systems such as foundation models and recommender systems. Still, the current dominant approach, reinforcement learning with human feedback (RLHF) faces known theoretical limitations in aggregating diverse human preferences. Social choice theory provides a framework to aggregate preferences, but was not developed for the multidimensional applications typical of AI. Leveraging insights from a recently published urn process, this work introduces a preference aggregation strategy that adapts to the user's context and that inherits the good properties of the maximal lottery, a Condorcet-consistent solution concept.

TwitterandLinkedIn

0 comments

Add comment

Adaptive Preference Aggregation

Adaptive Preference Aggregation

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments