Research
Notation: (α) authors listed alphabetically (r) equal contribution with randomized order among authors marked with †
How RLHF Amplifies Sycophancy
Robust AI Evaluation through Maximal Lotteries
Incentives in Federated Learning with Heterogeneous Agents
Generative Social Choice
Extended version of the paper published at EC 2024
Pairwise Calibrated Rewards for Pluralistic Alignment
SOAP: Improving and Stabilizing Shampoo using Adam
A New Perspective on Shampoo’s Preconditioner
Spotlight Top 2.1% of submissions
Axioms for AI Alignment from Human Feedback
Spotlight Top 2.1% of submissions
Learning Social Welfare Functions
Bias Detection Via Signaling
Optimal Engagement-Diversity Tradeoffs in Social Media
No publications match the current filters.
