Towards More Reliable Generative AI: Probing Failure Modes, Harnessing Test-Time Inference, and Interpreting Diffusion Models Abstract: In this talk, we explore strategies for making generative [...]
Learning From Aggregated Responses: Improving Model Utility Under Privacy Constraints Abstract: In many real-world scenarios, training data is aggregated before being shared with the learner [...]
Simulating Emergent LLM Social Behaviors in Multi-agent Systems Abstract: Large language model (LLM)–based agents are increasingly being deployed in multi-agent environments, introducing unprecedented risks of [...]