Trustworthy LLMs
Privacy auditing, leakage evaluation, and reliability under deployment constraints.
Shuying Cao
Benchmarks test what models can answer; my work studies how foundation models behave in real contexts - how they infer intent, protect private information, and make reliable choices among valid options.

Visiting Student Researcher, Stanford University Stanford, CA | shuyingc@stanford.edu
I am a M.S. student in Computer Science at the University of Southern California, advised by Prof. Sai Praneeth Karimireddy, and a Visiting Student Researcher at Stanford University, advised by Prof. Michael Zeineh.
My research studies foundation models beyond benchmark correctness, focusing on how they infer user intent, expose or protect private context, and choose among multiple valid responses in real-world settings. I develop empirical auditing and evaluation methods for trustworthy and human-centered AI, with applications in privacy-preserving in-context learning, LLM generation behavior, persona systems, and medical AI agents.
Previously, I received my B.Eng. in Geodesy and Geomatics Engineering from Wuhan University, advised by Prof. Zhenzhong Chen. I was also a visiting student at UC Berkeley, where I worked with Prof. Joseph E. Gonzalez and his former Ph.D. student Tianjun Zhang on visual-capable chatbot systems based on instruction-tuned language models.
Privacy auditing, leakage evaluation, and reliability under deployment constraints.
Understanding how models choose among multiple valid outputs, and why generation collapses.
Studying intent, persona, and medical AI agents in human-facing settings.
* = equal contribution.

arXiv preprint arXiv:2512.16059; ACL 2025 L2M2 Workshop
ContextLeak is an empirical auditing framework for measuring information leakage in private in-context learning methods using canary insertion and targeted adversarial queries.
An auditing framework for empirically measuring information leakage in private in-context learning methods.
project page →Studying how large language models select from multiple valid outputs and why diversity collapses in generation.
project page →A platform for creating, editing, and sharing persistent AI personas with memory and interaction boundaries.
project page →Visiting Student Researcher
M.S. Computer Science
Visiting student; visual-capable chatbot systems based on instruction-tuned language models.
M.S. in Computer Science
B.Eng. in Geodesy and Geomatics Engineering
Trustworthy AI, privacy auditing, LLM behavior, human-centered AI, medical AI agents.
Empirical auditing, canary insertion, targeted adversarial queries, model behavior analysis.
Large language models, instruction-tuned systems, visual-capable chatbots, AI persona platforms.