Research
My experience spans various areas within Machine Learning, with a recent focus on AI Evaluation and Trustworthy AI. I am broadly interested in drawing insights from established theories to advance AI development, while exploring the new life of traditional methods in the context of contemporary AI.
|
|
Reliable and Efficient Amortized Model-based Evaluation
Sang Truong,
Yuheng Tu,
Percy Liang,
Bo Li,
Sanmi Koyejo
Under review
PDF
AI Evaluation with statistical model and question generator.
Evaluate 184 LLMs across 25 datasets reliably and efficiently with Item Response Theory (IRT). Introduce amortized calibration to reduce the cost with minimal sacrifice of accuracy. Fine-Tune Llama-3-8B to generate questions conditioned on item parameters.
|
|
AIR-BENCH 2024: A Safety Benchmark Based on Risk Categories from Regulations and Policies
Yi Zeng*,
Yu Yang*,
Andy Zhou*,
Jeffrey Ziwei Tan*,
Yuheng Tu*,
Yifan Mai*,
Kevin Klyman,
Minzhou Pan,
Ruoxi Jia,
Dawn Song,
Percy Liang,
Bo Li
ICLR 2025 Spotlight
arXiv /
Code
Safety benchmark aligning with regulations and policies.
Generate 5,694 detailed and diverse instruction prompts across 314 risk categories and 3 language styles. Evaluate 22 leading LLMs with GPT-4o as a judge and category-specific system prompts.
|
|
Federated Learning Algorithms pursuing Gradient Compression
Guojun Chen,
Kaixuan Xie,
Yuheng Tu,
Tiecheng Song,
Yinfei Xu,
Jing Hu,
Lun Xin
IEEE Communications Letters (COMML)
PDF /
Code /
COMML
Gradient compression problem in Federated Learning.
Develop the NQFL algorithm which normalizes the gradients and quantizes them with the Lloyd-Max quantizer. Implement NQFL along with 3 comparative algorithms: QSGD, AdaQuantFL, and SLMQ in FedML framework.
|
|
Fast Design of Metasurface-based Microwave Absorber Using the Neuro-TF Approach
Yuheng Tu,
Jianan Liu,
Tian Qiu,
Yunlang Cai,
Jianan Zhang,
Jian Wei You,
Tie Jun Cui
Photonics & Electromagnetics Research Symposium (PIERS) 2023
PDF /
PIERS 2023
Model structure problem in Supervised Regression.
Develop the neuro-TF model (combines neural networks and pole-residue-based transfer function) which provides accurate and fast prediction of the EM behavior of a metasurface and thus greatly accelerate the design process.
|
|