Jerry (Zhi-Yang) He

Jerry Zhi-Yang He

hzyjerry at berkeley dot edu

I work on building AI systems that understand, adapt to, and stay aligned with the people they serve. Off the clock I enjoy coffee, music, running, basketball, and Jiu-Jitsu.

About

I am a Research Scientist at ByteDance Seed, working on post-training and personalization for large language models. I received my Ph.D. in EECS from UC Berkeley (graduated Dec 2024), advised by Prof. Anca Dragan in the InterACT Lab. Before Berkeley I was at Stanford (class of 2018), where I worked with Amir Zamir, Silvio Savarese, and Dorsa Sadigh at SVL and ILIAD.

I'm interested in making digital and physical agents more powerful, reliable, and safely aligned. Currently I'm focused on agentic systems, reasoning models, efficient large language models, and RL with verifiable rewards (RLVR). At ByteDance Seed I've contributed to Seed-OSS and the Seed-Thinking series (v1.5, v1.6, v2.0, and beyond).

During my Ph.D., I worked on the safety of human–robot systems from three angles: (1) causal confusion — how learned reward models latch onto spurious correlates of human preference, and how to diagnose it; (2) adversarial perturbations — quantifying robustness of assistive policies along a natural–adversarial frontier; and (3) steerability — making model behavior controllable and personalizable at inference time without retraining.

News

  • Aug 2025 Seed-OSS-36B released open-source under Apache-2.0.
  • Apr 2025 Seed1.5-Thinking tech report on arXiv.
  • Jan 2025 Context Steering accepted at ICLR 2025.
  • Aug 2023 Quantifying Assistive Robustness accepted at CoRL 2023.
  • Jan 2023 Causal Confusion in Preference-Based Reward Learning accepted at ICLR 2023.

Tech Reports

ByteDance Seed model releases I've contributed to.

Selected Publications

Full list on Google Scholar · * indicates equal contribution

Selected Projects

  • Gibson Environment Co-led the development of a simulation platform for real-world active perception. Recipient of the 2018 NVIDIA Pioneering Research Award.
  • ThingPedia Open-source platform for a personalized Internet of Things, with Prof. Monica Lam.