cv

Basics

Name Zeyu (Steven) Zhang
Label PhD Student
Email zeyuzhang2028@u.northwestern.edu
Url https://ZeyuZhang1901.github.io
Summary Third-year PhD student at Northwestern University researching trustworthy AI.

Work

  • 2025.06 - Present
    Graduate Researcher - Temporal Knowledge Leakage Detection
    Northwestern University
    Developing methods to detect temporal knowledge leakage in LLMs.
    • Formalized temporal knowledge leakage problem and introduced a four-phase detection pipeline
    • Computed Shapley values and developed Shapley-weighted Decision-Critical Leakage Rate metric
    • Proposed Temporal LLM Agent architecture with iterative claim verification to prevent leakage
    • Demonstrated significant leakage reduction across legal, salary, and stock ranking tasks
  • 2024.06 - 2025.01
    Graduate Researcher - Factual Memorization in LLMs
    Northwestern University
    Analysis of LLM memorization behaviors under SFT and DPO. Qualifying examination paper.
    • Analyzed LLM memorization behaviors under SFT and DPO fine-tuning paradigms
    • Reproduced and extended findings from Stanford's Finetunebench
    • Proposed formal distinction between passive and positive memorization
    • Designed temporal generalization experiments with system prompts to mitigate overfitting
  • 2024.01 - 2025.05
    Graduate Researcher - LLM Interpretability (LAMP)
    Northwestern University
    Developed LAMP for interpreting black-box LLMs. Accepted as Spotlight at AISTATS 2026.
    • Developed LAMP framework for interpreting black-box LLMs by fitting locally linear surrogate models
    • Designed perturbation-based method to probe model sensitivity without internal access
    • Enabled practical interpretability for proprietary LLMs through self-reported explanations
    • Conducted experiments across sentiment, controversial-topic detection, and harmful auditing
  • 2023.09 - 2024.05
    Graduate Researcher - Unified Off-Policy Learning to Rank
    Northwestern University
    Unified framework for off-policy Learning to Rank. Published at NeurIPS 2023.
    • Formulated unified framework modeling various click models in LTR as MDP
    • Proposed Click model-agnostic Unified Off-Policy Learning to Rank (CUOLR) algorithm
    • Demonstrated effective application of offline RL methods (DQN, SAC, BCQ, CQL) to LTR
    • Achieved state-of-the-art performance across heterogeneous click models

Education

  • 2023.09 - 2028.06

    Evanston, IL, USA

    Doctor of Philosophy
    Northwestern University
    Statistics and Data Science
  • 2020.09 - 2023.06

    Hefei, Anhui, P.R.China

    Certificate (Minor)
    University of Science and Technology of China (USTC)
    Artificial Intelligence
    • Talent Program in Artificial Intelligence
    • Outstanding Undergraduate Honorary Rank (top 5%)
  • 2019.09 - 2023.06

    Hefei, Anhui, P.R.China

    Bachelor of Engineering
    University of Science and Technology of China (USTC)
    Electronic Information Engineering
    • Wang Xiaomo Talent Program in Cyber Science and Technology
    • Talent Program in Information Science and Technology
    • Rank: 5/213 in School of Information Science and Technology

Awards

Skills

Programming Languages
Python
R
C
C++
Java
Verilog
Bash
LaTeX
Markdown
Software & Platforms
Visual Studio Code
Cursor
RStudio
HuggingFace
GitHub
Git
OpenAI Platform
Soft Skills
Teamwork
Event Management
Writing
Public Speaking
Time Management

Languages

Chinese
Native speaker
English
Fluent

Interests

Research
Large Language Models
Knowledge Leakage Detection
Reinforcement Learning
Model Interpretability
Trustworthy AI
Fitness & Sports
Basketball
Long-distance Running
Structured Workout Programs

Projects

  • 2021.04 - 2021.11
    Signal Distortion Measurement Device Design
    High-precision signal distortion measurement device achieving 0.5% error.
    • Applied Hanning window functions to reduce spectrum leakage effectively
    • Designed algorithm to detect center spectrum by aggregating energy from nearby lines
    • Developed LCD interface to visualize data and input analog signals