Portfolio

Please find my portfolio here: pdf link.

SimpleFSDP: Automatic Selective Unsharding for Efficient ML Systems & Distributed Training

Research Scientist Intern at Meta Platforms – Ads Training Infra
Mentor: Shuai Yang | May 2025 – Aug 2025 | Sunnyvale, CA Project Details Doc

Selective Unsharding improves FSDP throughput by strategically retaining parameters that reduce redundant communication. It leverages SimpleFSDP’s PT2-friendly graph structure to perform graph-level memory–communication co-optimization.

Meta SimpleFSDP Selective Unsharding Figure 1 Meta SimpleFSDP Selective Unsharding Figure 2

DiT Training Simulator: Large-Scale Video Diffusion / LLM Training Simulation

Research Scientist Intern at ByteDance – Seed (ML Systems Group)
Mentors: Zhihao Bai, Yanghua Peng | May 2024 – Aug 2024 | Bellevue, WA

A perf-accurate simulator for Diffusion Transformer (DiT) and other LLM models at 1–1000 GPU scale with multi-parallelism, timeline visualization, and memory modeling.


Out-of-GPU-Core LLM Training System (OOGC-LLM)

Lead Researcher (with Prof. Xuehai Qian & Yikang Yue & Yuxuan Liu) | May 2023 – Jan 2024
Purdue University, West Lafayette, IN

A holistic rethinking of ZeRO-Offload / ZeRO-3 limitations, using multi-layer prefetching, lazy all-gather, dynamic memory management, CPU/GPU pipelining, and MoE-aware computation placement.

OOGC-LLM Architecture Figure 1 OOGC-LLM Architecture Figure 2

CPU Execution Engines: Inference System for ML Models on CPU

Research Assistant (with Prof. Xuehai Qian & Gengyu Rao) | Aug 2022 – Dec 2022
Purdue University, West Lafayette, IN

A CPU-native depth-wise tensor execution model for efficient inference without GPU accelerators.

CPU Inference System Diagram


D-AirPatrol: Drone-based Traffic Monitoring

MobiCom’24 Best Poster Award
Author (with Jiaxin Du & Prof. Chunyi Peng) | Jan 2024 – May 2024
Purdue University, West Lafayette, IN

Designed a drone-to-edge system for real-time vehicle detection & speed estimation.

AirPatrol 1 AirPatrol 2

AirLab Platform: UAV Systems & Autonomous Drone Platforms

Full-Stack Developer (with Jiaxin Du & Prof. Chunyi Peng)
Purdue University, West Lafayette, IN

A complete drone experimentation ecosystem: flight logging, DB processing, dashboard management, and map-based visualization.