Zepu Wang 王泽普

CV

My Curriculum Vitae, updated in 11/13/2023

Education

Master of Science in Engineering in Data Science, University of Pennsylvania
08/2022 ~ 08/2024
GPA: 3.9/4.0
Relevant Courses: Machine Learning, Deep Learning, Graph Neural Networks, Natural Language Processing, Time Series for Business, Statistics for Data Science
Bachelor of Science in Data Science, Duke University/Duke Kunshan University 08/2018 ~ 05/2022
GPA: 3.95/4.0
Relevant Courses: Machine Learning, Deep Learning, Stochastic Process, Mathematical Modeling
Awards: Merit-based Undergraduate Official Scholarship, Summa Cum Laude (Undergraduate highest honor), Dean’s List (2018, 2019, 2020, 2021, 2022)

Research Experience

Online Research Intern, Massachusetts Institute of Technology (MIT)
02/2023 ~ 08/2023
Supervisor: Prof. Jinhua Zhao, Prof. Shenhao Wang
Project 1: Uncertainty Quantification in Traffic Data Imputation
• Utilized graph attention layers and bidirectional recurrent units to capture spatio-temporal traffic data patterns and predicted uncertainty of traffic data.
Project 2:Mixture Models for Uncertainty Quantification in Sparse Travel Demand Prediction
• Used mixture models (Laplace, Poisson, Gaussian) to approximate complex travel demand distribution, addressing high zero occurrences. link
Research Intern, Duke Kunshan University
07/2021 ~ present
Supervisor: Prof. Peng Sun, Ph.D.
Project 1: A Velocity-based Model in Traffic Flow Prediction
• Conducted evaluations and comparisons among popular traffic flow prediction models (Linear Regression, SVR, Decision Tree, Random Forest, LSTMs, and GRUs) in a single intersection based on accuracy (RMSE,MAE,R^2).
• Increased 2% accuracy (RMSE) on average by considering the vehicles’ speed in surrounding intersections to adjust the original results.
Project 2: A Hybrid Model in Traffic Flow Prediction!
• Combined Long Short-Term Memory (LSTM) neural networks, Wavelet Analysis and Spectral Analysis to design an accurate traffic flow forecasting algorithm.
Project 3: Traffic Flow Prediction using Auto-encoder
• Applied Auto-encoder as a dimension reduction technique for large road networks and increased the time efficiency by 27.4% with sacrifice of only 5% accuracy (MSE).
• Provided a comprehensive analysis of trade-off between original data information loss and noises’ filtration from the original data while applying dimension reduction technique.
Research Intern, Westlake University
07/2021 ~ 08/2021
Supervisor: Prof. Stan Z. Li
Project: Protein-Protein Interactions (PPIs)
• Predicted PPIs based on primary protein structures, using traditional natural language processing (NLP) methods (RNNs) and advanced NLP methods (Transformers).
• Increased accuracy by 2% (MSE) in RNNs methods by using the trick of pad sequences.