I am a postdoctoral researcher in the CAUSALab at the Harvard T.H. Chan School of Public Health, advised by Prof. Miguel Hernán.
My research lies at the intersection of causal inference and computer science, with a focus on advancing classical causal inference methods by integrating modern machine learning techniques. To bridge the gap between causal inference methodology and practical application, I also develop scientific computing tools that enable scalable and reproducible implementation of causal inference methods. In particular, I have been involved in open-source software development, leading and contributing to packages including pygformula, gformula, and gformulaICE, which support causal inference analyses for longitudinal observational data.
Previously, I received my Ph.D. in Computer Science from the School of Computer Science at Peking University, where I was advised by Prof. Yizhou Wang. Prior to my Ph.D., I obtained my bachelor's degree in Statistics from the School of Mathematics and Statistics at Wuhan University.
* equal contribution · Full list on Google Scholar →
View all repositories on GitHub →
The pygformula implements the parametric g-formula in Python. The parametric g-formula (Robins, 1986) uses longitudinal data with time-varying treatments and confounders to estimate the risk or mean of an outcome under hypothetical treatment strategies specified by the user.
The gfoRmula package implements the parametric g-formula in R. The parametric g-formula (Robins, 1986) uses longitudinal data with time-varying treatments and confounders to estimate the risk or mean of an outcome under hypothetical treatment strategies specified by the user.
The gformulaICE package implements parametric iterative conditional expectation (ICE) estimators of the plug-in g-formula in R.