Hi, I'm a researcher at Google Research. Before that, I was a Google AI Resident, where I was lucky to be advised by Yin Cui and Tsung-Yi Lin and worked on open-vocabulary visual recognition. Previously, I obtained my M.S. in Computer Science at Stanford University, and my B.E. in Computer Science at Zhejiang University. I spent half a year (9/2018-3/2019) and summer 2016 working happily with Prof. Yong Jae Lee at UC Davis. In summer 2018, I interned at TuSimple and worked on 3D point cloud scene flow estimation with
Currently, I'm interested into video generation and open-vocabulary visual recognition.
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
[paper]
[code]
[demo video] |
![]() |
|
![]() |
|
![]() |
|
Course project for CS224W: Machine Learning with Graphs. |
![]() |
Course project for CS348B: Image Synthesis Techniques. |
![]() |
A robust iterative license plate character segmentation algorithm and a license detection system with robust skew and slant correction to improve character segmentation.
[character segmentation report]
[detection report] |
![]() |
Course project for CS231A: Computer Vision, From 3D Reconstruction to Recognition. |
![]() |