Ph.D. Candidate

Data Mining Research Group
Database and Information Systems Laboratory
Department of Computer Science
University of Illinois at Urbana-Champaign

my CV (last updated Sep., 2016)

Google Scholar | LinkedIn | GitHub

About Me

I'm a fifth-year Ph.D. student in CS@Illinois, a member of Data Mining Research Group leaded by Professor Jiawei Han, and a Google PhD Fellow. I have close collaboration with Professor Ji Heng in RPI and Dr. Clare R. Voss in US Army Research Lab. Before that, I received Bachelor degree from Zhejiang University in 2012.

My thesis research is on Data-Driven Text Structure Extraction. I develope methods that are domain-independent, language-independent, distantly-supervised and scalable to mine various structures (phrases, entities, relations) from massive text data. This yields schema-rich databases/information networks, from which human can derive knowledge by applying different data mining functions (ranking, link prediction, classification, clustering).

In the past, I am lucky to work with Kuansan Wang and Yuanhua Lv at ISRC, and Surajit Chaudhuri and Tao Cheng (now at Pinterest) at DMX, in Microsoft Research. I spent the summer of 2016 at Pinterest. Dr. Cong Yu is my Fellowship mentor at Google Research. I am honored to receive Outstanding Graduate Student Award from UIUC, Yahoo!-DAIS Research Excellence Award, and Microsoft Young Fellowship from Microsoft Research.

What's New

  • Oct. 2016 - Paper on Comparative Document Analysis is accepted to WSDM 2017.
  • Oct. 2016 - I will be in PC of WWW 2017 (Social Network Analysis Track).
  • Sep. 2016 - Listed as the 3rd KDD Rising Star by Microsoft Academic Search.
  • Sep. 2016 - I will be in PC of SDM 2017, Houston, Texas.
  • July. 2016 - Three papers accepted in EMNLP 2016, CIKM 2016, and NAACL 2016.
  • May 2016 - Our latest embedding tool for Label Noise Reduction in Entity Typing is available at GitHub. The paper has been accepted to KDD 2016.
  • Apr. 2016 - Received C. W. Gear Outstanding Graduate Student Award from CS@Illinois---the highest honor given to one grad. student every year.
  • Mar 2016 - Thrilled to be one of the 52 Google Global PhD Fellows (25 in North America), and the sole winner in Structured Data. [CS@ILLINOIS]
  • Mar 2016 - Tutorial on "Automatic Entity Recognition and Typing in Massive Text Data" is accepted in SIGMOD 2016. Joint work with Ahmed El-Kishky, Heng Ji, and Jiawei Han.

  • Last Modified: 10/21/2016