Deep Keyphrase Generation

Abstract

Keyphrase provides highly-summative information that can be effectively used for understanding, organizing and retrieving text content. Though previous studies have provided many workable solutions for automated keyphrase extraction, they commonly divided the to-be-summarized content into multiple text chunks, then ranked and selected the most meaningful ones. These approaches could neither identify keyphrases that do not appear in the text, nor capture the real semantic meaning behind the text. We propose a generative model for keyphrase prediction with an encoder-decoder framework, which can effectively overcome the above drawbacks. We name it as deep keyphrase generation since it attempts to capture the deep semantic meaning of the content with a deep learning method. Empirical analysis on six datasets demonstrates that our proposed model not only achieves a significant performance boost on extracting keyphrases that appear in the source text, but also can generate absent keyphrases based on the semantic meaning of the text.

Publication
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Daqing He
Daqing He
Co-Principal Investigators, Professor in Information Science

The goal of my research aims to advance people’s capabilities of accessing online information with the support of various cutting-edge intelligent and social information technologies.

Peter Brusilovsky
Peter Brusilovsky
Professor in Information Science

Peter has been working in the field of adaptive, user modeling, and intelligent user interfaces for over 30 years.

Yu Chi
Yu Chi
PhD student in Information Science

In general, I use both qualitative and quantitative methods to explores how web search could support users to achieve their information needs and promote fairness in education and healthcare.