SENECA

An Entity-Driven Framework for Abstractive Summarization

Abstractive summarization systems aim to produce more coherent and concise summaries than their extractive counterparts. Popular neural models have achieved impressive results for single-document summarization, yet their outputs are often incoherent and unfaithful to the input.

In this paper, we introduce SENECA, a novel System for ENtity-drivEn Coherent Abstractive summarization framework that leverages entity information to generate informative and coherent abstracts. Our framework takes a two-step approach: (1) an entity-aware content selection module first identifies salient sentences from input, then (2) an abstract generation module conducts cross-sentence information compression and abstraction to generate the final summary, which is trained with rewards to promote coherence, conciseness, and clarity. The two components are further connected using reinforcement learning.

Automatic evaluation shows that our model significantly out performs previous state-of-the-arts based on ROUGE and our proposed coherence measures on the New York Times corpus. Human judges further rate our system summaries as more informative and coherent than those by popular summarization models.

Publication(s)

Stay tuned for more updates!!!