TIMan Logo Text Information Management Group
Home | About| People | Projects| Publications | Downloads| Links| Wiki

Multi-Faceted Comparative Text Summarization
[ Team ] [ Results ] [ Publications ] [ Funding ]

1. Introduction

Text summarization is to generate a concise summary of documents to help a user quickly digest information and has become increasingly important due to the explosive growth of information. Most existing summarization methods can only generate an unstructured summary with a simple list of sentences. In many applications, however, we want to generate a (more structured) multi-faceted comparative summary, in which sentences are grouped into multiple facets and compared across different views. For example, a summary about laptop opinions may group sentences into facets such as "battery life" and "memory" and separate sentences with positive and negative opinions in each facet.

This project aims to systematically study this new summarization problem (called multi-faceted comparative summarization). We will develop general probabilistic approaches that can be applied to multiple instances of the problem in different domains. The basic idea is to use probabilistic mixture models to model and extract the multiple facets and multiple views of each facet in a set of text documents to be summarized. The extracted facets and views are then used to generate facet labels and select sentences for different facet-view combinations.

2. Team members

3. Major Research Results

4. Selected Publications (See all publications)

5. Funding Support

[ Team ] [ Results ] [ Publications ] [ Funding ]

TIMan Logo Text Information Management Group
Home | About| People | Projects| Publications | Downloads| Links| Wiki