Summarizing Microblogs During Emergency Events: A Comparison of Extractive Summarization Algorithms

Published in International Conference on Emerging Technologies in Data Mining and Information Security, 2019

Abstract: Microblogging sites, notably Twitter, have become important sources of real-time situational information during emergency events. Since hundreds to thousands of microblogs (tweets) are generally posted on Twitter during an emergency event, manually going through every tweet is not feasible. Hence, summarization of microblogs posted during emergency events has become an important problem in recent years. Several summarization algorithms have been proposed in the literature, both for general document summarization, as well as specifically for summarization of microblogs. However, to our knowledge, there has not been any systematic analysis on which algorithms are more suitable for summarization of microblogs posted during disasters. In this work, we evaluate and compare the performance of 8 extractive summarization algorithms in the application of summarizing microblogs posted during emergency events. Apart from comparing the performances of the algorithms, we also find significant differences among the summaries produced by different algorithms over the same input data.

Paper PDF

Recommended Citation: Dutta, S., Chandra, V., Mehra, K., Ghatak, S., Das, A. & Ghosh, S. (2019). "Summarizing Microblogs during Emergency Events: A Comparison of Extractive Summarization Algorithms." International Conference on Emerging Technologies in Data Mining and Information Security (IEMIS), pp. 859-872. https://link.springer.com/chapter/10.1007%2F978-981-13-1498-8_76