INFORMATION TECHNOLOGY AND CONTROL, vol.43, no.4, pp.433-439, 2014 (SCI-Expanded)
We studied outlier document filtering (ODF) for extractive sentence summarization. Our results are superior compared to the average of the participant systems' using DUC 2006. Furthermore, we add extractive paragraph summarization to the same system. It is surprising that the results are nearly the same for ROUGE metrics. Although extractive paragraph summarization has a better performance for precision, extractive sentence summarization has a slightly better performance on the recall and F-Score which is the harmonic mean of recall and precision. The ODF is successful for both extractive sentence and paragraph summarization. The similarity metric (match percent) suggested in the article prevents the domination of longer sentences/paragraphs on shorter sentences/paragraphs in selection. As a result, the ODF provides the flexibility of paragraph extraction instead of sentence extraction for simplicity and readability and less work load.