Details

Review on Opinion Data Summarization Using K-means Clustering and Latent Semantic Analysis

Renu

Department of Computer Science, Shri Ram college of Engineering & Management, Palwal

Neha

Department of Computer Science, Shri Ram college of Engineering & Management, Palwal

Kunal

Department of Computer Science, Shri Ram college of Engineering & Management, Palwal

12-23

Vol: 7, Issue: 2, 2017

Receiving Date: 2017-02-12 Acceptance Date:

2017-03-29

Publication Date:

2017-04-16

Download PDF

Abstract

Text Summarization is condensing the source text into a shorter version preserving its information content and overall meaning. It is very difficult for human beings to manually summarize large documents of text. Text Summarization methods can be classified into extractive and abstractive summarization. An extractive summarization method consists of selecting important sentences, paragraphs etc. from the original document and concatenating them into shorter form. The importance of sentences is decided based on statistical and linguistic features of sentences. An abstractive summarization method consists of understanding the original text and re-telling it in fewer words. It uses linguistic methods to examine and interpret the text and then to find the new concepts and expressions to best describe it by generating a new shorter text that conveys the most important information from the original text document. Usually, the flow of information in a given document is not uniform, which means that some parts are more important than others. The major challenge in summarization lies in distinguishing the more informative parts of a document from the less ones. Though there have been instances of research describing the automatic creation of abstracts, most work presented in the literature relies on verbatim extraction of sentences to address the problem of single-document summarization. In this scheme, we describe some eminent extractive techniques. First, we look at early work from the aspect of research on summarization. Second, we concentrate on approaches involving machine learning techniques. In this dissertation, ontology based document summarization is proposed that provide efficient and accurate summary than other approaches. The main motivation for summarization is to identifying summary from a large document, that it is a data is beneficial for us or not. It is identify weather a product is purchasable or not. This make difficult for a potential customer to read them to make an informed decision on whether to purchase the product. It also makes it difficult for the manufacturer of the product to keep track and to manage customer opinion. In the scheme we proposed enhanced algorithm vide latent semantic kernel for better results.

Keywords: Data or Text Summarization; Inverse Document Frequency; Document Clustering

References

  1. R. Baeza-Yates, C. Hurtado, and M. Mendoza, “Query Recommendation Using Query Logs in Search Engines,”Proc. Int‟l Conf. Current Trends in Database Technology (EDBT ‟04), pp. 588-596, 2004
  2. Harshada P. Bhambure, Mandar Mokashi, 'Inferring User Search Goals Using Feedback Session', International Journal of Science and Research (IJSR), www.ijsr.net, Volume 4 Issue 6, June 2015, 2880 - 2884 - See more at: http://www.ijsr.net/archive/v4i6/v4i6_03.php#sthash.1TMBdOKC.dpuf
  3. Joel larocca Neto, Alex A. Freitas and Celso A.A.Kaestner, 'Automatic Text Summarization using a Machine Learning Approach”, Book: Advances in Artificial Intelligence: Lecture Notes in computer science, Springer Berlin / Heidelberg, Vol 2507/2002, 205-215, 2002.
  4. Weiguo Fan, Linda Wallace, Stephanie Rich, and Zhongju Zhang, “Tapping into the Power of Text Mining”, Journal of ACM, Blacksburg, 2005.
  5. Fang Chen, Kesong Han and Guilin Chen, “An Approach to sentence selection based text summarization', Proceedings of IEEE TENCON02, 489-493, 2002
  6. Mohamed Abdel Fattah and Fuji Ren, 'Automatic Text Summarization', Proceedings of World Academy of Science, Engineering and Technology, Vol 27,ISSN 1307- 6884, 192-195, Feb 2008.
  7. H. P. Luhn, “The Automatic Creation of Literature Abstracts”, Presented at IRE National Convention, New York, 159-165, 1958.
  8. H. P. Edmundson.,” New methods in automaticextracting”, Journal of the ACM, 16(2):264-285, April 1969.
  9. J. Kupiec, J. Pedersen, and F. Chen, “A trainable document summarizer”, In Proceedings of the 18th ACMSIGIR Conference, pages 68-73, 1995
  10. Ronald Brandow, Karl Mitze, and Lisa F. Rau. “Automatic condensation of electronic publications by sentence selection. Information Processing and Management”, 31(5):675-685,1995
  11. E. Mittendorf and P. Schauble, “ Document and passage retrieval based on hidden markov models”, In Proceedings of the 17th ACM-SIGIR Conference, pages 318-327,1994.
  12. A. Bookstein, S. T. Klein, and T. Raita, “Detecting content-bearing words by serial clustering”, In Proceedings of the 18th ACM-SIGIR Conference, pages 319-327, 1995
  13. Madhavi K. Ganapathiraju, “Overview of summarization methods”, 11-742: Self-paced lab in Information Retrieval, November 26, 2002
  14. Klaus Zechner, “A Literature Survey on Information Extraction and Text Summarization”, Computational Linguistics Program, Carnegie Mellon University, April 14, 1997.
Back

Disclaimer: All papers published in IJRST will be indexed on Google Search Engine as per their policy.

We are one of the best in the field of watches and we take care of the needs of our customers and produce replica watches of very good quality as per their demands.