IJRST

Details

Review on Opinion Data Summarization Using K-means Clustering and Latent Semantic Analysis

Renu

Department of Computer Science, Shri Ram college of Engineering & Management, Palwal

Neha

Department of Computer Science, Shri Ram college of Engineering & Management, Palwal

Kunal

Department of Computer Science, Shri Ram college of Engineering & Management, Palwal

12-23

Vol: 7, Issue: 2, 2017

Receiving Date: 2017-02-12 Acceptance Date:

2017-03-29

Publication Date:

2017-04-16

Download PDF

Abstract

Text Summarization is condensing the source text into a shorter version preserving its information content and overall meaning. It is very difficult for human beings to manually summarize large documents of text. Text Summarization methods can be classified into extractive and abstractive summarization. An extractive summarization method consists of selecting important sentences, paragraphs etc. from the original document and concatenating them into shorter form. The importance of sentences is decided based on statistical and linguistic features of sentences. An abstractive summarization method consists of understanding the original text and re-telling it in fewer words. It uses linguistic methods to examine and interpret the text replique omega and then to find the new concepts and expressions to best describe it by generating a new shorter text that conveys the most important information from the original text document. Usually, the flow of information in a given document is not uniform, which means that some parts are more important than others. The major challenge in summarization lies in distinguishing the more informative parts of a document from the less ones. Though there have been instances of research describing the automatic creation of abstracts, most work presented in the literature relies on verbatim extraction of sentences to address the problem of single-document summarization. In this scheme, we describe some eminent extractive techniques. First, we look at early work from the aspect of research on summarization. Second, we concentrate on approaches involving machine learning techniques. In this dissertation, ontology based document summarization is proposed that provide efficient and accurate summary than other approaches. The main motivation for summarization is to identifying summary from a large document, that it is a data is beneficial for us or not. It is identify weather a product is purchasable or not. This make difficult for a potential customer to read them to make an informed decision on whether to purchase the product. It also makes it difficult for the manufacturer of the product to keep track and to manage customer opinion. In the scheme we proposed enhanced algorithm vide latent semantic kernel for better results.

Keywords: Data or Text Summarization; Inverse Document Frequency; Document Clustering

References

R. Baeza-Yates, C. Hurtado, and M. Mendoza, â€œQuery Recommendation Using Query Logs in Search Engines,â€Proc. Intâ€Ÿl Conf. Current Trends in Database Technology (EDBT â€Ÿ04), pp. 588-596, 2004
Harshada P. Bhambure, Mandar Mokashi, 'Inferring User Search Goals Using Feedback Session', International Journal of Science and Research (IJSR), www.ijsr.net, Volume 4 Issue 6, June 2015, 2880 - 2884 - See more at: http://www.ijsr.net/archive/v4i6/v4i6_03.php#sthash.1TMBdOKC.dpuf
Joel larocca Neto, Alex A. Freitas and Celso A.A.Kaestner, 'Automatic Text Summarization using a Machine Learning Approachâ€, Book: Advances in Artificial Intelligence: Lecture Notes in computer science, Springer Berlin / Heidelberg, Vol 2507/2002, 205-215, 2002.
Weiguo Fan, Linda Wallace, Stephanie Rich, and Zhongju Zhang, â€œTapping into the Power of Text Miningâ€, Journal of ACM, Blacksburg, 2005.
Fang Chen, Kesong Han and Guilin Chen, â€œAn Approach to sentence selection based text summarization', Proceedings of IEEE TENCON02, 489-493, 2002
Mohamed Abdel Fattah and Fuji Ren, 'Automatic Text Summarization', Proceedings of World Academy of Science, Engineering and Technology, Vol 27,ISSN 1307- 6884, 192-195, Feb 2008.
H. P. Luhn, â€œThe Automatic Creation of Literature Abstractsâ€, Presented at IRE National Convention, New York, 159-165, 1958.
H. P. Edmundson.,â€ New methods in automaticextractingâ€, Journal of the ACM, 16(2):264-285, April 1969.
J. Kupiec, J. Pedersen, and F. Chen, â€œA trainable document summarizerâ€, In Proceedings of the 18th ACMSIGIR Conference, pages 68-73, 1995
Ronald Brandow, Karl Mitze, and Lisa F. Rau. â€œAutomatic condensation of electronic publications by sentence selection. Information Processing and Managementâ€, 31(5):675-685,1995
E. Mittendorf and P. Schauble, â€œ Document and passage retrieval based on hidden markov modelsâ€, In Proceedings of the 17th ACM-SIGIR Conference, pages 318-327,1994.
A. Bookstein, S. T. Klein, and T. Raita, â€œDetecting content-bearing words by serial clusteringâ€, In Proceedings of the 18th ACM-SIGIR Conference, pages 319-327, 1995
Madhavi K. Ganapathiraju, â€œOverview of summarization methodsâ€, 11-742: Self-paced lab in Information Retrieval, November 26, 2002
Klaus Zechner, â€œA Literature Survey on Information Extraction and Text Summarizationâ€, Computational Linguistics Program, Carnegie Mellon University, April 14, 1997.

Back

info@ijrst.com

+919555269393

Track Article

Upload Article

Details

Review on Opinion Data Summarization Using K-means Clustering and Latent Semantic Analysis

Abstract

References

Our Head Office

Quick Links

info@ijrst.com

+919555269393

Track Article

Upload Article

Details

Review on Opinion Data Summarization Using K-means Clustering and Latent Semantic Analysis

Abstract

References

Our Head Office

Quick Links

Indexing