Rishit Garkhel
Download PDF
http://doi.org/10.37648/ijrst.v10i04.005
Identifying and disposing of the copied document is one of the serious issues in the wide space of information cleaning and information quality in the framework. Ordinarily, a similar sensible true element might have numerous portrayals in the information distribution centre. Copy disposal is hard because it is brought about by a few blunders like typographical mistakes and various pictures of similar consistent worth. Our primary aim of this study is to recognise specific and inaccurate representations by utilising copy description and end rules. This methodology is used to work on the proficiency of the information. The significance of information precision and quality has expanded with the blast of information size. In the copy disposal step, just one duplicate of accurate copied records or documents is held and dispensed with other copy records or documents. The end cycle is vital to delivering cleaning information. Before the end sequence, the similitude limit esteems are determined for every one of the records available in the informational collection. The closeness limit admires significant for the end communication.
Keywords: Duplicate record recognition; Duplication; information linkage
Disclaimer: All papers published in IJRST will be indexed on Google Search Engine as per their policy.