{"product_id":"proposing-field-matching-similarity-methods-von-solmaz-khatami","title":"Proposing Field Matching Similarity Methods","description":"\u003cp\u003eDuplicate records do not have a common key but refer to a unit entity. Databases that include these records have often some errors which cause the matching problem in duplicate records becomes a complex problem. These errors are: typing errors, incomplete information such as abbreviations, ignoring of standard formats or a combination of the above factors. In this book, databases are used in which typing errors are more than other errors. This database contains real estate information that includes 4 fields: name, surname, property address and property area. The goals of this book are: a review on existing algorithms in identifying duplicate data in the fields which are:  Edit-distance, Smith-waterman, Jaro, Jaro-Winkler, Lcs and N-gram; description of the proposed algorithms was presented to improve the efficiency and increase the precision of identifying duplication which are the proposed token-based algorithm and the proposed algorithm based on typing error; and comparing these algorithms efficiency in a large Persian database.\u003c\/p\u003e\u003cdiv class=\"aw-variant-hidden-subtitle-div\" id=\"aw-variant-subtitle-9783659341304\"\u003e\u003ch3\u003eImplementation and Comparison of Field Similarity Metrics with Duplicate Entities Detection Purpose in Database\u003c\/h3\u003e\u003c\/div\u003e","brand":"Autorenwelt Shop","offers":[{"title":"Softcover - 9783659341304","offer_id":39485331243101,"sku":"9783659341304","price":49.0,"currency_code":"EUR","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0940\/0622\/files\/8286802b-d3b4-4975-bbb0-3b36e978547a.jpg?v=1773381072","url":"https:\/\/shop.autorenwelt.de\/en\/products\/proposing-field-matching-similarity-methods-von-solmaz-khatami","provider":"Autorenwelt Shop","version":"1.0","type":"link"}