{"product_id":"statistical-and-semantic-similarity-between-english-sentences-von-anis-zaman","title":"Statistical and Semantic Similarity between English Sentences","description":"\u003cp\u003eThis book presents various algorithms to compute semantic similarities between english texts. I explored three different algorithms for computing English sentence similarity. The first algorithm, which is well-explored in the literature [Salton and Buckley, 1988, Wu and Salton, 1981], weights words in each sentence according to term frequency and inverse document frequency (tf-idf ) and uses no semantic information. The second algorithm uses measures of the semantic distance between words belonging to the same part of speech. The third algorithm combines the tf-idf scores and the semantic distance scores between words. I evaluated the performance of the second and third algorithms on two data sets: O¿Sheäs set of sentence pairs with human similarity judgements [Li et al., Aug, Rubenstein and Goodenough, 1965], and Microsoft Research¿s sentence-level paraphrase dataset [Rus et al., 2012]. On O¿Sheäs data set, the third algorithm more accurately matches human judgments than the second. On the Microsoft data set, there was not a significant difference between the two algorithms\u003c\/p\u003e\u003cdiv class=\"aw-variant-hidden-subtitle-div\" id=\"aw-variant-subtitle-9783659616389\"\u003e\u003ch3\u003e\u003c\/h3\u003e\u003c\/div\u003e","brand":"Libri","offers":[{"title":"Softcover - 9783659616389","offer_id":39449228017757,"sku":"9783659616389","price":39.9,"currency_code":"EUR","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0940\/0622\/files\/1d2d6fa8-130c-4bbe-af38-1b898eace672.jpg?v=1751346682","url":"https:\/\/shop.autorenwelt.de\/en\/products\/statistical-and-semantic-similarity-between-english-sentences-von-anis-zaman","provider":"Autorenwelt Shop","version":"1.0","type":"link"}