✍️ 🧑‍🦱 💚 Autor:innen verdienen bei uns doppelt. Dank euch haben sie so schon 367.705 € mehr verdient. → Mehr erfahren 💪 📚 🙏

Statistical and Semantic Similarity between English Sentences

Statistical and Semantic Similarity between English Sentences

von Anis Zaman
Softcover - 9783659616389
39,90 €
  • Versandkostenfrei
Auf meine Merkliste
  • Hinweis: Print on Demand. Lieferbar in 2 Tagen.
  • Lieferzeit nach Versand: ca. 1-2 Tage
  • inkl. MwSt. & Versandkosten (innerhalb Deutschlands)

Autorenfreundlich Bücher kaufen?!

Beschreibung

This book presents various algorithms to compute semantic similarities between english texts. I explored three different algorithms for computing English sentence similarity. The first algorithm, which is well-explored in the literature [Salton and Buckley, 1988, Wu and Salton, 1981], weights words in each sentence according to term frequency and inverse document frequency (tf-idf ) and uses no semantic information. The second algorithm uses measures of the semantic distance between words belonging to the same part of speech. The third algorithm combines the tf-idf scores and the semantic distance scores between words. I evaluated the performance of the second and third algorithms on two data sets: O¿Sheäs set of sentence pairs with human similarity judgements [Li et al., Aug, Rubenstein and Goodenough, 1965], and Microsoft Research¿s sentence-level paraphrase dataset [Rus et al., 2012]. On O¿Sheäs data set, the third algorithm more accurately matches human judgments than the second. On the Microsoft data set, there was not a significant difference between the two algorithms

Details

Verlag LAP LAMBERT Academic Publishing
Ersterscheinung Oktober 2014
Maße 22 cm x 15 cm x 0.5 cm
Gewicht 125 Gramm
Format Softcover
ISBN-13 9783659616389
Seiten 72