{"product_id":"using-rogets-thesaurus-to-determine-the-similarity-of-texts-von-jeremy-ellman","title":"Using Roget's Thesaurus to Determine the Similarity of Texts","description":"\u003cp\u003eThis thesis addresses the problem of extracting a  representation of text''s meaning from its content.  The solution investigated is based on the use of  Roget''s thesaurus as an external knowledge source  and can be used to analyse texts of any length or  complexity. The resulting document representation  can then be compared to others, producing a new  method for text similarity assessment.  All coherent texts contain embedded sequences of  words that are related in meaning. These sequences  can be detected by identifying simple relationships  between the relevant thesaural entries in which the  words are found. The identification of initial  sequences drives the addition of further related  words into conceptually related ¿lexical chains¿.   Every coherent text contains many lexical chains of  different lengths and strengths. These may be used  to represent the broad subject matter of a text. By  identifying the key concept of each chain, and  relating this to its presence we may produce an  attribute value vector of concepts and their  strengths. This may then be used to identify other  texts as closer or further away in meaning.\u003c\/p\u003e\u003cdiv class=\"aw-variant-hidden-subtitle-div\" id=\"aw-variant-subtitle-9783838338408\"\u003e\u003ch3\u003eA Thesis in Computational Linguistics\u003c\/h3\u003e\u003c\/div\u003e","brand":"Autorenwelt Shop","offers":[{"title":"Softcover - 9783838338408","offer_id":39499026595933,"sku":"9783838338408","price":79.0,"currency_code":"EUR","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0940\/0622\/files\/d9a8fcf6-d3e6-4951-baf1-5f0cfdb0b054.jpg?v=1757654918","url":"https:\/\/shop.autorenwelt.de\/products\/using-rogets-thesaurus-to-determine-the-similarity-of-texts-von-jeremy-ellman","provider":"Autorenwelt Shop","version":"1.0","type":"link"}