{"product_id":"statistical-properties-of-turkish-words-von-gokhan-dalkilic","title":"Statistical Properties of Turkish Words","description":"\u003cp\u003eFor speech recognition, OCR, etc.  determination of the structural properties of a  natural language is essential. These properties can  be analyzed under two different categories;  morphological and statistical analysis. For  statistical analysis, a corpus which is a  representative sample of the natural language is  needed. Word n-gram frequencies of that corpus can  be determined by using suitable algorithms and  missing n-grams can be estimated by using smoothing  techniques. In this study, in order to compare and  apply smoothing techniques to Turkish,  a corpus named TurCo was created. In  order to calculate word n-grams, different  algorithms were tested. After finding  n-gram word lists, their characteristics  were analyzed. For generalization, Zipf''s Law was  applied, and to increase the accuracy in Zipf''s Law,  Mandelbrot Law was applied by finding the  appropriate constants of Mandelbrot. As the corpus  could not be big enough to represent all of the  language, smoothing techniques were used to estimate  the unseen word n-grams. This study can help  professionals working on speech recognition,  cryptanalysis, and author recognition in Turkish.\u003c\/p\u003e\u003cdiv class=\"aw-variant-hidden-subtitle-div\" id=\"aw-variant-subtitle-9783838351582\"\u003e\u003ch3\u003eContemporary Printed Turkish Word Characteristics and Smoothing Techniques\u003c\/h3\u003e\u003c\/div\u003e","brand":"Autorenwelt Shop","offers":[{"title":"Softcover - 9783838351582","offer_id":39498962567261,"sku":"9783838351582","price":59.0,"currency_code":"EUR","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0940\/0622\/files\/1850234c-f1bb-498e-be19-52e59e7cdb33.jpg?v=1773211416","url":"https:\/\/shop.autorenwelt.de\/products\/statistical-properties-of-turkish-words-von-gokhan-dalkilic","provider":"Autorenwelt Shop","version":"1.0","type":"link"}