Belaya T.I. 1
Pasechnik P.A. 1
1 St. Petersburg State University of technology and design
We have done the analysis of text processing using statistical estimation of clauses or particular terms. Main purpose of this article is describing terms evaluation method without using thesaurus methods. As the object of consideration selected terms introduced in the text for the first time , as well as their accompanying definitions. Considered an exclusively statistical tools allocation concepts highlighted advantages over dictionary methods. There is a focus of the work on automatic summarization . Identified four key steps to solve the problem , which are used in the template design , analysis of words and combinations of words in the statistics of occurrence of the text. Select the formula for the probability characteristics of terms and defining their proposals . Formed algorithm analyzes the text provides guidance on the use of this algorithm in the development of software tools. Evaluated data can be used in automation of educational test formation process, science material coverage estimation, translation of Russian texts, grammatical correcting automation and purposes of artificial intelligence theory.