| Developers: | Information science and management Federal research center RAS, Competence center of NTI based on MSU, Technologies of systems analysis |
| Date of the premiere of the system: | 2020/04/14 |
| Technology: | Big Data, Data Mining |
2020: Solution development for the intellectual analysis of big arrays of texts
In Competence center of NTI based on MSU in the direction of "Technology storages and the analysis Big Data" together with Institute of problems artificial intelligence FIC "Information Science and Management" of RAS and Technologies of systems analysis LLC the platform of text analytics on the basis of intelligent systems of collecting and processing of texts on Russian and English languages is developed. The project is the first-ever industrial solution having potential of cross-language analytics. On April 14, 2020 reported about it RVC.
The developed solution can analyze and process any type of information provided in text form on Russian, English, Belarusian, Kazakh and Tatar and languages. Advantage of the project is the possibility of cross-language analytics of texts: repeated search of the same data presented in documents in different languages and described by terms from different languages is not required. Implementing solution will significantly simplify work of the specialists working with the analysis of large volumes of texts and considerably will increase efficiency of patent and research search.
The analysis of large volumes of scientific and technical information, the analysis of social networks regarding identification of materials of undesirable subject, deviant behavior, the analysis of psychological state of users and social tension of population groups allows to pass the decision. Besides, the solution can carry out tasks of information extraction, the analysis of medical documents, the technical information of purchases for transfer of the big semi-structured arrays of texts to a form of the structured information.
Further analytical centers, scientific and scientifically-educational the organizations, the enterprises rendering services of protection of intellectual property, state corporation can become consumers of development. The normative and legal help systems and consulting bureau can become also potential consumers.
| The symbiosis of several scientific approaches offered by the president of the Russian academy of social sciences, the Doctor of Philosophy, professor G.V. Osipov allows to describe subject of documents through phrases and to analyze sense, separate expressions in the form of heterogeneous semantic networks. In total with modern methods of computational linguistics, distributive semantics and machine learning the created solution allows to reach the bigger accuracy and completeness in problems of text analytics, |
The solution for April, 2020 already went through a stage of pilot implementations in such organizations as: INFRA-M, NCR of Rukont, NTIMI, Directorate of scientific and technical programs, the Ministry of Education and Science of the Russian Federation that allowed to bring together base of technology requests of customers in the field of text analytics.
The cost of development and deployment of the ready-made solution varies from 5 to 25 million rubles in each case depending on need of the customer for services in integration, customization and a deep software setup.
