RSS
Логотип
Баннер в шапке 1
Баннер в шапке 2

Samolet: Hybrid Data Management Platform

Product
Developers: Samolet Group of Companies
Date of the premiere of the system: 2024/12/11
Technology: Big Data,  MDM - Master Data Management

The main articles are:

2024: Creating an Analytical Platform

The Samolet group of companies has created its own analytical platform designed to manage big data. It combines the advantages of classic corporate storage with the flexibility of data lakes and makes it possible to serve a variety of requests both for ready-made storefronts and work with high-quality cleaned data in the lake. This significantly expands the application of the platform in business and allows you to optimize the processes of loading, processing, cleaning and describing data. The Group of Companies announced this on December 11, 2024.

With significant amounts of heterogeneous information processed daily, effective data management is key to successful business. "Samolet uses a data-driven approach in making strategic and operational decisions, which allows the company to improve the accuracy of forecasts, optimize processes and improve the quality of services provided.

The Samolet platform is a full-fledged solution in the field of working with big data, which is built on the stack of open source technologies and its own developments.

File:Aquote1.png
For us, it was not just a project, but also a strategic challenge. Many companies prefer ready-made proprietary solutions, but we have opted for independence and flexibility, which is especially important in the context of a dynamically changing market situation. Our approach has made it possible to create a modern platform with a full data service cycle, which implements data governance processes, has an infrastructure as a code and meets the highest requirements. The analytics platform provides cross-system data integration from more than 170 different master systems and sources. It should be noted that the solution landscape includes various tools that allow not only to accumulate data, but also - much more importantly - to create a data management strategy and effectively apply it to business. For example, in our landscape, an important role is played by the service of regulatory reference information, through which unified corporate directories are replicated, - said Olga Svitneva, director of the Samolet group.
File:Aquote2.png

In-house development of the Samolet carries out a quality control system with a multi-level mechanism for validation and cleaning of data. The company actively develops and applies AI and machine learning projects in practice, so there are components in the stack that are sharpened to serve the needs of data science. The company focuses on secure development patterns and data access policies with a focus on turning data into a managed asset - affordable, sustainable and liquid.

The platform's technology stack includes many different components and services - Kubernetes, Kafka, Debezium, MiniO S3, ClickHouse, Airflow, PostgreSQL, DataHub, ML Flow, Jupiter Hub and others. The platform architecture is designed to maximize resiliency and scalability in the face of high data growth dynamics in the company. The entire platform is deployed and runs on the server infrastructure in its own Samolet Group data center, which allows you to fully meet security and performance requirements.