RSS
Логотип
Баннер в шапке 1
Баннер в шапке 2

Arenadata Catalog

Product
Developers: DataCatalog
Date of the premiere of the system: 2022
Last Release Date: 2024/09/16
Branches: Information Technology
Technology: Big Data

Content

Main article: Big Data

Arenadata Catalog is a tool for organizing data management within Data Governance.

2026

Availability of Polymatica Dashboards

Users of the Arenadata Catalog Enterprise Data Management System (ADC) have access to a connector to Polymatica Dashboards - the update was included in the product release v. 0.8.5. DataCatalog reported this on February 5, 2026.

The connector is designed for automated scanning of the metadata structure Polymatica BI Systems and subsequent cataloging in the Arenadata Catalog. This helps organizations expand their reach data management Data Governance by BI streamlining analytics assets, increasing data visibility, and building a single information space for collaboration IT and business.

By integrating with the business glossary, each indicator in the report is clearly defined, which leads to the elimination of discrepancies and ensuring consistency between teams. Arenadata Catalog allows you to automate the quality check of the original reporting data, which, in turn, increases the reliability of the conclusions. The Data Lineage function gives complete transparency: you can trace the data path from the source to the final dashboard, see transformations and ETL processes. This speeds up error finding, simplifies auditing, and creates a culture of responsible data handling.

File:Aquote1.png
We see a steady market demand for "end-to-end" data manageability - from sources and storefronts to end dashboards. The introduction of the connector in Arenadata Catalog makes the sharing of Polymatica Dashboards and metadata management tools more effective for customers and partners, increasing confidence in analytics and speeding up work, "said Elvin Mustafayev, Director of the Polymatica Product Development Department.
File:Aquote2.png

File:Aquote1.png
Increasing the number of Arenadata Catalog integrations is primarily an extension of the metadata management layer. Integration with Polymatica Dashboards allows you to catalog key business assets - dashboards and metrics - and link them to physical data and a glossary. So we create a single trusted data environment that simplifies import substitution of the BI stack, making analytics processes controlled and transparent, "said Igor Moiseev, director of business development at DataCatalog (part of the Arenadata Group).
File:Aquote2.png

2025

Arenadata Catalog version 0.8.5

The DataCatalog command simultaneously with the release of version 0.8.5 of the Arenadata Catalog Enterprise Data Management System (ADC) released a connector to the freeware Greengage DBMS. DataCatalog announced this on December 24, 2025. Read more here.

At the heart of the IT platform for large industrial enterprises

Arenadata and ATOLLis Groups on September 23, 2025 announced the release of a joint IT platform for large industrial enterprises.

The solution was based on Arenadata products: an enterprise data management system within the Data Governance (Arenadata Catalog), a data quality control system (Arenadata Catalog Data Quality Framework), a master data management system (Arenadata Harmony MDM), a DBMS (Arenadata DB) and an integration data management platform ATOLLis (ATOLL Platform). Read more here .

Arenadata Catalog (ADC) 0.8.4 with Reference Book Registry module

Arenadata has released an updated release of the enterprise data management Arenadata Catalog (ADC) 0.8.4 system. The key change was the Reference Book Registry module, designed for centralized reference data management (RDM). This enhanced functionality allows for consistency, accuracy, and availability of critical reference information to all stakeholders, directly affecting process efficiency and analytics quality. Arenadata announced this on September 16, 2025.

This module solves one of the key tasks of organizations - unification of disparate data. The "Directory Register" supports both internal data (unique product codes, division classifiers) and external (generally accepted industry standards). Data can be imported or integrated from any source without volume restrictions.

The main value of the Reference Book Registry module is the ability to associate reference book values ​ ​ with business glossary objects. This creates a single semantic field of the company, where each term is enriched with specific, structured data.

The Directory Registry provides additional features for directory users. For example, for financial organizations, it allows data consistency not only within hierarchically complex organization terminology, but also in conjunction with normalized reference values, through links, and the ability to automate the loading of reference and terminology records allows you to maintain consistency and relevance of data. This speeds up processes and eliminates errors due to possible misinterpretations, such as internal codes.

For working with external data, the reference book can serve as a reliable source of static information, for example, currency codes, which ensures the reliability of data when going through workflows.

The module extends the internal capabilities of the system, in particular the business glossary section. Unlike the simple Select From List attribute, the Directory Registry provides advanced features such as assigning people responsible for reference values, specifying the source of information, and the extensible attribute composition of the directories themselves - which increases trust in the data and simplifies its support.

File:Aquote1.png
The implementation of the "Directory Registry" in Arenadata Catalog is our response to one of the key challenges of companies: the need to maintain the integrity and relevance of reference information in the face of rapidly changing business processes. This module not only adds new functionality, but creates the basis for the formation of a single data language within the organization, which is especially important in the context of digital transformation and working with big data, "said Igor Moiseev, director of business development at DataCatalog (part of the Arenadata Group).
File:Aquote2.png

In addition to the "Directory Registry," the release includes more than 120 targeted improvements aimed at improving the convenience, transparency and stability of the system:

  • Improved business glossary: links to related objects have become clickable, and color legend customization has been added to visualize links.
  • · Updated interfaces: completely redesigned the "Settings" and "Classifiers" sections. You can now assign owners to classifier sets.
  • Increasing DQ transparency: The Data Quality section provides the ability to subscribe to alerts about test results.
  • Security enhancement: The role model has a function to hide entire system partitions for users who do not have access to them.
  • Large-scale optimization: improved processes for importing metadata from various sources (Oracle, Hive, Greenplum, etc.), as well as updated the Camunda framework to improve stability.

Obtaining a Level 4 FSTEC Certificate

Arenadata Catalog received FSTEC a level 4 certificate. This was DataCatalog announced on June 9, 2025.

File:Aquote1.png
The question of trust in data begins with confidence in the reliability of the tools that control this data. We are pleased that Arenadata Catalog is the first product in its class to receive a high degree of trust from FSTEC. This is an important step both for us and for the entire Russian data management market, "said Ivan Novosyolov, CEO of DataCatalog (part of the Arenadata Group).
File:Aquote2.png

Certification was conducted within established procedures, including analysis, source code safety architectures, and development processes. Obtaining FSTEC on level 4 of trust means that Arenadata Catalog has built-in mechanisms for protecting against unauthorized access to non-secret data, state provides functions identifications and authentications access control, as well as recording security events.

The product is included in the state register of the certification system for information protection tools according to information security requirements on May 28, 2025. The certificate is valid until May 28, 2030.

Compatibility with Luxms BI

The Luxms BI business intelligence platform has been tested for compatibility with the Arenadata Group solution stack, including the Arenadata Catalog. Arenadata announced this on March 6, 2025. Read more here.

2024

Developing a connector for integration with Picodata DBMS

The DataCatalog team (part of the Arenadata Group) has completed the development of a connector that provides compatibility between the Arenadata Catalog (ADC) product and the Picodata DBMS. Arenadata announced this on November 26, 2024. Read more here.

Compatibility with "MDM Harmony"

and the company Navicon Datakatalog"" (part of the Group) Arenadata on August 28, 2024 announced the completion of the testings Arenadata ON Catalog (ADC) and regulatory reference information management system. " MDM Harmony Integration two solutions will allow Russian business customers to use these products as part of the construction of complex IT systems for. data management

The integration of Arenadata Catalog with the NSI and Master Control System data - Harmony MDM was tested as part of joint tests carried out on a specially deployed stand. Arenadata Catalog users can now be assured of the cleanliness, relevance and consistency of the company's master data. This will ensure high speed and accuracy of analytics and decisions based on it.

File:Aquote1.png
Metadata and master data management are traditionally closely intertwined, providing an integrated approach to data organization and quality. "MDM Harmony" focuses on providing uniform, accurate and up-to-date data such as customer, product and supplier information. Integration with the Arenadata Catalog metadata management system allows you to track and manage data at the metadata level, such as the origin of data, its structure and relationships. This helps identify and resolve inconsistencies, duplicate records, and other issues, which ultimately improves overall data quality and consistency. With a unified and integrated approach to data management, management and analysts receive more complete and up-to-date information for decision-making. This contributes to effective planning, strategic analysis and prompt response to changes in the business environment, - said Ivan Novoselov, CEO of DataCatalog ("DataCatalog").
File:Aquote2.png

File:Aquote1.png
Arenadata Catalog is a popular product among large Russian business customers, the demand for which is constantly growing. The compatibility of our solutions will open up new prospects for market participants and make it possible to more effectively solve problems related to managing large data flows, "commented Maria Averina, Director of Strategic Development at Navicon.
File:Aquote2.png

Apache Impala Compatibility

On May 16, 2024, Arenadata announced that DataCatalog (part of the Arenadata Group) had tested a connector that provides compatibility between the Arenadata Catalog (ADC) product and the Apache Impala service, which is part of the Arenadata Hadoop (ADH) enterprise distribution.

DateCatalog has tested a connector that provides compatibility between Arenadata Catalog (ADC) and Apache Impala

According to the company, the connector allows you to import Impala object descriptions into the catalog, profile data, and configure custom data quality checks in Impala. This is not the first module to provide integration with the Hadoop ecosystem, previously customers were presented with a connector for the Hive service.

The Hadoop ecosystem is a de facto standard in business scenarios related to the storage, processing and analysis of large amounts of arbitrary data types. The steady demand for systems of this class is supported by the trend for digitalization and the growth of unstructured data and the number of related projects.

Responding to customers' need for high-performance analysis of big data stored on systems deployed on Arenadata Hadoop, Arenadata included Apache Impala, a distributed SQL query execution service, in the next update. It is designed for massively parallel processing (MPR) of ultra-large amounts of data.

Impala is designed as a faster and more efficient mechanism for executing SQL queries compared to traditional SQL-on-Hadoop (Hive, Spark SQL) components. Service support optimized product performance for a number of business scenarios, including the so-called data sandboxes for ad hoc processing by information analysts.

File:Aquote1.png
A number of Arenadata customers took advantage of the ability to speed up SQL processing and data analysis by using Impala instead of Hive in the data lakes. Unfortunately, the lack of support for this service in Arenadata Catalog deterred some of them from switching the load to Impala in the industrial circuit. The operational development and delivery of the metadata connector ensured the continuity of metadata tracking in systems and eliminated this obstacle.

counts Alexander Timchur, Head of Arenadata Sales Support
File:Aquote2.png

The metadata of the objects of the integrated systems is the basis of the data catalog. The integration of Impala object metadata allows Arenadata Catalog users to get an up-to-date and complete view of service objects to include in the lineage graph, explore links to objects of other source systems, and link to the business entities of the organization involved. The Arenadata Catalog administrator can supplement the automatically collected Impala metadata with an extended description, accompanied by custom attributes. Just like the rest of the objects in Arenadata Catalog, Impala service objects can have an owner and be classified according to the level of business criticality.

File:Aquote1.png
The technological landscape data stores the Russian of enterprises is characterized by complexity and fragmentation. In the past, corporate products of foreign vendors were widely used to build QCD; as of May 2024, solutions based on open source are being developed and implemented. In the software long term domestically produced software , it will take preferential positions. It is for this reason that Arenadata Catalog regularly expands the list of connectors to popular data sources and platforms, regardless of their type, developing them independently.

noted Ivan Novosyolov, CEO of DataCatalog
File:Aquote2.png

User quality checks and automatic collection of data profiling metrics are configured for Impala data integrated into the catalog. For example, you can test for duplicate values in a database table or a non-zero value in a column. Based on the results of inspections, a final report on the quality of data is generated. For Apache Impala, it is possible to form a visual origin (Data Lineage) between tables and views, including a generational lineage. Now, looking at the analytical report, you can track the path of data transformation between systems: which attributes of which tables of which database transmitted the information, how in turn they received it, which other information systems are involved.

Arenadata Hadoop (ADH) is an Apache Hadoop-based enterprise distribution for storing and processing semi-structured and unstructured data.

Tasks to be solved:

  • Storage and processing of large volumes of semi-structured and unstructured data of any type (document and content management systems, event storage and recording, sensor data, product catalogs, backup of other DBMS).
  • Distributed information processing.
  • Construction of lakes and data factories (a single center for all company data, quick deployment and folding of sandboxes for pilot projects and testing statistical hypotheses, working with analytical tools in a single environment).
  • Machine learning and artificial intelligence.
  • Data source for QCD.
  • Import substitution of Western systems.

Arenadata Hadoop has received a certificate of state registration of the computer program. The product is included in the unified register of Russian programs for electronic computers and databases.

2023

Arenadata Catalog 0.3 Release with Enhanced Glossary Capabilities

DateCatalog announced on June 20, 2023 the release of Arenadata Catalog 0.3, the next version of the data management tool. The Arenadata Catalog software is intended for organizations wishing to implement Data Governance practices, and allows you to solve the problems of managing the company's information assets and maintaining the corporate business glossary in a single interface. The most significant improvements in this version relate to the Glossary module. The updated functionality will allow users to expand the list of term types, develop an attribute register and perform full-text searches.

Illustration: hevodata.com

In Arenadata Catalog 0.3, the developers significantly expanded the capabilities of the Glossary and added predefined "boxed" types of terms: "business term," "entity," "data attribute," "calculated data attribute" and "indicator." Each type of term has its own set of attributes that users can extend. For the types "entity," "attribute," "calculated attribute" and "indicator," there are special types of "entity-attribute" relationship.

Thanks to the innovations, users will be able to add their own types of terms, and a special constructor will help manage the set of attributes, their order and the obligation to fill in.

In the attribute register, they are fully managed: specifying validation, filling instructions, selecting the number of values, and specifying the default value. Maintaining such a registry allows you to reuse attributes in different types of terms.

Thanks to the functionality of importing data into Glossary, the introduction of software into commercial operation is accelerated, the functionality allows not only to create terms, but also to update existing ones.

This version of Arenadata Catalog implements full-text Glossary search: it can be used to find both data catalog objects and objects of the Glossary itself. Users can also subscribe to Glossary objects such as "terms," "subject areas," and "glossaries," which will allow you to track changes that occur in metadata catalog objects through notifications.

The developers have added a user task management interface. For the administrator, it is available to monitor the execution dates, search for tasks without a performer and the ability to delegate the task to other users if the person responsible for coordination is not available.

In addition to the following, in Arenadata Catalog 0.3:

  • revised; adapter Greenplum
  • a connector for Luxms BI is enabled with the ability to create an automatic Data Lineage before the column of the data source table;
  • blocking the publication of terms if it is impossible to determine those responsible for user coordination;
  • It is possible to add the status of the task due date: "At risk," "Expired," "Norm";
  • added an updated algorithm for generating the name of user tasks. User tasks now contain: "Task Type," "Event Type," "Term Name";
  • for terms, a link with a link type is available.

File:Aquote1.png
This is an expected release both among our customers already implementing Arenadata Catalog and among companies conducting pilot projects. The main functionality of Arenadata Catalog 0.3 is focused on building a comprehensive and flexible conceptual data model that allows business and IT to build a single "Glossary" for communication and description of data. We see the demand for this functionality among customers and the need for flexible support from us for various options for implementing data management processes in companies,
commented Ivan Novosyolov, CEO of DataCatalog.
File:Aquote2.png

File:Aquote1.png
Very often we heard from customers wishes to customize the "Glossary" for their special requirements. Moreover, even at the stage of the birth of Arenadata Catalog, we drew attention to the rather scarce capabilities of the tools on the market for setting objects and the composition of the attributes of the Glossary. And in most Open Source tools they are completely absent. Therefore, we decided to make this functionality one of the main features of Arenadata Catalog and worked for a long time to ensure its maximum versatility and convenience. Now users will be able to create attributes of various types, ranging from standard "string," "number" to such specific ones as "calculation formula," "logical value,"
noted Rasil Saifullin, owner of Arenadata Catalog, DataCatalog company.
File:Aquote2.png

We add that for each attribute you can specify different settings for valid values, prompts and instructions for filling. This makes it possible to flexibly implement almost any requirements for the creation of the Glossary, taking into account the individual aspects and nuances of each industry. With extensive options for setting tolerances, you can reduce errors and improve the accuracy of information management, increasing the trust and frequency of use of the tool among business users.

Arenadata Catalog Capabilities

According to information for March 2023, Arenadata Catalog allows:

  • Integrate metadata from different data processing and analysis systems
  • Search for data and collaborate with metadata
  • maintain an enterprise business glossary and ensure its integration with the data catalog.

Arenadata Catalog is based on open source technologies, fully adapted for use in Russian commercial and government organizations, and includes the Unified Register of Russian Software.

2022

In 2022, Arenadata, a supplier of the big data management platform and Luxms Group of Companies, a supplier of BI and ETL systems (Luxms BI and Luxms Data Boring), joined forces to ensure the efficient use of data by Russian companies and organizations in their activities.

The joint venture Datakatalog"" creates a product to support processes Data Governance - Arenadata Catalog.

The basis of the company's strategy is the creation of an open source software product for the needs of the largest companies in Russia implementing Data Governance approaches:

  • support for metadata integration, including Russian and open-source software
  • architecture based on open metadata exchange standards
  • focus on user experience and usability
  • automatic detection of data subject to regulation in Russia (TIN, addresses, etc.)