[an error occurred while processing the directive]
RSS
Логотип
Баннер в шапке 1
Баннер в шапке 2

Pentaho BI

Product
The name of the base system (platform): Red Hat Decision Manager (before JBoss)
Developers: Pentaho
Date of the premiere of the system: 2005
Last Release Date: 2019/07/16
Technology: BI,  Data Mining,  OLAP

Content

Pentaho Businnes Intelligence is the opensource-project of Pentaho company (San Francisco) for an enterprays-class of a reporting, the analysis, date of mining, automation of office-work and document flow (BPEL standard) and constructions the intranet enterprise portal.

Pentaho BI includes all necessary components of a modern corporate system of data analysis. Among them the developed means of preparation and formation of the analytical reporting, data loading (ETL), creation of graphic dashboards (dashboards), production of knowledge (data mining), creation of an OLAP cubes. Besides, on this platform it is possible to organize a uniform workplace for preparation and the analysis of the reporting, including access through mobile devices. Application of Pentaho BI is especially relevant for the organizations which have diverse information systems and are interested in unification of technology of reporting and data analysis.

2019: Pentaho 8.3 with DataOps support

The Hitachi Vantara company provided on July 16, 2019 Pentaho 8.3 — the latest version of a software platform for integration and data analysis. In Pentaho 8.3 there was a number of the functions developed for DataOps support — methodology of shared control by data which allows the companies to implement the potential of the available digital assets completely. The platform increases flexibility of work with data in any environments, from peripheral to multicloud infrastructure, and at the same time the high level of control of security and quality of data is provided.

File:Aquote1.png
The methodology of DataOps is aimed at that customers had the necessary data in the right place at the right time. New features of Pentaho 8.3 also allow to achieve it — John Maggi, the marketing vice president of Hitachi Vantara noted. — We not only aim to provide economic data storage at optimum level of service, but also to provide functions of search, access and data management. At the expense of it customers have an opportunity to generate unique useful knowledge and to use all economic potential of data.
File:Aquote2.png

According to developers, Pentaho 8.3 includes a number of improvements which should help the organizations to upgrade management practice by data, eliminating "barrier" between data and their effective use. Updates are among:

  • The interface for work with data streams from hardly accessible sources
    • the Connector for SAP provides the simple interface for combination, enrichment and unloading of data from systems SAP ERP and Business Warehouse, at the same time strict observance the politician of data access, configured in the SAP solutions is provided. Such opportunities give deeper understanding of specifics of data and increase value analysts, received on the basis of corporate information.
    • The Amazon Kinesis service provides an opportunity for work with data in real time in the environment of AWS. Thanks to integration tools the platform allows AWS developers to accept and process instead of writing of the code stream data in the powerful visual environment and also to combine them with other data, thereby reducing the volume of "manual" transactions.

  • Visualization of data for optimization of management of corporate data

    • Expansion of integration into the Hitachi Content Platform (HCP) platform which simplifies reading record and updating of the user metadata of HCP and allows to execute easily requests of objects using system metadata. Thanks to it users can use enhanced capabilities of search, and process of receiving analytics becomes more managed and convenient.
    • Integration into the IBM Information Governance Catalog (IGC) tool which reduces the volume of the "manual" transactions necessary for management of corporate data. Extends restrictedly in the form of the beta.
    • Simplification of tracking to data source, received under such popular protocols as AMQP, JMS, Kafka and MQTT.

  • Expanded support of multicloud infrastructures

    • Package loading in AWS Redshift. The most widespread method of data transfer from S3 storages in Redshift is cyclic use of the scenarios SQL for coordinating of package loadings. Using functionality of package loading in Redshift users will be able to increase productivity of transactions considerably.
    • The connector for Snowflake. Snowflake becomes one of the most often used storages cloud data. However data and from other sources, including from cloud services are necessary for many analytical projects. Pentaho 8.3 provides a possibility of combination, enrichment and data analysis from Snowflake storage with data from other sources, including AWS and Google Cloud.

2018

Integration with SuiteCRM

According to the message of December 10, 2018 "the Cube Three" implemented the project of integration of the SuiteCRM and Pentaho BI platforms. Read more here.

Solutions Hitachi Vantara on management of models of machine learning

The Hitachi Vantara company, Hitachi Ltd. affiliated enterprise, in March, 2018 announced creation of the solutions on process management of machine learning designed to help processing and data analysis specialists to test and rebuild models of machine learning in the field of production. The innovative developments of Hitachi Vantara Labs are connected to the pipeline of data created by Pentaho. It allows to increase effectiveness of business and to reduce risks due to simplification of process of updating of models.

As you know, after start of model of machine learning it is necessary to perform constantly its monitoring, testing and retraining according to the changing environment conditions, and after that to restart. It is labor-consuming handwork which is executed rather seldom. Besides, after restart of model forecasting accuracy considerably decreases that has an adverse effect on profitability of business.

In general possibilities of the solutions Hitachi Vantara on management of models of intellectual data processing allow to optimize processes of machine learning in three directions:

  • Quick start of models in the production environment
    • Abilities to manage models of machine learning help to estimate correctly them and to increase forecasting accuracy to start of model on production. For further setup operational groups can test them using different techniques of cross validation and extra selective assessment. Data preparation taking into account specifics of specific algorithms is executed automatically now.

  • Increase in forecast accuracy

    • As a rule, after start of model on production the accuracy of its indications decreases in process of receipt of new data. The complex of estimated means revealing models which give inexact evidences helps to avoid it. Various visualization tools and creations of reports help to analyze quality of work and to reveal errors. At any updates or changes it is possible to carry out easily A/B-testing, having compared models with each other.

  • Joint work and management of transactions in required scale

    • The organizations even more often aim to increase transparency of algorithms of decision making. Opportunities which are offered by Hitachi Vantara promote interaction of employees, provide control of origin of data and also transparency of data sources and its primary functions. Similar level of transparency facilitates sharing of data and pipelines of data commands, standardizes algorithms and gives the chance of their repeated application.

File:Aquote1.png
Machine learning and the artificial intelligence (AI) allow to optimize all aspects of business — from customer interaction before operating activities. The management tools training models developed by Hitachi Vantara provide higher transparency of algorithms and extent of automation thanks to what developers of the company can focus on implementation of innovations, without being afraid of quality degradation of work of models — John Magee, the marketing vice president of Hitachi Vantara considers.
File:Aquote2.png

Solutions on management of models of machine learning are available on Pentaho Marketplace since March 6, 2018. So far these modules are available in the test mode. The next versions will be integrated into Pentaho Data Integration (PDI).

2016

Pentaho Data Integration (PDI)

The organizations are faced by the most difficult task connected with management of the growing volumes of more and more various data and extraction from them valuable knowledge. For November, 2016 the system of data integration Pentaho Data Integration (PDI) allows to get data access from complex and diverse sources and to combine them with the available relational data for receiving high-quality ready to information analysis – and all this without uniform line of the code.

Functionally saturated graphic user interface in combination with a multistream subsystem of data translation provides the possibilities of high-performance extraction, conversion and loading (ETL) capable to satisfy all needs for data integration, including receiving and processing of "Big Data".

Pentaho Data Integration provides:

  • The drag and drop interface simplifying and accelerating creation of flows of processing and analizadanny.
  • Connectivities practically to any data sources, including flat files, relational DBMS, "Big Data", the API interfaces and many other things
  • Integration into transaction databases, such as Oracle, DB2, Postgres, MySQL and others
  • Data access of corporate applications, including Salesforce.com, Google Analytics and to others
  • Support of a set of the Hadoop distribution kits and databases of NoSQL
  • Library of ready components for data access, their preprocessing, combination and cleaning
  • The functionality of orchestrating for management of complex workflows including task scheduling and sending notifications
  • Integration into a data stream of expanded models of analytics from R, Python and Weka
  • Administrative tools, scalings and safety of the corporate level

Big Data

(data are relevant for November, 2016)

The environment of visual design for combination of several sources of "Big Data" and data processing in required scale.

  • Integration with the leading Hadoop distribution kits, NoSQL storages and analytical DB and also with given files of magazines and the JSON/XML formats
  • A possibility of creation of transform circuits of data on Hadoop in the visual interface without writing any code which allows to reach 15-fold increase in productivity in comparison with manual programming and to execute calculations on a high-performance cluster of Hadoop
  • The fast connection of data sources to Hadoop on the basis of templates performed using feature set of loading of metadata (metadata injection)
  • The Adaptive Big Data Layer component providing transparent portability of conversions between the different Hadoop distribution kits
  • Practical solutions for creation in the environment of "Big Data" of data marts on demand

Business intelligence

Locating a range of analytical tools, users can create reports and interactive panels and also make visualization and data analysis in several directions, without involving IT specialists or developers. At the same time divisions of IT get advantage of use of the safe, scalable and managed analytics to all enterprise. The solution Pentaho can be deployed in the territory of the organization or in a cloud and also to embed seamlessly in other applications.

For November, 2016 Pentaho Business Analytics provides following features:

Special analysis and visualization:

  • Library of interactive visualization tools, such as maps, thermal cards, bubble charts and other representations
  • High-scalable caching of the data given in memory for accomplishment of the analysis of large volumes "with a thought speed" using the simple drag and drop interface
  • Possibility of visual filtering and change of scale with lasso for the best understanding or exception of sharp deviations
  • Selection of attributes a contrast color for more evident display
  • Detailing (drill down) for detailed studying of data

Interactive panels

  • The designer of interactive panels intended for business users based on the drag and drop web interface
  • Integration into portals and a possibility of modification of the built-in visualization (mashup) for seamless consolidation of an e-business intelligence into other web applications
  • Various visualization tools with opportunities of navigation and detailing and library of controls in the form of filters
  • The development environment of interactive panels providing the possibilities of analytics adapted to user requirements

Independent creation of reports by users

  • Support of the operational and parametrized reports and a possibility of independent interactive creation of reports on data of the transaction systems
  • Intuitive process of creation of interactive reports using the web interface for business users
  • The designer of reports with support of graphic imposition of pixel-perfect for experienced users

Mobile business intelligence

  • Mobile application for end users with opportunities of a research of data, the interactive analysis and visualization on iPad devices.
  • The optimized work from mobile devices with support of the main gestures, such as filtering by contact, transition on the detail levels and the drag and drop activated by contact
  • An opportunity using mobile devices to create new analytical content and also to browse to edit the existing reports

Predictive analytics

In addition to opportunities of a research of data of Pentaho offers algorithms of machine learning and instruments of data processing. It allows data processing and analysts specialists to reveal patterns and correlations which remain unnoticed when using normal means of the analysis and creation of reports. Possibilities of expanded analytics, such as forecasting of time series, help the organizations to plan results of activity, making a start from deeper understanding of performance indicators of business in the past.

Image:АНАЛИЗ С ПОМОЩЬЮ ТЕПЛОВОЙ КАРТЫ В PENTAHO BUSINESS ANALYTICS.jpg

Built-in means of analytics

The Pentaho platform supporting work in a cloud environment is created especially for embedding and integration in the available applications, portals and processes.

  • A possibility of seamless embedding of visualization tools, reports and interactive panels in existing applications
  • The configured user web interface and API based on web services provide control over appearance and functionality of analytical means
  • Possibilities of deployment in the multi-user environment and also effective integration into mechanisms of security and uniform login (SSO)
  • Adaptable learning process and consultations of specialists of level of system architects.

2013: Release of Pentaho 5.0

Pentaho 5.0 provides to the companies using Big Data, a range of analytical tools for any types and amounts of data, any architecture of IT and any required analysis. The interface simplifies work of the user. Pentaho 5.0 contains over 250 new and advanced features.

Pentaho 5.0 allows analysts to combine all data types, to visualize them, to study for deeper understanding and to prepare reports on their basis. Combination of Big Data 'at a source' allows to save degree of controllability, necessary for the exact and reliable analysis, and data security. The exact, combined practically in real time Big Data are necessary for the analysts working with the visualized data in distributed environment for the timely and exact analysis. The combination which is usually created for the end user requires intermediate stages that often leads to obsolescence of data sets. Thanks to Pentaho 5.0 opportunities in the field of integration of Big Data of analytics can combine surely all data almost in real time and immediately analyze the received results.

Representatives of Pentaho consider that in present conditions integration and certification of popular storages of Big Data guarantees to the companies an opportunity to keep up with the changes happening in an ecosystem of Big Data and readiness for the future. Recently integration of Pentaho with Splunk, Amazon Redshift and Cloudera Impala is performed, certification of MongoDB, Cassandra, DataStax, Cloudera, Intel, Hortonworks and MapR is carried out.

In Pentaho 5.0 there were such new opportunities as restart of tasks, rollback and redistribution of loading, new services REST for the simplified embedding of means of the analysis and transfer of reports in the Internet-applications provided as service.

2010: Structure of a product

For April, 2010 Pentaho Businnes Intelligence — the opensource-project of Pentaho company (San Francisco) for an enterprays-class of a reporting, the analysis, date of mining, automation of office-work and document flow (BPEL standard) and constructions the intranet enterprise portal.

A set of the integrated components standard for BI is a part of a product:

  • Pentaho Reporting JFreeReport - the designer of reports, an analog of popular open-source of the BIRT and JasperReports projects. Can use any DBMS supporting the JDBC interface as data source.
  • Pentaho Data Integration Kettle ETL is the ETL module for integration of the initial systems and Pentaho storage
  • Pentaho Analysis Mondrian OLAP Server - OLAP server allowing to create reports for data analysis online, supports language of requests MDX
  • Pentaho Data Mining Weka (machine learning) is the tool for automation date mining
  • Pentaho Dashboards is the instrument of creation of dashboard for monitoring of key indicators of activity of the enterprise.

2008: Start of release of the solution under the license GNU GPL v.2

The first version appeared in 2005. Since July, 2008 is issued under license GNU GPL v2.