RSS
Логотип
Баннер в шапке 1
Баннер в шапке 2

HP Vertica DBMS

Product
Developers: Vertica
Last Release Date: 2014/12/02
Technology: DBMS

Content

2022: Availability from VK Cloud Solutions

On February 3, 2022 the company "VK Digital technologies" reported that clients of VK Cloud Solutions will be able to use Vertica, the high-performance platform for the analysis of big data from a portfolio program obespecheniyamicro Focus, in a cloud. Vertica operates petabytes of data, executing requests in milliseconds, and a compressed storage format saves up to 90% of disk space. High performance and analytics make this DBMS a relevant tool for building business intelligence and big data systems. More details here.

2018

Flexible tables technology

The Vertica Big Data Analysis Platform, developed by Micro Focus, implements technology for storing data in the so-called common or flexible format - flexible tables, or flexible tables. The initial data is loaded into the DB in "raw" form, without changes. This was reported on April 24, 2018 in the company "Complit."

All necessary transformations are defined directly in the process of data processing: with the definition of so-called map maps (map) or using castomized transformation functions (transformation UDx). The first option is suitable for storing and processing CSV, JSON and XML formats. The second allows you to work with any other format, for example, to process data in the ASN.1 BER format.

As a result, the analyst has full access to all the source information within the same ecosystem - this is done using the usual language of structured SQL queries, which is well known to specialists.

An approach to organizing "flexible data" would not be effective if it were not for a cluster infrastructure with mass parallel data processing (MPP). Flexible tables in Vertica are organized as regular tables. The methods of distributing data between cluster nodes apply to these types of tables in the same way as to fixed tables. The transformation of data in them takes place in parallel on the involved nodes of the cluster. This increases processing performance by scaling horizontally. The analyst instantly receives information from flexible tables and based on them can make a decision and revise existing analytical models much faster.

In St. Petersburg, the Vertica stand is deployed in the Complit demo center. There you can quickly organize any pilot and bench project, the warehouse always has the equipment of the desired configuration. The company will provide site and its own data center prepared for technical testing.

2014

HP Vertica OnDemand

On December 2, HP announced the release of Vertica OnDemand, a solution with functionality for enterprise-level Big Data analytics through the cloud.

A wide range of built-in analytic features are available to users with the highest flexibility and performance. Simplicity, according to developers, is one of the main advantages of the solution.

HP Vertica OnDemand is expected to enter the market in Q1 2015.

HP Vertica uses SQL to access Hadoop

On November 19, 2014, it became known that the company joined HP the community of developers using SQL to access Hadoop[1] The company released the HP Vertica for SQL on Hadoop supplement to its analytical cluster DBMS Vertica with column storage.

According to HP, the company's solution supports a wider range of SQL statements, including join and merge, scales better than major competitors and is able to integrate with all popular Hadoop distributions.

Support is also reported for Parquet and ORC file formats and an attractive pricing model for the number of cluster nodes. Vertica for SQL on Hadoop uses its own tools to manage and administer, without relying on the standard YARN (Yet Another Resource Negotiator) for this distributed environment.

HP Vertica "Dragline"

On May 28, 2014, HP announced a new release of Vertica's analytical platform, Dragline. The solution provides access to new ways of obtaining, researching and storing data, is fast, cost effective and serves more users.

To quickly achieve success and lower total cost of ownership, organizations should be able to place information in the most appropriate storage conditions, and quickly study data to extract valuable information from them.

HP Vertica Dragline offers:

  • Technologies created as part of HP's Maverick project, including the Live Lookups feature, significantly accelerates the execution of many simultaneous requests by processing data as it arrives. HP Dynamic Workload Management, in turn, dynamically allocates the necessary amount of resources depending on the complexity of the received request - this can be both a simple situational request and a composite one, which takes much longer;

  • Advanced SQL over Hadoop support and cost-effective storage eliminates the need for data migration and supports more formats, including Parquet, Thrift, Avro, and CEF. With the most appropriate business analysis and visualization environments, enterprises can load, explore, and visualize data faster, without unnecessary complexity.

  • A strategic information lifecycle management plan can now be implemented more cost-effectively by providing access to multiple tiers of storage: older, rarely sought-after production data is proposed to be placed in Hadoop without moving it or using any adapters;

  • Extensive capabilities of specialized analytics. HP Vertica Dragline is equipped with a mechanism for analyzing the emotional color of Twitter posts and any short text messages, as well as an improved system for analyzing geospatial information. By combining information about the emotional color of the text with business data, organizations can quickly find out how members of online communities evaluate a particular brand, product or service;

The text search system allows you to analyze text information of various types, including processing automatically generated transaction logs and analyzing the emotional assessment vocabulary of short texts, such as tweets or product reviews.

HP Vertica Dragline contains analytical tools for solving problems:

  • Increase market share and acquire competitive differences. Utility operators and energy companies will be able to install smart meters and inform subscribers about the level of consumption and opportunities to reduce costs. Telecom operators, in turn, will be easier to implement personalized charging services in accordance with the legal requirements of some countries;

  • Save hardware and system resources. Analysts, reporters, and data experts can manage mixed workloads through dynamic resource management mechanisms that reduce total cost of ownership of systems;

  • Forecasting and preventing outflow of clients, thanks to the use of powerful means of analyzing the emotional coloring of statements in social media. These tools allow you to identify dissatisfied customers and quickly offer them individual discounts;

  • Personalized marketing. By combining Big Data with customer location information, businesses have the ability to run targeted advertising campaigns targeting different geographic areas. Now retailers can use mobile technologies and attract the most promising customers, based on where they are, as well as which products and brands prefer.

Sales of HP Vertica Dragline in the countries of the world will begin in August 2014.

2013

Vertica Crane

On December 2, 2013, HP announced the release of the updated HP Vertica Analytics platform under version 7 and the name HP Vertica Crane.


Description

HP Vertica Crane significantly simplifies the analysis of semi-structured data - "dark data" [2]., Has improved integration with Hadoop, and also offers a higher level of reliability and performance.

Processing semi-structured information from social networks, web logs, sensors, and the Internet of Things remains a major challenge in achieving the benefits of big data platforms. Such data often requires too long loading into traditional analytics and storage tools to become structured and "understandable," as a result of which they are simply not taken into account.

HP Vertica Crane solves the problem by using HP Vertica Flex Zone, an innovative solution that allows you to quickly load, analyze and use various semi-structured data.

HP Vertica Crane has automatic systematization functions, which eliminates the need for complex and time-consuming coding before loading data. In addition, you can quickly create a schema and apply it to a dataset. This allows analysts and business users to visualize information without the need for inefficient and expensive tools to convert data from database and storage.

HP Vertica Analytics supports various standard analytics and visualization tools. An open approach helps you use different dashboards to analyze patterns and dependencies across an array of structured and semi-structured data.

The information stored in Hadoop Distributed File System (HDFS) is valuable for business intelligence, but is often difficult to use in traditional databases. That is why HP Vertica Crane offers the industry's most open SQL-on-Hadoop architecture. Unlike other SQL-on-Hadoop solutions, the HP Vertica platform is compatible with the largest Hadoop infrastructures, which ensures high performance of data analysis of any type and from any source.

In addition, HP Vertica Crane supports direct integration with HCatalog, the Hadoop table storage layer. This allows customers to easily find the right data in Hadoop and upload it to HP Vertica Analytics for analysis.


Advantages

Benefits of HP Vertica Crane Platform:

  • The new Java developer kit, in addition to existing support for C/C + + and statistical language R, enhances analytics and protects investments.
  • Support for the Kerberos network authentication protocol for all database drivers, including the HDFS connector, allows you to meet the most stringent security requirements.
  • The updated Amazon Machine Image (AMI) engine, which includes Cloud Scripts for flexible management of Amazon EC2 cluster environments, simplifies cloud deployment.
  • The MyVertica community offers all the necessary information about the features and improvements of the new platform.


Availability

HP Vertica Analytics Platform 7 and HP Vertica Flex Zone will go on sale worldwide in December. Additional information is available on the developer's website.

2012

Vertica 6

Vertica 6 allows companies to connect to any data source to manage and investigate it. The unique Vertica FlexStore architecture provides flexible Big Data analytics that integrate closely with Autonomy and Hadoop technologies, as well as with any source of structured, unstructured, or semi-structured information.

In the new version, the Vertica distributed computing platform is expanded: it allows you to perform parallel tasks implemented in the analytical programming language R. In addition, Vertica 6 has improved support for deploying in the cloud and SaaS execution and expanded functions designed for environments with mixed workloads. Thus, Vertica 6 is the most comprehensive Big Data analysis platform available today.

As part of HP's strategy under the motto "100% Enterprise Data," the company made it possible to implement the Autonomy Intelligent Data Operating Layer (IDOL10) system in each Hadoop node. This gives users more than 500 HP IDOL features, including automatic classification, clustering, information extraction (education), and hyperlink generation. Consisting of Autonomy IDOL, Vertica 6 and HP AppSystem for Apache Hadoop, the solution kit is an unparalleled platform for processing and interpreting huge sections of heterogeneous data.

Vertica 5.1

Vertica's column-oriented analytical DBMS is designed to quickly load and analyze large amounts of data, and is often used to perform real-time analytics. Last year, Vertica was purchased by Hewlett-Packard. Since then, it has been integrated with the unstructured data indexing system Autonomy IDOL (also acquired by HP in 2011). As a result, a package for analyzing both structured and unstructured data called HP Next Generation Information Platform was born.

In Vertica 5.1, in addition to the new GUI (based on a simple terminal xTerm), various improvements in drivers and data access protocols are implemented. Completely rewritten ODBC and JDBC drivers to connect the DBMS to C and Java applications, respectively. Vertica 5.1 also includes a connector for reading and writing data to the Apache Hadoop system. Perhaps this is the most important feature of the new version, analysts at Ovum say. Big Data analysis tools like Apache Hadoop have so far been missing from the combined Vertica/IDOL platform.

Notes

  1. ON materials www.pcweek.ru/infrastructure/article/detail.php?ID=168821.
  2. Under the term "dark data," Gartner understands the totality of information resources that organizations collect, process and store in their systems, but which are not used for other purposes (analytics, relationship management and direct monetization). Like "dark matter" in physics, "dark data" often constitutes a significant (if not overwhelming!) Part of the organization's information assets. Often organizations don't get rid of dark data just for compliance reasons. In other words, the cost of storing and protecting such data can repeatedly outweigh the value that an organization is able to extract from them