RSS
Логотип
Баннер в шапке 1
Баннер в шапке 2

EMC Greenplum HD

Product
The name of the base system (platform): Apache Hadoop
Developers: Dell EMC
Last Release Date: May, 2011
Technology: BI,  Data Mining,  DBMS

The product family of EMC Greenplum HD allows the organizations to get advantages of analytics of Big Data without laid on costs and complexity available in the market of inconvenient tools today. The software of Greenplum HD which is issued in editions Community and Enterprise provides the finished platform including services of installation, training, global support and the services with added value supplementing Apache distribution.

Apache Hadoop quickly turns into a preferred solution for analytics of Big Data during the work with unstructured information. The organizations which look for a method more effectively to work at the fast changing market, understood that the analysis of Big Data gives competitive advantage. Processing of big arrays of unstructured and structured data on the basis of Hadoop using normal equipment cardinally changed analytics. Taking knowledge from the unstructured data generated by computers, the companies can make the correct decisions for increase in profit, improvement of service and cost reduction.

Unique additional functionality of EMC for Hadoop:

  • EMC Greenplum HD Data Computing Appliance — Apache Hadoop is "seamlessly" integrated with the Greenplum database into Greenplum HD Data Computing Appliance. The solution supports foreign tables of Hadoop that allows users to address the data which are stored in Hadoop Distributed File System (HDFS) without their extraction from the file system. Administrators can read and write files in parallel from Greenplum on HDFS that provides fast and simple sharing of information. The cross-platform analysis can be made, using power of Greenplum SQL and expanded functions of analytics for data access in HDFS. The combined solution implements the only finished platform in the industry for analytics of Big Data.
  • EMC Greenplum HD Enterprise Edition — Enterprise Edition for 100% is compatible via interfaces to Apache Hadoop stack. Being compatible to the Hadoop interfaces, Enterprise Edition provides "seamless" portability of applications and at the same time implements the expanded functions demanded in the big organizations including:
    • control functions by data, including instant pictures and replication on long distances;
    • simple loading and data access using the "native" network file system interface (NFS);
    • complete controllability, including simple deployment of a cluster, automatic recognition of failures and the notification about them, management of several platforms and the rolling upgrades function.

  • EMC Greenplum HD Community Edition — Community Edition is completely certified on compatibility with open source and supports a stack of Apache Hadoop which consists of HDFS, MapReduce, Zookeeper, Hive and HBase. EMC Greenplum provides fault tolerance for Name Node and Job Tracker. Both of these components are single points of failure of standard implementations of Hadoop.