Developers: | Oracle |
Date of the premiere of the system: | October 3, 2011 |
Last Release Date: | November, 2013 |
Technology: | BI, Big Data, Data Mining, DBMS, DWH |
Content |
Oracle officially submitted the Big Data Appliance system during the OpenWorld conference to San Francisco. This complete solution which will fill up the newest line of corporation created on a joint of software products of Oracle and server platforms which got to it after Sun purchase.
Big Data Appliance includes a program framework with the open code Hadoop, Oracle Data Integrator Application (adapted for Hadoop), Oracle Loader for Hadoop, a distribution kit of also open analytical statistical system and the Oracle database of NoSQL.
The vice president of corporation for server technologies and databases Andy Mendelssohn so commented on start of a product: "Today there is a set of data. Most of them have the small importance for business. There are information particles which people need really to find. Hadoop and other tools distil these data in search of significant data. The solution can be used together with hraniliza of data, such as Exedata, for the further analysis", - he noted.
According to the official statement, Oracle is going to provide all products which were included in the package of Big Data Appliance, separately and including without "iron" platform. The exact price of the solution and receipt date it in sale are not called yet. Let's note that Big Data Appliance already has competing products from other vendors - Aster Data, Netezza and Greenplum.
At the OpenWorld Oracle conference made already a number of important announcements. So, the new solution – Exalytics Intelligence Machine was presented a day earlier. It is in-memory the database created specially as the answer of in-memory to the SAP HANA platform.
The machine Exalytics consists of 40 main cores and has 1tb DRAM, thus it has an opportunity due to special technologies of compression to work with amounts of data of 5-10 of Tb. A new system works at software stack from Oracle which in-memory the database TimesTen, BI tools and the Essbase OLAP server (online analytical processing) enter.
All these new solutions will add Exadata. The analyst of Forrester James Kobielus noted that when it comes about processing of large volumes of data, first of all the solution power, information processing rate and a variety of data structures are important. "Exadata has all this. The solution is optimized for the mixed workloads and mass parallel operation and has rich library of algorithms and models of the analysis", - he noted.
In January, 2012 the Oracle corporation announced receipt in sale of Oracle Big Data Appliance, the optimized hardware and software system designed to help customers with obtaining the maximum advantages of use of "Big Data" (Big Data) for business
Oracle Big Data Appliance represents the optimized complex integrating hardware and software products, vklyuchayacloudera's Distribution with Apache Hadoop and Cloudera Manager and also a distribution kit of coding environment R open source.
The Oracle Big Data Appliance complex working running operating system Oracle Linux also includes DBMS Oracle NoSQL Database Community Edition and Java Oracle HotSpot Virtual Machine.
Oracle also announced the beginning of sales of the Oracle Big Data Connectors software product which helps customers to integrate with ease the data saved in Hadoop and Oracle NoSQL Database with Oracle Database 11g.
Oracle Big Data Appliance with a software package of Oracle Big Data Connectors, in combination with the optimized hardware and software systems Oracle Exadata Database Machine, Oracle Exalogic Elastic Cloud, and Oracle Exalytics In-Memory Machine, provides to customers all necessary for receiving, systematization and the analysis of "Big Data" within all corporate information array.
Compliance to requirements for management of "Big Data"
The Oracle Big Data Appliance complex intended for simplification of management and use of "Big Data" is delivered in a complete rack (full rack) configuration from 18 servers and contains in total:
- 864 GB of RAM;
- 216 main cores;
- 648 TB "crude" disk memories;
- Network infrastructure of InfiniBand with capacity of 40 Gbps between nodes of a complex and other optimized hardware and software systems of Oracle; and
- Interfaces of Ethernet with capacity of 10 Gbps for connection to all other components of a data processing center.
The new optimized hardware and software system can be scaled using connection of several racks in a uniform cluster via the network InfiniBand interface that allows to receive, systematize and analyze superlarge volumes of data.
"Oracle Big Data Appliance in combination with Oracle Exadata, Oracle Exalytics and Oracle Exalogic Elastic Cloud offers the most extensive and integrated product portfolio which is designed to help customers with receiving and systematization of different data types and also in the analysis of these and other available corporate data that allows to take new important knowledge and to be most informed at decision making", – Cetin Ozbutun, the vice president of Oracle for the Data Warehousing Technologies direction noted.
The product Cloudera's Distribution, the including Apache Hadoop (CDH) – the most complete is a part of Oracle Big Data Appliance, the checked, steady and widely used in commercial and non-commercial environments Hadoop Oracle Big Data Appliance distribution kit also includes Cloudera Manager, the industry-first application for complex (end-to-end) management of Apache Hadoop.
Released before Oracle NoSQL Database represents the distributed "key/value" (key-value) DBMS intended for management of large volumes of data. Oracle NoSQL Database is horizontally scaled to hundreds of nodes, provides high data availability, predictable levels of capacity and waiting time, requiring at the same time the minimum administration. The Oracle Big Data Appliance complex can work with Oracle NoSQL Database DBMS in editions Community Edition and Enterprise Edition.
The Oracle Big Data Appliance complex is specially designed to help customers:
- To quickly start the scalable system of high availability to management of data bulks;
- Create the high-performance platform for systematization, processing and the analysis of "Big Data" in the environment of Hadoop and also for use of statistical applications in language R with sources of primary data; and
- Control IT costs thanks to preliminary integration of all equipment rooms and program components into single solution for "Big Data" which supplements corporate data warehouses.
Optimization of integration of "Big Data" with corporate data warehouses
The software package Oracle Big Data of Connectors is delivered for use both with the Oracle Big Data Appliance complex, and with other systems based on Apache Hadoop. The delivery includes:
- The Oracle Loader for Hadoop loader – uses the MapReduce mechanism for effective data loading in Oracle Database 11g DBMS;
- The Oracle Data Integrator Application Adapter for Hadoop adapter – allows Oracle Data Integrator to generate the Hadoop MapReduce programs through idle time in use the graphical interface;
- The module of interface Oracle Connector R – provides to users of applications R quick and effective access to a distributed file system of Hadoop Distributed File System (HDFS) and a reference platform of programming of MapReduce; and
- The module of interface Oracle Direct Connector for Hadoop Distributed File System (ODCH) – provides to Oracle Database trouble-free data access from the Hadoop Distributed File System file system through SQL.
Oracle Big Data Connectors and Oracle NoSQL Database DBMS can be delivered as separate software products, irrespective of the optimized hardware and software system Oracle Big Data Appliance.
Oracle Big Data Appliance X3-2
Oracle Big Data Appliance X3-2 is the cost-efficient optimized hardware and software system which underwent upgrade and is equipped with the latest Intel processors, the new version of the Cloudera Distribution of Apache Hadoop (CDH) and Cloudera Manager distribution kit and also the new connected module Oracle Enterprise Manager for Big Data Appliance.
In Oracle Big Data Connectors the accessibility to Hadoop is improved: SQL access from Oracle databases became better, and access from the applications written in language R — is more transparent.
Oracle Big Data Appliance with the Oracle Big Data Connectors software products, in combination with Oracle Exadata Database Machine and Oracle Exalytics, provides to customers a full range of the optimized hardware and software systems for obtaining, systematization and the analysis of "Big Data". New versions increase data processing performance, expand amount of memory, improve integration and abilities to manage.
The Oracle Big Data Appliance X3-2 hardware contains 8-core Intel Xeon processors of E5-2600 series. In comparison with the previous configuration from 18 servers with a capacity of "crude" disk memory of 648 of Tb the new version offers:
- 33% more than computing power thanks to 288 main cores;
- 33% more than RAM counting on a node at the total amount of RAM in 1.1 Tb;
- up to 30%. economy on a power supply and cooling of the equipment.
Oracle Big Data Appliance X3-2 simplifies implementation and management of solutions for "Big Data" thanks to integration of all equipment rooms and program components necessary for collecting, systematization and the analysis of "Big Data". Oracle Big Data Appliance X3-2 includes:
- support of CDH4.1, including software update developed together with Cloudera company for implementation of high availability of NameNode in the environment of Hadoop. It allows to eliminate vulnerable elements which failure leads to failure of all system in the cluster Hadoop configurations;
- the new version of Oracle NoSQL Database Community Edition 2.0 which provides the improved integration about Hadoop and flexible scaling and also contains new interfaces for programming, including support of JSON and C;
- the connected module Oracle Enterprise Manager for Big Data Appliance which supplements possibilities of Cloudera Manager, facilitating management of Hadoop cluster;
- the updated distribution kits of Oracle Linux and Java Oracle Development Kit;
- the updated distribution kit R open source optimized for work with high-performance multistream libraries of mathematical functions.
Oracle Big Data Connectors is a set of software products created by Oracle for integration of Apache Hadoop with DBMS Oracle, Oracle Data Integrator and the Oracle R Distribution distribution kit.
Improvements of Oracle Big Data Connectors expand abilities to integrate "Big Data". The new version of Oracle Big Data Connectors in a dopoleniye to updates of all modules offers:
- the module of interface Oracle SQL Connector for Hadoop Distributed File System for performance improvement of SQL queries to the data saved in Hadoop from Oracle databases. The increase in productivity is reached due to additional automation and improvement of functionality of requests. The new module is also supported in Oracle Data Integrator Application Adapter for Hadoop;
- transparent access to language of requests Hive Query from applications R and implementation of the new analytical techniques executed in Hadoop that increases efficiency of application developers in language R thanks to improvement of access to Hadoop from environment R.
2013: Protection of the basis of the distributed processing
At the Oracle OpenWorld conference on September 22-26, 2013 the corporation announced improvements in processing systems of Big Data. In particular, the hardware and software system Big Data Appliance provides "protection of a corporate class" of a distributed processing system of data of Hadoop now.
Big Data Appliance supports authentication under the Kerberos and LDAP protocols, is integrated with the protective Oracle Audit Vault and Database Firewall system. A system conducts monitoring of magazines of registration of events of Hadoop and generates warning to administrators.
The new software module of Perfect Balance for Big Data Appliance executes balancing of loading, accelerating accomplishment of tasks of MapReduce. In Oracle developed the connector providing a possibility of poll and conversion of XML documents using the XQuery language for Hadoop.
Oracle Big Data Appliance X4-2
On November 14, 2013 the Oracle corporation announced the beginning of sales of the hardware and software system Oracle Big Data Appliance X4-2 as a part of which a complete technology stack of Cloudera Enterprise, support of the disk capacity 33% more - in the amount 864 TB on one hardware rack is implemented.
Description
Oracle Big Data Appliance X4-2 represents the complex platform for work with Big Data optimized both for package and for data processing in real time. The platform uses the software Cloudera Distribution for Apache Hadoop Oracle NoSQL Database, Cloudera Impala and Cloudera Search to provide compliance to requirements to computing resources.
The enterprises will receive more resources for data storage, using Oracle Big Data Appliance X4-2 that will help them to create economically more profitable platform for work with Big Data, helping that with creation of new advantages to business.