RSS
Логотип
Баннер в шапке 1
Баннер в шапке 2

Microsoft Azure Data Lake

Product
Developers: Microsoft
Date of the premiere of the system: 2015/04/30
Technology: SaaS - Software as service,  DWH

Microsoft Azure Data Lake is an open repository for Big Data which can be used for data storage of any type and the scheme – without the fixed limits on file size or the account.

On April 30, 2015 Microsoft announced creation of Azure Data Lake.

Data Lake – the Hadoop File System compatible to HDFS integrated with Azure HDInsight and which is integrated in the future with Revolution-R Enterprise, Hortonworks, Cloudera, Spark, Storm, Flume, Sqoop, Kafka and others.

there Was an expanded version of Azure Data Lake

On September 29, 2015 the Microsoft company submitted the expanded version of Azure Data Lake[1].

Representation of Azure Data Lake (2015)

"Microsoft pays much attention to development of the platform for work with Big Data. We create convenient instruments of information processing of any type and volume which our customers can use both in a cloud, and in local infrastructure, – Dmitry Marchenko, the chief promotions officer of a cloud platform of Microsoft in Russia said. – Our purpose – to make Big Data technologies simpler and available to the most wide range of users – developers, analysts, scientists and IT specialists. And we hope that updating of Azure Data Lake will become a big step to its achievement".
  • Azure Data Lake Store is the flexible scalable data warehouse which allows to work with the unstructured, semi-structured and structured information. For the first time with its help it is possible to collect information of any type and the size, to get access to it and to analyze it, avoiding failures in production processes and supporting the high level of security of network that, for example, is crucial for stable work of IoT-scenarios. The platform will become available to users in the nearest future.

  • Azure Data Lake Analytics is the service of data analysis created based on Apache YARN for work in a cloud. Azure Data Lake Analytics copes with information of any scale, regulating load of network. The model of providing service assumes that clients will pay only those periods when it was used and also support of Azure Active Directory. It does Azure Data Lake Analytics not only the effective, but also economic solution.

This version of Azure Data Lake included the Azure HDInsight service developed on the Apache Hadoop platform. HDInsight allows to untwist an unlimited number of nodes in read minutes. As one of the most quickly developing solutions in a cloud of Azure HDInsight offers users ample opportunities of an ecosystem of Hadoop within manageable service which work is supported by specialists of Microsoft. Today service became available to users of the Linux platform. The corporation conducts work on the version for Ubuntu.

Hadoop ISV is an application package for information management, was a part of edition Data Lake. In it means of continuous analytics of Big Data, such as Datameer, technologies of protection and information management – Dataguise and BlueTalon, – and also DataTorrent and instruments of visualization AtScale and Zoomdata.

Notes