six years ago, an article titled big data requires a big new architecture on forbes put forward the concept of “data lake”, a data storage architecture different from the original data warehouse. numerous original data is saved in the primary formats, including the structured and non-structured data, until the data is processed upon the use.
frankly speaking, many people are certainly in a fog. how can we make such concept easily understood?
it is simpler to understand it in a metaphor!
data warehouse is like a "data lake" of stores of bottled water that are cleaned, standardized and convenient for consumption.
the data size witnesses a geometric growth in the era of big data. the traditional "bottled water store" can no longer afford the numerous data volume and diversification of data source and types. hence, the data lake
- a support of big data which can meet the new architecture of storage demands comes into being.
like a huge water body under the natural state, the data lake gathers the brooks from different data source, including the numerous unordered and non-structured data (text, image, voice, webpage, etc.).
the data lake stores numerous primary data and supports all types of data. users can mine the data value according to their business needs and use scenarios.it can be said that the data lake endows the infrastructures with a new definition, that is, multiple clouds over the data lake.
we need to resolve a few problems before building a data lake.
1. storage of numerous data - building of a lake
2. convergence of data brooks - introduction of water
3. data processing and analysis - utilization
4. satisfaction of different user demands - value
so, the question is whether there is an enterprise that can promote the convergence of data resources of multiple parties, introduce water into the lake, have the advanced technologies and products for the storage of numerous data, utilize the advanced technology for data processing and analysis by water introduction and lake building, meet the customer demands finally by use of lake water, and achieve the value!!! surely of course, there should be.
it owns the good background of central state-owned enterprise, strong brand influence, rich modes of commerce, unique core technologies, powerful financing capacity and practical ability;
this is exactly e-hualu.
in 2016, e-hualu unveiled the "god's perspective". through accurately identifying the technical tendency and direction of industry reform, it made out the main tendency of future industry, and launched e-hualu urban data lake of intelligent and integrated information infrastructure which integrates data sensing, storage and analysis in view of its advantages. firstly, the optomagnetic integrated storage - a necessary device of data lake storage.
through this device
cold and thermal data are combined perfectly!
fearless of virus blackmail!
fearless of the attack of hackers!
fearless of intentional or unconscious data change!
resistant to water and electromagnet, power-saving!
the service life is 50 years. most important - it is rather cheap! secondly, the unparalleled combination of big data platform plus artificial intelligence, supported by various industry sub-lakes, data analysis and presentation, model training and other value-added elements, change the data lake into a data quagmire. at last, the diversified demands of governments - users which can meet the orientation of urban data lake
what we need to consider is not merely gdp!!!!
reform on the supplies
poverty alleviation and development of people's livelihood
mass entrepreneurship and innovation e-hualu urban data lake helps dakang secretary
in 2017, beijing e-hualu information technology co., ltd. entered into the contract on project of east china data lake industry park of china hualu successfully with taizhou. upon the completion, the project will be the big data infrastructure with the largest capacity in the world. the project of data lake project is located in gaoxin district, jiangyan, taizhou, with a total investment of rmb3 million and a planned area of 200mu (133,333.4m2
). the phase-i works will be put in service in 2018, and the phase-ii works is to be compelted in 2020.