[go: up one dir, main page]

CN103064902A - Method and device for storing and reading data in hadoop distributed file system (HDFS) - Google Patents

Method and device for storing and reading data in hadoop distributed file system (HDFS) Download PDF

Info

Publication number
CN103064902A
CN103064902A CN2012105502202A CN201210550220A CN103064902A CN 103064902 A CN103064902 A CN 103064902A CN 2012105502202 A CN2012105502202 A CN 2012105502202A CN 201210550220 A CN201210550220 A CN 201210550220A CN 103064902 A CN103064902 A CN 103064902A
Authority
CN
China
Prior art keywords
cold
data
hot
check code
data blocks
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2012105502202A
Other languages
Chinese (zh)
Inventor
庄进发
章正道
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xiamen Meiya Pico Information Co Ltd
Original Assignee
Xiamen Meiya Pico Information Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xiamen Meiya Pico Information Co Ltd filed Critical Xiamen Meiya Pico Information Co Ltd
Priority to CN2012105502202A priority Critical patent/CN103064902A/en
Publication of CN103064902A publication Critical patent/CN103064902A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a method and device for storing and reading data in a hadoop distributed file system (HDFS). The storing method includes: obtaining a cold and hot value of a file data block in the HDFS according to a data cold hot distinguishing mechanism, and comparing the cold and hot value and a set distinguishing threshold; if the cold and hot value is not larger than the distinguishing threshold, the file data block is cold data; dividing the file data block of the cold data into n blocks, and calculating m verifying coding blocks corresponding to the n data blocks, wherein m and n are positive integral numbers, and m<n; and storing the n data blocks and the m verifying coding blocks. The storing method and the storing system can achieve distinguishing of the cold data and hot data, stores the cold data in distinguishing mode, and reduces storing cost of the cold data in the HDFS. The reading method and the read system can achieve distinguishing of the cold data and the hot data, read and recover the cold data in distinguishing mode, and reduce time and cost for reading and recovering the cold data.

Description

Data storage method, read method and storage system, reading system among the HDFS
Technical field
The present invention relates to the data storage and read the field, especially, relate to data storage and read method and storage and reading system among a kind of HDFS.
Background technology
Hadoop is by the leading cluster distributed project of Apache fund, mainly comprises two nucleus modules: Map/Reduce programming mode and HDFS(Hadoop distributed File System) distributed file system.Wherein, mainly the many back mechanisms (being generally three parts) by file data blocks, heartbeat mechanism etc. are realized high availability, the cluster extendability of data, the characteristics such as high-speed read-write of data to HDFS.Because the above-mentioned characteristic of HDFS, at present, the esbablished corporation of storage reaches nearly thousand take HDFS as the fundamental construction cloud.
In the storage and read-write of dsc data, there is not any problem in the memory mechanism of HDFS, but in cold data storage and read-write, because this memory mechanism reckons without the differentiation storage to hot and cold data, therefore, can there be the higher problem of financial cost owing to cold data storage takies too many hard-disk capacity.For example, the cost of supposing hard disk is 1 yuan/G, its cost of memory mechanism according to HDFS just needs 3 yuan/G, the data of storage 1TB just need 3*1024=3072 unit, the data of storage 1PB just need 3*1024*1024=3145728 unit, that is to say, the storage of 1 part of storage backup piece of every reduction PB level just will save more than 100 ten thousand.
Hence one can see that, need the urgent technical matters that solves of those skilled in the art to be exactly: how to solve detection, differentiation and the difference storage of cold and hot data among the HDFS of prior art and read cold data, to reduce among the HDFS cold data storage and to read the high problem of cost.
Summary of the invention
Technical matters to be solved by this invention provide in HDFS, can distinguish cold and hot data and can to cold data distinguish storage method and system and cold data are distinguished the method and system that reads.
The invention provides data storage method among a kind of HDFS, it comprises the steps: to obtain by the cold and hot determine mechanism of data the cold and hot value of the file data blocks among the HDFS, and the discrimination threshold of described cold and hot value and setting is compared; If described cold and hot value is not more than described discrimination threshold, then described file data blocks is cold data; The file data blocks of described cold data is divided into the n piece, and calculates m check code piece corresponding to a described n data block, wherein, m, n are positive integer, and m<n; A described n data block and m check code piece are stored.
Preferably, described cold and hot determine mechanism comprises by the cold and hot discriminant parameter value of each file data blocks of testing mechanism acquisition sets up the cold and hot chained list of data, goes out the cold and hot value of each file data blocks in the described cold and hot chained list according to described parameter value calculation; Described cold and hot determine mechanism also can comprise by reading the configuration file that comprises the cold and hot discriminant parameter of data in each file data blocks, to obtain the cold and hot discriminant parameter value of each file data blocks, go out the cold and hot value of described each file data blocks according to described parameter value calculation.
Preferably, described check code piece is RS check code piece, and described RS check code piece calculates by model moral gate matrix.
Preferably, the file backup piece of described cold data is one.
The present invention also provides a kind of read method corresponding with data storage method among the HDFS, it comprises the steps: to obtain by the cold and hot determine mechanism of data the cold and hot value of the file data blocks among the HDFS, and the discrimination threshold of described cold and hot value and setting is compared; If described cold and hot value is not more than described discrimination threshold, then described file data blocks is cold data; N file data blocks in the described cold data read; Calculate m check code piece corresponding to a described n data block, wherein, m, n are positive integer, and m<n; If read failure, then choose the individual file data blocks of m ' according to the check code principle and carry out code restoration; Wherein, m ' [m, n].
Preferably, described check code piece is RS check code piece, and described RS check code piece calculates by model moral gate matrix.
The present invention also provides the corresponding storage system of data storage method among the HDFS a kind of and of the present invention, it comprises: comparing unit, be used for obtaining by the cold and hot determine mechanism of data the cold and hot value of the file data blocks of HDFS, and the discrimination threshold of described cold and hot value and setting is compared; Judgement unit is used for when described cold and hot value is not more than described discrimination threshold, described file data blocks is differentiated be cold data; Cold data division unit is used for the file data blocks of described cold data is divided into the n piece, and calculates m check code piece corresponding to a described n data block, and wherein, m, n are positive integer, and m<n; Cold data storage cell is used for a described n data block and m check code piece are stored.
Preferably, described comparing unit comprises detection sub-unit, sets up the cold and hot chained list of data for the cold and hot discriminant parameter value of each file data blocks that obtains by testing mechanism, goes out the cold and hot value of each file data blocks in the described cold and hot chained list according to described parameter value calculation; Described comparing unit also can comprise the parameter acquiring subelement, be used for reading the configuration file that comprises cold and hot discriminant parameter of each file data blocks, to obtain the cold and hot discriminant parameter value of each file data blocks, go out the cold and hot value of described each file data blocks according to described parameter value calculation.
Preferably, described check code piece is RS check code piece, and described RS check code piece calculates by model moral gate matrix.
Preferably, the file backup piece of described cold data is one.
The present invention also provides the corresponding reading system of data storage system among a kind of and above-mentioned HDFS simultaneously, and it comprises; Comparing unit is used for obtaining by the cold and hot determine mechanism of data the cold and hot value of the file data blocks of HDFS, and the discrimination threshold of described cold and hot value and setting is compared; Judgement unit is used for when described cold and hot value is not more than described discrimination threshold, described file data blocks is differentiated be cold data; Cold data-reading unit is used for n file data blocks of described cold data read; The check code computing unit is used for calculating m check code piece corresponding to a described n data block, and wherein, m, n are positive integer, and m<n; Data recovery unit is used for reading failure when cold data, then chooses the individual file data blocks of m ' according to the check code principle and carries out code restoration; Wherein, m ' [m, n].
Preferably, described check code piece is RS check code piece, and described RS check code piece calculates by model moral gate matrix.
Data storage method and storage system among the HDFS provided by the invention, realization is distinguished cold and hot data, and cold data are distinguished storage, can obviously reduce the high problem of cold data carrying cost among the existing HDFS, especially for the above data storage of PB level, can greatly reduce cold data storage cost among the HDFS; Read method and the reading system of data among the while HDFS provided by the invention, can realize cold and hot data are distinguished, and cold data are distinguished read and recover, thereby can reduce the time that cold data read and recover, simultaneously, carry out data by cryptoprinciple especially RS cryptoprinciple and recover, can obviously reduce the quantity of losing of file data blocks.
Description of drawings
Fig. 1 is the process flow diagram of data storage embodiment of the method among the HDFS of the present invention;
Fig. 2 is the process flow diagram of the read method embodiment of data among the HDFS of the present invention;
Fig. 3 is the schematic diagram of data storage system embodiment among the HDFS of the present invention;
Fig. 4 is the schematic diagram of the reading system embodiment of data among the HDFS of the present invention.
Embodiment
For above-mentioned purpose of the present invention, feature and advantage can be become apparent more, the present invention is further detailed explanation below in conjunction with the drawings and specific embodiments.
With reference to Fig. 1, show the process flow diagram of data storage embodiment of the method among the HDFS of the present invention, comprise: step S11, obtain the cold and hot value of the file data blocks among the HDFS, and the discrimination threshold of described cold and hot value and setting compared by the cold and hot determine mechanism of data; Step S12, if described cold and hot value is not more than described discrimination threshold, then described file data blocks is cold data; Step S13 is divided into the n piece with the file data blocks of described cold data, and calculates m check code piece corresponding to a described n data block, and wherein, m, n are positive integer, and m<n; Step S14 stores a described n data block and m check code piece.
By data storage method among the HDFS of the present invention, can realize cold and hot data are distinguished, and cold data are distinguished storage, can obviously reduce the high problem of cold data carrying cost among the existing HDFS, especially for the above data storage of PB level, can greatly reduce cold data storage cost among the HDFS.
Wherein, in step S11, according to system and user's request, can the different cold and hot determine mechanism of data of free choice for use.Preferably, the cold and hot determine mechanism of described data can be to set up the cold and hot chained list of data by the cold and hot discriminant parameter value of each file data blocks of testing mechanism acquisition, described cold and hot discriminant parameter value comprises data storage and reading frequency value, data storage and reads the time equivalence, goes out the cold and hot value of each file data blocks in the described cold and hot chained list according to described parameter value calculation; The cold and hot determine mechanism of described data also can be by reading the configuration file that comprises the cold and hot discriminant parameter of data in each file data blocks, HCFile.properties for example, to obtain the cold and hot discriminant parameter value of each file data blocks, go out the cold and hot value of described each file data blocks according to described parameter value calculation.
Wherein, described check code capable of choosing multiple mode, such as RS coding (Reed-Solomon, in coded), the LT code, Raptor code etc., but the RS of choice for use coding in the preferred embodiment of the present invention, although it need to set up long RS coding, its overall efficiency is better than other coded systems.Preferably, the RS encoding block calculates by model moral gate matrix.
Especially, the file backup piece of described cold data is set to one, increase simultaneously certain RS check code piece, wherein the quantity m of check code piece is less than the quantity n of file data blocks, therefore, the backup factor of described cold data is 1+m/n, less than the backup factor 3 of giving tacit consent among the HDFS, thereby can reduce the umber of cold data backup, and then reduce cold data carrying cost.
Referring to Fig. 2, show the process flow diagram of the method for reading data embodiment corresponding with data storage method among the above-mentioned HDFS, it comprises: step S21, obtain the cold and hot value of the file data blocks among the HDFS by the cold and hot determine mechanism of data, and the discrimination threshold of described cold and hot value and setting is compared; Step S22, if described cold and hot value is not more than described discrimination threshold, then described file data blocks is cold data; Step S23 merges n file data blocks in the described cold data and to read; Step S24 calculates m check code piece corresponding to a described n data block, and wherein, m, n are positive integer, and m<n; Step S25 if read failure, then chooses the individual file data blocks of m ' according to the check code principle and carries out code restoration; Wherein, m ' [m, n].
Wherein, described check code capable of choosing multiple mode, such as RS coding (Reed-Solomon, in coded), the LT code, Raptor code etc., but the RS of choice for use coding in the preferred embodiment of the present invention, although it need to set up long RS coding, its overall efficiency is better than other coded systems.Preferably, the RS encoding block calculates by model moral gate matrix.
Wherein, be m ' piece and m ' [m, n] owing to carrying out the file data blocks of RS code restoration, hence one can see that, and file data blocks allows at most to lose the m piece.
Read method by data among the HDFS of the present invention, can realize cold and hot data are distinguished, and cold data are distinguished read and recover, carry out data by cryptoprinciple especially RS cryptoprinciple and recover, can obviously reduce the quantity of losing of file data blocks.
Referring to Fig. 3, show the schematic diagram of the data-storage system embodiment corresponding with data storage method among the above-mentioned HDFS, it comprises: comparing unit 31, be used for obtaining by the cold and hot determine mechanism of data the cold and hot value of the file data blocks of HDFS, and the discrimination threshold of described cold and hot value and setting is compared; Judgement unit 32 is used for when described cold and hot value is not more than described discrimination threshold, described file data blocks is differentiated be cold data; Cold data division unit 33 is divided into the n piece with the file data blocks of described cold data, and calculates m verification code block corresponding to a described n data block, and wherein, m, n are positive integer, and m<n; Cold data storage cell 34 is used for a described n data block and m check code piece are stored.
Wherein, described check code capable of choosing multiple mode, such as RS coding (Reed-Solomon, in coded), the LT code, Raptor code etc., but the RS of choice for use coding in the preferred embodiment of the present invention, although it need to set up long RS coding, its overall efficiency is better than other coded systems.Preferably, the RS encoding block calculates by model moral gate matrix.
By data storage system among the HDFS of the present invention, can realize cold and hot data are distinguished, and cold data are distinguished storage, can obviously reduce the high problem of cold data carrying cost among the existing HDFS, especially for the above data storage of PB level, can greatly reduce cold data storage cost among the HDFS.
Wherein, according to system and user's request, but the comparison module that option and installment is different, preferably, described comparing unit 31 can comprise the detection sub-unit (not shown), cold and hot discriminant parameter value for each file data blocks that obtains by testing mechanism is set up the cold and hot chained list of data, and described cold and hot discriminant parameter value comprises data storage and reading frequency value, data storage and reads the time equivalence, goes out the cold and hot value of each file data blocks in the described cold and hot chained list according to described parameter value calculation; Described comparing unit also can comprise the parameter acquiring subelement, be used for reading the configuration file that comprises cold and hot discriminant parameter of each file data blocks, HCFile.properties for example, to obtain the cold and hot discriminant parameter value of each file data blocks, go out the cold and hot value of described each file data blocks according to described parameter value calculation.
Preferably, the file backup piece of described cold data is set to one, increase simultaneously certain RS check code piece, wherein the quantity m of check code piece is less than the quantity n of file data blocks, therefore, the backup factor of described cold data is 1+m/n, less than the backup factor 3 of giving tacit consent among the HDFS, thereby can reduce the umber of cold data backup, and then reduce cold data carrying cost.
Referring to Fig. 4, show with above-mentioned HDFS in data the schematic diagram of the corresponding data reading system embodiment of read method, comprise: comparing unit 41, be used for obtaining by the cold and hot determine mechanism of data the cold and hot value of the file data blocks of HDFS, and the discrimination threshold of described cold and hot value and setting is compared; Judgement unit 42 is used for when described cold and hot value is not more than described discrimination threshold, described file data blocks is differentiated be cold data; Cold data-reading unit 43 is used for n file data blocks of described cold data read; Check code computing unit 44 is used for calculating m check code piece corresponding to a described n data block, and wherein, m, n are positive integer, and m<n; Data recovery unit 45 is used and is read failure when cold data, then chooses the individual file data blocks of m ' according to the check code principle and carries out code restoration; Wherein, m ' [m, n].
Wherein, described check code capable of choosing multiple mode, such as RS coding (Reed-Solomon, in coded), the LT code, Raptor code etc., but the RS of choice for use coding in the preferred embodiment of the present invention, although it need to set up long RS coding, its overall efficiency is better than other coded systems.Preferably, the RS encoding block calculates by model moral gate matrix.
Wherein, be m ' piece and m ' [m, n] owing to carrying out the file data blocks of RS code restoration, hence one can see that, and file data blocks allows at most to lose the m piece.
Reading system by data among the HDFS of the present invention, can realize cold and hot data are distinguished, and cold data are distinguished read and recover, thereby can reduce the time that cold data read and recover, simultaneously, carry out data by cryptoprinciple especially RS cryptoprinciple and recover, can obviously reduce the quantity of losing of file data blocks.
Each embodiment in this instructions all adopts the mode of going forward one by one to describe, and what each embodiment stressed is and the difference of other embodiment that identical similar part is mutually referring to getting final product between each embodiment.For system embodiment because itself and embodiment of the method basic simlarity, so describe fairly simple, relevant part gets final product referring to the part explanation of embodiment of the method.
Above to data storage method and system among the HDFS provided by the present invention, reaching read method and system is described in detail, used specific case herein principle of the present invention and embodiment are set forth, the explanation of above embodiment just is used for helping to understand method of the present invention and core concept thereof; Simultaneously, for one of ordinary skill in the art, according to thought of the present invention, all will change in specific embodiments and applications, in sum, this description should not be construed as limitation of the present invention.

Claims (18)

1. data storage method among the HDFS is characterized in that, comprising:
Obtain the cold and hot value of the file data blocks among the HDFS by the cold and hot determine mechanism of data, and the discrimination threshold of described cold and hot value and setting is compared;
If described cold and hot value is not more than described discrimination threshold, then described file data blocks is cold data;
The file data blocks of described cold data is divided into the n piece, and calculates m check code piece corresponding to a described n data block, wherein, m, n are positive integer, and m<n;
A described n data block and m check code piece are stored.
2. the method for claim 1, it is characterized in that, described cold and hot determine mechanism comprises by the cold and hot discriminant parameter value of each file data blocks of testing mechanism acquisition sets up the cold and hot chained list of data, goes out the cold and hot value of each file data blocks in the described cold and hot chained list according to described parameter value calculation.
3. the method for claim 1, it is characterized in that, described cold and hot determine mechanism comprises by reading the configuration file that comprises the cold and hot discriminant parameter of data in each file data blocks, to obtain the cold and hot discriminant parameter value of each file data blocks, go out the cold and hot value of described each file data blocks according to described parameter value calculation.
4. the method for claim 1 is characterized in that, described check code piece is RS check code piece.
5. method as claimed in claim 4 is characterized in that, described RS check code piece calculates by model moral gate matrix.
6. such as each described method in the claim 1 to 5, it is characterized in that, the file backup piece of described cold data is one.
7. the read method of data among the HDFS is characterized in that, comprising:
Obtain the cold and hot value of the file data blocks among the HDFS by the cold and hot determine mechanism of data, and the discrimination threshold of described cold and hot value and setting is compared;
If described cold and hot value is not more than described discrimination threshold, then described file data blocks is cold data;
N file data blocks in the described cold data read;
Calculate m check code piece corresponding to a described n data block, wherein, m, n are positive integer, and m<n;
If read failure, then choose the individual file data blocks of m ' according to the check code principle and carry out code restoration; Wherein, m ' [m, n].
8. method as claimed in claim 7 is characterized in that, described check code piece is RS check code piece.
9. method as claimed in claim 8 is characterized in that, described RS check code piece calculates by model moral gate matrix.
10. data storage system among the HDFS is characterized in that, comprising:
Comparing unit is used for the cold and hot value by the file data blocks of the cold and hot determine mechanism acquisition of data HDFS, and the discrimination threshold of described cold and hot value and setting is compared;
Judgement unit is used for when described cold and hot value is not more than described discrimination threshold, described file data blocks is differentiated be cold data;
Cold data division unit is used for the file data blocks of described cold data is divided into the n piece, and calculates m check code piece corresponding to a described n data block, and wherein, m, n are positive integer, and m<n;
Cold data storage cell is used for a described n data block and m check code piece are stored.
11. system as claimed in claim 10, it is characterized in that, described comparing unit comprises detection sub-unit, cold and hot discriminant parameter value for each file data blocks that obtains by testing mechanism is set up the cold and hot chained list of data, goes out the cold and hot value of each file data blocks in the described cold and hot chained list according to described parameter value calculation.
12. system as claimed in claim 10, it is characterized in that, described comparing unit comprises the parameter acquiring subelement, be used for reading the configuration file that comprises cold and hot discriminant parameter of each file data blocks, to obtain the cold and hot discriminant parameter value of each file data blocks, go out the cold and hot value of described each file data blocks according to described parameter value calculation.
13. system as claimed in claim 10 is characterized in that, described check code piece is RS check code piece.
14. system as claimed in claim 13 is characterized in that, described RS check code piece calculates by model moral gate matrix.
15. such as each described system in the claim 10 to 14, it is characterized in that, the file backup piece of described cold data is one.
16. the reading system of data is characterized in that among the HDFS, comprises;
Comparing unit is used for obtaining by the cold and hot determine mechanism of data the cold and hot value of the file data blocks of HDFS, and the discrimination threshold of described cold and hot value and setting is compared;
Judgement unit is used for when described cold and hot value is not more than described discrimination threshold, described file data blocks is differentiated be cold data;
Cold data-reading unit is used for n file data blocks of described cold data read;
The check code computing unit is used for calculating m check code piece corresponding to a described n data block, and wherein, m, n are positive integer, and m<n;
Data recovery unit is used for reading failure when cold data, then chooses the individual file data blocks of m ' according to the check code principle and carries out code restoration; Wherein, m ' [m, n].
17. system as claimed in claim 16 is characterized in that, described check code piece is RS check code piece.
18. system as claimed in claim 17 is characterized in that, described RS check code piece calculates by model moral gate matrix.
CN2012105502202A 2012-12-18 2012-12-18 Method and device for storing and reading data in hadoop distributed file system (HDFS) Pending CN103064902A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2012105502202A CN103064902A (en) 2012-12-18 2012-12-18 Method and device for storing and reading data in hadoop distributed file system (HDFS)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2012105502202A CN103064902A (en) 2012-12-18 2012-12-18 Method and device for storing and reading data in hadoop distributed file system (HDFS)

Publications (1)

Publication Number Publication Date
CN103064902A true CN103064902A (en) 2013-04-24

Family

ID=48107532

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2012105502202A Pending CN103064902A (en) 2012-12-18 2012-12-18 Method and device for storing and reading data in hadoop distributed file system (HDFS)

Country Status (1)

Country Link
CN (1) CN103064902A (en)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103259856A (en) * 2013-04-27 2013-08-21 华为技术有限公司 Method, device and system for data processing
CN104573119A (en) * 2015-02-05 2015-04-29 重庆大学 Energy-saving-oriented Hadoop distributed file system storage policy in cloud computing
CN104917788A (en) * 2014-03-11 2015-09-16 中国移动通信集团公司 Data storage method and apparatus
CN104935481A (en) * 2015-06-24 2015-09-23 华中科技大学 A Data Recovery Method Based on Redundancy Mechanism in Distributed Storage
CN105138422A (en) * 2015-08-10 2015-12-09 联想(北京)有限公司 Control method and electronic device
CN105956128A (en) * 2016-05-09 2016-09-21 南京大学 Self-adaptive encoding storage fault-tolerant method based on simple regenerating code
CN106528608A (en) * 2016-09-27 2017-03-22 中国电力科学研究院 Cold and hot storage method and system for power grid GIS (Geographic Information System) data in cloud architecture
CN106649406A (en) * 2015-11-04 2017-05-10 华为技术有限公司 Method and device for storing file in self-adaption mode
CN107153661A (en) * 2016-03-04 2017-09-12 中国移动通信集团山东有限公司 A kind of storage, read method and its device of the data based on HDFS systems
CN107665224A (en) * 2016-07-29 2018-02-06 北京京东尚科信息技术有限公司 Scan the mthods, systems and devices of HDFS cold datas
CN107870726A (en) * 2016-09-26 2018-04-03 北京优朋普乐科技有限公司 Date storage method and device, server based on redundancy scheme
CN111930555A (en) * 2020-09-02 2020-11-13 平安国际智慧城市科技股份有限公司 Erasure code based file processing method and device and computer equipment
CN111966293A (en) * 2020-08-18 2020-11-20 北京明略昭辉科技有限公司 Cold and hot data analysis method and system
CN112948171A (en) * 2019-12-11 2021-06-11 华为技术有限公司 Data processing method and device, terminal equipment and computer readable storage medium
CN113360553A (en) * 2020-03-03 2021-09-07 中国移动通信集团贵州有限公司 Data cold and hot degree evaluation method and server

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102411542A (en) * 2011-10-26 2012-04-11 上海文广互动电视有限公司 Dynamic hierarchical storage system and method
US20120182891A1 (en) * 2011-01-19 2012-07-19 Youngseok Lee Packet analysis system and method using hadoop based parallel computation

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120182891A1 (en) * 2011-01-19 2012-07-19 Youngseok Lee Packet analysis system and method using hadoop based parallel computation
CN102411542A (en) * 2011-10-26 2012-04-11 上海文广互动电视有限公司 Dynamic hierarchical storage system and method

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103259856A (en) * 2013-04-27 2013-08-21 华为技术有限公司 Method, device and system for data processing
CN104917788A (en) * 2014-03-11 2015-09-16 中国移动通信集团公司 Data storage method and apparatus
CN104573119A (en) * 2015-02-05 2015-04-29 重庆大学 Energy-saving-oriented Hadoop distributed file system storage policy in cloud computing
CN104573119B (en) * 2015-02-05 2017-10-27 重庆大学 Towards the Hadoop distributed file system storage methods of energy-conservation in cloud computing
CN104935481A (en) * 2015-06-24 2015-09-23 华中科技大学 A Data Recovery Method Based on Redundancy Mechanism in Distributed Storage
CN104935481B (en) * 2015-06-24 2018-03-09 华中科技大学 Data reconstruction method based on redundancy scheme under a kind of distributed storage
CN105138422B (en) * 2015-08-10 2018-09-21 北京联想核芯科技有限公司 Control method and electronic equipment
CN105138422A (en) * 2015-08-10 2015-12-09 联想(北京)有限公司 Control method and electronic device
US11182080B2 (en) 2015-11-04 2021-11-23 Huawei Technologies Co., Ltd. Adaptive file storage method and apparatus
CN106649406A (en) * 2015-11-04 2017-05-10 华为技术有限公司 Method and device for storing file in self-adaption mode
CN106649406B (en) * 2015-11-04 2020-04-28 华为技术有限公司 A method and device for adaptively storing files
CN107153661A (en) * 2016-03-04 2017-09-12 中国移动通信集团山东有限公司 A kind of storage, read method and its device of the data based on HDFS systems
CN105956128A (en) * 2016-05-09 2016-09-21 南京大学 Self-adaptive encoding storage fault-tolerant method based on simple regenerating code
CN105956128B (en) * 2016-05-09 2019-09-17 南京大学 A kind of adaptive coding storage fault-tolerance approach based on simple regeneration code
CN107665224A (en) * 2016-07-29 2018-02-06 北京京东尚科信息技术有限公司 Scan the mthods, systems and devices of HDFS cold datas
CN107665224B (en) * 2016-07-29 2021-04-30 北京京东尚科信息技术有限公司 Method, system and device for scanning HDFS cold data
CN107870726A (en) * 2016-09-26 2018-04-03 北京优朋普乐科技有限公司 Date storage method and device, server based on redundancy scheme
CN106528608A (en) * 2016-09-27 2017-03-22 中国电力科学研究院 Cold and hot storage method and system for power grid GIS (Geographic Information System) data in cloud architecture
CN106528608B (en) * 2016-09-27 2022-07-01 中国电力科学研究院 Power grid GIS data cold and hot storage method and system under cloud architecture
CN112948171A (en) * 2019-12-11 2021-06-11 华为技术有限公司 Data processing method and device, terminal equipment and computer readable storage medium
CN113360553A (en) * 2020-03-03 2021-09-07 中国移动通信集团贵州有限公司 Data cold and hot degree evaluation method and server
CN111966293A (en) * 2020-08-18 2020-11-20 北京明略昭辉科技有限公司 Cold and hot data analysis method and system
CN111930555A (en) * 2020-09-02 2020-11-13 平安国际智慧城市科技股份有限公司 Erasure code based file processing method and device and computer equipment

Similar Documents

Publication Publication Date Title
CN103064902A (en) Method and device for storing and reading data in hadoop distributed file system (HDFS)
CN102279776B (en) Error checking and correcting ability testing method and device
KR101580454B1 (en) Selecting between non-volatile memory units having different minimum addressable data unit sizes
CN109542342A (en) Metadata management and data reconstruction method, equipment and storage medium
CN102841931A (en) Storage method and storage device of distributive-type file system
US8762771B2 (en) Method for completing write operations to a RAID drive pool with an abnormally slow drive in a timely fashion
CN105138290A (en) High-performance storage pool organization method and device
CN106033320A (en) A high-performance redundant array of inexpensive disc (RAID) mechanism processing method and device
CN101937721A (en) Method for testing memory device
CN103019893A (en) Multi-disk fault-tolerant two-dimensional hybrid disk RAID4 system architecture and read-write method thereof
CN103700396A (en) SRAM (static random access memory)-oriented anti-SEU (single-event upset) error accumulation controller and method
US8037332B2 (en) Quad-state power-saving virtual storage controller
CN104699577A (en) Counter to locate faulty die in a distributed codeword storage system
CN102981783A (en) Cache accelerating method based on Nand Flash
CN103729149A (en) Data storage method
CN103577115A (en) Data configuration processing method and data configuration processing device
CN102591746B (en) Data reconstruction method and storage equipment
CN105094700B (en) The data temperature of bedding storage calculates method and apparatus in cloud storage system
CN102270102B (en) Method for optimizing writing performance of RAID6 (Redundant Array of Independent Disks) disk array
CN109375868B (en) Data storage method, scheduling device, system, equipment and storage medium
Rosenthal The medium-term prospects for long-term storage systems
CN102033717B (en) Data storage method and system based on RAID (Redundant Array of Inexpensive Disk)
US20160292035A1 (en) Indicating rebuild state of storage devices
CN104375906B (en) A kind of extensive Backup Data fast calibration method based on file system
CN103399783A (en) Storage method and device of mirror image documents of virtual machines

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20130424

RJ01 Rejection of invention patent application after publication