[go: up one dir, main page]

CN119311699B - A very cold object magnetic tape storage system, access and management method - Google Patents

A very cold object magnetic tape storage system, access and management method Download PDF

Info

Publication number
CN119311699B
CN119311699B CN202411864081.XA CN202411864081A CN119311699B CN 119311699 B CN119311699 B CN 119311699B CN 202411864081 A CN202411864081 A CN 202411864081A CN 119311699 B CN119311699 B CN 119311699B
Authority
CN
China
Prior art keywords
extremely cold
storage device
data
metadata
cold
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202411864081.XA
Other languages
Chinese (zh)
Other versions
CN119311699A (en
Inventor
文刘飞
王岷兰
陈坚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Shanyan Intelligent Technology Co ltd
Shenzhen Sandstone Data Technology Co ltd
Original Assignee
Shenzhen Shanyan Intelligent Technology Co ltd
Shenzhen Sandstone Data Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Shanyan Intelligent Technology Co ltd, Shenzhen Sandstone Data Technology Co ltd filed Critical Shenzhen Shanyan Intelligent Technology Co ltd
Priority to CN202411864081.XA priority Critical patent/CN119311699B/en
Publication of CN119311699A publication Critical patent/CN119311699A/en
Application granted granted Critical
Publication of CN119311699B publication Critical patent/CN119311699B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/214Database migration support
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2282Tablespace storage structures; Management thereof
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/289Object oriented databases
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/907Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Library & Information Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

极冷对象磁带存储系统、访问和管理方法中,极冷对象索引层包括存储设备,存储设备用于保存极冷对象索引;冷对象数据层包括存储设备C1,用于保存极冷对象数据;极冷对象元数据层包括存储设备D1,用于保存极冷对象元数据;极冷对象索引包括极冷对象的名称或编号与极冷对象元数据保存的存储设备D1位置信息;极冷对象元数据中,包括极冷对象名称或编号与极冷对象数据的存储设备C1的位置信息;存储设备C1为磁带存储设备,存储设备D1为磁带存储设备;在极冷对象索引中根据名称或编号找到极冷对象元数据保存的存储设备D1位置信息;获得极冷对象元数据后,获得极冷对象数据的存储设备D1位置信息;再到存储设备D1获得极冷对象数据。

In the extremely cold object tape storage system, access and management method, the extremely cold object index layer includes a storage device, and the storage device is used to store the extremely cold object index; the cold object data layer includes a storage device C1, which is used to store the extremely cold object data; the extremely cold object metadata layer includes a storage device D1, which is used to store the extremely cold object metadata; the extremely cold object index includes the name or number of the extremely cold object and the location information of the storage device D1 storing the extremely cold object metadata; the extremely cold object metadata includes the name or number of the extremely cold object and the location information of the storage device C1 storing the extremely cold object data; the storage device C1 is a tape storage device, and the storage device D1 is a tape storage device; the location information of the storage device D1 storing the extremely cold object metadata is found in the extremely cold object index according to the name or number; after obtaining the extremely cold object metadata, the location information of the storage device D1 storing the extremely cold object data is obtained; and then the extremely cold object data is obtained from the storage device D1.

Description

Extremely cold object tape storage system, access and management method
Technical Field
The application belongs to the technical field of big data, and particularly relates to an extremely cold object tape storage system, an access and management method.
Background
With the development of internet technology, in the rapid evolution of large-scale data storage technology, with the development of electronic commerce business, electronic payment and online file editing and storage of network, each user stores a large amount of personal sensitive data in the network.
Currently, every individual enjoying internet service is generating a large amount of data every day, which data changes over time, with a gradual decrease in the probability of being revisited. In software engineering, data management of different access frequencies is called thermal management. The low-heat data is stored in an inexpensive storage medium, and the high-heat data is stored in a storage medium having a high access speed.
Chinese patent, CN202410373669.9, application date 2024.03.29, application name "a method and device for recovering data in a distributed object storage system", proposes that "recovering backup data from a tape library to a distributed file system by using an object recovering component to obtain the whole content of a recovered object", scanning the whole content of the recovered object, analyzing the whole content of the recovered object to obtain metadata of each object, writing the metadata into a distributed database, and obtaining a corresponding object from the distributed file system according to the metadata of the corresponding object when receiving an object obtaining request initiated by a client.
The Chinese patent CN202110317844.9, application date 2021.03.25, application name "a layered data storage system and method, a backup management server", a backup management server, an application server, a distributed object storage cluster and a tape library are provided, wherein the backup management server is used for generating and sending a first backup request to the application server according to the residual storage capacity and backup strategy of the distributed object storage cluster, generating and sending a second backup request to the distributed object storage cluster according to the residual storage capacity and the retention period of backup data of the tape library, the application server is connected with the distributed object storage cluster and used for storing corresponding data in the application server in the distributed object storage cluster according to the first backup request, and the distributed object storage cluster is connected with the tape library and used for storing corresponding data in the distributed object storage cluster in the tape library according to the second backup request. The method and the device can improve the data backup efficiency and success rate and reduce the operation and maintenance cost. ".
The data protection system and method based on the tape library are provided, wherein the data protection system comprises a data backup server and the tape library, the data backup server comprises a backup data storage module for storing data backed up from a production server and corresponding time point data sets, a data archiving module for establishing a data archiving task, archiving the data stored in the backup data storage module into the tape library by taking the time point data sets as units according to the data archiving task, and recording the position of each data object in the tape library, and a position database for storing the position of the data object in the tape library. Compared with the prior art, the invention not only can effectively protect the data, but also can reduce the cost by utilizing the advantages of the magnetic tape, and can conveniently manage the data in the magnetic tape.
In the above patents, the tape library is used as a general backup database, and the data of the tape library is not generally enabled in the case that the main data is not lost.
In fig. 8, when the tape library is used as a backup database, the object library is partially or wholly stored, and the object metadata and the object data are stored together.
In fig. 9, when the tape library is used as a backup database, the object library is partially or wholly stored, the object metadata is separated from the object data and is stored in different tape spaces, and the object metadata is not responsible for indexing the object data.
In the prior art, the tape device is used as the integral backup of the object database, the tape device is used in the mode, the tape storage device cannot be used for real object-oriented management, and when the data is read, the data in the tape device needs to be integrally restored to the database.
With the development of business, granularity of cold data is smaller and smaller, conversion of cold data and hot data is randomized gradually, such as transaction records of shopping websites, cold data before a few years, extremely cold data are possibly required to be converted into warm data because a certain user inquires about own consumption records. By adopting a cold data system or a tape backup system which is stored in batches, under the impact of the service, the tape backup data needs to be frequently moved to a warm data area, so that the performance of the system is greatly reduced, and the power consumption of the system is rapidly increased.
Abbreviation and key term definitions
MB, english shorthand, a storage unit in a computer, is called MByte in full. MB (all-MByte) a unit of storage in a computer, known as a "megabit".
Object storage, also known as object-based storage, is a generic term used to describe a method of resolving and processing discrete units, which are referred to as objects.
‌ Object Metadata (Metadata), which is a set of name-value pairs, is information describing the properties of an object. In object storage, object metadata includes creation time, modification time, storage type, etc. of the object, which can help manage and retrieve the object.
‌ Object Data (Data), in which managed objects are stored.
SSD storage device the present application refers to a device that uses SSD as a storage medium, such as SSD (Solid State disk STATE DRIVE), which is a storage device that uses flash memory storage technology, unlike traditional mechanical hard disks, where SSD has no mechanical moving parts.
Magnetic disk (disk) refers to a memory that stores data using magnetic recording technology. The magnetic disk is a main storage medium of a computer, can store a large amount of binary data, and can keep the data from losing after power is cut off.
Magnetic tape apparatus magnetic tape is a magnetically laminated tape material for recording sound, images, digital or other signals, and is the most versatile magnetic recording material with the greatest yield. Typically by coating a plastic film substrate (support) with a layer of granular magnetic material or by vapor deposition of a layer of magnetic oxide or alloy film.
Disclosure of Invention
According to the method and the device, through the index relation of the extremely cold object metadata recorded to the extremely cold object data, the two-layer index relation of the extremely cold object index of extremely cold object metadata of extremely cold object data can be realized, and the tape equipment can support object-oriented storage and access. By means of the two-layer index relationship, the tape storage device can be subjected to object-oriented storage addressing management, tape data do not need to be read in batches, and the access efficiency of the device is greatly improved
The extremely cold object tape storage system comprises an extremely cold object index layer, a cold object data layer and an extremely cold object metadata layer, wherein the extremely cold object index layer comprises storage devices used for storing extremely cold object indexes, the cold object data layer comprises storage devices C1 used for storing extremely cold object data, the extremely cold object metadata layer comprises storage devices D1 used for storing extremely cold object metadata, the extremely cold object indexes comprise names or numbers of extremely cold objects and storage device D1 position information stored by the extremely cold object metadata, the extremely cold object metadata comprises the names or numbers of the extremely cold objects and the position information of the storage devices C1 of the extremely cold object data, the storage devices C1 are tape storage devices, the storage devices D1 are tape storage devices, the position information of the storage devices D1 stored by the extremely cold object metadata is found in the extremely cold object indexes according to the names or the numbers, the position information of the storage devices D1 stored by the extremely cold object metadata is obtained after the extremely cold object metadata is obtained, and the extremely cold object data is obtained by the storage device D1.
The extremely cold object index layers may include extremely cold level 1 object index layers, extremely cold level 2 object index layers, and extremely cold level 3 object index layers.
The extremely cold level 1 object index layer may include an SSD storage device for holding extremely cold level 1 object indexes.
The extremely cold level 2 object index layer may include a hard disk storage device for holding extremely cold level 2 object indexes.
The extremely cold 3 level object index layer may include a tape storage device for holding extremely cold 3 level object indexes.
The extremely cold object index may be stored in a database.
The extremely cold object index may be stored by a key-value storage system.
The temperature object data layer comprises a storage device B1, wherein the storage device B1 is used for storing temperature object metadata, the temperature object data layer comprises a storage device B2, the storage device B2 is used for storing temperature object data, the temperature object metadata comprises temperature object names or numbers and position information of the storage device B2 of the temperature object data, the storage device B1 is an SSD storage device, the storage device B2 is a magnetic tape device or a magnetic disk device, the object metadata is acquired firstly, whether the data are on a magnetic disk or a magnetic tape is confirmed based on the content of the object metadata, the temperature object data are stored by an object storage system, and the temperature object metadata are stored by a key-value distributed storage system.
The cold object metadata storage system comprises a cold object data layer, wherein the warm object data layer comprises a storage device B3, the storage device B3 is used for storing cold object metadata, the cold object data layer comprises a storage device C3, the storage device C3 is used for storing cold object data, the cold object metadata comprises a cold object name or a number and position information of the storage device C3 of the cold object data, the storage device B3 is an SSD storage device or a hard disk device, the storage device C3 is a tape storage device, and the cold object metadata is stored by a key-value distributed storage system.
A storage access method of extremely cold objects comprises the steps of searching in an extremely cold object index library, judging whether metadata numbers are inquired or not, returning to the end, judging whether the metadata numbers are found in the extremely cold object index library, and obtaining extremely cold object metadata in extremely cold object metadata storage equipment according to the metadata numbers, wherein the extremely cold data are obtained in the extremely cold data storage equipment according to the extremely cold object metadata, the extremely cold object indexes are stored in SSD, magnetic discs or magnetic tape storage equipment, the storage equipment of the extremely cold object metadata library is magnetic tape equipment, and the storage equipment of the extremely cold data is magnetic tape equipment.
Step A10 is also included before step C10, wherein the step A10 is to search in a temperature object metadata base; A20 refers to whether to query the warm object metadata, A30 refers to obtaining the warm object data according to the warm object metadata and returning, B10 refers to whether to query the cold object metadata in a cold object metadata base, B20 refers to whether to query the cold object metadata, B30 refers to obtaining the cold object data according to the cold object metadata and returning, the storage device of the cold object metadata base is SSD storage device or disk device, the storage device of the cold object data is tape device or disk device, the storage device of the warm object metadata base is SSD storage device, and the storage device of the warm object data is SSD storage device.
It may be that in step C, if the extremely cold object metadata is moved to the cold object metadata base after the extremely cold object metadata is obtained, and in step D20, if the cold object metadata is moved to the SSD storage device after the cold object metadata is obtained.
A storage management method for extremely cold objects comprises the steps of periodically checking an object metadata base, obtaining temperature attributes and latest access time from the object metadata, if the temperature attributes are cold data, if the latest access time is greater than a set value temperature value TC, changing the temperature attributes of the data to be extremely cold data, migrating the object metadata to tape storage equipment, recording index positions of the object metadata in an extremely cold object index base, and deleting the object metadata in the object metadata base.
Before migration of the object metadata, counting the quantity of extremely cold metadata to be migrated, and if the quantity is larger than a migration threshold value, starting migration, wherein the threshold value is the minimum value of single-write data of the tape equipment.
May further include, if the temperature attribute is temperature data, if the latest access time is greater than a set temperature value TH; the method comprises the steps of determining that temperature attributes of object metadata change data are cold data, migrating the object data pointed by the object metadata into a tape storage device, recording tape position information after migration of the object metadata, deleting original object data, releasing storage positions, counting the number of the object data to be migrated before migration of the object data, and starting migration if the number is larger than a migration threshold value, wherein the threshold value is a minimum value of single write data of the tape device.
The technical effect of the technical scheme is that the two-layer index relationship of extremely cold object index, extremely cold object metadata and extremely cold object data can be realized through the index relationship from extremely cold object metadata to extremely cold object data, and the tape equipment can support object-oriented storage and access.
The technical scheme has the technical effects that the tape storage equipment can be subjected to object-oriented storage addressing management through the two-layer index relationship, tape data do not need to be read in batches, and the equipment access efficiency is greatly improved.
One of the technical effects of the technical scheme is that the extremely cold object tape storage system can be independently established and independently managed independently of the existing object storage system.
The technical scheme has the technical effects that the extremely cold object index layer comprises multiple stages, and can realize heat management in a long-span time period.
The technical scheme has the technical effects that the extremely cold object index layer comprises multiple stages, different storage media are indexed by extremely cold objects of different stages, and the storage cost of data with large span is greatly reduced.
The technical scheme has the technical effects that the extremely cold object index of extremely cold level 1 is stored in the SSD storage device, so that the searching speed can be improved.
One of the technical effects of the technical scheme is that the extremely cold object index of extremely cold level 2 is stored in the hard disk storage device, and the hard disk has higher access speed.
One of the technical effects of the technical scheme is that the extremely cold object index of extremely cold level 2 is stored in the tape storage device, so that massive long-span extremely cold data can be stored, and the storage cost is greatly reduced.
The technical scheme has the technical effects that mass long-span extremely-cold data can be stored, and single-object data addressing based on tape equipment can be achieved for extremely-low-frequency data access, so that the cold data access efficiency is high.
The technical scheme has the technical effects that the hierarchical storage of the extremely cold data can effectively control the data waiting tolerance of a user, and the user has waiting tolerance for extremely cold object data.
One of the technical effects of the technical scheme is that a warm object and a cold object are also built into a whole, data are gradually cooled into extremely cold object data and extremely cold object metadata, and data moving cost is low.
The technical scheme has the technical effects that the cold object is converted into the extremely cold object, metadata of the extremely cold object only need to be moved, the extremely cold object data does not need to be moved, and the system moving cost is low.
Drawings
FIG. 1 is a schematic block diagram of an extremely cold object tape storage system;
FIG. 2 is an extremely cold object tape storage system including a warm object data layer;
FIG. 3 is an extremely cold object tape storage system including a cold object data layer;
FIG. 4 is a schematic diagram of an extremely cold object storage access method;
FIG. 5 is a schematic diagram of a warm object store access method;
FIG. 6 is a schematic diagram of an extremely cold object storage management method;
FIG. 7 is a schematic diagram of a warm object storage management method;
FIG. 8 is a schematic diagram of a tape-based object store backup method;
FIG. 9 is a schematic diagram of a tape-based object storage backup method.
Detailed Description
The present application is described in further detail below with reference to the accompanying drawings. The following description of the preferred embodiments of the present application is not intended to limit the present application. The description of the preferred embodiments of the present application is merely illustrative of the general principles of the application. The numbers "first", "second" and "a" and "B" in the present application are for convenience of description only, and do not represent a time or space sequence relationship, and the letter and number combination "TA", "TB" and "H" in the present application are for convenience of description only, and the meaning is determined by the word in which they are referred to.
Referring to fig. 1, the extremely cold object tape storage system comprises an extremely cold object index layer, a cold object data layer and an extremely cold object metadata layer, wherein the extremely cold object index layer comprises storage devices used for storing extremely cold object indexes, the cold object data layer comprises storage devices C1 used for storing extremely cold object data, the extremely cold object metadata layer comprises storage devices D1 used for storing extremely cold object metadata, the extremely cold object indexes comprise names or numbers of extremely cold objects and storage device D1 position information stored by the extremely cold object metadata, the extremely cold object metadata comprises the names or numbers of the extremely cold objects and the position information of the storage devices C1 of the extremely cold object data, the storage devices C1 are tape storage devices, the storage devices D1 are tape storage devices, the position information of the storage devices D1 stored by the extremely cold object metadata is found according to the names or the numbers in the extremely cold object indexes, the position information of the storage devices D1 stored by the extremely cold object metadata is obtained after the extremely cold object metadata is obtained, and the extremely cold object data is obtained by the storage device D1 position information obtained by the extremely cold object metadata.
The storage device C1 and the storage device D1 are tape storage devices, which may be physically isolated devices, or may be the same physical device, and an extremely cold object index layer is adopted to establish an index relationship between an object name and extremely cold object metadata, and although the extremely cold object metadata is stored in the tape device, index searching may be performed on a single object through the extremely cold object index.
By storing extremely cold object metadata and extremely cold object data in the tape equipment, establishing an index relation of the extremely cold object metadata through extremely cold object indexes, and recording the index relation of the extremely cold object metadata to the extremely cold object data, the two-layer index relation of the extremely cold object indexes and the extremely cold object metadata can be realized, and the tape equipment can support object oriented storage and access.
The tape equipment is unidirectional rotation, the access speed is slow in the reading process, the traditional tape data is used for addressing and managing single objects, the time and the labor are wasted, the data volume stored by extremely cold objects is large, and the access frequency is low, but the real-time requirement of users on the data is not too high, for example, the users inquire about the data purchased once a few years ago, the users have the waiting tolerance, and the extremely cold data is stored by the tape equipment, so that the equipment cost pressure of the system can be greatly reduced. By adopting the secondary index, the cache cost of the data can be reduced, and the extremely cold object metadata can be distinguished from the extremely cold object data without completely reading the data of one disk.
The extremely cold object tape storage system can be built independently and managed independently of existing object storage systems.
The extremely cold object index layers may include extremely cold level 1 object index layers, extremely cold level 2 object index layers, and extremely cold level 3 object index layers.
Although the storage of extremely cold object indexes in SSD storage devices or disk devices is not expensive, for large systems such as nationwide shopping platforms, storage of shopping record data for 10 years or 20 years is very huge data even if the object data are indexed. By classifying extremely cold data, such as by far and near from the current year, the extremely cold data is classified into 1 level corresponding to data within 10 years from the current year, 2 level corresponding to data from 10 years to 20 years from the current year, and 3 level corresponding to data above 20 years from the current year.
The extremely cold level 1 object index layer may include an SSD storage device for holding extremely cold level 1 object indexes.
And the extremely cold object index of extremely cold level 1 is stored in the SSD storage device, so that the searching speed can be improved.
The extremely cold level 2 object index layer may include a hard disk storage device for holding extremely cold level 2 object indexes.
And the extremely cold object index of extremely cold level 2 is stored in the hard disk storage device, and the hard disk has a relatively high access speed.
The extremely cold 3 level object index layer may include a tape storage device for holding extremely cold 3 level object indexes.
For extremely cold object indexes of extremely cold level 2, stored on tape storage devices, in extreme cases, 3 separate tape storage devices need to be accessed separately, the access speed of the seek is relatively slowest, but the probability of occurrence of such speed is small, and such time overhead can be borne for individual users.
The extremely cold object index may be stored in a database.
The extremely cold object index is stored by a database, and for the extremely cold 2-level extremely cold object index, the data of the whole data sub-database of the tape storage device needs to be read into a cache disk, such as a magnetic disk, to be accessed. If the data is not stored in a database, but is stored in a sequence table, algorithm searching access, such as dichotomy, can be performed based on the relative position of the magnetic tape, so that the position of the data storage is gradually found.
The extremely cold object index may be stored by a key-value storage system.
For large data platforms, the key-value storage system may be used to store extremely cold object indices. The elements stored in the extremely cold object index are simple, and only the names of the objects, the tape equipment numbers and the positions can be stored, so that massive data can be stored by using a large key-value storage system.
As shown in FIG. 2, the temperature object data layer comprises a storage device B1, wherein the storage device B1 is used for storing temperature object metadata, the temperature object data layer comprises a storage device B2, the storage device B2 is used for storing temperature object data, the temperature object metadata comprises a temperature object name or a number and position information of the storage device B2 of the temperature object data, the storage device B1 is an SSD storage device, the storage device B2 is a magnetic tape device or a magnetic disk device, the object metadata is acquired firstly, whether the data are on a magnetic disk or a magnetic tape is confirmed based on the content of the object metadata, the temperature object data are stored by an object storage system, and the temperature object metadata are stored by a key-value distributed storage system.
The extremely cold object tape storage system may be part of a system that also builds a body of warm and cold objects, with data being gradually cooled into extremely cold object data and extremely cold object metadata.
For the online document data platform, after a file is generated, the file object data, the object metadata and the access frequency of the object data are repeatedly read and modified in a short period, a temperature object data layer can be set, the temperature object data layer stores the temperature object metadata and the temperature object data in an SSD storage device, and the temperature object metadata stores information such as the index position of the temperature object data, the establishment time of the temperature object data, the data property and the like.
As shown in FIG. 3, the cold object data layer comprises a cold object data layer, wherein the warm object data layer comprises a storage device B3, the storage device B3 is used for storing cold object metadata, the cold object data layer comprises a storage device C3, the storage device C3 is used for storing cold object data, the cold object metadata comprises a cold object name or a number and position information of the storage device C3 of the cold object data, the storage device B3 is an SSD storage device or a hard disk device, the storage device C3 is a magnetic tape storage device, and the cold object metadata is stored by a key-value distributed storage system.
For an online document data platform or a short video platform, after a file is generated for a period of time, the frequency of modification is gradually reduced, the frequency of access and reading is gradually reduced, the number of the files is huge, the timely reading of metadata information of the data needs to be reserved, and for object data, the faster reading speed is also needed, the data can be defined as cold data, and the cold data has the characteristic of being degraded by warm data, so that the object metadata of the data is kept still, the object data is moved to a cold data level, in the object metadata, the attribute of the data is modified into a cold object, and the movement of the cold object metadata is reduced.
The cold object metadata is formed by changing warm object metadata, when the access frequency of one data is reduced, in the object metadata, the attribute of the data is modified into an extremely cold object, the extremely cold object metadata is required to be moved to a tape device, an extremely cold object index is built in an extremely cold object index layer, and then the object metadata in the warm object data layer is deleted.
Referring to FIG. 4, the extremely cold object storage access method comprises the steps of searching in an extremely cold object index library, determining whether metadata numbers are inquired, determining whether metadata numbers are found, returning to the end, determining step D10, obtaining extremely cold object metadata in extremely cold object metadata storage equipment according to the metadata numbers, determining step D20, obtaining extremely cold data in the extremely cold data storage equipment according to the extremely cold object metadata, storing the extremely cold object indexes in SSD, a magnetic disk or a magnetic tape storage equipment, wherein the storage equipment of the extremely cold object metadata library is the magnetic tape equipment, and the storage equipment of the extremely cold data is the magnetic tape equipment.
The extremely cold object tape storage system can be independently established for independent management, and when the extremely cold object tape storage system reads data, the object-oriented storage and access of the tape equipment are supported by the two-layer index relationship of extremely cold object index, extremely cold object metadata and extremely cold object data.
The extremely cold object index is stored in the SSD, disk or tape storage device, with the speed of access being the fastest and the slowest. For a system with SSD, disk and tape, the extremely cold object indexes need to be accessed sequentially, and only if the extremely cold object indexes are not found in the tape storage device, the fact that the object is not stored in the system can be judged.
As shown in FIG. 5, before the step C10, the method further comprises the steps of searching in a warm object metadata base, the step A10 of judging whether the warm object metadata is inquired, the step A20 of judging whether the warm object metadata is inquired, the step A30 of judging whether the warm object metadata is obtained according to the warm object metadata, returning, the step B10 of judging whether the cold object metadata is inquired, the step B20 of judging whether the cold object metadata is inquired, the step B30 of judging whether the cold object metadata is obtained according to the cold object metadata, returning, wherein the storage device of the cold object metadata is SSD storage device or disk device, the storage device of the cold object data is tape device or disk device, the storage device of the warm object metadata base is SSD storage device, and the storage device of the warm object data is SSD storage device.
It may be that in step C, if the extremely cold object metadata is moved to the cold object metadata base after the extremely cold object metadata is obtained, and in step D20, if the cold object metadata is moved to the SSD storage device after the cold object metadata is obtained.
For a system with warm or cold objects, it is necessary to find access data in the order of warm, cold, and very cold objects, as the warm, cold, and very cold objects of the data are changing.
Referring to fig. 6, an extremely cold object storage management method includes periodically checking an object metadata base, acquiring a temperature attribute and a latest access time from the object metadata, if the temperature attribute is cold data and the latest access time is greater than a set temperature value TC, changing the temperature attribute of data to extremely cold data from the object metadata, migrating the object metadata to a tape storage device, recording an index position of the object metadata in an extremely cold object index base, and deleting the object metadata in the object metadata base.
The object metadata is periodically queried, for example, according to the characteristics of the service, for example, according to 1 day unit or 1 month unit, the data in the large object data system is detected, the latest accessed time record is detected, the heat of the data is judged, and the object data is moved first and then the object metadata is moved.
Before migration of the object metadata, counting the quantity of extremely cold metadata to be migrated, and if the quantity is larger than a migration threshold value, starting migration, wherein the threshold value is the minimum value of single-write data of the tape equipment.
The tape equipment, single motion, preferably adopts batch writing mode, can promote the utilization efficiency of equipment by a wide margin. When the data to be moved is queried, the label to be moved can be recorded in the object metadata, and the data can be moved once when the size of the movement is reached.
As shown in FIG. 7, the method further comprises the steps of if the temperature attribute is temperature data and the latest access time is greater than a set value temperature value TH, changing the temperature attribute of data to be cold data in the object metadata, migrating the object data pointed by the object metadata to a tape storage device, recording the migrated tape position information of the object metadata, deleting the original object data, releasing a storage position, counting the number of the object data to be migrated before the object data migration, and starting the migration if the number is greater than a migration threshold value, wherein the threshold value is the minimum value of the single write data of the tape device.
While the invention has been illustrated and described in terms of a preferred embodiment and several alternatives, the invention is not limited by the specific description in this specification. Other alternative or equivalent components may also be used in the practice of the present invention.

Claims (10)

1.一种极冷对象磁带存储系统,其特征在于:1. A very cold object magnetic tape storage system, characterized in that: 包括极冷对象索引层、冷对象数据层、极冷对象元数据层;It includes the extremely cold object index layer, the cold object data layer, and the extremely cold object metadata layer; 极冷对象索引层包括存储设备,存储设备用于保存极冷对象索引;The extremely cold object index layer includes a storage device, and the storage device is used to store the extremely cold object index; 冷对象数据层包括存储设备C1,存储设备C1用于保存极冷对象数据;The cold object data layer includes a storage device C1, which is used to store extremely cold object data; 极冷对象元数据层包括存储设备D1,存储设备D1用于保存极冷对象元数据;The extremely cold object metadata layer includes a storage device D1, and the storage device D1 is used to store extremely cold object metadata; 极冷对象索引包括极冷对象的名称或编号与极冷对象元数据保存的存储设备D1位置信息;The extremely cold object index includes the name or number of the extremely cold object and the location information of the storage device D1 where the extremely cold object metadata is stored; 极冷对象元数据中,包括极冷对象名称或编号与极冷对象数据的存储设备C1的位置信息;The extremely cold object metadata includes the extremely cold object name or number and the location information of the storage device C1 of the extremely cold object data; 所述存储设备C1为磁带存储设备,所述存储设备D1为磁带存储设备;The storage device C1 is a magnetic tape storage device, and the storage device D1 is a magnetic tape storage device; 在极冷对象索引中根据名称或编号找到极冷对象元数据保存的存储设备D1位置信息;获得极冷对象元数据后,获得极冷对象数据的存储设备D1位置信息;再到存储设备D1获得极冷对象数据;Find the location information of the storage device D1 where the metadata of the extremely cold object is stored according to the name or number in the extremely cold object index; after obtaining the metadata of the extremely cold object, obtain the location information of the storage device D1 of the extremely cold object data; and then go to the storage device D1 to obtain the extremely cold object data; 所述极冷对象索引用数据库保存;The extremely cold object index is stored in a database; 或所述极冷对象索引用key-value存储系统保存;Or the extremely cold object index is stored in a key-value storage system; 在极冷对象索引库中,记录所述对象元数据的索引位置,In the extremely cold object index library, the index location of the object metadata is recorded. 在极冷对象索引库中查找;查询到元数据编号;Search in the extremely cold object index library; find the metadata number; 根据所述元数据编号,在极冷对象元数据存储设备D1中获得极冷对象元数据;According to the metadata number, obtain the extremely cold object metadata in the extremely cold object metadata storage device D1; 根据所述极冷对象元数据,在极冷数据存储设备中,获得极冷数据。According to the extremely cold object metadata, extremely cold data is obtained in an extremely cold data storage device. 2.根据权利要求1所述的极冷对象磁带存储系统,其特征在于,包括2. The extremely cold object magnetic tape storage system according to claim 1, characterized in that it comprises 特征T10:按照距离当前年份的远近,极冷对象索引层包括极冷1级对象索引层;极冷2级对象索引层;极冷3级对象索引层;Feature T10: According to the distance from the current year, the extremely cold object index layer includes an extremely cold level 1 object index layer; an extremely cold level 2 object index layer; and an extremely cold level 3 object index layer; 包括如下特征的任意一项或多项:Includes any one or more of the following characteristics: 特征T20:极冷1级对象索引层包括SSD存储设备,SSD存储设备用于保存极冷1级对象索引;Feature T20: The ultra-cold level 1 object index layer includes an SSD storage device, and the SSD storage device is used to store the ultra-cold level 1 object index; 特征T30:极冷2级对象索引层包括硬盘存储设备,硬盘存储设备用于保存极冷2级对象索引;Feature T30: The extremely cold level 2 object index layer includes a hard disk storage device, and the hard disk storage device is used to store the extremely cold level 2 object index; 特征T40:极冷3级对象索引层包括磁带存储设备,磁带存储设备用于保存极冷3级对象索引。Feature T40: The deep-cold level 3 object index layer includes a tape storage device, and the tape storage device is used to store the deep-cold level 3 object index. 3.根据权利要求1所述的极冷对象磁带存储系统,其特征在于,3. The extremely cold object magnetic tape storage system according to claim 1, characterized in that: 包括温对象数据层;Includes warm object data layer; 温对象数据层包括存储设备B1,存储设备B1用于保存温对象元数据;The warm object data layer includes a storage device B1, which is used to store warm object metadata; 温对象数据层包括存储设备B2,存储设备B2用于保存温对象数据;The warm object data layer includes a storage device B2, which is used to store warm object data; 温对象元数据中,包括温对象名称或编号与温对象数据的存储设备B2的位置信息;The temperature object metadata includes the temperature object name or number and the location information of the storage device B2 of the temperature object data; 所述存储设备B1为SSD存储设备,所述存储设备B2为磁盘设备;The storage device B1 is an SSD storage device, and the storage device B2 is a disk device; 先获取对象元数据,基于对象元数据的内容,确认数据是在磁盘;First, obtain the object metadata and confirm that the data is on the disk based on the content of the object metadata; 所述温对象数据用对象存储系统保存;The warm object data is stored in an object storage system; 所述温对象元数据用key-value分布式存储系统保存。The warm object metadata is stored using a key-value distributed storage system. 4.根据权利要求3所述的极冷对象磁带存储系统,其特征在于,4. The extremely cold object magnetic tape storage system according to claim 3, characterized in that: 温对象数据层包括存储设备B3,存储设备B3用于保存冷对象元数据;The warm object data layer includes a storage device B3, which is used to store cold object metadata; 冷对象数据层包括存储设备C3,存储设备C3用于保存冷对象数据;The cold object data layer includes a storage device C3, and the storage device C3 is used to store cold object data; 冷对象元数据中,包括冷对象名称或编号与冷对象数据的存储设备C3的位置信息;The cold object metadata includes the cold object name or number and the location information of the storage device C3 of the cold object data; 所述存储设备B3为SSD存储设备或硬盘设备,The storage device B3 is an SSD storage device or a hard disk device. 所述存储设备C3为磁带存储设备;The storage device C3 is a magnetic tape storage device; 所述冷对象元数据用key-value分布式存储系统保存。The cold object metadata is stored in a key-value distributed storage system. 5.一种极冷对象存储访问方法,用于权利要求1至4任意一项所述的极冷对象磁带存储系统,其特征在于,包括5. A method for accessing an extremely cold object storage, used in the extremely cold object tape storage system according to any one of claims 1 to 4, characterized in that it comprises: 步骤C10:在极冷对象索引库中查找;Step C10: Search in the extremely cold object index library; 步骤C20:是否查询到元数据编号,否,返回结束;Step C20: Check whether the metadata number is found, if not, return to the end; 步骤D10:根据所述元数据编号,在极冷对象元数据存储设备中获得极冷对象元数据;Step D10: Obtaining the extremely cold object metadata in an extremely cold object metadata storage device according to the metadata number; 步骤D20:根据所述极冷对象元数据,在极冷数据存储设备中,获得极冷数据;Step D20: Obtaining extremely cold data in an extremely cold data storage device according to the extremely cold object metadata; 所述极冷对象索引存储在SSD、磁盘或磁带存储设备中;The extremely cold object index is stored in an SSD, disk or tape storage device; 同时存在SSD、磁盘、磁带的系统,极冷对象索引需要按照顺序访问;In a system with SSDs, disks, and tapes, the index of extremely cold objects needs to be accessed sequentially. 所述极冷对象元数据的存储设备为磁带设备;The storage device for the extremely cold object metadata is a magnetic tape device; 所述极冷数据的存储设备为磁带设备。The storage device for the extremely cold data is a magnetic tape device. 6.根据权利要求5所述的极冷对象存储访问方法,其特征在于,步骤C10之前,还包括6. The method for accessing extremely cold object storage according to claim 5, characterized in that before step C10, it also includes 步骤A10:在温对象元数据库中查找;Step A10: Search in the warm object metadata database; 步骤A20:是否查询到温对象元数据;Step A20: whether the metadata of the warm object is found; 步骤A30:是,根据所述温对象元数据获得温对象数据,返回;Step A30: yes, obtain the temperature object data according to the temperature object metadata, and return; 步骤B10:否,冷对象元数据库中查找冷对象元数据;Step B10: No, search for cold object metadata in the cold object metadata database; 步骤B20:是否查询到冷对象元数据;Step B20: whether cold object metadata is found; 步骤B30:是,根据所述冷对象元数据获得冷对象数据,返回;Step B30: yes, obtain the cold object data according to the cold object metadata, and return; 所述冷对象元数据库的存储设备为SSD存储设备或磁盘设备;The storage device of the cold object metadata database is an SSD storage device or a disk device; 所述冷对象数据的存储设备为磁带设备或磁盘设备;The storage device of the cold object data is a magnetic tape device or a magnetic disk device; 所述温对象元数据库的存储设备为SSD存储设备;The storage device of the warm object metadata database is an SSD storage device; 所述温对象数据的存储设备为SSD存储设备。The storage device for the warm object data is an SSD storage device. 7.根据权利要求5所述的极冷对象存储访问方法,其特征在于,7. The method for accessing extremely cold object storage according to claim 5, characterized in that: 步骤C中,如果在获得极冷对象元数据之后,将所述极冷对象元数据搬移到冷对象元数据库中;In step C, after obtaining the extremely cold object metadata, the extremely cold object metadata is moved to a cold object metadata database; 步骤D20中,如果在获得冷对象元数据之后,将所述冷对象数据搬移到SSD存储设备中。In step D20, after the cold object metadata is obtained, the cold object data is moved to the SSD storage device. 8.一种极冷对象存储管理方法,用于权利要求1至4任意一项所述的极冷对象磁带存储系统 ,其特征在于,包括8. A method for managing storage of extremely cold objects, used in the extremely cold object tape storage system according to any one of claims 1 to 4, characterized in that it comprises: 周期性检查对象元数据库;在对象元数据中获取温度属性与最近访问时间;Periodically check the object metadata database; obtain the temperature attribute and the last access time in the object metadata; 如果所述温度属性为冷数据,如果最近访问时间大于设定值温度值TC;所述对象元数据改变数据的温度属性为极冷数据;If the temperature attribute is cold data, if the most recent access time is greater than the set temperature value TC; the temperature attribute of the object metadata change data is extremely cold data; 将所述对象元数据迁移到磁带存储设备中,在极冷对象索引库中,记录所述对象元数据的索引位置,删除对象元数据库中所述的对象元数据。The object metadata is migrated to a tape storage device, an index position of the object metadata is recorded in an extremely cold object index library, and the object metadata in the object metadata library is deleted. 9.根据权利要求8所述的极冷对象存储管理方法,其特征在于,9. The extremely cold object storage management method according to claim 8, characterized in that: 在对象元数据迁移前,统计需要迁移的极冷元数据的数量,如果所述数量大于迁移阈值,启动迁移;所述阈值为磁带设备单次写入数据最小值。Before object metadata migration, the number of extremely cold metadata that needs to be migrated is counted, and if the number is greater than a migration threshold, migration is started; the threshold is the minimum value of data written in a single time by a tape device. 10.根据权利要求8所述的极冷对象存储管理方法,其特征在于,还包括10. The extremely cold object storage management method according to claim 8, characterized in that it also includes 如果所述温度属性为温数据,如果最近访问时间大于设定值温度值TH;在所述对象元数据改变数据的温度属性为冷数据;将所述对象元数据指向的对象数据迁移到磁带存储设备中,在所述对象元数据记录迁移后的磁带位置信息;删除原对象数据,释放存储位置;If the temperature attribute is warm data, and if the most recent access time is greater than the set temperature value TH; change the temperature attribute of the data in the object metadata to cold data; migrate the object data pointed to by the object metadata to a tape storage device, and record the tape location information after migration in the object metadata; delete the original object data and release the storage location; 在对象数据迁移前,统计需要迁移的对象数据的数量,如果所述数量大于迁移阈值,启动迁移;所述阈值为磁带设备单次写入数据最小值。Before the object data is migrated, the number of object data to be migrated is counted, and if the number is greater than a migration threshold, the migration is started; the threshold is the minimum value of data written in a single time by the tape device.
CN202411864081.XA 2024-12-18 2024-12-18 A very cold object magnetic tape storage system, access and management method Active CN119311699B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202411864081.XA CN119311699B (en) 2024-12-18 2024-12-18 A very cold object magnetic tape storage system, access and management method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202411864081.XA CN119311699B (en) 2024-12-18 2024-12-18 A very cold object magnetic tape storage system, access and management method

Publications (2)

Publication Number Publication Date
CN119311699A CN119311699A (en) 2025-01-14
CN119311699B true CN119311699B (en) 2025-04-29

Family

ID=94179377

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202411864081.XA Active CN119311699B (en) 2024-12-18 2024-12-18 A very cold object magnetic tape storage system, access and management method

Country Status (1)

Country Link
CN (1) CN119311699B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN120337879A (en) * 2025-06-16 2025-07-18 湖南涉外经济学院 An online document processing method and system based on document processing model

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118444848A (en) * 2024-04-26 2024-08-06 小红书科技有限公司 Data processing method, apparatus, computer device, and computer readable storage medium

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103714145B (en) * 2013-12-25 2016-11-23 中国地质大学(武汉) Relationship type and Key-Value type database spatial data index method
CN104765656A (en) * 2015-04-23 2015-07-08 浪潮电子信息产业股份有限公司 Backup method and device for cold data
GB201807534D0 (en) * 2018-05-09 2018-06-20 Palantir Technologies Inc Systems and methods for indexing and searching
CN115794687A (en) * 2022-12-12 2023-03-14 三星(中国)半导体有限公司 Method and device for key-value data storage based on log-structure merged tree
CN116361252A (en) * 2023-03-09 2023-06-30 北京百度网讯科技有限公司 Data storage and access method, device, equipment and medium
CN116974467A (en) * 2023-06-20 2023-10-31 杭州拓数派科技发展有限公司 Data caching processing method, device and system

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118444848A (en) * 2024-04-26 2024-08-06 小红书科技有限公司 Data processing method, apparatus, computer device, and computer readable storage medium

Also Published As

Publication number Publication date
CN119311699A (en) 2025-01-14

Similar Documents

Publication Publication Date Title
EP3944556B1 (en) Block data storage method and apparatus, and block data access method and apparatus
US9996557B2 (en) Database storage system based on optical disk and method using the system
US7191189B2 (en) Organizing data objects in a storage device
CN114647658A (en) Data retrieval method, device, equipment and machine-readable storage medium
CN107704211B (en) Magneto-optical-electric hybrid optical disk library and management method and management system thereof
US9189494B2 (en) Object file system
US9773059B2 (en) Tape data management
WO2014169587A1 (en) Database storage system based on optical disk library, and method using same
CN113553325B (en) A method and system for synchronizing aggregate objects in an object storage system
CN106909651A (en) A kind of method for being write based on HDFS small documents and being read
JP2021092950A (en) Data processing device and data processing program
CN119311699B (en) A very cold object magnetic tape storage system, access and management method
CN111159176A (en) Method and system for storing and reading mass stream data
CN115827556A (en) A method for object storage data archiving
US20130275394A1 (en) Information processing system
JP6008820B2 (en) File system for realizing WORM (Write OnceReadMany)
CN114064356A (en) Data management platform, method and node
CN120296224A (en) A hierarchical storage and verification system and method for unstructured data lake
CN110704431A (en) Hierarchical storage management method for mass data
CN109634520B (en) Storage system based on HDFS optical disc library
US11474981B2 (en) Database dual-core storage system based on optical disk and method using the system
JPH01184763A (en) Data storage system in file system
JP2005165485A (en) File management apparatus, storage management system, system management method, program, and recording medium
US20070299890A1 (en) System and method for archiving relational database data
CN120743865B (en) Data separation storage method, device, storage medium and computer program product

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant