CN110222035A

CN110222035A - A kind of efficient fault-tolerance approach of database page based on exclusive or check and journal recovery

Info

Publication number: CN110222035A
Application number: CN201910495162.XA
Authority: CN
Inventors: 刘碧楠; 谭炜波; 蒋旭; 孙磊; 吴嵩; 姬涛涛; 顾云苏
Original assignee: TIANJIN SHENZHOU GENERAL DATA CO Ltd
Current assignee: TIANJIN SHENZHOU GENERAL DATA CO Ltd
Priority date: 2019-06-10
Filing date: 2019-06-10
Publication date: 2019-09-10

Abstract

The present invention relates to a kind of efficient fault-tolerance approaches of the database page based on exclusive or check and journal recovery, the following steps are included: calculating the corresponding check code of page of data and being stored in the head of page of data；From when read data pages are into memory in storage medium, judge whether page of data is damaged by check code；During log-on data library carries out instance recovery, then the corrupted data page is skipped so that database normally starts, prompting user data page corruption occurs in which database object after starting；After log-on data Kucheng's function, determine that page corruption problem occurs in which table, user is handled by special SQL come the damage page to the table.Whether the present invention has rational design, damage in the XOR operation check code of the page of data store data inside page for the real-time detection page, can successfully restore the data damaged, reduce the data degradation of user.

Description

A kind of efficient fault-tolerance approach of database page based on exclusive or check and journal recovery

Technical field

The invention belongs to data storages and inquiring technology field, especially a kind of number based on exclusive or check and journal recovery According to the efficient fault-tolerance approach of the library page.

Background technique

Usually, the data of large-scale general relevant database be stored in non-volatile memory medium (it is common as Disk, disk array, network storage or solid state hard disk), these storage mediums are to reduce IO expense, usually with page of data (512 integral multiples) are written and read for unit, that is to say, that even the data of 1 byte of access, it is also necessary to read and write entire page Face is 8K size in magic data database storing page-size.

With the data explosion that information technology generates, cause the data volume stored in modern data library increasing, data Frequent access lead to occur in storage medium that bad block frequency is higher and higher, and part bad block cannot be by the management of storage medium itself System discovery (cost and performance factor).Once occurring bad block in database datafile, gently then lead to partial database object It can not access, it is heavy then cause Database Systems that can not start.

Currently, the usual method to solve the above problems is to be examined in real time by the file system of operating system to block corruptions It surveys, finds bad block in advance, but this method cannot very avoid bad block accessed, especially that class data can normally be read It writes, but realizes that the bad block overturn occurs in the data content returned, cannot be found substantially.

Summary of the invention

It is an object of the invention to overcome the deficiencies in the prior art, propose that a kind of design is reasonable, guarantee data security access And reduce the efficient fault-tolerance approach of the database page based on exclusive or check and journal recovery of user data loss.

The present invention solves its technical problem and adopts the following technical solutions to achieve:

A kind of efficient fault-tolerance approach of database page based on exclusive or check and journal recovery, comprising the following steps:

Step 1: before storage medium is written in the page of data in memory, the corresponding check code of page of data is calculated, And the check code is stored in the head of page of data；

Step 2: from when read data pages are into memory in storage medium, calculating the school of the page of data read out Code is tested, carry out numerical value comparison with the check code for being stored in page head: if numerical value is equal, this page does not have mistake, mark It is denoted as normal page；If numerical value etc., mistake is had occurred in this page, labeled as the damage page, enters step 3；

Step 3: during log-on data library carries out instance recovery, then skipping the corrupted data page so that database Normal starting reminds user data page corruption occurs in which database object after starting.

Step 4: after log-on data Kucheng's function, determining that page corruption problem occurs in which table, user passes through special SQL is handled come the damage page to the table.

Step 1 check code is to carry out 64 XOR operation to the data in page of data to generate, and by check code It charges in page head.

The check code is 8 byte lengths, and check code is stored in the head of page of data.

The concrete methods of realizing of the step 1 are as follows:

(1) prepare the page of data in memory storage medium is written；

(2) data page size is 8192 bytes, can include in memory 1024 64 as one by page of data The array of integer；

(3) the 1st element in 64 integer arrays is assigned a value of 0；

(4) the 1st element in 64 integer arrays is successively subjected to exclusive or fortune with remaining 1023 elements in array It calculates, the 1st element is written in each result；

(5) storage medium is written into page of data.

The implementation method of the step 2 the following steps are included:

(1) page of data is read in into memory from storage medium；

(2), if storage medium reports an error in read data pages, the page of data in memory is labeled as having damaged；

(3) data page size is 8192 bytes, can include in memory 1024 64 as one by page of data The array of integer；

(4) successively carried out using the local variable x that initial value is 0 with 1023 elements in array in addition to first element Local variable x is written in XOR operation, each result；

(5) do you judge that local variable x is equal with first element of 64 integer arrays? it, will be in memory if equal Page of data labeled as effective；If unequal, by the page of data in memory labeled as damage.

The processing method of the step 3 the following steps are included:

(1) database starting starts to carry out instance recovery；

(2) initialization damage page of data records chained list；

(3) log is read from redoing log in file；

(4) log recording is parsed, page of data number is obtained；

(5) if page of data number skips the log in damage page of data record chained list；

(6) from storage medium read data pages；

(7), if data page corruption, damage page of data is added in page of data and its affiliated database object and records chain In table, and the log is skipped, without reforming, otherwise, reforms the log；

(8) continue to read next log, repeat the above process, until completion is reformed in all logs；

(9) after the completion of instance recovery, check in damage page of data record chained list with the presence or absence of record, if it does, to using Family gives a warning, and tells that the page of data of which database object of user is damaged；

(10) database starts.

The step 4 handled by SQL come the damage page to the table the following steps are included:

(1) exclusive lock is added to table, to prevent other session access table；

(2) the page of data of unmarked damage is successively obtained from segment management module: identifying whether the page of data is damaged, such as Fruit page of data is damaged, and is marked in tablespace management structure, does not use the page of data later, and in section pipe Reason module is also marked, and when carrying out the page search of section later, needs to skip the page；

(3) continue to obtain page of data, until all page of data handled section；

(4) the exclusive lock of free list, return run succeeded.

The advantages and positive effects of the present invention are:

The present invention has rational design, 8 bytes is distributed on database page head, to the different of this page store data inside page Or arithmetic check code, it can be used for whether the real-time detection page damages.In the case where detecting data page corruption, pass through The mode for skipping the access damage page, enables database normally to start success, and be not loaded with the data for having already appeared damage. After database starting, it can be damaged by database administrator user using the means such as data backup, log, mirror image, recovery Data, and write in unspoiled page of data, restore data.To these means are not configured so as to cause restoring data invariably The case where, user still can handle the database object comprising damage page of data by sql command, make in database object not Data in the page of data of damage can be accessed normally, reduce the data degradation of user.

Detailed description of the invention

Fig. 1 is page of data write-in flow chart of the invention；

Fig. 2 is that page of data of the invention reads flow chart；

Fig. 3 is fault-tolerant damage page work flow diagram of the invention；

Fig. 4 is processing damage page flow chart of the invention.

Specific embodiment

The embodiment of the present invention is further described below in conjunction with attached drawing.

Step 1: before storage medium is written in the page of data in memory, first calculating corresponding 8 byte of page of data Check code, and check code is stored in 8 byte of head of page of data.Purpose in this way is can to identify and locate Manage the page of damage.

The present invention is in order to identify the damage page, before storage medium is written in page of data, to the data in page of data 64 XOR operation are carried out, operation result is charged in 8 bytes on page head.By page of data from storage medium reading in When depositing, 64 XOR operation are equally carried out, result and 8 byte of page head are compared.

As shown in Figure 1, page of data writing process the following steps are included:

(1) prepare the page of data in memory storage medium is written

(2) data page size is 8192 bytes, can include in memory 1024 64 as one by page of data The array of integer

(3) the 1st element in 64 integer arrays is assigned a value of 0

(4) the 1st element in 64 integer arrays is successively subjected to exclusive or fortune with remaining 1023 elements in array It calculates, the 1st element is written in each result

(5) storage medium is written into page of data.

Step 2: from when read data pages are into memory in storage medium, according to the identical calculating side of step 1 Formula calculates 8 byte check codes of the page of data read out, and 8 bytes for being stored in page head with step 1 carry out numerical value Comparison: if numerical value is equal, this page does not have mistake, is labeled as normal page；If numerical value etc., this page hair Mistake has been given birth to, labeled as the damage page, this page of specially treated is required in step 3 and step 4.

As shown in Fig. 2, page of data reading process the following steps are included:

(1) page of data is read in into memory from storage medium；

Step 3: during log-on data library, need to redo log, when reading the content of the relevant page of data of log, If page corruption is marked in step 2, need to carry out fault-tolerant processing: the page of all damages needs to be saved in special chained list In；Since the page has damaged, log relevant to the page, which is reformed, to have to skip；It is alerted to user, prompts to damage Table where the bad page.

Data page corruption is found during log-on data library carries out instance recovery, then is skipped the page, is guaranteed Database can normally start, and remind user data page corruption occurs in which database object after starting.

As shown in figure 3, fault-tolerant damage page processing method the following steps are included:

(1) database starting starts to carry out instance recovery

(2) initialization damage page of data records chained list

(3) log is read from redoing log in file

(4) log recording is parsed, page of data number is obtained

(5) if page of data number skips the log, without reforming in damage page of data record chained list

(6) from storage medium read data pages

(7), if data page corruption, damage page of data is added in page of data and its affiliated database object and records chain In table, and the log is skipped, without reforming.Otherwise, the log is reformed

(8) continue to read next log, repeat the above process, until completion is reformed in all logs.

(9) after the completion of instance recovery, check in damage page of data record chained list with the presence or absence of record, if it does, to using Family gives a warning, and tells that the page of data of which database object of user is damaged.

(10) database starts successfully.

Step 4: after log-on data Kucheng's function, can specify that page damage occurs in which table by step 3 warning information Bad problem, user can be handled by special SQL come the damage page to the table, mainly include the following contents:

In the segment management information of table, traverses all page of data of section and carry out the page check of step 2, if the page Damage, needs for the page to be identified, requires to ignore the page to the access of table later；

The damage page found during section traversal simultaneously, is also required to be identified damage in tablespace management structure Bad, the later page should not be allocated again.

The present invention also provides a kind of mechanism, and user is allowed to handle by sql command " ALTER TABLE table name REPAIR " Database object comprising damaging page of data, allows the positive frequentation of data in database object in unspoiled page of data It asks, reduces the data degradation of user.The page of data of damage can be extractd from database object during processing, and in table It is that each page increase marker bit is marked in space management structure, does not use the page of data later.

In magic database, the page of data of table is managed by segment management module: each table has unique correspondence A data segment；Each data segment has unique segment number；Each page of data at most belongs to a section, and can be on page head Data segment belonging to recording；Segment management module is also required to increase marker bit for each page of data of this section, marks whether to damage.

As shown in figure 4, in this step processing damage the page method the following steps are included:

(1) exclusive lock is added to table, to prevent other session access table

(2) the page of data of unmarked damage is successively obtained from segment management module: identifying whether the page of data is damaged, such as Fruit page of data is damaged, and is marked in tablespace management structure, does not use the page of data later, and in section pipe Reason module is also marked, and when carrying out the page search of section later, needs to skip the page

(3) continue to obtain page of data, until all page of data handled section

(4) the exclusive lock of free list, return run succeeded.

It is emphasized that embodiment of the present invention be it is illustrative, without being restrictive, therefore the present invention includes It is not limited to embodiment described in specific embodiment, it is all to be obtained according to the technique and scheme of the present invention by those skilled in the art Other embodiments out, also belong to the scope of protection of the invention.

Claims

1. a kind of efficient fault-tolerance approach of database page based on exclusive or check and journal recovery, it is characterised in that including following step It is rapid:

Step 1: before storage medium is written in the page of data in memory, calculating the corresponding check code of page of data, and will The check code is stored in the head of page of data；

Step 2: from when read data pages are into memory in storage medium, calculating the verification of the page of data read out Code, carry out numerical value comparison with the check code for being stored in page head: if numerical value is equal, this page does not have mistake, label For normal page；If numerical value etc., mistake is had occurred in this page, labeled as the damage page, enters step 3；

Step 3: during log-on data library carries out instance recovery, then skipping the corrupted data page so that database is normal Starting reminds user data page corruption occurs in which database object after starting.

Step 4: after log-on data Kucheng's function, determine that page corruption problem occurs in which table, user by special SQL come The damage page of the table is handled.

2. the efficient fault-tolerance approach of a kind of database page based on exclusive or check and journal recovery according to claim 1, It is characterized by: step 1 check code is to carry out 64 XOR operation to the data in page of data to generate, and will verify Code is charged in page head.

3. the efficiently fault-tolerant side of a kind of database page based on exclusive or check and journal recovery according to claim 1 or 2 Method, it is characterised in that: the check code is 8 byte lengths, and check code is stored in the head of page of data.

4. the efficiently fault-tolerant side of a kind of database page based on exclusive or check and journal recovery according to claim 1 or 2 Method, it is characterised in that: the concrete methods of realizing of the step 1 are as follows:

(1) prepare the page of data in memory storage medium is written；

(2) data page size is 8192 bytes, can include in memory 1024 64 integers as one by page of data Array；

(3) the 1st element in 64 integer arrays is assigned a value of 0；

(4) the 1st element in 64 integer arrays is successively subjected to XOR operation with remaining 1023 elements in array, often The 1st element is written in secondary result；

(5) storage medium is written into page of data.

5. the efficient fault-tolerance approach of a kind of database page based on exclusive or check and journal recovery according to claim 1, It is characterized by: the implementation method of the step 2 the following steps are included:

(1) page of data is read in into memory from storage medium；

(3) data page size is 8192 bytes, can include in memory 1024 64 integers as one by page of data Array；

(4) 1023 elements in the local variable x and array that the use of initial value are 0 in addition to first element successively carry out exclusive or Local variable x is written in operation, each result；

(5) do you judge that local variable x is equal with first element of 64 integer arrays? if equal, by the number in memory It is effective according to page marks；If unequal, by the page of data in memory labeled as damage.

6. the efficient fault-tolerance approach of a kind of database page based on exclusive or check and journal recovery according to claim 1, It is characterized by: the processing method of the step 3 the following steps are included:

(1) database starting starts to carry out instance recovery；

(2) initialization damage page of data records chained list；

(3) log is read from redoing log in file；

(4) log recording is parsed, page of data number is obtained；

(6) from storage medium read data pages；

(7), if data page corruption, damage page of data is added in page of data and its affiliated database object and records chained list In, and the log is skipped, without reforming, otherwise, reform the log；

(9) after the completion of instance recovery, check in damage page of data record chained list with the presence or absence of record, if it does, being sent out to user It alerts out, tells that the page of data of which database object of user is damaged；

(10) database starts.

7. the efficient fault-tolerance approach of a kind of database page based on exclusive or check and journal recovery according to claim 1, It is characterized by: the step 4 handled by SQL come the damage page to the table the following steps are included:

(1) exclusive lock is added to table, to prevent other session access table；

(2) the page of data of unmarked damage is successively obtained from segment management module: identifying whether the page of data is damaged, if number It is damaged according to the page, is marked in tablespace management structure, do not use the page of data later, and in segment management mould Block is also marked, and when carrying out the page search of section later, needs to skip the page；

(3) continue to obtain page of data, until all page of data handled section；

(4) the exclusive lock of free list, return run succeeded.