CN113708981B - Method and system for repairing fault of cloud host system - Google Patents
Method and system for repairing fault of cloud host system Download PDFInfo
- Publication number
- CN113708981B CN113708981B CN202111258722.3A CN202111258722A CN113708981B CN 113708981 B CN113708981 B CN 113708981B CN 202111258722 A CN202111258722 A CN 202111258722A CN 113708981 B CN113708981 B CN 113708981B
- Authority
- CN
- China
- Prior art keywords
- cloud host
- repair
- fault
- repairing
- backup
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0654—Management of faults, events, alarms or notifications using network fault recovery
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/10—Protocols in which an application is distributed across nodes in the network
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The application provides a method and a system for repairing a cloud host system fault, and belongs to the technical field of cloud platforms. The method comprises the following steps: determining a corresponding repairing method of the system fault cloud host based on the acquired fault information of the system fault cloud host; automatically backing up the cloud host with the system fault to obtain a backup cloud host system; repairing the backup cloud host system based on the fault information and a repairing method of a preset fault type corresponding to the fault information; and outputting the repairing result, the repairing content and the modified file. According to the method and the device, the cloud host with the system fault can be backed up to obtain the backup cloud host system, the backup cloud host system is used for carrying out a repair experiment on an automatic repair program, and after the repair is successful, the cloud host with the system fault is repaired. The cloud host is repaired under the condition that the system fault is not influenced, the repair quality and speed of the fault are improved, and the economic loss caused by the fact that the cloud host is subjected to system fault and is not repaired in time is reduced.
Description
Technical Field
The invention belongs to the technical field of cloud platforms, and particularly relates to a method and a system for repairing a cloud host system fault.
Background
With the popularization of the cloud concept, the cloud hosts are more and more widely used, the number of the cloud hosts is increased day by day, the problem of system failure is prominent, the time consumption for processing the cloud host system problem is long at present, and the loss of users due to the cloud host system failure is increased day by day.
Disclosure of Invention
The embodiment of the invention aims to provide a method and a system for repairing a cloud host system fault, which are used for repairing the fault under the condition that the system fault cloud host is not influenced, improving the quality and speed of repairing the fault and reducing the economic loss caused by the fact that a user cannot repair the cloud host system fault in time.
In order to achieve the above object, an embodiment of the present invention provides a method for repairing a cloud host system fault, including:
determining a corresponding repairing method of the system fault cloud host based on the acquired fault information of the system fault cloud host;
automatically backing up the cloud host with the system fault to obtain a backup cloud host system;
repairing the backup cloud host system based on the fault information and the repair method corresponding to the fault information;
and outputting the repairing result, the repairing content and the modified file.
In order to achieve the above object, an embodiment of the present invention further provides a cloud host system fault repairing system, including:
the determining module is used for determining a corresponding repairing method of the system fault cloud host based on the acquired fault information of the system fault cloud host;
the collection backup module is used for automatically backing up the cloud host with the system fault to obtain a backup cloud host system;
the repair module is used for repairing the backup cloud host system based on the fault information and a repair method of a preset fault type corresponding to the fault information;
and the output module is used for outputting the repair result, the repair content and the modified file.
The invention has the following beneficial effects: the invention provides a method and a system for rapidly repairing a cloud host system fault, which can repair the cloud host without influencing the cloud host, improve the quality and speed of repairing the fault, and reduce the economic loss caused by the fact that a user cannot repair the cloud host system fault in time.
According to the invention, the system fault cloud host is backed up to obtain the backup cloud host system, the backup cloud host system is used for carrying out a repair experiment on an automatic repair program, and after the repair is successful, the system fault cloud host is repaired.
The embodiment of the invention automatically repairs according to a preset repairing scheme program, if the scheme repairing fails, the program automatically compares system templates of the cloud host with system faults to automatically drill and repair, and after the repairing succeeds, the repairing method is automatically synchronized into a repairing scheme library.
Drawings
Fig. 1 is a flowchart illustrating steps of a method for repairing a failure of a cloud host system according to an embodiment of the present invention;
fig. 2 is a flowchart illustrating steps of a method for repairing a failure of a cloud host system according to an embodiment of the present invention;
fig. 3 is a block diagram illustrating a structure of a cloud host system fault repairing system according to an embodiment of the present invention;
fig. 4 is a block diagram illustrating a structure of a cloud host system fault repairing system according to another embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, embodiments of the present invention are described in detail below with reference to the accompanying drawings. It should be understood that the description is intended to be exemplary only, and is not intended to limit the scope of the present invention. Moreover, in the following description, descriptions of well-known structures and techniques are omitted so as to not unnecessarily obscure the concepts of the present invention.
The embodiment of the invention mainly solves the repair strategy of the cloud host with system faults, wherein the system faults include but are not limited to: operating system failures (e.g., file corruption, etc.).
According to the preset repairing method, when the cloud host system fails, the failure repairing can be carried out in time, and the failure repairing efficiency is improved.
A failed disk refers to a failed disk.
The system failure cloud host refers to a cloud host with a disk failure, and the backup cloud host system refers to a cloud host obtained by copying a failure disk of the system failure cloud host.
The system template is used when the cloud host is created, is equivalent to a system mirror image, and the cloud host can be created through the system template. The system fault cloud host is created through a system template, and the system template is an initial system template before the system fault cloud host fails.
The system core file comprises: a core file of a linux system and a core file of a Windows system; wherein, the core file of linux system includes: ,/etc,/boot,/dev,/tmp, etc.; the core files of the Windows system include: files required for Windows \ Boot startup, inf files required for installing hardware and software, main components of Windows \ System32 and the like.
Example 1
Fig. 1 is a flowchart illustrating steps of a cloud host system fault repairing system according to an embodiment of the present invention, and referring to fig. 1 and fig. 2, an embodiment of the present invention provides a cloud host system fault repairing method, where the repairing method includes:
step S110, determining a corresponding repairing method of the system fault cloud host based on the acquired fault information of the system fault cloud host;
step S120, automatically backing up and archiving a fault disk of the system fault cloud host to obtain a backup cloud host system;
step S130, repairing the backup cloud host system based on the fault information and the corresponding repairing method;
and step S140, outputting the repair result, the repair content and the modified file.
In this embodiment, first, failure information of a cloud host with a system failure is acquired, and when a failure such as an operating system failure (e.g., file corruption) occurs in the cloud host with the system failure, the failure information (e.g., a system failure code) is generated. And after further acquiring fault information, judging the fault type of the acquired fault information, wherein the fault information carries the identifier of the fault type at this time, and determining the fault type of the system fault cloud host according to the fault type identifier carried in the acquired fault information. The types of failures include, but are not limited to, the following: an operating system failure (e.g., file corruption, etc.), a host system down failure (e.g., a host system down, a host system crash, etc.).
And setting a corresponding fault repairing strategy aiming at each fault type in advance. For example, when the fault type is an operating system fault, the corresponding repair method is set as: kernel repair, file system repair, blue screen repair, boot repair, and the like.
When a system fault occurs in a cloud host, the cloud host fault disk collection system automatically backs up and files a fault disk of the cloud host with the system fault to obtain a backup cloud host system, so that the cloud host with the system fault is not affected, economic loss caused by untimely fault repair of the cloud host system by a user is reduced, the backup cloud host system is obtained after disk backup and filing are performed on the fault cloud host with the system fault, fault repair is performed on the backup cloud host system according to the determined fault type and a corresponding repair method, and after repair is completed, a repair result, repair content and a repair file are output.
In an embodiment of the present invention, the repairing the backup cloud host system based on the fault information and the preset fault type corresponding to the fault information specifically includes:
and according to the fault type and the system template code, matching and calling a corresponding repairing method to repair the backup cloud host system.
Optionally, the outputting the repair result, the repair content, and the modification file specifically includes: the repair result, repair contents, and modified files of the repair method are listed.
In order to prevent secondary damage caused by repair, and after the repair fails, the failed disk can be listed in a cloud host failure library. Optionally, the automatically backing up the cloud host with the system fault specifically includes:
and copying the fault disk of the system fault cloud host to obtain a backup cloud host system.
Optionally, if a plurality of repair methods are determined according to a plurality of fault information, copying a backup cloud host system corresponding to each fault information;
and simultaneously repairing each backup cloud host system based on the fault information and the corresponding repairing method.
When a plurality of fault information exist, each fault information is respectively copied to a backup cloud host system for repair, a plurality of repair methods may exist in one host fault, the time for trying one by one is slow, a plurality of disks are copied left and right for trying at the same time, and the repair efficiency is improved.
Optionally, after the backup cloud host system is successfully repaired, the repair method is adopted to repair the cloud host with the system fault, and the repair method is archived.
The archived content includes: recording the repairing method, whether the repairing method is successful (only representing the success or not of the currently used repairing method), the type of the failed disk, the size of the failed disk, the number of the failed disk and the state, wherein the repairing state comprises repairing, repairing success and repairing failure, and the repairing state is a total state and is different from whether the previous repairing method is successful or not.
Optionally, after the repair of the backup cloud host system fails (that is, after all the repair methods preset by the system are tried, the repair still fails), the repair method is archived, the failed disk is listed in the cloud host failure library, and whether MD5 values of the backup cloud host system and the system template on the system core file are consistent or not is analyzed, so as to obtain another repair method, and the backup cloud host system is repaired by using the another repair method. Specifically, the other repairing method specifically comprises the following steps: the MD5 values of the system core files of the backup cloud host and the system template are consistent, and no repair operation is performed; and for the condition that the MD5 values of the system core files of the backup cloud host and the system template are inconsistent, repairing the backup cloud host to be consistent with the system template.
In the embodiment of the invention, the fault disk is listed in the cloud host fault library, namely, a disk of the original system is copied to be put in the library again, and when the backup cloud host system fails to be repaired, a copied disk can be listed in the cloud host fault library.
If the repair is successful, recording another repair method, judging whether the another repair method is repeated in the repair scheme library, and adding the another repair method into the repair scheme library if the another repair method is not repeated.
The purpose of listing the fault disk in the cloud host fault library is that cloud hosts (disks for technical staff) which cannot be repaired by the fault repairing system are recorded in the cloud host fault library, then the difference between a system template and the current system template is analyzed, a certain time is needed for the analysis, and another repairing method is obtained after the analysis and the processing.
Fig. 3 is a block diagram illustrating a structure of a cloud host system fault repair system according to an embodiment of the present invention, and referring to fig. 3, a cloud host system fault repair system according to an embodiment of the present invention includes:
the determining module 210 is configured to determine a corresponding repairing method of the system failure cloud host based on the acquired failure information of the system failure cloud host;
the collection backup module 220 is used for automatically backing up the cloud host with the system fault to obtain a backup cloud host system;
the repairing module 230 is configured to repair the backup cloud host system based on the fault information and a repairing method of a preset fault type corresponding to the fault information;
and an output module 240, configured to output the repair result, the repair content, and the modification file.
Optionally, the repairing module 230 is configured to match and call a corresponding repairing method according to the fault type and the system template code to repair the backup cloud host system.
Optionally, an output module 240 is configured to list the repair result, the repair content, and the modified file of the repair method.
In order to prevent secondary damage caused by repair, and after the repair fails, the failed disk can be listed in a cloud host failure library. Optionally, the collection backup module 220 is configured to copy the failed disk of the cloud host with the system failure, so as to obtain a backup cloud host system.
Optionally, the collection backup module 220 is further configured to:
if a plurality of repairing methods are determined according to a plurality of fault information, copying a backup cloud host system corresponding to each fault information;
and simultaneously repairing each backup cloud host system based on the fault information and the corresponding repairing method.
When a plurality of fault information exist, each fault information is respectively copied to a backup cloud host system for repair, a plurality of repair methods may exist in one host fault, the time for trying one by one is slow, a plurality of disks are copied left and right for trying at the same time, and the repair efficiency is improved.
Optionally, after the backup cloud host system is successfully repaired, the repairing module 230 is configured to repair the system-failed cloud host by using the repairing method, and archive the repairing method.
The archived content includes: recording the repairing method, whether the repairing method is successful (only representing the success or not of the currently used repairing method), the type of the failed disk, the size of the failed disk, the number of the failed disk and the state, wherein the repairing state comprises repairing, repairing success and repairing failure, and the repairing state is a total state and is different from whether the previous repairing method is successful or not.
Optionally, as shown in fig. 4, the system further includes: an adding module 250 configured to:
after the repair of the backup cloud host system fails (that is, after all the repair methods preset by the system are tried, the repair still fails), the repair method is archived, the failed disk is listed in a cloud host failure library, whether the MD5 values of the backup cloud host system and the system template on the system core file are consistent or not is analyzed, so that another repair method is obtained, and the backup cloud host system is repaired by adopting the other repair method. Specifically, the other repairing method specifically comprises the following steps: the MD5 values of the system core files of the backup cloud host and the system template are consistent, and no repair operation is performed; and for the condition that the MD5 values of the system core files of the backup cloud host and the system template are inconsistent, repairing the backup cloud host to be consistent with the system template.
In the embodiment of the invention, the fault disk is listed in the cloud host fault library, namely, a disk of the original system is copied to be put in the library again, and when the backup cloud host system fails to be repaired, a copied disk can be listed in the cloud host fault library.
If the repair is successful, recording another repair method, judging whether the another repair method in the repair scheme library is repeated, and if not, adding the another repair method into the repair scheme library.
The embodiment of the invention also provides a computer-readable storage medium, wherein a computer execution instruction is stored in the computer-readable storage medium, and when the computer execution instruction is executed by a processor, the computer execution instruction is used for implementing the cloud host system fault repairing method described in any one of the above.
It is to be understood that the above-described embodiments of the present invention are merely illustrative of or explaining the principles of the invention and are not to be construed as limiting the invention. Therefore, any modification, equivalent replacement, improvement and the like made without departing from the spirit and scope of the present invention should be included in the protection scope of the present invention. Further, it is intended that the appended claims cover all such variations and modifications as fall within the scope and boundaries of the appended claims or the equivalents of such scope and boundaries.
Claims (12)
1. A method for repairing a cloud host system fault is characterized by comprising the following steps:
determining a corresponding repairing method of the system fault cloud host based on the acquired fault information of the system fault cloud host;
automatically backing up the cloud host with the system fault to obtain a backup cloud host system;
repairing the backup cloud host system based on the fault information and the repair method corresponding to the fault information;
outputting a repair result, repair contents and a modification file;
the system template is an initial system template before the cloud host with the system fault fails, and after the backup cloud host system is failed to be repaired, whether the MD5 values of the backup cloud host system and the system template on a system core file are consistent or not is analyzed, so that another repairing method is obtained, and the backup cloud host system is repaired by the another repairing method;
and if the repair is successful, recording the other repair method.
2. The method according to claim 1, wherein the repairing the backup cloud host system based on the fault information and the repairing method of the preset fault type corresponding to the fault information specifically comprises:
and according to the fault type and the system template code, matching and calling a corresponding repairing method to repair the backup cloud host system.
3. The method according to claim 1 or 2, wherein automatically backing up the cloud host with the system failure specifically comprises:
and copying the fault disk of the system fault cloud host to obtain a backup cloud host system.
4. The method of claim 1,
if a plurality of repairing methods are determined according to a plurality of fault information, copying a backup cloud host system corresponding to each fault information;
and simultaneously repairing each backup cloud host system based on the fault information and the corresponding repairing method.
5. The method according to claim 1 or 4, wherein after the backup cloud host system is successfully repaired, the repair method is adopted to repair the cloud host with the system failure, and the repair method is archived.
6. The method according to claim 3, wherein after the repair of the backup cloud host system fails, the repair method is archived, the failed disk is listed in the cloud host failure library, and whether the MD5 values of the backup cloud host system and the system template on the system core file are consistent or not is analyzed, so as to obtain the other repair method, after the repair of the backup cloud host system by using the other repair method is successful and the other repair method is recorded, whether the other repair method exists or not is searched in the repair solution library, and if the other repair method does not exist, the other repair method is added into the repair solution library.
7. A cloud host system failover system, comprising:
the determining module is used for determining a corresponding repairing method of the system fault cloud host based on the acquired fault information of the system fault cloud host;
the collection backup module is used for automatically backing up the cloud host with the system fault to obtain a backup cloud host system;
the repair module is used for repairing the backup cloud host system based on the fault information and a repair method of a preset fault type corresponding to the fault information;
the output module is used for outputting the repair result, the repair content and the modified file;
the system template is an initial system template before the cloud host with the system fault fails, and the system further comprises: an add module to: after the repair of the backup cloud host system fails, analyzing whether the MD5 values of the backup cloud host system and the system template on the system core file are consistent or not, so as to obtain another repair method, and repairing the backup cloud host system by adopting the another repair method;
and if the repair is successful, recording the other repair method.
8. The system of claim 7, wherein the repair module is to:
and according to the fault type and the system template code, matching and calling a corresponding repairing method to repair the backup cloud host system.
9. The system of claim 7 or 8, wherein the collection backup module is configured to:
and copying the fault disk of the system fault cloud host to obtain a backup cloud host system.
10. The system of claim 7, wherein the collection backup module is further configured to:
if a plurality of repairing methods are determined according to a plurality of fault information, copying a backup cloud host system corresponding to each fault information;
and simultaneously repairing each backup cloud host system based on the fault information and the corresponding repairing method.
11. The system according to claim 7 or 10, wherein after the backup cloud host system is successfully repaired, the repair module is configured to repair the cloud host with the system failure by using the repair method, and archive the repair method.
12. The system of claim 9, wherein the adding module is configured to: after the repair of the backup cloud host system fails, the repair method is archived, the fault disk is listed into a cloud host fault library, whether the MD5 values of the backup cloud host system and the system template on the system core file are consistent or not is analyzed, so that another repair method is obtained, after the backup cloud host system is successfully repaired by the another repair method and the another repair method is recorded, whether the another repair method exists or not is searched in a repair scheme library, and if the another repair method does not exist, the another repair method is added into the repair scheme.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111258722.3A CN113708981B (en) | 2021-10-28 | 2021-10-28 | Method and system for repairing fault of cloud host system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111258722.3A CN113708981B (en) | 2021-10-28 | 2021-10-28 | Method and system for repairing fault of cloud host system |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113708981A CN113708981A (en) | 2021-11-26 |
CN113708981B true CN113708981B (en) | 2022-04-15 |
Family
ID=78647159
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111258722.3A Active CN113708981B (en) | 2021-10-28 | 2021-10-28 | Method and system for repairing fault of cloud host system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113708981B (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103167004A (en) * | 2011-12-15 | 2013-06-19 | 中国移动通信集团上海有限公司 | Cloud platform host system failure repair method and cloud platform front-end control server |
CN107026760A (en) * | 2017-05-03 | 2017-08-08 | 联想(北京)有限公司 | A kind of fault repairing method and monitor node |
CN111124758A (en) * | 2019-12-26 | 2020-05-08 | 厦门市美亚柏科信息股份有限公司 | Data recovery method for failed hard disk |
CN111382006A (en) * | 2018-12-27 | 2020-07-07 | 中国移动通信集团四川有限公司 | Operating system recovery method, device, device and medium |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10516688B2 (en) * | 2017-01-23 | 2019-12-24 | Microsoft Technology Licensing, Llc | Ransomware resilient cloud services |
CN109995574A (en) * | 2018-01-02 | 2019-07-09 | 中兴通讯股份有限公司 | It is a kind of to repair the method for VNFM failure, monitor, VIM, VNFM and storage medium |
-
2021
- 2021-10-28 CN CN202111258722.3A patent/CN113708981B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103167004A (en) * | 2011-12-15 | 2013-06-19 | 中国移动通信集团上海有限公司 | Cloud platform host system failure repair method and cloud platform front-end control server |
CN107026760A (en) * | 2017-05-03 | 2017-08-08 | 联想(北京)有限公司 | A kind of fault repairing method and monitor node |
CN111382006A (en) * | 2018-12-27 | 2020-07-07 | 中国移动通信集团四川有限公司 | Operating system recovery method, device, device and medium |
CN111124758A (en) * | 2019-12-26 | 2020-05-08 | 厦门市美亚柏科信息股份有限公司 | Data recovery method for failed hard disk |
Also Published As
Publication number | Publication date |
---|---|
CN113708981A (en) | 2021-11-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110543386B (en) | Data storage method, device, equipment and storage medium | |
CN101641681A (en) | Self-managed processing device | |
CN109032838B (en) | Automatic verification method for consistency of backup and recovery data of virtual machine | |
CN109144789B (en) | A method, device and system for restarting OSD | |
CN114417335A (en) | Malicious file detection method and device, electronic equipment and storage medium | |
CN105022678A (en) | Data backup method and apparatus for virtual machine | |
CN111274065A (en) | Disaster recovery method and device based on software | |
CN110990124A (en) | Cloud host recovery method and device | |
CN112333008B (en) | Cloud host complete machine backup and restoration method, device, equipment and storage medium | |
CN103064759B (en) | The method of data restore and device | |
CN107765990A (en) | A kind of system disk management method and device | |
CN113708981B (en) | Method and system for repairing fault of cloud host system | |
CN113204359A (en) | Online server upgrading method and device | |
CN115098300B (en) | Database backup method, disaster recovery method, device and equipment | |
CN109104314B (en) | Method and device for modifying log configuration file | |
CN109446212B (en) | Dual-active host system switching method and system | |
US20240419559A1 (en) | Method for apparatus for recovering data of dual-machine hot standby system, and medium | |
CN111124737A (en) | Cloud platform operation conflict judgment method and electronic equipment | |
CN116643924A (en) | Data recovery method and device based on Oracle database | |
US20160004607A1 (en) | Information processing apparatus and information processing method | |
CN114253947A (en) | Method, device, equipment and medium for automatically cleaning MySQL filing log | |
CN112256481A (en) | Data recovery method and system in agile development | |
CN113467805A (en) | Firmware rollback method of network equipment and related device | |
JP2001005693A (en) | System and method for automatically restoring fault and recording medium recording automatic fault restoration program | |
CN118519834B (en) | File system repair method, device, storage medium and electronic device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |