[go: up one dir, main page]

CN103150226A - Abnormal dump and recovery system for computer model and dump and recovery method thereof - Google Patents

Abnormal dump and recovery system for computer model and dump and recovery method thereof Download PDF

Info

Publication number
CN103150226A
CN103150226A CN201310110711XA CN201310110711A CN103150226A CN 103150226 A CN103150226 A CN 103150226A CN 201310110711X A CN201310110711X A CN 201310110711XA CN 201310110711 A CN201310110711 A CN 201310110711A CN 103150226 A CN103150226 A CN 103150226A
Authority
CN
China
Prior art keywords
context information
dump
exception
abnormal
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201310110711XA
Other languages
Chinese (zh)
Inventor
邢宏伟
丁书耕
张建辉
陈雷动
王衡
陈飞龙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Electric Power Research Institute of State Grid Shandong Electric Power Co Ltd
Shandong Luneng Software Technology Co Ltd
Original Assignee
Shandong Luneng Software Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong Luneng Software Technology Co Ltd filed Critical Shandong Luneng Software Technology Co Ltd
Priority to CN201310110711XA priority Critical patent/CN103150226A/en
Publication of CN103150226A publication Critical patent/CN103150226A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Debugging And Monitoring (AREA)

Abstract

本发明涉及计算机模型,特别涉及一种计算机模型异常转储与恢复系统及其转储与恢复方法。该计算机模型异常转储与恢复系统,其特殊之处在于:该系统以计算机模型的计算引擎为驱动,包括:异常检测模块,用于检测计算模型的运行情况,如果发生异常调用异常转储模块存储上下文信息;异常转储模块,用于保存计算模型上下文信息;异常转储文件读取模块,读取异常转储文件,恢复发生异常时上下文信息。本发明的系统可以在计算模型运行发生异常(计算模型启动后长时间不结束)时,记录所有上下文信息,并在恢复时构造异常发生时的上下文信息,用于验证是否修改有效,且可以模拟出异常发生时的情况,利于分析原因,修改计算模型。

The invention relates to a computer model, in particular to a computer model abnormal dumping and restoring system and a dumping and restoring method thereof. The special feature of this computer model abnormal dump and recovery system is that the system is driven by the computing engine of the computer model, including: an abnormal detection module, which is used to detect the operation of the computing model, and if an abnormality occurs, call the abnormal dump module Store context information; exception dump module, used to save calculation model context information; exception dump file reading module, read exception dump file, restore context information when exception occurs. The system of the present invention can record all context information when the calculation model runs abnormally (the calculation model does not end for a long time after starting), and constructs the context information when the abnormality occurs during recovery to verify whether the modification is effective, and can simulate It is helpful to analyze the cause and modify the calculation model by finding out the situation when the abnormality occurs.

Description

计算机模型异常转储与恢复系统及其转储与恢复方法Computer model exception dump and restore system and its dump and restore method

(一)        技术领域 (1) Technical field

    本发明涉及计算机模型,特别涉及一种计算机模型异常转储与恢复系统及其转储与恢复方法。 The present invention relates to a computer model, in particular to a computer model exception dump and recovery system and its dump and recovery method.

(二)        背景技术 (2) Background technology

对于计算模型在现场运行过程中,由于计算模型的编写错误或测点值读取失败等原因可能导致计算模型发生异常。当现场出现异常后,只能靠以往经验来判断问题的起因,修改计算模型。这样带来了几个弊端:一、由于没有出现异常时的上下文信息,无法验证是否修改有效;二、有可能修改错误,反而增加计算模型运行中的风险;三、修改效率低下,定位不准确。四、需要具有丰富经验的人员参与修改。上述问题急需解决。 During the on-site operation of the calculation model, the calculation model may be abnormal due to reasons such as wrong writing of the calculation model or failure to read measurement point values. When an abnormality occurs on site, the cause of the problem can only be judged by past experience and the calculation model can be modified. This has brought several disadvantages: 1. Since there is no context information when an exception occurs, it is impossible to verify whether the modification is effective; 2. It is possible to modify the error, which will increase the risk in the operation of the calculation model; 3. The modification efficiency is low and the positioning is not accurate. . Fourth, the need for personnel with extensive experience to participate in the revision. The above problems urgently need to be solved.

(三)        发明内容 (3) Contents of the invention

    本发明为了弥补现有技术的不足,提供了一种计算机模型异常转储与恢复系统及其转储与恢复方法,计算模型在运行过程中发生异常时,可以通过异常转储机制记录异常发生时的上下文信息,当修改计算模型后,恢复异常时的上下文的信息,以用于验证是否修改有效。 In order to make up for the deficiencies in the prior art, the present invention provides a computer model abnormality dump and recovery system and its dump and recovery method. When the calculation model is abnormal during operation, it can record the time when the abnormality occurs through the abnormality dump mechanism. Context information, when the calculation model is modified, the context information when the exception is restored is used to verify whether the modification is valid.

本发明是通过如下技术方案实现的:  The present invention is achieved through the following technical solutions:

一种计算机模型异常转储与恢复系统,其特殊之处在于:该系统以计算机模型的计算引擎为驱动,包括: A computer model abnormal dump and recovery system, which is special in that: the system is driven by the calculation engine of the computer model, including:

异常检测模块,用于检测计算模型的运行情况,如果发生异常调用异常转储模块存储上下文信息; Anomaly detection module, used to detect the operation of the calculation model, if an exception occurs, call the exception dump module to store context information;

异常转储模块,用于保存计算模型上下文信息; Exception dump module, used to save computing model context information;

异常转储文件读取模块,读取异常转储文件,恢复发生异常时上下文信息。 The exception dump file reading module reads the exception dump file and restores the context information when an exception occurs.

本发明的二次设备智能状态评估诊断系统,启动计算引擎的同时加载异常检测模块,异常检测模块与定时器连接,通过输入设备调用异常转储文件读取模块在显示器上显示异常时上下文信息,异常转储模块与储存器连接。 In the intelligent state evaluation and diagnosis system of secondary equipment of the present invention, the abnormal detection module is loaded while the computing engine is started, the abnormal detection module is connected with the timer, and the abnormal dump file reading module is called through the input device to display the abnormal context information on the display, The exception dump module is connected with the storage.

本发明的计算机模型异常转储与恢复系统转储与恢复的方法,其特殊之处在于:包括以下步骤: The method for computer model abnormal dumping and recovery system dumping and recovery of the present invention is special in that it includes the following steps:

(1)载入计算模型文件,启动计算引擎运行计算模型,同时记录计算模型运行起始时间,开启异常检测模块启动定时器,定时器每隔一定周期利用当前时间减去计算模型运行起始时间得到运行时长,运行时长与用户配置的最大时长进行比较,如果超出,通过异常转储模块,调用计算引擎来保存上下文信息,并通知计算引擎终止计算任务; (1) Load the calculation model file, start the calculation engine to run the calculation model, and record the start time of the calculation model operation, open the exception detection module to start the timer, and the timer uses the current time to subtract the start time of the calculation model operation every certain period Get the running time, compare the running time with the maximum time configured by the user, if it exceeds, call the computing engine to save the context information through the exception dump module, and notify the computing engine to terminate the computing task;

(2)当发生异常后,通过计算引擎接口获取运行时的上下文信息,将上下文信息组织成XML格式,再将上下文信息保存到文件中; (2) When an exception occurs, obtain the runtime context information through the computing engine interface, organize the context information into XML format, and then save the context information to a file;

(3)异常转储文件读取模块读取步骤(2)保存的上下文信息,然后将该上下文信息通过计算引擎接口设置到计算引擎中,即引擎读取的数据为异常时的数据,启动计算引擎。 (3) The exception dump file reading module reads the context information saved in step (2), and then sets the context information to the computing engine through the computing engine interface, that is, the data read by the engine is the data when the exception occurs, and the calculation is started engine.

本发明的有益效果是:本发明的系统可以在计算模型运行发生异常(计算模型启动后长时间不结束)时,记录所有上下文信息,并在恢复时构造异常发生时的上下文信息,用于验证是否修改有效,且可以模拟出异常发生时的情况,利于分析原因,修改计算模型。 The beneficial effects of the present invention are: the system of the present invention can record all context information when the calculation model runs abnormally (the calculation model does not end for a long time after starting), and construct the context information when the abnormality occurs during recovery for verification Whether the modification is effective, and can simulate the situation when the abnormality occurs, which is conducive to analyzing the cause and modifying the calculation model.

(四)        附图说明 (4) Description of drawings

下面结合附图对本发明作进一步的说明。 The present invention will be further described below in conjunction with the accompanying drawings.

图1为本发明异常转储的流程图; Fig. 1 is the flowchart of abnormal dump of the present invention;

图2 为本发明恢复上下文环境的流程图; Fig. 2 is the flow chart of the present invention recovery context environment;

图3为本发明的框架图。 Fig. 3 is a frame diagram of the present invention.

(五)        具体实施方式 (5) Specific implementation methods

附图为本发明的一种具体实施例。该实施例的系统是以计算机模型的计算引擎为驱动,包括: Accompanying drawing is a kind of specific embodiment of the present invention. The system of this embodiment is driven by the calculation engine of the computer model, including:

异常检测模块,用于检测计算模型的运行情况,如果发生异常调用异常转储模块存储上下文信息; Anomaly detection module, used to detect the operation of the calculation model, if an exception occurs, call the exception dump module to store context information;

异常转储模块,用于保存计算模型上下文信息; Exception dump module, used to save computing model context information;

异常转储文件读取模块,读取异常转储文件,恢复发生异常时上下文信息。 The exception dump file reading module reads the exception dump file and restores the context information when an exception occurs.

本发明的二次设备智能状态评估诊断系统,启动计算引擎的同时加载异常检测模块,异常检测模块与定时器连接,通过输入设备调用异常转储文件读取模块在显示器上显示异常时上下文信息,异常转储模块与储存器连接。 In the intelligent state evaluation and diagnosis system of secondary equipment of the present invention, the abnormal detection module is loaded while the computing engine is started, the abnormal detection module is connected with the timer, and the abnormal dump file reading module is called through the input device to display the abnormal context information on the display, The exception dump module is connected with the storage.

本实施例的计算机模型异常转储与恢复系统转储与恢复的方法,包括以下步骤: The computer model abnormality dump of the present embodiment and the method for recovery system dump and recovery include the following steps:

(1)载入计算模型文件,启动计算引擎运行计算模型,同时记录计算模型运行起始时间,开启异常检测模块启动定时器,定时器每隔一定周期利用当前时间减去计算模型运行起始时间得到运行时长,运行时长与用户配置的最大时长进行比较,如果超出,通过异常转储模块,调用计算引擎来保存上下文信息,并通知计算引擎终止计算任务; (1) Load the calculation model file, start the calculation engine to run the calculation model, and record the start time of the calculation model operation, open the exception detection module to start the timer, and the timer uses the current time to subtract the start time of the calculation model operation every certain period Get the running time, compare the running time with the maximum time configured by the user, if it exceeds, call the computing engine to save the context information through the exception dump module, and notify the computing engine to terminate the computing task;

(2)当发生异常后,通过计算引擎接口获取运行时的上下文信息,将上下文信息组织成XML格式,再将上下文信息保存到文件中; (2) When an exception occurs, obtain the runtime context information through the computing engine interface, organize the context information into XML format, and then save the context information to a file;

(3)异常转储文件读取模块读取步骤(2)保存的上下文信息,然后将该上下文信息通过计算引擎接口设置到计算引擎中,即引擎读取的数据为异常时的数据,启动计算引擎。 (3) The exception dump file reading module reads the context information saved in step (2), and then sets the context information to the computing engine through the computing engine interface, that is, the data read by the engine is the data when the exception occurs, and the calculation is started engine.

本实施的系统便于发现异常发生时的情况,利于分析原因,修改计算模型。 The implemented system is convenient for discovering the situation when an abnormality occurs, for analyzing the cause, and for modifying the calculation model.

Claims (3)

1.一种计算机模型异常转储与恢复系统,其特征在于:该系统以计算机模型的计算引擎为驱动,包括: 1. A computer model abnormal dump and recovery system is characterized in that: the system is driven by the computing engine of the computer model, including: 异常检测模块,用于检测计算模型的运行情况,如果发生异常调用异常转储模块存储上下文信息; Anomaly detection module, used to detect the operation of the calculation model, if an exception occurs, call the exception dump module to store context information; 异常转储模块,用于保存计算模型上下文信息; Exception dump module, used to save computing model context information; 异常转储文件读取模块,读取异常转储文件,恢复发生异常时上下文信息。 The exception dump file reading module reads the exception dump file and restores the context information when an exception occurs. 2.根据权利要求1所述的二次设备智能状态评估诊断系统,其特征在于:启动计算引擎的同时加载异常检测模块,异常检测模块与定时器连接,通过输入设备调用异常转储文件读取模块在显示器上显示异常时上下文信息,异常转储模块与储存器连接。 2. The intelligent state evaluation and diagnosis system for secondary equipment according to claim 1, characterized in that: when the calculation engine is started, the abnormality detection module is loaded, the abnormality detection module is connected with the timer, and the abnormal dump file is called by the input device to read The module displays the abnormal context information on the monitor, and the abnormal dump module is connected with the storage. 3.根据权利要求1或2所述的计算机模型异常转储与恢复系统转储与恢复的方法,其特征在于:包括以下步骤: 3. according to claim 1 and 2 described abnormal dumping of computer model and the method for recovery system dumping and recovery, it is characterized in that: comprise the following steps: (1)载入计算模型文件,启动计算引擎运行计算模型,同时记录计算模型运行起始时间,开启异常检测模块启动定时器,定时器每隔一定周期利用当前时间减去计算模型运行起始时间得到运行时长,运行时长与用户配置的最大时长进行比较,如果超出,通过异常转储模块,调用计算引擎来保存上下文信息,并通知计算引擎终止计算任务; (1) Load the calculation model file, start the calculation engine to run the calculation model, and record the start time of the calculation model operation, open the exception detection module to start the timer, and the timer uses the current time to subtract the start time of the calculation model operation every certain period Get the running time, compare the running time with the maximum time configured by the user, if it exceeds, call the computing engine to save the context information through the exception dump module, and notify the computing engine to terminate the computing task; (2)当发生异常后,通过计算引擎接口获取运行时的上下文信息,将上下文信息组织成XML格式,再将上下文信息保存到文件中; (2) When an exception occurs, obtain the runtime context information through the computing engine interface, organize the context information into XML format, and then save the context information to a file; (3)异常转储文件读取模块读取步骤(2)保存的上下文信息,然后将该上下文信息通过计算引擎接口设置到计算引擎中,即引擎读取的数据为异常时的数据,启动计算引擎。 (3) The exception dump file reading module reads the context information saved in step (2), and then sets the context information to the computing engine through the computing engine interface, that is, the data read by the engine is the data when the exception occurs, and the calculation is started engine.
CN201310110711XA 2013-04-01 2013-04-01 Abnormal dump and recovery system for computer model and dump and recovery method thereof Pending CN103150226A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310110711XA CN103150226A (en) 2013-04-01 2013-04-01 Abnormal dump and recovery system for computer model and dump and recovery method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310110711XA CN103150226A (en) 2013-04-01 2013-04-01 Abnormal dump and recovery system for computer model and dump and recovery method thereof

Publications (1)

Publication Number Publication Date
CN103150226A true CN103150226A (en) 2013-06-12

Family

ID=48548324

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310110711XA Pending CN103150226A (en) 2013-04-01 2013-04-01 Abnormal dump and recovery system for computer model and dump and recovery method thereof

Country Status (1)

Country Link
CN (1) CN103150226A (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106201751A (en) * 2016-06-29 2016-12-07 北京北信源软件股份有限公司 A kind of detection method of Windows operating system collapse
CN106485526A (en) * 2015-08-31 2017-03-08 阿里巴巴集团控股有限公司 A kind of diagnostic method of data mining model and device
CN106598871A (en) * 2016-12-29 2017-04-26 山东鲁能智能技术有限公司 Automatic analysis method and system for collapse file under Linux
CN111125008A (en) * 2019-12-20 2020-05-08 浪潮电子信息产业股份有限公司 Dynamic repairing method and system of abnormal inode and related components
CN112689827A (en) * 2020-10-27 2021-04-20 华为技术有限公司 Model reasoning exception handling method and device
CN113076182A (en) * 2021-03-24 2021-07-06 成都海光集成电路设计有限公司 Computing task recovery method and device, user equipment and storage medium
CN113256909A (en) * 2020-12-31 2021-08-13 深圳怡化电脑股份有限公司 Method, system, depositing and withdrawing device and storage medium for device driver self-recovery
CN114595197A (en) * 2022-02-28 2022-06-07 浙江卡易智慧医疗科技有限公司 A kind of abnormal data management method and device

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1470992A (en) * 2002-07-23 2004-01-28 华为技术有限公司 A Locating Method for Using Shared Memory Objects to Record Abnormal Exits of Common Processes
CN1755648A (en) * 2004-09-30 2006-04-05 微软公司 Method, system, and apparatus for identifying unresponsive portions of a computer program
US20060168439A1 (en) * 2005-01-26 2006-07-27 Fujitsu Limited Memory dump program boot method and mechanism, and computer-readable storage medium

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1470992A (en) * 2002-07-23 2004-01-28 华为技术有限公司 A Locating Method for Using Shared Memory Objects to Record Abnormal Exits of Common Processes
CN1755648A (en) * 2004-09-30 2006-04-05 微软公司 Method, system, and apparatus for identifying unresponsive portions of a computer program
US20060168439A1 (en) * 2005-01-26 2006-07-27 Fujitsu Limited Memory dump program boot method and mechanism, and computer-readable storage medium

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106485526A (en) * 2015-08-31 2017-03-08 阿里巴巴集团控股有限公司 A kind of diagnostic method of data mining model and device
CN106201751B (en) * 2016-06-29 2019-02-19 北京北信源软件股份有限公司 A kind of detection method of Windows operating system collapse
CN106201751A (en) * 2016-06-29 2016-12-07 北京北信源软件股份有限公司 A kind of detection method of Windows operating system collapse
CN106598871A (en) * 2016-12-29 2017-04-26 山东鲁能智能技术有限公司 Automatic analysis method and system for collapse file under Linux
US11650875B2 (en) 2019-12-20 2023-05-16 Inspur Electronic Information Industry Co., Ltd. Abnormal inode dynamic repair method and system, and related component
CN111125008A (en) * 2019-12-20 2020-05-08 浪潮电子信息产业股份有限公司 Dynamic repairing method and system of abnormal inode and related components
CN111125008B (en) * 2019-12-20 2022-04-22 浪潮电子信息产业股份有限公司 Dynamic repairing method and system of abnormal inode and related components
CN112689827A (en) * 2020-10-27 2021-04-20 华为技术有限公司 Model reasoning exception handling method and device
CN113256909A (en) * 2020-12-31 2021-08-13 深圳怡化电脑股份有限公司 Method, system, depositing and withdrawing device and storage medium for device driver self-recovery
CN113076182A (en) * 2021-03-24 2021-07-06 成都海光集成电路设计有限公司 Computing task recovery method and device, user equipment and storage medium
CN113076182B (en) * 2021-03-24 2024-03-29 成都海光集成电路设计有限公司 Recovery method and device of computing task, user equipment and storage medium
CN114595197A (en) * 2022-02-28 2022-06-07 浙江卡易智慧医疗科技有限公司 A kind of abnormal data management method and device
CN114595197B (en) * 2022-02-28 2025-04-01 浙江卡易智慧医疗科技有限公司 Abnormal data management method and device

Similar Documents

Publication Publication Date Title
CN103150226A (en) Abnormal dump and recovery system for computer model and dump and recovery method thereof
CN104518924B (en) Automatic testing and result comparison method and system
CN103176876B (en) A kind of computer On-line self-diagnosis method of highly effective and safe and self-checking unit
RU2014148734A (en) SYSTEM AND METHOD FOR INITIATING THE CHECK OF THE FLOW METER BY THE COMPUTER OF THE FLOW
CN104317712A (en) Linux-based storage server fatigue test method
CN102244591A (en) Client server and method for full process monitoring on function text of client server
CN106326067A (en) Method and device for monitoring CPU (central processing unit) performance under pressure test
CN107590042A (en) A kind of server method for testing open/close machine and system based on linux system
CN105824388A (en) Power-on/off detection method, device and system
CN102810005A (en) Power state detection system and method
CN108364670B (en) Read-write pressure testing method, device, equipment and storage medium
CN110764960B (en) Solid state disk firmware testing method
CN105302687A (en) Linux-based hard disk sequence detection method
CN105138447A (en) Method for monitoring hard disk and memory in Reboot test process
CN104598283A (en) Method for realizing single-architecture multi-structure BMC firmware program
CN115388991A (en) A liquid level sensor float fault diagnosis method, device and storage medium
CN107133134A (en) A kind of efficient RAID card Auto-Test System and method
JP2008176353A (en) Information processor and log acquisition method, and program
CN102681928A (en) Abnormal information output system of computer system
CN103180745A (en) A method and device for measuring and recording electrical parameters
CN118837002A (en) Method, device, equipment and medium for detecting low-temperature fault of temperature sensor
CN108965053A (en) Upper computer software method for testing pressure, apparatus and system
CN108880881A (en) The method and apparatus of monitoring resource under a kind of cloud environment
CN101145992A (en) Single board task anomaly detection method and system
CN115754511A (en) Burn-in test method, device, equipment and storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
ASS Succession or assignment of patent right

Owner name: ELECTRIC POWER RESEARCH INSTITUTE, STATE GRID SHAN

Effective date: 20141115

C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20141115

Address after: Xinluo Avenue high tech Zone of Ji'nan City, Shandong province 250101 silver bearing No. 2008 building B block 5 layer

Applicant after: Shandong Luneng Software Technology Co., Ltd.

Applicant after: Electric Power Research Institute of State Grid Shandong Electric Power Company

Address before: Xinluo Avenue high tech Zone of Ji'nan City, Shandong province 250101 silver bearing No. 2008 building B block 5 layer

Applicant before: Shandong Luneng Software Technology Co., Ltd.

C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20130612