CN106502855A - A kind of test the method for positioning PMC RAID card memory grain problems - Google Patents
A kind of test the method for positioning PMC RAID card memory grain problems Download PDFInfo
- Publication number
- CN106502855A CN106502855A CN201610925560.7A CN201610925560A CN106502855A CN 106502855 A CN106502855 A CN 106502855A CN 201610925560 A CN201610925560 A CN 201610925560A CN 106502855 A CN106502855 A CN 106502855A
- Authority
- CN
- China
- Prior art keywords
- test
- raid card
- pmc
- positioning
- pattern
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000012360 testing method Methods 0.000 title claims abstract description 73
- 238000000034 method Methods 0.000 title claims abstract description 16
- 239000002245 particle Substances 0.000 claims description 19
- 230000001133 acceleration Effects 0.000 abstract 1
- 230000006870 function Effects 0.000 description 4
- 230000005540 biological transmission Effects 0.000 description 1
- 239000008187 granular material Substances 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000009662 stress testing Methods 0.000 description 1
- 238000010998 test method Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/22—Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
- G06F11/26—Functional testing
- G06F11/267—Reconfiguring circuits for testing, e.g. LSSD, partitioning
Landscapes
- Engineering & Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Computer Hardware Design (AREA)
- Quality & Reliability (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Techniques For Improving Reliability Of Storages (AREA)
Abstract
Description
技术领域technical field
本发明涉及计算机技术领域,具体地说是一种测试并定位PMC-RAID卡内存颗粒问题的方法。The invention relates to the technical field of computers, in particular to a method for testing and locating PMC-RAID card memory particle problems.
背景技术Background technique
RAID(Redundant Array of Independent Disk,独立冗余磁盘阵列)卡 作为服务器内的关键部件,可以有效地保护用户的数据安全。但是如果 RAID卡本身不能够稳定工作,则数据安全也就无从谈起了。目前业界对于 RAID卡的测试偏重性能较多,对于稳定性的测试也大多局限于长时间的压 力测试。RAID (Redundant Array of Independent Disk, independent redundant disk array) card, as a key component in the server, can effectively protect the user's data security. However, if the RAID card itself cannot work stably, data security will be out of the question. At present, the testing of RAID cards in the industry puts more emphasis on performance, and the testing of stability is mostly limited to long-term stress testing.
RAID卡就是用来实现RAID功能的板卡,通常是由I/O处理器、硬盘控制器、硬盘连接器和缓存等一系列零组件构成的。不同的RAID卡支持的RAID功能不同。支持RAlD0、RAID1、RAID3、RAID4、RAID5、RAID10不等。RAID卡可以让很多磁盘驱动器同时传输数据,而这些磁盘驱动器在逻辑上又是一个磁盘驱动器,所以使用RAID可以达到单个的磁盘驱动器几倍、几十倍甚至上百倍的速率。这也是RAID卡最初想要解决的问题。可以提供容错功能,这是RAID卡的第二个重要功能。A RAID card is a board used to implement the RAID function, and is usually composed of a series of components such as an I/O processor, a hard disk controller, a hard disk connector, and a cache. Different RAID cards support different RAID functions. Support RAlD0, RAID1, RAID3, RAID4, RAID5, RAID10 etc. The RAID card can allow many disk drives to transmit data at the same time, and these disk drives are logically a disk drive, so the use of RAID can achieve several times, dozens of times or even hundreds of times the speed of a single disk drive. This is also the problem that the RAID card originally wanted to solve. It can provide fault tolerance function, which is the second important function of RAID card.
多数RAID卡都配备了一定数量的内存颗粒作为高速缓存(Cache)使用。高速缓存是RAID卡与外部总线交换数据的场所,RAID卡先将数据传送到缓存,再由缓存和外边数据总线交换数据。缓存具有极快的存取速度,实际上就是相对低速的硬盘盘片与相对高速的外部设备(例如内存)之间的缓冲器。缓存的大小与速度是直接关系到RAID卡的实际传输速度的重要因素,大缓存能够大幅度地提高数据命中率从而提高RAID卡整体性能。Most RAID cards are equipped with a certain number of memory granules to be used as high-speed cache (Cache). The cache is the place where the RAID card exchanges data with the external bus. The RAID card first transfers the data to the cache, and then exchanges data between the cache and the external data bus. The cache has an extremely fast access speed, and is actually a buffer between a relatively low-speed hard disk platter and a relatively high-speed external device (such as memory). The size and speed of the cache are important factors directly related to the actual transmission speed of the RAID card. A large cache can greatly improve the data hit rate and thus improve the overall performance of the RAID card.
发明内容Contents of the invention
本发明的技术任务是提供一种测试并定位PMC-RAID卡内存颗粒问题的方法。The technical task of the invention is to provide a method for testing and locating the problem of PMC-RAID card memory particles.
本发明的技术任务是按以下方式实现的,一种测试并定位PMC-RAID卡内存颗粒问题的方法,具体方法如下:Technical task of the present invention is realized in the following manner, a kind of method of testing and positioning PMC-RAID card internal memory particle problem, concrete method is as follows:
S1、将测试工具用烧录器烧录到待测的PMC-RAID卡上;S1, burn the test tool to the PMC-RAID card to be tested with a burner;
S2、将待测的RAID卡安装到机器上,并连接串口工具到工作电脑上;S2, install the RAID card to be tested on the machine, and connect the serial port tool to the working computer;
S3、在工作电脑上使用测试脚本开始测试;S3, use the test script on the work computer to start the test;
S4、多次测试,并收集测试结果,确保问题可以复现;S4. Test multiple times and collect test results to ensure that the problem can be reproduced;
S5、查看测试结果,定位内存颗粒的问题点。S5. Check the test results and locate the problem points of the memory particles.
优选的,所述的S2的具体步骤为,串口工具一端连接待测的RAID卡,另一端连接到工作电脑上,确认com口,使用DDR测试工具根据com口编号打开。Preferably, the specific steps of said S2 are: one end of the serial port tool is connected to the RAID card to be tested, the other end is connected to the working computer, the com port is confirmed, and the DDR test tool is used to open it according to the com port number.
优选的,所述的S3一共包括八个测试项,测试定位内存颗粒可能出现的问题,测试结果输出到制定文件中。Preferably, the S3 includes a total of eight test items to test and locate possible problems of the memory particles, and the test results are output to the formulated file.
优选的,所述的八个测试问题,具体如下: Runs level, margin, mem DMA, memDMC tests,Run pattern walk1, pattern walk0, pattern blink1, pattern blink0tests。Preferably, the eight test questions are specifically as follows: Runs level, margin, mem DMA, memDMC tests, Run pattern walk1, pattern walk0, pattern blink1, pattern blink0tests.
本发明的一种测试并定位PMC-RAID卡内存颗粒问题的方法和现有技术相比,测试项目包括八项,测试结果全面而且准确,可以快速定位问题点;本方法可以用于测试和定位PMC-RAID卡内存颗粒问题,用于解决由于DDR报错而导致的RAID卡卡死以及EEC报错等问题,加速问题处理和解决的速度,提高公司的服务质量。Compared with the prior art, a method for testing and locating PMC-RAID card memory particle problems of the present invention, the test items include eight items, the test results are comprehensive and accurate, and the problem points can be quickly located; the method can be used for testing and locating The problem of PMC-RAID card memory particles is used to solve the problems of RAID card stuck and EEC error caused by DDR error reporting, speed up the speed of problem processing and resolution, and improve the company's service quality.
附图说明Description of drawings
附图1为一种测试并定位PMC-RAID卡内存颗粒问题的方法的流程示意图。Accompanying drawing 1 is a schematic flow chart of a method for testing and locating memory particle problems of a PMC-RAID card.
具体实施方式detailed description
首先,需要将测试工具用烧录器烧录到待测的PMC-RAID卡上,此工具是根据PMC现有的RAID卡的内存颗粒进行编码使用的,只能针对PMC的RAID卡使用。First of all, it is necessary to burn the test tool to the PMC-RAID card to be tested with a burner. This tool is coded according to the memory particles of the existing RAID card of PMC and can only be used for the RAID card of PMC.
其次,将待测RAID卡安装到机器上,并连接上串口工具,用于监控RAID卡信息和操作内存颗粒测试。将串口工具的另一端连接到工作电脑上,确认com口,使用DDR测试工具根据com口编号打开。Secondly, install the RAID card to be tested on the machine and connect it to a serial port tool to monitor the RAID card information and operate the memory particle test. Connect the other end of the serial port tool to the working computer, confirm the com port, and use the DDR test tool to open it according to the com port number.
最后,使用测试脚本开始测试,一共包括8个测试项,分别测试定位内存颗粒可能出现的问题,同时会将测试结果输出到指定的文件中,方便观察。由于内存颗粒问题有的比较难复现,一般会重复进行多次测试,全部PASS即为没有问题。如果有Failed的就可以定位到问题出现点了。Finally, use the test script to start the test, including a total of 8 test items to test the possible problems of locating memory particles, and output the test results to the specified file for easy observation. Because some memory particle problems are more difficult to reproduce, generally the test will be repeated several times, and all PASS means there is no problem. If there is Failed, you can locate the point where the problem occurred.
内存颗粒测试主要包括如下的8项测试:The memory particle test mainly includes the following 8 tests:
- Runs level, margin, mem DMA, mem DMC tests- Runs level, margin, mem DMA, mem DMC tests
- Run pattern walk1, pattern walk0, pattern blink1, pattern blink0 tests- Run pattern walk1, pattern walk0, pattern blink1, pattern blink0 tests
实施例1:Example 1:
下面是我们摘取的一次测试中的三项测试的测试结果,其中mem DMA和mem DMC测试是failed的,而Walk1 pattern测试是passed了,我们会根据测试结果重点去分析mem DMA和mem DMC这两部分内容。The following is the test results of three tests in a test we extracted. Among them, the mem DMA and mem DMC tests are failed, while the Walk1 pattern test is passed. We will focus on analyzing the mem DMA and mem DMC based on the test results. Two parts.
Running mem DMA test...Running mem DMA test...
Writing at 0x0 Writing at 0x0
memcmp failed rv = 0x0memcmp failed rv = 0x0
Errors: Addr = 0xbe04ea04 0x9a7a448 != 0x9a708a6Errors: Addr = 0xbe04ea04 0x9a7a448 != 0x9a708a6
Addr = 0xbe04ea0c 0x269e912 != 0x269c26aAddr = 0xbe04ea0c 0x269e912 != 0x269c26a
Addr = 0xbe04ea14 0xd09a7a45 != 0xd09a309aAddr = 0xbe04ea14 0xd09a7a45 != 0xd09a309a
Addr = 0xbe04ea1c 0x8c269e90 != 0x8c268c27Addr = 0xbe04ea1c 0x8c269e90 != 0x8c268c27
Addr = 0xbe04ea44 0xc1a309a6 != 0xc1a30000Addr = 0xbe04ea44 0xc1a309a6 != 0xc1a30000
Addr = 0xbe04ea4c 0xe068c268 != 0xe0680000Addr = 0xbe04ea4c 0xe068c268 != 0xe0680000
Addr = 0xbe04ea54 0x381a309a != 0x381a0000Addr = 0xbe04ea54 0x381a309a != 0x381a0000
Addr = 0xbe04ea5c 0xde068c27 != 0xde068000Addr = 0xbe04ea5c 0xde068c27 != 0xde068000
Test FAILED!!Test FAILED!!
Running mem DMC test... Running mem DMC test...
If the following memory init does not complete within 2 secondsIf the following memory init does not complete within 2 seconds
the SPD on the cache module may be incorrect.The SPD on the cache module may be incorrect.
Initializing DDR for ECC (size=2048 MiB)Initializing DDR for ECC (size=2048 MiB)
........ DDR fill done........ DDR fill done.
Writing at 0x0 Writing at 0x0
Test FAILED!!Test FAILED!!
Running Walk1 pattern test... Running Walk1 pattern test...
Writing to DDR of size 0x80000000... Writing to DDR of size 0x80000000...
at 0x0 at 0x0
at 0x10000000 at 0x10000000
at 0x20000000 at 0x20000000
at 0x30000000 at 0x30000000
at 0x40000000 at 0x40000000
at 0x50000000 at 0x50000000
at 0x60000000 at 0x60000000
at 0x70000000 at 0x70000000
Done Done
Reading from DDR and comparing... Reading from DDR and comparing...
at 0x0 at 0x0
at 0x10000000 at 0x10000000
at 0x20000000 at 0x20000000
at 0x30000000 at 0x30000000
at 0x40000000 at 0x40000000
at 0x50000000 at 0x50000000
at 0x60000000 at 0x60000000
at 0x70000000 at 0x70000000
Done Done
Test PASSED Test Passed
通过上面具体实施方式,所述技术领域的技术人员可容易的实现本发明。但是应当理解,本发明并不限于上述的几种具体实施方式。在公开的实施方式的基础上,所述技术领域的技术人员可任意组合不同的技术特征,从而实现不同的技术方案。Through the above specific implementation manners, those skilled in the technical field can easily realize the present invention. However, it should be understood that the present invention is not limited to the above-mentioned several specific implementation manners. On the basis of the disclosed embodiments, those skilled in the art can arbitrarily combine different technical features to achieve different technical solutions.
Claims (4)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610925560.7A CN106502855A (en) | 2016-10-24 | 2016-10-24 | A kind of test the method for positioning PMC RAID card memory grain problems |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610925560.7A CN106502855A (en) | 2016-10-24 | 2016-10-24 | A kind of test the method for positioning PMC RAID card memory grain problems |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106502855A true CN106502855A (en) | 2017-03-15 |
Family
ID=58319464
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610925560.7A Pending CN106502855A (en) | 2016-10-24 | 2016-10-24 | A kind of test the method for positioning PMC RAID card memory grain problems |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106502855A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107133123A (en) * | 2017-04-28 | 2017-09-05 | 郑州云海信息技术有限公司 | A kind of method of the wrong test of note on PMC RAID card parity errors |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5615335A (en) * | 1994-11-10 | 1997-03-25 | Emc Corporation | Storage system self-test apparatus and method |
CN105223889A (en) * | 2015-10-13 | 2016-01-06 | 浪潮电子信息产业股份有限公司 | Method for automatically monitoring PMC RAID card log suitable for production line |
CN105354123A (en) * | 2015-12-08 | 2016-02-24 | 浪潮电子信息产业股份有限公司 | RAID card and hard disk pressure test method suitable for customer site |
CN105551526A (en) * | 2015-12-08 | 2016-05-04 | 浪潮电子信息产业股份有限公司 | Method for detecting yield of RAID card memory particles by sending specific test data |
CN205539367U (en) * | 2016-04-05 | 2016-08-31 | 广州红象医疗科技有限公司 | PCBA function automatic test system |
CN106021081A (en) * | 2016-05-12 | 2016-10-12 | 浪潮电子信息产业股份有限公司 | Device, server and method for testing performance of RAID card |
-
2016
- 2016-10-24 CN CN201610925560.7A patent/CN106502855A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5615335A (en) * | 1994-11-10 | 1997-03-25 | Emc Corporation | Storage system self-test apparatus and method |
CN105223889A (en) * | 2015-10-13 | 2016-01-06 | 浪潮电子信息产业股份有限公司 | Method for automatically monitoring PMC RAID card log suitable for production line |
CN105354123A (en) * | 2015-12-08 | 2016-02-24 | 浪潮电子信息产业股份有限公司 | RAID card and hard disk pressure test method suitable for customer site |
CN105551526A (en) * | 2015-12-08 | 2016-05-04 | 浪潮电子信息产业股份有限公司 | Method for detecting yield of RAID card memory particles by sending specific test data |
CN205539367U (en) * | 2016-04-05 | 2016-08-31 | 广州红象医疗科技有限公司 | PCBA function automatic test system |
CN106021081A (en) * | 2016-05-12 | 2016-10-12 | 浪潮电子信息产业股份有限公司 | Device, server and method for testing performance of RAID card |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107133123A (en) * | 2017-04-28 | 2017-09-05 | 郑州云海信息技术有限公司 | A kind of method of the wrong test of note on PMC RAID card parity errors |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102773346B1 (en) | Traffic capture and debugging tools for identifying root causes of device failure during automated testing | |
US8291379B2 (en) | Runtime analysis of a computer program to identify improper memory accesses that cause further problems | |
US8880779B2 (en) | Debugging a memory subsystem | |
CN104484274B (en) | A kind of internal memory taking turn audit function method of testing based on ITP instruments | |
CN103473158A (en) | Disk pressure testing method for Linux server | |
CN102521092A (en) | Hard disk test method and device | |
US20090307384A1 (en) | Usb port testing apparatus and method | |
JP4343923B2 (en) | DMA circuit and data transfer method | |
US5878050A (en) | Method and apparatus for data compare detection of memory errors on a computers memory subsystem | |
US20130036255A1 (en) | Testing memory subsystem connectivity | |
CN107832179A (en) | A kind of PCIe Error Enabling method of testings | |
US20130111268A1 (en) | Testing device capable of simulating plugging and unplugging operations and method thereof | |
CN104133751A (en) | Chip debugging method and chip | |
TWI742461B (en) | System for detecting installation state of hard disk | |
CN112579382A (en) | NVMe solid state disk bad block analysis method, device, terminal and storage medium | |
CN102055780A (en) | Disk array test system and method | |
CN100517255C (en) | Serial high-level hard disk architecture interface function test device for motherboard | |
CN114780317A (en) | Hard disk plug-in and pull-out test method, system and related components | |
CN1230885C (en) | Fault-telerance method and system of testing chip for boundary scanning | |
CN106502855A (en) | A kind of test the method for positioning PMC RAID card memory grain problems | |
CN106648969B (en) | A method and system for inspecting damaged data in a disk | |
US10970442B1 (en) | Method of debugging hardware and firmware of data storage | |
CN108052441A (en) | A kind of test method, system, device and the storage medium of hard disk performance level | |
WO2025081833A1 (en) | Data transmission method and apparatus, electronic device, and readable storage medium | |
CN108038031A (en) | A kind of method and device for monitoring hard disk presence in real time |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20170315 |
|
WD01 | Invention patent application deemed withdrawn after publication |