JP2007147459A

JP2007147459A - Information processing apparatus, program, and computer-readable recording medium

Info

Publication number: JP2007147459A
Application number: JP2005342680A
Authority: JP
Inventors: Shigehiko Kanetani; 重彦金谷; Yukiko Nakamura; 由紀子中村; Hideyuki Suzuki; 秀幸鈴木; Nozomi Sakurai; 望櫻井; Daisuke Shibata; 大輔柴田; Masahiko Kitayama; 雅彦北山
Original assignee: SOSHI EDUCATIONAL FOUNDATION; Nara Institute of Science and Technology NUC; Kazusa DNA Research Institute Foundation
Current assignee: SOSHI EDUCATIONAL FOUNDATION; Nara Institute of Science and Technology NUC; Kazusa DNA Research Institute Foundation
Priority date: 2005-11-28
Filing date: 2005-11-28
Publication date: 2007-06-14

Abstract

【課題】複数の分析実験から得られた２次元ピークデータ間で、同一の化合物由来のマススペクトル同士を自動的かつ正確に対応付ける。
【解決手段】スペクトル解析装置１は、それぞれの２次元ピークデータからマススペクトル情報としてピークベクトルを作成する第１ピークベクトル作成部２１と、ピークベクトル同士を比較し、それぞれの２次元ピークデータにおいて内部標準となるピークベクトルのセットを作成する内部標準同定部２２と、内部標準となるピークベクトルに対応する保持時間に基づいて、それぞれの２次元ピークデータにおける保持時間を補正する保持時間補正部２３と、保持時間が補正された２次元ピーク情報から新たなピークベクトルを作成する第２マススペクトル抽出部と、保持時間が補正された２次元ピーク情報間で、保持時間の違いが所定の範囲内の新たなピークベクトル同士を比較する対応判定部２６とを備えている。
【選択図】図１To automatically and accurately associate mass spectra derived from the same compound between two-dimensional peak data obtained from a plurality of analytical experiments.
A spectrum analyzing apparatus compares a peak vector with a first peak vector creating unit that creates a peak vector as mass spectrum information from each two-dimensional peak data, and compares the peak vectors with each other. An internal standard identifying unit 22 for creating a set of standard peak vectors, a holding time correcting unit 23 for correcting the holding time in each two-dimensional peak data based on the holding time corresponding to the peak vector serving as the internal standard, The difference in holding time is within a predetermined range between the second mass spectrum extraction unit that creates a new peak vector from the two-dimensional peak information with the holding time corrected and the two-dimensional peak information with the holding time corrected. And a correspondence determination unit 26 for comparing new peak vectors.
[Selection] Figure 1

Description

本発明は、クロマトグラフィーと質量分析との組み合わせに基づいて得られるデータを解析するための情報処理装置、この情報処理装置を制御するプログラム、及びこのプログラムを記録したコンピュータ読み取り可能な記録媒体に関するものである。 The present invention relates to an information processing apparatus for analyzing data obtained based on a combination of chromatography and mass spectrometry, a program for controlling the information processing apparatus, and a computer-readable recording medium on which the program is recorded. It is.

ガスクロマトグラフィー質量分析（ＧＣ／ＭＳ）装置、液体クロマトグラフィー質量分析（ＬＣ／ＭＳ）装置は、クロマトグラフィーと質量分析とを組み合わせることによって、細胞、組織、又は器官全体に含まれる膨大な種類の生体成分を分離・同定することが可能である。これらの装置では、まず、細胞、組織、又は器官全体から抽出した生体サンプル中に含まれる生体成分を、ガスクロマトグラフィー（ＧＣ）又は液体クロマトグラフィー（ＬＣ）によって単一の化合物に分離する。その後、分離した単一の化合物を一定のエネルギーを有する電子線により開裂させ、フラグメントイオンを得る。そして、フラグメントイオンのイオン強度を質量数と電荷の比（ｍ／ｚ）ごとに検出することにより、ある化合物の開裂パターンを示すマススペクトルを得ることができる。 Gas chromatography / mass spectrometry (GC / MS) and liquid chromatography / mass spectrometry (LC / MS) devices combine a large number of types of cells, tissues, or organs by combining chromatography and mass spectrometry. Biological components can be separated and identified. In these devices, first, biological components contained in a biological sample extracted from cells, tissues, or whole organs are separated into a single compound by gas chromatography (GC) or liquid chromatography (LC). Thereafter, the separated single compound is cleaved with an electron beam having a certain energy to obtain fragment ions. And the mass spectrum which shows the cleavage pattern of a certain compound can be obtained by detecting the ionic strength of a fragment ion for every mass ratio and charge ratio (m / z).

上記の装置ではこれらの処理を連続的に行うため、得られるデータは、図２に示すように、保持時間及びｍ／ｚについてのイオン強度スペクトル、すなわち２次元スペクトルデータになる。この２次元スペクトルデータを保持時間軸と直交する断面で切り出すと、ｍ／ｚ軸に沿ったイオン強度の情報、すなわちマススペクトルを得ることができる。一方、２次元スペクトルデータをｍ／ｚ軸と直交する断面で切り出すと、保持時間軸に沿ったイオン強度の情報、すなわちマスクロマトグラムを得ることができる。 Since the above apparatus performs these processes continuously, as shown in FIG. 2, the obtained data is an ion intensity spectrum with respect to retention time and m / z, that is, two-dimensional spectrum data. When this two-dimensional spectrum data is cut out in a cross section orthogonal to the retention time axis, information on the ion intensity along the m / z axis, that is, a mass spectrum can be obtained. On the other hand, when the two-dimensional spectrum data is cut out in a cross section orthogonal to the m / z axis, information on ion intensity along the retention time axis, that is, a mass chromatogram can be obtained.

ここで、保持時間の分解能を高くすると、マスクロマトグラムでは、図２に示すように、単一のフラグメントイオン又は擬分子イオンのイオン強度が複数の保持時間にまたがった状態になる。従って、図３に示すように、複数の保持時間にまたがったイオン強度を一点に纏めることによって、図２に示す２次元スペクトルデータを、図４上段に示す２次元ピークデータへと変換する必要がある。２次元スペクトルデータから２次元ピークデータへ変換する従来の方法について説明すると次の通りである。 Here, when the resolution of the retention time is increased, in the mass chromatogram, as shown in FIG. 2, the ion intensity of a single fragment ion or quasi-molecular ion is in a state extending over a plurality of retention times. Therefore, as shown in FIG. 3, it is necessary to convert the two-dimensional spectrum data shown in FIG. 2 into the two-dimensional peak data shown in the upper part of FIG. 4 by combining the ion intensities over a plurality of holding times into one point. is there. A conventional method for converting two-dimensional spectrum data into two-dimensional peak data will be described as follows.

従来の方法では、まず、図２に示す２次元スペクトルデータから、あるｍ／ｚについて、図３上段に示すマスクロマトグラムデータを抽出する。そして、マスクロマトグラムにおいてイオン強度が極大となる保持時間を同定し、同定した保持時間がスペクトルのピーク位置であると認定する。 In the conventional method, first, the mass chromatogram data shown in the upper part of FIG. 3 is extracted for a certain m / z from the two-dimensional spectrum data shown in FIG. Then, the retention time at which the ionic strength is maximized in the mass chromatogram is identified, and the identified retention time is recognized as the peak position of the spectrum.

続いて、ピーク位置の近傍に分散するイオン強度の総和をとり、この総和をピーク位置に対応付ける。これにより、図３下段に示すピークデータになる。この処理を全てのｍ／ｚについて行うことにより、図４上段に示す２次元ピークデータが得られるのである。 Subsequently, the sum of the ion intensities dispersed in the vicinity of the peak position is taken, and this sum is associated with the peak position. Thereby, the peak data shown in the lower part of FIG. 3 is obtained. By performing this process for all m / z, the two-dimensional peak data shown in the upper part of FIG. 4 is obtained.

この２次元ピークデータから保持時間軸に垂直な断面を切り出せば、図４下段に示すマススペクトルが得られる。ここで、それぞれのマススペクトルには、１つの化合物の開裂パターンが含まれることになる。従って、マススペクトルの開裂パターンを基に、生体サンプルに含まれる化合物を同定することができる。 If a cross section perpendicular to the holding time axis is cut out from the two-dimensional peak data, the mass spectrum shown in the lower part of FIG. 4 can be obtained. Here, each mass spectrum includes a cleavage pattern of one compound. Therefore, the compound contained in the biological sample can be identified based on the cleavage pattern of the mass spectrum.

近年では、上記の装置によって、様々な生体サンプルに含まれる様々な化合物を網羅的に同定することが行われている。
質量分析法、「有機化合物のスペクトルによる同定法−ＭＳ，ＩＲ，ＮＭＲ，ＵＶの併用−第４版」、ＳｉｌｖｅｒｓｔｅｉｎＲＭ，ＢａｓｓｌｅｒＧＣ，ＭｏｒｒｉｌｌＴＣ著、荒木峻・益子洋一郎・山本修訳、第２章、質量分析法、ｐｐ．３−４０、東京化学同人、東京（１９８３）大倉洋甫、甲斐雅亮、ＩＶ−５質量分析法（マススペクトロメトリー）、「分析化学ＩＩ改訂第４版」、大倉洋甫、田中善正、山口政俊編集、ｐｐ．２２８−２５７、南江堂、東京（１９９７） In recent years, exhaustive identification of various compounds contained in various biological samples has been performed using the above-described apparatus.
Mass spectrometry, “Identification method by spectrum of organic compounds-MS, IR, NMR, UV combined use-4th edition”, Silverstein RM, Bassler GC, by Morrill TC, Shu Araki, Yoichiro Masuko, Osamu Yamamoto, 2nd Chapter, Mass Spectrometry, pp. 3-40, Tokyo Chemical Doujin, Tokyo (1983) Yosuke Okura, Masaaki Kai, IV-5 Mass Spectrometry (Mass Spectrometry), “Analytical Chemistry II Rev. 4th Edition”, Yosuke Okura, Yoshimasa Tanaka, Masatoshi Yamaguchi, pp. 228-257, Nanedo, Tokyo (1997)

しかしながら、異なる測定実験から得られた２次元ピークデータ間で、同一の化合物に由来すると考えられるマススペクトル同士を情報処理装置などによって自動的に対応付ける方法は未だ確立されていない。 However, a method for automatically associating mass spectra considered to be derived from the same compound between two-dimensional peak data obtained from different measurement experiments by an information processing device or the like has not yet been established.

通常、異なる生体サンプルから得られた２次元ピークデータ間で同一の化合物由来のマススペクトルを見出す場合、研究者が２次元ピークデータに含まれる膨大なマススペクトルの中から経験に基づいた勘によってマススペクトルを１つずつ選出し、それぞれのマススペクトル同士を見比べ、対応するか否かを主観的に判定することが一般的に行われている。しかしながら、上述したように、様々な生体サンプルに含まれる様々な化合物を網羅的に同定する場合、研究者が個々のマススペクトル同士を比較検討する従来の方法では、ハイスループットで出力される多量の２次元ピークデータに対応するのは困難である。従って、研究者の勘に頼らず、マススペクトル同士の対応付けを自動的に行うことのできる情報処理装置が求められている。 In general, when finding mass spectra derived from the same compound between two-dimensional peak data obtained from different biological samples, researchers use the intuition based on experience from the huge mass spectra included in the two-dimensional peak data. It is generally performed to select one spectrum at a time, compare each mass spectrum and subjectively determine whether or not they correspond. However, as described above, when comprehensively identifying various compounds contained in various biological samples, a conventional method in which researchers compare individual mass spectra with each other has a large amount of output at high throughput. It is difficult to deal with two-dimensional peak data. Therefore, there is a need for an information processing apparatus that can automatically associate mass spectra without depending on the intuition of a researcher.

ここで、対応付けを自動化する際には、保持時間の問題が鍵となる。例えば、ＧＣ／ＭＳやＬＣ／ＭＳなどによって同一の生体サンプルを同一の分析条件で複数回分析する場合、理論的には、同一の化合物は常に同一の保持時間の画分に含まれると考えられる。ところが、実際には、分析条件の同一性を完全に維持することは不可能であるため、分析実験ごとに保持時間のずれが生じてしまうのは避けられない。 Here, when automating the association, the problem of retention time is the key. For example, when the same biological sample is analyzed multiple times under the same analysis conditions by GC / MS, LC / MS, etc., it is theoretically considered that the same compound is always included in the fraction having the same retention time. . However, in reality, it is impossible to completely maintain the same analysis conditions, and it is inevitable that the holding time will be different for each analysis experiment.

このずれの問題は、生体サンプルの違いなど、分析条件が変化する場合ではさらに顕著になる。従って、この実験ごとにずれている保持時間を補正しなければ、マススペクトルの対応付けを正確に行うことができない。 This problem of deviation becomes even more pronounced when the analysis conditions change, such as differences in biological samples. Therefore, the mass spectrum cannot be accurately associated unless the holding time that is shifted for each experiment is corrected.

さらに、それ以前の問題として、従来の技術は、ピークの位置を正確に同定できない場合があるという問題を有している。 Furthermore, as a problem before that, the conventional technique has a problem that the position of the peak may not be accurately identified.

図５は、ある単一の化合物由来の異なるフラグメントイオンのイオン強度の例を示した図である。図５の上段は、ｍ／ｚがｉとなるフラグメントイオンのイオン強度を示したマスクロマトグラムであり、図５の下段は、ｍ／ｚがｊとなるフラグメントイオンのイオン強度を示したマスクロマトグラムである。これら２つのフラグメントイオンは同一の化合物に由来しており、それぞれのピークは、図５左側に示すように、ともに保持時間ｔ＋２とｔ＋３との間に存在する。そして、これらのフラグメントイオンのイオン強度をサンプリングすると、図５中央に示すようになる。 FIG. 5 is a diagram showing an example of ionic strength of different fragment ions derived from a single compound. The upper part of FIG. 5 is a mass chromatogram showing the ion intensity of a fragment ion with m / z being i, and the lower part of FIG. 5 is a mass chromatogram showing the ion intensity of a fragment ion with m / z being j. Gram. These two fragment ions are derived from the same compound, and each peak exists between retention times t + 2 and t + 3 as shown on the left side of FIG. And when the ion intensity of these fragment ions is sampled, it becomes as shown in the center of FIG.

このとき、これら２つのフラグメントイオンのピークの位置（保持時間）を従来の手法に従って同定すると、従来の手法ではサンプリングされたイオン強度が最大となる位置をピークの位置として同定するため、図５右側に示すように、一方のフラグメントイオンのピークの位置はｔ＋３となり、もう一方のフラグメントイオンのピークの位置はｔ＋２となってしまう。このように、従来の手法では、本来同一の保持時間に対応付けられるべき２つのピークが別々の保持時間に対応付けられてしまい、その結果、これらのフラグメントイオンが異なる化合物に由来するものとして扱われることになる。 At this time, if the peak positions (retention times) of these two fragment ions are identified according to the conventional technique, the position where the sampled ion intensity is maximum is identified as the peak position in the conventional technique. As shown in FIG. 2, the peak position of one fragment ion is t + 3, and the peak position of the other fragment ion is t + 2. Thus, in the conventional method, two peaks that should be associated with the same retention time are associated with different retention times, and as a result, these fragment ions are treated as being derived from different compounds. It will be.

研究者はこのような問題を経験に基づいた勘によって逐次修正しなければならないため、大量の２次元ピークデータを迅速に解析することが困難であった。 Since researchers have to correct such problems sequentially based on experience-based intuition, it was difficult to quickly analyze a large amount of two-dimensional peak data.

本発明は上記課題に鑑みてなされたものであり、その第１の目的は、複数の分析実験から得られた２次元ピークデータ間で、同一の化合物由来のマススペクトル同士を自動的かつ正確に対応付けることのできる情報処理装置の実現を目的とする。 The present invention has been made in view of the above problems, and a first object thereof is to automatically and accurately mass spectra derived from the same compound between two-dimensional peak data obtained from a plurality of analytical experiments. An object is to realize an information processing apparatus that can be associated.

また、本発明の第２の目的は、サンプリングされた２次元スペクトルデータから、個々のピークの位置を正確に同定することのできる情報処理装置の実現を目的とする。 A second object of the present invention is to realize an information processing apparatus capable of accurately identifying the position of each peak from sampled two-dimensional spectrum data.

上記の課題を解決するために、本発明に係る情報処理装置は、クロマトグラフィーと質量分析との組み合わせに基づいて得られる保持時間及びｍ／ｚごとのピークイオン強度の情報である２次元ピークデータが複数含まれる２次元ピークデータセットにおいて、２次元ピークデータ間で対応するマススペクトルを同定する情報処理装置であって、それぞれの２次元ピークデータから保持時間ごとにマススペクトルデータを抽出する第１マススペクトル抽出部と、異なる２次元ピークデータから抽出されたマススペクトルデータ同士を比較することによって、それぞれの２次元ピークデータから抽出された内部標準となるマススペクトルデータが含まれる内部標準マススペクトルデータセットを作成する内部標準同定部と、内部標準マススペクトルデータセットに含まれるそれぞれのマススペクトルデータに対応する保持時間に基づいて、それぞれの２次元ピークデータにおける保持時間を補正する保持時間補正部と、保持時間が補正された２次元ピークデータから保持時間ごとに新たなマススペクトルデータを抽出する第２マススペクトル抽出部と、保持時間が補正された２次元ピークデータ間で、保持時間の違いが所定の範囲内の新たなマススペクトルデータ同士を比較することによって、比較した新たなマススペクトル同士が対応するものであるか否かを判定する対応判定部とを備えていることを特徴とする。 In order to solve the above problems, an information processing apparatus according to the present invention provides two-dimensional peak data that is information on retention time and peak ion intensity for each m / z obtained based on a combination of chromatography and mass spectrometry. 2 is a data processing apparatus for identifying a mass spectrum corresponding to two-dimensional peak data in a two-dimensional peak data set including a plurality of data, and first extracting mass spectrum data for each holding time from each two-dimensional peak data Internal standard mass spectrum data including mass spectrum data as internal standards extracted from each two-dimensional peak data by comparing mass spectrum data extracted from different two-dimensional peak data with the mass spectrum extraction unit Internal standard identification part to create a set and internal standard mass spectrum Based on the retention time corresponding to each mass spectrum data included in the data set, the retention time correction unit for correcting the retention time in each 2D peak data, and the retention time corrected 2D peak data Compare the new mass spectrum data within the predetermined range of the retention time difference between the second mass spectrum extraction unit that extracts new mass spectrum data every time and the two-dimensional peak data with the retention time corrected. And a correspondence determination unit that determines whether or not the new compared mass spectra correspond to each other.

本明細書において、「２次元ピークデータ」とは、クロマトグラフィーと質量分析との組み合わせに基づいて得られるデータであって、かつ、各フラグメントイオン又は擬分子イオンのイオン強度が単一の保持時間に対応付けられたデータのことをいう。そして、その単一の保持時間に対応付けられたイオン強度のことを「ピークイオン強度」という。さらに本明細書において、「ピーク」とは、上記フラグメントイオン又は擬分子イオンのイオン強度が最大となる点のことをいう。また、本明細書において、「マススペクトルデータ」とは、ｍ／ｚごとのイオン強度の情報を示すデータであって、例えば、２次元ピークデータから、或る保持時間についてのｍ／ｚごとのイオン強度の情報を抽出することによって得ることができる。 In the present specification, “two-dimensional peak data” is data obtained based on a combination of chromatography and mass spectrometry, and the ionic strength of each fragment ion or quasi-molecular ion is a single retention time. The data associated with. The ion intensity associated with the single retention time is referred to as “peak ion intensity”. Furthermore, in this specification, the “peak” refers to a point at which the ionic strength of the fragment ion or quasi-molecular ion is maximized. Further, in this specification, “mass spectrum data” is data indicating ion intensity information for each m / z. For example, from two-dimensional peak data, for each m / z for a certain holding time. It can be obtained by extracting ionic strength information.

本発明に係る情報処理装置は、クロマトグラフィーと質量分析との組み合わせに基づいて得られる保持時間及びｍ／ｚごとのピークイオン強度の情報である２次元ピークデータが複数含まれる２次元ピークデータセットにおいて、２次元ピークデータ間で対応するマススペクトルを同定するものである。 An information processing apparatus according to the present invention includes a two-dimensional peak data set including a plurality of two-dimensional peak data, which are information on retention time and peak ion intensity for each m / z obtained based on a combination of chromatography and mass spectrometry. In FIG. 2, the corresponding mass spectrum is identified between the two-dimensional peak data.

上記構成によれば、第１マススペクトル抽出部によって、それぞれの２次元ピークデータから保持時間ごとにマススペクトルデータが抽出される。このマススペクトルデータは化合物を同定する手がかりになるものであり、同一の化合物からは同一のマススペクトルデータが得られる。そして、内部標準同定部によって、異なる２次元ピークデータから抽出されたマススペクトルデータ同士が比較され、それぞれの２次元ピークデータから抽出された互いに類似するマススペクトルデータが含まれる内部標準マススペクトルデータセットが作成される。この内部標準マススペクトルデータセットに含まれるマススペクトルデータは互いに類似するものであり、換言すれば、それぞれの２次元ピークデータに共通して見られるマススペクトルということになる。 According to the above configuration, mass spectrum data is extracted from each two-dimensional peak data for each holding time by the first mass spectrum extraction unit. This mass spectrum data is a clue for identifying a compound, and the same mass spectrum data can be obtained from the same compound. The internal standard identification unit compares the mass spectrum data extracted from the different two-dimensional peak data, and includes an internal standard mass spectrum data set including similar mass spectrum data extracted from each two-dimensional peak data. Is created. The mass spectrum data included in the internal standard mass spectrum data set is similar to each other, in other words, a mass spectrum commonly seen in each two-dimensional peak data.

これらのマススペクトルデータは、互いに類似しているため、同一の化合物から得られたものと推定できる。クロマトグラフィーの原理によれば、同一の化合物は同一の保持時間の画分に含まれるため、上記のセットに含まれるマススペクトルデータは、全て同一の保持時間に対応するべきものである。従って、このセットに含まれるマススペクトルデータを内部標準として、各２次元ピークデータの保持時間のずれを補正することができるのである。 Since these mass spectrum data are similar to each other, it can be estimated that they were obtained from the same compound. According to the chromatographic principle, the same compound is contained in the fraction with the same retention time, so all the mass spectral data contained in the above set should correspond to the same retention time. Accordingly, it is possible to correct a shift in the holding time of each two-dimensional peak data using the mass spectrum data included in this set as an internal standard.

本発明の情報処理装置では、保持時間補正部によって、それぞれの２次元ピーク情報における保持時間が、上述した内部標準マススペクトルデータセットに含まれるそれぞれのマススペクトルデータに対応する保持時間に基づいて補正される。これにより、それぞれの２次元ピークデータごとの保持時間のずれが解消される。 In the information processing apparatus of the present invention, the holding time correction unit corrects the holding time in each two-dimensional peak information based on the holding time corresponding to each mass spectrum data included in the internal standard mass spectrum data set described above. Is done. Thereby, the shift | offset | difference of the holding time for each two-dimensional peak data is eliminated.

続いて、第２マススペクトル抽出部によって、保持時間が補正されたそれぞれの２次元ピークデータから保持時間ごとに新たなマススペクトル情報が抽出される。そして、対応判定部によって、保持時間が補正された２次元ピークデータ間で、保持時間の違いが所定の範囲内の新たなマススペクトルデータ同士が比較され、比較された新たなマススペクトル同士が対応するものであるか否かが判定される。 Subsequently, new mass spectrum information is extracted for each holding time from each two-dimensional peak data with the holding time corrected by the second mass spectrum extraction unit. Then, the correspondence determination unit compares the new mass spectrum data in which the difference in the retention time is within a predetermined range between the two-dimensional peak data whose retention times have been corrected, and the newly compared mass spectra correspond to each other. It is determined whether it is what to do.

つまり、本発明に係る情報処理装置では、最終的に、保持時間の違いが所定の範囲内のマススペクトルデータ同士のみを比較する。従って、保持時間が大きく異なるマススペクトル同士を対応付けてしまうことがない。これにより、マススペクトルの対応付けを正確に行うことができる。さらに、この際用いられる保持時間は、実験ごとのずれが補正された正確なものであるため、対応付けを一層正確に行うことができる。 That is, in the information processing apparatus according to the present invention, finally, only the mass spectrum data in which the difference in retention time is within a predetermined range is compared. Therefore, mass spectra having significantly different retention times are not associated with each other. As a result, mass spectra can be accurately associated. Furthermore, since the holding time used at this time is an accurate one in which the deviation for each experiment is corrected, the association can be performed more accurately.

以上のように、本発明に係る情報処理装置によれば、複数の分析実験から得られた２次元ピークデータ間で、同一の化合物由来のマススペクトル同士を自動的かつ正確に対応付けることができる。 As described above, according to the information processing apparatus of the present invention, mass spectra derived from the same compound can be automatically and accurately associated with each other between two-dimensional peak data obtained from a plurality of analysis experiments.

また、上記内部標準同定部は、２次元ピークデータ間で、保持時間の違いが所定の範囲内のマススペクトルデータ同士を比較することが好ましい。 Moreover, it is preferable that the internal standard identification unit compares mass spectrum data having a retention time difference within a predetermined range between two-dimensional peak data.

上記構成によれば、内部標準同定部によって比較されるマススペクトルデータは、保持時間の違いが所定の範囲内のものとなる。従って、ピークパターンは類似しているものの保持時間が著しく異なるマススペクトルデータのセットを、内部標準として意図せずに用いてしまうのを防止することができる。 According to the above configuration, the mass spectrum data compared by the internal standard identification unit has a difference in retention time within a predetermined range. Therefore, it is possible to prevent unintentional use of a set of mass spectrum data having similar peak patterns but significantly different retention times as an internal standard.

また、上記第１マススペクトル抽出部は、それぞれのマススペクトルデータを、ｍ／ｚごとのイオン強度を要素とするベクトルとして抽出することが好ましい。 Moreover, it is preferable that the said 1st mass spectrum extraction part extracts each mass spectrum data as a vector which makes ion intensity for every m / z an element.

上記構成によれば、マススペクトルデータ同士の類似性の判定は、ベクトル同士の比較に基づいて行われる。ベクトルの類似性を判定する公知の手法には様々な手法があり、類似度を算出することなどにより、マススペクトルデータ同士の類似性を客観的に判定することができるようになる。 According to the above configuration, the similarity between the mass spectrum data is determined based on the comparison between the vectors. There are various known methods for determining the similarity of vectors, and the similarity between mass spectrum data can be objectively determined by calculating the similarity.

また、上記第２マススペクトル抽出部は、それぞれの新たなマススペクトルデータを、ｍ／ｚごとのイオン強度を要素とする新たなベクトルとして抽出し、上記対応判定部は、異なる補正された２次元ピークデータから抽出した新たなベクトル同士の類似度を算出するとともに、算出した類似度を閾値と比較することによって、マススペクトルデータ同士が対応しているか否かを判定することが好ましい。 Further, the second mass spectrum extraction unit extracts each new mass spectrum data as a new vector having an ion intensity for each m / z as an element, and the correspondence determination unit is configured to perform different corrected two-dimensional data. It is preferable to determine whether or not the mass spectrum data corresponds to each other by calculating the similarity between new vectors extracted from the peak data and comparing the calculated similarity with a threshold value.

上記構成によれば、対応判定部によって、それぞれのマススペクトルデータに対応するベクトル同士の類似度が算出され、さらに、算出された類似度と閾値との比較結果に基づいて、マススペクトルデータが対応するものであるか否かが判定される。このように、本発明に係る情報処理装置では、マススペクトルデータの対応付けを、ベクトルの類似度という客観的な指標に基づいて行うことができる。 According to the above configuration, the correspondence determination unit calculates the similarity between vectors corresponding to each mass spectrum data, and further, the mass spectrum data corresponds based on the comparison result between the calculated similarity and the threshold value. It is determined whether it is what to do. As described above, in the information processing apparatus according to the present invention, it is possible to associate mass spectrum data based on an objective index called vector similarity.

また、上記情報処理装置は、上記対応判定部によって対応すると判定された新たなマススペクトル同士について、それぞれの新たなマススペクトルの識別情報が同じ行又は列に配列された対応テーブルを作成するテーブル作成部をさらに備えていることが好ましい。 Further, the information processing apparatus creates a correspondence table in which identification information of each new mass spectrum is arranged in the same row or column for new mass spectra determined to correspond by the correspondence determination unit. It is preferable to further include a portion.

上記構成によれば、作成された対応テーブルを参照することによって、異なる２次元ピークデータ間において、どのマススペクトルとどのマススペクトルとが対応しているかを知ることができる。 According to the above configuration, it is possible to know which mass spectrum corresponds to which mass spectrum between different two-dimensional peak data by referring to the created correspondence table.

また、上記の課題を解決するために、本発明に係る別の情報処理装置は、クロマトグラフィーと質量分析との組み合わせに基づいて得られる保持時間及びｍ／ｚごとのイオン強度の情報である２次元スペクトルデータから、保持時間及びｍ／ｚごとのピークイオン強度の情報である２次元ピークデータを作成する２次元ピークデータ作成部と、上記２次元ピークデータについて、保持時間ごとにピークイオン強度の総和をトータルピーク強度として算出し、保持時間ごとのトータルピーク強度の情報であるトータルピークデータを作成するトータルピーク作成部と、上記トータルピークデータにおいてトータルピーク強度が常に閾値以上になる区間又は常に閾値よりも大きくなる区間を同定し、その区間においてトータルピーク強度が極大になる保持時間を、２次元ピークデータの上記区間に含まれるピークに対応する真の保持時間として同定する第１ピーク位置同定部とを備えていることを特徴とする。 In order to solve the above problems, another information processing apparatus according to the present invention is information on retention time and ion intensity for each m / z obtained based on a combination of chromatography and mass spectrometry 2. A two-dimensional peak data creation unit for creating two-dimensional peak data, which is information on the retention time and peak ion intensity for each m / z, from the two-dimensional spectrum data; A total peak generator that calculates total sum as total peak intensity and generates total peak data that is information of total peak intensity for each holding time, and a section where the total peak intensity is always greater than or equal to the threshold in the total peak data, or always threshold And the total peak intensity is maximized in that section. That the retention time, characterized in that it comprises a first peak position identifying unit for identifying as a true retention time corresponding to the peaks included in the section of the two-dimensional peak data.

本明細書において、「２次元スペクトルデータ」とは、クロマトグラフィーと質量分析との組み合わせに基づいて得られるデータであって、各フラグメントイオン又は擬分子イオンのイオン強度が複数の保持時間にまたがっているデータのことをいう。 In this specification, “two-dimensional spectral data” is data obtained based on a combination of chromatography and mass spectrometry, and the ionic strength of each fragment ion or pseudo-molecular ion extends over a plurality of retention times. It means the data.

上記構成によれば、まず、２次元ピークデータ作成部によって、２次元スペクトルデータから２次元ピークデータが作成される。ここで、一例として、作成された２次元ピークデータにおいて、図９のピーク１０１とピーク１０２のように、本来は同一の保持時間に対応付けられるべきピークが、ランダム誤差のために異なる保持時間（ｔ_１とｔ_２）に対応付けられているとする。上記構成によれば、トータルピーク作成部によって、保持時間ごとにトータルピーク強度が算出される。その結果、保持時間ｔ_１及びｔ_２におけるトータルピーク強度は閾値よりも大きくなる（あるいは閾値以上となる）。従って、保持時間がｔ_１からｔ_２までの区間は、第１ピーク同定部によって、トータルピーク強度が常に閾値以上になる区間（あるいは常に閾値よりも大きくなる区間）として同定される。そして、第１ピーク同定部によって、その区間においてトータルピーク強度が極大になる保持時間ｔ_１が、ピーク１０１とピーク１０２の真の保持時間として同定される。従って、ピーク１０１とピーク１０２は、同一の保持時間ｔ_１に位置し、単一の化合物由来のピークとして扱うことができる。 According to the above configuration, first, the two-dimensional peak data is created from the two-dimensional spectrum data by the two-dimensional peak data creation unit. Here, as an example, in the created two-dimensional peak data, peaks that should be associated with the same retention time, such as the peak 101 and the peak 102 in FIG. Assume that these are associated with t ₁ and t ₂ ). According to the above configuration, the total peak intensity is calculated for each holding time by the total peak creating unit. As a result, the total peak intensity at the holding times t ₁ and t ₂ becomes larger than the threshold (or becomes greater than or equal to the threshold). Therefore, the section from the retention time t ₁ to t ₂ is identified by the first peak identification unit as a section where the total peak intensity is always greater than or equal to the threshold (or a section where the total peak intensity is always greater than the threshold). Then, the holding time t _{1 at} which the total peak intensity is maximized in the section is identified as the true holding time of the peak 101 and the peak 102 by the first peak identifying unit. Thus, the peak 101 and peak 102 is located in the same retention time t _1, it can be treated as the peak derived from a single compound.

以上のように、本発明に係る情報処理装置によれば、サンプリングされた２次元スペクトルデータから、個々のピークの位置を正確に同定することができる。 As described above, according to the information processing apparatus of the present invention, the position of each peak can be accurately identified from the sampled two-dimensional spectrum data.

また、上記第１ピーク位置同定部は、さらに、上記２次元ピークデータの上記区間に含まれるピークイオン強度を、同定した真の保持時間に対応付けることが好ましい。 Moreover, it is preferable that the first peak position identifying unit further associates the peak ion intensity included in the section of the two-dimensional peak data with the identified true retention time.

上記構成によれば、ピークイオン強度が正確な保持時間に対応付けられた２次元ピークデータを得ることができる。 According to the above configuration, two-dimensional peak data in which peak ion intensity is associated with an accurate holding time can be obtained.

また、上記２次元ピークデータ作成部は、クロマトグラフィーと質量分析との組み合わせに基づいて得られる保持時間及びｍ／ｚごとのイオン強度の情報である２次元スペクトルデータからｍ／ｚごとにマスクロマトグラムデータを抽出するとともに、それぞれのマスクロマトグラムデータにおいてイオン強度が極大となる保持時間に基づいて仮のピーク保持時間を同定する第２ピーク位置同定部と、それぞれのマスクロマトグラムデータにおいて、仮のピーク保持時間に隣接する保持時間におけるイオン強度を、仮のピーク保持時間のイオン強度に加算することによってピークイオン強度を算出するとともに、算出したピークイオン強度を仮のピーク保持時間と対応付けることによって、上記２次元ピークデータを作成するスペクトル強度合体部とを含んでいることが好ましい。 In addition, the two-dimensional peak data creation unit performs mass chromatography for each m / z from two-dimensional spectrum data, which is information on retention time and ion intensity for each m / z obtained based on a combination of chromatography and mass spectrometry. In addition to extracting gram data, each mass chromatogram data includes a second peak position identifying unit that identifies a tentative peak retention time based on a retention time at which the ionic strength becomes maximum, and each mass chromatogram data By calculating the peak ion intensity by adding the ion intensity at the holding time adjacent to the peak holding time to the ion intensity at the temporary peak holding time and associating the calculated peak ion intensity with the temporary peak holding time. , Spectral intensity to create the above two-dimensional peak data Preferably contains a body portion.

上記構成によれば、まず、第２ピーク位置同定部によって、２次元スペクトルデータからｍ／ｚごとにマスクロマトグラムデータが抽出される。そして、それぞれのマスクロマトグラムデータにおいてイオン強度が極大となる保持時間に基づいて仮のピーク保持時間が同定される。続いて、スペクトル強度合体部によって、それぞれのマスクロマトグラムデータにおいて、仮のピーク保持時間に隣接する保持時間におけるイオン強度が、仮のピーク保持時間のイオン強度に加算される。この加算されたイオン強度はピークイオン強度となる。そして、このピークイオン強度は、仮のピーク保持時間と対応付けられる。その結果、複数の保持時間にまたがったイオン強度は、仮のピーク保持時間に纏められる。このようにして、２次元スペクトルデータから２次元ピークデータを作成することができる。 According to the above configuration, first, the mass chromatogram data is extracted for each m / z from the two-dimensional spectrum data by the second peak position identification unit. Then, a temporary peak retention time is identified based on the retention time at which the ionic strength is maximized in each mass chromatogram data. Subsequently, in each mass chromatogram data, the ion intensity at the holding time adjacent to the temporary peak holding time is added to the ion intensity at the temporary peak holding time by the spectrum intensity merging unit. This added ion intensity becomes the peak ion intensity. This peak ionic strength is associated with a temporary peak retention time. As a result, the ion intensity over a plurality of holding times is collected into a temporary peak holding time. In this way, two-dimensional peak data can be created from the two-dimensional spectrum data.

また、上記情報処理装置は、クロマトグラフィーと質量分析との組み合わせに基づいて得られる保持時間及びｍ／ｚごとのイオン強度の情報である２次元スペクトルデータからｍ／ｚごとにマスクロマトグラムデータを抽出するとともに、それぞれのマスクロマトグラムデータにおけるイオン強度を平滑化することによって、上記２次元ピークデータ作成部に入力する２次元スペクトルデータを作成する平滑化部をさらに備えていることが好ましい。 In addition, the information processing apparatus obtains mass chromatogram data for each m / z from two-dimensional spectrum data that is information on retention time and ion intensity for each m / z obtained based on a combination of chromatography and mass spectrometry. It is preferable to further include a smoothing unit that creates two-dimensional spectrum data to be input to the two-dimensional peak data creation unit by extracting and smoothing the ion intensity in each mass chromatogram data.

通常、イオン強度をサンプリングする際には、ランダムノイズが混入することが避けられない。ここで、上記構成によれば、平滑化部によって、２次元スペクトルデータにおけるイオン強度が保持時間軸に沿って平滑化されるので、ランダムノイズの影響を抑制することができる。 Usually, when sampling ion intensity, it is inevitable that random noise is mixed. Here, according to the above configuration, since the ion intensity in the two-dimensional spectrum data is smoothed along the holding time axis by the smoothing unit, the influence of random noise can be suppressed.

ところで、上記情報処理装置は、ハードウェアで実現してもよいし、プログラムをコンピュータに実行させることによって実現してもよい。具体的には、本発明に係るプログラムは、上記各部としてコンピュータを動作させるプログラムであり、本発明に係る記録媒体には、当該プログラムが記録されている。 By the way, the information processing apparatus may be realized by hardware or may be realized by causing a computer to execute a program. Specifically, a program according to the present invention is a program that causes a computer to operate as each of the above-described units, and the program is recorded on a recording medium according to the present invention.

これらのプログラムがコンピュータによって実行されると、当該コンピュータは、上記情報処理装置として動作する。したがって、上記情報処理装置と同様の効果を奏する。 When these programs are executed by a computer, the computer operates as the information processing apparatus. Therefore, the same effects as those of the information processing apparatus are achieved.

以上のように、本発明に係る情報処理装置は、それぞれの２次元ピークデータから保持時間ごとにマススペクトルデータを抽出する第１マススペクトル抽出部と、異なる２次元ピークデータから抽出されたマススペクトルデータ同士を比較することによって、それぞれの２次元ピークデータから抽出された互いに類似するマススペクトルデータが含まれる内部標準マススペクトルデータセットを作成する内部標準同定部と、内部標準マススペクトルデータセットに含まれるそれぞれのマススペクトルデータに対応する保持時間に基づいて、それぞれの２次元ピークデータにおける保持時間を補正する保持時間補正部と、保持時間が補正された２次元ピークデータから保持時間ごとに新たなマススペクトルデータを抽出する第２マススペクトル抽出部と、保持時間が補正された２次元ピークデータ間で、保持時間の違いが所定の範囲内の新たなマススペクトルデータ同士を比較することによって、比較した新たなマススペクトル同士が対応するものであるか否かを判定する対応判定部とを備えた構成となっている。従って、上述したように、複数の分析実験から得られた２次元ピークデータ間で、同一の化合物由来のマススペクトル同士を自動的かつ正確に対応付けることができるという効果を奏する。 As described above, the information processing apparatus according to the present invention includes a first mass spectrum extraction unit that extracts mass spectrum data from each two-dimensional peak data for each holding time, and a mass spectrum extracted from different two-dimensional peak data. Included in the internal standard mass spectrum data set and an internal standard mass spectrum data set that creates internal standard mass spectrum data sets that contain similar mass spectrum data extracted from each two-dimensional peak data by comparing the data A holding time correction unit for correcting the holding time in each two-dimensional peak data based on the holding time corresponding to each mass spectrum data, and a new one for each holding time from the two-dimensional peak data in which the holding time is corrected. Second mass spectrum to extract mass spectrum data New mass spectra that correspond to each other by comparing the new mass spectrum data in which the difference in retention time is within a predetermined range between the output portion and the two-dimensional peak data with the retention time corrected. It is the structure provided with the corresponding | compatible determination part which determines whether it is. Therefore, as described above, there is an effect that mass spectra derived from the same compound can be automatically and accurately associated between two-dimensional peak data obtained from a plurality of analysis experiments.

また、本発明に係る別の情報処理装置は、クロマトグラフィーと質量分析との組み合わせに基づいて得られる保持時間及びｍ／ｚごとのイオン強度の情報である２次元スペクトルデータから、保持時間及びｍ／ｚごとのピークイオン強度の情報である２次元ピークデータを作成する２次元ピークデータ作成部と、上記２次元ピークデータについて、保持時間ごとにピークイオン強度の総和をトータルピーク強度として算出し、保持時間ごとのトータルピーク強度の情報であるトータルピークデータを作成するトータルピーク作成部と、上記トータルピークデータにおいてトータルピーク強度が常に閾値以上になる区間又は常に閾値よりも大きくなる区間を同定し、その区間においてトータルピーク強度が極大になる保持時間を、２次元ピークデータの上記区間に含まれるピークに対応する真の保持時間として同定する第１ピーク位置同定部とを備えた構成となっている。従って、上述したように、サンプリングされた２次元スペクトルデータから、個々のピークの位置を正確に同定することができるという効果を奏する。 In addition, another information processing apparatus according to the present invention provides a retention time and m from two-dimensional spectrum data which is information on retention time and ion intensity for each m / z obtained based on a combination of chromatography and mass spectrometry. For the two-dimensional peak data creating unit that creates two-dimensional peak data that is information of peak ion intensity for each / z, and for the two-dimensional peak data, the sum of peak ion intensities for each holding time is calculated as the total peak intensity, A total peak creation unit that creates total peak data that is information on the total peak intensity for each holding time, and a section in which the total peak intensity is always greater than or equal to the threshold in the total peak data or a section that is always greater than the threshold, The retention time at which the total peak intensity becomes maximum in that section is the two-dimensional peak. It has a configuration that includes a first peak position identifying unit for identifying as a true retention time corresponding to the peaks included in the section of the chromatography data. Therefore, as described above, there is an effect that the position of each peak can be accurately identified from the sampled two-dimensional spectrum data.

〔実施形態１〕
本発明に係る情報処理装置の一実施形態について図１から図１３に基づいて説明すると以下の通りである。 Embodiment 1
An embodiment of an information processing apparatus according to the present invention will be described below with reference to FIGS.

本実施形態のスペクトル解析装置（情報処理装置）は、ＧＣ／ＭＳ、ＬＣ／ＭＳなど、クロマトグラフィーと質量分析とを組み合わせた測定装置がサンプリングしたデータを解析する装置である。上記の測定装置によって得られたイオン強度情報のデータは、イオン化による質量数と電荷の比（以下「ｍ／ｚ」という）、及び保持時間についての２次元スペクトルデータであるといえる。本実施形態のスペクトル解析装置は、上記の２次元スペクトルデータに含まれるピークの位置（すなわちピークに対応するｍ／ｚ及び保持時間）を正確に同定するために用いられる。 The spectrum analysis apparatus (information processing apparatus) of the present embodiment is an apparatus that analyzes data sampled by a measurement apparatus that combines chromatography and mass spectrometry, such as GC / MS and LC / MS. It can be said that the data of the ionic strength information obtained by the above measuring device is two-dimensional spectral data on the ratio of mass number to electric charge by ionization (hereinafter referred to as “m / z”) and retention time. The spectrum analysis apparatus of this embodiment is used to accurately identify the position of a peak (that is, m / z and retention time corresponding to the peak) included in the two-dimensional spectrum data.

まず、測定装置によって得られるデータの表記方法について定義する。上述したように、測定装置によって得られるイオン強度情報（以下「スペクトル強度」という）は、ｍ／ｚ及び保持時間ごとに変動する。ここで、得られるスペクトルデータにおいて、ｍ／ｚの番号を値の小さいものから順に１，２，…，Ｍ、保持時間の番号を時間の小さいものから順に１，２，…，Ｋとした場合に、保持時間がｋ番目でｍ／ｚがｍ番目のサンプル点のスペクトル強度を、ｘ_ｋ，ｍと表すことにする。 First, a description method of data obtained by the measuring apparatus is defined. As described above, the ion intensity information (hereinafter referred to as “spectral intensity”) obtained by the measuring apparatus varies for each m / z and holding time. Here, in the obtained spectrum data, the m / z number is 1, 2,..., M in order from the smallest value, and the retention time number is 1, 2,. Let x _{k, m} denote the spectral intensity of the sample point with the retention time kth and m / z mth.

図６は、本実施形態のスペクトル解析装置の要部構成を示す機能ブロック図である。本実施形態のスペクトル解析装置１は、図６に示すように、主としてピークデータ作成部１０及びピーク対応付け部２０を機能ブロックとして備えたマイクロコンピュータである。 FIG. 6 is a functional block diagram showing the main configuration of the spectrum analyzing apparatus according to the present embodiment. As shown in FIG. 6, the spectrum analysis apparatus 1 according to the present embodiment is a microcomputer mainly including a peak data creation unit 10 and a peak association unit 20 as functional blocks.

ピークデータ作成部１０は、ＬＣ／ＭＳ１００（又はＧＣ／ＭＳ）から得られた２次元スペクトルデータにおけるピークの保持時間及びイオン強度を同定し、各フラグメントイオン及び／又は擬分子イオンのイオン強度が単一の保持時間及びｍ／ｚに対応付けられた２次元ピークデータを作成するためのものである。また、ピーク対応付け部２０は、上記のピークデータ作成部１０によって得られた複数の実験に関する２次元ピークデータ同士を比較し、複数の実験間で同一の化合物由来のピーク同士を対応付けるためのものである。 The peak data creation unit 10 identifies the peak retention time and ion intensity in the two-dimensional spectral data obtained from the LC / MS 100 (or GC / MS), and the ion intensity of each fragment ion and / or quasi-molecular ion is simple. This is for creating two-dimensional peak data associated with one holding time and m / z. The peak association unit 20 compares two-dimensional peak data related to a plurality of experiments obtained by the peak data creation unit 10 and associates peaks derived from the same compound among the plurality of experiments. It is.

（１．ピークデータ作成部／ピークデータ作成工程）
図７は、ピークデータ作成部１０の詳細な機能構成を示すブロック図である。図７に示すように、本実施形態におけるピークデータ作成部１０は、平滑化部１１、ピーク時間同定部（第２ピーク位置同定部）１２、スペクトル強度合体部１３、トータルピーク作成部１４、トータルピーク合体部（第１ピーク位置同定部）１５、及びピーク合体部（第１ピーク位置同定部）１６を含んでいる。 (1. Peak data creation unit / peak data creation process)
FIG. 7 is a block diagram showing a detailed functional configuration of the peak data creation unit 10. As shown in FIG. 7, the peak data creation unit 10 in this embodiment includes a smoothing unit 11, a peak time identification unit (second peak position identification unit) 12, a spectral intensity coalescence unit 13, a total peak creation unit 14, and a total A peak merging portion (first peak position identifying portion) 15 and a peak merging portion (first peak position identifying portion) 16 are included.

平滑化部１１は、２次元スペクトルデータからｍ／ｚごとにマスクロマトグラムデータを抽出するとともに、それぞれのマスクロマトグラムにおけるイオン強度を平滑化する。平滑化した２次元スペクトルデータは、ピーク時間同定部１２に入力される。 The smoothing unit 11 extracts the mass chromatogram data for each m / z from the two-dimensional spectrum data, and smoothes the ion intensity in each mass chromatogram. The smoothed two-dimensional spectrum data is input to the peak time identification unit 12.

ピーク時間同定部１２は、平滑化部１１から入力された２次元スペクトルデータからｍ／ｚごとにマスクロマトグラムデータを抽出するとともに、それぞれのマスクロマトグラムにおいてイオン強度が極大となる保持時間に基づいて仮のピーク保持時間（ピークに対応する保持時間）を同定する。この仮のピーク位置の情報は、抽出されたマスクロマトグラムデータとともにスペクトル強度合体部１３に入力される。 The peak time identification unit 12 extracts mass chromatogram data for each m / z from the two-dimensional spectrum data input from the smoothing unit 11, and based on the retention time at which the ion intensity is maximized in each mass chromatogram. The temporary peak retention time (retention time corresponding to the peak) is identified. This temporary peak position information is input to the spectral intensity combining unit 13 together with the extracted mass chromatogram data.

スペクトル強度合体部１３は、ピーク時間同定部１２から入力されたそれぞれのマスクロマトグラムデータにおいて、仮のピーク保持時間に隣接する保持時間におけるイオン強度を、仮のピーク保持時間のイオン強度に加算することによってピークイオン強度を算出し、算出したピークイオン強度を仮のピーク保持時間と対応付ける。この処理により、複数の保持時間にまたがるフラグメントイオン又は擬分子イオンのイオン強度は、単一の保持時間及びｍ／ｚに対応付けられたピークイオン強度となり、その結果、２次元ピークデータが作成される。作成された２次元ピークデータは、トータルピーク作成部１４に入力される。 In each mass chromatogram data input from the peak time identifying unit 12, the spectrum intensity combining unit 13 adds the ion intensity at the holding time adjacent to the temporary peak holding time to the ion intensity at the temporary peak holding time. Thus, the peak ion intensity is calculated, and the calculated peak ion intensity is associated with the temporary peak retention time. By this process, the ionic strength of fragment ions or quasi-molecular ions over a plurality of holding times becomes a single holding time and a peak ion strength associated with m / z, and as a result, two-dimensional peak data is created. The The created two-dimensional peak data is input to the total peak creation unit 14.

トータルピーク作成部１４は、スペクトル強度合体部１３から入力された２次元ピークデータについて、保持時間ごとにピークイオン強度の総和を算出する。算出されたピークイオン強度の総和は、トータルピーク強度と称され、各保持時間の画分に含まれる化合物の相対量を示す指標となる。このトータルピーク強度は保持時間ごとに算出されるため、保持時間とトータルピーク強度との関係を示すトータルピークデータが作成される。このトータルピークデータは、トータルピーク合体部１５に入力される。 The total peak creating unit 14 calculates the sum of peak ion intensities for each holding time for the two-dimensional peak data input from the spectral intensity combining unit 13. The total sum of the calculated peak ion intensities is referred to as total peak intensity and serves as an index indicating the relative amount of the compound contained in each retention time fraction. Since this total peak intensity is calculated for each holding time, total peak data indicating the relationship between the holding time and the total peak intensity is created. The total peak data is input to the total peak combining unit 15.

トータルピーク合体部１５は、トータルピークデータにおいてトータルピーク強度が常に閾値以上になる区間又は常に閾値よりも大きくなる区間を求め、その区間においてトータルピーク強度が極大になる保持時間を同定する。ここで同定された保持時間は、２次元ピークデータの同区間に含まれる全てのピークに対応する真の保持時間となる。ここで求めた区間及び真の保持時間の情報は、２次元ピークデータとともにピーク合体部１６に入力される。さらに、トータルピーク合体部１５は、ここで求めた区間に含まれるトータルピーク強度の総和を算出し、算出した総和を真の保持時間に対応付ける。これにより、更新されたトータルピークデータが作成される。この更新されたトータルピークデータは、ピーク対応付け部２０の第１ピークベクトル作成部２１に入力される。 The total peak merging unit 15 obtains a section in which the total peak intensity is always greater than or equal to the threshold in the total peak data or a section in which the total peak intensity is always greater than the threshold, and identifies a holding time in which the total peak intensity is maximum in that section. The retention time identified here is a true retention time corresponding to all the peaks included in the same section of the two-dimensional peak data. The section and true retention time information obtained here are input to the peak merger 16 together with the two-dimensional peak data. Further, the total peak merging unit 15 calculates the sum of the total peak intensities included in the section obtained here, and associates the calculated sum with the true holding time. Thereby, updated total peak data is created. The updated total peak data is input to the first peak vector creation unit 21 of the peak association unit 20.

ピーク合体部１６は、トータルピーク合体部１５から入力された２次元ピークデータにおいて、上記の区間に含まれる全てのピークの位置を、上記の真の保持時間に対応付ける。具体的には、上記の区間に含まれるピークのピーク強度の総和をｍ／ｚごとに算出し、算出した総和を真の保持時間に対応付ける。これにより、更新された２次元ピークデータが作成される。更新された２次元ピークは、対応付け部２０の第１ピークベクトル作成部２１に入力される。 In the two-dimensional peak data input from the total peak merging unit 15, the peak merging unit 16 associates all peak positions included in the section with the true retention time. Specifically, the sum of peak intensities of peaks included in the section is calculated for each m / z, and the calculated sum is associated with the true retention time. Thereby, updated two-dimensional peak data is created. The updated two-dimensional peak is input to the first peak vector creation unit 21 of the association unit 20.

次に、ピークデータ作成部１０の動作について説明する。ピークデータ作成部１０の処理工程（ピーク解析方法）は、平滑化工程、ピーク時間同定工程、スペクトル強度合体工程、トータルピーク作成工程、トータルピーク合体工程、及びピーク合体工程を含んでいる。これらの工程について順に説明すると以下の通りである。 Next, the operation of the peak data creation unit 10 will be described. The processing steps (peak analysis method) of the peak data creation unit 10 include a smoothing step, a peak time identification step, a spectrum intensity coalescence step, a total peak creation step, a total peak coalescence step, and a peak coalescence step. These steps will be described in order as follows.

（１−１．平滑化部／平滑化工程）
まず、測定装置から得られた２次元スペクトルデータに対して、保持時間軸に沿って平滑化を行う。平滑化の詳細な手順は次の通りである。 (1-1. Smoothing part / smoothing step)
First, the two-dimensional spectrum data obtained from the measuring device is smoothed along the holding time axis. The detailed procedure for smoothing is as follows.

２次元スペクトルをｍ／ｚ軸と直交する平面で切り出して得られるマスクロマトグラムにおいて、保持時間がｋ番目のサンプル点のスペクトル強度をｘ_ｋとする。平滑化は、ｋ番目のサンプル点を中心とする合計Ｎ個のサンプル点についてスペクトル強度の平均値を算出し、算出した平均値をｋ番目のサンプル点のスペクトル強度に割り当てることによって行われる。これにより、ｋ番目のサンプル点の平滑化後のイオン強度ｘ’_ｋは、次の式（１） In a mass chromatogram obtained by cutting out a two-dimensional spectrum on a plane orthogonal to the m / z axis, let x _{k be} the spectral intensity at the k-th sample point with a retention time. Smoothing is performed by calculating an average value of the spectrum intensity for a total of N sample points centered on the kth sample point and assigning the calculated average value to the spectrum intensity of the kth sample point. Thereby, the ion intensity x ′ _k after the smoothing of the k-th sample point is expressed by the following equation (1).

（ただし、Ｎ’＝（Ｎ−１）／２）
によって表される。 (However, N ′ = (N−1) / 2)
Represented by

なお、この処理は、各ｍ／ｚと直交する全ての平面について行われる。従って、保持時間がｋ番目でｍ／ｚがｍ番目のサンプル点の平滑化後のスペクトル強度ｘ’_ｋ，ｍは、次の式（２） This process is performed for all planes orthogonal to each m / z. Accordingly, the smoothed spectral intensity x ′ _{k, m} of the sample point with the retention time kth and m / z _mth is expressed by the following equation (2).

によって表される。この処理により、２次元スペクトルデータは、保持時間軸に沿って平滑化されたものとなる。この平滑化工程は、平滑化部１１によって行われる。 Represented by By this processing, the two-dimensional spectrum data is smoothed along the holding time axis. This smoothing step is performed by the smoothing unit 11.

上記の平滑化工程による効果は次の通りである。通常、ＧＣ／ＭＳやＬＣ／ＭＳなどによって得られるデータでは、測定されたイオン強度がランダム誤差を含んでいる。従って、測定されたイオン強度情報をそのまま用いてピークの保持時間を同定すると、ランダム誤差によりピークの保持時間が本来のものからずれてしまうことがある。しかしながら、上記の平滑化工程を行えば、ランダム誤差による影響を抑制し、ピークの位置を正確に検出することができるようになる。 The effects of the smoothing process are as follows. Usually, in data obtained by GC / MS, LC / MS, or the like, the measured ion intensity includes a random error. Therefore, if the peak retention time is identified using the measured ion intensity information as it is, the peak retention time may deviate from the original due to a random error. However, if the above smoothing step is performed, the influence of random errors can be suppressed and the peak position can be accurately detected.

（１−２．ピーク時間同定部／ピーク時間同定工程）
次に、平滑化後の２次元スペクトルをｍ／ｚ軸と直交する平面で切り出して得られるマスクロマトグラムにおけるピークの位置（ピークに対応する保持時間）を同定する。ピークの位置を同定する際には、ある保持時間のサンプル点がピークであるか否かを検定する作業を保持時間軸に沿って順次行っていく。検定方法は次の通りである。 (1-2. Peak time identification unit / peak time identification step)
Next, the peak position (retention time corresponding to the peak) in the mass chromatogram obtained by cutting the smoothed two-dimensional spectrum along a plane orthogonal to the m / z axis is identified. When identifying the position of the peak, an operation for examining whether or not a sample point at a certain holding time is a peak is sequentially performed along the holding time axis. The test method is as follows.

保持時間がｋ番目のサンプル点がピークであるか否かの判定は、ｋ−１番目のサンプル点における曲線の傾きと、ｋ＋１番目のサンプル点における曲線の傾きとに基づいて行われる。詳細には、ｋ−１番目のサンプル点における傾きが０よりも大きく、かつ、ｋ＋１番目のサンプル点における傾きが０よりも小さい場合に、ｋ番目のサンプル点がピークであると判定する。 The determination as to whether or not the k-th sample point at the holding time is a peak is made based on the slope of the curve at the (k−1) -th sample point and the slope of the curve at the (k + 1) -th sample point. Specifically, when the slope at the (k−1) th sample point is larger than 0 and the slope at the (k + 1) th sample point is smaller than 0, it is determined that the kth sample point is a peak.

なお、各サンプル点における曲線の傾きは、そのサンプル点を中心とする合計Ｎ個のサンプル点のスペクトル強度を基に、線形回帰式を用いて簡便に求めることができる。具体的には、ｋ番目のサンプル点における曲線の傾きをｂ_ｋとすると、ｂ_ｋは、次の式（３） The slope of the curve at each sample point can be easily obtained using a linear regression equation based on the spectral intensities of a total of N sample points with the sample point as the center. Specifically, when the slope of the curve at the k-th sample point and b _k, b _k, the following equation (3)

によって表される。 Represented by

この処理は、各ｍ／ｚと直交する全てのマスクロマトグラムについて行われる。従って、保持時間がｋ番目でｍ／ｚがｍ番目のサンプル点における曲線の傾きをｂ_ｋ、ｍとすると、ｂ_ｋ，ｍは、次の式（４） This process is performed for all mass chromatograms orthogonal to each m / z. Therefore, the inclination of _{b k} of the curve m / z retention time at k-th in the m-th sample _point, when _{_m, b k, m,} the following formula (4)

によって表される。このピーク時間同定工程は、ピーク時間同定部１２によって行われる。 Represented by This peak time identification step is performed by the peak time identification unit 12.

（１−３．スペクトル強度合体部／スペクトル強度合体工程）
続いて、平滑化後の２次元スペクトルをｍ／ｚ軸と直交する断面で切り出してみる。得られたマスクロマトグラムにおいて、極小値−極大値−極小値からなる区間を１つの「山」と定義すると、この山に含まれる全てのサンプル点のスペクトル強度は、その山のピークの位置から派生したものであると考えられる。 (1-3. Spectral intensity coalescence part / spectral intensity coalescence process)
Subsequently, the smoothed two-dimensional spectrum is cut out in a cross section orthogonal to the m / z axis. In the obtained mass chromatogram, if a section consisting of a minimum value, a maximum value, and a minimum value is defined as one “mountain”, the spectral intensities of all sample points included in this mountain are calculated from the peak positions of the peaks. It is considered to be derived.

従って、各山に含まれる全てのサンプル点のスペクトル強度をピークの位置に纏める処理を行う。この処理は、ピーク周辺のサンプル点のスペクトル強度をピークのサンプル点のスペクトル強度に加算し、ピーク周辺のサンプル点のスペクトル強度を０にすることによって行われる。 Therefore, a process for collecting the spectral intensities of all the sample points included in each mountain at the peak position is performed. This processing is performed by adding the spectral intensity of the sample points around the peak to the spectral intensity of the peak sample points, and setting the spectral intensity of the sample points around the peak to zero.

図８は、この処理手順を詳細に説明するフロー図である。ｋ番目のサンプル点がピークと判定されたとする。ここで、ｋ番目の保持時間をｔ_ｋ、保持時間ｔ_ｋにおけるピーク強度を示す変数をＰ（ｔ_ｋ）とすると、まず、Ｐ（ｔ_ｋ）にｋ番目のサンプル点のスペクトル強度ｘ_ｋを代入する（Ｓ３１）。次に、ピークの左側に存在するサンプル点のスペクトル強度をピークに順次加算していくために以下の処理を行う。 FIG. 8 is a flowchart for explaining this processing procedure in detail. Assume that the kth sample point is determined to be a peak. Here, if the k-th holding time is t _k and the variable indicating the peak intensity at the holding time t _k is P (t _k ), first, the spectrum intensity x _k of the k-th sample point is set to P (t _k ). Substitute (S31). Next, the following processing is performed in order to sequentially add the spectral intensity of the sample points existing on the left side of the peak to the peak.

まず、処理対象のサンプル点をピークの左隣にセットするために、処理対象のサンプル番号を示す変数ｓにｋ−１を代入する（Ｓ３２）。続いて、処理対象のサンプル点における曲線の傾きｂ_ｓを０と比較する（Ｓ３３）。ここで、傾きｂ_ｓが０よりも大きい場合は、処理対象のサンプル点が現在のピークに含まれるべきサンプル点であると判断し、次のステップＳ３４に進む。 First, in order to set the sample point to be processed on the left side of the peak, k−1 is assigned to the variable s indicating the sample number to be processed (S32). Subsequently, the slope b _s of the curve at the sample point to be processed is compared with 0 (S33). If the slope b _s is larger than 0, it is determined that the sample point to be processed is a sample point that should be included in the current peak, and the process proceeds to the next step S34.

ステップＳ３４では、処理対象のサンプル点のスペクトル強度ｘ_ｓをＰ（ｔ_ｋ）に加算する。そして、処理対象のサンプル点を左側に１つずらすために、変数ｓの値を一つ減らす（Ｓ３５）。そして、ステップＳ３３に戻る。 In step S34, the spectrum intensity x _s of the sample point to be processed is added to P (t _k ). Then, in order to shift the sample point to be processed by one to the left, the value of the variable s is decreased by one (S35). Then, the process returns to step S33.

一方、ステップＳ３３において、傾きｂ_ｓが０以下である場合は、処理対象のサンプル点が現在のピークに含まれるべきものでないと判断し、ステップＳ３６に進む。そして、今度は、ピークの右側に存在するサンプル点のスペクトル強度をピークに順次加算していくために以下の処理を行う。 On the other hand, if the slope b _s is 0 or less in step S33, it is determined that the sample point to be processed should not be included in the current peak, and the process proceeds to step S36. Then, this time, the following processing is performed in order to sequentially add the spectral intensity of the sample points existing on the right side of the peak to the peak.

まず、処理対象のサンプル点をピークの右隣にセットするために、変数ｓにｋ＋１を代入する（Ｓ３６）。続いて、処理対象のサンプル点における曲線の傾きｂ_ｓを０と比較する（Ｓ３７）。ここで、傾きｂ_ｓが０よりも小さい場合は、処理対象のサンプル点が現在のピークに含まれるべきサンプル点であると判断し、次のステップＳ３８に進む。 First, in order to set the sample point to be processed to the right of the peak, k + 1 is substituted for the variable s (S36). Subsequently, the slope b _s of the curve at the sample point to be processed is compared with 0 (S37). If the slope b _s is smaller than 0, it is determined that the sample point to be processed is a sample point that should be included in the current peak, and the process proceeds to the next step S38.

ステップＳ３８では、処理対象のサンプル点のスペクトル強度ｘ_ｓをＰ（ｔ_ｋ）に加算する。そして、処理対象のサンプル点を右側に１つずらすために、変数ｓの値を一つ増やす（Ｓ３５）。そして、ステップＳ３７に戻る。 In step S38, the spectral intensity x _s of the sample point to be processed is added to P (t _k ). Then, in order to shift the sample point to be processed by one to the right, the value of the variable s is increased by one (S35). Then, the process returns to step S37.

一方、ステップＳ３７において、傾きｂ_ｓが０以上である場合には、処理対象のサンプル点が現在のピークに含まれるべきものでないと判断し、処理を終了する。以上により、１つのピークに属するべき各サンプル点のスペクトル強度が全てピークの位置に纏められる。その結果、ｔ番目の保持時間ｔ_ｋに纏められたピーク強度はＰ（ｔ_ｋ）として得られる。なお、ピーク以外の位置におけるＰ（ｔ_ｋ）は０とする。 On the other hand, if the slope b _s is greater than or equal to 0 in step S37, it is determined that the sample point to be processed should not be included in the current peak, and the process ends. As described above, the spectrum intensities of the sample points that should belong to one peak are all collected at the peak position. As a result, the peak intensities are summarized in t th retention time t _k can be obtained as P (t _k). Note that P (t _k ) at positions other than the peak is 0.

この処理は、各ｍ／ｚと直交する全ての断面に含まれる全てのピークについて行われる。以下では、保持時間がｔ_ｋでｍ／ｚがｍ番目におけるピーク強度をＰ（ｔ_ｋ）_ｍと表現する。また、保持時間、ｍ／ｚ、ピーク強度を要素とするデータのことを２次元ピークデータという。図９の上段は、この２次元ピークデータを可視化したものである。上記のスペクトル強度合体工程は、スペクトル強度合体部１３によって行われる。 This processing is performed for all peaks included in all cross sections orthogonal to each m / z. Hereinafter, the peak intensity at the holding time t _k and m / z is m-th is expressed as P (t _k ) _m . Data having retention time, m / z, and peak intensity as elements are referred to as two-dimensional peak data. The upper part of FIG. 9 is a visualization of this two-dimensional peak data. The spectral intensity combining step is performed by the spectral intensity combining unit 13.

上記の第１ピーク時間同定工程及びスペクトル強度合体工程によれば、同一のフラグメントイオンから得られるイオン強度情報が複数の保持時間にまたがってしまう場合であっても、それぞれのイオン強度情報を纏め、単一の保持時間に対応する一つのピーク強度情報として算出することができる。つまり、各フラグメントイオンに由来するピークの強度と、同ピークに対応する保持時間とを得ることができる。 According to the first peak time identifying step and the spectral intensity merging step, even when ion intensity information obtained from the same fragment ion spans a plurality of holding times, the respective ion intensity information is summarized, It can be calculated as one peak intensity information corresponding to a single holding time. That is, the intensity of the peak derived from each fragment ion and the retention time corresponding to the peak can be obtained.

（１−４．トータルピーク作成部／トータルピーク作成工程）
次に、スペクトル強度合体工程後の２次元ピークデータを保持時間軸と直交する断面で切り出し、得られたマススペクトルに含まれる全てのピークのピーク強度を合計することによって、トータルピーク強度を算出する。この処理は、各保持時間と直交する全てのマススペクトルについて行われる。換言すれば、ある保持時間に対応する全てのピークを１つに纏める作業を全ての保持時間について行う。 (1-4. Total peak creation part / Total peak creation process)
Next, the total peak intensity is calculated by cutting out the two-dimensional peak data after the spectral intensity merging step in a cross section orthogonal to the retention time axis and summing the peak intensities of all the peaks included in the obtained mass spectrum. . This process is performed for all mass spectra orthogonal to each holding time. In other words, the operation of combining all the peaks corresponding to a certain holding time into one is performed for all the holding times.

従って、保持時間ｔ_ｋにおけるトータルピーク強度をＴＩ（ｔ_ｋ）とすると、ＴＩ（ｔ_ｋ）は次の式（５） Therefore, when the total peak intensity at the holding time t _k is TI (t _k ), TI (t _k ) is expressed by the following equation (5).

によって表される。 Represented by

なお、このトータルピーク作成工程は、トータルピーク作成部１４によって行われる。以下では、この保持時間ごとのトータルピーク強度ＴＩ（ｔ_ｋ）のデータのことをトータルピークデータという。図９の中段は、トータルピークデータを可視化したものである。また、このトータルピークデータを作成する際に一つのトータルピークに纏められるピーク群のことをトータルピーク群という。トータルピーク群は、単一の化合物から得られるマススペクトルに相当する。 This total peak creation step is performed by the total peak creation unit 14. Hereinafter, the data of the total peak intensity TI (t _k ) for each holding time is referred to as total peak data. The middle part of FIG. 9 is a visualization of the total peak data. In addition, a peak group that is combined into one total peak when creating the total peak data is referred to as a total peak group. The total peak group corresponds to a mass spectrum obtained from a single compound.

（１−５．トータルピーク合体部／トータルピーク合体工程）
図９の中段に示すトータルピークデータにおいて、保持時間ｔ_１、ｔ_２におけるトータルピーク強度は、同一の化合物に由来するものであると考えられる。従って、トータルピーク合体工程では、これらの同一の化合物に由来すると考えられるトータルピークを一つに纏める処理を、以下の手順に従って行う。 (1-5. Total peak coalescence part / total peak coalescence process)
In the total peak data shown in the middle part of FIG. 9, the total peak intensities at the retention times t ₁ and t ₂ are considered to be derived from the same compound. Therefore, in the total peak merging step, a process of combining the total peaks considered to be derived from these same compounds into one is performed according to the following procedure.

まず、連続する保持時間ｔ_ｉ〜ｔ_ｉ＋ｎ（ただしｎは自然数）においてトータルピーク強度が常に０よりも大きくなる区間を同定する。そして、この区間において、トータルピーク強度が最大（極大）になる保持時間ｔ_ｕと山の両端部（極小点）の保持時間ｔ_ｓｕ、ｔ_ｅｕ（ただしｓｕ＜ｅｕとする）とを同定する。なお、１つの山に含まれるトータルピーク強度の数が２つしかない場合などでは、ｔ_ｕがｔ_ｓｕ又はｔ_ｅｕと等しくなる場合もある。なお、ここで得られた保持時間ｔ_ｕは、後に、２次元ピークデータの区間ｔ_ｓｕ〜ｔ_ｅｕに含まれる全てのピークの真の保持時間になるものである。 First, an interval in which the total peak intensity is always greater than 0 in continuous holding times t _{i to} t _i _{+ n} (where n is a natural number) is identified. In this section, the holding time t _{u at} which the total peak intensity becomes maximum (maximum) and the holding times t _su and t _eu at both ends (minimum points) of the peaks are identified. Note that, for example, when there are only two total peak intensities included in one peak, t _u may be equal to t _su or t _eu . The retention time t _u obtained here becomes the true retention time of all peaks included in the interval t _{su to} t _eu of the two-dimensional peak data later.

そして、保持時間ｔ_ｕを除くｔ_ｓｕからｔ_ｅｕまでのトータルピーク強度を、保持時間ｔ_ｕにおけるトータルピーク強度に加算し、保持時間ｔ_ｕ以外のトータルピーク強度を０にする。この処理は、全ての区間の全ての山について行われる。 Then, the total peak intensity from t _su except holding time t _u to t _eu, adds to the total peak intensities in the retention time t _u, the total peak intensity other than the retention time t _u is zero. This process is performed for all mountains in all sections.

以上により、１つの化合物に由来する複数のトータルピーク強度が、化合物の保持時間を代表する１つの位置（トータルピーク強度が極大になる保持時間）に纏められる。 As described above, a plurality of total peak intensities derived from one compound are collected at one position representing the retention time of the compound (the retention time at which the total peak intensity is maximized).

一例を示すと、図９中段のトータルピークデータでは、連続する保持時間ｔ_１及びｔ_２においてトータルピーク強度が０よりも大きくなっている。この場合、ｔ_ｕ＝ｔ_１、ｔ_ｓｕ＝ｔ_１、ｔ_ｅｕ＝ｔ_２となる。そして、保持時間ｔ_２におけるトータルピーク強度を保持時間ｔ_１におけるトータルピーク強度に加算し、保持時間ｔ_２におけるトータルピーク強度を０にする。その結果、トータルピークデータは、図９下段のようになる。 As an example, in the total peak data in the middle stage of FIG. 9, the total peak intensity is greater than 0 in the continuous holding times t ₁ and t ₂ . In this case, t _u = t ₁ , t _su = t ₁ , and t _eu = t ₂ . Then, by adding the total peak intensities in the retention time t ₂ to the total peak intensities in the retention time t _1, the total peak intensities in the retention time t ₂ to 0. As a result, the total peak data is as shown in the lower part of FIG.

他の例を示すと、図１０の（１）の場合は連続する２つのトータルピークが１つのピークに纏められ、（２）の場合は２つのトータルピークの間に０となる保持時間があることから２つのピークのままであり、（３）の場合は４つのトータルピークが２つのピークに纏められる。 As another example, in FIG. 10 (1), two continuous total peaks are combined into one peak, and in (2), there is a retention time of 0 between the two total peaks. Thus, the two peaks remain, and in the case of (3), the four total peaks are combined into two peaks.

なお、本実施形態では、トータルピーク強度が常に０よりも大きくなる区間を同定したが、トータルピーク強度にノイズ等が含まれる場合は、所定の閾値を設定し、常に閾値よりも大きくなる区間や、常に閾値以上となる区間を同定してもよい。 In this embodiment, a section in which the total peak intensity is always greater than 0 is identified. However, when noise or the like is included in the total peak intensity, a predetermined threshold is set, and a section in which the total peak intensity is always greater than the threshold is set. A section that is always greater than or equal to the threshold may be identified.

このトータルピーク合体工程は、トータルピーク合体部１５によって行われる。 This total peak merging step is performed by the total peak merging unit 15.

（１−６．ピーク合体部／ピーク合体工程）
次に、図４上段の２次元ピークデータについても、上記のトータルピーク合体工程で求めた区間に含まれる各ピーク強度を纏める処理を行う。つまり、上記の保持時間ｔ_ｓｕ〜ｔ_ｅｕに含まれるピークのピーク強度の総和を各ｍ／ｚごとに取り、総和を保持時間ｔ_ｕに対応付ける。 (1-6. Peak coalescence part / peak coalescence process)
Next, for the two-dimensional peak data in the upper part of FIG. 4, a process for collecting the peak intensities included in the section obtained in the total peak coalescing process is performed. In other words, taking the sum of the peak intensities of the peaks included in the retention time _t su _{~t eu} for each m / z, associated to hold the total time _{t u.}

上述した例で説明すると、図９上段の２次元ピークデータにおける、ｍ／ｚがＭのピークについて、ピーク強度Ｐ（ｔ_１）_ＭとＰ（ｔ_２）_Ｍとを加算し、加算したピーク強度を保持時間ｔ_１に対応付ける。ここではＰ（ｔ_２）_Ｍ＝０であるので、処理の前後でピークの位置及び強度に変化はない。 Explaining with the above-described example, the peak intensity P (t ₁ ) _M and P (t ₂ ) _M are added to the peak of m / z of M in the two-dimensional peak data in the upper part of FIG. Is associated with the holding time t ₁ . Here, since P (t ₂ ) _M = 0, there is no change in the position and intensity of the peak before and after the processing.

同様に、ｍ／ｚがｍのピークについても、ピーク強度Ｐ（ｔ_１）_ｍとＰ（ｔ_２）_ｍとを加算し、加算したピーク強度を保持時間ｔ_１に対応付ける。この場合、Ｐ（ｔ_１）_ｍ＝０であるので、処理の前後でピークの強度に変化はないが、ピークの位置がｔ_１に変化する。上記の処理の結果、図１１のようになる。 Similarly, for the peak with m / z of m, the peak intensities P (t ₁ ) _m and P (t ₂ ) _m are added, and the added peak intensity is associated with the holding time t ₁ . In this case, since P (t ₁ ) _m = 0, the peak intensity does not change before and after the processing, but the peak position changes to t ₁ . The result of the above processing is as shown in FIG.

この処理は、トータルピーク合体工程で求めた全ての区間について、全てのｍ／ｚごとに行われる。なお、ピーク合体工程はピーク合体部１６によって行われる。 This process is performed for every m / z for all sections obtained in the total peak merging step. The peak merging step is performed by the peak merging unit 16.

上記のトータルピーク合体工程及びピーク合体工程による効果は以下の通りである。図９上段に示すｍ／ｚごとのピークデータにおいて、ピーク１０１とピーク１０２とは、同一の化合物から解離したフラグメントイオン又は擬分子イオンによるものであると考えられる。しかしながら、測定データにはランダム誤差が含まれるため、スペクトル強度合体工程の結果、図９上段のピーク１０１とピーク１０２とのように、同一の化合物から解離したフラグメントイオンによるピークであっても、互いに異なる保持時間に対応付けられてしまうことがある。その結果、本来同一の化合物に関するピークとして扱うべき２つのピーク１０１・１０２を、別々の化合物に関するピークとして扱ってしまうことになる。 The effects of the total peak merging step and the peak merging step are as follows. In the peak data for each m / z shown in the upper part of FIG. 9, the peak 101 and the peak 102 are considered to be due to fragment ions or pseudo-molecular ions dissociated from the same compound. However, since the measurement data includes random errors, even if the peaks are fragment ions dissociated from the same compound, such as the peak 101 and the peak 102 in the upper part of FIG. It may be associated with different holding times. As a result, the two peaks 101 and 102 that should be treated as peaks related to the same compound are handled as peaks related to different compounds.

しかしながら、上記のトータルピーク合体工程及びピーク合体工程によれば、図１１に示すように、ピーク１０１とピーク１０２とが同じ保持時間に対応付けられる。その結果、ピーク１０１とピーク１０２とを同一の化合物に関するピークとして扱うことができるようになるのである。 However, according to the total peak merging step and the peak merging step described above, as shown in FIG. 11, the peak 101 and the peak 102 are associated with the same holding time. As a result, the peak 101 and the peak 102 can be handled as peaks related to the same compound.

（２．ピーク対応付け部／ピークデータ対応付け工程）
ピーク対応付け部２０は、複数の実験から２次元ピークデータのセットが得られた際に、複数の２次元ピークデータ間で、同一の化合物由来のトータルピーク群（マススペクトル）同士を対応付けるためのものである。なお、ピーク対応付け部２０に入力する２次元ピークデータは、必ずしも上記のピークデータ作成部１０によって作成したものでなくてもよく、同様のデータであれば公知の手法によって作成したものでもよい。 (2. Peak association unit / peak data association step)
The peak association unit 20 associates total peak groups (mass spectra) derived from the same compound between a plurality of two-dimensional peak data when a set of two-dimensional peak data is obtained from a plurality of experiments. Is. Note that the two-dimensional peak data input to the peak association unit 20 does not necessarily have to be created by the above-described peak data creation unit 10, and may be created by a known method as long as it is similar data.

ピークデータ対応付け部２０によるピークデータ対応付け工程の概要について説明すると、次の通りである。ガスクロマトグラフィーや液体クロマトグラフィーなどの分離法では、実験ごとに保持時間に誤差が含まれるため、それぞれの実験から得られた２次元ピークデータ間でトータルピーク群同士を対応付けるためには保持時間の補正が必要となる。そのために、ピークデータ対応付け部２０は、比較したい２次元ピークデータの全てに共通に存在するトータルピーク群を内部標準ピーク群として検出し、この内部標準ピーク群の保持時間を基に２次元ピークデータにおける保持時間を補正する。さらに、ピークデータ対応付け部２０は、補正された保持時間に基づいて、ユーザによって指定された許容範囲内で実験間のトータルピーク群同士を対応づけ、対応テーブルとして表現する。 The outline of the peak data association process by the peak data association unit 20 will be described as follows. In separation methods such as gas chromatography and liquid chromatography, since the retention time includes an error for each experiment, in order to associate the total peak groups between the two-dimensional peak data obtained from each experiment, Correction is required. For this purpose, the peak data association unit 20 detects a total peak group that exists in common in all of the two-dimensional peak data to be compared as an internal standard peak group, and the two-dimensional peak based on the retention time of the internal standard peak group. Correct the retention time in the data. Further, the peak data association unit 20 associates the total peak groups between experiments within an allowable range designated by the user based on the corrected holding time, and represents the result as a correspondence table.

以下では、実験ｓ（ｓ＝１，２，…，Ｓ）からピークデータ作成部１０によって得られる２次元ピークデータを次の表１のように表す。 In the following, the two-dimensional peak data obtained by the peak data creation unit 10 from the experiment s (s = 1, 2,..., S) is expressed as shown in Table 1 below.

ただし、Ｐ^(ｓ)(ｔ_ｋ(ｓ))_ｍは、実験ｓから得られた２次元ピークデータにおいて、保持時間がｔ_ｋ(ｓ)でｍ／ｚがｍ番目のサンプル点のピーク強度を示す。 However, P ^(s) (t _{k (s)} ) _m is the two-dimensional peak data obtained from the experiment s, and the peak intensity at the m-th sample point with the retention time t _{k (s)} and m / z. Show.

また、実験ｓ（ｓ＝１，２，…，Ｓ）からピークデータ作成部１０によって得られるトータルピークデータは次の表２のようになる。 The total peak data obtained by the peak data creation unit 10 from the experiment s (s = 1, 2,..., S) is as shown in Table 2 below.

ただし、ＴＩ^(ｓ)(ｔ_ｋ(ｓ))は、実験ｓから得られたトータルピークデータにおいて、保持時間がｔ_ｋ(ｓ)のトータルピーク強度を示す。 However, TI ^(s) (t _{k (s)} ) represents the total peak intensity with a retention time t _{k (s)} in the total peak data obtained from the experiment s.

図１は、ピーク対応付け部２０の詳細な機能構成を示すブロック図である。図１に示すように、本実施形態におけるピーク対応付け部２０は、第１ピークベクトル作成部（第１マススペクトル抽出部）２１、内部標準同定部２２、内部標準確認部２３、保持時間補正部２４、第２ピークベクトル作成部（第２マススペクトル抽出部）２５、対応判定部２６、及びテーブル作成部２７を含んでいる。 FIG. 1 is a block diagram illustrating a detailed functional configuration of the peak association unit 20. As shown in FIG. 1, the peak association unit 20 in the present embodiment includes a first peak vector creation unit (first mass spectrum extraction unit) 21, an internal standard identification unit 22, an internal standard confirmation unit 23, and a retention time correction unit. 24, a second peak vector creation unit (second mass spectrum extraction unit) 25, a correspondence determination unit 26, and a table creation unit 27.

第１ピークベクトル作成部２１は、それぞれの２次元ピークデータから保持時間ごとにマススペクトルデータを抽出し、それぞれのマススペクトルデータに含まれるピーク強度を要素とするピークベクトルを作成する。なお、本実施形態において、第１ピークベクトル作成部２１は、このピークベクトルの作成を全ての２次元ピークデータについて行う。作製されたピークベクトルは、内部標準同定部２２に入力される。 The first peak vector creation unit 21 extracts mass spectrum data for each holding time from each two-dimensional peak data, and creates a peak vector whose peak intensity is included in each mass spectrum data. In the present embodiment, the first peak vector creation unit 21 creates this peak vector for all two-dimensional peak data. The created peak vector is input to the internal standard identification unit 22.

内部標準同定部２２は、或る２次元ピークデータから作成されたピークベクトルと他の２次元ピークデータから作成されたピークベクトルとを比較し、異なる２次元ピークデータ間で類似するピークベクトルを同定する。なお、この異なる２次元ピークデータ間で類似するピークベクトルのことを内部標準ピークベクトルという。内部標準同定部２２は、全ての２次元ピークデータにおいて共通に見られる内部標準ピークベクトルのセットを作成する。後続の保持時間補正部２４では、このセットに含まれるそれぞれの内部標準ベクトルの保持時間に基づいて、それぞれの２次元ピークデータの保持時間を補正する。なお、内部標準ピークベクトルのセットは、１セットだけ作成してもよいし、複数セット作成してもよい。この内部標準ピークベクトルのセットは、内部標準確認部２３に入力される。 The internal standard identification unit 22 compares a peak vector created from certain two-dimensional peak data with a peak vector created from other two-dimensional peak data, and identifies similar peak vectors between different two-dimensional peak data To do. A peak vector similar between the different two-dimensional peak data is referred to as an internal standard peak vector. The internal standard identification unit 22 creates a set of internal standard peak vectors commonly found in all two-dimensional peak data. The subsequent holding time correction unit 24 corrects the holding time of each two-dimensional peak data based on the holding time of each internal standard vector included in this set. Note that only one set of internal standard peak vectors may be created, or a plurality of sets may be created. This set of internal standard peak vectors is input to the internal standard confirmation unit 23.

内部標準確認部２３は、内部標準同定部２２によって同定された内部標準ピークベクトルが、適切であるか否かを確認するためのものである。具体的には、ユーザに対して内部標準ピークベクトルを視覚的に表示し、ユーザの指示に従って不適切な内部標準ピークベクトルを除外する機能を有している。また、内部標準確認部２３は、ユーザの指示に従って内部標準ピークベクトルを他のピークベクトルに変更することもできる。このようにして変更された内部標準ピークベクトルのセットは、保持時間補正部２４に入力される。 The internal standard confirmation unit 23 is for confirming whether or not the internal standard peak vector identified by the internal standard identification unit 22 is appropriate. Specifically, it has a function of visually displaying the internal standard peak vector to the user and excluding inappropriate internal standard peak vectors according to the user's instructions. Moreover, the internal standard confirmation part 23 can also change an internal standard peak vector into another peak vector according to a user's instruction | indication. The set of internal standard peak vectors changed in this way is input to the holding time correction unit 24.

保持時間補正部２４は、内部標準ピークベクトルのセットに含まれるそれぞれの内部標準ピークベクトルに対応する保持時間に基づいて、異なる実験から得られた２次元ピークデータの保持時間を補正する。具体的には、異なる２次元ピークデータから得られた内部標準ピークベクトル同士が同一の保持時間となるように、それぞれの２次元ピークデータの保持時間を補正する。保持時間が補正された２次元ピークデータは、第２ピークベクトル作成部２５に入力される。 The holding time correction unit 24 corrects the holding time of two-dimensional peak data obtained from different experiments based on the holding time corresponding to each internal standard peak vector included in the set of internal standard peak vectors. Specifically, the retention times of the respective two-dimensional peak data are corrected so that the internal standard peak vectors obtained from different two-dimensional peak data have the same retention time. The two-dimensional peak data with the holding time corrected is input to the second peak vector creation unit 25.

第２ピークベクトル作成部２５は、保持時間が補正された２次元ピークデータから保持時間ごとに１次元ピークデータ（マススペクトルデータ）を切り出し、それぞれのマススペクトルデータに含まれるピーク強度を要素とするピークベクトルを新たに作成する。なお、第２ピークベクトル作成部２５は、このピークベクトルの作成を全ての２次元ピークデータについて行う。新たに作成されたピークベクトルは、対応判定部２６に入力される。 The second peak vector creation unit 25 extracts one-dimensional peak data (mass spectrum data) for each retention time from the two-dimensional peak data with the retention time corrected, and uses the peak intensity included in each mass spectrum data as an element. Create a new peak vector. The second peak vector creating unit 25 creates this peak vector for all two-dimensional peak data. The newly created peak vector is input to the correspondence determination unit 26.

対応判定部２６は、保持時間が補正された２次元ピーク情報間で、保持時間の違いが所定の範囲内のピークベクトル同士を比較することによって、比較したピークベクトル同士が対応するものであるか否かを判定する。 The correspondence determination unit 26 compares the peak vectors in which the difference in retention time is within a predetermined range between the two-dimensional peak information with the retention times corrected, so that the compared peak vectors correspond to each other. Determine whether or not.

テーブル作成部２７は、トータルピーク群の対応関係を示すテーブルを作成するためのものであり、本実施形態では、対応するトータルピーク群の識別情報が同一の列に記載された対応テーブルを作成する。なお、テーブル作成部２７は、対応するトータルピーク群の識別情報が同一の行に記載された対応テーブルを作成してもよい。 The table creation unit 27 is for creating a table showing the correspondence relationship between the total peak groups. In this embodiment, the table creation unit 27 creates a correspondence table in which the identification information of the corresponding total peak group is described in the same column. . The table creation unit 27 may create a correspondence table in which the identification information of the corresponding total peak group is described in the same row.

次に、ピーク対応付け部２０の動作について説明する。ピーク対応付け部２０によるピーク対応付け工程は、第１ピークベクトル作成工程、内部標準同定工程、内部標準確認工程、保持時間補正工程、及び第２ピークベクトル作成工程、対応判定工程、及びテーブル作成工程を含んでいる。これらの工程について順に説明すると以下の通りである。 Next, the operation of the peak association unit 20 will be described. The peak association process by the peak association unit 20 includes a first peak vector creation process, an internal standard identification process, an internal standard confirmation process, a holding time correction process, a second peak vector creation process, a correspondence determination process, and a table creation process. Is included. These steps will be described in order as follows.

（２−１．第１ピークベクトル作成部／第１ピークベクトル作成工程）
まず、上記の表１に示す２次元ピークデータから、保持時間ごとに、トータルピーク群に含まれる各ピークのピーク強度を要素とするピークベクトルを作成する。このピークベクトルの作成は、全ての実験について行われる。 (2-1. First Peak Vector Creation Unit / First Peak Vector Creation Step)
First, from the two-dimensional peak data shown in Table 1 above, a peak vector having the peak intensity of each peak included in the total peak group as an element is created for each holding time. This peak vector is created for all experiments.

ｓ番目の実験における保持時間ｔ_ｋ（ｓ）のピークベクトルをｆ^（ｓ）（ｔ_ｋ（ｓ））とすると、ｆ^（ｓ）（ｔ_ｋ（ｓ））は、次の式（６）
ｆ^（ｓ）（ｔ_ｋ（ｓ））＝（Ｐ^(ｓ)(ｔ_ｋ(ｓ))_１，Ｐ^(ｓ)(ｔ_ｋ(ｓ))_２，…，Ｐ^(ｓ)(ｔ_ｋ(ｓ))_ｍ，…
…，Ｐ^(ｓ)(ｔ_ｋ(ｓ))_Ｍ） …（６）
によって表すことができる。 Assuming that the peak vector of the holding time t _{k (s)} in the s-th experiment is f ^(s) (t _{k (s)} ), f ^(s) (t _{k (s)} ) is expressed by the following equation (6).
f ^(s) (t _{k (s)} ) = (P ^(s) (t _{k (s)} ) ₁ , P ^(s) (t _{k (s)} ) ₂ ,..., P ^(s) (t _{k (s )} ) _m , ...
…, P ^(s) (t _{k (s)} ) _M ) (6)
Can be represented by

この工程は、換言すれば、各実験について、２次元ピークデータから保持時間ごとに１次元ピークデータ（マススペクトルデータ）を抽出する工程であるともいえる。この第１ピークベクトル作成工程は、第１ピークベクトル作成部２１によって行われる。 In other words, it can be said that this step is a step of extracting one-dimensional peak data (mass spectrum data) from the two-dimensional peak data for each holding time for each experiment. This first peak vector creation step is performed by the first peak vector creation unit 21.

（２−２．内部標準同定部／内部標準同定工程）
次に、異なる実験間で類似するピークベクトルのセットを同定する。ピークベクトルは、保持時間ごとに切り出した１次元ピークデータ、すなわちマススペクトルデータであるため、その保持時間の画分に含まれた化合物に依存したベクトルとなる。従って、異なる実験から得られた混合物の中に同一の化合物が含まれていれば、類似するピークベクトルが得られると考えられる。よって、この類似するピークベクトルを内部標準ピークベクトルとして同定し、この内部標準ピークベクトルに対応する保持時間に基づいて、実験ごとの保持時間のずれを補正するのである。 (2-2. Internal Standard Identification Unit / Internal Standard Identification Step)
Next, a set of peak vectors that are similar between different experiments are identified. Since the peak vector is one-dimensional peak data cut out for each holding time, that is, mass spectrum data, the peak vector is a vector depending on the compound included in the fraction of the holding time. Therefore, if the same compound is contained in a mixture obtained from different experiments, it is considered that a similar peak vector can be obtained. Therefore, this similar peak vector is identified as the internal standard peak vector, and the deviation of the holding time for each experiment is corrected based on the holding time corresponding to the internal standard peak vector.

本実施形態では、ピークベクトルを比較する際に３つの閾値ＴＩＰ，ＮＴＰ，Ｔ_ｉｎｔを導入する。これらの閾値は、ユーザによって設定することができる。 In this embodiment, three threshold values TIP, NTP, and _Tint are introduced when comparing peak vectors. These thresholds can be set by the user.

ＴＩＰ及びＮＴＰは、ピークベクトルが内部標準ピークベクトルの候補としてふさわしいか否かを検定するために用いられるものである。ＴＩＰはピーク強度の閾値を示し、一方ＮＴＰはピークの本数の閾値を示す。また、Ｔ_ｉｎｔは、ピークベクトル同士を比較する際の保持時間の許容範囲を規定するための閾値である。 TIP and NTP are used to test whether a peak vector is suitable as a candidate for an internal standard peak vector. TIP indicates the peak intensity threshold, while NTP indicates the peak number threshold. T _int is a threshold value for defining an allowable range of retention time when comparing peak vectors.

図１２は、ピークベクトルの中から内部標準ピークベクトルを探索する工程を示すフロー図である。 FIG. 12 is a flowchart showing a process of searching for an internal standard peak vector from peak vectors.

まず、Ｓ個の実験の中から、任意に１つの実験を選択する。ここでは、説明の簡略化のため、１番目の実験を選択するものとする。そして、１番目の実験の１番目のピークベクトルｆ^（１）（ｔ_１（１））を選択する（Ｓ８０１）。 First, one experiment is arbitrarily selected from S experiments. Here, for simplification of explanation, the first experiment is selected. Then, the first peak vector f ⁽¹⁾ (t _{1 (1)} ) of the first experiment is selected (S801).

次に、このピークベクトルｆ^（１）（ｔ_１（１））が内部標準ピークベクトルの候補としてふさわしいか否かを、上記のＴＩＰ及びＮＴＰを用いて検定する（Ｓ８０２）。具体的には、ピークベクトルｆ^（１）（ｔ_１（１））が有する要素Ｐ^（１）（ｔ_１（１））_ｍ（ｍ＝１，２，…，Ｍ）のうち、ＴＩＰよりも大きい値を有する要素がＮＴＰ個よりも多い場合に、そのピークベクトルは内部標準ピークベクトルの候補になりうると判定され、ステップＳ８０３に進む。一方、上記の条件を満たさない場合は、そのピークベクトルは内部標準ベクトルの候補になりえないと判定され、ステップＳ８１０、Ｓ８１１、Ｓ８０２の順に進み、実験１の次のピークベクトルについて検定する。 Next, whether or not this peak vector f ⁽¹⁾ (t _{1 (1)} ) is suitable as a candidate for the internal standard peak vector is tested using the above TIP and NTP (S802). Specifically, among the elements P ⁽¹⁾ (t _{1 (1)} ) _m (m = 1, 2,..., M) included in the peak vector f ⁽¹⁾ (t _{1 (1)} ), than TIP. If there are more than NTP elements having a large value, it is determined that the peak vector can be a candidate for the internal standard peak vector, and the process proceeds to step S803. On the other hand, if the above condition is not satisfied, it is determined that the peak vector cannot be a candidate for the internal standard vector, and the process proceeds to steps S810, S811, and S802 in order, and the next peak vector of Experiment 1 is verified.

ステップＳ８０３では、１番目の実験と比較する実験を選択する。ここでは、実験２から順に選択していくことにする。 In step S803, an experiment to be compared with the first experiment is selected. Here, the selection is made in order from Experiment 2.

そして、実験２のピークベクトルｆ^（２）（ｔ_ｋ（２））（ｋ＝１，２，…，Ｋ）の中から、ｆ^（１）（ｔ_１（１））と最も類似するピークベクトルをｆ^（２） _ｍａｘとして同定する（Ｓ８０４）。ここで、実験２のピークベクトルｆ^（２）（ｔ_ｋ（２））（ｋ＝１，２，…，Ｋ）のうち、ｆ^（１）（ｔ_１（１））に対して探索対象となるものは、次の式（７）
ｔ_１（１）−Ｔ_ｉｎｔ＜ｔ_ｋ（２）＜ｔ_１（１）＋Ｔ_ｉｎｔ …（７）
を満たすｆ^（２）（ｔ_ｋ（２））である。つまり、式（７）を満たす実験２のピークベクトルｆ^（２）（ｔ_ｋ（２））の中から、実験１のピークベクトルｆ^（１）（ｔ_１（１））と最も類似するピークベクトルをｆ^（２） _ｍａｘとして同定する。 Then, the peak vector most similar to f ⁽¹⁾ (t _{1 (1)} ) among the peak vectors f ⁽²⁾ (t _{k (2)} ) (k = 1, 2,..., K) of Experiment 2 Is identified as f ⁽²⁾ _max (S804). Here, among the peak vectors f ⁽²⁾ (t _{k (2)} ) (k = 1, 2,..., K) of Experiment 2, the search target is set for f ⁽¹⁾ (t _{1 (1)} ). The following formula (7)
t _{1 (1)} −T _int <t _{k (2)} <t _{1 (1)} + T _int (7)
F ⁽²⁾ (t _{k (2)} ) that satisfies That is, the peak vector most similar to the peak vector f ⁽¹⁾ (t _{1 (1)} ) of Experiment 1 among the peak vectors f ⁽²⁾ (t _{k (2)} ) of Experiment 2 that satisfy Equation (7). _Is identified as f ⁽²⁾ _max .

ここで、例えば、ピークベクトルｆ^（２） _ｍａｘとして、ｆ^（２）（ｔ_２（２））が得られたとする。すると今度は、実験２のピークベクトルｆ^（２）（ｔ_２（２））と最も類似する実験１のピークベクトルｆ^（１） _ｍａｘを探索する（Ｓ８０５）。この場合も上記と同様に、探索対象となる実験１のピークベクトルは、Ｔ_ｉｎｔによって規定される許容範囲内のものに限定される。具体的には、次の式（８）
ｔ_２（２）−Ｔ_ｉｎｔ＜ｔ_ｋ（１）＜ｔ_２（２）＋Ｔ_ｉｎｔ …（８）
を満たすｆ^（１）（ｔ_ｋ（１））の中から、ｆ^（２）（ｔ_２（２））と最も類似するピークベクトルを探索する。 Here, for example, it is assumed that f ⁽²⁾ (t _{2 (2)} ) is obtained as the peak vector f ⁽²⁾ _max . Then, the peak vector f ⁽¹⁾ _max of Experiment 1 that is most similar to the peak vector f ⁽²⁾ (t _{2 (2)} ) of Experiment ₂ is searched for (S805). In this case as well, the peak vector of Experiment 1 to be searched is limited to that within the allowable range defined by T _int as described above. Specifically, the following equation (8)
_{_{_{t 2 (2) -T int <}}} t k (1) <t 2 (2) + T int ... (8)
A peak vector most similar to f ⁽²⁾ (t _{2 (2)} ) is searched from f ⁽¹⁾ (t _{k (1)} ) satisfying the above.

次に、得られたピークベクトルｆ^（１） _ｍａｘが、元のピークベクトルｆ^（１）（ｔ_１（１））と一致するか否かが判定される（Ｓ８０６）。つまり、ピークベクトルｆ^（１） _ｍａｘとしてｆ^（１）（ｔ_１（１））が得られた場合は、ピークベクトルｆ^（１）（ｔ_１（１））とｆ^（２）（ｔ_２（２））とが真に類似すると判断され、これらのピークベクトル同士は、内部標準ピークベクトルの候補として対応付けられ、ステップＳ８０７に進む。一方、ピークベクトルｆ^（１） _ｍａｘとして、ｆ^（１）（ｔ_１（１））が得られなかった場合は、ピークベクトルｆ^（１）（ｔ_１（１））は、内部標準ピークベクトルにはなり得ないと判定され、ステップＳ８１０、Ｓ８１１、Ｓ８０２の順に進み、実験１の次のピークベクトルについて検定を行う。 Next, it is determined whether or not the obtained peak vector f ⁽¹⁾ _max matches the original peak vector f ⁽¹⁾ (t _{1 (1)} ) (S806). That is, when f ⁽¹⁾ (t _{1 (1)} ) is obtained as the peak vector f ⁽¹⁾ _max , the peak vectors f ⁽¹⁾ (t _{1 (1)} ) and f ⁽²⁾ (t _{2 ( 2)} ) are determined to be truly similar, and these peak vectors are associated as candidates for the internal standard peak vector, and the process proceeds to step S807. On the other hand, when f ⁽¹⁾ (t _{1 (1)} ) is not obtained as the peak vector f ⁽¹⁾ _max , the peak vector f ⁽¹⁾ (t _{1 (1)} ) is changed to the internal standard peak vector. It is determined that it is not possible to proceed, and the process proceeds to steps S810, S811, and S802, and the next peak vector of Experiment 1 is tested.

図１３（ａ）及び図１３（ｂ）は、このことを一般化して示した図である。このように、本実施形態の内部標準同定工程では、２つの実験間において双方向でピークベクトルの類似性を検討し、２つのピークベクトルが互いに最も類似する関係にある場合に、内部標準ピークベクトルの候補としてこれら２つのピークベクトルを対応付ける。 FIG. 13A and FIG. 13B are generalized views of this. As described above, in the internal standard identification process of the present embodiment, the similarity between the peak vectors is examined bi-directionally between the two experiments, and when the two peak vectors are most similar to each other, the internal standard peak vector is determined. These two peak vectors are associated with each other as candidates.

ステップＳ８０７では、実験１と比較する次の実験が残っているか否かを判定する。そして、次の実験が残っている場合は、次の実験を選択する（Ｓ８０８）。上記の例では、実験３を選択する。そして、実験３についても、上記と同様の手順により、ｆ^（１）（ｔ_１（１））と互いに最も類似するピークベクトルを探索する（Ｓ８０４〜Ｓ８０６）。このことを、Ｓ番目の実験まで繰り返し、ｆ^（１）（ｔ_１（１））と互いに最も類似するピークベクトルが全ての実験について得られた場合に、ｆ^（１）（ｔ_１（１））が内部標準ピークベクトルであると認定し（Ｓ８０９）、ｆ^（１）（ｔ_１（１））及びこれに類似するｆ^（ｓ） _ｍａｘ（ｓ＝２，３，…，Ｓ）を、内部標準ピークベクトルのセットとして対応付ける。そして、ステップＳ８１０に進み、さらなる内部標準ピークベクトルのセットを探索する。 In step S807, it is determined whether or not the next experiment to be compared with experiment 1 remains. If the next experiment remains, the next experiment is selected (S808). In the above example, Experiment 3 is selected. In Experiment 3 as well, a peak vector most similar to f ⁽¹⁾ (t _{1 (1)} ) is searched for in the same procedure as above (S804 to S806). This will be repeated until the S-th ^{experiment, f} (1) _{when (t 1 (1))} and the most similar to the peak vector each other are obtained for all ^{_{experiments, f (1) (t 1}} (1) ) Is an internal standard peak vector (S809), and f ⁽¹⁾ (t _{1 (1)} ) and similar f ^(s) _max (s = 2, 3,..., S) Map as a set of standard peak vectors. In step S810, a further set of internal standard peak vectors is searched.

一方、ｆ^（１）（ｔ_１（１））と互いに最も類似するピークベクトルが全ての実験について得られなかった場合は、ｆ^（１）（ｔ_１（１））は内部標準ピークベクトルにはなり得ないと判断し（Ｓ８１０〜Ｓ８１１）、実験１の次のピークベクトルｆ^（１）（ｔ_２（１））について、同様の処理を行う（Ｓ８０４〜Ｓ８０９）。 On the other ^{hand, if _{f (1) (t 1 (}} 1)) and the most similar to the peak vector each other was not obtained for all ^{experiments, _{f (1) (t 1 (}} 1)) The internal standard peak vector It is determined that this is not possible (S810 to S811), and the same processing is performed for the next peak vector f ⁽¹⁾ (t2 ₍₁₎ ) of Experiment 1 (S804 to S809).

なお、ピークベクトル同士の類似性の判定は、公知の手法に従って行うことができる。具体的には、例えばｍ／ｚが同一のピークの本数や、コサイン係数、スピアマンの相関係数などに基づいて判定することができる。 Note that the similarity between peak vectors can be determined according to a known method. Specifically, for example, determination can be made based on the number of peaks having the same m / z, a cosine coefficient, a Spearman correlation coefficient, and the like.

この処理を、実験１の全てのピークベクトルについて行うことによって、１又は複数の内部標準ピークベクトルのセットを得ることができる。ここで、内部標準ピークベクトルがＮセット得られたとすると、得られた内部標準ピークベクトルＲｆ^(ｓ)(Ｔ_ｎ(ｓ)) （ｎ＝１，２，…，Ｎ）のセットは、次の表３のように表すことができる。 By performing this process for all the peak vectors of Experiment 1, a set of one or more internal standard peak vectors can be obtained. Here, _assuming that N sets of internal standard peak vectors are obtained, the set of the obtained internal standard peak vectors Rf ^(s) (T _{n (s)} ) (n = 1, 2,..., N) is It can be expressed as in Table 3.

ただし、Ｔ_ｎ(ｓ)は、実験ｓにおいてｎセット目の内部標準ピークベクトルとして認定されたピークベクトルの保持時間を示すものであり、また、Ｔ_１(ｓ)＜Ｔ_２(ｓ)＜…＜Ｔ_ｎ(ｓ)＜…＜Ｔ_Ｎ(ｓ)であるものとする。 However, T _{n (s)} indicates the retention time of the peak vector recognized as the n-th internal standard peak vector in the experiment s, and T _{1 (s)} <T _{2 (s)} <. <T _{n (s)} <... <T _{N (s)} .

なお、この内部標準同定工程は、内部標準同定部２２によって行われる。 This internal standard identification step is performed by the internal standard identification unit 22.

（２−３．内部標準確認部／内部標準確認工程）
続いて、内部標準確認工程では、内部標準確認部２３が、上記の内部標準同定工程によって得られた内部標準ピークベクトルの対応関係をユーザに対して視覚的に提示する。具体的には、各実験から得られた内部標準ピークベクトルのそれぞれをマススペクトルとして表示する。ユーザは、表示されたマススペクトル同士を実験間で比較検討し、得られた内部標準ピークが適切なものであるか否かを判断する。 (2-3. Internal standard confirmation part / Internal standard confirmation process)
Subsequently, in the internal standard confirmation step, the internal standard confirmation unit 23 visually presents the correspondence relationship between the internal standard peak vectors obtained in the internal standard identification step to the user. Specifically, each of the internal standard peak vectors obtained from each experiment is displayed as a mass spectrum. The user compares the displayed mass spectra between experiments, and determines whether the obtained internal standard peak is appropriate.

そして、得られた内部標準ピークが不適であると判断した場合、ユーザは、内部標準ピークを削除したり、ある実験についての内部標準ピークベクトルを他のピークベクトルに変更したりするように、スペクトル解析装置１の内部標準確認部２３に対してキーボードやマウス操作によって指示を与える。 If it is determined that the obtained internal standard peak is inappropriate, the user can delete the internal standard peak or change the internal standard peak vector for a certain experiment to another peak vector. An instruction is given to the internal standard confirmation unit 23 of the analysis apparatus 1 by a keyboard or mouse operation.

これを受けて、内部標準確認部２３は、得られた内部標準ピークベクトルのセットをユーザの指示に従って変更する。具体的には、内部標準ピークベクトルの削除が指示された場合は、上記の表３に示すテーブルから、該当する内部標準ピークベクトルを抹消する。一方、内部標準ピークベクトルを他のピークベクトルに変更するよう指示された場合は、上記のテーブルの該当する内部標準ピークベクトルの保持時間を書き換える。 In response to this, the internal standard confirmation unit 23 changes the obtained set of internal standard peak vectors in accordance with a user instruction. Specifically, when the deletion of the internal standard peak vector is instructed, the corresponding internal standard peak vector is deleted from the table shown in Table 3 above. On the other hand, when it is instructed to change the internal standard peak vector to another peak vector, the holding time of the corresponding internal standard peak vector in the above table is rewritten.

（２−４．保持時間補正部／保持時間補正工程）
次に保持時間補正工程では、得られた内部標準ピークベクトルに対応付けられた保持時間に基づいて、実験間で一致しない保持時間を補正する。具体的には、任意の１つの実験を基準実験として選択し、基準実験の内部標準ピークベクトルに対応する保持時間と、他の実験の内部標準ピークベクトルに対応する保持時間とが一致するように、他の実験の保持時間を線形補間によって補正する。 (2-4. Holding Time Correction Unit / Holding Time Correction Step)
Next, in the holding time correction step, the holding time that does not match between experiments is corrected based on the holding time associated with the obtained internal standard peak vector. Specifically, any one experiment is selected as the reference experiment, and the holding time corresponding to the internal standard peak vector of the reference experiment is matched with the holding time corresponding to the internal standard peak vector of the other experiment. The retention time of other experiments is corrected by linear interpolation.

ここで、基準実験としてｃ番目の実験を選択する。そして、実験ｓの保持時間ｔ_ｋ（ｓ）を補正して得られる補正後の保持時間をｔｃ_ｋ（ｓ）とすると、ｔｃ_ｋ（ｓ）は、以下の線形補間により算出することができる。 Here, the c-th experiment is selected as the reference experiment. When the corrected holding time obtained by correcting the holding time t _{k (s)} of the experiment s is tc _{k (s)} , tc _{k (s)} can be calculated by the following linear interpolation.

まず、ｔ_ｋ（ｓ）＜Ｔ_１（ｃ）の場合は、ｔｃ_ｋ（ｓ）は、次の式（９） First, in the case of t _{k (s)} <T _{1 (c)} , tc _{k (s)} is expressed by the following equation (9)

によって算出することができる。一方、Ｔ_ｉ（ｃ）＜ｔ_ｋ（ｓ）＜Ｔ_{ｉ＋１（ｃ）} （ｉ＝１，２，…，Ｎ−１）の場合は、ｔｃ_ｋ（ｓ）は、次の式（１０） Can be calculated. On the other hand, when T _{i (c)} <t _{k (s)} <T _{i + 1 (c)} (i = 1, 2,..., N−1), tc _{k (s)} is expressed by the following equation (10).

によって算出することができる。また、ｔ_ｋ（ｓ）＞Ｔ_Ｎ（ｃ）の場合は、ｔｃ_ｋ（ｓ）は、次の式（１１） Can be calculated. When t _{k (s)} > T _{N (c)} , tc _{k (s)} is expressed by the following equation (11).

によって算出することができる。 Can be calculated.

以上の計算により、基準実験ｃ以外の実験における保持時間が、基準実験ｃの保持時間に沿うように補正される。すなわち、各実験の２次元ピークデータの保持時間は、基準実験ｃの２次元ピークデータの保持時間のスケールに補間されたことになる。この保持時間補正工程は、保持時間補正部２４によって行われる。 With the above calculation, the holding time in the experiment other than the reference experiment c is corrected so as to follow the holding time of the reference experiment c. That is, the retention time of the two-dimensional peak data of each experiment is interpolated to the scale of the retention time of the two-dimensional peak data of the reference experiment c. This holding time correction step is performed by the holding time correction unit 24.

なお、上記の説明では、線形補間によって保持時間を補正したが、本発明はこれに限定されるものではなく、非線形補間によって保持時間を補正してもよい。 In the above description, the holding time is corrected by linear interpolation. However, the present invention is not limited to this, and the holding time may be corrected by nonlinear interpolation.

保持時間が補正された２次元ピークデータは、次の表４のように表すことができる。 The two-dimensional peak data with the holding time corrected can be expressed as shown in Table 4 below.

また、トータルピークデータは次の表５のようになる。 The total peak data is as shown in Table 5 below.

（２−５．第２ピークベクトル作成部／第２ピークベクトル作成工程）
この工程では、第１ピークベクトル作成工程と同様の処理を、保持時間が補正された２次元ピークデータについて行う。すなわち、上記の表４に示す２次元ピークデータから、各保持時間ごとに、トータルピーク群に含まれる各ピークのピーク強度を要素とするピークベクトルを作成する。このピークベクトルの作成は、全ての実験について行われる。 (2-5. Second peak vector creation unit / second peak vector creation step)
In this step, the same processing as in the first peak vector creation step is performed on the two-dimensional peak data with the holding time corrected. That is, a peak vector having the peak intensity of each peak included in the total peak group as an element is created for each holding time from the two-dimensional peak data shown in Table 4 above. This peak vector is created for all experiments.

ｓ番目の実験における保持時間ｔｃ_ｋ（ｓ）のピークベクトルをＦ^（ｓ）（ｔｃ_ｋ（ｓ））とすると、Ｆ^（ｓ）（ｔｃ_ｋ（ｓ））は、次の式（１２）
Ｆ^（ｓ）（ｔｃ_ｋ（ｓ））＝（Ｐ^(ｓ)(ｔｃ_ｋ(ｓ))_１，Ｐ^(ｓ)(ｔｃ_ｋ(ｓ))_２，…
…，Ｐ^(ｓ)(ｔｃ_ｋ(ｓ))_ｍ，…，Ｐ^(ｓ)(ｔｃ_ｋ(ｓ))_Ｍ） …（１２）
によって表すことができる。 Assuming that the peak vector of the holding time tc _{k (s)} in the s-th experiment is F ^(s) (tc _{k (s)} ), F ^(s) (tc _{k (s)} ) is expressed by the following equation (12):
F ^(s) (tck _(s) ) = (P ^(s) (tck _(s) ) ₁ , P ^(s) (tkk _(s) ) ₂ , ...
..., P ^(s) (tck _(s) ) _m , ..., P ^(s) (tkk _(s) ) _M ) (12)
Can be represented by

この工程は、換言すれば、各実験について、保持時間が補正された２次元ピークデータから保持時間ごとにマススペクトルデータを抽出する工程であるともいえる。この第２ピークベクトル作成工程は、第２ピークベクトル作成部２５によって行われる。 In other words, it can be said that this step is a step of extracting mass spectrum data for each holding time from the two-dimensional peak data with the holding time corrected for each experiment. The second peak vector creation step is performed by the second peak vector creation unit 25.

（２−６．対応判定部／対応判定工程、テーブル作成部／テーブル作成工程）
次に、ユーザなどにより、異なる２次元ピークデータから抽出されたピークベクトル同士の類似性を判定する際に用いる閾値Ｃを設定する。また、同様に、ユーザなどにより、ピークベクトル同士を比較する際の保持時間の許容範囲を規定するための閾値Ｔを設定する。 (2-6. Correspondence determination unit / correspondence determination step, table creation unit / table creation step)
Next, a threshold value C used when a user or the like determines the similarity between peak vectors extracted from different two-dimensional peak data is set. Similarly, the user sets a threshold T for defining an allowable range of holding time when comparing peak vectors.

そして、テーブル作成部２７は、実験１の各トータルピーク群（マススペクトル）の識別情報をテーブルの第１行に代入する。このとき、テーブルの各セルに格納される識別情報をＴＩ（ｔｃ_{１ｉ（１）}）^（１）（ｉ＝１，２，…，Ｎ_１）と表記する。ただし、Ｎ_１は実験１の２次元ピークデータに含まれるトータルピーク群の総数を示す。また、第ｋ列においてはじめてテーブルに代入された要素をＴＩ（ｔｃ_{ｋｉ（ｓ）}）^（ｓ）とする。その結果、テーブルは次の表６のようになる。 Then, the table creation unit 27 substitutes the identification information of each total peak group (mass spectrum) of Experiment 1 into the first row of the table. At this time, the identification information stored in each cell of the table is expressed as TI (tc _{1i (1)} ) ⁽¹⁾ (i = 1, 2,..., N ₁ ). N ₁ represents the total number of total peak groups included in the two-dimensional peak data of Experiment 1. In addition, let TI (tc _{ki (s)} ) ^(s) be the element assigned to the table for the first time in the k-th column. As a result, the table is as shown in Table 6 below.

次に、対応判定部２６が、実験２の各ピークベクトルを、実験１のピークベクトルと比較する。まず、対応判定部２６は、実験２のピークベクトルの中から、Ｆ^（２）（ｔｃ_１（２））を選択する。そして、実験１のピークベクトルＦ^（１）（ｔｃ_ｋ（１））（ｋ＝１，２，…，Ｋ）のうち、次の式（１３）
ｔｃ_１（２）−Ｔ＜ｔｃ_ｋ（１）＜ｔｃ_１（２）＋Ｔ …（１３）
を満たすＦ^（１）（ｔｃ_ｋ（１））の全てと、選択したＦ^（２）（ｔｃ_１（２））との類似性をそれぞれ評価する。 Next, the correspondence determination unit 26 compares each peak vector of Experiment 2 with the peak vector of Experiment 1. First, the correspondence determination unit 26 selects F ⁽²⁾ (tc _{1 (2)} ) from the peak vectors in Experiment 2. Of the peak vectors F ⁽¹⁾ (tk _{k (1)} ) (k = 1, 2,..., K) of Experiment 1, the following equation (13)
tc _{1 (2)} -T <tc _{k (1)} <tc _{1 (2)} + T (13)
The similarity between all of F ⁽¹⁾ (tc _{k (1)} ) satisfying and selected F ⁽²⁾ (tc _{1 (2)} ) is evaluated.

類似性の評価には、例えばｍ／ｚが同一のピークの本数や、コサイン係数、スピアマンの相関係数などを用いることができる。対応判定部２６は、これらを用いて２つのピークベクトルの類似度を数値化し、この類似度を閾値Ｃと比較する。ここで、比較対象となる実験１のピークベクトルの中に、類似度が閾値Ｃを超えるものがない場合、対応判定部２６は、Ｆ^（２）（ｔｃ_１（２））は、実験１のピークベクトルの何れとも対応しないと判定される。 For example, the number of peaks having the same m / z, a cosine coefficient, a Spearman correlation coefficient, and the like can be used for evaluating the similarity. The correspondence determination unit 26 quantifies the similarity between the two peak vectors using these, and compares this similarity with the threshold value C. Here, if there is no peak vector of Experiment 1 to be compared that has a similarity exceeding the threshold value C, the correspondence determination unit 26 determines that F ⁽²⁾ (tc _{1 (2)} ) It is determined that none of the peak vectors correspond.

この場合、テーブル作成部２７は、Ｆ^（２）（ｔｃ_１（２））に対応するトータルピーク群の識別情報を第２行目の第Ｎ_１＋１列目に格納する。格納される識別情報は、ＴＩ（ｔｃ_{２１（２）}）^（２）となる。 In this case, the table creation unit 27 stores the identification information of the total peak group corresponding to F ⁽²⁾ (tc _{1 (2)} ) in the N ₁ +1 column of the second row. The stored identification information is TI (tc _{21 (2)} ) ⁽²⁾ .

一方、比較対象となる実験１のピークベクトルの中に、類似度が閾値Ｃを超えるものがある場合、対応判定部２６は、Ｆ^（２）（ｔｃ_１（２））と最も類似度の高い実験１のピークベクトルＦ^（１）（ｔｃ_ｉ（１））が、Ｆ^（２）（ｔｃ_１（２））と対応するベクトルであると判定する。 On the other hand, in the case where the peak vector of Experiment 1 to be compared has a similarity that exceeds the threshold C, the correspondence determination unit 26 has the highest similarity with F ⁽²⁾ (tc _{1 (2)} ). It is determined that the peak vector F ⁽¹⁾ (tc _{i (1)} ) in Experiment 1 is a vector corresponding to F ⁽²⁾ (tc _{1 (2)} ).

この場合、テーブル作成部２７は、Ｆ^（２）（ｔｃ_１（２））に対応するトータルピーク群の識別情報を、第２行目で、実験１のトータルピーク群ＴＩ（ｔｃ_{１ｉ（１）}）^（１）と同じ列に格納する。格納される識別情報は、ＴＩ（ｔｃ_{１１（２）}）^（２）となる。 In this case, the table creating unit 27 displays the identification information of the total peak group corresponding to F ⁽²⁾ (tc _{1 (2)} ) on the second line, the total peak group TI (tc _{1i (1) of} Experiment _1). ) Store in the same column as ⁽¹⁾ . The stored identification information is TI (tc _{11 (2)} ) ⁽²⁾ .

同様にして、実験２の他のピークベクトルＦ^（２）（ｔｃ_ｈ（２））（ｈ＝２，３，…，Ｎ_２）についても、対応判定部２６が、次の式（１４）
ｔｃ_ｈ（２）−Ｔ＜ｔｃ_ｋ（１）＜ｔｃ_ｈ（２）＋Ｔ …（１４）
を満たす実験１のピークベクトルＦ^（１）（ｔｃ_ｋ（１））と比較し、閾値Ｃを超えるものがない場合は、テーブル作成部２７がテーブルの第２行目の右側の新しい列に識別情報を格納し、閾値Ｃを超えるものがある場合は、テーブル作成部２７が第２行目の、最も類似性の高い実験１のピークベクトルに対応するトータルピーク群の識別情報と同じ列に識別情報を格納する。 Similarly, for the other peak vectors F ⁽²⁾ (t c _{h (2)} ) (h = 2, 3,..., N ₂ ) in Experiment 2, the correspondence determination unit 26 uses the following equation (14).
tc _{h (2)} −T <tc _{k (1)} <t _{h h (2)} + T (14)
If there is nothing exceeding the threshold C compared with the peak vector F ⁽¹⁾ (t _{k k (1)} ) of Experiment 1 that satisfies the conditions, the table creation unit 27 identifies the new column on the right side of the second row of the table When information is stored and there is an item that exceeds the threshold C, the table creation unit 27 identifies the second row in the same column as the identification information of the total peak group corresponding to the peak vector of the most similar experiment 1 Store information.

その結果、テーブルは次の表７のようになる。 As a result, the table is as shown in Table 7 below.

同様に、実験ｓについて、対応判定部２６が、ピークベクトルＦ^（ｓ）（ｔｃ_ｊ（ｓ））を実験ｒ（ｒ＝１，２，…，ｓ−１）のピークベクトルＦ^（ｒ）（ｔｃ_ｍ（ｒ））（ただし、ｔｃ_ｊ（ｓ）−Ｔ＜ｔｃ_ｍ（ｒ）＜ｔｃ_ｊ（ｓ）＋Ｔ）と比較する。そして、類似度が閾値Ｃを超えるものがない場合は、テーブル作成部２７が第ｓ行目の新しい列ｚに、Ｆ^（ｓ）（ｔｃ_ｊ（ｓ））に対応するトータルピーク群の識別情報ＴＩ（ｔｃ_{ｚｊ（ｓ）}）^（ｓ）を格納する。 Similarly, for the experiment s, the correspondence determination unit 26 converts the peak vector F ^(s) (tc _{j (s)} ) to the peak vector F ^(r) (b) of the experiment r (r = 1, 2,..., S−1). tc _{m (r)} ) (where tc _{j (s)} −T <tc _{m (r)} <tc _{j (s)} + T). If there is no similarity exceeding the threshold value C, the table creation unit 27 identifies the total peak group identification information corresponding to F ^(s) (tc _{j (s)} ) in the new column z of the s-th row. TI (tc _{zj (s)} ) ^(s) is stored.

一方、類似度が閾値Ｃを超えるものがある場合は、対応判定部２６が最も類似するピークベクトルＦ^（ｒ）（ｔｃ_ｎ（ｒ））を同定し、テーブル作成部２７が第ｓ行目の、ピークベクトルＦ^（ｒ）（ｔｃ_ｎ（ｒ））に対応するトータルピーク群と同じ列に識別情報を格納する。 On the other hand, if the similarity exceeds the threshold C, the correspondence determination unit 26 identifies the most similar peak vector F ^(r) (tcn _(r) ), and the table creation unit 27 performs the s-th row The identification information is stored in the same column as the total peak group corresponding to the peak vector F ^(r) (tcn _(r) ).

以上の処理を、実験Ｓまで繰り返すことによって、テーブルは、次の表８のようになる。なお、対応するトータルピーク群がないセルについては、０が格納されるものとする。 By repeating the above processing until Experiment S, the table becomes as shown in Table 8 below. It is assumed that 0 is stored for a cell having no corresponding total peak group.

以上のようにして得られたテーブルでは、同一の実験から得られたトータルピーク群（マススペクトル）の識別情報が同じ行に配列されるとともに、対応するトータルピーク群（マススペクトル）の識別情報が同じ列に配列されることになる。このテーブルから、同じ列に配列されたトータルピーク群（マススペクトル）同士は、同一の化合物由来のものであることが示される。 In the table obtained as described above, the identification information of the total peak group (mass spectrum) obtained from the same experiment is arranged in the same row, and the identification information of the corresponding total peak group (mass spectrum) is displayed. They will be arranged in the same column. From this table, it is shown that the total peak groups (mass spectra) arranged in the same column are derived from the same compound.

そして、０以外の値が格納されているセルについて、列ごとに、格納されているトータルピーク群に対応する保持時間の平均値を求め、その平均値を列の代表保持時間とする。 Then, with respect to the cells in which values other than 0 are stored, the average value of the retention times corresponding to the stored total peak group is obtained for each column, and the average value is set as the representative retention time of the column.

以上の工程により、同一の化合物由来とみなすことのできるトータルピーク群（マススペクトル）を検出し行列に表現することが可能となった（表８）。それぞれのトータルピーク群に対応するピークベクトルと精製票品又は市販のマスライブラリーのピークベクトルとの共通性により、トータルピーク群の化合物の同定を行うことができる。このとき、同じ列に配列され、グルーピングされたトータルピーク群のピークベクトルについて代表ピークベクトルを定義することにより物質の検索効率を高めることができる。 Through the above steps, a total peak group (mass spectrum) that can be regarded as originating from the same compound can be detected and expressed in a matrix (Table 8). The compound of the total peak group can be identified based on the commonality between the peak vector corresponding to each total peak group and the peak vector of the purification slip or the commercially available mass library. At this time, by defining a representative peak vector for the peak vectors of the total peak group arranged in the same column and grouped, the substance search efficiency can be increased.

以下では、簡単な例を用いてテーブルを作成する工程を説明する。例として、３つの実験から３つの補正された２次元ピークデータが得られたとする。そして、各２次元ピークデータからは、それぞれ３つのマススペクトルが得られたとする。ここで、実験１のピークベクトルは、Ｆ^（１）（ｔｃ_１（１）），Ｆ^（１）（ｔｃ_２（１）），Ｆ^（１）（ｔｃ_３（１））であり、実験２のピークベクトルは、Ｆ^（２）（ｔｃ_１（２）），Ｆ^（２）（ｔｃ_２（２）），Ｆ^（２）（ｔｃ_３（２））であり、実験３のピークベクトルは、Ｆ^（３）（ｔｃ_１（３）），Ｆ^（３）（ｔｃ_２（３）），Ｆ^（３）（ｔｃ_３（３））である。 Hereinafter, a process of creating a table will be described using a simple example. As an example, assume that three corrected two-dimensional peak data are obtained from three experiments. It is assumed that three mass spectra are obtained from each two-dimensional peak data. Here, the peak vectors of Experiment 1 are F ⁽¹⁾ (tc _{1 (1)} ), F ⁽¹⁾ (tc _{2 (1)} ), F ⁽¹⁾ (tc _{3 (1)} ), and Experiment 2 The peak vectors of F ⁽²⁾ (tc _{1 (2)} ), F ⁽²⁾ (tc _{2 (2)} ), F ⁽²⁾ (tc _{3 (2)} ), and the peak vector of Experiment 3 is F ⁽³⁾ (tc _{1 (3)} ), F ⁽³⁾ (tc _{2 (3)} ), F ⁽³⁾ (tc _{3 (3)} ).

また、ｔｃ_１（１）とｔｃ_１（２）とｔｃ_１（２）との保持時間の違いは閾値Ｔ以内であり、同様に、ｔｃ_２（１）とｔｃ_２（２）とｔｃ_２（２）との保持時間の違いも閾値Ｔ以内であり、また、ｔｃ_３（１）とｔｃ_３（２）とｔｃ_３（２）との保持時間の違いも閾値Ｔ以内であるとする。 Also, the difference in retention time between tc _{1 (1)} , tc _{1 (2)} and tc _{1 (2)} is within the threshold T, and similarly, tc _{2 (1)} , tc _{2 (2)} and tc _{2 (} The difference in holding time from ₂₎ is also within the threshold T, and the difference in holding time between tc _{3 (1)} , tc _{3 (2),} and tc _{3 (2)} is also within the threshold T.

そして、Ｆ^（１）（ｔｃ_２（１））とＦ^（２）（ｔｃ_２（２））との類似度は閾値Ｃよりも大きく、Ｆ^（１）（ｔｃ_２（１））とＦ^（３）（ｔｃ_２（３））との類似度も閾値Ｃよりも大きものとする。そして、それ以外のピークベクトル同士は、類似性を示さないものとする。 The similarity between F ⁽¹⁾ (tc _{2 (1)} ) and F ⁽²⁾ (tc _{2 (2)} ) is greater than the threshold C, and F ⁽¹⁾ (tc _{2 (1)} ) and F ^{( 3)} The similarity with (tc _{2 (3)} ) is also larger than the threshold value C. And other peak vectors do not show similarity.

このとき、テーブル作成部２７は、まず、テーブルの第１行目の各セルに、実験１のそれぞれのマススペクトルの識別情報を格納する。ここでは、マススペクトルの識別情報として、ピークベクトルの識別情報を格納することとする。その結果、次の表９のようになる。 At this time, the table creation unit 27 first stores the identification information of each mass spectrum of Experiment 1 in each cell in the first row of the table. Here, it is assumed that peak vector identification information is stored as mass spectrum identification information. As a result, the following Table 9 is obtained.

続いて、対応判定部２６が、実験２のピークベクトルＦ^（２）（ｔｃ_１（２））と、実験１のピークベクトルＦ^（１）（ｔｃ_１（１））とを比較する。ここで、類似度は閾値Ｃ以下であるため、これらのピークベクトル同士は対応しないと判定される。その結果、ピークベクトルＦ^（２）（ｔｃ_１（２））は、第２行第４列目に格納される。 Subsequently, the correspondence determination unit 26 compares the peak vector F ⁽²⁾ (tc _{1 (2)} ) of Experiment 2 with the peak vector F ⁽¹⁾ (tc _{1 (1)} ) of Experiment 1. Here, since the similarity is equal to or less than the threshold value C, it is determined that these peak vectors do not correspond to each other. As a result, the peak vector F ⁽²⁾ (tc _{1 (2)} ) is stored in the second row and the fourth column.

次に、対応判定部２６が、実験２のピークベクトルＦ^（２）（ｔｃ_２（２））と、実験１のピークベクトルＦ^（１）（ｔｃ_２（１））とを比較する。ここで、類似度は閾値Ｃよりも大きく、類似度が最も高いいため、これらのピークベクトル同士は対応すると判定される。その結果、ピークベクトルＦ^（２）（ｔｃ_２（２））は、第２行第２列目に格納される。 Next, the correspondence determination unit 26 compares the peak vector F ⁽²⁾ (tc _{2 (2)} ) of Experiment 2 with the peak vector F ⁽¹⁾ (tc _{2 (1)} ) of Experiment 1. Here, since the similarity is greater than the threshold C and the similarity is the highest, it is determined that these peak vectors correspond to each other. As a result, the peak vector F ⁽²⁾ (tc _{2 (2)} ) is stored in the second row and second column.

続いて、対応判定部２６が、実験２のピークベクトルＦ^（２）（ｔｃ_３（２））と、実験１のピークベクトルＦ^（１）（ｔｃ_３（１））とを比較する。ここで、類似度は閾値Ｃ以下であるため、これらのピークベクトル同士は対応しないと判定される。その結果、ピークベクトルＦ^（２）（ｔｃ_３（２））は、第２行第５列目に格納される。 Subsequently, the correspondence determination unit 26 compares the peak vector F ⁽²⁾ (tc _{3 (2)} ) of Experiment 2 with the peak vector F ⁽¹⁾ (tc _{3 (1)} ) of Experiment 1. Here, since the similarity is equal to or less than the threshold value C, it is determined that these peak vectors do not correspond to each other. As a result, the peak vector F ⁽²⁾ (tc _{3 (2)} ) is stored in the second row and the fifth column.

実験３についても同様にして、ピークベクトルＦ^（３）（ｔｃ_１（３））は第３行第６列目、ピークベクトルＦ^（３）（ｔｃ_２（３））は第３行第２列目、ピークベクトルＦ^（３）（ｔｃ_３（３））は第３行第７列目にそれぞれ格納される。 Similarly for Experiment 3, the peak vector F ⁽³⁾ (tc _{1 (3)} ) is in the third row and the sixth column, and the peak vector F ⁽³⁾ (tc _{2 (3)} ) is in the third row and the second column. The eye and peak vector F ⁽³⁾ (tc _{3 (3)} ) are respectively stored in the third row and the seventh column.

その結果、テーブルは、次の表１０のようになる。 As a result, the table is as shown in Table 10 below.

このテーブルでは、実験１の保持時間ｔｃ_２（１）のマススペクトルと、実験２の保持時間ｔｃ_２（２）のマススペクトルと、実験３の保持時間ｔｃ_２（３）のマススペクトルとが、同一の化合物由来のものであることが表現されている。そして、これらのマススペクトルに含まれる各ピークがマススペクトル間で対応することになる。 In this table, the mass spectrum of retention time tc _{2 (1)} of Experiment 1, the mass spectrum of retention time tc _{2 (2)} of Experiment 2, and the mass spectrum of retention time tc _{2 (3)} of Experiment 3 are It is expressed that they are derived from the same compound. And each peak contained in these mass spectra will correspond between mass spectra.

そして、２列目のマススペクトルの代表保持時間は、次の式（１５）
｛ｔｃ_２（１）＋ｔｃ_２（２）＋ｔｃ_２（３）｝／３ …（１５）
によって求められる。 The representative retention time of the mass spectrum in the second column is given by the following equation (15)
{Tc _{2 (1)} + tc _{2 (2)} + tc _{2 (3)} } / 3 (15)
Sought by.

（３．その他）
最後に、スペクトル解析装置１の各ブロック、特に平滑化部１１、ピーク時間同定部１２、スペクトル強度合体部１３、トータルピーク作成部１４、トータルピーク合体部１５、ピーク合体部１６、第１ピークベクトル作成部２１、内部標準同定部２２、内部標準確認部２３、保持時間補正部２４、第２ピークベクトル作成部２５、対応判定部２６、及びテーブル作成部２７は、ハードウェアロジックによって構成してもよいし、次のようにＣＰＵを用いてソフトウェアによって実現してもよい。 (3. Other)
Finally, each block of the spectrum analyzer 1, particularly the smoothing unit 11, the peak time identification unit 12, the spectral intensity coalescence unit 13, the total peak creation unit 14, the total peak coalescence unit 15, the peak coalescence unit 16, and the first peak vector The creation unit 21, the internal standard identification unit 22, the internal standard confirmation unit 23, the retention time correction unit 24, the second peak vector creation unit 25, the correspondence determination unit 26, and the table creation unit 27 may be configured by hardware logic. Alternatively, it may be realized by software using a CPU as follows.

すなわち、スペクトル解析装置１は、各機能を実現する制御プログラムの命令を実行するＣＰＵ（central processing unit）、上記プログラムを格納したＲＯＭ（read only memory）、上記プログラムを展開するＲＡＭ（random access memory）、上記プログラムおよび各種データを格納するメモリ等の記憶装置（記録媒体）などを備えている。そして、本発明の目的は、上述した機能を実現するソフトウェアであるスペクトル解析装置１の制御プログラムのプログラムコード（実行形式プログラム、中間コードプログラム、ソースプログラム）をコンピュータで読み取り可能に記録した記録媒体を、スペクトル解析装置１に供給し、そのコンピュータ（またはＣＰＵやＭＰＵ）が記録媒体に記録されているプログラムコードを読み出し実行することによっても、達成可能である。 That is, the spectrum analyzer 1 includes a CPU (central processing unit) that executes instructions of a control program that realizes each function, a ROM (read only memory) that stores the program, and a RAM (random access memory) that expands the program. And a storage device (recording medium) such as a memory for storing the program and various data. An object of the present invention is to provide a recording medium in which a program code (execution format program, intermediate code program, source program) of a control program of the spectrum analyzer 1 which is software for realizing the above-described functions is recorded so as to be readable by a computer. This can also be achieved by supplying the spectrum analysis apparatus 1 and reading and executing the program code recorded on the recording medium by the computer (or CPU or MPU).

上記記録媒体としては、例えば、磁気テープやカセットテープ等のテープ系、フロッピー（登録商標）ディスク／ハードディスク等の磁気ディスクやＣＤ−ＲＯＭ／ＭＯ／ＭＤ／ＤＶＤ／ＣＤ−Ｒ等の光ディスクを含むディスク系、ＩＣカード（メモリカードを含む）／光カード等のカード系、あるいはマスクＲＯＭ／ＥＰＲＯＭ／ＥＥＰＲＯＭ／フラッシュＲＯＭ等の半導体メモリ系などを用いることができる。 Examples of the recording medium include a tape system such as a magnetic tape and a cassette tape, a magnetic disk such as a floppy (registered trademark) disk / hard disk, and an optical disk such as a CD-ROM / MO / MD / DVD / CD-R. Card system such as IC card, IC card (including memory card) / optical card, or semiconductor memory system such as mask ROM / EPROM / EEPROM / flash ROM.

また、スペクトル解析装置１を通信ネットワークと接続可能に構成し、上記プログラムコードを通信ネットワークを介して供給してもよい。この通信ネットワークとしては、特に限定されず、例えば、インターネット、イントラネット、エキストラネット、ＬＡＮ、ＩＳＤＮ、ＶＡＮ、ＣＡＴＶ通信網、仮想専用網（virtual private network）、電話回線網、移動体通信網、衛星通信網等が利用可能である。また、通信ネットワークを構成する伝送媒体としては、特に限定されず、例えば、ＩＥＥＥ１３９４、ＵＳＢ、電力線搬送、ケーブルＴＶ回線、電話線、ＡＤＳＬ回線等の有線でも、ＩｒＤＡやリモコンのような赤外線、Ｂｌｕｅｔｏｏｔｈ（登録商標）、８０２．１１無線、ＨＤＲ、携帯電話網、衛星回線、地上波デジタル網等の無線でも利用可能である。なお、本発明は、上記プログラムコードが電子的な伝送で具現化された、搬送波に埋め込まれたコンピュータデータ信号の形態でも実現され得る。 Moreover, the spectrum analyzer 1 may be configured to be connectable to a communication network, and the program code may be supplied via the communication network. The communication network is not particularly limited. For example, the Internet, intranet, extranet, LAN, ISDN, VAN, CATV communication network, virtual private network, telephone line network, mobile communication network, satellite communication. A net or the like is available. Also, the transmission medium constituting the communication network is not particularly limited. For example, even in the case of wired such as IEEE 1394, USB, power line carrier, cable TV line, telephone line, ADSL line, etc., infrared rays such as IrDA and remote control, Bluetooth ( (Registered trademark), 802.11 wireless, HDR, mobile phone network, satellite line, terrestrial digital network, and the like can also be used. The present invention can also be realized in the form of a computer data signal embedded in a carrier wave in which the program code is embodied by electronic transmission.

本発明は上述した各実施形態に限定されるものではなく、請求項に示した範囲で種々の変更が可能であり、異なる実施形態にそれぞれ開示された技術的手段を適宜組み合わせて得られる実施形態についても本発明の技術的範囲に含まれる。 The present invention is not limited to the above-described embodiments, and various modifications are possible within the scope shown in the claims, and embodiments obtained by appropriately combining technical means disclosed in different embodiments. Is also included in the technical scope of the present invention.

上述した実施形態に係るスペクトル解析装置が有用であることを検証する。本実施例では、薬剤処理したアラビドプシス培養細胞Ｔ８７でのＧＣ／ＭＳ時系列測定における保持時間の対応づけを行った。 It is verified that the spectrum analysis apparatus according to the above-described embodiment is useful. In this example, the retention times in the GC / MS time-series measurement were performed on the drug-treated Arabidopsis cultured cells T87.

シロイヌナズナにおいて薬剤Ａに応答した代謝産物の変動を解析するため、薬剤Ａで処理したシロイヌナズナ培養細胞Ｔ８７株のＧＣ／ＭＳ解析を行った。新鮮な液体培地に継代後１０日目のシロイヌナズナ培養細胞に、終濃度５００μＭとなるように薬剤Ａ（ＤＭＳＯに溶解）を添加し、添加直後（０分）又は５分後、及び３０分後、並びに１，２，４，６，８，１０，１２，１６，２０，２４，３６，４８，９６時間後に細胞を回収した。回収した細胞は蒸留水で洗浄したのち、吸引脱水し、-８０℃に保存した。また、薬剤Ａの代わりに同量のＤＭＳＯを添加し、上記と同様の時間後に回収した細胞をコントロールとした。細胞１００ｍｇから、メタノール、水、及びクロロホルムを用いた液層−液層分配法により生体成分を抽出後、水溶性画分をメトキシアミン及びＭＳＴＦＡにより誘導体化したのち、ＬＥＣＯ社のＰｅｇａｓｕｓＩＩＩ(ＧＣ−ＴＯＦ−ＭＳ)により分析を行った。ＴＯＦ−ＭＳによるデータ取得は、８２〜５００までのｍ／ｚについて、２０回／秒で２２分間行った。分析データは、各ｍ／ｚごとのシグナル強度を保持したＣＳＶファイルとして出力し、その結果、１サンプルについて約１４７ＭＢのＣＳＶファイルを得ることができた。このＣＳＶファイルを上述した実施形態のスペクトル解析装置によって解析した。図１４にスペクトル解析装置のメニュー画面を示す。 In order to analyze changes in metabolites in response to drug A in Arabidopsis thaliana, GC / MS analysis of Arabidopsis cultured cell strain T87 treated with drug A was performed. Drug A (dissolved in DMSO) is added to the cultured cells of Arabidopsis thaliana on the 10th day after subculture in a fresh liquid medium, and added immediately (0 minutes), 5 minutes, and 30 minutes after addition. And after 1, 2, 4, 6, 8, 10, 12, 16, 20, 24, 36, 48, 96 hours. The collected cells were washed with distilled water, dehydrated by suction, and stored at −80 ° C. In addition, the same amount of DMSO was added instead of the drug A, and the cells collected after the same time as above were used as a control. A biological component was extracted from 100 mg of cells by a liquid layer-liquid layer partition method using methanol, water, and chloroform, and a water-soluble fraction was derivatized with methoxyamine and MSTFA, and then Pegasus III (GC-) of LECO was used. Analysis was performed by TOF-MS). Data acquisition by TOF-MS was performed at 20 times / second for 22 minutes for m / z from 82 to 500. The analysis data was output as a CSV file holding the signal intensity for each m / z, and as a result, a CSV file of about 147 MB for one sample could be obtained. This CSV file was analyzed by the spectrum analyzer of the embodiment described above. FIG. 14 shows a menu screen of the spectrum analyzer.

まず、各ｍ／ｚごとのクロマトグラム情報を、保持時間とピークエリア（ピーク強度）が対応した情報へと変換した。この結果、平均ファイルサイズが約２．５ＭＢのファイルへと変換することができた。次に、今回の解析に用いたカラムにおける保持時間を側鎖長の異なる標準脂肪酸群を基準とした保持時間指数（ｒｅｔｅｎｔｉｏｎｉｎｄｅｘ）に変換し、サンプル相互で比較できるようにした。分析の際に内部標準として抽出サンプルに加えておいたリビトールを基準として、リビトールのピーク強度が１００となるようにピーク強度の補正を行った。補正されたデータを用い、保持時間指数のサンプル間における補正（ピークドリフトの補正）を行うための内在性内部標準の検出を行った。その結果、薬剤Ａ処理サンプル及びコントロールサンプルのサンプリング時間１６点における３反復実験（合計９６分析）中、内在性の内部標準の候補として、３６のピークを検出することができた。検出された内在性内部標準が、サンプル間で同じフラグメンテーションパターンを示し、同一物質であることを確認するため、評価を行った。この解析においては、保持時間の線形補正に用いる内部標準として、リビトールを含む９の化合物ピークを選択した（図１５（ａ），図１５（ｂ）)。出力されたファイル（平均約２７０ｋＢ）を用い、サンプル間における化合物ピークのマッチングを行った。マッチングを行った結果を可視化した結果を図１６に示す。図１６では、上部の表において、各行は保持時間指数の平均を、各列はサンプルを示しており、ピーク強度の大小を赤色の濃淡で示している。表中、右側４８サンプルが薬剤Ａ処理したＴ８７培養細胞の時系列変化、左側４８サンプルが、無処理コントロールの変化を示す。処理した細胞とコントロール細胞とで化合物蓄積の消長に違いが認められる化合物が検出された。検出されたピークのフラグメンテーションパターンを、既知物質のフラグメンテーションパターン（及び保持時間）と比較することにより、ｍｙｏ‐ｉｎｏｓｉｔｏｌ、ｇａｍｍａ‐Ａｍｉｎｏｂｕｔｙｒａｔｅ、Ｄ‐ｍａｌａｔｅ、Ｅｔｈａｎｏｌａｍｉｎｅ、Ｓｕｃｒｏｓｅなど、いくつかの化合物について、そのピークを同定することができた。 First, chromatogram information for each m / z was converted into information corresponding to retention time and peak area (peak intensity). As a result, the file could be converted into a file having an average file size of about 2.5 MB. Next, the retention time in the column used in this analysis was converted into a retention index based on standard fatty acid groups having different side chain lengths so that the samples could be compared with each other. The peak intensity was corrected so that the peak intensity of ribitol was 100, based on ribitol added to the extracted sample as an internal standard at the time of analysis. Using the corrected data, detection of an endogenous internal standard for correction between retention time index samples (peak drift correction) was performed. As a result, 36 peaks could be detected as endogenous internal standard candidates in 3 replicate experiments (96 analyzes in total) at 16 sampling times of the drug A-treated sample and the control sample. Evaluation was performed to confirm that the detected endogenous internal standard showed the same fragmentation pattern between samples and was the same substance. In this analysis, nine compound peaks containing ribitol were selected as internal standards used for linear correction of retention time (FIGS. 15 (a) and 15 (b)). Using the output file (average of about 270 kB), the compound peaks were matched between samples. The result of visualizing the result of matching is shown in FIG. In FIG. 16, in the upper table, each row indicates the average retention time index, each column indicates a sample, and the magnitude of the peak intensity is indicated by red shading. In the table, 48 samples on the right side show time-series changes of T87 cultured cells treated with drug A, and 48 samples on the left side show changes in untreated control. A compound in which the difference in the accumulation of the compound was recognized between the treated cell and the control cell was detected. By comparing the fragmentation pattern of the detected peak with the fragmentation pattern (and retention time) of known substances, the peak for some compounds such as myo-inositol, gamma-aminobutyrate, D-malate, ethanolamine, sucrose, etc. Could be identified.

本発明によれば、クロマトグラフィーと質量分析との組み合わせに基づいて得られたデータから、化合物の特徴を示すピークの位置を正確に同定することができる。また、本発明によれば、クロマトグラフィーと質量分析との組み合わせに基づいて複数の実験から得られた２次元ピークデータのセットにおいて、同一の化合物に由来するマススペクトルを実験間で対応付けることができる。従って、ＧＣ／ＭＳやＬＣ／ＭＳによって得られたデータを解析する解析装置に利用することができる。 According to the present invention, it is possible to accurately identify the position of a peak indicating the characteristics of a compound from data obtained based on a combination of chromatography and mass spectrometry. Moreover, according to the present invention, in a set of two-dimensional peak data obtained from a plurality of experiments based on a combination of chromatography and mass spectrometry, mass spectra derived from the same compound can be associated between experiments. . Therefore, the present invention can be used for an analysis apparatus that analyzes data obtained by GC / MS or LC / MS.

本発明の一実施形態を示すものであり、スペクトル解析装置のピーク対応付け部の詳細な機能構成を示すブロック図である。1, showing an embodiment of the present invention, is a block diagram illustrating a detailed functional configuration of a peak association unit of a spectrum analyzer. FIG. ＧＣ／ＭＳやＬＣ／ＭＳによって得られた２次元スペクトルデータを可視化した図である。It is the figure which visualized the two-dimensional spectrum data obtained by GC / MS or LC / MS. 従来の技術を示すものであり、マスクロマトグラムデータからピークデータへの変換方法を示す図である。It is a figure which shows the prior art and shows the conversion method from mass chromatogram data to peak data. ２次元ピークデータからマススペクトルを抽出する方法を示す図である。It is a figure which shows the method of extracting a mass spectrum from two-dimensional peak data. 従来の技術を示すものであり、マスクロマトグラムデータからピークデータへ変換する工程を示す図である。It is a figure which shows the prior art and shows the process of converting from mass chromatogram data to peak data. 本発明の一実施形態を示すものであり、スペクトル解析装置の要部構成を示すブロック図である。BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 illustrates an embodiment of the present invention and is a block diagram illustrating a main configuration of a spectrum analysis apparatus. 本発明の一実施形態を示すものであり、スペクトル解析装置のピークデータ作成部の詳細な機能構成を示すブロック図である。1, showing an embodiment of the present invention, is a block diagram illustrating a detailed functional configuration of a peak data creation unit of a spectrum analyzer. FIG. 本発明の一実施形態を示すものであり、スペクトル強度合体部による処理工程を示す図である。It is a figure which shows one Embodiment of this invention and shows the process process by a spectrum intensity | strength uniting part. 本発明の一実施形態を示すものであり、トータルピーク作成工程及びトータルピーク合体工程によるデータの変換を示す図である。FIG. 5 is a diagram illustrating an embodiment of the present invention, and is a diagram illustrating data conversion by a total peak creation step and a total peak merge step. 本発明の一実施形態を示すものであり、トータルピーク合体工程によるデータの変換を示す図である。FIG. 5 is a diagram illustrating an embodiment of the present invention and illustrating data conversion by a total peak coalescing process. 本発明の一実施形態を示すものであり、ピーク合体工程後の２次元スペクトルデータを示す図である。FIG. 5 is a diagram illustrating one-dimensional embodiment of the present invention and showing two-dimensional spectrum data after a peak coalescence step. 本発明の一実施形態を示すものであり、内部標準同定部による処理工程を示す図である。It is a figure which shows one Embodiment of this invention and shows the process process by an internal standard identification part. 本発明の一実施形態を示すものであり、（ａ）及び（ｂ）は内部標準同定工程におけるピークベクトル同士の比較方法を示す図である。FIG. 2 shows an embodiment of the present invention, and (a) and (b) are diagrams showing a method for comparing peak vectors in an internal standard identification step. 本発明の一実施例を示すものであり、スペクトル解析装置のメニュー画面を示す図である。It is a figure which shows one Example of this invention and shows the menu screen of a spectrum analyzer. 本発明の一実施例を示すものであり、（ａ）は内部標準同定部によって同定した内在性内部標準ピークを評価した例を示す図であり、（ｂ）は内在性内部標準の候補として選ばれた化合物ピークの、各サンプルにおけるフラグメントパターンの例を示す図であり、（ａ）の表中で選択されているピークについて、８サンプルのマスフラグメントパターンを示した。BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 shows an embodiment of the present invention, (a) is a diagram showing an example of evaluating an intrinsic internal standard peak identified by an internal standard identification unit, and (b) is selected as a candidate for an intrinsic internal standard. It is a figure which shows the example of the fragment pattern in each sample of the obtained compound peak, The mass fragment pattern of 8 samples was shown about the peak selected in the table | surface of (a). 本発明の一実施例を示すものであり、サンプル間のマススペクトルの対応関係を示す図である。It is a figure which shows one Example of this invention and shows the correspondence of the mass spectrum between samples.

Explanation of symbols

１スペクトル解析装置（情報処理装置）
１１平滑化部
１２ピーク時間同定部（２次元ピークデータ作成部、第２ピーク位置同定部）
１３スペクトル強度合体部（２次元ピークデータ作成部）
１４トータルピーク作成部
１５トータルピーク合体部（第１ピーク位置同定部）
１６ピーク合体部（第１ピーク位置同定部）
２１第１ピークベクトル作成部（第１マススペクトル抽出部）
２２内部標準同定部
２４保持時間補正部
２５第２ピークベクトル作成部（第２マススペクトル抽出部）
２６対応判定部
２７テーブル作成部 1 Spectrum analyzer (information processing equipment)
11 Smoothing unit 12 Peak time identification unit (two-dimensional peak data creation unit, second peak position identification unit)
13 Spectral intensity coalescence part (two-dimensional peak data creation part)
14 Total peak creation part 15 Total peak uniting part (1st peak position identification part)
16 Peak coalescence part (first peak position identification part)
21 1st peak vector preparation part (1st mass spectrum extraction part)
22 internal standard identification unit 24 retention time correction unit 25 second peak vector creation unit (second mass spectrum extraction unit)
26 Correspondence determination unit 27 Table creation unit

Claims

Corresponding between two-dimensional peak data in a two-dimensional peak data set that includes multiple two-dimensional peak data, which is information on retention time and peak ion intensity for each m / z obtained based on the combination of chromatography and mass spectrometry An information processing apparatus for identifying a mass spectrum to be performed,
A first mass spectrum extraction unit for extracting mass spectrum data from each two-dimensional peak data for each holding time;
Internal standard identification that creates an internal standard mass spectral data set that includes similar mass spectral data extracted from each two-dimensional peak data by comparing mass spectral data extracted from different two-dimensional peak data And
Based on the retention time corresponding to each mass spectrum data included in the internal standard mass spectrum data set, a retention time correction unit that corrects the retention time in each two-dimensional peak data;
A second mass spectrum extraction unit for extracting new mass spectrum data for each holding time from the two-dimensional peak data with the holding time corrected;
Whether or not the new mass spectra compared with each other correspond to each other by comparing the new mass spectrum data in which the difference in the retention time is within a predetermined range between the two-dimensional peak data whose retention times are corrected. An information processing apparatus comprising: a correspondence determination unit that determines whether or not.

The information processing apparatus according to claim 1, wherein the internal standard identification unit compares mass spectrum data having a retention time difference within a predetermined range between two-dimensional peak data.

The information processing apparatus according to claim 1 or 2, wherein the first mass spectrum extraction unit extracts each mass spectrum data as a vector having an ion intensity for each m / z as an element.

The second mass spectrum extraction unit extracts each new mass spectrum data as a new vector having an ion intensity for each m / z as an element,
The correspondence determination unit calculates the similarity between new vectors extracted from different corrected two-dimensional peak data, and compares the calculated similarity with a threshold value, so that mass spectrum data correspond to each other. The information processing apparatus according to claim 1, wherein it is determined whether or not.

A table creation unit for creating a correspondence table in which identification information of each new mass spectrum is arranged in the same row or column for new mass spectra determined to correspond by the correspondence determination unit. The information processing apparatus according to any one of claims 1 to 4, wherein the information processing apparatus is characterized in that:

Two-dimensional information about retention time and peak ion intensity for each m / z from two-dimensional spectrum data, which is information about retention time and ion intensity for each m / z, based on a combination of chromatography and mass spectrometry A two-dimensional peak data creation unit for creating peak data;
For the two-dimensional peak data, a total peak creation unit that calculates the sum of peak ion intensities for each holding time as total peak intensity and creates total peak data that is information on the total peak intensity for each holding time;
In the total peak data, a section in which the total peak intensity is always greater than or equal to the threshold is identified, and a retention time in which the total peak intensity is maximized in the section is included in the section of the two-dimensional peak data. An information processing apparatus comprising: a first peak position identifying unit that identifies a true retention time corresponding to a peak to be detected.

The information processing apparatus according to claim 6, wherein the first peak position identifying unit further associates the peak ion intensity included in the section of the two-dimensional peak data with the identified true retention time.

The two-dimensional peak data creation unit is
Mass chromatogram data is extracted for each m / z from the two-dimensional spectrum data, which is information on the retention time and ion intensity for each m / z, obtained based on the combination of chromatography and mass spectrometry. A second peak position identifying unit for identifying a temporary peak retention time based on a retention time at which the ionic strength is maximized in the gram data;
In each mass chromatogram data, the peak ion intensity is calculated by adding the ion intensity at the holding time adjacent to the temporary peak holding time to the ion intensity at the temporary peak holding time, and the calculated peak ion intensity is The information processing apparatus according to claim 6, further comprising a spectrum intensity merging unit that creates the two-dimensional peak data by associating with a provisional peak retention time.

Mass chromatogram data is extracted for each m / z from the two-dimensional spectrum data, which is information on the retention time and ion intensity for each m / z, obtained based on the combination of chromatography and mass spectrometry. The smoothing part which produces the two-dimensional spectrum data input into the said two-dimensional peak data creation part by smoothing the ion intensity in Gram data is further provided. The information processing apparatus according to item 1.

A program for operating the information processing apparatus according to any one of claims 1 to 9, wherein the program causes a computer to function as each unit.

A computer-readable recording medium on which the program according to claim 10 is recorded.