JP2025150825A

JP2025150825A - Apparatus, method and program

Info

Publication number: JP2025150825A
Application number: JP2024051947A
Authority: JP
Inventors: 駿河合; Shun Kawai; 宏哉近藤; Hiroya Kondo; 史織飯野; Shiori Iino; 貴子清水; Takako Shimizu
Original assignee: Sumitomo Chemical Co Ltd
Current assignee: Sumitomo Chemical Co Ltd
Priority date: 2024-03-27
Filing date: 2024-03-27
Publication date: 2025-10-09
Also published as: WO2025204577A1

Abstract

To improve analysis accuracy in analyzing the state of various microorganisms for a microbiota.SOLUTION: An apparatus has: an acquisition unit that acquires a spectral image created by spectrally dividing a photographed signal obtained by photographing an object; a feature quantity calculation unit that calculates a feature quantity representing a texture on the basis of the spectral image; and a prediction unit that inputs the feature quantity calculated for an object for prediction to a trained model trained by using data for training including the feature quantity calculated for an object for training and data representing the state of microorganisms included in the object for training, and thereby predicts data representing the state of the microorganisms included in the object for prediction.SELECTED DRAWING: Figure 11

Description

本開示は、装置、方法及びプログラムに関する。 This disclosure relates to an apparatus, a method, and a program.

家畜の腸内細菌等を調べる方法として、例えば、家畜の回腸内容物や糞便等を採取し、細菌叢を解析する方法が知られている。一般に、細菌叢の解析には、培養法による解析や１６ＳｒＲＮＡ解析等が用いられる。 One known method for examining the intestinal bacteria of livestock is to collect the ileal contents or feces of livestock and analyze the bacterial flora. Generally, bacterial flora analysis is performed using culture methods or 16S rRNA analysis.

一方で、細菌叢を含む微生物叢を解析する方法として、対象物に所定の波長の光を照射することで得られる反射スペクトルを用いる方法が提案されている。当該解析方法によれば、対象物に含まれる微生物の種類を、低コストで解析することができる。 On the other hand, a method has been proposed for analyzing microbial flora, including bacterial flora, that uses the reflectance spectrum obtained by irradiating a target object with light of a specific wavelength. This analysis method makes it possible to analyze the types of microorganisms contained in a target object at low cost.

特開２０１３－２３５０１４号公報JP 2013-235014 A

しかしながら、反射スペクトルを用いる上記方法は、対象物から反射する光の各波長での反射強度を解析するものであり、解析結果は、各種微生物の反射特性に依存する。このため、例えば、反射特性の異なる微生物が対象物内に混在する微生物叢の場合、上記方法では、各種微生物の状態（例えば、含有比率等）を精度よく解析することが難しい。 However, the above-mentioned method using reflectance spectra analyzes the reflection intensity at each wavelength of light reflected from the object, and the analysis results depend on the reflection characteristics of the various microorganisms. For this reason, for example, in the case of a microbial flora in which microorganisms with different reflection characteristics coexist within the object, it is difficult to accurately analyze the state of the various microorganisms (e.g., content ratio, etc.) using the above-mentioned method.

本開示は、微生物叢について、各種微生物の状態を解析する際の解析精度を向上させることを目的としている。 The purpose of this disclosure is to improve the analytical accuracy when analyzing the state of various microorganisms in a microbiome.

一態様によれば、装置は、
対象物を撮影した撮影信号を分光することで生成される分光画像を取得する取得部と、
前記分光画像に基づいてテクスチャを表す特徴量を算出する特徴量算出部と、
学習用の対象物について算出された前記特徴量と、前記学習用の対象物に含まれる微生物の状態を表すデータと、を含む学習用データを用いて学習された学習済みモデルに、予測用の対象物について算出された前記特徴量を入力することで、前記予測用の対象物に含まれる微生物の状態を表すデータを予測する予測部とを有する。 According to one aspect, an apparatus includes:
an acquisition unit that acquires a spectral image generated by dispersing an image signal obtained by capturing an image of an object;
a feature amount calculation unit that calculates a feature amount representing a texture based on the spectral image;
The system has a prediction unit that predicts data representing the state of microorganisms contained in an object for prediction by inputting the feature values calculated for the object for prediction into a trained model that has been trained using training data including the feature values calculated for the object for learning and data representing the state of microorganisms contained in the object for learning.

本開示によれば、微生物叢について、各種微生物の状態を解析する際の解析精度を向上させることができる。 This disclosure makes it possible to improve the analytical accuracy when analyzing the state of various microorganisms in a microbiome.

微生物叢解析システムの学習フェーズにおけるシステム構成の一例を示す図である。FIG. 1 is a diagram illustrating an example of a system configuration of a microbiome analysis system in a learning phase. 微生物叢解析装置のハードウェア構成の一例を示す図である。FIG. 1 is a diagram illustrating an example of the hardware configuration of a microbiome analyzer. ハイパースペクトル画像の一例を示す図である。FIG. 1 is a diagram illustrating an example of a hyperspectral image. 微生物叢データの一例を示す図である。FIG. 10 is a diagram showing an example of microbiota data. 微生物叢解析装置の学習フェーズにおける機能構成の一例を示す図である。FIG. 1 is a diagram illustrating an example of the functional configuration of a microbiome analyzer in the learning phase. 画像特徴量算出部による処理の一例を示す図である。FIG. 10 is a diagram illustrating an example of processing by an image feature amount calculation unit. 学習用データ生成部による処理の一例を示す図である。FIG. 10 is a diagram illustrating an example of processing by a learning data generation unit. 学習部による処理の一例を示す図である。FIG. 10 is a diagram illustrating an example of processing by a learning unit. 微生物叢解析システムによる学習処理の流れを示すフローチャートの一例である。1 is an example of a flowchart showing the flow of a learning process performed by the microbiome analysis system. 微生物叢解析システムの予測フェーズにおけるシステム構成の一例を示す図である。FIG. 1 is a diagram illustrating an example of the system configuration of the microbiome analysis system in the prediction phase. 微生物叢解析装置の予測フェーズにおける機能構成の一例を示す図である。FIG. 10 is a diagram illustrating an example of the functional configuration of the microbiome analyzer in the prediction phase. 予測部による処理の一例を示す図である。FIG. 10 is a diagram illustrating an example of processing by a prediction unit. 出力部による処理の一例を示す図である。FIG. 10 is a diagram illustrating an example of processing by an output unit. 微生物叢解析システムによる予測処理の流れを示すフローチャートの一例である。1 is an example of a flowchart showing the flow of prediction processing by the microbiome analysis system. 各種微生物の含有比率を予測する学習済みモデルの予測精度の検証例を示す図である。FIG. 10 is a diagram showing an example of verification of the prediction accuracy of a trained model that predicts the content ratio of various microorganisms. 最も含有比率の高い微生物の種類を予測する学習済みモデルの予測精度の検証例を示す図である。FIG. 10 is a diagram showing an example of verification of the prediction accuracy of a trained model that predicts the type of microorganism with the highest content ratio.

以下、各実施形態について添付の図面を参照しながら説明する。なお、本明細書及び図面において、実質的に同一の機能構成を有する構成要素については、同一の符号を付することにより重複した説明を省略する。 Each embodiment will be described below with reference to the accompanying drawings. Note that in this specification and drawings, components that have substantially the same functional configuration will be assigned the same reference numerals, and redundant explanations will be omitted.

［第１の実施形態］
＜微生物叢解析システムの学習フェーズにおけるシステム構成＞
はじめに、微生物叢を解析する微生物叢解析システムのシステム構成について説明する。第１の実施形態において、微生物叢解析システムは、学習処理を行う学習フェーズと、予測処理を行う予測フェーズとで異なるシステム構成を有する。ここでは、微生物叢解析システムの学習フェーズにおけるシステム構成について説明する。 [First embodiment]
<System configuration in the learning phase of the microbiome analysis system>
First, the system configuration of a microbiome analysis system that analyzes microbiomes will be described. In the first embodiment, the microbiome analysis system has different system configurations for the learning phase, in which learning processing is performed, and the prediction phase, in which prediction processing is performed. Here, the system configuration of the microbiome analysis system in the learning phase will be described.

また、第１の実施形態において、微生物叢解析システムが解析する微生物叢は、家畜の回腸内容物や糞便に含まれる細菌叢であるとする。したがって、第１の実施形態において、「物体」は家畜を有し、「対象サンプル」（対象物）は回腸内容物または糞便を指すものとする。 Furthermore, in the first embodiment, the microbiota analyzed by the microbiota analysis system is the bacterial flora contained in the ileal contents or feces of livestock. Therefore, in the first embodiment, the "object" refers to livestock, and the "target sample" (target object) refers to the ileal contents or feces.

ただし、「物体」は家畜に限定されず、他の生物であってもよい。あるいは、土壌に含まれる微生物叢を解析するシーンにおいては、「物体」は土壌等の非生物であってもよい。また、「対象サンプル」は、回腸内容物や糞便に限定されず、他の腸内容物や排泄物であってもよく、また、物体が土壌の場合にあっては、土壌から採取した土壌の一部であってもよい。つまり、「対象サンプル」は、単一種又は複数種類の微生物を含む生体由来または環境由来のサンプルである。 However, the "object" is not limited to livestock, but may be other living organisms. Alternatively, in a scenario where the microbiome contained in soil is being analyzed, the "object" may be a non-living object such as soil. Furthermore, the "target sample" is not limited to ileal contents or feces, but may be other intestinal contents or excrement, and if the object is soil, it may be a portion of the soil collected from the soil. In other words, the "target sample" is a biologically or environmentally derived sample containing a single or multiple types of microorganisms.

図１は、微生物叢解析システムの学習フェーズにおけるシステム構成の一例を示す図である。図１に示すように、学習フェーズにおける微生物叢解析システム１００は、近赤外光出力機器１１０と、ハイパースペクトルカメラ１２０と、次世代シーケンサ１３０と、第１の実施形態に係る「装置」である微生物叢解析装置１４０とを備える。 Figure 1 shows an example of the system configuration of a microbiome analysis system in the learning phase. As shown in Figure 1, the microbiome analysis system 100 in the learning phase includes a near-infrared light output device 110, a hyperspectral camera 120, a next-generation sequencer 130, and a microbiome analysis device 140, which is the "device" according to the first embodiment.

近赤外光出力機器１１０は、近赤外光（波長＝８００［ｎｍ］～２５００ｎｍの光）を出射する機器である。近赤外光出力機器１１０から出射された近赤外光は、物体１０１から採取された学習用の対象サンプル１０２に照射される。 The near-infrared light output device 110 is a device that emits near-infrared light (light with a wavelength of 800 nm to 2500 nm). The near-infrared light emitted from the near-infrared light output device 110 is irradiated onto a learning target sample 102 collected from an object 101.

ハイパースペクトルカメラ１２０は、学習用の対象サンプル１０２から反射した反射光を撮影し、撮影信号を各波長に分光することで、波長ごとの撮影画像である分光画像（以下では、「ハイパースペクトル画像」と称す）を生成する。ハイパースペクトルカメラ１２０は、各波長のハイパースペクトル画像を、微生物叢解析装置１４０に送信する。 The hyperspectral camera 120 captures the light reflected from the learning target sample 102 and disperses the captured signal into individual wavelengths to generate spectral images (hereinafter referred to as "hyperspectral images") that are captured for each wavelength. The hyperspectral camera 120 transmits the hyperspectral images for each wavelength to the microbiome analyzer 140.

次世代シーケンサ１３０は、微生物が有する１６ＳｒＲＮＡ遺伝子をＰＣＲ（Polymerase Chain Reaction）にて増幅したうえで解析し、微生物の種類及び分布を特定する機器である。物体１０１から採取された学習用の対象サンプル１０２について解析することで、次世代シーケンサ１３０は、学習用の対象サンプル１０２についての微生物叢データを生成する。次世代シーケンサ１３０により生成される微生物叢データは、学習用の対象サンプル１０２に含まれる微生物の状態を表すデータである。第１の実施形態において、学習用の対象サンプル１０２に含まれる微生物の状態を表すデータには、例えば、
・学習用の対象サンプル１０２に含まれる微生物の種類、
・学習用の対象サンプル１０２に含まれる各種微生物の量の、全ての種類の微生物の総量に占める比率を表す含有比率、
・学習用の対象サンプル１０２に含まれる微生物のうち最も含有比率の高い微生物（つまり、学習用の対象サンプル１０２に最も多く含まれる微生物）の種類、
等の情報が含まれる。次世代シーケンサ１３０は、生成した微生物叢データを、微生物叢解析装置１４０に送信する。 The next-generation sequencer 130 is an instrument that amplifies the 16S rRNA genes of microorganisms by PCR (Polymerase Chain Reaction) and then analyzes them to identify the type and distribution of the microorganisms. By analyzing the learning target sample 102 collected from the object 101, the next-generation sequencer 130 generates microbiota data for the learning target sample 102. The microbiota data generated by the next-generation sequencer 130 is data that represents the state of the microorganisms contained in the learning target sample 102. In the first embodiment, the data that represents the state of the microorganisms contained in the learning target sample 102 includes, for example,
The type of microorganisms contained in the learning target sample 102,
A content ratio representing the ratio of the amount of each type of microorganism contained in the learning target sample 102 to the total amount of all types of microorganisms;
The type of microorganism with the highest content ratio among the microorganisms contained in the learning target sample 102 (i.e., the type of microorganism most abundant in the learning target sample 102),
The next-generation sequencer 130 transmits the generated microbiota data to the microbiota analyzer 140.

微生物叢解析装置１４０は、学習フェーズにおいて、ハイパースペクトルカメラ１２０から、学習用の対象サンプル１０２についてのハイパースペクトル画像を取得する。また、微生物叢解析装置１４０は、次世代シーケンサ１３０から、学習用の対象サンプル１０２についての微生物叢データを取得する。 During the learning phase, the microbiome analysis device 140 acquires hyperspectral images of the learning target sample 102 from the hyperspectral camera 120. The microbiome analysis device 140 also acquires microbiome data about the learning target sample 102 from the next-generation sequencer 130.

微生物叢解析装置１４０は、取得したハイパースペクトル画像と微生物叢データとに基づいて、学習用データを生成し、生成した学習用データを用いて、機械学習モデル（以下、単に「モデル」と称す）の学習を行い、学習済みモデルを生成する。微生物叢解析装置１４０により生成される学習済みモデルは、予測用の対象サンプルに含まれる微生物の状態を表すデータを予測する学習済みモデルである。第１の実施形態において、予測用の対象サンプルに含まれる微生物の状態を表すデータを予測する学習済みモデルには、
・予測用の対象サンプルに含まれる各種微生物の含有比率を予測する学習済みモデル、
・予測用の対象サンプルに含まれる微生物のうち、最も含有比率の高い微生物の種類を予測する学習済みモデル、
が含まれる。 The microbiome analyzer 140 generates training data based on the acquired hyperspectral image and microbiome data, and uses the generated training data to train a machine learning model (hereinafter simply referred to as "model") to generate a trained model. The trained model generated by the microbiome analyzer 140 is a trained model that predicts data representing the state of microorganisms contained in a target sample for prediction. In the first embodiment, the trained model that predicts data representing the state of microorganisms contained in a target sample for prediction includes:
- A trained model that predicts the content ratio of various microorganisms contained in the target sample for prediction,
A trained model that predicts the type of microorganism with the highest concentration among the microorganisms contained in the target sample for prediction,
Includes:

＜微生物叢解析装置のハードウェア構成＞
次に、微生物叢解析装置１４０のハードウェア構成について説明する。図２は、微生物叢解析装置のハードウェア構成の一例を示す図である。図２に示すように、微生物叢解析装置１４０は、プロセッサ２０１、メモリ２０２、補助記憶デバイス２０３、接続デバイス２０４、通信デバイス２０５、ドライブ機器２０６を有する。なお、微生物叢解析装置１４０に含まれる各ハードウェアは、バス２０７を介して相互に接続されている。 <Hardware configuration of the microbiota analyzer>
Next, the hardware configuration of the microbiome analyzer 140 will be described. Fig. 2 is a diagram showing an example of the hardware configuration of a microbiome analyzer. As shown in Fig. 2, the microbiome analyzer 140 has a processor 201, memory 202, an auxiliary storage device 203, a connection device 204, a communication device 205, and a drive device 206. The pieces of hardware included in the microbiome analyzer 140 are connected to each other via a bus 207.

プロセッサ２０１は、ＣＰＵ（Central Processing Unit）、ＧＰＵ（Graphics Processing Unit）等の各種演算デバイスを有する。プロセッサ２０１は、各種プログラム（例えば、第１の実施形態に係る「プログラム」である微生物叢解析プログラム等）をメモリ２０２上に読み出して実行する。 The processor 201 has various computing devices such as a CPU (Central Processing Unit) and a GPU (Graphics Processing Unit). The processor 201 reads various programs (such as the microbiome analysis program, which is the "program" according to the first embodiment) into the memory 202 and executes them.

メモリ２０２は、ＲＯＭ（Read Only Memory）、ＲＡＭ（Random Access Memory）等の主記憶デバイスを有する。プロセッサ２０１とメモリ２０２とは、いわゆるコンピュータを形成し、プロセッサ２０１が、メモリ２０２上に読み出した各種プログラムを実行することで、当該コンピュータは各種機能を実現する。 Memory 202 includes a primary storage device such as ROM (Read Only Memory) and RAM (Random Access Memory). The processor 201 and memory 202 form what is known as a computer, and the computer realizes various functions by the processor 201 executing various programs read onto memory 202.

補助記憶デバイス２０３は、各種プログラムや、各種プログラムがプロセッサ２０１によって実行される際に用いられる各種情報を格納する。後述する画像格納部５１１、データ格納部５１２、学習用データ格納部５１３、学習済みモデル格納部５１４は、補助記憶デバイス２０３において実現される。 The auxiliary storage device 203 stores various programs and various information used when the various programs are executed by the processor 201. The image storage unit 511, data storage unit 512, training data storage unit 513, and trained model storage unit 514, which will be described later, are realized in the auxiliary storage device 203.

接続デバイス２０４は、外部機器（操作機器２１１、表示機器２１２等）と接続する接続デバイスである。 The connection device 204 is a connection device that connects to external devices (such as an operation device 211 and a display device 212).

通信デバイス２０５は、ハイパースペクトルカメラ１２０、次世代シーケンサ１３０との間で各種情報を送受信するための通信デバイスである。 The communication device 205 is a communication device for sending and receiving various information between the hyperspectral camera 120 and the next-generation sequencer 130.

ドライブ機器２０６は記録媒体２１３をセットするための機器である。ここでいう記録媒体２１３には、ＣＤ－ＲＯＭ、フレキシブルディスク、光磁気ディスク等のように情報を光学的、電気的あるいは磁気的に記録する媒体が含まれる。また、記録媒体２１３には、ＲＯＭ、フラッシュメモリ等のように情報を電気的に記録する半導体メモリ等が含まれていてもよい。 The drive device 206 is a device for loading the recording medium 213. The recording medium 213 here includes media that record information optically, electrically, or magnetically, such as CD-ROMs, flexible disks, and magneto-optical disks. The recording medium 213 may also include semiconductor memory that records information electrically, such as ROM and flash memory.

なお、補助記憶デバイス２０３にインストールされる各種プログラムは、例えば、配布された記録媒体２１３がドライブ機器２０６にセットされ、該記録媒体２１３に記録された各種プログラムがドライブ機器２０６により読み出されることでインストールされる。あるいは、補助記憶デバイス２０３にインストールされる各種プログラムは、通信デバイス２０５を介して不図示のネットワークからダウンロードされることで、インストールされてもよい。 The various programs installed in the auxiliary storage device 203 are installed, for example, by setting the distributed recording medium 213 in the drive device 206 and reading the various programs recorded on the recording medium 213 using the drive device 206. Alternatively, the various programs installed in the auxiliary storage device 203 may be installed by downloading them from a network (not shown) via the communication device 205.

＜ハイパースペクトル画像の具体例＞
次に、ハイパースペクトルカメラ１２０により生成されるハイパースペクトル画像の具体例について説明する。図３は、ハイパースペクトル画像の一例を示す図である。 <Examples of hyperspectral images>
Next, a specific example of a hyperspectral image generated by the hyperspectral camera 120 will be described. Fig. 3 is a diagram showing an example of a hyperspectral image.

図３において、横軸は波長を表す。また、図３において、符号３００＿１、符号３００＿２、符号３００＿３、・・・符号３００＿８１等は、ハイパースペクトルカメラ１２０により生成された、各波長のハイパースペクトル画像である。 In Figure 3, the horizontal axis represents wavelength. Also, in Figure 3, reference numerals 300_1, 300_2, 300_3, ..., 300_81, etc. represent hyperspectral images of each wavelength generated by the hyperspectral camera 120.

図３の例は、符号３００＿１が波長＝９００［ｎｍ］のハイパースペクトル画像を、符号３００＿２が波長＝９１０［ｎｍ］のハイパースペクトル画像を、符号３００＿３が波長＝９２０［ｎｍ］のハイパースペクトル画像をそれぞれ表している。また、図３の例は、符号３００＿８１が波長＝１７００［ｎｍ］のハイパースペクトル画像を表している。 In the example of Figure 3, reference numeral 300_1 represents a hyperspectral image with a wavelength of 900 [nm], reference numeral 300_2 represents a hyperspectral image with a wavelength of 910 [nm], and reference numeral 300_3 represents a hyperspectral image with a wavelength of 920 [nm]. In the example of Figure 3, reference numeral 300_81 represents a hyperspectral image with a wavelength of 1700 [nm].

このように、ハイパースペクトルカメラ１２０により生成されるハイパースペクトル画像は、
・波長範囲：９００「ｎｍ」～１７００［ｎｍ］、
・波長の刻み幅：１０［ｎｍ］、
・画像枚数：８１、
であるとする。ただし、ハイパースペクトルカメラ１２０により生成されるハイパースペクトル画像は、上記波長範囲に限定されず、異なる波長範囲であってもよい。また、ハイパースペクトルカメラ１２０により生成されるハイパースペクトル画像は、上記波長の刻み幅に限定されず、異なる刻み幅であってもよい。 In this way, the hyperspectral image generated by the hyperspectral camera 120 is
・Wavelength range: 900 "nm" to 1700 [nm],
Wavelength step size: 10 [nm]
・Number of images: 81,
However, the hyperspectral image generated by the hyperspectral camera 120 is not limited to the above wavelength range, and may be in a different wavelength range. Furthermore, the hyperspectral image generated by the hyperspectral camera 120 is not limited to the above wavelength interval, and may be in a different interval.

図３において、符号３００＿１、符号３００＿２、符号３００＿３、・・・符号３００＿８１等として示した、各波長のハイパースペクトル画像の各画素値は、対象サンプル１０２の各位置での、各波長の光の反射強度に相関する。 In Figure 3, each pixel value of the hyperspectral image for each wavelength, indicated by symbols 300_1, 300_2, 300_3, ..., 300_81, etc., correlates with the reflection intensity of light of each wavelength at each position on the target sample 102.

したがって、対象サンプル１０２内に、例えば、特定の波長の光を反射する所定の反射特性を有する微生物が分布していた場合、当該特定の波長のハイパースペクトル画像は、テクスチャ状の画像となる。具体的には、当該特定の波長のハイパースペクトル画像は、当該微生物の対象サンプル１０２内での各位置に対応する画像内の各位置に、濃淡が現れるテクスチャ状の画像となる。テクスチャとは、画像内の何らかの規則的な細かな濃度変化が表す模様を指す。 Therefore, for example, if microorganisms with specific reflection properties that reflect light of a specific wavelength are distributed within the target sample 102, the hyperspectral image of that specific wavelength will be a textured image. Specifically, the hyperspectral image of that specific wavelength will be a textured image in which shading appears at each position within the image that corresponds to each position of the microorganism within the target sample 102. Texture refers to a pattern expressed by some kind of regular, fine variation in density within the image.

なお、ここでいう微生物とは、微生物そのものであってもよいし、微生物によって分解されることで生成された代謝産物等（微生物の関連物質）であってもよい。つまり、対象サンプル１０２に含まれる微生物の状態を表すデータには、微生物及び微生物の関連物質の状態を表すデータが含まれる。 Note that the microorganisms referred to here may be the microorganisms themselves, or may be metabolic products (substances related to the microorganisms) produced by decomposition by the microorganisms. In other words, data representing the state of the microorganisms contained in the target sample 102 includes data representing the state of the microorganisms and substances related to the microorganisms.

＜微生物叢データの具体例＞
次に、次世代シーケンサ１３０により生成される微生物叢データの具体例について説明する。図４は、微生物叢データの一例を示す図である。 <Examples of microbiome data>
Next, a specific example of the microbiome data generated by the next-generation sequencer 130 will be described. Fig. 4 is a diagram showing an example of the microbiome data.

図４に示すように、微生物叢データ４００は、情報の項目として、"対象サンプル"、"微生物種"、"含有比率"、"高含有微生物種"を含む。 As shown in Figure 4, the microbiome data 400 includes information items such as "target sample," "microbial species," "content ratio," and "high-content microbial species."

"対象サンプル"には、微生物叢データ４００の生成に用いられた学習用の対象サンプル（例えば、対象サンプル１０２）を識別するための識別子（図４の例では、「サンプル１」）が格納される。 "Target sample" stores an identifier ("Sample 1" in the example of Figure 4) for identifying the learning target sample (e.g., target sample 102) used to generate the microbiome data 400.

"微生物種"には、微生物叢データ４００の生成に用いられた学習用の対象サンプル（例えば、対象サンプル１０２）に含まれる微生物の種類（図４の例では、「微生物Ａ」、「微生物Ｂ」、「微生物Ｃ」・・・等）が格納される。 "Microorganism species" stores the type of microorganism (in the example of Figure 4, "Microorganism A," "Microorganism B," "Microorganism C," etc.) contained in the learning target sample (e.g., target sample 102) used to generate the microbiome data 400.

"含有比率"には、微生物叢データ４００の生成に用いられた学習用の対象サンプル（例えば、対象サンプル１０２）に含まれる各種微生物の含有比率（図４の例では、「ａ％」、「ｂ％」、「ｃ％」、・・・等）が格納される。 "Content ratio" stores the content ratio (in the example of Figure 4, "a%, "b%, "c%, ...", etc.) of various microorganisms contained in the learning target sample (e.g., target sample 102) used to generate the microbiome data 400.

"高含有微生物種"には、微生物叢データ４００の生成に用いられた学習用の対象サンプル（例えば、対象サンプル１０２）に含まれる微生物のうち、最も含有比率が高い微生物の種類（図４の例では、「微生物Ｆ」）が格納される。 "Highly abundant microbial species" stores the type of microorganism with the highest abundance ("microorganism F" in the example of Figure 4) among the microorganisms contained in the learning target sample (e.g., target sample 102) used to generate the microbiome data 400.

＜微生物叢解析装置の機能構成＞
次に、微生物叢解析装置の学習フェーズにおける機能構成について説明する。図５は、微生物叢解析装置の学習フェーズにおける機能構成の一例を示す図である。上述したように、微生物叢解析装置１４０には、微生物叢解析プログラムがインストールされており、当該微生物叢解析プログラムが学習フェーズにおいて実行されることで、微生物叢解析装置１４０は、
・ハイパースペクトル画像取得部５０１、
・画像特徴量算出部５０２、
・微生物叢データ取得部５０３、
・学習用データ生成部５０４、
・学習部５０５、
として機能する。 <Functional configuration of the microbiome analyzer>
Next, the functional configuration of the microbiome analyzer in the learning phase will be described. FIG. 5 is a diagram showing an example of the functional configuration of the microbiome analyzer in the learning phase. As described above, a microbiome analysis program is installed in the microbiome analyzer 140, and by executing this microbiome analysis program in the learning phase, the microbiome analyzer 140:
Hyperspectral image acquisition unit 501,
Image feature amount calculation unit 502,
- Microbiota data acquisition unit 503,
A learning data generation unit 504,
Learning unit 505,
It functions as:

ハイパースペクトル画像取得部５０１は、ハイパースペクトルカメラ１２０から各波長のハイパースペクトル画像を取得し、画像格納部５１１に格納する。 The hyperspectral image acquisition unit 501 acquires hyperspectral images for each wavelength from the hyperspectral camera 120 and stores them in the image storage unit 511.

画像特徴量算出部５０２は、画像格納部５１１に格納された各波長のハイパースペクトル画像を読み出し、波長ごとに、画像特徴量を算出し、学習用データ生成部５０４に通知する。 The image feature calculation unit 502 reads out the hyperspectral images for each wavelength stored in the image storage unit 511, calculates the image feature for each wavelength, and notifies the learning data generation unit 504.

画像特徴量算出部５０２により算出される画像特徴量は、各波長のハイパースペクトル画像におけるテクスチャを表す特徴量である。テクスチャを表す特徴量とは、テクスチャの性質を定量的に表したものを指す。 The image features calculated by the image feature calculation unit 502 are features that represent the texture in the hyperspectral image for each wavelength. Features that represent the texture refer to quantitative representations of the texture properties.

このように、テクスチャを表す特徴量を算出することで、画像特徴量算出部５０２は、対象サンプルに含まれる微生物の分布を、波長ごとに捉えることができる。このため、従来のように、単に各波長での反射強度を解析する方法（反射スペクトルを用いる方法）と比較して、画像特徴量算出部５０２によれば、対象サンプルに含まれる微生物の量を適確に表す特徴量を算出することが可能になる。 By calculating features representing texture in this way, the image feature calculation unit 502 can capture the distribution of microorganisms contained in the target sample for each wavelength. Therefore, compared to conventional methods that simply analyze the reflection intensity at each wavelength (methods that use the reflection spectrum), the image feature calculation unit 502 can calculate features that accurately represent the amount of microorganisms contained in the target sample.

この結果、当該テクスチャを表す特徴量を用いることで、反射特性の異なる微生物が対象サンプル内に混在する場合であっても、各種微生物の状態（例えば、含有比率等）を精度よく解析することが可能になる。 As a result, by using the features representing this texture, it is possible to accurately analyze the state of various microorganisms (e.g., content ratio, etc.) even when microorganisms with different reflection characteristics are mixed within the target sample.

なお、テクスチャを表す特徴量には、各画素の画素値の度数分布を表すヒストグラムに基づく特徴量（統計量）、差分統計量、濃度共起行列、フーリエ特徴量等、任意の特徴量が含まれるが、第１の実施形態では、このうち、ヒストグラムに基づく特徴量を用いる。 Note that features representing texture include any feature, such as a feature (statistic) based on a histogram representing the frequency distribution of pixel values for each pixel, a difference statistic, a density co-occurrence matrix, or a Fourier feature. Of these, the first embodiment uses a feature based on a histogram.

微生物叢データ取得部５０３は、次世代シーケンサ１３０から微生物叢データ４００を取得し、データ格納部５１２に格納する。 The microbiome data acquisition unit 503 acquires the microbiome data 400 from the next-generation sequencer 130 and stores it in the data storage unit 512.

学習用データ生成部５０４は、画像特徴量算出部５０２から画像特徴量が通知されると、データ格納部５１２から、対応する微生物叢データ４００を読み出し、学習用データを生成する。学習用データ生成部５０４により生成される学習用データは、
・画像特徴量算出部５０２から通知された、学習用の対象サンプル１０２についての画像特徴量の全部または一部を入力データとする。
・データ格納部５１２から読み出された微生物叢データ４００のうち、学習用の対象サンプル１０２に含まれる微生物の状態を表すデータを正解データとする。 When the learning data generation unit 504 receives the image feature amount from the image feature amount calculation unit 502, it reads the corresponding microbiota data 400 from the data storage unit 512 and generates learning data. The learning data generated by the learning data generation unit 504 is
All or part of the image features of the learning target sample 102 notified by the image feature calculation unit 502 are used as input data.
Of the microbiome data 400 read from the data storage unit 512, data that represents the state of the microorganisms contained in the learning target sample 102 is taken as the correct answer data.

具体的には、第１の実施形態において、学習用データ生成部５０４は、
・学習用の対象サンプル１０２に含まれる各種微生物の含有比率を正解データとする学習用データ、
・学習用の対象サンプル１０２に含まれる微生物のうち最も含有比率の高い微生物の種類を正解データとする学習用データ、
の少なくとも２種類の学習用データを生成する。 Specifically, in the first embodiment, the learning data generation unit 504
Learning data in which the content ratios of various microorganisms contained in the learning target sample 102 are used as correct answer data;
Learning data in which the type of microorganism with the highest content ratio among the microorganisms contained in the learning target sample 102 is used as the correct answer data;
At least two types of learning data are generated.

学習用データ生成部５０４は、生成した２種類の学習用データを、学習用データ格納部５１３に格納する。学習用データ格納部５１３には、学習用の複数の対象サンプルについて学習用データ生成部５０４により生成された学習用データが格納される。 The learning data generation unit 504 stores the two types of learning data it has generated in the learning data storage unit 513. The learning data storage unit 513 stores the learning data generated by the learning data generation unit 504 for multiple target samples for learning.

学習部５０５は、学習用データ格納部５１３に格納された学習用データを用いてモデルの学習を行うことで、予測用の対象サンプルに含まれる微生物の状態を表すデータを予測する学習済みモデルを生成し、学習済みモデル格納部５１４に格納する。 The learning unit 505 uses the learning data stored in the learning data storage unit 513 to train the model, thereby generating a trained model that predicts data representing the state of microorganisms contained in the target sample for prediction, and stores the trained model in the trained model storage unit 514.

学習部５０５により生成される学習済みモデルには、
・学習用の対象サンプル１０２に含まれる各種微生物の含有比率を正解データとする学習用データを用いて学習された学習済みモデル（予測用の対象サンプルにおいて、各種微生物の含有比率を予測する学習済みモデル）、
・学習用の対象サンプル１０２に含まれる微生物のうち最も含有比率の高い微生物の種類を正解データとする学習用データを用いて学習された学習済みモデル（予測用の対象サンプルにおいて、最も含有比率の高い微生物の種類を予測する学習済みモデル）、
が含まれる。なお、各種微生物の含有比率を予測する学習済みモデルは、微生物の種類の数に応じた数だけ生成される。 The trained model generated by the training unit 505 includes:
A trained model trained using training data in which the content ratios of various microorganisms contained in the training target sample 102 are used as correct answer data (a trained model that predicts the content ratios of various microorganisms in a prediction target sample),
A trained model trained using training data in which the type of microorganism with the highest content ratio among the microorganisms contained in the training target sample 102 is used as the correct answer data (a trained model that predicts the type of microorganism with the highest content ratio in the prediction target sample),
The number of trained models that predict the content ratio of each type of microorganism is generated according to the number of types of microorganisms.

＜微生物叢解析装置の各機能部による処理の具体例＞
次に、微生物叢解析装置１４０の学習フェーズにおける各機能部（ここでは、画像特徴量算出部５０２、学習用データ生成部５０４、学習部５０５）による処理の具体例について説明する。 <Specific examples of processing by each functional unit of the microbiome analyzer>
Next, a specific example of processing by each functional unit (here, the image feature calculation unit 502, the learning data generation unit 504, and the learning unit 505) in the learning phase of the microbiome analyzer 140 will be described.

（１）画像特徴量算出部５０２による処理の具体例
はじめに、画像特徴量算出部５０２による処理の具体例について説明する。図６は、画像特徴量算出部による処理の一例を示す図である。図６に示すように、画像特徴量算出部５０２は、更に、正規化部６１０、ヒストグラム生成部６２０、特徴量算出部６３０を有する。 (1) Specific Example of Processing by Image Feature Amount Calculation Unit 502 First, a specific example of processing by the image feature amount calculation unit 502 will be described. Fig. 6 is a diagram showing an example of processing by the image feature amount calculation unit. As shown in Fig. 6, the image feature amount calculation unit 502 further includes a normalization unit 610, a histogram generation unit 620, and a feature amount calculation unit 630.

正規化部６１０は、画像格納部５１１から読み出した、各波長のハイパースペクトル画像（符号３００＿１、符号３００＿２、・・・符号３００＿８１）に対して、正規化処理を行う。具体的には、正規化部６１０は、
・各波長のハイパースペクトル画像（特定波長のハイパースペクトル画像を除くハイパースペクトル画像）に含まれる各画素の画素値を、
・特定波長のハイパースペクトル画像に含まれる特定画素の画素値で、
除算することで、正規化処理を行う。 The normalization unit 610 performs normalization processing on the hyperspectral images (reference numerals 300_1, 300_2, ..., and 300_81) of each wavelength read out from the image storage unit 511. Specifically, the normalization unit 610 performs normalization processing on the hyperspectral images (reference numerals 300_1, 300_2, ..., and 300_81) of each wavelength read out from the image storage unit 511.
The pixel value of each pixel included in the hyperspectral image of each wavelength (hyperspectral image excluding the hyperspectral image of a specific wavelength) is
- The pixel value of a specific pixel contained in a hyperspectral image of a specific wavelength,
Normalization is performed by division.

特定波長のハイパースペクトル画像に含まれる特定画素の画素値とは、例えば、波長＝１４４０［ｎｍ］のハイパースペクトル画像に含まれる、水領域の画素の画素値である。このように、正規化処理を行うことで、各波長のハイパースペクトル画像間の、反射強度のばらつきの影響を低減させることができる。 The pixel value of a specific pixel contained in a hyperspectral image of a specific wavelength is, for example, the pixel value of a pixel in the water region contained in a hyperspectral image of wavelength 1440 [nm]. By performing normalization in this way, the effects of variations in reflection intensity between hyperspectral images of each wavelength can be reduced.

正規化部６１０により正規化処理された、正規化後の各波長のハイパースペクトル画像は、ヒストグラム生成部６２０に通知される。 The normalized hyperspectral images for each wavelength, normalized by the normalization unit 610, are notified to the histogram generation unit 620.

ヒストグラム生成部６２０は、テクスチャを表す特徴量を算出するために、正規化後の各波長のハイパースペクトル画像のヒストグラム（各画素の画素値の度数分布）を生成し、特徴量算出部６３０に通知する。図６において、符号６００＿１は、正規化後のハイパースペクトル画像（符号３００＿１）のヒストグラムを表し、符号６００＿２は、正規化後のハイパースペクトル画像（符号３００＿２）のヒストグラムを表す。また、符号６００＿８１は、正規化後のハイパースペクトル画像（符号３００＿８１）のヒストグラムを表す。 To calculate features representing the texture, the histogram generation unit 620 generates histograms (frequency distribution of pixel values for each pixel) of the normalized hyperspectral image for each wavelength and notifies the feature calculation unit 630. In FIG. 6, reference numeral 600_1 represents the histogram of the normalized hyperspectral image (reference numeral 300_1), reference numeral 600_2 represents the histogram of the normalized hyperspectral image (reference numeral 300_2), and reference numeral 600_81 represents the histogram of the normalized hyperspectral image (reference numeral 300_81).

特徴量算出部６３０は、ヒストグラム生成部６２０よりヒストグラムが通知されると、テクスチャを表す特徴量として、当該ヒストグラムに基づく特徴量を算出する。具体的には、特徴量算出部６３０は、ヒストグラムに基づく特徴量として、例えば、
・平均値、
・標準偏差、
・分散値、
・最小値、
・最大値、
・中央値、
・第１四分位数、
・第３四分位数、
・最頻値、
・尖度、
・歪度、
を算出する。 When the histogram is notified from the histogram generation unit 620, the feature calculation unit 630 calculates a feature based on the histogram as a feature representing the texture. Specifically, the feature calculation unit 630 calculates, for example, the following as the feature based on the histogram:
・Average value,
- standard deviation,
- variance value,
- minimum value,
- maximum value,
- Median,
・1st quartile,
・Third quartile,
・Modification value,
·kurtosis,
·skewness,
Calculate.

特徴量算出部６３０は、ヒストグラムに基づく特徴量を、各波長の正規化後のハイパースペクトル画像それぞれについて算出する。このため、ハイパースペクトル画像の画像枚数（波長の数）が８１で、各ヒストグラムに基づく特徴量の数が１１個であるとすると、特徴量算出部６３０により学習用データ生成部５０４に通知される画像特徴量群６０２には、８９１個の画像特徴量が含まれる。 The feature calculation unit 630 calculates feature amounts based on histograms for each hyperspectral image after normalization for each wavelength. Therefore, if the number of images (number of wavelengths) in the hyperspectral image is 81 and the number of feature amounts based on each histogram is 11, the image feature amount group 602 notified to the training data generation unit 504 by the feature amount calculation unit 630 will include 891 image feature amounts.

（２）学習用データ生成部５０４による処理の具体例
次に、学習用データ生成部５０４による処理の具体例について説明する。図７は、学習用データ生成部による処理の一例を示す図である。図７に示すように、学習用データ生成部５０４は、更に、画像特徴量絞り込み部７１０、結合部７２０を有する。 (2) Specific Example of Processing by the Training Data Generation Unit 504 Next, a specific example of processing by the training data generation unit 504 will be described. Fig. 7 is a diagram showing an example of processing by the training data generation unit. As shown in Fig. 7, the training data generation unit 504 further includes an image feature narrowing unit 710 and a combining unit 720.

画像特徴量絞り込み部７１０は、特徴量算出部６３０から画像特徴量群６０２が通知されると、通知された画像特徴量群６０２に含まれる複数の画像特徴量から、所定の画像特徴量を選択し、結合部７２０に通知する。画像特徴量絞り込み部７１０により選択される画像特徴量は、学習用の対象サンプル１０２に含まれる微生物の状態を表すデータを予測するのに有益な画像特徴量であり、予め決定されているものとする。 When the image feature narrowing unit 710 is notified of the image feature group 602 by the feature calculation unit 630, it selects a predetermined image feature from the multiple image features included in the notified image feature group 602 and notifies the combination unit 720. The image feature selected by the image feature narrowing unit 710 is an image feature that is useful for predicting data representing the state of microorganisms contained in the learning target sample 102, and is assumed to have been determined in advance.

なお、画像特徴量絞り込み部７１０が選択する画像特徴量は、画像特徴量の様々な組み合わせのもとで、学習と検証とを繰り返し、最良の検証結果に基づいて予め決定されるものとする。選択する画像特徴量は、結合部７２０により生成される学習用データごとに異なっていてもよいし、学習用データに関わらず共通であってもよい。例えば、各種微生物の含有比率を予測するための学習用データを生成する場合に選択される画像特徴量と、最も含有比率の高い微生物の種類を予測するための学習用データを生成する場合に選択される画像特徴量とは、同じであっても、異なっていてもよい。また、各種微生物の含有比率を予測するための学習用データを生成する場合に選択される画像特徴量は、微生物の種類ごとに異なっていてもよいし、同じであってもよい。 The image features selected by the image feature narrowing unit 710 are determined in advance based on the best verification results obtained by repeating learning and verification using various combinations of image features. The selected image features may be different for each piece of training data generated by the combining unit 720, or may be common regardless of the training data. For example, the image features selected when generating training data for predicting the content ratios of various microorganisms may be the same as or different from the image features selected when generating training data for predicting the type of microorganism with the highest content ratio. Furthermore, the image features selected when generating training data for predicting the content ratios of various microorganisms may be the same or different for each type of microorganism.

結合部７２０は、画像特徴量絞り込み部７１０から、絞り込み後の画像特徴量が通知されると、データ格納部５１２から、対応する微生物叢データ４００を読み出し、学習用データを生成する。 When the combining unit 720 is notified of the refined image features by the image feature refinement unit 710, it reads the corresponding microbiome data 400 from the data storage unit 512 and generates learning data.

図７において、符号７３０＿１は、"微生物Ａ"の含有比率を予測する学習済みモデルを生成する際に用いられる学習用データである。同様に、符号７３０＿２、符号７３０＿３は、"微生物Ｂ"、"微生物Ｃ"の含有比率をそれぞれ予測する学習済みモデルを生成する際に用いられる学習用データである。 In Figure 7, reference numeral 730_1 denotes training data used when generating a trained model that predicts the content ratio of "microorganism A." Similarly, reference numerals 730_2 and 730_3 denote training data used when generating trained models that predict the content ratios of "microorganism B" and "microorganism C," respectively.

符号７３０＿１は、「サンプル１」により識別される学習用の対象サンプル１０２について生成された学習用データであって、
・画像特徴量絞り込み部７１０から通知された、絞り込み後の画像特徴量である、画像特徴量（λ_１）～画像特徴量（λ_８０）を入力データ、
・データ格納部５１２から読み出した微生物叢データ４００に含まれる、微生物種＝"微生物Ａ"、含有比率＝"ａ％"を正解データ、
とする学習用データを示している。 Reference numeral 730_1 denotes training data generated for the training target sample 102 identified by “sample 1,”
The image feature amounts (λ ₁ ) to (λ ₈₀ ), which are the image feature amounts after narrowing down notified by the image feature amount narrowing unit 710, are used as input data.
The microbial species=“microorganism A” and content ratio=“a%” contained in the microbiota data 400 read from the data storage unit 512 are the correct data.
The training data shown is:

なお、符号７３０＿１において、入力データ＝画像特徴量（λ_１）～画像特徴量（λ₈₀）は、画像特徴量絞り込み部７１０によって、５種類の波長が選択されたことを示している。上述したように、各波長は、１１個の画像特徴量を含むため、符号７３０＿１の例の場合、入力データには、５５個の画像特徴量が含まれることになる。 In addition, in reference numeral 730_1, the input data = image feature (λ ₁ ) to image feature (λ ₈₀ ) indicates that five wavelengths have been selected by the image feature narrowing unit 710. As described above, each wavelength includes 11 image features, so in the example of reference numeral 730_1, the input data includes 55 image features.

図７において、符号７４０は、最も含有比率の高い微生物の種類を予測する学習済みモデルを生成する際に用いられる学習用データである。符号７４０は、「サンプル１」により識別される学習用の対象サンプル１０２について生成された学習用データであって、
・画像特徴量絞り込み部７１０から通知された、絞り込み後の画像特徴量である、画像特徴量（λ_１）～画像特徴量（λ_８０）を入力データ、
・データ格納部５１２から読み出した微生物叢データ４００に含まれる、高含有微生物種＝"微生物Ｆ"を正解データ、
とする学習用データを示している。 7, reference numeral 740 denotes training data used to generate a trained model that predicts the type of microorganism with the highest content ratio. Reference numeral 740 denotes training data generated for the training target sample 102 identified by "sample 1,"
The image feature amounts (λ ₁ ) to (λ ₈₀ ), which are the image feature amounts after narrowing down notified by the image feature amount narrowing unit 710, are used as input data.
The highly abundant microbial species = "Microorganism F" included in the microbiota data 400 read from the data storage unit 512 is the correct answer data.
The training data shown is:

なお、符号７３０＿１において、入力データ＝画像特徴量（λ_１）～画像特徴量（λ₈₀）は、画像特徴量絞り込み部７１０によって、５種類の波長が選択されたことを示している。上述したように、各波長は、１１個の画像特徴量を含むため、符号７４０の例の場合、入力データには５５個の画像特徴量が含まれることになる。 In addition, in reference numeral 730_1, the input data = image feature (λ ₁ ) to image feature (λ ₈₀ ) indicates that five wavelengths have been selected by the image feature narrowing unit 710. As described above, each wavelength includes 11 image features, so in the example of reference numeral 740, the input data includes 55 image features.

（３）学習部５０５による処理の具体例
次に、学習部５０５による処理の具体例について説明する。図８は、学習部による処理の一例を示す図である。図８に示すように、学習部５０５は、更に、モデルＡ８０１、モデルＢ８０２、モデルＣ８０３、・・・モデルα８２１と、比較変更部８１１、比較変更部８１２、比較変更部８１３、・・・比較変更部８３１とを有する。 (3) Specific Example of Processing by the Learning Unit 505 Next, a specific example of processing by the learning unit 505 will be described. Fig. 8 is a diagram showing an example of processing by the learning unit. As shown in Fig. 8, the learning unit 505 further includes a model A 801, a model B 802, a model C 803, ..., a model α 821, and a comparison and modification unit 811, a comparison and modification unit 812, a comparison and modification unit 813, ..., a comparison and modification unit 831.

モデルＡ８０１は、"微生物Ａ"の含有比率を予測するためのモデルである。モデルＡ８０１は、符号７３０＿１に示す学習用データの"入力データ"が入力されることで、出力データを出力する。 Model A801 is a model for predicting the content ratio of "Microorganism A." Model A801 outputs output data when "input data" of the learning data indicated by reference numeral 730_1 is input.

比較変更部８１１は、モデルＡ８０１から出力された出力データと、符号７３０＿１に示す学習用データの"正解データ"とを比較し、誤差に基づいて、モデルＡ８０１のモデルパラメータを更新する。このようにして、モデルＡ８０１に対する学習が行われ、学習済みモデルＡが生成される。 The comparison and modification unit 811 compares the output data output from model A801 with the "correct data" of the training data indicated by reference numeral 730_1, and updates the model parameters of model A801 based on the error. In this way, training is performed on model A801, and trained model A is generated.

同様に、モデルＢ８０２は、"微生物Ｂ"の含有比率を予測するためのモデルである。モデルＢ８０２は、符号７３０＿２に示す学習用データの"入力データ"が入力されることで、出力データを出力する。 Similarly, model B802 is a model for predicting the content ratio of "microorganism B." Model B802 outputs output data when "input data" of the learning data indicated by reference numeral 730_2 is input.

比較変更部８１２は、モデルＢ８０２から出力された出力データと、符号７３０＿２に示す学習用データの"正解データ"とを比較し、誤差に基づいて、モデルＢ８０２のモデルパラメータを更新する。このようにして、モデルＢ８０２に対する学習が行われ、学習済みモデルＢが生成される。 The comparison and modification unit 812 compares the output data output from model B802 with the "correct data" of the training data indicated by reference numeral 730_2, and updates the model parameters of model B802 based on the error. In this way, model B802 is trained, and trained model B is generated.

同様に、モデルＣ８０３は、"微生物Ｃ"の含有比率を予測するためのモデルである。モデルＣ８０３は、符号７３０＿３に示す学習用データの"入力データ"が入力されることで、出力データを出力する。 Similarly, model C803 is a model for predicting the content ratio of "microorganism C." Model C803 outputs output data when "input data" of the learning data indicated by reference numeral 730_3 is input.

比較変更部８１３は、モデルＣ８０３から出力された出力データと、符号７３０＿３に示す学習用データの"正解データ"とを比較し、誤差に基づいて、モデルＣ８０３のモデルパラメータを更新する。このようにして、モデルＣ８０３に対する学習が行われ、学習済みモデルＣが生成される。 The comparison and modification unit 813 compares the output data output from model C803 with the "correct data" of the training data indicated by reference numeral 730_3, and updates the model parameters of model C803 based on the error. In this way, training is performed on model C803, and trained model C is generated.

モデルα８２１は、最も含有比率の高い微生物の種類を予測するためのモデルである。モデルα８２１は、符号７４０に示す学習用データの"入力データ"が入力されることで、出力データを出力する。 Model α821 is a model for predicting the type of microorganism with the highest content ratio. Model α821 outputs output data when "input data" of the learning data indicated by reference numeral 740 is input.

比較変更部８３１は、モデルα８２１から出力された出力データと、符号７４０に示す学習用データの"正解データ"とを比較し、誤差に基づいて、モデルα８２１のモデルパラメータを更新する。このようにして、モデルα８２１に対する学習が行われ、学習済みモデルαが生成される。 The comparison and modification unit 831 compares the output data output from model α821 with the "correct data" in the training data indicated by reference numeral 740, and updates the model parameters of model α821 based on the error. In this way, model α821 is trained, and a trained model α is generated.

上記説明から明らかなように、学習済みモデルＡ８０１、学習済みモデルＢ８０２、学習済みモデルＣ８０３、・・・学習済みモデルα８２１は、機械学習によって生成される。ここでいう機械学習には、例えば、
線形回帰、ロジスティック回帰、ナイーブベイズ、決定木、ランダムフォレスト、ブースティング、ＬＡＳＳＯ、Ｒｉｄｇｅ、ＥｌａｓｔｉｃＮｅｔ、ニューラルネットワーク（ディープラーニング、ＣＮＮ、Ｔｒａｎｓｆｏｒｍｅｒ）、
等の任意の機械学習が含まれる。 As is clear from the above description, the trained model A801, the trained model B802, the trained model C803, ..., the trained model α821 are generated by machine learning. The machine learning referred to here includes, for example,
Linear regression, logistic regression, Naive Bayes, decision tree, random forest, boosting, LASSO, Ridge, ElasticNet, neural network (deep learning, CNN, Transformer),
This includes any machine learning algorithms such as:

＜学習処理の流れ＞
次に、微生物叢解析システム１００による学習処理の流れについて説明する。図９は、微生物叢解析システムによる学習処理の流れを示すフローチャートの一例である。 <Learning process flow>
Next, we will explain the flow of the learning process performed by the microbiome analysis system 100. Fig. 9 is an example of a flowchart showing the flow of the learning process performed by the microbiome analysis system.

ステップＳ９０１において、微生物叢解析システム１００は、物体１０１から採取された学習用の対象サンプル１０２を取得する。 In step S901, the microbiome analysis system 100 acquires a learning target sample 102 collected from an object 101.

ステップＳ９０２において、微生物叢解析システム１００は、近赤外光を対象サンプル１０２に照射し、ハイパースペクトルカメラ１２０を用いて、反射光を撮影することで、各波長のハイパースペクトル画像を生成する。 In step S902, the microbiome analysis system 100 irradiates the target sample 102 with near-infrared light and captures the reflected light using the hyperspectral camera 120 to generate hyperspectral images for each wavelength.

ステップＳ９０３において、微生物叢解析システム１００は、取得した各波長のハイパースペクトル画像に対して正規化処理を行う。 In step S903, the microbiome analysis system 100 performs normalization processing on the acquired hyperspectral images for each wavelength.

ステップＳ９０４において、微生物叢解析システム１００は、正規化後の各波長のハイパースペクトル画像について、画像特徴量を算出する。 In step S904, the microbiome analysis system 100 calculates image features for the normalized hyperspectral images at each wavelength.

ステップＳ９０５において、微生物叢解析システム１００は、算出した画像特徴量から所定の画像特徴量を選択することで、画像特徴量の絞り込みを行う。 In step S905, the microbiome analysis system 100 narrows down the image features by selecting specific image features from the calculated image features.

また、ステップＳ９０６において、微生物叢解析システム１００は、次世代シーケンサ１３０を用いて、学習用の対象サンプル１０２について、微生物叢データ４００を生成する。 Furthermore, in step S906, the microbiome analysis system 100 uses the next-generation sequencer 130 to generate microbiome data 400 for the learning target sample 102.

ステップＳ９０７において、微生物叢解析システム１００は、絞り込み後の画像特徴量を入力データとして取得し、微生物叢データ４００に含まれる微生物の状態を表すデータを正解データとして取得する。 In step S907, the microbiome analysis system 100 acquires the narrowed-down image features as input data and acquires data representing the state of the microorganisms contained in the microbiome data 400 as correct answer data.

ステップＳ９１１において、微生物叢解析システム１００は、取得した入力データと、取得した正解データ（ここでは、学習用の対象サンプル１０２に含まれる対象の微生物の含有比率）とに基づいて、学習用データを生成する。 In step S911, the microbiome analysis system 100 generates learning data based on the acquired input data and the acquired correct answer data (here, the content ratio of the target microorganisms contained in the learning target sample 102).

ステップＳ９１２において、微生物叢解析システム１００は、生成した学習用データを用いて、予測用の対象サンプルに含まれる対象の微生物の含有比率を予測するための学習済みモデルを生成する。 In step S912, the microbiome analysis system 100 uses the generated training data to generate a trained model for predicting the content ratio of the target microorganism contained in the target sample for prediction.

ステップＳ９１３において、微生物叢解析システム１００は、全ての種類の微生物について、予測用の対象サンプルに含まれる対象の微生物の含有比率を予測するための学習済みモデルを生成したか否かを判定する。 In step S913, the microbiome analysis system 100 determines whether or not a trained model has been generated for predicting the content ratio of target microorganisms contained in the target sample for prediction for all types of microorganisms.

ステップＳ９１３において、学習済みモデルを生成していない微生物があると判定した場合には（ステップＳ９１３においてＮＯの場合には）、ステップＳ９１１に戻る。 If it is determined in step S913 that there are microorganisms for which a trained model has not been generated (NO in step S913), the process returns to step S911.

一方、ステップＳ９２１において、微生物叢解析システム１００は、取得した入力データと、取得した正解データ（ここでは、学習用の対象サンプル１０２に含まれる微生物のうち最も含有比率の高い微生物の種類）とに基づいて、学習用データを生成する。 On the other hand, in step S921, the microbiome analysis system 100 generates learning data based on the acquired input data and the acquired correct answer data (here, the type of microorganism with the highest content rate among the microorganisms contained in the learning target sample 102).

ステップＳ９２２において、微生物叢解析システム１００は、生成した学習用データを用いて、予測用の対象サンプルに含まれる微生物のうち、最も含有比率の高い微生物の種類を予測する学習済みモデルを生成する。 In step S922, the microbiome analysis system 100 uses the generated training data to generate a trained model that predicts the type of microorganism with the highest abundance among the microorganisms contained in the target sample for prediction.

ステップＳ９１３において、全ての種類の微生物について、学習済みモデルを生成したと判定し、かつ、ステップＳ９２２において学習済みモデルの生成が完了した場合には、微生物叢解析システム１００は、学習処理を終了する。 If it is determined in step S913 that trained models have been generated for all types of microorganisms, and if the generation of trained models is completed in step S922, the microbiome analysis system 100 terminates the training process.

＜微生物叢解析システムの予測フェーズにおけるシステム構成＞
次に、微生物叢解析システムの予測フェーズにおけるシステム構成について説明する。図１０は、微生物叢解析システムの予測フェーズにおけるシステム構成の一例を示す図である。図１０に示すように、予測フェーズにおける微生物叢解析システム１０００は、近赤外光出力機器１１０と、ハイパースペクトルカメラ１２０と、第１の実施形態に係る「装置」である微生物叢解析装置１４０とを備える。 <System configuration for the prediction phase of the microbiome analysis system>
Next, the system configuration of the microbiome analysis system in the prediction phase will be described. Fig. 10 is a diagram showing an example of the system configuration of the microbiome analysis system in the prediction phase. As shown in Fig. 10, the microbiome analysis system 1000 in the prediction phase comprises a near-infrared light output device 110, a hyperspectral camera 120, and a microbiome analysis device 140, which is the "device" according to the first embodiment.

近赤外光出力機器１１０は、近赤外光（波長＝８００［ｎｍ］～２５００ｎｍの光）を出射する機器である。近赤外光出力機器１１０から出射された近赤外光は、物体１００１から採取された予測用の対象サンプル１００２に照射される。 The near-infrared light output device 110 is a device that emits near-infrared light (light with a wavelength of 800 nm to 2500 nm). The near-infrared light emitted from the near-infrared light output device 110 is irradiated onto a target sample 1002 for prediction collected from an object 1001.

ハイパースペクトルカメラ１２０は、予測用の対象サンプル１００２から反射した反射光を撮影し、撮影することにより得られる信号（撮影信号）を各波長に分光することで、波長ごとの撮影画像であるハイパースペクトル画像を生成する。ハイパースペクトルカメラ１２０は、各波長のハイパースペクトル画像を、微生物叢解析装置１４０に送信する。 The hyperspectral camera 120 captures the light reflected from the target sample 1002 for prediction, and disperses the signal (captured signal) obtained by capturing the image into individual wavelengths to generate a hyperspectral image, which is an image captured for each wavelength. The hyperspectral camera 120 transmits the hyperspectral images for each wavelength to the microbiome analyzer 140.

微生物叢解析装置１４０は、予測フェーズにおいて、ハイパースペクトルカメラ１２０から、予測用の対象サンプル１００２についてのハイパースペクトル画像を取得する。微生物叢解析装置１４０は、学習フェーズにおいて生成した学習済みモデルを用いて、取得したハイパースペクトル画像から、予測用の対象サンプルに含まれる微生物の状態を表すデータを予測する。上述したように、学習フェーズにおいて生成した学習済みモデルには、
・予測用の対象サンプル１００２に含まれる各種微生物の含有比率を予測する学習済みモデル、
・予測用の対象サンプル１００２に含まれる微生物のうち、最も含有比率の高い微生物の種類を予測する学習済みモデル、
が含まれる。このため、微生物叢解析装置１４０は、予測用の対象サンプル１００２に含まれる各種微生物の含有比率と、予測用の対象サンプル１００２に含まれる微生物のうち、最も含有比率の高い微生物の種類とを、予測結果として出力する。 In the prediction phase, the microbiome analyzer 140 acquires a hyperspectral image of the target sample 1002 for prediction from the hyperspectral camera 120. Using the trained model generated in the learning phase, the microbiome analyzer 140 predicts data representing the state of microorganisms contained in the target sample for prediction from the acquired hyperspectral image. As described above, the trained model generated in the learning phase includes:
A trained model for predicting the content ratio of various microorganisms contained in the target sample 1002 for prediction;
A trained model that predicts the type of microorganism with the highest content ratio among the microorganisms contained in the target sample 1002 for prediction;
For this reason, the microbiome analyzer 140 outputs as prediction results the content ratios of various microorganisms contained in the target sample for prediction 1002 and the type of microorganism with the highest content ratio among the microorganisms contained in the target sample for prediction 1002.

＜微生物叢解析装置の機能構成＞
次に、微生物叢解析装置の予測フェーズにおける機能構成について説明する。図１１は、微生物叢解析装置の予測フェーズにおける機能構成の一例を示す図である。上述したように、微生物叢解析装置１４０には、微生物叢解析プログラムがインストールされており、当該微生物叢解析プログラムが予測フェーズにおいて実行されることで、微生物叢解析装置１４０は、
・ハイパースペクトル画像取得部５０１、
・画像特徴量算出部５０２、
・画像特徴量絞り込み部７０１、
・予測部１１０１、
・出力部１１０２、
として機能する。 <Functional configuration of the microbiome analyzer>
Next, the functional configuration of the microbiota analyzer in the prediction phase will be described. FIG. 11 is a diagram showing an example of the functional configuration of the microbiota analyzer in the prediction phase. As described above, a microbiota analysis program is installed in the microbiota analyzer 140, and by executing this microbiota analysis program in the prediction phase, the microbiota analyzer 140:
Hyperspectral image acquisition unit 501,
Image feature amount calculation unit 502,
Image feature narrowing unit 701,
Prediction unit 1101,
output unit 1102,
It functions as:

このうち、ハイパースペクトル画像取得部５０１及び画像特徴量算出部５０２は、図５を用いて説明済みであるため、ここでは、説明を省略する。また、画像特徴量絞り込み部７０１は、図７を用いて説明済みであるため、ここでは、説明を省略する。 Of these, the hyperspectral image acquisition unit 501 and the image feature calculation unit 502 have already been explained using Figure 5, so their explanation will be omitted here. Furthermore, the image feature narrowing unit 701 has already been explained using Figure 7, so their explanation will be omitted here.

予測部１１０１は、学習済みモデル格納部５１４に格納された学習済みモデルを読み出し、画像特徴量絞り込み部７０１により通知された絞り込み後の画像特徴量を入力する。これにより、学習済みモデルは、予測用の対象サンプル１００２に含まれる微生物の状態を表すデータを予測する。 The prediction unit 1101 reads the trained model stored in the trained model storage unit 514 and inputs the refined image features notified by the image feature refinement unit 701. As a result, the trained model predicts data representing the state of the microorganisms contained in the target sample 1002 for prediction.

上述したように、学習済みモデルには、
・予測用の対象サンプル１００２に含まれる各種微生物の含有比率を予測する学習済みモデル、
・予測用の対象サンプル１００２に含まれる微生物のうち、最も含有比率の高い微生物の種類を予測する学習済みモデル、
が含まれる。このため、予測部１１０１は、予測用の対象サンプル１００２に含まれる各種微生物の含有比率と、予測用の対象サンプル１００２に含まれる微生物のうち、最も含有比率の高い微生物の種類とを出力部１１０２に通知する。 As mentioned above, the trained model includes:
A trained model for predicting the content ratio of various microorganisms contained in the target sample 1002 for prediction;
A trained model that predicts the type of microorganism with the highest content ratio among the microorganisms contained in the target sample 1002 for prediction;
Therefore, the prediction unit 1101 notifies the output unit 1102 of the content ratios of various microorganisms contained in the target sample for prediction 1002 and the type of microorganism with the highest content ratio among the microorganisms contained in the target sample for prediction 1002.

出力部１１０２は、予測部１１０１から通知された、予測用の対象サンプル１００２に含まれる各種微生物の含有比率と、予測用の対象サンプル１００２に含まれる微生物のうち、最も含有比率の高い微生物の種類とを、予測結果として出力する。 The output unit 1102 outputs the content ratios of various microorganisms contained in the target sample 1002 for prediction, as notified by the prediction unit 1101, and the type of microorganism with the highest content ratio among the microorganisms contained in the target sample 1002 for prediction, as prediction results.

＜微生物叢解析装置の各機能部による処理の具体例＞
次に、微生物叢解析装置１４０の予測フェーズにおける各機能部（ここでは、予測部１１０１、出力部１１０２）による処理の具体例について説明する。 <Specific examples of processing by each functional unit of the microbiome analyzer>
Next, a specific example of processing by each functional unit (here, the prediction unit 1101 and the output unit 1102) in the prediction phase of the microbiome analyzer 140 will be described.

（１）予測部１１０１による処理の具体例
はじめに、予測部１１０１による処理の具体例について説明する。図１２は、予測部による処理の一例を示す図である。図１２に示すように、予測部１１０１は、学習済みモデル格納部５１４から読み出した学習済みモデルを有する。図１２の例は、学習済みモデルとして、学習済みモデルＡ１２０１、学習済みモデルＢ１２０２、学習済みモデルＣ１２０３、・・・学習済みモデルα１２２１が読み出された様子を示している。 (1) Specific Example of Processing by Prediction Unit 1101 First, a specific example of processing by the prediction unit 1101 will be described. Fig. 12 is a diagram showing an example of processing by the prediction unit. As shown in Fig. 12, the prediction unit 1101 has trained models read out from the trained model storage unit 514. The example in Fig. 12 shows how trained models A 1201, trained model B 1202, trained model C 1203, ..., and trained model α 1221 have been read out.

なお、図１２において、符号１２００は、「サンプルＸ」により識別される予測用の対象サンプル１００２について、画像特徴量絞り込み部７０１より、入力データとして、画像特徴量（λ_１）～画像特徴量（λ₈₀）が通知された様子を示している。 In FIG. 12, reference numeral 1200 indicates that the image feature narrowing unit 701 notifies the image feature (λ ₁ ) to the image feature (λ ₈₀ ) as input data for the target sample for prediction 1002 identified by “sample X.”

学習済みモデルＡ１２０１は、"微生物Ａ"の含有比率を予測する学習済みモデルである。符号１２００の"入力データ"が入力されることで、学習済みモデルＡ１２０１は、「サンプルＸ」により識別される予測用の対象サンプル１００２に含まれる微生物Ａの含有比率＝"ａ_Ｘ％"を予測する。 Trained model A1201 is a trained model that predicts the content ratio of "microorganism A." When "input data" 1200 is input, trained model A1201 predicts the content ratio of microorganism A contained in the target sample for prediction 1002 identified by "sample X" = "a _X %."

同様に、学習済みモデルＢ１２０２は、"微生物Ｂ"の含有比率を予測する学習済みモデルである。符号１２００の"入力データ"が入力されることで、学習済みモデルＢ１２０２は、「サンプルＸ」により識別される予測用の対象サンプル１００２に含まれる微生物Ｂの含有比率＝"ｂ_Ｘ％"を予測する。 Similarly, trained model B1202 is a trained model that predicts the content ratio of "microorganism B." When "input data" 1200 is input, trained model B1202 predicts the content ratio of microorganism B contained in the target sample for prediction 1002 identified by "sample X" = "b _X %."

同様に、学習済みモデルＣ１２０３は、"微生物Ｃ"の含有比率を予測する学習済みモデルである。符号１２００の"入力データ"が入力されることで、学習済みモデルＣ１２０３は、「サンプルＸ」により識別される予測用の対象サンプル１００２に含まれる微生物Ｃの含有比率＝"ｃ_Ｘ％"を予測する。 Similarly, trained model C1203 is a trained model that predicts the content ratio of "microorganism C." When the "input data" referenced 1200 is input, trained model C1203 predicts the content ratio of microorganism C contained in the target sample for prediction 1002 identified by "sample X" = "c _X %."

学習済みモデルα１２２１は、最も含有比率の高い微生物の種類を予測する学習済みモデルである。符号１２００の"入力データ"が入力されることで、学習済みモデルα１２２１は、「サンプルＸ」により識別される予測用の対象サンプル１００２に含まれる微生物のうち、最も含有比率の高い微生物の種類＝"微生物Ｇ"を出力する。 Trained model α1221 is a trained model that predicts the type of microorganism with the highest content ratio. When the "input data" referenced 1200 is input, trained model α1221 outputs the type of microorganism with the highest content ratio = "microorganism G" among the microorganisms contained in the target sample for prediction 1002 identified by "sample X."

（２）出力部１１０２による処理の具体例
次に、出力部１１０２による処理の具体例について説明する。図１３は、出力部による処理の一例を示す図である。 (2) Specific Example of Processing by Output Unit 1102 Next, a specific example of processing by the output unit 1102 will be described. Fig. 13 is a diagram showing an example of processing by the output unit.

図１３に示すように、予測結果１３００は、情報の項目として、"対象サンプル"、"微生物種"、"含有比率"、"高含有微生物種"を含む。 As shown in Figure 13, the prediction result 1300 includes the information items "target sample," "microbial species," "content ratio," and "high-content microbial species."

"対象サンプル"には、予測用の対象サンプル（例えば、対象サンプル１００２）を識別するための識別子（図１３の例では、「サンプルＸ」）が格納される。 "Target sample" stores an identifier ("Sample X" in the example of Figure 13) for identifying the target sample for prediction (e.g., target sample 1002).

"微生物種"には、予測用の対象サンプル（例えば、対象サンプル１００２）に含まれる微生物の種類（図１３の例では、「微生物Ａ」、「微生物Ｂ」、「微生物Ｃ」・・・等）が格納される。 "Microorganism species" stores the type of microorganism (in the example of Figure 13, "Microorganism A," "Microorganism B," "Microorganism C," etc.) contained in the target sample for prediction (e.g., target sample 1002).

"含有比率"には、予測用の対象サンプル（例えば、対象サンプル１００２）に含まれる各種微生物の含有比率（図１３の例では、「ａ_Ｘ％」、「ｂ_Ｘ％」、「ｃ_Ｘ％」、・・・等）が格納される。 The "content ratio" stores the content ratio (in the example of FIG. 13, "a _X %", "b _X %", "c _X %", ..., etc.) of various microorganisms contained in the target sample for prediction (e.g., target sample 1002).

"高含有微生物種"には、予測用の対象サンプル（例えば、対象サンプル１００２）に含まれる微生物のうち、最も含有比率が高い微生物の種類（図１３の例では、「微生物Ｇ」）が格納される。 "Highly abundant microbial species" stores the type of microorganism with the highest abundance ratio ("Microorganism G" in the example of Figure 13) among the microorganisms contained in the target sample for prediction (e.g., target sample 1002).

＜予測処理の流れ＞
次に、微生物叢解析システム１０００による予測処理の流れについて説明する。図１４は、微生物叢解析システムによる予測処理の流れを示すフローチャートの一例である。 <Prediction process flow>
Next, we will explain the flow of prediction processing by the microbiome analysis system 1000. Fig. 14 is an example of a flowchart showing the flow of prediction processing by the microbiome analysis system.

ステップＳ１４０１において、微生物叢解析システム１０００は、物体１００１から採取された予測用の対象サンプル１００２を取得する。 In step S1401, the microbiome analysis system 1000 acquires a target sample 1002 for prediction collected from the object 1001.

ステップＳ１４０２において、微生物叢解析システム１０００は、近赤外光を対象サンプル１００２に照射し、ハイパースペクトルカメラ１２０を用いて、反射光を撮影することで、各波長のハイパースペクトル画像を生成する。 In step S1402, the microbiome analysis system 1000 irradiates the target sample 1002 with near-infrared light and captures the reflected light using the hyperspectral camera 120 to generate hyperspectral images for each wavelength.

ステップＳ１４０３において、微生物叢解析システム１０００は、取得した各波長のハイパースペクトル画像に対して正規化処理を行う。 In step S1403, the microbiome analysis system 1000 performs normalization processing on the acquired hyperspectral images for each wavelength.

ステップＳ１４０４において、微生物叢解析システム１０００は、正規化後の各波長のハイパースペクトル画像について、画像特徴量を算出する。 In step S1404, the microbiome analysis system 1000 calculates image features for the normalized hyperspectral images at each wavelength.

ステップＳ１４０５において、微生物叢解析システム１０００は、算出した画像特徴量から所定の画像特徴量を選択することで、画像特徴量の絞り込みを行う。 In step S1405, the microbiome analysis system 1000 narrows down the image features by selecting specific image features from the calculated image features.

ステップＳ１４０６において、微生物叢解析システム１０００は、絞り込み後の画像特徴量を、予測用の対象サンプル１００２に含まれる各種微生物の含有比率を予測する各学習済みモデルに入力し、各種微生物の含有比率を予測する。 In step S1406, the microbiome analysis system 1000 inputs the narrowed-down image features into each trained model that predicts the content ratio of various microorganisms contained in the target sample 1002 for prediction, and predicts the content ratio of various microorganisms.

ステップＳ１４０７において、微生物叢解析システム１０００は、絞り込み後の画像特徴量を、予測用の対象サンプルに含まれる微生物のうち、最も含有比率の高い微生物の種類を予測する学習済みモデルに入力する。これにより、微生物叢解析システム１０００は、最も含有比率の高い微生物の種類を予測する。 In step S1407, the microbiome analysis system 1000 inputs the narrowed-down image features into a trained model that predicts the type of microorganism with the highest content ratio among the microorganisms contained in the target sample for prediction. As a result, the microbiome analysis system 1000 predicts the type of microorganism with the highest content ratio.

ステップＳ１４０８において、微生物叢解析システム１０００は、予測結果を出力する。 In step S1408, the microbiome analysis system 1000 outputs the prediction results.

ステップＳ１４０９において、微生物叢解析システム１０００は、予測用の他の対象サンプルがあるか否かを判定する。ステップＳ１４０９において、予測用の他の対象サンプルがあると判定した場合には（ステップＳ１４０９においてＹＥＳの場合には）、ステップＳ１４０１に戻る。 In step S1409, the microbiome analysis system 1000 determines whether there are other target samples for prediction. If it is determined in step S1409 that there are other target samples for prediction (YES in step S1409), the process returns to step S1401.

一方、ステップＳ１４０９において、予測用の他の対象サンプルがないと判定した場合には（ステップＳ１４０９においてＮＯの場合には）、予測処理を終了する。 On the other hand, if it is determined in step S1409 that there are no other target samples for prediction (NO in step S1409), the prediction process ends.

＜学習済みモデルの予測精度の検証例＞
次に、微生物叢解析装置１４０の予測部１１０１が有する学習済みモデルの予測精度について説明する。 <Example of verification of the predictive accuracy of a trained model>
Next, the prediction accuracy of the trained model possessed by the prediction unit 1101 of the microbiome analyzer 140 will be described.

（１）各種微生物の含有比率を予測する学習済みモデル
図１５は、各種微生物の含有比率を予測する学習済みモデルの予測精度の検証例を示す図である。 (1) Trained model for predicting the content ratio of various microorganisms Figure 15 is a diagram showing an example of verification of the prediction accuracy of a trained model for predicting the content ratio of various microorganisms.

このうち、グラフ１５１０は、ラクトバチルスの含有比率を予測する学習済みモデルの予測精度の検証例を示している。グラフ１５１０において横軸は、検証対象の各対象サンプルについて、次世代シーケンサ１３０を用いて解析することで得られたラクトバチルスの含有比率（実測値％）を示している。また、グラフ１５１０において縦軸は、検証対象の各対象サンプルについて、ラクトバチルスの含有比率を予測する学習済みモデルを用いて予測されたラクトバチルスの含有比率（予測値％）を示している。 Of these, graph 1510 shows an example of verification of the prediction accuracy of a trained model that predicts the lactobacillus content ratio. In graph 1510, the horizontal axis shows the lactobacillus content ratio (actual measured value %) obtained by analyzing each target sample under verification using the next-generation sequencer 130. Also, in graph 1510, the vertical axis shows the lactobacillus content ratio (predicted value %) predicted for each target sample under verification using the trained model that predicts the lactobacillus content ratio.

グラフ１５１０内の各点は、各対象サンプルの実測値と予測値とに対応する位置にプロットされた点である。また、グラフ１５１０内の直線１５１１は、実測値を説明変数、予測値を目的変数とする線形回帰式を示している。グラフ１５１０によれば、決定係数Ｒ^２＝０．７９７１となり、ラクトバチルスの含有比率を予測する学習済みモデルの予測精度が良好であることが示された。 Each point in graph 1510 is plotted at a position corresponding to the actual measured value and predicted value of each target sample. Also, a straight line 1511 in graph 1510 represents a linear regression equation with the actual measured value as the explanatory variable and the predicted value as the objective variable. According to graph 1510, the coefficient of determination ^R2 = 0.7971, indicating that the trained model for predicting the Lactobacillus content ratio has good predictive accuracy.

グラフ１５２０は、ストレプトコッカス（レンサ球菌）の含有比率を予測する学習済みモデルの予測精度の検証例を示している。グラフ１５２０において横軸は、検証対象の各対象サンプルについて、次世代シーケンサ１３０を用いて解析することで得られたストレプトコッカスの含有比率（実測値％）を示している。また、グラフ１５２０において縦軸は、検証対象の各対象サンプルについて、ストレプトコッカスの含有比率を予測する学習済みモデルを用いて予測されたストレプトコッカスの含有比率（予測値％）を示している。 Graph 1520 shows an example of verification of the prediction accuracy of a trained model that predicts the content ratio of streptococcus (Streptococcus). In graph 1520, the horizontal axis shows the content ratio of streptococcus (actual measured value %) obtained by analyzing each target sample under verification using the next-generation sequencer 130. In addition, in graph 1520, the vertical axis shows the content ratio of streptococcus (predicted value %) predicted for each target sample under verification using the trained model that predicts the content ratio of streptococcus.

グラフ１５２０内の各点は、各対象サンプルの実測値と予測値とに対応する位置にプロットされた点である。また、グラフ１５２０内の直線１５２１は、実測値を説明変数、予測値を目的変数とする線形回帰式を示している。グラフ１５２０によれば、決定係数Ｒ^２＝０．７６１６となり、ストレプトコッカスの含有比率を予測する学習済みモデルの予測精度が良好であることが示された。 Each point in graph 1520 is plotted at a position corresponding to the actual measured value and predicted value of each target sample. A straight line 1521 in graph 1520 represents a linear regression equation with the actual measured value as the explanatory variable and the predicted value as the objective variable. According to graph 1520, the coefficient of determination ^R2 = 0.7616, indicating that the trained model for predicting the Streptococcus content ratio has good prediction accuracy.

グラフ１５３０は、クロストリジウムの含有比率を予測する学習済みモデルの予測精度の検証例を示している。グラフ１５３０において横軸は、検証対象の各対象サンプルについて、次世代シーケンサ１３０を用いて解析することで得られたクロストリジウムの含有比率（実測値％）を示している。また、グラフ１５３０において縦軸は、検証対象の各対象サンプルについて、クロストリジウムの含有比率を予測する学習済みモデルを用いて予測されたクロストリジウムの含有比率（予測値％）を示している。 Graph 1530 shows an example of verification of the prediction accuracy of a trained model that predicts the Clostridium content ratio. In graph 1530, the horizontal axis shows the Clostridium content ratio (actual measured value %) obtained by analyzing each target sample under verification using the next-generation sequencer 130. In addition, in graph 1530, the vertical axis shows the Clostridium content ratio (predicted value %) predicted for each target sample under verification using the trained model that predicts the Clostridium content ratio.

グラフ１５３０内の各点は、各対象サンプルの実測値と予測値とに対応する位置にプロットされた点である。また、グラフ１５３０内の直線１５３１は、実測値を説明変数、予測値を目的変数とする線形回帰式を示している。グラフ１５３０によれば、決定係数Ｒ^２＝０．７５８４となり、クロストリジウムの含有比率を予測する学習済みモデルの予測精度が良好であることが示された。 Each point in graph 1530 is plotted at a position corresponding to the actual measured value and predicted value of each target sample. A straight line 1531 in graph 1530 represents a linear regression equation with the actual measured value as the explanatory variable and the predicted value as the objective variable. According to graph 1530, the coefficient of determination ^R2 = 0.7584, indicating that the trained model for predicting the Clostridium content ratio has good prediction accuracy.

グラフ１５４０は、エシェリヒア（大腸菌）の含有比率を予測する学習済みモデルの予測精度の検証例を示している。グラフ１５４０において横軸は、検証対象の各対象サンプルについて、次世代シーケンサ１３０を用いて解析することで得られたエシェリヒアの含有比率（実測値％）を示している。また、グラフ１５４０において縦軸は、検証対象の各対象サンプルについて、エシェリヒアの含有比率を予測する学習済みモデルを用いて予測されたエシェリヒアの含有比率（予測値％）を示している。 Graph 1540 shows an example of verification of the prediction accuracy of a trained model that predicts the content ratio of Escherichia (E. coli). In graph 1540, the horizontal axis shows the content ratio of Escherichia (actual value %) obtained by analyzing each target sample being verified using the next-generation sequencer 130. In addition, in graph 1540, the vertical axis shows the content ratio of Escherichia (predicted value %) predicted for each target sample being verified using the trained model that predicts the content ratio of Escherichia.

グラフ１５４０内の各点は、各対象サンプルの実測値と予測値とに対応する位置にプロットされた点である。また、グラフ１５４０内の直線１５４１は、実測値を説明変数、予測値を目的変数とする線形回帰式を示している。グラフ１５４０によれば、決定係数Ｒ^２＝０．９１５４となり、エシェリヒアの含有比率を予測する学習済みモデルの予測精度が良好であることが示された。 Each point in graph 1540 is plotted at a position corresponding to the actual measured value and predicted value of each target sample. A straight line 1541 in graph 1540 represents a linear regression equation with the actual measured value as the explanatory variable and the predicted value as the objective variable. According to graph 1540, the coefficient of determination ^R2 = 0.9154, indicating that the trained model for predicting the Escherichia content ratio has good prediction accuracy.

（２）最も含有比率の高い微生物の種類を予測する学習済みモデル
図１６は、最も含有比率の高い微生物の種類を予測する学習済みモデルの予測精度の検証例を示す図である。表１６００において、横方向に配列された各細菌種は、検証対象の各対象サンプルについて、最も含有比率の高い微生物の種類を予測する学習済みモデルを用いて予測された、最も含有比率の高い細菌種の種類（予測値）を示している。表１６００において、縦方向に配列された各細菌種は、検証対象の各サンプルについて、次世代シーケンサ１３０を用いて解析することで得られた、最も含有比率の高い微生物の種類（実測値）を示している。 (2) Trained Model for Predicting the Most Prevalent Microorganism Type Figure 16 is a diagram showing an example of verification of the prediction accuracy of a trained model for predicting the most prevalent microorganism type. In table 1600, each bacterial species arranged horizontally indicates the type of bacterial species with the highest prevalence (predicted value) for each target sample being verified, predicted using a trained model that predicts the most prevalent microorganism type. In table 1600, each bacterial species arranged vertically indicates the most prevalent microorganism type (actual value) obtained by analyzing each target sample being verified using the next-generation sequencer 130.

表１６００内の行１６０１は、実測値としてラクトバチルスが最も含有比率が高いことが解析された検証対象の対象サンプルが、１８個得られたことを示している。最も含有比率の高い微生物の種類を予測する学習済みモデルは、このうち１７個の対象サンプルについて、ラクトバチルスが最も含有比率が高い微生物であると予測することができた。 Row 1601 in Table 1600 shows that 18 test samples were analyzed to have the highest content of Lactobacillus as an actual measurement. The trained model, which predicts the type of microorganism with the highest content, was able to predict that Lactobacillus was the microorganism with the highest content in 17 of these test samples.

また、表１６００内の行１６０２は、実測値としてストレプトコッカス（レンサ球菌）が最も含有比率が高いことが解析された検証対象の対象サンプルが、１７個得られたことを示している。最も含有比率の高い微生物の種類を予測する学習済みモデルは、このうち１６個の対象サンプルについて、ストレプトコッカスが最も含有比率が高い微生物であると予測することができた。 Row 1602 in table 1600 shows that 17 target samples were analyzed for validation and were found to have the highest content of Streptococcus (Streptococcus) as an actual measurement. The trained model, which predicts the type of microorganism with the highest content, was able to predict that Streptococcus was the microorganism with the highest content in 16 of these target samples.

また、表１６００内の行１６０３は、実測値としてクロストリジウムが最も含有比率が高いことが解析された検証対象の対象サンプルが、１２個得られたことを示している。最も含有比率の高い微生物の種類を予測する学習済みモデルは、このうち１１個の対象サンプルについて、クロストリジウムが最も含有比率が高い微生物であると予測することができた。 Furthermore, row 1603 in table 1600 shows that 12 target samples were analyzed for validation and were found to have the highest content of Clostridium as an actual measured value. The trained model, which predicts the type of microorganism with the highest content rate, was able to predict that Clostridium was the microorganism with the highest content rate for 11 of these target samples.

また、表１６００内の行１６０４は、実測値としてエシェリヒア（大腸菌）が最も含有比率が高いことが解析された検証対象の対象サンプルが、１８個得られたことを示している。最も含有比率の高い微生物の種類を予測する学習済みモデルは、このうち１８個の対象サンプルについて、エシェリヒアが最も含有比率が高い微生物であると予測することができた。 Furthermore, row 1604 in table 1600 shows that 18 target samples were analyzed for validation and were found to contain the highest percentage of Escherichia (E. coli) as an actual measurement. The trained model, which predicts the type of microorganism with the highest percentage, was able to predict that Escherichia was the microorganism with the highest percentage for 18 of these target samples.

この結果、最も含有比率の高い微生物の種類を予測する学習済みモデルの正答率は９５．４％（＝６２／６５×１００）であることが検証された。 As a result, it was verified that the accuracy rate of the trained model in predicting the most frequently occurring microbial species was 95.4% (= 62/65 x 100).

＜まとめ＞
以上の説明から明らかなように、第１の実施形態に係る装置である微生物叢解析装置１４０は、学習フェーズにおいて、
・学習用の対象サンプル１０２を撮影した信号（撮影信号）を分光することで生成されるハイパースペクトル画像を取得する。
・ハイパースペクトル画像に基づいてテクスチャを表す特徴量を算出する。
・学習用の対象サンプル１０２について算出された画像特徴量と、学習用の対象サンプル１０２に含まれる微生物の状態を表すデータと、を含む学習用データを用いて学習することで学習済みモデルを生成する。 <Summary>
As is clear from the above explanation, the microbiome analyzer 140 according to the first embodiment performs the following in the learning phase:
A hyperspectral image is obtained by dispersing signals (photographed signals) obtained by photographing the target sample 102 for learning.
・Calculate features that represent texture based on hyperspectral images.
- A trained model is generated by training using training data including image features calculated for the training target sample 102 and data representing the state of microorganisms contained in the training target sample 102.

また、第１の実施形態に係る装置である微生物叢解析装置１４０は、予測フェーズにおいて、
・学習済みモデルに、予測用の対象サンプル１００２について算出された、テクスチャを表す特徴量を入力することで、予測用の対象サンプル１００２に含まれる微生物の状態を表すデータを予測する。 Furthermore, the microbiome analyzer 140 according to the first embodiment performs the following in the prediction phase:
- By inputting features representing the texture calculated for the target sample 1002 for prediction into the trained model, data representing the state of microorganisms contained in the target sample 1002 for prediction is predicted.

このように、第１の実施形態に係る装置である微生物叢解析装置１４０では、ハイパースペクトル画像に基づきテクスチャを表す特徴量を算出することで、対象サンプルに含まれる微生物の分布を、波長ごとに捉える。このため、第１の実施形態によれば、従来のように、単に各波長での反射強度を解析する方法（反射スペクトルを用いる方法）と比較して、対象サンプルに含まれる微生物の量を適確に表す特徴量を算出することが可能になる。 In this way, the microbiome analysis device 140 according to the first embodiment calculates features that represent texture based on hyperspectral images, thereby capturing the distribution of microorganisms contained in a target sample for each wavelength. Therefore, according to the first embodiment, it is possible to calculate features that accurately represent the amount of microorganisms contained in a target sample, compared to conventional methods that simply analyze the reflection intensity at each wavelength (methods that use the reflection spectrum).

この結果、第１の実施形態によれば、反射特性の異なる微生物が予測用の対象サンプル内に混在する場合であっても、各種微生物の状態を表すデータを精度よく予測することが可能になる。 As a result, according to the first embodiment, even if microorganisms with different reflection characteristics are mixed within the target sample for prediction, it is possible to accurately predict data representing the state of various microorganisms.

つまり、第１の実施形態に係る装置である微生物叢解析装置１４０によれば、微生物叢について、各種微生物の状態を解析する際の解析精度を向上させることができる。 In other words, the microbiome analyzer 140 according to the first embodiment can improve the analytical accuracy when analyzing the state of various microorganisms in a microbiome.

［第２の実施形態］
上記第１の実施形態では、近赤外光出力機器１１０から出射された近赤外光を、対象サンプルに照射し、ハイパースペクトルカメラ１２０が、対象サンプルからの反射光を撮影するように構成した。しかしながら、ハイパースペクトルカメラ１２０が撮影するのは、対象サンプルからの反射光に限定されない。例えば、ハイパースペクトルカメラ１２０は、対象サンプルを透過した透過光を撮影してもよい。 Second Embodiment
In the first embodiment, the near-infrared light emitted from the near-infrared light output device 110 is irradiated onto a target sample, and the hyperspectral camera 120 captures the light reflected from the target sample. However, the light captured by the hyperspectral camera 120 is not limited to the light reflected from the target sample. For example, the hyperspectral camera 120 may capture the light transmitted through the target sample.

また、上記第１の実施形態では、微生物叢解析システムが近赤外光出力機器を備え、対象サンプルに近赤外光を照射する構成とした。しかしながら、対象サンプルに照射する光は、近赤外光に限定されず、他の波長範囲の光を照射してもよい。 Furthermore, in the first embodiment described above, the microbiome analysis system is equipped with a near-infrared light output device and is configured to irradiate the target sample with near-infrared light. However, the light irradiated onto the target sample is not limited to near-infrared light, and light in other wavelength ranges may also be irradiated.

また、上記第１の実施形態では、微生物叢解析システムがハイパースペクトルカメラ１２０を備え、ハイパースペクトル画像を生成する構成とした。しかしながら、微生物叢解析システムの構成はこれに限定されず、ハイパースペクトルカメラ１２０に代えて、例えば、近赤外線カメラと分光フィルタとを配し、撮影信号を分光することで分光画像を生成する構成としてもよい。 Furthermore, in the first embodiment described above, the microbiome analysis system is configured to include a hyperspectral camera 120 and generate hyperspectral images. However, the configuration of the microbiome analysis system is not limited to this, and instead of the hyperspectral camera 120, for example, a near-infrared camera and a spectral filter may be provided, and a spectral image may be generated by spectrally dispersing the captured signal.

また、上記第１の実施形態では、正規化部６１０が正規化する際の特定画素の画素値として、波長＝１４４０ｎｍのハイパースペクトル画像に含まれる、水領域の画素の画素値を用いた。しかしながら、正規化部６１０が正規化する際に用いる特定画素の画素値はこれに限定されず、他の波長のハイパースペクトル画像に含まれる他の画素の画素値を用いてもよい。 In addition, in the first embodiment described above, the pixel value of a pixel in the water region included in the hyperspectral image at wavelength = 1440 nm was used as the pixel value of the specific pixel when the normalization unit 610 normalized. However, the pixel value of the specific pixel used by the normalization unit 610 when normalizing is not limited to this, and the pixel value of another pixel included in the hyperspectral image at another wavelength may also be used.

また、上記第１の実施形態では、テクスチャを表す特徴量には任意の特徴量が含まれるものとし、具体例として、各画素の画素値の度数分布を表すヒストグラムに基づく特徴量（統計量）、差分統計量、濃度共起行列、フーリエ特徴量等を示した。しかしながら、ここでいう任意の特徴量には、古典的な画像処理技術を用いて算出される特徴量に限定されず、例えば、ＣＮＮ、ＶｉｓｉｏｎＴｒａｎｓｆｏｒｍｅｒ等の深層学習技術を用いて算出される変数等であってもよい。 Furthermore, in the first embodiment described above, the features representing texture include arbitrary features, and specific examples include features (statistics) based on a histogram representing the frequency distribution of pixel values for each pixel, difference statistics, a density co-occurrence matrix, and Fourier features. However, the arbitrary features referred to here are not limited to features calculated using classical image processing techniques, and may also include, for example, variables calculated using deep learning techniques such as CNN and Vision Transformer.

また、上記第１の実施形態では、画像特徴量算出部５０２の処理の具体例として、画像特徴量算出部５０２が、正規化、ヒストグラムの生成、特徴量算出の順で処理を行う場合について説明した。しかしながら、画像特徴量算出部５０２が行う処理の順序はこれに限定されず、画像特徴量算出部５０２は、例えば、ヒストグラムの生成、特徴量算出、正規化の順に処理を行ってもよい。ただし、いずれの処理順序であっても、画像特徴量算出部５０２は、正規化処理した画像特徴量を算出するものとする。 Furthermore, in the above first embodiment, as a specific example of the processing of the image feature amount calculation unit 502, a case has been described in which the image feature amount calculation unit 502 performs processing in the order of normalization, histogram generation, and feature amount calculation. However, the order of processing performed by the image feature amount calculation unit 502 is not limited to this, and the image feature amount calculation unit 502 may perform processing in the order of, for example, histogram generation, feature amount calculation, and normalization. However, regardless of the processing order, the image feature amount calculation unit 502 calculates image feature amounts that have been normalized.

また、上記第１の実施形態では、画像特徴量算出部５０２における正規化処理の一例として、
・各波長のハイパースペクトル画像（特定波長のハイパースペクトル画像を除くハイパースペクトル画像）に含まれる各画素の画素値を、
・特定波長のハイパースペクトル画像に含まれる特定画素の画素値で、
除算する方法を挙げたが、正規化処理の方法はこれに限定されない。例えば、除算することに代えて、減算することで、あるいは、他の演算を行うことで正規化処理を行ってもよい。なお、画像特徴量算出部５０２における正規化処理は、必須の構成ではなく、画像特徴量算出部５０２は、正規化部６１０を有していなくてもよい。 In the first embodiment, as an example of normalization processing in the image feature amount calculation unit 502,
The pixel value of each pixel included in the hyperspectral image of each wavelength (hyperspectral image excluding the hyperspectral image of a specific wavelength) is
- The pixel value of a specific pixel contained in a hyperspectral image of a specific wavelength,
Although the division method has been described above, the normalization method is not limited to this. For example, normalization may be performed by subtraction instead of division, or by performing other operations. Note that the normalization process in the image feature amount calculation unit 502 is not an essential component, and the image feature amount calculation unit 502 does not necessarily have to include the normalization unit 610.

なお、上記実施形態に挙げた構成等に、その他の要素との組み合わせ等、ここで示した構成に本発明が限定されるものではない。これらの点に関しては、本発明の趣旨を逸脱しない範囲で変更することが可能であり、その応用形態に応じて適切に定めることができる。 Note that the present invention is not limited to the configurations described in the above embodiments, and may be combined with other elements. These aspects may be modified without departing from the spirit of the present invention, and may be determined appropriately depending on the application form.

１００：微生物叢解析システム
１１０：近赤外光出力機器
１２０：ハイパースペクトルカメラ
１３０：次世代シーケンサ
１４０：微生物叢解析装置
４００：微生物叢データ
５０１：ハイパースペクトル画像取得部
５０２：画像特徴量算出部
５０３：微生物叢データ取得部
５０４：学習用データ生成部
５０５：学習部
６１０：正規化部
６２０：ヒストグラム生成部
６３０：特徴量算出部
７１０：画像特徴量絞り込み部
７２０：結合部
１１０１：予測部
１１０２：出力部 100: Microbiota analysis system 110: Near-infrared light output device 120: Hyperspectral camera 130: Next-generation sequencer 140: Microbiota analysis device 400: Microbiota data 501: Hyperspectral image acquisition unit 502: Image feature calculation unit 503: Microbiota data acquisition unit 504: Learning data generation unit 505: Learning unit 610: Normalization unit 620: Histogram generation unit 630: Feature calculation unit 710: Image feature narrowing unit 720: Combination unit 1101: Prediction unit 1102: Output unit

Claims

an acquisition unit that acquires a spectral image generated by dispersing an image signal obtained by capturing an image of an object;
a feature amount calculation unit that calculates a feature amount representing a texture based on the spectral image;
and a prediction unit that predicts data representing the state of microorganisms contained in an object for prediction by inputting the feature amounts calculated for the object for prediction into a trained model that has been trained using training data including the feature amounts calculated for the object for learning and data representing the state of microorganisms contained in the object for learning.

The imaging signal is a signal obtained by irradiating a target with light in a predetermined wavelength range.
10. The apparatus of claim 1.

some of the feature amounts of some of the spectroscopic images are selected from the plurality of feature amounts of the respective spectroscopic images calculated by the feature amount calculation unit, and are used for the learning or the prediction.
10. The apparatus of claim 1.

the feature amount includes a statistical amount of pixel values of pixels included in the spectroscopic image;
10. The apparatus of claim 1.

the feature amount calculation unit calculates the feature amount after normalization processing;
10. The apparatus of claim 1.

the normalization process is a process of performing an operation on pixel values of a spectroscopic image of a specific wavelength among the spectroscopic images acquired by the acquisition unit, excluding the spectroscopic image of the specific wavelength, using pixel values of the spectroscopic image of the specific wavelength.
6. The apparatus of claim 5.

The subject is a biologically or environmentally derived sample containing a single or multiple species of microorganisms.
10. The apparatus of claim 1.

The data representing the state of the microorganisms includes any one of the type, amount, and content ratio of the microorganisms contained in the target object.
10. The apparatus of claim 1.

The data representing the state of microorganisms is a content ratio of each type of microorganism, which represents a ratio of the amount of each type of microorganism contained in the target object to the total amount of all types of microorganisms,
the prediction unit predicts the content ratio of each type of microorganism contained in the prediction target object using trained models, the number of which corresponds to the number of types of microorganisms;
9. The apparatus of claim 8.

the data representing the state of the microorganisms is data representing the type of microorganism most commonly found in the target object;
the prediction unit predicts the type of microorganism most frequently contained in the object for prediction using a trained model that predicts the type of microorganism most frequently contained in the object;
9. The apparatus of claim 8.

acquiring a spectral image generated by dispersing an image signal obtained by capturing an image of an object;
calculating a feature quantity representing a texture based on the spectral image;
and predicting data representing the state of microorganisms contained in an object for prediction by inputting the feature values calculated for the object for prediction into a trained model trained using training data including the feature values calculated for the object for training and data representing the state of microorganisms contained in the object for training.

acquiring a spectral image generated by dispersing an image signal obtained by capturing an image of an object;
calculating a feature quantity representing a texture based on the spectral image;
and a step of predicting data representing the state of microorganisms contained in an object for prediction by inputting the feature values calculated for the object for prediction into a trained model trained using training data including the feature values calculated for the object for learning and data representing the state of microorganisms contained in the object for learning.