JP5206366B2

JP5206366B2 - 3D data creation device

Info

Publication number: JP5206366B2
Application number: JP2008302126A
Authority: JP
Inventors: 一記喜多
Original assignee: Casio Computer Co Ltd
Current assignee: Casio Computer Co Ltd
Priority date: 2008-11-27
Filing date: 2008-11-27
Publication date: 2013-06-12
Anticipated expiration: 2028-11-27
Also published as: JP2010128742A

Description

本発明は、３次元データ作成装置に関する。本発明の産業上の利用分野としては、例えば、デジタルカメラ等で撮影した画像データを受信（入力）して胸像や銅像等の３次元人物像を作成するための加工用モデルデータや、ＣＧキャラクタ、アバータ用の３次元データの作成と、画像データの受付から３次元人物像加工用データ作成装置で作成した３次元データを用いたシステムを構築することである。 The present invention relates to a three-dimensional data creation device. Industrial application fields of the present invention include, for example, processing model data for receiving (inputting) image data taken by a digital camera or the like and creating a three-dimensional human image such as a chest image or a bronze image, or a CG character The creation of a three-dimensional data for an averter and the construction of a system using the three-dimensional data created by the three-dimensional human image processing data creation device from the reception of the image data.

従来、胸像や銅像などを作成するには、対象者の周囲からの画像を撮影して、彫刻作家や鋳物師などに注文し、彫刻作家は写真と実物モデルとを元に、彫塑原型を手作業で作成し、それを元に石膏型取りし、石膏像を基に鋳物師が銅像などに加工し、着色加工して仕上げたりしていた。このような彫刻家による従来工法では、ユーザーにとってもはじっとしている必要があるなどの手間がかかり、彫刻家にとっても作成日数がかかるなど難点があった。また、彫刻家の技量やセンスに左右されるため、芸術性は高いがモデルに対する忠実度（リアリティ）は必ずしも高くなかった。 Traditionally, to create a bust or bronze statue, take an image of the subject's surroundings and place an order with a sculptor or founder. The sculptor manually works on the sculpture prototype based on the photograph and the actual model. The mold was made from this, and a plaster mold was made based on it, and the founder processed it into a copper image based on the plaster image, and finished it by coloring it. In the conventional method by such a sculptor, there is a problem that it is necessary for the user to be reluctant, and the sculptor takes a long time to create. Also, because it depends on the skill and sense of the sculptor, the artistry is high, but the fidelity (reality) to the model is not necessarily high.

また、従来から、人体モデルの立体像やレリーフ像（浮彫像）を制作する方法として、モデルを被写体として写真撮影し、この写真（撮影画像）をもとに像を制作する方法が各種提案されている。
このような方法の例として、モデルを被写体として写真撮影し、または平行線スクリーン（縞模様）などをモデルに投影して写真撮影し、この写真像を、粘土など材料表面上に投影して、それをなぞるように、手動で彫塑加工して、像を制作する方法などが、提案されている（例えば、特許文献１参照）。同特許文献１には、被写物の周囲に、多数のカメラと投光機と多数の平行線を描いたスクリーンを配して、被写物に平行線スクリーンを投影したときの画像を多数のカメラで撮影した後、相似位置の投光機で原型の材料面上に、前記撮影された写真画像と前記平行線スクリーンを投影し、材料上でのスクリーンと写真上の線条が相重なるように、材料を加工する立体写真像製作方法が開示されている。 In addition, various methods have been proposed for producing stereoscopic images and relief images (relief images) of human models, and taking images of the model as a subject and producing images based on these photos (photographed images). ing.
As an example of such a method, a model is photographed as a subject, or a parallel line screen (striped pattern) is projected on the model and photographed, and this photographic image is projected on the surface of a material such as clay, In order to trace this, a method of manually engraving and creating an image has been proposed (see, for example, Patent Document 1). In Patent Document 1, many cameras, projectors, and screens depicting a large number of parallel lines are arranged around the object, and a large number of images are obtained when the parallel line screen is projected onto the object. After shooting with the above camera, the photographed photographic image and the parallel line screen are projected onto the original material surface with a projector at a similar position, and the screen on the material and the line on the photograph overlap. Thus, a method for producing a stereoscopic photographic image by processing a material is disclosed.

しかし、上述の立体写真像製作方法では、撮影や彫塑加工作業などを人手で行なうもので、彫刻家による方法よりはいくらか自動化されているものの、半手動式であるため、効率が悪く、加工精度や忠実度（リアリティ）に難点があった。 However, in the above-described stereoscopic photo image production method, photographing and sculpting work are performed manually, and although it is somewhat automated than the method by the sculptor, it is semi-manual, so it is inefficient and processing accuracy is low. And fidelity (reality).

このような立体像（３次元像）作成の問題点を解消するものとして、自動撮影機などで撮影した周囲３６０度からの画像から３次元データを計測する技術がある（例えば、特許文献２参照）。同特許文献２には、被写物の周囲に、多数のカメラを配して、同一の被写物を撮影した後、前記カメラと相似位置の投光機で原型の材料面上に、前記カメラ撮影された写真画像を投影し、写真による材料上での線条が互いに重なるように、材料を加工する立体写真像製作方法が開示されている。 As a technique for solving such a problem of creating a three-dimensional image (three-dimensional image), there is a technique for measuring three-dimensional data from an image taken from a surrounding 360 degrees photographed by an automatic photographing machine or the like (for example, see Patent Document 2). ). In Patent Document 2, a large number of cameras are arranged around the object, and the same object is photographed. Then, on the original material surface with a projector similar to the camera, There has been disclosed a method for producing a stereoscopic photographic image by projecting a photographic image taken by a camera and processing the material so that the stripes on the material by the photographic material overlap each other.

そして、このような技術に基づいて、自動切削機や３次元粉体プリンターなどで、胸像や銅像などを製作するサービスも実施されている。また、工業用の製品や部品の試作やサンプル作成には、最近では、光造形加工や、３Ｄの粉体プリンターなども用いられるようになってきた。 Based on such technology, a service for producing a chest image, a copper image, etc. is also carried out with an automatic cutting machine, a three-dimensional powder printer, or the like. In addition, recently, optical modeling and 3D powder printers have come to be used for prototyping of industrial products and parts and sample preparation.

また、自動撮影機などで撮影した画像から３次元データを生成するソフトウェア技術がある（例えば非特許文献１参照）。 Further, there is a software technique for generating three-dimensional data from an image photographed by an automatic photographing machine (see, for example, Non-Patent Document 1).

「デジタルカメラ画像から簡単に３次元データを生成するソフトウェア技術」ＳＡＮＹＯＴＥＣＨＮＩＣＡＬＲＥＶＩＥＷＣＯＬ．３５ＮＯ．１ＪＵＮ．２００３“Software Technology for Easily Generating 3D Data from Digital Camera Images” SANYO TECHNIC REVIEW COL. 35 NO. 1 JUN. 2003 特開昭５４−１１４２３１号公報JP 54-114231 A 特開平０１−１１３７４４号公報Japanese Patent Laid-Open No. 01-113744

しかしながら、特許文献２に開示されているように、自動撮影機から３次元データを自動作成する場合でも、周囲３６０度から多数枚の画像やアクティブスキャン画像などを撮影する３６０度撮影台や専用撮影機を必要とするので、ユーザーは、専用撮影機の設置場所まで赴いて写真を撮影する必要があり、面倒であった。また、写真画像から３次元データを生成するには、専門家による調整作業や胸像加工用のデータ変換作業などが必要であった。 However, as disclosed in Patent Document 2, even when three-dimensional data is automatically created from an automatic photographing machine, a 360-degree photographing stand for photographing a large number of images, active scan images, and the like from 360 degrees around or dedicated photographing. Since the camera is required, the user has to go to the place where the dedicated camera is installed to take pictures, which is troublesome. Further, in order to generate three-dimensional data from a photographic image, adjustment work by an expert, data conversion work for bust processing, and the like are necessary.

本発明の主たる目的は、同一被写体を任意の角度で撮影した複数の画像から自動的に３次元データを作成する３次元データ作成装置を提供することである。 A main object of the present invention is to provide a three-dimensional data creation apparatus that automatically creates three-dimensional data from a plurality of images obtained by photographing the same subject at an arbitrary angle.

上記課題を解決するために、請求項１に記載の発明では、立体像の標準モデルデータからなるデータベースと、複数枚の多視点画像の画像データを取得する画像データ取得手段と、３次元データ作成装置を、前記複数枚の多視点画像から３次元形状データを生成する３次元形状データ生成手段として機能させる３次元形状データ生成プログラムと、前記画像データ取得手段により取得した複数枚の画像データからそれぞれ人物領域を抽出し、抽出した人物領域から検出した顔の特徴部位と前記データベースの標準モデルデータとの対応点を取得してから、前記３次元形状データ生成プログラムにより３次元形状データを得る３次元データ作成制御手段と、前記３次元形状データを補正して顔の表情変化モデルデータを得る表情変化補正手段と、を備え、前記表情変化補正手段は、表情の種類と表情動作単位の組合せを対応付けた対応表データと、表情動作単位と顔の筋肉の収縮部位を対応付けた対応表データを含み、顔の表情を表情単位動作の組合せで表現し、各表上単位動作を対応する部位の顔の筋肉の収縮に変換することにより、喜怒哀楽を示している表情のデータを中立顔のデータに補正する、または中立顔のデータを喜怒哀楽を示している表情のデータに補正する、ことを特徴とする３次元データ作成装置を提供する。 In order to solve the above-described problems, in the invention described in claim 1, a database including standard model data of a stereoscopic image, image data acquisition means for acquiring image data of a plurality of multi-view images, and three-dimensional data generation A three-dimensional shape data generation program that causes the apparatus to function as a three-dimensional shape data generation unit that generates three-dimensional shape data from the plurality of multi-viewpoint images, and a plurality of image data acquired by the image data acquisition unit, respectively. A three-dimensional shape is obtained by extracting a human region and obtaining corresponding points between the feature portion of the face detected from the extracted human region and the standard model data of the database, and obtaining three-dimensional shape data by the three-dimensional shape data generation program a data generation control unit, and the facial expression change correction means the three-dimensional shape data corrected by the obtaining expression change model data of the face Wherein the expression change correcting means includes a correspondence table data that associates the combination of facial expression types and expression operation unit, the correspondence table data that associates muscle contraction portion expressions operation unit and the face, the face By expressing facial expressions as a combination of facial expression unit movements and converting each table unit movement to the contraction of the facial muscles of the corresponding part, the facial expression data showing emotions is corrected to neutral face data. Or a three-dimensional data creation device , wherein the neutral face data is corrected to facial expression data indicating emotions .

また、請求項２に記載の発明では、前記画像データ取得手段は、取得する画像データの撮影位置や撮影方向等のカメラ情報が既知の場合にそのカメラ情報を取得するカメラ情報取得手段を含み、前記立体像の標準モデルデータは、人物の顔および頭部の標準モデルデータ、表情変化の標準モデルデータを含み、前記３次元データ作成制御手段は、抽出した人物領域から検出した顔の特徴部位と前記人物の顔の標準モデルデータとの対応点から表情データを取得し、表情データにおける顔の表情と前記標準モデルデータの表情とを比較して表情データにおける顔の表情の変化が閾値より大きい場合には前記表情変化のモデルデータに基づいて顔の表情を所定の表情に補正し、前記カメラ情報取得手段によりカメラ情報が既知の場合には３次元形状データ生成プログラムに含まれている第１の３次元形状データ生成プログラムにより前記人物の３次元形状データを生成し、カメラ情報が未知の場合には３次元形状データ生成プログラムに含まれている第２の３次元形状データ生成プログラムにより前記人物の３次元形状データを生成する、ことを特徴とする請求項１に記載の３次元データ作成装置を提供する。
また、請求項３に記載の発明では、前記第１の３次元形状データ生成プログラムは、３次元データ作成装置を、前記画像データ取得手段によって取得した複数枚の多視点画像の画像データと前記カメラ情報取得手段によって取得されたカメラ情報を元に３次元形状データを生成する３次元形状データ生成手段として機能することを特徴とする請求項１に記載の３次元データ作成装置を提供する。 Further, in the invention according to claim 2, the image data acquisition means includes camera information acquisition means for acquiring the camera information when the camera information such as the shooting position and the shooting direction of the acquired image data is known, The standard model data of the three-dimensional image includes standard model data of a person's face and head, and standard model data of facial expression change. The three-dimensional data creation control means includes a facial feature portion detected from the extracted person region, When facial expression data is acquired from corresponding points with the standard model data of the person's face and the facial expression in the facial expression data is compared with the facial expression in the standard model data, and the change in facial expression in the facial expression data is greater than the threshold The facial expression is corrected to a predetermined facial expression based on the facial expression change model data, and if the camera information is known by the camera information acquisition means, the three-dimensional shape is corrected. The 3D shape data of the person is generated by the first 3D shape data generation program included in the data generation program, and the second information included in the 3D shape data generation program when the camera information is unknown. The three-dimensional data creation apparatus according to claim 1, wherein the three-dimensional shape data of the person is generated by the three-dimensional shape data generation program.
According to a third aspect of the present invention, the first three-dimensional shape data generation program uses a three-dimensional data generation apparatus to obtain image data of a plurality of multi-viewpoint images acquired by the image data acquisition unit and the camera. The three-dimensional data creation device according to claim 1, which functions as a three-dimensional shape data generation unit that generates three-dimensional shape data based on camera information acquired by the information acquisition unit.

また、請求項４に記載の発明では、前記第２の３次元形状データ生成プログラムは、３次元データ作成装置を、前記画像データ取得手段によって取得した複数枚の多視点画像の画像データを元に視体積交差法によって３次元形状データを生成する３次元形状データ生成手段として機能することを特徴とする請求項１に記載の３次元データ作成装置を提供する。 According to a fourth aspect of the present invention, the second three-dimensional shape data generation program uses a three-dimensional data generation device based on image data of a plurality of multi-viewpoint images acquired by the image data acquisition unit. The three-dimensional data creation device according to claim 1, which functions as a three-dimensional shape data generation unit that generates three-dimensional shape data by a visual volume intersection method.

また、請求項５に記載の発明では、前記第２の３次元形状データ生成プログラムは、３次元データ作成装置を、前記画像データ取得手段によって取得した複数枚の多視点画像の画像データを元に因子分解法によって３次元形状データを生成する３次元形状データ生成手段として機能することを特徴とする請求項１に記載の３次元データ作成装置を提供する。 According to a fifth aspect of the present invention, the second three-dimensional shape data generation program uses a three-dimensional data generation device based on image data of a plurality of multi-viewpoint images acquired by the image data acquisition unit. The three-dimensional data creation device according to claim 1, which functions as a three-dimensional shape data generation unit that generates three-dimensional shape data by a factorization method.

また、請求項６に記載の発明では、前記立体像の標準モデルデータは、人物の顔および頭部の標準モデルデータ、表情変化の標準モデルデータ、服装および髪型の標準モデルデータを含み、前記３次元データ作成制御手段は、抽出した人物領域から検出した顔の特徴部位を元に各画像データから頭部の形および顔貌の特徴を抽出し、前記人物の顔および頭部の標準モデルデータを検索して頭部の形および顔貌の特徴に最も類似する３次元モデルデータを取得し、取得した３次元モデルデータの曲面上に前記複数枚の多視点画像の画像データを射影変換して顔および頭部の３次元形状データを生成する、ことを特徴とする請求項１に記載の３次元データ作成装置を提供する。 Further, in the invention described in claim 6, the standard model data of the stereoscopic image includes the standard model data of the face and head of a person, the standard model data of facial expressions, the standard model data of clothing and hair, the 3 Dimensional data creation control means extracts head shape and facial features from each image data based on the facial feature detected from the extracted person region, and searches the standard model data of the person's face and head Then, 3D model data most similar to the shape of the head and the facial features is acquired, and the image data of the plurality of multi-viewpoint images are projectively transformed onto the curved surface of the acquired 3D model data to obtain the face and head. The three-dimensional data creation apparatus according to claim 1, wherein the three-dimensional shape data of the part is generated.

本発明によれば、任意の角度（および任意の時期、服装、表情）で撮影された画像でも、同一人物が写った複数の画像があればそれらの画像に基づいて、自動的に３次元データを生成できるので、従来のように、周囲３６０度の所定角度から撮影したり、専用の撮影台や自動撮影機を必要とせず、わざわざ自動撮影機の設置場所や彫刻作家の所に赴かなくても、写真や画像データを送付や送信するだけで作成される３次元データに基づいてリアルな胸像やフィギュアを、簡単且つ安価に作成できる。 According to the present invention, even if an image is taken at an arbitrary angle (and at an arbitrary time, clothes, facial expression), if there are a plurality of images showing the same person, three-dimensional data is automatically generated based on those images. As usual, you can shoot from a predetermined angle of 360 degrees around, and do not need a dedicated shooting stand or automatic camera, so you can go to the place of the automatic camera and the sculptor. However, it is possible to easily and inexpensively create a realistic bust and figure based on the three-dimensional data that is created simply by sending and transmitting photographs and image data.

図１は、本発明の３次元データ作成装置を用いた３次元画像作成受託システム１００の構成例を示す図であり、３次元画像作成受託システム１００は３次元形状標準モデルデータ等の標準モデルデータを検索可能に格納した標準モデルデータベース３０（図４参照）を備え、任意の異なった角度で撮影した複数写真１０の画像データから３次元データを作成する３次元データ作成装置１（図３参照）、３次元データ作成装置１とインターネット等の通信ネットワーク４を介して接続するカメラ付き携帯電話５やデジタルカメラ６等の撮像端末や、３次元データ作成装置１で作成された３次元データを用いて３次元画像を表示するパソコン７等の表示端末、３次元データ作成装置１で作成された３次元データ（実施例では３次元モデル加工データ）を用いて３次元画像を自動的に製作する３次元像加工装置８（例えば、３次元自動切削加工機８−１、または光造形加工機８−２、または３次元粉体プリンター８−３）からなる。 FIG. 1 is a diagram showing a configuration example of a 3D image creation commissioning system 100 using the 3D data creation apparatus of the present invention. The 3D image creation commissioning system 100 is a standard model data such as 3D shape standard model data. 3D data creation device 1 (see FIG. 3) that creates 3D data from image data of a plurality of photographs 10 taken at arbitrary different angles. Using an imaging terminal such as a mobile phone with camera 5 and a digital camera 6 connected to the 3D data creation device 1 via a communication network 4 such as the Internet, or 3D data created by the 3D data creation device 1 3D data generated by a display terminal such as a personal computer 7 that displays a 3D image and the 3D data generation apparatus 1 (in the embodiment, 3D model processing data) 3D image processing apparatus 8 (for example, 3D automatic cutting machine 8-1, stereolithography machine 8-2, or 3D powder printer 8-3) that automatically manufactures a 3D image using ).

図１で、３次元データ作成装置１、標準モデルデータベース３０、および３次元画像加工装置３は受託会社側の装置であり、カメラ付き携帯電話５やデジタルカメラ６等の撮像端末や、パソコン７等の表示端末、３次元像加工装置８は委託者側の装置である。 In FIG. 1, a three-dimensional data creation device 1, a standard model database 30, and a three-dimensional image processing device 3 are devices on the side of a contract company, such as an imaging terminal such as a camera-equipped mobile phone 5 and a digital camera 6, a personal computer 7, etc. The display terminal 3D image processing apparatus 8 is a consignor side apparatus.

ここで、３次元自動切削加工機８−１、光造形加工機８−２、３次元粉体プリンター８−３は公知の３次元像加工装置であり、３次元自動切削加工機８−１は作成された３次元データを元に金属や固形樹脂、ガラス、木等を自動的に切削して胸像等の立体像を製作する装置であり、光造形加工機８−２は作成された３次元データを元に紫外線硬化樹脂を自動的にレーザ加工して立体像を製作する装置であり、３次元粉体プリンター８−３は作成された３次元データを元に粉体と接着剤の混合物をプリント技術により積層加工して立体像を製作する装置である。 Here, the three-dimensional automatic cutting machine 8-1, the stereolithography processing machine 8-2, and the three-dimensional powder printer 8-3 are known three-dimensional image processing apparatuses, and the three-dimensional automatic cutting machine 8-1 is It is a device that automatically cuts metal, solid resin, glass, wood, etc. based on the created three-dimensional data to produce a three-dimensional image such as a breast image. This is a device that automatically produces a 3D image by laser processing UV curable resin based on the data. The 3D powder printer 8-3 uses a mixture of powder and adhesive based on the created 3D data. It is a device that produces a three-dimensional image by laminating using a printing technique.

なお、実施例では３次元像加工装置８−１，８−２、８−３、・・は３次元データ作成装置１から通信ネットワーク７を介してデータを受信するための通信制御を備えているものとしたが、パソコン等７等の通信制御機能を備えた端末に接続するように構成し、３次元データ作成装置１から通信ネットワーク７を介してパソコン等７等の通信制御機能を備えた端末が受信した３次元データを用いて３次元画像を自動的に製作するようにしてもよい。 In the embodiment, the three-dimensional image processing apparatuses 8-1, 8-2, 8-3,... Have communication control for receiving data from the three-dimensional data creation apparatus 1 via the communication network 7. However, it is configured to connect to a terminal having a communication control function such as a personal computer 7 and the like, and a terminal having a communication control function such as a personal computer 7 from the three-dimensional data creation device 1 via the communication network 7 A three-dimensional image may be automatically produced using the three-dimensional data received.

また、図１に示すように、３次元データ作成装置１に３次元像加工装置３（例えば、３次元像加工装置８−１，８−２、８−３とほぼ同様な機能を有する３次元自動切削加工機３−１、または光造形加工機３−２、または３次元粉体プリンター３−３）を接続し、作成された３次元データを用いて３次元画像を自動的に製作するように構成してもよい。 Further, as shown in FIG. 1, the three-dimensional data creation device 1 includes a three-dimensional image processing device 3 (for example, a three-dimensional image having substantially the same function as the three-dimensional image processing devices 8-1, 8-2, and 8-3). Connect automatic cutting machine 3-1, stereolithography machine 3-2, or 3D powder printer 3-3) to automatically produce 3D images using the created 3D data You may comprise.

図２は、図１に示した３次元画像作成受託システム１００における３次元画像の受託作成プロセスを示すプロセスチャートである。図２で、委託者は、撮像端末５または６で撮影した多視点カメラ画像データ（同一人物を同じ撮影日時、同じ場所で、任意の異なった角度で撮影した複数の画像データ（図７（ｂ）参照）または同一人物を異なる撮影日時、異なる場所で撮影した異なる複数の画像データ（図７（ａ）参照）と、納入日、委託内容、作成委託するデータの種類、納入条件等の委託条件を、インターネット４を介して３次元データ作成装置１に送信する（プロセスＰ１）。なお、このプロセス１で画像データを送信する代わりに同一人物を任意の異なった角度で撮影した複数の写真または同一人物の複数の写真（撮影時期は異なっていてもよい）１０や人物画を郵送等の方法で委託者に渡すようにしてもよい。 FIG. 2 is a process chart showing a 3D image contract creation process in the 3D image creation contract system 100 shown in FIG. In FIG. 2, the consignor uses the multi-viewpoint camera image data captured by the imaging terminal 5 or 6 (a plurality of pieces of image data captured from the same person at the same shooting date and time and at the same place at any different angle (FIG. 7B )) Or multiple different image data of the same person taken at different shooting dates and times (see Fig. 7 (a)), delivery date, details of commission, type of data to be commissioned, delivery conditions, etc. Are transmitted to the three-dimensional data creation apparatus 1 via the Internet 4 (process P1), instead of transmitting image data in this process 1, a plurality of photographs taken by the same person at arbitrary different angles or the same A plurality of photographs of people (photographing times may be different) 10 or portraits may be delivered to the consignor by mail or the like.

３次元データ作成装置１は、画像データを受信すると受信した委託条件をメモリに登録すると共に、委託者の認証、課金処理等を行ない、受信した画像データをＲＡＭ等の一時記憶メモリに記憶する（プロセスＰ２）。
なお、プロセスＰ１で郵送等の方法で委託者から写真や人物画等を受け取った場合は、このプロセス２で受託者側のオペレータの指示操作に基づいて３次元データ作成装置１の画像入力処理部１４で写真等の読取および画像データ作成を行うと共に受付登録、委託者の認証、課金処理を行ない、画像データをＲＡＭ等の一時記憶メモリに記憶する。 When receiving the image data, the three-dimensional data creation device 1 registers the received consignment conditions in the memory, performs authentication of the consignor, billing processing, etc., and stores the received image data in a temporary storage memory such as a RAM ( Process P2).
When a photograph or a portrait is received from the contractor by mail or the like in the process P1, the image input processing unit of the three-dimensional data creation device 1 is based on the instruction operation of the operator on the trustee side in this process 2. In step 14, a photograph or the like is read and image data is created, reception registration, entrustee authentication, and billing processing are performed, and the image data is stored in a temporary storage memory such as a RAM.

次に、３次元データ作成装置１は、多視点カメラ画像データからそれぞれ顔領域を検出して、顔の大きさや向き、回転などを正規化した後、「Lucas-Kanade法（勾配法）」や、「Kanade-Lucas-Tomasiトラッカー」などにより、特徴点の移動や相違を追跡し、対応点を探索し、各点の移動、距離の深さ、カメラ位置、エッジモーションなどを解析して、カメラの位置や動き情報を求めて、任意の多視点画像とカメラ情報から形状データを求める（図１２参照）（プロセスＰ３）。 Next, the three-dimensional data creation apparatus 1 detects each face area from the multi-viewpoint camera image data, normalizes the size, orientation, rotation, etc. of the face, and then performs “Lucas-Kanade method (gradient method)” , "Kanade-Lucas-Tomasi tracker" etc. to track the movement and difference of feature points, search for corresponding points, analyze the movement of each point, distance depth, camera position, edge motion, etc. The position data and the movement information are obtained, and shape data is obtained from an arbitrary multi-viewpoint image and camera information (see FIG. 12) (process P3).

また、プロセスＰ３として、「射影グリッド空間における視体積交差法」などを用いて、３次元ボクセル（立体画素）空間を多数の立体格子に分割し、異なる視点の各画像におけるシルエット画像を３次元ボクセル空間に逆投影して（すなわち、各ボクセルが各画像に投影される場合にそのシルエットに含まれるか否かを判断して）、顔および頭部の３次元データを生成してもよい（図１３参照）。 In addition, as a process P3, a “3D voxel (stereopixel) space” is divided into a number of 3D grids using “view volume intersection method in projective grid space” or the like, and silhouette images in each image from different viewpoints are 3D voxels. 3D data of the face and head may be generated by backprojecting to space (ie, determining whether each voxel is included in the silhouette when projected on each image) (see FIG. 13).

また、プロセスＰ３として、複数画像における対応点から、「因子分解法」などにより、（計測行列）＝（移動行列）×（形状行列）における（移動行列）と（形状行列）とを同時に求め、（形状行列）から顔および頭部の３次元データを生成する方法等、他の方法を用いても良い（図１４参照）。 Further, as process P3, (measurement matrix) = (movement matrix) × (movement matrix) in (shape matrix) and (shape matrix) are simultaneously obtained from corresponding points in a plurality of images by a “factorization method” or the like. Other methods such as a method of generating face and head 3D data from the (shape matrix) may be used (see FIG. 14).

３次元データ作成装置１は、プロセス２で登録した委託条件を調べ（プロセスＰ４）、３次元データの作成の委託の場合は通信ネットワーク４を介して委託者のパソコン５または携帯電話６に作成した３次元モデルデータを送信すると共に請求書を発行する（プロセスＰ５）。 The 3D data creation device 1 checks the commission condition registered in the process 2 (process P4), and creates the 3D data on the commissioner's personal computer 5 or mobile phone 6 via the communication network 4 when commissioning the creation of the 3D data. The three-dimensional model data is transmitted and an invoice is issued (process P5).

委託条件が加工用３次元モデルデータの作成または立体像（または原型）の製作の場合は、加工装置の種類に応じた加工用３次元モデルデータを作成し、委託条件が加工用３次元モデルデータの納入の場合は納入条件に基づいて通信ネットワーク４を介して委託者の３次元像加工装置８（またはそれを接続するコンピュータ）に作成した加工用３次元モデルデータを送信すると共に請求書を発行する。なお、納入条件がＣＤ等の記録媒体納入の場合は作成した３次元モデルデータをＣＤに記録すると共に請求書を発行する（プロセスＰ６）。 If the consignment conditions are creation of 3D model data for processing or 3D image (or prototype) production, 3D model data for processing corresponding to the type of processing equipment is created, and the consignment conditions are 3D model data for processing. In the case of delivery, the processing 3D model data created and transmitted to the contractor's 3D image processing device 8 (or the computer to which it is connected) via the communication network 4 based on the delivery conditions and the invoice is issued To do. If the delivery condition is delivery of a recording medium such as a CD, the created three-dimensional model data is recorded on the CD and an invoice is issued (process P6).

また、委託条件が立体像（または原型）の製作の場合は、作成した加工用３次元モデルデータを受託者側の３次元像加工装置３−１、または３−２、または３−３に送信して加工を行わせ、加工が終了すると請求書を発行する（プロセス７）。 Further, when the consignment condition is production of a three-dimensional image (or a prototype), the created three-dimensional model data for processing is transmitted to the three-dimensional image processing device 3-1, 3-2, or 3-3 on the trustee side. Then, processing is performed, and when processing is completed, a bill is issued (process 7).

図３は本発明の３次元データ作成装置の構成例を示す図である。３次元データ作成装置１は、パソコン程度の処理能力を有するコンピュータからなり、３次元データ作成装置１全体の動作制御を行うと共に、３次元データ作成時には３次元データ作成プログラム（後述）に基づいて３次元データ作成処理制御を行うＣＰＵ１１、プログラムやデータを一時記憶したり作業領域として用いるＲＡＭ等の一時記憶メモリ１２、インターネット等の通信ネットワーク４を介して画像データや撮影位置や方向等のカメラ情報等のデータの送受信を行う送受信処理部１３、写真やデザイン画等の画像を読み取って画像データを作成する画像入力処理部１４、各種プログラム２１〜２３や顔形標準モデルデータベース３０や、委託者から受信した委託条件や課金情報、作成した３次元画像データ等を記録した保存メモリ２、操作用のキーボード１５、表示部１６および、図示していないが、作成した３次元データをＣＤ等の記録媒体に記録可能な媒体記録装置を備えている。なお、画像入力処理部１４、表示部１６は必須ではない。 FIG. 3 is a diagram showing a configuration example of the three-dimensional data creation apparatus of the present invention. The three-dimensional data creation device 1 is composed of a computer having a processing capability similar to that of a personal computer. The three-dimensional data creation device 1 controls the operation of the entire three-dimensional data creation device 1 and, at the time of creating three-dimensional data, is based on a three-dimensional data creation program (described later). CPU 11 for controlling dimensional data creation processing, temporary storage memory 12 such as RAM that temporarily stores programs and data or used as a work area, camera data such as image data, shooting position and direction, etc. via communication network 4 such as the Internet Received from the transmission / reception processing unit 13 that transmits and receives data, the image input processing unit 14 that reads images such as photographs and design images and creates image data, the various programs 21 to 23, the face shape standard model database 30, and the consignor Storage memory that records the commissioned conditions, billing information, created 3D image data, etc. , Keyboard 15 for operation, the display unit 16 and, although not shown, includes a recordable medium recording apparatus in a recording medium such as a CD 3D data created. The image input processing unit 14 and the display unit 16 are not essential.

また、３次元データ作成装置１に３次元像加工装置３−１、３−２、３−３等の３次元像加工装置３や３次元像加工装置８−１、８−２、８−３等の３次元像加工装置８に作成した３次元データを送信するためのインターフェイス１７を備えるようにしてもよい。 Further, the three-dimensional data creation device 1 includes a three-dimensional image processing device 3 such as a three-dimensional image processing device 3-1, 3-2, 3-3, or a three-dimensional image processing device 8-1, 8-2, 8-3. An interface 17 for transmitting the created three-dimensional data to the three-dimensional image processing apparatus 8 such as the above may be provided.

図４は３次元標準モデルデータベース３０の構成例を示す図であり、３次元標準モデルデータベース３０は、顔（人物の顔や頭部）の３次元標準モデルデータ３１や、表情変化の３次元標準モデルデータ３２、および服装、髪型等の３次元モデルデータ３３を体系的かつ検索可能に格納してなる。図３の例では３次元標準モデルデータベース３０を保存メモリ２に格納するようにしたが、３次元標準モデルデータベース３０を格納したメモリと保存メモリ２は別体でもよい。 FIG. 4 is a diagram illustrating a configuration example of the three-dimensional standard model database 30. The three-dimensional standard model database 30 includes three-dimensional standard model data 31 of a face (a person's face and head) and a three-dimensional standard of facial expression change. The model data 32 and the three-dimensional model data 33 such as clothes and hairstyle are stored in a systematic and searchable manner. In the example of FIG. 3, the 3D standard model database 30 is stored in the storage memory 2, but the memory storing the 3D standard model database 30 and the storage memory 2 may be separate.

顔の３次元標準モデルデータ３１は、同一人物を任意の角度で撮影した複数枚の画像（任意の多視点画像）データの顔領域から得られる顔や頭部の特徴点から導かれる立体形状に対応する顔の３次元標準モデルの形状を構成する諸元（特徴点およびそのモデルの形状を構成する形状データ（例えば、正規化された座標等からなる形状行列））である。なお、顔や頭部の立体形状に対応する顔の３次元標準モデルは一つとは限られない。顔の３次元標準モデルデータ３１には、通常、顔や頭部の特徴点の組ごとに複数の３次元モデルが対応付けられている。図８に顔の３次元標準モデルの一例を示す。 The three-dimensional standard model data 31 of the face has a three-dimensional shape derived from face and head feature points obtained from the face area of a plurality of images (arbitrary multi-viewpoint images) obtained by photographing the same person at an arbitrary angle. These are specifications (shape data (for example, a shape matrix made up of normalized coordinates, etc.) constituting feature points and the shape of the model) constituting the shape of the corresponding three-dimensional standard model of the face. Note that the three-dimensional standard model of the face corresponding to the three-dimensional shape of the face or head is not limited to one. The face three-dimensional standard model data 31 is usually associated with a plurality of three-dimensional models for each set of face and head feature points. FIG. 8 shows an example of a three-dimensional standard model of a face.

表情変化の標準モデルデータ３２は、顔の表情表現の要素（例えば、目の開閉、大きさ、眼球の位置、口の開閉、口元の形状、鼻のふくらみ等）からなる表情と表情形成（例えば、笑い、怒り、安堵、喜び、悲しみ等の大きさを引き起こす筋肉部位とその収縮の度合いを示す指数）を対応付けたデータからなる。顔の表情変化の標準モデルデータ３２は３次元モデルデータ作成時に３次元標準モデルデータ３１を元に得た人物像の表情を補正若しくは補完するために用いられる。図９に、表情動作と表情変化を引き起こす表情筋などの筋肉の収縮部位等を対応付けた対応表データ３２−１および表情（表情データ）とＡＵの組み合わせ表データ３２−２からなる顔の表情変化の標準モデルデータの一例を示す。 The standard model data 32 of facial expression changes includes facial expression and facial expression (for example, opening / closing of eyes, size, position of eyeball, opening / closing of mouth, shape of mouth, swelling of nose, etc.). , Muscular sites that cause magnitudes of laughter, anger, relief, joy, sadness, etc., and an index indicating the degree of contraction). The standard model data 32 of facial expression change is used to correct or complement the facial expression of a human image obtained based on the three-dimensional standard model data 31 when creating the three-dimensional model data. FIG. 9 shows facial expression composed of correspondence table data 32-1 and facial expression (expression data) and AU combination table data 32-2, in which facial expression movements and muscle contraction sites such as facial muscles causing facial expression changes are associated with each other. An example of the standard model data of change is shown.

服装、髪型等の標準モデルデータ３３は、図示していないが、さまざまな服装、髪型等の３次元データを体系的に登録してなる。服装、髪型等の標準モデルデータ３３は、３次元データ作成時に受信（入力）した画像の胴部や頭部から抽出した服装や髪の形状と登録されている標準モデルデータとの比較により最も類似する服装や髪型のモデルデータが選定され、３次元標準モデルデータ３１を元に得た人物像を補正若しくは補完するために用いられる。 The standard model data 33 for clothes, hairstyles, and the like, although not shown, is a systematic registration of three-dimensional data such as various clothes and hairstyles. The standard model data 33 such as clothes and hairstyle is most similar by comparing the registered standard model data with the clothes and hair shapes extracted from the torso and head of the image received (input) at the time of creation of the three-dimensional data. The model data of the clothes and hairstyle to be selected is selected and used to correct or complement the human image obtained based on the three-dimensional standard model data 31.

図５は保存メモリ２に格納されているプログラムの例を示す図であり、保存メモリ２のプログラム格納エリア２０には、図５（ａ）に示すように、３次元データ作成装置１全体の動作を制御するための制御プログラム（例えば、ＯＳ）、通信ネットワーク４を介して端末５、６、７、・・・等とのデータ（電子メール、画像データ、制御データ等）の送受信を制御する通信制御プログラム等の制御プログラム群２１、委託者からの３次元データの受託管理処理（委託受付処理、認証処理、課金処理、請求書発行処理等）を行う受託管理プログラム群２２、任意の角度で撮影した同一人物の複数の画像の画像データ（以下、「複数の画像データ」と記す）から３次元データを作成する３次元データ作成プログラム２３、加工用３次元モデルデータ作成プログラム２４、３次元データ出力プログラム２５等が格納されている。 FIG. 5 is a diagram showing an example of a program stored in the storage memory 2. In the program storage area 20 of the storage memory 2, as shown in FIG. A communication program for controlling transmission / reception of data (e-mail, image data, control data, etc.) with the terminals 5, 6, 7,... A control program group 21 such as a control program, a trust management program group 22 that performs trust management processing (outsourcing reception processing, authentication processing, billing processing, invoicing issuance processing, etc.) of three-dimensional data from a contractor, photographing at an arbitrary angle A three-dimensional data creation program 23 for creating three-dimensional data from image data of a plurality of images of the same person (hereinafter referred to as “plurality of image data”), three-dimensional model data for processing Creation program 24,3 dimensional data output program 25, etc. are stored.

なお、制御プログラム群２１、受託管理プログラム群２２、および３次元データ作成プログラム２３等のプログラムは保存メモリ２とは別体のメモリに保存してもよく、ＲＡＭ１２に常駐しているものであってもよい。また、受託管理プログラム群２２、および３次元データ作成プログラム２３、加工用３次元モデルデータ作成プログラム２４、３次元データ出力プログラム２５等のプログラムは、３次元データ作成依頼のある都度、通信ネットワーク４を介して図示しないサーバからダウンロードされるものであってもよい。 Programs such as the control program group 21, the commission management program group 22, and the three-dimensional data creation program 23 may be stored in a memory separate from the storage memory 2 and are resident in the RAM 12. Also good. Further, the commission management program group 22, the 3D data creation program 23, the machining 3D model data creation program 24, the 3D data output program 25, etc., each time the 3D data creation request is made, the communication network 4 is used. It may be downloaded from a server (not shown).

３次元データ作成プログラム２３は図２のステップＰ３に相当する動作を３次元データ作成装置１に実行させるプログラムであり、図２のプロセスＰ２で受信した複数の画像データ（若しくは画像入力部１４で読み取った写真、人物画等から生成された画像データ）の正規化等の前処理を行ってから人物の顔の特徴部位を検出し、検出された特徴に基づいて各画像から人物の領域を抽出する人物領域抽出プログラム２３１、抽出された各画像の人物領域の顔画像から所定の特徴点を検出し、標準モデルデータベース３０に格納されている顔の３次元標準モデルデータ３１の特徴点との対応付けを行う特徴点対応付プログラム２３２、顔の表情の違いや変化が所定値より大きい場合に（中立顔やすまし顔、微笑顔などの）所定の表情の顔画像になるよう顔の表情等の変化の補正を行う表情変化の補正処理プログラム２３３、所定の方法により人物の３次元形状データを生成する３次元形状データ生成プログラム２３４等のサブプログラムからなる。 The three-dimensional data creation program 23 is a program that causes the three-dimensional data creation apparatus 1 to execute an operation corresponding to step P3 in FIG. 2, and is read by a plurality of image data (or image input unit 14) received in the process P2 in FIG. Image data generated from a photograph, a person image, etc.) is pre-processed and the like is detected, and then a feature region of the person's face is detected, and a person region is extracted from each image based on the detected feature The person area extraction program 231 detects a predetermined feature point from the face image of the person area of each extracted image, and associates it with the feature point of the 3D standard model data 31 of the face stored in the standard model database 30 If the difference or change in facial expression is greater than a predetermined value, the feature point correspondence program 232 for performing a facial expression with a predetermined expression (such as a neutral face, a smiling face, a smile) So that facial expressions change in correction program 233 to correct the change in facial expressions, a three-dimensional shape data generating program 234 and the like subprograms for generating a three-dimensional shape data of the person by a predetermined method.

加工用３次元モデルデータ作成プログラム２４は、図２のステップＰ６に相当する動作を３次元データ作成装置１に実行させるプログラムであり、３次元データ作成プログラム２３により作成された３次元形状データから３次元像加工装置の種類に応じた加工用３次元モデルデータを作成する。 The machining 3D model data creation program 24 is a program that causes the 3D data creation apparatus 1 to execute an operation corresponding to step P6 in FIG. Three-dimensional model data for processing corresponding to the type of the three-dimensional image processing apparatus is created.

また、３次元データ出力プログラム２５は、図２のステップＰ５に相当する動作を３次元データ作成装置１に実行させるプログラムであり、加工用３次元モデルデータ作成プログラム２３５により作成された３次元データや３次元モデルデータを委託者の指定する方法で出力（通信ネットワーク４を介した委託者端末への送信、ＣＤへの記録、受託者側の３次元像加工処理用のデータ出力）する。 The three-dimensional data output program 25 is a program that causes the three-dimensional data creation apparatus 1 to execute an operation corresponding to step P5 in FIG. 2, and includes the three-dimensional data created by the machining three-dimensional model data creation program 235, The 3D model data is output by a method designated by the consignor (transmission to the consignor terminal via the communication network 4, recording on the CD, and data output for 3D image processing on the consignor side).

なお、３次元データ作成プログラム２３の変形例として、図５（ｂ）に示すように、表情変化の補正処理プログラム２３３および３次元形状データ生成プログラム２３４を特徴点対応付けプログラム２３２によって対応付けされた対応点から対象者の頭部の形および顔貌の特徴点を抽出する特徴点抽出プログラム２３６、顔の３次元標準モデルデータ３１を検索し、特徴点抽出プログラム２３６によって抽出された対象者の頭部や顔の特徴点に最も類似する３次元モデルデータを得る３次元モデルデータ検索プログラム２３７、３次元モデルデータ検索プログラム２３７によって検索されたモデルの３次元形状（曲面）上に射影変換して顔および頭部の３次元データを生成する３次元データ生成プログラム２３８、および表情変化の標準モデルデータ３２や服装・髪型等の標準モデルデータ３３を元に顔の表情の変化や服装・髪型等の補正を行う表情変化等の補正処理プログラム２３９に代えてもよい（図１５参照）。 As a modification of the three-dimensional data creation program 23, the facial expression change correction processing program 233 and the three-dimensional shape data generation program 234 are associated by the feature point association program 232 as shown in FIG. A feature point extraction program 236 that extracts feature points of the subject's head shape and facial appearance from the corresponding points, and the face three-dimensional standard model data 31 are searched, and the subject's head extracted by the feature point extraction program 236 3D model data search program 237 that obtains the most similar 3D model data to the feature points of the face and the three-dimensional shape (curved surface) of the model searched by the 3D model data search program 237 by projective transformation. 3D data generation program 238 for generating 3D data of the head, and standard model of facial expression change It may be replaced into the correction processing program 239 of the expression change for performing the data 32 and clothing, hairstyle standard model data 33 based on facial expression changes and clothing, hair style correction, such as the like (see FIG. 15).

図６は、本発明に基づく３次元形状データ作成過程の説明図である。３次元形状データ生成の際、同一人物が写った任意の複数枚の画像を用いる場合に、同一日時における同一被写体の撮影画像の代りに、同一人物だが、異なる日時や場所の画像を用いる場合には、同一人物でも、角度や向きだけでなく、髪形や服装、化粧などが変わっていたり、表情が異なっていたりする場合が生じるので、実施例では表情の変化が著しい場合に「無表情」または「微笑顔」等の「中立顔」の画像に補正するように構成したが、髪形や服装、化粧などが変わっていた場合にも同様に（例えば、標準的な服装や髪型に）補正することもできる。 FIG. 6 is an explanatory diagram of a process of creating three-dimensional shape data based on the present invention. When generating arbitrary 3D images of the same person when generating 3D shape data, when using images of the same person but different dates and places instead of captured images of the same subject at the same date and time The same person may change not only the angle and direction, but also the hairstyle, clothes, makeup, etc. or the expression may be different. Although it is configured to correct to “neutral face” images such as “smile”, it should be corrected in the same way (for example, standard clothes and hairstyles) when the hairstyle, clothes, makeup, etc. have changed. You can also.

図６で、（ａ）は同一人物を撮影した（撮影日時および撮影場所が異なる）複数枚の画像を示し、（ｂ）は（ａ）に示した各画像の表情の変化の補正処理を施した画像を示し、（ｃ）は対象物（人物）を多視点、つまり、任意の異なる視点（位置）から撮影した（撮影日時および撮影場所が同じ複数枚の画像）を示す。また、（ｄ）は、上記（ｂ）または（ｄ）の各画像から得る対象人物の顔部分の多視点画像データを示し、（ｅ）は（ｄ）の多視点画像データから生成される３次元形状データの例を示す。 In FIG. 6, (a) shows a plurality of images of the same person (with different shooting date and time and shooting location), and (b) shows the correction processing for the change in facial expression of each image shown in (a). (C) shows an object (person) photographed from multiple viewpoints, that is, from arbitrary different viewpoints (positions) (a plurality of images having the same photographing date and time and photographing location). Further, (d) shows multi-viewpoint image data of the face portion of the target person obtained from each image of (b) or (d), and (e) is generated from the multi-viewpoint image data of (d) 3 An example of dimensional shape data is shown.

つまり、（ａ）のように同一人物の撮影画像であっても、撮影日時および撮影場所が異なると各画像の表情等が異なる場合があるので、（ｂ）のように表情等の変化の補正処理（図７のステップＳ６、Ｓ７、図１０参照）を施してから、（ｄ）のような多視点画像データを得て（ｅ）のような３次元形状データを生成する（図７のステップＳ９、図１２〜図１４参照）。また、（ｃ）のように同一人物を同じ撮影日時および撮影場所で角度を代えて撮影した複数の画像では表情等の変化は小さいので、（ｂ）のように表情等の変化の補正処理なしに（ｄ）のような多視点画像データを得て（ｅ）のような３次元形状データを生成する。 That is, even in the case of a photographed image of the same person as in (a), the facial expression and the like of each image may differ depending on the photographing date and time and the photographing location. After performing the processing (see steps S6, S7, and FIG. 10 in FIG. 7), multi-viewpoint image data as shown in (d) is obtained, and three-dimensional shape data as shown in (e) is generated (step in FIG. 7). S9, see FIGS. 12 to 14). In addition, since changes in facial expressions and the like are small in a plurality of images obtained by photographing the same person at the same shooting date and time and shooting location as shown in (c), there is no correction processing for changes in facial expressions and the like as shown in (b). In addition, multi-viewpoint image data as shown in (d) is obtained to generate three-dimensional shape data as shown in (e).

図７は、３次元データ作成装置１の３次元データ作成動作の一実施例を示すフローチャートである。
３次元データ作成装置１のＣＰＵ１１は、３次元データ作成プログラム２３により図７のステップＳ２〜Ｓ９に示すような動作（図２のステップＰ３の動作に相当する機能）を実行し加工用３次元モデルデータ作成プログラム２４により、図７のステップＳ１０〜Ｓ１１に示すような動作（図２のステップＰ５の動作に相当する機能）を実行し、３次元データ出力プログラム２５により、図７のステップＳ１０〜Ｓ１３に示すような動作（図２のステップＰ６の動作に相当する機能）を実行する。つまり、３次元データ作成装置１の動作は制御プログラム群２１の制御下でＣＰＵ２１によって保存メモリ２から取り出され、ＲＡＭ１２の実行プログラム領域に記憶される受託管理プログラム群２２、３次元データ作成プログラム２３、加工用３次元モデルデータ作成プログラム２４、および３次元データ出力プログラム２５に基づいて実行される。 FIG. 7 is a flowchart showing an embodiment of the three-dimensional data creation operation of the three-dimensional data creation device 1.
The CPU 11 of the three-dimensional data creation device 1 executes the operations shown in steps S2 to S9 in FIG. 7 (functions corresponding to the operations in step P3 in FIG. 2) by the three-dimensional data creation program 23 to perform a three-dimensional model for machining. The data creation program 24 executes operations as shown in steps S10 to S11 in FIG. 7 (functions corresponding to the operations in step P5 in FIG. 2), and the three-dimensional data output program 25 performs steps S10 to S13 in FIG. (The function corresponding to the operation of step P6 in FIG. 2) is executed. That is, the operation of the three-dimensional data creation device 1 is taken out from the storage memory 2 by the CPU 21 under the control of the control program group 21 and stored in the execution program area of the RAM 12, the three-dimensional data creation program 23, This is executed based on the machining three-dimensional model data creation program 24 and the three-dimensional data output program 25.

ＣＰＵ１１は、委託者の撮像端末から任意の多視点画像データ（およびカメラ情報を含む）撮影情報若しくは連続動画像データを受信すると受信した画像データをＲＡＭ１２の所定領域に記憶する。なお、委託者から郵送等の手段により３次元データ作成委託データを受け取った場合は、画像入力処理部１４で各画像を読み取ってそれぞれの画像データを生成し、生成した画像データをＲＡＭ１２の所定領域に記憶する（ステップＳ１）。 When the CPU 11 receives arbitrary multi-viewpoint image data (including camera information) shooting information or continuous moving image data from the imaging terminal of the consignor, the CPU 11 stores the received image data in a predetermined area of the RAM 12. When the 3D data creation consignment data is received from the consignor by mail or the like, each image is read by the image input processing unit 14 to generate each image data, and the generated image data is stored in a predetermined area of the RAM 12. (Step S1).

次に、ＣＰＵ１１は、人物領域抽出プログラム２３１により、ＲＡＭ１２の所定領域に記憶されている各画像データの傾き補正、ノイズ除去、鮮鋭化処理、および画像サイズ等を合わせる正規化等の前処理を行う（ステップＳ２）。傾き補正、ノイズ除去、鮮鋭化処理、および正規化等の前処理は公知の画像処理技術を適用することができる。 Next, the CPU 11 uses the person region extraction program 231 to perform preprocessing such as inclination correction, noise removal, sharpening processing, and normalization that matches the image size of each image data stored in a predetermined region of the RAM 12. (Step S2). Known image processing techniques can be applied to pre-processing such as tilt correction, noise removal, sharpening processing, and normalization.

ＣＰＵ１１は、人物領域抽出プログラム２３１により、ＲＡＭ１２の所定領域に記憶されている各画像データからそれぞれ人物の顔の特徴部位を検出し（ステップＳ３）、検出された特徴に基づいて各画像から人物の領域を抽出する（ステップＳ４）。
多視点画像からの人物の顔の特徴部位の抽出についても公知の画像処理技術、例えば、「Lucas-Kanade法（勾配法）」や、「Kanade-Lucas-Tomasiトラッカー法」などを摘要して特徴点の移動や相違を追跡し、対応点を探索することによって行うことができる。 The CPU 11 detects a human face feature part from each image data stored in a predetermined area of the RAM 12 by the person area extraction program 231 (step S3), and based on the detected feature, the person's face feature part is detected. An area is extracted (step S4).
Extraction of human facial features from multi-viewpoint images using well-known image processing techniques such as “Lucas-Kanade method (gradient method)” and “Kanade-Lucas-Tomasi tracker method” This can be done by tracking point movement and differences and searching for corresponding points.

ここで、同一人物が写った任意の複数枚の画像を用いる場合に、同一日時における同一被写体の撮影画像の代りに、同一人物だが、異なる日時や場所の画像を用いる場合には、同一人物でも、角度や向きだけでなく、髪形や服装、化粧などが変わっていたり、表情が異なっていたりする場合が生じる。ＣＰＵ１１は、特徴点対応付プログラム２３２により、ステップＳ３、Ｓ４で異なる複数枚の多視点画像から検出された顔の特徴と、標準モデルデータベース３０に格納されている顔の３次元標準モデルデータ３１の特徴点との対応付けを行って、顔の表情データを検出する（ステップＳ５）。 Here, when using a plurality of arbitrary images of the same person, the same person is used instead of the photographed image of the same subject at the same date and time. , Not only the angle and direction, but also the hairstyle, clothes, makeup, etc. may change or the expression may differ. The CPU 11 stores the facial features detected from a plurality of different multi-viewpoint images in steps S3 and S4 by the feature point association program 232, and the facial 3D standard model data 31 stored in the standard model database 30. Correspondence with feature points is performed to detect facial expression data (step S5).

次に、ＣＰＵ１１は、表情変化の補正処理プログラム２３３により、多視点画像のうち、当該視点画像の顔の特徴や表情の違いや変化が顔の３次元標準モデルデータ３１における顔の特徴や表情と比べて所定の閾値より大きい場合、または、表情変化の補正機能がＯＮ設定されている場合にはステップＳ７に進み、そうでない場合はステップＳ８に進む（ステップＳ６）。 Next, the CPU 11 uses the facial expression change correction processing program 233 to change the facial features and facial expressions in the three-dimensional standard model data 31 of the facial features and facial expressions of the viewpoint images. If it is larger than the predetermined threshold value or if the facial expression change correction function is set to ON, the process proceeds to step S7, and if not, the process proceeds to step S8 (step S6).

表情変化の補正処理プログラム２３３は、当該視点の画像における人物の顔領域の画像に対して、（中立顔やすまし顔、微笑顔などの）所定の表情の顔画像になるよう顔の表情等の変化の補正処理（図１０参照）を施してからステップＳ９に進む（ステップＳ７）。ステップＳ７の顔の表情の変化の補正処理動作により、下記ステップＳ９で行う３次元形状データ生成処理により、より精度の高い３次元形状データを生成することができる。 The facial expression change correction processing program 233 performs a facial expression such as a facial expression so that the facial image of a predetermined facial expression (such as a neutral facial expression, a smiling face, or a smile) is obtained with respect to the image of the human face region in the viewpoint image. After performing the change correction process (see FIG. 10), the process proceeds to step S9 (step S7). With the correction processing operation of the facial expression change in step S7, more accurate three-dimensional shape data can be generated by the three-dimensional shape data generation processing performed in step S9 below.

ＣＰＵ１１は、処理対称の複数枚の多視点画像の撮影位置や撮影方向などのカメラ情報が既知か否かを調べ（ステップＳ８）、既知の場合は３次元形状データ作成による、図１２に示すような複数枚の多視点カメラ画像からの３次元形状データ作成動作により３次元形状データを生成してステップＳ１０に進み、撮影情報が既知でない場合は図１４に示すような因子分解法による３次元形状データ作成動作により３次元形状データを生成してステップＳ１０に進む（ステップＳ９）。なお、撮影情報が既知でない場合に、図１３に示すような視体積交差法による３次元形状データ作成動作により３次元形状データを生成するようにしてもよい。 The CPU 11 checks whether or not camera information such as the shooting positions and shooting directions of a plurality of process-symmetric multi-viewpoint images is known (step S8). If it is known, three-dimensional shape data creation is performed, as shown in FIG. The three-dimensional shape data is generated from a plurality of multi-viewpoint camera images to generate three-dimensional shape data, and the process proceeds to step S10. If the photographing information is not known, the three-dimensional shape by the factorization method as shown in FIG. Three-dimensional shape data is generated by the data creation operation, and the process proceeds to step S10 (step S9). In addition, when imaging information is not known, you may make it produce | generate 3D shape data by 3D shape data creation operation | movement by a visual volume intersection method as shown in FIG.

次に、ＣＰＵ１１は、加工用３次元モデルデータ作成プログラム２４により、図２のプロセス２で登録した委託条件を調べ、委託条件がＣＧキャラクタ、アバータ用３Ｄモデルモデルデータ等の３次元データ作成依頼の場合はステップＳ１１に進み、委託条件が加工用３次元モデルデータの作成、または立体像（または原型）の製作の場合はステップＳ１２に進む（ステップＳ１０）。 Next, the CPU 11 checks the consignment conditions registered in the process 2 of FIG. 2 by the machining three-dimensional model data creation program 24, and the consignment condition is a request for creating a three-dimensional data such as a CG character, 3D model model data for averter, etc. If this is the case, the process proceeds to step S11. If the consignment condition is creation of 3D model data for processing or production of a stereoscopic image (or prototype), the process proceeds to step S12 (step S10).

委託条件がＣＧキャラクタ、アバータ用３Ｄモデルモデルデータ等の３次元データ作成依頼の場合は、委託者のパソコン５または携帯電話６に作成したＣＧキャラクタ、アバータ用３Ｄモデルモデルデータを作成してネットワーク４を介して送信すると共に受託管理プログラム群２２に含まれている請求書発行プログラムにより請求書を発行する（ステップＳ１１）。 If the consignment condition is a request to create 3D data such as a CG character and 3D model data for averter, the network 4 The bill is issued by the bill issuing program included in the trust management program group 22 (step S11).

委託条件が加工用３次元モデルデータの作成、または立体像（または原型）の製作の場合は、ＣＰＵ１１は、加工用モデルデータ作成プログラム２３６により、加工装置の種類に応じた加工用３次元モデルデータを作成し、ステップＳ１３に進む（ステップＳ１２）。 When the consignment condition is creation of machining 3D model data or production of a three-dimensional image (or prototype), the CPU 11 uses the machining model data creation program 236 to process the 3D model data for machining according to the type of the machining apparatus. And proceed to step S13 (step S12).

委託条件が立体像（または原型）の製作の場合は（図２のプロセスＰ６で立体像（または原型）の製作加工を行わせるために）作成した加工用３次元モデルデータを受託者側の３次元像加工装置３−１、または３−２、または３−３に送信する。また、加工用３次元モデルデータの納入の場合は（図２のプロセスＰ５に示したように）納入条件に基づいて通信ネットワーク４を介して委託者の３次元像加工装置８（またはそれを接続するコンピュータ）に作成した加工用３次元モデルデータを送信すると共に受託管理プログラム群２２に含まれている請求書発行プログラムにより請求書を発行する。なお、納入条件がＣＤ等の記録媒体納入の場合は作成した３次元モデルデータをＣＤに記録すると共に請求書を発行する（ステップＳ１３）。 When the consignment condition is production of a three-dimensional image (or prototype) (to make a three-dimensional image (or prototype) produced by process P6 in FIG. 2), the processing 3D model data created by the contractor 3 The data is transmitted to the three-dimensional image processing device 3-1, 3-2, or 3-3. In the case of delivery of 3D model data for processing (as shown in process P5 in FIG. 2), the 3D image processing device 8 of the consignor (or connection thereof) is connected via the communication network 4 based on the delivery conditions. The processing three-dimensional model data created is transmitted to the computer, and a bill is issued by the bill issuing program included in the trust management program group 22. If the delivery condition is delivery of a recording medium such as a CD, the created three-dimensional model data is recorded on the CD and an invoice is issued (step S13).

上述したように、本発明の３次元データ作成装置によれば、任意の角度（および任意の時期、服装、表情）で撮影された画像でも、同一人物が写った複数の画像があればそれらの画像に基づいて、自動的に３次元データを生成できるので、従来のように、周囲３６０度の所定角度から撮影したり、専用の撮影台や自動撮影機を必要とせず、わざわざ自動撮影機の設置場所や彫刻作家の所に赴かなくても、写真や画像データを送付や送信するだけで作成される３次元データに基づいてリアルな胸像やフィギュアを、簡単且つ安価に作成できる。また、インターネット等の通信ネットワークや電子メールを利用して、パソコンやカメラ付携帯電話から画像を送信して胸像やフィギュアを発注することもできる。 As described above, according to the three-dimensional data creation device of the present invention, even if an image is taken at an arbitrary angle (and at an arbitrary time, clothes, facial expression), if there are a plurality of images showing the same person, those Since 3D data can be automatically generated on the basis of images, it is not necessary to shoot from a predetermined angle of 360 degrees around as in the past, or to use an automatic shooter without the need for a dedicated shooting stand or automatic shooter. Even without going to the place of installation or the sculptor, a realistic bust or figure can be created easily and inexpensively based on the three-dimensional data created simply by sending or sending photographs and image data. It is also possible to order busts and figures by sending images from a personal computer or camera-equipped mobile phone using a communication network such as the Internet or electronic mail.

図８は、顔の３次元標準モデルおよび表情変化の補正後の３次元モデルの例を示す図であり、（ａ）は顔の標準モデルデータ３１から生成される３次元標準モデルの一例を示し、（ｂ）は、（ａ）に示した次元標準モデルに図９に示すような顔の括約筋と線形筋モデルによって補正された顔の表情変化モデルデータ３２の例を示す。 FIG. 8 is a diagram showing an example of a three-dimensional standard model of a face and a three-dimensional model after correction of expression changes, and (a) shows an example of a three-dimensional standard model generated from the standard model data 31 of the face. , (B) shows an example of facial expression change model data 32 corrected by the face sphincter and linear muscle models as shown in FIG. 9 in the dimensional standard model shown in (a).

図９は、顔の表情変化の標準モデルデータ３２の一実施例を示す図であり、（ａ）はＡＵ番号、ＡＵ（Action Unit、表情動作単位）とＡＵを引き起こす表情筋などの筋肉の収縮部位等を対応付けた表情変化と筋肉収縮対応表データ３２−１を示し、（ｂ）は表情（表情データ）とＡＵの組み合わせ表データ３２−２を示す。 FIG. 9 is a diagram showing an example of the standard model data 32 of facial expression changes. (A) is a contraction of muscles such as AU number, AU (Action Unit) and facial muscles that cause AU. A facial expression change and muscle contraction correspondence table data 32-1 associated with a part or the like is shown, and (b) shows a combination table data 32-2 of facial expressions (facial expression data) and AU.

図１０は、表情変化の補正処理プログラムによる顔画像の表情の変化の補正処理動作の一実施例を示すフローチャートであり、図６、図８、図９を元に説明する。
図１０において、図６のステップＳ５で検出した顔の表情データを、表情とＡＵの組み合わせ表データ３２−２の表情データと比較して、ＡＵ（表情動作単位）の組み合わせを得る。つまり、検出された顔の表情を、例えば、「驚き」＝ＡＵ１（内眉を上げる）＋ＡＵ２（外眉を上げる）＋ＡＵ５（上瞼を上げる）＋ＡＵ２６（口を開ける）といった表情動作単位ＡＵで表現する（ステップＳ７−１）。 FIG. 10 is a flowchart showing an example of correction processing operation of facial expression change by a facial expression change correction processing program, which will be described with reference to FIGS. 6, 8, and 9.
In FIG. 10, the facial expression data detected in step S5 in FIG. 6 is compared with the facial expression data in the facial expression / AU combination table data 32-2 to obtain a combination of AUs (facial expression operation units). That is, the expression of the detected face is expressed in the expression operation unit AU, for example, “surprise” = AU1 (inner eyebrow raised) + AU2 (outer eyebrow raised) + AU5 (upper eyelid raised) + AU26 (mouth opened). (Step S7-1).

次に、各ＡＵを表情変化と筋肉収縮対応表データ３２−１を用いて対応する表情筋の収縮に変換する。例えば、ＡＵ（ｉ）→内側前頭筋（０．１４）＋外側前頭筋（０．１７）＋眼瞼部眼輪筋（−０．４８）といったように表情筋の収縮に変換する（ステップＳ７−２）。 Next, each AU is converted into a corresponding facial muscle contraction using the facial expression change and muscle contraction correspondence table data 32-1. For example, conversion into facial muscle contraction such as AU (i) → inner frontal muscle (0.14) + outer frontal muscle (0.17) + eyelid eyelid muscle (−0.48) (step S7−). 2).

そして、各表情に対応する、ステップＳ７−２で得た表情筋を収縮させた３次元顔画像をＣＧ描画再生して３次元画像データを取得し、図７のステップＳ８に進む（ステップＳ７−３）。 Then, the 3D face image obtained by contracting the facial expression muscle obtained in step S7-2 corresponding to each facial expression is drawn and reproduced by CG to obtain 3D image data, and the process proceeds to step S8 in FIG. 7 (step S7-). 3).

図１１は、表情変化の補正処理プログラムによる顔の表情等の補正の説明図であり、図１０の表情変化の補正処理プログラム２３３（または２３５）では、（ａ）〜（ｆ）に示すような、「喜び」、「驚き」、「悲しみ」、「恐れ」、「嫌悪」、「怒り」といったような喜怒哀楽を示している表情の画像を、補正して、（ｇ）に示す「無表情」または「微笑顔」等の「中立顔」の画像とする。また、逆に、（ｇ）に示す「無表情」または「微笑顔」等の「中立顔」の画像から（ａ）〜（ｆ）に示すいずれかの表情の画像に補正することもできる。 FIG. 11 is an explanatory diagram of correction of facial expression and the like by the expression change correction processing program. In the expression change correction processing program 233 (or 235) of FIG. 10, as shown in (a) to (f) of FIG. , “Joy”, “surprise”, “sadness”, “fear”, “disgust”, “anger”, and other expressions of facial expressions showing emotions such as “anger” are corrected to “nothing” shown in (g). The image is a “neutral face” image such as “expression” or “smile”. Conversely, the image of “neutral face” such as “no expression” or “smiling smile” shown in (g) can be corrected to the image of any expression shown in (a) to (f).

＜複数枚のカメラ画像からの３次元形状データの作成方法＞：
特に、複数枚の撮影画像から３次元形状データを生成する方法について、以下に詳しく説明する。 <Method for creating three-dimensional shape data from a plurality of camera images>:
In particular, a method for generating three-dimensional shape data from a plurality of photographed images will be described in detail below.

（Ａ）：画素座標、カメラ座標、画像座標；
カメラを基準とした３次元空間座標を「カメラ（Camera）座標」と、２次元画像を表現する「画像（Image）座標」とを、カメラ座標系（X,Y,Z）の原点（０，０，０）を光軸上のカメラ中心とし、撮像画像面に平行なX軸、Y軸と光軸方向のZ軸との正規直交座標系として設定すると、カメラ座標が（X,Y,Z）^Tである３次元空間の点と、その透視射影として得られる２次元画像の画像座標（ｘ、ｙ）^Tには、次式が成り立つ。
ｘ＝ｌ×X／Z、ｙ＝ｌ×Y／Z（ただし、ｌ：カメラの焦点距離）・・・（１） (A): pixel coordinates, camera coordinates, image coordinates;
The three-dimensional space coordinates based on the camera are the “camera coordinates” and the “image coordinates” representing the two-dimensional image are the origin (0, 0) of the camera coordinate system (X, Y, Z). (0, 0) is the camera center on the optical axis, and the camera coordinates are set to (X, Y, Z) when set as an orthonormal coordinate system of the X axis, Y axis parallel to the captured image plane, and the Z axis in the optical axis direction. ) The following expression holds for a point in the three-dimensional space ^T and the image coordinates (x, y) ^T of the two-dimensional image obtained as the perspective projection.
x = 1 × X / Z, y = 1 × Y / Z (where l is the focal length of the camera) (1)

ここで、透視射影による３次元空間の像を記述する「画像（Image）座標」と、モニター表示画面などの「画素（Pixel）座標」の間には、個々のカメラに固有の１対１の写像関係がある。複数の画像における点対応からカメラ運動と相対的位置関係（３次元形状）とを復元する場合には、点対応は、まず「画像座標」で与えることができるので、全てのカメラに対して統一的に扱うためには、この「画素（Pixel）座標」と「画像（Image）座標」間の１対１の写像関係を求めること（「カメラキャリブレーション」と呼ばれる）ができれば、「画素（Pixel）座標」と「画像（Image）座標」とを自由に変換できることになる。 Here, there is a one-to-one characteristic unique to each camera between “Image coordinates” describing an image in a three-dimensional space by perspective projection and “Pixel coordinates” such as a monitor display screen. There is a mapping relationship. When restoring camera motion and relative positional relationship (three-dimensional shape) from point correspondence in multiple images, point correspondence can be given first by "image coordinates", so it is unified for all cameras In order to deal with the problem, if a one-to-one mapping relationship between the “pixel coordinates” and the “image coordinates” can be obtained (referred to as “camera calibration”), the “pixels (Pixel coordinates)” can be obtained. ) Coordinates "and" image coordinates "can be freely converted.

カメラモデルを表現する透視射影において、複数の画像における点対応からカメラ運動と３次元形状とを求める問題は、非線形写像の逆問題となるので、非線形最適化問題に帰着し、非線形最適化問題はノイズに敏感で、初期値依存性が高く、数値計算が不安定であるなど、問題があるため、安定して３次元形状を復元することは難しい。
そこで、非特許文献１にあるように、カメラの位置や姿勢、視点方向などに関する外部情報、すなわち、カメラ運動情報を入力するか、参照マーカーなど人工的特徴を付加することによって、カメラ運動情報を求めやすくする方法がある。 In perspective projection that expresses a camera model, the problem of obtaining camera motion and 3D shape from point correspondence in multiple images is an inverse problem of the nonlinear mapping, resulting in a nonlinear optimization problem. Since there are problems such as sensitivity to noise, high initial value dependency, and unstable numerical calculation, it is difficult to stably restore a three-dimensional shape.
Therefore, as described in Non-Patent Document 1, external information on the camera position, posture, viewpoint direction, or the like, that is, camera motion information is input, or camera motion information is added by adding an artificial feature such as a reference marker. There are ways to make it easier to find.

あるいは、特開２００４−２２０３１２号公報や、後述の視体積交差法の説明にあるように）、複数の多視点カメラ（または視点）の中から、なるべく直交する関係の２台のカメラ（または視点）を基底カメラとして選択して、基底カメラによる射影グリッド空間を用いて、複数のカメラ間または視点間の相互関係情報を付加するか、制限して、例えば、カメラ間のエピポーラ幾何関係を表す基底行列Ｆ（Fundamental）行列等を用いて、射影グリッド空間上の点（ボクセル）を各カメラ画像へ逆投影し、各カメラ画像のシルエット画像において、各点（ボクセル）が対象物の内部に存在するか外部かを判定して、３次元形状データを復元する方法がある。 Alternatively, two cameras (or viewpoints) that are orthogonal to each other as much as possible from among a plurality of multi-viewpoint cameras (or viewpoints) as disclosed in Japanese Patent Application Laid-Open No. 2004-220312 and the explanation of the visual volume intersection method described later. ) As a base camera, and using a projection grid space by the base camera, add or restrict correlation information between multiple cameras or viewpoints, for example, a base representing an epipolar geometric relationship between cameras A point (voxel) on the projection grid space is back-projected onto each camera image using a matrix F (Fundamental) matrix or the like, and each point (voxel) exists inside the object in the silhouette image of each camera image. There is a method of restoring the three-dimensional shape data by determining whether it is external or external.

あるいは、金出武雄ほか、「因子分解法による物体形状とカメラ運動の復元」、電子通信学会論文誌、J76−D−II、No.8（19930825）、pp．1497−1505や、藤木淳（産総研）、「点対応を用いた複数の２次元画像からの３次元形状復元−因子分解法の数理−」、統計数理、第４９巻第１号、pp77〜107、2001年にあるように、「因子分解法」などにより、理想的カメラモデルである透視射影をアフィン射影に近似して、正射影モデル等のアフィン近似射影に基づいた複数の２次元画像からカメラ運動情報と３次元形状情報とを同時に復元する手法などを用いることができる。 Alternatively, Takeo Kanade et al., “Restoring Object Shape and Camera Motion by Factorization”, IEICE Transactions, J76-D-II, No.8 (19930825), pp. 1497-1505, Satoshi Fujiki (AIST), “Reconstruction of 3D shape from multiple 2D images using point correspondence -Mathematical factorization method”, Statistical mathematics, Vol. 49, No. 1, pp77- As in 107 and 2001, the perspective projection, which is an ideal camera model, is approximated to an affine projection by “factorization method”, etc., and a plurality of two-dimensional images based on an affine approximate projection such as an orthographic projection model are used. A technique for simultaneously restoring camera motion information and three-dimensional shape information can be used.

２次元画像がアフィン近似射影で得られると仮定すると、複数のアフィン近似射影画像における点対応からのカメラ運動と３次元形状の復元問題は、線形写像の逆問題となるので、復元の精度は劣るが、非線形写像の場合に比べ数値計算上安定して解くことができるようになる。 Assuming that a two-dimensional image is obtained by affine approximate projection, the camera motion from point correspondence in a plurality of affine approximate projection images and the three-dimensional shape restoration problem are inverse problems of the linear mapping, so the restoration accuracy is inferior. However, it can be solved more stably in numerical calculation than in the case of nonlinear mapping.

Ｂ：複数の画像間における点の対応付け；
まず、複数の画像間における点特徴（輝度や色、輪郭形状、テクスチャーなど）の対応付けを行なう。３次元空間では遠く離れた点も、２次元画像では近くに投影されることがある。２次元画像におけるわずかな誤差が３次元空間での認識や理解に重大な影響を及ぼすので、複数の画像間における点の対応を精度良く行なう必要がある。 B: point association between multiple images;
First, point features (luminance, color, contour shape, texture, etc.) are associated between a plurality of images. Points that are far away in the three-dimensional space may be projected closer in the two-dimensional image. Since a slight error in a two-dimensional image has a significant effect on recognition and understanding in a three-dimensional space, it is necessary to accurately correspond points between a plurality of images.

複数の画像間における特徴点の対応付けには、「Lucas-Kanade法（勾配法）」（1981年）や、「Kanade-Lucas-Tomasiトラッカー」（Shi and Tomasi、1994年）などの手法を用いることができる。時間的に離れた画像を事前知識無しに対応付けるのは難しいが、複数の点で対応が既知であれば、同一の３次元空間を撮影した多視点の複数画像間の幾何的な関係（エピポーラ幾何学）を用いて、他の対応点の存在可能な領域を絞りこむことができる。
例えば、複数のフレーム画像間における対応や追跡には、一般に、見え方（局所画像）、または、エッジ（輪郭）、色ヒストグラムなどの画像特徴の類似（相関）や相違に基づいて、隣接する他フレーム画像との間で、最も類似する領域を探索し、探索された点へ対象物が移動したと判定する方法（ブロックマッチング）が良く用いられる。 For matching feature points between multiple images, methods such as “Lucas-Kanade method (gradient method)” (1981) and “Kanade-Lucas-Tomasi tracker” (Shi and Tomasi, 1994) are used. be able to. It is difficult to correlate images that are separated in time without prior knowledge, but if the correspondence is known at multiple points, the geometric relationship between multiple images taken from the same three-dimensional space (epipolar geometry) The area where other corresponding points can exist can be narrowed down.
For example, for correspondence and tracking between a plurality of frame images, in general, adjacent to each other based on appearance (local image) or similarity (correlation) or difference of image features such as edges (contours) and color histograms. A method (block matching) that searches for the most similar area between the frame images and determines that the object has moved to the searched point is often used.

つまり、座標ｘ＝（ｘ，ｙ）における画素値I（x）毎に、移動量（変位）ｄ＝（ｄ_x，ｄ_y）を逐次変えながら、次式であらわされるような、
差分二乗和（二乗誤差）ε（ｄ）＝Σ｛I_t（ｘ＋ｄ）−I_t-1（ｘ）｝²、もしくは、
差分絶対値和ε（ｄ）＝Σ｜I_t（ｘ−ｄ）−I_t-1（ｘ）｜、もしくは、
相互相関γ（ｄ）＝Σ｛I_t（ｘ−ｄ）−I^￣ _t｝｛I_t-1（ｘ）−I^￣ _t-1｝／｜I_t（ｘ−ｄ）−I^￣ _t｜｜I_t-1（ｘ）−I^￣ _t-1｜・・・（２）
などを計算し、その中から、
相違度最小d＾=min_d｛ε（ｄ）｝、または、類似度（相関）最大d＾=max_d｛γ（ｄ）｝
・・・（３）
となる変位ｄを求めれば良い。しかし、これを全探索すると、計算量が多くなったり、変位量が離散的で連続しないなどの難点があった。局所画像を回転や拡大縮小してマッチングする場合には、さらに膨大な計算が必要になる。 That is, for each pixel value I (x) at the coordinate x = (x, y), the movement amount (displacement) d = (d _x , _dy ) is sequentially changed, and is expressed by the following equation:
Sum of squared differences (square error) ε (d) = Σ {I _t (x + d) −I _t−1 (x)} ² , or
Sum of absolute differences ε (d) = Σ | I _t (x−d) −I _t−1 (x) |
Cross-correlation γ (d) = Σ {I _t (x−d) −I ^￣ _t } {I _t−1 (x) −I ^￣ _t−1 } / | I _t (x−d) −I ^￣ _t | | I _t-1 (x) −I ^￣ _t-1 | (2)
Etc., and from that,
Difference minimum d ^ = min _d {ε (d)} or similarity (correlation) maximum d ^ = max _d {γ (d)}
... (3)
What is necessary is just to obtain | require the displacement d which becomes. However, if this is fully searched, there is a problem that the amount of calculation increases and the amount of displacement is discrete and not continuous. In the case of matching by rotating or enlarging / reducing a local image, further enormous calculation is required.

このとき、「Lucas-Kanade法（勾配法）」では、暫定解の周りの勾配（傾き）にもとづいて、山登り（または山降り）することにより、極大値（または極小値）を効率よく求めることができる。すなわち、差分二乗和（二乗誤差、SSD）の勾配を、
ε＝Σ｛I（ｘ＋δｘ，ｙ＋δｙ，ｔ＋δｔ）−I（ｘ，ｙ，ｔ）｝² ・・・（４）
とすると、この第１項のテーラー展開は、
I（ｘ＋δｘ，ｙ＋δｙ，ｔ＋δｔ）
＝I（ｘ，ｙ，ｔ）＋δｘ｛∂I（ｘ，ｙ，ｔ）/δｘ｝＋δｙ｛∂I（ｘ，ｙ，ｔ）/δｙ｝＋δｔ｛∂I（ｘ，ｙ，ｔ）/δｔ｝＋・・・
このとき、２次以降の項を、変位が微小であるとして無視できる（ｘ周辺で線形近似できる）とすると、
δｘ＝ｄ_x、δｙ＝ｄ_y、δｔ＝１として、
ε＝Σ｛ｄ_xI_x（ｘ，ｙ，ｔ）＋ｄ_yI_y（ｘ，ｙ，ｔ）＋I_t（ｘ，ｙ，ｔ）｝²
・・・（５）
ただし、上式で、I_x(x,y,t)＝∂I(x,y,t)/δｘ、I_y(x,y,t)＝∂I(x,y,t)/δｙ、I_t(x,y,t)＝∂I(x,y,t)/δｔ
相違度最小の変位d＾=min_dεは、∂ε/∂ｄ_x＝0、∂ε/∂ｄ_y＝0となるｄを求めれば良いので、
∂ε/∂ｄ_x＝Σ２I_x（ｘ，ｙ，ｔ）｛ｄ_xI_x（ｘ，ｙ，ｔ）＋ｄ_yI_y（ｘ，ｙ，ｔ）＋I_t（ｘ，ｙ，ｔ）｝＝0、
∂ε/∂ｄ_y＝Σ２I_y（ｘ，ｙ，ｔ）｛ｄ_xI_x（ｘ，ｙ，ｔ）＋ｄ_yI_y（ｘ，ｙ，ｔ）＋I_t（ｘ，ｙ，ｔ）｝＝0 ・・・（６）
ここで、

・・・（７）
とおくと、A^TAd−A^Tｂ＝0 → A^TAd＝A^Tｂ

・・・（８）
となるので、A^TAが正則なとき、d＾は解を持ち、
d＾＝min_dε＝（A^TA）^-1 A^Tｂ・・・（９）
となり、全探索しなくても、相違度最小となる変位量を求めることができる。
特徴点検出と上記のような追跡法とを統合した手法は、「Kanade-Lucas-Tomasi（KLT）トラッカー」と呼ばれる。 At this time, in the “Lucas-Kanade method (gradient method)”, the maximum value (or minimum value) is efficiently obtained by climbing (or descending) on the basis of the gradient (slope) around the provisional solution. Can do. That is, the slope of the sum of squared differences (square error, SSD)
ε = Σ {I (x + δx, y + δy, t + δt) −I (x, y, t)} ² (4)
Then, the Taylor expansion of this first term is
I (x + δx, y + δy, t + δt)
= I (x, y, t) + δx {∂I (x, y, t) / δx} + δy {∂I (x, y, t) / δy} + δt {∂I (x, y, t) / δt } + ...
At this time, if the terms after the second order can be ignored as the displacement is minute (can be linearly approximated around x),
As δx = d _x , δy = d _y , δt = 1,
ε = Σ {d _x I _x (x, y, t) + d _y I _y (x, y, t) + I _t (x, y, t)} ²
... (5)
Where I _x (x, y, t) = ∂I (x, y, t) / δx, I _y (x, y, t) = ∂I (x, y, t) / δy, I _t (x, y, t) = ∂I (x, y, t) / δt
The displacement d ^ = min _d ε with the smallest difference is obtained by obtaining d such that ∂ε / ∂d _x = 0 and ∂ε / ∂d _y = 0.
∂ε / ∂d _x = Σ2I _x (x, y, t) {d _x I _x (x, y, t) + d _y I _y (x, y, t) + I _t (x, y, t)} = 0,
_{_{∂ε / ∂d y = Σ2I y (}} x, y, t) {d x I x (x, y, t) + d y I y (x, y, t) + I t (x, y, t)} = 0 (6)
here,

... (7)
A ^T Ad−A ^T b = 0 → A ^T Ad = A ^T b

... (8)
Therefore, when A ^T A is regular, d ^ has a solution,
d ^ = min _d ε = (A ^T A) ⁻¹ A ^T b (9)
Thus, the displacement amount that minimizes the dissimilarity can be obtained without performing a full search.
A method in which feature point detection and the tracking method as described above are integrated is called a “Kanade-Lucas-Tomasi (KLT) tracker”.

Ｃ：３次元形状データの生成方法
（Ｃ−１）：カメラ位置情報や参照マーカーを用いる方法；
例えば、全周囲３６０度からの多視点角度から、または、周囲に配した複数台のカメラから撮影した画像データを入力して、対象物の３次元形状データを作成するソフトウェアが各種開発されている。
これらでは、予めカメラを所定の位置や角度に配して撮影したカメラ位置情報が既知のカメラで撮影された複数枚の多視点画像を用いるか、または、例えば、非特許文献１などにあるように、回転台に印刷された参照マーカーと、回転台に載せた対象物体とを一緒に撮影した複数枚のカメラ画像から、撮影時のカメラ位置を自動計算する。 C: Three-dimensional shape data generation method (C-1): Method using camera position information and reference markers;
For example, various types of software have been developed for creating three-dimensional shape data of an object by inputting image data taken from multiple viewpoint angles from 360 degrees around the periphery or from a plurality of cameras arranged around the periphery. .
In these, a plurality of multi-viewpoint images taken with a camera whose camera position information is previously taken with a camera placed at a predetermined position and angle are used, or as described in Non-Patent Document 1, for example In addition, the camera position at the time of shooting is automatically calculated from a plurality of camera images obtained by shooting together the reference marker printed on the turntable and the target object placed on the turntable.

また、得られたカメラ位置情報と物体画像とから、形状とテクスチャの３次元情報を自動計算し、その３次元情報をユーザーが適宜修正して、デジタルカメラで手持ち撮影した画像から、３次元データを生成する。対象物を撮影した視点の異なる１０〜２０枚程度以上のJPEGフォーマットの撮影画像データを入力して、カメラ位置の計算、形状情報の生成などの一連の処理を実行して、３次元形状データを作成する（カメラ位置の計算については次に述べる図１２のステップＳ９−１−１の説明参照、形状情報の生成についてはステップＳ９−１−２の説明参照、３次元形状データの出力についてはステップＳ９−１−３の説明参照）。 In addition, the 3D information of the shape and texture is automatically calculated from the obtained camera position information and the object image, the 3D information is appropriately corrected by the user, and the 3D data is obtained from the image hand-held by the digital camera. Is generated. Input 10 to 20 or more JPEG-format captured image data from different viewpoints for capturing an object, and execute a series of processes such as camera position calculation and shape information generation to obtain 3D shape data. Create (refer to the description of step S9-1-1 in FIG. 12 described below for the calculation of the camera position, refer to the description of step S9-1-2 for the generation of shape information, and step for the output of three-dimensional shape data) (See description of S9-1-3).

図１２は、図７のステップＳ９における複数枚の多視点カメラ画像からの３次元形状データ作成動作の一実施例を示す詳細フローチャートである。
図７のステップＳ８でカメラ情報が既知の場合には、図１２で、ＣＰＵ１１は、座標系におけるカメラの３次元位置パラメータ（回転成分α，β，γと平行移動成分（x,y,z）をHough変換などに基づいてカメラ位置を計算する。つまり、撮影された画像から、参照マーカーを抽出し、抽出された参照マーカーの任意の３点と、あらかじめ登録されている参照マーカー中の任意の３点の位置関係を組合せて決定される連立方程式を解くことにより、位置パラメータを計算する（ステップＳ９−１−１）。 FIG. 12 is a detailed flowchart showing an example of the operation for creating three-dimensional shape data from a plurality of multi-viewpoint camera images in step S9 of FIG.
If the camera information is known in step S8 in FIG. 7, in FIG. 12, the CPU 11 determines the three-dimensional position parameters of the camera in the coordinate system (rotation components α, β, γ and translation components (x, y, z)). The camera position is calculated based on the Hough transform, etc. That is, the reference marker is extracted from the photographed image, and any three of the extracted reference markers and any of the reference markers registered in advance are extracted. A position parameter is calculated by solving simultaneous equations determined by combining the positional relationships of the three points (step S9-1-1).

次に、所定の背景の色情報を用いて物体と背景を分離し、対象物のシルエット（輪郭）の抽出を行い（ステップＳ９−１−２）、カメラの位置と対象物の２次元輪郭情報に基づいて、３次元ボクセル空間へのボーティング（投票）処理により、３次元形状を再構成し、得られたボクセルデータから、ポリゴン（多角形）データへ変換し、三角ポリゴン表現形式やＳＳＦ形式など、３次元形状データとして出力し、図７のステップＳ１０に進む（ステップＳ９−１−３）。なお、VRMLやXVL、IGES、STL、PLYなど、３次元CADや３次元CGで用いられている出力形式に準拠した３次元形状データ形式に変換して出力してもよい。 Next, the object and background are separated using color information of a predetermined background, and the silhouette (contour) of the target is extracted (step S9-1-2), and the camera position and the two-dimensional contour information of the target are extracted. Based on the above, the 3D shape is reconstructed by voting to the 3D voxel space, and the obtained voxel data is converted to polygon (polygon) data. Etc., and output as three-dimensional shape data, and proceeds to step S10 in FIG. 7 (step S9-1-3). Note that VRML, XVL, IGES, STL, PLY, and the like may be converted into a three-dimensional shape data format that conforms to an output format used in three-dimensional CAD or three-dimensional CG.

ここでHough変換（ハフ変換、ヒュー変換）は、エッジ画像から直線を求める問題等において多く用いられる方法であり、エッジ画像中の各々の点（x,y）について、それがある直線上にある点と仮定した場合に可能性のある全ての直線を、極座標ρ＝x cosθ＋y sinθによるパラメータ空間（ρ，θ）へ曲線として投票し、最終的にパラメータ空間においてピークとなる（ρ，θ）が、求める直線のパラメータとして得られる。 Here, the Hough transform (Hough transform, Hugh transform) is a method often used in problems such as obtaining a straight line from an edge image, and each point (x, y) in the edge image is on a certain straight line. Voting all possible straight lines assuming a point as a curve to the parameter space (ρ, θ) with polar coordinates ρ = x cosθ + y sinθ, and finally (ρ, θ) peaks in the parameter space. Is obtained as a parameter of the straight line to be obtained.

（Ｃ−２）：視体積交差法；
複数の多視点からの画像に対して、「視体積交差法」を用いても、３次元形状モデルの生成を行うことができる。視体積交差法では、実空間内に設置した複数（位置）のカメラで撮影した画像から、物体のシルエットを抽出し、空間に逆投影し、シルエットの交わりを計算することによって、３次元モデルを求める。複数の多視点画像から３次元モデルを生成する手順をフローチャートに示す。 (C-2): visual volume intersection method;
A three-dimensional shape model can also be generated by using the “view volume intersection method” for a plurality of images from multiple viewpoints. In the visual volume intersection method, the silhouette of an object is extracted from images taken by multiple (position) cameras installed in real space, back-projected into space, and the intersection of the silhouettes is calculated. Ask. A procedure for generating a three-dimensional model from a plurality of multi-viewpoint images is shown in the flowchart.

図１３は、図７のステップＳ９における視体積交差法による３次元形状データ作成の動作の一実施例を示す詳細フローチャートである。
図７のステップＳ８でカメラ情報が未知の場合には、図１３で、ＣＰＵ１１は、まず、形状を構成する３次元空間（ボクセル空間）を立方体格子に分割し（ステップＳ９−２−１）、視体積交差法により、多視点が像のシルエット画像を入力して各ボクセルに対して正射影による逆投影を行う（ステップＳ９−２−２）。 FIG. 13 is a detailed flowchart showing an example of the operation of creating the three-dimensional shape data by the visual volume intersection method in step S9 of FIG.
If the camera information is unknown in step S8 of FIG. 7, in FIG. 13, the CPU 11 first divides the three-dimensional space (voxel space) constituting the shape into cubic lattices (step S9-2-1). By the view volume intersection method, a multi-viewpoint image silhouette image is input, and back projection by orthographic projection is performed on each voxel (step S9-2-2).

次に、各ボクセル上に当該画像のシルエットが存在するか、しないかを判定し、シルエットが存在する場合はステップ９−２−４に進み、シルエットが存在しない場合はステップ９−２−４に進む（ステップＳ９−２−３）。 Next, it is determined whether or not the silhouette of the image exists on each voxel. If the silhouette exists, the process proceeds to Step 9-2-4. If the silhouette does not exist, the process proceeds to Step 9-2-4. Proceed (step S9-2-3).

シルエットが存在する場合はそのボクセルを残してステップＳ９−２−６に進み（ステップＳ９−２−４）、シルエットが存在しない場合はそのボクセルを削除してステップＳ９−２−６に進む（ステップＳ９−２−５）。 If the silhouette exists, the voxel is left and the process proceeds to step S9-2-6 (step S9-2-4). If the silhouette does not exist, the voxel is deleted and the process proceeds to step S9-2-6 (step S9-2-6). S9-2-5).

ステップＳ９−２−１で分割したすべてのボクセルについてシルエットの存否を調べたか否かを判定し、すべてのボクセルを調べ済みの場合はステップＳ９−２−７に進み、調べていないボクセルがある場合は次のボクセルを調べるためにステップＳ９−２−２に戻る（ステップＳ９−２−６）。 It is determined whether or not the existence of the silhouette has been checked for all the voxels divided in step S9-2-1. If all the voxels have been checked, the process proceeds to step S9-2-7, and there is a voxel that has not been checked. Returns to step S9-2-2 to check the next voxel (step S9-2-6).

全ての多視点画像について上記ステップＳ９−２−１〜Ｓ９−２−６の判定動作を行ったか否かを判定し、全ての多視点画像について判定済みの場合はステップＳ９−２−８に進み、判定していない画像がある場合は次の画像の判定を行うためにステップＳ９−２−１に戻る（ステップＳ９−２−７）。そして、最終的に存在するボクセル集合を３次元形状とみなし、３次元形状の内部にあるボクセルを削除した３次元形状データを生成し、図７のステップＳ１０に進む（ステップＳ９−２−８）。 It is determined whether or not the determination operations in steps S9-2-1 to S9-2-6 have been performed for all the multi-view images. If all the multi-view images have been determined, the process proceeds to step S9-2-8. If there is an image that has not been determined, the process returns to step S9-2-1 to determine the next image (step S9-2-7). Then, the finally existing voxel set is regarded as a three-dimensional shape, and three-dimensional shape data is generated by deleting the voxels inside the three-dimensional shape, and the process proceeds to step S10 in FIG. 7 (step S9-2-8). .

視体積交差法と射影グリッド空間について；
「視体積交差法」に基づいて３次元モデルを復元する原理について簡単に説明する。
視体積とは、視点を頂点と、対象物のシルエットを断面とする錐体のことで、「視体積交差法」は、全ての視点における対象物の視体積の共通部分を求めることにより、対象物の形状を復元する手法である。 Visual volume intersection method and projective grid space;
The principle of restoring the three-dimensional model based on the “visual volume intersection method” will be briefly described.
The visual volume is a cone whose viewpoint is the apex and whose silhouette is the cross-section of the object, and the “visual volume intersection method” is used to calculate the common part of the visual volume of the object at all viewpoints. This is a technique for restoring the shape of an object.

複数台のカメラ（または複数位置からのカメラ画像）のうち、任意の２つを基底カメラ（または基底位置からのカメラ画像）１、２として、２台の基底カメラのそれぞれの視点から、中心投影によって３次元空間を定義する。
ここで、３次元空間を、射影グリッド空間（PGS：Projective Grid Space）として考え、空間中のボクセルA（p,q,r）は、基底カメラ１から撮影画像１上の点a₁（p,q）へ、基底カメラ２からの撮影画像２上の点a₂（r,s）へ、投影されるものと定義する。 Arbitrary two of the multiple cameras (or camera images from multiple positions) are the base cameras (or camera images from the base positions) 1 and 2, and central projection from the respective viewpoints of the two base cameras Defines a three-dimensional space.
Here, the three-dimensional space is considered as a projective grid space (PGS), and the voxel A (p, q, r) in the space is a point a ₁ (p, q) is defined as being projected onto a point a ₂ (r, s) on the captured image 2 from the base camera 2.

交差計算をする際には、画像間の幾何関係や、カメラ座標と空間座標との対応関係が必要であり、それらはＦ（Fundamental）行列を算出することで既知となる（Ｆ行列では、２画像間の９点以上の点対応によって決定できる）。 When performing the intersection calculation, a geometric relationship between images and a correspondence relationship between camera coordinates and space coordinates are necessary, and these are known by calculating an F (Fundamental) matrix (in the F matrix, 2). It can be determined by the correspondence of 9 or more points between images).

Ｆ行列を利用して以下のような投影を行なう。
１）まず、空間上のA（p,q,r）に対する画像１上の投影点a₁については、射影グリッド空間PGSの定義より、a₁（p,q）に投影される。
２）次に、画像２上の投影点a₂については、Ｆ行列Ｆ₂₁を用いて画像２にエピポーラ線L₂₁として投影すると、a₂はL₂₁上に存在するため、直線L₂₁は次式で定義できる。

・・・（１０）
a₂のx座標は射影グリッド空間PGSの定義よりrであるから、y座標sも定まる。
３）そして、基底カメラ以外のカメラからの撮影画像（または、基底位置以外からのカメラ画像）ｉに対する投影点の座標x_i,y_iは、次のようにして定まる。
基底カメラ２への投影と同様に、Ｆ_i1を用いて点a₁を画像ｉ上に直線L_i1として投影する。
またＦ_i2を用いて、点a₂を画像ｉ上に直線L_i2として投影する。
４）２本のエピポーラ線L_i1、L_i2の交点が、画像ｉの投影点の座標である。
５）この処理を、全視点の画像に対して行なう。
このようにして、注目ボクセルに対する全視点の画像の座標値を求めることができ、３次元モデルが復元できる。 The following projection is performed using the F matrix.
1) First, the projection point a ₁ on the image 1 with respect to A (p, q, r) in the space is projected onto a ₁ (p, q) by the definition of the projection grid space PGS.
2) Next, the projection point a ₂ on the image 2, when projected as epipolar line L ₂₁ in the image 2 by using the F matrix F _21, for a ₂ is present on L _21, the straight line L ₂₁ following Can be defined by an expression.

... (10)
Since the x coordinate of a ₂ is r from the definition of the projection grid space PGS, the y coordinate s is also determined.
3) Then, the coordinates x _i and y _i of the projection point with respect to the photographed image from the camera other than the base camera (or the camera image from other than the base position) _i are determined as follows.
Like the projection on the base camera 2 projects as a straight line L _i1 the point a ₁ on the image i with F _i1.
Further, the point a ₂ is projected onto the image i as a straight line L _i2 using F _i2 .
4) The intersection of the two epipolar lines L _i1 and L _i2 is the coordinates of the projection point of the image i.
5) This process is performed on all viewpoint images.
In this way, the coordinate values of the images of all viewpoints for the target voxel can be obtained, and the three-dimensional model can be restored.

交差計算による３次元モデル復元は、定義された射影グリッド空間PGS上で、空間に含まれるボクセルを一枚のシルエット上に投影し、シルエット上にないボクセルを全て削除し、次のシルエット画像に投影するという処理を、基底カメラ１、基底カメラ２、その他のカメラの順に行ない、全ての入力視点画像のシルエットに含まれるボクセルだけを「存在」とみなし、３次元モデルを復元することができる。 Three-dimensional model restoration by intersection calculation is performed by projecting voxels contained in the space onto one silhouette on the defined projective grid space PGS, deleting all voxels not on the silhouette, and projecting them to the next silhouette image. The process of performing is performed in the order of the base camera 1, the base camera 2, and the other cameras, and only the voxels included in the silhouettes of all the input viewpoint images are regarded as “exist” and the three-dimensional model can be restored.

（Ｃ−３）：因子分解法：
上記のような参照マーカー等を用いずに、複数の多視点カメラ画像データや連続動画像データだけから３次元形状データを生成する方法として、因子分解法がある。
一般に、カメラ位置や視点方向の制限も設けずに、対象物周囲の任意の複数枚の２次元画像から、対象物の３次元形状を求めるには、膨大な計算処理が必要で、解も不安定になる。「因子分解法（Factorization）」（Tomasi and Kanade（金出武雄）、1992年）では、実際のカメラモデルである透視射影をアフィン射影で近似することにより、問題を簡略化し、数値計算を高速かつ解を安定化させることができる。また、複数のアフィン近似射影画像から、カメラ運動と対象物体の立体形状とを同時に復元できる優れた方法として知られている。 (C-3): Factorization method:
There is a factorization method as a method of generating three-dimensional shape data from only a plurality of multi-view camera image data and continuous moving image data without using the reference marker as described above.
In general, in order to obtain the three-dimensional shape of an object from any two-dimensional images around the object without limiting the camera position and the viewpoint direction, a huge amount of calculation processing is required and the solution is not good. Become stable. "Factorization" (Tomasi and Kanade (Takeo Kanade), 1992) simplifies the problem by approximating the perspective projection, which is an actual camera model, with an affine projection, and makes numerical calculations faster and faster. The solution can be stabilized. It is also known as an excellent method that can simultaneously restore camera motion and the three-dimensional shape of a target object from a plurality of affine approximate projection images.

図１４は、図７のステップＳ９における因子分解法による３次元形状データ作成動作の一実施例を示す詳細フローチャートである。
図７のステップＳ８でカメラ情報が未知の場合には、図１４で、ＣＰＵ１１は、まず、各画像から、対象とする人物の輪郭外形や顔の特徴部位を表す線分や、曲線、特徴点を抽出する（ステップ９−３−１）。 FIG. 14 is a detailed flowchart showing an example of the operation for creating three-dimensional shape data by the factorization method in step S9 of FIG.
If the camera information is unknown in step S8 of FIG. 7, in FIG. 14, the CPU 11 firstly, from each image, a line segment, a curve, or a feature point representing the contour outline of the subject person or the feature part of the face. Is extracted (step 9-3-1).

次に、各画像の主要点の点特徴を抽出し、Kanade-Lucas-Tomasi法等を用いて各特徴点を対応付け（ステップ９−３−２）、多視点画像における各点座標（計測行列）から、因子分解法等により、カメラの動き情報（運動行列）と対象物の３次元形状情報（形状行列）を復元して３次元形状データを生成し、図７のステップＳ１０に進む（ステップ９−３−３）。 Next, point features of main points of each image are extracted, and each feature point is associated using the Kanade-Lucas-Tomasi method or the like (step 9-3-2), and each point coordinate (measurement matrix) in the multi-viewpoint image is obtained. ), The camera motion information (motion matrix) and the three-dimensional shape information (shape matrix) of the object are restored by factorization or the like to generate three-dimensional shape data, and the process proceeds to step S10 in FIG. 7 (step S10). 9-3-3).

「因子分解法」については、前述した文献（金出武雄ほか、「因子分解法による物体形状とカメラ運動の復元」、電子通信学会論文誌、J76−D−II、No.8（19930825）、pp．1497−1505や、藤木淳（産総研）、「点対応を用いた複数の２次元画像からの３次元形状復元−因子分解法の数理−」、統計数理、第４９巻第１号、pp77〜107、2001年）などに詳しく説明されているので、ここでは煩雑を避けて、以下に概略のみ説明する。 Regarding the "factor decomposition method", the above-mentioned literature (Takeo Kanade et al., "Restoring object shape and camera motion by factor decomposition method", IEICE Transactions, J76-D-II, No. 8 (19930825), pp. 1495-1505, Satoshi Fujiki (AIST), “3D shape restoration from multiple 2D images using point correspondence-Mathematical factorization method”, Statistical mathematics, Vol. 49, No. 1, (pp77-107, 2001) and the like are described in detail, so that only the outline will be described below to avoid complication.

すでに、上述した「Ｂ：複数の画像間における点の対応付け」のステップにより、複数の画像における点特徴の対応付けが既に求められ、画像座標として与えられているとする。アフィン近似射影においては、カメラ撮影による写像は、３次元空間の対象物から、２次元画像へのカメラの位置と方向によって決まるアフィン射影となる。
画像がｆ枚、特徴点がＰ個与えられるとき、Ｐ個の３次元座標のＦ個のアフィン射影によるＦＰ個の画像座標が得られるとすると、因子分解法では、このＦＰ個の条件を行列の形に並べて、複数の２次元画像からの３次元形状復元問題を単純な形で表現することができる。すなわち、
（計測行列）＝（運動行列）×（形状行列）・・・（１１）
ここで、計測行列はＦＰ個の画像座標を並べた２Ｆ×Ｐ行列、運動行列はＦ個のアフィン射影の表現行列を並べた２Ｆ×３行列、形状行列はＰ個の特徴点の３次元座標を並べた３×Ｐ行列である。つまり、複数の２次元画像からの３次元形状の復元問題は、計測行列の因子分解に帰着できる。 It is assumed that the association of point features in a plurality of images has already been obtained and given as image coordinates by the step “B: Point association between a plurality of images” described above. In the affine approximate projection, the mapping by the camera shooting is an affine projection determined by the position and direction of the camera from the object in the three-dimensional space to the two-dimensional image.
When f images and P feature points are given, assuming that FP image coordinates are obtained by F affine projections of P three-dimensional coordinates, the factorization method uses the FP conditions as a matrix. The three-dimensional shape restoration problem from a plurality of two-dimensional images can be expressed in a simple form. That is,
(Measurement matrix) = (motion matrix) × (shape matrix) (11)
Here, a measurement matrix is a 2F × P matrix in which FP image coordinates are arranged, a motion matrix is a 2F × 3 matrix in which F affine projection expression matrices are arranged, and a shape matrix is a three-dimensional coordinate of P feature points. Is a 3 × P matrix. That is, the problem of restoring a three-dimensional shape from a plurality of two-dimensional images can be reduced to factorization of a measurement matrix.

このように、透視射影によって得られた計測行列の成分からアフィン近似射影により投影されたとき得られる計測行列を推定できれば、後は、（因子分解法のアルゴリズムにしたがって）計測行列を運動行列と形状行列の積に分解するだけで、カメラの運動情報と物体の３次元形状とを復元することができる。
ただし、画像座標が正規直交基底による表現であるため、正しい復元解を得るには、画像座標の基底が正規直交基底となるように分解する必要がある。アフィン射影モデル（Mundy and Zisserman,1992）は、校正されていないカメラに対するモデルとして、X_fpからｘ_fpへの変換が次式のアフィン射影の形で表される。
ｘ_fp＝A_f X_fp＋u_f ・・・（１２）
ここで、A_fとu_fは未知パラメータである。アフィン射影モデルでは、A_fには何の仮定もされていないので、対象物のアフィン空間における位置関係を知ること（アフィン復元）はできても、対象物体の対象物体の長さや角度など計量情報を知ること（ユークリッド復元）はできない。 Thus, if the measurement matrix obtained when projected by the affine approximate projection can be estimated from the components of the measurement matrix obtained by the perspective projection, the measurement matrix and the shape of the motion matrix and the shape are obtained (in accordance with the factorization algorithm). The camera motion information and the three-dimensional shape of the object can be restored simply by decomposing the matrix product.
However, since the image coordinates are expressed by orthonormal bases, it is necessary to decompose the image coordinate bases to be orthonormal bases in order to obtain a correct restoration solution. The affine projection model (Mundy and Zisserman, 1992) is a model for an uncalibrated camera, and the conversion from X _fp to x _fp is expressed in the form of an affine projection of the following equation.
x _fp = A _f X _fp + u _f (12)
Here, A _f and u _f are unknown parameters. In the affine projection model, no assumptions are made for A _f , so even if the positional relationship of the target object in the affine space can be known (affine reconstruction), the measurement information such as the length and angle of the target object of the target object can be obtained. It is impossible to know (Euclidean restoration).

そこで、対象物体のユークリッド復元を行なうためのモデルとして、計量アフィン射影モデル（ＭＡＰモデル）が考えられた。対象物体のユークリッド復元を行なうには、A_fから奥行きパラメータλ_f*＝ｔZ_f*をくくりだした残りである行列B_fの成分が既知である必要がある。

（B_fは既知）・・・（１３） Therefore, a metric affine projection model (MAP model) has been considered as a model for performing Euclidean reconstruction of the target object. In order to perform the Euclidean reconstruction of the target object, it is necessary to know the components of the matrix B _f that is the remainder obtained by deducting the depth parameter λ _{f *} = tZ _{f *} from A _f .

(B _f is known) (13)

さらに、カメラの位置の復元を行なうためには、u_fが既知である必要がある。
以上の仮定より、

（ここで、ｌ：焦点距離）・・・（１４）
このような仮定を加えたアフィン射影モデルを、「計量アフィン射影」（ＭＡＰ：Metric Affine Projection）モデルと呼ぶ。また、A_fをＭＡＰ行列と呼ぶ。 Furthermore, u _f needs to be known in order to restore the position of the camera.
From the above assumptions,

(Where l is the focal length) (14)
The affine projection model to which such an assumption is added is called a “metric affine projection” (MAP) model. A _f is called a MAP matrix.

立体（対象物体）が固定され、カメラが運動していると仮定すると、カメラの運動情報と物体の３次元形状とを復元するためには、カメラモデルを立体に固定された座標系（世界座標系）で表す必要がある。第ｆ画像におけるカメラ位置の世界座標をｔ_f、第ｆ画像面上の正規直交基底を｛i_f,j_f｝、カメラ光軸方向の単位ベクトルをｋ_fとして、
世界座標におけるカメラの向きを表す行列（カメラの基底行列）をC＝（i_f，j_f，ｋ_f）^T、
また、第ｐ特徴点の世界座標をｓ_p、第ｆ画像のカメラ座標系における空間座標をX_fpとすると、
ｓ_p＝ｔ_f＋C_f ^T X_fp ・・・（１５）
この表現を、ある特徴点ｓ_*からの相対座標ｓ^* _p＝ｓ_p−ｓ_*、ｔ^* _f＝ｔ_f−ｓ_*で表すと、
ｓ^* _p＝ｔ^* _f＋C_f ^T X_fp、X_fp＝C_f（ｓ^* _p−ｔ^* _f）・・・（１６）
上記のＭＡＰモデルを世界座標系で表すと、

（ここで、ｌ：焦点距離）・・・（１７）
Ｐ個の点特徴の画像がＦ枚得られたとき、複数の２次元画像から、カメラの運動情報と物体の３次元形状情報とを復元する問題は、上式（１７）から｛C_f｝と、｛ｓ^* _p｝を求める問題になる。 Assuming that the solid (target object) is fixed and the camera is moving, to restore the camera motion information and the three-dimensional shape of the object, the coordinate system (world coordinates in which the camera model is fixed to the solid) System). The world coordinate of the camera position in the f-th image is t _f , the orthonormal basis on the f-th image plane is {i _f , j _f }, and the unit vector in the camera optical axis direction is k _f ,
C = (i _f , j _f , k _f ) ^T , a matrix representing the camera orientation in world coordinates (camera base matrix)
Furthermore, the world coordinate s _p of the p feature points, the spatial coordinates in the camera coordinate system of the f image and X _fp,
_{_{_{s p = t f + C f}}} T X fp ··· (15)
The expression, relative coordinates s ^* _{_p} = s _p -s from one feature point s _* _*, expressed in ^{_{_{_{t * f = t f -s *}}}} ,
^{_{^{_{s * p = t * f +}}}} C f T X fp, X fp = C f (s * p -t * f) ··· (16)
When the above MAP model is expressed in the world coordinate system,

(Where l is the focal length) (17)
When F images of P point features are obtained, the problem of restoring camera motion information and object three-dimensional shape information from a plurality of two-dimensional images is as follows: {C _f } And {s ^* _p }.

因子分解法では、ＦＰ個の上式（ｆ=１〜F、ｐ＝１〜P）から作られた行列を分解することによって、カメラの向きを表す行列｛C_f｝（ｆ＝１〜F）と、対象物の特徴点の世界座標｛ｓ^* _p｝（ｐ＝１〜P）を求める。 In the factorization method, a matrix {C _f } (f = 1 to _F ) representing the camera direction is obtained by decomposing a matrix formed from the above FP equations (f = 1 to F, p = 1 to P). ) And world coordinates {s ^* _p } (p = 1 to P) of feature points of the object.

ここで、計測行列W^*、運動行列M、形状行列S^*を、

・・・（１８）
と定義すると、
W^*＝M（２F×３行列）・S^*（３×P行列）・・・（１９）
が成立する。
Mには、カメラ運動に関する未知数｛C_f｝（ｆ＝１〜F）および｛λ^* _f｝（ｆ＝１〜F）のみが、S^*には、３次元形状に関する未知数｛ｓ^* _p｝（ｐ＝１〜P）のみが含まれていることから、計測行列W^*を、運動行列Mと形状行列S^*の積に分解することができれば、カメラの運動情報と物体の３次元形状とが復元できる。 Here, the measurement matrix W ^* , the motion matrix M, and the shape matrix S ^* are

... (18)
Defined as
W ^* = M (2F × 3 matrix) · S ^* (3 × P matrix) (19)
Is established.
M contains only the unknowns {C _f } (f = 1 to F) and {λ ^* _f } (f = 1 to _F ) related to the camera motion, and S ^* represents the unknown {s ^* _p } related to the three-dimensional shape. Since only (p = 1 to P) is included, if the measurement matrix W ^* can be decomposed into the product of the motion matrix M and the shape matrix S ^* , the motion information of the camera and the three-dimensional shape of the object Can be restored.

ここで、W^*のMとS^*の積への分解において、｛C_f｝が３次元回転行列であることから、次の条件（計量拘束）が満たされる必要がある。
M_f M_f ^T＝A_f A_f ^T＝(1/λ^2* _f)B_f B_f ^T ・・・（２０） Here, in the decomposition of W ^* into the product of M and S ^* , since {C _f } is a three-dimensional rotation matrix, the following condition (metric constraint) needs to be satisfied.
M _f M _f ^T = A _f A _f ^T = (1 / λ ^{2 *} _f ) B _f B _f ^T (20)

因子分解法のアルゴリズム；
実際の因子分解法のアルゴリズムでは、「Affine復元」、「Euclid復元」の順に、計測行列W^*を分解して、カメラの運動情報と物体の３次元形状情報を復元する。
まず、特異的分解（SVD）などにより、計測行列W^*を、M＾（２Ｆ×３）と、S＾^*（３×P）の積に一時的に（暫定的に）分解して、Affine復元する。

・・・（２１）
このとき、M、S^*、と暫定解のM＾、S＾^*の間には、
M＝M＾A、 S^*＝A^-1・S＾^*（ただし、A^-1はAの逆行列）・・・（２２）
の関係を満たす３×３可逆行列Aが存在するので、この暫定的な分解によって、運動と形状がアフィン復元されていることになる。 Factorization algorithm;
In an actual factorization algorithm, the measurement matrix W ^* is decomposed in the order of “Affine restoration” and “Euclid restoration” to restore camera motion information and object three-dimensional shape information.
First, the measurement matrix W ^* is temporarily (provisionally) decomposed into a product of M ^ (2F × 3) and S ^ ^* (3 × P) by specific decomposition (SVD) or the like, and Affine Restore.

... (21)
At this time, between M, S ^* , and M ^, S ^ ^* of the provisional solution,
M = M ^ A, S ^* = A- ¹ · S ^ ^* (where A- ¹ is the inverse matrix of A) (22)
Since there is a 3 × 3 reversible matrix A that satisfies the relationship, the motion and shape are restored to affine by this provisional decomposition.

２）暫定解のアフィン復元解からユークリッド復元解を求めることは、３×３可逆行列Aを求めることに帰着する。Q＝AA^Tとおくと、前記の計量拘束条件から、Aの満たすべき条件は、
M＾_f Q M＾_f ^T＝A_f A_f ^T ＝(1/λ^2* _f) B_f B_f ^T ・・・（２３）
ここで、式（１４）における未知量は｛λ^* _f｝（ｆ＝１〜F）とQとであり、B_fは既知であるから、
B_fの特異値分解をB_f＝R_fΣ_fD_fとすると、R_f、Σ_fは既知であり、
P＾_f＝（ｐ＾_f，ｑ＾_f）^T＝R_f ^TM＾_f、P_f＝R_f ^TM_f ・・・（２４）
とおくと、式（１４）の拘束条件は、
P＾_f Q P＾_f ^T＝(1/λ^2* _f) Σ² _f ・・・（２５）
と単純になる。このとき、
ｐ＾_f ^TQ ｐ＾_f＝ｐ² _f/λ^2* _f、ｐ＾_f ^TQ ｑ＾_f＝0、ｑ＾_f ^TQ ｑ＾_f＝ｑ² _f/λ^2* _f、
すなわち、（ｐ＾_f ^TQ ｐ＾_f）/（ｐ² _f）＝（ｑ＾_f ^TQ ｑ＾_f）/（ｑ² _f）＝１/λ^2* _f、ｐ＾_f ^TQ ｑ＾_f＝0、・・・（２６）
よって、次式のように、｛λ^* _f｝を含まない、Qに関する線型同次連立方程式が得られる。
ｑ² _f（ｐ＾_f ^TQ ｐ＾_f）−ｐ² _f（ｑ＾_f ^TQ ｑ＾_f）＝0、ｐ＾_f ^TQ ｑ＾_f＝0、
・・・（２７）
この連立方程式を解くことによって、｛λ^* _f｝による定数倍の不定性を除いて、３×３の正値対称行列であるQを一意的に求めることができる。 2) Obtaining the Euclidean restoration solution from the affine restoration solution of the provisional solution results in obtaining the 3 × 3 reversible matrix A. Putting a Q = AA ^T, from the metering constraint, condition to be satisfied by A is
_{_{^{M ^ f QM ^ f T =}}} A f A f T = (1 / λ 2 * f) B f B f T ··· (23)
Here, the unknown quantities in the equation (14) are {λ ^* _f } (f = 1 to F) and Q, and _Bf is known.
_If the singular value decomposition of B _f is B _f = R _f Σ _f D _f , R _f and Σ _f are known,
_{_{P ^ f = (p ^ f}} , q ^ f) T = R f T M ^ f, P f = R f T M f ··· (24)
Then, the constraint condition of the equation (14) is
P ^ _f QP ^ _f ^T = (1 / λ ^{2 *} _f ) Σ ² _f (25)
And become simple. At this time,
_{^{_{p ^ f T Q p ^ f}}} = p 2 f / λ 2 * f, p ^ f T Q q ^ f = 0, q ^ f T Q q ^ f = q 2 f / λ 2 * f,
In other _{^{words, (p ^ f T Q p}} ^ f) / (p 2 f) = (q ^ f T Q q ^ f) / (q 2 f) = 1 / λ 2 * f, p ^ f T Q q ^ _f = 0, (26)
Therefore, a linear simultaneous equation regarding Q that does not include {λ ^* _f } is obtained as in the following equation.
q ² _f (p ^ _f ^T Q p ^ _f ) −p ² _f (q ^ _f ^T Q q ^ _f ) = 0, p ^ _f ^T Q q ^ _f = 0,
... (27)
By solving these simultaneous equations, Q, which is a 3 × 3 positive symmetric matrix, can be uniquely obtained without the indefiniteness of constant multiplication by {λ ^* _f }.

３）対称行列Qのコレスキー分解を、Q＝LL^Tとすると、
Aの一般解は、A=L^TUとなり、
運動行列M、形状行列S^*の一般解は、M＝M＾L^TU、S^*＝U^TL S＾^* ・・・（２８）
で求まる。 3) The Cholesky decomposition of the symmetric matrix Q, and the Q = LL ^T,
The general solution of A is A = L ^T U,
The general solutions of the motion matrix M and the shape matrix S ^* are M = M ^ L ^T U, S ^* = U ^T LS ^ ^* (28)
It is obtained by

上記のようにして、計測行列W^*（ＦＰ個の画像座標を並べた２Ｆ×Ｐ行列）データから、カメラの運動情報を含む運動行列Ｍデータ（Ｆ個のアフィン射影の表現行列を並べた２Ｆ×３行列）と、物体の３次元形状情報を含む形状行列S^*データ（Ｐ個の特徴点の３次元座標を並べた３×Ｐ行列）を同時に復元することができる。 As described above, from the measurement matrix W ^* (2F × P matrix in which FP image coordinates are arranged) data, the motion matrix M data including the camera motion information (F affine projection expression matrices arranged in 2F) × 3 matrix) and shape matrix S ^* data (3 × P matrix in which the three-dimensional coordinates of P feature points are arranged) including the three-dimensional shape information of the object can be restored simultaneously.

因子分解法アルゴリズムの詳細は、前記の文献などに詳しい。また、因子分解法をリアルタイム処理に対応するために逐次的に計算する「逐次型因子分解法」や、アフィン近似射影による画像を因子分解法を用いて反復的に推定して、カメラ運動と立体形状とをより高精度に復元する「C-H法」（Christy and Horaud，1996年）など、様々な改良法も提案されているが、ここでは、省略する。 Details of the factorization algorithm are detailed in the above-mentioned literature. In addition, the “sequential factorization method” that sequentially calculates the factorization method to support real-time processing, and the iterative estimation of the image by the affine approximate projection using the factorization method, the camera motion and the stereoscopic Various improved methods such as the “CH method” (Christy and Horaud, 1996) for restoring the shape with higher accuracy have been proposed, but are omitted here.

実際の因子分解法のアルゴリズムでは、「Affine復元」、「Euclid復元」の順に、計測行列W^*を分解して、カメラの運動情報と物体の３次元形状情報を復元する。
１）まず、特異的分解（SVD）などにより、計測行列W^*を、M＾（２Ｆ×３）と、S＾^*（３×Ｐ）の積に一時的に（暫定的に）分解して、Affine復元する。 In an actual factorization algorithm, the measurement matrix W ^* is decomposed in the order of “Affine restoration” and “Euclid restoration” to restore camera motion information and object three-dimensional shape information.
1) First, the measurement matrix W ^* is temporarily (provisionally) decomposed into the product of M ^ (2F × 3) and S ^ ^* (3 × P) by specific decomposition (SVD). Restore Affine.

なお、逆行列（inverse matrix）とは、n次正方行列Aに対して、AX=XA=I（Iは単位行列）となるn次正方行列Xが存在するとき、Aはn次正則行列、あるいは「正則である」という。このとき、XをAの「逆行列」と呼び、A^-1と書く。また、正方行列（square matrix）とは、行要素の数と列要素の数とが一致する行列のことであり、可逆行列（invertible matrix）：あるいは、正則行列とは、行列の通常の積に関する逆元である逆行列を持つ正方行列のことである。
また、対称行列（symmetric matrix）とは、正方行列Aのうち、Aの転置行列A^TがA自身と一致する行列をいう。 The inverse matrix (inverse matrix) is an n-order square matrix A when an n-order square matrix X with AX = XA = I (I is a unit matrix) exists for an n-order square matrix A. Or they say "regular". At this time, X is called an “inverse matrix” of A and is written as A ⁻¹ . A square matrix is a matrix in which the number of row elements matches the number of column elements. Invertible matrix: Or, a regular matrix is related to the normal product of matrices. It is a square matrix with an inverse matrix that is the inverse element.
Also, the symmetric matrix (symmetric matrix), of a square matrix A, means a matrix transposed matrix A ^T of A coincides with A itself.

また、コレスキー分解（Cholesky decomposition）：とは、本来は、正定値エルミート行列Aを下三角行列LとLの共役転置行列L^*との積に分解すること。実対称行列の場合には、共役転置は転置に単純化されるので、対称行列AをA＝LL^Tに分解することに相当する。 Cholesky decomposition (Cholesky decomposition): Originally, a positive definite Hermitian matrix A is decomposed into a product of a lower triangular matrix L and a conjugate transpose matrix L ^* of L. In the case of a real symmetric matrix, conjugate transposition is simplified to transposition, which is equivalent to decomposing the symmetric matrix A into A = LL ^T.

（変形例）
図１５は、３次元データ作成装置１の３次元形状データ作成動作の一変形例を示すフローチャートであり、図７のステップＳ６〜Ｓの動作をステップＳ６’〜Ｓ９’に置き換えて、より簡易に３次元形状データ作成が可能なように構成した例である。 (Modification)
FIG. 15 is a flowchart showing a modification of the three-dimensional shape data creation operation of the three-dimensional data creation device 1, and the operations in steps S6 to S in FIG. 7 are replaced with steps S6 ′ to S9 ′, thereby simplifying the operation. In this example, three-dimensional shape data can be created.

図１５で、ＣＰＵ１１は、特徴点抽出プログラム２３６により、図７のステップＳ４で抽出された各画像の人物領域から頭部の形、顔貌の特徴点を抽出する（ステップＳ６’）。 In FIG. 15, the CPU 11 extracts feature points of the shape of the head and the face from the person region of each image extracted in step S4 of FIG. 7 by the feature point extraction program 236 (step S6 ').

次に、３次元モデルデータ検索プログラム２３７により、顔の３次元標準モデルデータ３１を検索し、ステップＳ９−４−１で抽出した対象者の頭部や顔の特徴点に最も類似する３次元モデルデータを得る（ステップＳ７’）。 Next, the three-dimensional model data retrieval program 237 retrieves the three-dimensional standard model data 31 of the face, and the three-dimensional model most similar to the subject's head and facial feature points extracted in step S9-4-1. Data is obtained (step S7 ′).

次に、３次元データ生成プログラム２３８により、対象者の顔の画像データをステップ１０−４−２で得た顔の３次元標準モデルの３次元形状（曲面）（図８（ａ）参照））上に射影変換して顔（および頭部）の３次元形状データを生成する（ステップＳ８’）。 Next, the three-dimensional shape (curved surface) of the three-dimensional standard model of the face obtained in step 10-4-2 by the three-dimensional data generation program 238 as the face image data of the subject (see FIG. 8A)) Projective transformation is performed on the face to generate three-dimensional shape data of the face (and head) (step S8 ′).

ステップＳ９−４−３で生成した顔（および頭部）の３次元形状データからなる３次元画像の顔の表情の違いや変化が所定値より大きい場合に表情変化等の補正処理プログラム２３９により（中立顔やすまし顔、微笑顔などの）所定の表情の顔画像になるよう顔の表情変化の標準モデルデータ３２や服装、髪型等の標準モデルデータ３３を用いて顔の表情や服装や髪形等の変化の補正処理を施し、図７のステップＳ１０に進む（ステップＳ９’）。 If the facial expression difference or change in the 3D image composed of the 3D shape data of the face (and head) generated in step S9-4-3 is greater than a predetermined value, the facial expression change correction processing program 239 ( Using standard model data 32 for changing facial expressions and standard model data 33 for clothes, hairstyles, etc. so that a facial image with a predetermined expression (such as a neutral face, a smiling face, a smile) is used. And the process proceeds to step S10 in FIG. 7 (step S9 ′).

上記変形例によれば、被写体の頭部や顔に類似する３次元モデルデータの顔に、対象者の顔の画像データを射影変換して頭部の３次元データを生成できるようにしたので、画像処理能力が低い装置や小型装置でも処理できるとともに、処理時間が短縮される。例えば、ゲームセンターに設置してあるようなガチャガチャ式の自動販売機などでも、３次元データ作成装置にカメラを装備したり、デジタルカメラや携帯電話から画像を近距離送信したり、ケーブル接続により入力するようにして、その場で顔画像から３次元フィギュアや胸像を製作加工して、安価なフィギュアを製造販売提供できる。 According to the above modification, the image data of the subject's face is projectively transformed to the face of the 3D model data similar to the head and face of the subject, so that the 3D data of the head can be generated. It is possible to process even a device having a low image processing capability or a small device, and the processing time is shortened. For example, even in a gambling vending machine installed in a game center, a 3D data creation device is equipped with a camera, an image is transmitted from a digital camera or a mobile phone, or input by a cable connection. In this way, it is possible to manufacture and sell inexpensive figures by producing and processing 3D figures and busts from face images on the spot.

本発明の３次元データ作成装置を用いた３次元画像作成受託システムの構成例を示す図である。It is a figure which shows the structural example of the three-dimensional image creation commissioned system using the three-dimensional data creation apparatus of this invention. 図１に示した３次元画像作成受託システムにおける３次元画像の受託・作成プロセスを示すプロセスチャートである。3 is a process chart showing a 3D image entrusting / creating process in the 3D image creating entrusting system shown in FIG. 1. 本発明の３次元データ作成装置の構成例を示す図である。It is a figure which shows the structural example of the three-dimensional data creation apparatus of this invention. 標準モデルデータベースの構成例を示す図である。It is a figure which shows the structural example of a standard model database. 保存メモリに格納されているプログラムの例を示す図である。It is a figure which shows the example of the program stored in the preservation | save memory. 本発明に基づく３次元形状データ作成過程の説明図である。It is explanatory drawing of the three-dimensional shape data creation process based on this invention. ３次元データ作成装置による３次元胸像データ作成動作の一実施例を示すフローチャートである。It is a flowchart which shows one Example of the three-dimensional breast image data creation operation | movement by a three-dimensional data creation apparatus. 顔の３次元標準モデルおよび顔の表情変化の３次元モデルの例を示す図である。It is a figure which shows the example of the 3D standard model of a face, and the 3D model of the facial expression change. 顔の表情変化の標準モデルデータの構成例を示す図である。It is a figure which shows the structural example of the standard model data of the facial expression change. 表情変化の補正処理プログラムによる顔画像の表情の変化の補正処理動作の一実施例を示すフローチャートである。It is a flowchart which shows one Example of the correction process operation | movement of the change of the expression of the face image by the correction process program of a facial expression change. 表情変化の補正処理プログラムによる顔の表情等の補正の説明図である。It is explanatory drawing of correction | amendment of the facial expression etc. by the correction process program of facial expression change. 図７のステップＳ９における多視点カメラ画像からの３次元形状データ作成動作の一実施例を示す詳細フローチャートである。FIG. 8 is a detailed flowchart illustrating an example of an operation for creating three-dimensional shape data from a multi-viewpoint camera image in step S <b> 9 of FIG. 7. 図７のステップＳ９における視体積交差法による３次元形状データ作成の動作の一実施例を示す詳細フローチャートである。It is a detailed flowchart which shows one Example of the operation | movement of three-dimensional shape data preparation by the visual volume intersection method in FIG.7 S9. 図７のステップＳ９における因子分解法による３次元形状データ作成動作の一実施例を示す詳細フローチャートである。It is a detailed flowchart which shows one Example of the three-dimensional shape data creation operation | movement by the factorization method in step S9 of FIG. ３次元データ作成装置による３次元形状データ作成動作の一変形例を示すフローチャートである。It is a flowchart which shows the modification of the three-dimensional shape data creation operation | movement by a three-dimensional data creation apparatus.

Explanation of symbols

１３次元データ作成装置
２データベース
３、８３次元画像加工装置
４通信ネットワーク
１１ＣＰＵ
２２３次元データ作成プログラム
３１顔や頭部の３次元標準モデルデータ
３２顔の表情変化の３次元標準モデルデータ
３３服装、髪型等の３次元標準モデルデータ DESCRIPTION OF SYMBOLS 1 3D data preparation apparatus 2 Database 3, 8 3D image processing apparatus 4 Communication network 11 CPU
22 3D data creation program 31 3D standard model data of face and head 32 3D standard model data of facial expression changes 33 3D standard model data of clothes, hairstyle, etc.

Claims

A database of standard model data of stereoscopic images;
Image data acquisition means for acquiring image data of a plurality of multi-viewpoint images;
A three-dimensional shape data generation program for causing a three-dimensional data creation device to function as a three-dimensional shape data generation means for generating three-dimensional shape data from the plurality of multi-viewpoint images;
Extracting a person area from each of a plurality of pieces of image data acquired by the image data acquisition means, acquiring corresponding points between the feature part of the face detected from the extracted person area and the standard model data of the database, 3D data creation control means for obtaining 3D shape data by a 3D shape data generation program;
Facial expression change correction means for correcting facial expression change model data by correcting the three-dimensional shape data;
With
The facial expression change correction means includes correspondence table data in which combinations of facial expression types and facial expression action units are associated, and correspondence table data in which facial expression motion units and facial muscle contraction sites are associated with each other. By expressing the combination of unit movements and converting the unit movement on each table into the contraction of the facial muscles of the corresponding part, the facial expression data showing emotions is corrected to neutral face data, or neutral Correct facial data to facial expression data showing emotions,
A three-dimensional data creation device characterized by that.

The image data acquisition means includes camera information acquisition means for acquiring camera information when camera information such as a shooting position and a shooting direction of image data to be acquired is known,
The standard model data of the stereoscopic image includes standard model data of a person's face and head, standard model data of facial expression changes,
The three-dimensional data creation control means acquires facial expression data from corresponding points between the facial feature portion detected from the extracted person region and the standard model data of the person's face, and the facial expression in the facial expression data and the standard model If the facial expression change in the facial expression data is greater than a threshold value by comparing with the facial expression of the data, the facial facial expression is corrected to a predetermined facial expression based on the facial expression change model data, and the camera information acquisition means If the information is known, the first 3D shape data generation program included in the 3D shape data generation program generates the 3D shape data of the person. If the camera information is unknown, the 3D shape data is generated. Generating the 3D shape data of the person by a second 3D shape data generation program included in the data generation program;
The three-dimensional data creation apparatus according to claim 1.

The first three-dimensional shape data generation program uses a three-dimensional data creation device based on image data of a plurality of multi-view images acquired by the image data acquisition unit and camera information acquired by the camera information acquisition unit. The three-dimensional data creation apparatus according to claim 1, wherein the three-dimensional data creation device functions as three-dimensional shape data generation means for generating three-dimensional shape data.

The second three-dimensional shape data generation program generates a three-dimensional shape data by using a three-dimensional data creation device based on a visual volume intersection method based on image data of a plurality of multi-viewpoint images acquired by the image data acquisition unit. The three-dimensional data creation apparatus according to claim 1, wherein the three-dimensional data creation device functions as a three-dimensional shape data generation unit.

The second three-dimensional shape data generation program generates a three-dimensional shape data by a factorization method based on the image data of a plurality of multi-view images acquired by the image data acquisition unit. The three-dimensional data creation apparatus according to claim 1, which functions as a three-dimensional shape data generation unit.

Standard model data of the stereoscopic image includes the standard model data of the face and head of a person, the standard model data of facial expressions, the standard model data of clothing and hair,
The three-dimensional data creation control means extracts a head shape and facial features from each image data based on a facial feature portion detected from the extracted person region, and standard model data of the person's face and head To obtain three-dimensional model data most similar to the shape of the head and facial features, and projectively transform the image data of the plurality of multi-viewpoint images onto the curved surface of the acquired three-dimensional model data. And three-dimensional shape data of the head,
The three-dimensional data creation apparatus according to claim 1 .