JP2004021388A

JP2004021388A - Image processing apparatus and photographing system including the same

Info

Publication number: JP2004021388A
Application number: JP2002172571A
Authority: JP
Inventors: Hideki Mitsumine; 三ッ峰　秀樹; Yuiko Yamauchi; 山内　結子; Takashi Fukaya; 深谷　崇史; Seiki Inoue; 井上　誠喜
Original assignee: Nippon Hoso Kyokai NHK; Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2002-06-13
Filing date: 2002-06-13
Publication date: 2004-01-22

Abstract

【課題】所定の照明条件の映像を実時間で順次生成することが可能な技術を提供することである。
【解決手段】入力画像として映像される被写体の反射係数を演算する反射係数演算手段と、前記入力画像と異なる照明条件を設定する新規照明条件設定手段と、前記設定された照明条件と前記反射係数とに基づいて、前記入力画像から前記新規照明条件設定手段で設定される照明条件の画像を生成する照明条件付加手段とを有する画像処理装置において、前記入力画像は前記被写体の奥行き情報を含む画像からなり、前記反射係数演算手段は、前記被写体の奥行き情報に基づいて前記入力画像から鏡面反射領域を特定し鏡面反射画素を補正する補正手段と、前記補正された入力画像から前記被写体の反射係数を演算する反射係数演算手段とを備えた
【選択図】　　　　図１An object of the present invention is to provide a technique capable of sequentially generating images under predetermined lighting conditions in real time.
A reflection coefficient calculating means for calculating a reflection coefficient of a subject imaged as an input image, a new lighting condition setting means for setting an illumination condition different from the input image, the set illumination condition and the reflection coefficient An illumination condition adding unit that generates an image of the illumination condition set by the new illumination condition setting unit from the input image based on the input image, wherein the input image includes an image including depth information of the subject. The reflection coefficient calculation means comprises: a correction means for specifying a specular reflection area from the input image based on the depth information of the subject to correct specular reflection pixels; and a reflection coefficient of the subject from the corrected input image. And a reflection coefficient calculating means for calculating the coefficient.

Description

【０００１】
【発明の属する技術分野】
本発明は、画像処理装置及びそれを備えた撮影システムに関し、特に、所定の照明条件下で連続的に撮影した被写体の撮影画像から他の照明条件下で見込まれる撮影画像を生成する技術に関するものである。
【０００２】
【従来の技術】
従来の仮想スタジオシステムでは、合成元となる予め用意された映像（背景映像）と、該背景映像と同等の照明条件をスタジオ内で再現し、この照明条件の元で撮影された出演者（被写体）の映像とを合成することによって、リアルタイムで違和感のない合成映像を得ていた。
【０００３】
しかしながら、刻一刻と変化する照明条件をスタジオ内で連続して再現することは不可能であり、また、スタジオ内で再現可能な照明条件は限られたものとなる。このために、刻一刻と変化する照明条件やスタジオで再現不可能な背景映像を用いた仮想スタジオシステムでは、代表的な照明条件を再現し、出演者はこの代表的な照明条件で演技を行うことによって、違和感の小さい合成映像を得る構成となっていた。
【０００４】
一方、近年のＣＧ（コンピュータグラフィック）技術の進展に伴い、背景映像に表現される照明条件も複雑なものとなっており、背景映像に合致した照明条件をスタジオ内で再現することが困難である。
【０００５】
この問題を解決する技術として、出演者を撮影した映像を３次元ＣＧ処理することによって、背景映像と同等な照明条件の映像を生成することが検討されている。この３次元ＣＧ技術を用いた方法では、所定の照明条件で撮影された出演者の映像から出演者の形状及び表面の反射特性（テクスチャ）を作成し、背景映像の照明条件に合致するように出演者の映像を加工するものである。
【０００６】
【発明が解決しようとする課題】
本発明者は、前記従来技術を検討した結果、以下の問題点を見いだした。
被写体形状および表面の反射特性を作成することは非常に煩雑な作業が必要であり、現実味のあるリアルな３次元ＣＧ映像を作成するためには、作成者の高い能力や膨大な作業時間が必要となっていた。
【０００７】
このために、映画や録画映像のように、撮影から放送（放映）までの期間に、３次元ＣＧ映像の作成に要する時間を確保することができる場合には適用可能である。しかしながら、生放送のようにテレビカメラで撮影した映像を順次放送する場合には、テレビカメラが次の映像の撮影を終了するまでに３次元ＣＧ映像の作成を終了する必要があるので、生放送の番組には適用できないという問題がった。
【０００８】
ただし、従来の被写体表面における反射特性の計測方法は、例えば暗室に被写体を設置し、光源情報のわかるものを一点に設置した環境下で、その設置位置が既知である計測装置によって計測するものであった。また、従来の被写体表面の反射特性の推定方法には、例えば、特願２００１−１４５１９２（以下、文献１と記す）に開示される被写体表面の反射特性計測装置があった。
【０００９】
本発明の目的は、所定の照明条件の映像を実時間で順次生成することが可能な技術を提供することにある。
本発明の前記ならびにその他の目的と新規な特徴は、本明細書の記述及び添付図面によって明らかになるであろう。
【００１０】
【課題を解決するための手段】
本願において開示される発明のうち、代表的なものの概要を簡単に説明すれば、下記のとおりである。
【００１１】
（１）、入力画像として映像される被写体の反射係数を演算する反射係数演算手段と、前記入力画像と異なる照明条件を設定する新規照明条件設定手段と、前記設定された照明条件と前記反射係数とに基づいて、前記入力画像から前記新規照明条件設定手段で設定される照明条件の画像を生成する照明条件付加手段とを有する画像処理装置において、前記入力画像は前記被写体の奥行き情報を含む画像からなり、前記反射係数演算手段は、前記被写体の奥行き情報に基づいて前記入力画像から鏡面反射領域を特定し鏡面反射画素を補正する補正手段と、前記補正された入力画像から前記被写体の反射係数を演算する拡散反射係数演算手段とを備えた。
【００１２】
（２）被写体の奥行き情報を含む撮影画像を撮影する撮影手段と、前記撮影画像に映像される前記被写体の反射係数を演算する反射係数演算手段と、前記画像と異なる照明条件を設定する新規照明条件設定手段と、前記設定された照明条件と前記反射係数とに基づいて、前記撮影画像から前記新規照明条件設定手段で設定される照明条件の画像を生成する照明条件付加手段とを有する撮影システムにおいて、前記反射係数演算手段は、前記被写体の奥行き情報に基づいて前記撮影画像から鏡面反射領域を特定し鏡面反射画素を補正する補正手段と、前記補正された撮影画像から前記被写体の反射係数を演算する拡散反射係数演算手段とを備えた。
【００１３】
前述した手段によれば、入力画像として被写体の奥行き情報を含む画像が入力されると、まず、被写体の奥行き情報に基づいて、補正手段が入力画像から鏡面反射領域を特定し、この領域内の画素である鏡面反射画素を補正する。次に、補正手段により補正された入力画像から、拡散反射係数演算手段が被写体の反射係数を演算する。この後に、新規照明条件設定手段で設定された照明条件と、拡散反射係数演算手段で演算された反射係数とに基づいて、照明条件付加手段が鏡面反射の影響が補正された入力画像から新規照明条件の画像を生成する構成となっている、すなわち膨大な演算が必要となる鏡面反射係数の算出処理を必要としない構成となっているので、所定の照明条件の映像を実時間（リアルタイム）で順次生成することが可能となり、１フレーム期間内での画像処理を実現できる。
【００１４】
従って、被写体の奥行き情報を含む撮影画像を撮影する撮影手段で撮影された撮影画像を入力画像とすることによって、所定の照明条件の映像を実時間（リアルタイム）で順次生成する、１フレーム期間内での画像処理が可能な撮影システムを構成できる。
【００１５】
【発明の実施の形態】
以下、本発明について、発明の実施の形態（実施例）とともに図面を参照して詳細に説明する。
なお、発明の実施の形態を説明するための全図において、同一機能を有するものは同一符号を付け、その繰り返しの説明は省略する。
【００１６】
（実施の形態１）
図１は本発明の実施の形態１の撮影システムの概略構成を説明するための図である。ただし、以下の説明では、周知のビデオカメラの撮影と同じ赤、青、緑の色毎の２次元の輝度分布画像を、例えば３０分の１秒毎に順次撮影し出力する装置をＲＧＢカメラと記す。
図１において、１０１は被写体、１０２は計測部、１０３は半透鏡（ハーフミラー）、１０４は照明光源、１０５はレンズ、１０６は奥行き抽出ＲＧＢカメラ、１０７は法線推定部、１０８は入射光量推定部、１０９は閾値設定部、１１０は照明効果除去部、１１１は新規照明条件設定部、１１２は新規照明条件付加部、１１３は分光測色計、１１４は領域Ａを示す。
【００１７】
図１から明らかなように、実施の形態１の撮影システムでは、被写体１０１となる出演者を照明するための照明光源１０４と、この照明光源１０４から照射される照明光を出演者側に照射するハーフミラー１０３と、照明光源１０４から照射される照明光のＲＧＢの各色毎の分光特性を計測する分光測色計１１３と、被写体１０１の体表で反射されハーフミラー１０３を介して入射された光線（出演者の光学像）を結像させるレンズ１０５と、出演者の光学像を撮影する奥行き抽出ＲＧＢカメラ１０６とから計測部１０２が形成される。
【００１８】
ただし、実施の形態１の奥行き抽出ＲＧＢカメラ１０６は、ＲＧＢカメラで得られる画像の座標毎に、レンズ主点から被写体までの奥行き距離が例えば３０分の１秒毎に実時間で順次に得られる周知のカメラである。また、ＲＧＢの３枚の画像の内の少なくとも一枚のＲＧＢ画像に対応する奥行き情報の集まりを、撮影奥行き画像と記す。さらには、実施の形態１では、奥行き距離が３０分の１秒毎に得られることが必要条件ではなく、６０分の１秒毎以外にもさらに高速なものや、あるいは低速なものでも適用可能であることはいうまでもない。この機能を有するカメラ（奥行き抽出ＲＧＢカメラ１０６）については、既にいくつかの手法が実用化されており、周知のアクシビジョンと称されるテレビカメラや、２台のカメラを所定間隔で配置したステレオカメラによる手法などがあるが、実施の形態１では、他の手法を適用したカメラでも適用可能なことはいうまでもない。しかしながら、得られるＲＧＢ画像の各画素の奥行き情報が、１対１の対応で得られる必要がある。すなわち、ＲＧＢ画像を取得する際のレンズ主点と奥行き情報を得るためのレンズ主点は同一位置となる必要がある。
【００１９】
また、実施の形態１の撮影システムでは、奥行き抽出ＲＧＢカメラ１０６で撮影された奥行き画像（撮影奥行き画像）から各画素位置での法線ベクトルを演算する法線推定部１０７と、法線ベクトルに基づいて出演者の体表面における照明光の入射光量を演算する入射光量推定部１０８と、奥行き抽出ＲＧＢカメラ１０６で撮影されたＲＧＢの各色毎の映像及び分光測色計１１３で計測されたＲＧＢの各色毎の分光特性並びに各画素毎の入射光量情報及び閾値情報に基づいて反射係数情報を演算する照明効果除去部１１０と、閾値を設定する閾値設定部１０９と、法線情報及び反射係数情報に基づいて新規照明条件で画像を生成する新規照明条件付加部１１２と、この新規照明条件付加部１１２に新たな照明条件を設定する新規照明条件設定部１１１とから画像情報の処理部が形成される。
【００２０】
従って、実施の形態１の撮影システムでは、照明光源１０４から照射された照明光で撮影された光学像（奥行き情報を有する撮影奥行き画像）に基づいて、まず、法線推定部１０７により前記撮影奥行き画像の各画素における法線ベクトルが演算され、得られた法線ベクトルが入射光量推定部１０８に入力される。入射光量推定部１０８に入力された法線ベクトルは、出演者の体表面における照明光の入射光量を演算する際の基準データとされ、入射光量推定部１０８により撮影奥行き画像と法線ベクトルとに基づいた照明光の入射光量が演算され、得られた入射光量が照明効果除去部１１０に出力される。この入射光量推定部１０８で得られた入射光量と、照明光源１０４から照射された照明光で撮影されたＲＧＢの光学像（撮影ＲＧＢ画像）と、閾値設定部１０９からの閾値に基づいて、照明効果除去部１１０により撮影ＲＧＢ画像中での各画素毎の反射係数が算出され、反射係数情報として新規照明条件付加部１１２に出力される。
【００２１】
このとき、実施の形態１の撮影システムでは、法線推定部１０７からの法線情報と、新規照明条件として新規照明条件設定部１１１からの新規照明条件情報とが、新規照明条件付加部１１２に入力される構成となっている。従って、実施の形態１の新規照明条件付加部１１２では、各画素毎の反射係数、法線情報及び新規照明条件に基づいて、撮影ＲＧＢ画像の各画素毎に彩度及び輝度情報を演算して、新規照明条件に適合した画像（出力画像）を生成する。
【００２２】
このように、実施の形態１の撮影システムでは、被写体１０１の奥行き情報に基づいて算出された法線ベクトルを基準として、入射光量及び各画素における反射係数が算出されると共に、この法線ベクトル情報に基づいて新規の照明条件での撮影ＲＧＢ画像が生成される構成となっている。すなわち、リアルタイムで１フレーム期間内での画像処理を実現することが可能となる。
【００２３】
次に、実施の形態１の撮影システムの各部の詳細構成を説明する。
【００２４】
（計測部）
図２は、実施の形態１の計測部の光学的構成を説明するための図である。ただし、以下の説明では、ＲＧＢカメラ（奥行き抽出ＲＧＢカメラ１０６）をピンホールカメラと想定した場合のピンホール位置をレンズ主点２０１と記す。また、照明光源１０４を点光源と想定した場合の光源位置を照明主点２０２と記す。
【００２５】
実施の形態１の計測部１０２では、被写体１０１に対し、ハーフミラー１０３を用いて仮想的に同一光軸上に被写体１０１を撮影する奥行き抽出ＲＧＢカメラ１０６、および照明光源１０４が設置される構成となっている。この際、奥行き抽出ＲＧＢカメラ１０６のレンズ主点２０１と、照明光源１０４の照明主点２０２とは、被写体１０１からの光路長が図２のＬ１，Ｌ２，Ｌ３を用いて、下記（式１）に示すように、同一となるように設置する。
【００２６】
【式１】
Ｌ１＋Ｌ２＝Ｌ１＋Ｌ３　　　・・・・（式１）
このように、実施の形態１の撮影システムでは、レンズ主点２０１と照明主点２０２とが一致する配置で計測部１０２を形成することにより、被写体１０１の形状によっては、被写体１０１自身により照明光が遮蔽され、被写体１０１の表面に影が撮影されてしまうことを防止する構成となっている。すなわち、レンズ１０５と照明光源１０４の主点２０１，２０２が一致していないことに起因する被写体１０１自身の影が撮影画像中の被写体上に生じることとなるが、前記した計測部１０２の形成により後述する画像処理過程における陰部分の演算精度の低下を防止するものである。
【００２７】
ここで、ハーフミラー１０３を用いる目的の一つは、照明光源１０４から照射される照明光を、ハーフミラー１０３による反射を介して被写体１０１に照射するためである。また、ハーフミラー１０３を用いる他の目的は、被写体像のハーフミラー１０３による透過を利用して、奥行き抽出ＲＧＢカメラ１０６で撮影を行うためである。奥行き抽出ＲＧＢカメラ１０６には、カメラ側から赤外線を被写体１０１に投光し、その反射光を撮像する必要のある手法があるが、この点についても、ハーフミラー１０３が透過する性質を利用することで、奥行き抽出ＲＧＢカメラ１０６の機能を制限することは無い。
【００２８】
また、実施の形態１では、ハーフミラー１０３を用いた場合に説明するが、ハーフミラー１０３の代わりに、周知の光学プリズムを用いることにより、同一の機能を実現することは可能である。また、レンズ主点２０１と照明主点２０２とが同一となるような条件が得られる場合には、ハーフミラー１０３やカメラのレンズ１０５などの被写体に対する前後関係が、図１に示す構成と異なっても問題ない。例えば、図３に示すように、ハーフミラー１０３をレンズ１０５と奥行き抽出カメラ１０６との間に配置してた場合であっても、実施の形態１の計測部１０２を構成可能となる。
【００２９】
特に、図３に示すように、レンズ１０５、ハーフミラー１０３及び奥行き抽出ＲＧＢカメラ１０６を配置した場合、図２に示す構成に比較して、照明光源１０４の主点である照明主点２０２と、奥行き抽出ＲＧＢカメラ１０６に光学像を結像させるためのレンズ１０５の主点であるレンズ主点２０１とを合致させるために必要となる照明光源１０４の移動量が少なくできるので、装置全体を小型化できるという効果を得ることも可能となる。従って、専用システムとして構成する場合は、計測部１０２は、図３に示すように、被写体１０１の側からレンズ１０５、ハーフミラー１０３、奥行き抽出ＲＧＢカメラ１０６の順に配置した構成がよい。
【００３０】
一方、図１に点線で示す領域Ａ１１４以外の部分を組み込み接続する形態とすることによって、普段は奥行き抽出ＲＧＢカメラ１０６として使用し、本願発明の目的を達成したい場合にのみ、奥行き抽出ＲＧＢカメラ１０６を多目的に利用できる。
【００３１】
（法線推定部）
実施の形態１の法線推定部１０７は、計測部１０２から得られた撮影奥行き画像における被写体１０１の各部位の法線ベクトルを推定する手段である。ただし、以下の説明では、法線ベクトルとは被写体１０１の表面に垂直なベクトルであり、面の向きを表すものである。特に、実施の形態１では、撮影奥行き画像を構成する各画素における法線ベクトルを推定する。当該画素の法線ベクトルは、当該画素と周辺の画素との奥行き情報を用いることで演算できる。
【００３２】
例えば、計測部１０２で得られた撮影奥行き画像が、横６４０画素，縦４８０画素で図４の（ａ）に示すように構成されている場合には、画像の左下の２次元座標を基準位置（０，０）とする。任意の画素をこの左下の座標を基準に画素数の距離で座標を与えるものとする。
【００３３】
２次元座標（１００，１００）の法線ベクトルを計算する場合には、図４の（ｂ）に示すように、当該画素である（１００，１００）の奥行き情報と、当該画素に隣接する画素として周囲４画素（座標（１００，１０１）、（９９，１００）、（１０１，１００）、（１００，９９））の情報を用いる。まず、それぞれの２次元座標と奥行き情報とは、レンズ主点２０１を原点とした３次元座標に変換可能となるので、実施の形態１では予め３次元座標に変換する。次に、当該画素と周囲画素とに対応する３次元座標がなす線分をそれぞれ求める。さらに、隣り合う線分同士で外積を求めることによって、線分が成す平面の法線ベクトルが求まる。この演算により４つの法線ベクトルが求まるが、実施の形態１では、４つの法線ベクトルを平均し正規化することで、当該画素の法線ベクトルとする構成となっている。ただし、法線ベクトルの計算手法は限定しないが、前述するようなコンピュータグラフィックス分野で用いられている一般的な計算手法で実現できる。また、法線推定部１０７で得られた法線ベクトル情報および３次元座標は、それぞれ次段の入射光量推定部１０８に送られる。
【００３４】
（入射光量推定部）
実施の形態１の入射光量推定部１０８は、被写体１０１の各部位への入射光量を推定する手段である。ただし、入射光量の推定には、照明光源１０４からの距離と照明光の被写体１０１への入射角とが必要となる。
【００３５】
従って、実施の形態１の撮影システムでは、入射角度および照明光源１０４からの距離は、前段の法線推定部１０７で得られた被写体１０１の各部位の法線ベクトル情報と３次元座標とを用いて求める構成となっている。このとき、前段である放線推定部１０７で得られた３次元座標は、レンズ主点２０１すなわち照明主点２０２が原点になっており、法線ベクトル情報とともに用いることで容易に照明の入射角度が推定できる。また、照明光源１０４から被写体１０１の各部位までの距離についても、各部位の３次元座標と原点（０，０，０）との間の距離を求めることで推定可能である。例えば、被写体１０１の所定部位の座標を（ｘ_１，ｙ_１，ｚ_１）、その法線ベクトルを（ｘ_２，ｙ_２，ｚ_２）とした場合、入射角度θは（ｘ_１，ｙ_１，ｚ_１）、（ｘ_２，ｙ_２，ｚ_２）をベクトルとした場合の内積から求まる。
【００３６】
まず、照明光源１０４からの距離Ｌ１は、下記の式２となる。
【００３７】
【数１】
Ｌ１＝（Ｘ_１ ^２＋ｙ_１ ^２＋ｚ_１ ^２）^１／２　　　　・・・・（式２）
照明光源１０４から距離Ｌ１離れた場合の照明の光強度が距離Ｌ１の２乗に反比例することから照明光源１０４の光強度の基準をｐとすると、距離Ｌ１離れた場所への光強度ｐ_１は、下記の式３となる。
【００３８】
【数２】
ｐ_１＝ｐ／Ｌ１^２　　　・・・・（式３）
また、被写体への入射光量Ｉは、コンピュータグラフィックスの分野で古典的なＬａｍｂｅｒｔの余弦則から、下記の式４となる。
【００３９】
【数３】
Ｉ＝ｐ_１／ｃｏｓθ　　　・・・・（式４）
入射光量推定部１０８では、以上に説明した演算により、入射光量Ｉを画像の全画素にわたって計算し、得られた入射光量Ｉを次段の照明効果除去部１１０に送る。
【００４０】
（照明効果除去部）
実施の形態１の照明効果除去部１１０は、計測部１０２で得られた撮影ＲＧＢ画像と、前段の入射光量推定部１０８で得られた入射光量Ｉとを用いて、照明効果を除去した被写体１０１の表面の反射係数を求める手段である。
【００４１】
被写体１０１からの反射光の成分は、これまで種々提案されている反射モデルごとに異なる分類がなされている。しかし、その多くは経験的に広く用いられている２色性反射モデルに基づき、鏡面反射成分と拡散反射成分とに大別されている。鏡面反射成分は、一部の被写体領域で拡散反射成分に比べ極端に支配的となる現象を引き起こし、照明効果の除去を行う上で妨害となる。この領域は、ハイライト領域と呼ばれている。
【００４２】
これらの反射成分（鏡面反射成分と拡散反射成分）は、多様に提案されている反射モデル毎に表し方が異なる。拡散反射成分については、多くの反射モデルで照明光の入射角度に依存したＬａｍｂｅｒｔの余弦則が用いられており、実施の形態１においても、同様に利用する。一方、鏡面反射成分は、照明光の入射角度との関係だけでは容易に表せず、なんらかの反射モデルに当てはめたとしても高品質、高速かつ被写体１０１の材質を選ばず万能的に適用できる除去手法は未だない。従って、実施の形態１の照明効果除去部１１０では、反射モデルを考慮しない手法として、ハイライト領域を推定し、その周りの画素値で内挿する仕組みを組み込んだ構成となっている。ただし、将来、より高品質で精度の良い鏡面反射成分の除去手法が提案されれば、実施の形態１に組み込むことが可能であることはいうまでもない。
【００４３】
鏡面反射をしている部位の特徴は、画像中で比較的明るくかつ物体の色よりも光源色が支配的になる。例えば、白色照明下にある赤い物体色の表面で鏡面反射を起こしている部位は白く輝いて見える。この様な領域はハイライト領域と称されており、被写体１０１の反射係数推定の妨害となる。また、被写体１０１の材質に依存するが、被写体１０１の表面への照明光の入射角と被写体１０１の表面からカメラ側への反射光の出射角度とが近い値であると、鏡面反射が生じやすくなることが知られている。すなわち、実施の形態１の光学的条件下では、図５に示すように、レンズ主点２０１から被写体１０１の各部位に伸ばした直線Ｌ４に対して、被写体１０１の表面が垂直であるほど鏡面反射を生じやすくなる。
【００４４】
従って、実施の形態１の照明効果除去部１１０では、前述した性質から次の３つの判定基準を複合的に用いて、鏡面反射部分すなわちハイライト領域を判定する構成となっている。
（直線Ｌ４に対する判定に用いる値）
まず、レンズ主点２０１から被写体１０１の各部位に伸ばした直線Ｌ４と、被写体１０１の表面の法線ベクトルとがなす角度をθａとする。このθａは被写体１０１の法線ベクトル情報および３次元座標より算出できる。
（色の判定に用いる値）
次に、当該被写体部位の色と光源色の比較とは、以下のように行う。予め光原色の赤、青、緑の分光特性を計測しておき、図６に示すように、赤、青、緑の強度を軸とする３次元座標上に正規化して、点Ｓ_１として投影する。一方、奥行き抽出ＲＧＢカメラ１０６で撮影した撮影ＲＧＢ画像から被写体１０１の当該部位の赤、青、緑の各成分を、赤、青、緑の強度を軸とする３次元座標上に正規化して、点Ｓ_２として３次元座標上に投影する。このときの光源色点Ｓ_１と被写体当該部位の色点Ｓ_２との３次元座標上での距離を求め、この距離をｎ_１とする。このように距離ｎ_１を設定することにより、距離ｎ_１が小さいほど色が似通っていると判定できる。ただし、実施の形態１では、予め光源色の分光特性を計測し、この計測値を用いる構成としているが、比較基準をこれに限定するものではなく、計測部内の照明近くあるいは図１に示すような部位に分光測色計１１３を内蔵し、直接に労力を掛けずに分光特性を自動で計測し、利用することも可能である。
（明るさの判定に用いる値）
図７に示すように、奥行き抽出ＲＧＢカメラ１０６で撮影した撮影ＲＧＢ画像の輝度値のピーク値Ｉ_ｍａｘと、全画素の輝度値に対するヒストグラムとを作成し、最も明るい輝度値Ｉ_ｍａｘから一つ目のヒストグラム上のピークとなる輝度値Ｉ_ｐｅａｋを演算する。次に、Ｉ_ｐｅａｋ−（Ｉ_ｍａｘ−Ｉ_ｐｅａｋ）なる輝度値をＩ_ｓとする。このように輝度値Ｉ_ｓを設定することによって、鏡面反射によるハイライト部分が、輝度値Ｉ_ｐｅａｋを中心かつ最大値としたガウス関数で表せる分布に近似出来るので、輝度値Ｉ_ｐｅａｋを中心にＩ_ｐｅａｋ−Ｉ_ｍａｘ間を折り返した左右対称の分布となる。ただし、輝度値Ｉ_ｓは輝度値Ｉ_ｍａｘを折り返した部分に相当する。
（３つの判定基準の利用）
３つの判定基準に対し、予め手動で第１〜第３の閾値Ｔｈ_１，Ｔｈ_２，Ｔｈ_３を閾値設定部１０９により設定しておく。
【００４５】
まず、奥行き抽出ＲＧＢカメラ１０６で撮影した撮影ＲＧＢ画像の各画素の角度θａと第１の閾値Ｔｈ_１とを比較し、角度θａが第１の閾値Ｔｈ_１より小さい場合には、鏡面反射による反射が支配的な部位とみなし、この条件を満たす領域の情報Ｒ_１を保存する。
【００４６】
次に、Ｒ_１の領域に限り、領域内の全画素の輝度値Ｉ_ｏｕｔと輝度値Ｉ_ｓとの差分値Ｉ_ｄｉｆｆを、下記の式５より求める。
【００４７】
【数４】
Ｉ_ｄｉｆｆ＝Ｉ_ｏｕｔ−Ｉ_ｓ　　　・・・・（式５）
差分値Ｉ_ｄｉｆｆが第２の閾値Ｔｈ_２よりも大きい場合は、鏡面反射による反射が支配的な部位とみなし、この条件を満たす領域の情報Ｒ_２を保存する。
【００４８】
最後にＲ_２の領域に限り、領域内の全画素について距離ｎ_１を求める。ここで、距離ｎ_１が第３の閾値Ｔｈ_３より小さい部分は、鏡面反射による反射が支配的な部位とみなし、この条件を満たす領域の情報Ｒ_３を保存する。
【００４９】
最終的にＲ_３の領域を鏡面反射が支配的な部位とみなす。ここで、Ｒ_３の領域について、一般的な画像処理手法であるセグメンテーション処理を行い、Ｒ_３の領域を画像中で最小面積の閉区間に分割し、それぞれ領域ごとに周りの画素値でその領域の画素値を置き換える。
【００５０】
以上のように鏡面反射が原因となる反射係数が処理された撮影ＲＧＢ画像について、実施の形態１の撮影システムでは、以下に示す手順で拡散反射が原因となる反射係数を演算する。
拡散反射光Ｉ_ｄは、入射光をＩで拡散反射係数をＫ_ｄで表すと、下記の式６及び式７となる。
【００５１】
【数５】
Ｉ_ｄ＝Ｋ_ｄ×Ｉ　　　・・・・（式６）
Ｋ_ｄ＝Ｉ_ｄ／Ｉ　　　・・・・（式７）
拡散反射光Ｉ_ｄに対応する観測された撮影ＲＧＢ画像の赤成分、青成分、緑成分を（Ｃ_ｒ，Ｃ_ｂ，Ｃ_ｇ）とし、入射光Ｉの赤成分、青成分、緑成分を（Ｉ_ｒ，Ｉ_ｂ，Ｉ_ｇ）とすると、拡散反射係数Ｋ_ｄの赤成分、青成分、緑成分（Ｋ_ｒ，Ｋ_ｂ，Ｋ_ｇ）は、下記の式８〜式１０により計算される。
【００５２】
【数６】
Ｋ_ｒ＝Ｃ_ｒ／Ｉ_ｒ　　　・・・・（式８）
Ｋ_ｂ＝Ｃ_ｂ／Ｉ_ｂ　　　・・・・（式９）
Ｋ_ｇ＝Ｃ_ｇ／Ｉ_ｇ　　　・・・・（式１０）
この得られた拡散反射係数（Ｋ_ｒ，Ｋ_ｂ，Ｋ_ｇ）は、照明効果を除去した情報として、次段の新規照明条件付加部１１２に送られる。
【００５３】
図８は実施の形態１の照明効果除去部の概略構成を説明するための図である。ただし、８０１は照射入射角判定部（第１の領域特定手段）、８０２は色判定部（分光特性演算手段、第２の領域特定手段）、８０３は明るさ判定部（輝度値演算手段、第３の領域特定手段）、８０４は領域内挿部（補正手段）、８０５は拡散反射光除去部（拡散反射係数演算手段）を示す。
【００５４】
図８において、照明入射角判定部８０１は、入射光量推定部１０８からの入射光量情報、法線推定部１０７からの法線ベクトルと３次元位置情報、奥行き抽出ＲＧＢカメラ１０６からの撮影ＲＧＢ画像、及び閾値設定部１０９からの第１の閾値Ｔｈ_１が入力される構成となっており、前述する「直線Ｌ４に対する判定に用いる値」に記載した判定処理を行う手段である。この照明入射角判定部８０１で得られた領域情報Ｒ_１は、色判定部８０２に入力される。
【００５５】
色判定部８０２は、入射光量推定部１０８からの入射光量情報、法線推定部１０７からの法線ベクトルと３次元位置情報、奥行き抽出ＲＧＢカメラ１０６からの撮影ＲＧＢ画像、及び閾値設定部１０９からの第２の閾値Ｔｈ_２が入力される構成となっており、前述する「色の判定に用いる値」に記載した判定処理を行う手段である。ただし、色判定部８０２は、前述するように、照明入射角判定部８０１から入力される領域情報Ｒ_１に基づいて、照明入射角判定部８０１で得られた領域に対してのみ判定処理を行う。この色判定部８０２で得られた領域情報Ｒ_２は、次段の明るさ判定部８０３に入力される。
【００５６】
明るさ判定部８０３は、入射光量推定部１０８からの入射光量情報、法線推定部１０７からの法線ベクトルと３次元位置情報、奥行き抽出ＲＧＢカメラ１０６からの撮影ＲＧＢ画像、及び閾値設定部１０９からの第３の閾値Ｔｈ_３が入力される構成となっており、前述する「明るさの判定に用いる値」に記載した判定処理を行う手段である。ただし、明るさ判定部８０３は、前述するように、色判定部８０２から入力される領域情報Ｒ_２に基づいて、色判定部８０２で得られた領域に対してのみ判定処理を行う。この明るさ判定部８０３で得られた領域情報Ｒ_３は、次段の領域内挿部８０４に入力される。
【００５７】
領域内挿部８０４は、明るさ判定部８０３で得られた領域情報Ｒ_３に基づいて、撮影ＲＧＢ画像の当該領域をその周囲の画素値で内挿する手段であり、この内挿処理は、前述する「３つの判定基準の利用」に記載した処理である。この領域内挿部８０４で得られた内挿処理後の撮影ＲＧＢ画像は、次段の拡散反射光除去部８０５に入力される。
【００５８】
拡散反射光除去部８０５は、前述する式８、式９、式１０に基づき拡散反射係数を計算する手段であり、得られた拡散反射係数（Ｋ_ｒ，Ｋ_ｂ，Ｋ_ｇ）が、照明効果除去部１１０の出力として、新規照明条件付加部１１２に入力される。
【００５９】
（閾値設定部）
照明効果除去部１１０で用いる第１〜第３の閾値Ｔｈ_１，Ｔｈ_２，Ｔｈ_３を設定する手段である。例えば、閾値の設定釦としてつまみ状のボリュームを３つ配置し、これらを操作することで各閾値を手動で設定することが可能となる。この第１〜第３の閾値Ｔｈ_１，Ｔｈ_２，Ｔｈ_３の設定は、使用前あるいは使用中に出力画像を見ながら、操作者の判断で行うことが可能である。ただし、本願発明では、このユーザインターフェースを限定しない。また、計測部内の分光測色計より照明の輝度値Ｉ_１を求め、この輝度値Ｉ_１に比例するように第２の閾値Ｔｈ_２を自動的に設定するなど、自動的にこれらの閾値を設定することも可能であることはいうまでもない。
【００６０】
（新規照明条件設定部）
新規照明条件設定部１１１は、必要とされる照明条件を設定する手段である。実施の形態１の新規照明条件設定部１１１では、設定項目は、照明の数、３次元空間での配置、配光特性、色温度などを設定することが可能となっている。
【００６１】
これらの設定項目は、周知のコンピュータグラフィックスによる画像を製作する上で必要となる項目で、一般的なコンピュータグラフィックス用ソフトウェアの機能の一部と同等の機能を、新規照明条件設定部１１１に組み込むことでも実現可能である。特に、実時間で順次照明条件を変更する必要がない場合には、予め決定されている照明条件を送り続ける仕組みでも良い。また、照明条件の設定の類の操作性に慣れていない人でも直感的に操作が可能とする場合には、例えば、照明の方向のみを周知のジョイスティックで指定し、指定された方向の情報を次段の新規照明条件付加部１１２に順次送るような構成でもよい。さらには、設定項目を限定することも可能である。
【００６２】
また、テレビ番組の映像制作手法である仮想スタジオシステムと組み合わせて用いることも可能である。ただし、仮想スタジオシステムとは、被写体１０１を撮影しているカメラと同一の条件で背景となるコンピュータグラフィックス映像を生成し、この生成された画像とカメラで撮影された被写体画像とを合成し、あたかもコンピュータグラフィックスで生成した映像上に被写体１０１が存在するかのような映像効果を生成するシステムである。このコンピュータグラフィックスで生成している背景映像の照明条件は、コンピュータの中に設定してあるので、この設定条件を本手法の新規照明条件設定部１１１に与えることが可能である。
また、実際の日時あるいは使用者が指定した日時に応じた屋外の照明条件を物理法則に則り推定し、次段の新規照明条件付加部１１２に送ることも可能である。
【００６３】
実施の形態１では、このユーザインターフェースについては限定しないが、照明条件を実時間で、順次次段の新規照明条件付加部１１２に送る必要があることはいうまでもない。
【００６４】
（新規照明条件付加部）
新規照明条件付加部１１２は、法線推定部１０７からの法線ベクトル情報、照明効果除去部１１０からの反射係数（鏡面反射係数と拡散反射係数）、及び新規照明条件設定部１１１からの新規照明条件の情報が入力される構成となっており、これらの入力情報に基づいて、新規照明条件で見込まれる被写体映像を生成し出力する手段である。ただし、新規条件の被写体映像の生成は、周知のコンピュータグラフィックスで用いられる一般的な手法で実現できるので、詳細な説明は省略する。
【００６５】
ここで、新規照明条件設定部１１１に仮想スタジオシステムから得られる照明条件を与えた場合、仮想スタジオシステムで生成したコンピュータグラフィックス映像と同一の照明条件の被写体映像が生成できる。このとき、奥行き情報を利用して、被写体１０１と仮想スタジオの背景に用いるコンピュータグラフィックスのセットとの前後関係を判定する手段を設けることによって、コンピュータグラフィックスのセットより前に位置する被写体１０１の領域のみを、コンピュータグラフィックスのセットによる背景映像に合成することが可能となるので、被写体１０１が背景映像を撮影した場所とは異なる場所にいる場合であっても、被写体１０１が背景映像の撮影位置で撮影しているような効果（合成効果）が得られる。
【００６６】
なお、以上説明を行った装置のカメラや同期発生器は、３０分の１秒や６０分の１秒を基準に説明を行ったが、ＮＴＳＣ信号の機器を用いた場合は同期信号が５９．９４Ｈｚであるので、５９．９４分の１秒あるいは２／５９．９４秒となることはいうまでもない。また、高速度カメラや、長時間露光カメラなど、様々な周波数のビデオカメラが存在するが、本願発明はその周波数を限定するものではなく、画像を構成する最小単位毎に処理が順次繰り返されるように構成することによって、他の周波数の撮影カメラを用いた撮影システムにも適用可能である。
【００６７】
また、以上に説明した実施の形態１の撮影システムでは、被写体１０１は暗室にあり、計測部１０２にある照明以外の照明が存在しない環境で被写体１０１の映像を撮影する必要があることはいうまでもない。
【００６８】
以上説明したように、実施の形態１の撮影システムでは、照明光源１０４から照射された照明光のもと、奥行き抽出ＲＧＢカメラ１０６を用いて撮影された撮影奥行き画像に基づいて、法線推定部１０７が撮影奥行き画像の各画素における法線ベクトルを演算し、入射光量推定部１０８が入力された法線ベクトルから出演者である被写体１０１の体表面における照明光の入射光量を演算し、照明効果除去部１１０が入射光量推定部１０８で得られた入射光量と、照明光源１０４から照射された照明光で撮影された撮影ＲＧＢ画像と、閾値設定部１０９からの第１〜第３の閾値Ｔｈ_１，Ｔｈ_２，Ｔｈ_３に基づいて、撮影ＲＧＢ画像中での各画素毎の反射係数を算出し、新規照明条件付加部１１２が反射係数に基づいて新規照明条件での被写体映像を生成し出力する。
【００６９】
このとき、実施の形態１の撮影システムでは、被写体１０１の反射特性すなわち反射係数を演算する照明効果除去部１１０が、照射入射角判定部８０１、色判定部８０２、明るさ判定部８０３、領域内挿部８０４、及び拡散反射光除去部８０５で構成されている。
【００７０】
ここで、鏡面反射に係わる演算として、まず、照明入射角判定部８０１が入射光量情報、法線ベクトル、３次元位置情報、撮影ＲＧＢ画像、及び第１の閾値Ｔｈ_１に基づいた領域の特定を行う。次に、照明入射角判定部８０１で特定された領域に対して、色判定部８０２が入射光量情報、法線ベクトル、３次元位置情報、撮影ＲＧＢ画像、及び第２の閾値Ｔｈ_２に基づいた領域の特定を行う。次に、色判定部８０２で特定された領域に対して、明るさ判定部８０３が入射光量情報、法線ベクトル、３次元位置情報、撮影ＲＧＢ画像、及び第３の閾値Ｔｈ_３に基づいた領域の特定を行う。次に、明るさ判定部８０３で特定された領域に対して、領域内挿部８０４が撮影ＲＧＢ画像の当該領域をその周囲の画素値で内挿する。この後に、領域内挿部８０４でハイライト領域が内挿（補正）された撮影ＲＧＢ画像に対して、拡散反射光除去部８０５が拡散反射光に係わる反射係数を演算する構成となっている。
【００７１】
このように、実施の形態１の照明効果除去部１１０が奥行き抽出ＲＧＢカメラ１０６で得られた撮影ＲＧＢ画像から鏡面反射の影響を取り除いた撮影ＲＧＢ画像を生成する構成、すなわち膨大な演算が必要となる鏡面反射係数の算出処理を必要としない構成となっているので、所定の照明条件の映像を実時間（リアルタイム）で順次生成することが可能となり、特に１フレーム期間内での画像処理を必要とする生放送への適用が可能となる。
【００７２】
（実施の形態２）
図９は本発明の実施の形態２の撮影システムの概略構成を説明するための図である。ただし、９０１は光学シャッタ、９０２は同期発生器、９０３は分周器、９０４は画像メモリ部、９０５は差分画像生成部を示す。また、以下の説明では、実施の形態１の撮影システムと構成が異なる、光学シャッタ９０１、同期発生器９０２、分周器９０３、画像メモリ部９０４、及び差分画像生成部９０５に係わる動作及びその効果について、詳細に説明する。
【００７３】
図９から明らかなように、実施の形態１の撮影システムでは、被写体１０１となる出演者を照明するための照明光源１０４と、この照明光源１０４から照射される照明光を被写体１０１の側に照射するハーフミラー１０３と、照明光源１０４とハーフミラー１０３との間に配置されて照明光源１０４から出射される照明光の透過と遮蔽とを制御する光学シャッタ９０１と、照明光源１０４から照射される照明光のＲＧＢの各色毎の分光特性を計測する分光測色計１１３と、被写体１０１の体表で反射されハーフミラー１０３を介して入射された光線（被写体１０１の光学像）を結像させるレンズ１０５と、被写体１０１の光学像を撮影する奥行き抽出ＲＧＢカメラ１０６と、奥行き抽出ＲＧＢカメラ１０６が備える外部同期入力端子に適合した同期信号を生成する同期発生器９０２と、同期発生器９０２からの同期信号を分周する分周器９０３とから計測部１０２が形成される。
【００７４】
このように、実施の形態２の計測部１０２では、同期発生器９０２で生成された同期信号に同期して、奥行き抽出ＲＧＢカメラ１０６の撮影タイミングを制御する構成となっている。また、同期信号を分周器で分周した信号（分周信号）により、光学シャッタ９０１の動作を制御する構成となっている。従って、奥行き抽出ＲＧＢカメラ１０６による被写体１０１の撮影と、被写体１０１への照明とを同期させることが可能となっている。なお、計測部１０２の詳細については後述する。
【００７５】
また、実施の形態１の撮影システムでは、奥行き抽出カメラ１０６で撮影された奥行き画像（撮影奥行き画像）から各画素位置での法線ベクトルを演算する法線推定部１０７と、法線ベクトルに基づいて出演者の体表面における照明光の入射光量を演算する入射光量推定部１０８と、奥行き抽出ＲＧＢカメラ１０６で撮影された撮影ＲＧＢ画像を順次格納する画像メモリ部９０４と、画像メモリ部９０４から読み出した１フレーム（１撮影周期）分遅れた画像（遅延画像）と奥行き抽出ＲＧＢカメラ１０６で撮影された撮影ＲＧＢ画像とからＲＧＢの各色毎の差分画像を生成する差分画像生成部９０５と、差分画像生成部で生成されたＲＧＢの各色毎の映像及び分光測色計１１３で計測されたＲＧＢの各色毎の分光特性並びに各画素毎の入射光量情報及び閾値情報に基づいて反射係数情報を演算する照明効果除去部１１０と、閾値を設定する閾値設定部１０９と、法線情報及び反射係数情報に基づいて新規照明条件で画像を生成する新規照明条件付加部１１２と、この新規照明条件付加部１１２に新たな照明条件を設定する新規照明条件設定部１１１とから画像情報の処理部が形成される。
【００７６】
このように、実施の形態２の処理部では、奥行き抽出ＲＧＢカメラ１０６で撮影された撮影ＲＧＢ画像を画像メモリ部９０４に一旦格納し、この格納した撮影ＲＧＢ画像とリアルタイムで撮影される撮影ＲＧＢ画像との差分画像を順次差分画像生成部９０５で生成し、この差分画像を照明効果除去部１１０が用いる構成となっている。
【００７７】
図９において、同期発生器９０２は、奥行き抽出ＲＧＢカメラ１０６のカメラ画像の撮像周波数である３０分の１秒、あるいは６０分の１秒単位の同期信号をパルスとして発生される周知の同期信号発生手段である。ただし、同期信号は奥行き抽出ＲＧＢカメラ１０６を構成するビデオカメラが有する外部同期入力端子に適合した波形であり、この同期信号がビデオカメラに入力される。このビデオカメラは入力した同期信号に同期して撮像するように調整する。ただし、一般的な業務用テレビカメラには外部同期入力端子あるいはゲンロック端子と呼ばれる外部同期信号を入力するための端子があり、内蔵されるＰＬＬ回路（ＰｈａｓｅＬｏｃｋｅｄ　Ｌｏｏｐ：位相同期回路）により、外部同期信号に同期してカメラ内部回路用の同期信号を発生させるための発振を行う回路が組み込まれている。従って、実施の形態２における同期処理は容易に実現できる。
【００７８】
また、分周器９０３は、３０分の１秒あるいは６０分の１秒の同期信号を２分周する手段であり、同期信号の半分の１５分の１秒あるいは３０分の１秒の周期の信号（分周信号）を生成する。特に、実施の形態２では、分周信号のデューティ比を５０％のデジタル信号としておき、分周信号が入力される各部での判定が容易となるように、波形整形する。
【００７９】
また、照明投光方向に対して垂直に配置される光学シャッタ９０１は、電気信号で開閉が可能な周知の光学シャッタであり、閉状態で光を遮断し、開状態で光を透過する構成となっている。また、実施の形態２の光学シャッタ９０１は、駆動信号あるいは制御信号として入力される分周信号が、Ｈｉｇｈ（１）の期間では閉状態となり、Ｌｏｗ（０）の期間では開状態となる。なお、光学シャッタ９０１としては、機械式のシャッタや液晶シャッタを用いることにより、実施の形態２の光学シャッタを実現できる。
【００８０】
画像メモリ部９０４は、奥行き抽出ＲＧＢカメラ１０６で撮影された撮影ＲＧＢ画像を順次格納する手段であり、実施の形態２では、次の撮影ＲＧＢ画像の入力タイミングで、格納している一つ前の入力タイミングで格納した撮影ＲＧＢ画像を出力する構成となっている。すなわち、画像メモリ部９０４は、撮影ＲＧＢ画像が入力し終わり、次の撮影ＲＧＢ画像が奥行き抽出ＲＧＢカメラ１０６より送られてくると、そのタイミングで画像メモリに蓄えられた内容すなわち撮影ＲＧＢ画像を、次段の差分画像生成部９０５に送る。すなわち、画像メモリ部９０４を通過した撮影ＲＧＢ画像は、奥行き抽出ＲＧＢカメラ１０６から出力されるＲＧＢ画像に対して、画像一枚分遅れることとなる。
【００８１】
差分画像生成部９０５は、奥行き抽出ＲＧＢカメラ１０６から直接に入力される現在の撮影ＲＧＢ画像Ｇ_ｐと、画像メモリ部９０４から入力される画像一枚分遅延した画像Ｇ_ｄとの差の画像（差分画像）Ｇ_ｄｉｆｆを生成する手段である。特に、実施の形態２の差分画像生成部９０５では、下記の式１１と式１２とを同期発生器を分周器で２分周した信号を基に、３０分の１秒あるいは６０分の一秒ごとに切り替えることによって、差分画像Ｇ_ｄｉｆｆを得る構成となっている。
【００８２】
【数７】
Ｇ_ｄｉｆｆ＝Ｇ_ｐ−Ｇ_ｄ　　　・・・・（式１１）
Ｇ_ｄｉｆｆ＝Ｇ_ｄ−Ｇ_ｐ　　　・・・・（式１２）
特に、実施の形態２の差分画像生成部９０５では、計測部１０２の照明光源１０４が点灯した際の画像が右辺の左の変数となり、消灯している際の画像が右辺の右の変数となるように制御する構成となっている。すなわち、点灯している状態の画像から点灯していない状態の画像を差し引くことで、計測部１０２内の照明光源１０４による反射光のみを、差分画像Ｇ_ｄｉｆｆとして取り出す構成となっている。
【００８３】
これを実現するために、分周器９０３からの分周信号が０の際には式１１、分周信号が１の際には式１２を選択して差分画像を取得し、次段の照明効果除去部１１０に出力する。
【００８４】
このような工夫を行うことで、計測部１０２以外の他の光源により被写体１０１が照明されている場合であっても、実施の形態１の撮影システムと同様の効果を得る構成となっている。
【００８５】
図１０は実施の形態２における奥行き抽出カメラによる撮影ＲＧＢ画像の収集タイミングと照明光の照射タイミングとを説明するための図である。ただし、図１０に示す画像名は、奥行き抽出ＲＧＢカメラ１０６で順次撮影される撮影ＲＧＢ画像を示すものであり、本明細書中では１，２，３，４，５，６・・・の自然数で示される連続する番号を画像名とする。
【００８６】
まず、図１０に基づいて、計測部１０２における撮影ＲＧＢ画像の撮影動作について説明する。
【００８７】
前述するように、実施の形態２の計測部１０２では、奥行き抽出ＲＧＢカメラ１０６には同期信号が入力される構成となっているのに対して、光学シャッタ９０１には同期信号が２分周された分周信号が入力される構成となっている。従って、実施の形態２の計測部１０２では、同期信号に同期して奥行き抽出ＲＧＢカメラ１０６は同期信号の入力に同期して撮影を行うこととなる。
【００８８】
一方、光学シャッタ９０１では、同期信号が２分周されたデューティ比５０％の分周信号のＨｉｇｈ（１）期間とＬｏｗ（０）期間に応じて、閉状態と開状態と順番に切り替わる構成となっている。すなわち、実施の形態２の光学シャッタ９０１では、同期信号に同期して照明光源１０４から照射された照明光の透過と遮蔽とが切り替え制御されることとなるので、照明光源１０４から被写体１０１への照明光の照射も同期信号に同期した照射となる。
【００８９】
その結果、図１０に示すように、画像１，２，３，４，５，６・・・と連続する撮影ＲＧＢ画像の撮影が、同期信号に同期して動作する奥行き抽出ＲＧＢカメラによってなされた場合には、分周器９０３の出力である分周信号は、例えば画像１、３、５・・・を撮影中は１となり、画像２，４，６・・・を撮影中は０となる。従って、画像１、３、５・・・を撮影中すなわち分周信号が１の期間では、光学シャッタ９０１は閉じて照明光源１０４からの照明光を遮蔽するので、この照明光による被写体１０１の照明は行われないこととなる。これに対して、画像２、４、６・・・を撮影中すなわち分周信号が０の期間では、光学シャッタ９０１は開いて照明光源１０４からの照明光を透過するので、この照明光により被写体１０１が照明されることとなる。
【００９０】
このように、実施の形態２の計測部１０２では、奥行き抽出ＲＧＢカメラ１０６の撮影周期（フレーム期間）毎に、照明光源１０４からの照明光で被写体１０１を照明した撮影ＲＧＢ画像と、照明光源１０４からの照明光で被写体１０１を照明していない撮影ＲＧＢ画像とを撮影する構成となっている。
【００９１】
次に、図１０に基づいて、処理部における出力画像の生成動作を説明する。ただし、照明効果除去部１１０に入力される画像データが異なる以外は、実施の形態１の処理部における出力画像の生成動作と同じとなる。従って、以下の説明では、画像メモリ部９０４及び差分画像生成部９０５による差分画像Ｇ_ｄｉｆｆの生成動作についてのみ詳細に説明する。
【００９２】
奥行き抽出ＲＧＢカメラ１０６で撮影された撮影ＲＧＢ画像は、順次画像メモリ部９０４に格納される。このとき、同じ撮影ＲＧＢ画像が差分画像生成部９０５にも入力される。従って、図１０に示す画像名１の撮影ＲＧＢ画像が撮影された場合には、画像メモリ部９０４と差分画像生成部９０５には、この画像名１の撮影ＲＧＢ画像が現在の撮影ＲＧＢ画像Ｇ_ｐとして入力されることとなる。このとき、画像メモリ部９０４にはそれ以前の画像が格納されていないので、差分画像生成部９０５からの差分画像Ｇ_ｄｉｆｆの出力もなされないこととなる。
【００９３】
次の撮影周期で画像名２の撮影ＲＧＢ画像が撮影された場合には、画像メモリ部９０４と差分画像生成部９０５には、この画像名２の撮影ＲＧＢ画像が現在の撮影ＲＧＢ画像Ｇ_ｐとして入力されることとなる。このとき、まず、差分画像生成部９０５は画像メモリ部９０４から一つ前の撮影周期で撮影された画像となる、画像名１の撮影ＲＧＢ画像を画像一枚分遅延した画像Ｇ_ｄとして読み出す。次に、差分画像生成部９０５は画像メモリ部９０４からの画像名１の撮影ＲＧＢ画像（画像一枚分遅延した画像Ｇ_ｄ）と、奥行き抽出ＲＧＢカメラ１０６からの画像名２の撮影ＲＧＢ画像（現在の撮影ＲＧＢ画像Ｇ_ｐ）とから、その差分画像Ｇ_ｄｉｆｆを生成する。このときの演算は、前述するように、式１１に従って、照明光源１０４からの照射光が照射されて撮影された画像である画像名２の撮影ＲＧＢ画像（現在の撮影ＲＧＢ画像Ｇ_ｐ）から、照明光源１０４からの照射光が照射されないときに撮影された画像名１の撮影ＲＧＢ画像（画像一枚分遅延した画像Ｇ_ｄ）を減算することによって、差分画像Ｇ_ｄｉｆｆを生成する。
【００９４】
次の撮影周期で画像名３の撮影ＲＧＢ画像が撮影された場合には、画像メモリ部９０４と差分画像生成部９０５には、この画像名３の撮影ＲＧＢ画像が現在の撮影ＲＧＢ画像Ｇ_ｐとして入力される。このとき、差分画像生成部９０５は画像メモリ部９０４から一つ前の撮影周期で撮影された画像となる、画像名２の撮影ＲＧＢ画像を画像一枚分遅延した画像Ｇ_ｄとして読み出す。次に、差分画像生成部９０５は画像メモリ部９０４からの画像名２の撮影ＲＧＢ画像（画像一枚分遅延した画像Ｇ_ｄ）と、奥行き抽出ＲＧＢカメラ１０６からの画像名３の撮影ＲＧＢ画像（現在の撮影ＲＧＢ画像Ｇ_ｐ）とから、その差分画像Ｇ_ｄｉｆｆを生成する。このときの演算は、前述するように、式１２に従って、照明光源１０４からの照射光が照射されて撮影された画像である画像名２の撮影ＲＧＢ画像（画像一枚分遅延した画像Ｇ_ｄ）から、照明光源１０４からの照射光が照射されないときに撮影された画像名３の撮影ＲＧＢ画像（現在の撮影ＲＧＢ画像Ｇ_ｐ）を減算することによって、差分画像Ｇ_ｄｉｆｆを生成する。
【００９５】
以上に説明した画像メモリ部９０４への現在の撮影ＲＧＢ画像Ｇ_ｐの格納と、差分画像生成手段９０５による、式１１もしくは式１２に従った、照射光が照射されて撮影された撮影ＲＧＢ画像から、照明光源１０４からの照射光が照射されないときに撮影された撮影ＲＧＢ画像の減算処理を順次行うことにより得られた差分画像Ｇ_ｄｉｆｆは、照明効果除去部１１０に出力され、実施の形態１と同様の処理がなされることとなるので、実施の形態１の撮影システムと同様の効果が得られることとなる。
【００９６】
また、実施の形態２では、以上に説明した画像メモリ部９０４への現在の撮影ＲＧＢ画像Ｇ_ｐの格納と、差分画像生成手段９０５による、式１１もしくは式１２に従った、照射光が照射されて撮影された撮影ＲＧＢ画像から、照明光源１０４からの照射光が照射されないときに撮影された撮影ＲＧＢ画像の減算処理を順次行うことにより、計測部１０２の照明光源１０４による反射光のみを差分画像Ｇ_ｄｉｆｆとして取り出すことが可能となる。その結果、計測部１０２以外の照明光源１０４により、被写体１０１が照明されている場合であっても、暗室に被写体１０１を入れて撮影した実施の形態１の撮影システムと同様の効果が得られるという格別の効果を有する。
【００９７】
（実施の形態３）
図１１は本発明の実施の形態３の撮影システムにおける照明光源の概略構成を説明するための図である。ただし、実施の形態３の撮影システムは、計測部１０３を構成する奥行き抽出ＲＧＢカメラ１０６及び照明光源１０４の構成を除く他の構成は、実施の形態１もしくは実施の形態２の撮影システムと同様の構成となる。従って、以下の説明では、計測部１０２の構成について、詳細に説明する。
【００９８】
図１１において、１１０１はテレビカメラ、１１０２はレンズリモコン、１１０３は第１のＡ／Ｄ変換器、１１０４は第２のＡ／Ｄ変換器、１１０５はルックアップテーブル検索部、１１０６は主点位置情報電圧変換部、１１０７はサーボモータ、１１０８は回転軸、１１０９は摺動部、１１１０は光源ランプを示す。
【００９９】
図１１に示すように、実施の形態３の撮影システムでは、レンズリモコン１１０２から遠隔操作信号（例えば、電圧信号等）に応じて、奥行き抽出ＲＧＢカメラ１０６とレンズ１０５とからなるテレビカメラ１１０１の図示しないズーム機構及びフォーカス機構が動作して、テレビカメラ１１０１の撮影視野（画角）が任意に設定可能なテレビカメラ１１０１を用いることによって、奥行き抽出カメラ１０６とレンズ１０５とが構成されている。また、ズーム機構及びフォーカス機構を制御する遠隔操作信号は、第１及び第２のＡ／Ｄ変換器１１０３，１１０４に入力される構成となっている。
【０１００】
第１及び第２のＡ／Ｄ変換器１１０３，１１０４では、レンズリモコン１１０２からの遠隔操作信号（レンズのズーム量及びフォーカス量）がデジタル信号に変換され、ルックアップテーブル検索部１１０５に出力される。このデジタル信号に変換されたズーム量及びフォーカス量が入力されたルックアップテーブル検索部１１０５は、ズーム量及びフォーカス量に応じた照明主点２０２の位置情報を格納する図示しないテーブルを参照して、ズーム量及びフォーカス量の組み合わせに適合した照明主点２０２を検索する。検索によって得られた照明主点２０２の位置情報は、主点位置情報電圧変換部１１０６により、サーボモータ１１０７を駆動する駆動電力に変換され、サーボモータ１１０７が駆動される。
【０１０１】
ここで、実施の形態３では、サーボモータ１１０７の回転軸１１０８にはネジ山が形成され、この回転軸１１０８のネジ山と摺動部１１０９のネジ山とが嵌合されている。その結果、回転軸１１０８の回転量に応じて、光源ランプ１１１０が取り付けられる摺動部１１０９が図中に矢印で示す方向（照明光源１０４の照明主点２０２が増減する方向）に移動して、光源ランプ１１１０の位置すなわち照明主点２０２の位置を移動させる構成となっている。
【０１０２】
従って、サーボモータ１１０７の駆動によって、光源ランプ１１１０と共に摺動部１１０９が回転軸１１０８に沿って移動し、光源ランプ１１１０がズーム量及びフォーカス量に応じた照明主点２０２に移動される。
【０１０３】
このように、実施の形態３の撮影システムでは、フォーカス量とズーム量とに対する主点（照明主点）の位置をルックアップテーブルに予め格納しておき、いわゆるリモコンズームレンズのリモコンであるレンズリモコン１１０２から出力される遠隔操作信号から、フォーカス量とズーム量に応じた照明主点２０２の位置を得る。次に、得られた照明主点２０２の位置情報に基づいて、光源ランプ１１１０を照明主点２０２に移動制御する構成となっているので、ズーム機能付きのテレビカメラ１１０１を使用した場合であっても、照明の死角をなくした撮影が可能となる。
【０１０４】
なお、実施の形態３の撮影システムでは、レンズリモコン１１０２から出力される遠隔操作信号を監視することによって、レンズ１０５のズーム量及びフォーカス量に対応したレンズ主点２０１に光源ランプ１１１０を移動させる構成となっているが、これに限定されることはなく、テレビカメラ１１０１のズーム量及びフォーカス量を直接検出し、この検出量に応じて照明主点２０２を移動させる構成でもよい。
【０１０５】
図１２はテレビカメラのズーム量及びフォーカス量を直接検出する機構を備えた照明光源の概略構成を説明するための図である。以下、図１２に基づいて、ズーム量及びフォーカス量の検出機構及び検出量に応じた照明主点２０２の移動を説明する。ただし、図１２に示す照明光源１０４は、フォーカスリング１２０１、第１のロータリーエンコーダ１２０２、ズームリング１２０３、第２のロータリーエンコーダ１２０４、第１のカウンタ１２０５、及び第２のカウンタ１２０６を除く他の構成は、図１１に示す実施の形態３の照明光源１０４と同様となる。
【０１０６】
図１２に示すように、他の実施の形態３の撮影システムでは、テレビカメラ１１０１がフォーカスリング１２０１及びズームリング１２０３の移動量を検出する第１及び第２のロータリーエンコーダ１２０２，１２０４を備える構成となっている。すなわち、フォーカスリング１２０１の回転量を検出することによりフォーカス量を検出し、ズームリング１２０３の回転量を検出することによりズーム量を検出する構成となっている。
【０１０７】
第１のロータリーエンコーダ１２０２の出力は、第１のカウンタ１２０５により計数されており、その計数値がルックアップテーブル検索部１１０５に入力される構成となっている。また、第２のロータリーエンコーダ１２０４の出力は、第２のカウンタ１２０６により計数されており、その計数値もルックアップテーブル検索部１１０５に入力される構成となっている。
【０１０８】
ここで、第１及び第２のカウンタ１２０５，１２０６からの計数値に基づいて、ルックアップテーブル検索部１１０５がズーム量及びフォーカス量に応じた照明主点２０２の位置情報を格納する図示しないテーブルを参照して、ズーム量及びフォーカス量の組み合わせに適合した照明主点２０２を検索する。検索によって得られた照明主点２０２の位置情報は、主点位置情報電圧変換部１１０６により、サーボモータ１１０７を駆動する駆動電力に変換され、サーボモータ１１０７が駆動されることとなるので、前述する実施の形態３の照明光源１０４と同様にして、光源ランプ１１１０が照明主点２０２に移動する。その結果、照明の死角をなくした撮影が可能となる。
【０１０９】
このとき、この図１２に示すように、フォーカスリング１２０１及びズームリング１２０３の回転量を機械的に読み出す構成とするほうが、図１０に示す実施の形態３の構成よりも、ズーム量やフォーカス量によるレンズ主点２０１の位置を正確に把握することができるので、正確な制御が可能となる。
【０１１０】
ただし、図１２に示す構成は、フォーカスリング１２０１及びズームリング１２０３の回転を取り出す機構及びロータリーエンコーダ１２０２，１２０４等が必要となるので、図９に示す実施の形態３の構成とした方が簡易な構成とすることができる。
【０１１１】
なお、実施の形態３の照明光源１０４は、実施の形態１及び実施の形態２の撮影システムに適用可能なことはいうまでもない。
【０１１２】
また、実施の形態１〜３の撮影システムでは、照明効果除去部１１０において、鏡面反射による影響を受けた画素を当該画素の周辺の画素値で補正する構成としたが、鏡面反射がないものとして、拡散反射係数のみを演算して反射係数としてもよいことはいうまでもない。
【０１１３】
また、実施の形態１〜３の撮影システムは、シールプリント装置のように、被写体１０１である人物の映像と、予め用意している背景映像とを合成するシステムにも適用可能である。この場合、背景映像に合致した照明条件の人物映像を生成し、背景映像と合成することで違和感の無いプリントシールを生成できる。この場合、実時間かつ順次照明条件を変更できる効果を利用して、シールへの印刷前に合成の具合を利用者が確認できるとともに、撮影時の照明の死角がないため、人物の画像全体に一様に、十分な光量で撮影できる。
【０１１４】
また、既存のシールプリント装置等の簡易な撮影システムでは、被写体１０１となる人物の後ろにフラッシュライトを反射しやすいシートを設置し、撮影画像の明るさで人物領域と背景領域とを判定し合成しているが、本願発明では距離情報を同時に取得しているので、特殊なシートを人物の後ろに配置しなくても、距離情報を基に、人物領域を判定できる。
【０１１５】
また、現在普及しつつある従来のインターネットあるいは電話回線など通信媒体を用いたテレビ電話やテレビ会議などでは、一般家庭内で見栄えが良く自然な照明を設営することは、スペース的に困難であり、現実的には部屋の天井に設置してある照明あるいはテレビ電話付近にある照明を用いることになる。しかしながら、本願発明を適用することにより、種々に照明条件を変更できるので、自然な順光の照明に設定できる。
【０１１６】
さらには、本願発明を適用することにより、撮影日時に応じて新規照明条件設定部で照明条件を変更することが可能となるので、屋内にいても季節感や、時間（例えば秋の夕方）などの効果を自動的に表現できる。また、奥行き情報を利用して、人物よりも、奥にある領域は別の画像に差し替えることで、自室を相手に見せないことで、プライバシーを守ることが出来る。
【０１１７】
以上、本発明者によってなされた発明を、前記発明の実施の形態に基づき具体的に説明したが、本発明は、前記発明の実施の形態に限定されるものではなく、その要旨を逸脱しない範囲において種々変更可能であることは勿論である。
【０１１８】
【発明の効果】
本願において開示される発明のうち代表的なものによって得られる効果を簡単に説明すれば、下記の通りである。
【０１１９】
（１）膨大な演算が必要となる鏡面反射係数の算出処理を必要としない構成となっているので、所定の照明条件の映像を実時間（リアルタイム）で順次生成することができる。
【０１２０】
（２）照射光が照射されて撮影された撮影ＲＧＢ画像から、照射光が照射されないときに撮影された撮影ＲＧＢ画像の減算処理を順次行うことにより得られた差分画像に基づいて、照明効果除去部が鏡面反射係数の算出処理を必要としない反射係数の算出を行うので、被写体が照明されている場合であっても、所定の照明条件の映像を実時間（リアルタイム）で順次生成することができる。
【０１２１】
（３）テレビカメラのズーム量及びフォーカス量に応じて照明主点に光源ランプを移動することができるので、テレビカメラにズームレンズを使用した場合であっても、照明の死角をなくした撮影ができる。
【図面の簡単な説明】
【図１】本発明の実施の形態１の撮影システムの概略構成を説明するための図である。
【図２】実施の形態１の計測部の光学的構成を説明するための図である。
【図３】実施の形態１の計測部の他の光学的構成を説明するための図である。
【図４】実施の形態１の法線推定部の動作を説明するための図である。
【図５】実施の形態１の照明効果除去部の動作を説明するための図である。
【図６】実施の形態１の照明効果除去部における分光特性の測定原理を説明するための図である。
【図７】実施の形態１の照明効果除去部における明るさの判定に用いる値の算出手順を説明するための図である。
【図８】実施の形態１の照明効果除去部の概略構成を説明するための図である。
【図９】本発明の実施の形態２の撮影システムの概略構成を説明するための図である。
【図１０】実施の形態２における奥行き抽出カメラによる撮影ＲＧＢ画像の収集タイミングと照明光の照射タイミングとを説明するための図である。
【図１１】本発明の実施の形態３の撮影システムにおける照明光源の概略構成を説明するための図である。
【図１２】本発明の実施の形態３の撮影システムにおける他の照明光源の概略構成を説明するための図である。
【符号の説明】
１０１…被写体　　　　　　　　　　　１０２…計測部
１０３…半透鏡（ハーフミラー）　　　１０４…照明光源
１０５…レンズ　　　　　　　　　　　１０６…奥行き抽出ＲＧＢカメラ
１０７…法線推定部　　　　　　　　　１０８…入射光量推定部
１０９…閾値設定部　　　　　　　　　１１０…照明効果除去部
１１１…新規照明条件設定部　　　　　１１２…新規照明条件付加部
１１３…分光測色計　　　　　　　　　１１４…領域Ａ
２０１…レンズ主点　　　　　　　　　２０２…照明主点
８０１…照射入射角判定部　　　　　　８０２…色判定部
８０３…明るさ判定部　　　　　　　　８０４…領域内挿部
８０５…拡散反射光除去部
９０１…光学シャッタ　　　　　　　　９０２…同期発生器
９０３…分周器　　　　　　　　　　　９０４…画像メモリ部
９０５…差分画像生成部
１１０１…テレビカメラ　　　　　　　１１０２…レンズリモコン
１１０３…第１のＡ／Ｄ変換器　　　　１１０４…第２のＡ／Ｄ変換器
１１０５…ルックアップテーブル検索部
１１０６…主点位置情報電圧変換部　　１１０７…サーボモータ
１１０８…回転軸　　　　　　　　　　１１０９…摺動部
１１１０…光源ランプ
１２０１…フォーカスリング　　　　　１２０２…第１のロータリーエンコーダ
１２０３…ズームリング　　　　　　　１２０４…第２のロータリーエンコーダ
１２０５…第１のカウンタ　　　　　　１２０６…第２のカウンタ[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to an image processing apparatus and an imaging system including the same, and more particularly, to a technique for generating a captured image that can be expected under another illumination condition from a captured image of a subject continuously captured under a predetermined illumination condition. It is.
[0002]
[Prior art]
In a conventional virtual studio system, a prepared video (background video) serving as a synthesis source and lighting conditions equivalent to the background video are reproduced in the studio, and performers (subjects) photographed under the lighting conditions are reproduced. By combining the video with the video of (2), a synthesized video without a sense of incongruity was obtained in real time.
[0003]
However, it is impossible to continuously reproduce the lighting conditions that change every moment in the studio, and the lighting conditions that can be reproduced in the studio are limited. For this reason, in a virtual studio system using lighting conditions that change every moment or background images that cannot be reproduced in the studio, typical lighting conditions are reproduced, and the performers perform in these typical lighting conditions. As a result, it is configured to obtain a composite image having a small uncomfortable feeling.
[0004]
On the other hand, with the progress of CG (computer graphic) technology in recent years, lighting conditions expressed in a background image have become complicated, and it is difficult to reproduce lighting conditions that match the background image in a studio. .
[0005]
As a technique for solving this problem, generation of a video under illumination conditions equivalent to a background video by performing three-dimensional CG processing on a video of a performer has been studied. According to the method using the three-dimensional CG technique, the shape and the reflection characteristics (texture) of the performer are created from the performer's video taken under a predetermined lighting condition, and the performance of the performer is adjusted to match the lighting condition of the background video. This is to process the performer's video.
[0006]
[Problems to be solved by the invention]
The present inventor has found the following problems as a result of studying the above-mentioned conventional technology.
Creating the shape of the subject and the reflection characteristics of the surface requires extremely complicated work, and the creation of realistic and realistic three-dimensional CG images requires the creator's high ability and enormous work time. It was.
[0007]
For this reason, the present invention can be applied to a case where the time required for creating a three-dimensional CG image can be ensured during a period from shooting to broadcasting (broadcasting), such as a movie or a recorded video. However, in the case of sequentially broadcasting video shot by a television camera as in live broadcasting, it is necessary to finish creating a three-dimensional CG video before the TV camera finishes shooting the next video. There was a problem that could not be applied to.
[0008]
However, the conventional method of measuring the reflection characteristics on the surface of a subject is, for example, to place the subject in a dark room and measure it with a measuring device whose installation position is known in an environment in which light source information is known at one point. there were. As a conventional method of estimating the reflection characteristic of the subject surface, for example, there has been an apparatus for measuring the reflection characteristic of the subject surface disclosed in Japanese Patent Application No. 2001-145192 (hereinafter referred to as Document 1).
[0009]
An object of the present invention is to provide a technique capable of sequentially generating images under predetermined lighting conditions in real time.
The above and other objects and novel features of the present invention will become apparent from the description of the present specification and the accompanying drawings.
[0010]
[Means for Solving the Problems]
The following is a brief description of an outline of typical inventions disclosed in the present application.
[0011]
(1) reflection coefficient calculating means for calculating a reflection coefficient of a subject imaged as an input image, new lighting condition setting means for setting lighting conditions different from the input image, the set lighting conditions and the reflection coefficient An illumination condition adding unit that generates an image of illumination conditions set by the new illumination condition setting unit from the input image based on the input image. The reflection coefficient calculating means comprises: a correction means for specifying a specular reflection area from the input image based on the depth information of the subject and correcting specular reflection pixels; and a reflection coefficient of the subject from the corrected input image. And a diffuse reflection coefficient calculating means for calculating
[0012]
(2) photographing means for photographing a photographed image including depth information of a subject, reflection coefficient computing means for computing a reflection coefficient of the subject imaged in the photographed image, and new lighting for setting lighting conditions different from the image A photographing system comprising: a condition setting unit; and an illumination condition adding unit configured to generate an image of the illumination condition set by the new illumination condition setting unit from the captured image based on the set illumination condition and the reflection coefficient. In the reflection coefficient calculating means, a correction means for specifying a specular reflection region from the captured image based on the depth information of the subject and correcting specular reflection pixels, and a reflection coefficient of the subject from the corrected captured image And a diffuse reflection coefficient calculating means for calculating.
[0013]
According to the above-described means, when an image including depth information of a subject is input as an input image, first, based on the depth information of the subject, the correction unit specifies a specular reflection region from the input image, and within this region, The specular reflection pixel which is a pixel is corrected. Next, the diffuse reflection coefficient calculation means calculates the reflection coefficient of the subject from the input image corrected by the correction means. Thereafter, based on the illumination condition set by the new illumination condition setting means and the reflection coefficient calculated by the diffuse reflection coefficient calculation means, the illumination condition adding means sets a new illumination from the input image in which the influence of the specular reflection has been corrected. Since it is configured to generate an image of the condition, that is, the configuration does not require the calculation processing of the specular reflection coefficient which requires an enormous amount of operation, the image of the predetermined illumination condition can be generated in real time (real time). This can be sequentially generated, and image processing within one frame period can be realized.
[0014]
Therefore, by using a photographed image photographed by the photographing means for photographing a photographed image including depth information of a subject as an input image, images under predetermined lighting conditions are sequentially generated in real time (real time). An imaging system capable of performing image processing on a computer can be configured.
[0015]
BEST MODE FOR CARRYING OUT THE INVENTION
Hereinafter, the present invention will be described in detail with reference to the drawings together with embodiments (examples) of the present invention.
In all the drawings for describing the embodiments of the present invention, components having the same functions are denoted by the same reference numerals, and their repeated description will be omitted.
[0016]
(Embodiment 1)
FIG. 1 is a diagram for explaining a schematic configuration of the imaging system according to the first embodiment of the present invention. However, in the following description, a device that sequentially captures and outputs two-dimensional luminance distribution images of the same red, blue, and green colors as in a known video camera, for example, every 1/30 second is referred to as an RGB camera. Write.
1, reference numeral 101 denotes a subject, 102 denotes a measurement unit, 103 denotes a semi-transparent mirror (half mirror), 104 denotes an illumination light source, 105 denotes a lens, 106 denotes a depth extraction RGB camera, 107 denotes a normal line estimation unit, and 108 denotes an incident light amount estimation. Unit, 109 a threshold setting unit, 110 a lighting effect removing unit, 111 a new lighting condition setting unit, 112 a new lighting condition adding unit, 113 a spectrocolorimeter, and 114 an area A.
[0017]
As is clear from FIG. 1, in the imaging system according to the first embodiment, an illumination light source 104 for illuminating a performer serving as a subject 101, and illumination light emitted from the illumination light source 104 is applied to the performer. A half mirror 103, a spectrocolorimeter 113 for measuring the spectral characteristics of each color of RGB of the illumination light emitted from the illumination light source 104, and a ray reflected by the body surface of the subject 101 and incident via the half mirror 103 A measurement unit 102 is formed by a lens 105 for forming an optical image of the performer and a depth extracting RGB camera 106 for capturing an optical image of the performer.
[0018]
However, the depth extraction RGB camera 106 of the first embodiment can sequentially obtain the depth distance from the lens principal point to the subject in real time, for example, every 1/30 second for each coordinate of the image obtained by the RGB camera. It is a well-known camera. Also, a group of depth information corresponding to at least one of the three RGB images will be referred to as a photographed depth image. Further, in the first embodiment, it is not a necessary condition that the depth distance is obtained every 1/30 second, and a higher speed or a lower speed can be applied other than every 1/60 second. Needless to say, For a camera having this function (the depth extraction RGB camera 106), several methods have already been put into practical use, and a television camera called a well-known axivision and a stereo camera in which two cameras are arranged at a predetermined interval are used. Although there is a method using a camera and the like, it goes without saying that the first embodiment can be applied to a camera to which another method is applied. However, the depth information of each pixel of the obtained RGB image needs to be obtained in a one-to-one correspondence. That is, the principal point of the lens for acquiring the RGB image and the principal point of the lens for obtaining the depth information need to be at the same position.
[0019]
Further, in the imaging system of the first embodiment, a normal estimating unit 107 that calculates a normal vector at each pixel position from a depth image (photographed depth image) captured by the depth extraction RGB camera 106, An incident light amount estimating unit 108 that calculates an incident light amount of illumination light on the performer's body surface based on the performer; and an image of each color of RGB captured by the depth extracting RGB camera 106 and RGB values measured by the spectrophotometer 113. An illumination effect removing unit 110 that calculates reflection coefficient information based on spectral characteristics of each color, incident light amount information and threshold information of each pixel, a threshold setting unit 109 that sets a threshold, and a normal line information and a reflection coefficient information. A new lighting condition adding unit 112 for generating an image based on a new lighting condition based on the new lighting condition, and a new lighting condition setting for setting a new lighting condition in the new lighting condition adding unit 112 Processor of the image information is formed from 111..
[0020]
Therefore, in the imaging system according to the first embodiment, based on an optical image (an imaging depth image having depth information) captured by the illumination light emitted from the illumination light source 104, the imaging depth is first determined by the normal estimation unit 107. A normal vector at each pixel of the image is calculated, and the obtained normal vector is input to the incident light amount estimation unit 108. The normal vector input to the incident light amount estimating unit 108 is used as reference data when calculating the incident light amount of the illumination light on the performer's body surface, and is converted into a shooting depth image and a normal vector by the incident light amount estimating unit 108. The amount of incident light of the illumination light is calculated based on the calculated amount of light, and the obtained amount of incident light is output to the illumination effect removing unit 110. Lighting is performed based on the incident light amount obtained by the incident light amount estimating unit 108, an RGB optical image (photographed RGB image) captured by the illumination light emitted from the illumination light source 104, and a threshold from the threshold setting unit 109. The effect removing unit 110 calculates a reflection coefficient for each pixel in the captured RGB image, and outputs the reflection coefficient information to the new illumination condition adding unit 112.
[0021]
At this time, in the imaging system of the first embodiment, the normal line information from the normal line estimating unit 107 and the new lighting condition information from the new lighting condition setting unit 111 as the new lighting condition are transmitted to the new lighting condition adding unit 112. It is configured to be input. Therefore, the new illumination condition adding unit 112 according to the first embodiment calculates saturation and luminance information for each pixel of the captured RGB image based on the reflection coefficient, normal line information, and the new illumination condition for each pixel. , An image (output image) suitable for the new lighting condition is generated.
[0022]
As described above, in the imaging system according to the first embodiment, the incident light amount and the reflection coefficient at each pixel are calculated based on the normal vector calculated based on the depth information of the subject 101, and the normal vector information is calculated. Is configured to generate a captured RGB image under a new lighting condition based on. That is, it is possible to realize image processing within one frame period in real time.
[0023]
Next, a detailed configuration of each unit of the imaging system according to the first embodiment will be described.
[0024]
(Measurement unit)
FIG. 2 is a diagram for explaining the optical configuration of the measurement unit according to the first embodiment. However, in the following description, the pinhole position when the RGB camera (depth extraction RGB camera 106) is assumed to be a pinhole camera is described as a lens principal point 201. The light source position when the illumination light source 104 is assumed to be a point light source is referred to as an illumination principal point 202.
[0025]
The measurement unit 102 according to the first embodiment has a configuration in which a depth extraction RGB camera 106 and an illumination light source 104 that virtually photograph the subject 101 on the same optical axis using the half mirror 103 are provided for the subject 101. Has become. At this time, the lens principal point 201 of the depth extraction RGB camera 106 and the illumination principal point 202 of the illumination light source 104 have the optical path length from the subject 101 using L1, L2, and L3 in FIG. As shown in the figure, they are installed to be the same.
[0026]
(Equation 1)
L1 + L2 = L1 + L3 (Formula 1)
As described above, in the imaging system according to the first embodiment, the measurement unit 102 is formed in an arrangement in which the lens principal point 201 and the illumination principal point 202 coincide with each other. Is shielded to prevent a shadow from being photographed on the surface of the subject 101. In other words, the shadow of the subject 101 itself occurs on the subject in the captured image due to the fact that the principal points 201 and 202 of the lens 105 and the illumination light source 104 do not match. This is to prevent a decrease in calculation accuracy of a hidden part in an image processing process described later.
[0027]
Here, one of the purposes of using the half mirror 103 is to irradiate the subject 101 with illumination light emitted from the illumination light source 104 via reflection by the half mirror 103. Another purpose of using the half mirror 103 is to use the transmission of the subject image by the half mirror 103 to capture an image with the depth extraction RGB camera 106. The depth extraction RGB camera 106 has a method that needs to project infrared light from the camera side to the subject 101 and take an image of the reflected light, and this point also uses the property that the half mirror 103 transmits. Therefore, the function of the depth extraction RGB camera 106 is not limited.
[0028]
In the first embodiment, the case where the half mirror 103 is used will be described. However, the same function can be realized by using a known optical prism instead of the half mirror 103. In addition, when a condition is obtained such that the lens principal point 201 and the illumination principal point 202 are the same, the anteroposterior relationship with respect to the subject such as the half mirror 103 and the camera lens 105 differs from the configuration shown in FIG. No problem. For example, as shown in FIG. 3, even when the half mirror 103 is disposed between the lens 105 and the depth extraction camera 106, the measurement unit 102 according to the first embodiment can be configured.
[0029]
In particular, as shown in FIG. 3, when the lens 105, the half mirror 103, and the depth extraction RGB camera 106 are arranged, compared with the configuration shown in FIG. 2, the illumination principal point 202 which is the principal point of the illumination light source 104, Since the amount of movement of the illumination light source 104 required to match the lens principal point 201 which is the principal point of the lens 105 for forming an optical image on the depth extraction RGB camera 106 can be reduced, the entire apparatus can be downsized. It is also possible to obtain the effect of being able to do so. Therefore, when configured as a dedicated system, as shown in FIG. 3, the measurement unit 102 is preferably arranged in the order of the lens 105, the half mirror 103, and the depth extraction RGB camera 106 from the subject 101 side.
[0030]
On the other hand, by incorporating and connecting a part other than the area A114 indicated by the dotted line in FIG. Can be used for multiple purposes.
[0031]
(Normal estimation unit)
The normal estimating unit 107 according to the first embodiment is a unit that estimates a normal vector of each part of the subject 101 in the captured depth image obtained from the measuring unit 102. However, in the following description, the normal vector is a vector perpendicular to the surface of the subject 101 and represents the direction of the surface. In particular, in the first embodiment, the normal vector at each pixel constituting the captured depth image is estimated. The normal vector of the pixel can be calculated by using the depth information between the pixel and peripheral pixels.
[0032]
For example, in the case where the photographed depth image obtained by the measuring unit 102 is composed of 640 pixels horizontally and 480 pixels vertically as shown in FIG. 4A, the lower left two-dimensional coordinates of the image are used as the reference position. (0, 0). Assume that given pixels are given coordinates at a distance of the number of pixels based on the lower left coordinates.
[0033]
When calculating the normal vector of the two-dimensional coordinates (100, 100), as shown in FIG. 4B, the depth information of the pixel (100, 100) and the pixel adjacent to the pixel are calculated. Information of four surrounding pixels (coordinates (100, 101), (99, 100), (101, 100), (100, 99)). First, since the respective two-dimensional coordinates and depth information can be converted into three-dimensional coordinates with the lens principal point 201 as the origin, the first embodiment converts them into three-dimensional coordinates in advance. Next, line segments formed by the three-dimensional coordinates corresponding to the pixel and the surrounding pixels are obtained. Further, by obtaining an outer product between adjacent line segments, a normal vector of a plane formed by the line segments is obtained. Four normal vectors are obtained by this operation. In the first embodiment, the normal vector of the pixel is obtained by averaging and normalizing the four normal vectors. However, the calculation method of the normal vector is not limited, but can be realized by a general calculation method used in the computer graphics field as described above. The normal vector information and the three-dimensional coordinates obtained by the normal estimating unit 107 are sent to the incident light amount estimating unit 108 at the next stage.
[0034]
(Incident light quantity estimation unit)
The incident light amount estimating unit 108 according to the first embodiment is means for estimating the amount of incident light on each part of the subject 101. However, the estimation of the amount of incident light requires the distance from the illumination light source 104 and the angle of incidence of the illumination light on the subject 101.
[0035]
Therefore, in the imaging system according to the first embodiment, the incident angle and the distance from the illumination light source 104 are determined by using the normal vector information and the three-dimensional coordinates of each part of the subject 101 obtained by the normal estimating unit 107 in the preceding stage. Configuration. At this time, the three-dimensional coordinates obtained by the radiation estimating unit 107 in the former stage have the origin at the lens principal point 201, that is, the illumination principal point 202, and when used together with normal vector information, the incident angle of illumination can be easily determined. Can be estimated. The distance from the illumination light source 104 to each part of the subject 101 can also be estimated by obtaining the distance between the three-dimensional coordinates of each part and the origin (0, 0, 0). For example, the coordinates of a predetermined part of the subject 101 are represented by (x₁, Y₁, Z₁) And its normal vector to (x₂, Y₂, Z₂), The incident angle θ is (x₁, Y₁, Z₁), (X₂, Y₂, Z₂) As a vector.
[0036]
First, the distance L1 from the illumination light source 104 is represented by the following equation (2).
[0037]
(Equation 1)
L1 = (X₁ ²+ Y₁ ²+ Z₁ ²)^1/2... (Equation 2)
Since the light intensity of the illumination at a distance L1 from the illumination light source 104 is inversely proportional to the square of the distance L1, the light intensity of the illumination light source 104 at a location L1 away from the light intensity p₁Becomes the following Expression 3.
[0038]
(Equation 2)
p₁= P / L1²... (Equation 3)
Further, the amount of incident light I on the subject is expressed by the following equation 4 based on Lambert's cosine law, which is classic in the field of computer graphics.
[0039]
(Equation 3)
I = p₁/ Cos θ (Equation 4)
The incident light amount estimating unit 108 calculates the incident light amount I for all the pixels of the image by the above-described calculation, and sends the obtained incident light amount I to the illumination effect removing unit 110 at the next stage.
[0040]
(Lighting effect removal section)
The illumination effect removal unit 110 according to the first embodiment uses the captured RGB image obtained by the measurement unit 102 and the incident light amount I obtained by the incident light amount estimation unit 108 in the preceding stage to remove the illumination effect of the subject 101. Is a means for determining the reflection coefficient of the surface.
[0041]
The components of the reflected light from the subject 101 are classified differently for each of various reflection models proposed so far. However, many of them are roughly classified into specular reflection components and diffuse reflection components based on a dichroic reflection model widely used empirically. The specular reflection component causes a phenomenon that is extremely dominant in some object regions as compared with the diffuse reflection component, and hinders the removal of the illumination effect. This area is called a highlight area.
[0042]
These reflection components (specular reflection component and diffuse reflection component) are expressed differently for each of various reflection models proposed. As for the diffuse reflection component, Lambert's cosine law depending on the incident angle of illumination light is used in many reflection models, and is used in the first embodiment in the same manner. On the other hand, the specular reflection component is not easily represented only by the relationship with the incident angle of the illumination light, and even if it is applied to any reflection model, a high-quality, high-speed and universally applicable removal method regardless of the material of the subject 101 is required. Not yet. Therefore, the illumination effect removal unit 110 according to the first embodiment has a configuration in which a mechanism for estimating a highlight region and interpolating the same with pixel values around the highlight region is incorporated as a method not considering the reflection model. However, it goes without saying that if a higher quality and more accurate specular reflection component removal method is proposed in the future, it can be incorporated in the first embodiment.
[0043]
The feature of the portion that is specularly reflected is relatively bright in the image, and the light source color is dominant over the color of the object. For example, a part of the surface of a red object color under white illumination that causes specular reflection appears to glow white. Such an area is called a highlight area and interferes with the estimation of the reflection coefficient of the subject 101. Further, depending on the material of the subject 101, if the incident angle of the illumination light on the surface of the subject 101 and the exit angle of the reflected light from the surface of the subject 101 to the camera side are close to each other, mirror reflection easily occurs. It is known to be. That is, under the optical conditions of the first embodiment, as shown in FIG. 5, the more the surface of the subject 101 is perpendicular to the straight line L4 extending from the lens principal point 201 to each part of the subject 101, the more the specular reflection occurs. Tends to occur.
[0044]
Therefore, the illumination effect removing unit 110 according to the first embodiment is configured to determine the specular reflection portion, that is, the highlight region, by using the following three determination criteria in combination from the above-described properties.
(Value used for determination on straight line L4)
First, an angle between a straight line L4 extending from the lens principal point 201 to each part of the subject 101 and a normal vector of the surface of the subject 101 is defined as θa. This θa can be calculated from the normal vector information of the subject 101 and the three-dimensional coordinates.
(Value used for color judgment)
Next, the comparison between the color of the subject part and the light source color is performed as follows. The red, blue and green spectral characteristics of the light primary colors are measured in advance, and as shown in FIG. 6, the points S are normalized on three-dimensional coordinates with the intensities of red, blue and green as axes.₁Projected as On the other hand, the red, blue, and green components of the relevant portion of the subject 101 are normalized on three-dimensional coordinates around the red, blue, and green intensities from the captured RGB image captured by the depth extraction RGB camera 106, Point S₂Projected on three-dimensional coordinates. The light source color point S at this time₁And the color point S of the subject₂And the distance on the three-dimensional coordinate with₁And Thus the distance n₁By setting the distance n₁Is smaller, it can be determined that the colors are more similar. However, in the first embodiment, the spectral characteristics of the light source color are measured in advance, and the measured values are used. However, the comparison standard is not limited to this. It is also possible to incorporate the spectrocolorimeter 113 in an appropriate part and automatically measure and use the spectral characteristics without direct effort.
(Value used to judge brightness)
As shown in FIG. 7, the peak value I of the luminance value of the captured RGB image captured by the depth extraction RGB camera 106_maxAnd a histogram for the luminance values of all the pixels, and the brightest luminance value I_maxBrightness value I that becomes the peak on the first histogram_peakIs calculated. Then I_peak− (I_max-I_peak) Is I_sAnd Thus, the luminance value I_sIs set, the highlight portion due to the specular reflection becomes the luminance value I_peakCan be approximated to a distribution that can be expressed by a Gaussian function with the center value and the maximum value._peakI around_peak-I_maxThe distribution is symmetrical with the space turned back. However, the luminance value I_sIs the luminance value I_maxCorresponds to the folded part.
(Use of three criteria)
The first to third threshold values Th are manually determined in advance for the three determination criteria.₁, Th₂, Th₃Is set by the threshold setting unit 109 in advance.
[0045]
First, the angle θa of each pixel of the captured RGB image captured by the depth extraction RGB camera 106 and the first threshold Th₁And the angle θa is equal to the first threshold Th.₁If it is smaller, it is considered that the reflection by specular reflection is the dominant part, and the information R of the region satisfying this condition is considered.₁Save.
[0046]
Next, R₁, The brightness value I of all pixels in the region_outAnd luminance value I_sAnd the difference value I_diffIs calculated by the following equation (5).
[0047]
(Equation 4)
I_diff= I_out-I_s... (Equation 5)
Difference value I_diffIs the second threshold Th₂If it is larger than this, it is considered that the reflection by the specular reflection is dominant, and the information R of the region satisfying this condition is considered.₂Save.
[0048]
Finally R₂Distance n for all pixels in the region₁Ask for. Where the distance n₁Is the third threshold Th₃The smaller portion is regarded as a portion where the reflection by specular reflection is dominant, and information R of a region satisfying this condition is obtained.₃Save.
[0049]
Finally R₃Is regarded as a region where specular reflection is dominant. Where R₃Is subjected to a segmentation process, which is a general image processing technique,₃Is divided into closed sections having the minimum area in the image, and the pixel values of the area are replaced with the surrounding pixel values for each area.
[0050]
With respect to the captured RGB image in which the reflection coefficient caused by specular reflection has been processed as described above, the imaging system of the first embodiment calculates the reflection coefficient caused by diffuse reflection in the following procedure.
Diffuse reflection light I_dMeans that the incident light is I and the diffuse reflection coefficient is K_dAre represented by the following Expressions 6 and 7.
[0051]
(Equation 5)
I_d= K_d× I (Equation 6)
K_d= I_d/ I (Equation 7)
Diffuse reflection light I_dThe red, blue, and green components of the observed captured RGB image corresponding to_r, C_b, C_g), And the red, blue, and green components of the incident light I are represented by (I_r, I_b, I_g), The diffuse reflection coefficient K_dRed component, blue component, green component (K_r, K_b, K_g) Is calculated by the following Expressions 8 to 10.
[0052]
(Equation 6)
K_r= C_r/ I_r... (Equation 8)
K_b= C_b/ I_b... (Equation 9)
K_g= C_g/ I_g... (Equation 10)
The obtained diffuse reflection coefficient (K_r, K_b, K_g) Is sent to the new lighting condition adding unit 112 at the next stage as information from which the lighting effect has been removed.
[0053]
FIG. 8 is a diagram for explaining a schematic configuration of the illumination effect removing unit according to the first embodiment. Here, reference numeral 801 denotes an irradiation incident angle determination unit (first region specifying unit), 802 denotes a color determination unit (spectral characteristic calculation unit, second region specification unit), and 803 denotes a brightness determination unit (brightness value calculation unit; 3, an area interpolating unit (correction unit) 804, and a diffuse reflection light removing unit (diffuse reflection coefficient calculation unit) 805.
[0054]
8, the illumination incident angle determination unit 801 includes: an incident light amount information from the incident light amount estimation unit 108; a normal vector and three-dimensional position information from the normal line estimation unit 107; a captured RGB image from the depth extraction RGB camera 106; And the first threshold Th from the threshold setting unit 109₁Is a means for performing the determination processing described in “Value used for determination on straight line L4” described above. Region information R obtained by this illumination incident angle determination unit 801₁Is input to the color determination unit 802.
[0055]
The color determination unit 802 receives the incident light amount information from the incident light amount estimation unit 108, the normal vector and three-dimensional position information from the normal line estimation unit 107, the captured RGB image from the depth extraction RGB camera 106, and the threshold setting unit 109. Of the second threshold Th₂Is a means for performing the determination processing described in the above-mentioned “value used for color determination”. However, as described above, the color determination unit 802 uses the region information R input from the illumination incident angle determination unit 801.₁, The determination process is performed only on the region obtained by the illumination incident angle determination unit 801. The region information R obtained by the color determination unit 802₂Is input to the next-stage brightness determination unit 803.
[0056]
The brightness determination unit 803 includes the incident light amount information from the incident light amount estimation unit 108, the normal vector and three-dimensional position information from the normal line estimation unit 107, the captured RGB image from the depth extraction RGB camera 106, and the threshold setting unit 109. From the third threshold Th₃Is a means for performing the determination processing described in the above “value used for determining brightness”. However, as described above, the brightness determination unit 803 uses the region information R input from the color determination unit 802.₂, The determination process is performed only on the region obtained by the color determination unit 802. The region information R obtained by the brightness determination unit 803₃Is input to the area interpolation unit 804 at the next stage.
[0057]
The region interpolation unit 804 outputs the region information R obtained by the brightness determination unit 803.₃Is used to interpolate the area of the captured RGB image with the pixel values around the area based on the above. This interpolation processing is the processing described in the above-mentioned “Use of Three Determination Criteria”. The captured RGB image after interpolation processing obtained by the area interpolation unit 804 is input to the diffuse reflection light removal unit 805 at the next stage.
[0058]
The diffuse reflection light removing unit 805 is a unit that calculates the diffuse reflection coefficient based on the above-described equations 8, 9, and 10, and obtains the obtained diffuse reflection coefficient (K_r, K_b, K_g) Is input to the new lighting condition adding unit 112 as an output of the lighting effect removing unit 110.
[0059]
(Threshold setting part)
First to third threshold values Th used in the lighting effect removing unit 110₁, Th₂, Th₃Is a means for setting. For example, three knob-shaped volumes are arranged as threshold setting buttons, and these thresholds can be manually set by operating these knobs. The first to third threshold values Th₁, Th₂, Th₃Can be set at the discretion of the operator while viewing the output image before or during use. However, the present invention does not limit the user interface. Also, the luminance value I of the illumination is obtained from the spectral colorimeter in the measuring section.₁And obtain the luminance value I₁The second threshold value Th is proportional to₂It is needless to say that these thresholds can be automatically set, for example, by automatically setting the threshold.
[0060]
(New lighting condition setting section)
The new lighting condition setting unit 111 is a means for setting a required lighting condition. In the new lighting condition setting unit 111 according to the first embodiment, the setting items can set the number of lights, arrangement in a three-dimensional space, light distribution characteristics, color temperature, and the like.
[0061]
These setting items are necessary for producing an image by well-known computer graphics, and a function equivalent to a part of the function of general computer graphics software is added to the new lighting condition setting unit 111. It can also be realized by incorporating. In particular, when it is not necessary to sequentially change the lighting conditions in real time, a mechanism that continues to send predetermined lighting conditions may be used. In addition, in a case where even a person who is not used to the operability of setting the lighting conditions can intuitively operate, for example, only the direction of the lighting is designated by a well-known joystick, and information of the designated direction is obtained. It is also possible to adopt a configuration in which the information is sequentially sent to the new lighting condition adding unit 112 at the next stage. Further, the setting items can be limited.
[0062]
Further, it can be used in combination with a virtual studio system which is a video production method of a television program. However, the virtual studio system generates a computer graphics image serving as a background under the same conditions as the camera that is shooting the subject 101, and combines the generated image with the subject image shot by the camera, This is a system that generates a video effect as if the subject 101 exists on a video generated by computer graphics. Since the lighting conditions of the background image generated by the computer graphics are set in the computer, the setting conditions can be given to the new lighting condition setting unit 111 of the present method.
It is also possible to estimate outdoor lighting conditions according to the actual date and time or the date and time designated by the user in accordance with physical laws, and to send it to the new lighting condition adding unit 112 at the next stage.
[0063]
In the first embodiment, the user interface is not limited, but it is needless to say that the lighting conditions need to be sequentially transmitted in real time to the new lighting condition adding unit 112 at the next stage.
[0064]
(New lighting condition addition section)
The new lighting condition adding unit 112 includes normal vector information from the normal estimating unit 107, the reflection coefficient (specular reflection coefficient and diffuse reflection coefficient) from the lighting effect removing unit 110, and the new lighting from the new lighting condition setting unit 111. This is a means for inputting condition information, and for generating and outputting a subject image expected under new lighting conditions based on the input information. However, since the generation of the subject image under the new condition can be realized by a general method used in known computer graphics, a detailed description is omitted.
[0065]
Here, when the lighting conditions obtained from the virtual studio system are given to the new lighting condition setting unit 111, a subject image under the same lighting conditions as the computer graphics video generated by the virtual studio system can be generated. At this time, by using the depth information, means for determining the order of the subject 101 and the set of computer graphics used for the background of the virtual studio is provided, so that the subject 101 located before the set of computer graphics is provided. Since only the area can be combined with the background video by the set of computer graphics, even when the subject 101 is in a place different from the location where the background video was captured, the subject 101 can capture the background video. An effect (compositing effect) as if shooting at a position is obtained.
[0066]
The camera and the synchronization generator of the apparatus described above have been described on the basis of 1/30 second or 1/60 second. However, when the equipment of the NTSC signal is used, the synchronization signal is 59. Since it is 94 Hz, it goes without saying that it is 1 / 59.94 second or 2 / 59.94 second. In addition, there are video cameras of various frequencies such as a high-speed camera and a long-time exposure camera, but the present invention does not limit the frequency, and the processing is sequentially repeated for each minimum unit constituting an image. , The present invention can be applied to a photographing system using a photographing camera of another frequency.
[0067]
In the above-described imaging system according to the first embodiment, it is needless to say that the subject 101 needs to be captured in an environment where there is no illumination other than the illumination in the measurement unit 102 in the dark room. Nor.
[0068]
As described above, in the imaging system according to the first embodiment, the normal estimating unit based on the captured depth image captured using the depth extraction RGB camera 106 under the illumination light emitted from the illumination light source 104. 107 calculates a normal vector at each pixel of the photographed depth image, and an incident light amount estimating unit 108 calculates an incident light amount of illumination light on the body surface of the subject 101 as a performer based on the input normal vector. The removal unit 110 detects the incident light amount obtained by the incident light amount estimation unit 108, the captured RGB image captured by the illumination light emitted from the illumination light source 104, and the first to third threshold values Th from the threshold value setting unit 109.₁, Th₂, Th₃, A reflection coefficient for each pixel in the captured RGB image is calculated, and the new lighting condition adding unit 112 generates and outputs a subject image under new lighting conditions based on the reflection coefficient.
[0069]
At this time, in the imaging system according to the first embodiment, the illumination effect removing unit 110 that calculates the reflection characteristic of the subject 101, that is, the reflection coefficient, includes the irradiation incident angle determination unit 801, the color determination unit 802, the brightness determination unit 803, An insertion section 804 and a diffuse reflection light removing section 805 are provided.
[0070]
Here, as an operation related to specular reflection, first, the illumination incident angle determination unit 801 sets the incident light amount information, the normal vector, the three-dimensional position information, the captured RGB image, and the first threshold Th.₁The region is specified based on. Next, for the area specified by the illumination incident angle determination unit 801, the color determination unit 802 performs the incident light amount information, the normal vector, the three-dimensional position information, the captured RGB image, and the second threshold Th.₂The region is specified based on. Next, for the area specified by the color determination unit 802, the brightness determination unit 803 determines the incident light amount information, the normal vector, the three-dimensional position information, the captured RGB image, and the third threshold value Th.₃The region is specified based on. Next, with respect to the area specified by the brightness determination unit 803, the area interpolation unit 804 interpolates the area of the captured RGB image with the pixel values around the area. Thereafter, the diffuse reflection light removing unit 805 calculates a reflection coefficient related to the diffuse reflection light with respect to the captured RGB image in which the highlight region is interpolated (corrected) by the region interpolation unit 804.
[0071]
As described above, the configuration in which the illumination effect removing unit 110 according to the first embodiment generates a captured RGB image in which the influence of specular reflection is removed from the captured RGB image obtained by the depth extraction RGB camera 106, that is, an enormous amount of calculation is required. Since the configuration does not require the calculation processing of the specular reflection coefficient, it is possible to sequentially generate an image under a predetermined illumination condition in real time (real time). In particular, image processing within one frame period is required. To live broadcasts.
[0072]
(Embodiment 2)
FIG. 9 is a diagram for explaining a schematic configuration of the imaging system according to the second embodiment of the present invention. Here, reference numeral 901 denotes an optical shutter, 902 denotes a synchronization generator, 903 denotes a frequency divider, 904 denotes an image memory unit, and 905 denotes a difference image generation unit. Further, in the following description, operations and effects related to the optical shutter 901, the synchronization generator 902, the frequency divider 903, the image memory unit 904, and the difference image generation unit 905, which are different in configuration from the imaging system of the first embodiment. Will be described in detail.
[0073]
As is apparent from FIG. 9, in the imaging system according to the first embodiment, an illumination light source 104 for illuminating a performer who is an object 101 and illumination light emitted from the illumination light source 104 is applied to the object 101 side. A half mirror 103, an optical shutter 901 disposed between the illumination light source 104 and the half mirror 103 to control transmission and blocking of illumination light emitted from the illumination light source 104, and illumination emitted from the illumination light source 104. A spectrocolorimeter 113 for measuring the spectral characteristics of each of the RGB colors of light, and a lens 105 for forming a light beam (optical image of the subject 101) reflected by the body surface of the subject 101 and incident via the half mirror 103. And a depth extracting RGB camera 106 for photographing an optical image of the subject 101, and an external synchronization input terminal of the depth extracting RGB camera 106. A sync generator 902 for generating a synchronization signal, the measurement unit 102 from the frequency divider 903 for dividing the synchronization signal from sync generator 902 is formed.
[0074]
As described above, the measuring unit 102 according to the second embodiment is configured to control the shooting timing of the depth extraction RGB camera 106 in synchronization with the synchronization signal generated by the synchronization generator 902. Further, the operation of the optical shutter 901 is controlled by a signal (frequency-divided signal) obtained by dividing the synchronization signal by the frequency divider. Therefore, it is possible to synchronize the shooting of the subject 101 with the depth extraction RGB camera 106 and the illumination of the subject 101. The details of the measuring unit 102 will be described later.
[0075]
Further, in the imaging system according to the first embodiment, a normal estimating unit 107 that calculates a normal vector at each pixel position from a depth image (captured depth image) captured by the depth extraction camera 106, and An incident light amount estimating unit 108 for calculating an incident light amount of illumination light on the performer's body surface, an image memory unit 904 for sequentially storing captured RGB images captured by the depth extracting RGB camera 106, and reading from the image memory unit 904. A difference image generation unit 905 that generates a difference image for each color of RGB from an image (delayed image) delayed by one frame (one shooting cycle) and the captured RGB image captured by the depth extraction RGB camera 106; The image for each color of RGB generated by the generation unit, the spectral characteristics of each color of RGB measured by the spectral colorimeter 113, and the incidence for each pixel. A lighting effect removing unit 110 that calculates reflection coefficient information based on the amount information and the threshold information, a threshold setting unit 109 that sets a threshold, and a new image generating unit that generates an image under new lighting conditions based on the normal information and the reflection coefficient information. An image information processing unit is formed by the lighting condition adding unit 112 and the new lighting condition setting unit 111 for setting a new lighting condition in the new lighting condition adding unit 112.
[0076]
As described above, the processing unit according to the second embodiment temporarily stores the captured RGB image captured by the depth extraction RGB camera 106 in the image memory unit 904, and stores the stored captured RGB image and the captured RGB image captured in real time. Are sequentially generated by the difference image generation unit 905, and the difference image is used by the illumination effect removal unit 110.
[0077]
In FIG. 9, a synchronization generator 902 is a well-known synchronization signal generator that generates a pulse of a synchronization signal in units of 1/30 second or 1/60 second, which is the imaging frequency of the camera image of the depth extraction RGB camera 106. Means. However, the synchronization signal has a waveform suitable for an external synchronization input terminal of the video camera constituting the depth extraction RGB camera 106, and this synchronization signal is input to the video camera. This video camera adjusts so as to capture an image in synchronization with the input synchronization signal. However, a general commercial TV camera has a terminal for inputting an external synchronization signal called an external synchronization input terminal or a genlock terminal, and an external synchronization signal is output by a built-in PLL circuit (PhaseLocked @ Loop: phase synchronization circuit). A circuit for oscillating to generate a synchronization signal for a camera internal circuit in synchronism with the circuit is incorporated. Therefore, the synchronization processing in the second embodiment can be easily realized.
[0078]
The frequency divider 903 is a means for dividing the synchronization signal of 1/30 or 1/60 second by 2, and has a period of 1/15 second or 1/30 second, which is half of the synchronization signal. Generate a signal (divided signal). In particular, in the second embodiment, the duty ratio of the frequency-divided signal is set to a digital signal of 50%, and the waveform is shaped so that the determination at each unit to which the frequency-divided signal is input is easy.
[0079]
An optical shutter 901 arranged perpendicularly to the illumination projection direction is a well-known optical shutter that can be opened and closed by an electric signal, and blocks light in a closed state and transmits light in an open state. Has become. In addition, the optical shutter 901 according to the second embodiment is in a closed state when a frequency-divided signal input as a drive signal or a control signal is High (1), and is opened in a Low (0) period. The optical shutter of the second embodiment can be realized by using a mechanical shutter or a liquid crystal shutter as the optical shutter 901.
[0080]
The image memory unit 904 is a unit for sequentially storing captured RGB images captured by the depth extraction RGB camera 106. In the second embodiment, at the input timing of the next captured RGB image, It is configured to output the captured RGB images stored at the input timing. In other words, when the captured RGB image has been input and the next captured RGB image is sent from the depth extraction RGB camera 106, the image memory unit 904 stores the content stored in the image memory, that is, the captured RGB image at that timing. This is sent to the difference image generation unit 905 at the next stage. That is, the captured RGB image that has passed through the image memory unit 904 is delayed by one image from the RGB image output from the depth extraction RGB camera 106.
[0081]
The difference image generation unit 905 outputs the current captured RGB image G directly input from the depth extraction RGB camera 106._pAnd an image G delayed by one image input from the image memory unit 904_dImage (difference image) G from the difference_diffIs a means for generating. In particular, in the difference image generation unit 905 of the second embodiment, 1/30 second or 1/60 of the following equation 11 and equation 12 are obtained based on the signal obtained by dividing the synchronization generator by 2 using the frequency divider. By switching every second, the difference image G_diffIs obtained.
[0082]
(Equation 7)
G_diff= G_p-G_d··· (Equation 11)
G_diff= G_d-G_p... (Equation 12)
In particular, in the difference image generation unit 905 of the second embodiment, the image when the illumination light source 104 of the measurement unit 102 is turned on becomes the left variable on the right side, and the image when it is turned off becomes the right variable on the right side. Control. That is, by subtracting the non-lighted image from the lighted image, only the reflected light from the illumination light source 104 in the measuring unit 102 is converted into the difference image G_diffIt is configured to take out as.
[0083]
In order to realize this, when the frequency-divided signal from the frequency divider 903 is 0, Expression 11 is selected, and when the frequency-divided signal is 1, Expression 12 is selected to obtain a difference image. Output to the effect removing unit 110.
[0084]
By performing such a contrivance, even when the subject 101 is illuminated by a light source other than the measuring unit 102, the same effect as the imaging system of the first embodiment is obtained.
[0085]
FIG. 10 is a diagram for explaining the collection timing of captured RGB images and the irradiation timing of illumination light by the depth extraction camera according to the second embodiment. However, the image names shown in FIG. 10 indicate captured RGB images sequentially captured by the depth extraction RGB camera 106, and in this specification, natural numbers of 1, 2, 3, 4, 5, 6,. The consecutive numbers indicated by are designated as image names.
[0086]
First, the photographing operation of the photographed RGB image in the measurement unit 102 will be described based on FIG.
[0087]
As described above, the measuring unit 102 according to the second embodiment has a configuration in which a synchronization signal is input to the depth extraction RGB camera 106, whereas the synchronization signal is divided into two by the optical shutter 901. The divided frequency signal is inputted. Therefore, in the measurement unit 102 according to the second embodiment, the depth extraction RGB camera 106 shoots in synchronization with the input of the synchronization signal in synchronization with the synchronization signal.
[0088]
On the other hand, the optical shutter 901 is configured to switch between the closed state and the open state in order according to the High (1) period and the Low (0) period of a frequency-divided signal having a duty ratio of 50% obtained by dividing the synchronization signal by two. Has become. That is, in the optical shutter 901 of the second embodiment, transmission and shielding of the illumination light emitted from the illumination light source 104 are controlled to be switched in synchronization with the synchronization signal. The irradiation of the illumination light is also synchronized with the synchronization signal.
[0089]
As a result, as shown in FIG. 10, shooting of captured RGB images continuous with images 1, 2, 3, 4, 5, 6,... Was performed by a depth extraction RGB camera operating in synchronization with a synchronization signal. In this case, the frequency-divided signal output from the frequency divider 903 becomes 1 while, for example, images 1, 3, 5,... Are taken, and becomes 0 while images 2, 4, 6,. . .., I.e., while the frequency-divided signal is 1, the optical shutter 901 is closed to block the illumination light from the illumination light source 104, and the illumination light illuminates the subject 101. Will not be performed. On the other hand, during shooting of images 2, 4, 6,..., That is, during the period in which the frequency-divided signal is 0, the optical shutter 901 is opened and the illumination light from the illumination light source 104 is transmitted. 101 will be illuminated.
[0090]
As described above, in the measuring unit 102 according to the second embodiment, the captured RGB image obtained by illuminating the subject 101 with the illumination light from the illumination light source 104 and the illumination light source 104 for each imaging cycle (frame period) of the depth extraction RGB camera 106 And a captured RGB image that does not illuminate the subject 101 with illumination light from the camera.
[0091]
Next, an operation of generating an output image in the processing unit will be described with reference to FIG. However, the operation is the same as the operation of generating the output image in the processing unit of the first embodiment except that the image data input to the illumination effect removing unit 110 is different. Accordingly, in the following description, the difference image G by the image memory unit 904 and the difference image generation unit 905 will be described._diffOnly the generating operation will be described in detail.
[0092]
The captured RGB images captured by the depth extraction RGB camera 106 are sequentially stored in the image memory unit 904. At this time, the same captured RGB image is also input to the difference image generation unit 905. Accordingly, when the captured RGB image of the image name 1 shown in FIG. 10 is captured, the image memory unit 904 and the difference image generation unit 905 store the captured RGB image of the image name 1 in the current captured RGB image G._pWill be entered as At this time, since the previous image is not stored in the image memory unit 904, the difference image G from the difference image generation unit 905 is output._diffIs not output.
[0093]
When the captured RGB image of the image name 2 is captured in the next capturing cycle, the captured RGB image of the image name 2 is stored in the image memory unit 904 and the difference image generating unit 905._pWill be entered as At this time, first, the difference image generation unit 905 outputs the image G obtained by delaying the captured RGB image of the image name 1 by one image from the image memory unit 904, which is an image captured in the immediately preceding imaging cycle._dRead as Next, the difference image generation unit 905 outputs the captured RGB image of the image name 1 from the image memory unit 904 (the image G delayed by one image)._d) And the captured RGB image of the image name 2 from the depth extraction RGB camera 106 (the current captured RGB image G_p) And the difference image G_diffGenerate The calculation at this time is, as described above, the photographed RGB image of the image name 2 (the current photographed RGB image G), which is an image photographed by irradiating the irradiation light from the illumination light source 104 according to Expression 11._p), The photographed RGB image of the image name 1 photographed when the irradiation light from the illumination light source 104 is not irradiated (the image G delayed by one image)_d), The difference image G_diffGenerate
[0094]
When the captured RGB image of the image name 3 is captured in the next capturing cycle, the captured RGB image of the image name 3 is stored in the image memory unit 904 and the difference image generating unit 905._pIs entered as At this time, the difference image generation unit 905 outputs the image G obtained by delaying the captured RGB image of the image name 2 by one image from the image memory unit 904, which is an image captured in the immediately preceding imaging cycle._dRead as Next, the difference image generation unit 905 outputs the captured RGB image of the image name 2 from the image memory unit 904 (the image G delayed by one image)._d) And a captured RGB image of the image name 3 from the depth extraction RGB camera 106 (current captured RGB image G_p) And the difference image G_diffGenerate The calculation at this time is, as described above, a captured RGB image of image name 2 (an image G delayed by one image), which is an image captured by irradiation with the illumination light from the illumination light source 104, according to Expression 12._d), The photographed RGB image of the image name 3 photographed when the irradiation light from the illumination light source 104 is not irradiated (the current photographed RGB image G_p), The difference image G_diffGenerate
[0095]
The currently captured RGB image G stored in the image memory unit 904 described above_pAnd a photograph taken when the irradiation light from the illumination light source 104 is not irradiated, from the photographed RGB image photographed by irradiation with the irradiation light according to the expression 11 or 12 by the differential image generation means 905. Difference image G obtained by sequentially performing subtraction processing of RGB images_diffIs output to the illumination effect removing unit 110, and the same processing as in the first embodiment is performed. Therefore, the same effect as in the imaging system of the first embodiment can be obtained.
[0096]
In the second embodiment, the currently captured RGB image G stored in the image memory unit 904 described above is stored._pAnd a photograph taken when the irradiation light from the illumination light source 104 is not irradiated, from the photographed RGB image photographed by irradiation with the irradiation light according to the expression 11 or 12 by the differential image generation means 905. By sequentially performing the subtraction processing of the RGB images, only the light reflected by the illumination light source 104 of the measurement unit 102 is subtracted from the difference image G._diffCan be taken out. As a result, even when the subject 101 is illuminated by the illumination light source 104 other than the measurement unit 102, the same effect as that of the imaging system according to the first embodiment in which the subject 101 is placed in a dark room and imaged is obtained. It has a special effect.
[0097]
(Embodiment 3)
FIG. 11 is a diagram for explaining a schematic configuration of an illumination light source in the imaging system according to the third embodiment of the present invention. However, the configuration of the imaging system according to the third embodiment is the same as that of the imaging system according to the first or second embodiment except for the configurations of the depth extracting RGB camera 106 and the illumination light source 104 that configure the measuring unit 103. Configuration. Therefore, in the following description, the configuration of the measurement unit 102 will be described in detail.
[0098]
11, 1101 is a television camera, 1102 is a lens remote controller, 1103 is a first A / D converter, 1104 is a second A / D converter, 1105 is a lookup table search unit, 1106 is principal point position information A voltage conversion unit, 1107 is a servomotor, 1108 is a rotating shaft, 1109 is a sliding unit, and 1110 is a light source lamp.
[0099]
As shown in FIG. 11, in the imaging system according to the third embodiment, a television camera 1101 including a depth-extracting RGB camera 106 and a lens 105 according to a remote operation signal (for example, a voltage signal or the like) from a lens remote controller 1102 is illustrated. The depth extraction camera 106 and the lens 105 are configured by using the television camera 1101 in which the zoom mechanism and the focus mechanism that do not operate and the shooting field of view (angle of view) of the television camera 1101 can be arbitrarily set. Further, a remote operation signal for controlling the zoom mechanism and the focus mechanism is input to the first and second A / D converters 1103 and 1104.
[0100]
In the first and second A / D converters 1103 and 1104, remote operation signals (the zoom amount and the focus amount of the lens) from the lens remote controller 1102 are converted into digital signals and output to the lookup table search unit 1105. . The lookup table search unit 1105, to which the zoom amount and the focus amount converted into the digital signal are input, refers to a table (not shown) that stores the position information of the illumination principal point 202 corresponding to the zoom amount and the focus amount, An illumination principal point 202 suitable for the combination of the zoom amount and the focus amount is searched. The position information of the illumination principal point 202 obtained by the search is converted into driving power for driving the servo motor 1107 by the principal point position information voltage converter 1106, and the servo motor 1107 is driven.
[0101]
Here, in the third embodiment, a thread is formed on the rotating shaft 1108 of the servomotor 1107, and the thread of the rotating shaft 1108 and the thread of the sliding portion 1109 are fitted. As a result, the sliding portion 1109 to which the light source lamp 1110 is attached moves in the direction indicated by the arrow in the drawing (the direction in which the illumination principal point 202 of the illumination light source 104 increases or decreases) in accordance with the amount of rotation of the rotation shaft 1108, The position of the light source lamp 1110, that is, the position of the illumination principal point 202 is moved.
[0102]
Therefore, by driving the servo motor 1107, the sliding portion 1109 moves along the rotation axis 1108 together with the light source lamp 1110, and the light source lamp 1110 is moved to the illumination principal point 202 according to the zoom amount and the focus amount.
[0103]
As described above, in the imaging system according to the third embodiment, the positions of the principal points (illumination principal points) with respect to the focus amount and the zoom amount are stored in the look-up table in advance, and the lens remote controller which is a remote controller for a so-called remote control zoom lens is used. From the remote control signal output from 1102, the position of the illumination principal point 202 according to the focus amount and the zoom amount is obtained. Next, since the movement of the light source lamp 1110 is controlled to the lighting principal point 202 based on the obtained position information of the lighting principal point 202, this is the case where the television camera 1101 with a zoom function is used. In addition, it is possible to take a picture without blind spots of the lighting.
[0104]
In the imaging system according to the third embodiment, the light source lamp 1110 is moved to the lens principal point 201 corresponding to the zoom amount and the focus amount of the lens 105 by monitoring a remote operation signal output from the lens remote controller 1102. However, the present invention is not limited to this, and the configuration may be such that the zoom amount and the focus amount of the television camera 1101 are directly detected, and the illumination principal point 202 is moved according to the detected amounts.
[0105]
FIG. 12 is a diagram for explaining a schematic configuration of an illumination light source provided with a mechanism for directly detecting a zoom amount and a focus amount of a television camera. Hereinafter, a mechanism for detecting the zoom amount and the focus amount and the movement of the illumination principal point 202 according to the detected amount will be described with reference to FIG. Note that the illumination light source 104 illustrated in FIG. 12 has another configuration except for a focus ring 1201, a first rotary encoder 1202, a zoom ring 1203, a second rotary encoder 1204, a first counter 1205, and a second counter 1206. Is the same as the illumination light source 104 of the third embodiment shown in FIG.
[0106]
As shown in FIG. 12, in the imaging system according to the third embodiment, the television camera 1101 includes first and second rotary encoders 1202 and 1204 for detecting the movement amounts of the focus ring 1201 and the zoom ring 1203. Has become. That is, the focus amount is detected by detecting the rotation amount of the focus ring 1201, and the zoom amount is detected by detecting the rotation amount of the zoom ring 1203.
[0107]
The output of the first rotary encoder 1202 is counted by a first counter 1205, and the counted value is input to the lookup table search unit 1105. The output of the second rotary encoder 1204 is counted by a second counter 1206, and the counted value is also input to the lookup table search unit 1105.
[0108]
Here, based on the count values from the first and second counters 1205 and 1206, the lookup table search unit 1105 stores a table (not shown) in which the position information of the illumination principal point 202 according to the zoom amount and the focus amount is stored. The illumination principal point 202 suitable for the combination of the zoom amount and the focus amount is searched with reference to the reference. The position information of the illumination principal point 202 obtained by the search is converted into drive power for driving the servo motor 1107 by the principal point position information voltage conversion unit 1106, and the servo motor 1107 is driven. The light source lamp 1110 moves to the main illumination point 202 in the same manner as the illumination light source 104 of the third embodiment. As a result, it is possible to take a picture without eliminating the blind spot of the illumination.
[0109]
At this time, as shown in FIG. 12, the configuration in which the rotation amounts of the focus ring 1201 and the zoom ring 1203 are read out mechanically is more effective than the configuration of the third embodiment shown in FIG. Since the position of the lens principal point 201 can be accurately grasped, accurate control becomes possible.
[0110]
However, the configuration shown in FIG. 12 requires a mechanism for extracting the rotation of the focus ring 1201 and the zoom ring 1203, the rotary encoders 1202 and 1204, and the like. Therefore, the configuration of the third embodiment shown in FIG. It can be configured.
[0111]
It is needless to say that the illumination light source 104 according to the third embodiment can be applied to the imaging systems according to the first and second embodiments.
[0112]
In the imaging systems of the first to third embodiments, the illumination effect removing unit 110 corrects a pixel affected by specular reflection with a pixel value around the pixel, but it is assumed that there is no specular reflection. Needless to say, only the diffuse reflection coefficient may be calculated and used as the reflection coefficient.
[0113]
Further, the photographing systems according to the first to third embodiments are also applicable to a system that combines a video of a person who is the subject 101 and a background video prepared in advance, such as a sticker printing apparatus. In this case, a print sticker without a sense of incongruity can be generated by generating a person image under lighting conditions that match the background image and combining it with the background image. In this case, by utilizing the effect that the lighting conditions can be changed in real time and sequentially, the user can check the degree of synthesis before printing on the sticker, and since there is no blind spot of lighting at the time of shooting, the entire image of the person is A uniform amount of light can be taken.
[0114]
In a simple photographing system such as an existing sticker printing apparatus, a sheet that easily reflects a flashlight is installed behind a person who is the subject 101, and a person area and a background area are determined and synthesized based on the brightness of a photographed image. However, in the present invention, since the distance information is obtained at the same time, the person area can be determined based on the distance information without arranging a special sheet behind the person.
[0115]
In addition, it is difficult to set up good-looking and natural lighting in ordinary households in terms of space, for example, in the conventional telephone or video conference using a communication medium such as the Internet or a telephone line, which is currently spreading, In practice, the lighting installed on the ceiling of the room or the lighting near the videophone is used. However, by applying the invention of the present application, the illumination conditions can be changed in various ways, so that the illumination can be set to a natural normal light.
[0116]
Furthermore, by applying the present invention, it is possible to change the lighting conditions in the new lighting condition setting unit according to the shooting date and time, so that even when indoors, the sense of season, time (for example, autumn evening), etc. The effect of can be expressed automatically. In addition, by using the depth information, an area that is deeper than the person is replaced with another image, so that privacy can be protected by not showing the room to the other party.
[0117]
As described above, the invention made by the inventor has been specifically described based on the embodiment of the present invention. However, the present invention is not limited to the embodiment of the present invention, and does not depart from the gist of the invention. It goes without saying that various changes can be made in.
[0118]
【The invention's effect】
The following is a brief description of an effect obtained by a representative one of the inventions disclosed in the present application.
[0119]
(1) Since the configuration does not require the calculation processing of the specular reflection coefficient which requires an enormous amount of calculation, it is possible to sequentially generate images under predetermined illumination conditions in real time (real time).
[0120]
(2) Lighting effect removal based on a difference image obtained by sequentially performing a subtraction process of a photographed RGB image photographed when irradiation light is not irradiated from a photographed RGB image photographed by irradiation light irradiation. Since the unit calculates the reflection coefficient which does not require the calculation processing of the specular reflection coefficient, even if the subject is illuminated, it is possible to sequentially generate images under predetermined illumination conditions in real time (real time). it can.
[0121]
(3) Since the light source lamp can be moved to the main lighting point according to the zoom amount and the focus amount of the TV camera, even when a zoom lens is used for the TV camera, it is possible to take a picture without eliminating the blind spot of the lighting. it can.
[Brief description of the drawings]
FIG. 1 is a diagram for explaining a schematic configuration of an imaging system according to a first embodiment of the present invention.
FIG. 2 is a diagram illustrating an optical configuration of a measurement unit according to the first embodiment.
FIG. 3 is a diagram for explaining another optical configuration of the measurement unit according to the first embodiment.
FIG. 4 is a diagram for explaining an operation of a normal estimation unit according to the first embodiment;
FIG. 5 is a diagram for explaining an operation of a lighting effect removing unit according to the first embodiment.
FIG. 6 is a diagram for explaining the principle of measuring spectral characteristics in the illumination effect removing unit according to the first embodiment.
FIG. 7 is a diagram for explaining a procedure of calculating a value used for determining brightness in the illumination effect removing unit according to the first embodiment.
FIG. 8 is a diagram for explaining a schematic configuration of a lighting effect removing unit according to the first embodiment.
FIG. 9 is a diagram for explaining a schematic configuration of an imaging system according to a second embodiment of the present invention.
FIG. 10 is a diagram for explaining the collection timing of captured RGB images and the irradiation timing of illumination light by a depth extraction camera according to Embodiment 2.
FIG. 11 is a diagram for explaining a schematic configuration of an illumination light source in a photographing system according to a third embodiment of the present invention.
FIG. 12 is a diagram for explaining a schematic configuration of another illumination light source in the imaging system according to the third embodiment of the present invention.
[Explanation of symbols]
101 subject 102 measuring unit
103: semi-transparent mirror (half mirror) 104: illumination light source
105: Lens # 106: Depth extraction RGB camera
107: normal line estimation unit # 108: incident light amount estimation unit
109: threshold setting unit # 110: lighting effect removing unit
111: New lighting condition setting unit # 112: New lighting condition adding unit
113: spectral colorimeter 測 114: area A
201: Lens principal point 202: Illumination principal point
801: Irradiation incident angle determination unit # 802: Color determination unit
803: brightness determination unit 804: region interpolation unit
805: Diffuse reflected light removal unit
901: Optical shutter # 902: Synchronous generator
903: frequency divider $ 904: image memory unit
905: Difference image generation unit
1101 ... TV camera $ 1102 ... Lens remote control
1103: first A / D converter # 1104: second A / D converter
1105 Lookup table search unit
1106: Principal point position information voltage converter # 1107: Servo motor
1108: Rotating shaft # 1109: Sliding part
1110 ... Light source lamp
1201: Focus ring # 1202: First rotary encoder
1203: Zoom ring # 1204: Second rotary encoder
1205: first counter 第 1206: second counter

Claims

A reflection coefficient calculating means for calculating a reflection coefficient of a subject imaged as an input image, a new lighting condition setting means for setting a lighting condition different from the input image, and based on the set lighting condition and the reflection coefficient. An illumination condition adding unit that generates an image of the illumination condition set by the new illumination condition setting unit from the input image, wherein the input image comprises an image including depth information of the subject, A reflection coefficient calculation unit configured to specify a specular reflection region from the input image based on the depth information of the subject and correct a specular reflection pixel; and a diffusion unit configured to calculate a reflection coefficient of the subject from the corrected input image. An image processing apparatus comprising: a reflection coefficient calculating unit.

2. The image processing apparatus according to claim 1, wherein the correction unit estimates a normal vector of each part of the subject based on depth information of the subject imaged in the input image; An image processing apparatus, comprising: first area specifying means for specifying the specular reflection area in the input image based on a normal vector.

3. The image processing apparatus according to claim 2, wherein the input image includes red, green, and blue captured images captured using illumination light whose spectral characteristics have been measured in advance, and wherein the correction unit performs pixel-by-pixel correction on the input image. A spectral characteristic calculating unit that calculates the spectral characteristics of the pixels, and a region in which the spectral characteristics of each of the pixels and the spectral characteristics of the illumination light are close to each other from the region specified by the first region specifying unit. An image processing apparatus comprising: a second area specifying unit for specifying.

4. The image processing apparatus according to claim 3, wherein the correction unit generates a histogram of luminance values for the input image, and focuses on a luminance value at a peak on a high luminance side of the histogram. A luminance value computing unit that computes a reference luminance value obtained by folding back to a lower luminance side, and a region that is equal to or larger than the reference luminance value from the region specified by the second region specifying unit and is specified as the specular reflection region. An image processing apparatus comprising: a third area specifying unit.

The image processing apparatus according to any one of claims 2 to 4, further comprising a threshold setting unit that sets a threshold of the first to third region identification units.

Photographing means for photographing a photographed image including depth information of a subject; reflection coefficient calculating means for calculating a reflection coefficient of the subject imaged in the photographed image; and new lighting condition setting means for setting lighting conditions different from the image And an illumination condition adding unit configured to generate an image of an illumination condition set by the new illumination condition setting unit from the captured image based on the set illumination condition and the reflection coefficient. A reflection coefficient calculation unit configured to specify a specular reflection area from the captured image based on the depth information of the subject and correct a specular reflection pixel; and a diffusion unit configured to calculate a reflection coefficient of the subject from the corrected captured image. An imaging system comprising: a reflection coefficient calculating unit.

7. The imaging system according to claim 6, wherein the correction unit estimates a normal vector of each part of the subject based on depth information of the subject imaged in the captured image, and the normal vector estimation unit. An imaging system comprising: a first area identification unit that identifies the specular reflection area in the captured image based on a line vector.

8. The image capturing system according to claim 7, wherein the image capturing unit includes a unit that captures red, green, and blue captured images under illumination light whose spectral characteristics have been measured in advance, and the correction unit includes a pixel for the captured image. A spectral characteristic calculating unit that calculates spectral characteristics for each pixel; and a region where the spectral characteristics of each pixel and the spectral characteristics of the illumination light are close to each other from the region specified by the first region specifying unit, and And a second area specifying means for specifying the image data.

9. The imaging system according to claim 8, wherein the correction unit generates a histogram of luminance values for the photographed image, and calculates a last luminance value of the histogram with a luminance value at a peak on a high luminance side of the histogram as a center. A brightness value calculating means for calculating a reference brightness value folded back to the low brightness side; and a third means for selecting an area having the reference brightness value or more from the area specified by the second area specifying means and specifying the selected area as the specular reflection area. An imaging system comprising: an area specifying unit.

The photographing system according to any one of claims 6 to 9, further comprising a threshold setting unit that sets a threshold of the first to third region specifying units.