JP3347087B2

JP3347087B2 - 3D structure reconstruction method from 2D video

Info

Publication number: JP3347087B2
Application number: JP02562899A
Authority: JP
Inventors: 利春向井; 昇大西
Original assignee: RIKEN Institute of Physical and Chemical Research
Current assignee: RIKEN Institute of Physical and Chemical Research
Priority date: 1999-02-03
Filing date: 1999-02-03
Publication date: 2002-11-20
Anticipated expiration: 2019-02-03
Also published as: JP2000222580A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、ビデオカメラ等の
２次元動画像から、並進運動と回転運動を識別して３次
元構造を復元しかつ対象物の実際の大きさを知る方法に
関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method for recognizing a translational motion and a rotational motion from a two-dimensional moving image of a video camera or the like, restoring a three-dimensional structure, and knowing the actual size of an object.

【０００２】[0002]

【従来の技術】ビデオカメラ等で撮像される動画像は観
測者の動きや対象の構造に関する重要な情報を含んでい
る。例えば、街を歩きながらビデオカメラである建物の
周りを動きながら撮影してデータを取れば、後でコンピ
ュータにより３次元構造を復元し、自由な角度から眺め
ることが可能となる。従って、かかる２次元動画像から
の３次元構造の復元は、コンピュータビジョンの重要な
課題の１つであり、この技術が確立されれば、３次元モ
デリング、トラッキング、パッシブ・ナビゲーション、
ロボットビジョンなどの多くの分野に応用可能である。2. Description of the Related Art A moving image picked up by a video camera or the like contains important information on the movement of an observer and the structure of an object. For example, if data is taken by moving around a building, which is a video camera while walking in a city, data is acquired, and a three-dimensional structure can be restored later by a computer and viewed from any angle. Therefore, restoration of a three-dimensional structure from such a two-dimensional moving image is one of the important issues of computer vision, and if this technology is established, three-dimensional modeling, tracking, passive navigation,
It can be applied to many fields such as robot vision.

【０００３】この分野の研究は、おおまかに２種類に分
類することができる。１つは、異なる時刻に得られた画
像上の点の対応関係を使う手段であり、もう１つは、画
像上の速度（オプティカルフロー）を利用する手段であ
る。前者に較べてオプティカルフローを使う手段は、
（１）オプティカルフローは画像上の対応点よりも容易
に得られ、（２）速度はオプティカルフローから得られ
るが対応点からは得られない、等の利点がある。[0003] Research in this area can be broadly classified into two types. One is a means for using the correspondence between points on the image obtained at different times, and the other is a means for using the speed (optical flow) on the image. The means of using optical flow compared to the former is
(1) The optical flow can be obtained more easily than the corresponding point on the image, and (2) the speed can be obtained from the optical flow but cannot be obtained from the corresponding point.

【０００４】更に、画像上のオプティカルフローから対
象の構造を復元する手段として、（１）平行投影像を使
うものと（２）透視投影像を使うものがある。前者は後
者の近似であり、この近似は対象がカメラから遠方にあ
る時にのみ成立する。従って、後者の透視投影像を使う
手段の方が高い精度を得ることができる。Further, as means for restoring an object structure from an optical flow on an image, there are (1) a method using a parallel projection image and (2) a method using a perspective projection image. The former is an approximation of the latter, and this approximation only holds when the object is far from the camera. Therefore, the latter means using the perspective projection image can obtain higher accuracy.

【０００５】透視投影像のオプティカルフローから対象
の構造を復元する手段としては、従来、特別な仮定を設
けない限り、非線形連立方程式を繰り返し法を用いて解
く必要があった。特別な仮定とは、被観測点が平面上に
ある場合、運動が回転だけ、又は並進だけの場合であ
る。従って、一般的には、非線形連立方程式を解く必要
があるが、その場合、解の一意性が保証されない、繰り
返し法による探索が必要になる、等の問題点があった。As a means for restoring the structure of an object from an optical flow of a perspective projection image, it has conventionally been necessary to solve a system of nonlinear equations by an iterative method unless special assumptions are made. A special assumption is that when the point to be observed lies on a plane, the movement is only rotation or only translation. Therefore, in general, it is necessary to solve a system of nonlinear equations. However, in this case, there are problems that the uniqueness of the solution is not guaranteed and that a search by an iterative method is required.

【０００６】これらの問題点を解決するために、本発明
の発明者等は、剛体的な運動をする点から透視投影で得
られたオプティカルフロー画像を使って、線形方程式を
解くだけで構造を復元する方法を提案した（「オプティ
カルフロー画像からの線形計算による３次元運動パラメ
ータと構造の復元」，計測自動制御学会論文集，Ｖｏ
ｌ．３４，Ｎｏ．５，４３８／４４４（１９９８））。
この方法により、非線形方程式を解く必要がなく、解の
一意性が保証され、かつ被観測点を増やすことにより精
度も容易に向上できる。[0006] In order to solve these problems, the inventors of the present invention have developed a structure simply by solving a linear equation using an optical flow image obtained by perspective projection from a point performing rigid motion. A method for restoration is proposed (“Reconstruction of 3D motion parameters and structure by linear calculation from optical flow image”, Transactions of the Society of Instrument and Control Engineers, Vo
l. 34, no. 5,438 / 444 (1998)).
According to this method, it is not necessary to solve the nonlinear equation, the uniqueness of the solution is guaranteed, and the accuracy can be easily improved by increasing the number of observation points.

【０００７】[0007]

【発明が解決しようとする課題】しかし、上述した３次
元構造復元方法には、以下の問題点があった。（１）図１に例示するように、カメラ２と対象物１との
距離に対して対象物１の奥行きが相対的に小さい場合、
並進運動（Ａ）と回転運動（Ｂ）は動きが小さい場合に
は画像上では似たようなオプティカルフローとなるの
で、区別するのは非常に難しい。その結果、対象物の構
造も正しく復元することが困難となる。（２）対象物の大きさとカメラの並進運動は相対的な値
としてしか求まらず、絶対的な大きさはわからない。な
ぜなら、小さな対象物が近くにあって少し動いた場合
と、大きな対象物が遠くにあって大きく動いた場合とで
は、画像上では全く同じ結果が得られるからである。However, the above-mentioned three-dimensional structure restoring method has the following problems. (1) As illustrated in FIG. 1, when the depth of the object 1 is relatively small with respect to the distance between the camera 2 and the object 1,
When the motion is small, the translational motion (A) and the rotational motion (B) have a similar optical flow on the image, and it is very difficult to distinguish them. As a result, it is difficult to correctly restore the structure of the object. (2) The size of the object and the translational movement of the camera are obtained only as relative values, and the absolute size is not known. This is because the same result is obtained on an image when a small object moves nearby and moves a little, and when a large object moves far and largely.

【０００８】本発明は、かかる問題点を解決するために
創案されたものである。すなわち、本発明の目的は、非
線形方程式を解く必要がなく、解の一意性が保証され、
かつ並進運動と回転運動を容易に識別でき、更に対象物
の絶対的な大きさも得られる２次元動画像からの３次元
構造復元方法を提供することにある。The present invention has been made to solve such a problem. That is, the object of the present invention is to eliminate the need to solve nonlinear equations, guarantee the uniqueness of the solution,
Another object of the present invention is to provide a method for restoring a three-dimensional structure from a two-dimensional moving image in which a translational motion and a rotational motion can be easily distinguished and an absolute size of an object can be obtained.

【０００９】[0009]

【課題を解決するための手段】上述した問題点は原理的
なものであり画像処理だけでは解決できない。しかし、
カメラに加速度・角速度センサを付加して情報を補うこ
とにより、これらの欠点を回避することができる。本発
明はかかる新規の着想に基づくものである。The above-mentioned problems are fundamental and cannot be solved only by image processing. But,
These disadvantages can be avoided by supplementing information by adding an acceleration / angular velocity sensor to the camera. The present invention is based on such a new idea.

【００１０】すなわち、本発明によれば、静止している
対象物（１）の動画像を撮像するカメラ（２）に加速度
と角速度を計測するセンサを一体的に取り付け、動画像
と加速度及び角速度のデータを同期させて記録し、得ら
れた角速度データを基に角速度が０になるように動画像
を画像処理して並進運動だけを含む動画像とし、この動
画像から３次元構造を復元し、得られた加速度データか
ら求めた速度と、並進運動だけを含む動画像から求めた
速度との比ｓから、この比ｓにカメラの単位時間あたり
の移動距離を単位として求めた大きさを積算して対象物
の大きさを求める、ことを特徴とする３次元構造復元方
法が提供される。That is, according to the present invention, a sensor for measuring acceleration and angular velocity is integrally attached to a camera (2) for capturing a moving image of a stationary object (1), and the moving image and the acceleration and angular velocity are integrated. to synchronize the data recording to give al
Moving image so that the angular velocity becomes 0 based on the extracted angular velocity data
Is processed into an image containing only translational motion.
3D structure is restored from the image, and the obtained acceleration data
Calculated from the moving image including only the translational motion
From the ratio s to the speed, this ratio s
The target object is calculated by integrating the size calculated using the travel distance of
A three-dimensional structure restoring method for determining the size of the three-dimensional structure.

【００１１】３次元構造（形）を復元したい対象物は静
止しており、その周りを加速度・角速度センサを取り付
けたカメラを動かしながら撮影し、画像と加速度・角速
度センサの出力を同期させて取り込む。次に、加速度・
角速度センサの出力から角速度データが得られるので、
動画像に対し回転をキャンセルするような操作を施す
と、並進運動だけを含む動画像が得られる。これにより
カメラの動きの自由度は減るので、制限された状況下で
動画像からカメラの動きと対象物の再構成が行える。そ
の結果、カメラの動きが以前より正確にわかるので、対
象物の３次元構造の復元値の精度が向上する。An object whose three-dimensional structure (shape) is desired to be restored is stationary, and an image is taken around the object while moving a camera provided with an acceleration / angular velocity sensor, and the image is synchronized with the output of the acceleration / angular velocity sensor. . Next, acceleration
Since angular velocity data can be obtained from the output of the angular velocity sensor,
By performing an operation to cancel the rotation of the moving image, a moving image including only the translational motion can be obtained. As a result, the degree of freedom of the movement of the camera is reduced, so that the movement of the camera and the reconstruction of the target object can be performed from the moving image under a limited situation. As a result, the movement of the camera can be more accurately recognized than before, and the accuracy of the restored value of the three-dimensional structure of the object is improved.

【００１２】更に、カメラ速度と対象物の構造を動画像
から求めた値には後述する同一の未知スケールｓが掛か
っている。そこで、加速度データから求めた実際の速度
と、並進運動だけを含む動画像から求めた画像上の速度
を比較することにより、その比としてこの未知スケール
ｓを求めることができ、この比ｓにカメラの単位時間あ
たりの移動距離を単位として求めた大きさを積算すれば
対象物の大きさを求めることができる。Furthermore, the same unknown scale s, which will be described later, is applied to the values obtained from the moving image for the camera speed and the structure of the object. Therefore, the actual speed obtained from acceleration data, by comparing the speed of the image obtained from a moving image that contains only translational motion can be determined the unknown scale s as the ratio, camera in this ratio s Unit time
The size of the target object can be obtained by integrating the sizes obtained in units of the travel distance of the object.

【００１３】[0013]

【発明の実施の形態】以下、本発明の実施形態を具体的
に説明する。１．座標系の定義以下の説明で位置や速度を表すベクトルを用いるが、ベ
クトルに関して、要素の値を表すために基準となる座標
系を３種類定義する。第１は、世の中に固定したワ−ル
ド座標系であり、ベクトルをこれを基準にして表す場合
は右肩にＢを付ける。これを以下「基準ワ−ルド座標
系」と呼ぶ。第２は、カメラと共に動くカメラ座標系で
あり、この場合は右肩にＣを付ける。第３は、世の中に
対して固定したワ−ルド座標系であるが、この座標系は
各時刻でのカメラ座標系に重なるように取る。この場
合、右肩にＷを付ける。この座標系を以下「瞬時ワ−ル
ド座標系」と呼ぶ。DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, embodiments of the present invention will be described specifically. 1. Definition of Coordinate System In the following description, vectors representing positions and velocities will be used. Regarding the vectors, three types of reference coordinate systems are defined to represent element values. The first is a world coordinate system fixed in the world. When a vector is expressed based on this, a B is added to the right shoulder. This is hereinafter referred to as a “reference world coordinate system”. The second is a camera coordinate system that moves with the camera. In this case, C is attached to the right shoulder. Third, a world coordinate system fixed to the world is set so as to overlap the camera coordinate system at each time. In this case, W is attached to the right shoulder. This coordinate system is hereinafter referred to as an “instant-world coordinate system”.

【００１４】基準ワ−ルド座標系は動画像を撮る時間を
通して１つだけであるが、瞬時ワ−ルド座標系はカメラ
の移動に伴い、各時刻ごとに異なったものとなる。例え
ば、カメラがある基準ワ−ルド座標系Ｂ₁に対してｖ^Bで
動いている場合、カメラ座標系はカメラと共に動いてい
るので、当然ｖ^C＝０となる。また、この瞬間のカメラ
座標系と重なるように取った瞬時ワ−ルド座標系Ｗ₂か
ら見た速度は、ｖ^W＝Ｒｖ^Bとなる。ここで、ＲはＢ₁か
らＷ₂への変換を表す回転である。ベクトルは基準位置
が並進移動しても要素の値は変わらないので、回転だけ
で座標系間の関係がつけられる。加速度・角速度センサ
の出力は、瞬時ワ−ルド座標系に関する加速度ａ^Wと角
速度ω^Wである。There is only one reference world coordinate system throughout the time when a moving image is taken, but the instant world coordinate system changes at each time as the camera moves. For example, the reference word is a camera - if running in v ^B relative to field coordinate system B _1, the camera coordinate system is so moving together with the camera, a naturally v ^C = 0. Further, the instantaneous word took as to overlap the camera coordinate system of the instant - speed viewed from the field coordinate system W ₂ becomes v ^W = Rv ^B. Here, R is a rotation representing the conversion from B ₁ to W ₂ . Since the values of the elements of the vector do not change even if the reference position is translated, the relationship between the coordinate systems can be established only by rotation. The outputs of the acceleration / angular velocity sensor are acceleration a ^W and angular velocity ω ^W with respect to the instantaneous world coordinate system.

【００１５】２．一時刻のオプティカルフロ−から対象物構造を復元
する方法図２は、対象物とカメラの位置変化との関係図である。
カメラの中心３から対象物上の点４に向かう単位ベクト
ルをｑ^W、カメラ中心の移動をあらわすベクトルをδｕ^W
とすると、時刻ｔとそれから少し後の時刻ｔ＋δｔで図
２のような関係がある。つまり、ｑ^W（ｔ）、ｑ^W（ｔ＋
δｔ）、δｕ^Wは同一平面上にあるので、スカラー三重
積が０になることから、（式１）が成立する。[0015] 2. Method for Reconstructing Object Structure from Optical Flow at One Time FIG. 2 is a diagram showing the relationship between the object and a change in the position of the camera.
The unit vector from the camera center 3 to the point 4 on the object is q ^W , and the vector representing the movement of the camera center is δu ^W
Then, there is a relationship as shown in FIG. 2 between the time t and the time t + δt slightly later than the time t. That is, q ^W (t), q ^W (t +
Since δt) and δu ^W are on the same plane, the scalar triple product becomes 0, so (Equation 1) holds.

【数１】 (Equation 1)

【００１６】（１）式をカメラで観測した画像から直接
得られるカメラ座標系での対象物上の点の方向を表す単
位ベクトルｑ^Cとその時間微分（つまり、オプティカル
フロー）ｄｑ^C／ｄｔを使って書き換えると、（式２）
となる。ただし、ｖ^W、ω^Wはカメラの速度、角速度を表
す。これが対象物上の各点について成り立つ。A unit vector q ^C representing the direction of a point on an object in a camera coordinate system obtained directly from an image obtained by observing the equation (1) with a camera and its time derivative (that is, optical flow) dq ^C / dt are represented by When rewritten using (Equation 2)
Becomes However, v ^{^W,} ω ^W represents speed of the camera, the angular speed. This holds for each point on the object.

【数２】 (Equation 2)

【００１７】ｖ^W、ω^Wの要素から構成されるベクトル
（式３）を定義する。A vector (Equation 3) composed of v ^W and ω ^W elements is defined.

【数３】 (Equation 3)

【００１８】（式２）を変形することにより、最終的に
は観測値（８点以上必要）だけから得られる行列Ｇを使
って、Ｇｘ＝０という方程式が得られる。この式を解くことによってｖ
^W、ω^Wが得られる。ただし、ｖ^Wに関してはスケールは
未知である。ここで得られたｖ^W、ω^Wを使って対象物上
の点の位置も復元される。ただし、スケール未知のｖ^W
を使うので、対象物のスケールも未知となる。つまり、
速度と点の位置全体について１つの未知数（後述する
ｓ）があり、全てはこれが掛かった形で求まる。言い換
えると、点の位置の復元値はカメラの単位時間あたりの
移動距離を単位として求まる。By transforming (Equation 2), an equation Gx = 0 is finally obtained using a matrix G obtained only from observation values (8 or more points are required). By solving this equation, v
^W and ω ^W are obtained. However, the scale of v ^W is unknown. The position of the point on the object is restored using v ^W and ω ^W obtained here. Where v ^{W of} unknown scale
, The scale of the object is also unknown. That is,
There is one unknown (s, which will be described later) for the speed and the entire position of the point, all of which are obtained by multiplying them. In other words, the restored value of the position of the point is obtained in units of the moving distance of the camera per unit time.

【００１９】３．異なる時刻の復元結果を融合する方法動画像からは各時刻でオプティカルフローが得られるの
で、各時刻で対象物構造やカメラの動きが復元される。
しかし、各時刻での復元値は観測した時刻のカメラ座標
系に一致するように決めた瞬時ワールド座標系を基準に
して求まり、さらにスケールはその時刻でのカメラ速度
を基準として求まるので、カメラが移動する結果、同一
の点でも時刻によって座標値は異なったものとなる。3. Method of merging restoration results at different times Since an optical flow is obtained from a moving image at each time, the object structure and the motion of the camera are restored at each time.
However, the restoration value at each time is obtained based on the instantaneous world coordinate system determined to match the camera coordinate system at the observed time, and the scale is obtained based on the camera speed at that time. As a result, even at the same point, the coordinate value differs depending on the time.

【００２０】しかし、対象物の形自体は各時刻で変わら
ないので、各時刻間の対象物の復元値はスケール、並
進、回転の変換を適切に行えば、重なり合うはずであ
る。これにより、各時刻間の座標系の関係を表すスケー
ル、並進、回転が求まる。例えば図３は、ある時刻にお
いて復元した対象物の３次元構造を示しており、（Ａ）
は時刻ｔ１、（Ｂ）は時刻ｔ２におけるものである。対
象物が静止している場合には、形自体は各時刻で変わら
ないので、各時刻間の対象物の復元値はスケール、並
進、回転の変換を適切に行うことにより、重なり合わせ
ることができる。However, since the shape of the object itself does not change at each time, the restored values of the object between the times should overlap if the scale, translation, and rotation are appropriately converted. As a result, a scale, translation, and rotation representing the relationship of the coordinate system between the times are obtained. For example, FIG. 3 shows a three-dimensional structure of an object restored at a certain time, and FIG.
Is at time t1, and (B) is at time t2. When the object is stationary, the shape itself does not change at each time, so the restoration value of the object between each time can be overlapped by appropriately performing scale, translation, rotation conversion. .

【００２１】求まったスケール、並進、回転を使って各
時刻間の変換を行い、重ね合わせた結果について平均を
取ることによって、対象物の形の精度が向上する。速度
については、スケール変換だけを行えば時刻間でのスケ
ールの比が正しい関係にあるｖ^W（ｔ）が求まるし、ス
ケールと共に回転も行えば同一座標系（重ね合わせる基
準に使ったワールド座標系）での時刻に伴う変遷ｖ
^B（ｔ）が求まる。角速度についてはスケールは画像か
ら正しく求まっている、つまり、画像からω^W（ｔ）は
融合前からわかっているので、回転だけを行えばω
^B（ｔ）が求まる。ただし、これでも各時刻のカメラ速
度と復元した点の位置全体に掛かるスケール自体は未知
のままである。By performing conversion between respective times using the obtained scale, translation, and rotation, and averaging the superimposed results, the accuracy of the shape of the object is improved. As for the speed, if only scale conversion is performed, v ^W (t) in which the scale ratio between times is in a correct relationship is obtained. If rotation is performed together with scale, the same coordinate system (the world coordinate system used as a reference for superposition) is used. Changes with time at)
^B (t) is obtained. Regarding the angular velocity, the scale is correctly obtained from the image, that is, since ω ^W (t) is known from the image before the fusion, if only rotation is performed, ω
^B (t) is obtained. However, even in this case, the camera speed at each time and the scale applied to the entire position of the restored point remain unknown.

【００２２】４．加速度・角速度センサを用いて問題点（１）を解決
する方法加速度・角速度センサから角速度が出力されるので、動
画像に対し回転をキャンセルするような操作を施すと、
並進運動だけを含む画像が得られる。これによりカメラ
の動きの自由度は減るので、制限された状況下で動画像
からカメラの動きと対象物の再構成が行える。言い換え
ると、（式２）でω^Wが加速度・角速度センサから求ま
るので、画像と加速度・角速度センサ出力から求まる行
列をＨとして、（式４）という式を解けば良くなる。[0022] 4. Method for Solving Problem (1) Using Acceleration / Angular Velocity Sensor The angular velocity is output from the acceleration / angular velocity sensor.
An image containing only translational motion is obtained. As a result, the degree of freedom of the movement of the camera is reduced, so that the movement of the camera and the reconstruction of the target object can be performed from the moving image under a limited situation. In other words, since ω ^W is obtained from the acceleration / angular velocity sensor in (Equation 2), it is sufficient to solve the equation (Equation 4) with H as a matrix obtained from the image and the output of the acceleration / angular velocity sensor.

【数４】その結果、カメラの動きが以前より正確にわかるので、
対象物の３次元構造の復元値の精度が向上する。(Equation 4) As a result, you can see the movement of the camera more accurately than before,
The accuracy of the restoration value of the three-dimensional structure of the object is improved.

【００２３】５．加速度・角速度センサを用いて問題点２を解決する
方法加速度・角速度センサ出力を用いてカメラの速度を求
め、これを画像から求めた速度と比較することによって
未知スケールを求めることを考える。この未知スケール
は対象物のスケールでもあるので、結局、対象物の大き
さが求まる。まず、加速度・角速度センサ出力を用いて
カメラの速度を求める方法について述べる。これは理論
的には基準ワールド座標系で表したベクトルを用いて
（式５）式のように書ける。ここでｖ^B（ｔ₀）は速度の
初期値、ａ^Bは加速度、ｇ^Bは重力加速度である。[0023] 5. Method of Solving Problem 2 Using Acceleration / Angular Velocity Sensor Consider a method of obtaining an unknown scale by obtaining the speed of a camera using the output of an acceleration / angular speed sensor and comparing the obtained speed with the speed obtained from an image. Since this unknown scale is also the scale of the object, the size of the object is finally obtained. First, a method of obtaining the speed of the camera using the output of the acceleration / angular velocity sensor will be described. This can be theoretically written as Expression (5) using a vector expressed in the reference world coordinate system. Here, v ^B (t ₀ ) is the initial value of the velocity, a ^B is the acceleration, and g ^B is the gravitational acceleration.

【数５】 (Equation 5)

【００２４】加速度・角速度センサはカメラに取り付け
てあるのでセンサ出力はカメラ座標系と一致する瞬時ワ
ールド座標系に関して得られるから、（式５）を変形し
て（式６）と（式７）が得られる。ここでＲ（ｔ）は、
基準ワールド座標系から、時刻ｔでの瞬時ワールド座標
系への回転を表す。Since the acceleration / angular velocity sensor is attached to the camera, the sensor output can be obtained with respect to the instantaneous world coordinate system which coincides with the camera coordinate system. Therefore, the expression (5) is modified to obtain the expressions (6) and (7). can get. Where R (t) is
It represents the rotation from the reference world coordinate system to the instantaneous world coordinate system at time t.

【数６】 (Equation 6)

【数７】このＲ（ｔ）自体は、初期値がわかっていれば加速度・
角速度センサ出力の角速度を時間積分することによって
求められる。(Equation 7) This R (t) itself can be calculated as acceleration /
It is determined by integrating the angular velocity of the output of the angular velocity sensor over time.

【００２５】上述した原理を用いて加速度・角速度セン
サから求めた速度をｖ^W _G（ｔ）と表記し、真の速度ｖ^W
（ｔ）との関係を（式８）と表す。ここで、ｂ（ｔ）は
未知の初期速度とドリフトの効果を含めて表したもので
ある。The velocity obtained from the acceleration / angular velocity sensor based on the above principle is expressed as v ^W _G (t), and the true velocity v ^W
The relationship with (t) is represented by (Equation 8). Here, b (t) represents the unknown initial velocity and the effects of drift.

【数８】 (Equation 8)

【００２６】ｂ（ｔ）はセンサのノイズのために少しず
つ変化するが、その変化の値は小さいので、ｄｔ／ｄｂ
（ｔ）≒０とできる。一方、画像から求めた速度をｖ^W _I
（ｔ）とするとこれはスケール未知なので（式９）とな
る。よって（式１０）という関係が得られる。B (t) changes little by little due to sensor noise, but since the value of the change is small, dt / db
(T) ≒ 0. On the other hand, the speed obtained from the image is represented by v ^W _I
If (t) is used, the scale is unknown, so that (Expression 9) is obtained. Therefore, the relationship of (Equation 10) is obtained.

【数９】 (Equation 9)

【数１０】 (Equation 10)

【００２７】図４は、画像から求めた速度と真の速度と
の関係図である。この図に示すように、計測時間にわた
ってｂ（ｔ）がほとんど変化しない場合にはｖ^W _I（ｔ）
とｖ^W _G（ｔ）をプロットしたグラフはほぼ直線状に並
び、直線の傾きからスケールｓがわかる。FIG. 4 is a diagram showing the relationship between the speed obtained from the image and the true speed. As shown in this figure, when b (t) hardly changes over the measurement time, v ^W _I (t)
And a graph plotting v ^W _G (t) are substantially linearly arranged, and the scale s can be determined from the slope of the straight line.

【００２８】計測時間中にｂ（ｔ）が変化するとした場
合にも、その単位時間あたりの変化分は小さいので、
（式１０）の時間微分を取ると、（式１１）となるか
ら、この関係からｓが得られる。Even if b (t) changes during the measurement time, the change per unit time is small.
Taking the time derivative of (Equation 10) gives (Equation 11), and s is obtained from this relationship.

【数１１】以上の方法で画像から得られた速度と復元位置全体にか
かるスケルールｓが求まるので、最終的に、この比ｓに
カメラの単位時間あたりの移動距離を単位として求めた
大きさを積算すれば対象物の大きさも求まる。[Equation 11] Since such Sukeruru s is obtained in the entire speed and restoring the position obtained from the images by the above method, finally, to the ratio s
The distance traveled by the camera per unit time was obtained as a unit.
If the size is integrated , the size of the target object can also be obtained.

【００２９】上述したように、本発明の方法によれば、
動画像を撮像するカメラに加速度と角速度を計測するセ
ンサを一体的に取り付け、動画像と加速度及び角速度の
データを同期させて記録する。また、角速度データを基
に角速度が０になるように動画像を画像処理して並進運
動だけを含む動画像とし、この動画像から３次元構造を
復元する。更に、加速度データから求めた速度と、並進
運動だけを含む動画像から求めた速度との比ｓから、こ
の比ｓにカメラの単位時間あたりの移動距離を単位とし
て求めた大きさを積算すれば対象物の大きさを求める。As described above, according to the method of the present invention,
A sensor that measures acceleration and angular velocity is integrally attached to a camera that captures a moving image, and data of the moving image and the acceleration and angular velocity are recorded in synchronization. Further, based on the angular velocity data, the moving image is subjected to image processing so that the angular velocity becomes zero, thereby obtaining a moving image including only the translational motion, and the three-dimensional structure is restored from the moving image. Furthermore, the speed obtained from acceleration data, the ratio s between the speed obtained from a moving image that contains only translational motion, this
Is the ratio of camera movement distance per unit time to unit s
The size of the target object is obtained by integrating the sizes obtained by the above .

【００３０】カメラと加速度・角速度センサの出力を同
期させて取り込むことにより、加速度・角速度センサの
出力から角速度データが得られるので、動画像に対し回
転をキャンセルするような操作を施すと、並進運動だけ
を含む動画像が得られる。これによりカメラの動きの自
由度は減るので、制限された状況下で動画像からカメラ
の動きと対象物の再構成が行える。その結果、カメラの
動きが以前より正確にわかるので、対象物の３次元構造
の復元値の精度が向上する。更に、カメラ速度と対象物
の構造を動画像から求めた値には同一の未知スケールｓ
（式９〜１１参照）が掛かっているが、加速度データか
ら求めた実際の速度と、並進運動だけを含む動画像から
求めた画像上の速度を比較することにより、その比とし
てこの未知スケールｓを求めることができ、この比ｓに
カメラの単位時間あたりの移動距離を単位として求めた
大きさを積算すれば対象物の大きさを求めることができ
る。By synchronizing and capturing the output of the camera and the acceleration / angular velocity sensor, angular velocity data can be obtained from the output of the acceleration / angular velocity sensor. Is obtained. As a result, the degree of freedom of the movement of the camera is reduced, so that the movement of the camera and the reconstruction of the target object can be performed from the moving image under a limited situation. As a result, the movement of the camera can be more accurately recognized than before, and the accuracy of the restored value of the three-dimensional structure of the object is improved. Further, the same unknown scale s is included in the values obtained from the moving image for the camera speed and the structure of the object.
(Equations 9 to 11) are applied, but by comparing the actual speed obtained from the acceleration data with the speed on the image obtained from the moving image including only the translational motion, the unknown scale s is obtained as a ratio. And this ratio s
The distance traveled by the camera per unit time was obtained as a unit.
By integrating the sizes, the size of the object can be obtained.

【００３１】なお、本発明は上述した実施形態に限定さ
れず、本発明の要旨を逸脱しない範囲で種々に変更でき
ることは勿論である。It should be noted that the present invention is not limited to the above-described embodiment, but can be variously modified without departing from the gist of the present invention.

【００３２】[0032]

【発明の効果】上述したように、本発明の２次元動画像
からの３次元構造復元方法は、非線形方程式を解く必要
がなく、解の一意性が保証され、かつ並進運動と回転運
動を容易に識別でき、更に対象物の絶対的な大きさも得
られる、等の優れた効果を有する。As described above, the method for restoring a three-dimensional structure from a two-dimensional moving image according to the present invention does not require solving a nonlinear equation, guarantees uniqueness of the solution, and facilitates translation and rotation. And the absolute size of the object can be obtained.

[Brief description of the drawings]

【図１】対象物とカメラとの関係を示す図である。FIG. 1 is a diagram showing a relationship between an object and a camera.

【図２】対象物とカメラの位置変化との関係図である。FIG. 2 is a relationship diagram between an object and a change in the position of a camera.

【図３】復元した形状を模式的に示す図である。FIG. 3 is a diagram schematically showing a restored shape.

【図４】画像から求めた速度と真の速度との関係図であ
る。FIG. 4 is a relationship diagram between a speed obtained from an image and a true speed.

[Explanation of symbols]

１対象物２カメラ３カメラの中心４対象物上の点 1 object 2 camera 3 camera center 4 point on object

───────────────────────────────────────────────────── フロントページの続き (72)発明者大西昇愛知県名古屋市守山区大字下志段味字穴ケ洞2271−130 サイエンスパーク研究開発センター内理化学研究所バイオ・ミメティックコントロール研究センター内 (56)参考文献特開平９−81790（ＪＰ，Ａ) 特開平11−306363（ＪＰ，Ａ) 特開平10−23465（ＪＰ，Ａ) 向井利春，大西昇，「三次元形状モデル作成のためのビデオカメラとジャイロセンサを用いたセンサシステム」，日本バーチャルリアリティ学会第４回大会論文集，日本，日本バーチャルリアリティ学会，1999年９月29日，ｐ．213−216 向井利春，大西昇，「オプティカルフロー画像からの線形計算による３次元運動パラメータと構造の復元」，計測自動制御学会論文集，日本，計測自動制御学会，1998年５月31日，Ｖｏｌ．34，Ｎｏ．５，ｐ．438−444 (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06T 1/00 315 G01B 11/24 G06T 7/00 G06T 7/20 G06T 7/20 100 ────────────────────────────────────────────────── ─── Continuing on the front page (72) Inventor Noboru Noboru 2271-130 Science Park Research and Development Center, Shimo-shi-dami-ji, Moriyama-ku, Nagoya-shi, Aichi Pref. RIKEN Bio-Mimetic Control Research Center (56) Reference References JP-A-9-81790 (JP, A) JP-A-11-306363 (JP, A) JP-A-10-23465 (JP, A) Toshiharu Mukai, Noboru Onishi, "3D Shape Model Creation" Sensor System Using Video Camera and Gyro Sensor, "Proceedings of the 4th Annual Meeting of the Virtual Reality Society of Japan, Japan, Virtual Reality Society of Japan, September 29, 1999, p. 213-216 Toshiharu Mukai, Noboru Onishi, "Reconstruction of 3D Motion Parameters and Structure by Linear Calculation from Optical Flow Image", Transactions of the Society of Instrument and Control Engineers, Japan, Society of Instrument and Control Engineers, May 31, 1998 Date, Vol. 34, No. 5, p. 438-444 (58) Fields investigated (Int.Cl. ⁷ , DB name) G06T 1/00 315 G01B 11/24 G06T 7/00 G06T 7/20 G06T 7/20 100

Claims

(57) [Claims]

1. A sensor for measuring acceleration and angular velocity is integrally attached to a camera for capturing a moving image of a stationary object, and data of the acceleration and angular velocity are synchronized with the camera. Record and move so that the angular velocity becomes 0 based on the obtained angular velocity data.
The image is processed into a moving image that contains only translational motion.
Of the three-dimensional structure from the moving image of the target, and the speed and translational motion obtained from the obtained acceleration data
From the ratio s to the speed obtained from the moving image containing
The distance traveled by the camera per unit time was obtained as a unit.
A three-dimensional structure restoring method, wherein a size of an object is obtained by integrating the sizes .