JP6854629B2

JP6854629B2 - Image processing device, image processing method

Info

Publication number: JP6854629B2
Application number: JP2016228295A
Authority: JP
Inventors: 知宏西山
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2016-11-24
Filing date: 2016-11-24
Publication date: 2021-04-07
Anticipated expiration: 2036-11-24
Also published as: JP2018084997A

Description

本発明は、画像間のオプティカルフローを求めるための技術に関するものである。 The present invention relates to a technique for obtaining an optical flow between images.

近年、画像間の各画素の対応付けを行う技術の重要性が高まっている。対応とは、基準画像の画素と同一であるとみなす参照画像の画素との関係であり、二点の座標で表現できる。ステレオ画像や多視点画像を入力した場合は、画素の対応関係から被写体の奥行きを算出できるため、３次元画像処理に応用することも可能である。また、連続して撮像された画像（動画）を入力し、その対応関係を相対座標として表現すれば、それは動きベクトルとなる。画素ごとの動きベクトル（以下オプティカルフローと称する）を利用することによって、動体追跡、動画の防振などが可能となる。 In recent years, the importance of a technique for associating each pixel between images has increased. Correspondence is the relationship with the pixels of the reference image which are considered to be the same as the pixels of the reference image, and can be expressed by the coordinates of two points. When a stereo image or a multi-viewpoint image is input, the depth of the subject can be calculated from the correspondence of pixels, so that it can be applied to three-dimensional image processing. Further, if images (moving images) captured continuously are input and the correspondence is expressed as relative coordinates, it becomes a motion vector. By using a motion vector for each pixel (hereinafter referred to as an optical flow), it is possible to track a moving object, isolate a moving image, and the like.

オプティカルフローを取得する代表的な手法としては勾配法がある。勾配法では、画素の時空間の輝度変化の方向と大きさから、オプティカルフローを算出する。勾配法は大きく２種類に分けることができる。一つ目の勾配法では、着目画素の周辺の画素は同じ動きであると仮定し、着目画素を中心とするパッチ内の画素の平均的な時空間の輝度変化からオプティカルフローを算出する（以下、パッチベースの手法と呼称する）。二つ目の勾配法では、画素ごとに画像間の輝度差と、オプティカルフローの滑らかさを表す平滑化項を重みづけして加算し、すべての画素で総和をとったものをエネルギーとして、エネルギーを最適化する（以下、エネルギー最適化法と呼称する）。 The gradient method is a typical method for acquiring optical flow. In the gradient method, the optical flow is calculated from the direction and magnitude of the spatiotemporal luminance change of the pixel. The gradient method can be roughly divided into two types. In the first gradient method, it is assumed that the pixels around the pixel of interest have the same movement, and the optical flow is calculated from the average spatiotemporal brightness change of the pixels in the patch centered on the pixel of interest (hereinafter). , Called a patch-based method). In the second gradient method, the brightness difference between images and the smoothing term representing the smoothness of the optical flow are weighted and added for each pixel, and the sum of all the pixels is used as energy. (Hereinafter referred to as energy optimization method).

パッチベースの手法の代表的なものには、非特許文献１に記載のＬｕｃａｓＫａｎａｄｅ法（以下、ＬＫ法と呼称する）があり、特許文献１でも同様の考え方が用いられている。特許文献２では、エネルギー最適化法が用いられている。 A typical patch-based method is the Lucas-Kanade method (hereinafter referred to as the LK method) described in Non-Patent Document 1, and the same concept is used in Patent Document 1. In Patent Document 2, an energy optimization method is used.

国際公開第０６/０７５３９４International Publication No. 06/075394 特開平９−１７８７６４号公報Japanese Unexamined Patent Publication No. 9-178764

ＰｙｒａｍｉｄａｌＩｍｐｌｅｍｅｎｔａｔｉｏｎｏｆｔｈｅＬｕｃａｓＫａｎａｄｅＦｅａｔｕｒｅＴｒａｃｋｅｒＤｅｓｃｒｉｐｔｉｏｎｏｆｔｈｅａｌｇｏｒｉｔｈｍＪｅａｎ−ＹｖｅｓＢｏｕｇｕｅｔ [online] [retrieved on 2016-11-07] Retrieved from the Internet: ＜URL:ｈｔｔｐ：／／ｒｏｂｏｔｓ．ｓｔａｎｆｏｒｄ．ｅｄｕ／ｃｓ２２３ｂ０４／ａｌｇｏ＿ｔｒａｃｋｉｎｇ．ｐｄｆ＞Pyramidal Implementation of the Lucas Kanade Footure Tracker Description of the algorithm Jean-Yves Bouget [online] [retrieved on 2016-11-07] Retrieved from the Internet stanford. edu / cs223b04 / algo_tracking. pdf>

しかしながら、特許文献２に代表されるエネルギー最適化法では、エネルギー最適化のための反復計算が必要となり、演算量が増大するという課題がある。一方、非特許文献１に代表されるパッチベースの手法では、反復計算が不要なため、高速にオプティカルフローを推定できる。しかし、拘束条件が明確に考慮されていないため、正解値から外れたフローベクトルが推定される可能性が高くなり、推定が不安定になるという課題がある。 However, the energy optimization method represented by Patent Document 2 requires iterative calculation for energy optimization, and has a problem that the amount of calculation increases. On the other hand, the patch-based method represented by Non-Patent Document 1 does not require iterative calculation, so that the optical flow can be estimated at high speed. However, since the constraint conditions are not clearly considered, there is a high possibility that a flow vector deviating from the correct answer value is estimated, and there is a problem that the estimation becomes unstable.

特許文献１の手法は、階層処理の中で、推定したオプティカルフローを平滑化するようパッチベースの手法を改良したものである。このことにより、正解から外れたフローベクトルの出現を抑制できるが、テクスチャが少ない領域では、推定値が不安定になるという課題がある。 The method of Patent Document 1 is an improvement of the patch-based method so as to smooth the estimated optical flow in the hierarchical processing. As a result, the appearance of the flow vector deviating from the correct answer can be suppressed, but there is a problem that the estimated value becomes unstable in the region where the texture is small.

本発明はこのような問題に鑑みてなされたものであり、少ない演算量で高精度にオプティカルフローを推定するための技術を提供する。 The present invention has been made in view of such a problem, and provides a technique for estimating an optical flow with high accuracy with a small amount of calculation.

本発明の一様態は、第１の画像及び該第１の画像を規定の縮小率で再帰的に縮小した複数の縮小画像を要素とする第１の集合と、第２の画像及び該第２の画像を前記規定の縮小率で再帰的に縮小した複数の縮小画像を要素とする第２の集合と、を取得する取得手段と、前記第２の集合に属する画像を画像サイズが小さい順に選択する選択手段と、前記選択手段が今回選択した選択画像の各画素を、前記選択手段が前回選択した画像に対応するオプティカルフローを該選択画像のサイズに応じて変換した変換済みオプティカルフローに従って移動させた移動済み選択画像を生成する生成手段と、前記第１の集合に属する画像のうち前記選択画像と同サイズの画像と前記移動済み選択画像との差分である第１の差分と、前記変換済みオプティカルフローと該変換済みオプティカルフローに対して平滑化処理を施した処理済みオプティカルフローとの差分である第２の差分と、に基づく評価値を極小化するオプティカルフローを、前記選択画像に対応するオプティカルフローとして求める計算手段と、前記計算手段が求めた、前記第２の画像に対応するオプティカルフローを出力する出力手段とを備えることを特徴とする。 The uniformity of the present invention comprises a first set of elements of a first image and a plurality of reduced images obtained by recursively reducing the first image at a specified reduction ratio, a second image, and the second image. A second set having a plurality of reduced images recursively reduced at the specified reduction rate, an acquisition means for acquiring the image, and an image belonging to the second set are selected in ascending order of image size. The selection means to be selected and each pixel of the selected image selected by the selection means this time are moved according to the converted optical flow obtained by converting the optical flow corresponding to the image previously selected by the selection means according to the size of the selected image. The generation means for generating the moved selected image, the first difference between the image belonging to the first set and the image having the same size as the selected image and the moved selected image, and the converted image. The second difference, which is the difference between the optical flow and the processed optical flow obtained by smoothing the converted optical flow, and the optical flow that minimizes the evaluation value based on the second difference correspond to the selected image. It is characterized by including a calculation means obtained as an optical flow and an output means for outputting the optical flow corresponding to the second image obtained by the calculation means.

本発明の構成によれば、少ない演算量で高精度にオプティカルフローを推定することができる。 According to the configuration of the present invention, the optical flow can be estimated with high accuracy with a small amount of calculation.

コンピュータ装置のハードウェア構成例を示すブロック図。A block diagram showing a hardware configuration example of a computer device. オプティカルフローを説明する図。The figure explaining the optical flow. 画像処理装置の機能構成例を示すブロック図。The block diagram which shows the functional configuration example of an image processing apparatus. オプティカルフローを生成するための処理のフローチャート。Flowchart of processing for generating optical flow. 画像処理装置の機能構成例を示すブロック図。The block diagram which shows the functional configuration example of an image processing apparatus. オプティカルフローを生成するための処理のフローチャート。Flowchart of processing for generating optical flow. 参考オプティカルフローを得るための処理について説明する図。Reference The figure explaining the process for obtaining an optical flow. 画像処理装置の機能構成例を示すブロック図。The block diagram which shows the functional configuration example of an image processing apparatus. 画像処理装置の機能構成例を示すブロック図。The block diagram which shows the functional configuration example of an image processing apparatus.

以下、添付図面を参照し、本発明の実施形態について説明する。なお、以下説明する実施形態は、本発明を具体的に実施した場合の一例を示すもので、特許請求の範囲に記載した構成の具体的な実施例の１つである。 Hereinafter, embodiments of the present invention will be described with reference to the accompanying drawings. In addition, the embodiment described below shows an example when the present invention is concretely implemented, and is one of the specific examples of the configuration described in the claims.

［第１の実施形態］
本実施形態では、次のような構成を有する画像処理装置の一例について説明する。この画像処理装置は、第１の画像及び該第１の画像を規定の縮小率で再帰的に縮小した複数の縮小画像を要素とする第１の集合と、第２の画像及び該第２の画像を上記規定の縮小率で再帰的に縮小した複数の縮小画像を要素とする第２の集合と、を取得する。そして画像処理装置は、第２の集合に属する画像を画像サイズが小さい順に選択する。そして画像処理装置は、今回選択した選択画像の各画素を、前回選択した画像に対応するオプティカルフローを該選択画像のサイズに応じて変換した変換済みオプティカルフローに従って移動させた移動済み選択画像を生成する。そして画像処理装置は、第１の集合に属する画像のうち選択画像と同サイズの画像と移動済み選択画像との差分である第１の差分と、変換済みオプティカルフローと該変換済みオプティカルフローに対して平滑化処理を施した処理済みオプティカルフローとの差分である第２の差分と、に基づく評価値を極小化するオプティカルフローを、選択画像に対応するオプティカルフローとして求める（計算）。そして画像処理装置は、この計算により求めた、第２の画像に対応するオプティカルフローを出力する。 [First Embodiment]
In this embodiment, an example of an image processing apparatus having the following configuration will be described. This image processing device includes a first set having a first image and a plurality of reduced images recursively reduced at a predetermined reduction ratio as elements, a second image, and the second image. A second set having a plurality of reduced images obtained by recursively reducing the images at the above-specified reduction ratio is acquired. Then, the image processing device selects the images belonging to the second set in ascending order of image size. Then, the image processing device generates a moved selected image in which each pixel of the selected image selected this time is moved according to the converted optical flow in which the optical flow corresponding to the previously selected image is converted according to the size of the selected image. To do. Then, the image processing device relates to the first difference, which is the difference between the image having the same size as the selected image and the moved selected image among the images belonging to the first set, and the converted optical flow and the converted optical flow. The second difference, which is the difference from the processed optical flow that has undergone the smoothing process, and the optical flow that minimizes the evaluation value based on the difference are obtained as the optical flow corresponding to the selected image (calculation). Then, the image processing device outputs the optical flow corresponding to the second image obtained by this calculation.

先ず、本実施形態を含む以下の各実施形態において用いる様々な定義について説明する。以下の説明における「オプティカルフロー」（以下、ＯＦと称する場合がある）とは、基準画像に対する対象画像の動きベクトルを画素毎に登録したマップ画像である。つまり、オプティカルフローは対象画像と同じ解像度（縦横の画素数）を有し、対象画像の各画素に対応するオプティカルフローの要素は二次元ベクトルとなっている。 First, various definitions used in each of the following embodiments including the present embodiment will be described. The "optical flow" (hereinafter, may be referred to as OF) in the following description is a map image in which the motion vector of the target image with respect to the reference image is registered for each pixel. That is, the optical flow has the same resolution (the number of pixels in the vertical and horizontal directions) as the target image, and the element of the optical flow corresponding to each pixel of the target image is a two-dimensional vector.

以下では、画像をＩと表記した場合、該画像上の画素位置（ｘ、ｙ）における画素値はＩ（ｘ，ｙ）と表記する。オプティカルフローにおいて、基準画像Ｉ上の画素位置（ｘ、ｙ）に対応する要素は（ｕ（ｘ，ｙ），ｖ（ｘ，ｙ））と表記する。ｕ（ｘ，ｙ）は、基準画像Ｉの画素位置（ｘ、ｙ）に対応する動きベクトルの水平方向成分（Ｘ成分）を表し、ｖ（ｘ，ｙ）は、基準画像Ｉの画素位置（ｘ、ｙ）に対応する動きベクトルの垂直方向成分（Ｙ成分）を表している。 In the following, when the image is expressed as I, the pixel value at the pixel position (x, y) on the image is expressed as I (x, y). In the optical flow, the element corresponding to the pixel position (x, y) on the reference image I is expressed as (u (x, y), v (x, y)). u (x, y) represents the pixel position of the reference image I (x, y) the horizontal component (X component) of the motion vector corresponding to, v (x, y) is the pixel position of the reference image I ( It represents the vertical component (Y component) of the motion vector corresponding to x, y).

オプティカルフローについて図２を例にとり説明する。図２では、画像２０１に対する画像２０２のオプティカルフローについて説明する。画像２０１は、人物２０３が移動しているシーンを撮像装置を動かしながら撮像した動画像中のＮ（Ｎは１以上の整数）フレーム目の画像であり、画像２０２は該動画像における（Ｎ＋Ｎ’）（Ｎ’は１以上の整数）フレーム目の画像である。画像２０１及び画像２０２には被写体として人物２０３と家２０４とが含まれている。動きベクトル２０５は、画像２０１中の人物２０３から画像２０２中の人物２０３への動きベクトルを表しており、動きベクトル２０６は、画像２０１中の家２０４から画像２０２中の家２０４への動きベクトルを表している。一般的に、画像中の人物２０３（家２０４）の領域を構成するそれぞれの画素に対する動きベクトルは全く同じではないが、図２では説明を簡単にするために、オブジェクト内の各画素の動きベクトルは全て同じであるものとする。つまり図２では、画像２０１中の人物２０３の領域内の各画素の動きベクトルは全て動きベクトル２０５とし、画像２０１中の家２０４の領域内の各画素の動きベクトルは全てベクトル２０６としている。ここで、動きベクトル２０５の成分を（１０，５）、動きベクトル２０６の成分を（−５，０）とする。このとき、画像２０１上の画素位置（ｘ、ｙ）が人物２０３の領域に含まれている場合、画像２０１に対するオプティカルフローにおいて画素位置（ｘ、ｙ）に対応する要素（ｕ（ｘ，ｙ），ｖ（ｘ，ｙ））＝（１０，５）となる。また、画像２０１上の画素位置（ｘ、ｙ）が家２０４の領域に含まれている場合、画像２０１に対するオプティカルフローにおいて画素位置（ｘ、ｙ）に対応する要素（ｕ（ｘ，ｙ），ｖ（ｘ，ｙ））＝（−５，０）となる。なお、画像２０１上の画素位置（ｘ、ｙ）が背景領域（人物２０３及び家２０４以外の領域）に含まれている場合、画像２０１に対するオプティカルフローにおいて画素位置（ｘ、ｙ）に対応する要素（ｕ（ｘ，ｙ），ｖ（ｘ，ｙ））＝（０，０）とする。 The optical flow will be described by taking FIG. 2 as an example. FIG. 2 describes the optical flow of the image 202 with respect to the image 201. The image 201 is an image of the N (N is an integer of 1 or more) frame in the moving image of the scene in which the person 203 is moving while moving the imaging device, and the image 202 is the (N + N'in the moving image. ) (N'is an integer of 1 or more) This is the image of the frame. Images 201 and 202 include a person 203 and a house 204 as subjects. The motion vector 205 represents a motion vector from the person 203 in the image 201 to the person 203 in the image 202, and the motion vector 206 represents a motion vector from the house 204 in the image 201 to the house 204 in the image 202. Represents. Generally, the motion vectors for each pixel constituting the area of the person 203 (house 204) in the image are not exactly the same, but in FIG. 2, for the sake of simplicity, the motion vector of each pixel in the object is shown. Are all the same. That is, in FIG. 2, the motion vectors of each pixel in the region of the person 203 in the image 201 are all motion vectors 205, and the motion vectors of each pixel in the region of the house 204 in the image 201 are all vector 206. Here, the component of the motion vector 205 is (10, 5), and the component of the motion vector 206 is (-5, 0). At this time, when the pixel position (x, y) on the image 201 is included in the area of the person 203, the element (u (x, y) corresponding to the pixel position (x, y) in the optical flow with respect to the image 201). , V (x, y)) = (10, 5). Further, when the pixel position (x, y) on the image 201 is included in the area of the house 204, the element (u (x, y), which corresponds to the pixel position (x, y) in the optical flow with respect to the image 201). v (x, y)) = (-5,0). When the pixel position (x, y) on the image 201 is included in the background area (area other than the person 203 and the house 204), the element corresponding to the pixel position (x, y) in the optical flow with respect to the image 201. (U (x, y), v (x, y)) = (0,0).

本実施形態では、単一の撮像装置において互いに異なる時刻に撮像された第１の画像及び第２の画像（第１の画像の撮像時刻は第２の撮像時刻よりも早い）を取得し、該第１の画像に対する該第２の画像のオプティカルフローを生成する。なお、第１の画像及び第２の画像は単一の撮像装置において互いに異なる時刻に撮像された画像に限らず、複数台の撮像装置において同時刻に撮像された画像であっても良いし、複数台の撮像装置において互いに異なる時刻に撮像された画像であっても良い。 In the present embodiment, the first image and the second image (the imaging time of the first image is earlier than the second imaging time) captured at different times by a single imaging device are acquired, and the image is taken. Generate an optical flow of the second image with respect to the first image. The first image and the second image are not limited to images captured at different times by a single imaging device, and may be images captured at the same time by a plurality of imaging devices. Images may be captured at different times by a plurality of image pickup devices.

次に、本実施形態に係る画像処理装置の機能構成例及びその動作について、画像処理装置の機能構成例を示すブロック図である図３、画像処理装置がオプティカルフローを生成するために行う処理のフローチャートを示す図４、を用いて説明する。なお、図面においてＯＦはオプティカルフローを表す。また、図４に示したフローチャートに従った処理は、１枚の画像に対するオプティカルフローを求めるための処理である。然るに、例えば、複数枚の画像のそれぞれについてオプティカルフローを求める場合には、該複数の画像のそれぞれについて図４のフローチャートに従った処理を行えばよい。 Next, regarding the functional configuration example of the image processing device and its operation according to the present embodiment, FIG. 3, which is a block diagram showing the functional configuration example of the image processing device, the processing performed by the image processing device to generate the optical flow. This will be described with reference to FIG. 4, which shows a flowchart. In the drawings, OF represents an optical flow. Further, the process according to the flowchart shown in FIG. 4 is a process for obtaining an optical flow for one image. However, for example, when obtaining an optical flow for each of a plurality of images, processing may be performed for each of the plurality of images according to the flowchart of FIG.

ステップＳ４０１では、画像データ取得部３０１は、上記の第１の画像及び第２の画像を取得する。以下では、２枚の画像のみを取得する場合について説明するが、複数枚の画像や動画像を取得しても構わない。画像が３枚以上ある場合や動画像の場合は、対象となる２枚の画像、もしくはフレームを選択して以降の処理を進める。 In step S401, the image data acquisition unit 301 acquires the first image and the second image described above. Hereinafter, the case where only two images are acquired will be described, but a plurality of images or moving images may be acquired. If there are three or more images or a moving image, select the two target images or frames and proceed with the subsequent processing.

ステップＳ４０２では、画像縮小部３０２は、第１の画像Ｉ_１を縮小率ｓｃａｌｅ＿ｆａｃｔｏｒ（以下ｓｆと呼称する：０＜ｓｆ＜１）に従って再帰的に縮小して複数枚の縮小画像を生成する。更に画像縮小部３０２は、第２の画像Ｉ_２を縮小率ｓｆに従って再帰的に縮小して複数枚の縮小画像を生成する。具体的には、画像縮小部３０２は先ず、Ｉ_１及びＩ_２から生成する縮小画像の枚数である最大階層数（ｍａｘ＿ｌｖ）を取得する。最大階層数ｍａｘ＿ｌｖは予め画像処理装置１００に設定されていたものであっても良いし、ユーザに入力させても良い。本実施形態では、Ｉ_１（Ｉ_２）を縮小率ｓｆに従ってｍａｘ＿ｌｖ回縮小した縮小画像のサイズ（縦及び／又は横サイズ）がＩ_１（Ｉ_２）のサイズ（縦及び／又は横サイズ）の５％以下となるまで縮小を行うものとした。然るにこの場合、以下の式１に示す如く、ｍａｘ＿ｌｖ＝５となる。 In step S402, the image reduction unit 302 _{recursively reduces the first image I 1 according} to the reduction ratio scale_factor (hereinafter referred to as sf: 0 <sf <1) to generate a plurality of reduced images. Further, the image reduction unit 302 _{recursively reduces the second image I 2} according to the reduction ratio sf to generate a plurality of reduced images. Specifically, the image reduction unit 302 first acquires the maximum number of layers (max_lv), which is the number of reduced images generated from _{I 1} and I _2. The maximum number of layers max_lv may be set in advance in the image processing device 100, or may be input by the user. In the present embodiment, _{the size (vertical and / or horizontal size) of the reduced image obtained by reducing I 1} (I ₂ ) max_lv times according to the reduction ratio sf is the size (vertical and / or horizontal size) of I ₁ (I ₂ ). It was decided to reduce the size until it became 5% or less. However, in this case, max_lv = 5 as shown in the following equation 1.

以下では、Ｉ_１を縮小率ｓｆに従ってｌｖ（ｌｖは０〜ｍａｘ＿ｌｖを満たす整数）回縮小した縮小画像をＩ_１［ｌｖ］と表記する。また、Ｉ_２を縮小率ｓｆに従ってｌｖ回縮小した縮小画像をＩ_２［ｌｖ］と表記する。つまり、Ｉ_１＝Ｉ_１［０］、Ｉ_２＝Ｉ_２［０］である。Ｉ_１に対するＩ_１［ｌｖ］の縮小率ｓ（Ｉ_２に対するＩ_２［ｌｖ］の縮小率ｓ）は以下の式２で表される。 In the following, _{a reduced image obtained by reducing I 1} by lv (where lv is an integer satisfying 0 to max_lv) according to the reduction ratio sf is referred to as I ₁ [lv]. Further, a reduced image obtained by reducing lv times according reduction ratio sf the _{I 2} is expressed as _I 2 [lv]. That is, I ₁ = I ₁ [0] and I ₂ = I ₂ [0]. Reduction ratio of _I 1 [lv] for I ₁ s (reduction ratio s of _I 2 [lv] for _{I 2)} is expressed by the following equation 2.

つまり、Ｉ_１を縮小率ｓに従って縮小することでＩ_１［ｌｖ］が得られ、Ｉ_２を縮小率ｓに従って縮小することでＩ_２［ｌｖ］が得られる。以下では一例として、ｓｆ＝０．５であるものとするが、ｓｆの値は０より大きく１未満であれば如何なる値であっても良い。なお、Ｉ_１［ｍａｘ＿ｌｖ］（Ｉ_２［ｍａｘ＿ｌｖ］）のサイズは、画像間の動き検出対象の動きが大きいほど小さく設定すべきであるが、処理時間など様々な要素によって応じて最適な設定をすることが望ましい。また、画像の縮小処理の際には、バイキュービック法を用いても良いし、Ｌａｎｃｚｏｓ３−ｌｏｂｅｄ法などの方法を用いてもよい。 That is, I ₁ [lv] is obtained by reducing I ₁ according to the reduction ratio s, and I ₂ [lv] is obtained by _{reducing I 2 according to the reduction ratio s.} In the following, as an example, it is assumed that sf = 0.5, but any value may be used as long as the value of sf is greater than 0 and less than 1. The size of I ₁ [max_lv] (I ₂ [max_lv]) should be set smaller as the movement of the motion detection target between images is larger, but the optimum setting should be made according to various factors such as the processing time. It is desirable to do. Further, in the image reduction processing, a bicubic method may be used, or a method such as the Lanczos 3-loved method may be used.

ステップＳ４０３では、制御部３９９は、変数ｌｖの値にｍａｘ＿ｌｖを設定する。以下では、Ｉ_１［ｌｖ］及びＩ_２［ｌｖ］を階層ｌｖにおける画像、Ｉ_１［０］及びＩ_２［０］は最小階層における画像、Ｉ_１［ｍａｘ＿ｌｖ］及びＩ_２［ｍａｘ＿ｌｖ］は最大階層における画像と称する場合がある。 In step S403, the control unit 399 sets max_lv as the value of the variable lv. In the following, I ₁ [lv] and I ₂ [lv] are images in the hierarchy lv, I ₁ [0] and I ₂ [0] are images in the minimum hierarchy, and I ₁ [max_lv] and I ₂ [max_lv] are maximum. Sometimes referred to as an image in the hierarchy.

ステップＳ４０４では、制御部３９９は、ｌｖ＜ｍａｘ＿ｌｖであるか否かを判断する。この判断の結果、ｌｖ＜ｍａｘ＿ｌｖであれば、処理はステップＳ４０５に進み、ｌｖ＝ｍａｘ＿ｌｖであれば、処理はステップＳ４０８に進む。 In step S404, the control unit 399 determines whether or not lv <max_lv. As a result of this determination, if lv <max_lv, the process proceeds to step S405, and if lv = max_lv, the process proceeds to step S408.

ステップＳ４０８では、ＯＦ初期化部３０３は、階層ｍａｘ＿ｌｖにおけるオプティカルフローの全ての要素の値を０に初期化する。以下では、階層ｌｖにおけるオプティカルフローをＯＦ［ｌｖ］と表記する。ＯＦ［ｌｖ］の解像度はＩ_１［ｌｖ］、Ｉ_２［ｌｖ］の解像度と同じである。そして処理はステップＳ４０９に進む。 In step S408, the OF initialization unit 303 initializes the values of all the elements of the optical flow in the layer max_lv to 0. In the following, the optical flow in the layer lv is referred to as OF [lv]. Resolution OF [lv] is _I 1 _[lv], is the same as the resolution of I 2 [lv]. Then, the process proceeds to step S409.

一方、ステップＳ４０５でＯＦ拡大部３０７は、最近求めたオプティカルフロー（ＯＦ［ｌｖ＋１］）の各要素の値（動きベクトルの成分値）を１／ｓｆ倍してから、該オプティカルフローの縦横サイズを１／ｓｆ倍に拡大したＯＦ’［ｌｖ＋１］を生成する。拡大では、ＲＧＢ画像の拡大においてＲ、Ｇ，Ｂそれぞれの成分を独立して処理するのと同様に、動きベクトルのＸ成分、Ｙ成分を独立して処理する。この拡大には、バイリニア補間を用いても良いし、バイキュービック法等の他の方法を採用しても良い。ＯＦ［ｌｖ＋１］における要素ｕ（ｘ，ｙ）、ｖ（ｘ，ｙ）とＯＦ’［ｌｖ＋１］における要素ｕ’（ｘ，ｙ）、ｖ’（ｘ，ｙ）との関係を以下の式３に示す。 On the other hand, in step S405, the OF enlargement unit 307 multiplies the value (component value of the motion vector) of each element of the recently obtained optical flow (OF [lv + 1]) by 1 / sf, and then increases the vertical and horizontal sizes of the optical flow. Generate OF'[lv + 1] magnified 1 / sf times. In the enlargement, the X component and the Y component of the motion vector are processed independently in the same manner as the R, G, and B components are processed independently in the enlargement of the RGB image. Bilinear interpolation may be used for this expansion, or another method such as a bicubic method may be adopted. The relationship between the elements u (x, y) and v (x, y) in OF [lv + 1] and the elements u'(x, y) and v'(x, y) in OF'[lv + 1] is expressed by the following equation 3 Shown in.

ステップＳ４０６では、画像変形部３０５は、Ｉ_２［０］〜Ｉ_２［ｍａｘ＿ｌｖ］のうちＩ_２［ｌｖ］を選択し、該選択したＩ_２［ｌｖ］内の各画素を、ＯＦ’［ｌｖ＋１］に従って移動（ワーピング）させた画像Ｉ_２ｗ［ｌｖ］を生成する。つまり、以下の式４に示す如く、Ｉ_２［ｌｖ］内の画素位置（ｘ、ｙ）における画素を、ＯＦ’［ｌｖ＋１］内のｕ’（ｘ，ｙ）、ｖ’（ｘ，ｙ）によって規定される動きベクトルの方向に、該動きベクトルの長さだけ移動させた画像Ｉ_２ｗ［ｌｖ］を生成する。 In step S406, the image transforming unit 305 _{selects I 2} [lv] from _{I 2} [0] to I ₂ [max_lv], and sets _{each pixel in the selected I 2} [lv] to OF'[lv + 1]. ], The image I _2w [lv] moved (warping) is generated. That is, as shown in the following equation 4, _{the pixels at the pixel positions (x, y) in I 2} [lv] are the u'(x, y), v'(x, y) in OF'[lv + 1]. _{The image I 2w} [lv] is generated by moving the motion vector in the direction defined by the motion vector by the length of the motion vector.

ステップＳ４０７では、ＯＦ平滑化部３０４は、ステップＳ４０５で生成したオプティカルフローＯＦ’［ｌｖ＋１］に対して平滑化フィルタを適用して、平滑化処理済みのオプティカルフローＯＦ”［ｌｖ＋１］を生成する。平滑化フィルタとしては、例えば、平均フィルタや、ジョイントバイラテラルフィルタなどを用いることができる。ジョイントバイラテラルフィルタを用いる場合は、Ｉ_１［ｌｖ］の画素値を参照することで、被写体境界の再現性を向上させることができる。また、メディアンフィルタなどの非線形フィルタを用いてもよい。つまり、オプティカルフローＯＦ’［ｌｖ＋１］を平滑化できる手法であれば、如何なる手法を用いても構わない。本実施形態では、フィルタサイズが７ｘ７の平均フィルタを用いてオプティカルフローＯＦ’［ｌｖ＋１］に対する平滑化処理を行うものとする。 In step S407, the OF smoothing unit 304 applies a smoothing filter to the optical flow OF'[lv + 1] generated in step S405 to generate a smoothed optical flow OF "[lv + 1]. As the smoothing filter, for example, an average filter, a joint bilateral filter, or the like can be used. When a joint bilateral filter is used, the subject boundary is reproduced by referring to the pixel value of _{I 1 [lv].} In addition, a non-linear filter such as a median filter may be used. That is, any method may be used as long as it can smooth the optical flow OF'[lv + 1]. In the embodiment, it is assumed that the smoothing process for the optical flow OF'[lv + 1] is performed using an average filter having a filter size of 7x7.

ステップＳ４０９では、エネルギー関数生成部３０６は、Ｉ_１［ｌｖ］とＩ_２ｗ［ｌｖ］との差分である第１の差分と、ＯＦ’［ｌｖ＋１］とＯＦ”［ｌｖ＋１］との差分である第２の差分と、に基づく関数であるエネルギー関数を生成する。ステップＳ４０９における処理の詳細については後述する。 In step S409, the energy function generation unit 306 is the _first difference, which is the difference between I 1 [lv] and I _2w [lv], and the difference between OF'[lv + 1] and OF "[lv + 1]. An energy function, which is a function based on the difference between 2 and 2, is generated. Details of the processing in step S409 will be described later.

ステップＳ４１０では、ＯＦ算出部３０８は、ステップＳ４０９で生成したエネルギー関数を極小化するようなオプティカルフローＯＦ［ｌｖ］を生成する。ステップＳ４１０における処理の詳細については後述する。 In step S410, the OF calculation unit 308 generates an optical flow OF [lv] that minimizes the energy function generated in step S409. The details of the process in step S410 will be described later.

ステップＳ４１１では、制御部３９９は、変数ｌｖの値が０であるか否かを判断する。この判断の結果、変数ｌｖの値が０であれば、ＯＦ算出部３０８は、ステップＳ４１０で生成したオプティカルフローＯＦ［０］を、画像Ｉ_１を基準とする画像Ｉ_２のオプティカルフローとして出力する。ＯＦ算出部３０８によるオプティカルフローＯＦ［０］の出力先については画像処理装置１００内のメモリや外部のメモリ、外部の装置など、特定の出力先に限るものではない。そして図４のフローチャートに従った処理は終了する。 In step S411, the control unit 399 determines whether or not the value of the variable lv is 0. As a result of this determination, if the value of the variable lv is 0, the OF calculation unit 308 outputs the optical flow OF [0] generated in step S410 as the optical flow of the image I ₂ _{with reference to the image I 1.} .. The output destination of the optical flow OF [0] by the OF calculation unit 308 is not limited to a specific output destination such as a memory in the image processing device 100, an external memory, or an external device. Then, the process according to the flowchart of FIG. 4 is completed.

一方、変数ｌｖの値が０ではない場合には、処理はステップＳ４１２に進む。ステップＳ４１２では、制御部３９９は、変数ｌｖの値を１つデクリメントし、その後、処理はステップＳ４０４に進む。 On the other hand, if the value of the variable lv is not 0, the process proceeds to step S412. In step S412, the control unit 399 decrements the value of the variable lv by one, and then the process proceeds to step S404.

次に、上記のステップＳ４０９における処理の詳細について説明する。エネルギー関数を最小化するようにオプティカルフローを推定する方法は、一般的に勾配法と呼ばれる。基本となるのはデータタームと呼ばれる項であり、データタームは以下の式で定義される。 Next, the details of the process in step S409 will be described. The method of estimating the optical flow so as to minimize the energy function is generally called the gradient method. The basis is a term called a data term, and the data term is defined by the following formula.

ｆは、Ｉ_１とＩ_２ｗとの差分を求める関数であり、Ｉ_１とＩ_２ｗとの差の絶対値を求める関数であっても良いし、Ｉ_１とＩ_２ｗとの差の二乗を求める関数であっても良い。勾配法のエネルギー関数は主に２種類に分類することができる。 f is a function for obtaining the difference between _{I 1} and _{I 2w,} may be a function for obtaining the absolute value of the difference between _{I 1} and _{I 2w,} obtains the square of the difference between _{I 1} and _{I 2w} It may be a function. The energy functions of the gradient method can be classified into two main types.

一つ目は、データタームをあるパッチの範囲で総和をとったものをエネルギー関数と定義するタイプであり、以下の式６で定義される。以下、この手法をパッチベースの手法と呼称する。パッチベースの手法では、画素ごとに以下のエネルギー関数を最小にするオプティカルフローを算出する。 The first is a type in which the sum of data terms in a certain patch range is defined as an energy function, which is defined by the following equation 6. Hereinafter, this method will be referred to as a patch-based method. In the patch-based method, the optical flow that minimizes the following energy function is calculated for each pixel.

ここで、Ｂは画素位置（ｘ、ｙ）を中心としたパッチ領域を表しており、例えば７×７のパッチを考えた場合、ｐはｘ−３からｘ＋３まで、ｑはｙ−３からｙ＋３までの整数値をとる。この手法の利点は、ρとして例えば差分２乗を採用した場合、最小となるオプティカルフローを解析的に求めることができる点である。一方で、推定されるオプティカルフローは正解から外れた値になることが多く、高精度に推定することが困難である。 Here, B represents a patch region centered on the pixel position (x, y). For example, when considering a 7 × 7 patch, p is from x-3 to x + 3, and q is from y-3 to y + 3. Takes an integer value up to. The advantage of this method is that the minimum optical flow can be obtained analytically when, for example, the difference square is adopted as ρ. On the other hand, the estimated optical flow often deviates from the correct answer, and it is difficult to estimate it with high accuracy.

二つ目は、上記の問題を解決するために、拘束条件として、オプティカルフローを滑らかにするための平滑化項を追加する。エネルギー関数は以下の式で定義されることが多い。 Second, in order to solve the above problem, a smoothing term for smoothing the optical flow is added as a constraint condition. The energy function is often defined by the following equation.

ここで、λは適当な重み係数であり、∇ｕ，∇ｖはオプティカルフローの勾配である。パッチベースの手法では、Σはパッチ領域内の和をとっていたが、ここでは全体画素の和をとる。ｇは平滑化項であり、ＴＶノルムや、Ｌ２ノルムを用いることが多い。勾配は、例えば以下の式で算出される。 Here, λ is an appropriate weighting coefficient, and ∇u and ∇v are the gradients of the optical flow. In the patch-based method, Σ is the sum within the patch area, but here it is the sum of all pixels. g is a smoothing term, and a TV norm or an L2 norm is often used. The gradient is calculated by, for example, the following formula.

平滑化項を用いた手法では、式７で表されるような画像全体のエネルギー関数を最小化するように全ての画素のオプティカルフローを最適化する。以下、この手法をエネルギー最適化法と呼称する。エネルギー最適化法は、精度のよいオプティカルフローを求めることができる一方で、最適化を行うために反復計算が必要となり、演算量が増大するという課題がある。 In the method using the smoothing term, the optical flow of all pixels is optimized so as to minimize the energy function of the entire image as represented by Equation 7. Hereinafter, this method is referred to as an energy optimization method. While the energy optimization method can obtain an accurate optical flow, it has a problem that iterative calculation is required to perform the optimization and the amount of calculation increases.

本実施形態では、パッチベースの手法、エネルギー最適化法のそれぞれの問題点に鑑み、パッチベースの手法に擬似的な平滑化項を追加してエネルギー最適化法の考え方を取り入れつつ、パッチベースの手法とほぼ同等の演算量でオプティカルフローを推定する。本実施形態に係るエネルギー関数を以下の式９に示す。 In this embodiment, in view of the problems of the patch-based method and the energy optimization method, a pseudo-smoothing term is added to the patch-based method to incorporate the concept of the energy optimization method, and the patch-based method is used. Estimate the optical flow with almost the same amount of calculation as the method. The energy function according to this embodiment is shown in Equation 9 below.

式９のエネルギー関数は、画素位置（ｘ、ｙ）に対するものである。なお、式９ではφ（）についてはパッチ内の総和を計算していないが、ρ（）と同様にパッチ内の総和を計算しても良い。式９におけるρ（）、φ（）を、以下の式１０に示す。 The energy function of Equation 9 is for pixel positions (x, y). In Equation 9, the sum in the patch is not calculated for φ (), but the sum in the patch may be calculated in the same manner as in ρ (). Ρ () and φ () in Equation 9 are shown in Equation 10 below.

式１０においてｐ、ｑは、画素位置（ｘ、ｙ）を中心とするパッチ領域内のｘ座標値、ｙ座標値を示す。ステップＳ４１０では、Ｅ（ｘ、ｙ）が極小（最小）となるｄｕ［ｌｖ］（ｘ、ｙ）、ｄｖ［ｌｖ］（ｘ、ｙ）を、画像Ｉ_２［ｌｖ］に対応するオプティカルフローにおいて、画像Ｉ_２［ｌｖ］中の画素位置（ｘ、ｙ）に対する動きベクトルのＸ成分及びＹ成分として求める。 In Equation 10, p and q indicate the x-coordinate value and the y-coordinate value in the patch region centered on the pixel position (x, y). In step S410, du [lv] (x, y) and dv [lv] (x, y) in which E (x, y) becomes the minimum (minimum) are set in the optical flow corresponding to the _{image I 2 [lv].} , Obtained as the X and Y components of the motion vector with respect to the pixel position (x, y) in the image I _{2 [lv].}

ρ（ｐ、ｑ）は、画像Ｉ_２ｗ［ｌｖ］中の画素位置（ｐ、ｑ）からｄｕ［ｌｖ］（ｘ、ｙ）、ｄｖ［ｌｖ］（ｘ、ｙ）によって規定される動きベクトルの分だけ移動させた画素位置の画素値と、画像Ｉ_１［ｌｖ］中の画素位置（ｐ、ｑ）における画素値との差の二乗を表している。なお、ρ（）は、差の二乗に限らず、差の絶対値等、「画像Ｉ_２ｗ［ｌｖ］中の画素位置（ｐ、ｑ）からｄｕ［ｌｖ］（ｘ、ｙ）、ｄｖ［ｌｖ］（ｘ、ｙ）によって規定される動きベクトルの分だけ移動させた画素位置の画素値と、画像Ｉ_１［ｌｖ］中の画素位置（ｐ、ｑ）における画素値との差」を表す様々な式を適用しても構わない。 ρ (p, q) is a motion vector defined by du [lv] (x, y) and dv [lv] (x, y) from the pixel position (p, q) in the image I _{2w [lv].} It represents the square of the difference between the pixel value of the pixel position moved by the minute and the pixel value _{at the pixel position (p, q) in the image I 1 [lv].} Note that ρ () is not limited to the square of the difference, but the absolute value of the difference, etc., is "du [lv] (x, y), dv [lv] from the pixel position (p, q) in the _{image I 2w [lv].} ] (X, y), the difference between the pixel value of the pixel position moved by the motion vector and the pixel value at the pixel position (p, q) in _{the image I 1 [lv] ”.} Expression may be applied.

式１０においてφ（ｘ、ｙ）は、ＯＰ’［ｌｖ＋１］におけるＸ成分であるｕ’（ｘ、ｙ）にｄｕ［ｌｖ］（ｘ、ｙ）を加えたものと、ＯＰ”［ｌｖ＋１］におけるＸ成分であるｕ_ａｖｅ（ｘ、ｙ）と、の差の二乗と、ＯＰ’［ｌｖ＋１］におけるＹ成分であるｖ’（ｘ、ｙ）にｄｖ［ｌｖ］（ｘ、ｙ）を加えたものと、ＯＰ”［ｌｖ＋１］におけるＹ成分であるｖ_ａｖｅ（ｘ、ｙ）と、の差の二乗と、の和を表している。なお、φ（）は、差の二乗和に限らず、例えば、前者の差の絶対値と後者の差の絶対値との和であっても良い。 In Equation 10, φ (x, y) is the sum of u'(x, y), which is the X component in OP'[lv + 1], plus du [lv] (x, y), and in OP'[lv + 1]. _{The square of the difference between u ave} (x, y), which is the X component, and v'(x, y), which is the Y component in OP'[lv + 1], plus dv [lv] (x, y). It represents the sum of and the square of the difference between _{v ave} (x, y), which is the Y component in OP ”[lv + 1]. Note that φ () is not limited to the sum of squares of differences, and may be, for example, the sum of the absolute value of the former difference and the absolute value of the latter difference.

エネルギー関数にφ（）の項を加えることで、ｕ_ａｖｅ（ｘ、ｙ）、ｖ_ａｖｅ（ｘ、ｙ）はもともとのオプティカルフローに比べて滑らかで、外れ値が抑制された結果となるので、ｕ’とｕ_ａｖｅの値が乖離しないように推定値が算出され、この項が平滑化項としての役割を果たす。これはｖについても同様である。 By adding the term φ () to the energy function, u _ave (x, y) and v _ave (x, y) are smoother than the original optical flow, resulting in suppressed outliers. Estimated values are calculated so that the values of u'and u _ave do not deviate, and this term serves as a smoothing term. This also applies to v.

上記の式９においてλ＝０の場合は、階層型のＬｕｃａｓーＫａｎａｄｅ法に帰着する。ここで、上記のｄｕ、ｄｖが小さいとして、ρをテイラー展開すると、以下の式１１が得られる。 When λ = 0 in the above equation 9, it is reduced to the hierarchical Lucas-Kanade method. Here, assuming that the above du and dv are small, the Taylor expansion of ρ gives the following equation 11.

ここで、Ｉ_２ｘｗは、式４におけるＩ_２の代わりに画像Ｉ_２ｗのｘ方向の１次偏微分画像、式４におけるＩ_２ｗの代わりにＩ_２ｘｗを当てはめて計算されるものである。同様に、Ｉ_２ｙｗは、式４におけるＩ_２の代わりに画像Ｉ_２ｗのｙ方向の１次偏微分画像、式４におけるＩ_２ｗの代わりにＩ_２ｙｗを当てはめて計算されるものである。画像Ｉの１次偏微分は、例えば以下の式１２で求めることが可能である。 _{Here, I 2Xw} are those calculated by applying 1 Tsugihen differential image in the x direction of the image _{I 2w} instead of _{I 2} in Formula 4, the _{I 2Xw} instead of _{I 2w} in Equation 4. _{Similarly, I 2Yw} are those calculated by applying 1 Tsugihen differential image in the y direction of the image _{I 2w} instead of _{I 2} in Formula 4, the _{I 2Yw} instead of _{I 2w} in Equation 4. The first partial differential of the image I can be obtained, for example, by the following equation 12.

それ以外にも、水平、垂直のＳｏｂｅｌフィルタなどを作用させて求めてもよい。求めるべき解析解ｄｕ、ｄｖは以下の連立方程式を満たす。なお、式１４、１５は階層によらないため、階層表記は省いている。 In addition to that, a horizontal or vertical Sobel filter or the like may be applied to obtain the result. The analytical solutions du and dv to be obtained satisfy the following simultaneous equations. Since equations 14 and 15 do not depend on the hierarchy, the hierarchy notation is omitted.

式１３の両辺にＡの逆行列をかけることで、ｄｕ、ｄｖを求めることができる。このように、本実施形態によれば、前の階層のオプティカルフローに対して平滑化した結果と、算出するオプティカルフローとの差分が小さくなるようにエネルギーを極小化することで、演算量を増加させることなく、精度を向上させることができる。 By multiplying both sides of Equation 13 by the inverse matrix of A, du and dv can be obtained. As described above, according to the present embodiment, the amount of calculation is increased by minimizing the energy so that the difference between the smoothed result of the optical flow of the previous layer and the calculated optical flow becomes small. The accuracy can be improved without causing the problem.

［第２の実施形態］
以下では、第１の実施形態との差分について重点的に説明し、以下で特に触れない限りは第１の実施形態と同様であるものとする。第１の実施形態では、エネルギー関数に使用するオプティカルフローは、現階層ｌｖよりも１つ上の階層（ｌｖ＋１）におけるオプティカルフローを使用した。これに対し、本実施形態では、現フレームよりも１つ前のフレームの画像について求めたオプティカルフローをエネルギー関数に使用する。以下では、現フレームの画像Ｉ_２に対するオプティカルフローを、該フレームよりも１フレーム前の画像Ｉ_１について求めたオプティカルフローを使用して求める例について説明する。 [Second Embodiment]
In the following, the differences from the first embodiment will be mainly described, and unless otherwise specified below, the same as the first embodiment. In the first embodiment, the optical flow used for the energy function is the optical flow in the layer (lv + 1) one level higher than the current layer lv. On the other hand, in the present embodiment, the optical flow obtained for the image of the frame immediately before the current frame is used for the energy function. In the following, an example will be described in which the optical flow for _{the image I 2} _{of the current frame is obtained by using the optical flow obtained for the image I 1} one frame before the frame.

本実施形態に係る画像処理装置の機能構成例、画像Ｉ_２に対するオプティカルフローを求めるために画像処理装置１００が行う処理について、図５のブロック図、図６のフローチャートを用いて説明する。なお、図５において、図３に示した機能部と同じ機能部には同じ参照番号を付しており、該機能部に係る説明は省略する。また、図６のフローチャートにおいて、図４に示した処理ステップと同じ処理ステップには同じステップ番号を付しており、該処理ステップに係る説明は省略する。なお、図６に示したフローチャートに従った処理は、１枚の画像に対するオプティカルフローを求めるための処理である。然るに、例えば、複数枚の画像のそれぞれについてオプティカルフローを求める場合には、該複数の画像のそれぞれについて図６のフローチャートに従った処理を行えばよい。 An example of the functional configuration of the image processing apparatus according to the present embodiment and the processing _{performed by the image processing apparatus 100 for obtaining the optical flow for the image I 2} will be described with reference to the block diagram of FIG. 5 and the flowchart of FIG. In FIG. 5, the same functional unit as the functional unit shown in FIG. 3 is assigned the same reference number, and the description relating to the functional unit will be omitted. Further, in the flowchart of FIG. 6, the same processing step as the processing step shown in FIG. 4 is assigned the same step number, and the description relating to the processing step will be omitted. The process according to the flowchart shown in FIG. 6 is a process for obtaining an optical flow for one image. However, for example, when obtaining an optical flow for each of a plurality of images, processing may be performed for each of the plurality of images according to the flowchart of FIG.

ステップＳ６０１では、ＯＦ変形部５０１は、画像Ｉ_１について過去に求めたオプティカルフローを、画像Ｉ_２のオプティカルフローを生成するためのエネルギー関数に使用する参考オプティカルフローに変換する。この変換方法には様々な方法が考えられる。 In step S601, the OF deformer 501 converts the previously obtained optical flow for _{image I 1} into a reference optical flow used for the energy function to generate the optical flow for _{image I 2.} Various methods can be considered for this conversion method.

例えば、画像Ｉ_１について求めたオプティカルフローは、画像Ｉ_１よりも１フレーム前の画像Ｉ_０に対する画像Ｉ_１のオプティカルフローであり、該オプティカルフローの要素は、画像Ｉ_０からの動きベクトルを表している。ここで、フレーム間の時間間隔が充分に短い場合、画像中のオブジェクトの動きは等速直線運動と見なせるため、画像Ｉ_１について求めたオプティカルフローの各要素を、該オプティカルフローの要素が示す動きベクトルに従って移動させたものを、上記の参考オプティカルフローとして使用することができる。この移動により、参考オプティカルフローには、動きベクトルが格納されない要素が存在する可能性があるため、そのような要素はフィルタ処理などによって周囲の動きベクトルから穴埋めする。 For example, the optical flow obtained for the image I ₁ is the optical flow of the image I _{1 with} respect to _{the image I 0} one frame before the _{image I 1} , and the element of the optical flow represents the motion vector from the _{image I 0.} ing. Here, when the time interval between frames is sufficiently short, the movement of the object in the image can be regarded as a constant velocity linear motion. Therefore, _{each element of the optical flow obtained for the image I 1} is the movement indicated by the element of the optical flow. The one moved according to the vector can be used as the above reference optical flow. Due to this movement, there may be an element in the reference optical flow in which the motion vector is not stored. Therefore, such an element is filled in from the surrounding motion vector by filtering or the like.

なお、画像Ｉ_１を基準とした画像Ｉ_０のオプティカルフローが得られている場合には、このオプティカルフローの要素の符号を逆にしたものを上記の参考オプティカルフローとしても良い。 In the case where the optical flow of the image I ₀ relative to the image I ₁ has been obtained, those in which the sign of the elements of this optical flow in the opposite may be the above references optical flow.

参考オプティカルフローを得るための処理について、図７を例にとり説明する。画像７０１〜７０３はそれぞれ画像Ｉ_０〜Ｉ_２であり、何れの画像にも人物２０３及び家２０４が含まれている。 Reference The process for obtaining the optical flow will be described by taking FIG. 7 as an example. Images 701 to 703 are images I _{0 to} I ₂ , respectively, and each image includes a person 203 and a house 204.

画像Ｉ_０を基準とした画像Ｉ_１における人物２０３の動きベクトル７１３を該動きベクトル７１３の分だけ移動させた動きベクトルを、画像Ｉ_１を基準とした画像Ｉ_２における人物２０３の動きベクトル７０７として求める。もし、画像Ｉ_１を基準とした画像Ｉ_０における人物２０３の動きベクトル７０５が得られている場合には、これを反転させたものを動きベクトル７０７としても良い。画像Ｉ_０を基準とした画像Ｉ_１における家２０４の動きベクトル７０４を該動きベクトル７０４の分だけ移動させた動きベクトルを、画像Ｉ_１を基準とした画像Ｉ_２における家２０４の動きベクトル７０８として求める。もし、画像Ｉ_１を基準とした画像Ｉ_０における家２０４の動きベクトル７０６が得られている場合には、これを反転させたものを動きベクトル７０８としても良い。このようにして求めた動きベクトル７０７，７０８が上記の参考オプティカルフローとなる。 By the amount motion vector obtained by the movement of the motion vector 713 a-out animal vector 713 of a person 203 in the image _{I 1} on the basis of the image _{I 0,} as the motion vector 707 of a person 203 in the image _{I 2} relative to the image _{I 1} Ask. If the motion vector 705 of the person 203 in the image I ₀ with respect to the image I ₁ is obtained, the motion vector 707 may be an inverted version of the motion vector 705. The motion vector obtained by moving the motion vector 704 of the house 204 in the image I ₁ with respect to the image I ₀ by the amount of the motion vector 704 is used as the motion vector 708 of the house 204 in the image I ₂ _{with respect to the image I 1.} Ask. If the motion vector 706 of the house 204 in the image I ₀ with respect to the image I ₁ is obtained, the motion vector 708 may be an inverted version of the motion vector 706. The motion vectors 707 and 708 obtained in this way serve as the above-mentioned reference optical flow.

図６に戻って、次にステップＳ６０２では、ＯＦ平滑化部３０４は、ステップＳ６０１で生成した参考オプティカルフローに対して、第１の実施形態で説明したオプティカルフローに対する平滑化処理を行う。 Returning to FIG. 6, next, in step S602, the OF smoothing unit 304 performs a smoothing process on the optical flow described in the first embodiment with respect to the reference optical flow generated in step S601.

ステップＳ６０３では、ＯＦ縮小部５０２は、ステップＳ６０２で平滑化処理を施した参考オプティカルフローの各要素の値をｓｆ^ｌｖ倍してから、該参考オプティカルフローの縦横サイズをｓｆ^ｌｖ倍に縮小したオプティカルフローを生成する。 In step S603, the OF reduction unit 502 multiplies the value of each element of the reference optical flow smoothed in step S602 by sf ^lv , and then reduces the vertical and horizontal size of the reference optical flow to sf ^{lv times.} Generate a flow.

そして以降は、ステップＳ６０３で生成したオプティカルフローのｕ（ｘ，ｙ）、ｖ（ｘ，ｙ）をｕ_ａｖｅ（ｘ，ｙ）、ｖ_ａｖｅ（ｘ，ｙ）として使用してエネルギー関数を構成する以外は第１の実施形態と同様である。なお、図６のフローチャートでは、全ての階層について、ステップＳ６０３で生成したオプティカルフローのｕ（ｘ，ｙ）、ｖ（ｘ，ｙ）をｕ_ａｖｅ（ｘ，ｙ）、ｖ_ａｖｅ（ｘ，ｙ）として使用してエネルギー関数を構成している。しかし、特定の階層、例えば、最終回層以外の階層については第１の実施形態と同様にしてエネルギー関数を構成し、最終階層については、ステップＳ６０３で生成したオプティカルフローのｕ（ｘ，ｙ）、ｖ（ｘ，ｙ）をｕ_ａｖｅ（ｘ，ｙ）、ｖ_ａｖｅ（ｘ，ｙ）として使用してエネルギー関数を構成しても良い。 After that, the u (x, y) and v (x, y) of the optical flow generated in step S603 are used as u _ave (x, y) and v _ave (x, y) to construct an energy function. Other than that, it is the same as that of the first embodiment. In the flowchart of FIG. 6, u (x, y) and v (x, y) of the optical flow generated in step S603 are set to u _ave (x, y) and v _ave (x, y) for all layers. To construct an energy function. However, for a specific layer, for example, a layer other than the final layer, the energy function is configured in the same manner as in the first embodiment, and for the final layer, u (x, y) of the optical flow generated in step S603. , V (x, y) may be used as u _ave (x, y), v _ave (x, y) to construct an energy function.

なお、第１の実施形態と同様に、前の階層のオプティカルフローを平滑化した結果をエネルギー関数に追加してもよい。ステップＳ６０３で生成したオプティカルフローのｕ（ｘ，ｙ）、ｖ（ｘ，ｙ）のそれぞれをｕ_ａｖｅ１（ｘ、ｙ）、ｖ_ａｖｅ１（ｘ、ｙ）、ＯＰ”［ｌｖ＋１］におけるＸ成分、Ｙ成分のそれぞれをｕ_ａｖｅ２（ｘ、ｙ）、ｖ_ａｖｅ２（ｘ、ｙ）とすると、エネルギー関数は以下のようになる。 As in the first embodiment, the result of smoothing the optical flow of the previous layer may be added to the energy function. Each of u (x, y) and v (x, y) of the optical flow generated in step S603 is u _ave1 (x, y), _vave1 (x, y), the X component in OP "[lv + 1], Y. _Assuming that each of the components is uave2 (x, y) and _vave2 (x, y), the energy function is as follows.

なお、式１６ではφ_１（）、φ_２（）についてはパッチ内の総和を計算していないが、ρ（）と同様にパッチ内の総和を計算しても良い。本実施形態によれば、オプティカルフローの時間的な連続性も考慮しつつ、演算量を抑えて高精度にオプティカルフローを算出することができる。なお、図４，６に示した全てのステップは上記の説明の通り上から順に実行されることに限らず、一部の処理ステップで順番を入れ替えても良いし、一部の処理ステップを並列に実行しても良い。 In Equation 16, _{the sum in the patch is not calculated for φ 1} () and φ ₂ (), but the sum in the patch may be calculated in the same manner as in ρ (). According to this embodiment, it is possible to calculate the optical flow with high accuracy while suppressing the amount of calculation while considering the temporal continuity of the optical flow. Note that all the steps shown in FIGS. 4 and 6 are not limited to being executed in order from the top as described above, and the order may be changed in some processing steps, or some processing steps may be performed in parallel. You may execute it.

［第３の実施形態］
第１，２の実施形態で説明したオプティカルフローの生成処理によって生成されたオプティカルフローは様々な用途に装用できる。オプティカルフローを算出することで、動いている被写体の特定や、カメラが動いている方向を推定することができる。このことにより、被写体の追跡や動画の防振など様々な用途に適用することが可能である。また、撮影した画像や動画に対し、映像効果を付与することも可能である。例えば、撮影した画像に対して、オプティカルフローの方向にブラーを付けることで、動きのある被写体を強調した躍動感のある画像を生成することができる。以下では、動画の防振と、ある特定のフレームに対して動きに基づいたブラーを付与する場合について説明する。 [Third Embodiment]
The optical flow generated by the optical flow generation process described in the first and second embodiments can be worn for various purposes. By calculating the optical flow, it is possible to identify a moving subject and estimate the direction in which the camera is moving. This makes it possible to apply it to various applications such as tracking a subject and vibration isolation of a moving image. It is also possible to add a video effect to the captured image or moving image. For example, by adding a blur in the direction of the optical flow to the captured image, it is possible to generate a dynamic image that emphasizes a moving subject. In the following, vibration isolation of moving images and a case of applying motion-based blur to a specific frame will be described.

動画の防振にオプティカルフローを用いる画像処理装置の機能構成例について、図８のブロック図を用いて説明する。図８の画像処理装置８００は、上記の画像処理装置１００内に納められた装置であっても良い。 An example of a functional configuration of an image processing device that uses an optical flow for vibration isolation of moving images will be described with reference to the block diagram of FIG. The image processing device 800 of FIG. 8 may be a device housed in the above-mentioned image processing device 100.

ＯＦデータ取得部８０１は、上記の画像処理装置１００が生成して出力したオプティカルフローを取得する。ＯＦデータ取得部８０１によるオプティカルフローの取得方法については特定の取得方法に限らない。例えば、画像処理装置１００から無線若しくは有線のネットワーク、若しくは有線と無線の組み合わせによるネットワークを介してオプティカルフローを取得しても良いし、外部の記憶装置に格納されているオプティカルフローを取得しても良い。 The OF data acquisition unit 801 acquires the optical flow generated and output by the image processing apparatus 100. The method of acquiring the optical flow by the OF data acquisition unit 801 is not limited to a specific acquisition method. For example, the optical flow may be acquired from the image processing device 100 via a wireless or wired network, or a network formed by a combination of wired and wireless, or the optical flow stored in an external storage device may be acquired. good.

算出部８０２は、ＯＦデータ取得部８０１が取得したオプティカルフローを用いてグローバルモーションを算出する。グローバルモーションとは、画像全体に対して最も支配的な動きの方向であり、一つのベクトルで表される。グローバルモーションは、例えばオプティカルフローのヒストグラムを生成して最頻値を取得することにより算出することが可能である。なお、画像全体の動きを算出することができれば、別の手法で算出しても構わない。 The calculation unit 802 calculates the global motion using the optical flow acquired by the OF data acquisition unit 801. Global motion is the most dominant direction of movement for the entire image and is represented by a single vector. Global motion can be calculated, for example, by generating a histogram of optical flow and acquiring the mode. If the movement of the entire image can be calculated, another method may be used for calculation.

平滑部８０３は、グローバルモーションの時間方向の高周波成分を除去する。これは、時間方向に対する、動画の振動を除去するためである。例えば、時間方向にフーリエ変換して高周波を除去したり、時間方向に平滑化フィルタを作用させることで実現することができる。 The smoothing portion 803 removes high frequency components in the time direction of the global motion. This is to eliminate the vibration of the moving image in the time direction. For example, it can be realized by Fourier transforming in the time direction to remove high frequencies or by operating a smoothing filter in the time direction.

防振部８０４は、各時刻のグローバルモーションに基づいて、画像データ取得部８０５が取得する各フレームの画像のうち対応する時刻の画像を電子的にシフトして位置合わせする。 The vibration isolation unit 804 electronically shifts and aligns the image at the corresponding time among the images of each frame acquired by the image data acquisition unit 805 based on the global motion at each time.

次に、動きに基づいたブラーを付与する画像処理装置の機能構成例について、図９のブロック図を用いて説明する。図９の画像処理装置９００は、上記の画像処理装置１００内に納められた装置であっても良い。図９において図８と同じ機能部には同じ参照番号を付しており、該機能部に係る説明は省略する。なお、以下では処理対象の画像を画像１として説明する。 Next, an example of a functional configuration of an image processing device that imparts blur based on movement will be described with reference to the block diagram of FIG. The image processing device 900 of FIG. 9 may be a device housed in the above-mentioned image processing device 100. In FIG. 9, the same functional unit as in FIG. 8 is assigned the same reference number, and the description of the functional unit will be omitted. In the following, the image to be processed will be described as image 1.

画像変形部９０１は、ｋ＝１〜ｎ−１としたとき、ＯＦデータ取得部８０１が取得したオプティカルフロー内の各要素（動きベクトルの成分）をｋ／ｎ倍した動きベクトルを用いて、式４に従って画像１をシフトしたシフト画像を生成する。例えば、ｎ＝１０とすると、ｋ＝１〜９に対して、ｎ−１枚分のシフトしたシフト画像を生成する。画像合成部９０２は、ｎ−１枚の変形画像と画像１とを画素毎に合成した合成画像を生成し、該合成画像の各画素の画素値をｎで除算することにより、ブラーが付与された画像を生成する。動きの大きな被写体ほどオプティカルフローベクトルが大きく、静止している被写体は、オプティカルフローベクトルが０になるため、動きが大きいほどブラーが発生した画像が生成される。本実施形態では、ｎとして固定値を用いたが、画像中のオプティカルフローの長さの最大値から決めてもよい。例えば、オプティカルフローの長さの最大値が５０ｐｉｘであれば、ｎ＝５０とする。また、ユーザーがブラーの強度を指定できる場合は、強度に応じてオプティカルフローをリスケールし、同様の処理を行ってもよい。例えば、ブラーの効果を強くする場合は、元のオプティカルフローを何倍かして処理を行えばよい。本実施形態によれば、オプティカルフローを用いることで、カメラ機能を高速化・高精度化したり、映像効果を付与することが可能になる。また、異なる撮像装置で同一時刻に撮影された画像の場合は、オプティカルフローから被写体の奥行きを算出することも可能である。 When k = 1 to n-1, the image deformation unit 901 uses a motion vector obtained by multiplying each element (component of the motion vector) in the optical flow acquired by the OF data acquisition unit 801 by k / n. A shift image in which the image 1 is shifted according to 4 is generated. For example, assuming that n = 10, shift images for n-1 images are generated for k = 1-9. The image synthesizing unit 902 generates a composite image obtained by synthesizing n-1 deformed images and image 1 for each pixel, and divides the pixel value of each pixel of the composite image by n to add blur. Generate an image. The larger the movement, the larger the optical flow vector, and for a stationary subject, the optical flow vector becomes 0. Therefore, the larger the movement, the more blurred the image is generated. In the present embodiment, a fixed value is used as n, but it may be determined from the maximum value of the length of the optical flow in the image. For example, if the maximum value of the optical flow length is 50 pix, n = 50. If the user can specify the intensity of the blur, the optical flow may be rescaled according to the intensity and the same processing may be performed. For example, if you want to increase the effect of blurring, you can multiply the original optical flow by several times. According to the present embodiment, by using the optical flow, it is possible to increase the speed and accuracy of the camera function and to add a video effect. Further, in the case of images taken at the same time by different imaging devices, it is possible to calculate the depth of the subject from the optical flow.

［第４の実施形態］
図３，５に示した画像処理装置１００を構成する各機能部は何れもハードウェアで実装しても良いが、ソフトウェア（コンピュータプログラム）で実装しても良い。後者の場合、このコンピュータプログラムを実行可能なプロセッサを有するコンピュータ装置は、上記の画像処理装置１００に適用することができる。画像処理装置１００に適用可能なコンピュータ装置のハードウェア構成例について、図１のブロック図を用いて説明する。 [Fourth Embodiment]
Each of the functional units constituting the image processing apparatus 100 shown in FIGS. 3 and 5 may be implemented by hardware, but may also be implemented by software (computer program). In the latter case, a computer device having a processor capable of executing this computer program can be applied to the image processing device 100 described above. An example of a hardware configuration of a computer device applicable to the image processing device 100 will be described with reference to the block diagram of FIG.

ＣＰＵ１０１は、ＲＡＭ１０２やＲＯＭ１０３に格納されているコンピュータプログラムやデータを用いて各種の処理を実行する。これによりＣＰＵ１０１は、コンピュータ装置全体の動作制御を行うと共に、画像処理装置１００が行うものとして上述した各処理を実行若しくは制御する。 The CPU 101 executes various processes using computer programs and data stored in the RAM 102 and the ROM 103. As a result, the CPU 101 controls the operation of the entire computer device, and executes or controls each of the above-described processes as performed by the image processing device 100.

ＲＡＭ１０２は、ＲＯＭ１０３や記憶部１０４からロードされたコンピュータプログラムやデータを格納するためのエリアを有する。更にＲＡＭ１０２は、ＣＰＵ１０１が各種の処理を実行する際に用いるワークエリアを有する。このようにＲＡＭ１０２は、各種のエリアを適宜提供することができる。ＲＯＭ１０３には、書き換え不要の設定データやブートプログラムなどが格納されている。 The RAM 102 has an area for storing computer programs and data loaded from the ROM 103 and the storage unit 104. Further, the RAM 102 has a work area used by the CPU 101 when executing various processes. As described above, the RAM 102 can appropriately provide various areas. The ROM 103 stores setting data and a boot program that do not need to be rewritten.

記憶部１０４は、ハードディスクドライブ装置に代表される大容量情報記憶装置である。記憶部１０４には、ＯＳ（オペレーティングシステム）や、画像処理装置１００が行うものとして上述した各処理をＣＰＵ１０１に実行させるためのコンピュータプログラムやデータが保存されている。記憶部１０４に保存されているコンピュータプログラムには、図３，５に示した各機能部の機能をＣＰＵ１０１に実行させるためのコンピュータプログラムが含まれている。また、記憶部１０４に保存されているデータには、上記の説明において既知の情報として説明したものや、処理対象となる画像や動画像のデータが含まれている。記憶部１０４に保存されているコンピュータプログラムやデータは、ＣＰＵ１０１による制御に従って適宜ＲＡＭ１０２にロードされ、ＣＰＵ１０１による処理対象となる。 The storage unit 104 is a large-capacity information storage device typified by a hard disk drive device. The storage unit 104 stores an OS (operating system) and computer programs and data for causing the CPU 101 to execute each of the above-described processes as performed by the image processing device 100. The computer program stored in the storage unit 104 includes a computer program for causing the CPU 101 to execute the functions of the functional units shown in FIGS. 3 and 5. Further, the data stored in the storage unit 104 includes the data described as known information in the above description, and the data of the image or moving image to be processed. The computer programs and data stored in the storage unit 104 are appropriately loaded into the RAM 102 according to the control by the CPU 101, and are processed by the CPU 101.

なお、記憶部１０４としては、ハードディスクドライブ装置以外にも、ＣＤ−ＲＯＭやＤＶＤ−ＲＯＭ等の記憶媒体から情報を読み取る機器、フラッシュメモリ、ＵＳＢメモリなどのメモリ装置を適用することもできる。 In addition to the hard disk drive device, the storage unit 104 can also be applied to a device that reads information from a storage medium such as a CD-ROM or a DVD-ROM, or a memory device such as a flash memory or a USB memory.

出力インターフェース１０６には表示装置１０９が接続されている。表示装置１０９は、ＣＲＴや液晶画面、プロジェクタ装置などにより構成されており、ＣＰＵ１０１による処理結果を画像や文字などでもって表示もしくは投影することができる。 A display device 109 is connected to the output interface 106. The display device 109 is composed of a CRT, a liquid crystal screen, a projector device, and the like, and can display or project the processing result by the CPU 101 with images, characters, and the like.

ＣＰＵ１０１、ＲＡＭ１０２、ＲＯＭ１０３、記憶部１０４、出力インターフェース１０６は何れもバス１０７に接続されている。なお、図１に示した構成は、画像処理装置１００に適用可能なコンピュータ装置の構成の一例に過ぎない。 The CPU 101, RAM 102, ROM 103, storage unit 104, and output interface 106 are all connected to the bus 107. The configuration shown in FIG. 1 is only an example of the configuration of a computer device applicable to the image processing device 100.

また、図８，９に示した画像処理装置８００，９００の各機能部についても同様で、何れもハードウェアで実装しても良いが、ソフトウェア（コンピュータプログラム）で実装しても良い。後者の場合、このコンピュータプログラムを実行可能なプロセッサを有するコンピュータ装置は、上記の画像処理装置８００，９００として機能するので、このコンピュータ装置に図１に示した構成を適用可能であることはいうまでもない。また、画像処理装置８００や画像処理装置９００を画像処理装置１００内に納めた場合には、図１のコンピュータ装置は、画像処理装置８００や画像処理装置９００の機能をも実現することになる。 The same applies to the functional units of the image processing devices 800 and 900 shown in FIGS. 8 and 9, and all of them may be implemented by hardware, but may be implemented by software (computer program). In the latter case, a computer device having a processor capable of executing this computer program functions as the above-mentioned image processing devices 800, 900, and it goes without saying that the configuration shown in FIG. 1 can be applied to this computer device. Nor. Further, when the image processing device 800 and the image processing device 900 are housed in the image processing device 100, the computer device of FIG. 1 also realizes the functions of the image processing device 800 and the image processing device 900.

（その他の実施例）
本発明は、上述の実施形態の１以上の機能を実現するプログラムを、ネットワーク又は記憶媒体を介してシステム又は装置に供給し、そのシステム又は装置のコンピュータにおける１つ以上のプロセッサーがプログラムを読出し実行する処理でも実現可能である。また、１以上の機能を実現する回路（例えば、ＡＳＩＣ）によっても実現可能である。 (Other Examples)
The present invention supplies a program that realizes one or more functions of the above-described embodiment to a system or device via a network or storage medium, and one or more processors in the computer of the system or device reads and executes the program. It can also be realized by the processing to be performed. It can also be realized by a circuit (for example, ASIC) that realizes one or more functions.

３０１：画像データ取得部３０２：画像縮小部３０３：ＯＦ初期化部３０４：ＯＦ平滑化部３０５：画像変形部３０６：エネルギー関数生成部３０７：ＯＦ拡大部３０８：ＯＦ算出部 301: Image data acquisition unit 302: Image reduction unit 303: OF initialization unit 304: OF smoothing unit 305: Image deformation unit 306: Energy function generation unit 307: OF enlargement unit 308: OF calculation unit

Claims

A first set having a first image and a plurality of reduced images recursively reduced at a specified reduction rate as elements, and a second image and the second image reduced by the specified reduction. A second set whose elements are a plurality of reduced images recursively reduced by a rate, an acquisition means for acquiring, and an acquisition means.
A selection means for selecting images belonging to the second set in ascending order of image size, and
A moved selection image in which each pixel of the selected image selected by the selection means is moved according to a converted optical flow in which the optical flow corresponding to the image previously selected by the selection means is converted according to the size of the selected image. And the generation means to generate
Among the images belonging to the first set, the first difference, which is the difference between the image having the same size as the selected image and the moved selected image, and smoothing with respect to the converted optical flow and the converted optical flow. A calculation means for obtaining the second difference, which is the difference from the processed optical flow that has undergone the conversion process, and the optical flow that minimizes the evaluation value based on the optical flow, as the optical flow corresponding to the selected image.
An image processing apparatus including an output means for outputting an optical flow corresponding to the second image, which is obtained by the calculation means.

The converted optical flow obtained by converting the optical flow corresponding to the image previously selected by the selection means according to the size of the selected image is a motion vector which is an element of the optical flow corresponding to the image previously selected by the selection means. The image processing apparatus according to claim 1, wherein the component value and the size of the optical flow corresponding to the image selected last time by the selection means are converted according to the size of the selected image.

A first set having a first image and a plurality of reduced images recursively reduced at a specified reduction rate as elements, and a second image and the second image reduced by the specified reduction. A second set whose elements are a plurality of reduced images recursively reduced by a rate, an acquisition means for acquiring, and an acquisition means.
A selection means for selecting images belonging to the second set in ascending order of image size, and
A moved selection image in which each pixel of the selected image selected by the selection means is moved according to a converted optical flow in which the optical flow corresponding to the image previously selected by the selection means is converted according to the size of the selected image. And the generation means to generate
Among the images belonging to the first set, the first difference, which is the difference between the image having the same size as the selected image and the moved selected image, and the converted optical flow and the optical flow for the first image are shown. The second difference, which is the difference from the processed optical flow that has been converted according to the size of the selected image and then smoothed, and the optical flow that minimizes the evaluation value based on the second difference correspond to the selected image. The calculation method to be calculated as the optical flow to be performed,
An image processing apparatus including an output means for outputting an optical flow corresponding to the second image, which is obtained by the calculation means.

The image processing apparatus according to any one of claims 1 to 3, wherein any one of an average filter, a joint bilateral filter, and a median filter is used for the smoothing process.

In addition
Any one of claims 1 to 4, wherein the global motion in the image is obtained by using the optical flow output by the output means, and the vibration isolating means for shifting the image based on the obtained global motion is provided. The image processing apparatus according to the section.

In addition
A means for generating a plurality of optical flows from the optical flows output by the output means, generating a plurality of shift images obtained by shifting the images using the plurality of optical flows, and synthesizing the images and the plurality of shift images. The image processing apparatus according to any one of claims 1 to 4, wherein the image processing apparatus is characterized by the above.

Any one of claims 1 to 6, wherein each of the first image and the second image is an image captured at the same time by a plurality of image pickup devices or at different times from each other. The image processing apparatus according to.

The image according to any one of claims 1 to 6, wherein each of the first image and the second image is an image captured at different times by a single image pickup apparatus. Processing equipment.

This is an image processing method performed by an image processing device.
The acquisition means of the image processing apparatus includes a first set including a first image and a plurality of reduced images in which the first image is recursively reduced at a predetermined reduction rate, a second image, and the like. An acquisition step of acquiring a second set whose elements are a plurality of reduced images obtained by recursively reducing the second image at the specified reduction ratio.
A selection step in which the selection means of the image processing device selects images belonging to the second set in ascending order of image size.
The generation means of the image processing apparatus has converted each pixel of the selected image selected this time in the selection step into an optical flow corresponding to the image previously selected in the selection step according to the size of the selected image. A generation process that generates a moved selected image that has been moved according to the flow,
The calculation means of the image processing apparatus includes a first difference, which is a difference between an image having the same size as the selected image and the moved selected image among the images belonging to the first set, and the converted optical flow. The second difference, which is the difference between the converted optical flow and the processed optical flow that has been smoothed, and the optical flow that minimizes the evaluation value based on the second difference, are used as the optical flow corresponding to the selected image. The required calculation process and
An image processing method characterized in that the output means of the image processing apparatus includes an output step of outputting an optical flow corresponding to the second image obtained in the calculation step.

This is an image processing method performed by an image processing device.
The acquisition means of the image processing apparatus includes a first set including a first image and a plurality of reduced images in which the first image is recursively reduced at a predetermined reduction rate, a second image, and the like. An acquisition step of acquiring a second set whose elements are a plurality of reduced images obtained by recursively reducing the second image at the specified reduction ratio.
A selection step in which the selection means of the image processing device selects images belonging to the second set in ascending order of image size.
The generation means of the image processing apparatus has converted each pixel of the selected image selected this time in the selection step into an optical flow corresponding to the image previously selected in the selection step according to the size of the selected image. A generation process that generates a moved selected image that has been moved according to the flow,
The calculation means of the image processing apparatus includes a first difference, which is a difference between an image having the same size as the selected image and the moved selected image among the images belonging to the first set, and the converted optical flow. The evaluation value based on the second difference, which is the difference between the optical flow for the first image and the processed optical flow that has been subjected to the smoothing process after being converted according to the size of the selected image, is minimized. A calculation process for obtaining the optical flow as an optical flow corresponding to the selected image, and
An image processing method characterized in that the output means of the image processing apparatus includes an output step of outputting an optical flow corresponding to the second image obtained in the calculation step.

A computer program for causing a computer to function as each means of the image processing apparatus according to any one of claims 1 to 8.