EP1382017B1 - Image composition evaluation - Google Patents
Image composition evaluation Download PDFInfo
- Publication number
- EP1382017B1 EP1382017B1 EP01272086.8A EP01272086A EP1382017B1 EP 1382017 B1 EP1382017 B1 EP 1382017B1 EP 01272086 A EP01272086 A EP 01272086A EP 1382017 B1 EP1382017 B1 EP 1382017B1
- Authority
- EP
- European Patent Office
- Prior art keywords
- image
- interest
- crop
- composition
- region
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000011156 evaluation Methods 0.000 title claims description 29
- 238000000034 method Methods 0.000 claims description 36
- 230000008569 process Effects 0.000 claims description 15
- 238000012545 processing Methods 0.000 claims description 12
- 230000000007 visual effect Effects 0.000 claims description 7
- 238000004458 analytical method Methods 0.000 claims description 6
- 230000007717 exclusion Effects 0.000 claims description 5
- 230000004044 response Effects 0.000 claims description 5
- 239000003086 colorant Substances 0.000 description 11
- 238000005259 measurement Methods 0.000 description 5
- 230000000694 effects Effects 0.000 description 4
- 230000036961 partial effect Effects 0.000 description 4
- 238000005070 sampling Methods 0.000 description 4
- 238000009826 distribution Methods 0.000 description 3
- 230000006872 improvement Effects 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 238000003909 pattern recognition Methods 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 2
- 230000001186 cumulative effect Effects 0.000 description 2
- 238000013442 quality metrics Methods 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 241000586542 Aonidiella citrina Species 0.000 description 1
- 241001677188 Coccus viridis Species 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000009432 framing Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000005286 illumination Methods 0.000 description 1
- 238000003709 image segmentation Methods 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 230000011514 reflex Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N1/00—Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
- H04N1/387—Composing, repositioning or otherwise geometrically modifying originals
- H04N1/3872—Repositioning or masking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformations in the plane of the image
- G06T3/40—Scaling of whole images or parts thereof, e.g. expanding or contracting
- G06T3/403—Edge-driven scaling; Edge-based scaling
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformations in the plane of the image
- G06T3/40—Scaling of whole images or parts thereof, e.g. expanding or contracting
- G06T3/4038—Image mosaicing, e.g. composing plane images from plane sub-images
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/40—Analysis of texture
- G06T7/41—Analysis of texture based on statistical description of texture
- G06T7/44—Analysis of texture based on statistical description of texture using image operators, e.g. filters, edge density metrics or local histograms
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20092—Interactive image processing based on input by user
- G06T2207/20104—Interactive definition of region of interest [ROI]
Definitions
- the present invention relates to method of providing image composition evaluation, or advice, and to an image composition evaluation system utilising the method.
- the capturing of an image nearly always first involves composing the image prior to capture. This is normally accomplished through a view finder or other similar arrangement associated with the camera apparatus. Successfully composing an image to produce a pleasing picture is a skill normally acquired over time and is a skill that inexperienced camera users often find difficult to acquire. Even experienced photographers or camera users can sometimes overlook some compositional aspects of an image due to a lack of concentration at the time of capture of the image, or due to capturing the image in a hurry.
- a pleasingly composed image often follows one or more known composition rules.
- These rules include, for example, positioning elements of interest according to the "rule of thirds", ensuring that elements of interest are not positioned too close to the edge of the image frame, framing the main area of interest by placing areas of very little interest at the edge of the image, for example placing relatively dark areas, or areas containing relatively little activity, at the edge of the image, and positioning strong diagonal lines to run towards the main subject of the image.
- the "rule of thirds” involves subdividing the image area using two equally spaced horizontal lines and two equally spaced vertical lines (in the manner of a "noughts and crosses” board) and positioning elements of interest on the intersections of the horizontal and vertical lines, or equally placing strong horizontal or vertical features on one of the horizontal and vertical lines.
- composition index numbers in respect of an image viewed by a camera, and visually suggesting changes in the direction of pointing of the camera to improve the image composition.
- composition index numbers comprise:
- a method of providing an evaluation of the composition of an image comprising acquiring an image, processing the acquired image to evaluate its composition by applying one or more compositional rules, and providing a report concerning composition of the image such that the user has an opportunity to recompose the image prior to recording the image; wherein processing of the acquired image comprises:
- apparatus for evaluating the composition of an image comprising:
- Figure 1 schematically illustrates an image composition evaluation system constituting an embodiment of the present invention, in conjunction with a film camera system.
- a subject 1 is optically focussed by a lens system 3 which receives light from the subject and focuses it onto a view finder 5.
- the view finder 5 is part of both the image composition evaluation system and the optical camera system.
- the image 1 is also focused on a image converter 9.
- the image converter 9 may, for example, be a photo-detector array.
- the image converter 9 converts the optical image signals into an electrical image signal.
- an image processor 11 Connected to the image converter 9 is an image processor 11 that performs image composition evaluation on the received image.
- a warning signal is generated.
- the warning signal may be an audio warning signal produced using an audio loudspeaker 13, or alternatively the warning signal may be a visual warning signal produced, for example, by illuminating a warning lamp 15 or by giving a textural message.
- the visual warning may be visible through the image view finder 5. Equally, both an audio warning signal and visual warning signal may be generated.
- the image 1 may also be captured on photographic film 17 by the focussing elements 3 and the mirror 7. The capture of the image 1 on the photographic film 17 may take place after the image composition evaluation system has evaluated the composition of the image, thus providing the user of the camera system with the opportunity of recomposing the image 1 to finally capture a more pleasingly composed image.
- FIG. 2 shows an alternative arrangement of an image composition evaluation system constituting an embodiment of the present invention, in conjunction with an electronic image capture system. Unless otherwise stated, those parts that are common between the system shown in Figure 2 and the system shown in Figure 1 are numbered with identical reference numerals.
- An image 1 is optically focused using lens system 3 that receives the image 1 and focuses it onto an image conversion device 9 to convert the optical image into an electronic image signal.
- the image signal is then passed to an image processor 11 where image composition evaluation is performed.
- the image processor may also output the image signal to an image view finder 5 which may be implemented as a display, such as a LCD display.
- the electronic image signal from the image convertor 9 may be fed directly to the image view finder 5, as indicated by hash line 23 of Figure 2 .
- the image processor may provide a visual warning system in the image view finder 5 in addition to displaying the image.
- the visual warning system may simply be the illumination, or flashing, of a symbol, message, warning lamp or LED within the image view finder 5, or may be represented by highlighting, dimming, or otherwise indicating, those areas of the image that specifically contravene one or more of the predefined set of composition rules.
- the image composition evaluation may occur continuously, as just described, or may only occur in response to an image composition evaluation request signal provided from an composition evaluation request switch 19.
- the composition evaluation switch 19 may also serve as an image capture switch as part of an electronic image capture system that causes the image processor to capture the image 1 onto an electronic recording media 21.
- the image composition evaluation switch 19 may in this case be a two, or three position switch where the first position generates the image composition evaluation request signal but the switch 19 must be manipulated into a further, for example the second, position to cause image capture to occur.
- the method of performing the image composition evaluation comprises identifying one or more regions of compositional significance or interest within the image and applying one or more predefined compositional rules to the identified regions.
- Suitable compositional rules may, for example, include the "rules of thirds", rejecting any regions of compositional significance that are too close to the edge of the image, ensuring that a "main region of interest” is always in the image, and more likely in the centre of the image, ensuring that relatively large areas containing very few regions of interest or significance are, if possible, not included in the image.
- Figure 3 shows an image composed in a casual manner, perhaps by an inexperienced photographer.
- the girl on the swing 25 represents the main subject of the image.
- other potential regions of interest include the flowers 27 located at the bottom left hand corner of the image.
- a serious compositional error has been made by only including part of a figure 29 shown at the right hand side of the image.
- the system of the present invention may, for example, issue a message indication that Figure 29 should be fully included or fully excluded from the image.
- Figure 4 illustrates one possible recomposition of the image that satisfies most compositional rules.
- the main region of interest, the girl 25, is now the sole region of interest within the image, the final image having been recomposed to exclude the flowers 27 and the partial figure 29 of the original image shown in Figure 3 .
- Figure 5 shows an alternative recomposition of the image of Figure 3 where only the partial figure 29 has been excluded.
- other possible recompositions exist, such as recomposing the image to include the totality of the figure 29 shown in Figure 3 or recomposing to only include the flowers 27 as the sole region of interest.
- the final composition of the image remains the choice of the operator of the composition evaluation system.
- the user may instruct the image processor to present them for review. This may occur by activation of a further switch, or other input means such as existing keys or voice commands.
- An automated image processing system has no a-priori knowledge of the subject matter of the photograph and therefore needs to process it in order to extract some form of representation which will indicate where the compositionally significant regions of the photograph lie.
- the photograph 10 may have been taken with a camera having in excess of 2,000,000 active pixels. Analysing such a large number of pixels would be computationally very significant indeed.
- the image processor down samples the image in order to reduce the number of pixels therein.
- Figure 6 schematically illustrates the same image as shown in Figure 3 , but after down sampling to 240 by 180 pixels. This down sampling has reduced the number of active pixels to 43,200.
- the down sampled image is then converted into an image having compressed colour variation whilst still retaining intensity variations.
- An example of such a processing is converting the image to the YCC colour space format. It should be noted that this is not the only colour space representation which could be used. Thus, the CIELAB colour space system can also be used.
- This system is well known, and defines a space in which the lightness L*, which is a measure of how bright a colour is, is plotted against the vertical axis, and two further measurements a* and b* are defined as linear axes with a* defining the colour from a red to green scale and the b* axis indicating colour on a blue to yellow scale.
- the measurements a* and b* are in the horizontal colour plane and are perpendicular to each other such that this colour system defines an orthogonal cartesian space.
- Each of the L*, a* and b* axis are defined in such a way that one unit on any of the scales has approximately the same "visibility" making this system both linear and isotropic as regards human perception.
- the L* axis has a scale from zero (black) to 100 (white) whilst the a* and b* scales range from -60 to +60 each.
- This system has the advantage that a colour difference of one unit has substantially the same visibility at any part of the colour space.
- areas within the converted image having similar colour and intensity are generated and grown.
- This process commences by blurring the image, and then the blurred image is analysed in order to form "seed areas" that have a smooth colour and intensity.
- the seed areas are then grown by adding areas adjacent to the boundary of the seed areas where those adjacent areas have a sufficiently similar colour and intensity.
- a test is made to determine whether all of the pixels within the colour compressed image have been allocated to seed areas. If not the blur and region grow process is repeated in an iterative manner.
- Figure 7 schematically illustrates the image of Figure 6 once all of the image has been blurred and assigned to regions. At this stage the image shown in Figure 7 contains approximately 2,800 regions, some 2,200 of which contain 10 or less pixels.
- the image processing then continues by merging adjacent areas of the image which are separated by "weak edges".
- "Weak edges” are those boundaries that separate areas of the picture which have a relatively low colour or intensity differences. In other words, the regions which are close to one another within the YCC or CIELAB space. Adjacent areas with similar mean colours are then merged together, and then the image is analysed to determine if small areas, that is areas whose size is less than a threshold value, are completely enclosed by another larger area. If so, then the small area is merged into the larger area. A test may be made to determine whether the number of individual regions has fallen to below a predetermined threshold number.
- Figure 8 shows the image following the region merging.
- the image is further analysed in order to cluster similar colours together until such time as the number of colours has dropped to an appropriate number, which is typically in the region of 20 or so.
- the image of clustered colours is schematically illustrated in Figure 9 .
- a region is a spatially connected sub-area of the image.
- a cluster is a collection of similar regions, but the regions do not need to be adjacent to one another.
- an interest metric is formed on the basis of the unusualness of the colour, and the image is analysed to determine the compositionally significant properties therein from amongst a plurality of different possible properties.
- One such analysis that may be performed is the analysis of the clustered colours shown in Figure 9 to determine how unusual they are.
- the image shown in Figure 9 as noted hereinbefore, comprises approximately 20 or so different colour clusters. These clusters are then sorted in order to identify how many pixels belong to each one of the colours.
- Each of the colour clusters is processed in turn.
- the colour distance between it and each of the other colour clusters is calculated, the clusters are then sorted in order of colour distance from the colour cluster being processed.
- a cumulative histogram can then be formed for the colour cluster under test, by counting the cumulative sum of image pixels which are included in an increasing number of clusters along the colour distance dimension.
- Clusters which, together with closely coloured neighbouring clusters, occupy a relatively large proportion of the pixels of the image are deemed to be background.
- cluster colours which together with closely coloured neighbouring clusters occupy only a relatively small proportion of the pixels of the image are deemed to be foreground.
- cluster colours can be allocated a default saliency based on the likelihood that they are foreground colours.
- colour mapping is not the only process that is applied in order to determine a saliency image.
- those regions which are located towards the edges of the image may be penalised as they may belong to objects which are not fully in frame.
- a search may be made to identify bodies or faces as a result of comparing areas within the image against models held within a model library.
- Figure 10 schematically illustrates a saliency image of Figure 3 following the conclusion of the one or more analysis processes.
- the saliency image is processed to subdivide it into a small number of large areas (typically rectangles) which enclose the majority of the saliency in the image. Thus, the selected areas enclose the bright regions of the saliency image.
- One method of doing this is to form the sums of saliency pixel values along each row, and separately, down each column. Plotting these sums against the vertical and horizontal axes respectively, shows the vertical and horizontal distributions of saliency. These can then be analysed to find the widest minimum in either the vertical or horizontal saliency distribution.
- the image can then be split into three parts at this minimum.
- a first part comprises a horizontal, or as the case may be vertical, band through the image having a width substantially corresponding to that of the minimum.
- This part can be ignored as non salient. This will then leave two parts of the image each side of this minimum band which will contain saliency (except in the case where the minimum band is adjacent one of the edges of the image in which case there will only be one non-empty or salient side). These parts can each be processed by the same algorithm.
- the part with the widest minimum can be split in an analogous manner, discarding the width of the minimum and hence splitting that part into two smaller parts. This process can continue with each stage splitting the part about the best minimum until one of the following limiting conditions is reached:
- the saliency map can now include regions of the image which are defined as include regions and exclude regions.
- the girl has been identified as an "include” region and has been framed by a crop boundary 60 which represents the minimum boundary possible to include all of the girl therein.
- the flowers have been identified as an include region and have been framed by a crop boundary 61 representing the minimum crop required to include the flowers.
- “must exclude” regions have been identified and enclosed by crop boundaries 64 and 66 respectively. However, these regions may be highlighted to a user such that he is given the opportunity to recompose the image in such a way that these features are fully included.
- the user may be presented with an image corresponding to or based on that shown in Figure 10 where compositionally significant regions are highlighted; or an image similar to that of Figure 11a where minimum crop boundaries are represented.
- images could be colour coded such that the include regions are presented in one colour, for example green, whereas the exclude regions are presented in another colour, such as red.
- the exclude regions may be significant and for example, it may be desired to re-frame the image such that the partial Figure 29 is fully framed within the image.
- This compositional option can be given to a user of the present invention. However, it is self evident that the Figure 29 could not be included in a system which utilised only post capture image processing.
- the user may be presented with suggested recompositions of the image. In order to do this, some further processing is required. An example of these additional processes is given below.
- a further minimum crop boundary 70 can be defined which includes both the girl and the flowers (with partial exclusion of the flowers being allowed because they are so close to the edge), and a further maximum crop boundary 72 has also been defined which extends to the upper and lower edges of the photograph, to the left hand edge, but abuts the must exclude regions 64 and 66 at the right hand edge thereof.
- the saliency map is analysed in order to determine how many areas of interest exist therein.
- N distinct areas of interest for example areas of interest separated by some area of non-interest as determined by some adaptively set threshold
- possible minimum cropping rectangles can be generated which contain alternative combinations of between 1 and N areas of interest where the minimum cropping rectangle contains a selected combination of areas of interest and excludes other areas.
- minimum cropping rectangle 60, 61 and 70 in Figures 11A and 11C .
- the maximum cropping rectangle for the each single or combination of areas of interest is the maximum rectangle which contains the areas of interest but excludes the non-selected areas of interest. Thus this corresponds to rectangles 68 and 72 in Figures 11B and 11C .
- Each minimum cropping rectangle 60, 61 and 70 and its associated maximum cropping limit (of which only cropping limits 68 and 72 are shown in Figures 11B and 11C ) is then processed in turn. However, some initial sorting may reduce the processing required.
- One of the compositional rules may require that a large well centred interesting area in the image is deemed to be essential. If we apply this rule then only minimum cropping boundaries 60 and 70 are permitted, with the flowers as defined by crop boundary 61 being excluded.
- the first step is to select a first one of the minimum cropping boundaries 60 and 70 as a potential cropping candidate, together with its cropping limits. As search is then made to identify possible edge locations for each of the edges.
- each of the columns between P and Q is examined in turn in order to generate a metric of how good that column would be as a border of the cropping rectangle.
- the metric is constructed such that dark areas or slowly changing pixels along the column incur a low cost penalty, whereas brighter areas or alternatively rapidly changing colours in a row of pixels achieve a high penalty rating.
- the rating may also be modified with regards to the proximity of that column to the minimum and maximum crop boundaries, or indeed the proximity of that column to the edge of the picture.
- the edge quality metric is a function of:
- a penalty measurement is formed, and the penalty measurement can then be plotted with respect to column thereby obtaining a penalty measurement profile 90.
- the profile 90 can then be examined to determine the position of minima therein, such as broad minima 92 or the sharper minima 94 and 96 which are then deemed to be potential cropping boundaries.
- This process can be repeated for each of the left, right, bottom and top crop boundaries individually, and may be repeated on a iterative basis such that for example those pixels in the column which lie above the upper crop limit or below the lower crop limit are excluded from the next iteration of the crop boundary.
- These candidate crops can then be subject to further constraints. In practice, there will be too many constraints to satisfy all of the constraints simultaneously. Constraints may include implementing the "rule of thirds" in respect of the horizon line. Similarly. the "rule of thirds" can be introduced to act on the main feature of interest to place it 1 ⁇ 3 of a distance from the edge of the crop.
- the final crop is also be constrained by the aspect ratio of the camera.
- a crop candidate Once a crop candidate has been identified, it is then evaluated by applying one or more rules. Each rule is implemented as a heuristically evaluated measure on the image.
- Heuristic measures are used for compositional rules such as eliminating distractions close to the edge of the frame, minimum edge quality, a preference for dark or low activity boundaries, and so on.
- compositional rules used may have different weightings associated with them to vary the importance of those rules for example, particular attention may be paid to identify large boring areas, distractions at the edges of the image, or horizon lines that are centrally placed or placed very close to the top or bottom of the image frames.
- the weightings may vary depending on image content.
- a criterion that may be attributed particular significance may be that of identifying regions of interest that extend beyond the edge of the frame.
- the user of the image evaluation system may be advised by means of the audio or visual warning signals to attempt to fully capture these features, or if the region of interest identified by the image composition evaluation system is actually a combination of multiple objects, to reposition the image capture system such that these two objects are not aligned.
- This advice it is more likely to produce an image composition where the true subject is well isolated from competing regions of interest, for example by being separated by background regions.
- the user can be instructed to zoom and pan in order to match the fully framed image to the suggested crop.
- an image of the suggested crop may be faintly displayed on the viewfinder in addition to the "current" image seen by the camera.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Probability & Statistics with Applications (AREA)
- Image Analysis (AREA)
- Studio Devices (AREA)
- Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)
- Exposure Control For Cameras (AREA)
Description
- The present invention relates to method of providing image composition evaluation, or advice, and to an image composition evaluation system utilising the method.
- The capturing of an image, whether that image is being captured permanently on photographic film or as a digital still image, or whether the image is being captured as a moving image using a video camera or television camera or the like, nearly always first involves composing the image prior to capture. This is normally accomplished through a view finder or other similar arrangement associated with the camera apparatus. Successfully composing an image to produce a pleasing picture is a skill normally acquired over time and is a skill that inexperienced camera users often find difficult to acquire. Even experienced photographers or camera users can sometimes overlook some compositional aspects of an image due to a lack of concentration at the time of capture of the image, or due to capturing the image in a hurry.
- A pleasingly composed image often follows one or more known composition rules. These rules include, for example, positioning elements of interest according to the "rule of thirds", ensuring that elements of interest are not positioned too close to the edge of the image frame, framing the main area of interest by placing areas of very little interest at the edge of the image, for example placing relatively dark areas, or areas containing relatively little activity, at the edge of the image, and positioning strong diagonal lines to run towards the main subject of the image. The "rule of thirds" involves subdividing the image area using two equally spaced horizontal lines and two equally spaced vertical lines (in the manner of a "noughts and crosses" board) and positioning elements of interest on the intersections of the horizontal and vertical lines, or equally placing strong horizontal or vertical features on one of the horizontal and vertical lines.
- Although these rules of composition are well known and can be studied, inexperienced photographers often find them difficult to apply. In particular, the lack of immediate feedback concerning the proposed composition of an image makes the learning process time consuming and difficult.
- It has been proposed by the present applicant in the co-pending United Kingdom patent application number
GB 0031423.7 - It would therefore be advantageous to provide an indication of how well a proposed image conforms to these well known composition rules prior to capture of that image. This can be considered as providing automatic composition advice.
- The following two papers disclose retrieval and classification systems utilizing image composition as a relevant parameter:
- " Image retrieval based on compositional features and interactive query specification" (Hachimura K et al; Proceedings 15th International Conference On Pattern Recognition. ICPR-2000, Proceedings Of 15th International Conference On Pattern Recognition, Barcelona, Spain, 3-7 Sept. 2000, pages 262-266 vol.4, XP010533070 2000, Los Alamitos, CA, USA, IEEE Comput. Soc, USA ISBN: 0-7695-0750-6 ). This paper discloses a sy stem in which compositional parameters are extracted from images by image segmentation and object extraction procedures. T specify a query a user first selects one of seven composition templates, each of which represents a typical composition. The system then successively generates patterns of "composition models" according to the templates selected. These composition models describe the pattern of object distribution within an image. A user iteratively selects composition models, which properly describe the composition he/she wants.
- " Integrating image matching and classification for multimedia retrieval on the Web" (Hirata K et al; Multimedia Computing And Systems. 1999. IEEE International Conference On Florence. Italy 7-11 June 1999, Los Alamitos, CA, USA,IEEE Comput. Soc, US, 7 June 1999 (1999-06-07), pages 237-263, XP010342881 ISBN: 0-7695-0253-9 ) This paper discloses a technique for image classification based on color, shape and composition using the primary objects.
-
US 5,831,670 describes deriving three composition index numbers in respect of an image viewed by a camera, and visually suggesting changes in the direction of pointing of the camera to improve the image composition. These composition index numbers comprise: - a columnar-object (pole, tree, etc.) composition index number G1 which is a measure of verticality of such objects in the image - improvement suggestions seek to make the objects vertical;
- a main-subject-position composition index number G2 which is a measure of the subject's vertical centering in the image - improvement suggestions seek to centre the subject;
- a horizontal-divide (e.g. horizon) composition index number G3 which is a measure of whether a horizontal divide crosses too close to the middle of the image or slants - improvement suggestions seek to move the divide from the vertical centre and make it level.
- According to a first aspect of the present invention there is provided a method of providing an evaluation of the composition of an image, the method comprising acquiring an image, processing the acquired image to evaluate its composition by applying one or more compositional rules, and providing a report concerning composition of the image such that the user has an opportunity to recompose the image prior to recording the image; wherein processing of the acquired image comprises:
- analyzing the acquired image to identify regions of interest within the image; and
- for each identified region of interest, applying one or more compositional rules to determine compositionally-desirable boundaries for the region;
- According to a second aspect of the present invention, there is provided apparatus for evaluating the composition of an image, the apparatus comprising:
- an image receiving element arranged to convert an image into an electrical image signal; and
- an image processor arranged to receive said electrical image signal and process it to evaluate its composition by applying one or more compositional rules, the image processor being further arranged to provide a report concerning composition of the image such that the user has an opportunity to recompose the image prior to recording the image;
- analysing the image represented by the signal to identify regions of interest within the image; and
- for each identified region of interest, applying one or more composition rules to determine compositionally-desirable boundaries for the region;
- The present invention will now be described, by way of example, with reference to the accompanying drawings in which:
-
Figure 1 schematically illustrates an image composition evaluation system according to an embodiment of the present invention incorporated within a film camera; -
Figure 2 schematically illustrates an image composition evaluation system according to a further embodiment of the present invention incorporated within an electronic image capture system; -
Figure 3 shows an image on which composition evaluation may be performed according to an embodiment of the present invention; -
Figure 4 shows an improved composition of the image ofFigure 3 ; -
Figure 5 shows an alternative improved composition of the image ofFigure 3 ; -
Figure 6 shows the image ofFigure 3 after re-sampling to reduce the number of active pixels; -
Figure 7 shows the image ofFigure 6 once the image has been blurred and subdivided into regions of similar appearance; -
Figure 8 shows the image ofFigure 7 following further region merging; -
Figure 9 shows the image ofFigure .8 following colour clustering; -
Figure 10 shows a saliency image for the image ofFigure 3 ; -
Figure 11a, 11b and 11c illustrate potential minimum and maximum cropping boundaries; and -
Figure 12 illustrates a way of determining suitable positions for crop boundaries. -
Figure 1 schematically illustrates an image composition evaluation system constituting an embodiment of the present invention, in conjunction with a film camera system. Asubject 1 is optically focussed by alens system 3 which receives light from the subject and focuses it onto aview finder 5. In this particular example theview finder 5 is part of both the image composition evaluation system and the optical camera system. Using ahinged mirror 7, of the type well known from single lens reflex cameras, theimage 1 is also focused on aimage converter 9. Theimage converter 9 may, for example, be a photo-detector array. Theimage converter 9 converts the optical image signals into an electrical image signal. Connected to theimage converter 9 is an image processor 11 that performs image composition evaluation on the received image. Should the image processor 11 determine that the received image contravenes one or more of a predefined set of composition rules, a warning signal is generated. The warning signal may be an audio warning signal produced using anaudio loudspeaker 13, or alternatively the warning signal may be a visual warning signal produced, for example, by illuminating awarning lamp 15 or by giving a textural message. The visual warning may be visible through theimage view finder 5. Equally, both an audio warning signal and visual warning signal may be generated. Theimage 1 may also be captured onphotographic film 17 by the focussingelements 3 and themirror 7. The capture of theimage 1 on thephotographic film 17 may take place after the image composition evaluation system has evaluated the composition of the image, thus providing the user of the camera system with the opportunity of recomposing theimage 1 to finally capture a more pleasingly composed image. -
Figure 2 shows an alternative arrangement of an image composition evaluation system constituting an embodiment of the present invention, in conjunction with an electronic image capture system. Unless otherwise stated, those parts that are common between the system shown inFigure 2 and the system shown inFigure 1 are numbered with identical reference numerals. Animage 1 is optically focused usinglens system 3 that receives theimage 1 and focuses it onto animage conversion device 9 to convert the optical image into an electronic image signal. The image signal is then passed to an image processor 11 where image composition evaluation is performed. The image processor may also output the image signal to animage view finder 5 which may be implemented as a display, such as a LCD display. Alternatively, the electronic image signal from theimage convertor 9 may be fed directly to theimage view finder 5, as indicated byhash line 23 ofFigure 2 . Having evaluated the composition of the image, the image processor may provide a visual warning system in theimage view finder 5 in addition to displaying the image. The visual warning system may simply be the illumination, or flashing, of a symbol, message, warning lamp or LED within theimage view finder 5, or may be represented by highlighting, dimming, or otherwise indicating, those areas of the image that specifically contravene one or more of the predefined set of composition rules. The image composition evaluation may occur continuously, as just described, or may only occur in response to an image composition evaluation request signal provided from an compositionevaluation request switch 19. Thecomposition evaluation switch 19 may also serve as an image capture switch as part of an electronic image capture system that causes the image processor to capture theimage 1 onto anelectronic recording media 21. The imagecomposition evaluation switch 19 may in this case be a two, or three position switch where the first position generates the image composition evaluation request signal but theswitch 19 must be manipulated into a further, for example the second, position to cause image capture to occur. - In one embodiment of the present invention, the method of performing the image composition evaluation comprises identifying one or more regions of compositional significance or interest within the image and applying one or more predefined compositional rules to the identified regions. Suitable compositional rules may, for example, include the "rules of thirds", rejecting any regions of compositional significance that are too close to the edge of the image, ensuring that a "main region of interest" is always in the image, and more likely in the centre of the image, ensuring that relatively large areas containing very few regions of interest or significance are, if possible, not included in the image.
-
Figure 3 shows an image composed in a casual manner, perhaps by an inexperienced photographer. The girl on theswing 25 represents the main subject of the image. However, other potential regions of interest include theflowers 27 located at the bottom left hand corner of the image. Also, a serious compositional error has been made by only including part of a figure 29 shown at the right hand side of the image. The system of the present invention may, for example, issue a message indication that Figure 29 should be fully included or fully excluded from the image.Figure 4 illustrates one possible recomposition of the image that satisfies most compositional rules. InFigure 4 the main region of interest, thegirl 25, is now the sole region of interest within the image, the final image having been recomposed to exclude theflowers 27 and the partial figure 29 of the original image shown inFigure 3 .Figure 5 shows an alternative recomposition of the image ofFigure 3 where only the partial figure 29 has been excluded. Of course other possible recompositions exist, such as recomposing the image to include the totality of the figure 29 shown inFigure 3 or recomposing to only include theflowers 27 as the sole region of interest. The final composition of the image remains the choice of the operator of the composition evaluation system. Where multiple recompositions exist, the user may instruct the image processor to present them for review. This may occur by activation of a further switch, or other input means such as existing keys or voice commands. - Various schemes are known for selecting a region of interest from an electronic image. One such scheme is described in the present applicants co-pending UK patent application number
GB 0031423.7 GB 0031423.7 Figure 3 . It will be appreciated that although only shown using a grey scale image to a reproduction, the original image ofFigure 3 was in colour. - An automated image processing system has no a-priori knowledge of the subject matter of the photograph and therefore needs to process it in order to extract some form of representation which will indicate where the compositionally significant regions of the photograph lie.
- The photograph 10 may have been taken with a camera having in excess of 2,000,000 active pixels. Analysing such a large number of pixels would be computationally very significant indeed. Thus prior to performing any other processing stamps, the image processor down samples the image in order to reduce the number of pixels therein.
Figure 6 schematically illustrates the same image as shown inFigure 3 , but after down sampling to 240 by 180 pixels. This down sampling has reduced the number of active pixels to 43,200. Following the down sampling, the down sampled image is then converted into an image having compressed colour variation whilst still retaining intensity variations. An example of such a processing is converting the image to the YCC colour space format. It should be noted that this is not the only colour space representation which could be used. Thus, the CIELAB colour space system can also be used. This system is well known, and defines a space in which the lightness L*, which is a measure of how bright a colour is, is plotted against the vertical axis, and two further measurements a* and b* are defined as linear axes with a* defining the colour from a red to green scale and the b* axis indicating colour on a blue to yellow scale. The measurements a* and b* are in the horizontal colour plane and are perpendicular to each other such that this colour system defines an orthogonal cartesian space. Each of the L*, a* and b* axis are defined in such a way that one unit on any of the scales has approximately the same "visibility" making this system both linear and isotropic as regards human perception. The L* axis has a scale from zero (black) to 100 (white) whilst the a* and b* scales range from -60 to +60 each. This system has the advantage that a colour difference of one unit has substantially the same visibility at any part of the colour space. - Following conversion of the image to a colour space, areas within the converted image having similar colour and intensity are generated and grown. This process commences by blurring the image, and then the blurred image is analysed in order to form "seed areas" that have a smooth colour and intensity. The seed areas are then grown by adding areas adjacent to the boundary of the seed areas where those adjacent areas have a sufficiently similar colour and intensity. A test is made to determine whether all of the pixels within the colour compressed image have been allocated to seed areas. If not the blur and region grow process is repeated in an iterative manner.
-
Figure 7 schematically illustrates the image ofFigure 6 once all of the image has been blurred and assigned to regions. At this stage the image shown inFigure 7 contains approximately 2,800 regions, some 2,200 of which contain 10 or less pixels. - The image processing then continues by merging adjacent areas of the image which are separated by "weak edges". "Weak edges" are those boundaries that separate areas of the picture which have a relatively low colour or intensity differences. In other words, the regions which are close to one another within the YCC or CIELAB space. Adjacent areas with similar mean colours are then merged together, and then the image is analysed to determine if small areas, that is areas whose size is less than a threshold value, are completely enclosed by another larger area. If so, then the small area is merged into the larger area. A test may be made to determine whether the number of individual regions has fallen to below a predetermined threshold number. If it is judged that there are still too many regions, the merge can be repeated, possibly with the definition of what constitutes a weak edge being changed such that the distance in the colour space by which colours must be separated before they are regarded as sufficiently different not to be merged may be increased.
Figure 8 shows the image following the region merging. - The image is further analysed in order to cluster similar colours together until such time as the number of colours has dropped to an appropriate number, which is typically in the region of 20 or so. The image of clustered colours is schematically illustrated in
Figure 9 . - It should be noted that as used herein a region is a spatially connected sub-area of the image. However a cluster is a collection of similar regions, but the regions do not need to be adjacent to one another.
- It can be seem with reference to
Figure 9 that the main part of theflowers 27 have been merged into areas of uniform colour. Similarly, the girl's face has been merged into an area ofuniform colour 51 as have hertrousers 52. Large areas of the background have also been merged into areas of substantially uniform colour, for example the tree towards the left hand side of the image. - Next an interest metric is formed on the basis of the unusualness of the colour, and the image is analysed to determine the compositionally significant properties therein from amongst a plurality of different possible properties.
- One such analysis that may be performed is the analysis of the clustered colours shown in
Figure 9 to determine how unusual they are. The image shown inFigure 9 , as noted hereinbefore, comprises approximately 20 or so different colour clusters. These clusters are then sorted in order to identify how many pixels belong to each one of the colours. - Each of the colour clusters is processed in turn. When a colour is processed, the colour distance between it and each of the other colour clusters is calculated, the clusters are then sorted in order of colour distance from the colour cluster being processed. A cumulative histogram can then be formed for the colour cluster under test, by counting the cumulative sum of image pixels which are included in an increasing number of clusters along the colour distance dimension.
- Clusters which, together with closely coloured neighbouring clusters, occupy a relatively large proportion of the pixels of the image are deemed to be background. Conversely, cluster colours which together with closely coloured neighbouring clusters occupy only a relatively small proportion of the pixels of the image are deemed to be foreground. By this analysis, cluster colours can be allocated a default saliency based on the likelihood that they are foreground colours.
- However, colour mapping is not the only process that is applied in order to determine a saliency image. In general, those regions which are located towards the edges of the image may be penalised as they may belong to objects which are not fully in frame.
- Further processes, such as pattern recognition may also be applied to the image. Thus, a search may be made to identify bodies or faces as a result of comparing areas within the image against models held within a model library.
-
Figure 10 schematically illustrates a saliency image ofFigure 3 following the conclusion of the one or more analysis processes. - The saliency image is processed to subdivide it into a small number of large areas (typically rectangles) which enclose the majority of the saliency in the image. Thus, the selected areas enclose the bright regions of the saliency image. One method of doing this is to form the sums of saliency pixel values along each row, and separately, down each column. Plotting these sums against the vertical and horizontal axes respectively, shows the vertical and horizontal distributions of saliency. These can then be analysed to find the widest minimum in either the vertical or horizontal saliency distribution. The image can then be split into three parts at this minimum. A first part comprises a horizontal, or as the case may be vertical, band through the image having a width substantially corresponding to that of the minimum. This part can be ignored as non salient. This will then leave two parts of the image each side of this minimum band which will contain saliency (except in the case where the minimum band is adjacent one of the edges of the image in which case there will only be one non-empty or salient side). These parts can each be processed by the same algorithm. The part with the widest minimum can be split in an analogous manner, discarding the width of the minimum and hence splitting that part into two smaller parts. This process can continue with each stage splitting the part about the best minimum until one of the following limiting conditions is reached:
- a. No minimum can be found in any of the remaining parts. I.e. no minimum is found which is sufficiently wide and sufficiently low in saliency.
- b. The fraction of the total saliency of the image which is outside of the retained block reaches some predetermined limit, such as 5%.
- The result of this process is that a small set of rectangular blocks which enclose the major areas of saliency of the image are derived.
- Suppose that the image is initially framed such that, as shown in
Figure 3 , it includes unwanted features. - Once features relevant to the composition of the image have been identified, the saliency map can now include regions of the image which are defined as include regions and exclude regions. Thus, considering
Figure 11A the girl has been identified as an "include" region and has been framed by acrop boundary 60 which represents the minimum boundary possible to include all of the girl therein. Similarly, the flowers have been identified as an include region and have been framed by acrop boundary 61 representing the minimum crop required to include the flowers. Furthermore, "must exclude" regions have been identified and enclosed bycrop boundaries - Thus, at this point, and optionally in response to preferences set by the user, the user may be presented with an image corresponding to or based on that shown in
Figure 10 where compositionally significant regions are highlighted; or an image similar to that ofFigure 11a where minimum crop boundaries are represented. Indeed such images could be colour coded such that the include regions are presented in one colour, for example green, whereas the exclude regions are presented in another colour, such as red. - The exclude regions may be significant and for example, it may be desired to re-frame the image such that the partial Figure 29 is fully framed within the image. This compositional option can be given to a user of the present invention. However, it is self evident that the Figure 29 could not be included in a system which utilised only post capture image processing.
- Other problems might also be indicated at this point, such as the inclusion of large boring areas, badly placed horizons, the image being tilted, people looking out of frame, the camera pointing directly into the sun, edges with unnecessarily high levels of activity, and so on. This gives the user the opportunity to recompose the photograph.
- As a further alternative, the user may be presented with suggested recompositions of the image. In order to do this, some further processing is required. An example of these additional processes is given below.
- Having identified the minimum crop boundary, it is then advantageous to identify the maximum crop boundary. With regards to
Figure 11B , one potentialmaximum crop boundary 68 has been identified. This crop boundary abuts the must excluderegions region 61. The boundary also extends between the upper and lower edges of the photograph. Thiscrop boundary 68 represents the maximum crop boundary available to include the girl but to exclude the flowers. However, an alternative crop boundary is available which includes both the girl and the flowers. Thus, as shown inFigure 11C a furtherminimum crop boundary 70 can be defined which includes both the girl and the flowers (with partial exclusion of the flowers being allowed because they are so close to the edge), and a furthermaximum crop boundary 72 has also been defined which extends to the upper and lower edges of the photograph, to the left hand edge, but abuts the must excluderegions - The saliency map is analysed in order to determine how many areas of interest exist therein. Thus, if the saliency map shows N distinct areas of interest (for example areas of interest separated by some area of non-interest as determined by some adaptively set threshold) possible minimum cropping rectangles can be generated which contain alternative combinations of between 1 and N areas of interest where the minimum cropping rectangle contains a selected combination of areas of interest and excludes other areas. Thus this corresponds to generation of
minimum cropping rectangle Figures 11A and 11C . It should be noted that not all combinations may be possible as they may not be contained within a single rectangle that excludes one or more of the non-selected areas. The maximum cropping rectangle for the each single or combination of areas of interest is the maximum rectangle which contains the areas of interest but excludes the non-selected areas of interest. Thus this corresponds torectangles Figures 11B and 11C . - Each
minimum cropping rectangle limits Figures 11B and 11C ) is then processed in turn. However, some initial sorting may reduce the processing required. One of the compositional rules may require that a large well centred interesting area in the image is deemed to be essential. If we apply this rule then onlyminimum cropping boundaries crop boundary 61 being excluded. The first step is to select a first one of theminimum cropping boundaries - Supposing that minimum and maximum crop rectangles have been defined, and that it is now desired to find the position of suitable crop boundaries between the minimum and maximum limits. For the purpose of this description, we are going to locate the edge of one boundary, occurring to the left hand side of the minimum crop rectangle. Given that the digital image can be considered as consisting of a plurality of columns, the left hand edge of the maximum crop rectangle is located in column P, whereas the left hand edge of the minimum crop rectangle is located in column Q. Columns P and Q are not adjacent.
- Sequentially each of the columns between P and Q is examined in turn in order to generate a metric of how good that column would be as a border of the cropping rectangle. Thus, the metric is constructed such that dark areas or slowly changing pixels along the column incur a low cost penalty, whereas brighter areas or alternatively rapidly changing colours in a row of pixels achieve a high penalty rating. Furthermore, the rating may also be modified with regards to the proximity of that column to the minimum and maximum crop boundaries, or indeed the proximity of that column to the edge of the picture.
- In a preferred embodiment of the present invention, the edge quality metric is a function of:
- i. Brightness. That is dark edges are preferred and hence incur only a low penalty.
- ii. Activity. That is the sum of the colour differences between regions crossed by a row or column is analysed, with low sums scoring a lower penalty.
- iii. Saliency. That is the sum of the saliency values for pixels in the row or column is formed, with low saliency incurring a lower penalty.
- iv. Distance from strong colour transitions parallel-to, and on the inside of, the column or row being tested. The distance should not be too close nor too far and a weighted distance term is used to accomplish this. This latter criteria is used to avoid cropping too close to a feature, even if it is not part of the minimum cropping rectangle.
- These factors are independently smoothed and normalised before being combined in order to form a weighted sum to generate the edge quality metric as shown in
Figure 12 . - Thus for each one of the individual columns, a penalty measurement is formed, and the penalty measurement can then be plotted with respect to column thereby obtaining a
penalty measurement profile 90. Theprofile 90 can then be examined to determine the position of minima therein, such asbroad minima 92 or thesharper minima - The final crop is also be constrained by the aspect ratio of the camera.
- Once a crop candidate has been identified, it is then evaluated by applying one or more rules. Each rule is implemented as a heuristically evaluated measure on the image.
- Heuristic measures are used for compositional rules such as eliminating distractions close to the edge of the frame, minimum edge quality, a preference for dark or low activity boundaries, and so on.
- The combination of different rule penalties by a weighted sum allows some rules to be considered as more important than others. Again, the weightings are determined heuristically.
- Other known methods of identifying regions of interest from electronic image may equally be applied to embodiments of the present invention.
- Thus, as noted hereinbefore, the different compositional rules used may have different weightings associated with them to vary the importance of those rules for example, particular attention may be paid to identify large boring areas, distractions at the edges of the image, or horizon lines that are centrally placed or placed very close to the top or bottom of the image frames. The weightings may vary depending on image content.
- A criterion that may be attributed particular significance may be that of identifying regions of interest that extend beyond the edge of the frame. The user of the image evaluation system may be advised by means of the audio or visual warning signals to attempt to fully capture these features, or if the region of interest identified by the image composition evaluation system is actually a combination of multiple objects, to reposition the image capture system such that these two objects are not aligned. By following this advice it is more likely to produce an image composition where the true subject is well isolated from competing regions of interest, for example by being separated by background regions.
- Where candidate crops are presented to a user, and the crops lie wholly within the original badly composed image, the user can be instructed to zoom and pan in order to match the fully framed image to the suggested crop. In order to achieve this an image of the suggested crop may be faintly displayed on the viewfinder in addition to the "current" image seen by the camera.
Claims (20)
- A method of providing an evaluation of the composition of an image, the method comprising acquiring an image, processing the acquired image to evaluate its composition by applying one or more compositional rules, and providing a report concerning composition of the image such that the user has an opportunity to recompose the image prior to recording the image; characterized in that processing of the acquired image comprises:analyzing the acquired image to identify regions of interest within the image; andfor each identified region of interest, applying one or more compositional rules to determine compositionally-desirable crop boundaries for the region;said report being indicative of the regions of interest and their compositionally-desirable crop boundaries.
- A method according to claim 1, wherein said analysis is performed in response to a composition evaluation request signal.
- A method according to claim 1, wherein said report is presented visually.
- A method according to claim 3, wherein said report is displayed on an image viewer.
- A method according to claim 1, wherein said analyzing of the acquired image includes deriving a saliency map of the image on the basis of unusualness of one of the colour and brightness in the image.
- A method according to claim 1, wherein providing the report includes presenting, as the compositionally-desirable boundaries of a region of interest, a suggested crop of the image, this crop being presented on an output device.
- A method according to claim 1, wherein the candidate crop presented in the report is presented visually.
- A method according to claim 1, wherein said analyzing of the acquired image identifies regions for exclusion, the determination of compositionally desirable boundaries for the regions of interest comprising for each such region:determining a minimum crop boundary around the region of interest;determining a maximum crop boundary around the region of interest, this maximum crop boundary excluding any region identified for exclusion but potentially including one or more other regions of interest;determining candidate crop boundaries between the minimum and maximum crop boundaries; andevaluating the crop candidates using said one or more compositional rules.
- Apparatus for evaluating the composition of an image (1), the apparatus comprising:an image receiving element (9) arranged to convert an image into an electrical image signal; andan image processor (11) arranged to receive said electrical image signal and process it to evaluate its composition by applying one or more compositional rules, the image processor being further arranged to provide a report concerning composition of the image such that the user has an opportunity to recompose the image prior to recording the image;characterized in that the image processor (11) is arranged to process the said electrical image signal by:analysing the image represented by the signal to identify regions of interest within the image; andfor each identified region of interest, applying one or more composition rules to determine crop boundaries for the region;said report being indicative of the regions of interest and their compositionally-desirable crop boundaries.
- Apparatus according to claim 9, wherein said image processor (11) is arranged to perform its processing in response to receiving a composition evaluation request signal.
- Apparatus according to claim 10, wherein said composition evaluation request signal is generated in response to the actuation of a composition evaluation request switch (19) of the apparatus.
- Apparatus according to claim 11, wherein said composition evaluation request switch (19) comprises an image capture switch arranged to activate an image capture system to capture the image represented by the electrical image signal.
- Apparatus according to claim 9, wherein the apparatus is arranged to provide said report visually.
- Apparatus according to claim 13, wherein the apparatus is arranged to present said report on a view finder (5) of the apparatus.
- Apparatus according to claim 9, wherein the image processor (11) is arranged to analyze the acquired image by deriving a saliency map of the image on the basis of unusualness of one of the colour and brightness in the image.
- Apparatus according to claim 9, wherein the image processor (11) is arranged to provide the report by presenting, on an output device (5) of the apparatus, a suggested crop of the image as the compositionally-desirable boundaries of a region of interest.
- Apparatus according to claim 16, wherein said output device is a visual display (5).
- Apparatus according to claim 9, wherein the image processor (11), in analyzing the image, is arranged to identifies regions for exclusion; the image processor (11) being arranged to determine compositionally desirable boundaries for a said region of interest by:determining a minimum crop boundary around the region of interest;determining a maximum crop boundary around the region of interest, this maximum crop boundary excluding any region identified for exclusion but potentially including one or more other regions of interest;determining candidate crop boundaries between the minimum and maximum crop boundaries; andevaluating the crop candidates using said one or more compositional rules.
- A motion picture camera including apparatus according to any one of claims 9 to 18.
- A still picture camera including apparatus according to any one of claims 9 to 18.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB0031423 | 2000-12-22 | ||
GB0031423A GB2370438A (en) | 2000-12-22 | 2000-12-22 | Automated image cropping using selected compositional rules. |
GB0116082 | 2001-06-30 | ||
GB0116082A GB0116082D0 (en) | 2001-06-30 | 2001-06-30 | Image composition evaluation |
PCT/GB2001/005675 WO2002052839A2 (en) | 2000-12-22 | 2001-12-20 | Image composition evaluation |
Publications (2)
Publication Number | Publication Date |
---|---|
EP1382017A2 EP1382017A2 (en) | 2004-01-21 |
EP1382017B1 true EP1382017B1 (en) | 2018-08-22 |
Family
ID=26245478
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP01272086.8A Expired - Lifetime EP1382017B1 (en) | 2000-12-22 | 2001-12-20 | Image composition evaluation |
Country Status (4)
Country | Link |
---|---|
US (1) | US7035477B2 (en) |
EP (1) | EP1382017B1 (en) |
JP (1) | JP2004526179A (en) |
WO (1) | WO2002052839A2 (en) |
Families Citing this family (43)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2370438A (en) * | 2000-12-22 | 2002-06-26 | Hewlett Packard Co | Automated image cropping using selected compositional rules. |
GB0116113D0 (en) * | 2001-06-30 | 2001-08-22 | Hewlett Packard Co | Tilt correction of electronic images |
GB2378340A (en) * | 2001-07-31 | 2003-02-05 | Hewlett Packard Co | Generation of an image bounded by a frame or of overlapping images |
US7327890B2 (en) | 2002-12-20 | 2008-02-05 | Eastman Kodak Company | Imaging method and system for determining an area of importance in an archival image |
US7680342B2 (en) | 2004-08-16 | 2010-03-16 | Fotonation Vision Limited | Indoor/outdoor classification in digital images |
US7606417B2 (en) | 2004-08-16 | 2009-10-20 | Fotonation Vision Limited | Foreground/background segmentation in digital images with differential exposure calculations |
US20050030388A1 (en) * | 2003-08-06 | 2005-02-10 | Stavely Donald J. | System and method for improving image capture ability |
US7675528B2 (en) * | 2003-11-14 | 2010-03-09 | Vistaprint Technologies Limited | Image cropping system and method |
EP1544792A1 (en) * | 2003-12-18 | 2005-06-22 | Thomson Licensing S.A. | Device and method for creating a saliency map of an image |
US20060072847A1 (en) * | 2004-10-01 | 2006-04-06 | Microsoft Corporation | System for automatic image cropping based on image saliency |
JP4581084B2 (en) * | 2004-11-08 | 2010-11-17 | 国立大学法人横浜国立大学 | Image processing apparatus, image processing method, computer-readable recording medium storing program, and program |
US7609847B2 (en) * | 2004-11-23 | 2009-10-27 | Hewlett-Packard Development Company, L.P. | Methods and systems for determining object layouts |
KR100624473B1 (en) * | 2004-12-14 | 2006-09-18 | 삼성전자주식회사 | Image processing apparatus and method |
US7760956B2 (en) | 2005-05-12 | 2010-07-20 | Hewlett-Packard Development Company, L.P. | System and method for producing a page using frames of a video stream |
US7692696B2 (en) * | 2005-12-27 | 2010-04-06 | Fotonation Vision Limited | Digital image acquisition system with portrait mode |
EP1987436B1 (en) * | 2006-02-14 | 2015-12-09 | FotoNation Limited | Image blurring |
IES20060559A2 (en) * | 2006-02-14 | 2006-11-01 | Fotonation Vision Ltd | Automatic detection and correction of non-red flash eye defects |
IES20060564A2 (en) | 2006-05-03 | 2006-11-01 | Fotonation Vision Ltd | Improved foreground / background separation |
US20080101761A1 (en) * | 2006-10-31 | 2008-05-01 | Simon Widdowson | Weighted occlusion costing |
US8565552B2 (en) * | 2006-11-14 | 2013-10-22 | Codonics, Inc. | Assembling multiple medical images into a single film image |
US8132096B1 (en) * | 2007-03-26 | 2012-03-06 | Hewlett-Packard Development Company, L.P. | Image composition evaluation |
DE102008012307A1 (en) * | 2007-03-27 | 2008-10-02 | Denso Corp., Kariya | display device |
JP5432714B2 (en) * | 2007-08-03 | 2014-03-05 | 学校法人慶應義塾 | Composition analysis method, image apparatus having composition analysis function, composition analysis program, and computer-readable recording medium |
JP4894712B2 (en) * | 2007-10-17 | 2012-03-14 | ソニー株式会社 | Composition determination apparatus, composition determination method, and program |
JP5083138B2 (en) * | 2008-09-12 | 2012-11-28 | 株式会社ニコン | Image classification device |
US8180178B2 (en) * | 2009-06-24 | 2012-05-15 | Hewlett-Packard Development Company, L.P. | Autocropping and autolayout method for digital images |
CN102804759B (en) | 2009-06-24 | 2016-03-02 | 惠普开发有限公司 | Image albums creates |
US8478053B2 (en) * | 2009-07-15 | 2013-07-02 | Nikon Corporation | Image sorting apparatus |
JP5471130B2 (en) * | 2009-07-31 | 2014-04-16 | カシオ計算機株式会社 | Image processing apparatus and method |
US8923629B2 (en) | 2011-04-27 | 2014-12-30 | Hewlett-Packard Development Company, L.P. | System and method for determining co-occurrence groups of images |
US8849014B2 (en) * | 2011-09-06 | 2014-09-30 | Mediatek Inc. | Photographic system |
US9165187B2 (en) * | 2012-01-12 | 2015-10-20 | Kofax, Inc. | Systems and methods for mobile image capture and processing |
US11620733B2 (en) | 2013-03-13 | 2023-04-04 | Kofax, Inc. | Content-based object detection, 3D reconstruction, and data extraction from digital images |
US10783615B2 (en) | 2013-03-13 | 2020-09-22 | Kofax, Inc. | Content-based object detection, 3D reconstruction, and data extraction from digital images |
US9881202B2 (en) | 2013-05-14 | 2018-01-30 | Google Llc | Providing visual effects for images |
CN104346801B (en) * | 2013-08-02 | 2018-07-20 | 佳能株式会社 | Image composition apparatus for evaluating, information processing unit and its method |
US10019823B2 (en) * | 2013-10-24 | 2018-07-10 | Adobe Systems Incorporated | Combined composition and change-based models for image cropping |
US9299004B2 (en) | 2013-10-24 | 2016-03-29 | Adobe Systems Incorporated | Image foreground detection |
US9330334B2 (en) | 2013-10-24 | 2016-05-03 | Adobe Systems Incorporated | Iterative saliency map estimation |
US11062176B2 (en) | 2017-11-30 | 2021-07-13 | Kofax, Inc. | Object detection and image cropping using a multi-detector approach |
CN110288533B (en) * | 2019-07-02 | 2022-12-06 | 河北农业大学 | A Fast Stitching Method for Non-rotating Images |
CN111062861A (en) * | 2019-12-13 | 2020-04-24 | 广州市玄武无线科技股份有限公司 | Method and device for generating display image samples |
JP7609512B2 (en) | 2020-10-26 | 2025-01-07 | 株式会社Jvis | Automatic trimming program, automatic trimming device, and automatic trimming method |
Family Cites Families (39)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS5840172B2 (en) * | 1973-07-28 | 1983-09-03 | 沖電気工業株式会社 | camera |
US4317620A (en) | 1980-01-17 | 1982-03-02 | Polaroid Corporation | Variable proportional fill flash |
US4384336A (en) | 1980-08-29 | 1983-05-17 | Polaroid Corporation | Method and apparatus for lightness imaging |
GB2124055B (en) | 1982-07-09 | 1987-06-24 | Fuji Photo Film Co Ltd | Picture image input/output system |
US4423936A (en) | 1982-07-26 | 1984-01-03 | Polaroid Corporation | Photographic exposure control system and method |
US4541704A (en) | 1984-09-10 | 1985-09-17 | Polaroid Corporation | Photographic camera with electronic image enhancement |
US4605970A (en) | 1984-10-01 | 1986-08-12 | Tektronix, Inc. | Method and apparatus for calibrating an optical document digitizer |
US4724330A (en) | 1986-09-24 | 1988-02-09 | Xerox Corporation | Self aligning raster input scanner |
JPH0514271Y2 (en) | 1990-02-16 | 1993-04-16 | ||
EP0456414A3 (en) | 1990-05-11 | 1992-05-20 | Matsushita Electric Industrial Co., Ltd | Slant correction for a video camera apparatus |
US5091654A (en) | 1990-08-28 | 1992-02-25 | Xerox Corporation | Method of automatically setting document registration and locating calibration strip |
JP3139028B2 (en) * | 1991-02-04 | 2001-02-26 | 株式会社ニコン | Imaging equipment |
JPH0659430A (en) | 1992-08-07 | 1994-03-04 | Toppan Printing Co Ltd | Layout scanner and trimming and layout method |
JP3074967B2 (en) | 1992-10-27 | 2000-08-07 | 松下電器産業株式会社 | High dynamic range imaging / synthesis method and high dynamic range imaging apparatus |
EP0622722B1 (en) | 1993-04-30 | 2002-07-17 | Xerox Corporation | Interactive copying system |
US5384615A (en) * | 1993-06-08 | 1995-01-24 | Industrial Technology Research Institute | Ambient depth-of-field simulation exposuring method |
US5517242A (en) | 1993-06-29 | 1996-05-14 | Kabushiki Kaisha Toyota Chuo Kenkyusho | Image sensing device having expanded dynamic range |
JPH07219017A (en) * | 1994-02-04 | 1995-08-18 | Nikon Corp | Camera with warning function |
AU2601195A (en) | 1994-05-23 | 1995-12-18 | Out-Takes, Inc. | Seamless composite photography system |
US5666186A (en) * | 1994-07-20 | 1997-09-09 | Eastman Kodak Company | Text interface for still photographic camera with data recording capability |
JPH08336069A (en) * | 1995-04-13 | 1996-12-17 | Eastman Kodak Co | Electronic still camera |
GB9614837D0 (en) | 1996-07-12 | 1996-09-04 | Rank Xerox Ltd | Interactive desktop system with multiple image capture and display modes |
GB2350251B (en) | 1996-07-29 | 2001-01-03 | Eastman Kodak Co | A system for custom imprinting a variety of articles with images obtained from a variety of different sources |
US5978519A (en) | 1996-08-06 | 1999-11-02 | Xerox Corporation | Automatic image cropping |
US5973734A (en) | 1997-07-09 | 1999-10-26 | Flashpoint Technology, Inc. | Method and apparatus for correcting aspect ratio in a camera graphical user interface |
IL134557A0 (en) | 1997-08-28 | 2001-04-30 | Qualia Computing Inc | Method and system for automated detection of clustered microcalcifications from digital mammograms |
AUPO960197A0 (en) * | 1997-10-03 | 1997-10-30 | Canon Information Systems Research Australia Pty Ltd | Multi-media editing method and apparatus |
DE69822958T2 (en) | 1997-10-23 | 2005-03-10 | Olympus Corporation | Image recording device with means for expanding the dynamic range |
US5987270A (en) | 1997-12-17 | 1999-11-16 | Hewlett-Packard Company | Digital copying machine that automasks data for small originals if a document feeder is present |
US6310984B2 (en) | 1998-04-09 | 2001-10-30 | Hewlett-Packard Company | Image processing system with image cropping and skew correction |
WO2000038417A1 (en) | 1998-12-22 | 2000-06-29 | Koninklijke Philips Electronics N.V. | Method of and device for forming an image of an object from a plurality of images |
JP2000244800A (en) * | 1999-02-22 | 2000-09-08 | Victor Co Of Japan Ltd | Image pickup device |
JP2001148843A (en) * | 1999-11-18 | 2001-05-29 | Mega Chips Corp | Image pickup device and criminal prevention system using the image pickup device |
US6671405B1 (en) * | 1999-12-14 | 2003-12-30 | Eastman Kodak Company | Method for automatic assessment of emphasis and appeal in consumer images |
US6545743B1 (en) | 2000-05-22 | 2003-04-08 | Eastman Kodak Company | Producing an image of a portion of a photographic image onto a receiver using a digital image of the photographic image |
US6747690B2 (en) | 2000-07-11 | 2004-06-08 | Phase One A/S | Digital camera with integrated accelerometers |
US6654507B2 (en) | 2000-12-14 | 2003-11-25 | Eastman Kodak Company | Automatically producing an image of a portion of a photographic image |
GB2370438A (en) * | 2000-12-22 | 2002-06-26 | Hewlett Packard Co | Automated image cropping using selected compositional rules. |
GB2378340A (en) | 2001-07-31 | 2003-02-05 | Hewlett Packard Co | Generation of an image bounded by a frame or of overlapping images |
-
2001
- 2001-12-20 WO PCT/GB2001/005675 patent/WO2002052839A2/en active Application Filing
- 2001-12-20 EP EP01272086.8A patent/EP1382017B1/en not_active Expired - Lifetime
- 2001-12-20 US US10/169,835 patent/US7035477B2/en not_active Expired - Lifetime
- 2001-12-20 JP JP2002553817A patent/JP2004526179A/en active Pending
Non-Patent Citations (1)
Title |
---|
None * |
Also Published As
Publication number | Publication date |
---|---|
JP2004526179A (en) | 2004-08-26 |
WO2002052839A2 (en) | 2002-07-04 |
EP1382017A2 (en) | 2004-01-21 |
US7035477B2 (en) | 2006-04-25 |
US20020191860A1 (en) | 2002-12-19 |
WO2002052839A3 (en) | 2003-11-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1382017B1 (en) | Image composition evaluation | |
US7133571B2 (en) | Automated cropping of electronic images | |
KR101199804B1 (en) | Image processing device and method | |
JP4862930B2 (en) | Image processing apparatus, image processing method, and program | |
JP5744437B2 (en) | TRACKING DEVICE, TRACKING METHOD, AND PROGRAM | |
US9344642B2 (en) | Method and apparatus for capturing a first image using a first configuration of a camera and capturing a second image using a second configuration of a camera | |
JP4611069B2 (en) | Device for selecting an image of a specific scene, program, and recording medium recording the program | |
JP5432714B2 (en) | Composition analysis method, image apparatus having composition analysis function, composition analysis program, and computer-readable recording medium | |
RU2466438C2 (en) | Method of simplifying focusing | |
US6950554B2 (en) | Learning type image classification apparatus, method thereof and processing recording medium on which processing program is recorded | |
US20070122034A1 (en) | Face detection in digital images | |
CN106604005B (en) | A kind of projection TV Atomatic focusing method and system | |
US20050200722A1 (en) | Image capturing apparatus, image capturing method, and machine readable medium storing thereon image capturing program | |
US20060036948A1 (en) | Image selection device and image selecting method | |
US20110222796A1 (en) | Image Processing Apparatus, Image Processing Method, Program, and Imaging Apparatus | |
KR101012729B1 (en) | String Extraction System Using Marker and Its Method | |
JP5287965B2 (en) | Image processing apparatus, image processing method, and program | |
CN117455792B (en) | Method for synthesizing and processing 360-degree panoramic image built-in vehicle | |
JP5499818B2 (en) | Image processing apparatus and method, and program | |
JPH1021390A (en) | Device for measuring occupancy ratio for specific face area | |
Hao | Improved sampling-based alpha matting in images and video | |
García-Valdecasas et al. | TWO-DIMENSIONAL IMAGING |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20030717 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR |
|
AX | Request for extension of the european patent |
Extension state: AL LT LV MK RO SI |
|
RBV | Designated contracting states (corrected) |
Designated state(s): AT BE DE GB |
|
17Q | First examination report despatched |
Effective date: 20100122 |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P. |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R079 Ref document number: 60150923 Country of ref document: DE Free format text: PREVIOUS MAIN CLASS: G06T0007000000 Ipc: H04N0005232000 |
|
GRAJ | Information related to disapproval of communication of intention to grant by the applicant or resumption of examination proceedings by the epo deleted |
Free format text: ORIGINAL CODE: EPIDOSDIGR1 |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: G06T 3/40 20060101ALI20180219BHEP Ipc: G06T 7/44 20170101ALI20180219BHEP Ipc: G06T 7/00 20170101ALI20180219BHEP Ipc: H04N 1/387 20060101ALI20180219BHEP Ipc: H04N 5/232 20060101AFI20180219BHEP |
|
RBV | Designated contracting states (corrected) |
Designated state(s): DE GB |
|
INTG | Intention to grant announced |
Effective date: 20180321 |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): DE GB |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R096 Ref document number: 60150923 Country of ref document: DE |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20181126 Year of fee payment: 18 |
|
RAP2 | Party data changed (patent owner data changed or rights of a patent transferred) |
Owner name: HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P. |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R097 Ref document number: 60150923 Country of ref document: DE |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed |
Effective date: 20190523 |
|
GBPC | Gb: european patent ceased through non-payment of renewal fee |
Effective date: 20181220 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20181220 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R119 Ref document number: 60150923 Country of ref document: DE |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20200701 |