WO2015196462A1 - Method and device for displaying a video sequence - Google Patents
Method and device for displaying a video sequence Download PDFInfo
- Publication number
- WO2015196462A1 WO2015196462A1 PCT/CN2014/080979 CN2014080979W WO2015196462A1 WO 2015196462 A1 WO2015196462 A1 WO 2015196462A1 CN 2014080979 W CN2014080979 W CN 2014080979W WO 2015196462 A1 WO2015196462 A1 WO 2015196462A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- image
- video frame
- determining
- aspect ratio
- output
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 22
- 230000000873 masking effect Effects 0.000 claims abstract description 20
- 238000004590 computer program Methods 0.000 claims description 6
- 238000004891 communication Methods 0.000 claims description 4
- 238000010586 diagram Methods 0.000 description 8
- 230000008685 targeting Effects 0.000 description 7
- 230000008569 process Effects 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 238000007796 conventional method Methods 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000003467 diminishing effect Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/44—Receiver circuitry for the reception of television signals according to analogue transmission standards
- H04N5/46—Receiver circuitry for the reception of television signals according to analogue transmission standards for receiving on more than one standard at will
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/431—Generation of visual interfaces for content selection or interaction; Content or additional data rendering
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/4402—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
- H04N21/440263—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by altering the spatial resolution, e.g. for displaying on a connected PDA
- H04N21/440272—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by altering the spatial resolution, e.g. for displaying on a connected PDA for performing aspect ratio conversion
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/01—Conversion of standards, e.g. involving analogue television standards or digital television standards processed at pixel level
- H04N7/0117—Conversion of standards, e.g. involving analogue television standards or digital television standards processed at pixel level involving conversion of the spatial resolution of the incoming video signal
Definitions
- the present disclosure relates to image processing, and more particularly relates to a method and a device for displaying a video sequence.
- an image has an aspect ratio.
- the aspect ratio describes the proportional relationship between the image's width and the image's height, i.e. it is the ratio of width to height.
- the most common aspect ratios used today in the presentation of films in cinemas are 1.85: 1 and 2.39: 1.
- Two common videographic aspect ratios are 4:3 (1.33:1 ), which is the universal video format of the 20th century, and 16:9 (1.77:1 ), which is universal for high-definition television and European digital television.
- Other cinema and video aspect ratios exist, but are used infrequently.
- the most common aspect ratios are 4:3, 3:2, and more recently being found in consumer cameras 16:9.
- Other aspect ratios, such as 5:3, 5:4, and 1 : 1 (square format) are used in photography as well, particularly in medium format and large format.
- a method for displaying a video sequence comprising the steps of receiving (401 ) a video frame of the video sequence, wherein the video frame consists of an image and masking; determining (402) an image region of the image within the video frame of the video sequence; determining (403) an output aspect ratio; and generating (404) an output video frame based on the video frame, the image region, the output aspect ratio and a predefined value indicating a filling manner for filling the image into the output video frame.
- a device for displaying a video sequence comprising a storage; and a processor for receiving a video frame of the video sequence, wherein the video frame consists of an image and masking; determining an image region of the image within the video frame of the video sequence; determining an output aspect ratio; generating an output video frame based on the video frame, the image region, the output aspect ratio and a predefined value indicating a filling manner for filling the image into the output video frame, wherein the predefined value is stored in the storage.
- non- transitory computer-readable medium comprising a computer program product recorded thereon and capable of being run by a processor, including program code instructions for implementing the steps of above method.
- Fig. 1 is a diagram showing a 2.35:1 widescreen image letterboxed in a 1.33:1 format according to prior art
- Fig. 2 is a diagram showing an image is placed in a wider aspect ratio format according to prior art
- Fig. 3 is a diagram showing an image displayed on TV when the aspect ratio of a video frame having masking is different from aspect ratio of a display device according to prior art;
- Fig. 4 is a flowchart showing a method for displaying a video sequence according to an embodiment of the present invention
- Fig. 5 is a diagram showing an image displayed on TV according to the embodiment of the present invention.
- Fig. 6 is a block diagram showing a STB according to the embodiment of the present invention.
- the aspect ratio of image format for an original image is different from an aspect ratio of a targeting format (or called receiving format), e.g. the aspect ratio of the original image is 4:3 and aspect ratio of the targeting format is 16:9.
- the targeting format can be used for storage e.g. in a videotape or hard disk or for displaying on e.g. a TV). In this case, format conversion is necessary.
- converting formats of unequal ratios is achieved a) by enlarging the original image to fill the targeting format's display area and cutting off any excess picture information (zooming and cropping), b) by adding horizontal mattes (letterboxing) or vertical mattes (pillarboxing) to retain the original format's aspect ratio for the original image, c) by stretching (hence distorting) the original image to fill the targeting format's aspect ratio, and d) by scaling by different factors in both directions, possibly scaling by a different factor in the center and at the edges (as in Wide Zoom mode).
- Letterboxing is the practice of transferring a video sequence, e.g. a film shot, with a widescreen aspect ratio to standard-width video formats while preserving the video sequence's original aspect ratio.
- the resulting image in the resulting video sequence has mattes (also called black bars or masking) above and below the original image in the standard-width video formats, i.e. the mattes are on the top and bottom of the resulting image. These mattes are part of the resulting image.
- Fig. 1 is a diagram showing a 2.35:1 widescreen image letterboxed in a 1.33:1 format according to prior art.
- the rectangle in grid pattern is the original widescreen image and the rectangle including both the rectangle in grid pattern and mattes is the targeting format.
- Pillarbox occurs in widescreen video displays when mattes are placed on the sides of the original image. It becomes necessary when video or film that was not originally designed for widescreen is shown on a widescreen displaying device or stored in a widescreen format, or when the original image is displayed or stored in a wider aspect ratio. The original image is placed in the middle of the targeting format.
- Fig. 2 is a diagram showing an image is placed in a wider aspect ratio format according to prior art.
- a set top box (STB) has been processed with lettebox, pillarbox and the like. It means that there is masking around the original image, e.g. as shown in the Fig. 1 and Fig. 2.
- the STB can't display the original image (which doesn't have masking) in an appropriate aspect ratio on TV.
- a Standard Definition (SD) camera shots a video with aspect ratio of 4:3.
- the broadcast operator delivers the video in a High Definition channel with aspect ratio of 16:9.
- pillarbox is used and the format of video sequence is like what is shown in the Fig. 2.
- the video sequence is displayed in 16:9, the displayed video is like what is shown in the Fig. 2.
- the present disclosure provides a method and a device for display a video sequence.
- Fig. 4 is a flowchart showing a method for displaying a video sequence according to an embodiment of the present invention.
- the STB receives the video sequence. It is implemented by tuning to a channel, starting to receive video signal that conveys the video sequence and decoding the video signal to obtain the video sequence.
- a video frame in the video sequence is made up of the original image and masking around the original image, e.g. as shown in the Fig. 5.
- the STB determines image region of the original image in the video frame.
- the video frame is YUV picture in the buffer of the STB.
- the STB can obtain the size of the video frame, e.g. its width is a and height is b, i.e. it has b rows of pixels and a columns of pixels.
- the STB can obtain data of every pixel in the video frame.
- the value for masking in the video frame is set to a fixed value. The fixed value may vary from implementations, e.g. different chipsets for image processing may use different values.
- n1 the STB scans horizontally from top to bottom. For n2, it scans horizontally from bottom to top. For ml , it scans vertically from left to right. And for m2, it scans horizontally from right to left. For any scanning for n1 , n2, ml and m2, if pixels in the scanning row have the same value, then it's determined that the scanning row belongs to masking and the scanning continues to next row or column. If they don't have same value, the scanning process stops.
- the STB determines the masking and determines that the real image region starts at n1 row and ml column and ends at (b-n2) row and (a-m2) column, and the size of the image region is (a-m1 -m2, b-n1 -n2).
- the STB repeats the scanning process for a predetermined number of video frames, e.g. 30 frames to obtain an average values for ml , m2, n1 , n2.
- the STB determines an output aspect ratio.
- the output aspect ratio can be an input from the user, a predefined value by the user or automatically detected by the STB based on communication protocol between the STB and a displaying device.
- the STB In the step 404, the STB generates an output video frame based on the original image, the output aspect ratio and a predefined filling parameter value.
- the original image is determined by the video frame and the determined image region.
- the predefined filling parameter indicates a filling manner in which the original image is fitted into the output video frame.
- Values of filling parameter comprises a) scaling by a same factor (i.e. enlarging or diminishing in a linear manner) the original image to fill entire output video frame and cutting off excess portion of the enlarged image; b) scaling by a same factor the original image so as to make one pair of edges of the original image (e.g.
- the value of the predefined filling parameter is selected by the user from above 4 values and stored in the storage of the STB.
- the user can use remote to set the predefined filling parameter value in a STB menu.
- the STB converts the original image from its original aspect ratio to the output aspect ratio by using a filling manner indicated by the predefined filling parameter value.
- steps 401 to 404 are performed before the STB starts to display the output video frame to the users. In a variant of the embodiment, it is performed in a real time manner, i.e. before the STB determines the image region of the original image, the STB displays the video frame of the video sequence to the user, and after the STB determines the image region, the STB displays the output video frame with original image to the user.
- the STB selects one value close to aspect ratio of the image region from 4:3, 16:9 and 14:9, and uses the selected one as a verified aspect ratio for the image region. So it provides compatibility to existing filling functions among different aspect ratios.
- Fig. 6 is a block diagram showing a STB according to the embodiment of the present invention.
- the STB comprises a tuner, a decoder, a processor and a storage.
- the tuner receives a digital broadcasting signal to generate a digital data stream conveying a video sequence, and transmits the digital data stream to the decoder.
- the decoder e.g. MPEG-2 decoder, decodes the digital data stream to obtain the video sequence.
- the obtained video sequence can be either digital signal or analog signal depending on the acceptability of signal to the displaying device, e.g. TV or monitor. If TV only accepts analog signal, then a digital to analog converter shall be added and used.
- the storage is used to store the predefined filling parameter value.
- the buffer (not shown in the Fig. 5) is used to store temporary data, such as video frames of the video sequence, output video frames etc.
- the processor is used for receiving a video frame of the video sequence, wherein the video frame consists of an image and masking; determining an image region of the image within the video frame of the video sequence; determining an output aspect ratio; generating an output video frame based on the video frame, the image region, the output aspect ratio and a predefined filling parameter value indicating a filling manner for filling the image into the output video frame, wherein the predefined filling parameter value is stored in the storage.
- the processor is further used for receiving a predefined number of video frames; determining image regions for each video frames; and determining average value of the image regions of the predefined number of video frames as a common image region.
- the processor is further used for determining a value closed to aspect ratio of the image region from 4:3, 16:9 and 14:9 as a verified aspect ratio of the image region.
- the processor is further used for performing in the video frame scanning horizontally from top to bottom, scanning horizontally from bottom to top, scanning vertically from left to right and scanning vertically from right to left; during each scanning, if pixels in a row or column have a same value determining the row or the column belongs to masking; and determining the image region.
- the processor is further used for receiving an input selectable from a) scaling by a same factor the image to fill entire output video frame and cutting off excess portion of the enlarged image; b) scaling by a same factor the image so as to make one pair of edges of the image overlap with one pair of edges of the output video frame while keeping the other pair of edges of the image within the output video frame; c) stretching the image to fill entire output video frame; and d) scaling by different factors in the center and at the edges of the image to fill entire output video frame; and storing the input as the predefined filling parameter value in the storage.
- STB can be used for a player to play a video sequence stored on a storage device, e.g. local hard disk or remote hard disk or to play a VOD (Video on Demand) video as long as the video frame consists of an original image (e.g. shot image by a camera) and masking.
- a storage device e.g. local hard disk or remote hard disk
- VOD Video on Demand
- the tuner is not necessary.
- non- transitory computer-readable medium comprising a computer program product recorded thereon and capable of being run by a processor, including program code instructions for implementing the steps of above method.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
Abstract
It is provided a method for displaying a video sequence,comprising the steps of receiving (401) a video frame of the video sequence,wherein the video frame consists of an image and masking; determining (402) an image region of the image within the video frame of the video sequence; determining (403) an output aspect ratio; and generating (404) an output video frame based on the video frame,the image region,the output aspect ratio and a predefined value indicating a filling manner for filling the image into the output video frame.
Description
METHOD AND DEVICE FOR DISPLAYING A VIDEO SEQUENCE
TECHNICAL FIELD
The present disclosure relates to image processing, and more particularly relates to a method and a device for displaying a video sequence.
BACKGROUND
In technical field of image processing, an image has an aspect ratio. The aspect ratio describes the proportional relationship between the image's width and the image's height, i.e. it is the ratio of width to height.
The most common aspect ratios used today in the presentation of films in cinemas are 1.85: 1 and 2.39: 1. Two common videographic aspect ratios are 4:3 (1.33:1 ), which is the universal video format of the 20th century, and 16:9 (1.77:1 ), which is universal for high-definition television and European digital television. Other cinema and video aspect ratios exist, but are used infrequently. In still camera photography, the most common aspect ratios are 4:3, 3:2, and more recently being found in consumer cameras 16:9. Other aspect ratios, such as 5:3, 5:4, and 1 : 1 (square format), are used in photography as well, particularly in medium format and large format.
SUMMARY
According to an aspect of the present disclosure, it is provided a method for displaying a video sequence, comprising the steps of receiving (401 ) a video frame of the video sequence, wherein the video frame consists of an image and masking; determining (402) an image region of the image within the video frame of the video sequence; determining (403) an output aspect ratio; and generating (404) an output video frame based on the video frame, the image region, the output aspect ratio and a predefined value indicating a filling manner for filling the image into the output video frame.
According to an aspect of the present disclosure, it is provided a device for displaying a video sequence, comprising a storage; and a processor for receiving a video frame of the video sequence, wherein the video frame consists of an image and masking; determining an image region of the image within the video frame of the video sequence; determining an output aspect ratio; generating an output video frame based on the video frame, the image region, the output aspect ratio and a predefined value indicating a filling manner for filling the image into the output video frame, wherein the predefined value is stored in the storage.
According to an aspect of the present disclosure, it is provided computer program product downloadable from a communication network and/or recorded on a medium readable by computer and/or executable by a processor, comprising program code instructions for implementing the steps of above method.
According to an aspect of the present disclosure, it is provided non- transitory computer-readable medium comprising a computer program product recorded thereon and capable of being run by a processor, including program code instructions for implementing the steps of above method.
It is to be understood that more aspects and advantages of the disclosure will be found in the following detailed description of the present invention.
BRIEF DESCRIPTION OF THE DRAWINGS
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this application, will be used to illustrate an embodiment of the invention, as explained by the description. The invention is not limited to the embodiment.
In the drawings:
Fig. 1 is a diagram showing a 2.35:1 widescreen image letterboxed in a 1.33:1 format according to prior art;
Fig. 2 is a diagram showing an image is placed in a wider aspect ratio format according to prior art;
Fig. 3 is a diagram showing an image displayed on TV when the aspect ratio of a video frame having masking is different from aspect ratio of a display device according to prior art;
Fig. 4 is a flowchart showing a method for displaying a video sequence according to an embodiment of the present invention;
Fig. 5 is a diagram showing an image displayed on TV according to the embodiment of the present invention; and
Fig. 6 is a block diagram showing a STB according to the embodiment of the present invention.
DETAILED DESCRIPTION
Sometimes, the aspect ratio of image format for an original image is different from an aspect ratio of a targeting format (or called receiving format), e.g. the aspect ratio of the original image is 4:3 and aspect ratio of the targeting format is 16:9. (the targeting format can be used for storage e.g. in a videotape or hard disk or for displaying on e.g. a TV). In this case, format conversion is necessary. In conventional methods, converting formats of unequal ratios is achieved a) by enlarging the original image to fill the targeting format's display area and cutting off any excess picture information (zooming and cropping), b) by adding horizontal mattes (letterboxing) or vertical mattes (pillarboxing) to retain the original format's aspect ratio for the original image, c) by stretching (hence distorting) the original image to fill the targeting format's aspect ratio, and d) by scaling by different factors in both directions, possibly scaling by a different factor in the center and at the edges (as in Wide Zoom mode).
As to above conventional method b), below gives some description about letterboxing and pillarboxing.
Letterboxing is the practice of transferring a video sequence, e.g. a film shot, with a widescreen aspect ratio to standard-width video formats while preserving
the video sequence's original aspect ratio. The resulting image in the resulting video sequence has mattes (also called black bars or masking) above and below the original image in the standard-width video formats, i.e. the mattes are on the top and bottom of the resulting image. These mattes are part of the resulting image. Fig. 1 is a diagram showing a 2.35:1 widescreen image letterboxed in a 1.33:1 format according to prior art. The rectangle in grid pattern is the original widescreen image and the rectangle including both the rectangle in grid pattern and mattes is the targeting format.
Pillarbox occurs in widescreen video displays when mattes are placed on the sides of the original image. It becomes necessary when video or film that was not originally designed for widescreen is shown on a widescreen displaying device or stored in a widescreen format, or when the original image is displayed or stored in a wider aspect ratio. The original image is placed in the middle of the targeting format. Fig. 2 is a diagram showing an image is placed in a wider aspect ratio format according to prior art.
Sometimes, video sequences received by a set top box (STB) have been processed with lettebox, pillarbox and the like. It means that there is masking around the original image, e.g. as shown in the Fig. 1 and Fig. 2. In this case, the STB can't display the original image (which doesn't have masking) in an appropriate aspect ratio on TV. For example, a Standard Definition (SD) camera shots a video with aspect ratio of 4:3. The broadcast operator delivers the video in a High Definition channel with aspect ratio of 16:9. In this case, pillarbox is used and the format of video sequence is like what is shown in the Fig. 2. At the user end, if the video sequence is displayed in 16:9, the displayed video is like what is shown in the Fig. 2. But if the video sequence is displayed in 4:3 on TV, it is like what is shown in the Fig. 3. It affects user experience when there is masking around the original image. Similarly, if a HD video is delivered in a SD channel by broadcast operator and the user device displays the received video with aspect ratio of 16:9, the video displayed on TV is like what is shown in the
Fig. 3. In order to solve it, the present disclosure provides a method and a device for display a video sequence.
The embodiment of the present invention will now be described in detail in conjunction with the drawings. In the following description, some detailed descriptions of known functions and configurations may be omitted for clarity and conciseness.
Fig. 4 is a flowchart showing a method for displaying a video sequence according to an embodiment of the present invention.
In the step 401 , the STB receives the video sequence. It is implemented by tuning to a channel, starting to receive video signal that conveys the video sequence and decoding the video signal to obtain the video sequence. A video frame in the video sequence is made up of the original image and masking around the original image, e.g. as shown in the Fig. 5.
In the step 402, the STB determines image region of the original image in the video frame. Below gives an example for obtaining image region of the original image. In the example, the video frame is YUV picture in the buffer of the STB. The STB can obtain the size of the video frame, e.g. its width is a and height is b, i.e. it has b rows of pixels and a columns of pixels. Besides, the STB can obtain data of every pixel in the video frame. Generally, the value for masking in the video frame is set to a fixed value. The fixed value may vary from implementations, e.g. different chipsets for image processing may use different values. If pixels in a row (or column) have the same value, then it is very likely that the row (or column) belongs to masking. In order to obtain n1 , the STB scans horizontally from top to bottom. For n2, it scans horizontally from bottom to top. For ml , it scans vertically from left to right. And for m2, it scans horizontally from right to left. For any scanning for n1 , n2, ml and m2, if pixels in the scanning row have the same value, then it's determined that the scanning row belongs to masking and the scanning continues to next row or column. If they don't have same value, the scanning process stops. In this way, the STB determines the masking and determines that the real image region starts at n1
row and ml column and ends at (b-n2) row and (a-m2) column, and the size of the image region is (a-m1 -m2, b-n1 -n2).
According to a variant, the STB repeats the scanning process for a predetermined number of video frames, e.g. 30 frames to obtain an average values for ml , m2, n1 , n2.
In the step 403, the STB determines an output aspect ratio. The output aspect ratio can be an input from the user, a predefined value by the user or automatically detected by the STB based on communication protocol between the STB and a displaying device.
In the step 404, the STB generates an output video frame based on the original image, the output aspect ratio and a predefined filling parameter value. The original image is determined by the video frame and the determined image region. The predefined filling parameter indicates a filling manner in which the original image is fitted into the output video frame. Values of filling parameter comprises a) scaling by a same factor (i.e. enlarging or diminishing in a linear manner) the original image to fill entire output video frame and cutting off excess portion of the enlarged image; b) scaling by a same factor the original image so as to make one pair of edges of the original image (e.g. left and right edges, or top and bottom edges) overlap with one pair of edges of the output video frame while keeping the other pair of edges of the original image within the output video frame; c) stretching the original image to fill the entire output video frame; and d) scaling by different factors in the center and at the edges of the original image to fill entire output video frame. The value of the predefined filling parameter is selected by the user from above 4 values and stored in the storage of the STB. The user can use remote to set the predefined filling parameter value in a STB menu. In other words, in this step the STB converts the original image from its original aspect ratio to the output aspect ratio by using a filling manner indicated by the predefined filling parameter value.
It shall note that steps 401 to 404 are performed before the STB starts to display the output video frame to the users. In a variant of the embodiment, it is
performed in a real time manner, i.e. before the STB determines the image region of the original image, the STB displays the video frame of the video sequence to the user, and after the STB determines the image region, the STB displays the output video frame with original image to the user.
According to a variant of the embodiment, the STB selects one value close to aspect ratio of the image region from 4:3, 16:9 and 14:9, and uses the selected one as a verified aspect ratio for the image region. So it provides compatibility to existing filling functions among different aspect ratios.
Fig. 6 is a block diagram showing a STB according to the embodiment of the present invention.
The STB comprises a tuner, a decoder, a processor and a storage.
The tuner receives a digital broadcasting signal to generate a digital data stream conveying a video sequence, and transmits the digital data stream to the decoder.
The decoder, e.g. MPEG-2 decoder, decodes the digital data stream to obtain the video sequence. The obtained video sequence can be either digital signal or analog signal depending on the acceptability of signal to the displaying device, e.g. TV or monitor. If TV only accepts analog signal, then a digital to analog converter shall be added and used.
The storage is used to store the predefined filling parameter value.
The buffer (not shown in the Fig. 5) is used to store temporary data, such as video frames of the video sequence, output video frames etc.
The processor is used for receiving a video frame of the video sequence, wherein the video frame consists of an image and masking; determining an image region of the image within the video frame of the video sequence; determining an output aspect ratio; generating an output video frame based on the video frame, the image region, the output aspect ratio and a predefined filling parameter value indicating a filling manner for filling the image into the output video frame, wherein the predefined filling parameter value is stored in the storage.
According to a variant, the processor is further used for receiving a predefined number of video frames; determining image regions for each video frames; and determining average value of the image regions of the predefined number of video frames as a common image region.
According to a variant, the processor is further used for determining a value closed to aspect ratio of the image region from 4:3, 16:9 and 14:9 as a verified aspect ratio of the image region.
According to a variant, the processor is further used for performing in the video frame scanning horizontally from top to bottom, scanning horizontally from bottom to top, scanning vertically from left to right and scanning vertically from right to left; during each scanning, if pixels in a row or column have a same value determining the row or the column belongs to masking; and determining the image region.
According to a variant, the processor is further used for receiving an input selectable from a) scaling by a same factor the image to fill entire output video frame and cutting off excess portion of the enlarged image; b) scaling by a same factor the image so as to make one pair of edges of the image overlap with one pair of edges of the output video frame while keeping the other pair of edges of the image within the output video frame; c) stretching the image to fill entire output video frame; and d) scaling by different factors in the center and at the edges of the image to fill entire output video frame; and storing the input as the predefined filling parameter value in the storage.
According to a variant of the embodiment, it is not limited to STB, can be used for a player to play a video sequence stored on a storage device, e.g. local hard disk or remote hard disk or to play a VOD (Video on Demand) video as long as the video frame consists of an original image (e.g. shot image by a camera) and masking. In this case, the tuner is not necessary.
According to the embodiment of the present invention, it is provided computer program product downloadable from a communication network and/or recorded on a medium readable by computer and/or executable by a processor,
comprising program code instructions for implementing the steps of above method.
According to the embodiment of the present invention, it is provided non- transitory computer-readable medium comprising a computer program product recorded thereon and capable of being run by a processor, including program code instructions for implementing the steps of above method.
A number of implementations have been described. Nevertheless, it will be understood that various modifications may be made. For example, elements of different implementations may be combined, supplemented, modified, or removed to produce other implementations. Additionally, one of ordinary skill will understand that other structures and processes may be substituted for those disclosed and the resulting implementations will perform at least substantially the same function(s), in at least substantially the same way(s), to achieve at least substantially the same result(s) as the implementations disclosed. Accordingly, these and other implementations are contemplated by this application and are within the scope of the invention as defined by the appended claims.
Claims
1. A method for displaying a video sequence, comprising the steps of:
receiving (401 ) a video frame of the video sequence, wherein the video frame consists of an image and masking;
determining (402) an image region of the image within the video frame of the video sequence;
determining (403) an output aspect ratio; and
generating (404) an output video frame based on the video frame, the image region, the output aspect ratio and a predefined value indicating a filling manner for filling the image into the output video frame.
2. The method of the claim 1 , wherein
receiving a predefined number of video frames;
determining image regions for each video frames; and
determining average value of the image regions of the predefined number of video frames as a common image region.
3. The method of the claim 1 , wherein
determining a value closed to aspect ratio of the image region from 4:3, 16:9 and 14:9 as a verified aspect ratio of the image region.
4. The method of the claim 1 , wherein the step of determining image region further comprises:
performing in the video frame scanning horizontally from top to bottom, scanning horizontally from bottom to top, scanning vertically from left to right and scanning vertically from right to left;
during each scanning, if pixels in a row or column have a same value determining the row or the column belongs to masking; and
determining the image region.
5. The method of the claim 1 , wherein
receiving an input for the predefine value selectable from a) scaling by a same factor the image to fill entire output video frame and cutting off excess portion of the enlarged image; b) scaling by a same factor the image so as to make one pair of edges of the image overlap with one pair of edges of the output video frame while keeping the other pair of edges of the image within the output video frame; c) stretching the image to fill entire output video frame; and d) scaling by different factors in the center and at the edges of the image to fill entire output video frame.
6. A device for displaying a video sequence, comprising
a storage; and
a processor for receiving a video frame of the video sequence, wherein the video frame consists of an image and masking; determining an image region of the image within the video frame of the video sequence;
determining an output aspect ratio; generating an output video frame based on the video frame, the image region, the output aspect ratio and a predefined value indicating a filling manner for filling the image into the output video frame, wherein the predefined value is stored in the storage.
7. The device of the claim 6, wherein the processor is further used for
receiving a predefined number of video frames;
determining image regions for each video frames; and
determining average value of the image regions of the predefined number of video frames as a common image region.
8. The device of the claim 6, wherein the processor is further used for determining a value closed to aspect ratio of the image region from 4:3, 16:9 and 14:9 as a verified aspect ratio of the image region.
9. The device of the claim 6, wherein the processor is further used for
performing in the video frame scanning horizontally from top to bottom, scanning horizontally from bottom to top, scanning vertically from left to right and scanning vertically from right to left;
during each scanning, if pixels in a row or column have a same value determining the row or the column belongs to masking; and
determining the image region.
10. The device of the claim 6, wherein the processor is further used for
receiving an input selectable from a) scaling by a same factor the image to fill entire output video frame and cutting off excess portion of the enlarged image; b) scaling by a same factor the image so as to make one pair of edges of the image overlap with one pair of edges of the output video frame while keeping the other pair of edges of the image within the output video frame; c) stretching the image to fill entire output video frame; and d) scaling by different factors in the center and at the edges of the image to fill entire output video frame; and
storing the input as the predefined value in the storage.
1 1. Computer program product downloadable from a communication network and/or recorded on a medium readable by computer and/or executable by a processor, comprising program code instructions for implementing the steps of a method according to at least one of claims 1 to 5.
12. Non-transitory computer-readable medium comprising a computer program product recorded thereon and capable of being run by a processor, including program code instructions for implementing the steps of a method according to at least one of claims 1 to 5.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2014/080979 WO2015196462A1 (en) | 2014-06-27 | 2014-06-27 | Method and device for displaying a video sequence |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2014/080979 WO2015196462A1 (en) | 2014-06-27 | 2014-06-27 | Method and device for displaying a video sequence |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2015196462A1 true WO2015196462A1 (en) | 2015-12-30 |
Family
ID=54936523
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2014/080979 WO2015196462A1 (en) | 2014-06-27 | 2014-06-27 | Method and device for displaying a video sequence |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2015196462A1 (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4962521A (en) * | 1987-12-17 | 1990-10-09 | Mitsubishi Denki Kabushiki Kaisha | Apparatus for still picture video telephone apparatus and methods for transmitting and displaying still picture image for use therein |
US5208669A (en) * | 1991-01-08 | 1993-05-04 | Sony Broadcast & Communications Limited | Up-conversion of a video signal format with expansion of active portion of video field |
US20080084503A1 (en) * | 2006-10-10 | 2008-04-10 | Sony Corporation | Apparatus, method, and computer program for processing image |
CN102332096A (en) * | 2011-10-17 | 2012-01-25 | 中国科学院自动化研究所 | A method of video subtitle text extraction and recognition |
CN102523390A (en) * | 2011-12-08 | 2012-06-27 | 成都东方盛行电子有限责任公司 | Automatic aspect ratio control method based on AFD (automatic feeding device) |
-
2014
- 2014-06-27 WO PCT/CN2014/080979 patent/WO2015196462A1/en active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4962521A (en) * | 1987-12-17 | 1990-10-09 | Mitsubishi Denki Kabushiki Kaisha | Apparatus for still picture video telephone apparatus and methods for transmitting and displaying still picture image for use therein |
US5208669A (en) * | 1991-01-08 | 1993-05-04 | Sony Broadcast & Communications Limited | Up-conversion of a video signal format with expansion of active portion of video field |
US20080084503A1 (en) * | 2006-10-10 | 2008-04-10 | Sony Corporation | Apparatus, method, and computer program for processing image |
CN102332096A (en) * | 2011-10-17 | 2012-01-25 | 中国科学院自动化研究所 | A method of video subtitle text extraction and recognition |
CN102523390A (en) * | 2011-12-08 | 2012-06-27 | 成都东方盛行电子有限责任公司 | Automatic aspect ratio control method based on AFD (automatic feeding device) |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
TWI544450B (en) | Scale-independent maps | |
US8139081B1 (en) | Method for conversion between YUV 4:4:4 and YUV 4:2:0 | |
CN108063976B (en) | Video processing method and device | |
US11509858B2 (en) | Automatic program formatting for TV displays | |
CN105230033B (en) | It is converted using adaptive frequency intensity-controlled to create the details in image | |
US10504263B2 (en) | Adaptive high dynamic range (HDR) tone mapping with overlay indication | |
US20110262055A1 (en) | Image scaling curve generation | |
US20160316274A1 (en) | Method and system for switching video playback resolution | |
KR102197579B1 (en) | Creating details in an image with frequency lifting | |
US9066025B2 (en) | Control of frequency lifting super-resolution with image features | |
US20110175988A1 (en) | 3d video graphics overlay | |
US9471958B2 (en) | Image processing method and apparatus | |
US8873642B2 (en) | Video content analysis methods and systems | |
US9161030B1 (en) | Graphics overlay system for multiple displays using compressed video | |
US7129992B2 (en) | Method and system for video display with automatic reframing | |
US9053752B1 (en) | Architecture for multiple graphics planes | |
US8483389B1 (en) | Graphics overlay system for multiple displays using compressed video | |
US20130293549A1 (en) | Method for partitioning and processing a digital image | |
WO2015196462A1 (en) | Method and device for displaying a video sequence | |
EP1848203A1 (en) | Method and system for video image aspect ratio conversion | |
EP2239934A1 (en) | Image apparatus for determining type of image data and method for processing image data applicable thereto | |
US20230039717A1 (en) | Automatic program formatting for tv displays | |
JP7613085B2 (en) | Image capture device and image capture processing method | |
WO2022172617A1 (en) | Imaging device and imaging processing method | |
JP2010093856A (en) | Image processing apparatus and method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 14895836 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 14895836 Country of ref document: EP Kind code of ref document: A1 |