CN102014267B - Subtitle area detection method - Google Patents
Subtitle area detection method Download PDFInfo
- Publication number
- CN102014267B CN102014267B CN 200910173122 CN200910173122A CN102014267B CN 102014267 B CN102014267 B CN 102014267B CN 200910173122 CN200910173122 CN 200910173122 CN 200910173122 A CN200910173122 A CN 200910173122A CN 102014267 B CN102014267 B CN 102014267B
- Authority
- CN
- China
- Prior art keywords
- subtitle
- pixels
- pixel
- scan line
- target pixel
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000001514 detection method Methods 0.000 title claims abstract description 41
- 230000003068 static effect Effects 0.000 claims abstract description 15
- 238000000034 method Methods 0.000 abstract description 13
- 238000006243 chemical reaction Methods 0.000 description 9
- 230000002123 temporal effect Effects 0.000 description 5
- 230000009466 transformation Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000002688 persistence Effects 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Landscapes
- Television Systems (AREA)
Abstract
Description
技术领域 technical field
本发明涉及去交错(de-interlacing)转换,特别是涉及一种字幕区侦测方法,以此来进行去交错。The present invention relates to de-interlacing conversion, in particular to a subtitle region detection method for de-interlacing.
背景技术 Background technique
一般的电视广播视频信号采用交错(interlaced)格式,例如NTSC、PAL、SECAM,其以奇数场(field)及偶数场依序交错显示在电视机屏幕上,利用视觉暂留原理来显示出帧(frame)的视频内容。交错视频信号仅需使用低频宽即可传送可接受的视频质量;然而,其缺点是造成垂直分辨率的降低、线条或区域的闪烁(flicker)现象。一般计算机显示器的视频信号则是采用非交错(non-interlaced)或循序(progressive)格式,其是直接将帧的视频内容显示在计算机显示器上。The general TV broadcast video signal adopts interlaced format, such as NTSC, PAL, SECAM, which are sequentially displayed on the TV screen with odd field (field) and even field, and uses the principle of visual persistence to display the frame ( frame) of the video content. Interlaced video signals can deliver acceptable video quality using only low bandwidth; however, the disadvantages are reduced vertical resolution and flickering of lines or areas. The video signal of a general computer monitor adopts a non-interlaced or progressive format, which directly displays the video content of the frame on the computer monitor.
为了让交错视频信号能够显示在循序信号格式的显示器(例如计算机的显示器)上,必须先将交错视频信号转换为非交错/循序视频信号,这种格式转换处理过程称为去交错(de-interlacing)或倍频循序。借助于去交错转换,将原本的奇数场(field)及偶数场结合产生一个帧。In order for an interlaced video signal to be displayed on a display with a sequential signal format (such as a computer monitor), the interlaced video signal must first be converted into a non-interlaced/sequential video signal. This format conversion process is called de-interlacing. ) or octave sequentially. By means of de-interlacing conversion, the original odd field and even field are combined to generate a frame.
视频信号的去交错转换方法可以归纳为下列两种:空间(spatial)转换及时间(temporal)转换。在空间转换中,仅使用同一场的像素(pixel)以产生新的像素,因此,这种转换一般又称为场内(intra-field)转换。在时间转换中,使用相邻不同场的像素来产生新像素,因此,这种转换一般又称为场间(inter-field)转换。通常,静止区是以时间上或场间(inter-field)插补技术来进行去交错,而移动区则以空间上或场内(intra-field)插补技术来进行去交错。The methods for de-interlacing conversion of video signals can be classified into the following two types: spatial conversion and temporal conversion. In spatial transformation, only pixels in the same field are used to generate new pixels, so this transformation is generally also called intra-field transformation. In temporal conversion, adjacent pixels of different fields are used to generate new pixels, so this conversion is also generally called inter-field conversion. Usually, the static area is de-interleaved by temporal or inter-field interpolation techniques, while the moving area is de-interleaved by spatial or intra-field interpolation techniques.
一般来说,移动侦测(motion detection)通常采用同一奇偶数场(sameparity field)相对应像素之间的差值来判断;差值大者表示有移动,差值小者表示没有移动。然而,字幕区的背景如果有移动时,则可能将字幕区视为移动区来进行去交错,因而造成字幕区的闪动现象。Generally speaking, motion detection (motion detection) is usually judged by the difference between corresponding pixels of the same parity field (sameparity field); a larger difference indicates that there is movement, and a smaller difference indicates that there is no movement. However, if the background of the subtitle area moves, the subtitle area may be regarded as a moving area for de-interlacing, thus causing flickering of the subtitle area.
鉴于传统技术无法正确判断字幕区因而造成字幕区的闪动,因此亟需提出一种新颖的移动侦测及去交错机制,用以正确侦测出字幕区,以利于去交错的进行。In view of the fact that the traditional technology cannot correctly determine the subtitle area and thus cause the subtitle area to flicker, it is urgent to propose a novel motion detection and de-interlacing mechanism to correctly detect the subtitle area so as to facilitate de-interlacing.
发明内容 Contents of the invention
鉴于上述,本发明实施例提出一种字幕区的侦测方法,用以正确侦测得到字幕区,以此来进行去交错,而得以改善字幕区的闪动现象。In view of the above, an embodiment of the present invention proposes a subtitle region detection method for correctly detecting the subtitle region for de-interlacing and improving the flickering phenomenon of the subtitle region.
根据本发明实施例,首先,对目前扫描线的每一目标像素进行字幕像素侦测,用以判定目前扫描线的每一像素是否为字幕像素。计数前一帧的字幕像素的个数(spf),当其大于一预设值时,则将目前扫描线的字幕像素设为静止像素。计数目前帧的前一扫描线的字幕像素的个数(spl),当其大于一预设值时,则根据前一帧、后一帧,将目前扫描线的部分非静止像素调整设定为静止像素。According to an embodiment of the present invention, firstly, subtitle pixel detection is performed on each target pixel of the current scanning line to determine whether each pixel of the current scanning line is a subtitle pixel. The number of subtitle pixels in the previous frame (spf) is counted, and when it is greater than a preset value, the subtitle pixels of the current scanning line are set as static pixels. Count the number of subtitle pixels (spl) of the previous scan line of the current frame, and when it is greater than a preset value, adjust and set some non-stationary pixels of the current scan line according to the previous frame and the next frame Still pixels.
根据一实施例,上述的字幕像素侦测步骤包含下列步骤。首先,进行目标像素的移动侦测。侦测目标像素的相邻像素的亮度值,判定是否存在有特别大或特别小的亮度值。接着,检查前一帧中,相应于目标像素的相邻像素的状态是否为静止。最后,侦测目标像素是否位于字幕边界。According to an embodiment, the above subtitle pixel detection step includes the following steps. First, the motion detection of the target pixel is performed. Detect the brightness values of adjacent pixels of the target pixel, and determine whether there is a particularly large or small brightness value. Next, it is checked whether the state of the adjacent pixel corresponding to the target pixel is static in the previous frame. Finally, detect whether the target pixel is located at the boundary of the subtitle.
附图说明 Description of drawings
图1说明了三个连续帧的部分扫描线。Figure 1 illustrates partial scan lines of three consecutive frames.
图2示出了字幕像素的侦测方法的流程图。FIG. 2 shows a flowchart of a subtitle pixel detection method.
图3示出了本发明实施例的字幕区侦测方法的流程图。FIG. 3 shows a flow chart of a subtitle area detection method according to an embodiment of the present invention.
图4说明了具有字幕的扫描线及其相应的前一帧、后一帧扫描线。Figure 4 illustrates a scanline with subtitles and its corresponding previous and next frame scanlines.
【主要元件符号说明】[Description of main component symbols]
21-24 字幕像素的侦测步骤21-24 Subtitle pixel detection steps
31-38 字幕区侦测步骤31-38 Subtitle area detection steps
PA、PC、PE 前一帧的扫描线PA, PC, PE The scan line of the previous frame
B、D 目前帧的扫描线B, D Scanning line of the current frame
NA、NC、NE 后一帧的扫描线NA, NC, NE The scan line of the next frame
pa、pb、pc、pd、pe 前一帧的像素pa, pb, pc, pd, pe Pixels of the previous frame
b、d 目前帧的像素b, d Pixels of the current frame
na、nc、ne 后一帧的像素na, nc, ne Pixels of the next frame
具体实施方式 Detailed ways
图1说明了三个连续帧(亦即,前一帧、目前帧及后一帧)的部分扫描线。其中,前一帧及后一帧为奇数场,而目前帧为偶数场。前一帧显示有扫描线PA、PC、PE,目前帧显示有扫描线B、D,而后一帧则显示有扫描线NA、NC、NE。FIG. 1 illustrates partial scan lines of three consecutive frames (ie, previous frame, current frame and next frame). Wherein, the previous frame and the next frame are odd fields, and the current frame is an even field. The previous frame displays scan lines PA, PC, PE, the current frame displays scan lines B, D, and the next frame displays scan lines NA, NC, NE.
图2示出了字幕(subtitle)像素的侦测方法的流程图,用以侦测待去交错的目前像素t(又称为目标像素),以决定其是否可能为字幕像素。本实施例虽然以步骤21至24依序侦测目标像素t,然而,这些步骤的顺序是可以变更的。再者,有些步骤可予以省略,也可增加其它额外步骤。FIG. 2 shows a flow chart of a subtitle pixel detection method, which is used to detect a current pixel t to be de-interlaced (also referred to as a target pixel) to determine whether it may be a subtitle pixel. Although the present embodiment uses
首先,以步骤21进行目标像素t的移动侦测(motion detection)。鉴于字幕出现的时间一般都会维持数秒之久,以便观看者进行阅读。所以,对字幕区域进行移动侦测时,至少二个以上的同奇偶数(same parity)场之间的差异会特别小。在本实施例中,上述同奇偶数场的差异是通过测量相应于目标像素t的前一帧的多个像素点(例如,涵盖像素pc的窗口中的多个像素)及后一帧的多个像素点(例如,涵盖像素nc的窗口的多个像素)之间的绝对差之和(sum of absolute differences,SAD)。绝对差之和的值SAD可以由下式表示:Firstly, the motion detection (motion detection) of the target pixel t is performed in
其中,nci代表后一帧的扫描线NC上的像素,pci代表前一帧的扫描线PC上的像素,k代表窗口的像素个数。Among them, nc i represents the pixels on the scan line NC of the next frame, pc i represents the pixels on the scan line PC of the previous frame, and k represents the number of pixels in the window.
上述绝对差之和SAD如果小于一预设值T,则注明该目标像素t的状态为静止(static),并继续后续步骤;如果绝对差之和SAD大于预设值T,则注明为移动(motion),并结束图2的流程并决定该目标像素t不是字幕像素。If the sum SAD of the above-mentioned absolute differences is less than a preset value T, then indicate that the state of the target pixel t is static (static), and continue the subsequent steps; if the sum SAD of the absolute differences is greater than the preset value T, then indicate as move (motion), and end the process of FIG. 2 and determine that the target pixel t is not a subtitle pixel.
接着,以步骤22侦测目标像素t的相邻像素(例如,目前帧中位于目标像素t上方的像素b及下方的像素d,及位于前一帧中相应于目标像素t的像素pc)的亮度值,用以侦测是否存在有特别大或特别小的亮度值。为了让字幕醒目并与背景产生区别,字幕通常会使用深色(特别小亮度值)外框配合亮色(特别大亮度值)内字以加强对比。因此,当步骤22侦测到特别大或特别小亮度值时,该目标像素t即可能位于字幕区,并继续后续步骤;否则,结束图2的流程并决定该目标像素t不是字幕像素。在本实施例中,步骤22由此判定像素b、d、pc是否具有特别大亮度值(大于预设值T2)或者是否具有特别小亮度值(小于预设值T3)。亦即,b>T2或d>T2或pc>T2或b<T3或d<T3或pc<T3。Next, detect the adjacent pixels of the target pixel t in step 22 (for example, the pixel b above and below the target pixel t in the current frame, and the pixel pc corresponding to the target pixel t in the previous frame) The brightness value is used to detect whether there is a particularly large or small brightness value. In order to make the subtitle stand out and distinguish it from the background, the subtitle usually uses a dark (especially small brightness value) outer frame and a bright color (especially large brightness value) inner text to enhance the contrast. Therefore, when a particularly large or small luminance value is detected in
然而,如果像素落于平滑区时(亦即一般的单调亮度值),则上述步骤21及22并不足以辨别出字幕。甚至,还有可能因而错误侦测出很多的字幕像素。因此,本实施例继续以步骤23及24来做进一步的侦测。However, if the pixel falls in the smooth region (that is, the general monotone luminance value), the
在步骤23,检查前一帧中,相应于目标像素t的相邻像素(例如,像素pb及pd)的状态是否为静止。如果为是,则继续后续步骤;否则,结束图2的流程并决定该目标像素t不是字幕像素。在本实施例中,是检查像素pb及pd的状态是否均为静止。In
最后,在步骤24,侦测目标像素t是否位于字幕边界。字幕的边界一般具有下列两种情形之一:(a)目标像素t的上下相邻扫描线(例如扫描线B和扫描线D)之间的(绝对)差值很大;(b)目标像素t的上下相邻扫描线之间的(绝对)差值很小,但是,该上下相邻扫描线与前一帧中相应目标像素t的扫描线(例如扫描线PC)之间的(绝对)差值却很大。如果有情形(a)或(b)者,则判定目标像素为字幕像素;否则,结束图2的流程并决定该目标像素t不是字幕像素。在本实施例中,情形(a)是判定扫描线B(涵盖像素b)的窗口平均值和扫描线D(涵盖像素d)的窗口平均值的绝对差值是否大于一预设值T4。亦即,|B-D|>T4。情形(b)是判定扫描线B(涵盖像素b)的窗口平均值和扫描线D(涵盖像素d)的窗口平均值的绝对差值是否小于一预设值T5(亦即,|B-D|<T5),且扫描线B/扫描线D与扫描线PC(涵盖像素pc)的绝对差值是否大于预设值T4(亦即,|B-PC|>T4或|D-PC|>T4)。Finally, in
图3示出了本发明实施例的字幕区侦测方法的流程图,可依据其所获得的移动侦测结果来进行后续(未示出在流程中)的去交错。首先,在步骤31中,以图2的流程依序对目前扫描线的每一目标像素进行字幕像素侦测,用以判定该扫描线的每一像素是否为字幕像素。FIG. 3 shows a flow chart of a subtitle area detection method according to an embodiment of the present invention, and subsequent (not shown in the flow chart) de-interlacing can be performed according to the obtained motion detection results. Firstly, in
接着,在步骤32,计数前一帧的字幕像素的个数(subtitle pixels inprevious frame,spf)。由于相同的字幕会连续出现在多个帧中,因此,可通过前一帧的结果来得知目前帧是否具有字幕。一般来说,一帧中字幕像素的个数(spf)够大时,即可能真正具有字幕。Then, in
当前一帧的字幕像素的个数(spf)大于一预设值T5时(步骤33),则将目前扫描线的字幕像素设为静止(static)像素(步骤34);否则,即依据一般的移动侦测(步骤35)的结果来进行去交错。例如,以时间上或场间(inter-field)插补技术来进行静止区的去交错,而以空间上或场内(intra-field)插补技术来进行移动区的去交错。接着,在步骤36,计数(目前帧)前一扫描线的字幕像素的个数(subtitle pixels in previous scan line,spl)。When the number (spf) of the subtitle pixels of the previous frame is greater than a preset value T5 (step 33), then the subtitle pixels of the current scanning line are set as static (static) pixels (step 34); otherwise, promptly according to general The results of the motion detection (step 35) are used for de-interlacing. For example, temporal or inter-field interpolation techniques are used to perform de-interleaving of stationary areas, while spatial or intra-field interpolation techniques are used to perform de-interleaving of moving areas. Then, in
如果前一扫描线的字幕像素的个数(spl)大于一预设值T6(步骤37),则目前扫描线即为具有字幕的扫描线;否则,即依据一般的移动侦测(步骤35)的结果来进行去交错。根据本实施例,可以由前一帧运算完的数据(例如spf)得知目前帧是否具有字幕。为了节省额外记录信息的储存空间,字幕区的位置则由目前帧的数据(例如spl)来得到。If the number of subtitle pixels (spl) of the previous scanning line is greater than a preset value T6 (step 37), then the current scanning line is a scanning line with subtitles; otherwise, according to the general motion detection (step 35) The results are deinterleaved. According to this embodiment, whether the current frame has subtitles can be known from the calculated data (such as spf) of the previous frame. In order to save storage space for additional recording information, the position of the subtitle area is obtained from the data of the current frame (such as spl).
经判定为具有字幕的扫描线当中,有些像素为静止的,而有些像素则不是静止。如果“非静止”像素所对应的前一帧像素及后一帧像素之间的绝对差值小于一预设值T7,则以步骤38将其调整设定为静止(static)像素。图4说明了具有字幕的扫描线及其相应的前一帧、后一帧扫描线。在此实例中,具有字幕的扫描线的第三像素t3为非静止像素,其相应的前一帧像素为pc3,相应的后一帧像素为nc3。如果像素nc3和像素pc3的绝对差值小于预设值T7(亦即,|nc-pc|<T7),则将像素t3调整设定为静止像素。Among the scanlines determined to have subtitles, some pixels are still and some pixels are not. If the absolute difference between the pixels of the previous frame and the pixels of the next frame corresponding to the “non-stationary” pixel is smaller than a preset value T7, then it is adjusted and set as a static pixel in
根据本实施例,可使得字幕区及其邻近区域判定为静止区,使其移动侦测不受背景移动的影响,并以时间上或场间(inter-field)插补技术来进行该静止区的去交错。According to this embodiment, the subtitle area and its adjacent area can be determined as a still area, so that its motion detection is not affected by the background movement, and the still area can be implemented by temporal or inter-field interpolation technology to de-interleave.
以上所述仅为本发明的优选实施例,并非用以限定本发明的所附权利要求书的范围;任何其它在未脱离发明所揭示的精神下完成的等效改变或修饰,均应包含在所附权利要求书的范围内。The above descriptions are only preferred embodiments of the present invention, and are not intended to limit the scope of the appended claims of the present invention; any other equivalent changes or modifications that do not deviate from the spirit disclosed by the invention should be included in within the scope of the appended claims.
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 200910173122 CN102014267B (en) | 2009-09-07 | 2009-09-07 | Subtitle area detection method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 200910173122 CN102014267B (en) | 2009-09-07 | 2009-09-07 | Subtitle area detection method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102014267A CN102014267A (en) | 2011-04-13 |
CN102014267B true CN102014267B (en) | 2013-01-09 |
Family
ID=43844256
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 200910173122 Expired - Fee Related CN102014267B (en) | 2009-09-07 | 2009-09-07 | Subtitle area detection method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102014267B (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1103526A (en) * | 1993-12-04 | 1995-06-07 | 行健电讯股份有限公司 | Method and system for superimposing subtitles on live broadcast satellite |
CN1170309A (en) * | 1996-05-03 | 1998-01-14 | 三星电子株式会社 | Audience-selective subtitle broadcasting and receiving method suitable for phoneme characters |
CN1176557A (en) * | 1996-09-06 | 1998-03-18 | 三星电子株式会社 | Subtitle signal playback method for viewer-selective subtitle playback |
TWI255140B (en) * | 2004-11-04 | 2006-05-11 | Himax Tech Inc | Caption detection and compensation for interlaced image |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2330854A1 (en) * | 2001-01-11 | 2002-07-11 | Jaldi Semiconductor Corp. | A system and method for detecting a non-video source in video signals |
-
2009
- 2009-09-07 CN CN 200910173122 patent/CN102014267B/en not_active Expired - Fee Related
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1103526A (en) * | 1993-12-04 | 1995-06-07 | 行健电讯股份有限公司 | Method and system for superimposing subtitles on live broadcast satellite |
CN1170309A (en) * | 1996-05-03 | 1998-01-14 | 三星电子株式会社 | Audience-selective subtitle broadcasting and receiving method suitable for phoneme characters |
CN1176557A (en) * | 1996-09-06 | 1998-03-18 | 三星电子株式会社 | Subtitle signal playback method for viewer-selective subtitle playback |
TWI255140B (en) * | 2004-11-04 | 2006-05-11 | Himax Tech Inc | Caption detection and compensation for interlaced image |
Also Published As
Publication number | Publication date |
---|---|
CN102014267A (en) | 2011-04-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR100303728B1 (en) | Deinterlacing method of interlaced scanning video | |
JP4253327B2 (en) | Subtitle detection apparatus, subtitle detection method, and pull-down signal detection apparatus | |
CN101309385B (en) | A Deinterlacing Method Based on Motion Detection | |
US8497937B2 (en) | Converting device and converting method of video signals | |
US8115867B2 (en) | Image processing device | |
WO2004017634A1 (en) | Image processing device and method, video display device, and recorded information reproduction device | |
USRE45306E1 (en) | Image processing method and device thereof | |
CN102014267B (en) | Subtitle area detection method | |
TWI403160B (en) | Method of detecting a subtitle | |
JP5206313B2 (en) | COMBING NOISE DETECTION DEVICE AND COMBING NOISE DETECTION METHOD | |
JP2008009182A (en) | Liquid crystal display device | |
CN102186046A (en) | Video image de-interlacing scanning method and device | |
JP4433949B2 (en) | Image processing apparatus and method | |
CN101009768A (en) | Adaptive angle de-interleaving device and de-interleaving method | |
JP4801678B2 (en) | Color difference signal IP conversion method | |
CN101483747B (en) | Movement detection method suitable for deinterlacing technique | |
CN102308576A (en) | Video processing device | |
JP5172999B2 (en) | Video signal processing apparatus and image display apparatus | |
JP4744150B2 (en) | 3D vector method of motion detection between fields | |
US8237860B2 (en) | Poor video editing detection system | |
CN101106686A (en) | Method and device for deinterlacing conversion of remote-end interpolation | |
US8045820B2 (en) | System and method for edge direction detection for spatial deinterlace | |
US8373798B2 (en) | Text protection device and related motion adaptive de-interlacing device | |
CN101106687B (en) | Dynamic de-interlacing conversion method and device | |
TWI500316B (en) | Method of de-interlacing a horizontal moving subtitle |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20130109 |
|
CF01 | Termination of patent right due to non-payment of annual fee |