US8195038B2 - Brief and high-interest video summary generation - Google Patents
Brief and high-interest video summary generation Download PDFInfo
- Publication number
- US8195038B2 US8195038B2 US12/290,033 US29003308A US8195038B2 US 8195038 B2 US8195038 B2 US 8195038B2 US 29003308 A US29003308 A US 29003308A US 8195038 B2 US8195038 B2 US 8195038B2
- Authority
- US
- United States
- Prior art keywords
- video
- subshots
- determining
- shots
- acoustic features
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related, expires
Links
- 238000003064 k means clustering Methods 0.000 claims abstract description 9
- 239000003550 marker Substances 0.000 claims abstract description 6
- 238000005070 sampling Methods 0.000 claims abstract description 6
- 238000000034 method Methods 0.000 claims description 56
- 230000000007 visual effect Effects 0.000 claims description 18
- 238000004590 computer program Methods 0.000 claims description 6
- 238000009877 rendering Methods 0.000 claims description 6
- 230000000694 effects Effects 0.000 claims description 5
- 238000001514 detection method Methods 0.000 description 7
- 238000013459 approach Methods 0.000 description 6
- 238000012360 testing method Methods 0.000 description 6
- 230000011218 segmentation Effects 0.000 description 5
- 230000009471 action Effects 0.000 description 4
- 230000002123 temporal effect Effects 0.000 description 4
- 230000008569 process Effects 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000008520 organization Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000012706 support-vector machine Methods 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 102100024170 Cyclin-C Human genes 0.000 description 1
- 101000980770 Homo sapiens Cyclin-C Proteins 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 238000012805 post-processing Methods 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/14—Picture signal circuitry for video frequency region
- H04N5/147—Scene change detection
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/73—Querying
- G06F16/738—Presentation of query results
- G06F16/739—Presentation of query results in form of a video summary, e.g. the video summary being a video sequence, a composite still image or having synthesized frames
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/41—Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/46—Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
- G06V20/47—Detecting features for summarising video content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/49—Segmenting video sequences, i.e. computational techniques such as parsing or cutting the sequence, low-level clustering or determining units such as shots or scenes
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/02—Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
- G11B27/031—Electronic editing of digitised analogue information signals, e.g. audio or video signals
- G11B27/034—Electronic editing of digitised analogue information signals, e.g. audio or video signals on discs
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/19—Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
- G11B27/28—Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording
- G11B27/32—Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording on separate auxiliary tracks of the same or an auxiliary record carrier
- G11B27/322—Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording on separate auxiliary tracks of the same or an auxiliary record carrier used signal is digitally coded
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/34—Indicating arrangements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/85—Assembly of content; Generation of multimedia applications
- H04N21/854—Content authoring
- H04N21/8549—Creating video summaries, e.g. movie trailer
Definitions
- the present invention relates generally to video summarization and more particularly to relevancy in video summarization.
- Video summarization may be used to present the main theme of video sequences to a user. This summarization is especially useful if the volume of the video is large and the content is highly redundant.
- any digital device is capable of capturing video.
- these videos include raw footage of a short event (e.g., a child's first words, etc.), a series of re-takes to capture a single event to be edited in production, or a single long relatively static recording (e.g., surveillance video, etc.).
- a short event e.g., a child's first words, etc.
- a series of re-takes to capture a single event to be edited in production e.g., surveillance video, etc.
- a single long relatively static recording e.g., surveillance video, etc.
- summarization systems and methods are used. Summaries are shorter than the original video source and usually aim to capture only small parts of the content as important events or scenes, such as the utterance of a phrase, a person, an action, or a setting.
- To compose a summary several events or scenes are arranged in a way that is easy for the user to understand. In this way, the original meaning and even context of
- rushes are captured for subsequent editing, they generally contain large numbers of repeated scenes and ideas. For example, actors in a television drama may be asked to repeat a particular scene or action multiple times so that video editors and directors have multiple “takes” to choose from. Additionally, this video also contains non-essential (e.g., “junk”) content such as color bars to signal the starting of a new recording session and calibrate equipment or physical clappers used to guarantee synchronization between the audio and video streams and provide metadata (e.g., date, series name, scene or take information, etc.) about the next recorded scene.
- the creation of a rush video summary involves many as yet unrealized goals: reduce junk content, identify and reduce the count of repeated ideas, prioritize and choose only the most interesting ideas, and present the summarized set of ideas in a comprehensible and logical manner.
- a method of video summarization includes determining if a video contains one or more junk frames, modifying one or more boundaries of shots of the video based at least in part on the determination of if the video contains one or more junk frames, sampling a plurality of the shots of the video into a plurality of subshots, clustering the plurality of subshots with a multiple step k-means clustering, and creating a video summary based at least in part on the clustered plurality of subshots.
- the video is segmented into a plurality of shots and a keyframe from each of the plurality of shots is extracted.
- the distance among the plurality of subshots as a weighted summation of a motion compensated matching error, a histogram difference in red, green, blue color space, and a histogram difference in hue, saturation, value color space is determined and a distance between a plurality of clusters as a maximum distance between pairs of subshots within the plurality of clusters is determined. If the determined distance between the clusters is smaller than a predetermined threshold, the clusters are merged. If a difference in a keyframe file size in the cluster exceeds a predetermined threshold, a cluster is divided.
- the method also includes determining an importance of the subshots in the clustered plurality of subshots and creating a video summary based on the determined importance of the subshots in the clustered plurality of subshots and a time budget.
- the created video summary is rendered by displaying playback rate information for the rendered video summary, displaying a currently playing subshot marker with the rendered video summary, and displaying an indication of similar content in the rendered video summary.
- FIG. 4 shows a monochrome frame
- FIG. 5 shows a clapper frame
- FIG. 6 depicts a flowchart of a method of video summarization according to an embodiment of the present invention
- FIG. 7 depicts a rendered video summary according to an embodiment of the present invention.
- a time budget (e.g., the total duration of the target video summary) is also allocated to the clusters and subshots based on their importance values.
- image salience is computed for the subshots with high importance values and the most salient segments are selected for summarization using durations determined by the subshot budget.
- these video segments are sorted based on their time stamps and they are stitched together as the summarized video.
- two takes 108 a and 108 b may be made of scene 110 a .
- Each take 108 a and 108 b has two shots, respective shots 104 a - b and 104 c - d .
- shots 104 a and 104 c may be similar (e.g., they are versions of the first event in the scene) and shots 104 b and 104 d may be similar (e.g., they are versions of the second event in the scene).
- shots 104 a and 104 c may be similar (e.g., they are versions of the first event in the scene)
- shots 104 b and 104 d may be similar (e.g., they are versions of the second event in the scene).
- FIG. 2 depicts a flowchart of a method 200 of video summarization according to an embodiment of the present invention.
- the method 200 may be performed by any appropriate device.
- one or more steps of method 200 are performed by a computer adapted specifically to perform such method steps.
- computer 800 described below with respect to FIG. 8 may be configured as a special-purpose computer for performing one or more steps of method 200 .
- the method starts at step 202 .
- shots are determined. That is, shots (e.g., shots 104 a - h ) are determined using shot segmentation. Any appropriate method of shot segmentation may be used. The most basic approaches in shot segmentation analyze the change in content like scene changes, object motion, and camera motion between subsequent video frames. More complex shot segmentation approaches maintain a state-based system to monitor and test against criterion for changes such as scene cuts, dissolves, wipes, and zoom in/out effects.
- Junk frames are frames that do not convey meaningful semantics in the video and should not be included in the video summary.
- FIGS. 3-5 show these classes of junk frames.
- FIG. 3 shows a TV testing signal (e.g. color bars)
- FIG. 4 shows monochrome frames (e.g., white or black frames)
- FIG. 5 shows clapper frames.
- TV test signals (e.g., color bars) as shown in FIG. 3 are normally accompanied by a pure tone in audio channel, so a multi-modal approach can best detect this junk frame.
- some visual features extracted in the shot boundary detection of step 206 are used, like the edge energies and the histogram variance in red, green, blue channels, as well as the intensity. Given the nature of the color bars, the vertical edge energy is significantly greater than the horizontal edge energy, and the histogram variance in all channels are bigger than those in a regular scene are.
- acoustic features are extracted like volume, frequency centroid, frequency bandwidth, energy ratio in sub-bands, and voice activity. With a set of threshold-based criterion, the tone signal can be reliably detected.
- the final detection result of a TV test signal is a fusion of result scores from both modalities.
- step 214 subshots are clustered using a multiple step k-means clustering.
- Clustering is used to remove redundancy among subshots.
- a very small cluster size is targeted using only grid color moments.
- the number of clusters to form is computed based on the total number of subshots known. This solution places a high reliance on the shot boundary detection algorithm to correctly identify temporal discontinuities.
- K-means clustering selects a centroid point then iteratively updates the cluster labels and cluster centroids until intra-cluster distance falls below a threshold. To avoid local minima in cluster configurations, several rounds with randomly initialized centroids are evaluated.
- FIG. 6 depicts a flowchart of a method 600 of video summarization according to an embodiment of the present invention.
- the method 600 may be performed by any appropriate device.
- one or more steps of method 600 are performed by a computer adapted specifically to perform such method steps.
- computer 800 described below with respect to FIG. 8 may be configured as a special-purpose computer for performing one or more steps of method 600 .
- the method starts at step 602 .
- visual difference is measured by three components: motion estimations, RGB color histograms, and LUV histograms. These values are computed during shot boundary detection in step 206 of method 200 , so no additional computation is required for importance determination. To balance the contribution of both scores, an average of normalized salience scores and visual difference values determine the importance value.
- Each cluster C i contains L i subshots S i j , 1 ⁇ j ⁇ L i , and each subshot S S i j starts from frame B i j , and ends at frame E i j .
- the importance value for frame k is denoted as Im f (k)
- the importance value for subshot m is denoted as Im s (m)
- the value for cluster i is denoted as Im c (i).
- the subshot importance value is the summation of the frame importance values of all the frames within corresponding subshot
- the cluster importance value is the summation of the subshot importance values of all member subshots.
- the video summary may be created by allocating budgets for all clusters, allocating budges for subshots within each cluster, and picking the video segment from each subshot. First, all clusters are sorted based on their importance values. If the budget B is bigger than the number of clusters N, each cluster is assigned with round(B/N) time units (e.g., seconds) at the beginning. The rest of the budget is assigned in the importance order. A predetermined time unit (e.g., one second) is assigned to the cluster with the maximum importance value and a predetermined time unit (e.g., one second) is assigned to the next important cluster until all budgets are used up. In a similar way, the budget of each cluster is assigned to all member subshots.
- round(B/N) time units e.g., seconds
- Cluster C i has a budget of b i . If it is bigger than the number of all subshots in this cluster, L i , then each subshot S i j is allocated with round(b i /L i ) time units (e.g., seconds). The remaining budget is assigned based on the subshot importance order—one time unit (e.g., second) to the most important subshot and one time unit to the next important subshot until no budget is left.
- the beginning three seconds of the subshot is ignored if it is longer than six seconds. If the subshot is shorter than the assigned budgets, then the ending frame is extended to meet the budget. Normally, the assigned budget is much shorter than the subshot duration. In this case, the continuous segment whose overall importance value is the maximum within the subshot is chosen.
- the selected video segments are sorted in temporal order, and the result is a list of all selected video segments and some side information for rending the summary video.
- FIG. 7 depicts a rendered video summary 700 according to an embodiment of the present invention.
- the rendered video summary 700 is simple in that it has no text to read, but also informative with indications of position with position and frequency bar 702 and time remaining on the current subshot with timer 704 indicating speed and fraction of clip remaining.
- Frequency bar 702 has a currently playing marker 706 and similar content markers 708 .
- Currently playing marker 706 indicates the portion of the video 100 depicted in video summary 700 that is currently playing.
- Similar content markers 708 indicate keyframes in the video summary 700 that are similar to each other as discussed above.
- Computer 800 contains devices that form a controller including a processor 802 that controls the overall operation of the computer 800 by executing computer program instructions, which define such operation.
- the computer program instructions may be stored in a storage device 804 (e.g., magnetic disk, database, etc.) and loaded into memory 806 when execution of the computer program instructions is desired.
- applications for performing the herein-described method steps, such as those described below with respect to methods 200 and 600 are defined by the computer program instructions stored in the memory 806 and/or storage 804 and controlled by the processor 802 executing the computer program instructions.
- the computer 800 may also include one or more network interfaces 808 for communicating with other devices via a network.
- the computer 800 also includes input/output devices 810 that enable operator interaction with the computer 800 .
- instructions of a program may be read into memory 806 , such as from a ROM device to a RAM device or from a LAN adapter to a RAM device. Execution of sequences of the instructions in the program may cause the computer 800 to perform one or more of the method steps described herein.
- hard-wired circuitry or integrated circuits may be used in place of, or in combination with, software instructions for implementation of the processes of the present invention.
- embodiments of the present invention are not limited to any specific combination of hardware, firmware, and/or software.
- the memory 806 may store the software for the computer 800 , which may be adapted to execute the software program and thereby operate in accordance with the present invention and particularly in accordance with the methods described in detail above.
- the invention as described herein could be implemented in many different ways using a wide range of programming techniques as well as general purpose hardware sub-systems or dedicated controllers.
- Such programs may be stored in a compressed, uncompiled, and/or encrypted format.
- the programs furthermore may include program elements that may be generally useful, such as an operating system, a database management system, and device drivers for allowing the portable communication device to interface with peripheral devices and other equipment/components.
- Appropriate general purpose program elements are known to those skilled in the art, and need not be described in detail herein.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Databases & Information Systems (AREA)
- Signal Processing (AREA)
- Computing Systems (AREA)
- Software Systems (AREA)
- Computer Security & Cryptography (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Television Signal Processing For Recording (AREA)
Abstract
Description
Claims (18)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/290,033 US8195038B2 (en) | 2008-10-24 | 2008-10-24 | Brief and high-interest video summary generation |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/290,033 US8195038B2 (en) | 2008-10-24 | 2008-10-24 | Brief and high-interest video summary generation |
Publications (2)
Publication Number | Publication Date |
---|---|
US20100104261A1 US20100104261A1 (en) | 2010-04-29 |
US8195038B2 true US8195038B2 (en) | 2012-06-05 |
Family
ID=42117593
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/290,033 Expired - Fee Related US8195038B2 (en) | 2008-10-24 | 2008-10-24 | Brief and high-interest video summary generation |
Country Status (1)
Country | Link |
---|---|
US (1) | US8195038B2 (en) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120183219A1 (en) * | 2011-01-13 | 2012-07-19 | Sony Corporation | Data segmenting apparatus and method |
US20160078297A1 (en) * | 2014-09-17 | 2016-03-17 | Xiaomi Inc. | Method and device for video browsing |
US9471663B1 (en) | 2014-01-22 | 2016-10-18 | Google Inc. | Classification of media in a media sharing system |
US9529840B1 (en) * | 2014-01-14 | 2016-12-27 | Google Inc. | Real-time duplicate detection of videos in a massive video sharing system |
US9723344B1 (en) | 2015-12-29 | 2017-08-01 | Google Inc. | Early detection of policy violating media |
US10141023B2 (en) | 2014-12-29 | 2018-11-27 | Industrial Technology Research Institute | Method and system for multimedia summary generation |
US10735811B2 (en) | 2018-12-10 | 2020-08-04 | At&T Intellectual Property I, L.P. | System for content curation with user context and content leverage |
US11393209B2 (en) * | 2017-10-12 | 2022-07-19 | Google Llc | Generating a video segment of an action from a video |
US11475063B2 (en) | 2020-02-24 | 2022-10-18 | Motorola Solutions, Inc. | Device, system and method for providing indications of a discrepancy between a video and a textual description of an incident |
US11729478B2 (en) | 2017-12-13 | 2023-08-15 | Playable Pty Ltd | System and method for algorithmic editing of video content |
US11836181B2 (en) | 2019-05-22 | 2023-12-05 | SalesTing, Inc. | Content summarization leveraging systems and processes for key moment identification and extraction |
Families Citing this family (92)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8472792B2 (en) | 2003-12-08 | 2013-06-25 | Divx, Llc | Multimedia distribution system |
US8488839B2 (en) * | 2006-11-20 | 2013-07-16 | Videosurf, Inc. | Computer program and apparatus for motion-based object extraction and tracking in video |
US8379915B2 (en) * | 2006-11-20 | 2013-02-19 | Videosurf, Inc. | Method of performing motion-based object extraction and tracking in video |
US8059915B2 (en) * | 2006-11-20 | 2011-11-15 | Videosurf, Inc. | Apparatus for and method of robust motion estimation using line averages |
WO2009065137A1 (en) * | 2007-11-16 | 2009-05-22 | Divx, Inc. | Hierarchical and reduced index structures for multimedia files |
US8364698B2 (en) * | 2008-07-11 | 2013-01-29 | Videosurf, Inc. | Apparatus and software system for and method of performing a visual-relevance-rank subsequent search |
US8364660B2 (en) * | 2008-07-11 | 2013-01-29 | Videosurf, Inc. | Apparatus and software system for and method of performing a visual-relevance-rank subsequent search |
US9443147B2 (en) * | 2010-04-26 | 2016-09-13 | Microsoft Technology Licensing, Llc | Enriching online videos by content detection, searching, and information aggregation |
US9508011B2 (en) | 2010-05-10 | 2016-11-29 | Videosurf, Inc. | Video visual and audio query |
US8599316B2 (en) * | 2010-05-25 | 2013-12-03 | Intellectual Ventures Fund 83 Llc | Method for determining key video frames |
US20120078899A1 (en) * | 2010-09-27 | 2012-03-29 | Fontana James A | Systems and methods for defining objects of interest in multimedia content |
US8737727B2 (en) * | 2010-12-30 | 2014-05-27 | Pelco, Inc. | Color similarity sorting for video forensics search |
US8625889B2 (en) * | 2010-12-30 | 2014-01-07 | Samsung Electronics Co., Ltd. | System for food recognition method using portable devices having digital cameras |
WO2013023063A1 (en) | 2011-08-09 | 2013-02-14 | Path 36 Llc | Digital media editing |
CN103136763B (en) * | 2011-12-02 | 2017-07-21 | 富泰华工业(深圳)有限公司 | Electronic installation and its method for detecting the abnormal paragraph of video sequence |
US8953836B1 (en) * | 2012-01-31 | 2015-02-10 | Google Inc. | Real-time duplicate detection for uploaded videos |
US9414038B2 (en) | 2012-10-16 | 2016-08-09 | Amanjyot Singh JOHAR | Creating time lapse video in real-time |
US9116924B2 (en) * | 2013-01-14 | 2015-08-25 | Xerox Corporation | System and method for image selection using multivariate time series analysis |
US9122931B2 (en) * | 2013-10-25 | 2015-09-01 | TCL Research America Inc. | Object identification system and method |
KR102025362B1 (en) * | 2013-11-07 | 2019-09-25 | 한화테크윈 주식회사 | Search System and Video Search method |
US9754159B2 (en) | 2014-03-04 | 2017-09-05 | Gopro, Inc. | Automatic generation of video from spherical content using location-based metadata |
US10796163B2 (en) * | 2014-03-07 | 2020-10-06 | Eagle Eye Networks, Inc. | Surveillance video activity summary system and access method of operation (VASSAM) |
US11042588B2 (en) * | 2014-04-24 | 2021-06-22 | Nokia Technologies Oy | Apparatus, method, and computer program product for video enhanced photo browsing |
US20150348587A1 (en) * | 2014-05-27 | 2015-12-03 | Thomson Licensing | Method and apparatus for weighted media content reduction |
US10664687B2 (en) | 2014-06-12 | 2020-05-26 | Microsoft Technology Licensing, Llc | Rule-based video importance analysis |
US9984293B2 (en) | 2014-07-23 | 2018-05-29 | Gopro, Inc. | Video scene classification by activity |
US9685194B2 (en) | 2014-07-23 | 2017-06-20 | Gopro, Inc. | Voice-based video tagging |
WO2016014724A1 (en) * | 2014-07-23 | 2016-01-28 | Gopro, Inc. | Scene and activity identification in video summary generation |
US9430848B1 (en) * | 2014-09-02 | 2016-08-30 | Google Inc. | Monochromatic image determination |
US10424341B2 (en) * | 2014-11-12 | 2019-09-24 | Massachusetts Institute Of Technology | Dynamic video summarization |
US9734870B2 (en) | 2015-01-05 | 2017-08-15 | Gopro, Inc. | Media identifier generation for camera-captured media |
AT516770A1 (en) * | 2015-01-27 | 2016-08-15 | J&P Medical Res Ltd | METHOD FOR RECORDING AND CERTIFYING DATA |
US9679605B2 (en) | 2015-01-29 | 2017-06-13 | Gopro, Inc. | Variable playback speed template for video editing application |
US9607223B2 (en) * | 2015-04-09 | 2017-03-28 | Facebook, Inc. | Systems and methods for defining and analyzing video clusters based on video image frames |
US10186012B2 (en) | 2015-05-20 | 2019-01-22 | Gopro, Inc. | Virtual lens simulation for video and photo cropping |
US9894393B2 (en) | 2015-08-31 | 2018-02-13 | Gopro, Inc. | Video encoding for reduced streaming latency |
US10204273B2 (en) | 2015-10-20 | 2019-02-12 | Gopro, Inc. | System and method of providing recommendations of moments of interest within video clips post capture |
US9721611B2 (en) | 2015-10-20 | 2017-08-01 | Gopro, Inc. | System and method of generating video from video clips based on moments of interest within the video clips |
US10356456B2 (en) * | 2015-11-05 | 2019-07-16 | Adobe Inc. | Generating customized video previews |
US9805269B2 (en) | 2015-11-20 | 2017-10-31 | Adobe Systems Incorporated | Techniques for enhancing content memorability of user generated video content |
US10652298B2 (en) * | 2015-12-17 | 2020-05-12 | Intel Corporation | Media streaming through section change detection markers |
US10095696B1 (en) | 2016-01-04 | 2018-10-09 | Gopro, Inc. | Systems and methods for generating recommendations of post-capture users to edit digital media content field |
US10109319B2 (en) | 2016-01-08 | 2018-10-23 | Gopro, Inc. | Digital media editing |
US9812175B2 (en) | 2016-02-04 | 2017-11-07 | Gopro, Inc. | Systems and methods for annotating a video |
KR20170098079A (en) * | 2016-02-19 | 2017-08-29 | 삼성전자주식회사 | Electronic device method for video recording in electronic device |
US9972066B1 (en) | 2016-03-16 | 2018-05-15 | Gopro, Inc. | Systems and methods for providing variable image projection for spherical visual content |
US10402938B1 (en) | 2016-03-31 | 2019-09-03 | Gopro, Inc. | Systems and methods for modifying image distortion (curvature) for viewing distance in post capture |
US9838730B1 (en) | 2016-04-07 | 2017-12-05 | Gopro, Inc. | Systems and methods for audio track selection in video editing |
US9838731B1 (en) | 2016-04-07 | 2017-12-05 | Gopro, Inc. | Systems and methods for audio track selection in video editing with audio mixing option |
US9794632B1 (en) | 2016-04-07 | 2017-10-17 | Gopro, Inc. | Systems and methods for synchronization based on audio track changes in video editing |
US9998769B1 (en) | 2016-06-15 | 2018-06-12 | Gopro, Inc. | Systems and methods for transcoding media files |
US10250894B1 (en) | 2016-06-15 | 2019-04-02 | Gopro, Inc. | Systems and methods for providing transcoded portions of a video |
US9922682B1 (en) | 2016-06-15 | 2018-03-20 | Gopro, Inc. | Systems and methods for organizing video files |
US10045120B2 (en) | 2016-06-20 | 2018-08-07 | Gopro, Inc. | Associating audio with three-dimensional objects in videos |
US10185891B1 (en) | 2016-07-08 | 2019-01-22 | Gopro, Inc. | Systems and methods for compact convolutional neural networks |
US10469909B1 (en) | 2016-07-14 | 2019-11-05 | Gopro, Inc. | Systems and methods for providing access to still images derived from a video |
CN106210878A (en) * | 2016-07-25 | 2016-12-07 | 北京金山安全软件有限公司 | Picture extraction method and terminal |
CN106060629A (en) * | 2016-07-25 | 2016-10-26 | 北京金山安全软件有限公司 | Picture extraction method and terminal |
US10395119B1 (en) | 2016-08-10 | 2019-08-27 | Gopro, Inc. | Systems and methods for determining activities performed during video capture |
US9836853B1 (en) | 2016-09-06 | 2017-12-05 | Gopro, Inc. | Three-dimensional convolutional neural networks for video highlight detection |
US10282632B1 (en) * | 2016-09-21 | 2019-05-07 | Gopro, Inc. | Systems and methods for determining a sample frame order for analyzing a video |
US10268898B1 (en) * | 2016-09-21 | 2019-04-23 | Gopro, Inc. | Systems and methods for determining a sample frame order for analyzing a video via segments |
US10002641B1 (en) | 2016-10-17 | 2018-06-19 | Gopro, Inc. | Systems and methods for determining highlight segment sets |
US10284809B1 (en) | 2016-11-07 | 2019-05-07 | Gopro, Inc. | Systems and methods for intelligently synchronizing events in visual content with musical features in audio content |
US10262639B1 (en) | 2016-11-08 | 2019-04-16 | Gopro, Inc. | Systems and methods for detecting musical features in audio content |
US10534966B1 (en) | 2017-02-02 | 2020-01-14 | Gopro, Inc. | Systems and methods for identifying activities and/or events represented in a video |
US10339443B1 (en) | 2017-02-24 | 2019-07-02 | Gopro, Inc. | Systems and methods for processing convolutional neural network operations using textures |
US10127943B1 (en) | 2017-03-02 | 2018-11-13 | Gopro, Inc. | Systems and methods for modifying videos based on music |
US10185895B1 (en) | 2017-03-23 | 2019-01-22 | Gopro, Inc. | Systems and methods for classifying activities captured within images |
US10083718B1 (en) | 2017-03-24 | 2018-09-25 | Gopro, Inc. | Systems and methods for editing videos based on motion |
US10187690B1 (en) | 2017-04-24 | 2019-01-22 | Gopro, Inc. | Systems and methods to detect and correlate user responses to media content |
US10395122B1 (en) | 2017-05-12 | 2019-08-27 | Gopro, Inc. | Systems and methods for identifying moments in videos |
US10402698B1 (en) | 2017-07-10 | 2019-09-03 | Gopro, Inc. | Systems and methods for identifying interesting moments within videos |
US10614114B1 (en) | 2017-07-10 | 2020-04-07 | Gopro, Inc. | Systems and methods for creating compilations based on hierarchical clustering |
US10402656B1 (en) | 2017-07-13 | 2019-09-03 | Gopro, Inc. | Systems and methods for accelerating video analysis |
US10909378B2 (en) | 2017-11-07 | 2021-02-02 | Comcast Cable Communications, Llc | Processing content based on natural language queries |
CN108307229B (en) * | 2018-02-02 | 2023-12-22 | 新华智云科技有限公司 | Video and audio data processing method and device |
US10311913B1 (en) * | 2018-02-22 | 2019-06-04 | Adobe Inc. | Summarizing video content based on memorability of the video content |
US10412391B1 (en) * | 2018-08-06 | 2019-09-10 | AnyClip Ltd. | Minimize number of encoded video stream frames for content recognition |
CN109194978A (en) * | 2018-10-15 | 2019-01-11 | 广州虎牙信息科技有限公司 | Live video clipping method, device and electronic equipment |
US11122099B2 (en) | 2018-11-30 | 2021-09-14 | Motorola Solutions, Inc. | Device, system and method for providing audio summarization data from video |
US10775977B2 (en) | 2019-01-25 | 2020-09-15 | Google Llc | Image display with selective depiction of motion |
CN109819338B (en) * | 2019-02-22 | 2021-09-14 | 影石创新科技股份有限公司 | Automatic video editing method and device and portable terminal |
US11003915B2 (en) | 2019-03-29 | 2021-05-11 | Wipro Limited | Method and system for summarizing multimedia content |
CN110730380B (en) * | 2019-08-28 | 2022-11-22 | 咪咕文化科技有限公司 | Video synthesis method, electronic device and storage medium |
KR102245349B1 (en) * | 2020-02-11 | 2021-04-28 | 한국과학기술원 | Method and apparatus for extracting color scheme from video |
US11314970B1 (en) * | 2020-11-19 | 2022-04-26 | Adobe Inc. | Reinforcement learning techniques for automated video summarization |
CN112468877B (en) * | 2021-02-01 | 2021-05-04 | 北京中科大洋科技发展股份有限公司 | Intelligent news cataloging method based on AI content analysis and OCR recognition |
CN113361426A (en) * | 2021-06-11 | 2021-09-07 | 爱保科技有限公司 | Vehicle loss assessment image acquisition method, medium, device and electronic equipment |
CN113343026B (en) * | 2021-06-17 | 2022-07-15 | 中国科学技术大学 | Method for generating summary of content of online video course |
US12260639B2 (en) * | 2021-11-12 | 2025-03-25 | International Business Machines Corporation | Annotating a video with a personalized recap video based on relevancy and watch history |
US12262115B2 (en) | 2022-01-28 | 2025-03-25 | Gopro, Inc. | Methods and apparatus for electronic image stabilization based on a lens polynomial |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6710822B1 (en) * | 1999-02-15 | 2004-03-23 | Sony Corporation | Signal processing method and image-voice processing apparatus for measuring similarities between signals |
US20080232687A1 (en) * | 2007-03-22 | 2008-09-25 | Christian Petersohn | Method and device for selection of key-frames for retrieving picture contents, and method and device for temporal segmentation of a sequence of successive video pictures or a shot |
US20090080853A1 (en) * | 2007-09-24 | 2009-03-26 | Fuji Xerox Co., Ltd. | System and method for video summarization |
-
2008
- 2008-10-24 US US12/290,033 patent/US8195038B2/en not_active Expired - Fee Related
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6710822B1 (en) * | 1999-02-15 | 2004-03-23 | Sony Corporation | Signal processing method and image-voice processing apparatus for measuring similarities between signals |
US20080232687A1 (en) * | 2007-03-22 | 2008-09-25 | Christian Petersohn | Method and device for selection of key-frames for retrieving picture contents, and method and device for temporal segmentation of a sequence of successive video pictures or a shot |
US20090080853A1 (en) * | 2007-09-24 | 2009-03-26 | Fuji Xerox Co., Ltd. | System and method for video summarization |
Non-Patent Citations (7)
Title |
---|
Chen, F., et al., "Video Summarization Preserving Dynamic Content", TRECVID Video Summarization Workshop ACM Multimedia, 2007. |
Itti, L., et al., "A Model of Saliency-Based Visual Attention for Rapid Scene Analysis", IEEE Transactions on Pattern Analysis and Machine Intelligence, 1998. |
Lee et al. "A Fast Clustering Algorithm for Video Abstraction", 2003, pp. 563-568. * |
Liu, Z., et al., "A Fast, Comprehensive Shot Boundray Determination System", IEEE International Conference on Multimedia and Expo, 2007. |
Liu, Z., et al., "AT&T Research at TRECVID 2007", TRECVID 2007 Workshop, 2007. |
Ma et al, "Video Snapshot: A Bird View of Video Sequence", Proceedings of the 11th International Multimedia Conference (MMM'05), 2005. * |
Petersohn, "Sub-Shots-Basic Units of Video", Tehnical University of Berlin, pp. 323-326, date unknown. * |
Cited By (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120183219A1 (en) * | 2011-01-13 | 2012-07-19 | Sony Corporation | Data segmenting apparatus and method |
US8831347B2 (en) * | 2011-01-13 | 2014-09-09 | Sony Corporation | Data segmenting apparatus and method |
US10198441B1 (en) | 2014-01-14 | 2019-02-05 | Google Llc | Real-time duplicate detection of videos in a massive video sharing system |
US9529840B1 (en) * | 2014-01-14 | 2016-12-27 | Google Inc. | Real-time duplicate detection of videos in a massive video sharing system |
US9471663B1 (en) | 2014-01-22 | 2016-10-18 | Google Inc. | Classification of media in a media sharing system |
US9799376B2 (en) * | 2014-09-17 | 2017-10-24 | Xiaomi Inc. | Method and device for video browsing based on keyframe |
US20160078297A1 (en) * | 2014-09-17 | 2016-03-17 | Xiaomi Inc. | Method and device for video browsing |
US10141023B2 (en) | 2014-12-29 | 2018-11-27 | Industrial Technology Research Institute | Method and system for multimedia summary generation |
US9723344B1 (en) | 2015-12-29 | 2017-08-01 | Google Inc. | Early detection of policy violating media |
US11393209B2 (en) * | 2017-10-12 | 2022-07-19 | Google Llc | Generating a video segment of an action from a video |
US11663827B2 (en) | 2017-10-12 | 2023-05-30 | Google Llc | Generating a video segment of an action from a video |
US11729478B2 (en) | 2017-12-13 | 2023-08-15 | Playable Pty Ltd | System and method for algorithmic editing of video content |
US12231747B2 (en) | 2017-12-13 | 2025-02-18 | Playable Pty Ltd | System and method for algorithmic editing of video content |
US11228806B2 (en) | 2018-12-10 | 2022-01-18 | At&T Intellectual Property I, L.P. | System for content curation with user context and content leverage |
US10735811B2 (en) | 2018-12-10 | 2020-08-04 | At&T Intellectual Property I, L.P. | System for content curation with user context and content leverage |
US11836181B2 (en) | 2019-05-22 | 2023-12-05 | SalesTing, Inc. | Content summarization leveraging systems and processes for key moment identification and extraction |
US12210560B2 (en) | 2019-05-22 | 2025-01-28 | SalesTing, Inc. | Content summarization leveraging systems and processes for key moment identification and extraction |
US12235888B2 (en) | 2019-05-22 | 2025-02-25 | SalesTing, Inc. | Content summarization leveraging systems and processes for key moment identification and extraction |
US11475063B2 (en) | 2020-02-24 | 2022-10-18 | Motorola Solutions, Inc. | Device, system and method for providing indications of a discrepancy between a video and a textual description of an incident |
Also Published As
Publication number | Publication date |
---|---|
US20100104261A1 (en) | 2010-04-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8195038B2 (en) | Brief and high-interest video summary generation | |
US8363960B2 (en) | Method and device for selection of key-frames for retrieving picture contents, and method and device for temporal segmentation of a sequence of successive video pictures or a shot | |
CN108882057B (en) | Video abstract generation method and device | |
US9646227B2 (en) | Computerized machine learning of interesting video sections | |
Ngo et al. | Automatic video summarization by graph modeling | |
JP5355422B2 (en) | Method and system for video indexing and video synopsis | |
KR100590537B1 (en) | Summary Method and Device of Multiple Images | |
Gao et al. | Unsupervised video-shot segmentation and model-free anchorperson detection for news video story parsing | |
Truong et al. | Scene extraction in motion pictures | |
US9881215B2 (en) | Apparatus and method for identifying a still image contained in moving image contents | |
US7483618B1 (en) | Automatic editing of a visual recording to eliminate content of unacceptably low quality and/or very little or no interest | |
JP4253139B2 (en) | Frame information description method, frame information generation apparatus and method, video reproduction apparatus and method, and recording medium | |
Srinivas et al. | An improved algorithm for video summarization–a rank based approach | |
JP6557592B2 (en) | Video scene division apparatus and video scene division program | |
US20190364211A1 (en) | System and method for editing video contents automatically technical field | |
EP2022054A2 (en) | A video browsing user interface | |
CN113255423B (en) | Method and device for extracting color scheme from video | |
KR101195613B1 (en) | Apparatus and method for partitioning moving image according to topic | |
JP2009123095A (en) | Image analysis device and image analysis method | |
Jiang et al. | Advances in video summarization and skimming | |
JP5096259B2 (en) | Summary content generation apparatus and summary content generation program | |
Baber et al. | Video segmentation into scenes using entropy and SURF | |
Carbonneau et al. | Real-time visual play-break detection in sport events using a context descriptor | |
Liu et al. | Brief and high-interest video summary generation: evaluating the AT&T labs rushes summarizations | |
Chen et al. | On the preview of digital movies |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: AT&T INTELLECTUAL PROPERTY I, L.P.,NEVADA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:LIU, ZHU;BASSO, ANDREA;GIBBON, DAVID CRAWFORD;AND OTHERS;SIGNING DATES FROM 20090226 TO 20090310;REEL/FRAME:022407/0766 Owner name: AT&T INTELLECTUAL PROPERTY I, L.P., NEVADA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:LIU, ZHU;BASSO, ANDREA;GIBBON, DAVID CRAWFORD;AND OTHERS;SIGNING DATES FROM 20090226 TO 20090310;REEL/FRAME:022407/0766 |
|
FEPP | Fee payment procedure |
Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Free format text: PAYER NUMBER DE-ASSIGNED (ORIGINAL EVENT CODE: RMPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
ZAAA | Notice of allowance and fees due |
Free format text: ORIGINAL CODE: NOA |
|
ZAAB | Notice of allowance mailed |
Free format text: ORIGINAL CODE: MN/=. |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
FPAY | Fee payment |
Year of fee payment: 4 |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 8 |
|
FEPP | Fee payment procedure |
Free format text: MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
STCH | Information on status: patent discontinuation |
Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362 |
|
FP | Lapsed due to failure to pay maintenance fee |
Effective date: 20240605 |