WO2017156793A1

WO2017156793A1 - Geographic location-based video processing method

Info

Publication number: WO2017156793A1
Application number: PCT/CN2016/077182
Authority: WO
Inventors: 崔时泓
Original assignee: 深圳市至壹科技开发有限公司
Priority date: 2016-03-15
Filing date: 2016-03-24
Publication date: 2017-09-21
Also published as: CN105812660A

Abstract

The present invention relates to a geographic location-based video processing method for use by a mobile filming terminal, wherein said mobile filming terminal comprises a camera, and said method comprises: invoking the camera to film so as to obtain video data; obtaining a current video frame to be sent according to the video data; obtaining additional information corresponding to the current video frame, wherein the additional information at least contains current positioning information of the mobile filming terminal; adding the additional information to a data packet of the current video frame; and sending the data packet of the current video frame to which the additional information has been added to either a cloud server or a remote client, thus allowing the cloud server or remote client to process the video frame according to the positioning information within the received data packet. In the above method, each video frame contains positioning information, and may also contain other additional information; on the basis of said additional information, the method allows for video data to be looked up according to geographic location, thereby providing plentiful applications of videos.

Description

Location-based video processing method

Technical field

The present invention relates to video processing technologies, and in particular, to a geographic location based video processing technology.

Background technique

With the continuous development of network communication technologies and network infrastructure, it is possible to provide various multimedia contents on the Internet. On the other hand, with the development of mobile electronic terminals such as smart phones, a large number of videos taken by various users have been uploaded on the network. These videos record a wealth of information, and existing video retrieval systems are generally implemented based on the names or tags set by the video uploader when uploading the video. The rich information in the video cannot be retrieved.

Summary of the invention

In view of the above, it is necessary to provide a video processing method and system, which can solve the problem that information contained in a video in the prior art cannot be retrieved and utilized.

A geographic location-based video processing method is applied to a mobile shooting terminal, the mobile shooting terminal includes a camera, and the method includes:

Calling the camera to capture video data;

Obtaining a current video frame to be sent according to the video data;

Obtaining additional information corresponding to the current video frame, where the additional information includes at least current positioning information of the mobile camera terminal;

Adding the additional information to a data packet of the current video frame;

Sending a data packet of the current video frame to which the additional information is added to the cloud server or the remote client, so that the cloud server or the remote client according to the received data packet The positioning information within the processing of the video frame.

In one embodiment, the method further includes: detecting a rotation angle of the camera in real time during shooting; the additional information further includes a rotation angle of the camera.

In an embodiment, the method further includes: acquiring user identity information of the photographer corresponding to the current video frame; the additional information further includes the user identity information.

In an embodiment, the method further includes: acquiring a shooting time of the current video frame in real time during shooting; the additional information further includes the shooting time.

In one embodiment, the method further includes: obtaining an instruction input by a user; the additional information further including the shooting time.

In one embodiment, the method further includes receiving a label and/or textual input with the user; the additional information further including the label and/or textual introduction.

In one embodiment, the above method further comprises: encrypting at least part of the content of the additional information, the additional information including at least the content processed by the encryption.

In an embodiment, the method further includes: calculating corresponding verification information according to the video data of the current frame; and the verification information is included in the encrypted processed content.

A geographic location-based video processing method applied to a cloud server system, including:

Receiving a video data packet sent by the mobile shooting terminal;

And parsing the video data and the corresponding additional information from the video data packet, where the additional information includes at least the current positioning information of the mobile shooting terminal;

Storing the video data and obtaining a corresponding index;

Storing the additional information in association with the index;

Retrieving and acquiring corresponding video data based on the additional information, and providing a video data service to the client based on the retrieved video data.

In an embodiment, the method further includes: parsing out from the video data packet An instruction corresponding to each video frame; and executing a data processing procedure corresponding to the instruction.

In one embodiment, the data processing process includes:

Capturing a video clip of a predetermined length or a specified length and sharing it to a self-media platform or a social networking platform bound to the mobile camera terminal; or

Intercepting a relevant video clip to be sent to the server of the traffic violation handling agency; or

Automatically intercepting a related video clip and sending it to the alarm server of the alarm receiving institution; or

Automatically intercepts a related video clip and sends it to the server of the emergency department.

In an embodiment, the method further includes: parsing a label and/or a text description of the video from the video data package;

Store the label and/or text description;

Extracting keywords from the label and/or text introduction;

Perform statistical analysis on the keywords based on geographic location;

When the statistical value of a certain keyword of a certain geographical location exceeds a preset threshold, relevant video data is acquired, and the acquired related video data is published in a content distribution system.

According to the above technical solution, each video frame of the video data includes positioning information, and may further include other additional information, and based on the additional information, the video data can be subjected to geo-location-based retrieval, thereby being able to perform on the basis of Provide a rich video application.

The above and other objects, features, and advantages of the present invention will become more apparent and understood by the appended claims appended claims

DRAWINGS

FIG. 1 is a schematic structural diagram of a geographic location-based video information processing system according to an embodiment of the present invention.

2 is a block diagram showing the structure of a mobile photographing terminal of the video information processing system of FIG. 1.

FIG. 3 is a schematic diagram showing the data structure of video data uploaded by the mobile camera terminal of FIG. 2.

4 is a schematic flow chart of processing video data uploaded by a mobile camera terminal by the video information processing system of FIG. 1.

5 is a schematic diagram of an interface of an application provided by the video information processing system of FIG. 1 based on video data with geographic locations.

6 is a schematic diagram of an interface of another application provided by the video information processing system of FIG. 1 based on video data with geographic location.

FIG. 7 is a schematic flow chart of the video information processing system of FIG. 1 providing a panoramic preview function based on video data with geographic locations.

8 is a flow chart showing the video information processing system of FIG. 1 providing a video search service based on video data with geographic location.

9 is an expanded schematic diagram of the video information processing system of FIG. 1.

10 is a flow diagram of the video information processing system of FIG. 1 providing a video application based on video data with geographic location, tags, and/or textual introduction.

detailed description

The specific embodiments, structures, features and functions of the present invention will be described in detail below with reference to the accompanying drawings and preferred embodiments.

FIG. 1 is a schematic structural diagram of a geographic location-based video information processing system according to a first embodiment of the present invention. As shown in FIG. 1, the video information processing system 100 may include a mobile photography terminal 10, a cloud server system 20, and a client 30.

The mobile photographing terminal 10, in particular, may be any mobile electronic terminal with a camera such as a mobile phone, a tablet computer, a drone, or the like. Referring to FIG. 2, it is a schematic structural diagram of the mobile photographing terminal 10. The mobile photographing terminal 10 includes a memory 102, a memory controller 104, and one or more (only one shown in the figure) processor 106, peripheral interface 108, network module 110, audio circuit 111, GPS (Global Positioning System) module 112, sensor 114, camera module 116, and power module 122 . These components communicate with one another via one or more communication bus/signal lines.

It will be understood by those skilled in the art that the structure shown in FIG. 2 is merely illustrative and does not limit the structure of the mobile camera terminal 10. For example, the mobile photography terminal 10 may also include more or less components than those shown in FIG. 2, or have a different configuration than that shown in FIG. 2.

The memory 102 can be used to store software programs and modules, such as program instructions/modules corresponding to the methods and devices in the embodiments of the present invention. The processor 106 executes various functional applications by running software programs and modules stored in the memory 102. And data processing.

Memory 102 can include high speed random access memory and can also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid state memory. In some examples, memory 102 can further include memory remotely located relative to processor 106, which can be connected to the server via a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof. Access to the memory 102 by the processor 106 and other possible components can be performed under the control of the memory controller 104.

Peripheral interface 108 couples various input/input devices to processor 106. The processor 106 runs various software within the memory 102, instructs the server to perform various functions, and performs data processing. In some embodiments, peripheral interface 108, processor 106, and memory controller 104 can be implemented in a single chip. In other instances, they can be implemented by separate chips.

The network module 110 is configured to receive and transmit network signals. The network signal described above may include a wireless signal. In one embodiment, the network module 110 is substantially a radio frequency module that receives and transmits electromagnetic waves, and converts electromagnetic waves and electrical signals, thereby interacting with a communication network or other devices. Line communication. The radio frequency module can include various existing circuit components for performing these functions, such as an antenna, a radio frequency transceiver, a digital signal processor, an encryption/decryption chip, a Subscriber Identity Module (SIM) card, a memory, and the like. The RF module can communicate with various networks such as the Internet, intranets, wireless networks or with other devices over a wireless network. The wireless network described above may include a cellular telephone network, a wireless local area network, or a metropolitan area network. The above wireless network can use various communication standards, protocols and technologies, including but not limited to Global System for Mobile Communication (GSM), Enhanced Data GSM Environment (EDGE), and wideband code. Wideband code division multiple access (W-CDMA), Code division access (CDMA), Time Division Multiple Access (TDMA), Wireless Fidelity (Wireless Fidelity, WiFi) (such as the Institute of Electrical and Electronics Engineers Standard IEEE 802.11a, IEEE 802.11b, IEEE 802.11g and / or IEEE 802.11n), Voice Over Internet Protocol (VoIP), Worldwide Interoperability for Microwave Access, Wi-Max, other protocols for mail, instant messaging, and short messages, as well as any other suitable communication protocol, may even include protocols that are not currently being developed.

The audio circuit 111 is provided with an interface for providing recording of the mobile photographing terminal 10. Specifically, the audio circuit 111 receives an electrical signal from the microphone, converts the electrical signal into sound data, and transmits the sound data to the processor 102 for further processing.

The GPS module 112 is configured to receive a positioning signal of a GPS satellite broadcast, and calculate its own position according to the positioning signal. The above positions can be expressed, for example, by longitude, latitude, and altitude. It will be appreciated that the manner in which positioning is achieved is not limited to GPS systems. For example, other available satellite positioning systems include the Compass Navigation Satellite System (CNSS) or the Global Navigation Satellite System (GLONASS). In addition, positioning is not limited to satellite positioning technology, for example, Wireless positioning technology, such as positioning technology based on wireless base stations or positioning technology of WIFI. At this point, the GPS module 112 can be replaced with a corresponding module, or directly by the processor 102 executing a particular positioning procedure.

Examples of sensors 114 include, but are not limited to, light sensors, attitude sensors, and other sensors. Among them, the ambient light sensor can sense the brightness of the ambient light, and thus can adjust the shooting. The attitude sensor may include, for example, an acceleration sensor, a gravimeter, a gyroscope, or the like, which can detect a spatial attitude of the moving photographing terminal 10, for example, a rotation angle in various directions, and the like. It can be understood that the rotation angle of the moving photographing terminal 10 in each direction corresponds to the photographing direction. Other sensors may include barometers, hygrometers, thermometers, and the like.

The camera module 116 is used to take a photo or video. The photographed or video taken may be stored in the memory 104 and may be transmitted through the network module 110. The camera module 116 may specifically include components such as a lens module, an image sensor, and a flash. The lens module is used to image the captured object and map the formed image to the image sensor. The image sensor is used to receive light from the lens module to achieve sensitization to record image information. Specifically, the image sensor can be implemented based on a Complementary Metal Oxide Semiconductor (CMOS), a Charge-coupled Device (CCD), or other image sensing principle. The flash is used to compensate for exposure when shooting. In general, the flash for moving the photographing terminal 10 may be a Light Emitting Diode (LED) flash.

The power module 122 is used to provide power to the processor 102 and other components. Specifically, the power module 122 may include a power management system, one or more power sources (such as a battery or an alternating current), a charging circuit, a power failure detecting circuit, an inverter, a power status indicator, and any other power in the mobile shooting terminal 10. The generation, management, and distribution of related components.

The software and program modules stored in the memory 104 may include an operating system 130 and an application running on the operating system 130. The operating system 130 can include various management systems Software components and/or drivers (such as memory management, storage device control, power management, etc.) that communicate with various hardware or software components to provide an operating environment for other software components. The application program may include: a shooting module 131, an additional information adding module 132, a video data encapsulating module 133, and a data sending module 134.

The shooting module 131 is configured to invoke the camera module 116 to capture video data, and the additional information acquiring module 132 is configured to acquire additional information corresponding to the current video frame and add the additional information to the current video frame. The video data encapsulating module 133 is configured to package data of one or more video frames to which additional information is added; the data sending module 134 is configured to send the packaged video data to the cloud server system 20, so that the cloud server System 20 provides various information services based on the additional information based on the additional information within the received video data.

As shown in FIG. 3, multiple video frames may be included in the same video data packet, and each video frame includes both additional information and video data of the video frame, and the video data may be in any format (for example, H. 264 or MPEG4, etc. for storage).

The additional information may include two types, one is editable additional information, and the user may modify, add or delete such information through a specific application, and the editable additional information may generally be used to store information input by the user; The other type is additional information that cannot be edited. Once the video frame is written, the user can no longer edit it. The non-editable additional information can generally be used to store the status information acquired in real time.

In a specific embodiment, the editable additional information may include: a label input by a user, a text introduction, and the like.

In a specific embodiment, the editable additional information described above may include the code of the instruction entered by the user. Instructions entered by the user may include sharing, reporting, and the like.

In a specific embodiment, the non-editable additional information described above may include positioning information, such as latitude, longitude, and altitude acquired by the GPS module 112.

In a specific embodiment, the non-editable additional information may include posture information of the mobile camera terminal 10, for example, a rotation angle of the mobile camera terminal 10 or the camera module 116 in each direction. The attitude information of the mobile photographing terminal 10 can be acquired by the sensor 114.

In a specific embodiment, the non-editable additional information may include: a shooting time of the current video frame.

In a specific embodiment, the non-editable additional information described above may include: user identification information of the video photographer. The user identification information herein may be, for example, an account of the user in a network account system, or other information capable of uniquely determining the user account in a network account system. At the same time, the user of the video capture terminal 20, that is, the video photographer, can be limited to only one person. The user may be a user account bound to the mobile camera terminal 10 or authorized to use the user account of the mobile camera terminal 10.

In a specific embodiment, the non-editable additional information may include: verification information of the video data of the current video frame. The verification information is calculated, for example, by using a hash algorithm according to the video data, and can be used to verify whether the video data is modified. Thus, no matter how the video frame is copied and transmitted, the video data can be verified based on the verification information, so that the authenticity of the video data can be further confirmed, which provides a technical guarantee for the video as judicial evidence.

For editable additional information, it can write only part of the video frame, for example, for multiple video frames generated within one second (which can also be other lengths of time), the editable additional information can be written to only one fixed video. Within a frame (for example, the first frame). This video frame with additional information that can be edited can be defined as a key video frame during that time. In this way, editable additional information can be written directly into the video frame, and the storage space occupied by the editable additional information can be minimized.

For non-editable additional information, it is generally obtained in real time, so it can be in every frame Both are written. However, it is not limited to this manner, and it is still possible to write non-editable additional information only in a part of the video frame. For example, each second writes non-editable additional information in one video frame.

In addition, in order to prevent the non-editable additional information from being destroyed or falsified, the non-editable additional information may be encrypted and processed by the asymmetric encryption algorithm to be written to the video frame. For example, each video capture terminal 10 may store the same public key, and the public key may be used to encrypt the non-editable additional information. The private key corresponding to the public key is only available in the cloud server system 20, that is, only the cloud server 10 can interpret the encrypted additional information written in the video frame.

As described above, in the video information processing system of the present embodiment, the video data uploaded by the mobile photographing terminal 10 includes the video data and the additional information described above.

As shown in FIG. 1, the cloud server system 2020 can include a video processing server 21, a database 22, a distributed file storage system 23, and an application server 24.

The video processing server 21 is configured to receive the video data packet uploaded by the mobile camera terminal 10, and further process the received video data packet.

Referring to FIG. 4, in a specific embodiment, the video processing server 21 performs further processing on the received video data packet, including the following steps:

Step S101, extracting additional information of each frame of video from the video data packet. First, the video data packet is unpacked to obtain all the video frames, and then the additional information is parsed from the video frame according to a predefined protocol.

Step S102, processing the video data into a format suitable for storage. For example, the video data itself is subjected to a certain compression process, a conversion format, and the like. However, it is worth noting that the processing in this step is only for the video data itself, and the processing does not affect the additional information. That is to say, even for the processed video data, the same additional information as before the processing is included in each frame. In addition, step S102 can be omitted, that is, each frame is extracted from the video data packet. After the video, the received video data packet is directly used as a storage format.

Step S103: Store the video data in the distributed file storage system and obtain a corresponding storage index. That is, the video data or the video data packet acquired in step S102 is stored in the distributed file storage system, and the distributed file storage system returns a storage index, which is used to implement access to the video data.

In step S104, the additional information is stored in the database in association with the storage index. For example, a relational database may be used to store additional information and a stored index, and different information within the additional information (eg, coordinates, shooting time, user ID, instruction code, gesture information, tags, etc.) may be stored in separate fields, respectively. It can be understood that if the additional information is encrypted, it needs to be decrypted first.

Through the above processing, the video data can be retrieved, counted, analyzed, outputted and the like based on the additional information, thereby providing various video applications to the user, and the specific process can be implemented by the application server 24.

The client 30 may include, for example, a smart phone 31, a notebook computer 32, a desktop computer 33, a tablet computer 34, and any other smart terminal not shown in FIG. 1, such as smart glasses, augmented reality helmets, wearable smart devices, and the like. .

The client 30 interacts with the application server 24 so that various video applications provided by the application server 24 can be used. The following will be described below in conjunction with specific scenarios.

Referring to FIG. 5, in a specific application scenario, a certain mobile camera terminal 10 moves from position A to position B. In the process, the mobile camera terminal 10 is always shooting and uploading an additional information video to the cloud server system 20. The packet of the frame. The cloud server system 20 can forward the received data packet to the client. Of course, the mobile camera terminal 10 can also send the data packet directly to the client in a point-to-point manner. The client parses the positioning information from the received data packet, and can generate a trajectory line in the electronic map 301 according to the positioning information, and at the same time, the client can simultaneously output the video screen 302. In this way, the user of the client can pass the electricity The sub-map 301 knows the position of the mobile shooting terminal 10 in real time, and can view the currently captured video image in real time through the video screen 302, thereby realizing simultaneous viewing of the moving position and the real-time picture, and is particularly suitable for real-time tracking of a certain target. However, it can be understood that such tracking is only meaningful for the same mobile camera terminal 10 and the photographer's identity is the same.

Further, when the posture information of the mobile photographing terminal 10 is included in the additional information, the angle of view of the video can also be displayed in the electronic map 301.

It can be understood that, in the electronic map 301, each point in the trajectory corresponds to a coordinate. When the user clicks on a certain point on the trajectory, the coordinates corresponding to the click point can be obtained according to the preset mapping relationship, and then It is possible to find a video frame whose positioning information is closest to the coordinates among all the received video frames, and to switch the video picture 302 to the video frame.

In the above application scenario, the client is viewing the video captured by a particular mobile camera terminal 10, however, the form of attaching the video service in the electronic map is not limited to the above manner, for example, in another specific In the application scenario, the cloud server system 20 can provide an online electronic map service to provide electronic map data to the client.

Referring to FIG. 6, a corresponding electronic map application is run in the client 30, which acquires electronic map data from the cloud server system 20 and displays it in the interface 61. As shown in FIG. 6, when triggered, a menu 62 can be displayed in the interface 61, with various additional functions in the menu 62, such as an entrance of "panoramic preview", when the user triggers the "panoramic preview" function, the electronic map The application acquires the coordinates of the user's click location, generates a preview request based on the coordinates, and sends the preview request to the cloud server system 20.

Referring to FIG. 7, the process of processing a panoramic preview request in the cloud server system 20 includes the following steps:

Step S201, receiving a panoramic preview request.

Step S202, parsing the preview coordinates from the panoramic preview request.

The preview coordinates are obtained by the electronic map application when the user triggers the panoramic preview function. The coordinates at which the user clicks on the position, or the point at which the other way is marked.

Step S203, retrieving the matched video data according to the preview coordinates and the additional information of the video data.

As described above, additional information (including at least address information) of all the video data is stored in the database, and therefore, additional information matching the preview coordinates can be retrieved, and corresponding video data can be acquired via the additional information.

Since it is possible to have video data taken by a plurality of mobile shooting terminals at the same position, when the search results are presented, the search results can be sorted according to a certain order, for example, shooting time, image sharpness, and the like. And then acquiring video data captured by at least one mobile shooting terminal from the sorted result as a retrieval result.

It is worth noting that when the user previews the panorama in the electronic map, the transmitted video data may include only one frame without transmitting all the video data, thereby reducing the network transmission amount.

Further, in the retrieval step of step S203, the video data that can be retrieved must be video data that has been authorized by the photographing user to be disclosed.

Step S204, the retrieved data is sent to the client for display.

After receiving the video data sent by the cloud server system 20, the client unpacks (if compressed), decrypts (if encrypted), and the like according to a predefined protocol, and then outputs the content in the interface.

Since the direction is involved in the user when the user is using the panoramic preview function, the direction selected by the user may also be included in the preview request. In this case, in the retrieval step of step S203, in addition to retrieving the preview coordinates, it is necessary to retrieve the shooting direction of the video, and only the video data whose coordinates match the direction can be used as the matched video data.

According to the above embodiment, the panoramic preview function in the electronic map can be realized directly based on the video data captured by the mass mobile shooting terminal, without using a special street view shooting car to shoot Street view photos, effectively reducing the cost of building real-life browsing capabilities.

Referring to FIG. 8, in another specific application scenario, the cloud server system 20 also provides a geographic location based video search service. As shown in FIG. 8, the flow of providing a video search service by the cloud server system 20 includes the following steps:

Step S301, receiving a video search request sent by the client;

Step S302, parsing the search coordinates and the search time from the video search request;

Step S303, searching for matching video data according to the search coordinates and the search time;

Step S304, sending the searched video data to the client for display;

Step S305, modifying the order of the video data captured by the different video capturing terminals according to the feedback during the user watching the video.

According to this method, when a hot event occurs in a certain place, no special shooting is required, and the corresponding video data can be obtained by providing the location and time of the event; and the video data is corrected according to the feedback result during the user viewing process. The sorting in turn ensures that the user is most interested in viewing the most interesting video content.

According to the various implementation manners above, the video data is retrieved and matched based on the geographic location information in the video data, and the specific video application scenario is implemented on the basis of the video data. However, the additional information within the video data is not limited to these application scenarios.

For example, in the process of the mobile shooting terminal 10 capturing and uploading video data, the user can also input some instructions, such as sharing, traffic violation reporting, alarm, insurance, first aid, and the like. The input method of the command may be directly input through a button on the mobile photographing terminal 10, a touch screen, or the like, or may be input through the mobile electronic terminal 30 connected to the mobile photographing terminal 10. The code for these instructions can be added to the data frame of the video frame. Correspondingly, after receiving the data packet of the video frame, the cloud server system 20 can parse out the instruction code therefrom, and execute the corresponding data processing process, or forward the instructions to the servers of other third-party organizations for further processing. deal with.

As shown in FIG. 9, the third party institutions herein may include: a self-media platform, a social network platform, a traffic violation handling agency, a police, an insurance institution, a first aid institution, and the like.

For example, when receiving the user's sharing instruction, the cloud server system 20 can automatically intercept a predetermined length (which can be started from the time of receiving the sharing instruction) or a video clip of a specified length, and share it to the mobile shooting terminal 10. From the media platform or social network platform (such as WeChat friends circle, QQ space, Weibo, etc.). In this way, when the user encounters interesting events, beautiful scenery or any other content that the user wants to share, it is possible to share the captured video with one click.

Further, the user may add a label, a text introduction, and the like to the shared content. When the mobile photographing terminal 10 includes an input interface, the user can directly input on the mobile photographing terminal 10, and the mobile photographing terminal 10 may not include the input interface. At this time, the input may be performed by the mobile electronic terminal bound to the mobile photographing terminal 10. These tags, text descriptions can be stored by the cloud server system 20 and used for video retrieval.

When receiving the traffic violation reporting instruction, the cloud server system 20 can automatically intercept a relevant video segment and send it to the server of the traffic violation processing institution, so that the user can realize one-click reporting of traffic violation and violation by the mobile shooting terminal 10. .

When receiving the alarm instruction, the cloud server system 20 can automatically intercept a relevant video segment and send it to the alarm server of the alarm receiving institution. In this way, the user can realize a one-button alarm by moving the shooting terminal 10, and because the video frame data packet directly contains the positioning information, the police can conveniently locate the alarm position in time.

When receiving the insurance instruction, the cloud server system 20 can automatically intercept a relevant video segment and send it to the server of the insurance institution. In this way, the user can realize the one-key insurance by moving the shooting terminal 10. Since the video can effectively restore the live situation on the spot, the insurance institution can realize the remote claim service without going to the scene.

When receiving the first aid instruction, the cloud server system 20 can automatically intercept a relevant section. The video clip is sent to the server of the emergency department, so that the user can realize the function of one-touch call emergency service by moving the photographing terminal 10. Moreover, since the video frame data packet directly includes the positioning information, the emergency department can quickly locate the call position and reduce the position communication time.

According to the above embodiment, the instruction may be directly embedded in the additional information of the video data, so that the cloud server system 20 executes the corresponding data processing process and functions, so that the video data can be more widely applied.

As described above, when the user shares the video captured by the mobile shooting terminal 10, the user can input a label or a text introduction. In the cloud server system 20, based on the text introduction, automatic discovery and mining of the hot event and related video can also be realized.

Referring to FIG. 10, the automatic mining process of hotspot events and related videos based on video tags and texts includes the following steps:

Step S301, extracting keywords from the received video tags and text introduction.

Video tags can generally be used directly as keywords. The text introduction can perform steps such as word segmentation processing and word frequency statistics to extract keywords.

In step S302, the frequency of occurrence of the keyword and/or other parameters are separately calculated based on the geographic location.

Since the additional information transmitted simultaneously with the video data also includes the positioning information, the frequency of occurrence of a certain keyword and other parameters can be counted based on the geographical location. Other parameters herein may include, for example, density, increased speed, and the like.

Step S303, when the frequency and/or other parameters of a certain keyword appearing in a certain place exceed a preset threshold, the content of the video related to the keyword based on the location is released in a content distribution system.

When the frequency and/or other parameters of a keyword appearing in a place are higher than the preset value, it can be regarded as a hot event or a hot spot. At this time, the video content of the place related to the keyword can be published in a content distribution system for the user to browse. The content publishing system here, for example, a video website, APP or other content publishing system.

The released video content may be a video captured by a single mobile shooting terminal, or may be a video data captured by a plurality of mobile shooting terminals 10.

According to the above method, various popular events, scenes and contents in life can be automatically found, and the content of the video is directly presented to the user without a special shooting team to shoot on the spot.

The above is only a preferred embodiment of the present invention, and is not intended to limit the scope of the present invention. The present invention has been disclosed in the preferred embodiments, but is not intended to limit the present invention, and any person skilled in the art. The present invention may be modified or modified to equivalent variations without departing from the technical scope of the present invention, without departing from the scope of the present invention. It is still within the scope of the technical solution of the present invention to make any of the above-mentioned embodiments, the equivalent modifications and modifications of the above embodiments.

Claims

A method for processing a video based on a geographic location, which is applied to a mobile camera. The mobile camera includes a camera. The method includes:

Calling the camera to capture video data;

Obtaining a current video frame to be sent according to the video data;

Obtaining additional information corresponding to the current video frame, where the additional information includes at least current positioning information of the mobile camera terminal;

Adding the additional information to a data packet of the current video frame;

Sending a data packet of the current video frame to which the additional information is added to the cloud server or the remote client, so that the cloud server or the remote client processes the video frame according to the positioning information in the received data packet. .
The method according to claim 1, further comprising: detecting a rotation angle of the camera in real time during shooting; the additional information further comprising a rotation angle of the camera.
The method for processing a geographic location based video according to claim 1, further comprising: acquiring user identification information of a photographer corresponding to the current video frame; the additional information further including the user identification information.
The geographic location-based video processing method according to claim 1, further comprising: acquiring a shooting time of the current video frame in real time during shooting; the additional information further including the shooting time.
The method according to claim 1, further comprising: obtaining an instruction input by the user; the additional information further comprising the shooting time.
The method according to claim 1, further comprising: receiving a label and/or text input input by the user; the additional information further comprising the label and/or text introduction.
The geographic location-based video processing method of claim 1, further comprising: encrypting at least a portion of the content of the additional information, the additional information including at least the encrypted processed content.
The method for processing a geographic location based video according to claim 1, further comprising: calculating corresponding verification information according to the video data of the current frame; wherein the content of the encrypted processing includes the verification information.
A geographic location-based video processing method, comprising:

Receiving a video data packet sent by the mobile shooting terminal;

And parsing the video data and the corresponding additional information from the video data packet, where the additional information includes at least the current positioning information of the mobile shooting terminal;

Storing the video data and obtaining a corresponding index;

Storing the additional information in association with the index;

Retrieving and acquiring corresponding video data based on the additional information, and providing a video data service to the client based on the retrieved video data.
The geographic location-based video processing method according to claim 9, further comprising: parsing an instruction corresponding to each video frame from the video data packet; and performing data processing corresponding to the instruction process.
The method of processing a geographic location based video according to claim 10, wherein the data processing process comprises:

Capturing a video clip of a predetermined length or a specified length and sharing it to a self-media platform or a social networking platform bound to the mobile camera terminal; or

Intercepting a relevant video clip to be sent to the server of the traffic violation handling agency; or

Automatically intercepting a related video clip and sending it to the alarm server of the alarm receiving institution; or

Automatically intercepts a related video clip and sends it to the server of the emergency department.
The method according to claim 9, further comprising: parsing a label and/or a text of the video from the video data package;

Store the label and/or text description;

Extracting keywords from the label and/or text introduction;

Perform statistical analysis on the keywords based on geographic location;

When the statistical value of a certain keyword of a certain geographical location exceeds a preset threshold, relevant video data is acquired, and the acquired related video data is published in a content distribution system.