CN107509021A

CN107509021A - Shooting method, shooting device and storage medium

Info

Publication number: CN107509021A
Application number: CN201710584382.0A
Authority: CN
Inventors: 刘昕; 廖宇
Original assignee: MIGU Music Co Ltd; MIGU Culture Technology Co Ltd
Current assignee: MIGU Music Co Ltd; MIGU Culture Technology Co Ltd
Priority date: 2017-07-18
Filing date: 2017-07-18
Publication date: 2017-12-22
Anticipated expiration: 2037-07-18
Also published as: CN107509021B

Abstract

The invention discloses a shooting method, which comprises the steps of acquiring state information of a target object in a state of acquiring an audio signal input by the target object; judging whether the state information meets shooting conditions or not; and if the state information accords with shooting conditions, controlling the camera device to shoot the multimedia aiming at the target object. The invention also discloses a shooting device and a storage medium.

Description

A shooting method, device and storage medium

技术领域technical field

本发明涉及拍摄技术领域，尤其涉及一种拍摄方法、装置及存储介质。The present invention relates to the technical field of photographing, in particular to a photographing method, device and storage medium.

背景技术Background technique

随着人们生活水平的不断提高，可供人们选择的娱乐方式也越来越多，其中去K歌房唱歌，是一种非常普遍的娱乐方式。With the continuous improvement of people's living standards, there are more and more entertainment methods for people to choose from. Among them, going to the karaoke room to sing is a very common entertainment method.

近来在商场、影院、餐厅等人流量大的地方，纷纷出现了一种小型化、精致化、便捷化的娱乐设备。该类娱乐设备是一种集唱歌、听歌、录歌等功能于一体的玻璃房子，其外观类似于封闭的电话亭，因此，将其称之为“移动KTV”、“迷你练歌房”，或者“迷你K歌房”(以下统称为“迷你K歌房”)。人们可以在迷你K歌房内不受他人的干涉而尽情歌唱，并且在演唱过程中，不禁会流露出各种丰富的表情。Recently, a kind of miniaturized, exquisite and convenient entertainment equipment has appeared in shopping malls, cinemas, restaurants and other places with a large flow of people. This type of entertainment equipment is a glass house that integrates singing, listening to songs, and recording songs. Its appearance is similar to a closed telephone booth. Therefore, it is called "mobile KTV" and "mini karaoke room". Or "mini karaoke room" (hereinafter collectively referred to as "mini karaoke room"). People can sing in the mini karaoke room without the interference of others, and in the process of singing, they can't help showing various rich expressions.

然而，现有的迷你K歌房中要拍摄用户的表情，往往需要用户通过自己掏出自带的手机或相机等终端设备，进行手动拍摄，效率低且繁琐，导致用户体验较差。However, in the existing mini karaoke room, to capture the user's expression, the user often needs to take out his own mobile phone or camera and other terminal devices to perform manual shooting, which is inefficient and cumbersome, resulting in poor user experience.

发明内容Contents of the invention

有鉴于此，本发明实施例期望提供一种拍摄方法、装置及存储介质，能够实现自动拍摄。In view of this, the embodiments of the present invention hope to provide a shooting method, device and storage medium, which can realize automatic shooting.

为达到上述目的，本发明实施例的技术方案是这样实现的：In order to achieve the above object, the technical solution of the embodiment of the present invention is achieved in this way:

本发明实施例提供一种拍摄方法，所述方法应用于设置有摄像装置的设备，所述方法包括：An embodiment of the present invention provides a shooting method, the method is applied to equipment provided with a camera device, and the method includes:

在采集目标对象输入的音频信号的状态下，获取目标对象的状态信息；In the state of collecting the audio signal input by the target object, obtain the state information of the target object;

判断所述状态信息是否符合拍摄条件；judging whether the state information meets the shooting conditions;

若所述状态信息符合拍摄条件，则控制所述摄像装置拍摄针对所述目标对象的多媒体。If the state information meets the shooting condition, the camera is controlled to shoot the multimedia for the target object.

上述方案中，所述获取目标对象的状态信息之前，还包括：In the above solution, before acquiring the status information of the target object, it also includes:

调整至少一个摄像装置的拍摄角度，使得所述目标对象处于所述至少一个摄像装置的拍摄范围内；或者，若所述目标对象位于所述摄像装置拍摄的预览图像的指定区域外，则提示所述目标对象改变位置。adjusting the shooting angle of at least one camera so that the target object is within the shooting range of the at least one camera; or, if the target object is located outside the specified area of the preview image captured by the camera, prompting the The target object changes position.

上述方案中，所述调整至少一个摄像装置的拍摄角度，包括：In the above solution, the adjustment of the shooting angle of at least one camera device includes:

自动调整所述摄像装置的拍摄角度；或者，automatically adjust the shooting angle of the camera; or,

响应于接收到的拍摄角度调整指令，对所述摄像装置的拍摄角度进行调整。In response to the received shooting angle adjustment instruction, the shooting angle of the camera device is adjusted.

上述方案中，所述获取目标对象的状态信息，包括获取下述信息中的至少一个：In the above solution, the acquisition of the status information of the target object includes acquisition of at least one of the following information:

所述目标对象的音调；the tone of voice of the target audience;

所述目标对象的音量；the volume of the target object;

所述目标对象的面部表情特征。The facial expression characteristics of the target object.

上述方案中，当所述目标对象的状态信息包括所述目标对象的音调时，所述判断所述状态信息是否符合拍摄条件，包括：当所述目标对象在音频信号输出过程中的音调大于音调阈值时，判定符合拍摄条件；In the above solution, when the state information of the target object includes the tone of the target object, the judging whether the state information meets the shooting conditions includes: when the tone of the target object in the audio signal output process is greater than the tone When the threshold is reached, it is determined that the shooting conditions are met;

当所述目标对象的状态信息包括所述目标对象的音量时，所述判断所述状态信息是否符合拍摄条件，包括：当所述目标对象在音频信号输出过程中的音量大于音量阈值时，判定符合拍摄条件；When the state information of the target object includes the volume of the target object, the judging whether the state information meets the shooting conditions includes: when the volume of the target object in the audio signal output process is greater than a volume threshold, judging Meet the shooting conditions;

当所述目标对象的状态信息包括所述目标对象的面部表情特征时，所述判断所述状态信息是否符合拍摄条件，包括：判断表情库中是否存在与所述目标对象的面部表情特征的相似度大于相似度阈值的预设面部表情特征；若存在，则判定符合拍摄条件；其中，所述表情库，为用于存储预设面部表情特征的数据库；所述预设面部表情特征，为预先设置的、用以表征所述目标对象的状态信息符合拍摄条件的面部表情特征。When the state information of the target object includes the facial expression features of the target object, the judging whether the state information meets the shooting conditions includes: judging whether there is similarity with the facial expression features of the target object in the expression database Degree is greater than the preset facial expression feature of the similarity threshold; if it exists, it is determined that it meets the shooting conditions; wherein, the expression library is a database for storing preset facial expression features; the preset facial expression feature is a preset facial expression feature The state information set to characterize the target object conforms to the facial expression feature of the shooting condition.

上述方案中，所述方法还包括：播放预设的多媒体；In the above solution, the method further includes: playing preset multimedia;

若所述状态信息不符合拍摄条件，则当所述预设的多媒体的播放进度达到预设时间点时，控制所述摄像装置拍摄针对目标对象的多媒体。If the state information does not meet the shooting condition, when the preset playing progress of the multimedia reaches a preset time point, the camera is controlled to shoot the multimedia targeted at the target object.

上述方案中，所述设置有摄像装置的设备应用于迷你K歌房。In the above solution, the device provided with the camera device is applied to a mini karaoke room.

上述方案中，所述针对目标对象的多媒体，包括下述之一：In the above scheme, the multimedia for the target object includes one of the following:

图片；视频；picture; video;

所述方法还包括：The method also includes:

对针对所述目标对象的多媒体进行预处理，并将经过预处理后的多媒体进行存储；Preprocessing the multimedia for the target object, and storing the preprocessed multimedia;

其中，所述预处理至少包括以下方式之一：根据获取的多媒体生成表情包；将获取的多媒体制作成个性化视频。Wherein, the preprocessing includes at least one of the following methods: generating an emoticon package according to the acquired multimedia; making the acquired multimedia into a personalized video.

上述方案中，所述方法还包括：In the above scheme, the method also includes:

若所述状态信息符合拍摄条件，则控制所述设备对所述目标对象当前输入的音频信号进行录制；If the state information meets the shooting conditions, then controlling the device to record the audio signal currently input by the target object;

将录制的音频信号和针对所述目标对象的多媒体合成为一个视频。Combining the recorded audio signal and the multimedia for the target object into one video.

利用合成的所述视频，替换所述预设的多媒体。The synthesized video is used to replace the preset multimedia.

将所述针对所述目标对象的多媒体和/或所述视频，发送至指定的终端设备。Send the multimedia and/or the video for the target object to a designated terminal device.

本发明实施例还提供一种拍摄装置，所述装置包括：获取模块、判别模块和拍摄模块；其中，The embodiment of the present invention also provides a photographing device, which includes: an acquisition module, a discrimination module, and a photographing module; wherein,

所述获取模块，用于在采集目标对象输入的音频信号的状态下，获取目标对象的状态信息；The acquisition module is used to acquire the state information of the target object in the state of collecting the audio signal input by the target object;

所述判别模块，用于判断所述状态信息是否符合拍摄条件；The judging module is used to judge whether the state information meets the shooting conditions;

所述拍摄模块，用于若所述状态信息符合拍摄条件，则控制所述摄像装置拍摄针对所述目标对象的多媒体。The photographing module is configured to control the photographing device to photograph the multimedia for the target object if the state information meets the photographing conditions.

上述方案中，所述装置还包括调整模块，用于调整至少一个摄像装置的拍摄角度，使得所述目标对象处于所述至少一个摄像装置的拍摄范围内；或者，若所述目标对象位于所述摄像装置拍摄的预览图像的指定区域外，则提示所述目标对象改变位置。In the above solution, the device further includes an adjustment module, configured to adjust the shooting angle of at least one camera device, so that the target object is within the shooting range of the at least one camera device; or, if the target object is located in the If the preview image captured by the camera device is outside the specified area, the target object is prompted to change its position.

上述方案中，所述调整模块，具体用于：In the above solution, the adjustment module is specifically used for:

上述方案中，所述获取模块，具体用于：获取下述信息中的至少一个：In the above solution, the acquisition module is specifically used to: acquire at least one of the following information:

所述目标对象的音调；the tone of voice of the target audience;

所述目标对象的音量；the volume of the target object;

上述方案中，所述判别模块，具体用于：当所述获取模块获取的所述目标对象的状态信息包括所述目标对象的音调，且所述目标对象在音频信号输出过程中的音调大于音调阈值时，判定符合拍摄条件；In the above solution, the judging module is specifically configured to: when the state information of the target object acquired by the acquisition module includes the tone of the target object, and the tone of the target object in the audio signal output process is greater than the tone When the threshold is reached, it is determined that the shooting conditions are met;

还具体用于：当所述获取模块获取的所述目标对象的状态信息包括所述目标对象的音量，且所述目标对象在音频信号输出过程中的音量大于音量阈值时，判定符合拍摄条件；It is also specifically used for: when the state information of the target object acquired by the acquisition module includes the volume of the target object, and the volume of the target object during the audio signal output process is greater than a volume threshold, determine that the shooting condition is met;

还具体用于：当所述获取模块获取的所述目标对象的状态信息包括所述目标对象的面部表情特征，且判断表情库中是否存在与所述目标对象的面部表情特征的相似度大于相似度阈值的预设面部表情特征；若存在，则判定符合拍摄条件；其中，所述表情库，为用于存储预设面部表情特征的数据库；所述预设面部表情特征，为预先设置的、用以表征所述目标对象的状态信息符合拍摄条件的面部表情特征。It is also specifically used for: when the state information of the target object acquired by the acquisition module includes the facial expression characteristics of the target object, and it is judged whether there is a similarity degree to the facial expression characteristics of the target object in the expression database The preset facial expression feature of the degree threshold; if it exists, it is determined that it meets the shooting conditions; wherein, the expression storehouse is a database for storing preset facial expression features; the preset facial expression feature is a preset, The state information used to characterize the target object meets the facial expression features of the shooting conditions.

上述方案中，所述装置还包括预设模块，具体用于：In the above solution, the device also includes a preset module, specifically for:

播放预设的多媒体；Play preset multimedia;

若所述状态信息不符合拍摄条件，则当所述预设的多媒体的播放进度达到预设时间点时，控制所述摄像装置拍摄针对所述目标对象的多媒体。If the state information does not meet the shooting condition, when the preset playing progress of the multimedia reaches a preset time point, the camera is controlled to shoot the multimedia for the target object.

上述方案中，所述装置还包括预处理模块，用于对针对所述目标对象的多媒体进行预处理，并将经过预处理后的多媒体进行存储；其中，所述预处理至少包括以下方式之一：根据获取的多媒体生成表情包；将获取的多媒体制作成个性化视频。In the above solution, the device further includes a preprocessing module, configured to preprocess the multimedia for the target object, and store the preprocessed multimedia; wherein, the preprocessing includes at least one of the following methods : Generate an emoticon package according to the acquired multimedia; make the acquired multimedia into a personalized video.

上述方案中，所述预处理模块，具体用于：若所述状态信息符合拍摄条件，则控制所述设备对所述目标对象当前输入的音频信号进行录制；将录制的音频信号和针对所述目标对象的多媒体合成为一个视频。In the above solution, the preprocessing module is specifically used to: if the state information meets the shooting conditions, control the device to record the audio signal currently input by the target object; combine the recorded audio signal with the The multimedia of the target object is synthesized into one video.

上述方案中，所述装置还包括替换模块，用于利用合成的所述视频，替换所述预设的多媒体。In the above solution, the device further includes a replacement module, configured to use the synthesized video to replace the preset multimedia.

上述方案中，所述装置还包括发送模块，用于将针对所述目标对象的多媒体和/或所述视频，发送至指定的终端设备。In the solution above, the apparatus further includes a sending module, configured to send the multimedia and/or the video for the target object to a designated terminal device.

本发明实施例还提供一种存储介质，其上存储有可执行程序，所述可执行程序被处理器执行时实现上述技术方案中的步骤。An embodiment of the present invention also provides a storage medium on which an executable program is stored, and when the executable program is executed by a processor, the steps in the above technical solution are implemented.

本发明实施例还提供一种拍摄装置，包括存储器、处理器及存储在存储器上并能够由所述处理器运行的可执行程序，所述处理器运行所述可执行程序时执行上述技术方案中的步骤。An embodiment of the present invention also provides a photographing device, including a memory, a processor, and an executable program stored on the memory and capable of being run by the processor. When the processor runs the executable program, it executes the above-mentioned technical solutions. A step of.

本发明实施例提供的以上任意一种方案，通过在采集目标对象输入的音频信号的状态下，获取目标对象的状态信息，进而对所述目标对象的状态信息进行判定。当所述目标对象的状态信息符合拍摄条件时，便利用摄像装置拍摄针对所述目标对象的多媒体。可见，整个拍摄过程无需用户通过自己掏出自带的手机或相机等终端设备进行手动拍摄，而是实现了对目标对象(用户)的自动拍摄。本方案相对于现有技术而言效率高，且对于用户来说，由于无需自己掏出设备手动拍摄，从而拍摄过程变得便捷。In any one of the above solutions provided by the embodiments of the present invention, the state information of the target object is acquired in the state of collecting the audio signal input by the target object, and then the state information of the target object is determined. When the state information of the target object meets the shooting conditions, the camera device is used to shoot the multimedia for the target object. It can be seen that the entire shooting process does not require the user to take out the mobile phone or camera and other terminal equipment that comes with it for manual shooting, but realizes the automatic shooting of the target object (user). Compared with the prior art, this solution has high efficiency, and for the user, since there is no need to take out the equipment and shoot manually, the shooting process becomes convenient.

附图说明Description of drawings

图1为本发明实施例提供的拍摄方法的实现流程示意图；FIG. 1 is a schematic diagram of the implementation flow of the shooting method provided by the embodiment of the present invention;

图2为本发明实施例提供的拍摄方法的详细流程示意图；Fig. 2 is a detailed flow chart of the photographing method provided by the embodiment of the present invention;

图3为本发明实施例提供的拍摄装置的组成结构示意图；FIG. 3 is a schematic diagram of the composition and structure of the photographing device provided by the embodiment of the present invention;

图4为本发明实施例提供的拍摄装置的硬件结构示意图。FIG. 4 is a schematic diagram of a hardware structure of a photographing device provided by an embodiment of the present invention.

具体实施方式detailed description

实施例一、Embodiment one,

本发明实施例中，拍摄方法的实现流程示意图如图1所示，包括以下步骤：In the embodiment of the present invention, a schematic diagram of the implementation flow of the shooting method is shown in Figure 1, including the following steps:

步骤101：在采集目标对象输入的音频信号的状态下，获取目标对象的状态信息；Step 101: Obtain the state information of the target object in the state of collecting the audio signal input by the target object;

步骤102：判断所述状态信息是否符合拍摄条件；Step 102: judging whether the state information meets the shooting conditions;

步骤103：若所述状态信息符合拍摄条件，则控制所述摄像装置拍摄针对所述目标对象的多媒体。Step 103: If the state information meets the shooting condition, control the camera device to shoot multimedia for the target object.

这里，所述拍摄方法应用于设置有摄像装置的设备。其中，所述设置有摄像装置的设备，比如可以应用于迷你K歌房。Here, the photographing method is applied to equipment provided with a photographing device. Wherein, the device provided with the camera device, for example, can be applied to a mini karaoke room.

这里，所述目标对象为处于音频输出状态下，由多媒体处理系统控制摄像设备进行多媒体拍摄的用户。例如，在迷你K歌房中演唱的过程中，由K歌系统控制摄像头进行图片或视频拍摄的用户。进一步的，当所述目标对象输出音频信号时，多媒体处理系统将自动获取所述音频信号，此时多媒体处理系统便处于所述采集目标对象输入的音频信号的状态。例如，目标对象在迷你K歌房中开始演唱，K歌系统自动采集所述目标对象的声音信号。所述获取目标对象的状态信息之前，还包括：调整至少一个摄像装置的拍摄角度，使得所述目标对象处于所述至少一个摄像装置的拍摄范围内；或者，若所述目标对象位于所述摄像装置拍摄的预览图像的指定区域外，则提示所述目标对象改变位置。具体的，当所述目标对象处于摄像装置的拍摄范围内，在拍摄角度不佳需要调整拍摄范围时，调整摄像装置的拍摄角度，使得所述目标对象处于至少一个摄像装置的拍摄范围内；当所述目标对象处于所述摄像装置拍摄的预览图像的指定区域外时，则出现提示信息，提示所述目标对象改变位置，使得所述目标对象处于至少一个摄像装置的拍摄范围内。其中，所述调整至少一个摄像装置的拍摄角度，包括：自动调整所述摄像装置的拍摄角度；或者，响应于接收到的拍摄角度调整指令，对所述摄像装置的拍摄角度进行调整。Here, the target object is a user who is in an audio output state and the multimedia processing system controls the camera device to perform multimedia shooting. For example, in the process of singing in the mini karaoke room, the karaoke system controls the camera to take pictures or videos of users. Further, when the target object outputs an audio signal, the multimedia processing system will automatically acquire the audio signal, and at this time the multimedia processing system is in the state of collecting the audio signal input by the target object. For example, the target object starts to sing in the mini karaoke room, and the karaoke system automatically collects the sound signal of the target object. Before the acquisition of the state information of the target object, it also includes: adjusting the shooting angle of at least one camera, so that the target object is within the shooting range of the at least one camera; or, if the target object is located in the camera If the device is outside the designated area of the preview image captured by the device, the target object is prompted to change its position. Specifically, when the target object is within the shooting range of the camera device, and the shooting angle needs to be adjusted when the shooting angle is not good, the shooting angle of the camera device is adjusted so that the target object is within the shooting range of at least one camera device; When the target object is outside the specified area of the preview image captured by the camera, a prompt message appears, prompting the target object to change its position so that the target object is within the shooting range of at least one camera device. Wherein, the adjusting the shooting angle of at least one camera includes: automatically adjusting the shooting angle of the camera; or adjusting the shooting angle of the camera in response to a received shooting angle adjustment instruction.

这里，所述获取目标对象的状态信息，包括获取下述信息中的至少一个：所述目标对象的音调；所述目标对象的音量；所述目标对象的面部表情特征。其中，所述目标对象的音调是多媒体处理系统在获取到所述目标对象的音频信号后，通过语音音调识别的方式获取到所述目标对象的音调；所述目标对象的音量是多媒体处理系统在获取到所述目标对象的音频信号后，通过语音音量识别的方式获取到所述目标对象的音量；所述目标对象的面部表情特征是由多媒体处理系统通过处于工作状态下的摄像设备监测并获取的。Here, the acquiring the status information of the target object includes acquiring at least one of the following information: the tone of the target object; the volume of the target object; and the facial expression features of the target object. Wherein, the tone of the target object is the tone of the target object obtained by the multimedia processing system through voice tone recognition after acquiring the audio signal of the target object; the volume of the target object is the tone of the target object obtained by the multimedia processing system After the audio signal of the target object is obtained, the volume of the target object is obtained through voice volume recognition; the facial expression characteristics of the target object are monitored and obtained by the multimedia processing system through the camera equipment in working condition of.

进一步的，当所述目标对象的状态信息包括所述目标对象的音调时，所述判断所述状态信息是否符合拍摄条件，包括：当所述目标对象在音频信号输出过程中的音调大于音调阈值时，判定符合拍摄条件；Further, when the state information of the target object includes the tone of the target object, the judging whether the state information meets the shooting conditions includes: when the tone of the target object in the audio signal output process is greater than a tone threshold , it is determined that the shooting conditions are met;

此外，所述方法还包括：播放预设的多媒体；若所述状态信息不符合拍摄条件，则当所述预设的多媒体的播放进度达到预设时间点时，控制所述摄像装置拍摄针对目标对象的多媒体。其中，所述预设的多媒体为已经存在的多媒体。可选的，所述预设的多媒体中可以预先设置有至少一个预设时间点。为何要在多媒体中设置预设时间点，后文将进行介绍，此处不再赘述。In addition, the method further includes: playing preset multimedia; if the status information does not meet the shooting conditions, when the playing progress of the preset multimedia reaches a preset time point, controlling the camera to shoot the target Objects of multimedia. Wherein, the preset multimedia is existing multimedia. Optionally, at least one preset time point may be preset in the preset multimedia. The reason for setting the preset time point in the multimedia will be introduced later, and will not be repeated here.

进一步的，所述针对目标对象的多媒体，包括下述之一：图片；视频；具体的，可以在采集目标对象输入的音频信号的状态下，对所述目标对象进行图片拍摄；或者，可以在采集目标对象输入的音频信号的状态下，通过所述摄像装置对所述目标对象进行视频录制。Further, the multimedia for the target object includes one of the following: pictures; videos; specifically, the target object can be photographed in the state of collecting the audio signal input by the target object; or, it can be In the state of collecting the audio signal input by the target object, video recording is performed on the target object through the camera device.

进一步的，所述方法还包括：对针对所述目标对象的多媒体进行预处理，并将经过预处理后的多媒体进行存储；其中，所述预处理至少包括以下方式之一：根据获取的多媒体生成表情包；将获取的多媒体制作成个性化视频。其中，所述表情包为以拍摄的目标对象的图片为素材，配上一系列与所述图片相匹配的文字、符号等，进而制作成用于表达特定情感的一组图片。所述个性化视频为以拍摄的目标对象的图片为素材，通过合成软件将音频、文字及所述图片进行合成，进而制作成满足制作者意愿的视频。进一步的，所述方法还包括：若所述状态信息符合拍摄条件，则控制所述设备对所述目标对象当前输入的音频信号进行录制；将录制的音频信号和针对所述目标对象的多媒体合成为一个视频。进一步的，所述方法还包括：利用合成的所述视频，替换所述预设的多媒体。进一步的，所述方法还包括：将所述针对所述目标对象的多媒体和/或所述视频，发送至指定的终端设备。Further, the method further includes: preprocessing the multimedia for the target object, and storing the preprocessed multimedia; wherein, the preprocessing includes at least one of the following methods: generating Emoticon package; make the acquired multimedia into a personalized video. Wherein, the emoticon package is a set of pictures for expressing specific emotions, which is made by taking pictures of the target object as the material, adding a series of words, symbols, etc. matching the pictures. The personalized video is based on the picture of the target object taken as the material, and the audio, text and the picture are synthesized through the synthesis software, and then produced into a video that meets the producer's wishes. Further, the method further includes: if the state information meets the shooting conditions, controlling the device to record the audio signal currently input by the target object; synthesizing the recorded audio signal with the multimedia for the target object for a video. Further, the method further includes: using the synthesized video to replace the preset multimedia. Further, the method further includes: sending the multimedia for the target object and/or the video to a designated terminal device.

本实施例提供的方案，在采集目标对象输入的音频信号的状态下，获取目标对象的状态信息，然后判别所述目标对象的所述状态信息是否符合拍摄条件，当判定目标对象的状态信息符合拍摄条件时，便控制摄像装置拍摄针对所述目标对象的多媒体；或者，在确定所述目标对象的状态信息不符合拍摄条件的情况下，在预设的多媒体的播放进度达到预设时间点时，则控制摄像装置拍摄针对所述目标对象的多媒体。如此，便实现了对所述目标对象的自动拍摄。In the solution provided by this embodiment, the state information of the target object is obtained under the state of collecting the audio signal input by the target object, and then it is judged whether the state information of the target object meets the shooting conditions. When it is determined that the state information of the target object meets the When the shooting conditions are met, the camera is controlled to shoot the multimedia for the target object; or, when it is determined that the state information of the target object does not meet the shooting conditions, when the preset multimedia playback progress reaches the preset time point , then control the camera device to shoot multimedia for the target object. In this way, the automatic shooting of the target object is realized.

实施例二、Embodiment two,

本发明实施例中，拍摄方法的实现流程如图1所示，包括以下步骤：In the embodiment of the present invention, the implementation process of the shooting method is shown in Figure 1, including the following steps:

在本发明实施例中的步骤101中，目标对象所在的空间中布置至少一个摄像装置，并且通过有线或无线的方式与多媒体处理系统实现通信连接。具体的，摄像装置可以采用无线局域网(Wireless Local Area Networks，WLAN)、蓝牙、近场通信(Near FieldCommunication，NFC)等无线方式连接到多媒体处理系统；或者，可以通过有线网络连接到多媒体处理系统。此外，终端设备通过扫码、输入验证信息等方式登录到多媒体处理系统，实现终端设备与多媒体处理系统的通信连接。其中，所述终端设备包括但不限于手机、平板电脑、智能穿戴设备等。进一步的，所述摄像装置在接收到启动指令后，进行启动；或者，可以在连接到多媒体处理系统后自动启动。具体的，所摄像装置直接接收到启动指令后进行启动；或者，所述多媒体处理系统接收启动所述摄像装置的启动指令，启动所述摄像装置；或者，所述摄像装置在连接到所述多媒体处理系统后便自动启动。此外，所述摄像装置在开启后便设定了一种拍摄模式，可以在需要的时候进行更改。In step 101 in the embodiment of the present invention, at least one camera device is arranged in the space where the target object is located, and communicates with the multimedia processing system in a wired or wireless manner. Specifically, the camera device can be connected to the multimedia processing system in a wireless manner such as Wireless Local Area Networks (WLAN), Bluetooth, or Near Field Communication (NFC); or, it can be connected to the multimedia processing system through a wired network. In addition, the terminal device logs into the multimedia processing system by scanning a code, inputting verification information, etc., so as to realize the communication connection between the terminal device and the multimedia processing system. Wherein, the terminal device includes, but is not limited to, a mobile phone, a tablet computer, a smart wearable device, and the like. Further, the camera starts up after receiving the startup instruction; or, it can start up automatically after being connected to the multimedia processing system. Specifically, the camera device is started after receiving the startup command directly; or, the multimedia processing system receives the startup command to start the camera device, and starts the camera device; or, the camera device is connected to the multimedia It starts automatically after processing the system. In addition, after the camera is turned on, a shooting mode is set, which can be changed when needed.

这里，所述拍摄方法应用于设置有摄像装置的设备。其中，所述设置有摄像装置的设备应用于迷你K歌房。Here, the photographing method is applied to equipment provided with a photographing device. Wherein, the device provided with the camera device is applied to a mini karaoke room.

这里，所述获取目标对象的状态信息，包括获取下述信息中的至少一个：所述目标对象的音调；所述目标对象的音量；所述目标对象的面部表情特征。其中，所述多媒体包括图像、视频等形式。此外，所述获取目标对象的状态信息之前，还包括：调整至少一个摄像装置的拍摄角度，使得所述目标对象处于所述至少一个摄像装置的拍摄范围内；或者，若所述目标对象位于所述摄像装置拍摄的预览图像的指定区域外，则提示所述目标对象改变位置。具体的，当所述目标对象处于摄像装置的拍摄范围内，在拍摄角度不佳需要调整拍摄范围时，调整摄像装置的拍摄角度，使得所述目标对象处于至少一个摄像装置的拍摄范围内；当所述目标对象处于所述摄像装置拍摄的预览图像的指定区域外时，则多媒体处理系统中出现提示信息，提示所述目标对象改变位置，使得所述目标对象处于至少一个摄像装置的拍摄范围内。Here, the acquiring the status information of the target object includes acquiring at least one of the following information: the tone of the target object; the volume of the target object; and the facial expression features of the target object. Wherein, the multimedia includes image, video and other forms. In addition, before the acquisition of the state information of the target object, it also includes: adjusting the shooting angle of at least one camera so that the target object is within the shooting range of the at least one camera device; or, if the target object is located in the If the target object is outside the designated area of the preview image captured by the camera device, the target object is prompted to change its position. Specifically, when the target object is within the shooting range of the camera device, and the shooting angle needs to be adjusted when the shooting angle is not good, the shooting angle of the camera device is adjusted so that the target object is within the shooting range of at least one camera device; When the target object is outside the specified area of the preview image captured by the camera, a prompt message appears in the multimedia processing system, prompting the target object to change its position so that the target object is within the shooting range of at least one camera device .

其中，所述调整至少一个摄像装置的拍摄角度，包括：多媒体处理系统自动调整所述摄像装置的拍摄角度；或者，响应于接收到的拍摄角度调整指令，对所述摄像装置的拍摄角度进行调整。进一步的，调整摄像装置的拍摄角度可以采用至少以下几种方式之一：多媒体处理系统自动调整摄像装置的角度；或者，在接收到调整指令后，在多媒体处理系统的显示屏上的进行拍摄角度的调整；或者，在接收到调整指令后，直接在摄像装置上进行拍摄角度的调整。具体的，所述多媒体处理系统将所述摄像装置拍摄的多媒体发送至终端设备，所述目标对象便可在所述终端设备的屏幕中得到拍摄的预览信息，所述多媒体处理系统便可以根据所述预览信息，自动调整拍摄的角度；或者，所述摄像装置在接收到调整指令后，根据所述终端设备的屏幕中显示的预览信息，直接进行拍摄角度的调整；或者，根据所述终端设备的屏幕中显示的预览信息，多媒体处理系统根据接收到的调整指令对摄像装置进行拍摄角度的调整。Wherein, the adjusting the shooting angle of at least one camera includes: the multimedia processing system automatically adjusts the shooting angle of the camera; or, in response to the received shooting angle adjustment instruction, adjusts the shooting angle of the camera . Further, at least one of the following methods can be adopted to adjust the shooting angle of the camera device: the multimedia processing system automatically adjusts the angle of the camera device; or, after receiving the adjustment instruction, directly adjust the shooting angle on the camera device. Specifically, the multimedia processing system sends the multimedia captured by the camera to the terminal device, and the target object can obtain the captured preview information on the screen of the terminal device, and the multimedia processing system can The above preview information automatically adjusts the shooting angle; or, after receiving the adjustment instruction, the camera device directly adjusts the shooting angle according to the preview information displayed on the screen of the terminal device; or, according to the terminal device The preview information displayed on the screen of the multimedia processing system adjusts the shooting angle of the camera device according to the received adjustment instruction.

此外，所述摄像装置的拍摄角度调整后，所述终端设备可以关闭预览状态；或者，可以保持开启预览状态。In addition, after the shooting angle of the camera device is adjusted, the terminal device may turn off the preview state; or, may keep the preview state on.

在步骤102中，所述多媒体处理系统判断所述状态信息是否符合拍摄条件。具体的，判定所述状态信息符合拍摄条件可以通过以下方式中的至少一个：In step 102, the multimedia processing system judges whether the status information meets the shooting conditions. Specifically, determining that the state information meets the shooting conditions may be done in at least one of the following ways:

当所述目标对象的状态信息包括所述目标对象的音调时，所述多媒体处理系统判断所述状态信息是否符合拍摄条件，包括：当所述目标对象在音频信号输出过程中的音调大于音调阈值时，判定符合拍摄条件；当所述目标对象的状态信息包括所述目标对象的音量时，所述多媒体处理系统判断所述状态信息是否符合拍摄条件，包括：当所述目标对象在音频信号输出过程中的音量大于音量阈值时，判定符合拍摄条件；When the state information of the target object includes the tone of the target object, the multimedia processing system determines whether the state information meets the shooting conditions, including: when the tone of the target object in the audio signal output process is greater than a tone threshold when the shooting condition is met; when the state information of the target object includes the volume of the target object, the multimedia processing system judges whether the state information meets the shooting condition, including: when the target object is in the audio signal output When the volume in the process is greater than the volume threshold, it is determined that the shooting condition is met;

当所述目标对象的状态信息包括所述目标对象的面部表情特征时，所述多媒体处理系统判断所述状态信息是否符合拍摄条件，包括：判断表情库中是否存在与所述目标对象的面部表情特征的相似度大于相似度阈值的预设面部表情特征；若存在，则判定符合拍摄条件；其中，所述表情库，为用于存储预设面部表情特征的数据库；所述预设面部表情特征，为预先设置的、用以表征所述目标对象的状态信息符合拍摄条件的面部表情特征。When the state information of the target object includes the facial expression feature of the target object, the multimedia processing system judges whether the state information meets the shooting conditions, including: judging whether there is a facial expression similar to the target object in the expression database The similarity of the feature is greater than the preset facial expression feature of the similarity threshold; if it exists, it is determined that it meets the shooting conditions; wherein, the expression storehouse is a database for storing preset facial expression features; the preset facial expression feature , is a preset facial expression feature used to represent that the state information of the target object meets the shooting conditions.

当然，也可以通过上述方式中的两种及两种以上来判定所述状态信息是否符合拍摄条件。例如，当所述目标对象的状态信息包括所述目标对象的音调和音量时，所述多媒体处理系统判断所述状态信息是否符合拍摄条件，包括：当所述目标对象在音频信号输出过程中的音量大于音量阈值且所述目标对象在音频信号输出过程中的音调大于音调阈值时，判定符合拍摄条件。Of course, it is also possible to use two or more of the above methods to determine whether the state information meets the shooting conditions. For example, when the state information of the target object includes the tone and volume of the target object, the multimedia processing system judging whether the state information meets the shooting conditions includes: when the target object is in the audio signal output process When the volume is greater than the volume threshold and the tone of the target object during the audio signal output process is greater than the tone threshold, it is determined that the shooting condition is met.

具体的，所述目标对象在音频信号输出过程中输出的语音信号的音调和音量会不断变化，在音调较高的情况下往往会有鲜明的状态信息，比如丰富的面部表情、多样的肢体动作，所以当所述目标对象在音频信号输出过程中的音调大于音调阈值时，则判定符合拍摄条件。其中，所述音调阈值为经验数值，可以将音调中的某一数值设定为所述音调阈值，比如将2000美设定为音调阈值；或者，当所述目标对象在音频信号输出过程中音量发生明显变化，大于音量阈值时，则判定符合拍摄条件。其中，所述音量阈值为经验数值，可以将音量中的某一数值设定为所述音量阈值，比如将40分贝设定为音量阈值；或者，判断表情库中是否存在与所述目标对象的面部表情特征的相似度大于相似度阈值的预设面部表情特征；若存在，则判定符合拍摄条件；其中，所述相似度阈值为经验数值，可以将相似度大于80％的面部表情设定为相似度阈值。所述预设表情可以是忧伤的表情、开心的表情、狰狞的表情等。Specifically, the pitch and volume of the voice signal output by the target object during the audio signal output process will constantly change, and in the case of a high pitch, there will often be clear state information, such as rich facial expressions and various body movements. , so when the pitch of the target object during the audio signal output process is greater than the pitch threshold, it is determined that the shooting condition is met. Wherein, the tone threshold is an empirical value, and a certain value in the tone can be set as the tone threshold, for example, 2000 US is set as the tone threshold; or, when the target object is in the audio signal output process, the volume When there is a significant change and is greater than the volume threshold, it is determined that the shooting condition is met. Wherein, the volume threshold is an empirical value, and a certain value in the volume can be set as the volume threshold, for example, 40 decibels is set as the volume threshold; The similarity of facial expression feature is greater than the preset facial expression feature of similarity threshold; If it exists, it is judged to meet the shooting conditions; Wherein, the similarity threshold is an empirical value, and the facial expression with similarity greater than 80% can be set as similarity threshold. The preset expression may be a sad expression, a happy expression, a ferocious expression, and the like.

此外，所述方法还包括：播放预设的多媒体；若所述状态信息不符合拍摄条件，则当所述预设的多媒体的播放进度达到预设时间点时，控制所述摄像装置拍摄针对目标对象的多媒体。其中，所述预设的多媒体为已经存在的多媒体，并且通过对所述预设的多媒体的历史信息的统计，从而对所有所述预设的多媒体进行了设置，使得所述预设的多媒体包含至少一个预设时间点。其中，所述预设时间点可以为微笑的表情、悲伤的表情等出现的时间点。In addition, the method further includes: playing preset multimedia; if the status information does not meet the shooting conditions, when the playing progress of the preset multimedia reaches a preset time point, controlling the camera to shoot the target Objects of multimedia. Wherein, the preset multimedia is already existing multimedia, and through the statistics of the historical information of the preset multimedia, all the preset multimedia are set, so that the preset multimedia includes At least one preset time point. Wherein, the preset time point may be a time point when a smiling expression, a sad expression, etc. appear.

在步骤103中，若所述状态信息符合拍摄条件，则控制所述摄像装置拍摄针对所述目标对象的多媒体。其中，所述针对目标对象的多媒体，包括下述之一：图片；视频；具体的，可以在采集目标对象输入的音频信号的状态下，对所述目标对象进行图片拍摄；或者，可以在采集目标对象输入的音频信号的状态下，通过所述摄像装置对所述目标对象进行视频录制。In step 103, if the state information meets the shooting condition, the camera is controlled to shoot multimedia for the target object. Wherein, the multimedia for the target object includes one of the following: pictures; video; specifically, the target object can be photographed in the state of collecting the audio signal input by the target object; or, it can be collected In the state of the audio signal input by the target object, video recording is performed on the target object through the camera device.

这里，所述多媒体处理系统控制所述摄像装置拍摄针对所述目标对象的多媒体之后，对针对所述目标对象的多媒体进行预处理，并将经过预处理后的多媒体进行存储；其中，所述预处理至少包括以下方式之一：根据获取的多媒体生成表情包；或者，可以对获取的所述目标对象的照片或视频进行图像处理、音效处理；或者，将获取的多媒体制作成个性化视频。具体的，所述多媒体处理系统控制所述设备对所述目标对象当前输入的音频信号进行录制；将录制的音频信号和针对所述目标对象的多媒体合成为一个视频。进一步的，所述多媒体处理系统利用合成的所述视频，替换所述预设的多媒体。进一步的，所述多媒体处理系统将所述针对所述目标对象的多媒体和/或所述视频，发送至指定的终端设备。Here, after the multimedia processing system controls the camera device to capture the multimedia of the target object, it preprocesses the multimedia of the target object and stores the preprocessed multimedia; wherein, the preprocessing The processing includes at least one of the following methods: generating an emoticon package according to the acquired multimedia; or, performing image processing and sound effect processing on the acquired photo or video of the target object; or making the acquired multimedia into a personalized video. Specifically, the multimedia processing system controls the device to record the audio signal currently input by the target object; and synthesizes the recorded audio signal and the multimedia for the target object into one video. Further, the multimedia processing system replaces the preset multimedia with the synthesized video. Further, the multimedia processing system sends the multimedia for the target object and/or the video to a designated terminal device.

实施例三、Embodiment three,

下面结合实例，以目标对象A在迷你K歌房中的演唱过程中，对目标对象A进行抓拍或录制视频为例，对本发明实施例的拍摄方法作进一步详细的描述。The shooting method of the embodiment of the present invention will be further described in detail below in conjunction with an example, taking target object A's singing process in a mini karaoke room as an example to capture or record a video of the target object A.

本发明实施例中，拍摄方法的详细流程示意图如图2所示，包括以下步骤：In the embodiment of the present invention, the detailed flowchart of the shooting method is shown in Figure 2, including the following steps:

步骤201：调整摄像头的拍摄角度或提示目标对象A改变所处位置；Step 201: Adjust the shooting angle of the camera or prompt the target object A to change its location;

这里，在迷你K歌房中安装至少一个摄像头，可以实现从不同的方位对目标对象A进行拍摄。比如，可以从正面、左侧及右侧对目标对象A进行拍摄。进一步的，所述摄像头通过有线或无线的方式与K歌系统实现通信连接。具体的，摄像头可以采用WLAN、蓝牙、NFC等无线方式连接到K歌系统；或者，可以通过有线网络连接到K歌系统。Here, at least one camera is installed in the mini karaoke room, so that the target object A can be photographed from different directions. For example, the target object A may be photographed from the front, left and right sides. Further, the camera communicates with the karaoke system in a wired or wireless manner. Specifically, the camera can be connected to the karaoke system in a wireless manner such as WLAN, Bluetooth, or NFC; or, it can be connected to the karaoke system through a wired network.

此外，目标对象A的终端设备，如手机，可以通过扫描微信、QQ或微博的二维码的方式登录K歌系统；或者，可以通过输入验证码登录K歌系统，实现终端设备与K歌系统的通信连接。In addition, the terminal device of the target object A, such as a mobile phone, can log in to the karaoke system by scanning the QR code of WeChat, QQ or Weibo; System communication connections.

进一步的，摄像头在接收到启动指令后，进行启动；或者，可以在连接到K歌系统后自动启动。具体的，摄像头直接接收到启动指令后进行启动；或者，K歌系统接收启动指令，启动摄像头；或者，摄像头连接到K歌系统后自动启动。其中，每个摄像头都需要进行独立的开启，可以根据需要开启不同的摄像头。此外，摄像头在开启后便设定了一种拍摄模式，可以在需要的时候进行更改。Further, the camera starts up after receiving the startup command; or, it can start up automatically after being connected to the karaoke system. Specifically, the camera starts up after receiving the startup command directly; or, the karaoke system receives the startup command and starts up the camera; or, the camera starts up automatically after being connected to the karaoke system. Wherein, each camera needs to be turned on independently, and different cameras can be turned on as required. In addition, the camera is set to a shooting mode when it is turned on, which can be changed when needed.

接下来，目标对象A在正式演唱前，所在的位置若不在任何一个摄像头的拍摄范围内，则调整至少一个摄像头的拍摄角度，使得目标对象A处于所述至少一个摄像头的拍摄范围内；或者，若目标对象A位于摄像头拍摄的预览图像的指定区域外，则提示目标对象A改变位置。具体的，当目标对象A处于摄像头的拍摄范围内，在拍摄角度不佳需要调整拍摄范围时，调整摄像头的拍摄角度，使得目标对象A处于至少一个摄像头的拍摄范围内；当目标对象A处于摄像头拍摄的预览图像的指定区域外时，则K歌系统的显示屏中出现提示信息，提示目标对象A改变位置，使得目标对象A处于至少一个摄像头的拍摄范围内。Next, before the official singing, if the position of the target object A is not within the shooting range of any camera, adjust the shooting angle of at least one camera so that the target object A is within the shooting range of the at least one camera; or, If the target object A is located outside the specified area of the preview image captured by the camera, the target object A is prompted to change its position. Specifically, when the target object A is within the shooting range of the camera, and the shooting angle needs to be adjusted when the shooting angle is not good, adjust the shooting angle of the camera so that the target object A is within the shooting range of at least one camera; When the preview image taken is outside the specified area, a prompt message appears on the display screen of the karaoke system, prompting the target object A to change its position so that the target object A is within the shooting range of at least one camera.

其中，所述调整至少一个摄像头的拍摄角度，包括：K歌系统自动调整摄像头的拍摄角度；或者，响应于接收到的拍摄角度调整指令，对摄像头的拍摄角度进行调整。Wherein, said adjusting the shooting angle of at least one camera includes: the karaoke system automatically adjusts the shooting angle of the camera; or, adjusts the shooting angle of the camera in response to the received shooting angle adjustment instruction.

进一步的，调整摄像头的拍摄角度可以采用至少以下几种方式之一：Further, at least one of the following methods may be used to adjust the shooting angle of the camera:

K歌系统将摄像头拍摄的照片发送至目标对象A的手机，目标对象A便可在手机的屏幕中得到拍摄的预览信息，K歌系统便可以获取目标对象A在手机显示屏中预览框的位置，进而自动调整拍摄的角度；或者，摄像头在接收到调整指令后，根据目标对象A在手机显示屏中预览框的位置，直接进行拍摄角度的调整；或者，根据目标对象A在手机显示屏中预览框的位置，K歌系统根据接收到的调整指令对摄像头进行拍摄角度的调整。此外，所述摄像头的拍摄角度调整后，摄像头将保持当前预览拍摄状态，目标对象A的手机可以关闭预览状态；或者，可以保持开启预览状态。The karaoke system sends the photos taken by the camera to the mobile phone of the target object A, and the target object A can get the preview information of the shooting on the screen of the mobile phone, and the karaoke system can obtain the position of the preview frame of the target object A on the mobile phone display , and then automatically adjust the shooting angle; or, after receiving the adjustment command, the camera directly adjusts the shooting angle according to the position of the target object A in the preview frame on the mobile phone display; or, according to the target object A in the mobile phone display The position of the preview frame, the karaoke system adjusts the shooting angle of the camera according to the received adjustment instructions. In addition, after the shooting angle of the camera is adjusted, the camera will maintain the current preview shooting state, and the mobile phone of the target object A can turn off the preview state; or, can keep the preview state on.

步骤202：在采集目标对象A演唱过程中，获取目标对象A的状态信息；Step 202: Obtain the status information of the target object A during the process of collecting the target object A's singing;

这里，在开启至少一个摄像头，并调整好拍摄的角度后，目标对象A开始演唱，K歌系统实时对目标对象A进行监测，在目标对象A的演唱过程中，获取目标对像A的状态信息。Here, after turning on at least one camera and adjusting the shooting angle, the target object A starts to sing, and the karaoke system monitors the target object A in real time, and obtains the status information of the target object A during the singing process of the target object A .

其中，所述获取目标对象A的状态信息，包括获取下述信息中的至少一个：所述目标对象A的音调；所述目标对象A的音量；所述目标对象A的面部表情特征。Wherein, said obtaining the status information of the target object A includes obtaining at least one of the following information: the tone of the target object A; the volume of the target object A; the facial expression features of the target object A.

步骤203：判断所述状态信息是否符合拍摄条件；Step 203: judging whether the state information meets the shooting conditions;

这里，K歌系统判断获取的目标对象A的状态信息是否符合拍摄条件。具体的，判定所述状态信息符合拍摄条件可以通过以下方式中的至少一个：Here, the karaoke system judges whether the acquired state information of the target object A meets the shooting conditions. Specifically, determining that the state information meets the shooting conditions may be done in at least one of the following ways:

当目标对象A的状态信息包括目标对象A的音调时，K歌系统判断所述状态信息是否符合拍摄条件，包括：当目标对象A在演唱过程中的音调大于音调阈值时，判定符合拍摄条件；When the state information of the target object A includes the tone of the target object A, the karaoke system judges whether the state information meets the shooting conditions, including: when the tone of the target object A in the singing process is greater than the tone threshold, it is determined that the shooting conditions are met;

当目标对象A的状态信息包括目标对象A的音量时，K歌系统判断所述状态信息是否符合拍摄条件，包括：当目标对象A在演唱过程中的音量大于音量阈值时，判定符合拍摄条件；When the state information of the target object A includes the volume of the target object A, the karaoke system judges whether the state information meets the shooting conditions, including: when the volume of the target object A in the singing process is greater than the volume threshold, it is determined that the shooting conditions are met;

当目标对象A的状态信息包括目标对象A的面部表情特征时，K歌系统判断所述状态信息是否符合拍摄条件，包括：判断表情库中是否存在与目标对象A的面部表情特征的相似度大于相似度阈值的预设面部表情特征；若存在，则判定符合拍摄条件；其中，所述表情库，为用于存储预设面部表情特征的数据库；所述预设面部表情特征，为预先设置的、用以表征所述目标对象的状态信息符合拍摄条件的面部表情特征。When the state information of the target object A includes the facial expression feature of the target object A, the karaoke system judges whether the state information meets the shooting conditions, including: judging whether there is a similarity with the facial expression feature of the target object A greater than The preset facial expression feature of the similarity threshold; if it exists, it is judged to meet the shooting conditions; wherein, the expression storehouse is a database for storing preset facial expression features; the preset facial expression feature is preset , the facial expression feature used to characterize the state information of the target object conforming to the shooting condition.

当然，也可以通过上述方式中的两种及两种以上来判定所述状态信息是否符合拍摄条件。例如，当目标对象A的状态信息包括所述目标对象的音调和音量时，K歌系统判断所述状态信息是否符合拍摄条件，包括：当目标对象A在演唱过程中的音量大于音量阈值且音调大于音调阈值时，判定符合拍摄条件。Of course, it is also possible to use two or more of the above methods to determine whether the state information meets the shooting conditions. For example, when the state information of the target object A includes the tone and volume of the target object, the karaoke system judges whether the state information meets the shooting conditions, including: when the volume of the target object A during singing is greater than the volume threshold and the tone When it is greater than the tone threshold, it is determined that the shooting condition is met.

具体的，目标对象A在演唱过程中的音调和音量会不断变化，在音调较高的情况下往往会有鲜明的状态信息，比如丰富的面部表情、多样的肢体动作，所以当目标对象A在演唱过程中的音调大于音调阈值时，则判定符合拍摄条件。其中，所述音调阈值为经验数值，可以将音调中的某一数值设定为所述音调阈值，比如将2000美设定为音调阈值；或者，当目标对象A在演唱过程中的音量发生明显变化，大于音量阈值时，则判定符合拍摄条件。其中，所述音量阈值为经验数值，可以将音量中的某一数值设定为所述音量阈值，比如将40分贝设定为音量阈值；或者，判断表情库中是否存在与目标对象A的面部表情特征的相似度大于相似度阈值的预设面部表情特征；若存在，则判定符合拍摄条件；其中，所述相似度阈值为经验数值，可以将相似度大于80％的面部表情设定为相似度阈值。所述预设表情可以是忧伤的表情、开心的表情、狰狞的表情等。Specifically, the pitch and volume of the target object A will change continuously during the singing process. In the case of high pitch, there will often be clear state information, such as rich facial expressions and various body movements. Therefore, when the target object A is in the When the pitch during the singing is greater than the pitch threshold, it is determined that the shooting condition is met. Wherein, the tone threshold is an empirical value, and a certain value in the tone can be set as the tone threshold, for example, 2000 US is set as the tone threshold; When the change is greater than the volume threshold, it is determined that the shooting condition is met. Wherein, the volume threshold is an empirical value, and a certain value in the volume can be set as the volume threshold, for example, 40 decibels is set as the volume threshold; The similarity of the expression feature is greater than the preset facial expression feature of the similarity threshold; if it exists, it is determined to meet the shooting conditions; wherein, the similarity threshold is an empirical value, and the facial expression with a similarity greater than 80% can be set as similar degree threshold. The preset expression may be a sad expression, a happy expression, a ferocious expression, and the like.

此外，K歌系统播放预设的多媒体；若所述状态信息不符合拍摄条件，则当所述预设的多媒体的播放进度达到预设时间点时，控制摄像头拍摄针对目标对象A的多媒体。其中，所述预设的多媒体为K歌系统中存在的多媒体，并且通过对所述预设的多媒体的历史信息的统计，从而对所有所述预设的多媒体进行了设置，使得所述预设的多媒体包含至少一个预设时间点。其中，所述预设时间点可以为微笑的表情、悲伤的表情等出现的时间点。In addition, the karaoke system plays preset multimedia; if the state information does not meet the shooting conditions, when the playback progress of the preset multimedia reaches a preset time point, the camera is controlled to shoot the multimedia for the target object A. Wherein, the preset multimedia is the multimedia existing in the karaoke system, and through the statistics of the historical information of the preset multimedia, all the preset multimedia are set, so that the preset The multimedia contains at least one preset time point. Wherein, the preset time point may be a time point when a smiling expression, a sad expression, etc. appear.

步骤204：若所述状态信息符合拍摄条件，则控制摄像头拍摄针对目标对象A的多媒体。Step 204: If the state information meets the shooting condition, control the camera to shoot the multimedia for the target object A.

若所述状态信息符合拍摄条件，则控制摄像头拍摄针对目标对象A的多媒体。其中，所述针对目标对象A的多媒体，包括下述之一：图片；视频；If the state information meets the shooting conditions, the camera is controlled to shoot multimedia for the target object A. Wherein, the multimedia for the target object A includes one of the following: pictures; videos;

具体的，当摄像头的当前模式为拍摄格式，可以在目标对象A在演唱过程中对目标对象A进行照片的拍摄；或者，当摄像头的当前模式为录制格式，可以在目标对象A在演唱过程中通过摄像头对目标对象A进行视频录制。Specifically, when the current mode of the camera is the shooting format, the target object A can be photographed during the singing process of the target object A; or, when the current mode of the camera is the recording format, the target object A can be taken during the singing process Video recording of the target object A is performed through the camera.

此外，K歌系统控制摄像头拍摄针对目标对象A的多媒体之后，对针对目标对象A的多媒体进行预处理，并将经过预处理后的多媒体进行存储；其中，所述预处理至少包括以下方式之一：根据获取的多媒体生成表情包；或者，可以对获取的所述目标对象的照片或视频进行图像处理、音效处理；或者，将获取的多媒体制作成个性化视频。具体的，K歌系统控制所述设备对目标对象A当前的演唱的歌曲进行录制；将录制的歌曲和针对目标对象A的多媒体合成为一个视频，并且利用合成的所述视频，替换K歌系统中预设的多媒体。此外，K歌系统可以将针对目标对象A的多媒体和/或所述视频，发送至手机，通过手机实现目标对象A的演唱过程的网络直播。In addition, after the karaoke system controls the camera to shoot the multimedia for the target object A, preprocess the multimedia for the target object A, and store the preprocessed multimedia; wherein, the preprocessing includes at least one of the following methods : generating an emoticon package according to the acquired multimedia; or, performing image processing and sound effect processing on the acquired photo or video of the target object; or, making the acquired multimedia into a personalized video. Specifically, the karaoke system controls the device to record the song currently sung by the target object A; synthesizes the recorded song and the multimedia for the target object A into a video, and uses the synthesized video to replace the karaoke system Multimedia preset in . In addition, the karaoke system can send the multimedia and/or the video for the target object A to the mobile phone, and realize the webcast of the target object A's singing process through the mobile phone.

为实现上述拍摄方法，本发明实施例还提供了一种拍摄装置，所述装置的组成结构示意图如图3所示，包括：获取模块31、判别模块32和拍摄模块33；其中，In order to realize the above-mentioned photographing method, an embodiment of the present invention also provides a photographing device. The schematic diagram of the composition and structure of the device is shown in FIG.

所述获取模块31，用于在采集目标对象输入的音频信号的状态下，获取目标对象的状态信息；The acquisition module 31 is used to acquire the state information of the target object under the state of collecting the audio signal input by the target object;

所述判别模块32，用于判断所述状态信息是否符合拍摄条件；The judging module 32 is configured to judge whether the status information meets the shooting conditions;

所述拍摄模块33，用于若所述状态信息符合拍摄条件，则控制所述摄像装置拍摄针对所述目标对象的多媒体。其中，所述针对目标对象的多媒体，包括下述之一：图片；视频；The photographing module 33 is configured to control the photographing device to photograph the multimedia for the target object if the state information meets the photographing conditions. Wherein, the multimedia for the target object includes one of the following: pictures; videos;

这里，所述获取模块，具体用于：获取下述信息中的至少一个：Here, the acquiring module is specifically configured to: acquire at least one of the following information:

所述目标对象的音调；the tone of voice of the target audience;

所述目标对象的音量；the volume of the target object;

这里，所述装置还包括调整模块，用于调整至少一个摄像装置的拍摄角度，使得所述目标对象处于所述至少一个摄像装置的拍摄范围内；或者，若所述目标对象位于所述摄像装置拍摄的预览图像的指定区域外，则提示所述目标对象改变位置。具体的，当所述目标对象处于摄像装置的拍摄范围内，在拍摄角度不佳需要调整拍摄范围时，调整摄像装置的拍摄角度，使得所述目标对象处于至少一个摄像装置的拍摄范围内；当所述目标对象处于所述摄像装置拍摄的预览图像的指定区域外时，则出现提示信息，提示所述目标对象改变位置，以便所述目标对象进入至少一个摄像装置的拍摄范围，进而通过调整使得所述目标对象处于至少一个摄像装置的拍摄范围内。进一步的，所述调整模块，具体用于：自动调整所述摄像装置的拍摄角度；或者，响应于接收到的拍摄角度调整指令，对所述摄像装置的拍摄角度进行调整。Here, the device further includes an adjustment module, configured to adjust the shooting angle of at least one camera device, so that the target object is within the shooting range of the at least one camera device; or, if the target object is located in the camera device If the captured preview image is outside the specified area, the target object is prompted to change its position. Specifically, when the target object is within the shooting range of the camera device, and the shooting angle needs to be adjusted when the shooting angle is not good, the shooting angle of the camera device is adjusted so that the target object is within the shooting range of at least one camera device; When the target object is outside the specified area of the preview image taken by the camera, a prompt message appears, prompting the target object to change its position so that the target object enters the shooting range of at least one camera device, and then adjusted so that The target object is within the shooting range of at least one camera device. Further, the adjustment module is specifically configured to: automatically adjust the shooting angle of the camera; or adjust the shooting angle of the camera in response to a received shooting angle adjustment instruction.

进一步的，所述判别模块，具体用于：当所述获取模块获取的所述目标对象的状态信息包括所述目标对象的音调，且所述目标对象在音频信号输出过程中的音调大于音调阈值时，判定符合拍摄条件；Further, the judging module is specifically configured to: when the state information of the target object acquired by the acquiring module includes the tone of the target object, and the tone of the target object during the audio signal output process is greater than a tone threshold , it is determined that the shooting conditions are met;

进一步的，所述装置还包括预设模块，具体用于：播放预设的多媒体；若所述状态信息不符合拍摄条件，则当所述预设的多媒体的播放进度达到预设时间点时，控制所述摄像装置拍摄针对所述目标对象的多媒体。其中，所述预设的多媒体为已经存在的、预先设置有至少一个预设时间点的多媒体。Further, the device further includes a preset module, specifically configured to: play preset multimedia; if the state information does not meet the shooting conditions, when the playback progress of the preset multimedia reaches a preset time point, The camera is controlled to capture multimedia for the target object. Wherein, the preset multimedia is existing multimedia with at least one preset time point preset.

具体的，所述目标对象在音频信号输出过程中输出的语音信号的音调和音量会不断变化，在音调较高的情况下往往会有鲜明的状态信息，比如丰富的面部表情、多样的肢体动作，所以当所述获取模块获取的所述目标对象的状态信息包括所述目标对象的音调，且所述目标对象在音频信号输出过程中的音调大于音调阈值时，判定符合拍摄条件。其中，所述音调阈值为经验数值，可以将音调中的某一数值设定为所述音调阈值，比如将2000美设定为音调阈值；或者，当所述获取模块获取的所述目标对象的状态信息包括所述目标对象的音量，且所述目标对象在音频信号输出过程中的音量大于音量阈值时，判定符合拍摄条件。其中，所述音量阈值为经验数值，可以将音量中的某一数值设定为所述音量阈值，比如将40分贝设定为音量阈值；或者，当所述获取模块获取的所述目标对象的状态信息包括所述目标对象的面部表情特征，且判断表情库中是否存在与所述目标对象的面部表情特征的相似度大于相似度阈值的预设面部表情特征；其中，所述相似度阈值为经验数值，可以将相似度大于80％的面部表情设定为相似度阈值。所述预设表情可以是忧伤的表情、开心的表情、狰狞的表情等。Specifically, the pitch and volume of the voice signal output by the target object during the audio signal output process will constantly change, and in the case of a high pitch, there will often be clear state information, such as rich facial expressions and various body movements. , so when the state information of the target object acquired by the acquisition module includes the tone of the target object, and the tone of the target object during the audio signal output process is greater than the tone threshold, it is determined that the shooting condition is met. Wherein, the tone threshold is an empirical value, and a certain value in the tone can be set as the tone threshold, for example, 2000 US is set as the tone threshold; or, when the target object acquired by the acquisition module The status information includes the volume of the target object, and when the volume of the target object during the audio signal output process is greater than a volume threshold, it is determined that the shooting condition is met. Wherein, the volume threshold is an empirical value, and a certain value in the volume can be set as the volume threshold, for example, 40 decibels is set as the volume threshold; or, when the target object acquired by the acquisition module The state information includes the facial expression feature of the target object, and it is judged whether there is a preset facial expression feature whose similarity with the facial expression feature of the target object is greater than a similarity threshold in the expression storehouse; wherein, the similarity threshold is Empirical values, facial expressions with a similarity greater than 80% can be set as the similarity threshold. The preset expression may be a sad expression, a happy expression, a ferocious expression, and the like.

进一步的，所述装置还包括预处理模块，用于对针对所述目标对象的多媒体进行预处理，并将经过预处理后的多媒体进行存储；其中，所述预处理至少包括以下方式之一：根据获取的多媒体生成表情包；将获取的多媒体制作成个性化视频。Further, the device further includes a preprocessing module, configured to preprocess the multimedia for the target object, and store the preprocessed multimedia; wherein, the preprocessing includes at least one of the following methods: Generate emoticons according to the acquired multimedia; make the acquired multimedia into a personalized video.

进一步的，所述预处理模块，具体用于：若所述状态信息符合拍摄条件，则控制所述设备对所述目标对象当前输入的音频信号进行录制；将录制的音频信号和针对所述目标对象的多媒体合成为一个视频。Further, the preprocessing module is specifically configured to: if the state information meets the shooting conditions, control the device to record the audio signal currently input by the target object; The object's multimedia is composited into one video.

进一步的，所述装置还包括替换模块，用于利用合成的所述视频，替换所述预设的多媒体。Further, the device further includes a replacement module, configured to use the synthesized video to replace the preset multimedia.

进一步的，所述装置还包括发送模块，用于将针对所述目标对象的多媒体和/或所述视频，发送至指定的终端设备。Further, the apparatus further includes a sending module, configured to send the multimedia and/or the video for the target object to a designated terminal device.

在实际应用中，所述获取模块31、判别模块32和拍摄模块33、调整模块、预设模块、预处理模块、替换模块及发送模块均可由位于多媒体处理系统中的中央处理器(CPU，Central Processing Unit)、微处理器(MPU，Micro Processor Unit)、数字信号处理器(DSP，Digital Signal Processor)、或现场可编程门阵列(FPGA，Field ProgrammableGate Array)等实现。In practical applications, the acquisition module 31, the discrimination module 32, the photographing module 33, the adjustment module, the preset module, the preprocessing module, the replacement module and the sending module can be controlled by a central processing unit (CPU, Central) located in the multimedia processing system. Processing Unit), microprocessor (MPU, Micro Processor Unit), digital signal processor (DSP, Digital Signal Processor), or Field Programmable Gate Array (FPGA, Field Programmable Gate Array).

需要说明的是：上述实施例提供的拍摄装置在进行拍摄时，仅以上述各程序模块的划分进行举例说明，实际应用中，可以根据需要而将上述处理分配由不同的程序模块完成，即将装置的内部结构划分成不同的程序模块，以完成以上描述的全部或者部分处理。另外，上述实施例提供的拍摄装置与拍摄方法实施例属于同一构思，其具体实现过程详见方法实施例，这里不再赘述。It should be noted that: when the photographing device provided in the above-mentioned embodiment performs photographing, the division of the above-mentioned program modules is used as an example for illustration. The internal structure of the program is divided into different program modules to complete all or part of the processing described above. In addition, the photographing device provided in the above embodiments and the photographing method embodiments belong to the same idea, and the specific implementation process thereof is detailed in the method embodiments, and will not be repeated here.

为实现上述方法，本发明实施例还提供了另一种拍摄装置，该装置包括存储器、处理器及存储在存储器上并能够由所述处理器运行的可执行程序，所述处理器运行所述可执行程序时，执行以下操作：In order to realize the above method, an embodiment of the present invention also provides another shooting device, which includes a memory, a processor, and an executable program stored on the memory and capable of being run by the processor, and the processor runs the When the program is executable, do the following:

所述处理器还用于运行所述可执行程序时，执行以下操作：The processor is also configured to perform the following operations when running the executable program:

所述获取目标对象的状态信息，包括获取下述信息中的至少一个：The acquiring state information of the target object includes acquiring at least one of the following information:

所述目标对象的音调；the tone of voice of the target audience;

所述目标对象的音量；the volume of the target object;

当所述目标对象的状态信息包括所述目标对象的音调时，所述判断所述状态信息是否符合拍摄条件，包括：当所述目标对象在音频信号输出过程中的音调大于音调阈值时，判定符合拍摄条件；When the state information of the target object includes the tone of the target object, the judging whether the state information meets the shooting conditions includes: when the tone of the target object in the audio signal output process is greater than a tone threshold, judging Meet the shooting conditions;

播放预设的多媒体；Play preset multimedia;

所述设置有摄像装置的设备应用于迷你K歌房。The device provided with the camera device is applied to a mini karaoke room.

所述针对目标对象的多媒体，包括下述之一：图片；视频；The multimedia for the target object includes one of the following: pictures; videos;

所述方法还包括：The method also includes:

下面以拍摄装置实施为用于拍摄的服务器或终端为例，对该拍摄装置的硬件结构做进一步说明。The hardware structure of the photographing device will be further described below by taking the photographing device implemented as a server or terminal for photographing as an example.

图4给出了本发明实施例的拍摄装置的硬件结构示意图，图4所示的拍摄装置400包括：至少一个处理器401、存储器402、用户接口403和至少一个网络接口404。所述拍摄装置400中的各个组件通过总线系统405耦合在一起。可理解，总线系统405用于实现这些组件之间的连接通信。总线系统405除包括数据总线之外，还包括电源总线、控制总线和状态信号总线。但是为了清楚说明起见，在图4中将各种总线都标为总线系统405。FIG. 4 shows a schematic diagram of a hardware structure of a camera according to an embodiment of the present invention. The camera 400 shown in FIG. 4 includes: at least one processor 401 , a memory 402 , a user interface 403 and at least one network interface 404 . Various components in the camera 400 are coupled together through a bus system 405 . It can be understood that the bus system 405 is used to realize connection and communication between these components. In addition to the data bus, the bus system 405 also includes a power bus, a control bus and a status signal bus. However, for clarity of illustration, the various buses are labeled as bus system 405 in FIG. 4 .

其中，用户接口403可以包括显示器、键盘、鼠标、轨迹球、点击轮、按键、按钮、触感板或者触摸屏等。Wherein, the user interface 403 may include a display, a keyboard, a mouse, a trackball, a click wheel, keys, buttons, a touch panel or a touch screen, and the like.

可以理解，存储器402可以是易失性存储器或非易失性存储器，也可包括易失性和非易失性存储器两者。It can be understood that the memory 402 may be a volatile memory or a non-volatile memory, and may also include both volatile and non-volatile memories.

本发明实施例中的存储器402用于存储各种类型的数据以支持拍摄装置400的操作。这些数据的示例包括：用于在拍摄装置400上操作的任何计算机程序，如可执行程序4021，实现本发明实施例方法的程序可以包含在可执行程序4021中。The memory 402 in the embodiment of the present invention is used to store various types of data to support the operation of the camera 400 . Examples of these data include: any computer program for operating on the photographing device 400 , such as an executable program 4021 , and the program for implementing the method of the embodiment of the present invention may be included in the executable program 4021 .

上述本发明实施例揭示的方法可以应用于处理器401中，或者由处理器401实现。处理器401可能是一种集成电路芯片，具有信号的处理能力。在实现过程中，上述方法的各步骤可以通过处理器401中的硬件的集成逻辑电路或者软件形式的指令完成。上述的处理器401可以是通用处理器、DSP，或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。处理器401可以实现或者执行本发明实施例中的公开的各方法、步骤及逻辑框图。通用处理器可以是微处理器或者任何常规的处理器等。结合本发明实施例所公开的方法的步骤，可以直接体现为硬件译码处理器执行完成，或者用译码处理器中的硬件及软件模块组合执行完成。软件模块可以位于存储介质中，该存储介质位于存储器402，处理器401读取存储器402中的信息，结合其硬件完成前述方法的步骤。The methods disclosed in the foregoing embodiments of the present invention may be applied to the processor 401 or implemented by the processor 401 . The processor 401 may be an integrated circuit chip and has signal processing capabilities. In the implementation process, each step of the above method may be completed by an integrated logic circuit of hardware in the processor 401 or instructions in the form of software. The aforementioned processor 401 may be a general-purpose processor, DSP, or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, and the like. The processor 401 may implement or execute various methods, steps, and logic block diagrams disclosed in the embodiments of the present invention. A general purpose processor may be a microprocessor or any conventional processor or the like. The steps of the methods disclosed in the embodiments of the present invention may be directly implemented by a hardware decoding processor, or implemented by a combination of hardware and software modules in the decoding processor. The software module may be located in a storage medium, and the storage medium is located in the memory 402. The processor 401 reads the information in the memory 402, and completes the steps of the foregoing method in combination with its hardware.

在示例性实施例中，本发明实施例还提供了一种存储介质，其上存储有可执行程序，所述可执行程序被拍摄装置400的处理器401运行时，执行以下操作：In an exemplary embodiment, the embodiment of the present invention also provides a storage medium on which an executable program is stored, and when the executable program is run by the processor 401 of the photographing device 400, the following operations are performed:

所述可执行程序被拍摄装置400的处理器401运行时，还执行以下操作：When the executable program is run by the processor 401 of the photographing device 400, the following operations are also performed:

所述目标对象的音调；the tone of voice of the target audience;

所述目标对象的音量；the volume of the target object;

播放预设的多媒体；Play preset multimedia;

所述方法还包括：The method also includes:

本领域内的技术人员应明白，本发明的实施例可提供为方法、系统、或可执行程序产品。因此，本发明可采用硬件实施例、软件实施例、或结合软件和硬件方面的实施例的形式。而且，本发明可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器和光学存储器等)上实施的可执行程序产品的形式。Those skilled in the art should understand that the embodiments of the present invention may be provided as methods, systems, or executable program products. Accordingly, the present invention can take the form of a hardware embodiment, a software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of an executable program product embodied on one or more computer-usable storage media (including but not limited to magnetic disk storage, optical storage, etc.) having computer-usable program code embodied therein.

本发明是参照根据本发明实施例的方法、设备(系统)、和可执行程序产品的流程图和/或方框图来描述的。应理解可由可执行程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些可执行程序指令到通用计算机、专用计算机、嵌入式处理机或参考可编程数据处理设备的处理器以产生一个机器，使得通过计算机或参考可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and executable program products according to embodiments of the invention. It should be understood that each procedure and/or block in the flowchart and/or block diagram, and a combination of procedures and/or blocks in the flowchart and/or block diagram can be realized by executable program instructions. These executable program instructions can be provided to a general purpose computer, special purpose computer, embedded processor or processor of a reference programmable data processing device to produce a machine such that the instructions executed by the computer or a processor of a reference programmable data processing device produce Means for realizing the functions specified in one or more procedures of the flowchart and/or one or more blocks of the block diagram.

这些可执行程序指令也可存储在能引导计算机或参考可编程数据处理设备以特定方式工作的计算机可读存储器中，使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品，该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These executable program instructions may also be stored in a computer-readable memory capable of directing a computer or reference programmable data processing apparatus to operate in a specific manner, such that the instructions stored in the computer-readable memory produce an article of manufacture comprising instruction means, the The instruction means implements the functions specified in one or more procedures of the flowchart and/or one or more blocks of the block diagram.

这些可执行程序指令也可装载到计算机或参考可编程数据处理设备上，使得在计算机或参考可编程设备上执行一系列操作步骤以产生计算机实现的处理，从而在计算机或参考可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These executable program instructions can also be loaded onto a computer or reference programmable data processing device, causing a series of operational steps to be performed on the computer or reference programmable device to produce a computer-implemented process for execution on the computer or reference programmable device The instructions provide steps for implementing the functions specified in the procedure or procedures of the flowchart and/or the block or blocks of the block diagram.

以上所述，仅为本发明的较佳实施例而已，并非用于限定本发明的保护范围。The above descriptions are only preferred embodiments of the present invention, and are not intended to limit the protection scope of the present invention.

Claims

1. A shooting method, characterized in that, the method is applied to equipment provided with a camera, and the method comprises:

In the state of collecting the audio signal input by the target object, obtain the state information of the target object;

judging whether the state information meets the shooting conditions;

If the state information meets the shooting condition, the camera is controlled to shoot the multimedia for the target object.

2. The method according to claim 1, further comprising:

adjusting the shooting angle of at least one camera so that the target object is within the shooting range of the at least one camera; or, if the target object is located outside the specified area of the preview image captured by the camera, prompting the The target object changes position.

3. The method according to claim 2, wherein said adjusting the shooting angle of at least one camera device comprises:

automatically adjust the shooting angle of the camera; or,

In response to the received shooting angle adjustment instruction, the shooting angle of the camera device is adjusted.

4. The method according to claim 1, wherein said obtaining the status information of the target object comprises obtaining at least one of the following information:

the tone of voice of the target audience;

the volume of the target object;

The facial expression characteristics of the target object.

5. The method of claim 4, wherein,

When the state information of the target object includes the tone of the target object, the judging whether the state information meets the shooting conditions includes: when the tone of the target object in the audio signal output process is greater than a tone threshold, judging Meet the shooting conditions;

When the state information of the target object includes the volume of the target object, the judging whether the state information meets the shooting conditions includes: when the volume of the target object in the audio signal output process is greater than a volume threshold, judging Meet the shooting conditions;

When the state information of the target object includes the facial expression features of the target object, the judging whether the state information meets the shooting conditions includes: judging whether there is similarity with the facial expression features of the target object in the expression database Degree is greater than the preset facial expression feature of the similarity threshold; if it exists, it is determined that it meets the shooting conditions; wherein, the expression library is a database for storing preset facial expression features; the preset facial expression feature is a preset facial expression feature The state information set to characterize the target object conforms to the facial expression feature of the shooting condition.

6. according to the described method of claim 5, it is characterized in that, described method also comprises:

Play preset multimedia;

If the state information does not meet the shooting condition, when the preset playing progress of the multimedia reaches a preset time point, the camera is controlled to shoot the multimedia targeted at the target object.

7. The method according to claim 6, wherein the multimedia for the target object comprises one of the following:

picture; video;

The method also includes:

Preprocessing the multimedia for the target object, and storing the preprocessed multimedia;

Wherein, the preprocessing includes at least one of the following methods: generating an emoticon package according to the acquired multimedia; making the acquired multimedia into a personalized video.

8. The method according to claim 7, wherein the method further comprises:

If the state information meets the shooting conditions, then controlling the device to record the audio signal currently input by the target object;

Combining the recorded audio signal and the multimedia for the target object into one video.

9. The method according to claim 8, wherein the method further comprises:

The synthesized video is used to replace the preset multimedia.

10. The method according to claim 9, further comprising:

Send the multimedia and/or the video for the target object to a designated terminal device.

11. The method according to any one of claims 1-10, characterized in that, the device provided with the camera device is applied to a mini karaoke room.

12. A photographing device, characterized in that the device comprises: an acquisition module, a discrimination module, and a photographing module; wherein,

The acquisition module is used to acquire the state information of the target object in the state of collecting the audio signal input by the target object;

The judging module is used to judge whether the state information meets the shooting conditions;

The photographing module is configured to control the photographing device to photograph the multimedia for the target object if the state information meets the photographing conditions.

13. The device according to claim 12, further comprising an adjustment module, configured to adjust the shooting angle of at least one camera device, so that the target object is within the shooting range of the at least one camera device or, if the target object is located outside the specified area of the preview image captured by the camera device, prompting the target object to change its location.

14. The device according to claim 13, wherein the adjustment module is specifically used for:

automatically adjust the shooting angle of the camera; or,

15. The device according to claim 12, wherein the acquiring module is specifically configured to: acquire at least one of the following information:

the tone of voice of the target audience;

the volume of the target object;

The facial expression characteristics of the target object.

16. The apparatus of claim 15, wherein:

The judging module is specifically configured to: when the state information of the target object acquired by the acquisition module includes the tone of the target object, and the tone of the target object during the audio signal output process is greater than a tone threshold, determine Meet the shooting conditions;

It is also specifically used for: when the state information of the target object acquired by the acquisition module includes the volume of the target object, and the volume of the target object during the audio signal output process is greater than a volume threshold, determine that the shooting condition is met;

It is also specifically used for: when the state information of the target object acquired by the acquisition module includes the facial expression characteristics of the target object, and it is judged whether there is a similarity degree to the facial expression characteristics of the target object in the expression database The preset facial expression feature of the degree threshold; if it exists, it is determined that it meets the shooting conditions; wherein, the expression storehouse is a database for storing preset facial expression features; the preset facial expression feature is a preset, The state information used to characterize the target object meets the facial expression features of the shooting conditions.

17. The device according to claim 16, characterized in that the device further comprises a preset module, specifically for:

Play preset multimedia;

If the state information does not meet the shooting condition, when the preset playing progress of the multimedia reaches a preset time point, the camera is controlled to shoot the multimedia for the target object.

18. The device according to claim 12, further comprising a preprocessing module configured to preprocess the multimedia for the target object and store the preprocessed multimedia; wherein, The preprocessing includes at least one of the following methods: generating an emoticon package according to the acquired multimedia; making the acquired multimedia into a personalized video.

19. The device according to claim 19, wherein the preprocessing module is specifically configured to: if the state information meets the shooting conditions, control the device to record the audio signal currently input by the target object ; Synthesizing the recorded audio signal and the multimedia for the target object into one video.

20. The device according to claim 19, further comprising a replacement module, configured to use the synthesized video to replace the preset multimedia.

21. The device according to claim 20, further comprising a sending module, configured to send the multimedia and/or the video for the target object to a designated terminal device.

22. A storage medium, on which an executable program is stored, wherein when the executable program is executed by a processor, the steps of the method according to any one of claims 1 to 11 are implemented.

23. A photographing device, comprising a memory, a processor, and an executable program stored on the memory and capable of being run by the processor, characterized in that, when the processor runs the executable program, it executes claims 1 to 10. 11. The steps of any one of the methods.