JP2011250439A

JP2011250439A - Imaging apparatus, recording method, and program

Info

Publication number: JP2011250439A
Application number: JP2011147894A
Authority: JP
Inventors: Yuji Kuriyama; 祐司栗山
Original assignee: Casio Computer Co Ltd
Current assignee: Casio Computer Co Ltd
Priority date: 2011-07-04
Filing date: 2011-07-04
Publication date: 2011-12-08

Abstract

【課題】音が再生されながら録音された音の再生の精度を向上する撮像装置及びそのプログラムを実現する。
【解決手段】ＣＰＵ１０は、減算処理部３１を備え、音を再生しながら音付動画撮影を行なう場合は、再生する音データを内蔵スピーカ２６から放音するとともに、内蔵マイク２１によって集音された音データから、該再生している音データを減算し、該減算後の音データをフラッシュメモリ１３に記録する。これにより、撮影録音時に音を再生した場合であっても、録音される音は、再生された音によって被写体の音声がかき消されることがなく、音が再生されながら録音された音の再生の精度を向上することができる。
【選択図】図２PROBLEM TO BE SOLVED: To realize an imaging apparatus and a program for improving the accuracy of reproduction of sound recorded while sound is reproduced.
When a CPU 10 includes a subtraction processing unit 31 and performs moving image shooting with sound while reproducing sound, the sound data to be reproduced is emitted from the built-in speaker 26 and collected by the built-in microphone 21. The sound data being reproduced is subtracted from the sound data, and the sound data after the subtraction is recorded in the flash memory 13. As a result, even if the sound is played back during shooting and recording, the recorded sound will not erase the sound of the subject by the played back sound, and the playback accuracy of the sound that was recorded while the sound was played back Can be improved.
[Selection] Figure 2

Description

本発明は、撮像装置及びそのプログラムに係り、詳しくは、録音機能を有した撮像装置及びそのプログラムに関する。 The present invention relates to an imaging apparatus and a program thereof, and more particularly to an imaging apparatus having a recording function and a program thereof.

近年、電子カメラ等の撮像装置において、録音して被写体を撮影（音付撮影）する機能や（特許文献１）、音楽等の音データを再生する機能を有した技術が登場してきている（特許文献２）。 In recent years, in an imaging apparatus such as an electronic camera, a technology having a function of recording and photographing a subject (photographing with sound) (Patent Document 1) and a function of reproducing sound data such as music has appeared (patents). Reference 2).

公開特許公報特開２００７−０３６４２７JP Patent Publication No. 2007-036427 公開特許公報特開２００２−１７６５７８Japanese Patent Laid-Open No. 2002-176578

しかしながら、従来の技術では、音楽等の音を再生しながら、被写体を音付撮影する場合、録音される被写体の声（音声）が再生されている音によってかき消されてしまい、被写体の声をよく録音することができず、該録音された音を再生した場合には聞きづらい音となってしまう。 However, in the conventional technology, when shooting a subject with sound while reproducing music or the like, the recorded subject's voice (sound) is drowned out by the reproduced sound, and the subject's voice is often improved. It cannot be recorded, and when the recorded sound is reproduced, it becomes difficult to hear.

そこで本発明は、かかる従来の問題点に鑑みてなされたものであり、音が再生されながら録音された音の再生の精度を向上する撮像装置及びそのプログラムを提供することを目的とする。 Therefore, the present invention has been made in view of such a conventional problem, and an object of the present invention is to provide an imaging apparatus that improves the accuracy of reproduction of sound recorded while the sound is reproduced, and a program therefor.

上記目的達成のため、請求項１記載の発明による撮像装置は、音を放音するスピーカと、
前記スピーカから音データを放音させる放音制御手段と、
音を集音するマイクロフォンと、
前記マイクロフォンにより集音された音データを記録手段に記録する記録制御手段と、
前記マイクロフォンにより集音された音データが前記記録制御手段により前記記録手段に記録されている最中に、前記放音制御手段により放音された音データを、前記マイクロフォンにより集音された音データから減算する減算手段と、
を備えたことを特徴とする。 In order to achieve the above object, an image pickup apparatus according to the first aspect of the present invention includes a speaker that emits sound,
Sound emission control means for emitting sound data from the speaker;
A microphone that collects the sound,
Recording control means for recording sound data collected by the microphone in a recording means;
While the sound data collected by the microphone is being recorded on the recording means by the recording control means, the sound data emitted by the sound emission control means is the sound data collected by the microphone. Subtracting means for subtracting from,
It is provided with.

また、例えば、請求項２に記載されているように、前記減算手段は、
前記マイクロフォンにより集音された音データが前記記録手段に記録される前に減算を実行し、
前記記録制御手段は、
前記減算手段により減算された音データを前記記録手段に記録するようにしてもよい。 For example, as described in claim 2, the subtracting unit includes:
Performing subtraction before sound data collected by the microphone is recorded in the recording means;
The recording control means includes
The sound data subtracted by the subtracting means may be recorded on the recording means.

また、例えば、請求項３に記載されているように、前記減算手段は、
前記記録手段による前記マイクロフォンにより集音された音データの記録後に、減算を実行するようにしてもよい。 For example, as described in claim 3, the subtracting unit includes:
Subtraction may be performed after recording the sound data collected by the microphone by the recording means.

また、例えば、請求項４に記載されているように、前記記録制御手段により記録された音データを再生する再生手段を備え、
前記減算手段は、
前記記録手段に記録された音データを再生する際に、減算を実行し、
前記再生手段は、
前記減算手段により減算された音データを再生するようにしてもよい。 Further, for example, as described in claim 4, a playback unit that plays back the sound data recorded by the recording control unit is provided,
The subtracting means is
When reproducing the sound data recorded in the recording means, subtraction is performed,
The reproducing means includes
The sound data subtracted by the subtracting means may be reproduced.

また、例えば、請求項５に記載されているように、前記スピーカから放音させる音データと逆位相の音データを生成する生成手段を備え、
前記スピーカは、
被写体側に向けられた第１のスピーカと、撮影者側に向けられた第２のスピーカとを有し、
前記放音制御手段は、
前記第１のスピーカから前記放音させる音データを放音させ、前記第２のスピーカから前記生成手段により生成された逆位相の音データを放音させるようにしてもよい。 Further, for example, as described in claim 5, the apparatus includes a generation unit that generates sound data having a phase opposite to that of sound data emitted from the speaker.
The speaker is
A first speaker directed to the subject side and a second speaker directed to the photographer side;
The sound emission control means includes
The sound data to be emitted from the first speaker may be emitted, and the opposite phase sound data generated by the generating unit may be emitted from the second speaker.

また、例えば、請求項６に記載されているように、前記マイクロフォンにより集音された音データに対して音声認識を行なう音声認識手段を備え、
前記減算手段は、
前記マイクロフォンにより集音された音データが前記記録制御手段により前記記録手段に記録されている最中に、音声認識手段による認識結果に基づいて、音声以外の周辺音を前記マイクロフォンにより集音された音データから減算するようにしてもよい。 In addition, for example, as described in claim 6, voice recognition means for performing voice recognition on the sound data collected by the microphone,
The subtracting means is
While the sound data collected by the microphone is being recorded on the recording means by the recording control means, ambient sounds other than the voice are collected by the microphone based on the recognition result by the voice recognition means. You may make it subtract from sound data.

また、例えば、請求項７に記載されているように、前記減算手段は、
音声以外の周辺音のうち、前記放音制御手段により放音された音データ以外の音データを前記マイクロフォンにより集音された音データから減算するととともに、前記放音制御手段により放音された音データも前記マイクロフォンにより集音された音データから減算するようにしてもよい。 For example, as described in claim 7, the subtraction unit includes:
Out of peripheral sounds other than sound, sound data other than sound data emitted by the sound emission control means is subtracted from sound data collected by the microphone, and sound emitted by the sound emission control means Data may also be subtracted from sound data collected by the microphone.

また、例えば、請求項８に記載されているように、前記マイクロフォンにより集音された音データに対して音声認識を行なう音声認識手段を備え、
前記減算手段は、
前記マイクロフォンにより集音された音データが前記記録制御手段により前記記録手段に記録されている最中に、音声認識手段による認識結果に基づいて、音レベルの高い音声データを、前記マイクロフォンにより集音された音データから減算するようにしてもよい。 Further, for example, as described in claim 8, voice recognition means for performing voice recognition on the sound data collected by the microphone,
The subtracting means is
While the sound data collected by the microphone is recorded on the recording means by the recording control means, voice data having a high sound level is collected by the microphone based on the recognition result by the voice recognition means. Subtraction may be made from the recorded sound data.

上記目的達成のため、請求項９記載の発明によるプログラムは、被写体を撮像する撮像手段と、音を放音するスピーカと、音を集音するマイクロフォンと、を備えた撮像装置を実行させるためのプログラムであって、
前記スピーカから音データを放音させる放音制御処理と、
前記撮像手段により撮像された画像データと前記マイクロフォンにより集音された音データとを関連付けて記録媒体に記録する記録制御処理と、
前記マイクロフォンにより集音された音データが前記記録制御処理により前記記録媒体に記録されている最中に、前記放音制御処理により放音された音データを、前記マイクロフォンにより集音された音データから減算する減算処理と、
を含むことを特徴とする。 In order to achieve the above object, a program according to a ninth aspect of the invention is a program for executing an imaging apparatus including an imaging unit that images a subject, a speaker that emits sound, and a microphone that collects sound. A program,
A sound emission control process for emitting sound data from the speaker;
A recording control process in which image data captured by the imaging unit and sound data collected by the microphone are associated and recorded on a recording medium;
While the sound data collected by the microphone is recorded on the recording medium by the recording control process, the sound data emitted by the sound emission control process is the sound data collected by the microphone. Subtraction processing to subtract from
It is characterized by including.

本願発明によれば、音が再生されながら録音された音の再生精度を向上することができる。 According to the present invention, it is possible to improve the reproduction accuracy of the sound recorded while the sound is reproduced.

本発明の実施の形態のデジタルカメラのブロック図である。1 is a block diagram of a digital camera according to an embodiment of the present invention. 本発明の音声処理部及びＣＰＵ１０の機能を説明するためのブロック図である。It is a block diagram for demonstrating the function of the audio | voice processing part of this invention, and CPU10. 本発明の音声処理部及びＣＰＵ１０の機能を説明するためのブロック図である。It is a block diagram for demonstrating the function of the audio | voice processing part of this invention, and CPU10. 録音時に再生された音データを減算する場合の動作を示すフローチャートである。It is a flowchart which shows operation | movement in the case of subtracting the sound data reproduced | regenerated at the time of recording. 再生時に減算する場合の音付動作撮影の動作を示すフローチャートである。It is a flowchart which shows the operation | movement of sound operation | movement imaging | photography when subtracting at the time of reproduction | regeneration. 再生時に減算する場合の音付動作再生に動作を示すフローチャートである。It is a flowchart which shows operation | movement for the operation | movement reproduction | regeneration with sound in the case of subtracting at the time of reproduction | regeneration. 第２の実施の形態における音声処理部１５及びＣＰＵ１０の機能を説明するための図である。It is a figure for demonstrating the function of the audio | voice processing part 15 and CPU10 in 2nd Embodiment. 第２の実施の形態のデジタルカメラ１の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the digital camera 1 of 2nd Embodiment. 第３の実施の形態における音声処理部１５及びＣＰＵ１０の機能を説明するための図である。It is a figure for demonstrating the function of the audio | voice process part 15 and CPU10 in 3rd Embodiment. 第３の実施の形態のデジタルカメラ１の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the digital camera 1 of 3rd Embodiment.

以下、本実施の形態について、本発明の撮像装置をデジタルカメラに適用した一例として図面を参照して詳細に説明する。
[第１の実施の形態]
Ａ．デジタルカメラの構成
図１は、本発明の撮像装置を実現するデジタルカメラ１の電気的な概略構成を示すブロック図である。
デジタルカメラ１は、撮影レンズ２、レンズ駆動ブロック３、絞り４、ＣＣＤ５、ドライバ６、ＴＧ（timing generator）７、ユニット回路８、メモリ９、ＣＰＵ１０、ＤＲＡＭ１１、画像表示部１２、フラッシュメモリ１３、キー入力部１４、音声処理部１５、バス１６を備えている。 Hereinafter, the present embodiment will be described in detail with reference to the drawings as an example in which the imaging apparatus of the present invention is applied to a digital camera.
[First embodiment]
A. Configuration of Digital Camera FIG. 1 is a block diagram showing a schematic electrical configuration of a digital camera 1 that implements the imaging apparatus of the present invention.
The digital camera 1 includes a photographing lens 2, a lens driving block 3, an aperture 4, a CCD 5, a driver 6, a TG (timing generator) 7, a unit circuit 8, a memory 9, a CPU 10, a DRAM 11, an image display unit 12, a flash memory 13, and a key. An input unit 14, an audio processing unit 15, and a bus 16 are provided.

撮影レンズ２は、図示しない複数のレンズ群から構成されるフォーカスレンズ、ズームレンズ等を含む。そして、撮影レンズ２にはレンズ駆動ブロック３が接続されている。レンズ駆動ブロック３は、フォーカスレンズ、ズームレンズをそれぞれ光軸方向に沿って駆動させるフォーカスモータ、ズームモータと、ＣＰＵ１０から送られてくる制御信号にしたがって、フォーカスモータ、ズームモータを駆動させるフォーカスモータドライバ、ズームモータドライバから構成されている（図示略）。 The photographing lens 2 includes a focus lens, a zoom lens, and the like that are constituted by a plurality of lens groups (not shown). A lens driving block 3 is connected to the photographing lens 2. The lens driving block 3 includes a focus motor and a zoom motor that drive the focus lens and the zoom lens along the optical axis direction, respectively, and a focus motor driver that drives the focus motor and the zoom motor according to a control signal sent from the CPU 10. And a zoom motor driver (not shown).

絞り４は、図示しない駆動回路を含み、駆動回路はＣＰＵ１０から送られてくる制御信号にしたがって絞り４を動作させる。
絞り４とは、撮影レンズ２から入ってくる光の量を制御する機構のことをいう。 The diaphragm 4 includes a drive circuit (not shown), and the drive circuit operates the diaphragm 4 in accordance with a control signal sent from the CPU 10.
The diaphragm 4 is a mechanism that controls the amount of light that enters from the photographing lens 2.

ＣＣＤ５は、ドライバ６によって駆動され、一定周期毎に被写体像のＲＧＢ値の各色の光の強さを光電変換して撮像信号としてユニット回路８に出力する。このドライバ６、ユニット回路８の動作タイミングはＴＧ７を介してＣＰＵ１０により制御される。なお、ＣＣＤ５はベイヤー配列の色フィルターを有しており、電子シャッタとしての機能も有する。この電子シャッタのシャッタ速度は、ドライバ６、ＴＧ７を介してＣＰＵ１０によって制御される。 The CCD 5 is driven by the driver 6 and photoelectrically converts the intensity of light of each color of the RGB value of the subject image for every fixed period and outputs it to the unit circuit 8 as an imaging signal. The operation timing of the driver 6 and the unit circuit 8 is controlled by the CPU 10 via the TG 7. The CCD 5 has a Bayer color filter and also functions as an electronic shutter. The shutter speed of the electronic shutter is controlled by the CPU 10 via the driver 6 and TG7.

ユニット回路８には、ＴＧ７が接続されており、ＣＣＤ５から出力される撮像信号を相関二重サンプリングして保持するＣＤＳ（Correlated Double Sampling）回路、そのサンプリング後の撮像信号の自動利得調整を行なうＡＧＣ（Automatic Gain Control）回路、その自動利得調整後のアナログの撮像信号をデジタル信号に変換するＡ／Ｄ変換器から構成されており、ＣＣＤ５から出力された撮像信号はユニット回路８を経てデジタル信号としてＣＰＵ１０に送られる。 A TG 7 is connected to the unit circuit 8, a CDS (Correlated Double Sampling) circuit that holds the imaged signal output from the CCD 5 by correlated double sampling, and an AGC that performs automatic gain adjustment of the imaged signal after the sampling. An (Automatic Gain Control) circuit and an A / D converter that converts an analog image pickup signal after the automatic gain adjustment into a digital signal, the image pickup signal output from the CCD 5 passes through the unit circuit 8 as a digital signal. It is sent to CPU10.

ＣＰＵ１０は、ユニット回路８から送られてきた画像データや音声処理部１５の内蔵マイクによって集音された音データをバッファメモリ（ＤＲＡＭ１１）に記憶させるとともに、該記憶させた画像データに対してガンマ補正、補間処理、ホワイトバランス処理、輝度色差信号（ＹＵＶデータ）の生成処理などの画像処理、フラッシュメモリ１３への画像データや音データの記録処理を行う機能を有するとともに、デジタルカメラ１の各部を制御するワンチップマイコンである。
特に、ＣＰＵ１０は、録音された音声データから録音時に再生された音データを減算する減算部を有する。 The CPU 10 stores the image data sent from the unit circuit 8 and the sound data collected by the built-in microphone of the sound processing unit 15 in the buffer memory (DRAM 11), and also performs gamma correction on the stored image data. It has functions for performing image processing such as interpolation processing, white balance processing, luminance color difference signal (YUV data) generation processing, and image data and sound data recording processing to the flash memory 13, and controls each part of the digital camera 1. It is a one-chip microcomputer.
In particular, the CPU 10 includes a subtracting unit that subtracts sound data reproduced at the time of recording from recorded sound data.

メモリ９には、ＣＰＵ１０が各部を制御するのに必要な制御プログラム、及び必要なデータが記録されており、ＣＰＵ１０は、該プログラムに従い動作する。
ＤＲＡＭ１１は、ＣＣＤ５によって撮像された後、ＣＰＵ１０に送られてきた画像データ、音声処理部１５の内蔵マイクによって集音された音データを一時記憶するバッファメモリとして使用されるとともに、ＣＰＵ１０のワーキングメモリとして使用される。
フラッシュメモリ１３は、画像データや音データを保存する記録媒体である。 The memory 9 stores a control program and necessary data necessary for the CPU 10 to control each unit, and the CPU 10 operates according to the program.
The DRAM 11 is used as a buffer memory for temporarily storing image data picked up by the CCD 5 and then sent to the CPU 10 and sound data collected by the built-in microphone of the sound processing unit 15, and as a working memory of the CPU 10. used.
The flash memory 13 is a recording medium that stores image data and sound data.

画像表示部１２は、カラーＬＣＤとその駆動回路を含み、撮影待機状態にあるときには、ＣＣＤ５によって撮像された被写体をスルー画像として表示し、記録画像の再生時には、フラッシュメモリ１３から読み出され、伸張された記録画像を表示させる。
キー入力部１４は、シャッタボタン、モード切替キー、音再生キー、十字キー、ＳＥＴキー等の複数の操作キーを含み、ユーザのキー操作に応じた操作信号をＣＰＵ１０に出力する。 The image display unit 12 includes a color LCD and its drive circuit, and displays the subject imaged by the CCD 5 as a through image when in a shooting standby state, and is read out from the flash memory 13 and decompressed when a recorded image is reproduced. The recorded image is displayed.
The key input unit 14 includes a plurality of operation keys such as a shutter button, a mode switching key, a sound reproduction key, a cross key, and a SET key, and outputs an operation signal according to the user's key operation to the CPU 10.

音声処理部１５は、内蔵マイク（マイクロフォン）、アンプ／ローパスフィルタ、Ａ／Ｄ変換器、Ｄ／Ａ変換器、ローパスフィルタ／アンプ、内蔵スピーカ等を含み、音付動画撮影時には、内蔵マイクにより集音された音声をデジタル信号に変換してＣＰＵ１０に送る。ＣＰＵ１０は、送られてきた音声データを、バッファメモリ（ＤＲＡＭ１２）に順次記憶し、該記憶した音声データを撮影された動画データと関連付けてフラッシュメモリ１３に記録する。つまり、音付動画データを記録する。 The sound processing unit 15 includes a built-in microphone (microphone), an amplifier / low-pass filter, an A / D converter, a D / A converter, a low-pass filter / amplifier, a built-in speaker, and the like. The sound that has been sounded is converted into a digital signal and sent to the CPU 10. The CPU 10 sequentially stores the transmitted audio data in the buffer memory (DRAM 12), and records the stored audio data in the flash memory 13 in association with the captured moving image data. That is, moving image data with sound is recorded.

また、音声処理部１５は、ＣＰＵ１０から送られてきた音データを内蔵スピーカから放音することにより、該音データを再生する。
この内蔵マイク、内蔵スピーカは共に、デジタルカメラ１の撮影レンズ２が設けられている面に備えられている。これにより、再生される音がよく被写体に聞こえるようになり、被写体の音声等も良く録音することが可能となる。なお、画像表示部１２は、撮影レンズ２が設けられている面と反対側（撮影者側）の面に備えられている。 The sound processing unit 15 reproduces the sound data by emitting sound data sent from the CPU 10 from the built-in speaker.
Both the built-in microphone and built-in speaker are provided on the surface of the digital camera 1 on which the photographing lens 2 is provided. As a result, the reproduced sound can be heard easily by the subject, and the sound of the subject can be recorded well. The image display unit 12 is provided on a surface opposite to the surface on which the photographing lens 2 is provided (photographer side).

Ｂ．デジタルカメラ１の機能
次に、本発明のデジタルカメラ１の構成の機能を説明する。
図２、３は、本発明の音声処理部及びＣＰＵ１０の機能を説明するためのブロック図である。
図２は、フラッシュメモリ１３やメモリ９に記録されている音データを再生させながら、音付動画撮影（録音処理）を行なう場合についての、ＣＰＵ１０の機能を説明するための図である。
図２を見ると分かるように、音声処理部１５は、内蔵マイク２１、アンプ／ローパスフィルタ２２、Ａ／Ｄ変換器２３、Ｄ／Ａ変換器２４、ローパスフィルタ／アンプ２５、内蔵スピーカ２６を備えており、ＣＰＵ１０は、減算処理部３１を備えている。 B. Next, functions of the configuration of the digital camera 1 of the present invention will be described.
2 and 3 are block diagrams for explaining the functions of the audio processing unit and the CPU 10 of the present invention.
FIG. 2 is a diagram for explaining the function of the CPU 10 in the case of shooting moving image with sound (recording process) while reproducing the sound data recorded in the flash memory 13 or the memory 9.
As can be seen from FIG. 2, the audio processing unit 15 includes a built-in microphone 21, an amplifier / low-pass filter 22, an A / D converter 23, a D / A converter 24, a low-pass filter / amplifier 25, and a built-in speaker 26. The CPU 10 includes a subtraction processing unit 31.

まず、内蔵マイク２１は、集音された被写体の音声等を電気信号に変換してアンプ／ローパスフィルタ２２に音データを出力する。アンプ／ローパスフィルタ２２は、音データを増幅してから（ＡＭＰ）、不要な周波数帯域をカットして（ＬＰＦ）、Ａ／Ｄ変換器２３に出力する。そして、Ａ／Ｄ変換器２３は、入力された音データをデジタル信号に変換してＣＰＵ１０に出力する。 First, the built-in microphone 21 converts the collected subject voice or the like into an electrical signal and outputs sound data to the amplifier / low-pass filter 22. The amplifier / low-pass filter 22 amplifies the sound data (AMP), cuts an unnecessary frequency band (LPF), and outputs it to the A / D converter 23. The A / D converter 23 converts the input sound data into a digital signal and outputs it to the CPU 10.

このとき、ＣＰＵ１０は、フラッシュメモリ１３やメモリ９に予め記録されているデジタル信号の音データ（音楽や、遊び音等の音のデータ）をバッファメモリ（ＤＲＡＭ１１）に記憶させ、該記憶させた音データをＤ／Ａ変換器２４に出力する。そして、Ｄ／Ａ変換器２４は、送られてきた音データをアナログ信号に変換して、ローパスフィルタ／アンプ２５に出力し、ローパスフィルタ／アンプ２５は、入力された音データの不要な周波数帯域をカットして、増幅してから内蔵スピーカ２６に出力し、内蔵スピーカ２６は該入力された音データを音に変換して放音する。 At this time, the CPU 10 stores in the buffer memory (DRAM 11) the sound data (sound data such as music and play sound) recorded in advance in the flash memory 13 and the memory 9, and stores the stored sound. Data is output to the D / A converter 24. The D / A converter 24 converts the received sound data into an analog signal and outputs the analog signal to the low-pass filter / amplifier 25. The low-pass filter / amplifier 25 uses the unnecessary frequency band of the input sound data. Is amplified and output to the built-in speaker 26. The built-in speaker 26 converts the inputted sound data into sound and emits the sound.

この場合、録音を行なっているときに、内蔵スピーカ２６から音を放音させると、被写体の音声より大きな音レベルで内蔵スピーカ２６から再生されている音（再生音）が内蔵マイク２１によって集音されてしまい、被写体の音声がかき消されてしまうという不具合がある。これは、内蔵マイク２１が被写体と比べ、内蔵スピーカ２６と至近距離にあるためであり、また、被写体に聞こえるような音レベルで内蔵スピーカ２６から音を放音するためである。 In this case, if sound is emitted from the built-in speaker 26 during recording, the sound (reproduced sound) reproduced from the built-in speaker 26 at a sound level higher than the sound of the subject is collected by the built-in microphone 21. This causes a problem that the sound of the subject is erased. This is because the built-in microphone 21 is closer to the built-in speaker 26 than the subject and emits sound from the built-in speaker 26 at a sound level that can be heard by the subject.

したがって、ＣＰＵ１０の減算処理部３１は、内蔵マイク２１によって集音される被写体の音声と、内蔵マイク２１によって集音される再生音との音レベル（ボリューム）が同じになるように、内蔵マイク２１によって集音された音声データから、内蔵スピーカ２６によって再生される音データを減算する。この減算処理は、内蔵マイク２１によって集音される音声と、再生音とのボリュームが同じになるように減算するためであり、必ずしも再生される音を無くすために減算するためのものではない。また、この減算は、録音されている再生音と同期させて再生音の音データを減算する。 Therefore, the subtraction processing unit 31 of the CPU 10 causes the built-in microphone 21 so that the sound level (volume) of the sound of the subject collected by the built-in microphone 21 and the reproduction sound collected by the built-in microphone 21 are the same. The sound data reproduced by the built-in speaker 26 is subtracted from the sound data collected by the above. This subtraction process is for subtracting so that the volume of the sound collected by the built-in microphone 21 is the same as that of the reproduced sound, and is not necessarily for subtracting to eliminate the reproduced sound. In addition, this subtraction subtracts the sound data of the reproduced sound in synchronization with the recorded reproduced sound.

なお、減算の度合いをユーザが選択することができ、該選択された度合いの減算を行なうようにしてもよい。例えば、減算度合いを大にした場合は再生音のボリューム（音レベル）が小さくなるように記録することができ、減算度合いを小にした場合は再生音のボリュームが大きくなるように記録することができる。 The degree of subtraction can be selected by the user, and the selected degree may be subtracted. For example, when the degree of subtraction is increased, recording can be performed so that the volume (sound level) of the reproduced sound is reduced, and when the degree of subtraction is decreased, recording can be performed so that the volume of the reproduced sound is increased. it can.

そして、この減算処理部３１によって減算された後の音データがバッファメモリに記憶され、フラッシュメモリ１３に記録されることになる。
これにより、内蔵マイク２１によって集音された内蔵スピーカ２６からの再生音のボリュームを調整して記録することができる。 Then, the sound data subtracted by the subtraction processing unit 31 is stored in the buffer memory and recorded in the flash memory 13.
Thereby, the volume of the reproduction sound from the built-in speaker 26 collected by the built-in microphone 21 can be adjusted and recorded.

図３は、音データを再生させながら減算処理を行うことなく単に音付動画撮影（録音処理）により録音された音データを再生する場合についての、ＣＰＵ１０の機能を説明するための図である。
音付動画撮影時（録音処理時）に再生された音データを、内蔵マイク２１から集音された音声データから減算処理を行うことなく録音された音は、被写体の音声より再生された音のボリュームの方が遥かに大きく、被写体の音声がかき消されてしまっている状況である。 FIG. 3 is a diagram for explaining the function of the CPU 10 in the case of reproducing sound data recorded simply by moving-image shooting with sound (recording processing) without performing subtraction processing while reproducing sound data.
Sound recorded without sound subtraction from the sound data collected from the built-in microphone 21 is the sound reproduced from the sound of the subject. The volume is much larger and the subject's voice has been drowned out.

したがって、このような音を再生する場合には、ＣＰＵ１０の減算処理部３１は、録音処理時に録音された被写体の音声と、録音処理時に再生された音とが同じ音レベルになるように、録音された音データから、該録音時に再生された音データを減算する。この減算処理は、内蔵マイク２１によって集音された音声と、内蔵マイク２１によって集音された再生音とのボリュームが同じになるように減算するためであり、必ずしも録音時に再生された音を無くすために減算するためのものではない。
なお、減算の度合いをユーザが選択することができ、該選択された度合いの減算を行なうようにしてもよい。例えば、減算度合いを大にした場合は録音時に再生された音のボリューム（音レベル）を小さくして再生することができ、減算度合いを小にした場合は録音時に再生された音のボリュームを大きくして再生することができる。 Therefore, when reproducing such a sound, the subtraction processing unit 31 of the CPU 10 records the sound so that the sound of the subject recorded during the recording process and the sound reproduced during the recording process have the same sound level. The sound data reproduced at the time of recording is subtracted from the recorded sound data. This subtraction process is for subtracting so that the volume of the sound collected by the built-in microphone 21 and the reproduced sound collected by the built-in microphone 21 are the same, and the sound reproduced during recording is not necessarily eliminated. Not for subtraction.
The degree of subtraction can be selected by the user, and the selected degree may be subtracted. For example, if the degree of subtraction is increased, the volume (sound level) of the sound played during recording can be reduced, and if the degree of subtraction is reduced, the volume of the sound played back during recording can be increased. Can be played.

そして、減算処理部３１によって減算された後の音データが、Ｄ／Ａ変換器２４に入力され、Ｄ／Ａ変換器２４は、該入力された音をアナログ信号に変換してローパスフィルタ／アンプ２５に出力し、ローパスフィルタ／アンプ２５は、入力された音データの不要な周波数帯域をカットして、増幅してから内蔵スピーカ２６に出力し、内蔵スピーカ２６は該入力された音データを音に変換して放音する。これにより、録音時に再生された音のボリュームを調整して再生することができる。 The sound data after being subtracted by the subtraction processing unit 31 is input to the D / A converter 24. The D / A converter 24 converts the input sound into an analog signal to convert it to a low-pass filter / amplifier. The low-pass filter / amplifier 25 cuts an unnecessary frequency band of the input sound data, amplifies it, and outputs it to the built-in speaker 26. The built-in speaker 26 outputs the input sound data to the sound. Convert to sound and emit. As a result, the volume of the sound reproduced during recording can be adjusted and reproduced.

Ｃ．デジタルカメラ１の動作
実施の形態におけるデジタルカメラ１の動作を、録音時に減算する場合と、再生時減算する場合に分けて説明する。
Ｃ−１．まず、録音時に減算する場合の動作を説明する。
録音時に再生された音データを減算する場合の動作を図４のフローチャートに従って説明する。 C. Operation of Digital Camera 1 The operation of the digital camera 1 in the embodiment will be described separately for the case of subtraction during recording and the case of subtraction during playback.
C-1. First, the operation when subtracting during recording will be described.
The operation when subtracting sound data reproduced during recording will be described with reference to the flowchart of FIG.

ユーザのキー入力部１４のモード切替キーの操作により音付撮影モードに設定されると、ステップＳ１で、ＣＰＵ１０は、ＣＣＤ５に所定のフレームレートで被写体を撮像させる処理を開始させ（動画撮像処理を開始させ）、該撮像により順次得られたフレーム画像データから輝度色差信号のフレーム画像データ（ＹＵＶデータ）を生成していき、該順次生成された輝度色差信号のフレーム画像データをバッファメモリ（ＤＲＡＭ１１）に記憶させ、該記憶された被写体のフレーム画像データを順次画像表示部１２に表示させるというスルー画像表示を開始させる。 When the shooting mode with sound is set by operating the mode switching key of the user's key input unit 14, in step S1, the CPU 10 starts a process of causing the CCD 5 to image a subject at a predetermined frame rate (moving image capturing process). The frame image data (YUV data) of the luminance / color difference signal is generated from the frame image data sequentially obtained by the imaging, and the frame image data of the luminance / chrominance signal sequentially generated is buffered (DRAM 11). And through image display is started in which the frame image data of the stored subject is sequentially displayed on the image display unit 12.

次いで、ステップＳ２で、ＣＰＵ１０は、音を再生させるか否かを判断する。
このとき、ユーザは、音再生キーの操作を行なうと、ＣＰＵ１０は、フラッシュメモリ１３やメモリ９に記録されている音データ（音楽や、お遊びの音等の音データ）がスルー画像表示に重ねて一覧表示させ、ユーザが十字キー、ＳＥＴキーの操作を行なうことにより、任意の音データを選択することができ、ＣＰＵ１０は、該一連の操作が行なわれると、音を再生させると判断する。 Next, in step S2, the CPU 10 determines whether or not to reproduce sound.
At this time, when the user operates the sound reproduction key, the CPU 10 superimposes sound data (sound data such as music or play sound) recorded in the flash memory 13 or the memory 9 on the through image display. The user can select any sound data by operating the cross key and the SET key, and the CPU 10 determines that the sound is reproduced when the series of operations is performed.

ステップＳ２で、音を再生させると判断すると、ステップＳ３に進み、ＣＰＵ１０は、ユーザによって選択された音データの再生処理を開始させて、ステップＳ４に進む。つまり、該選択された音データを読み出してバッファメモリに記憶し、該記憶した音データを順次音声処理部１５に入力し、音声処理部１５は、該入力された音データをアナログ変換などをして内蔵スピーカ２６から放音する処理を開始する。この音データの再生処理は、最後まで再生させると自動的に終了するが、ユーザによって再生終了の指示操作が行なわれた場合も終了する。
一方、ステップＳ２で、音を再生しないと判断すると、そのままステップＳ４に進む。 If it is determined in step S2 that the sound is to be reproduced, the process proceeds to step S3, where the CPU 10 starts the reproduction process of the sound data selected by the user, and proceeds to step S4. That is, the selected sound data is read out and stored in the buffer memory, and the stored sound data is sequentially input to the sound processing unit 15. The sound processing unit 15 performs analog conversion or the like on the input sound data. Then, the process of emitting sound from the built-in speaker 26 is started. This sound data playback process automatically ends when the sound data is played back to the end, but also ends when the user performs a playback end instruction operation.
On the other hand, if it is determined in step S2 that the sound is not reproduced, the process proceeds to step S4.

ステップＳ４に進むと、ＣＰＵ１０は、録画を開始するか否かを判断する。この判断は、シャッタボタンの押下に対応する操作信号がキー入力部１４から送られてきたか否かにより判断する。
ステップＳ４で、録画を開始しないと判断すると、ステップＳ５に進み、ＣＰＵ１０は、現在、音の再生処理中であるか否かを判断する。つまり、現在、内蔵スピーカ２６から音が放音されているか否かを判断する。
ステップＳ５で、音の再生処理が行なわれていないと判断するとステップＳ２に戻り、音の再生処理が行なわれていると判断するとステップＳ４に戻る。 In step S4, the CPU 10 determines whether to start recording. This determination is made based on whether or not an operation signal corresponding to pressing of the shutter button is sent from the key input unit 14.
If it is determined in step S4 that recording is not started, the process proceeds to step S5, and the CPU 10 determines whether or not sound reproduction processing is currently in progress. That is, it is determined whether or not sound is currently being emitted from the built-in speaker 26.
If it is determined in step S5 that sound reproduction processing is not being performed, the process returns to step S2, and if it is determined that sound reproduction processing is being performed, the process returns to step S4.

一方、ステップＳ４で、録画を開始すると判断すると、ステップＳ６に進み、ＣＰＵ１０は、動画記録処理を開始する。つまり、ＣＣＤ５により順次撮像されバッファメモリに記憶されたフレーム画像データをフラッシュメモリ１３に記録させていく処理を開始する。
次いで、ステップＳ７に進み、ＣＰＵ１０は、録音処理を開始する。つまり、内蔵マイク２６により集音された音が電気信号に変換され、デジタル信号に変換された音データを順次バッファメモリに記憶し、該記憶した音データをフラッシュメモリ１３に記録させていく処理を開始する。 On the other hand, if it is determined in step S4 that recording is started, the process proceeds to step S6, and the CPU 10 starts moving image recording processing. That is, the process of recording the frame image data sequentially captured by the CCD 5 and stored in the buffer memory in the flash memory 13 is started.
Next, in step S7, the CPU 10 starts a recording process. That is, the sound collected by the built-in microphone 26 is converted into an electrical signal, the sound data converted into a digital signal is sequentially stored in the buffer memory, and the stored sound data is recorded in the flash memory 13. Start.

次いで、ステップＳ８で、ＣＰＵ１０は、現在、音の再生処理中であるか否かを判断する。
ステップＳ８で、音の再生処理中であると判断するとステップＳ１２に進み、ステップＳ８で、音の再生処理中でないと判断すると、ステップＳ９に進み、音を再生するか否かを判断する。つまり、ステップＳ２と同様の判断を行う。 Next, in step S8, the CPU 10 determines whether or not sound reproduction processing is currently in progress.
If it is determined in step S8 that sound is being reproduced, the process proceeds to step S12. If it is determined in step S8 that sound is not being reproduced, the process proceeds to step S9 to determine whether or not to reproduce the sound. That is, the same determination as in step S2 is performed.

ステップＳ９で、音を再生しないと判断すると、ステップＳ１０に進み、ＣＰＵ１０は、録画を終了するか否かを判断する。この判断は、シャッタボタンの押下に対応する操作信号がキー入力１４から送られてきたか否かにより判断する。
なお、ここでは、シャッタボタンの押下と共に録画を開始し、再びシャッタボタンが押下されると録画を終了すると判断するようにしたが、シャッタボタンの押下中は録画を行い、シャッタボタンの押下が解除されると録画を終了すると判断するようにしてもよい。
ステップＳ１０で、録画を終了しないと判断するとステップＳ９に戻る。 If it is determined in step S9 that no sound is reproduced, the process proceeds to step S10, and the CPU 10 determines whether or not to end the recording. This determination is made based on whether or not an operation signal corresponding to pressing of the shutter button is sent from the key input 14.
Here, recording is started when the shutter button is pressed, and it is determined that the recording is ended when the shutter button is pressed again. However, recording is performed while the shutter button is pressed, and the shutter button is released. Then, it may be determined that the recording is finished.
If it is determined in step S10 that the recording is not finished, the process returns to step S9.

一方、ステップＳ９で、音を再生すると判断すると、ステップＳ１１に進み、ＣＰＵ１０は、ユーザによって選択された音データの再生処理を開始させて、ステップＳ１２に進む。
ステップＳ１２に進むと、ＣＰＵ１０の減算処理部３１は、内蔵マイク２１から集音された音データから、現在内蔵スピーカ２６から放音している（再生している）音データを減算する処理を開始させる。この減算処理により減算された音データが、音声録音処理によってバッファメモリに記憶され、フラッシュメモリ１３に記録されていることになる。 On the other hand, if it is determined in step S9 that the sound is to be reproduced, the process proceeds to step S11, and the CPU 10 starts the reproduction process of the sound data selected by the user, and proceeds to step S12.
In step S12, the subtraction processing unit 31 of the CPU 10 starts a process of subtracting the sound data currently being emitted (reproduced) from the built-in speaker 26 from the sound data collected from the built-in microphone 21. Let The sound data subtracted by this subtraction process is stored in the buffer memory by the sound recording process and recorded in the flash memory 13.

次いで、ステップＳ１３に進み、ＣＰＵ１０は、録画を終了するか否かを判断する。この判断は、ステップＳ１０と同様の判断を行う。
ステップＳ１０で、録画を終了しないと判断すると、ステップＳ１４に進み、ＣＰＵ１０は、音の再生処理が終了したか否かを判断する。この音の再生処理が終了したと判断する場合としては、再生している音が最後まで再生された場合や、ユーザによって再生終了の指示操作が行なわれた場合等がある。このユーザによって再生終了の指示操作が行なわれると、ＣＰＵ１０は、音の再生処理を中止させる。 Next, the process proceeds to step S13, and the CPU 10 determines whether or not to end the recording. This determination is the same as in step S10.
If it is determined in step S10 that the recording is not to be ended, the process proceeds to step S14, and the CPU 10 determines whether or not the sound reproduction processing has ended. The case where it is determined that the sound reproduction processing has been completed includes the case where the sound being reproduced has been reproduced to the end, the case where the user has performed an instruction to end reproduction, and the like. When the user performs a reproduction end instruction operation, the CPU 10 stops the sound reproduction process.

ステップＳ１４で、音の再生処理が終了していないと判断すると、ステップＳ１３に戻り、ステップＳ１４で音の再生処理が終了したと判断すると、ステップＳ１５に進み、ＣＰＵ１０は、減算処理を終了させて、ステップＳ９に戻る。減算処理が終了されると、内蔵マイク２１によって取得された音データがそのままバッファメモリに記憶され、フラッシュメモリ１３に記録されることになる。つまり、図２の減算処理部３１は、Ａ／Ｄ変換器２３から送られてきた音データを減算させることなくそのままスルーさせてバッファメモリ等に記憶、記録させることになる。 If it is determined in step S14 that the sound reproduction process has not ended, the process returns to step S13. If it is determined in step S14 that the sound reproduction process has ended, the process proceeds to step S15, and the CPU 10 ends the subtraction process. Return to step S9. When the subtraction process is completed, the sound data acquired by the built-in microphone 21 is stored as it is in the buffer memory and recorded in the flash memory 13. That is, the subtraction processing unit 31 in FIG. 2 allows the sound data sent from the A / D converter 23 to pass through without being subtracted and to store and record it in the buffer memory or the like.

一方、ステップＳ１０で録画を終了すると判断した場合、ステップＳ１３で録画を終了したと判断した場合は、ステップＳ１６に進み、ＣＰＵ１０は、動画記録処理によりフラッシュメモリ１３に記録されたフレーム画像データと、録音処理によりフラッシュメモリ１３に記録された音データとから音付動画ファイルを生成する。なお、音付動画ファイルを生成するようにしたが、要は音データと動画データとが関連付けられて記録されるようにすればよい。
これにより、音を再生しながら録音した場合であっても、再生された音によって被写体の音声がかき消されることがなく、録音中に音を再生する場合の被写体音声の録音精度を向上することができる。 On the other hand, if it is determined in step S10 that the recording is to be ended, or if it is determined in step S13 that the recording is ended, the process proceeds to step S16, and the CPU 10 stores the frame image data recorded in the flash memory 13 by the moving image recording process, A sound-added moving image file is generated from the sound data recorded in the flash memory 13 by the recording process. Although the sound-added moving image file is generated, the sound data and the moving image data may be recorded in association with each other.
As a result, even if the sound is recorded while being reproduced, the sound of the subject is not erased by the reproduced sound, and the recording accuracy of the subject sound when the sound is reproduced during recording can be improved. it can.

Ｃ−２．次に、再生時に減算する場合の動作を説明する。
この再生時に減算する場合の動作は、更に、音付動画撮影と音付動画再生に分けて説明する。
まず、再生時に減算する場合の音付動作撮影の動作を図５のフローチャートに従って説明する。 C-2. Next, the operation when subtracting during reproduction will be described.
The operation in the case of subtraction at the time of reproduction will be further described separately for sounded moving image shooting and sounded moving image reproduction.
First, the operation of sounding operation shooting when subtracting during reproduction will be described with reference to the flowchart of FIG.

ユーザのキー入力部１４のモード切替キーの音付動画撮影モードに設定されると、ステップＳ３１で、ＣＰＵ１０は、動画撮像処理を開始して、被写体のスルー画像表示を開始させる。
次いで、ステップＳ３２で、ＣＰＵ１０は、音を再生させるか否かを判断する。この判断は、図４のステップＳ２と同様の判断を行う。
ステップＳ３２で、音を再生させると判断すると、ステップＳ３３に進み、ＣＰＵ１０は、ユーザによって選択された音データの再生処理を開始させて、ステップＳ３４に進む。
一方、ステップＳ３２で、音を再生させないと判断するとそのままステップＳ３４に進む。 When the sound moving image shooting mode of the mode switching key of the user key input unit 14 is set, in step S31, the CPU 10 starts moving image capturing processing and starts displaying a through image of the subject.
Next, in step S32, the CPU 10 determines whether or not to reproduce the sound. This determination is the same as step S2 in FIG.
If it is determined in step S32 that the sound is to be reproduced, the process proceeds to step S33, and the CPU 10 starts the reproduction process of the sound data selected by the user, and proceeds to step S34.
On the other hand, if it is determined in step S32 that the sound is not reproduced, the process proceeds to step S34 as it is.

ステップＳ３４に進むと、ＣＰＵ１０は、録画を開始させるか否かを判断する。この判断は、図４のステップＳ４と同様の判断を行う。
ステップＳ３４で、録画を開始しないと判断すると、ステップＳ３５に進み、ＣＰＵ１０は、現在、音の再生処理中であるか否かを判断する。つまり、現在、内蔵スピーカから音が放音されているか否かを判断する。
ステップＳ３５で、音の再生処理が行なわれていないと判断するとステップＳ３２に戻り、音の再生処理が行なわれていると判断するとステップＳ３４に戻る。 In step S34, the CPU 10 determines whether to start recording. This determination is the same as step S4 in FIG.
If it is determined in step S34 that the recording is not started, the process proceeds to step S35, and the CPU 10 determines whether or not a sound reproduction process is currently being performed. That is, it is determined whether or not sound is currently being emitted from the built-in speaker.
If it is determined in step S35 that sound reproduction processing is not being performed, the process returns to step S32. If it is determined that sound reproduction processing is being performed, the process returns to step S34.

一方、ステップＳ３４で、録画を開始すると判断すると、ステップＳ３６に進み、ＣＰＵ１０は、動画記録処理を開始する。つまり、ＣＣＤ５により順次撮像されバッファメモリに記憶されたフレーム画像データをフラッシュメモリ１３に記録させていく処理を開始する。
次いで、ＣＰＵ１０は、ステップＳ３７に進み、ＣＰＵ１０は、録音処理を開始する。つまり、内蔵マイク２６により集音された音が電気信号に変換され、デジタル信号に変換された音データを順次バッファメモリに記憶し、該記憶した音データをフラッシュメモリ１３に記録させていく処理を開始する。 On the other hand, if it is determined in step S34 that recording is started, the process proceeds to step S36, and the CPU 10 starts moving image recording processing. That is, the process of recording the frame image data sequentially captured by the CCD 5 and stored in the buffer memory in the flash memory 13 is started.
Next, the CPU 10 proceeds to step S37, and the CPU 10 starts a recording process. That is, the sound collected by the built-in microphone 26 is converted into an electrical signal, the sound data converted into a digital signal is sequentially stored in the buffer memory, and the stored sound data is recorded in the flash memory 13. Start.

次いで、ステップＳ３８に進み、ＣＰＵ１０は、現在、音の再生処理中であるか否かを判断する。つまり、現在、内蔵スピーカから音が放音されているか否かを判断する。
ステップＳ３８で、音の再生処理が行なわれていないと判断すると、ステップＳ３９に進み、ＣＰＵ１０は、音を再生させるか否かを判断する。この判断は、図４のステップＳ２と同様の判断を行う。 Next, the process proceeds to step S38, and the CPU 10 determines whether or not a sound reproduction process is currently being performed. That is, it is determined whether or not sound is currently being emitted from the built-in speaker.
If it is determined in step S38 that sound reproduction processing has not been performed, the process proceeds to step S39, and the CPU 10 determines whether or not to reproduce sound. This determination is the same as step S2 in FIG.

ステップＳ３９で、音を再生させると判断すると、ステップＳ４０に進み、ＣＰＵ１０は、ユーザによって選択された音データの再生処理を開始させて、ステップＳ４１に進む。この再生処理は、最後まで再生させると自動的に終了するが、ユーザによって再生終了の指示操作が行なわれた場合も終了する。
一方、ステップＳ３８で現在、音の再生処理が行なわれていないと判断した場合、ステップＳ３９で音を再生させないと判断した場合は、そのままステップＳ４１に進む。 If it is determined in step S39 that the sound is to be reproduced, the process proceeds to step S40, where the CPU 10 starts the reproduction process of the sound data selected by the user, and proceeds to step S41. This reproduction processing is automatically terminated when the reproduction is completed to the end, but is also terminated when a reproduction end instruction operation is performed by the user.
On the other hand, if it is determined in step S38 that no sound reproduction process is currently being performed, or if it is determined in step S39 that no sound is to be reproduced, the process proceeds directly to step S41.

ステップＳ４１に進むと、ＣＰＵ１０は、録画を終了するか否かを判断する。この判断は、図４のステップＳ１０と同様の判断を行う。
ステップＳ４１で録画を終了しないと判断するとステップＳ３８に戻り、ステップＳ４１で録画を終了すると判断すると、ステップＳ４２に進み、ＣＰＵ１０は、動画記録処理によりフラッシュメモリ１３に記録されたフレーム画像データと、録音処理によりフラッシュメモリ１３に記録された音データとから音付動画ファイルを生成すると共に、再生された音データの情報（音情報）も関連付けて記録させる。この音情報とは、再生された音データの所在情報、どのタイミングで音データが再生されたか、どのタイミングで音データの再生が終了したか、再生された音データのうち、どの部分から録音されたか等の情報である。例えば、録音処理開始時に既に音データが再生されていた場合は、該再生されていた音データのうち、どのタイミングから該音データが録音されたかの情報や、録音処理開始されてから音データが再生された場合は、録画、録音のどのタイミングで音データが再生されたかの情報である。 In step S41, the CPU 10 determines whether to end the recording. This determination is performed in the same manner as step S10 in FIG.
If it is determined in step S41 that the recording is not to be ended, the process returns to step S38. If it is determined in step S41 that the recording is to be ended, the process proceeds to step S42, and the CPU 10 records the frame image data recorded in the flash memory 13 by the moving image recording process. A sound-added moving image file is generated from the sound data recorded in the flash memory 13 by the processing, and information (sound information) of the reproduced sound data is recorded in association with it. This sound information is the location information of the reproduced sound data, the timing at which the sound data was reproduced, the timing at which the reproduction of the sound data was completed, and from which part of the reproduced sound data was recorded. It is information such as Taka. For example, if sound data has already been played back at the start of the recording process, information on when the sound data was recorded from among the played sound data, and the sound data is played back after the recording process is started If it is, it is information on when the sound data was reproduced at the timing of recording or recording.

次に、再生時に減算する場合の音付動作再生に動作を図６のフローチャートに従って説明する。
ユーザのキー入力部１４のモード切替キーの音付動画撮影モードに設定されると、ステップＳ５１に進み、ＣＰＵ１０は、ユーザの操作にしたがって、再生対象となる音付動画ファイルを選択する。 Next, the operation for sounding operation reproduction when subtracting during reproduction will be described with reference to the flowchart of FIG.
When the sound-added moving image shooting mode of the mode switching key of the user key input unit 14 is set, the process proceeds to step S51, and the CPU 10 selects a sound-added moving image file to be reproduced in accordance with the user operation.

次いで、ステップＳ５２で、ＣＰＵ１０は、該選択した音付動画ファイルの再生処理を開始する。つまり、音付動画ファイルを読み出してバッファメモリに記憶させ、フレーム画像データを順次画像表示部１２に表示させていくと共に、音データを順次内蔵スピーカ２６から放音させる処理を開始させる。
このとき、該選択した音付動画ファイルに音情報が関連付けて記録されている場合は、該音情報に基づいて録音時に再生された音データを特定し、フラッシュメモリ１３やメモリ９から読み出してバッファメモリに音データを記憶させておく。 Next, in step S52, the CPU 10 starts reproduction processing of the selected moving image file with sound. In other words, the sound-added moving image file is read out and stored in the buffer memory, the frame image data is sequentially displayed on the image display unit 12, and the process of sequentially emitting sound data from the built-in speaker 26 is started.
At this time, if sound information is recorded in association with the selected moving image file with sound, sound data reproduced at the time of recording is specified based on the sound information, read from the flash memory 13 or the memory 9, and buffered. The sound data is stored in the memory.

次いで、ステップＳ５３で、ＣＰＵ１０は、撮影時に再生された音の録音が開始されたタイミングが到来したか否かを判断する。この判断は、ステップＳ５１で選択した音付動画ファイルに関連付けられて記録されている音情報に基づいて判断する。この撮影時に再生された音の録音が開始されたタイミングとは、例えば、動画記録、録音開始時に既に音データが再生されている場合は、動画記録、録音が開始されたタイミングのことであり、動画記録、録音が開始され、その後、音データの再生が開始された場合は、該音データの再生開始が開始されたタイミングのことである。 Next, in step S53, the CPU 10 determines whether or not the timing at which recording of the sound reproduced at the time of shooting has started has arrived. This determination is made based on the sound information recorded in association with the sound-added moving image file selected in step S51. The timing at which recording of the sound played back at the time of shooting is started, for example, when sound data has already been played back at the start of video recording and recording, is the timing at which video recording and recording started, When moving image recording and recording are started and then reproduction of sound data is started, this is the timing at which the start of reproduction of the sound data is started.

ステップＳ５３で、再生された音の録音が開始されたタイミングが到来していないと判断すると、ステップＳ５４に進み、ＣＰＵ１０は、音付動画ファイルの再生処理が終了したか否かを判断する。
ステップＳ５４で、音付動画ファイルの再生処理が終了していないと判断するとステップＳ５３に戻る。 If it is determined in step S53 that the recording timing of the reproduced sound has not started, the process proceeds to step S54, and the CPU 10 determines whether or not the reproduction processing of the sound-added moving image file has been completed.
If it is determined in step S54 that the reproduction process of the moving image file with sound has not been completed, the process returns to step S53.

一方、ステップＳ５３で、再生された音の録音が開始されたタイミングが到来したと判断すると、ステップＳ５５に進み、ＣＰＵ１０の減算処理部３１は、現在再生している音付動画ファイルの音データから、バッファメモリに記憶されている録音時に再生された音データを減算する処理を開始させる。このときは、録音時に再生された音と同期させて、音付動画ファイルの音データから、再生させた音データを減算させる。この減算された音データが内蔵スピーカ２６から放音されることになる。 On the other hand, if it is determined in step S53 that the recording timing of the reproduced sound has started, the process proceeds to step S55, and the subtraction processing unit 31 of the CPU 10 determines from the sound data of the sound-added moving image file currently being reproduced. Then, the process of subtracting the sound data reproduced during recording stored in the buffer memory is started. At this time, the reproduced sound data is subtracted from the sound data of the sound-added moving image file in synchronization with the sound reproduced at the time of recording. The subtracted sound data is emitted from the built-in speaker 26.

次いで、ステップＳ５６で、ＣＰＵ１０は、音付動画ファイルの再生が終了したか否かを判断する。この判断は、音付動画ファイルを最後まで再生させた場合や、ユーザのキー入力部１４の操作により再生を終了する旨の操作信号が送られてくると、音付動画ファイルの再生が終了したと判断する。
ステップＳ５６で、音付動画ファイルの再生が終了していないと判断すると、ステップＳ５７に進み、ＣＰＵ１０は、撮影時（録音時）に再生された音の録音が終了したタイミングが到来したか否かを判断する。この判断も、ステップＳ５１で選択した音付動画ファイルに関連付けられて記録されている音情報に基づいて行なう。この撮影時に再生された音の録音が終了したタイミングとして、撮影時に再生された音が最後まで再生され、自動的に音の再生が終了したタイミング、若しくは撮影時にユーザによって再生している音の再生終了操作が行なわれたタイミングなどがある。 Next, in step S56, the CPU 10 determines whether or not the reproduction of the moving image file with sound has been completed. This determination is made when the movie file with sound is played back to the end, or when the operation signal indicating that the playback is finished is sent by the user's operation of the key input unit 14, the playback of the movie file with sound is finished. Judge.
If it is determined in step S56 that the reproduction of the moving image file with sound has not ended, the process proceeds to step S57, and the CPU 10 determines whether or not the timing at which recording of the sound reproduced at the time of shooting (recording) has ended. Judging. This determination is also made based on the sound information recorded in association with the sound-added moving image file selected in step S51. As the timing when the recording of the sound played back at the time of shooting is finished, the sound played back at the time of shooting is played to the end and the playback of the sound automatically ends, or the playback of the sound played by the user at the time of shooting There is a timing when the ending operation is performed.

ステップＳ５７で、撮影時に再生された音の録音が終了したタイミングが到来していないと判断するとステップＳ５６に戻り、ステップＳ５７で、撮影時に再生された音の録音が終了したタイミングが到来したと判断すると、ステップＳ５８に進み、ＣＰＵ１０は、現在再生している音付動画ファイルの音データから録音時に再生された音データの減算処理を終了して、ステップＳ５３に戻る。
一方、ステップＳ５４、ステップＳ５６で、音付動画ファイルの再生を終了すると判断すると、ステップＳ５９に進み、ＣＰＵ１０は、音付動画ファイルの再生処理を終了させる。
これにより、音が再生されながら録音された音を再生する場合であっても、該録音時に再生された音によって被写体の音声がかき消されることなく、録音された音の再生精度を向上することができる。 If it is determined in step S57 that the recording of the sound reproduced at the time of shooting has not ended, the process returns to step S56, and in step S57, it is determined that the timing of the recording of the sound reproduced at the time of shooting has come. Then, the process proceeds to step S58, and the CPU 10 ends the subtraction process of the sound data reproduced at the time of recording from the sound data of the currently-reproduced moving image file with sound, and returns to step S53.
On the other hand, if it is determined in steps S54 and S56 that the reproduction of the sound-added moving image file is finished, the process proceeds to step S59, and the CPU 10 ends the sound-added moving image file reproduction process.
As a result, even when the recorded sound is reproduced while the sound is reproduced, the sound of the subject is not drowned out by the sound reproduced at the time of recording, and the reproduction accuracy of the recorded sound can be improved. it can.

以上のように、第１の実施の形態においては、音付動画撮影中に、音データが再生された場合であっても、内蔵マイク２１により集音された音データから該再生する音データを減算するので、撮影録音時に音を再生した場合であっても、再生された音によって被写体の音声がかき消されることがなく、音が再生されながら録音された音の再生の精度を向上することができる。
例えば、ダンス等の曲を再生ながら音付動画撮影を行なった場合であっても、動画再生の際にダンスの曲のみが極端に大きな音で再生されることがない。 As described above, in the first embodiment, even when sound data is played back during shooting of sound-added moving images, the sound data to be played back is extracted from the sound data collected by the built-in microphone 21. Since the subtraction is performed, even if the sound is played back during shooting and recording, the sound of the subject is not drowned out by the played back sound, and the accuracy of playback of the sound recorded while the sound is played back can be improved. it can.
For example, even when a moving image with sound is taken while playing a song such as a dance, only the dance song is not played with an extremely loud sound when the movie is played.

なお、上記第１の実施の形態において、被写体の音声と、録音時に再生された音とが同じ音レベルとなるように、内蔵マイク２１から集音された音データから該再生された音データを減算するようにしたが、録音時に再生された音が無くなるように（音レベルがゼロ若しくは略ゼロ）となるように、内蔵マイク２１から集音された音データから該再生された音データを減算し、該減算後の音データと録音時に再生された音データとを合成するようにしてもよい。この場合は、録音時に再生された音データの音レベルと被写体の音声の音レベルとが同じになるように合成するようにしてもよいし、ユーザによって選択された音レベルとなるように該録音時に再生された音データを合成させるようにしてもよい。これにより、再生された音を綺麗な音で録音することができる。
また、内蔵マイク２１により集音された音データをそのまま記録し、記録後に、記録した音データから、録音時に再生された音データを減算した音データを生成して記録するようにしてもよい。これにより、撮影時、再生時の処理負担を軽減することができる。 In the first embodiment, the reproduced sound data is obtained from the sound data collected from the built-in microphone 21 so that the sound of the subject and the sound reproduced at the time of recording have the same sound level. Subtraction is performed, but the reproduced sound data is subtracted from the sound data collected from the built-in microphone 21 so that there is no sound reproduced during recording (the sound level is zero or substantially zero). Then, the subtracted sound data and the sound data reproduced at the time of recording may be synthesized. In this case, the sound level of the sound data reproduced at the time of recording may be synthesized so that the sound level of the sound of the subject is the same, or the recording may be performed so that the sound level selected by the user is obtained. Sometimes the reproduced sound data may be synthesized. Thereby, the reproduced sound can be recorded with a beautiful sound.
Alternatively, sound data collected by the built-in microphone 21 may be recorded as it is, and after recording, sound data obtained by subtracting sound data reproduced during recording from the recorded sound data may be generated and recorded. Thereby, it is possible to reduce the processing load at the time of shooting and reproduction.

［第２の実施の形態］
次に第２の実施の形態について説明する。
第２の実施の形態では、音付動画撮影を行なうときや行なっている最中に、音データを再生した場合であっても、被写体の音声が再生した音にかき消されないよう十分に被写体の声が撮影者に聞こえるようにするというものである。 [Second Embodiment]
Next, a second embodiment will be described.
In the second embodiment, even when sound data is reproduced during or during the recording of a moving image with sound, the sound of the subject is sufficiently not to be erased by the reproduced sound. The voice is made audible to the photographer.

Ｄ.デジタルカメラ１の機能
第２の実施の形態も、図１に示したものと同様の構成を有するデジタルカメラ１を用いることにより本発明の撮像装置を実現する。
但し、第２の実施の形態においては、音声処理部１５は、内蔵スピーカ、ローパスフィルタ／アンプ、Ａ／Ｄ変換器を２つずつ備え、一方の内蔵スピーカ（第１の実施の形態で説明した内蔵スピーカ２１）は撮影レンズ２が設けられている面（被写体側）に備えられ、他方の内蔵スピーカは画像表示部１２が設けられている反対側の面（撮影者側）に備えられている。また、ＣＰＵ１０は、音データと逆位相の音データを生成する逆位相生成部を備えている。 D. Function of Digital Camera 1 The second embodiment also implements the imaging apparatus of the present invention by using the digital camera 1 having the same configuration as that shown in FIG.
However, in the second embodiment, the sound processing unit 15 includes two built-in speakers, a low-pass filter / amplifier, and an A / D converter, and one built-in speaker (described in the first embodiment). The built-in speaker 21) is provided on the surface (subject side) where the taking lens 2 is provided, and the other built-in speaker is provided on the opposite surface (photographer side) where the image display unit 12 is provided. . In addition, the CPU 10 includes an anti-phase generation unit that generates sound data having a phase opposite to that of the sound data.

図７は、第２の実施の形態における音声処理部１５及びＣＰＵ１０の機能を説明するための図である。
図７を見るとわかるように、音声処理部１５は、Ｄ／Ａ変換器２４、ローパスフィルタ／アンプ２５、内蔵スピーカ２６（第１のスピーカ２６）に加え、更に、Ｄ／Ａ変換器２７、ローパスフィルタ／アンプ２８、内蔵スピーカ２９（第２のスピーカ２９）を備えている。なお、内蔵マイク２１、アンプ／ローパスフィルタ２２、Ａ／Ｄ変換器２３は、図示を省略している。 FIG. 7 is a diagram for explaining the functions of the audio processing unit 15 and the CPU 10 in the second embodiment.
As can be seen from FIG. 7, the audio processing unit 15 includes a D / A converter 24, a low-pass filter / amplifier 25, a built-in speaker 26 (first speaker 26), a D / A converter 27, A low-pass filter / amplifier 28 and a built-in speaker 29 (second speaker 29) are provided. The built-in microphone 21, the amplifier / low pass filter 22, and the A / D converter 23 are not shown.

まず、ＣＰＵ１０は、フラッシュメモリ１３に記録されているデジタル信号の音データ（音楽等の音データ）をバッファメモリに記憶させ、ＣＰＵ１０の逆位相生成部３２は、該記憶させた音データから、該音データと逆位相の音声データを順次生成さていく。この逆位相の音声データの生成は既に周知技術なので説明を割愛する。 First, the CPU 10 stores sound data (sound data such as music) of a digital signal recorded in the flash memory 13 in a buffer memory, and the anti-phase generation unit 32 of the CPU 10 calculates the sound data from the stored sound data. The sound data with the opposite phase to the sound data is sequentially generated. Since the generation of the audio data with the opposite phase is already a well-known technique, the description is omitted.

そして、ＣＰＵ１０は、該記憶させた音データを順次Ｄ／Ａ変換器２４に出力すると共に、該生成された逆位相の音データをＤ／Ａ変換器２７に順次出力させていく。このとき、音データと生成された逆位相の音データとを同期させて、Ｄ／Ａ変換器２４、２７にそれぞれ出力させる。
Ａ／Ｄ変換器２４、２７は、該入力されたデジタル信号の音データからアナログ信号の音データに変換して、ローパスフィルタ／アンプ２５、２８にそれぞれ出力し、ローパスフィルタ／アンプ２５、２８は、入力された音データの不要な周波数帯域をカットして、増幅してから内蔵スピーカ２６、２９にそれぞれ出力し、内蔵スピーカ２６、２９は該入力された音データを音に変換して放音する。 The CPU 10 sequentially outputs the stored sound data to the D / A converter 24 and sequentially outputs the generated antiphase sound data to the D / A converter 27. At this time, the sound data and the generated sound data of the opposite phase are synchronized and output to the D / A converters 24 and 27, respectively.
The A / D converters 24 and 27 convert the input digital signal sound data into analog signal sound data, and output the analog signal sound data to the low-pass filters / amplifiers 25 and 28, respectively. The unnecessary frequency band of the input sound data is cut and amplified, and then output to the built-in speakers 26 and 29. The built-in speakers 26 and 29 convert the input sound data into sound and emit sound. To do.

このように、被写体側の内蔵スピーカ２６には、音データがそのまま放音され、撮影者側の内蔵スピーカ２９には音データと逆位相の音データが放音されるので、被写体には内蔵スピーカ２６から放音された音がそのまま伝わり、撮影者には、内蔵スピーカ２６から放音された音が、内蔵スピーカ２９から放音された逆位相の音によって減殺されるので、内蔵スピーカ２６から放音される音が小さくなり、被写体の声がかき消されることなく撮影者に十分に伝わる。 In this way, sound data is emitted as it is to the built-in speaker 26 on the subject side, and sound data having a phase opposite to that of the sound data is emitted to the built-in speaker 29 on the photographer side. 26, the sound emitted from the built-in speaker 26 is transmitted to the photographer as it is, and the sound emitted from the built-in speaker 26 is attenuated by the reverse phase sound emitted from the built-in speaker 29. The sound to be heard is reduced, and the subject's voice is fully transmitted to the photographer without being erased.

Ｅ．デジタルカメラ１の動作
以下、第２の実施の形態のデジタルカメラ１の動作を図８のフローチャートにしたがって説明する。
まず、ステップＳ７１で、ＣＰＵ１０は、音を再生させるか否かを判断する。この判断は、図４のステップＳ２や、ステップＳ９と同様の判断であり、ユーザによって音再生キーの操作を行なわれ、スルー画像表示に重ねて表示されたフラッシュメモリ１３等に記録されている音データ（音楽や、お遊びの音等の音データ）がユーザの十字キー、ＳＥＴキーの操作によって選択された場合は、音を再生させると判断する。 E. Operation of Digital Camera 1 The operation of the digital camera 1 according to the second embodiment will be described below with reference to the flowchart of FIG.
First, in step S71, the CPU 10 determines whether or not to reproduce a sound. This determination is the same as in step S2 or step S9 in FIG. 4, and the sound recorded on the flash memory 13 or the like displayed on the through image display is displayed by the user operating the sound reproduction key. When data (sound data such as music or play sounds) is selected by the user's operation of the cross key or the SET key, it is determined that the sound is reproduced.

ステップＳ７１で、音を再生させると判断すると、ステップＳ７２に進み、ＣＰＵ１０の逆位相生成部３２は、ユーザによって選択された音データと逆位相の音データを生成する処理を開始する。
次いで、ステップＳ７３で、ユーザによって選択された音データを、第１のスピーカから放音させることにより該音データの再生処理を開始させる。
次いで、ステップＳ７４で、該生成された逆位相の音データを、第２のスピーカから放音させることにより該音データの再生処理を開始させる。このとき、逆位相の音データを、ステップＳ７３で再生される音データと同期させて放音させる。 If it is determined in step S71 that the sound is to be reproduced, the process proceeds to step S72, and the anti-phase generation unit 32 of the CPU 10 starts a process of generating sound data having an anti-phase with the sound data selected by the user.
Next, in step S73, the sound data selected by the user is emitted from the first speaker, thereby starting the reproduction processing of the sound data.
Next, in step S74, the sound data reproduction process is started by emitting the generated sound data of opposite phase from the second speaker. At this time, the sound data in the opposite phase is emitted in synchronization with the sound data reproduced in step S73.

次いで、ステップＳ７５で、音の再生処理を終了したか否かを判断する。この判断は、図４のステップＳ１４と同様の判断であり、再生している音が最後まで再生された場合や、ユーザによって再生を中止する操作が行なわれた場合等は、再生処理が終了したと判断する。
ステップＳ７５で、音の再生処理が終了していないと判断すると、終了するまでステップＳ７５に戻り、音の再生処理が終了したと判断すると、ステップＳ７６に進み、ＣＰＵ１０は、逆位相の生成処理、ステップＳ７３、７４の再生処理を終了してステップＳ７１に戻る。 Next, in step S75, it is determined whether or not the sound reproduction processing has ended. This determination is the same as step S14 in FIG. 4. When the sound being played has been played to the end, or when the user has performed an operation to stop the playback, the playback process has ended. Judge.
If it is determined in step S75 that the sound reproduction process has not been completed, the process returns to step S75 until the process is completed. If it is determined that the sound reproduction process has been completed, the process proceeds to step S76, where the CPU 10 The reproduction process of steps S73 and 74 is finished, and the process returns to step S71.

以上のように、第２の実施の形態においては、再生している音データと逆位相の音データを生成し、再生している音データを被写体側に向けられた第１のスピーカから放音し、該生成した逆位相の音データを撮影者側に向けられた第２のスピーカから放音するようにしたので、再生している音によって被写体の音声がかき消されること無く撮影者にも聞こえる。 As described above, in the second embodiment, sound data having an opposite phase to the sound data being reproduced is generated, and the sound data being reproduced is emitted from the first speaker directed toward the subject side. Since the generated sound data having the opposite phase is emitted from the second speaker directed to the photographer, the photographer can hear the sound of the subject without being erased by the sound being reproduced. .

［第３の実施の形態］
次に第３の実施の形態について説明する。
第３の実施の形態では、録音される音のうち、被写体の音声以外の周辺音を抑えることによって、被写体の音声が周辺音によってかき消されないようにするというものである。 [Third Embodiment]
Next, a third embodiment will be described.
In the third embodiment, by suppressing peripheral sounds other than the sound of the subject among the recorded sounds, the sound of the subject is prevented from being erased by the peripheral sound.

Ｆ．デジタルカメラ１の機能
第３の実施の形態も図１に示したものと同様の構成を有するデジタルカメラ１を用いることにより本発明の撮像装置を実現する。
但し、第３の実施の形態においては、ＣＰＵ１０は、音声認識処理部、音抽出部、減算処理部を有している。 F. Functions of Digital Camera 1 In the third embodiment, the imaging apparatus of the present invention is realized by using the digital camera 1 having the same configuration as that shown in FIG.
However, in the third embodiment, the CPU 10 includes a voice recognition processing unit, a sound extraction unit, and a subtraction processing unit.

図９は、第３の実施の形態における音声処理部１５及びＣＰＵ１０の機能を説明するための図である。
図９を見るとわかるように、音声処理部１５の内蔵マイク２１によって変換され、アンプ／ローパスフィルタ２２を介してＡ／Ｄ変換器２３から出力されるデジタル信号の音データ（音声と周辺音を含む音データ）は、ＣＰＵ１０の音声認識処理部３３に送られる。なお、音声処理部１５のＤ／Ａ変換器２４、ローパスフィルタ／アンプ２５、内蔵スピーカ２６は、図示を省略している。 FIG. 9 is a diagram for explaining the functions of the audio processing unit 15 and the CPU 10 in the third embodiment.
As can be seen from FIG. 9, the sound data of the digital signal (sound and ambient sound is converted by the built-in microphone 21 of the sound processing unit 15 and output from the A / D converter 23 via the amplifier / low-pass filter 22). Sound data) is sent to the speech recognition processing unit 33 of the CPU 10. The D / A converter 24, the low-pass filter / amplifier 25, and the built-in speaker 26 of the audio processing unit 15 are not shown.

そして、音声認識処理部３３は、該送られてきた音データ（音声と周辺音を含む音データ）に対して音声認識処理を行うことにより音データの音声部分を認識して、音抽出部３４に出力する。この音声認識により、音声（人の声）と、それ以外の周辺音（動物の鳴き声や、車の音、海の波の音、風によって木がなびく音）とを区別することが可能となる。この音声認識処理は既に周知技術なので説明を割愛する。
音抽出部３４は、音声認識処理部の認識結果に基づいて、該送られてきた音データ（音声と周辺音を含む音データ）から周辺音の音データを抽出する。この音抽出部３４は、送られてきた音データ（音声と周辺音を含む音データ）を減算処理部３１にそのまま出力すると共に、抽出した周辺音の音データも減算処理部３１に出力する。 Then, the voice recognition processing unit 33 recognizes the voice part of the sound data by performing voice recognition processing on the transmitted sound data (sound data including voice and surrounding sounds), and the sound extraction unit 34. Output to. This voice recognition makes it possible to distinguish between voice (human voice) and other ambient sounds (animal calls, car sounds, sea wave sounds, and sound of trees flying by the wind). . Since this voice recognition process is already a well-known technique, a description thereof is omitted.
The sound extraction unit 34 extracts sound data of peripheral sound from the transmitted sound data (sound data including sound and peripheral sound) based on the recognition result of the sound recognition processing unit. The sound extraction unit 34 outputs the received sound data (sound data including sound and peripheral sounds) to the subtraction processing unit 31 as it is, and also outputs the extracted sound data of the peripheral sounds to the subtraction processing unit 31.

減算処理部３１は、送られてきた音データ（音声と周辺音を含む音データ）から該送られてきた周辺音の音データを減算する。この減算は、周辺音の音が全く無くなるように減算してもよいし、周辺音の音が少し聞こえるように減算するようにしてもよい。また、ユーザが減算の度合いを任意に選択することができるようにしてもよい。
そして、減算処理部３１から出力される減算後の音データがバッファメモリに記憶され、フラッシュメモリ１３に記録される。
これにより、周辺音のボリュームを抑えて録音することができ、被写体の音声が周辺音にかき消されずにすむ。 The subtraction processing unit 31 subtracts the transmitted sound data of the peripheral sound from the transmitted sound data (sound data including sound and peripheral sound). This subtraction may be performed so that the peripheral sound is completely eliminated or may be subtracted so that the peripheral sound is slightly heard. Further, the user may arbitrarily select the degree of subtraction.
The subtracted sound data output from the subtraction processing unit 31 is stored in the buffer memory and recorded in the flash memory 13.
As a result, it is possible to record while suppressing the volume of the ambient sound, and the sound of the subject is not erased by the ambient sound.

Ｇ．デジタルカメラ１の動作
以下、第３の実施の形態のデジタルカメラ１の動作を図１０のフローチャートにしたがって説明する。
ユーザのキー入力部１４のモード切替キーの音付動画撮影モードに設定されると、ステップＳ９１に進み、ＣＰＵ１０は、動画撮像処理を開始して、被写体のスルー画像表示を開始させる。 G. Operation of Digital Camera 1 The operation of the digital camera 1 according to the third embodiment will be described below with reference to the flowchart of FIG.
When the sound switching moving image shooting mode of the mode switching key of the user key input unit 14 is set, the process proceeds to step S91 where the CPU 10 starts moving image capturing processing and starts displaying a through image of the subject.

次いで、ステップＳ９２で、ＣＰＵ１０は、録画を開始するか否かを判断する。この判断は、図４のステップＳ４と同様の判断を行う。
ステップＳ９２で、録画を開始しないと判断すると、録画を開始すると判断するまでステップＳ９２に留まり、録画を開始すると判断すると、ステップＳ９３に進み、ＣＰＵ１０は、動画記録処理を開始する。つまり、ＣＣＤ５により順次撮像されバッファメモリに記憶されたフレーム画像データをフラッシュメモリ１３に記録させていく処理を開始する。 Next, in step S92, the CPU 10 determines whether to start recording. This determination is the same as step S4 in FIG.
If it is determined in step S92 that the recording is not started, the process stays in step S92 until it is determined that the recording is started. If it is determined that the recording is started, the process proceeds to step S93, and the CPU 10 starts the moving image recording process. That is, the process of recording the frame image data sequentially captured by the CCD 5 and stored in the buffer memory in the flash memory 13 is started.

次いで、ステップＳ９４で、ＣＰＵ１０は、録音処理を開始する。つまり、内蔵マイク２１により電気信号に変換され、デジタル信号に変換された音データを順次バッファメモリに記憶し、該記憶した音データをフラッシュメモリ１３に記録させていく処理を開始する。
次いで、ステップＳ９５で、ＣＰＵ１０の音声認識処理部３３は、該内蔵マイク２１により取得され、Ａ／Ｄ変換器２３により順次変換されたデジタル信号の音データに対して音声認識処理を行うことにより、音声部分を認識して、送られてきた音データをそのまま音抽出部３４に出力する。 Next, in step S94, the CPU 10 starts a recording process. That is, the sound data converted into the electric signal by the built-in microphone 21 and converted into the digital signal is sequentially stored in the buffer memory, and the process of recording the stored sound data in the flash memory 13 is started.
Next, in step S95, the voice recognition processing unit 33 of the CPU 10 performs voice recognition processing on the sound data of the digital signal acquired by the built-in microphone 21 and sequentially converted by the A / D converter 23. The voice part is recognized, and the transmitted sound data is output to the sound extraction unit 34 as it is.

次いで、ステップＳ９６で、ＣＰＵ１０は、該音声認識処理による認識結果に基づいて被写体の音声が含まれているか否かを判断する。
ステップＳ９６で、被写体の音声が含まれていると判断すると、ステップＳ９７に進み、ＣＰＵ１０の音抽出部３４は、音声認識結果に基づいて、音声認識処理部３３から送られてきた音データのうち、周辺音の音データを抽出し、減算処理部３１は、該送られてきた音データから該抽出した周辺音の音データを減算する処理を行って、ステップＳ９８に進む。この減算後の音データがバッファメモリに記憶され、フラッシュメモリ１３に記録されることになる。 Next, in step S96, the CPU 10 determines whether or not the sound of the subject is included based on the recognition result obtained by the sound recognition process.
If it is determined in step S96 that the sound of the subject is included, the process proceeds to step S97, and the sound extraction unit 34 of the CPU 10 includes the sound data sent from the sound recognition processing unit 33 based on the sound recognition result. Then, the sound data of the peripheral sound is extracted, and the subtraction processing unit 31 performs a process of subtracting the extracted sound data of the peripheral sound from the transmitted sound data, and proceeds to step S98. The subtracted sound data is stored in the buffer memory and recorded in the flash memory 13.

一方、ステップＳ９６で、被写体の音声が含まれていないと判断すると、そのままステップＳ９８に進む。このときは、音抽出部３４、減算処理部３１は送られてきた音データをスルーさせて出力することになり、音声認識処理部３３に送られてきた音データは減算されずにそのままバッファメモリに記憶され、フラッシュメモリ１３に記録されることになる。
ステップＳ９８に進むと、ＣＰＵ１０は、録画を終了するか否かを判断する。この判断は、図４のステップＳ１０と同様の判断を行う。
ステップＳ９８で、録画を終了しないと判断するとステップＳ９６に戻り、ステップＳ９８で、録画を終了すると判断すると、ステップＳ９９に進み、ＣＰＵ１０は、該撮像され記録されたフレーム画像データと記録された音データとに基づいて音付動画ファイルを生成する。 On the other hand, if it is determined in step S96 that the subject's voice is not included, the process directly proceeds to step S98. At this time, the sound extraction unit 34 and the subtraction processing unit 31 output the transmitted sound data through, and the sound data transmitted to the speech recognition processing unit 33 is not subtracted as it is but in the buffer memory. And recorded in the flash memory 13.
In step S98, the CPU 10 determines whether to end recording. This determination is performed in the same manner as step S10 in FIG.
If it is determined in step S98 that the recording is not to be ended, the process returns to step S96. If it is determined in step S98 that the recording is to be ended, the process proceeds to step S99, where the CPU 10 captures and records the frame image data and the recorded sound data. Based on the above, a sound-added moving image file is generated.

以上のように、第３の実施の形態においては、内蔵マイク２１により集音された音データを音声認識し、該音声認識結果に基づいて内蔵マイク２１により集音された音データから、音声以外の周辺音の音データを減算するようにしたので、周辺音が騒々しい場合でも、被写体の音声が周辺音によってかき消されることは無く、被写体の音声が最も大きくなるように録音することができ、録音、再生の精度を向上させることができる。 As described above, in the third embodiment, the sound data collected by the built-in microphone 21 is recognized by voice, and the sound data collected by the built-in microphone 21 based on the voice recognition result is used for other than voice. Since the sound data of the surrounding sound is subtracted, even if the surrounding sound is noisy, the sound of the subject is not drowned out by the surrounding sound, and can be recorded so that the sound of the subject becomes the loudest. , Recording and playback accuracy can be improved.

なお、第３の実施の形態においては、内蔵マイク２１から集音された音データから周辺音の音データを減算して、記録するようにしたが（録音するようにしたが）、録音時は、減算することなくそのまま記録し、再生時に、音声認識処理を行うことにより、再生する音データから周辺音の音データを減算して、音を再生するようにしてもよい。
また、内蔵マイク２１から集音された音データをそのまま記録し、記録後に、音声認識処理を行うことにより、周辺音を減算した音データを生成して記録するようにしてもよい。これにより、撮影時、再生時の処理負担を軽減することができる。 In the third embodiment, the sound data of the surrounding sound is subtracted from the sound data collected from the built-in microphone 21 and recorded (although it is recorded), but at the time of recording, Alternatively, the sound may be recorded without being subtracted, and the sound may be reproduced by subtracting the sound data of the surrounding sound from the sound data to be reproduced by performing a voice recognition process at the time of reproduction.
Alternatively, sound data collected from the built-in microphone 21 may be recorded as it is, and sound data obtained by subtracting the surrounding sound may be generated and recorded by performing a voice recognition process after the recording. Thereby, it is possible to reduce the processing load at the time of shooting and reproduction.

また、例えば、撮影者と被写体が会話をした場合、内蔵マイク２１に近い撮影者の声の方が大きくなるので、内蔵マイク２１により集音された音データを音声認識することにより、音レベルの大きい音声（音が大きい音声）と、音レベルの小さい音声（音が小さい音声）とを区別し、該音レベルが大きい音声と、音レベルが小さい音声とが同じ音レベルとなるように、内蔵マイク２１により集音された音データから、音レベルの大きい音声データを減算するようにしてもよいし、音レベルの小さい音声を増幅して音レベルを大きくするようにしてもよい。これにより、撮影者と被写体の音の大きさを同じにして録音、再生することができ、録音精度、再生精度を向上させることができる。
また、この場合もユーザが減算の度合い、増幅の度合いを指定することができるようにしてもよい。 Further, for example, when the photographer and the subject have a conversation, the photographer's voice near the built-in microphone 21 becomes louder. Therefore, by recognizing the sound data collected by the built-in microphone 21, the sound level can be adjusted. Built-in so that loud sound (sound with high sound) and low sound level (sound with low sound) are distinguished, and sound with a high sound level and sound with a low sound level have the same sound level. Audio data with a high sound level may be subtracted from the sound data collected by the microphone 21, or a sound with a low sound level may be amplified to increase the sound level. As a result, the sound volume of the photographer and the subject can be recorded and reproduced with the same volume, and the recording accuracy and reproduction accuracy can be improved.
Also in this case, the user may be able to specify the degree of subtraction and the degree of amplification.

また、第１及び第３の実施の形態において、内蔵マイク２１によって集音された音データに対して音声認識を行い、内蔵マイク２１によって集音された音データに対して減算を行なわず、または減算を行なうと共に、音声の音レベルを大きくして記録、再生するようにしてもよい。 In the first and third embodiments, voice recognition is performed on sound data collected by the built-in microphone 21, and no subtraction is performed on sound data collected by the built-in microphone 21, or While subtracting, the sound level of the sound may be increased and recorded and reproduced.

また、上記各実施の形態を任意に組み合わせた態様であってもよい。
この第１の実施の形態と、第３の実施の形態とを組み合わせる場合は、周辺音の音データのうち、録音時に再生された音データ以外の音データを内蔵マイク２１によって集音された音データから減算すると共に、該録音時に再生させる音データも減算するようにしてもよい。
例えば、録音時にヴォーカル付きの音楽を再生している場合は、該再生している音が周辺音として認識されない部分と、周辺音として認識される部分とに分かれてしまう場合があり、適切に再生している音楽を減算することはできないので、再生している音楽と、周辺音のうち該音楽以外の音データとを分けてそれぞれ内蔵マイク２１から集音された音データから減算する。これにより、再生精度を向上させることができる。 Moreover, the aspect which combined each said embodiment arbitrarily may be sufficient.
When the first embodiment and the third embodiment are combined, the sound collected by the built-in microphone 21 from the sound data of the surrounding sounds other than the sound data reproduced at the time of recording is collected. While subtracting from the data, sound data to be reproduced at the time of recording may be subtracted.
For example, when playing music with vocals during recording, the sound being played may be divided into a part that is not recognized as a peripheral sound and a part that is recognized as a peripheral sound. Since the music being played cannot be subtracted, the music being played and the sound data other than the music of the surrounding sounds are divided and subtracted from the sound data collected from the built-in microphone 21. Thereby, the reproduction accuracy can be improved.

また、上記各実施の形態においては、音付動画撮影の場合について説明したが、音付静止画撮影、撮影を伴わない単なる録音の場合のおいても適用可能である。 In each of the above-described embodiments, the case of moving image shooting with sound has been described. However, the present invention can also be applied to still image shooting with sound and simple recording without shooting.

また、本発明の上記実施形態は、何れも最良の実施形態としての単なる例に過ぎず、本発明の原理や構造等をより良く理解することができるようにするために述べられたものであって、添付の特許請求の範囲を限定する趣旨のものでない。
したがって、本発明の上記実施形態に対してなされ得る多種多様な変形ないし修正はすべて本発明の範囲内に含まれるものであり、添付の特許請求の範囲によって保護されるものと解さなければならない。 The above embodiments of the present invention are merely examples as the best embodiments, and are described in order to better understand the principle and structure of the present invention. It is not intended to limit the scope of the appended claims.
Therefore, it should be understood that all the various variations and modifications that can be made to the above-described embodiments of the present invention are included in the scope of the present invention and protected by the appended claims.

最後に、上記各実施の形態においては、本発明の撮像装置をデジタルカメラ１に適用した場合について説明したが、上記の実施の形態に限定されるものではなく、要は、被写体を撮影し、音を録音することができる機器であれば適用可能である。 Finally, in each of the above embodiments, the case where the imaging apparatus of the present invention is applied to the digital camera 1 has been described. However, the present invention is not limited to the above embodiment, and in short, the subject is photographed, Any device that can record sound is applicable.

１デジタルカメラ
２撮影レンズ
３レンズ駆動ブロック
４絞り
５ＣＣＤ
６ドライバ
７ＴＧ
８ユニット回路
９メモリ
１０ＣＰＵ
１１ＤＲＡＭ
１２画像表示部
１３フラッシュメモリ
１４キー入力部
１５音声処理部
１６バス 1 Digital Camera 2 Shooting Lens 3 Lens Drive Block 4 Aperture 5 CCD
6 Driver 7 TG
8 Unit circuit 9 Memory 10 CPU
11 DRAM
12 Image display unit 13 Flash memory 14 Key input unit 15 Audio processing unit 16 Bus

上記目的達成のため、本発明の撮像装置は、
撮像手段と、
マイクロフォンにより集音された音データを取得する取得手段と、
前記取得された音データに対して人の声を認識する音声認識を行なう認識手段と、
前記音声認識の結果に基づいて、人の声以外の音を前記取得された音データから減算する減算手段と、
前記撮像手段により撮像された画像データと前記人の声以外の音を減算した音データとを関連付けて記録装置に記録する記録制御手段と
を備えたことを特徴とする。 In order to achieve the above object, the imaging apparatus of the present invention provides:
Imaging means;
Obtaining means for obtaining sound data collected by a microphone;
Recognition means for performing speech recognition for recognizing a human voice for the acquired sound data;
Subtracting means for subtracting sound other than human voice from the acquired sound data based on the result of the speech recognition;
Recording control means for associating the image data picked up by the image pickup means with sound data obtained by subtracting sounds other than the human voice and recording them in a recording device;
It is provided with.

本願発明によれば、録音された音の再生精度を向上することができる。 According to the present invention, it is possible to improve the reproduction accuracy of the recorded sound sounds.

Claims

Imaging means for imaging a subject;
A speaker that emits sound;
Sound emission control means for emitting sound data from the speaker;
A microphone that collects the sound,
Recording control means for associating the image data picked up by the image pickup means with the sound data collected by the microphone to record in the recording means;
While the sound data collected by the microphone is being recorded on the recording means by the recording control means, the sound data emitted by the sound emission control means is the sound data collected by the microphone. Subtracting means for subtracting from,
An imaging apparatus comprising:

The subtracting means is
Performing subtraction before sound data collected by the microphone is recorded in the recording means;
The recording control means includes
2. The imaging apparatus according to claim 1, wherein the sound data subtracted by the subtracting means is recorded in the recording means.

The subtracting means is
The imaging apparatus according to claim 1, wherein subtraction is performed after recording of sound data collected by the microphone by the recording unit.

Reproducing means for reproducing the sound data recorded by the recording control means,
The subtracting means is
When reproducing the sound data recorded in the recording means, subtraction is performed,
The reproducing means includes
4. The imaging apparatus according to claim 3, wherein the sound data subtracted by the subtracting means is reproduced.

Generating means for generating sound data having an opposite phase to the sound data emitted from the speaker;
The speaker is
A first speaker directed to the subject side and a second speaker directed to the photographer side;
The sound emission control means includes
5. The sound data to be emitted from the first speaker is emitted, and the sound data of the opposite phase generated by the generating means is emitted from the second speaker. The imaging device according to any one of the above.

Voice recognition means for performing voice recognition on sound data collected by the microphone;
The subtracting means is
While the sound data collected by the microphone is being recorded on the recording means by the recording control means, ambient sounds other than the voice are collected by the microphone based on the recognition result by the voice recognition means. 6. The imaging apparatus according to claim 1, wherein the imaging apparatus subtracts from the sound data.

The subtracting means is
Out of peripheral sounds other than sound, sound data other than sound data emitted by the sound emission control means is subtracted from sound data collected by the microphone, and sound emitted by the sound emission control means The image pickup apparatus according to claim 6, wherein data is also subtracted from sound data collected by the microphone.

Voice recognition means for performing voice recognition on the sound data collected by the microphone;
The subtracting means is
While the sound data collected by the microphone is being recorded in the recording means by the recording control means, a voice having a high sound level among a plurality of sounds is based on the recognition result by the voice recognition means. 6. The imaging apparatus according to claim 1, wherein subtraction is performed from sound data collected by the microphone.

A program for executing an imaging apparatus including an imaging unit that images a subject, a speaker that emits sound, and a microphone that collects sound,
A sound emission control process for emitting sound data from the speaker;
A recording control process in which image data captured by the imaging unit and sound data collected by the microphone are associated and recorded on a recording medium;
While the sound data collected by the microphone is recorded on the recording medium by the recording control process, the sound data emitted by the sound emission control process is the sound data collected by the microphone. Subtraction processing to subtract from
The program characterized by including.