CN105261365A

CN105261365A - Audio output method and device

Info

Publication number: CN105261365A
Application number: CN201510587775.8A
Authority: CN
Inventors: 石武
Original assignee: Beijing Yunzhisheng Information Technology Co Ltd
Current assignee: Beijing Yunzhisheng Information Technology Co Ltd
Priority date: 2015-09-15
Filing date: 2015-09-15
Publication date: 2016-01-20
Also published as: WO2017045413A1

Abstract

The invention discloses an audio output method and device which are used for realizing multipath audio data collection in voice recognition. The method comprises the steps of: receiving multipath audio data; converting the multipath audio data into single path audio data; and transmitting the single path audio data to a processor through an audio output interface. According to the technical scheme, the received multipath audio data is converted into the single path audio data, so that the multipath audio data can be received in the voice recognition, the problem that a central processor of a voice recognition application platform has only one set of audio input interface and fails to receive the multipath audio data is solved, and the requirements of voice recognition are met to the largest degree.

Description

A kind of audio-frequency inputting method and device

Technical field

The present invention relates to multimedia technology field, particularly relate to a kind of audio-frequency inputting method and device.

Background technology

The collection of voice data, process and transmission are the important component parts of multimedia technology.Along with developing rapidly of multimedia technology, numerous digital audio systems enters consumption market, such as Digital Audio Tape, Digital Sound Processor etc.For equipment and manufacturer, standardized information transmission structure can improve the adaptability of system.IIS (Inter-ICSound, audio input interface) bus, as a kind of bus standard of the audio data transmission between digital audio-frequency apparatus, is responsible for the data transmission between audio frequency apparatus, is widely used in various multimedia system.In correlation technique, in the application of speech recognition, a lot of Mike (6 Mikes or 8 Mikes) is needed to input, be convenient to do source of sound location and noise elimination etc., but, the CPU (CenterProcessUnit, central processing unit) being used for doing speech recognition only has one group of audio input interface IIS usually, cannot meet the demand of speech recognition.

Summary of the invention

The embodiment of the present invention provides a kind of audio-frequency inputting method and device, for realizing the collection to multi-path audio-frequency data in speech recognition.

A kind of audio-frequency inputting method, is applied to field programmable gate array, comprises the following steps:

Receive the multi-path audio-frequency data of multichannel voice frequency input interface input;

Described multi-path audio-frequency data is converted into SCVF single channel voice frequency data;

Described SCVF single channel voice frequency data are exported by SCVF single channel voice frequency output interface.

In one embodiment, the multi-path audio-frequency data of described reception multichannel voice frequency input interface input, comprising:

According to the quantity of described multichannel voice frequency input interface and the first sampling rate of default described SCVF single channel voice frequency data, determine the second sampling rate of described multi-path audio-frequency data;

According to described second sampling rate, determine the clock number that described multi-path audio-frequency data single inputs;

The multi-path audio-frequency data of described multichannel voice frequency input interface input is received according to described clock number.

In one embodiment, described described multi-path audio-frequency data is converted into SCVF single channel voice frequency data, comprises:

According to multi-path audio-frequency data described in described clock number buffer memory;

The preset order of the multi-path audio-frequency data of described buffer memory according to described multichannel voice frequency input interface is sorted, obtains the SCVF single channel voice frequency data after sequence.

In one embodiment, the described SCVF single channel voice frequency data of described output, comprising:

Export the SCVF single channel voice frequency data after described sequence.

The multi-path audio-frequency data of multichannel voice frequency input interface input is received at the rising edge of clock;

Describedly export described SCVF single channel voice frequency data by SCVF single channel voice frequency output interface, comprising:

Described SCVF single channel voice frequency data are exported by SCVF single channel voice frequency output interface at the negative edge of described clock.

The multi-path audio-frequency data of multichannel voice frequency input interface input is received according to prefixed time interval.

Some beneficial effects of the embodiment of the present invention can comprise:

Technique scheme, export by the multi-path audio-frequency data received is converted into SCVF single channel voice frequency data, make can receive in speech recognition and output multi-channel voice data, when solving speech recognition, central processing unit only has one group of audio input interface and causes receiving the problem of multi-path audio-frequency data, meets the demand of speech recognition to a greater extent.

A kind of audio-frequency inputting method, is applied to central processing unit, comprises the following steps:

When receiving SCVF single channel voice frequency data, be multi-path audio-frequency data by described SCVF single channel voice frequency Data Division;

Export described multi-path audio-frequency data.

In one embodiment, described is multi-path audio-frequency data by described SCVF single channel voice frequency Data Division, comprising:

Obtain the quantity of multichannel voice frequency input interface;

According to described clock number, be multi-path audio-frequency data by described SCVF single channel voice frequency Data Division.

In one embodiment, described SCVF single channel voice frequency data are single channel left audio data or single channel right audio channel data; Described when receiving SCVF single channel voice frequency data, be multi-path audio-frequency data by described SCVF single channel voice frequency Data Division, comprise:

When receiving described single channel left audio data, be multichannel left audio data by described single channel left audio Data Division;

According to the described clock number of multi-path audio-frequency data single input and the clock number of the described SCVF single channel voice frequency data received, determine the first invalid data in the multichannel left audio data after described fractionation;

Abandon described first invalid data, obtain multichannel left audio valid data;

When receiving described single channel right audio channel data, be multichannel right audio channel data by described single channel right audio channel Data Division;

According to the described clock number of multi-path audio-frequency data single input and the clock number of the described SCVF single channel voice frequency data received, determine the second invalid data in the multichannel right audio channel data after described fractionation;

Abandon described second invalid data, obtain multichannel right audio channel valid data;

According to the corresponding relation between described multichannel left audio valid data and multichannel right audio channel valid data, in conjunction with described multichannel left audio valid data and described multichannel right audio channel valid data, obtain described multi-path audio-frequency data.

Technique scheme, by being that multi-path audio-frequency data exports by the SCVF single channel voice frequency received Data Division, the voice data received with single group audio input interface in speech recognition can be exported with multichannel form, thus complete voice data can be exported, when solving speech recognition, central processing unit only has one group of audio input interface and causes receiving the problem of multi-path audio-frequency data, meets the demand of speech recognition to a greater extent.

A kind of audio output device, be applied to field programmable gate array, described device comprises:

Receiver module, for receiving the multi-path audio-frequency data of multichannel voice frequency input interface input;

Conversion module, for being converted into SCVF single channel voice frequency data by described multi-path audio-frequency data;

First output module, for exporting described SCVF single channel voice frequency data by SCVF single channel voice frequency output interface.

In one embodiment, described receiver module comprises:

First determines submodule, for according to the quantity of described multichannel voice frequency input interface and the first sampling rate of default described SCVF single channel voice frequency data, determines the second sampling rate of described multi-path audio-frequency data;

Second determines submodule, for according to described second sampling rate, determines the clock number that described multi-path audio-frequency data single inputs;

First receives submodule, for receiving the multi-path audio-frequency data of described multichannel voice frequency input interface input according to described clock number.

In one embodiment, described conversion module comprises:

First cache sub-module, for according to multi-path audio-frequency data described in described clock number buffer memory;

Sorting sub-module, for the preset order of the multi-path audio-frequency data of described buffer memory according to described multichannel voice frequency input interface being sorted, obtains the SCVF single channel voice frequency data after sequence.

In one embodiment, described first output module comprises:

First output sub-module, for exporting the SCVF single channel voice frequency data after described sequence.

In one embodiment, described receiver module comprises:

Second receives submodule, for receiving the multi-path audio-frequency data of multichannel voice frequency input interface input at the rising edge of clock;

Described first output module, comprising:

Second output sub-module, exports described SCVF single channel voice frequency data for the negative edge at described clock by SCVF single channel voice frequency output interface.

In one embodiment, described receiver module comprises:

3rd receives submodule, for receiving the multi-path audio-frequency data of multichannel voice frequency input interface input according to prefixed time interval.

Said apparatus, export by the multi-path audio-frequency data received is converted into SCVF single channel voice frequency data, make can receive in speech recognition and output multi-channel voice data, when solving speech recognition, central processing unit only has one group of audio input interface and causes receiving the problem of multi-path audio-frequency data, meets the demand of speech recognition to a greater extent.

A kind of audio output device, be applied to central processing unit, described device comprises:

Splitting module, for when receiving SCVF single channel voice frequency data, is multi-path audio-frequency data by described SCVF single channel voice frequency Data Division;

Second output module, for exporting described multi-path audio-frequency data.

In one embodiment, described fractionation module comprises:

Obtain submodule, for obtaining the quantity of multichannel voice frequency input interface;

3rd determines submodule, for according to the quantity of described multichannel voice frequency input interface and the first sampling rate of default described SCVF single channel voice frequency data, determines the second sampling rate of described multi-path audio-frequency data;

4th determines submodule, for according to described second sampling rate, determines the clock number that described multi-path audio-frequency data single inputs;

First splits submodule, for according to described clock number, is multi-path audio-frequency data by described SCVF single channel voice frequency Data Division.

In one embodiment, described SCVF single channel voice frequency data are single channel left audio data or single channel right audio channel data; Described fractionation module comprises:

Second splits submodule, for when receiving described single channel left audio data, is multichannel left audio data by described single channel left audio Data Division;

5th determines submodule, for according to the described clock number of multi-path audio-frequency data single input and the clock number of the described SCVF single channel voice frequency data received, determines the first invalid data in the multichannel left audio data after described fractionation;

First abandons submodule, for abandoning described first invalid data, obtains multichannel left audio valid data;

3rd splits submodule, for when receiving described single channel right audio channel data, is multichannel right audio channel data by described single channel right audio channel Data Division;

6th determines submodule, for according to the described clock number of multi-path audio-frequency data single input and the clock number of the described SCVF single channel voice frequency data received, determines the second invalid data in the multichannel right audio channel data after described fractionation;

Second abandons submodule, for abandoning described second invalid data, obtains multichannel right audio channel valid data;

Zygote module, for according to the corresponding relation between described multichannel left audio valid data and multichannel right audio channel valid data, in conjunction with described multichannel left audio valid data and described multichannel right audio channel valid data, obtain described multi-path audio-frequency data.

Said apparatus, by being that multi-path audio-frequency data exports by the SCVF single channel voice frequency received Data Division, the voice data received with single group audio input interface in speech recognition can be exported with multichannel form, thus complete voice data can be exported, when solving speech recognition, central processing unit only has one group of audio input interface and causes receiving the problem of multi-path audio-frequency data, meets the demand of speech recognition to a greater extent.

Other features and advantages of the present invention will be set forth in the following description, and, partly become apparent from instructions, or understand by implementing the present invention.Object of the present invention and other advantages realize by structure specifically noted in write instructions, claims and accompanying drawing and obtain.

Below by drawings and Examples, technical scheme of the present invention is described in further detail.

Accompanying drawing explanation

Accompanying drawing is used to provide a further understanding of the present invention, and forms a part for instructions, together with embodiments of the present invention for explaining the present invention, is not construed as limiting the invention.In the accompanying drawings:

Fig. 1 is the process flow diagram of a kind of audio-frequency inputting method in the embodiment of the present invention;

Fig. 2 is the process flow diagram of a kind of audio-frequency inputting method in the embodiment of the present invention;

Fig. 3 is the process flow diagram of a kind of audio-frequency inputting method in the embodiment of the present invention;

Fig. 4 is the process flow diagram of step S32 in a kind of audio-frequency inputting method in the embodiment of the present invention;

Fig. 5 is the process flow diagram of step S32 in a kind of audio-frequency inputting method in the embodiment of the present invention;

Fig. 6 is the block diagram of a kind of audio-frequency inputting method sound intermediate frequency recognition system in the embodiment of the present invention;

Fig. 7 is the sequential chart of IIS signal in a kind of audio-frequency inputting method in the embodiment of the present invention;

Fig. 8 is the process flow diagram of a kind of audio-frequency inputting method in the present invention one specific embodiment;

Fig. 9 is the block diagram of a kind of audio output device in the embodiment of the present invention;

Figure 10 is the block diagram of the first receiver module in a kind of audio output device in the embodiment of the present invention;

Figure 11 is the block diagram of conversion module in a kind of audio output device in the embodiment of the present invention;

Figure 12 is the block diagram of a kind of audio output device in the embodiment of the present invention;

Figure 13 is the block diagram of a kind of audio output device in the embodiment of the present invention;

Figure 14 is the block diagram splitting module in the embodiment of the present invention in a kind of audio output device;

Figure 15 is the block diagram splitting module in the embodiment of the present invention in a kind of audio output device.

Embodiment

Below in conjunction with accompanying drawing, the preferred embodiments of the present invention are described, should be appreciated that preferred embodiment described herein is only for instruction and explanation of the present invention, is not intended to limit the present invention.

A kind of audio-frequency inputting method that the embodiment of the present invention provides relates to two executive agents, comprise on-site programmable gate array FPGA and central processor CPU, wherein, FPGA is used for receiving multi-path audio-frequency data from multichannel voice frequency input interface, and multi-path audio-frequency data is converted into SCVF single channel voice frequency data and exports to CPU, it is that multi-path audio-frequency data exports that CPU is used for the SCVF single channel voice frequency Data Division received, thus realizes the identification to multi-path audio-frequency data in speech recognition.Below from the angle of FPGA and CPU two executive agents, a kind of audio-frequency inputting method that the embodiment of the present invention provides is described respectively respectively.

FPGA side

Fig. 1 is the process flow diagram of a kind of audio-frequency inputting method in the embodiment of the present invention.As shown in Figure 1, the method is used in FPGA, and wherein, be provided with multiple serial buffer corresponding with multichannel voice frequency input interface, transmit block and SCVF single channel voice frequency output interface in FPGA, this audio-frequency inputting method comprises the following steps S11-S13:

Step S11, receives the multi-path audio-frequency data of multichannel voice frequency input interface input.

In this step, the multi-path audio-frequency data of multichannel voice frequency input interface input can be received according to prefixed time interval.Such as, prefixed time interval is set to 10ms.

Step S12, is converted into SCVF single channel voice frequency data by multi-path audio-frequency data.

Step S13, exports SCVF single channel voice frequency data by SCVF single channel voice frequency output interface.

In one embodiment, as shown in Figure 2, said method also can be embodied as following steps S21-S26:

Step S21, according to the quantity of multichannel voice frequency input interface and the first sampling rate of default SCVF single channel voice frequency data, determines the second sampling rate of multi-path audio-frequency data.

Such as, the quantity of multichannel voice frequency input interface is 3, first sampling rate of the SCVF single channel voice frequency data preset is 96KHz, in embodiments of the present invention, voice data divides left audio data and right audio channel data, and therefore, 3 multichannel voice frequency input interfaces can receive the audio frequency input of 6 Mikes, then the second sampling rate of multi-path audio-frequency data is 1/6 of the first sampling rate, i.e. 16KHz.Wherein, 16KHz and 96KHz is the sampling rate of standard.

Step S22, according to the second sampling rate, determines the clock number that multi-path audio-frequency data single inputs.

In this step, because FPGA once samples at the rising edge of each clock, therefore, when the second sampling rate is 16KHz, the clock number of multi-path audio-frequency data single input is 16 clocks.

Step S23, receives the multi-path audio-frequency data of multichannel voice frequency input interface input according to clock number.

Step S24, according to clock number buffer memory multi-path audio-frequency data.

In this step, in FPGA, there is the serial buffer corresponding with multichannel voice frequency input interface, be used for the multi-path audio-frequency data that buffer memory multichannel voice frequency input interface receives,

Step S25, sorts the preset order of the multi-path audio-frequency data of buffer memory according to multichannel voice frequency input interface, obtains the SCVF single channel voice frequency data after sequence.

Such as, FPGA has these three interfaces of audio input interface 1, audio input interface 2 and audio input interface 3, and preset order is audio input interface 1-audio input interface 2-audio input interface 3, when then the multi-path audio-frequency data of buffer memory being sorted, the voice data that audio input interface 1 receives comes, then be the data that audio input interface 2 receives, last then be the data that audio input interface 3 receives, by above-mentioned sort method, multi-path audio-frequency data SCVF single channel voice frequency data are changed into.

Step S26, exports the SCVF single channel voice frequency data after sequence.

In this step, SCVF single channel voice frequency data are sent by the transmit block in FPGA, export to CPU through SCVF single channel voice frequency output interface, then are that multi-path audio-frequency data exports by CPU by SCVF single channel voice frequency Data Division, finally obtain complete multi-path audio-frequency data.

In said method, step S21-S23 is a kind of embodiment of step S11, and step S24-S25 is a kind of embodiment of step S12, and step S26 is a kind of embodiment of step S13.In specific implementation process, step S21-S26 circulation is carried out, and the operation that serial buffer buffered audio data in FPGA and transmit block send voice data is carried out simultaneously.In addition, because the voice data related in the embodiment of the present invention is divided into left audio data and right audio channel data, therefore FPGA needs to circulate above-mentioned flow process twice, and the left audio data exported respectively for twice and right audio channel data combine through CPU, and split into multi-path audio-frequency data, just can complete the acquisition of a frame voice data.

In one embodiment, step S11 can be embodied as following steps: the multi-path audio-frequency data receiving the input of multichannel voice frequency input interface at the rising edge of clock; Now, step S13 can be embodied as following steps: export SCVF single channel voice frequency data at the negative edge of same clock by SCVF single channel voice frequency output interface.Adopt the technical scheme of the present embodiment, can guarantee once to sample and export to carry out in same clock.

Adopt the technical scheme that the embodiment of the present invention provides, export by the multi-path audio-frequency data received is converted into SCVF single channel voice frequency data, make can receive in speech recognition and output multi-channel voice data, when solving speech recognition, central processing unit only has one group of audio input interface and causes receiving the problem of multi-path audio-frequency data, meets the demand of speech recognition to a greater extent.

In above-mentioned any embodiment, the circuit of FPGA composition all comprises a reset signal, resets, to guarantee the accuracy of system works when system powers at every turn to data.

CPU side

Fig. 3 is the process flow diagram of a kind of audio-frequency inputting method in the embodiment of the present invention.As shown in Figure 3, the method is used for CPU side, comprises the following steps S31-S32:

SCVF single channel voice frequency Data Division, when receiving SCVF single channel voice frequency data, is multi-path audio-frequency data by step S31.

In this step, the SCVF single channel voice frequency data that CPU receives are the SCVF single channel voice frequency data that the SCVF single channel voice frequency output interface in FPGA exports.

Step S32, output multi-channel voice data.

In one embodiment, as shown in Figure 4, step S31 can be embodied as following steps S311-S314:

Step S311, obtains the quantity of multichannel voice frequency input interface.

Step S312, according to the quantity of multichannel voice frequency input interface and the first sampling rate of default SCVF single channel voice frequency data, determines the second sampling rate of multi-path audio-frequency data.

Such as, the quantity of multichannel voice frequency input interface is 3, first sampling rate of the SCVF single channel voice frequency data preset is 96KHz, in embodiments of the present invention, voice data divides left audio data and right audio channel data, and therefore, 3 multichannel voice frequency input interfaces can receive the audio frequency input of 6 Mikes, then the second sampling rate of multi-path audio-frequency data is 6 frequency divisions of the first sampling rate, i.e. 16KHz.Wherein, 16KHz and 96KHz is the sampling rate of standard.

Step S313, according to the second sampling rate, determines the clock number that multi-path audio-frequency data single inputs.

SCVF single channel voice frequency Data Division, according to clock number, is multi-path audio-frequency data by step S314.

In one embodiment, because left audio data and right audio channel data gather respectively, therefore, as shown in Figure 5, step S31 can be embodied as following steps S51-S57:

Single channel left audio Data Division, when receiving single channel left audio data, is multichannel left audio data by step S51.

Step S52, according to the clock number of multi-path audio-frequency data single input and the clock number of the SCVF single channel voice frequency data received, determines the first invalid data in the multichannel left audio data after splitting.

In this step, the difference of the clock number of the clock number of the SCVF single channel voice frequency data that CPU receives and the input of multi-path audio-frequency data single, is the figure place of the first invalid data in the multichannel left audio data after fractionation.

Step S53, abandons the first invalid data, obtains multichannel left audio valid data.

Single channel right audio channel Data Division, when receiving single channel right audio channel data, is multichannel right audio channel data by step S54.

Step S55, according to the clock number of multi-path audio-frequency data single input and the clock number of the SCVF single channel voice frequency data received, determines the second invalid data in the multichannel right audio channel data after splitting.

In this step, the difference of the clock number of the clock number of the SCVF single channel voice frequency data that CPU receives and the input of multi-path audio-frequency data single, is the figure place of the second invalid data in the multichannel right audio channel data after fractionation.

Step S56, abandons the second invalid data, obtains multichannel right audio channel valid data.

Step S57, according to the corresponding relation between multichannel left audio valid data and multichannel right audio channel valid data, in conjunction with multichannel left audio valid data and multichannel right audio channel valid data, obtains multi-path audio-frequency data.

In this embodiment, be multichannel left audio data by single channel left audio Data Division, or when being multichannel right audio channel data by single channel right audio channel Data Division, be all carry out splitting according to the clock number of multi-path audio-frequency data single input.

Adopt the technical scheme that the embodiment of the present invention provides, by being that multi-path audio-frequency data exports by the SCVF single channel voice frequency received Data Division, the voice data received with single group audio input interface in speech recognition can be exported with multichannel form, thus complete voice data can be collected, when solving speech recognition, central processing unit only has one group of audio input interface and causes receiving the problem of multi-path audio-frequency data, meets the demand of speech recognition to a greater extent.

Below by way of a specific embodiment, audio-frequency inputting method provided by the invention is described.

In this specific embodiment, audio recognition systems comprises on-site programmable gate array FPGA and central processor CPU, and the block diagram of this system as shown in Figure 6.As can be seen from Figure 6, this audio recognition systems has 6 Mike's inputs, be convenient to do source of sound location and noise elimination etc., FPGA61 has 3 groups of voice datas that 3 multichannel voice frequency input interfaces (interface 1, interface 2 and interface 3) and a SCVF single channel voice frequency output interface 4,3 multichannel voice frequency input interfaces receive 6 Mikes' inputs respectively.Wherein, 3 the serial buffers (serial buffer 1, serial buffer 2 and serial buffer 3) corresponding respectively with interface 1, interface 2 and interface 3 are had in FPGA61.This audio recognition systems adopts serial digital audio bus protocol IIS (Inter-ICSoundbus), and IIS signal is defined as follows shown in table 1, and sequential chart as shown in Figure 7.

Table 1

Signal name	Sense	Signal description
			BCLK	Export	Position sampling clock
LRCK	Export	Left and right acoustic channels synchronous clock
			RXD	Input	Input signal
TXD	Export	Output signal

In this embodiment, the each 16bit of left and right acoustic channels, the sampling rate of SCVF single channel voice frequency data is set to 96KHz, namely LRCK is 96KHz, due to BCLK=LRCK* about data width * number of channels, wherein, data width is 16bit, left and right number of channels is 2 (voice data related in the present embodiment is stereo data, and therefore left and right number of channels is 2), and therefore BCLK is 3.072MHz.According to the quantity of multichannel voice frequency input interface and the sampling rate of default SCVF single channel voice frequency data, can determine that the sampling rate of multi-path audio-frequency data is 1/6 of the sampling rate of SCVF single channel voice frequency data, namely the clock signal of multi-path audio-frequency data carries out 6 frequency divisions by the clock signal of SCVF single channel voice frequency data to obtain, the sequential chart of SCVF single channel voice frequency data and multi-path audio-frequency data as shown in Figure 7, Fig. 7 shows BCLK in SCVF single channel voice frequency data, LRCK, the sequential chart of RXD and TXD, and the sequential chart of BCLK1 in multi-path audio-frequency data, wherein, BCLK1 represents the position sampling clock of a wherein road voice data in multi-path audio-frequency data.Represent the left and right acoustic channels synchronous clock of multi-path audio-frequency data with LRCKm (m=1/2/3), then LRCK1, LRCK2 and LRCK3 are 16KHz, and wherein, 16KHz and 96KHz is the sampling rate of standard.Simultaneously, can show that CPU62 often reads a sampled data, can read 96 clock signals, namely BCLK exports 96 clock signals at every turn, represent the position sampling clock of multi-path audio-frequency data with BCLKn (n=1/2/3), then BCLK1, BCLK2 and BCLK3 export 16 clock signals all at every turn.In addition, for enabling BCLK export 96 clock signals at every turn, CPU62 needs each data reading 12 bytes from the SCVF single channel voice frequency output interface 4 of FPGA61.

Fig. 8 is the process flow diagram of a kind of audio-frequency inputting method in this specific embodiment.As shown in Figure 8, S801-S814 is comprised the following steps:

Step S801, audio recognition systems powers on.

Step S802, FPGA61, according to the sampling rate 96KHz of the quantity of multichannel voice frequency input interface with the SCVF single channel voice frequency data preset, determine that the sampling rate of multi-path audio-frequency data is 16KHz.

Step S803, FPGA61, according to the sampling rate of multi-path audio-frequency data, determine that the clock number that multi-path audio-frequency data single inputs is 16 clocks.

Multichannel voice frequency input interface 1 in step S804, FPGA61, interface 2 and interface 3 synchronously receive multichannel left audio data.Wherein, the multichannel left audio data received are inputted by Mike and are converted audio analog signals to digital signal through analog to digital converter (A/D converter) and obtain.

3 serial buffers in step S805, FPGA61 are the left audio data of 16 clocks that receive of cache interface 1, interface 2 and interface 3 respectively, and CPU62 receives the sampled data of 12 bytes from the SCVF single channel voice frequency output interface 4 of FPGA61 simultaneously.Wherein, the left audio data that serial buffer 1 cache interface 1 receives, the left audio data that serial buffer 2 cache interface 2 receives, the left audio data that serial buffer 3 cache interface 3 receives, the left audio data of buffer memory use WORD1 [15..0] respectively, and WORD2 [15..0] and WORD3 [15..0] represents.The sampled data that CPU62 reads the data of buffer memory in serial buffer to be sent by transmit block to obtain, because this step is buffered audio data first, therefore in transmit block, valid data can not be sent to CPU62, that is, the sampled data that CPU62 reads is invalid data.

The invalid data of reading abandons by step S806, CPU62.

Step S807,3 serial buffers in FPGA61 are the right audio channel data of 16 clocks that receive of cache interface 1, interface 2 and interface 3 respectively, and the left audio data of buffer memory last in serial buffer are sorted, the left audio data after sequence send by transmit block simultaneously.In this step, it is that preset order according to serial buffer sorts that the left audio data of FPGA61 to buffer memory last in serial buffer sort, such as, in serial buffer 1, the left audio data of buffer memory come foremost, then be the left audio data of buffer memory in serial buffer 2, finally the left audio data of buffer memory in serial buffer 3, the right audio channel data of buffer memory use WORD1 [31..16] respectively, and WORD2 [31..16] and WORD3 [31..16] represents.

Step S808, CPU62 receives the left audio data of 12 bytes from the SCVF single channel voice frequency interface 4 of FPGA61, and the left audio data received are split, obtain the left channel audio signal WORD1 [15..0] of 16 clocks respectively, WORD2 [15..0] and WORD3 [15..0].

Step S809, CPU62 abandon WORD1 [15..0], the invalid data in WORD2 [15..0] and WORD3 [15..0], obtain left audio valid data.What receive due to CPU62 only has left audio data, but CPU62 receives the sampled data of 12 bytes at every turn, i.e. 96 clock signals, therefore signal WORD1 [15..0], in WORD2 [15..0] and WORD3 [15..0], front 6 bytes are left audio valid data respectively, and rear 6 bytes are then invalid data.

Step S810,3 serial buffers in FPGA61 are the left audio data of 16 clocks that receive of cache interface 1, interface 2 and interface 3 respectively, and the right audio channel data of buffer memory last in serial buffer are sorted, the right audio channel data after sequence send by transmit block simultaneously.The left audio data of buffer memory use WORD1 [15..0] respectively, and WORD2 [15..0] and WORD3 [15..0] represents.

Step S811, CPU62 receives the right audio channel data of 12 bytes from the SCVF single channel voice frequency interface 4 of FPGA61, and the right audio channel data received are split, obtain the right channel audio signal WORD1 [31..16] of 16 clocks respectively, WORD2 [31..16] and WORD3 [31..16].

Step S812, CPU62 abandon WORD1 [31..16], the invalid data in WORD2 [31..16] and WORD3 [31..16], obtain right audio channel valid data.What receive due to CPU62 only has right audio channel data, but CPU62 receives the sampled data of 12 bytes at every turn, i.e. 96 clock signals, therefore signal WORD1 [31..16], WORD2 [31..16] and WORD3 is [in 31..16, front 6 bytes are right audio channel valid data respectively, and rear 6 bytes are then invalid data.

Step S813, CPU62 is according to the corresponding relation between each road left audio valid data and each road right audio channel valid data, the left audio valid data of acquisition and right audio channel valid data are combined, obtains the multi-path audio-frequency data that a frame is complete.Return step step S807.

Step S814, CPU62 output multi-channel voice data.

Adopt the technical scheme in the present embodiment, by FPGA, the multi-path audio-frequency data received is converted into SCVF single channel voice frequency data to export, SCVF single channel voice frequency data are received again by CPU, and split into multi-path audio-frequency data and export, when solving speech recognition, CPU only has one group of audio input interface and causes receiving the problem of multi-path audio-frequency data, meets the demand of speech recognition to a greater extent.

Corresponding to a kind of audio-frequency inputting method in above-described embodiment, the present invention also provides a kind of audio output device, uses to perform the above method.

Fig. 9 is the block diagram of a kind of audio output device in the embodiment of the present invention.As shown in Figure 9, this device is used for on-site programmable gate array FPGA, comprising:

Receiver module 91, for receiving the multi-path audio-frequency data of multichannel voice frequency input interface input;

Conversion module 92, for being converted into SCVF single channel voice frequency data by multi-path audio-frequency data;

First output module 93, for exporting SCVF single channel voice frequency data by SCVF single channel voice frequency output interface.

In one embodiment, as shown in Figure 10, receiver module 91 comprises:

First determines submodule 911, for according to the quantity of multichannel voice frequency input interface and the first sampling rate of default SCVF single channel voice frequency data, determines the second sampling rate of multi-path audio-frequency data;

Second determines submodule 912, for according to the second sampling rate, determines the clock number that multi-path audio-frequency data single inputs;

First receives submodule 913, for receiving the multi-path audio-frequency data of multichannel voice frequency input interface input according to clock number.

In one embodiment, as shown in figure 11, conversion module 92 comprises:

First cache sub-module 921, for according to clock number buffer memory multi-path audio-frequency data;

Sorting sub-module 922, for the preset order of the multi-path audio-frequency data of buffer memory according to multichannel voice frequency input interface being sorted, obtains the SCVF single channel voice frequency data after sequence.

In one embodiment, the first output module 93 comprises:

First output sub-module, for exporting the SCVF single channel voice frequency data after sequence.

In one embodiment, as shown in figure 12, receiver module 91 comprises:

Second receives submodule 914, for receiving the multi-path audio-frequency data of multichannel voice frequency input interface input at the rising edge of clock;

First output module 93, comprising:

Second output sub-module 931, exports SCVF single channel voice frequency data for the negative edge at clock by SCVF single channel voice frequency output interface.

In one embodiment, receiver module 91 comprises:

Adopt the device that the embodiment of the present invention provides, export by the multi-path audio-frequency data received is converted into SCVF single channel voice frequency data, make can receive in speech recognition and output multi-channel voice data, when solving speech recognition, central processing unit only has one group of audio input interface and causes receiving the problem of multi-path audio-frequency data, meets the demand of speech recognition to a greater extent.

Figure 13 is the block diagram of a kind of audio output device in the embodiment of the present invention.As shown in figure 13, this device is used for central processor CPU, comprising:

Splitting module 131, for when receiving SCVF single channel voice frequency data, is multi-path audio-frequency data by SCVF single channel voice frequency Data Division;

Second output module 132, for output multi-channel voice data.

In one embodiment, as shown in figure 14, split module 131 to comprise:

Obtain submodule 1311, for obtaining the quantity of multichannel voice frequency input interface;

3rd determines submodule 1312, for according to the quantity of multichannel voice frequency input interface and the first sampling rate of default SCVF single channel voice frequency data, determines the second sampling rate of multi-path audio-frequency data;

4th determines submodule 1313, for according to the second sampling rate, determines the clock number that multi-path audio-frequency data single inputs;

First splits submodule 1314, for according to clock number, is multi-path audio-frequency data by SCVF single channel voice frequency Data Division.

In one embodiment, as shown in figure 15, SCVF single channel voice frequency data are single channel left audio data or single channel right audio channel data; Split module 131 to comprise:

Second splits submodule 1315, for when receiving described single channel left audio data, is multichannel left audio data by single channel left audio Data Division;

5th determines submodule 1316, for the clock number of the clock number inputted according to multi-path audio-frequency data single and the SCVF single channel voice frequency data received, determines the first invalid data in the multichannel left audio data after splitting;

First abandons submodule 1317, for abandoning the first invalid data, obtains multichannel left audio valid data;

3rd splits submodule 1318, for when receiving described single channel right audio channel data, is multichannel right audio channel data by described single channel right audio channel Data Division;

6th determines submodule 1319, for according to the described clock number of multi-path audio-frequency data single input and the clock number of the described SCVF single channel voice frequency data received, determines the second invalid data in the multichannel right audio channel data after described fractionation;

Second abandons submodule 13110, for abandoning described second invalid data, obtains multichannel right audio channel valid data;

Zygote module 13111, for according to the corresponding relation between multichannel left audio valid data and multichannel right audio channel valid data, in conjunction with multichannel left audio valid data and multichannel right audio channel valid data, obtains multi-path audio-frequency data.

Adopt the device that the embodiment of the present invention provides, by being that multi-path audio-frequency data exports by the SCVF single channel voice frequency received Data Division, the voice data received with single group audio input interface in speech recognition can be exported with multichannel form, thus complete voice data can be exported, when solving speech recognition, central processing unit only has one group of audio input interface and causes receiving the problem of multi-path audio-frequency data, meets the demand of speech recognition to a greater extent.

Those skilled in the art should understand, embodiments of the invention can be provided as method, system or computer program.Therefore, the present invention can adopt the form of complete hardware embodiment, completely software implementation or the embodiment in conjunction with software and hardware aspect.And the present invention can adopt in one or more form wherein including the upper computer program implemented of computer-usable storage medium (including but not limited to magnetic disk memory and optical memory etc.) of computer usable program code.

The present invention describes with reference to according to the process flow diagram of the method for the embodiment of the present invention, equipment (system) and computer program and/or block scheme.Should understand can by the combination of the flow process in each flow process in computer program instructions realization flow figure and/or block scheme and/or square frame and process flow diagram and/or block scheme and/or square frame.These computer program instructions can being provided to the processor of multi-purpose computer, special purpose computer, Embedded Processor or other programmable data processing device to produce a machine, making the instruction performed by the processor of computing machine or other programmable data processing device produce device for realizing the function of specifying in process flow diagram flow process or multiple flow process and/or block scheme square frame or multiple square frame.

These computer program instructions also can be stored in can in the computer-readable memory that works in a specific way of vectoring computer or other programmable data processing device, the instruction making to be stored in this computer-readable memory produces the manufacture comprising command device, and this command device realizes the function of specifying in process flow diagram flow process or multiple flow process and/or block scheme square frame or multiple square frame.

These computer program instructions also can be loaded in computing machine or other programmable data processing device, make on computing machine or other programmable devices, to perform sequence of operations step to produce computer implemented process, thus the instruction performed on computing machine or other programmable devices is provided for the step realizing the function of specifying in process flow diagram flow process or multiple flow process and/or block scheme square frame or multiple square frame.

Obviously, those skilled in the art can carry out various change and modification to the present invention and not depart from the spirit and scope of the present invention.Like this, if these amendments of the present invention and modification belong within the scope of the claims in the present invention and equivalent technologies thereof, then the present invention is also intended to comprise these change and modification.

Claims

1. an audio-frequency inputting method, is characterized in that, is applied to field programmable gate array, and described method comprises:

Described multi-path audio-frequency data is exported by SCVF single channel voice frequency output interface.

2. method according to claim 1, is characterized in that, the multi-path audio-frequency data of described reception multichannel voice frequency input interface input, comprising:

3. method according to claim 2, is characterized in that, described described multi-path audio-frequency data is converted into SCVF single channel voice frequency data, comprising:

4. method according to claim 3, is characterized in that, the described SCVF single channel voice frequency data of described output, comprising:

Export the SCVF single channel voice frequency data after described sequence.

5. method according to claim 1, is characterized in that,

The multi-path audio-frequency data of described reception multichannel voice frequency input interface input, comprising:

Multi-path audio-frequency data is received at the rising edge of clock;

6. method according to claim 1, is characterized in that, the multi-path audio-frequency data of described reception multichannel voice frequency input interface input, comprising:

7. an audio-frequency inputting method, is characterized in that, is applied to central processing unit, and described method comprises:

Export described multi-path audio-frequency data.

8. method according to claim 7, is characterized in that, described is multi-path audio-frequency data by described SCVF single channel voice frequency Data Division, comprising:

Obtain the quantity of multichannel voice frequency input interface;

9. method according to claim 7, is characterized in that, described SCVF single channel voice frequency data are single channel left audio data or single channel right audio channel data; Described when receiving SCVF single channel voice frequency data, be multi-path audio-frequency data by described SCVF single channel voice frequency Data Division, comprise:

10. an audio output device, is characterized in that, is applied to field programmable gate array, and described device comprises:

11. devices according to claim 10, is characterized in that, described receiver module comprises:

12. devices according to claim 11, is characterized in that, described conversion module comprises:

13. devices according to claim 12, is characterized in that, described first output module comprises:

14. devices according to claim 10, is characterized in that,

Described receiver module comprises:

Described first output module, comprising:

15. devices according to claim 10, is characterized in that, described receiver module comprises:

16. 1 kinds of audio output devices, is characterized in that, be applied to central processing unit, and described device comprises:

Second output module, for exporting described multi-path audio-frequency data.

17. devices according to claim 16, is characterized in that, described fractionation module comprises:

18. devices according to claim 16, is characterized in that, described SCVF single channel voice frequency data are single channel left audio data or single channel right audio channel data; Described fractionation module comprises: