EP1834322A1 - Verfahren zum codieren eines analogen signals - Google Patents
Verfahren zum codieren eines analogen signalsInfo
- Publication number
- EP1834322A1 EP1834322A1 EP05815885A EP05815885A EP1834322A1 EP 1834322 A1 EP1834322 A1 EP 1834322A1 EP 05815885 A EP05815885 A EP 05815885A EP 05815885 A EP05815885 A EP 05815885A EP 1834322 A1 EP1834322 A1 EP 1834322A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- signal
- excitation signal
- values
- exe
- exc
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 19
- 230000005284 excitation Effects 0.000 claims abstract description 42
- 230000003044 adaptive effect Effects 0.000 claims abstract description 25
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 8
- 238000003786 synthesis reaction Methods 0.000 claims abstract description 6
- 238000005070 sampling Methods 0.000 claims description 13
- 238000004891 communication Methods 0.000 claims description 7
- 230000009467 reduction Effects 0.000 claims description 3
- 230000005540 biological transmission Effects 0.000 description 9
- 230000006872 improvement Effects 0.000 description 4
- 238000001228 spectrum Methods 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/09—Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor
Definitions
- the invention relates to a method for coding an analog signal by means of an analysis by synthesis methods.
- bandwidth is a limited resource.
- D. H that the predetermined, limited bandwidth must be distributed to a large number of users. If the bandwidth for a user is now increased, a reduction of the bandwidth available to the remaining users must inevitably occur given a constant number of users.
- Bandwidth for example 8 kHz bandwidth from 0 to 8 kHz.
- An analog signal is decomposed into frames for coding and a synthesized signal is timed to match the analog signal.
- the synthetic signal is generated as an output signal of a synthesis filter, which is excited by an excitation signal as an input signal.
- At least one adaptive codebook is used in which the excitation signal is present for earlier time frames.
- the earlier excitation signal is shown here as a plurality of samples.
- a segment corresponding to the length of the current time frame is output of the plurality of samples present in the adaptive codebook.
- the selection is made by means of a reference parameter dependent on a basic speech frequency, which can also assume non-integer values, ie. H . refers to spaces for intermediate values lying between the actually present samples.
- the speech fundamental frequency parameter now assumes a non-integer value, corresponding intermediate values are selected for the sampled values in the selected segment.
- the segment corresponds in length to the current time frame and its position in the adaptive codebook is determined by the speech fundamental frequency parameter.
- This formation of intermediate values takes place, for example, by interpolation.
- an interpolation can take place with a (sin x) / x function.
- the core of the invention is now that the entirety of samples and interpolation values is used to form the excitation signal.
- the speech fundamental frequency parameter determines the beginning of the selected segment and refers to the value 5 1/3
- the corresponding intermediate values 5 1/3, 6 1/3, 7 1/3 etc are formed and only these for generation of the excitation signal and maintained in the adaptive codebook.
- the values 5 1/3, 5 2/3, 6, 6 1/3, 6 2/3 etc would be used, which can be done without additional transmission of information.
- a quality improvement is generated.
- the distance between a sample and an intermediate value is reduced to H or. 1.3.
- N is greater than or equal to 3
- the distance between two intermediate values is reduced to the same value.
- the excitation signal can also be generated in particular by means of a fixed codebook. For example, fixed excitation signals are present in a fixed codebook.
- the fixed codebook in its originally predetermined bandwidth or. maintain the original samples and achieve higher bandwidth only with the adaptive codebook. This has the advantage of a particularly simple implementation.
- a shift of a fixed codebook entry can take place while maintaining the time intervals between the signal components. For example, has a fixed codebook entry of length 4 a signal component at times 1 and 3, and no or. a zero value of the signal component at times 0, 2 and 4, then a shift to the times 1/3 to 4 1/3 would take place.
- a white, d. H . essentially frequency-independent noise signal can be used to generate the excitation signal.
- the fixed codebook can be saved. It has been found that, in particular with speech signals, a very satisfactory quality of the signal generated on the receiver side can be ensured.
- the noise signal is picked up from the environment or generated by means of a noise generator.
- filtering of the formed excitation signal can be provided, in particular before it is used as input signal for the synthesis filter becomes.
- a Wiener FIR finite impulse response
- the proposed methods can be used in a communication terminal with an encoding unit, such as a mobile telephone, a personal digital assistant (PDA), a computer or a landline telephone, etc. occur .
- an encoding unit such as a mobile telephone, a personal digital assistant (PDA), a computer or a landline telephone, etc. occur .
- a corresponding receiver for example transition elements between different communication systems, a TRAU (transmission and rate adaptation unit) has a corresponding decoding unit.
- TRAU transmission and rate adaptation unit
- a suitable communication system has at least one communication terminal and one receiver.
- FIG. 1 a the representation of the generation of a synthesized signal
- FIG. 1b The representation of the generation of an excitation signal for a broadband solution
- FIG. 3 shows an exemplary bandwidth extension in the adaptive codebook.
- FIG. 1 a shows the use of an excitation signal exe to excite a synthesis filter A (z).
- the synthesis filter A (z) simulates in the case of speech signals in the human vocal tract, so that in this case a synthetic acoustic signal AS syn is generated by means of a suitable excitation signal exe. This is compared by means of a comparator C with the actual acoustic signal as. Successively, the excitation signal exe is adjusted so that the synthetic acoustic signal AS_syn resembles the actual acoustic signal as as well as possible.
- FIG. 1b shows the generation of an excitation signal exe in the case of a broadband solution.
- broadband solution is understood to mean that the bandwidth of the signal reconstructed on the receiver side is greater than originally - for example.
- B. provided by the design of codebooks.
- an extension of the G.729 is spoken by a 4kHz bandwidth signal as a narrowband signal, and a broadband signal is added to 8kHz bandwidth.
- an adaptive codebook ACB is provided, with which harmonic components of the acoustic signal are displayed.
- the adaptive codebook includes earlier excitation signals old_exc, d. H . those from previous time frames resp. Time periods.
- the selection of an entry from the adaptive codebook ACB takes place via a non-integer speech frequency parameter p, which is represented by its integer part N * (int p), where N represents an integer, and the fraction p frac.
- FIG. 2 sample values of the excitation signal exe for different sampling rates are shown. Depending on the sampling rate results in a 4 kHz bandwidth (case A), an 8 kHz bandwidth (case B) or a 12 kHz bandwidth (case C). The individual sample values are shown as dots, the different sample rates are made clear by different time intervals between the sample values on the time axis. In the following, Fig. Ib directed.
- a fixed codebook SCB is also provided, which is often referred to as an innovative codebook.
- a reference idx s to the fixed codebook SCB By means of a reference idx s to the fixed codebook SCB, a specific entry is selected from the fixed codebook SCB. This is amplified by a suitable amplification factor g_s. The resulting signal forms the fixed excitation signal exe s.
- values are optionally set in the fixed codebook between the existing values.
- the number of values inserted depends on the desired bandwidth expansion. This intermediate is to be clarified by the entry int N.
- FIG. 3 shows the history (history ACB) recorded in the adaptive codebook ACB, as well as a current time frame (actual frame).
- the current current frame is shown on the right hand side of the dashed line, whereby the continuous time on a time axis (t) is to be expressed to the right.
- the frame is shown above the samples and intermediate values present in the adaptive codebook.
- a sample is the value sampled at an original first sampling frequency.
- Intermediate values are the initially artificially interposed values, which first assume the value 0 and then values ⁇ 0 as a function of the respective new time frames of the signal.
- the intermediate values are intermediate values.
- the adaptive codebook ACB is empty, i. H . there are only zero values at the times which correspond to a desired sampling rate. At the same time, zeroes are already inserted as intermediate values, so that in the adaptive codebook, zero values are present in the line a) at the times which already correspond to a higher sampling rate.
- the first frame is present only at a first sampling rate, for example 4 kHz, such as the non-zero values of the current frame in line a, but a subsequent encoding is for a threefold sampling rate, for example 12 kHz, then a corresponding number Null values are set between the existing samples. This is also shown in line a for the current frame.
- a first sampling rate for example 4 kHz, such as the non-zero values of the current frame in line a
- a subsequent encoding is for a threefold sampling rate, for example 12 kHz
- the first frame is already contained in the adaptive codebook.
- a suitable segment is selected from the adaptive codebook.
- Ml MO * M3 if MO is the number of Ml at the first sampling rate, ie. B. at 4 kHz, present values.
- the lower first sampling rate (of for example 4 kHz) against that between the original ones Samples intermediate intermediate values for non-integer basic speech frequency parameters p.
- the second frame is represented, for example, by the elliptically rounded segment from the adaptive codebook ACB.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE102005000828A DE102005000828A1 (de) | 2005-01-05 | 2005-01-05 | Verfahren zum Codieren eines analogen Signals |
PCT/EP2005/056479 WO2006072519A1 (de) | 2005-01-05 | 2005-12-05 | Verfahren zum codieren eines analogen signals |
Publications (2)
Publication Number | Publication Date |
---|---|
EP1834322A1 true EP1834322A1 (de) | 2007-09-19 |
EP1834322B1 EP1834322B1 (de) | 2015-02-18 |
Family
ID=35697206
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP05815885.8A Active EP1834322B1 (de) | 2005-01-05 | 2005-12-05 | Verfahren zum codieren eines analogen signals |
Country Status (5)
Country | Link |
---|---|
US (1) | US7957978B2 (de) |
EP (1) | EP1834322B1 (de) |
CN (2) | CN102655004B (de) |
DE (1) | DE102005000828A1 (de) |
WO (1) | WO2006072519A1 (de) |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5495555A (en) * | 1992-06-01 | 1996-02-27 | Hughes Aircraft Company | High quality low bit rate celp-based speech codec |
EP0704836B1 (de) * | 1994-09-30 | 2002-03-27 | Kabushiki Kaisha Toshiba | Vorrichtung zur Vektorquantisierung |
US5664055A (en) * | 1995-06-07 | 1997-09-02 | Lucent Technologies Inc. | CS-ACELP speech compression system with adaptive pitch prediction filter gain based on a measure of periodicity |
US6073092A (en) * | 1997-06-26 | 2000-06-06 | Telogy Networks, Inc. | Method for speech coding based on a code excited linear prediction (CELP) model |
US6173257B1 (en) * | 1998-08-24 | 2001-01-09 | Conexant Systems, Inc | Completed fixed codebook for speech encoder |
CA2252170A1 (en) * | 1998-10-27 | 2000-04-27 | Bruno Bessette | A method and device for high quality coding of wideband speech and audio signals |
JP3343082B2 (ja) * | 1998-10-27 | 2002-11-11 | 松下電器産業株式会社 | Celp型音声符号化装置 |
US7039581B1 (en) * | 1999-09-22 | 2006-05-02 | Texas Instruments Incorporated | Hybrid speed coding and system |
US7139700B1 (en) * | 1999-09-22 | 2006-11-21 | Texas Instruments Incorporated | Hybrid speech coding and system |
US7222070B1 (en) * | 1999-09-22 | 2007-05-22 | Texas Instruments Incorporated | Hybrid speech coding and system |
GB0025659D0 (en) * | 2000-10-19 | 2000-12-06 | Radioscape Ltd | Hybrid analogue/digital transmission or communication system |
DE10124420C1 (de) * | 2001-05-18 | 2002-11-28 | Siemens Ag | Verfahren zur Codierung und zur Übertragung von Sprachsignalen |
-
2005
- 2005-01-05 DE DE102005000828A patent/DE102005000828A1/de not_active Withdrawn
- 2005-12-05 CN CN201210137786.2A patent/CN102655004B/zh not_active Expired - Fee Related
- 2005-12-05 EP EP05815885.8A patent/EP1834322B1/de active Active
- 2005-12-05 WO PCT/EP2005/056479 patent/WO2006072519A1/de active Application Filing
- 2005-12-05 CN CN2005800460485A patent/CN101099198B/zh not_active Expired - Fee Related
- 2005-12-05 US US11/794,790 patent/US7957978B2/en active Active
Non-Patent Citations (1)
Title |
---|
See references of WO2006072519A1 * |
Also Published As
Publication number | Publication date |
---|---|
CN102655004B (zh) | 2015-06-17 |
US20090276226A1 (en) | 2009-11-05 |
CN101099198B (zh) | 2012-06-27 |
CN101099198A (zh) | 2008-01-02 |
DE102005000828A1 (de) | 2006-07-13 |
CN102655004A (zh) | 2012-09-05 |
WO2006072519A1 (de) | 2006-07-13 |
US7957978B2 (en) | 2011-06-07 |
EP1834322B1 (de) | 2015-02-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE69910240T2 (de) | Vorrichtung und verfahren zur wiederherstellung des hochfrequenzanteils eines überabgetasteten synthetisierten breitbandsignals | |
DE60121405T2 (de) | Transkodierer zur Vermeidung einer Kaskadenkodierung von Sprachsignalen | |
DE60034484T2 (de) | Verfahren und vorrichtung in einem kommunikationssystem | |
EP1979901B1 (de) | Verfahren und anordnungen zur audiosignalkodierung | |
DE19604273C2 (de) | Verfahren und Vorrichtung zum Durchführen einer Suche in einem Kodebuch im Hinblick auf das Kodieren eines Klangsignales, Zellkommunikationssystem, Zellnetzwerkelement und mobile Zell-Sender-/Empfänger-Einheit | |
DE60006271T2 (de) | Celp sprachkodierung mit variabler bitrate mittels phonetischer klassifizierung | |
DE69521164T2 (de) | System zum Kodieren und Dekodieren von Signalen | |
DE69730721T2 (de) | Verfahren und vorrichtungen zur geräuschkonditionierung von signalen welche audioinformationen darstellen in komprimierter und digitalisierter form | |
DE3736193C2 (de) | ||
DE69121411T2 (de) | Methode und gerät zur codierung von analogen signalen | |
DE69033510T3 (de) | Numerischer sprachcodierer mit verbesserter langzeitvorhersage durch subabtastauflösung | |
EP2245621A1 (de) | Verfahren und mittel zur enkodierung von hintergrundrauschinformationen | |
EP1953739A2 (de) | Verfahren und Vorrichtung zur Geräuschunterdrückung | |
DE102008042579A1 (de) | Verfahren zur Fehlerverdeckung bei fehlerhafter Übertragung von Sprachdaten | |
DE69820362T2 (de) | Nichtlinearer Filter zur Geräuschunterdrückung in linearen Prädiktions-Sprachkodierungs-Vorrichtungen | |
EP1023777B1 (de) | Verfahren und vorrichtung zur erzeugung eines bitratenskalierbaren audio-datenstroms | |
DE69028434T2 (de) | System zur Codierung von Breitbandaudiosignalen | |
DE4491015C2 (de) | Verfahren zum Erzeugen eines Spektralrauschbewertungsfilters zur Verwendung in einem Sprachcoder | |
DE69611607T2 (de) | Quantisierung einer aufgeteilten vorhersagematrix mit spektralparametern zur wirksamen sprachkodierung | |
DE60016305T2 (de) | Verfahren zum Betrieb eines Sprachkodierers | |
DE68907098T2 (de) | Differentieller Kodierer mit auto-adaptivem Prädiktorfilter und dazugehörigem Dekodierer. | |
DE69837296T2 (de) | Verfahren und Vorrichtung zur Audiokodierung mittels einer mehrstufigen Mehrimpulsanregung | |
DE2303497C2 (de) | Verfahren zur Übertragung von Sprachsignalen | |
DE69127339T2 (de) | Methode und Einrichtung zur Kodierung und Dekodierung eines abgetasteten Analogsignals mit Wiederholungseigenschaften | |
DE68917552T2 (de) | Verfahren und Einrichtung zur Codierung und Decodierung von Sprachsignalen unter Anwendung von Multipuls-Anregung. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20070531 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): DE FR GB IT SE |
|
17Q | First examination report despatched |
Effective date: 20071214 |
|
DAX | Request for extension of the european patent (deleted) | ||
RBV | Designated contracting states (corrected) |
Designated state(s): DE FR GB IT SE |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: SIEMENS ENTERPRISE COMMUNICATIONS GMBH & CO. KG |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: UNIFY GMBH & CO. KG |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R079 Ref document number: 502005014685 Country of ref document: DE Free format text: PREVIOUS MAIN CLASS: G10L0019080000 Ipc: G10L0019090000 |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: G10L 19/09 20130101AFI20140702BHEP |
|
INTG | Intention to grant announced |
Effective date: 20140718 |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): DE FR GB IT SE |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D Free format text: NOT ENGLISH |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R081 Ref document number: 502005014685 Country of ref document: DE Owner name: UNIFY GMBH & CO. KG, DE Free format text: FORMER OWNER: SIEMENS AKTIENGESELLSCHAFT, 80333 MUENCHEN, DE |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R096 Ref document number: 502005014685 Country of ref document: DE Effective date: 20150402 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: SE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20150218 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R097 Ref document number: 502005014685 Country of ref document: DE |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 11 |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed |
Effective date: 20151119 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: IT Payment date: 20151222 Year of fee payment: 11 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R082 Ref document number: 502005014685 Country of ref document: DE Representative=s name: SCHAAFHAUSEN PATENTANWAELTE PARTNERSCHAFTSGESE, DE Ref country code: DE Ref legal event code: R082 Ref document number: 502005014685 Country of ref document: DE Representative=s name: FRITZSCHE PATENTANWAELTE, DE Ref country code: DE Ref legal event code: R081 Ref document number: 502005014685 Country of ref document: DE Owner name: UNIFY GMBH & CO. KG, DE Free format text: FORMER OWNER: UNIFY GMBH & CO. KG, 81379 MUENCHEN, DE |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 12 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IT Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20161205 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 13 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R082 Ref document number: 502005014685 Country of ref document: DE Representative=s name: SCHAAFHAUSEN PATENTANWAELTE PARTNERSCHAFTSGESE, DE |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R081 Ref document number: 502005014685 Country of ref document: DE Owner name: UNIFY BETEILIGUNGSVERWALTUNG GMBH & CO. KG, DE Free format text: FORMER OWNER: UNIFY GMBH & CO. KG, 80807 MUENCHEN, DE |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20241001 Year of fee payment: 20 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: GB Payment date: 20241001 Year of fee payment: 20 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: FR Payment date: 20241001 Year of fee payment: 20 |