[go: up one dir, main page]

CN111798831B - Sound particle synthesis method and device - Google Patents

Sound particle synthesis method and device Download PDF

Info

Publication number
CN111798831B
CN111798831B CN202010549930.8A CN202010549930A CN111798831B CN 111798831 B CN111798831 B CN 111798831B CN 202010549930 A CN202010549930 A CN 202010549930A CN 111798831 B CN111798831 B CN 111798831B
Authority
CN
China
Prior art keywords
sound
particles
particle
audio
transition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010549930.8A
Other languages
Chinese (zh)
Other versions
CN111798831A (en
Inventor
卢炽华
谢丽萍
刘惟伊
刘志恩
魏浩钦
刘浩
孙唯
张光
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan University of Technology WUT
Dongfeng Motor Corp
Original Assignee
Wuhan University of Technology WUT
Dongfeng Motor Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan University of Technology WUT, Dongfeng Motor Corp filed Critical Wuhan University of Technology WUT
Priority to CN202010549930.8A priority Critical patent/CN111798831B/en
Publication of CN111798831A publication Critical patent/CN111798831A/en
Application granted granted Critical
Publication of CN111798831B publication Critical patent/CN111798831B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management
    • G10L13/047Architecture of speech synthesisers
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02TCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
    • Y02T90/00Enabling technologies or technologies with a potential or indirect contribution to GHG emissions mitigation

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Stereophonic System (AREA)

Abstract

本发明属于数字信号处理技术领域,公开了一种声音粒子合成方法及装置,方法包括采集原始声音样本,根据原始声音样本构建声音粒子样本库,并添加声音粒子的索引,根据车况输入信息,基于声音粒子的索引读取声音粒子样本库中的声音粒子,根据索引读取的声音粒子,构建过渡音频,通过过渡音频拼接相邻的声音粒子,合成目标声音样本;装置包括采集单元、声音粒子样本库构建单元、索引单元、声音粒子读取单元、过渡音频构建单元、目标声音样本合成单元。本发明解决了现有技术中声音粒子在拼接处会产生“阶跃”现象,导致合成音频声中参杂“爆音”,使得合成音不够连续自然的问题,能够保证合成音的连贯性,满足对目标声品质的要求。

The invention belongs to the technical field of digital signal processing and discloses a sound particle synthesis method and device. The method includes collecting original sound samples, constructing a sound particle sample library based on the original sound samples, adding an index of sound particles, inputting information according to vehicle conditions, and based on The index of sound particles reads the sound particles in the sound particle sample library, constructs transition audio based on the sound particles read by the index, splices adjacent sound particles through the transition audio, and synthesizes the target sound sample; the device includes a collection unit, sound particle samples Library building unit, indexing unit, sound particle reading unit, transition audio building unit, target sound sample synthesis unit. The invention solves the problem in the prior art that sound particles will produce a "step" phenomenon at the splicing point, resulting in "pops" being mixed into the synthesized audio sound, making the synthesized sound not continuous and natural enough. It can ensure the coherence of the synthesized sound and satisfy the requirements. Requirements for target sound quality.

Description

一种声音粒子合成方法及装置A sound particle synthesis method and device

技术领域Technical field

本发明涉及数字信号处理技术领域,尤其涉及一种声音粒子合成方法及装置。The present invention relates to the technical field of digital signal processing, and in particular to a sound particle synthesis method and device.

背景技术Background technique

汽车产业已进入内涵式发展的稳健时期,当汽车各个系统的NVH(Noise、Vibration、Harshness,噪声、振动与声振粗糙度)问题得到改善后,人们更加注重噪声对乘坐人员的主观感受的影响,突出体现为“从静音到声品质”的变革。ASD(active sounddesign,主动声音设计)是一种通过主动发声的形式来满足驾乘者“声趣”的体验和提高车内声品质目标要求的手段。目前主要从声音设计、声音合成算法以及主动发声控制系统三大部分对ASD技术开展研究。The automobile industry has entered a steady period of connotative development. When the NVH (Noise, Vibration, Harshness) problems of various automobile systems are improved, people pay more attention to the impact of noise on the subjective feelings of passengers. , highlighted as the transformation of “from silence to sound quality”. ASD (active sound design) is a method that uses active sound to satisfy the "sound fun" experience of drivers and passengers and improve the target requirements of in-car sound quality. At present, ASD technology is mainly researched from three parts: sound design, sound synthesis algorithm and active sound control system.

其中,声音合成算法是保证目标声音播放连贯性的关键,当前针对车内声音合成算法主要包括谐波算法与粒子算法,其中谐波算法是通过设定输入信号(如转速)与合成阶次声幅值与频率的函数关系,以此合成阶次声;粒子算法则是将录制好的声音样本切割成小的声音粒子,根据工况输入读取对应样本,并进行声音粒子的拼接,从而播放出随工况变化的声音。Among them, the sound synthesis algorithm is the key to ensuring the coherence of the target sound playback. The current sound synthesis algorithm for the car mainly includes the harmonic algorithm and the particle algorithm. The harmonic algorithm is based on setting the input signal (such as rotation speed) and synthesizing the order sound. The functional relationship between amplitude and frequency is used to synthesize order sound; the particle algorithm cuts the recorded sound samples into small sound particles, reads the corresponding samples according to the working condition input, and splices the sound particles to play The sound changes with the working conditions.

声音粒子算法和谐波算法相比,能够高度还原原始声音的声音品质。为了满足顾客多元化车内声音需求、声音粒子算法成为最有效的声音合成方案之一。声音粒子在拼接过程中的连贯性,是保证合成音声品质的关键,但通常情况下声音粒子在拼接处会产生“阶跃”现象,导致合成音频声中参杂“爆音”,使得合成音不够连续自然。Compared with the harmonic algorithm, the sound particle algorithm can highly restore the sound quality of the original sound. In order to meet the diversified in-car sound needs of customers, the sound particle algorithm has become one of the most effective sound synthesis solutions. The coherence of sound particles during the splicing process is the key to ensuring the quality of synthetic sound. However, usually sound particles will produce a "step" phenomenon at the splicing point, resulting in "pops" in the synthesized audio sound, making the synthesized sound insufficient. Continuous and natural.

发明内容Contents of the invention

本申请实施例通过提供一种声音粒子合成方法及装置,解决了现有技术中声音粒子在拼接处会产生“阶跃”现象,导致合成音频声中参杂“爆音”,使得合成音不够连续自然的问题。By providing a method and device for synthesizing sound particles, the embodiments of the present application solve the problem of the "step" phenomenon in the existing technology where sound particles are spliced together, resulting in "pops" being mixed into the synthesized audio sound, making the synthesized sound not continuous enough. Natural question.

本申请实施例提供一种声音粒子合成方法,包括以下步骤:The embodiment of the present application provides a sound particle synthesis method, which includes the following steps:

步骤1、采集原始声音样本;Step 1. Collect original sound samples;

步骤2、根据所述原始声音样本,构建声音粒子样本库,并添加声音粒子的索引;Step 2. Construct a sound particle sample library based on the original sound sample, and add the index of the sound particle;

步骤3、根据车况输入信息,基于所述声音粒子的索引读取所述声音粒子样本库中的声音粒子;Step 3: According to the vehicle condition input information, read the sound particles in the sound particle sample library based on the index of the sound particles;

步骤4、根据所述索引读取的声音粒子,构建过渡音频;通过所述过渡音频拼接相邻的声音粒子,合成目标声音样本。Step 4: Construct transitional audio based on the sound particles read by the index; splice adjacent sound particles through the transitional audio to synthesize the target sound sample.

优选的,所述步骤1的实现方式为:选取目标车型,录制所述目标车型对应的音频信号作为所述原始声音样本;Preferably, step 1 is implemented by: selecting a target car model and recording the audio signal corresponding to the target car model as the original sound sample;

其中,所述音频信号包括但不限于发动机声音信息。Wherein, the audio signal includes but is not limited to engine sound information.

优选的,所述步骤2的实现方式为:根据所述原始声音样本,定位截取标记点并提取声音粒子,构建所述声音粒子样本库;Preferably, step 2 is implemented by: locating and intercepting mark points and extracting sound particles according to the original sound sample, and constructing the sound particle sample library;

其中,所述声音粒子样本库中的声音粒子的首尾相位均为0。Wherein, the first and last phases of the sound particles in the sound particle sample library are both 0.

优选的,所述步骤2中,所述添加声音粒子的索引的实现方式为:提取所述声音粒子样本库中的声音粒子片段,并将其顺序摆放;对每个声音粒子进行编号,并制成一个索引数组;Preferably, in step 2, the method of adding an index of sound particles is to: extract the sound particle fragments in the sound particle sample library and place them in order; number each sound particle, and Make an index array;

所述步骤3中,根据车况输入信息获得对应的声音粒子的索引号,并根据索引号读取所述声音粒子样本库中的声音粒子;所述车况输入信息包括但不限于发动机转速信息、车速信息、扭矩信息。In step 3, the index number of the corresponding sound particle is obtained according to the vehicle condition input information, and the sound particles in the sound particle sample library are read according to the index number; the vehicle condition input information includes but is not limited to engine speed information, vehicle speed information, torque information.

优选的,所述步骤4中,根据所述索引读取的声音粒子,构建过渡音频拼接相邻的声音粒子;其中,将音频信号和希尔伯特变换的函数表达式进行类比,得到音频信号函数表达式;Preferably, in step 4, based on the sound particles read by the index, a transitional audio is constructed to splice adjacent sound particles; wherein, the audio signal is compared with the functional expression of the Hilbert transform to obtain the audio signal function expression;

通过埃尔米特插值算法得到过渡音频的相位信息、频率信息,通过插值算法得到幅值信息;The phase information and frequency information of the transition audio are obtained through the Hermitian interpolation algorithm, and the amplitude information is obtained through the interpolation algorithm;

根据所述音频信号函数表达式,以及所述过渡音频的相位信息、频率信息、幅值信息,构建所述过渡音频。The transition audio is constructed according to the audio signal function expression and the phase information, frequency information, and amplitude information of the transition audio.

优选的,所述音频信号函数表达式y(t)如下:Preferably, the audio signal function expression y(t) is as follows:

y(t)=A(t)sin(Φ(t))y(t)=A(t)sin(Φ(t))

式中,A(t)为音频信号的瞬时幅值、Φ(t)为音频信号的瞬时相位、为音频信号瞬时相位的导数、F(t)为音频信号的瞬时频率、C为积分常数、t2-t1为音频信号的时长。In the formula, A(t) is the instantaneous amplitude of the audio signal, Φ(t) is the instantaneous phase of the audio signal, is the derivative of the instantaneous phase of the audio signal, F(t) is the instantaneous frequency of the audio signal, C is the integration constant, and t2-t1 is the duration of the audio signal.

优选的,所述通过埃尔米特插值算法得到过渡音频的相位信息的实现方式为:Preferably, the implementation method of obtaining the phase information of transition audio through the Hermitian interpolation algorithm is:

设相邻的声音粒子分别为声音粒子i、声音粒子i+1,则用于拼接所述声音粒子i和所述声音粒子i+1的过渡音频记为过渡音频i;Assume that the adjacent sound particles are sound particle i and sound particle i+1 respectively, then the transition audio used to splice the sound particle i and the sound particle i+1 is recorded as transition audio i;

拼接后得到的合成音的瞬时相位曲线连续可导;The instantaneous phase curve of the synthesized sound obtained after splicing is continuously differentiable;

其中,所述声音粒子i对应的时间区间为ti~ti+1,所述声音粒子i在t i+1时刻的瞬时相位为Φ(ti+1);所述声音粒子i+1对应的时间区间为ti+2~ti+3,所述声音粒子i+1在ti+2时刻的瞬时相位为Φ(ti+2);Wherein, the time interval corresponding to the sound particle i is t i ~ t i+1 , and the instantaneous phase of the sound particle i at time t i+1 is Φ(t i+1 ); the sound particle i+1 The corresponding time interval is t i+2 ~ t i+3 , and the instantaneous phase of the sound particle i+1 at time t i+2 is Φ(t i+2 );

所述过渡音频i对应的时间区间为ti+1~ti+2;所述过渡音频i的左端点为所述声音粒子i在ti+1时刻的音频信号,所述过渡音频i的右端点为所述声音粒子i+1在ti+2时刻的音频信号;The time interval corresponding to the transition audio i is ti +1 to ti +2 ; the left endpoint of the transition audio i is the audio signal of the sound particle i at time t i+1 , and the transition audio i The right endpoint is the audio signal of the sound particle i+1 at time t i+2 ;

所述声音粒子i在ti+1时刻的瞬时相位的积分常数为C1;所述声音粒子i+1在ti+2时刻的瞬时相位的积分常数为C2;The integral constant of the instantaneous phase of the sound particle i at time t i+1 is C1; the integral constant of the instantaneous phase of the sound particle i+1 at time t i+2 is C2;

预设ti为0,C1=0,根据所述过渡音频i左端点的瞬时相位Φ(ti+1)、左右端点瞬时相位的一阶导数和/>左右端点瞬时相位的二阶导数/>和/>得到四次Hermite插值法求解过渡音频i相位的表达式Φ(t)如下:Preset ti is 0, C1=0, according to the instantaneous phase Φ(t i+1 ) of the left endpoint of the transition audio i and the first-order derivative of the instantaneous phase of the left and right endpoints and/> The second derivative of the instantaneous phase of the left and right endpoints/> and/> The expression Φ(t) for solving the phase of transition audio i using the quartic Hermite interpolation method is obtained as follows:

Φ(t)=at4+bt3+ct2+dt+eΦ(t)=at 4 +bt 3 +ct 2 +dt+e

式中,a、b、c、d、e为待求的多项式系数;In the formula, a, b, c, d, and e are the polynomial coefficients to be found;

采用第一插值条件,得到第一线性方程组表示为:Using the first interpolation condition, the first linear equation system is obtained as:

其中,rank(A|y)=rank(A)=5,则埃尔米特插值的多项式系数有唯一解,可得过渡音频相位的表达式Φ(t),同时求得C2=2π*Φ(ti+2);Among them, rank(A|y)=rank(A)=5, then the polynomial coefficient of Hermitian interpolation has a unique solution, and the expression of the transition audio phase Φ(t) can be obtained, and C2=2π*Φ can be obtained. (t i+2 );

根据所述第一插值条件求得的C2得到的sin(C2)不为0时,采用第二插值条件替换所述第一插值条件,采用第二线性方程组替换所述第一线性方程组;所述第二线性方程组表示为:When sin(C2) obtained from C2 obtained according to the first interpolation condition is not 0, the second interpolation condition is used to replace the first interpolation condition, and the second linear equation set is used to replace the first linear equation set; The second system of linear equations is expressed as:

其中,rank(A|y)=rank(A)<5,则插值系数向量X有无数解,从中获取一个满足sin(C1)=0且sin(C2)=0的过渡音频相位的表达式Φ(t)。Among them, rank(A|y)=rank(A)<5, then the interpolation coefficient vector X has countless solutions, from which an expression Φ of the transition audio phase satisfying sin(C1)=0 and sin(C2)=0 is obtained. (t).

优选的,所述过渡音频的频率信息满足以下约束条件:Preferably, the frequency information of the transition audio satisfies the following constraints:

所述过渡音频的频率介于左右两端点的频率之间;The frequency of the transition audio is between the frequencies of the left and right endpoints;

所述过渡音频的频率单调。The frequency of the transition audio is monotonic.

优选的,所述过渡音频的幅值满足连续条件;根据所述声音粒子i和所述声音粒子i+1在拼接点处的幅值A(ti+1)和A(ti+2),插值得到所述过渡音频的幅值信息。Preferably, the amplitude of the transition audio satisfies the continuity condition; according to the amplitudes A(t i+1 ) and A(t i+2 ) of the sound particle i and the sound particle i+1 at the splicing point , interpolate to obtain the amplitude information of the transition audio.

本申请实施例提供一种声音粒子合成装置,包括:An embodiment of the present application provides a sound particle synthesis device, including:

采集单元,用于采集获取原始声音样本;A collection unit is used to collect and obtain original sound samples;

声音粒子样本库构建单元,用于根据所述原始声音样本,得到声音粒子样本库;A sound particle sample library construction unit is used to obtain a sound particle sample library based on the original sound sample;

索引单元,用于基于所述声音粒子样本库,得到声音粒子的索引;An indexing unit, used to obtain the index of sound particles based on the sound particle sample library;

声音粒子读取单元,用于根据车况输入信息,基于所述声音粒子的索引读取所述声音粒子样本库中的声音粒子;A sound particle reading unit is used to input information according to the vehicle condition and read the sound particles in the sound particle sample library based on the index of the sound particles;

过渡音频构建单元,用于根据所述索引读取的声音粒子,得到过渡音频;A transition audio building unit, used to obtain transition audio based on the sound particles read by the index;

目标声音样本合成单元,用于通过所述过渡音频拼接相邻的声音粒子,合成得到目标声音样本;A target sound sample synthesis unit is used to splice adjacent sound particles through the transition audio to synthesize the target sound sample;

所述声音粒子合成装置用于实现上述声音粒子合成方法。The sound particle synthesis device is used to implement the above sound particle synthesis method.

本申请实施例中提供的一个或多个技术方案,至少具有如下技术效果或优点:One or more technical solutions provided in the embodiments of this application have at least the following technical effects or advantages:

在本申请实施例中,首先采集原始声音样本,然后根据原始声音样本构建声音粒子样本库,并添加声音粒子的索引,接着根据车况输入信息,基于声音粒子的索引读取声音粒子样本库中的声音粒子,之后根据索引读取的声音粒子,构建过渡音频,并通过过渡音频拼接相邻的声音粒子,合成目标声音样本。本发明通过构建过渡音频拼接相邻声音粒子,实现声音粒子的拼接自然,解决了现有声音粒子拼接过程中出现的“阶跃”现象,避免合成音中的“爆音”出现,保证了合成音的连贯性,满足对目标声品质的要求。In the embodiment of this application, original sound samples are first collected, then a sound particle sample library is constructed based on the original sound samples, and the index of the sound particle is added. Then, information is input according to the vehicle condition, and the sound particle sample library is read based on the index of the sound particle. Sound particles, and then construct transition audio based on the sound particles read by the index, and splice adjacent sound particles through the transition audio to synthesize the target sound sample. The present invention realizes the natural splicing of sound particles by constructing transitional audio to splice adjacent sound particles, solves the "step" phenomenon that occurs in the splicing process of existing sound particles, avoids the occurrence of "pops" in synthesized sounds, and ensures that the synthesized sounds are coherence to meet the requirements for target sound quality.

附图说明Description of the drawings

为了更清楚地说明本实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本发明的一个实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to explain the technical solution in this embodiment more clearly, the drawings needed to be used in the description of the embodiment will be briefly introduced below. Obviously, the drawing in the following description is an embodiment of the present invention. For those of ordinary skill in the art, other drawings can also be obtained based on these drawings without exerting creative efforts.

图1为本发明实施例提供的一种声音粒子合成方法的流程图;Figure 1 is a flow chart of a sound particle synthesis method provided by an embodiment of the present invention;

图2为本发明实施例提供的一种声音粒子合成方法的示意图;Figure 2 is a schematic diagram of a sound particle synthesis method provided by an embodiment of the present invention;

图3为本发明实施例提供的一种声音粒子合成方法中通过“过渡音频”拼接相邻声音粒子的示意图;Figure 3 is a schematic diagram of splicing adjacent sound particles through "transition audio" in a sound particle synthesis method provided by an embodiment of the present invention;

图4为直接拼接相邻声音粒子的合成音的频谱图;Figure 4 is a spectrogram of a synthetic sound that directly splices adjacent sound particles;

图5为利用“过渡音频”为桥梁拼接相邻声音粒子的合成音的频谱图。Figure 5 is a spectrogram of a synthetic sound that uses "transition audio" to splice adjacent sound particles into a bridge.

其中,1-声音粒子1、2-过渡音频1、3-声音粒子2、4-过渡音频2、5-声音粒子3。Among them, 1-sound particle 1, 2-transition audio 1, 3-sound particle 2, 4-transition audio 2, 5-sound particle 3.

具体实施方式Detailed ways

本实施例提供一种声音粒子合成方法,包括以下步骤:This embodiment provides a sound particle synthesis method, which includes the following steps:

步骤1、采集原始声音样本。Step 1. Collect original sound samples.

具体的,选取目标车型,录制所述目标车型对应的音频信号作为所述原始声音样本。其中,所述音频信号包括但不限于发动机声音信息。Specifically, a target car model is selected, and the audio signal corresponding to the target car model is recorded as the original sound sample. Wherein, the audio signal includes but is not limited to engine sound information.

步骤2、根据所述原始声音样本,构建声音粒子样本库,并添加声音粒子的索引。Step 2: Construct a sound particle sample library based on the original sound sample, and add the index of the sound particle.

具体的,根据所述原始声音样本,定位截取标记点并提取声音粒子,构建所述声音粒子样本库。其中,所述声音粒子样本库中的声音粒子的首尾相位均为0。Specifically, according to the original sound sample, the interception mark point is located and the sound particles are extracted, and the sound particle sample library is constructed. Wherein, the first and last phases of the sound particles in the sound particle sample library are both 0.

所述添加声音粒子的索引的一种实现方式为:提取所述声音粒子样本库中的声音粒子片段,并将其顺序摆放;对每个声音粒子进行编号,并制成一个索引数组。One implementation method of adding the index of sound particles is to: extract the sound particle fragments in the sound particle sample library and place them in order; number each sound particle and create an index array.

步骤3、根据车况输入信息,基于所述声音粒子的索引读取所述声音粒子样本库中的声音粒子。Step 3: According to the vehicle condition input information, read the sound particles in the sound particle sample library based on the index of the sound particles.

具体的,根据车况输入信息获得对应的声音粒子的索引号,并根据索引号读取所述声音粒子样本库中的声音粒子。所述车况输入信息包括但不限于发动机转速信息、车速信息、扭矩信息。Specifically, the index number of the corresponding sound particle is obtained according to the vehicle condition input information, and the sound particles in the sound particle sample library are read according to the index number. The vehicle condition input information includes but is not limited to engine speed information, vehicle speed information, and torque information.

步骤4、根据所述索引读取的声音粒子,构建过渡音频;通过所述过渡音频拼接相邻的声音粒子,合成目标声音样本。Step 4: Construct transitional audio based on the sound particles read by the index; splice adjacent sound particles through the transitional audio to synthesize the target sound sample.

具体的,根据所述索引读取的声音粒子,构建过渡音频拼接相邻的声音粒子。其中,将音频信号和希尔伯特变换的函数表达式进行类比,得到音频信号函数表达式。通过埃尔米特插值算法得到过渡音频的相位信息、频率信息、幅值信息。根据所述音频信号函数表达式,以及所述过渡音频的相位信息、频率信息、幅值信息,构建所述过渡音频。Specifically, based on the sound particles read by the index, a transition audio is constructed to splice adjacent sound particles. Among them, the audio signal is compared with the function expression of the Hilbert transform to obtain the audio signal function expression. The phase information, frequency information, and amplitude information of the transition audio are obtained through the Hermitian interpolation algorithm. The transition audio is constructed according to the audio signal function expression and the phase information, frequency information, and amplitude information of the transition audio.

其中,所述音频信号函数表达式y(t)如下:Wherein, the audio signal function expression y(t) is as follows:

y(t)=A(t)sin(Φ(t))y(t)=A(t)sin(Φ(t))

式中,A(t)为音频信号的瞬时幅值、Φ(t)为音频信号的瞬时相位、为音频信号瞬时相位的导数、F(t)为音频信号的瞬时频率、C为积分常数、t2-t1为音频信号的时长。In the formula, A(t) is the instantaneous amplitude of the audio signal, Φ(t) is the instantaneous phase of the audio signal, is the derivative of the instantaneous phase of the audio signal, F(t) is the instantaneous frequency of the audio signal, C is the integration constant, and t2-t1 is the duration of the audio signal.

所述通过埃尔米特插值算法得到过渡音频的相位信息的实现方式为:设相邻的声音粒子分别为声音粒子i、声音粒子i+1,则用于拼接所述声音粒子i和所述声音粒子i+1的过渡音频记为过渡音频i;拼接后得到的合成音的瞬时相位曲线连续可导。The implementation method of obtaining the phase information of the transition audio through the Hermitian interpolation algorithm is as follows: assuming that the adjacent sound particles are sound particle i and sound particle i+1 respectively, then it is used to splice the sound particle i and the The transition audio of sound particle i+1 is recorded as transition audio i; the instantaneous phase curve of the synthesized sound obtained after splicing is continuously differentiable.

其中,所述声音粒子i对应的时间区间为ti~ti+1,所述声音粒子i在t i+1时刻的瞬时相位为Φ(ti+1);所述声音粒子i+1对应的时间区间为ti+2~ti+3,所述声音粒子i+1在ti+2时刻的瞬时相位为Φ(ti+2)。所述过渡音频i对应的时间区间为ti+1~ti+2;所述过渡音频i的左端点为所述声音粒子i在ti+1时刻的音频信号,所述过渡音频i的右端点为所述声音粒子i+1在ti+2时刻的音频信号。所述声音粒子i在ti+1时刻的瞬时相位的积分常数为C1;所述声音粒子i+1在ti+2时刻的瞬时相位的积分常数为C2。Wherein, the time interval corresponding to the sound particle i is t i ~ t i+1 , and the instantaneous phase of the sound particle i at time t i+1 is Φ(t i+1 ); the sound particle i+1 The corresponding time interval is t i+2 ~ t i+3 , and the instantaneous phase of the sound particle i+1 at time t i+2 is Φ(t i+2 ). The time interval corresponding to the transition audio i is ti +1 to ti +2 ; the left endpoint of the transition audio i is the audio signal of the sound particle i at time t i+1 , and the transition audio i The right endpoint is the audio signal of the sound particle i+1 at time t i+2 . The integral constant of the instantaneous phase of the sound particle i at time t i+1 is C1; the integral constant of the instantaneous phase of the sound particle i+1 at time t i+2 is C2.

预设ti为0,C1=0,根据所述过渡音频i左端点的瞬时相位Φ(ti+1)、左右端点瞬时相位的一阶导数和/>左右端点瞬时相位的二阶导数/>和/>得到四次Hermite插值法求解过渡音频i相位的表达式Φ(t)如下:Preset ti is 0, C1=0, according to the instantaneous phase Φ(t i+1 ) of the left endpoint of the transition audio i and the first-order derivative of the instantaneous phase of the left and right endpoints and/> The second derivative of the instantaneous phase of the left and right endpoints/> and/> The expression Φ(t) for solving the phase of transition audio i using the quartic Hermite interpolation method is obtained as follows:

Φ(t)=at4+bt3+ct2+dt+eΦ(t)=at 4 +bt 3 +ct 2 +dt+e

式中,a、b、c、d、e为待求的多项式系数。In the formula, a, b, c, d, and e are the polynomial coefficients to be found.

采用第一插值条件,得到第一线性方程组表示为:Using the first interpolation condition, the first linear equation system is obtained as:

其中,rank(A|y)=rank(A)=5,则埃尔米特插值的多项式系数有唯一解,可得过渡音频相位的表达式Φ(t),同时求得C2=2π*Φ(ti+2)。Among them, rank(A|y)=rank(A)=5, then the polynomial coefficient of Hermitian interpolation has a unique solution, and the expression of the transition audio phase Φ(t) can be obtained, and C2=2π*Φ can be obtained. (t i+2 ).

根据所述第一插值条件求得的C2得到的sin(C2)不为0时,采用第二插值条件替换所述第一插值条件,采用第二线性方程组替换所述第一线性方程组;所述第二线性方程组表示为:When sin(C2) obtained from C2 obtained according to the first interpolation condition is not 0, the second interpolation condition is used to replace the first interpolation condition, and the second linear equation set is used to replace the first linear equation set; The second system of linear equations is expressed as:

其中,rank(A|y)=rank(A)<5,则插值系数向量X有无数解,从中获取一个满足sin(C1)=0且sin(C2)=0的过渡音频相位的表达式Φ(t)。Among them, rank(A|y)=rank(A)<5, then the interpolation coefficient vector X has countless solutions, from which an expression Φ of the transition audio phase satisfying sin(C1)=0 and sin(C2)=0 is obtained. (t).

由于音频信号的瞬时频率是音频信号瞬时相位的导数,因此根据相位信息可以得到频率信息。此外,所述过渡音频的频率信息满足以下约束条件:所述过渡音频的频率介于左右两端点的频率之间;所述过渡音频的频率单调。Since the instantaneous frequency of the audio signal is the derivative of the instantaneous phase of the audio signal, frequency information can be obtained based on the phase information. In addition, the frequency information of the transition audio satisfies the following constraints: the frequency of the transition audio is between the frequencies of the left and right end points; the frequency of the transition audio is monotonic.

所述过渡音频的幅值满足连续条件;根据所述声音粒子i和所述声音粒子i+1在拼接点处的幅值A(ti+1)和A(ti+2),插值得到所述过渡音频的幅值信息。具体的,此处采用线性插值即可实现,根据两端点的幅值信息,插值得到过渡音频的幅值信息,合成音的幅值曲线仅需满足连续条件即可。The amplitude of the transition audio satisfies the continuity condition; according to the amplitudes A(t i+1 ) and A(t i+2 ) of the sound particle i and the sound particle i+1 at the splicing point, interpolation is obtained Amplitude information of the transition audio. Specifically, linear interpolation can be used here. According to the amplitude information of the two end points, the amplitude information of the transition audio is obtained by interpolation. The amplitude curve of the synthetic sound only needs to meet the continuity condition.

与上述合成方法对应,本实施例还提供一种声音粒子合成装置,包括:Corresponding to the above synthesis method, this embodiment also provides a sound particle synthesis device, including:

采集单元,用于采集获取原始声音样本;A collection unit is used to collect and obtain original sound samples;

声音粒子样本库构建单元,用于根据所述原始声音样本,得到声音粒子样本库;A sound particle sample library construction unit is used to obtain a sound particle sample library based on the original sound sample;

索引单元,用于基于所述声音粒子样本库,得到声音粒子的索引;An indexing unit, used to obtain the index of sound particles based on the sound particle sample library;

声音粒子读取单元,用于根据车况输入信息,基于所述声音粒子的索引读取所述声音粒子样本库中的声音粒子;A sound particle reading unit is used to input information according to the vehicle condition and read the sound particles in the sound particle sample library based on the index of the sound particles;

过渡音频构建单元,用于根据所述索引读取的声音粒子,得到过渡音频;A transition audio building unit, used to obtain transition audio based on the sound particles read by the index;

目标声音样本合成单元,用于通过所述过渡音频拼接相邻的声音粒子,合成得到目标声音样本;A target sound sample synthesis unit is used to splice adjacent sound particles through the transition audio to synthesize the target sound sample;

所述声音粒子合成装置用于实现上述声音粒子合成方法。The sound particle synthesis device is used to implement the above sound particle synthesis method.

为了更好的理解上述技术方案,下面将结合说明书附图以及具体的实施方式对上述技术方案进行详细的说明。In order to better understand the above technical solution, the above technical solution will be described in detail below with reference to the accompanying drawings and specific implementation modes.

本实施例提供了一种声音粒子合成方法,如图1、图2所示,主要包括以下步骤:This embodiment provides a sound particle synthesis method, as shown in Figures 1 and 2, which mainly includes the following steps:

步骤1、采集原始声音样本。Step 1. Collect original sound samples.

采集原始声音样本包括:针对顾客对声音品质的期望,选取市面上相匹配的车型作为目标车型,录制其车内或发动机声音。Collecting original sound samples includes: targeting customers' expectations for sound quality, selecting matching models on the market as target models, and recording their interior or engine sounds.

一种具体的录制方法是:在发动机舱或车内合理布置一个微型传声器,然后挂空档运行发动机,缓慢踩下油门,让发动机转速从怠速缓慢加速到发动机能达到的最高转速,以获取足够长的声音样本用于声音样本库的制备。A specific recording method is to reasonably arrange a micro microphone in the engine compartment or car, then run the engine in neutral, slowly step on the accelerator, and let the engine speed slowly accelerate from idle to the highest speed the engine can reach, in order to obtain enough Long sound samples are used for sound sample library preparation.

步骤2、构建声音粒子样本库。Step 2. Build a sound particle sample library.

声音粒子样本库的制备包括:主谐波的提取、声音样本截取标记点的定位与选择以及声音粒子的提取。The preparation of the sound particle sample library includes: extraction of main harmonics, positioning and selection of sound sample interception mark points, and extraction of sound particles.

主要技术要点包括时变窗函数的设计,以满足声音粒子切割大小随工况的变化,一旦从截取标记点列表中确定截取标记,将使用一个以截取标记为中心的时间窗来提取声音粒子。实现将录制好的音频信号切割成首尾相位均为0的声音粒子,并保存为声音粒子样本库。The main technical points include the design of the time-varying window function to meet the changes in the sound particle cutting size with the working conditions. Once the interception mark is determined from the interception mark point list, a time window centered on the interception mark will be used to extract the sound particles. It is possible to cut the recorded audio signal into sound particles whose first and last phases are both 0, and save them as a sound particle sample library.

步骤3、声音粒子的索引。Step 3. Index of sound particles.

声音粒子的索引包括:提取声音粒子片段并将其顺序摆放,对每个声音粒子进行编号制成一个索引数组。并通过获得发动机转速、车速、扭矩等输入参数计算对应声音粒子的索引号,根据索引号读取声音样本库中的声音粒子。The indexing of sound particles includes: extracting sound particle fragments and placing them in order, and numbering each sound particle to create an index array. And by obtaining input parameters such as engine speed, vehicle speed, and torque, the index number of the corresponding sound particle is calculated, and the sound particles in the sound sample library are read according to the index number.

步骤4、构建过渡音频、合成目标声音样本。Step 4. Construct transition audio and synthesize target sound samples.

主要包括根据索引读取的声音粒子,构建过渡音频;通过过渡音频拼接相邻的声音粒子,合成目标声音样本。It mainly includes constructing transition audio based on the sound particles read by the index; splicing adjacent sound particles through the transition audio to synthesize the target sound sample.

例如,将声音粒子1、2、3用过渡音频1、2进行拼接,实现声音的合成,参看图3。For example, sound particles 1, 2, and 3 are spliced with transition audio 1 and 2 to achieve sound synthesis, see Figure 3.

具体的,步骤4主要包括以下步骤:Specifically, step 4 mainly includes the following steps:

(1)将音频信号和希尔伯特变换的函数表达式进行类比。(1) Compare the audio signal to the functional expression of Hilbert transform.

其中,希尔伯特变换(Hilbert变换)可以实现将一个窄带的一维的信号解析成二维复平面上的信号,复数的模和幅角代表了信号的幅度和相位,对于一个实信号x(t),其Hilbert变换为:Among them, Hilbert transform can analyze a narrow-band one-dimensional signal into a signal on a two-dimensional complex plane. The module and argument of the complex number represent the amplitude and phase of the signal. For a real signal x (t), its Hilbert transform is:

式中,*表示卷积运算,对应的解析信号为:In the formula, * represents the convolution operation, and the corresponding analytical signal is:

其中,对于载波频率为fs的窄带实信号x(t),利用所述的Hilbert变换可以求解解析信号,从而得到信号瞬时振幅a(t)和瞬时相位并可以利用瞬时相位的导数求解瞬时频率f(t),具体表达式如下:Among them, for the narrow-band real signal x(t) with the carrier frequency f s , the Hilbert transform can be used to solve the analytical signal, thereby obtaining the instantaneous amplitude a(t) and instantaneous phase of the signal. And the instantaneous phase derivative can be used to solve for the instantaneous frequency f(t). The specific expression is as follows:

因此,所述解析信号可以重新表达为:Therefore, the analytical signal can be re-expressed as:

式中, In the formula,

本发明根据上述对希尔波特变换的分析,将音频信号表达式和希尔伯特变换的解析式进行类比理解,设音频信号函数表达式为y(t),因此:Based on the above analysis of the Hilbert transform, the present invention understands the audio signal expression and the analytical formula of the Hilbert transform by analogy. Suppose the audio signal function expression is y(t), therefore:

y(t)=A(t)sin(Φ(t))y(t)=A(t)sin(Φ(t))

式中,C为积分常数、A(t)为音频信号的瞬时幅值(包络)、Φ(t)为音频信号的瞬时相位、F(t)为音频信号的瞬时频率,t2-t1为音频信号的时长。In the formula, C is the integration constant, A(t) is the instantaneous amplitude (envelope) of the audio signal, Φ(t) is the instantaneous phase of the audio signal, F(t) is the instantaneous frequency of the audio signal, t2-t1 is The duration of the audio signal.

(2)利用埃尔米特插值算法插值出“过渡音频”的瞬时相位。(2) Use the Hermitian interpolation algorithm to interpolate the instantaneous phase of the "transition audio".

所述音频信号的瞬时频率为瞬时相位的导数。因此令时间t=0时,对应的相位为0,则当对相邻声音粒子进行拼接时,第一段声音粒子的瞬时相位的积分常数C1=0,频率积分可由MATLAB函数CUMSUM完成。但令时间t=t2时,第二段声音粒子的瞬时相位的积分常数C2则无法确定,且C2应保证相邻声音粒子的瞬时相位曲线拼接后连续可导。The instantaneous frequency of the audio signal is the derivative of the instantaneous phase. Therefore, when time t=0, the corresponding phase is 0, then when adjacent sound particles are spliced, the integration constant C1 of the instantaneous phase of the first sound particle=0, and the frequency integration can be completed by the MATLAB function CUMSUM. However, when time t = t 2 , the integral constant C2 of the instantaneous phase of the second segment of sound particles cannot be determined, and C2 should ensure that the instantaneous phase curves of adjacent sound particles are continuously differentiable after splicing.

具体的,根据所述的第二段声音粒子的瞬时相位的积分常数C2的求解方法:以“过渡音频”连接的形式保证拼接后的声音粒子的瞬时相位曲线均连续可导,所述“过渡音频”的瞬时相位可由埃尔米特插值法插值得到。Specifically, according to the solution method of the integral constant C2 of the instantaneous phase of the sound particles in the second section: in the form of "transition audio" connection, it is ensured that the instantaneous phase curves of the spliced sound particles are all continuously differentiable, and the "transition audio" connection is used. The instantaneous phase of "audio" can be interpolated by the Hermitian interpolation method.

其中,根据所述的埃尔米特插值法:Among them, according to the Hermitian interpolation method:

假设我们已经得到了节点a≤x0<x1...<xn≤b,节点处的函数值h(xj)以及该点处的导数值h'(xj),则由这n+1个节点通过Hermite插值一个2n+1次与原函数近似的多项式H2n+1(x)。Assume that we have obtained the node a≤x 0 <x 1 ...<x n ≤b, the function value h(x j ) at the node and the derivative value h'(x j ) at the point, then from this n +1 nodes interpolate a polynomial H 2n +1 (x) of degree 2n+1 that approximates the original function through Hermite.

具体的,以“过渡音频”连接的形式,并利用埃尔米特插值法插值出“过渡音频”的瞬时相位求解方法包括:Specifically, the instantaneous phase solution method for interpolating the "transition audio" in the form of a "transition audio" connection and using the Hermitian interpolation method includes:

已知待插区间左端点的数值、两端点的一阶导数、以及两端点的二阶导数、因此可利用四次埃尔米特插值法求解瞬时相位的表达式Φ(t),设:It is known that the value of the left endpoint of the interval to be interpolated, the first-order derivative of the two endpoints, and the second-order derivative of the two endpoints are known. Therefore, the fourth-order Hermitian interpolation method can be used to solve the instantaneous phase expression Φ(t). Suppose:

Φ(t)=at4+bt3+ct2+dt+eΦ(t)=at 4 +bt 3 +ct 2 +dt+e

式中,a、b、c、d、e为待求的多项式系数,因此可得线性方程组:In the formula, a, b, c, d, and e are the polynomial coefficients to be found, so a system of linear equations can be obtained:

其中,rank(A|y)=rank(A)=5,埃尔米特插值的多项式系数有唯一解,此时可得到过渡音频相位的表达式Φ(t),同时求得C2=2π*Φ(t2)。Among them, rank(A|y)=rank(A)=5, the polynomial coefficient of Hermitian interpolation has a unique solution. At this time, the expression of the transition audio phase Φ(t) can be obtained, and C2=2π* can be obtained at the same time. Φ(t 2 ).

某些情况下,根据所述的利用四次埃尔米特插值法求解求出的C2未必能保证sin(C2)=0。此时可以适当放宽埃尔米特插值系数方程组的插值条件,如去掉某个已知条件,则线性方程组变为:In some cases, the C2 obtained by using the quartic Hermitian interpolation method may not guarantee sin(C2)=0. At this time, the interpolation conditions of the Hermitian interpolation coefficient equations can be appropriately relaxed. If a certain known condition is removed, the linear equations become:

具体的,此时rank(A|y)=rank(A)<5,则插值系数向量X有无数解,可从解中获取一个满足sin(C1)=0且sin(C2)=0要求的解(即得到过渡音频相位的表达式Φ(t))。Specifically, at this time rank(A|y)=rank(A)<5, then the interpolation coefficient vector X has countless solutions, and one can be obtained from the solutions that satisfies the requirements of sin(C1)=0 and sin(C2)=0 Solution (that is, obtain the expression Φ(t) of the transition audio phase).

注意到第二个声音粒子的相位信号的初始相位为0,需保证C2=2π*Φ(t2)为π的整数倍,即Φ(t2)∈N,N为整数。由得基础解析U,在搜寻合适的基础解系系数时,可以通过最优化方法,令Φ(t2)∈N尽可能接近整数。Note that the initial phase of the phase signal of the second sound particle is 0, and it is necessary to ensure that C2=2π*Φ(t 2 ) is an integer multiple of π, that is, Φ(t 2 )∈N, N is an integer. Depend on Obtain the basic analysis U. When searching for appropriate basic solution coefficients, we can use optimization methods to make Φ(t 2 )∈N as close to an integer as possible.

(3)根据音频信号函数表达式,利用插值得到的瞬时相位,构建“过渡音频”。(3) According to the audio signal function expression, use the instantaneous phase obtained by interpolation to construct "transition audio".

其中,根据所述的通过埃尔米特插值求出“过渡音频”的瞬时相位,并利用音频信号函数表达式y(t)=A(t)sin(Φ(t))构建“过渡音频”。Among them, the instantaneous phase of the "transition audio" is obtained through Hermitian interpolation, and the "transition audio" is constructed using the audio signal function expression y(t)=A(t)sin(Φ(t)) .

此外,为保证过渡段频率不发生突变,优选添加适当的约束条件如下:In addition, in order to ensure that the frequency of the transition section does not mutate, it is best to add appropriate constraints as follows:

约束条件1:过渡段(即过渡音频)的频率介于两端点(即过渡音频左右两端拼接的两个声音粒子)的频域之间;约束条件2:过渡段的频率单调。Constraint 1: The frequency of the transition section (i.e., the transition audio) is between the frequency domains of the two end points (i.e., the two sound particles spliced at the left and right ends of the transition audio); Constraint 2: The frequency of the transition section is monotonic.

(4)根据构建的“过渡音频”,合成目标声音样本。(4) Based on the constructed "transition audio", the target sound sample is synthesized.

如图3所示,根据构建的过渡音频1和2,以此为桥梁分别连接声音粒子1和声音粒子2,连接声音粒子2和声音粒子3,实现声音粒子的合成。As shown in Figure 3, based on the constructed transition audio 1 and 2, this is used as a bridge to connect sound particles 1 and 2, and to connect sound particles 2 and 3, respectively, to achieve the synthesis of sound particles.

即本实施例提供了一种声音粒子合成方法,根据录制的音频信号,定位截取标记点并提取声音粒子样本,构建声音粒子样本库。根据所述声音粒子样本库,在声音合成阶段,将声音信号和希尔伯特变换的函数表达式进行类比,通过埃尔米特插值算法插值出声音信号的瞬时相位和瞬时频率,通过插值算法得到瞬时幅值,生成“过渡函数”。所述“过渡函数”保证相邻音频粒子在瞬时幅值、频率、相位连续可导,并利用“过渡函数”合成“过渡音频”,以此为桥梁拼接相邻声音粒子,合成目标声音样本。That is, this embodiment provides a sound particle synthesis method. According to the recorded audio signal, the interception mark points are located and sound particle samples are extracted to construct a sound particle sample library. According to the sound particle sample library, in the sound synthesis stage, the sound signal is compared with the functional expression of the Hilbert transform, and the instantaneous phase and instantaneous frequency of the sound signal are interpolated through the Hermitian interpolation algorithm. Instantaneous amplitude, generating a "transition function". The "transition function" ensures that adjacent audio particles are continuously differentiable in instantaneous amplitude, frequency, and phase, and uses the "transition function" to synthesize "transition audio", which serves as a bridge to splice adjacent sound particles and synthesize target sound samples.

为了验证本发明提出的一种声音粒子合成方法的有效性和优越性,本发明构建声音粒子1和声音粒子2,并将这两个声音粒子分别进行直接合成和利用拼接优化算法构建的“过渡音频”进行合成(即利用本发明提供的方法进行合成),得到两种拼接方式合成音的频谱。In order to verify the effectiveness and superiority of a sound particle synthesis method proposed by the present invention, the present invention constructs sound particles 1 and 2, and directly synthesizes these two sound particles and uses a splicing optimization algorithm to construct a "transition" "Audio" is synthesized (that is, synthesized using the method provided by the present invention), and the spectrum of the synthesized sound in two splicing methods is obtained.

如图4所示,未使用“过渡音频”直接将两个单频声音粒子进行合成,可以明显看出在拼接处,声波在拼接点处出现了冲击,当播放合成音时,听到令人不悦的“爆音”。如图5所示,利用本发明提出的声音粒子合成方法构建的“过渡音频”,连接相邻的两个声音粒子,能够有效的消除拼接点处的冲击,保证声波的连贯性,避免声音播放出现卡顿。As shown in Figure 4, two single-frequency sound particles are directly synthesized without using "transition audio". It can be clearly seen that at the splicing point, the sound wave has an impact at the splicing point. When the synthesized sound is played, an amazing sound is heard. Displeased "pop". As shown in Figure 5, the "transitional audio" constructed using the sound particle synthesis method proposed by the present invention connects two adjacent sound particles, which can effectively eliminate the impact at the splicing point, ensure the continuity of the sound wave, and avoid sound playback Stuttering occurs.

本发明实施例提供的一种声音粒子合成方法及装置至少包括如下技术效果:A sound particle synthesis method and device provided by embodiments of the present invention at least include the following technical effects:

(1)本发明通过构建过渡音频拼接相邻声音粒子,实现声音粒子的拼接自然,解决了现有声音粒子拼接过程中出现的“阶跃”现象,避免合成音中的“爆音”出现,保证了合成音的连贯性。(1) The present invention realizes the natural splicing of sound particles by constructing transitional audio to splice adjacent sound particles, solves the "step" phenomenon that occurs in the splicing process of existing sound particles, avoids the occurrence of "pops" in synthetic sounds, and ensures improves the coherence of synthesized sounds.

(2)本发明根据希尔伯特变换的物理意义,并结合埃尔米特插值算法,提出一种声音粒子合成方法,该方法根据录制的声音样本,利用SHC函数提取声音样本主谐波,定位截取标记并提取声音样本,构建声音粒子样本库,在声音合成阶段,将声音信号和Hilbert变换进行类比,通过Hermite插值算法插值出声音信号的瞬时相位和瞬时频率,通过插值算法得到瞬时幅值,生成“过渡函数”保证了相邻音频粒子在幅值、频率、相位连续可导,并利用“过渡函数”合成“过渡音频”,以此拼接相邻声音粒子,实现声音粒子的拼接自然。(2) Based on the physical meaning of the Hilbert transform and combined with the Hermitian interpolation algorithm, the present invention proposes a sound particle synthesis method. This method uses the SHC function to extract the main harmonics of the sound sample based on the recorded sound sample. Locate and intercept markers and extract sound samples to build a sound particle sample library. In the sound synthesis stage, the sound signal is compared with the Hilbert transform, and the instantaneous phase and frequency of the sound signal are interpolated through the Hermite interpolation algorithm. The instantaneous amplitude is obtained through the interpolation algorithm. , generating a "transition function" to ensure that adjacent audio particles are continuously differentiable in amplitude, frequency, and phase, and using the "transition function" to synthesize "transition audio" to splice adjacent sound particles to achieve natural splicing of sound particles.

(3)本发明提出的声音粒子合成方法与其他的声音合成算法(例如,文Mcaulay R等人提出的“死亡轨迹-新生轨迹”的频率追踪算法)相比,本发明的计算量小、效率高,优化效果更优,能够有效地消除拼接过程中产生的“阶跃”现象,保证合成音的连贯性。(3) Compared with other sound synthesis algorithms (for example, the "death trajectory-rebirth trajectory" frequency tracking algorithm proposed by Wen Mcaulay R et al.), the sound particle synthesis method proposed by the present invention has a small amount of calculation and high efficiency. High, the optimization effect is better, it can effectively eliminate the "step" phenomenon produced during the splicing process, and ensure the coherence of the synthesized sound.

(4)本发明提出的声音粒子合成方法适用范围广、简单快捷、灵活性强,保证了主动声音设计中对目标声音声品质的要求。(4) The sound particle synthesis method proposed by the present invention has a wide range of applications, is simple, fast, and highly flexible, and ensures the sound quality requirements of the target sound in active sound design.

最后所应说明的是,以上具体实施方式仅用以说明本发明的技术方案而非限制,尽管参照实例对本发明进行了详细说明,本领域的普通技术人员应当理解,可以对本发明的技术方案进行修改或者等同替换,而不脱离本发明技术方案的精神和范围,其均应涵盖在本发明的权利要求范围当中。Finally, it should be noted that the above specific embodiments are only used to illustrate the technical solutions of the present invention and are not limiting. Although the present invention has been described in detail with reference to examples, those of ordinary skill in the art will understand that the technical solutions of the present invention can be carried out. Modifications or equivalent substitutions without departing from the spirit and scope of the technical solution of the present invention shall be included in the scope of the claims of the present invention.

Claims (7)

1. A method of synthesizing sound particles, comprising the steps of:
step 1, collecting an original sound sample;
step 2, constructing a sound particle sample library according to the original sound sample, and adding indexes of sound particles;
step 3, according to the vehicle condition input information, reading the sound particles in the sound particle sample library based on the indexes of the sound particles;
step 4, constructing transition audio according to the sound particles read by the index; splicing adjacent sound particles through the transition audio frequency to synthesize a target sound sample;
in the step 4, adjacent sound particles of transition audio splicing are constructed according to the sound particles read by the index; the audio signal and the Hilbert transform function expression are subjected to analogy to obtain an audio signal function expression;
obtaining phase information and frequency information of transition audio through a Hermite interpolation algorithm, and obtaining amplitude information through the interpolation algorithm;
constructing the transition audio according to the audio signal function expression, and the phase information, the frequency information and the amplitude information of the transition audio;
the audio signal function expression y (t) is as follows:
y(t)=A(t)sin(Φ(t))
wherein A (t) is the instantaneous amplitude of the audio signal, phi (t) is the instantaneous phase of the audio signal,Is audio frequencyThe derivative of the instantaneous phase of the signal, F (t) being the instantaneous frequency of the audio signal, C being the integration constant, t2-t1 being the duration of the audio signal;
the implementation mode of obtaining the phase information of the transition audio through the Hermite interpolation algorithm is as follows:
if adjacent sound particles are respectively sound particles i and sound particles i+1, the transition audio used for splicing the sound particles i and the sound particles i+1 is recorded as transition audio i;
the instantaneous phase curve of the synthesized sound obtained after splicing is continuous and conductive;
wherein the time interval corresponding to the sound particle i is t i ~t i+1 The sound particles i are at t i+1 The instantaneous phase of the moment is phi (t i+1 ) The method comprises the steps of carrying out a first treatment on the surface of the The time interval corresponding to the sound particle i+1 is t i+2 ~t i+3 The sound particle i+1 is at t i+2 The instantaneous phase of the moment is phi (t i+2 );
The corresponding time interval of the transition audio i is t i+1 ~t i+2 The method comprises the steps of carrying out a first treatment on the surface of the The left end point of the transition audio i is t of the sound particle i i+1 An audio signal at a moment, wherein the right end point of the transition audio i is the sound particle i+1 at t i+2 An audio signal of time;
the sound particle i is at t i+1 The integral constant of the instantaneous phase at the moment is C1; the sound particle i+1 is at t i+2 The integral constant of the instantaneous phase at the moment is C2;
preset t i Is 0, c1=0, according to the instantaneous phase Φ (t i+1 ) First derivative of instantaneous phase of left and right endpointsAnd->Second derivative of left and right endpoint instantaneous phase +.>And->The expression Φ (t) for solving the phase of the transition audio i by four Hermite interpolation methods is obtained as follows:
Φ(t)=at 4 +bt 3 +ct 2 +dt+e
wherein a, b, c, d, e is a polynomial coefficient of the Hermite interpolation to be solved;
using a first interpolation condition, the first set of linear equations is expressed as:
wherein, rank (a|y) =rank (a) =5, the polynomial coefficient of the hermite interpolation has a unique solution, and the expression Φ (t) of the audio phase can be transited, and c2=2pi×Φ (t) i+2 );
When sin (C2) obtained by C2 obtained according to the first interpolation condition is not 0, replacing the first interpolation condition by adopting a second interpolation condition, and replacing the first linear equation set by adopting a second linear equation set; the second system of linear equations is expressed as:
where rank (a|y) =rank (a) < 5, there are innumerable solutions to the polynomial coefficients of the hermite interpolation, from which an expression Φ (t) is obtained that satisfies the transition audio phase of sin (C1) =0 and sin (C2) =0.
2. The sound particle synthesis method according to claim 1, wherein the implementation manner of step 1 is as follows: selecting a target vehicle type, and recording an audio signal corresponding to the target vehicle type as the original sound sample;
wherein the audio signal includes engine sound information.
3. The sound particle synthesis method according to claim 1, wherein the implementation manner of the step 2 is as follows: positioning a interception mark point and extracting sound particles according to the original sound sample, and constructing the sound particle sample library;
and the head-tail phases of the sound particles in the sound particle sample library are all 0.
4. The method of synthesizing sound particles according to claim 1, wherein in the step 2, the index of the added sound particles is implemented by: extracting sound particle fragments in the sound particle sample library, and sequentially placing the sound particle fragments; numbering each sound particle and making an index array;
in the step 3, obtaining the index number of the corresponding sound particle according to the vehicle condition input information, and reading the sound particle in the sound particle sample library according to the index number; the vehicle condition input information comprises engine rotation speed information, vehicle speed information and torque information.
5. The sound particle synthesis method according to claim 1, wherein the frequency information of the transition audio satisfies the following constraint:
the frequency of the transition audio frequency is between the frequencies of the left endpoint and the right endpoint;
the transition audio is monotonic in frequency.
6. The sound particle synthesis method according to claim 1, wherein the amplitude of the transition audio satisfies a continuous condition; according to the amplitude A (t i+1 ) And A (t) i+2 ) And interpolating to obtain the amplitude information of the transition audio.
7. An acoustic particle synthesizer, comprising:
the acquisition unit is used for acquiring an original sound sample;
the sound particle sample library construction unit is used for obtaining a sound particle sample library according to the original sound sample;
the index unit is used for obtaining indexes of the sound particles based on the sound particle sample library;
a sound particle reading unit for reading sound particles in the sound particle sample library based on the index of the sound particles according to the vehicle condition input information;
the transition audio construction unit is used for obtaining transition audio according to the sound particles read by the index;
the target sound sample synthesis unit is used for splicing adjacent sound particles through the transition audio frequency to synthesize a target sound sample;
the sound particle synthesizing apparatus is for realizing the sound particle synthesizing method according to any one of claims 1 to 6.
CN202010549930.8A 2020-06-16 2020-06-16 Sound particle synthesis method and device Active CN111798831B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010549930.8A CN111798831B (en) 2020-06-16 2020-06-16 Sound particle synthesis method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010549930.8A CN111798831B (en) 2020-06-16 2020-06-16 Sound particle synthesis method and device

Publications (2)

Publication Number Publication Date
CN111798831A CN111798831A (en) 2020-10-20
CN111798831B true CN111798831B (en) 2023-11-28

Family

ID=72802959

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010549930.8A Active CN111798831B (en) 2020-06-16 2020-06-16 Sound particle synthesis method and device

Country Status (1)

Country Link
CN (1) CN111798831B (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112700762B (en) * 2020-12-23 2022-10-04 武汉理工大学 Automobile sound synthesis method and device based on cylinder pressure signal
CN112863530B (en) * 2021-01-07 2024-08-27 广州欢城文化传媒有限公司 Sound work generation method and device
CN113990287B (en) * 2021-11-18 2025-03-07 腾讯音乐娱乐科技(深圳)有限公司 A speech synthesis method, device and storage medium
CN114758662B (en) * 2022-03-31 2025-02-28 上汽通用五菱汽车股份有限公司 Method and device for generating vehicle and motor audio data
CN115662470B (en) * 2022-12-12 2023-05-26 科大讯飞(苏州)科技有限公司 Audio particle extraction method, sound wave synthesis device, equipment and medium

Citations (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5890115A (en) * 1997-03-07 1999-03-30 Advanced Micro Devices, Inc. Speech synthesizer utilizing wavetable synthesis
AU2005207606A1 (en) * 2004-01-16 2005-08-04 Nuance Communications, Inc. Corpus-based speech synthesis based on segment recombination
KR20060084134A (en) * 2005-01-17 2006-07-24 엘지전자 주식회사 Pop noise canceling device and method
CN101101700A (en) * 2006-07-06 2008-01-09 中国船舶重工集团公司第七一五研究所 Underwater sound communications and alarming method and device
EP1944755A1 (en) * 2007-01-15 2008-07-16 France Télécom Modification of a voice signal
CN101661569A (en) * 2009-09-18 2010-03-03 北京科技大学 Intelligent emotional robot multi-modal behavioral associative expression system
CN102023074A (en) * 2010-11-10 2011-04-20 中国第一汽车集团公司 Performance analysis method for vibratory noise of automobile engine oil pan
WO2016042244A1 (en) * 2014-09-18 2016-03-24 Peugeot Citroen Automobiles Sa Sound synthesis device intended for the active coloration of the noise of a vehicle engine
CN108140396A (en) * 2015-09-22 2018-06-08 皇家飞利浦有限公司 Audio signal processing
AU2018247327B1 (en) * 2018-07-23 2018-12-06 Q-CTRL Pty Ltd Multi-qubit control
WO2019019801A1 (en) * 2017-07-24 2019-01-31 比亚迪股份有限公司 New energy vehicle and active noise reduction method and system therefor
CN109449909A (en) * 2018-12-21 2019-03-08 伏诺瓦(天津)科技有限公司 Interactive voice lightning protection box
WO2019144386A1 (en) * 2018-01-26 2019-08-01 大连理工大学 Method for predicting key performance parameters of aviation engine transition state
WO2019183543A1 (en) * 2018-03-23 2019-09-26 John Rankin System and method for identifying a speaker's community of origin from a sound sample
CN110718207A (en) * 2019-09-06 2020-01-21 中国第一汽车股份有限公司 Sound synthesis precision verification method for active sound production system and active sound production system

Patent Citations (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5890115A (en) * 1997-03-07 1999-03-30 Advanced Micro Devices, Inc. Speech synthesizer utilizing wavetable synthesis
AU2005207606A1 (en) * 2004-01-16 2005-08-04 Nuance Communications, Inc. Corpus-based speech synthesis based on segment recombination
KR20060084134A (en) * 2005-01-17 2006-07-24 엘지전자 주식회사 Pop noise canceling device and method
CN101101700A (en) * 2006-07-06 2008-01-09 中国船舶重工集团公司第七一五研究所 Underwater sound communications and alarming method and device
EP1944755A1 (en) * 2007-01-15 2008-07-16 France Télécom Modification of a voice signal
CN101661569A (en) * 2009-09-18 2010-03-03 北京科技大学 Intelligent emotional robot multi-modal behavioral associative expression system
CN102023074A (en) * 2010-11-10 2011-04-20 中国第一汽车集团公司 Performance analysis method for vibratory noise of automobile engine oil pan
WO2016042244A1 (en) * 2014-09-18 2016-03-24 Peugeot Citroen Automobiles Sa Sound synthesis device intended for the active coloration of the noise of a vehicle engine
CN108140396A (en) * 2015-09-22 2018-06-08 皇家飞利浦有限公司 Audio signal processing
WO2019019801A1 (en) * 2017-07-24 2019-01-31 比亚迪股份有限公司 New energy vehicle and active noise reduction method and system therefor
WO2019144386A1 (en) * 2018-01-26 2019-08-01 大连理工大学 Method for predicting key performance parameters of aviation engine transition state
WO2019183543A1 (en) * 2018-03-23 2019-09-26 John Rankin System and method for identifying a speaker's community of origin from a sound sample
AU2018247327B1 (en) * 2018-07-23 2018-12-06 Q-CTRL Pty Ltd Multi-qubit control
CN109449909A (en) * 2018-12-21 2019-03-08 伏诺瓦(天津)科技有限公司 Interactive voice lightning protection box
CN110718207A (en) * 2019-09-06 2020-01-21 中国第一汽车股份有限公司 Sound synthesis precision verification method for active sound production system and active sound production system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
航空发动机整机动力学研究进展与展望;陈予恕;张华彪;;航空学报(第08期);全文 *

Also Published As

Publication number Publication date
CN111798831A (en) 2020-10-20

Similar Documents

Publication Publication Date Title
CN111798831B (en) Sound particle synthesis method and device
JP7160968B2 (en) Improved harmonic transposition based on subband blocks
CN102985970B (en) Improved magnitude response and temporal alignment in phase vocoder based bandwidth extension for audio signals
JP3528258B2 (en) Method and apparatus for decoding encoded audio signal
JP4396233B2 (en) Complex exponential modulation filter bank signal analysis method, signal synthesis method, program thereof, and recording medium thereof
JP2010249939A (en) Noise reducing device and noise determination method
Shao et al. Order tracking by discrete Gabor expansion
CN102257567A (en) Sound signal processing apparatus, sound encoding apparatus and sound decoding apparatus
US9805704B1 (en) Method and system for artificial reverberation using modal decomposition
CN106797526B (en) Apparatus for processing audio, method and computer readable recording medium
CN111739544A (en) Voice processing method, device, electronic device and storage medium
Wiesmeyr et al. Efficient algorithms for discrete Gabor transforms on a nonseparable lattice
CN102483943A (en) Multi-channel audio decoding method and apparatus thereof
JP4119112B2 (en) Mixed sound separator
Wright et al. Supporting the sound description interchange format in the max/msp environment
JP6087760B2 (en) Sound field recording / reproducing apparatus, method, and program
Timoney et al. Dynamic signal phase distortion using coefficient-modulated allpass filters
JP4170459B2 (en) Time-axis compression / expansion device for waveform signals
Savioja et al. Digital waveguide mesh for room acoustic modeling
Amatriain et al. Spectral modeling for higher-level sound transformations
RU2800676C1 (en) Improved harmonic transformation based on a block of sub-bands
Polotti et al. Fractal additive synthesis
Petrausch et al. Sound synthesis by physical modeling using the functional transformation method: Efficient implementations with polyphase-filterbanks
De Poli Sound synthesis by fractional waveshaping
Smith et al. Overview of the commuted piano synthesis technique

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant