CN107610691A

CN107610691A - English vowel sounding error correction method and device

Info

Publication number: CN107610691A
Application number: CN201710803552.XA
Authority: CN
Inventors: 王红岩
Original assignee: Shenzhen University
Current assignee: Shenzhen University
Priority date: 2017-09-08
Filing date: 2017-09-08
Publication date: 2018-01-19
Anticipated expiration: 2037-09-08
Also published as: CN107610691B

Abstract

The present invention relates to a kind of English vowel sounding error correction method, including：Step 1, the English vowel sounding acoustic model of pre-stored criteria；Step 2, the English Phonetics of typing measurand；Step 3, the vowel in the English Phonetics of measurand is identified；Step 4, typing measurand reads aloud the voice of the vowel identified；Step 5, the voice for reading aloud measurand the vowel identified carries out English vowel sounding acoustic analysis；Step 6, the English vowel sounding acoustic analyze data of measurand obtains first degree of deviation compared with the English vowel sounding acoustic model of standard；Step 7, error correction is carried out to the English vowel sounding of measurand according to first degree of deviation.By analyzing the English vowel sounding acoustic of measurand, and compared with the English vowel sounding acoustic model of the standard with prestoring, so as to be corrected to the English vowel sounding of measurand, to cause the English vowel sounding of measurand more accurate.

Description

English vowel pronunciation error correction method and device

技术领域technical field

本发明涉及语音识别技术领域，具体涉及英语元音发声纠错方法及装置。The invention relates to the technical field of speech recognition, in particular to a method and device for correcting English vowel sounds.

背景技术Background technique

语言，是人类交流的桥梁，语言的多样性,也是交流的障碍。英语，作为使用频率较高的通用语的主导。世界上不同国家不同发音特色的“英语变体”成为交流中的障碍。对于不同母语背景下的英语特征的研究是突破交流障碍，提高说话人英语发音的关键。对说话人语音特征的识别，是对说话人语音内容识别的不可回避的过程。为提高说话人的英语语音发音，改善不同国籍的人之间的交流障碍，当说话人语音偏离目标语语音规范一定范围时，对说话人的英语发音进行纠正就显得格外重要。Language is a bridge for human communication, and language diversity is also an obstacle to communication. English is the dominant lingua franca with high frequency of use. "English variants" with different pronunciation characteristics in different countries in the world have become obstacles in communication. The study of English features in different native language backgrounds is the key to breaking through communication barriers and improving the speaker's English pronunciation. The recognition of the speaker's speech characteristics is an unavoidable process of recognizing the speaker's speech content. In order to improve the speaker's English pronunciation and improve communication barriers between people of different nationalities, it is particularly important to correct the speaker's English pronunciation when the speaker's voice deviates from the target language's phonetic norms within a certain range.

发音时从肺部呼出的气流通过起共鸣器作用的口腔，发出阻力极小并无摩擦声音的语音。尽管在一般情况下发元音时声带都振动，但也可使声带不振动，发成清音或耳语音。During pronunciation, the airflow exhaled from the lungs passes through the oral cavity, which acts as a resonator, and a voice with minimal resistance and no friction sound is produced. Although under normal circumstances the vocal cords vibrate when pronouncing vowels, it is also possible to make the vocal cords not vibrate and produce unvoiced or ear sounds.

从发音语音学的观点看，元音通常按舌头的位置和双唇的形状而分类，高元音发音时舌面拱起，紧靠上腭，低元音发音时舌的部位相对地低平，舌面与上腭的距离稍大，中元音发音时舌的位置处于中间状态，高、中、低元音也按前后列分类。舌面的位置和唇的形状是元音分类的一个标准。From the perspective of pronunciation phonetics, vowels are usually classified according to the position of the tongue and the shape of the lips. When high vowels are pronounced, the tongue surface is arched and close to the palate. When low vowels are pronounced, the tongue is relatively flat. , the distance between the tongue surface and the upper palate is slightly larger, the position of the tongue is in the middle state when the middle vowel is pronounced, and the high, middle and low vowels are also classified according to the front and back columns. The position of the tongue surface and the shape of the lips are a criterion for classifying vowels.

根据发音语音学，元音是气流振动声带、在口腔没有收到阻碍而形成的一类发音，不同的口腔形状形成了不同的元音；而辅音是气流在口腔受到阻碍而形成的，不同的发音部位或发音方法形成了不同的辅音。According to pronunciation phonetics, vowels are a type of pronunciation formed by the airflow vibrating the vocal cords without being obstructed in the oral cavity. Different oral cavity shapes form different vowels; while consonants are formed when the airflow is obstructed in the oral cavity. Different consonants are formed by the place of pronunciation or the method of pronunciation.

发明内容Contents of the invention

本发明的目的是基于英语为目标语时，被测对象与标准的英语元音发声模型对比，实现的被测对象英语元音发声的纠错。The purpose of the present invention is to realize the error correction of the English vowel pronunciation of the measured object by comparing the measured object with the standard English vowel sounding model when English is the target language.

本发明一方面提供一种英语元音发声纠错方法，包括：步骤1，预存标准的英语元音发声声学模型；步骤2，录入被测对象的英语语音；步骤3，识别所述被测对象的英语语音中的元音；步骤4，录入所述被测对象朗读所述识别出的元音的语音；步骤5，对所述被测对象朗读所述识别出的元音的语音进行英语元音发声声学分析；步骤6，所述被测对象的英语元音发声声学分析数据与所述标准的英语元音发声声学模型比较，得到第一偏差度；步骤7，根据所述第一偏差度对所述被测对象的英语元音发声进行纠错。One aspect of the present invention provides a method for correcting English vowel pronunciation, including: step 1, pre-store a standard English vowel pronunciation acoustic model; step 2, input the English voice of the measured object; step 3, identify the measured object Vowels in the English voice; Step 4, input the voice of the measured object reading the recognized vowel; Step 5, the voice of the measured object reading the recognized vowel English Pronunciation acoustic analysis; step 6, the English vowel pronunciation acoustic analysis data of the measured object is compared with the English vowel pronunciation acoustic model of the standard to obtain a first degree of deviation; step 7, according to the first degree of deviation Error correction is performed on the English vowel pronunciation of the subject under test.

所述步骤1包括：录入多个标准英语样本对象的英语语音；识别所述多个标准英语样本对象的英语语音中的元音；分别对每个样本对象的元音进行英语元音发声声学分析；根据所述英语元音发声声学分析结果生成所述标准的英语元音发声声学模型。Described step 1 comprises: input the English speech of a plurality of standard English sample objects; Identify the vowel in the English speech of described a plurality of standard English sample objects; Carry out English vowel pronunciation acoustic analysis to the vowel of each sample object respectively ; Generating the standard English vowel pronunciation acoustic model according to the acoustic analysis results of English vowel pronunciation.

所述步骤2包括：根据所述被测对象的国籍提供所述语音材料，并录入所述被测对象朗读所述语音材料的英语语音。The step 2 includes: providing the voice material according to the nationality of the tested object, and recording the English voice of the tested object reading the voice material.

所述步骤3包括：根据所述元音的共振峰值识别所述被测对象的英语语音中的元音。The step 3 includes: identifying the vowels in the English speech of the subject under test according to the formant peaks of the vowels.

所述步骤3包括：根据所述元音的共振峰值以及所述元音的时长识别所述被测对象的英语语音中的元音。The step 3 includes: identifying the vowel in the English speech of the subject under test according to the resonance peak of the vowel and the duration of the vowel.

所述共振峰值包括第一共振峰值和第二共振峰值。The resonance peaks include a first resonance peak and a second resonance peak.

所述步骤7包括：根据所述英语元音发声声学分析的数据和所述标准的英语元音发声声学模型，以可视的图像调整所述被测对象英语元音的发声。The step 7 includes: according to the data of the acoustic analysis of the English vowel pronunciation and the standard English vowel pronunciation acoustic model, adjusting the English vowel pronunciation of the subject under test with a visual image.

所述步骤7后，还包括：再次录入所述被测对象再次朗读所述元音的语音；对所述被测对象再次朗读的语音进行英语元音发声声学分析；所述被测对象再次朗读的语音的英语元音发声声学分析数据与所述标准的英语元音发声声学模型比较，得到第二偏差度；根据所述第一偏差度和所述第二偏差度输出所述被测对象的英语元音发声评价文本。After the step 7, it also includes: re-entering the voice of the measured object reading the vowel again; performing an acoustic analysis of English vowel sounds on the voice of the measured object reading again; The English vowel phonation acoustic analysis data of the speech is compared with the English vowel phonation acoustic model of the standard to obtain a second deviation degree; English vowel pronunciation evaluation text.

所述英语元音发声声学分析包括：测量所述录入的英语元音发声的共振峰值；测量所述录入的英语元音发声的时长；根据所述录入的英语元音发声的共振峰值以及所述时长生成所述录入的英语元音发声声学分析数据。The acoustic analysis of English vowel sounds includes: measuring the resonance peak of the input English vowel sounds; measuring the duration of the input English vowel sounds; according to the input resonance peak of the English vowel sounds and the The duration of generating the inputted English vowel phonation acoustic analysis data.

本发明还提供一种存储设备，其中存储有多条指令，所述指令适于由处理器加载并执行为：步骤1，预存标准的英语元音发声声学模型；步骤2，录入被测对象的的英语语音；步骤3，识别所述被测对象的英语语音中的元音；步骤4，录入所述被测对象朗读所述识别出的元音的语音；步骤5，对所述被测对象朗读所述识别出的元音的语音进行英语元音发声声学分析；步骤6，所述被测对象的英语语音发声声学分析数据与所述标准的英语元音发声声学模型比较，得到第一偏差度；步骤7，根据所述第一偏差度对所述被测对象的英语元音发声进行纠错。The present invention also provides a storage device, wherein a plurality of instructions are stored, and the instructions are suitable for being loaded by a processor and executed as follows: Step 1, pre-store a standard English vowel sounding acoustic model; Step 2, input the measured object Step 3, identify the vowels in the English voice of the measured object; Step 4, input the voice of the measured object to read the recognized vowels; Step 5, to the measured object Reading the voice of the identified vowels aloud and performing an acoustic analysis of English vowel sounds; step 6, comparing the acoustic analysis data of English speech sounds of the measured object with the standard English vowel sound acoustic model to obtain a first deviation degree; step 7, correcting the English vowel pronunciation of the measured object according to the first degree of deviation.

所述步骤2包括：根据所述被测对象的国籍提供语音材料，并录入所述被测对象朗读所述语音材料的英语语音。The step 2 includes: providing voice material according to the nationality of the tested object, and recording the English voice of the tested object reading the voice material.

所述步骤3还包括：根据所述元音的共振峰值以及所述元音的时长识别所述被测对象的英语语音中的元音。The step 3 further includes: identifying the vowel in the English speech of the subject under test according to the resonance peak of the vowel and the duration of the vowel.

所述步骤7后，还包括：再次录入所述被测对象再次朗读所述识别出的元音的语音；对所述被测对象再次朗读的语音进行英语元音发声声学分析；所述被测对象再次朗读的语音的英语元音发声声学分析数据与所述标准的英语元音发声声学模型比较，得到第二偏差度；根据所述第一偏差度和所述第二偏差度输出所述被测对象的英语元音发声评价文本。After the step 7, it also includes: re-entering the voice of the tested object reading the recognized vowel again; performing an acoustic analysis of English vowel sounds on the voice of the tested object reading again; The English vowel phonation acoustic analysis data of the voice read aloud again by the subject is compared with the standard English vowel phonation acoustic model to obtain a second degree of deviation; The test object's English vowel pronunciation evaluation text.

所述英语元音发声声学分析包括：测量所述录入的英语元音发声的共振峰值；测量所述录入的元音发声的时长；根据所述录入的英语元音发声的共振峰值以及所述时长生成所述录入的英语元音发声声学分析数据。The acoustic analysis of English vowel sounds includes: measuring the resonance peak of the input English vowel sounds; measuring the duration of the input vowel sounds; according to the input resonance peak of the English vowel sounds and the duration Acoustic analysis data of the entered English vowel phonation is generated.

本发明还提供一种英语元音发声纠错装置，包括：处理器，适于实现各指令；以及存储设备，适于存储多条指令，所述指令适于由所述处理器加载并执行为：步骤1，预存标准的英语元音发声声学模型；步骤2，录入被测对象的英语语音；步骤3，识别所述被测对象的英语语音中的元音；步骤4，录入所述被测对象朗读所述识别出的元音的语音；步骤5，对所述被测对象朗读所述识别出的元音的语音进行英语元音发声声学分析；步骤6，所述被测对象的英语元音发声声学分析与所述标准的英语元音发声声学模型比较，得到第一偏差度；步骤7，根据所述第一偏差度对所述被测对象的英语元音发声进行纠错。The present invention also provides an English vowel sound error correction device, comprising: a processor, adapted to implement instructions; and a storage device, adapted to store a plurality of instructions, the instructions are adapted to be loaded by the processor and executed as : Step 1, pre-store standard English vowel sound acoustic model; Step 2, input the English voice of the tested object; Step 3, identify the vowels in the English voice of the tested object; Step 4, input the tested object The object reads the voice of the recognized vowel; step 5, performs an acoustic analysis of English vowel sounds on the voice of the measured object reading the recognized vowel; step 6, the English element of the measured object The pronunciation acoustic analysis is compared with the standard English vowel pronunciation acoustic model to obtain a first deviation degree; step 7, correcting the English vowel pronunciation of the measured object according to the first deviation degree.

所述步骤2包括：根据所述样本对象的国籍提供语音材料，并录入所述样本对象朗读所述语音材料的英语语音。The step 2 includes: providing voice material according to the nationality of the sample object, and recording the English voice of the sample object reading the voice material.

所述步骤7后，还包括：再次录入所述被测对象再次朗读所述识别出的元音的语音；对所述被测对象再次朗读的语音进行英语元音发声声学分析；所述被测对象再次朗读的英语元音发声声学分析数据与所述标准的英语元音发声声学模型比较，得到第二偏差度；根据所述第一偏差度和所述第二偏差度输出所述被测对象的英语元音发声评价文本。After the step 7, it also includes: re-entering the voice of the tested object reading the recognized vowel again; performing an acoustic analysis of English vowel sounds on the voice of the tested object reading again; The English vowel pronunciation acoustic analysis data read aloud by the subject is compared with the standard English vowel pronunciation acoustic model to obtain a second degree of deviation; output the measured object according to the first degree of deviation and the second degree of deviation English vowel pronunciation evaluation text.

本发明的有益效果在于，通过对被测对象的英语元音发声声学进行分析，并与预存的标准的英语元音发声声学模型相比较，从而对被测对象的英语元音发声进行纠正，以使得被测对象的英语元音发声更准确。The beneficial effects of the present invention are that, by analyzing the English vowel sounding acoustics of the measured object, and comparing with the pre-stored standard English vowel sounding acoustic model, the English vowel sounding of the measured object is corrected, so that Make the English vowel pronunciation of the tested object more accurate.

附图说明Description of drawings

下面将结合附图及实施例对本发明作进一步说明，附图中：The present invention will be further described below in conjunction with accompanying drawing and embodiment, in the accompanying drawing:

图1是本发明一实施例的英语元音发声纠错方法100的流程图；Fig. 1 is a flowchart of an English vowel pronunciation error correction method 100 according to an embodiment of the present invention;

图2是本发明一实施例的形成标准英语元音发声模型中选取样本对象的方法200的流程图；2 is a flowchart of a method 200 for selecting sample objects in forming a standard English vowel pronunciation model according to an embodiment of the present invention;

图3是本发明一实施例的形成标准英语元音发声模型的方法300的流程图；FIG. 3 is a flow chart of a method 300 for forming a standard English vowel pronunciation model according to an embodiment of the present invention;

图4是本发明一实施例的英语元音发声纠错方法的可视化英语元音发声纠错图；Fig. 4 is a visual English vowel pronunciation error correction diagram of the English vowel pronunciation error correction method according to an embodiment of the present invention;

图5是本发明又一实施例的英语元音发声纠错方法500的流程图；FIG. 5 is a flow chart of an English vowel pronunciation error correction method 500 according to another embodiment of the present invention;

图6是本发明一实施例的英语元音发声分析方法600的流程图；FIG. 6 is a flowchart of an English vowel pronunciation analysis method 600 according to an embodiment of the present invention;

图7是本发明一实施例的采用图6所示英语元音发声声学分析方法600生成的不同国家男女被测对象的英语元音发声声学分析图；FIG. 7 is an acoustic analysis diagram of English vowel pronunciation of men and women in different countries generated by the English vowel pronunciation acoustic analysis method 600 shown in FIG. 6 according to an embodiment of the present invention;

图8是本发明一实施例的采用图6所示英语元音发声声学分析方法600生成的标准英语元音发声模型图。FIG. 8 is a diagram of a standard English vowel pronunciation model generated by using the English vowel pronunciation acoustic analysis method 600 shown in FIG. 6 according to an embodiment of the present invention.

具体实施方式detailed description

现结合附图，对本发明的较佳实施例作详细说明。Now in conjunction with the accompanying drawings, the preferred embodiments of the present invention will be described in detail.

如图1所示，本发明一实施例的英语元音发声纠错方法100的流程图。As shown in FIG. 1 , it is a flow chart of a method 100 for correcting English vowel pronunciation errors according to an embodiment of the present invention.

步骤101，首先预存标准的英语元音发声模型。例如，可保存英语母语背景的样本对象的元音发声模型作为标准的英语元音发声模型。本发明中所述的样本对象指的是用于形成标准的英语元音发声模型时所选取的发音人。In step 101, a standard English vowel pronunciation model is pre-stored. For example, the vowel pronunciation model of the sample object of the English native language background may be saved as a standard English vowel pronunciation model. The sample object mentioned in the present invention refers to the speaker selected when forming a standard English vowel pronunciation model.

步骤103,录入被测对象的英语语音。在具体实施方式中，可提供语音材料供被测对象朗读，也可由被测对象朗读其他任一英文词语或语句等。当选择按照语音材料朗读时，可提供与建立标准的英语元音发声模型时相同的语音材料，也可以提供与建立标准的英语元音发声模型时不同的语音材料。Step 103, inputting the English voice of the tested object. In a specific embodiment, audio materials can be provided for the subject to read aloud, and any other English words or sentences can also be read by the subject to be tested. When choosing to read aloud according to the phonetic material, the same phonetic material as that used when establishing a standard English vowel sounding model can be provided, or a different phonetic material than that used when building a standard English vowel sounding model.

步骤105，识别录入的被测对象的英语语音中的元音。Step 105, identifying vowels in the recorded English speech of the subject to be tested.

步骤107，录入被测对象朗读识别出的元音的语音。Step 107, recording the voice of the tested subject reading the recognized vowels aloud.

步骤109，对所录入的被测对象朗读的识别出的元音的语音做英语元音发声声学分析，可得到所述元音发声声学分析数据。Step 109 , performing an acoustic analysis of English vowel utterances on the recorded speech of the recognized vowels read aloud by the subject to obtain the acoustic analysis data of vowel utterances.

步骤111，将所得到的英语元音发声声学分析数据与预存的标准的英语元音发声模型相比较，得到第一偏差度。Step 111 , comparing the obtained English vowel pronunciation acoustic analysis data with a pre-stored standard English vowel pronunciation model to obtain a first degree of deviation.

步骤113，根据第一偏差度对被测对象的英语元音发声进行纠错。Step 113 , correcting the English vowel pronunciation of the subject under test according to the first degree of deviation.

在具体实施方式中，当录入某一被测对象朗读的英语语音后，首先识别所述英语语音中的元音，在录入被测对象朗读所述识别出的元音的语音，对被测对象朗读所述识别出的元音的语音进行英语元音发声声学分析，得到的英语元音发声声学分析数据与预存的标准的英语元音发声模型相比较，可以得到被测对象的英语元音发音与标准的英语元音发音的差别，则可根据该差别对被测对象的英语元音发声进行纠正。In a specific embodiment, after inputting the English voice read aloud by a certain measured object, first recognize the vowels in the English voice, and then input the voice of the measured object to read the recognized vowel, and to the measured object Read the voice of the recognized vowel aloud and perform acoustic analysis of English vowel pronunciation, and compare the obtained English vowel pronunciation acoustic analysis data with the pre-stored standard English vowel pronunciation model to obtain the English vowel pronunciation of the tested object If there is a difference from the standard English vowel pronunciation, the English vowel pronunciation of the subject under test can be corrected according to the difference.

在一实施方式中，需要选取标准英语的样本对象去建立标准的英语元音发声声学模型，可采用图2的方法完成。In one embodiment, standard English sample objects need to be selected to establish a standard English vowel pronunciation acoustic model, which can be accomplished by using the method in FIG. 2 .

步骤201，选择语音材料，本发明针对英语作为目标语，以元音为主要对象，语料涉及英语所有元音在词的结构与句子的结构中的呈现。句子结构包括英语所有简单句句型5种。语义涉及可预知语句与不可预知语句。可预知语句中包含可预知性高的语句与可预知性低的句子。所有词为高频词，但包括所有英语元音。语音材料可以由世界著名语音学家设计。可预知和不可预知的语句，会对元音的感知有影响，比如：完整语句为“spread somebutter on the bread”，即使语句为“spread some butter on the”，虽然没说“bread”，或者没有识别出“butter”，但是通常都知道是“butter/bread”，也就是说可预知语句，所以也可以识别。Step 201, select speech materials. The present invention is aimed at English as the target language, with vowels as the main object, and the corpus involves the presentation of all vowels in English in the structure of words and sentences. Sentence structure includes 5 kinds of simple sentence patterns in English. Semantics concerns predictable and unpredictable sentences. Predictable sentences include sentences with high predictability and sentences with low predictability. All words are high-frequency words, but include all English vowels. Voice materials can be designed by world-renowned phoneticians. Predictable and unpredictable sentences, which affect the perception of vowels, for example: the complete sentence is "spread some butter on the bread", even though the sentence is "spread some butter on the", although "bread" is not said, or there is no "butter" is recognized, but it is usually known as "butter/bread", which means that the sentence can be predicted, so it can also be recognized.

步骤203，选择标准英语样本对象朗读语音材料，建立语音库。在实际应用中，可选择在美国加洲的本土英语成年人作为标准英语样本对象，选择这些人之后，分别朗读所选择的语音材料，并录音形成语音库。Step 203, select the standard English sample object to read the speech material aloud, and establish a speech library. In practical application, native English adults in California can be selected as standard English sample subjects. After selecting these people, they will read the selected speech materials aloud and record them to form a speech library.

步骤205，标准英语样本对象代表的选择，可在美国选取与样本对象语言背景相同的听音人，进行中期感知，选择感知中处于中间值的男女说话人，他们是最具代表性的样本对象。Step 205, the selection of the standard English sample object representative, the listener with the same language background as the sample object can be selected in the United States, and the medium-term perception is carried out, and the male and female speakers who are in the middle of the perception are selected, and they are the most representative sample objects .

步骤207，对通过中期感知的最具代表性的样本对象进行总体的英语元音发声声学分析，形成标准的英语元音发声声学模型。Step 207 , conduct overall acoustic analysis of English vowel pronunciation on the most representative sample objects perceived through the middle period, and form a standard acoustic model of English vowel pronunciation.

在一实施方式中，在通过图2所示的方法选取最具代表性的标准英语发音人群之后，可采用图3所示的方法生成标准的英语元音发声声学模型。In one embodiment, after the most representative standard English pronunciation population is selected through the method shown in FIG. 2 , the standard English vowel pronunciation acoustic model can be generated using the method shown in FIG. 3 .

步骤301，录入多个标准英语样本对象的英语语音，语音输入之后，形成语音素材，自然语流中的元音不止一个，这样形成基本元音数据。In step 301, the English voices of multiple standard English sample objects are input. After the voices are input, the voice material is formed. There is more than one vowel in the natural language flow, thus forming basic vowel data.

步骤303，识别所述多个标准英语样本对象的英语语音中的元音，在一实施方式中，可根据元音的共振峰值来识别语音中的元音，也可以为了进一步提高识别准确性，由共振峰值结合元音的时长来识别语音中的元音。在具体实施方式中，共振峰值可以是元音的第一共振峰值和第二共振峰值。其中，第一共振峰值F1表示的是唇的维度，即发音的上下维度，第二共振峰值F2表示的是是舌的维度，即发音的前后维度。Step 303: Identify the vowels in the English speech of the plurality of standard English sample objects. In one embodiment, the vowel in the speech can be identified according to the resonance peak of the vowel, or in order to further improve the recognition accuracy, Vowels in speech are identified by their formants combined with their durations. In a specific embodiment, the resonant peak may be a first resonant peak and a second resonant peak of a vowel. Among them, the first resonance peak F1 represents the dimension of the lips, that is, the up-and-down dimension of pronunciation, and the second resonance peak F2 represents the dimension of the tongue, that is, the front-back dimension of pronunciation.

步骤305，对每个标准英语样本对象的语音中的元音进行英语元音发声声学分析。Step 305, performing an acoustic analysis of English vowel pronunciation on the vowels in the speech of each standard English sample object.

步骤307，根据步骤305得到的多个标准英语样本对象的英语元音发声声学分析数据生成标准的英语元音发声声学模型。在具体实施方式中，每个样本对象的单个元音特征会不同与其他发音人，但是一个人的前、后、高、低元音会在一定的范围内，根据这一特殊性把发音人的元音进行整体测量，然后根据元音范围确定发音人的基本特征。Step 307: Generate a standard English vowel pronunciation acoustic model according to the acoustic analysis data of English vowel pronunciation of a plurality of standard English sample objects obtained in step 305. In a specific implementation, the single vowel feature of each sample object will be different from other speakers, but a person's front, back, high and low vowels will be within a certain range, and the speaker will be classified according to this particularity. The overall measurement of the vowel sounds, and then determine the basic characteristics of the speaker according to the range of vowels.

在一实施方式中，可根据被测对象的国籍提供语音材料，并录入被测对象朗读所述语音材料的英语语音。例如，对于中国人，出错率较高的元音是：/i:～/、/ε～/以及/u:～/，那么在提供语音材料时，可多提供包含上述元音的语音材料，以更多的对被测对象经常出错的元音进行纠正，使得英语元音发声的纠正更有针对性。In one embodiment, audio materials may be provided according to the nationality of the subject under test, and the English voice of the subject under test reading the audio material may be recorded. For example, for Chinese, the vowel with high error rate is: /i:～ /, /ε～ / and /u:~ /, so when providing speech materials, more speech materials containing the above-mentioned vowels can be provided, so as to correct the vowels that are often made wrong by the test object, so that the correction of English vowel sounds is more targeted.

在一实施方式中，根据第一偏差度对被测对象的英语元音发声进行纠错可以包括：根据英语元音发声声学分析的数据和标准的英语元音发声声学模型，以可视的图像调整被测对象英语元音的发声。如图4所示，是本发明一实施例的英语元音发声纠错方法的可视化英语元音发声纠错图，例如，对一个中国人的英语元音发声进行纠错时，提供的语音材料是“a good looking woman”，通过采用本发明的英语元音发声纠错方法对发音进行纠错，首先得到的是图4中圆点表示的该中国人的元音发声的第一共振峰和第二共振峰的坐标位置，图4中三角表示的标准的发声的第一共振峰和第二共振峰的坐标位置，通过此种可视化对比，可以更加直观的让说话人知道该如何调整自己的英语元音发声。In one embodiment, correcting the English vowel pronunciation of the subject under test according to the first degree of deviation may include: according to the data of the English vowel pronunciation acoustic analysis and the standard English vowel pronunciation acoustic model, using a visual image Adjust the pronunciation of English vowels of the test subject. As shown in Figure 4, it is a visual English vowel pronunciation error correction diagram of the English vowel pronunciation error correction method according to an embodiment of the present invention, for example, when a Chinese's English vowel pronunciation is corrected, the speech material provided is "a good looking woman", by adopting the English vowel pronunciation error correction method of the present invention to Pronunciation is corrected, first obtained is the coordinate position of the first formant and the second formant of the Chinese vowel sound represented by the dot in Fig. 4, and the triangle in Fig. 4 represents The coordinate positions of the first formant and the second formant of the standard vocalization, through this visual comparison, can let the speaker know how to adjust his English vowel pronunciation more intuitively.

图5是本发明又一实施例的英语元音发声纠错方法500的流程图，其中，步骤501至513与图1中的步骤101至113相同，步骤515，再次录入被测对象朗读识别出的元音的语音；步骤517，对被测对象再次朗读的语音进行英语元音发声声学分析；步骤519，将步骤517得到的英语元音发声声学分析的数据与标准的英语元音发声声学模型比较，得到第二偏差度；步骤521，根据第一偏差度和第二偏差输出被测对象的英语元音发声评价文本。Fig. 5 is a flow chart of an English vowel pronunciation error correction method 500 according to another embodiment of the present invention, wherein, steps 501 to 513 are the same as steps 101 to 113 in Fig. 1, and step 515 is to input the subject to read aloud and identify The voice of the vowel; Step 517, carry out the acoustic analysis of English vowel sound to the speech that the measured object reads aloud again; Step 519, the data of the English vowel sound acoustic analysis that step 517 obtains and standard English vowel sound acoustic model Comparing to obtain the second degree of deviation; step 521 , output the English vowel pronunciation evaluation text of the subject under test according to the first degree of deviation and the second deviation.

在具体实施方式中，英语元音发声评价文本可以包括被测对象原始的英语元音发声声学分析图和经过纠正后的英语元音发声声学分析图等信息，以方便被测对象了解自身的英语发音情况及存在的需要纠正的发音问题，使得被测对象可以依据自身特点进行有目的的英语发声练习，以提高被测对象自身的英语发声。In a specific embodiment, the English vowel pronunciation evaluation text may include information such as the original English vowel pronunciation acoustic analysis diagram of the subject and the corrected English vowel pronunciation acoustic analysis diagram, so as to facilitate the subject to understand his own English vowel pronunciation. The pronunciation situation and the existing pronunciation problems that need to be corrected enable the test subjects to carry out purposeful English pronunciation exercises according to their own characteristics, so as to improve the test subjects' own English pronunciation.

在生成标准的英语元音发声声学模型或对被测对象进行英语元音发声声学分析时，可采用图6所示的英语元音发声声学分析方法。When generating a standard English vowel pronunciation acoustic model or performing an English vowel pronunciation acoustic analysis on the measured object, the English vowel pronunciation acoustic analysis method shown in FIG. 6 can be used.

步骤601首先录入英语语音。当生成标准的英语元音发声声学模型时，录入的是标准英语发音的样本对象的语音；当对被测对象进行英语元音发声声学分析时，录入的是被测对象的语音。In step 601, English voice is firstly recorded. When a standard English vowel pronunciation acoustic model is generated, the input is the voice of the sample subject of standard English pronunciation; when the acoustic analysis of English vowel pronunciation is performed on the tested object, the input is the voice of the tested object.

步骤603识别英语语音中的元音。在一实施方式中，可根据元音的共振峰值来识别语音中的元音，也可以为了进一步提高识别准确性，由共振峰值结合元音的时长来识别语音中的元音。在具体实施方式中，共振峰值可以是元音的第一共振峰值和第二共振峰值。Step 603 identifies vowels in English speech. In one embodiment, the vowel in the speech can be identified according to the resonance peak of the vowel, or the vowel in the speech can be identified by combining the resonance peak and the duration of the vowel in order to further improve the recognition accuracy. In a specific embodiment, the resonant peak may be a first resonant peak and a second resonant peak of a vowel.

步骤605测量元音的第一共振峰值F1和第二共振峰值F2，元音共振峰值的F1和F2并非线性关系，在具体实施方式中，可把共振峰的赫兹值转换成巴克(Bark)值，具体转换公式为：Step 605 measures the first resonance peak F1 and the second resonance peak F2 of the vowel, and the F1 and F2 of the vowel resonance peak are not in a linear relationship. In a specific embodiment, the Hertz value of the formant can be converted into a Bark (Bark) value , the specific conversion formula is:

Bark＝[(26.81 x F)/(1960+F)]–0.53。Bark = [(26.81 x F)/(1960+F)] – 0.53.

步骤607对元音的时长进行测量。Step 607 measures the duration of vowels.

步骤609根据步骤605和步骤607测得的数据生成英语元音发声声学分析数据。Step 609 generates English vowel pronunciation acoustic analysis data according to the data measured in steps 605 and 607 .

当用于生成标准的英语元音发声声学模型时，是对多个标准英语的样本对象进行英语元音发声声学分析，最终根据多个样本对象的英语元音发声声学分析数据生成标准的英语元音发声声学模型。When used to generate a standard English vowel pronunciation acoustic model, the English vowel pronunciation acoustic analysis is performed on multiple standard English sample objects, and finally the standard English vowel pronunciation is generated based on the English vowel pronunciation acoustic analysis data of multiple sample objects Acoustic model of sound production.

图7是采用图6所示英语元音发声声学分析方法600生成的不同国籍男女被测对象的英语元音发声声学分析图，是针对某一个说话人做的元音发声声学分析所得到的被测对象声学特征图，左侧为男性，右侧为女性，横坐标是元音第二共振峰的F2值，纵坐标是元音第一共振峰的F1值，并将共振峰值F1和F2由赫兹值转换成巴克(Bark)值。FIG. 7 is an acoustic analysis diagram of English vowel pronunciation of male and female test subjects of different nationalities generated by using the English vowel pronunciation acoustic analysis method 600 shown in FIG. Acoustic feature map of the test object, male on the left and female on the right, the abscissa is the F2 value of the second formant of the vowel, and the ordinate is the F1 value of the first formant of the vowel, and the formant F1 and F2 are divided by Hertz values are converted to Bark values.

图中顶层为汉语为母语的被测对象的声学特征，被测对象的元音发声未见明显松劲元音之区分，体现明显的汉语元音的干扰，有明显的汉语口音特征。图中中层为荷兰语为母语的被测对象的声学特征，被测对象元音发声个别音有松紧音之分，个别音明显有荷兰语音负迁移效应。图中底层为美国英语母语被测对象的声学特征，被测对象的元音发声具有明显松紧元音的区分，体现英语母语这声学特征，可以作为标准的英语元音发声声学模型。The top layer in the picture shows the acoustic characteristics of the test subjects whose native language is Chinese. There is no obvious distinction between loose and vigorous vowels in the vowel pronunciation of the test subjects, which reflects the obvious interference of Chinese vowels and has obvious characteristics of Chinese accent. The middle layer in the figure shows the acoustic characteristics of the test subjects whose native language is Dutch. The vowel sounds of the test subjects are divided into tense and tense sounds, and some sounds obviously have negative transfer effect of Dutch phonetics. The bottom layer of the figure shows the acoustic characteristics of the test subjects whose native language is American English. The vowel pronunciation of the test subjects has a clear distinction between loose and tight vowels, which reflects the acoustic characteristics of the native English language and can be used as a standard acoustic model of English vowel pronunciation.

在一实施方式中，如图8所示，是本发明一实施例的采用图6所示英语元音发声声学分析方法600，并选取美国人作为标准英语的样本对象，所生成的标准的英语元音发声模型图。In one embodiment, as shown in FIG. 8 , it adopts the acoustic analysis method 600 of English vowel pronunciation shown in FIG. Diagram of a vowel pronunciation model.

应当理解，本发明不限制英语元音发声纠错方法中的每个步骤的执行顺序，可根据实际需求调整各个步骤的执行顺序，可实现本发明的技术方案即可。It should be understood that the present invention does not limit the execution order of each step in the English vowel pronunciation error correction method, and the execution order of each step can be adjusted according to actual needs, as long as the technical solution of the present invention can be realized.

所述技术领域的技术人员可以理解，本发明的英语元音发声纠错方法中的每个步骤都可以实现为系统、方法或计算机程序产品。因此，本发明的各个方面可以具体实现为以下形式，即：完全的硬件实施方式、完全的软件实施方式(包括固件、驻留软件、微代码等)，或硬件和软件方面结合的实施方式。Those skilled in the technical field can understand that each step in the method for correcting English vowel pronunciation of the present invention can be implemented as a system, method or computer program product. Accordingly, various aspects of the present invention may be embodied in the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, microcode, etc.), or a combination of hardware and software aspects.

应当理解，上述流程图的每个方框以及流程图中各方框的组合，都可以由计算机程序指令实现。这些计算机程序指令可以提供给通用计算机、专用计算机或其他可编程数据处理装置的处理器，从而生产出一种机器，使得这些指令在通过计算机或其他可编程数据处理装置的处理器执行时，产生了实现流程图中的一个或多个方框中规定的功能的装置。可以以一种或多种程序设计语言的任意组合来编写用于执行本发明的各个方面的操作的计算机程序代码，所述程序设计语言包括面向对象的程序设计语言—诸如Java、C++和C语言等类似的程序设计语言。It should be understood that each block of the flowchart above and combinations of blocks in the flowchart can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine such that, when executed by the processor of the computer or other programmable data processing apparatus, the instructions produce Means for realizing the functions specified in one or more blocks in the flowchart. Computer program code for carrying out operations for the various aspects of the present invention can be written in any combination of one or more programming languages, including object-oriented programming languages—such as Java, C++, and C++ and similar programming languages.

也可以把这些计算机程序指令存储在计算机可读介质中，这些指令使得计算机、其他可编程数据处理装置、或其他设备以特定方式工作，从而，存储在计算机可读介质中的指令就产生出包括实现流程图和/或框图中的一个或多个方框中规定的功能/动作的指令的制造品。计算机可读存储介质可以是便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、便携式紧凑盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。计算机可读存储介质可以是任何包含或存储程序的有形介质，该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。These computer program instructions can also be stored in a computer-readable medium, and these instructions cause a computer, other programmable data processing apparatus, or other equipment to operate in a specific way, so that the instructions stored in the computer-readable medium generate information including Manufactures of instructions that implement the functions/actions specified in one or more blocks in flowcharts and/or block diagrams. The computer readable storage medium can be a portable computer disk, hard disk, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), portable compact disk read only memory (CD -ROM), optical storage devices, magnetic storage devices, or any suitable combination of the above. A computer readable storage medium may be any tangible medium that contains or stores a program for use by or in connection with an instruction execution system, apparatus, or device.

也可以把计算机程序指令加载到计算机、其他可编程数据处理装置、或其他设备上，使得在计算机、其他可编程装置或其他设备上执行一系列操作步骤，以产生计算机实现的过程，从而使得在计算机或其他可编程装置上执行的指令提供实现流程图和/或框图中的一个或多个方框中规定的功能/动作的过程。It is also possible to load computer program instructions into a computer, other programmable data processing device, or other equipment, so that a series of operation steps are executed on the computer, other programmable device or other equipment, so as to generate a computer-implemented process, so that in Instructions executed on computers or other programmable devices provide processes for implementing the functions/actions specified in one or more blocks in the flowcharts and/or block diagrams.

应当理解的是，以上实施例仅用以说明本发明的技术方案，而非对其限制，对本领域技术人员来说，可以对上述实施例所记载的技术方案进行修改，或者对其中部分技术特征进行等同替换；而所有这些修改和替换，都应属于本发明所附权利要求的保护范围。It should be understood that the above embodiments are only used to illustrate the technical solutions of the present invention, not to limit them. Those skilled in the art can modify the technical solutions described in the above embodiments, or modify some of the technical features. Perform equivalent replacements; and all these modifications and replacements should belong to the protection scope of the appended claims of the present invention.

Claims

1. A method for correcting errors in pronunciation of English vowels, is characterized in that, comprising:

Step 1, pre-store the standard English vowel pronunciation acoustic model;

Step 2, input the English voice of the tested object;

Step 3, identifying vowels in the English speech of the tested object;

Step 4, recording the voice of the subject under test reading the recognized vowel aloud;

Step 5, performing an acoustic analysis of English vowel pronunciation on the voice of the measured subject reading the recognized vowel aloud;

Step 6, comparing the English vowel pronunciation acoustic analysis data of the measured object with the standard English vowel pronunciation acoustic model to obtain a first degree of deviation;

Step 7 Correcting the English vowel pronunciation of the subject under test according to the first degree of deviation.

2. English vowel pronunciation error correction method as claimed in claim 1, is characterized in that, described step 1 comprises:

Input the English voice of multiple standard English sample objects;

identifying vowels in English speech of the plurality of standard English sample objects;

Acoustic analysis of English vowel pronunciation for each sample object's vowel separately;

The standard English vowel pronunciation acoustic model is generated according to the English vowel pronunciation acoustic analysis result.

3. The English vowel pronunciation error correction method as claimed in claim 1, characterized in that, said step 2 comprises: providing voice material according to the nationality of said tested object, and typing said measured object to read said voice aloud English phonetics of the material.

4. The English vowel pronunciation error correction method according to claim 1, characterized in that, said step 3 comprises: identifying the vowel in the English speech of the subject under test according to the resonance peak of the vowel.

5. The English vowel pronunciation error correction method as claimed in claim 1, characterized in that, said step 3 comprises: identifying the English language of the measured object according to the resonance peak value of the vowel and the duration of the vowel. Vowels in speech.

6. The method for correcting errors in English vowel pronunciation according to claim 4 or 5, wherein the resonance peak comprises a first resonance peak and a second resonance peak.

7. English vowel pronunciation error correction method as claimed in claim 1, is characterized in that, described step 7 comprises: according to the data of described English vowel pronunciation acoustic analysis and the English vowel pronunciation acoustic model of described standard, The vocalization of English vowels of the subject under test is corrected with a visual image.

8. English vowel pronunciation error correction method as claimed in claim 1, is characterized in that, after described step 7, also comprises:

Recording again the voice of the subject under test reading the recognized vowel sound aloud again;

Carrying out acoustic analysis of English vowel pronunciation to the voice read aloud again by the subject under test;

The English vowel sounding acoustic analysis data of the speech read aloud again by the subject under test is compared with the standard English vowel sounding acoustic model to obtain a second degree of deviation;

Outputting the English vowel pronunciation evaluation text of the measured object according to the first degree of deviation and the second degree of deviation.

9. English vowel pronunciation error correction method as claimed in claim 1,2 or 8, is characterized in that, described English vowel pronunciation acoustic analysis comprises:

Measuring the resonance peak of the recorded English vowel sounds;

Measuring the duration of the recorded English vowel sounds;

Acoustic analysis data of the subject's English vowel pronunciation is generated according to the recorded resonance peak of the English vowel pronunciation and the duration.

10. The method for correcting errors in English vowel pronunciation according to claim 9, wherein the resonance peak comprises a first resonance peak and a second resonance peak.

11. A memory device in which a plurality of instructions are stored, said instructions being adapted to be loaded by a processor and executed as:

Step 1, pre-store the standard English vowel pronunciation acoustic model;

Step 2, input the English voice of the tested object;

Step 3, identifying vowels in the English speech of the tested object;

Step 7, correcting the English vowel pronunciation of the subject under test according to the first degree of deviation.

12. The storage device according to claim 11, wherein the step 1 comprises:

Input the English voice of multiple standard English sample objects;

13. The storage device according to claim 11, wherein the step 2 comprises: providing voice materials according to the nationality of the subject under test, and recording the English voice of the subject under test reading the voice material aloud.

14. The storage device according to claim 11, wherein the step 3 comprises: identifying the vowel in the English speech of the object under test according to the resonance peak of the vowel.

15. The storage device according to claim 11, wherein the step 3 comprises: recognizing the vowel in the English speech of the object under test according to the resonance peak of the vowel and the duration of the vowel .

16. The storage device according to claim 14 or 15, wherein the resonance peak comprises a first resonance peak and a second resonance peak.

17. The storage device according to claim 11, characterized in that, said step 7 comprises: according to the data of the English vowel pronunciation acoustic analysis and the standard English vowel pronunciation acoustic model, the visual image Adjust the pronunciation of English vowels of the subject under test.

18. The storage device according to claim 11, further comprising:

The English vowel sounding acoustic analysis data of the speech read aloud again by the subject under test is compared with the standard English vowel sounding acoustic model to obtain a second deviation;

Outputting an English vowel pronunciation evaluation text of the subject under test according to the first degree of deviation and the second deviation.

19. The storage device according to claim 11, 12 or 18, wherein the acoustic analysis of English vowel sounds comprises:

Measuring the resonance peak of the recorded English vowel sounds;

Measuring the vocalization duration of the recorded vowels;

The recorded English vowel sound acoustic analysis data is generated according to the recorded resonance peak of English vowel sound and the duration.

20. The storage device according to claim 19, wherein the resonance peak comprises a first resonance peak and a second resonance peak.

21. An English vowel pronunciation error correction device, comprising:

a processor adapted to implement the instructions; and

a storage device adapted to store a plurality of instructions adapted to be loaded and executed by the processor as:

Step 1, pre-store the standard English vowel pronunciation acoustic model;

Step 2, input the English voice of the tested object;

Step 3, identifying vowels in the English speech of the tested object;

22. English vowel pronunciation error correction device as claimed in claim 21, is characterized in that, described step 1 comprises:

Input the English voice of multiple standard English sample objects;

23. The English vowel pronunciation error correction device as claimed in claim 21, characterized in that, said step 2 comprises: providing voice materials according to the nationality of the tested object, and inputting the tested object to read the voice English phonetics of the material.

24. The device for correcting errors in English vowel pronunciation according to claim 21, characterized in that, said step 3 comprises: identifying the vowel in the English speech of the subject under test according to the resonance peak value of the vowel.

25. The English vowel pronunciation error correction device as claimed in claim 21, wherein said step 3 comprises: identifying the English language of the measured object according to the resonance peak of the vowel and the duration of the vowel. Vowels in speech.

26. The English vowel pronunciation error correction device according to claim 24 or 25, wherein the resonance peak comprises a first resonance peak and a second resonance peak.

27. The English vowel pronunciation error correction device as claimed in claim 21, wherein said step 7 comprises: according to the data of the English vowel pronunciation acoustic analysis and the standard English vowel pronunciation acoustic model, The pronunciation of English vowels of the subject under test is adjusted with a visual image.

28. English vowel pronunciation error correction device as claimed in claim 21, is characterized in that, after described step 7, also comprises:

Outputting the speaker's English vowel pronunciation evaluation text according to the first deviation degree and the second deviation.

29. The English vowel utterance error correction device as claimed in claim 21, 22 or 28, wherein said English vowel utterance acoustic analysis comprises:

Measuring the resonance peak of the recorded English vowel sounds;

Measuring the vocalization duration of the recorded vowels;

30. The English vowel pronunciation error correction device according to claim 29, wherein the resonance peak comprises a first resonance peak and a second resonance peak.