[go: up one dir, main page]

CN102509483A - Distributive automatic grading system for spoken language test and method thereof - Google Patents

Distributive automatic grading system for spoken language test and method thereof Download PDF

Info

Publication number
CN102509483A
CN102509483A CN2011103361794A CN201110336179A CN102509483A CN 102509483 A CN102509483 A CN 102509483A CN 2011103361794 A CN2011103361794 A CN 2011103361794A CN 201110336179 A CN201110336179 A CN 201110336179A CN 102509483 A CN102509483 A CN 102509483A
Authority
CN
China
Prior art keywords
scoring
task
module
examinee
voice
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2011103361794A
Other languages
Chinese (zh)
Other versions
CN102509483B (en
Inventor
周伟达
俞凯
王欢良
薛峰
朱宏杰
张李
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Chisheng Information Technology Co Ltd
Original Assignee
Suzhou Speech Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Speech Information Technology Co Ltd filed Critical Suzhou Speech Information Technology Co Ltd
Priority to CN201110336179.4A priority Critical patent/CN102509483B/en
Publication of CN102509483A publication Critical patent/CN102509483A/en
Application granted granted Critical
Publication of CN102509483B publication Critical patent/CN102509483B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Electrically Operated Instructional Devices (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

一种口语考试的分布式自动评分系统,包括一台评分管理装置管理若干考试客户端装置,评分管理装置和至少一台评分任务调度装置进行通信,一台评分任务调度装置管理若干评分计算装置,其中用于自动评分的模块包括:录音模块,用于检测并录制考生语音;语音处理模块,用于对考生语音信号进行处理,提取声学特征,识别语音对应的文本,得到声学单元的时间边界;发音评估模块,用于根据题目信息和语音处理模块输出的结果,对用户发音的不同方面进行分析和评估;语流特征提取模块,用于提取反映考生口语水平的语流综合特征,包括声学、语法和语义的特征;评分子模块:利用语流综合特征,对用户语音进行综合评分;其中除录音模块外,其他评分模块可根据考试客户端装置和后台的评分计算装置的计算负载以及网络负载来动态调度部署。

Figure 201110336179

A distributed automatic scoring system for oral exams, comprising a scoring management device to manage several test client devices, the scoring management device communicates with at least one scoring task scheduling device, and one scoring task scheduling device manages several scoring computing devices, The modules used for automatic scoring include: a recording module, which is used to detect and record the candidate's voice; a voice processing module, which is used to process the candidate's voice signal, extract acoustic features, recognize the text corresponding to the voice, and obtain the time boundary of the acoustic unit; The pronunciation evaluation module is used to analyze and evaluate different aspects of the user's pronunciation according to the topic information and the output results of the speech processing module; the speech flow feature extraction module is used to extract the comprehensive speech flow features that reflect the candidate's oral level, including acoustics, Grammatical and semantic features; Scoring sub-module: use the comprehensive features of language flow to comprehensively score the user's voice; except for the recording module, other scoring modules can be based on the calculation load and network load of the test client device and the scoring calculation device in the background to dynamically schedule deployments.

Figure 201110336179

Description

一种口语考试的分布式自动评分系统及其方法A distributed automatic scoring system and method for speaking test

技术领域 technical field

本专利涉及计算机语音信号处理和分析、语言评估、模式识别和计算机辅助教育领域,特别是计算机辅助的分布式大规模口语考试自动评分方法和系统领域。This patent relates to the fields of computer speech signal processing and analysis, language assessment, pattern recognition and computer-aided education, especially the field of computer-aided distributed large-scale oral examination automatic scoring method and system.

背景技术 Background technique

传统考试可分为两大类,一类是书面考试;一类是口语考试(重点考查学生的口语语言能力)。随着素质教育理念的不断深入,口语能力越来越被重视,相应地,口语教学和考试也在不断普及,比如,全国性考试和省级考试中都开始测试口语能力。此外各种专业等级考试中,口语能力也成为测试重点,比如普通话水平测试,大学英语等级考试、雅思、托福等。Traditional examinations can be divided into two categories, one is written examinations; the other is oral examinations (focusing on the oral language ability of students). With the deepening of the concept of quality education, more and more attention has been paid to oral English ability. Correspondingly, oral English teaching and examinations are also becoming more popular. For example, oral English ability has been tested in both national and provincial examinations. In addition, in various professional level examinations, oral English ability has also become the focus of the test, such as Putonghua proficiency test, college English level test, IELTS, TOEFL, etc.

和书面考试不同,口语考试通常有两种方式:一种是考生和考官进行面对面交流;另一种是录制特定语境中考生的口语语音给考官进行分析。口语考试的评分也有两种方式,一种是考官根据交流体验现场给出评分;另一种是通过对考生语音进行听测分析来评分。可见,无论哪种方式,口语考试都需要耗费大量的人力和物力,尤其是大规模考试中。另一方面,在大规模考试中,通常要求在有限时间内完成所有评分,需要提高评分效率。另外,口语考试属于主观题,其评分工作量巨大,并且主观性较大,一致性较差。因此,在大规模口语考试中,需要一种既可以提供高度一致性评分又具有高效率低成本的考试系统和方法。Different from written exams, there are usually two ways of oral exams: one is face-to-face communication between candidates and examiners; the other is to record the spoken voice of candidates in a specific context for the examiner to analyze. There are also two ways to score the speaking test. One is that the examiner gives the score on the spot based on the communication experience; the other is to score by listening and analyzing the candidates' speech. It can be seen that no matter which method is used, the speaking test needs to consume a lot of manpower and material resources, especially in large-scale tests. On the other hand, in large-scale examinations, it is usually required to complete all scoring within a limited time, and the scoring efficiency needs to be improved. In addition, the speaking test is a subjective question, and its scoring workload is huge, and the subjectivity is relatively high, and the consistency is poor. Therefore, in a large-scale oral test, a test system and method that can provide highly consistent scores and have high efficiency and low cost are needed.

计算机技术已被应用于教育考试领域,可以降低考试成本,提高考试效率。比较典型的应用包括:通用的网络化考试管理系统【CN101964152A,CN1801244A,CN1732490B,CN101827124A】,针对口语的网络化考试系统【CN101419795B,英语口语考试系统的设计与实现(中科大硕士论文)】,网上阅卷评分系统【CN1700200A,CN1598845A,CN1987889A】等。然而,这些工作都没有涉及大规模口语考试的组织管理和自动评分问题。Computer technology has been applied in the field of educational examinations, which can reduce examination costs and improve examination efficiency. Typical applications include: general networked test management system [CN101964152A, CN1801244A, CN1732490B, CN101827124A], networked test system for oral English [CN101419795B, Design and Implementation of English Spoken Test System (Master’s Thesis of University of Science and Technology of China)], online marking Scoring system [CN1700200A, CN1598845A, CN1987889A], etc. However, none of these works dealt with the organizational management and automatic scoring of large-scale speaking tests.

针对当前口语考试专家评分的主观性问题,本发明利用计算机和语音分析技术,对考生语音进行高度一致性的自动评分。针对大规模口语考试评分任务工作量巨大问题,本发明利用计算机网络和计算服务器对大规模考生语音进行分布式的并行评分。Aiming at the subjectivity problem of expert scoring in the current speaking test, the present invention uses computer and voice analysis technology to automatically score candidates' voices with high consistency. Aiming at the huge workload of scoring tasks in large-scale oral exams, the invention utilizes computer networks and computing servers to perform distributed parallel scoring on the voices of large-scale candidates.

发明内容 Contents of the invention

本发明公开一套口语考试的分布式自动评分系统及其评分方法,解决当前口语考试中评分效率低和一致性差的问题,尤其是为大规模口语考试提供一套可靠、高效和低成本的评分解决方案。The invention discloses a set of distributed automatic scoring system and its scoring method for speaking tests, which solves the problems of low scoring efficiency and poor consistency in current speaking tests, and especially provides a set of reliable, high-efficiency and low-cost scoring for large-scale speaking tests solution.

本发明提供一种口语考试的分布式自动评分系统,包括一台评分管理装置管理若干考试客户端装置,评分管理装置和至少一台评分任务调度装置进行通信,一台评分任务调度装置管理若干评分计算装置,其中用于自动评分的模块包括:The present invention provides a distributed automatic scoring system for oral exams, which includes a scoring management device to manage several test client devices, the scoring management device communicates with at least one scoring task scheduling device, and one scoring task scheduling device manages several scoring Computing means, where modules for automatic scoring include:

录音模块,用于检测并录制考生语音;The recording module is used to detect and record the examinee's voice;

语音处理模块,用于对考生语音信号进行处理,提取声学特征,识别语音对应的文本,得到声学单元的时间边界;The speech processing module is used to process the examinee's speech signal, extract the acoustic feature, recognize the text corresponding to the speech, and obtain the time boundary of the acoustic unit;

发音评估模块,用于根据题目信息和语音处理模块输出的结果,对用户发音的不同方面进行分析和评估;The pronunciation evaluation module is used to analyze and evaluate different aspects of the user's pronunciation according to the topic information and the output results of the speech processing module;

语流特征提取模块,用于提取反映考生口语水平的语流综合特征,包括声学、语法和语义的特征;The speech flow feature extraction module is used to extract the speech flow comprehensive features reflecting the examinee's oral level, including acoustic, grammatical and semantic features;

综合评分模块:利用语流综合特征,对用户语音进行综合评分;其中Comprehensive scoring module: use the comprehensive features of speech flow to comprehensively score the user's voice;

除录音模块外,其他评分模块可根据考试客户端装置和后台的评分计算装置的计算负载以及网络负载来动态调度部署。In addition to the recording module, other scoring modules can be dynamically scheduled and deployed according to the computing load and network load of the test client device and the background scoring calculation device.

还提供了一种口语考试的分布式自动评分方法,其中评分的具体步骤包括:Also provided is a distributed automatic scoring method for oral exams, wherein the specific steps of scoring include:

步骤1、录音设备检测考生语音并录制;Step 1. The recording device detects the examinee's voice and records it;

步骤2、提取语音特征,利用声学模型和语言模型,对语音进行识别,得到对应文本及其音素单元时间边界和似然度得分;Step 2, extracting speech features, using the acoustic model and language model to recognize the speech, and obtaining the corresponding text and its phoneme unit time boundary and likelihood score;

步骤3、利用音素单元时间边界和似然度得分以及题目相关知识,对语音进行分析评估,得到不同语音单元的发音准确性、韵律准确性、发音完整度和发音流利度等;Step 3. Using the phoneme unit time boundary and likelihood score and topic-related knowledge, analyze and evaluate the speech, and obtain the pronunciation accuracy, prosodic accuracy, pronunciation completeness and pronunciation fluency of different phonetic units;

步骤4、综合语音处理结果、评估结果和题目信息,提取反映考生口语水平的声学、语法和语义方面的语流综合特征;Step 4, integrating speech processing results, evaluation results and topic information, and extracting comprehensive characteristics of speech flow in terms of acoustics, grammar and semantics that reflect the examinee's oral level;

步骤5、基于预先训练的模型和设计规则,使用语流综合特征对考生口语水平进行综合评分。Step 5. Based on the pre-trained model and design rules, use the comprehensive features of speech flow to comprehensively score the oral proficiency of candidates.

进一步提供了一种口语考试的分布式自动评分方法,具体包括如下步骤:A distributed automatic scoring method for an oral test is further provided, which specifically includes the following steps:

步骤1、考生获得考试题目并按要求回答,考试客户端装置录制考生语音,根据题目信息和要求对考生语音进行第一阶段评分计算,依据数据交换标准把考生语音和评分结果发送到评分管理装置;Step 1. Candidates obtain the test questions and answer according to the requirements. The test client device records the candidate's voice, performs the first-stage scoring calculation on the testee's voice according to the question information and requirements, and sends the testee's voice and scoring results to the scoring management device according to the data exchange standard ;

步骤2、评分管理装置接收考试客户端设备发送的考生答案和第一阶段评分结果,解析评分结果,如果评分结果不是最终评分,则依据数据交换标准把考生语音和评分中间结果及题目信息组织成评分任务,加密后发送到评分任务调度装置进行进一步评分;Step 2. The scoring management device receives the examinee's answer sent by the test client device and the scoring result of the first stage, analyzes the scoring result, and if the scoring result is not the final scoring, organizes the candidate's voice, the intermediate scoring result and the topic information according to the data exchange standard The scoring task is encrypted and sent to the scoring task scheduling device for further scoring;

步骤3、评分任务调度装置接收到评分管理装置发送的评分任务,解析评分任务得到评分数据和任务优先级,根据评分模块域的设置选择所需评分模块和算法,生成评分计算程序,按照任务优先级提交到后台的评分计算装置进行计算;Step 3. The scoring task scheduling device receives the scoring task sent by the scoring management device, analyzes the scoring task to obtain the scoring data and task priority, selects the required scoring module and algorithm according to the setting of the scoring module field, generates a scoring calculation program, and prioritizes according to the task Submit to the scoring calculation device in the background for calculation;

步骤4、评分计算装置接收任务调度装置发送的评分程序,执行第二阶段评分,把最终评分结果发送到评分任务调度装置,然后评分任务调度装置再把评分结果转发到评分管理装置。Step 4. The scoring calculation device receives the scoring program sent by the task scheduling device, executes the second-stage scoring, and sends the final scoring result to the scoring task scheduling device, and then the scoring task scheduling device forwards the scoring result to the scoring management device.

附图说明: Description of drawings:

图1口语考试的分布式自动评分系统的组织结构示意图。Figure 1 Schematic diagram of the organizational structure of the distributed automatic scoring system for the oral test.

图2口语考试的分布式自动评分系统和方法的评分流程示意图。Fig. 2 is a schematic diagram of the scoring process of the distributed automatic scoring system and method for the speaking test.

图3口语考试的分布式自动评分系统的工作流程示意图。Figure 3 Schematic diagram of the workflow of the distributed automatic scoring system for the oral test.

图4一种典型的口语考试的分布式自动评分模块的部署示意图。Figure 4 is a schematic diagram of the deployment of a typical distributed automatic scoring module for an oral test.

图5接入语音云端设备的口语考试的分布式自动评分系统的组织结构示意图。Figure 5 is a schematic diagram of the organizational structure of the distributed automatic scoring system for the oral test connected to the voice cloud device.

图6精简的口语考试的分布式自动评分系统环境的结构示意图。Fig. 6 is a schematic structural diagram of a distributed automatic scoring system environment for a simplified speaking test.

具体实施方式 Detailed ways

下面结合附图和具体实施例对本发明做进一步的详细描述:Below in conjunction with accompanying drawing and specific embodiment the present invention is described in further detail:

图1,给出了本发明所公开系统的组织结构示意图。一种口语考试的分布式自动评分系统由四大装置组成:考试客户端装置101,评分管理装置102,评分任务调度装置103和评分计算装置104。Fig. 1 shows a schematic diagram of the organizational structure of the system disclosed in the present invention. A distributed automatic scoring system for oral exams consists of four devices: a test client device 101 , a scoring management device 102 , a scoring task scheduling device 103 and a scoring calculation device 104 .

考试客户端设备101是考生提供考试服务的终端设备,通常为计算机或者嵌入式移动设备等。至少具有如下硬件配置:音频播放单元,音频录制单元,计算单元,网络通信单元。The test client device 101 is a terminal device for test takers to provide test services, usually a computer or an embedded mobile device. It has at least the following hardware configuration: audio playback unit, audio recording unit, computing unit, and network communication unit.

可选地,支持图形用户界面和触控操作。Optionally, GUI and touch operation are supported.

其通过图形界面向考生显示考题内容,通过音频播放单元(如:头戴耳机或扬声器)播放考题音频,通过音频录制单元(如:麦克风)录制考生语音,通过本地计算单元对答案(即考生语音)进行预处理和一定的评分计算(简称为第一阶段评分),通过网络或者其他方式发送考生答案和评分结果(通常是中间评分结果)到评分管理装置。并且可以暂存考生语音,进行离线异步传输。It displays the content of the test questions to candidates through a graphical interface, plays the audio of the test questions through an audio playback unit (such as: headphones or speakers), records the voice of candidates through an audio recording unit (such as: a microphone), and uses the local computing unit to analyze the answers (that is, the voice of candidates) ) to perform preprocessing and certain scoring calculations (referred to as the first stage of scoring), and send candidates’ answers and scoring results (usually intermediate scoring results) to the scoring management device through the network or other means. And it can temporarily store the candidate's voice for offline asynchronous transmission.

还可以在考试客户端设备上配置一个专用考试客户端程序或者一个具有考试插件的浏览器以完成上述录音和评分工作。It is also possible to configure a dedicated test client program or a browser with a test plug-in on the test client device to complete the above-mentioned recording and scoring work.

评分管理设备102是获取考生答案和第一阶段评分结果、组织并提交评分任务的计算机。Scoring management device 102 is a computer for acquiring candidates' answers and first-stage scoring results, organizing and submitting scoring tasks.

其获取来自考试客户端设备的考生答案和第一阶段评分结果,利用该评分结果以及题目信息构造评分任务,发送到评分任务调度装置,并接收来自评分任务调度装置的最终评分结果。It obtains the examinee's answer and the scoring result of the first stage from the test client device, uses the scoring result and topic information to construct a scoring task, sends it to the scoring task scheduling device, and receives the final scoring result from the scoring task scheduling device.

具有可靠的大容量存储设备,一般可配置高数据传输宽带。保存考生答案和最终评分结果,并暂存来至第一阶段评分的中间评分结果。It has a reliable large-capacity storage device and can generally be configured with high data transmission bandwidth. Save candidates' answers and final scoring results, and temporarily save intermediate scoring results from the first stage of scoring.

评分任务调度设备103是用于接收评分任务、解析并构造评分计算任务、提交计算任务和管理评分计算装置的计算机。The scoring task scheduling device 103 is a computer for receiving scoring tasks, analyzing and constructing scoring calculation tasks, submitting calculation tasks and managing scoring calculation devices.

其接收评分管理设备发送的评分任务,解析任务,抽取评分数据,根据题目信息和评分所需模块等信息自动组织评分计算程序,生成评分计算任务,通过调度算法提交计算任务到评分计算装置,最后从计算设备获取最终评分结果。It receives the scoring task sent by the scoring management device, analyzes the task, extracts the scoring data, automatically organizes the scoring calculation program according to the topic information and the modules required for scoring, generates the scoring calculation task, submits the calculation task to the scoring calculation device through the scheduling algorithm, and finally Obtain final scoring results from the computing device.

特别地,所述评分任务调度装置具有可靠的大容量存储设备,且一般可具有高数据传输宽带。In particular, the scoring task scheduling device has a reliable large-capacity storage device and generally can have a high data transmission bandwidth.

特别地,所述评分任务调度装置保留一段内存空间,维护一个优先级排序的计算任务队列。In particular, the scoring task scheduling device reserves a section of memory space and maintains a queue of computing tasks sorted by priority.

特别地,所述评分任务调度装置的任务调度算法为:若任务队列不满,则把任务根据其优先级插入队列相应位置;否则如果任务优先级高于队列中的某些任务优先级,则把优先级最低的任务移出到缓存区或者外部存储器,把该任务插入队列相应位置;否则把该任务存储到缓存区或者外部存储器;定时查询任务队列,若有空间,则把缓存区任务按其优先级加入队列;定时轮询每个评分计算设备,若有CPU空闲,则从任务队列中取任务到该CPU进行运算。In particular, the task scheduling algorithm of the scoring task scheduling device is: if the task queue is not full, then insert the task into the corresponding position of the queue according to its priority; otherwise, if the task priority is higher than some task priority in the queue, then insert the task The task with the lowest priority is moved out to the cache area or external memory, and the task is inserted into the corresponding position of the queue; otherwise, the task is stored in the cache area or external memory; the task queue is queried regularly, and if there is space, the task in the cache area is prioritized Level joins the queue; regularly polls each scoring calculation device, if there is a CPU idle, then fetches tasks from the task queue to the CPU for calculation.

评分计算设备104是由若干高性能CPU组成的计算集群,所有CPU都可以并行工作。其执行评分任务调度装置提交的评分计算程序,经过计算得到评分结果并输出给评分任务调度装置。The score calculation device 104 is a calculation cluster composed of several high-performance CPUs, all of which can work in parallel. It executes the scoring calculation program submitted by the scoring task scheduling device, obtains the scoring result through calculation, and outputs it to the scoring task scheduling device.

口语考试的分布式自动评分系统其特点在于所述评分任务调度装置可以把多个评分任务组织在一起构成一个可批处理的计算任务,提交给后台的评分计算设备。The distributed automatic scoring system for the speaking test is characterized in that the scoring task scheduling device can organize multiple scoring tasks together to form a batch-processing computing task, which is submitted to the background scoring computing device.

口语考试的分布式自动评分系统其特点在于所述考试客户端装置可以部署多个评分模块。在保证考试录音过程不受影响的情况下利用本地CPU完成第一阶段的评分工作。所述第一阶段评分工作可与考试同时进行,为避免对考试造成任何影响,也可在考试之外的空闲时间定时运行。The distributed automatic grading system for the speaking test is characterized in that the test client device can deploy multiple grading modules. In the case of ensuring that the test recording process is not affected, the first stage of scoring is completed by using the local CPU. The first stage of grading work can be carried out simultaneously with the examination, and in order to avoid any impact on the examination, it can also be run regularly during the free time outside the examination.

口语考试的分布式自动评分系统其特点在于所述评分任务调度装置和评分管理装置可以通过广域网进行连接,也可以通过局域网连接。The distributed automatic scoring system for the speaking test is characterized in that the scoring task scheduling device and the scoring management device can be connected through a wide area network or a local area network.

口语考试的分布式自动评分系统其特点在于所述后台的评分计算装置可以是普通计算机,也可以高性能计算服务器。这些评分计算装置通过局域网络连接,构成一个计算集群。The distributed automatic scoring system for the speaking test is characterized in that the scoring calculation device in the background can be an ordinary computer or a high-performance computing server. These scoring calculation devices are connected through a local area network to form a computing cluster.

所述的口语考试的分布式自动评分系统的结构是:一台评分管理装置管理若干考试客户端装置,评分管理装置和至少一台所述评分任务调度装置进行通信,一台评分任务调度装置管理若干后台的评分计算装置。The structure of the distributed automatic scoring system of the oral test is: a scoring management device manages several test client devices, the scoring management device communicates with at least one scoring task scheduling device, and a scoring task scheduling device manages A number of score calculation devices in the background.

所述的口语考试的分布式自动评分系统的网络结构是:评分管理装置和考试客户端装置之间通过局域网或者专用网络连接;评分任务调度装置和后台的评分计算装置之间通过局域网连接;评分管理装置和评分任务调度装置之间的数据通信既通过任意加密网络也可通过其他保密的数据传输方式进行。The network structure of the distributed automatic scoring system of the oral exam is: between the scoring management device and the test client device, connect through a local area network or a dedicated network; connect between the scoring task scheduling device and the scoring calculation device at the background; The data communication between the management device and the scoring task scheduling device can be carried out not only through any encrypted network but also through other confidential data transmission methods.

特定地,评分管理装置和考试客户端装置之间以及评分管理装置和评分任务调度装置之间也可以通过移动存储设备来进行数据传输。Specifically, data transmission between the scoring management device and the test client device and between the scoring management device and the scoring task scheduling device can also be performed through a mobile storage device.

图2给出了口语考试的分布式自动评分系统和方法的评分模块和流程。口语考试的分布式自动评分系统和方法的核心模块包括:录音模块201,语音处理模块202,发音评估模块203,语流特征提取模块204和综合评分模块205。Fig. 2 shows the scoring module and process of the distributed automatic scoring system and method for the speaking test. The core modules of the distributed automatic scoring system and method for speaking test include: recording module 201 , speech processing module 202 , pronunciation evaluation module 203 , speech flow feature extraction module 204 and comprehensive scoring module 205 .

录音模块201:检测并录制考生语音。Recording module 201: detect and record the candidate's voice.

所述录音模块利用语音端点检测方法来自动检测考生语音,或者通过接收考生手工操作命令来启动录音。The recording module automatically detects the examinee's voice by using the voice endpoint detection method, or starts recording by receiving the examinee's manual operation command.

语音处理模块202:对考生语音信号进行处理,提取声学特征,识别语音对应的文本,得到声学单元的时间边界。Speech processing module 202: process the examinee's speech signal, extract the acoustic feature, recognize the text corresponding to the speech, and obtain the time boundary of the acoustic unit.

所述语音处理模块其特点在于支持实时在线处理和离线批处理两种方式。所述语音处理模块中的语音识别过程既可以采用本地服务来完成,也可以通过网络访问语音云端服务来完成。The voice processing module is characterized in that it supports real-time online processing and offline batch processing. The voice recognition process in the voice processing module can be completed by using local services, or by accessing voice cloud services through the network.

发音评估模块203:根据题目信息和语音处理模块202输出的结果,对用户发音的不同方面进行分析和评估。Pronunciation evaluation module 203: according to the topic information and the output result of the speech processing module 202, analyze and evaluate different aspects of the user's pronunciation.

所述发音评估模块其特点在于发音评估至少包含四个方面:不同单元的发音准确性、韵律准确性(如重音、声调、语调等)、发音完整度和发音流利度。The feature of the pronunciation evaluation module is that the pronunciation evaluation includes at least four aspects: pronunciation accuracy of different units, prosody accuracy (such as stress, tone, intonation, etc.), pronunciation completeness and pronunciation fluency.

语流特征提取模块204:提取反映考生口语水平的语流综合特征,包括声学、语法和语义的特征。Speech flow feature extraction module 204: extract the comprehensive features of speech flow that reflect the examinee's spoken language level, including acoustic, grammatical and semantic features.

所述语流特征提取模块其特征在于所提取的特征一部分来自于发音评估模块203,一部分来自题目信息,还有一部分来自语音处理模块202。The speech flow feature extraction module is characterized in that part of the extracted features come from the pronunciation evaluation module 203 , part of it comes from topic information, and part of it comes from the speech processing module 202 .

所述考试题目信息至少包括题目类型、题目要求、题目内容、考察重点以及标准答案文本等。The information about the test questions at least includes question types, question requirements, question content, examination focus, standard answer texts, and the like.

评分子模块205:利用语流综合特征,对用户语音进行综合评分。Scoring sub-module 205: using the comprehensive features of the speech flow to perform comprehensive scoring on the user's voice.

所述评分子模块其特征在于语流综合特征来自于语流特征提取模块204,综合评分算法是统计和规则相结合的。The scoring sub-module is characterized in that the speech flow comprehensive feature comes from the speech flow feature extraction module 204, and the comprehensive scoring algorithm is a combination of statistics and rules.

口语考试的分布式自动评分方法的评分方法包括如下步骤:The scoring method of the distributed automatic scoring method of the speaking test comprises the following steps:

步骤一,通过录音模块检测考生语音并录制;Step 1: Detect and record the examinee’s voice through the recording module;

步骤二,提取语音特征,利用声学模型和语言模型,对语音进行识别,得到对应文本及其声学单元时间边界和似然度得分;Step 2, extracting speech features, using the acoustic model and language model to recognize the speech, and obtaining the corresponding text and its acoustic unit time boundary and likelihood score;

步骤三,利用声学单元时间边界和似然度得分以及题目相关知识,对语音进行分析评估,得到不同语音单元的发音准确性、韵律准确性、发音完整度和发音流利度等的评分;Step 3: Using the acoustic unit time boundary, likelihood score and topic-related knowledge to analyze and evaluate the speech, and obtain the scores of pronunciation accuracy, prosodic accuracy, pronunciation completeness and pronunciation fluency of different speech units;

步骤四,从语音处理模块和发音评估模块的输出结果以及考试题目信息中提取反映考生口语水平的声学、语法和语义方面的语流综合特征;Step 4, extracting from the output results of the speech processing module and the pronunciation evaluation module and the information of the test questions, the integrated features of the acoustic, grammatical and semantic aspects of the oral language level of the candidates are extracted;

步骤五,基于预先训练的模型和设计的规则,使用语流综合特征对考生口语水平进行综合评分。Step five, based on the pre-trained model and designed rules, use the comprehensive features of speech flow to comprehensively score the oral level of candidates.

所述口语考试的分布式自动评分方法其特征在于各评分模块之间是前后依赖的,每个评分模块的输入信息来自于前一模块的计算输出,其中语音处理模块202的计算复杂性最高。The distributed automatic scoring method of the speaking test is characterized in that the scoring modules are sequentially dependent, the input information of each scoring module comes from the calculation output of the previous module, and the calculation complexity of the speech processing module 202 is the highest.

图3给出了口语考试的分布式自动评分系统的工作流程:Figure 3 shows the workflow of the distributed automatic scoring system for oral exams:

第一步,考生获得考试题目并按要求回答,考试客户端设备录制考生语音,根据题目信息和要求对考生语音进行第一阶段评分计算,依据数据交换标准把考生语音和评分结果发送到评分管理装置;In the first step, candidates obtain the test questions and answer them as required. The test client device records the candidate’s voice, performs the first-stage scoring calculation on the candidate’s voice according to the question information and requirements, and sends the testee’s voice and scoring results to the scoring management system according to the data exchange standard. device;

第二步,评分管理装置接收考试客户端装置发送的考生答案和评分结果,解析评分结果,如果评分结果不是最终评分,则依据数据交换标准把考生语音和评分中间结果组织成评分任务,加密后发送到评分任务调度装置进行进一步评分;In the second step, the scoring management device receives the examinee's answers and scoring results sent by the test client device, and analyzes the scoring results. If the scoring result is not the final scoring, it organizes the candidate's voice and scoring intermediate results into a scoring task according to the data exchange standard. After encryption Sent to scoring task scheduling device for further scoring;

第三步,评分任务调度装置接收到评分管理装置发送的评分任务,解析评分任务得到评分数据和任务优先级,根据评分模块域的设置选择所需评分模块和算法,生成评分计算程序,按照任务优先级提交到后台的评分计算装置进行计算;In the third step, the scoring task scheduling device receives the scoring task sent by the scoring management device, analyzes the scoring task to obtain the scoring data and task priority, selects the required scoring module and algorithm according to the setting of the scoring module field, generates a scoring calculation program, and calculates the score according to the task. The priority is submitted to the scoring calculation device in the background for calculation;

第四步,评分计算装置接收评分任务调度装置发送的评分程序,执行第二阶段评分,把最终评分结果发送到评分任务调度装置,然后评分任务调度装置再把评分结果转发到评分管理装置。In the fourth step, the scoring calculation device receives the scoring program sent by the scoring task scheduling device, executes the second stage of scoring, and sends the final scoring result to the scoring task scheduling device, and then the scoring task scheduling device forwards the scoring result to the scoring management device.

所公开的口语考试的分布式自动评分系统其关键特点在于自动评分过程可分为两个阶段:位于考试客户端装置上的第一阶段评分和位于后台的评分计算装置上的第二阶段评分。The key feature of the disclosed distributed automatic scoring system for oral exams is that the automatic scoring process can be divided into two stages: the first stage of scoring on the test client device and the second stage of scoring on the background scoring calculation device.

所公开的口语考试的分布式自动评分系统其关键特征在于所述的自动评分模块可分别归属于第一评分阶段和第二评分阶段。The key feature of the disclosed distributed automatic grading system for oral exams is that the automatic grading modules can be assigned to the first grading stage and the second grading stage respectively.

所公开的口语考试的分布式自动评分系统其关键特征在于所述第一阶段评分部署在考试客户端装置上,第二阶段评分部署在评分计算装置上。The key feature of the disclosed distributed automatic scoring system for oral exams is that the first-stage scoring is deployed on the test client device, and the second-stage scoring is deployed on the scoring calculation device.

所公开的口语考试的分布式自动评分系统其关键特征在于部署在评分计算装置上的第二阶段评分支持大规模评分任务的批量计算,多个评分计算任务独立并行执行。The key feature of the disclosed distributed automatic scoring system for oral exams is that the second-stage scoring deployed on the scoring calculation device supports batch calculation of large-scale scoring tasks, and multiple scoring calculation tasks are executed independently and in parallel.

所公开的口语考试的分布式自动评分系统其关键特征还在于第二阶段评分计算任务可同时包含多个考生的多道题目的评分。同一评分计算任务中不同考生和考题的相同评分模块可共享相同的评分资源,减少资源重复加载的时间和空间浪费。The key feature of the disclosed distributed automatic scoring system for oral exams is that the scoring calculation task in the second stage can simultaneously include the scoring of multiple questions of multiple examinees. The same scoring modules for different candidates and test questions in the same scoring calculation task can share the same scoring resources, reducing the waste of time and space for repeated resource loading.

所公开的口语考试的分布式自动评分系统其特征还在于既支持大规模离线批量评分,也支持小规模在线评分。The disclosed distributed automatic scoring system for the speaking test is also characterized in that it supports both large-scale offline batch scoring and small-scale online scoring.

所述口语考试的分布式自动评分系统其特征还在于支持小规模在线评分,即每次评分任务只处理一个考生的一道考题答案,在考试完成之后直接反馈评分。The distributed automatic grading system for the speaking test is also characterized in that it supports small-scale online grading, that is, each grading task only processes the answer to one test question of one examinee, and directly feedbacks the grading after the test is completed.

所公开的口语考试的分布式自动评分系统其关键特征在于除录音模块外地其他评分模块可根据考试客户端装置和后台的评分计算装置的计算负载以及网络负载来动态调度部署。The key feature of the disclosed distributed automatic scoring system for oral exams is that other scoring modules except the recording module can be dynamically scheduled and deployed according to the computing load and network load of the test client device and the scoring calculation device in the background.

所公开的口语考试的分布式自动评分系统其关键特征在于部署在考试客户端装置上的评分模块中,录音模块具有最高优先级,保证其他模块的执行不会影响录音功能。The key feature of the disclosed distributed automatic scoring system for oral exams is that among the scoring modules deployed on the test client device, the recording module has the highest priority, ensuring that the execution of other modules will not affect the recording function.

根据评分模块部署的位置,口语考试的分布式自动评分系统具有如下四种评分模块部署方案:According to the location where the scoring module is deployed, the distributed automatic scoring system for the speaking test has the following four scoring module deployment schemes:

表1口语考试的分布式自动评分系统的四种评分模块部署方案Table 1 Four scoring module deployment schemes for the distributed automatic scoring system of the speaking test

Figure BSA00000601809500071
Figure BSA00000601809500071

特定地,在方案4中,评分任务不再需要送入后台的评分计算装置进行计算。Specifically, in solution 4, the scoring task no longer needs to be sent to the background scoring calculation device for calculation.

图4给出了方案1的评分模块部署示意图。整个口语考试的分布式自动评分系统可以分为两个大模块:前端模块和后端模块。Figure 4 shows a schematic diagram of the scoring module deployment of scheme 1. The distributed automatic scoring system of the entire speaking test can be divided into two major modules: the front-end module and the back-end module.

所述前端模块包括考试客户端装置和评分管理装置。其中考试客户端装置上部署有录音模块和语音处理模块。The front-end module includes an examination client device and a scoring management device. A recording module and a voice processing module are deployed on the test client device.

所述后端模块包括评分任务调度装置和后台的评分计算装置。其中后台的评分计算装置上部署有发音评估模块,语流特征提取模块和综合评分模块。The back-end module includes scoring task scheduling means and background scoring calculation means. The score calculation device in the background is equipped with a pronunciation evaluation module, a speech flow feature extraction module and a comprehensive scoring module.

典型地,方案1把录音模块和语音处理模块部署前段模块上,其它模块部署在后端模块上。Typically, in Solution 1, the recording module and voice processing module are deployed on the front-end module, and other modules are deployed on the back-end module.

部署方案1的好处是充分利用考试客户端装置的计算能力来完成语音处理功能,可以减轻后台的评分计算装置的压力,同时又不至于造成考试客户端装置的负载太重而影响录音工作。The advantage of deployment scheme 1 is that the computing power of the test client device can be fully used to complete the voice processing function, which can reduce the pressure on the score calculation device in the background, and at the same time, it will not cause too heavy load on the test client device and affect the recording work.

所述口语考试的分布式自动评分系统其特征还在于考试客户端装置可通过网络和语音云端系统相连接,可充分利用云端计算资源。The distributed automatic scoring system of the speaking test is also characterized in that the test client device can be connected to the voice cloud system through the network, and the cloud computing resources can be fully utilized.

所述口语考试的分布式自动评分系统的部署方案1其特征还在于,语音处理模块的语音识别计算可以提交到语音云端完成,从而减轻考试客户端的计算负载。The deployment plan 1 of the distributed automatic grading system for the speaking test is also characterized in that the speech recognition calculation of the speech processing module can be submitted to the speech cloud for completion, thereby reducing the computing load of the test client.

图5是接入语音云系统的口语考试的分布式自动评分系统的组织结构示意图。考试客户端装置101通过网络接入语音云端系统105。Fig. 5 is a schematic diagram of the organizational structure of the distributed automatic scoring system of the speaking test connected to the voice cloud system. The test client device 101 accesses the voice cloud system 105 through the network.

语音云端系统105上部署有强大的计算资源和精准的语音模型资源。语音处理模块中的语音识别过程是整个评分过程最消耗计算资源的,通过利用语音云端设备的计算能力和模型资源,不仅可减轻考试客户端的计算负载,而且还可得到更准确的识别结果。Powerful computing resources and accurate speech model resources are deployed on the speech cloud system 105 . The speech recognition process in the speech processing module consumes the most computing resources in the entire scoring process. By using the computing power and model resources of the speech cloud device, not only can the computing load of the test client be reduced, but also more accurate recognition results can be obtained.

在评分模块部署方案1的情况下,考试客户端装置101首先录制考生语音;然后进行语音处理,把考生语音或者考生语音的声学特征发送到语音云端系统上进行语音识别,并获得语音识别结果,在完成其它语音处理工作后,把题目信息、考生语音以及语音处理的结果一起发送到评分管理装置102,由评分管理装置102进行后续的评分处理工作。In the case of scoring module deployment scheme 1, the test client device 101 first records the examinee's voice; then performs voice processing, sends the examinee's voice or the acoustic features of the examinee's voice to the voice cloud system for voice recognition, and obtains the voice recognition result, After completing other speech processing tasks, the topic information, examinee's speech and speech processing results are sent to the scoring management device 102, and the scoring management device 102 performs subsequent scoring processing.

所述口语考试的分布式自动评分系统其特征在于如果后台的评分设备和评分管理装置在同一个局域网中,或者数据传输通过移动存储设备,则所述口语考试的分布式自动评分系统中的评分管理装置和评分任务调度装置可以合并。该系统是一个精简的口语考试的分布式自动评分系统。The distributed automatic scoring system of the oral exam is characterized in that if the scoring equipment and the scoring management device in the background are in the same local area network, or the data transmission passes through the mobile storage device, the scoring in the distributed automatic scoring system of the oral exam The management means and scoring task scheduling means can be combined. The system is a distributed automatic scoring system for a streamlined oral exam.

图6给出了一种精简的口语考试的分布式自动评分系统环境的结构示意图。Fig. 6 shows a structural schematic diagram of a distributed automatic scoring system environment for a simplified speaking test.

所述的精简的口语考试的分布式自动评分系统包括考试客户端装置301,任务管理设备302和评分计算装置303。The distributed automatic grading system for the simplified speaking test includes a test client device 301 , a task management device 302 and a score calculation device 303 .

考试客户端装置301和图1中101模块的功能和配置相同。Examination client device 301 has the same function and configuration as module 101 in FIG. 1 .

评分计算装置303和图1中104模块的功能和配置相同。The score calculating means 303 has the same function and configuration as the module 104 in FIG. 1 .

任务管理设备302:接收考生答案和评分结果,解析评分结果,利用中间评分数据和题目信息构造评分计算程序,提交评分计算任务,接收最终评分结果。Task management device 302: Receive candidates' answers and scoring results, analyze the scoring results, use the intermediate scoring data and topic information to construct a scoring calculation program, submit scoring calculation tasks, and receive the final scoring results.

特定地,可以利用移动存储设备把每个考试客户端装置上的考生答案和评分结果移动到评分管理装置。评分管理装置上的评分管理程序解析评分结果,抽取中间评分数据、考生语音和题目信息,和所需评分模块的评分程序进行连接,汇总多个评分任务生成批量评分计算任务,然后提交给后台的评分计算设备进行计算。Specifically, the examinee's answers and scoring results on each test client device can be moved to the scoring management device by using a mobile storage device. The scoring management program on the scoring management device analyzes the scoring results, extracts the intermediate scoring data, examinee voice and question information, and connects with the scoring program of the required scoring module, aggregates multiple scoring tasks to generate batch scoring calculation tasks, and then submits them to the background The score calculation device performs the calculation.

所述口语考试的分布式自动评分系统其特点还在于具有一套计算容错机制,以保证评分计算的稳定性。具体措施包括:The distributed automatic scoring system of the speaking test is also characterized in that it has a set of calculation fault tolerance mechanism to ensure the stability of scoring calculation. Specific measures include:

1)配置备用计算设备:为后台的评分计算设备配置冗余的CPU以应对突发硬件故障。若有评分计算设备出现故障,评分任务调度设备自动启用备用的评分计算设备,并把该计算机上的评分任务重新提交,并发出警报信息。备用的评分计算设备一直处于等待状态,评分任务调度设备可以随时启用该设备。通常设置评分计算设备可用CPU的十分之一作为备用计算设备。1) Configure backup computing equipment: configure redundant CPUs for the scoring computing equipment in the background to deal with unexpected hardware failures. If any scoring calculation equipment breaks down, the scoring task scheduling device automatically activates the standby scoring calculation equipment, resubmits the scoring task on the computer, and sends out an alarm message. The standby scoring calculation device is always in a waiting state, and the scoring task scheduling device can enable the device at any time. Usually one-tenth of the available CPU of the scoring computing device is set as a backup computing device.

2)记录计算流程的日志:为每道题的评分任务都保持一个日志文件,从录音开始一直到获得最后评分,记录每个评分子模块中每个步骤的执行情况,至少包括执行的指令,数据目录,执行时间,执行状态等信息。2) Record the log of the calculation process: keep a log file for the scoring task of each question, from the beginning of the recording to the final scoring, record the execution of each step in each scoring sub-module, at least including the executed instructions, Data directory, execution time, execution status and other information.

3)检测录音异常:在考试正式开始之前,录音模块要求考生试录音。通过分析从考生录音设备获取的音频信号,检测录音设备的异常,比如:没有语音,音量太大或者音量太小等异常情况都要提示考生。3) Detect recording abnormalities: Before the exam officially starts, the recording module requires candidates to try recording. By analyzing the audio signal obtained from the candidate's recording device, detect the abnormality of the recording device, such as: no voice, too much volume or too low volume and other abnormal conditions will prompt candidates.

4)检测评分计算任务异常:后端模块的评分任务调度装置监控每个评分任务的计算过程,若发现有任务计算失败,如果是计算硬件问题导致的,则启用备用计算资源,并重新提交计算任务;否则记录错误信息,发出警告信息。4) Detect abnormal scoring calculation tasks: the scoring task scheduling device of the back-end module monitors the calculation process of each scoring task, if any task calculation failure is found, if it is caused by a computing hardware problem, then enable the backup computing resource and resubmit the calculation task; otherwise, log an error message and issue a warning message.

5)检测数据传输异常:考试客户端装置和评分管理装置监控数据传输过程,若有数据传输失败,间隔若干时间重新传输,并记录错误信息,发出警告信息。5) Abnormal detection of data transmission: the test client device and scoring management device monitor the data transmission process, if any data transmission fails, retransmit at intervals, record error messages, and issue warning messages.

6)数据备份:在考试客户端装置上临时保存考生语音和第一阶段评分的结果,直到该数据成功传输。在前端评分管理装置上保存考生语音、评分任务和最终评分结果。在后端的评分任务调度装置上临时保存评分任务和最终评分结果。6) Data backup: Temporarily save the examinee's voice and the results of the first stage of scoring on the test client device until the data is successfully transmitted. The examinee's speech, scoring tasks and final scoring results are saved on the front-end scoring management device. The scoring task and the final scoring result are temporarily saved on the back-end scoring task scheduling device.

当然,本发明还可有其他多种实施例,在不背离本发明精神及其实质的情况下,熟悉本领域的技术人员当可根据本发明作出各种相应的改变和变形,但是这些相应的改变和变形都应属于本发明所附的权利要求的保护范围。Certainly, the present invention also can have other multiple embodiments, without departing from the spirit and essence of the present invention, those skilled in the art can make various corresponding changes and deformations according to the present invention, but these corresponding Changes and deformations should belong to the scope of protection of the appended claims of the present invention.

Claims (14)

1. the distributed automatic scoring system of a SET; Comprise one the scoring management devices manage some examination client terminal devices; Scoring management devices and at least one scoring task scheduling apparatus communicate; A scoring task scheduling apparatus is managed some score calculation devices, and the module that wherein is used for automatic scoring comprises:
Recording module is used for detecting and recording examinee's voice;
Speech processing module is used for examinee's voice signal is handled, and extracts acoustic feature, and the text that recognizing voice is corresponding obtains the time border of acoustic elements;
The pronunciation evaluation module is used for the result according to exercise question information and speech processing module output, and the different aspect of user pronunciation is analyzed and assessed;
The flow characteristic extracting module is used to extract the flow comprehensive characteristics that reflects examinee's spoken language proficiency, comprises the characteristic of acoustics, syntax and semantics;
Comprehensive grading module: utilize the flow comprehensive characteristics, user speech is carried out comprehensive grading; Wherein
Except that recording module, other grading module can be come the dynamic dispatching deployment according to the computational load and the offered load of the score calculation device on examination client terminal device and backstage.
2. the distributed automatic scoring system of SET as claimed in claim 1, wherein
The examination client terminal device; Be used for recording examinee's voice through said recording module; Through said local computing unit pre-service is carried out in examinee's answer; Calculate certain middle appraisal result, send to the scoring management devices to examinee's answer and middle appraisal result thereof through network communication unit or other modes;
The scoring management devices; Be used to obtain examinee's answer and middle appraisal result thereof from the examination client terminal device; Appraisal result and exercise question information structuring scoring task sends to the scoring task scheduling apparatus in the middle of utilizing, and receives the final appraisal result from the scoring task scheduling apparatus;
The scoring task scheduling apparatus; Be used to receive the scoring task that said scoring management devices sends; Resolve and extract score data, organize the score calculation program automatically, generate the score calculation task according to information such as exercise question information and the required modules of scoring; Submit to calculation task to the score calculation device through dispatching algorithm, obtain final appraisal result from the score calculation device at last;
The score calculation device is used to receive and carry out the scoring procedures that said scoring task scheduling apparatus sends, and sends to said scoring task scheduling apparatus to final appraisal result.
3. the distributed automatic scoring system of SET as claimed in claim 2, the client terminal device of wherein taking an examination can be disposed a plurality of grading module, is guaranteeing to utilize local cpu to accomplish certain scoring work under the impregnable situation of Recording Process; Said scoring work can be carried out with examination simultaneously, for avoiding that examination is had any impact, and also can be at the free time timing operation outside the examination.
4. the distributed automatic scoring system of SET as claimed in claim 2, the management devices of wherein marking disposes reliable mass-memory unit with the scoring task scheduling apparatus, generally can have high data transfer bandwidth; Said scoring task scheduling apparatus is safeguarded the calculation task formation of a prioritization in internal memory.
5. the distributed automatic scoring system of SET as claimed in claim 2; The network data transmission mode of wherein taking an examination between client terminal device, scoring management devices and the scoring task scheduling apparatus can use other data transfer mode to substitute, and all the other devices and workflow still remain unchanged; Consider from data confidentiality, transmit data and can carry out encryption.
6. the distributed automatic scoring system of SET as claimed in claim 4, the task scheduling algorithm of wherein said scoring task scheduling apparatus is: if task queue is discontented, then insert the formation relevant position to task according to its priority; Task priority is higher than some task priority in the formation else if, then is moved out to buffer area or external memory storage to the minimum task of priority, inserts the formation relevant position to this task; Otherwise store this task into buffer area or external memory storage; Regularly query task formation if having living space, then adds formation to the buffer area task by its priority; Each score calculation device of automatic regular polling, if there is CPU idle, the task of then from task queue, getting is carried out computing to this CPU.
7. the distributed automatic scoring system of SET as claimed in claim 4, but wherein said scoring task scheduling apparatus is submitted to the score calculation device to the calculation task that a plurality of scoring taskings constitute a batch processing together.
8. the distributed automatic scoring system of SET as claimed in claim 1, wherein the flow characteristic extracting module combines exam question information to extract the flow comprehensive characteristics according to the output result of speech processing module and pronunciation evaluation module; Said exam question information comprises exercise question type, exercise question requirement, item content, investigation emphasis and model answer text at least.
9. the distributed automatic scoring system of SET as claimed in claim 1, its automatic scoring process is divided into two stages: the phase one scoring is deployed on the examination client terminal device; The subordinate phase scoring is deployed on the score calculation device; Wherein: the phase one scoring comprises recording module.
10. the distributed automatic scoring system of SET as claimed in claim 1, its automatic scoring module is successively to rely on; Except that recording module, other modules can be on phase one scoring and subordinate phase be marked flexible allocation; Concrete scheme is: the phase one scoring also comprises speech processing module; The subordinate phase scoring is made up of pronunciation evaluation module, flow characteristic extracting module and comprehensive grading module.
11. the distributed automatic scoring system of SET as claimed in claim 9, the subordinate phase scoring that is deployed on the score calculation device supports the batch of extensive scoring task to calculate, and a plurality of score calculation task independent parallels are carried out; Each score calculation task of subordinate phase scoring can comprise the scoring of answer of a plurality of examinees' multiple tracks exercise question or a whole set of examination question.
12. the distributed automatic scoring method of a SET, wherein the concrete steps of scoring comprise:
Step 1, detect examinee's voice and record through recording module;
Step 2, extraction phonetic feature utilize acoustic model and language model, and voice are discerned, and obtain corresponding text and phoneme unit time border and likelihood score score;
Step 3, utilize phoneme unit time border and likelihood score score and exercise question relevant knowledge, voice are carried out analysis and evaluation, obtain pronunciation accuracy, rhythm accuracy, pronunciation integrity degree and the fluent degree of pronunciation etc. of different phonetic unit;
Step 4, integrated voice result, assessment result and exercise question information are extracted the acoustics of reflection examinee spoken language proficiency, the flow comprehensive characteristics of syntax and semantics aspect;
Step 5, based on the model and the designed regular of training in advance, use the flow comprehensive characteristics that examinee's spoken language proficiency is carried out comprehensive grading.
13. the distributed automatic scoring method of a SET specifically comprises the steps:
Step 1, examinee obtain exam question and answer on request; The examination client terminal device is recorded examinee's voice; With requiring examinee's voice are carried out the phase one score calculation according to exercise question information, send to the scoring management devices to the phase one appraisal result of examinee's voice and acquisition according to the self-defining data exchange standard;
Step 2, scoring management devices receive examinee's answer and the phase one appraisal result that the examination client terminal device sends; Resolve appraisal result; If appraisal result is not final scoring; Then be made into the scoring task to examinee's voice and scoring intermediate result and exercise question message block, send to the scoring task scheduling apparatus and carry out the subordinate phase scoring according to the self-defining data exchange standard;
Step 3, scoring task scheduling apparatus receive the scoring task that the scoring management devices sends; Resolve the scoring task and obtain score data and task priority; Select required grading module and algorithm according to being provided with of grading module territory; Generate the score calculation program, the score calculation device that is submitted to the backstage according to task priority calculates;
Step 4, score calculation device receive the scoring procedures that task scheduling apparatus sends, and carry out the subordinate phase scoring, send to the scoring task scheduling apparatus to final appraisal result, and the task scheduling apparatus of marking then is forwarded to the scoring management devices to appraisal result again.
14. the distributed automatic scoring system of SET as claimed in claim 1, said automatic scoring system comprises that a cover calculates fault tolerant mechanism, specifically comprises:
1) dispose subsequent use computational resource: for the CPU of backstage score calculation device configuring redundancy with reply burst hardware fault; If there is the score calculation device to break down, said task scheduling apparatus is launched subsequent use score calculation device CPU automatically, and resubmits the scoring task on this computing machine, and sends a warning; Subsequent use score calculation device is in waiting status always, and said task scheduling apparatus can be launched the CPU of this device at any time;
2) daily record of record calculation process: the scoring task of inscribing for per pass all keeps a journal file; Up to obtaining last scoring, write down the implementation status of each step in each grading module from the recording beginning, comprise the order of execution at least; Data directory, execution time and execution state information;
3) detect recording unusually: before examination formally began, said recording module required examinee's rehearsal sound.Through examinee's sound signal that analysis is obtained, it is unusual to detect sound-recording function, and prompting examinee adjustment;
4) detect the score calculation task abnormity: the computation process of said each scoring task of scoring task scheduling apparatus monitoring, if find to have the task computation failure,, then launch subsequent use computational resource, and resubmit task if the computing hardware problem causes; Otherwise misregistration information, information gives a warning;
5) detect data transmission exception: said examination client terminal device and scoring management devices monitor data transmission course, if data transmission fails is arranged, the some time transmits again at interval, and misregistration information, information gives a warning;
6) data backup: said examination client terminal device is preserved the result of user speech and phase one scoring temporarily, successfully transmits up to these data; Said scoring management devices is preserved examinee's voice, scoring task and final appraisal result; Said scoring task scheduling apparatus is preserved scoring task and final appraisal result temporarily.
CN201110336179.4A 2011-10-31 2011-10-31 A kind of distributed Auto-Evaluation System of SET and method thereof Active CN102509483B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201110336179.4A CN102509483B (en) 2011-10-31 2011-10-31 A kind of distributed Auto-Evaluation System of SET and method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201110336179.4A CN102509483B (en) 2011-10-31 2011-10-31 A kind of distributed Auto-Evaluation System of SET and method thereof

Publications (2)

Publication Number Publication Date
CN102509483A true CN102509483A (en) 2012-06-20
CN102509483B CN102509483B (en) 2015-10-21

Family

ID=46221559

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201110336179.4A Active CN102509483B (en) 2011-10-31 2011-10-31 A kind of distributed Auto-Evaluation System of SET and method thereof

Country Status (1)

Country Link
CN (1) CN102509483B (en)

Cited By (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102779435A (en) * 2012-07-30 2012-11-14 无锡南理工科技发展有限公司 Wireless intelligent answering system supporting multi-network and multi-terminal and data transmission method thereof
CN103151042A (en) * 2013-01-23 2013-06-12 中国科学院深圳先进技术研究院 Full-automatic oral language evaluating management and scoring system and scoring method thereof
CN103258544A (en) * 2013-04-15 2013-08-21 深圳市海云天科技股份有限公司 Recording testing method, recording testing device, examination terminal and examination system
CN103605492A (en) * 2013-11-28 2014-02-26 中国科学院深圳先进技术研究院 Self-adaption language training method and platform
CN103685313A (en) * 2013-12-26 2014-03-26 北京中电普华信息技术有限公司 Communication method and system applied to online examination
CN104240543A (en) * 2013-06-19 2014-12-24 一宇数位科技股份有限公司 Cloud course market teaching system
CN104347071A (en) * 2013-08-02 2015-02-11 安徽科大讯飞信息科技股份有限公司 Method and system for generating oral test reference answer
CN104575490A (en) * 2014-12-30 2015-04-29 苏州驰声信息科技有限公司 Spoken language pronunciation detecting and evaluating method based on deep neural network posterior probability algorithm
CN104715640A (en) * 2015-03-14 2015-06-17 陕西学前师范学院 Language teaching system
CN105556564A (en) * 2013-07-16 2016-05-04 株式会社倍乐生 Portable information processing device, test assistance system, and test assistance method
CN106205635A (en) * 2016-07-13 2016-12-07 中南大学 Speech processing method and system
CN106409283A (en) * 2016-08-31 2017-02-15 上海交通大学 Audio frequency-based man-machine mixed interaction system and method
CN107316255A (en) * 2017-04-07 2017-11-03 苏州清睿教育科技股份有限公司 A kind of efficient competition method competed online that shuttles
CN107578778A (en) * 2017-08-16 2018-01-12 南京高讯信息科技有限公司 A kind of method of spoken scoring
CN107767862A (en) * 2017-11-06 2018-03-06 深圳市领芯者科技有限公司 Voice data processing method, system and storage medium
CN108877308A (en) * 2018-07-14 2018-11-23 汤凯 A kind of VR safety education evaluating method
CN109087667A (en) * 2018-09-19 2018-12-25 平安科技(深圳)有限公司 The recognition methods of voice fluency, device, computer equipment and readable storage medium storing program for executing
CN109147765A (en) * 2018-11-16 2019-01-04 安徽听见科技有限公司 Audio quality comprehensive evaluating method and system
CN109785698A (en) * 2017-11-13 2019-05-21 上海流利说信息技术有限公司 Method, apparatus, electronic equipment and medium for spoken language proficiency evaluation and test
CN109889938A (en) * 2019-01-17 2019-06-14 东南大学 Interpreting test recording device
CN110164422A (en) * 2019-04-03 2019-08-23 苏州驰声信息科技有限公司 A kind of the various dimensions appraisal procedure and device of speaking test
CN110164447A (en) * 2019-04-03 2019-08-23 苏州驰声信息科技有限公司 A kind of spoken language methods of marking and device
CN110827835A (en) * 2019-11-21 2020-02-21 上海好学网络科技有限公司 Spoken language examination system and method
CN111161577A (en) * 2019-12-25 2020-05-15 上海好学网络科技有限公司 Spoken language examination system and method
CN111833853A (en) * 2020-07-01 2020-10-27 腾讯科技(深圳)有限公司 Voice processing method and device, electronic equipment and computer readable storage medium
CN113572995A (en) * 2021-04-09 2021-10-29 深圳市即构科技有限公司 An interview video recording device, an interview system and an interview cabin
CN113674766A (en) * 2021-08-18 2021-11-19 上海复深蓝软件股份有限公司 Voice evaluation method and device, computer equipment and storage medium
CN114037574A (en) * 2021-11-18 2022-02-11 国网江苏省电力有限公司电力科学研究院 Lightweight dialogue capability evaluation system, method, equipment and medium
CN114245194A (en) * 2021-12-23 2022-03-25 深圳市优必选科技股份有限公司 Video teaching interaction method and device and electronic equipment
CN119763610A (en) * 2025-01-06 2025-04-04 科大讯飞股份有限公司 Oral language assessment method, device, electronic device, storage medium and program product

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101094104A (en) * 2007-07-30 2007-12-26 中兴通讯股份有限公司 Method and device for carrying out device management through proxy of security network management
CN101197084A (en) * 2007-11-06 2008-06-11 安徽科大讯飞信息科技股份有限公司 Automatic spoken English evaluating and learning system
CN101739868A (en) * 2008-11-19 2010-06-16 中国科学院自动化研究所 Automatic evaluation and diagnosis method of text reading level for oral test
CN201570038U (en) * 2009-12-09 2010-09-01 丁肇芬 Blinding control system for spoken language tests

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101094104A (en) * 2007-07-30 2007-12-26 中兴通讯股份有限公司 Method and device for carrying out device management through proxy of security network management
CN101197084A (en) * 2007-11-06 2008-06-11 安徽科大讯飞信息科技股份有限公司 Automatic spoken English evaluating and learning system
CN101739868A (en) * 2008-11-19 2010-06-16 中国科学院自动化研究所 Automatic evaluation and diagnosis method of text reading level for oral test
CN201570038U (en) * 2009-12-09 2010-09-01 丁肇芬 Blinding control system for spoken language tests

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
严可等: "面向大规模英语口语机考的复述题自动评分技术", 《清华大学学报(自然科学版)》 *

Cited By (41)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102779435A (en) * 2012-07-30 2012-11-14 无锡南理工科技发展有限公司 Wireless intelligent answering system supporting multi-network and multi-terminal and data transmission method thereof
CN103151042B (en) * 2013-01-23 2016-02-24 中国科学院深圳先进技术研究院 Full-automatic oral evaluation management and points-scoring system and methods of marking thereof
CN103151042A (en) * 2013-01-23 2013-06-12 中国科学院深圳先进技术研究院 Full-automatic oral language evaluating management and scoring system and scoring method thereof
CN103258544A (en) * 2013-04-15 2013-08-21 深圳市海云天科技股份有限公司 Recording testing method, recording testing device, examination terminal and examination system
CN103258544B (en) * 2013-04-15 2016-05-25 深圳市海云天科技股份有限公司 A kind of examination system
CN104240543A (en) * 2013-06-19 2014-12-24 一宇数位科技股份有限公司 Cloud course market teaching system
CN105556564A (en) * 2013-07-16 2016-05-04 株式会社倍乐生 Portable information processing device, test assistance system, and test assistance method
CN104347071A (en) * 2013-08-02 2015-02-11 安徽科大讯飞信息科技股份有限公司 Method and system for generating oral test reference answer
CN103605492A (en) * 2013-11-28 2014-02-26 中国科学院深圳先进技术研究院 Self-adaption language training method and platform
CN103605492B (en) * 2013-11-28 2016-08-17 中国科学院深圳先进技术研究院 A kind of self adaptation speech training method and platform
CN103685313A (en) * 2013-12-26 2014-03-26 北京中电普华信息技术有限公司 Communication method and system applied to online examination
CN104575490A (en) * 2014-12-30 2015-04-29 苏州驰声信息科技有限公司 Spoken language pronunciation detecting and evaluating method based on deep neural network posterior probability algorithm
CN104575490B (en) * 2014-12-30 2017-11-07 苏州驰声信息科技有限公司 Spoken language pronunciation evaluating method based on deep neural network posterior probability algorithm
CN104715640A (en) * 2015-03-14 2015-06-17 陕西学前师范学院 Language teaching system
CN106205635A (en) * 2016-07-13 2016-12-07 中南大学 Speech processing method and system
CN106409283B (en) * 2016-08-31 2020-01-10 上海交通大学 Man-machine mixed interaction system and method based on audio
CN106409283A (en) * 2016-08-31 2017-02-15 上海交通大学 Audio frequency-based man-machine mixed interaction system and method
CN107316255A (en) * 2017-04-07 2017-11-03 苏州清睿教育科技股份有限公司 A kind of efficient competition method competed online that shuttles
CN107578778A (en) * 2017-08-16 2018-01-12 南京高讯信息科技有限公司 A kind of method of spoken scoring
CN107767862A (en) * 2017-11-06 2018-03-06 深圳市领芯者科技有限公司 Voice data processing method, system and storage medium
CN107767862B (en) * 2017-11-06 2024-05-21 深圳市领芯者科技有限公司 Voice data processing method, system and storage medium
CN109785698A (en) * 2017-11-13 2019-05-21 上海流利说信息技术有限公司 Method, apparatus, electronic equipment and medium for spoken language proficiency evaluation and test
CN108877308A (en) * 2018-07-14 2018-11-23 汤凯 A kind of VR safety education evaluating method
CN109087667A (en) * 2018-09-19 2018-12-25 平安科技(深圳)有限公司 The recognition methods of voice fluency, device, computer equipment and readable storage medium storing program for executing
CN109087667B (en) * 2018-09-19 2023-09-26 平安科技(深圳)有限公司 Voice fluency recognition method and device, computer equipment and readable storage medium
WO2020056995A1 (en) * 2018-09-19 2020-03-26 平安科技(深圳)有限公司 Method and device for determining speech fluency degree, computer apparatus, and readable storage medium
CN109147765A (en) * 2018-11-16 2019-01-04 安徽听见科技有限公司 Audio quality comprehensive evaluating method and system
CN109889938B (en) * 2019-01-17 2021-03-26 东南大学 Oral interpretation examination recording device
CN109889938A (en) * 2019-01-17 2019-06-14 东南大学 Interpreting test recording device
CN110164447B (en) * 2019-04-03 2021-07-27 苏州驰声信息科技有限公司 Spoken language scoring method and device
CN110164422A (en) * 2019-04-03 2019-08-23 苏州驰声信息科技有限公司 A kind of the various dimensions appraisal procedure and device of speaking test
CN110164447A (en) * 2019-04-03 2019-08-23 苏州驰声信息科技有限公司 A kind of spoken language methods of marking and device
CN110827835A (en) * 2019-11-21 2020-02-21 上海好学网络科技有限公司 Spoken language examination system and method
CN111161577A (en) * 2019-12-25 2020-05-15 上海好学网络科技有限公司 Spoken language examination system and method
CN111833853A (en) * 2020-07-01 2020-10-27 腾讯科技(深圳)有限公司 Voice processing method and device, electronic equipment and computer readable storage medium
CN111833853B (en) * 2020-07-01 2023-10-27 腾讯科技(深圳)有限公司 Voice processing method and device, electronic equipment and computer readable storage medium
CN113572995A (en) * 2021-04-09 2021-10-29 深圳市即构科技有限公司 An interview video recording device, an interview system and an interview cabin
CN113674766A (en) * 2021-08-18 2021-11-19 上海复深蓝软件股份有限公司 Voice evaluation method and device, computer equipment and storage medium
CN114037574A (en) * 2021-11-18 2022-02-11 国网江苏省电力有限公司电力科学研究院 Lightweight dialogue capability evaluation system, method, equipment and medium
CN114245194A (en) * 2021-12-23 2022-03-25 深圳市优必选科技股份有限公司 Video teaching interaction method and device and electronic equipment
CN119763610A (en) * 2025-01-06 2025-04-04 科大讯飞股份有限公司 Oral language assessment method, device, electronic device, storage medium and program product

Also Published As

Publication number Publication date
CN102509483B (en) 2015-10-21

Similar Documents

Publication Publication Date Title
CN102509483A (en) Distributive automatic grading system for spoken language test and method thereof
US8682241B2 (en) Method and system for improving the quality of teaching through analysis using a virtual teaching device
Li et al. An automated assessment framework for atypical prosody and stereotyped idiosyncratic phrases related to autism spectrum disorder
CN110457432A (en) Interview methods of marking, device, equipment and storage medium
US9652999B2 (en) Computer-implemented systems and methods for estimating word accuracy for automatic speech recognition
CN107680019A (en) A kind of implementation method of Examination Scheme, device, equipment and storage medium
CN107274738A (en) Chinese-English translation teaching points-scoring system based on mobile Internet
WO2020006558A1 (en) System and method for generating dialogue graphs
CN114333787A (en) Scoring method, apparatus, equipment, storage medium and program product for oral examination
CN117745494A (en) A multi-terminal integrated 3D video digital OSCE examination system
CN113158690B (en) Test method and device of conversation robot
CN112148845A (en) Method and device for inputting verbal resources of robot, electronic equipment and storage medium
CN118248164A (en) Processing method and system based on large model and talent operation quality inspection of tablet equipment
EP4542543A1 (en) System and method for digital assistance through a display device
CN205814303U (en) A kind of Chinese standard aphasia evaluation system
US12406685B2 (en) Methods and systems for cross-correlating and aligning parallel speech utterances to improve quality assurance
CN118538200A (en) Recording quality detection method, electronic equipment and storage medium
CN113593553B (en) Voice recognition method, voice recognition apparatus, voice management server, and storage medium
CN109754648A (en) A teaching method and device for analyzing the quality of clinical interviews
CN116756348A (en) Data processing methods, devices, electronic equipment and storage media
CN108958715A (en) A kind of On-line Examining system based on J2EE framework
CN115130841A (en) Method and system for evaluating explanation quality of instructor
CN114339132A (en) Intelligent meeting minutes method, device and computer equipment for video conferencing
Sun Research on Intelligent Oral English Teaching Training System Based on Computer Multimedia Technology
Nandwalkar et al. IntelliMeet: AI-powered meeting summarization with FLAN-T5 and cosine clustering

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
DD01 Delivery of document by public notice

Addressee: Suzhou gallop Mdt InfoTech Ltd patent controller (collect)

Document name: Notification that Application Deemed not to be Proposed

DD01 Delivery of document by public notice

Addressee: Suzhou gallop Mdt InfoTech Ltd patent controller (collect)

Document name: Notification that Application Deemed not to be Proposed

ASS Succession or assignment of patent right

Owner name: SUZHOU CHISHENG INFORMATION TECHNOLOGY CO., LTD.

Free format text: FORMER OWNER: SUZHOU AISPEECH INFORMATION TECHNOLOGY CO., LTD.

Effective date: 20140926

C41 Transfer of patent application or patent right or utility model
COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 215021 SUZHOU, JIANGSU PROVINCE TO: 215000 SUZHOU, JIANGSU PROVINCE

TA01 Transfer of patent application right

Effective date of registration: 20140926

Address after: 102, room 210, Xinghai street, Suzhou Industrial Park, Jiangsu, Suzhou 215000, China

Applicant after: Suzhou Chisheng Information Technology Co., Ltd.

Address before: 215021, Jiangsu Province, Suzhou Industrial Park, No. 258 benevolence Road, alone lake library, C106

Applicant before: Suzhou Speech Information Technology Co., Ltd.

C53 Correction of patent of invention or patent application
CB03 Change of inventor or designer information

Inventor after: Zhou Weida

Inventor after: Lin Yuandong

Inventor after: Yu Kai

Inventor after: Wang Huanliang

Inventor after: Xue Feng

Inventor after: Zhu Hongjie

Inventor after: Zhang Li

Inventor before: Zhou Weida

Inventor before: Yu Kai

Inventor before: Wang Huanliang

Inventor before: Xue Feng

Inventor before: Zhu Hongjie

Inventor before: Zhang Li

COR Change of bibliographic data

Free format text: CORRECT: INVENTOR; FROM: ZHOU WEIDA YU KAI WANG HUANLIANG XUE FENG ZHU HONGJIE ZHANG LI TO: ZHOU WEIDA LIN YUANDONG YU KAI WANG HUANLIANG XUE FENG ZHU HONGJIE ZHANG LI

C14 Grant of patent or utility model
GR01 Patent grant