CN117112760A

CN117112760A - Intelligent education big model based on knowledge base

Info

Publication number: CN117112760A
Application number: CN202311082062.7A
Authority: CN
Inventors: 许辉; 张浩楠; 向瑞睿; 向生建; 刘才广; 张勇雄; 刘成; 李云雪; 彭彬; 莫智文
Original assignee: Sichuan Xinlong Xinchuang Technology Co ltd
Current assignee: Sichuan Xinlong Xinchuang Technology Co ltd
Priority date: 2023-08-26
Filing date: 2023-08-26
Publication date: 2023-11-24

Abstract

The invention belongs to the technical field of intelligent education, in particular to an intelligent education big model based on a knowledge base, which comprises a big model training and fine tuning module, a voice recognition module, a text preprocessing module and an intelligent dialogue module, wherein the voice recognition module and the text preprocessing module cooperatively process user input, and the intelligent dialogue module takes the big model as a base, cooperates with the voice recognition and text preprocessing module, responds to a user request to conduct intelligent reasoning and replies based on prior information provided by a self-research knowledge base. Specifically, the voice recognition module is used for processing voice input of a user in a complex noise environment, and then the voice input is converted into text information by the text preprocessing module and is input into the subsequent intelligent dialogue module. According to the invention, the safety and controllability of the model are enhanced, the safety shielding design is carried out in the process of training corpus and outputting the model, the safety and privacy of educational users can be better protected, and the model is prevented from misleading students or generating harmful information.

Description

A large model of intelligent education based on knowledge base

技术领域Technical field

本发明属于智能教育技术领域，具体为一种基于知识库的智能教育大模型。The invention belongs to the field of intelligent education technology, and is specifically a large intelligent education model based on a knowledge base.

背景技术Background technique

在人工智能的发展中，大规模预训练模型的创建与应用已经取得了显著的成就，尤其在数据提取和决策支持方面，然而，尽管这些模型已经实现了巨大的进步，但在面对安全可控性及其在垂直行业(如教育)的应用时，仍存在显著的问题和局限性。In the development of artificial intelligence, the creation and application of large-scale pre-trained models have made remarkable achievements, especially in data extraction and decision support. However, although these models have achieved great progress, they still have to face the challenges of security and reliability. There are still significant problems and limitations when it comes to controllability and its application in vertical industries such as education.

首先，从安全可控的角度考虑，大规模模型的发展起源于国外，其在国内商业化应用时可能存在安全风险，特别是在涉及机密信息的场景中，此外，大模型的训练语料往往缺乏对中文和国内语义环境的持续学习适配，导致基于中文的大模型相对较少，在垂直应用场景，如智能教育行业中，大模型目前主要应用于对话式问答，却在具体的应用场景研究方面仍有所欠缺。First of all, from the perspective of security and controllability, the development of large-scale models originated abroad. There may be security risks when they are commercially applied in China, especially in scenarios involving confidential information. In addition, training corpus for large models is often lacking. Continuous learning and adaptation to Chinese and domestic semantic environments have resulted in relatively few large models based on Chinese. In vertical application scenarios, such as the intelligent education industry, large models are currently mainly used in conversational question and answer, but in specific application scenario research There are still some shortcomings.

其次，现有的预训练模型训练数据多源于互联网，这些数据知识并不等同于教育领域的专业知识，因此，当现有大模型面临如物理、化学、英语等专业领域知识迁移时，可能会出现困难，这可能会导致教育行业的用户获取到误导信息，对于这些问题，尚未存在一种有效的解决策略或方案。Secondly, the existing pre-training model training data mostly comes from the Internet. This data knowledge is not equivalent to professional knowledge in the education field. Therefore, when existing large models face the transfer of knowledge in professional fields such as physics, chemistry, English, etc., it may Difficulties will arise, which may lead to users in the education industry receiving misleading information, and there is not yet an effective solution strategy or solution to these problems.

因此，需要一种基于知识库的智能教育大模型来解决上述提到的问题。Therefore, a large intelligent education model based on knowledge base is needed to solve the above-mentioned problems.

发明内容Contents of the invention

针对上述情况，为克服现有技术的缺陷，本发明提供一种基于知识库的智能教育大模型，有效的解决了上述背景技术中提出的问题。In view of the above situation, in order to overcome the shortcomings of the existing technology, the present invention provides a large intelligent education model based on a knowledge base, which effectively solves the problems raised in the above background technology.

为实现上述目的，本发明提供如下技术方案：一种基于知识库的智能教育大模型，包括大模型训练与微调模块、语音识别模块、文本预处理模块和智能对话模块，所述语音识别模块与文本预处理模块协同处理用户输入，所述智能对话模块以大模型为底座，协同语音识别和文本预处理模块响应用户请求，进行智能推理并回复。具体地，所述语音识别模块用于处理复杂噪声环境下用户的语音输入，随后由文本预处理模块将其转化为文字信息输入后续智能对话模块，交由大模型进行推理预测并回复。同时，本发明通过增强模型的安全性和可控性，在模型训练语料和输出后处理进行安全屏蔽设计、可以更好地保护教育用户的安全和隐私，防止模型误导学生或生成有害信息。In order to achieve the above objectives, the present invention provides the following technical solution: a large intelligent education model based on a knowledge base, including a large model training and fine-tuning module, a speech recognition module, a text preprocessing module and an intelligent dialogue module. The speech recognition module and The text preprocessing module collaboratively processes user input. The intelligent dialogue module is based on a large model and collaborates with the speech recognition and text preprocessing modules to respond to user requests, perform intelligent reasoning and reply. Specifically, the speech recognition module is used to process the user's voice input in a complex noise environment, and then the text preprocessing module converts it into text information and inputs it into the subsequent intelligent dialogue module, which is handed over to the large model for inference prediction and reply. At the same time, by enhancing the security and controllability of the model, the present invention performs a security shielding design on the model training corpus and output post-processing, which can better protect the security and privacy of education users and prevent the model from misleading students or generating harmful information.

语音识别模块：该模块主要用于处理复杂嘈杂噪声环境下用户的语音输入，并负责语音去噪等一系列音频预处理工作，其半转录后的语音信号会随之送入后续的文本预处理模块，用于进一步的语音转文字流程。所述语音识别模块的设计和工作流程为：Speech recognition module: This module is mainly used to process the user's voice input in complex and noisy environments, and is responsible for a series of audio preprocessing tasks such as voice denoising. Its semi-transcribed speech signal will then be sent to subsequent text preprocessing. Module for further speech-to-text process. The design and workflow of the speech recognition module are:

S1、使用包含各类背景音/噪音的音频数据集微调语音识别模块，提升其鲁棒性；S1. Use audio data sets containing various background sounds/noises to fine-tune the speech recognition module to improve its robustness;

S2、收集用户对于指定文本的发音，用以确定用户的发音准确性以及发音偏好，进一步微调语音识别模块，提升其识别准确性；S2. Collect the user's pronunciation of the specified text to determine the user's pronunciation accuracy and pronunciation preference, and further fine-tune the speech recognition module to improve its recognition accuracy;

S3、在系统使用过程中，针对用户输入的音频进行降噪、矫正等预处理步骤，输出给后续模块；S3. During the use of the system, preprocessing steps such as noise reduction and correction are performed on the audio input by the user, and the output is output to subsequent modules;

文本预处理模块：该模块用于接收来自语音识别模块的数据，完成语音转录，分词、文本向量化等任务。所述文本预处理模块的设计和工作流程为：Text preprocessing module: This module is used to receive data from the speech recognition module and complete tasks such as speech transcription, word segmentation, and text vectorization. The design and workflow of the text preprocessing module are:

S1、对于来自语音识别模块的输出，进行语音转文本的转录工作；针对转化的文本，进行分词，将其分割为单词或子词的序列；S1. For the output from the speech recognition module, perform speech-to-text transcription; perform word segmentation on the converted text and segment it into a sequence of words or sub-words;

S2、分析句子中的单词是否包含非学习相关的词汇及违规内容，如果有则发出对应的提示信息提醒用户，要求用户重新输入或拒绝响应；S2. Analyze whether the words in the sentence contain non-learning related vocabulary and illegal content. If so, send a corresponding prompt message to remind the user and ask the user to re-enter or refuse to respond;

S3、对每个词汇进行词性标注；S3. Perform part-of-speech tagging for each word;

S4、识别和标记文本中的命名实体；S4. Identify and mark named entities in text;

S5、将文本分割成独立的句子，作为智能对话模块的输入；S5. Split the text into independent sentences as input to the intelligent dialogue module;

协同自研知识库的智能对话模块：该模块旨在以国内大模型为底座进行进一步微调，协同语音识别和文本预处理模块，基于自研知识库提供的先验信息，响应用户请求，进行智能推理并回复。所述智能对话模块的工作流程为：Intelligent dialogue module that collaborates with the self-developed knowledge base: This module is designed to be further fine-tuned based on the domestic large model, and collaborates with the speech recognition and text preprocessing modules to respond to user requests and perform intelligent intelligent dialogue based on the prior information provided by the self-developed knowledge base. Reason and reply. The workflow of the intelligent dialogue module is:

S1、基于机器学习和深度学习算法，使用国内中文底座模型在高质量教育数据上进行微调，使其具备语法错误识别，混淆词纠正等能力，更适用于教育场景，获得一个面向教育的专用大模型；S1. Based on machine learning and deep learning algorithms, use the domestic Chinese base model to fine-tune on high-quality educational data, so that it has the ability to identify grammatical errors, correct confusing words, etc., and is more suitable for educational scenarios, obtaining a dedicated large-scale education-oriented Model;

S2、用户设定语音练习的偏好场景，如商务对话，英语演讲，日常对话等场景，从而实现对智能对话模块的使用场景微调；S2. Users set preferred scenarios for voice practice, such as business conversations, English speeches, daily conversations, etc., to fine-tune the usage scenarios of the intelligent dialogue module;

S3、更新并维护自研知识库，及时更新并扩展模型的知识储备；S3. Update and maintain the self-researched knowledge base, and promptly update and expand the knowledge reserve of the model;

S4、微调后的语言大模型基于其背景知识，场景自适应的提示词和具备高度时效性的知识文本，生成用户所需的应答文本；S4. The fine-tuned large language model generates the response text required by the user based on its background knowledge, scene-adaptive prompt words and highly time-sensitive knowledge text;

S5、使用智能对话模块中的语音转换子模块，生成应答文本对应的语音，反馈给用户。S5. Use the voice conversion sub-module in the intelligent dialogue module to generate the voice corresponding to the response text and feed it back to the user.

进一步的，所述的基于机器学习和深度学习算法，使用国内中文底座模型在高质量教育数据上进行微调S1，其工作流程如下：Further, based on the machine learning and deep learning algorithms, the domestic Chinese base model is used to fine-tune S1 on high-quality education data. The workflow is as follows:

S1-1、使用爬虫等技术从互联网数据中构建出千万级规模微调数据集对底座模型进行有监督训练，增强其对教育相关知识的响应能力和推理性能；S1-1. Use crawler and other technologies to construct tens of millions of scale fine-tuning data sets from Internet data to conduct supervised training of the base model to enhance its responsiveness and reasoning performance for education-related knowledge;

S1-2、面向教育行业，针对性地收集强相关的高质量数据进行进一步的参数微调，增强模型在细化专业领域的知识理解；S1-2. Facing the education industry, collect highly relevant high-quality data in a targeted manner for further fine-tuning of parameters to enhance the knowledge understanding of the model in detailed professional fields;

S1-3、基于专业的教育从业人士对模型输出内容的反馈，让模型学习专业人士的语言风格以及对具体问题的分析过程，增强模型输出的专业性。S1-3. Based on the feedback of professional education practitioners on the model output content, the model can learn the language style of professionals and the analysis process of specific problems to enhance the professionalism of the model output.

进一步的，更新并维护自研知识库，及时更新并扩展模型的知识储备S3，其工作流程如下：Further, update and maintain the self-researched knowledge base, and timely update and expand the model's knowledge reserve S3. The workflow is as follows:

S3-1、基于数据收集子模块，实时从互联网中抓取并更新数据库中各科目相关文档知识，如教材，教案，视频媒体中的文字信息等，形成海量且及时的数据库；S3-1. Based on the data collection sub-module, capture and update document knowledge related to each subject in the database in real time from the Internet, such as textbooks, lesson plans, text information in video media, etc., forming a massive and timely database;

S3-2、基于数据分块子模块，按照知识的特征进行分类并分块，用以构建向量数据库，供模型分析和使用；S3-2. Based on the data segmentation sub-module, it is classified and segmented according to the characteristics of knowledge to build a vector database for model analysis and use;

S3-3、基于用户query向量化模块，将结构化的文本信息投射至嵌入空间，提取出文本的特征向量，结合向量检索技术，从数据库中获取并返回最相关的特征向量，从而实现知识的快速查询。S3-3. Based on the user query vectorization module, project the structured text information into the embedding space, extract the feature vector of the text, and combine it with vector retrieval technology to obtain and return the most relevant feature vector from the database, thereby realizing knowledge Quick Search.

与现有技术相比，本发明的有益效果是：Compared with the prior art, the beneficial effects of the present invention are:

1、本发明，通过自研教育专用大模型，收集行业内千万级规模的真实数据，完成高效的智能技术部署和教育行业落地应用。1. This invention collects tens of millions of real data in the industry through self-developed large models dedicated to education, and completes efficient intelligent technology deployment and practical application in the education industry.

2、本发明具备应用扩展能力，通过增强模型的专有训练数据，能实现将这些技术应用从英语迁移到更多的教育学科，如语文、数学、物理、化学等，实现更广泛的社会效益。2. The present invention has application expansion capabilities. By enhancing the proprietary training data of the model, it can realize the migration of these technical applications from English to more educational subjects, such as Chinese, mathematics, physics, chemistry, etc., to achieve wider social benefits. .

3、本发明通过增强模型的安全性和可控性，在模型训练语料和输出后处理进行安全屏蔽设计、可以更好地保护教育用户的安全和隐私，防止模型误导学生或生成有害信息。3. By enhancing the security and controllability of the model, the present invention performs a security shielding design on the model training corpus and output post-processing, which can better protect the security and privacy of education users and prevent the model from misleading students or generating harmful information.

附图说明Description of drawings

附图用来提供对本发明的进一步理解，并且构成说明书的一部分，与本发明的实施例一起用于解释本发明，并不构成对本发明的限制。在附图中：The drawings are used to provide a further understanding of the present invention and constitute a part of the specification. They are used to explain the present invention together with the embodiments of the present invention and do not constitute a limitation of the present invention. In the attached picture:

图1为本发明的模块示意图。Figure 1 is a schematic diagram of the module of the present invention.

具体实施方式Detailed ways

下面将结合本发明实施例中的附图，对本发明实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例仅仅是本发明一部分实施例，而不是全部的实施例；基于本发明中的实施例，本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例，都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them; based on The embodiments of the present invention and all other embodiments obtained by those of ordinary skill in the art without creative efforts fall within the scope of protection of the present invention.

实施例一，由图1给出，本发明公开了一种基于知识库的智能教育大模型，一种基于知识库的智能教育大模型，包括大模型训练与微调模块、语音识别模块、文本预处理模块和智能对话模块，所述语音识别模块与文本预处理模块协同处理用户输入，所述智能对话模块以大模型为底座，协同语音识别和文本预处理模块响应用户请求，进行智能推理并回复。具体地，所述语音识别模块用于处理复杂噪声环境下用户的语音输入，随后由文本预处理模块将其转化为文字信息输入后续智能对话模块，交由大模型进行推理预测并回复。同时，本发明通过增强模型的安全性和可控性，在模型训练语料和输出后处理进行安全屏蔽设计、可以更好地保护教育用户的安全和隐私，防止模型误导学生或生成有害信息。Embodiment 1, as shown in Figure 1, the present invention discloses a large intelligent education model based on a knowledge base. A large intelligent education model based on a knowledge base includes a large model training and fine-tuning module, a speech recognition module, and a text pre-processing module. processing module and intelligent dialogue module. The speech recognition module and the text preprocessing module cooperate to process user input. The intelligent dialogue module uses a large model as a base to cooperate with the speech recognition and text preprocessing module to respond to user requests, perform intelligent reasoning and reply. . Specifically, the speech recognition module is used to process the user's voice input in a complex noise environment, and then the text preprocessing module converts it into text information and inputs it into the subsequent intelligent dialogue module, which is handed over to the large model for inference prediction and reply. At the same time, by enhancing the security and controllability of the model, the present invention performs a security shielding design on the model training corpus and output post-processing, which can better protect the security and privacy of education users and prevent the model from misleading students or generating harmful information.

需要说明的是，在本文中，诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来，而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且，术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含，从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素，而且还包括没有明确列出的其他要素，或者是还包括为这种过程、方法、物品或者设备所固有的要素。It should be noted that in this article, relational terms such as first and second are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply that these entities or operations are mutually exclusive. any such actual relationship or sequence exists between them. Furthermore, the terms "comprises," "comprises," or any other variations thereof are intended to cover a non-exclusive inclusion such that a process, method, article, or apparatus that includes a list of elements includes not only those elements, but also those not expressly listed other elements, or elements inherent to the process, method, article or equipment.

尽管已经示出和描述了本发明的实施例，对于本领域的普通技术人员而言，可以理解在不脱离本发明的原理和精神的情况下可以对这些实施例进行多种变化、修改、替换和变型，本发明的范围由所附权利要求及其等同物限定。Although the embodiments of the present invention have been shown and described, those of ordinary skill in the art will understand that various changes, modifications, and substitutions can be made to these embodiments without departing from the principles and spirit of the invention. and modifications, the scope of the invention is defined by the appended claims and their equivalents.

Claims

1. A large intelligent education model based on a knowledge base, including a speech recognition module, a text processing module and an intelligent dialogue module, characterized in that: the speech recognition module and the text preprocessing module collaboratively process user input, and the intelligent dialogue module Using the large model as the base, it collaborates with the speech recognition and text preprocessing modules, and based on the prior information provided by the self-developed knowledge base, responds to user requests for intelligent reasoning and replies;

Speech recognition module: This module is mainly used to process the user's voice input in complex and noisy environments, and is responsible for a series of audio preprocessing tasks such as voice denoising. Its semi-transcribed speech signal will then be sent to subsequent text preprocessing. module for further speech-to-text process; the design and workflow of the speech recognition module are:

S1. Use audio data sets containing various background sounds/noises to fine-tune the speech recognition module to improve its robustness;

S2. Collect the user's pronunciation of the specified text to determine the user's pronunciation accuracy and pronunciation preference, and further fine-tune the speech recognition module to improve its recognition accuracy;

S3. During the use of the system, preprocessing steps such as noise reduction and correction are performed on the audio input by the user, and the output is output to subsequent modules;

Text preprocessing module: This module is used to receive data from the speech recognition module and complete tasks such as speech transcription, word segmentation, and text vectorization; the design and workflow of the text preprocessing module are:

S1. For the output from the speech recognition module, perform speech-to-text transcription; perform word segmentation on the converted text and segment it into a sequence of words or sub-words;

S2. Analyze whether the words in the sentence contain non-learning related vocabulary and illegal content. If so, send a corresponding prompt message to remind the user and ask the user to re-enter or refuse to respond;

S3. Perform part-of-speech tagging for each word;

S4. Identify and mark named entities in text;

S5. Split the text into independent sentences as input to the intelligent dialogue module;

Intelligent dialogue module that collaborates with the self-developed knowledge base: This module is designed to be further fine-tuned based on the domestic large model, and collaborates with the speech recognition and text preprocessing modules to respond to user requests and perform intelligent intelligent dialogue based on the prior information provided by the self-developed knowledge base. Reason and reply; the workflow of the intelligent dialogue module is:

S1. Based on machine learning and deep learning algorithms, use the domestic Chinese base model to fine-tune on high-quality educational data, so that it has the ability to identify grammatical errors, correct confusing words, etc., and is more suitable for educational scenarios, obtaining a dedicated large-scale education-oriented Model;

S2. Users set preferred scenarios for voice practice, such as business conversations, English speeches, daily conversations, etc., to fine-tune the usage scenarios of the intelligent dialogue module;

S3. Update and maintain the self-researched knowledge base, and promptly update and expand the knowledge reserve of the model;

S4. The fine-tuned large language model generates the response text required by the user based on its background knowledge, scene-adaptive prompt words and highly time-sensitive knowledge text;

S5. Use the voice conversion sub-module in the intelligent dialogue module to generate the voice corresponding to the response text and feed it back to the user.

2. A large intelligent education model based on knowledge base according to claim 1, characterized in that: the domestic Chinese base model is used to fine-tune S1 on high-quality education data, and its work flow is as follows:

S1-1. Use crawler and other technologies to construct tens of millions of scale fine-tuning data sets from Internet data to conduct supervised training of the base model to enhance its responsiveness and reasoning performance for education-related knowledge;

S1-2. Facing the education industry, collect highly relevant high-quality data in a targeted manner for further fine-tuning of parameters to enhance the knowledge understanding of the model in detailed professional fields;

S1-3. Based on the feedback of professional education practitioners on the model output content, the model can learn the language style of professionals and the analysis process of specific problems to enhance the professionalism of the model output.

3. A large intelligent education model based on knowledge base according to claim 1, characterized in that: updating and maintaining the self-developed knowledge base, timely updating and expanding the knowledge reserve S3 of the model, and its work flow is as follows:

S3-1. Based on the data collection sub-module, capture and update document knowledge related to each subject in the database in real time from the Internet, such as textbooks, lesson plans, text information in video media, etc., forming a massive and timely database;

S3-2. Based on the data segmentation sub-module, it is classified and segmented according to the characteristics of knowledge to build a vector database for model analysis and use;

S3-3. Based on the user query vectorization module, project the structured text information into the embedding space, extract the feature vector of the text, and combine it with vector retrieval technology to obtain and return the most relevant feature vector from the database, thereby realizing knowledge Quick Search.