CN118349661A - A data processing method, device and equipment based on large language model - Google Patents
A data processing method, device and equipment based on large language model Download PDFInfo
- Publication number
- CN118349661A CN118349661A CN202410772348.6A CN202410772348A CN118349661A CN 118349661 A CN118349661 A CN 118349661A CN 202410772348 A CN202410772348 A CN 202410772348A CN 118349661 A CN118349661 A CN 118349661A
- Authority
- CN
- China
- Prior art keywords
- target
- description
- access
- data
- parameter
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3329—Natural language query formulation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/335—Filtering based on additional data, e.g. user or group profiles
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Artificial Intelligence (AREA)
- Human Computer Interaction (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
本申请提供一种基于大语言模型的数据处理方法、装置及设备,该方法包括:将原始输入问题和提示词输入给大语言模型,通过所述大语言模型基于所述提示词对所述原始输入问题进行处理得到目标问题参数和目标问题描述;其中,所述提示词包括问题样例、问题参数样例和问题描述样例;从特征数据库中查询所述目标问题描述对应的目标访问参数;其中,所述特征数据库包括访问描述与访问参数之间的对应关系;基于所述目标访问参数和所述目标问题参数生成运维系统操作数据;通过所述运维系统操作数据对运维系统进行操作,得到与所述原始输入问题对应的目标运维数据,将所述目标运维数据返回给用户。通过本申请的技术方案,能够节省计算资源,节省时间开销。
The present application provides a data processing method, device and equipment based on a large language model, the method comprising: inputting an original input question and a prompt word into a large language model, processing the original input question based on the prompt word by the large language model to obtain a target question parameter and a target question description; wherein the prompt word includes a question sample, a question parameter sample and a question description sample; querying a target access parameter corresponding to the target question description from a feature database; wherein the feature database includes a correspondence between an access description and an access parameter; generating operation and maintenance system operation data based on the target access parameter and the target question parameter; operating the operation and maintenance system through the operation and maintenance system operation data to obtain the target operation and maintenance data corresponding to the original input question, and returning the target operation and maintenance data to the user. The technical solution of the present application can save computing resources and time overhead.
Description
技术领域Technical Field
本申请涉及人工智能技术领域,尤其是涉及一种基于大语言模型(LargeLanguage Model,简称LLM)的数据处理方法、装置及设备。The present application relates to the field of artificial intelligence technology, and in particular to a data processing method, device and equipment based on a large language model (LLM).
背景技术Background technique
随着网络技术的快速发展,网络数据越来越多,为了从大量网络数据中检索到用户所需数据,需要由用户提供原始输入问题,服务器基于原始输入问题从大量网络数据中检索到用户所需数据,并将数据返回给用户。With the rapid development of network technology, there is more and more network data. In order to retrieve the data required by users from a large amount of network data, users are required to provide original input questions. The server retrieves the data required by users from a large amount of network data based on the original input questions and returns the data to the user.
然而,由于网络数据的数据量很大,服务器需要消耗大量计算资源才能够从大量网络数据中检索到用户所需数据,且服务器需要花费大量时间,从而导致服务器的大量资源消耗,且用户很长时间才能够获知所需数据。However, due to the large amount of network data, the server needs to consume a lot of computing resources to retrieve the data required by the user from a large amount of network data, and the server needs to spend a lot of time, resulting in a large amount of server resource consumption, and the user can only obtain the required data for a long time.
发明内容Summary of the invention
本申请提供一种基于大语言模型的数据处理方法,所述方法包括:The present application provides a data processing method based on a large language model, the method comprising:
将原始输入问题和提示词输入给大语言模型,通过所述大语言模型基于所述提示词对所述原始输入问题进行处理得到目标问题参数和目标问题描述;其中,所述提示词包括问题样例、问题参数样例和问题描述样例;Inputting the original input question and the prompt word into the large language model, and processing the original input question based on the prompt word by the large language model to obtain target question parameters and target question description; wherein the prompt word includes question examples, question parameter examples and question description examples;
从特征数据库中查询所述目标问题描述对应的目标访问参数;其中,所述特征数据库包括访问描述与访问参数之间的对应关系;Querying the target access parameter corresponding to the target problem description from a feature database; wherein the feature database includes a correspondence between the access description and the access parameter;
基于所述目标访问参数和所述目标问题参数生成运维系统操作数据;Generate operation data of the operation and maintenance system based on the target access parameter and the target problem parameter;
通过所述运维系统操作数据对运维系统进行操作,得到与所述原始输入问题对应的目标运维数据,将所述目标运维数据返回给用户。The operation and maintenance system is operated through the operation and maintenance system operation data to obtain target operation and maintenance data corresponding to the original input question, and the target operation and maintenance data is returned to the user.
本申请提供一种基于大语言模型的数据处理装置,所述装置包括:The present application provides a data processing device based on a large language model, the device comprising:
处理模块,用于将原始输入问题和提示词输入给大语言模型,通过所述大语言模型基于所述提示词对所述原始输入问题进行处理得到目标问题参数和目标问题描述;其中,所述提示词包括问题样例、问题参数样例和问题描述样例;A processing module, used to input the original input question and the prompt word into the large language model, and process the original input question based on the prompt word through the large language model to obtain target question parameters and target question description; wherein the prompt word includes question examples, question parameter examples and question description examples;
查询模块,用于从特征数据库中查询所述目标问题描述对应的目标访问参数;其中,所述特征数据库包括访问描述与访问参数之间的对应关系;A query module, used to query the target access parameter corresponding to the target problem description from a feature database; wherein the feature database includes a correspondence between the access description and the access parameter;
操作模块,用于基于所述目标访问参数和所述目标问题参数生成运维系统操作数据,并通过所述运维系统操作数据对运维系统进行操作,得到与所述原始输入问题对应的目标运维数据,将所述目标运维数据返回给用户。An operation module is used to generate operation and maintenance system operation data based on the target access parameters and the target question parameters, and operate the operation and maintenance system through the operation and maintenance system operation data to obtain target operation and maintenance data corresponding to the original input question, and return the target operation and maintenance data to the user.
本申请提供一种电子设备,包括:处理器和机器可读存储介质,所述机器可读存储介质存储有能够被所述处理器执行的机器可执行指令;所述处理器用于执行机器可执行指令,以实现上述基于大语言模型的数据处理方法。The present application provides an electronic device, comprising: a processor and a machine-readable storage medium, wherein the machine-readable storage medium stores machine-executable instructions that can be executed by the processor; the processor is used to execute the machine-executable instructions to implement the above-mentioned data processing method based on a large language model.
由以上技术方案可见,本申请实施例中,在得到原始输入问题之后,借助大语言模型获取原始输入问题对应的目标运维数据,将目标运维数据返回给用户,只需要消耗少量计算资源就能够获取目标运维数据,获取目标运维数据的时间更短,从而节省服务器的计算资源,并节省时间开销。能够快速确定出原始输入问题对应的目标运维数据,提供逻辑分析推理能力,利用大语言模型的分析能力给出准确可靠的目标运维数据。通过一个Copilot(助手)代理作为中间层,用户可以与Copilot代理进行自然语言交互,完成运维需求的交互,Copilot代理将自然语言转化为指令传递给运维系统,实现对话式智能运维系统,提升运维体验,实现用户与运维系统的无障碍人机交互体验,提升用户体验。基于大语言模型的能力,解决自然语言理解的障碍,让用户的输入从页面点击方式变为人机语言交互方式。基于大语言模型的能力,针对运维系统的功能,如概览、诊断、自愈闭环、报表等,实现基于语言交互的自动实现能力。It can be seen from the above technical solutions that in the embodiment of the present application, after obtaining the original input question, the target operation and maintenance data corresponding to the original input question is obtained with the help of a large language model, and the target operation and maintenance data is returned to the user. Only a small amount of computing resources is needed to obtain the target operation and maintenance data, and the time to obtain the target operation and maintenance data is shorter, thereby saving the computing resources of the server and saving time. It is possible to quickly determine the target operation and maintenance data corresponding to the original input question, provide logical analysis and reasoning capabilities, and use the analysis capabilities of the large language model to provide accurate and reliable target operation and maintenance data. Through a Copilot (assistant) agent as an intermediate layer, users can interact with the Copilot agent in natural language to complete the interaction of operation and maintenance requirements. The Copilot agent converts natural language into instructions and passes them to the operation and maintenance system to realize a conversational intelligent operation and maintenance system, improve the operation and maintenance experience, and realize barrier-free human-computer interaction experience between users and the operation and maintenance system, thereby improving user experience. Based on the capabilities of the large language model, the obstacles to natural language understanding are solved, and the user's input is changed from page clicks to human-computer language interaction. Based on the capabilities of the large language model, the automatic implementation capability based on language interaction is realized for the functions of the operation and maintenance system, such as overview, diagnosis, self-healing closed loop, and reporting.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1是本申请一种实施方式中的基于大语言模型的数据处理方法的流程图;FIG1 is a flow chart of a data processing method based on a large language model in one embodiment of the present application;
图2是本申请一种实施方式中的Copilot代理的结构示意图;FIG2 is a schematic diagram of the structure of a Copilot agent in one embodiment of the present application;
图3是本申请一种实施方式中的基于大语言模型的数据处理方法的流程图;FIG3 is a flow chart of a data processing method based on a large language model in one embodiment of the present application;
图4是本申请一种实施方式中的基于大语言模型的数据处理方法示意图;FIG4 is a schematic diagram of a data processing method based on a large language model in one embodiment of the present application;
图5是本申请一种实施方式中的Copilot代理支持四种能力的示意图;FIG5 is a schematic diagram of four capabilities supported by the Copilot agent in one embodiment of the present application;
图6是本申请一种实施方式中的单任务流程的示意图;FIG6 is a schematic diagram of a single task process in one embodiment of the present application;
图7是本申请一种实施方式中的多任务流程的示意图;FIG7 is a schematic diagram of a multi-task process in one embodiment of the present application;
图8是本申请一种实施方式中的基于大语言模型的数据处理装置的结构图;FIG8 is a structural diagram of a data processing device based on a large language model in one embodiment of the present application;
图9是本申请一种实施方式中的电子设备的硬件结构图。FIG. 9 is a hardware structure diagram of an electronic device in one embodiment of the present application.
具体实施方式Detailed ways
本申请实施例提出一种基于大语言模型的数据处理方法,可以应用于电子设备,如服务器,参见图1所示,为该方法的流程示意图,该方法可以包括:The present application embodiment proposes a data processing method based on a large language model, which can be applied to an electronic device, such as a server. Referring to FIG1 , which is a flow chart of the method, the method may include:
步骤101、将原始输入问题和提示词输入给大语言模型,通过大语言模型基于该提示词对该原始输入问题进行处理得到目标问题参数和目标问题描述;其中,该提示词可以包括问题样例、问题参数样例和问题描述样例。Step 101: input an original input question and a prompt word into a large language model, and process the original input question based on the prompt word through the large language model to obtain target question parameters and a target question description; wherein the prompt word may include a question sample, a question parameter sample, and a question description sample.
步骤102、从特征数据库中查询该目标问题描述对应的目标访问参数;其中,该特征数据库可以包括访问描述与访问参数之间的对应关系。Step 102: query the target access parameter corresponding to the target problem description from the feature database; wherein the feature database may include a correspondence between the access description and the access parameter.
步骤103、基于该目标访问参数和该目标问题参数生成运维系统操作数据。Step 103: Generate operation data of the operation and maintenance system based on the target access parameter and the target problem parameter.
步骤104、通过该运维系统操作数据对运维系统进行操作,得到与该原始输入问题对应的目标运维数据,将该目标运维数据返回给用户。Step 104: operate the operation and maintenance system through the operation data of the operation and maintenance system to obtain target operation and maintenance data corresponding to the original input question, and return the target operation and maintenance data to the user.
示例性的,从特征数据库中查询该目标问题描述对应的目标访问参数,可以包括但不限于:若特征数据库中存在与该目标问题描述匹配的第一访问描述,则可以从特征数据库中获取该第一访问描述对应的第一数据结构,该第一数据结构可以包括该第一访问描述和访问参数。在此基础上,可以将该第一数据结构中的访问参数确定为目标问题描述对应的目标访问参数。Exemplarily, querying the target access parameters corresponding to the target problem description from the feature database may include, but is not limited to: if there is a first access description matching the target problem description in the feature database, then a first data structure corresponding to the first access description may be obtained from the feature database, and the first data structure may include the first access description and access parameters. On this basis, the access parameters in the first data structure may be determined as the target access parameters corresponding to the target problem description.
示例性的,从特征数据库中查询该目标问题描述对应的目标访问参数之前,还可以获取运维系统的私域数据,该私域数据可以包括访问描述信息和访问参数信息;从该访问描述信息中提取第一描述关键词,基于该第一描述关键词生成访问描述;从该访问参数信息中提取第一参数关键词,基于该第一参数关键词生成访问参数;在此基础上,可以在特征数据库中记录该访问描述与该访问参数之间的对应关系。Exemplarily, before querying the target access parameters corresponding to the target problem description from the feature database, private domain data of the operation and maintenance system can also be obtained, and the private domain data can include access description information and access parameter information; extract the first description keyword from the access description information, and generate an access description based on the first description keyword; extract the first parameter keyword from the access parameter information, and generate an access parameter based on the first parameter keyword; on this basis, the correspondence between the access description and the access parameter can be recorded in the feature database.
示例性的,从特征数据库中查询该目标问题描述对应的目标访问参数,可以包括但不限于:若特征数据库中存在与目标问题描述匹配的第二访问描述,则从特征数据库中获取第二访问描述对应的第二数据结构,第二数据结构可以包括第二访问描述和多个操作步骤描述;针对每个操作步骤描述,若特征数据库中存在与该操作步骤描述匹配的第三访问描述,则从特征数据库中获取第三访问描述对应的第三数据结构,第三数据结构可以包括第三访问描述和访问参数,将第三数据结构中的访问参数确定为该操作步骤描述对应的访问参数;将每个操作步骤描述对应的访问参数确定为所述目标访问参数。Exemplarily, querying the target access parameters corresponding to the target problem description from the feature database may include but is not limited to: if there is a second access description matching the target problem description in the feature database, obtaining a second data structure corresponding to the second access description from the feature database, the second data structure may include a second access description and multiple operation step descriptions; for each operation step description, if there is a third access description matching the operation step description in the feature database, obtaining a third data structure corresponding to the third access description from the feature database, the third data structure may include a third access description and access parameters, and determining the access parameters in the third data structure as the access parameters corresponding to the operation step description; and determining the access parameters corresponding to each operation step description as the target access parameters.
示例性的,从特征数据库中查询该目标问题描述对应的目标访问参数之前,还可以获取运维系统的私域数据,私域数据包括访问描述信息和多个操作步骤描述信息;其中,针对每个操作步骤描述信息,私域数据还包括该操作步骤描述信息对应的访问描述信息和访问参数信息。从访问描述信息中提取第二描述关键词,基于该第二描述关键词生成访问描述;针对每个操作步骤描述信息,从该操作步骤描述信息中提取操作步骤关键词,基于该操作步骤关键词生成操作步骤描述;在特征数据库中记录该访问描述与该操作步骤描述的对应关系。针对每个操作步骤描述信息,从该操作步骤描述信息对应的访问描述信息中提取第三描述关键词,基于该第三描述关键词生成访问描述;从该操作步骤描述信息对应的访问参数信息中提取第二参数关键词,基于该第二参数关键词生成访问参数;在特征数据库中记录该访问描述与该访问参数之间的对应关系。Exemplarily, before querying the target access parameters corresponding to the target problem description from the feature database, the private domain data of the operation and maintenance system can also be obtained, and the private domain data includes access description information and multiple operation step description information; wherein, for each operation step description information, the private domain data also includes access description information and access parameter information corresponding to the operation step description information. Extract the second description keyword from the access description information, and generate an access description based on the second description keyword; for each operation step description information, extract the operation step keyword from the operation step description information, and generate an operation step description based on the operation step keyword; record the correspondence between the access description and the operation step description in the feature database. For each operation step description information, extract the third description keyword from the access description information corresponding to the operation step description information, and generate an access description based on the third description keyword; extract the second parameter keyword from the access parameter information corresponding to the operation step description information, and generate an access parameter based on the second parameter keyword; record the correspondence between the access description and the access parameter in the feature database.
在一种可能的实施方式中,目标问题参数包括目标设备标识,目标问题描述包括目标URL描述,目标访问参数包括目标URL地址,且该目标URL地址可以包括已配置的固定设备标识。基于此,基于目标访问参数和目标问题参数生成运维系统操作数据,可以包括但不限于:将目标URL地址中的固定设备标识替换为目标设备标识得到修改后的URL地址,作为运维系统操作数据。通过运维系统操作数据对运维系统进行操作,得到与原始输入问题对应的目标运维数据,可以包括但不限于:访问运维系统的与修改后的URL地址对应的运维管理页面,该运维管理页面可以包括目标运维数据。In a possible implementation, the target question parameter includes a target device identifier, the target question description includes a target URL description, the target access parameter includes a target URL address, and the target URL address may include a configured fixed device identifier. Based on this, the operation and maintenance system operation data is generated based on the target access parameter and the target question parameter, which may include but is not limited to: replacing the fixed device identifier in the target URL address with the target device identifier to obtain a modified URL address as the operation and maintenance system operation data. The operation and maintenance system is operated through the operation and maintenance system operation data to obtain the target operation and maintenance data corresponding to the original input question, which may include but is not limited to: accessing the operation and maintenance management page of the operation and maintenance system corresponding to the modified URL address, and the operation and maintenance management page may include the target operation and maintenance data.
在一种可能的实施方式中,目标问题参数包括目标设备标识,目标问题描述包括目标API描述,目标访问参数包括目标API,且目标API可以包括已配置的固定设备标识。基于此,基于目标访问参数和目标问题参数生成运维系统操作数据,可以包括但不限于:将目标API中的固定设备标识替换为目标设备标识得到修改后的API,作为运维系统操作数据。In a possible implementation, the target problem parameter includes a target device identifier, the target problem description includes a target API description, the target access parameter includes a target API, and the target API may include a configured fixed device identifier. Based on this, the operation data of the operation and maintenance system is generated based on the target access parameter and the target problem parameter, which may include but is not limited to: replacing the fixed device identifier in the target API with the target device identifier to obtain a modified API as the operation data of the operation and maintenance system.
通过运维系统操作数据对运维系统进行操作,得到与原始输入问题对应的目标运维数据,可以包括但不限于:调用修改后的API对运维系统进行操作得到操作结果数据,并基于该操作结果数据确定目标运维数据。The operation and maintenance system is operated through the operation and maintenance system operation data to obtain target operation and maintenance data corresponding to the original input problem, which may include but is not limited to: calling the modified API to operate the operation and maintenance system to obtain operation result data, and determining the target operation and maintenance data based on the operation result data.
在一种可能的实施方式中,目标问题参数包括目标数据表标识,目标问题描述包括目标SQL描述,目标访问参数包括目标SQL定义。基于此,基于目标访问参数和目标问题参数生成运维系统操作数据,包括但不限于:基于目标SQL定义和目标数据表标识生成目标SQL语句,作为运维系统操作数据,目标SQL语句包括目标数据表标识。通过运维系统操作数据对运维系统进行操作,得到与原始输入问题对应的目标运维数据,包括但不限于:通过执行目标SQL语句,对运维系统的目标数据表标识对应的数据表进行操作,得到数据表的操作结果数据;基于数据表的操作结果数据确定目标运维数据。In a possible implementation, the target problem parameter includes a target data table identifier, the target problem description includes a target SQL description, and the target access parameter includes a target SQL definition. Based on this, the operation and maintenance system operation data is generated based on the target access parameter and the target problem parameter, including but not limited to: generating a target SQL statement based on the target SQL definition and the target data table identifier as the operation and maintenance system operation data, the target SQL statement includes the target data table identifier. The operation and maintenance system is operated through the operation and maintenance system operation data to obtain the target operation and maintenance data corresponding to the original input problem, including but not limited to: operating the data table corresponding to the target data table identifier of the operation and maintenance system by executing the target SQL statement to obtain the operation result data of the data table; determining the target operation and maintenance data based on the operation result data of the data table.
一个例子中,上述执行顺序只是为了方便描述给出的示例,在实际应用中,还可以改变步骤之间的执行顺序,对此执行顺序不做限制。而且,在其它实施例中,并不一定按照本说明书示出和描述的顺序来执行相应方法的步骤,其方法所包括的步骤可以比本说明书所描述的更多或更少。此外,本说明书中所描述的单个步骤,在其它实施例中可能被分解为多个步骤进行描述。本说明书中所描述的多个步骤,在其它实施例也可能被合并为单个步骤进行描述。In one example, the above execution order is only for the convenience of describing the examples given. In practical applications, the execution order between the steps can also be changed, and there is no limitation on this execution order. Moreover, in other embodiments, the steps of the corresponding method are not necessarily executed in the order shown and described in this specification, and the steps included in the method may be more or less than those described in this specification. In addition, the single step described in this specification may be decomposed into multiple steps for description in other embodiments. The multiple steps described in this specification may also be combined into a single step for description in other embodiments.
由以上技术方案可见,本申请实施例中,在得到原始输入问题之后,借助大语言模型获取原始输入问题对应的目标运维数据,将目标运维数据返回给用户,只需要消耗少量计算资源就能够获取目标运维数据,获取目标运维数据的时间更短,从而节省服务器的计算资源,并节省时间开销。能够快速确定出原始输入问题对应的目标运维数据,提供逻辑分析推理能力,利用大语言模型的分析能力给出准确可靠的目标运维数据。通过一个Copilot(助手)代理作为中间层,用户可以与Copilot代理进行自然语言交互,完成运维需求的交互,Copilot代理将自然语言转化为指令传递给运维系统,实现对话式智能运维系统,提升运维体验,实现用户与运维系统的无障碍人机交互体验,提升用户体验。基于大语言模型的能力,解决自然语言理解的障碍,让用户的输入从页面点击方式变为人机语言交互方式。基于大语言模型的能力,针对运维系统的功能,如概览、诊断、自愈闭环、报表等,实现基于语言交互的自动实现能力。It can be seen from the above technical solutions that in the embodiment of the present application, after obtaining the original input question, the target operation and maintenance data corresponding to the original input question is obtained with the help of a large language model, and the target operation and maintenance data is returned to the user. Only a small amount of computing resources is needed to obtain the target operation and maintenance data, and the time to obtain the target operation and maintenance data is shorter, thereby saving the computing resources of the server and saving time. It is possible to quickly determine the target operation and maintenance data corresponding to the original input question, provide logical analysis and reasoning capabilities, and use the analysis capabilities of the large language model to provide accurate and reliable target operation and maintenance data. Through a Copilot (assistant) agent as an intermediate layer, users can interact with the Copilot agent in natural language to complete the interaction of operation and maintenance requirements. The Copilot agent converts natural language into instructions and passes them to the operation and maintenance system to realize a conversational intelligent operation and maintenance system, improve the operation and maintenance experience, and realize barrier-free human-computer interaction experience between users and the operation and maintenance system, thereby improving user experience. Based on the capabilities of the large language model, the obstacles to natural language understanding are solved, and the user's input is changed from page clicks to human-computer language interaction. Based on the capabilities of the large language model, the automatic implementation capability based on language interaction is realized for the functions of the operation and maintenance system, such as overview, diagnosis, self-healing closed loop, and reporting.
以下结合具体应用场景,对本申请实施例的上述技术方案进行说明。The above technical solutions of the embodiments of the present application are described below in combination with specific application scenarios.
随着ICT(Information and Communications Technology,信息与通信技术)基础设施的演进、企业数字化转型的发展,用户的运维需求增多,运维功能的复杂度越来越高,运维的易用性面临挑战,运维的自动化和智能化成为主题。运维是指,对网络、服务器、服务的生命周期的各个阶段进行运营与维护。With the evolution of ICT (Information and Communications Technology) infrastructure and the development of enterprise digital transformation, users' O&M needs have increased, O&M functions have become increasingly complex, and the ease of use of O&M has been challenged. Automation and intelligence of O&M have become the theme. O&M refers to the operation and maintenance of networks, servers, and services at all stages of their life cycle.
为了从运维系统检索到用户所需数据,需要由用户提供原始输入问题,运维系统基于原始输入问题检索到用户所需数据,将数据返回给用户。由于运维系统的数据量很大,需要消耗大量计算资源才能够检索到用户所需数据,需要花费大量时间,从而导致大量资源消耗,且用户很长时间才能够获知所需数据。In order to retrieve the data required by the user from the operation and maintenance system, the user needs to provide the original input question. The operation and maintenance system retrieves the data required by the user based on the original input question and returns the data to the user. Since the amount of data in the operation and maintenance system is large, a large amount of computing resources is required to retrieve the data required by the user, which takes a lot of time, resulting in a large amount of resource consumption, and the user takes a long time to obtain the required data.
针对上述发现,可以借助大语言模型(Large Language Model,LLM)对数据进行分析。比如说,借助大语言模型获取原始输入问题对应的目标运维数据,将目标运维数据返回给用户,只需要消耗少量计算资源就能够获取目标运维数据,获取目标运维数据的时间更短,从而节省服务器的计算资源,并节省时间开销。能够快速确定出原始输入问题对应的目标运维数据,提供逻辑分析推理能力,利用大语言模型的分析能力给出准确可靠的目标运维数据。In view of the above findings, the data can be analyzed with the help of a large language model (LLM). For example, with the help of a large language model, the target operation and maintenance data corresponding to the original input question can be obtained, and the target operation and maintenance data can be returned to the user. Only a small amount of computing resources are needed to obtain the target operation and maintenance data, and the time to obtain the target operation and maintenance data is shorter, thereby saving server computing resources and time costs. It can quickly determine the target operation and maintenance data corresponding to the original input question, provide logical analysis and reasoning capabilities, and use the analysis capabilities of the large language model to provide accurate and reliable target operation and maintenance data.
比如说,ChatGPT是一种基于自然语言处理和机器学习的大语言模型,可以模拟人类的语言交流,实现与用户的智能对话,广泛应用于智能客服、聊天机器人、语音助手等领域,为人们提供更方便、快捷和智能化的服务。基于此,可以借助ChatGPT对数据进行分析,得到原始输入问题对应的目标运维数据。For example, ChatGPT is a large language model based on natural language processing and machine learning. It can simulate human language communication and realize intelligent dialogue with users. It is widely used in intelligent customer service, chatbots, voice assistants and other fields to provide people with more convenient, fast and intelligent services. Based on this, ChatGPT can be used to analyze the data and obtain the target operation and maintenance data corresponding to the original input question.
在上述应用场景下,本申请实施例中提出一种基于大语言模型的数据处理方法,该方法可以应用于电子设备,如运维系统的服务器。通过在服务器部署Copilot代理(助手代理),将Copilot代理作为中间层,用户(如运维人员)可以与Copilot代理进行自然语言交互,传达业务需求。Copilot代理将语言转化为指令传递给运维系统,实现用户与运维系统的无障碍人机交互体验。In the above application scenario, a data processing method based on a large language model is proposed in an embodiment of the present application, which can be applied to electronic devices, such as servers of an operation and maintenance system. By deploying a Copilot agent (assistant agent) on the server and using the Copilot agent as an intermediate layer, users (such as operation and maintenance personnel) can interact with the Copilot agent in natural language to convey business needs. The Copilot agent converts language into instructions and passes them to the operation and maintenance system, realizing a barrier-free human-computer interaction experience between users and the operation and maintenance system.
参见图2所示,为Copilot代理的结构示意图,用户向Copilot代理提供原始输入问题,原始输入问题可以是语音格式(Voice)的原始输入问题,可以是文本格式(Text)的原始输入问题,可以是其它格式的原始输入问题,如视频格式等,对此不做限制。显然,用户可以通过语音或文字与Copilot代理进行交互。As shown in FIG. 2 , which is a schematic diagram of the structure of the Copilot agent, the user provides the Copilot agent with an original input question, which can be an original input question in voice format (Voice), an original input question in text format (Text), or an original input question in other formats, such as video format, etc., without limitation. Obviously, the user can interact with the Copilot agent through voice or text.
Copilot代理支持大语言模型(LLM)、RAG(Retrieval Augmented Generation,检索增强生成)和Agent(代理)等功能。基于大语言模型的功能,Copilot代理可以对自然语言(如原始输入问题)进行识别和理解,通过微调训练,大语言模型可以具备运维系统的运维知识,从而更好的对原始输入问题进行识别和理解。基于大语言模型的能力,解决自然语言理解的障碍,让用户的输入从页面点击方式变为人机语言交互,针对运维系统的功能,如概览功能、诊断功能、自愈闭环功能、报表功能等,实现基于语言交互的自动实现能力。Copilot agent supports functions such as large language model (LLM), RAG (Retrieval Augmented Generation) and Agent. Based on the functions of the large language model, Copilot agent can recognize and understand natural language (such as original input questions). Through fine-tuning training, the large language model can have the operation and maintenance knowledge of the operation and maintenance system, so as to better recognize and understand the original input questions. Based on the capabilities of the large language model, the obstacles to natural language understanding are solved, and the user's input is changed from page clicking to human-computer language interaction. For the functions of the operation and maintenance system, such as overview function, diagnosis function, self-healing closed loop function, reporting function, etc., the automatic realization capability based on language interaction is realized.
基于RAG的功能,Copilot代理可以结合RAG和大语言模型,来解决大语言模型的幻觉、知识局限等问题,从而提高大语言模型的准确率。Based on the functions of RAG, the Copilot agent can combine RAG and large language models to solve problems such as hallucinations and knowledge limitations of large language models, thereby improving the accuracy of large language models.
基于Agent的功能,Copilot代理可以实现与运维系统的调用对接,从而完成与原始输入问题相关的任务,并向用户返回原始输入问题对应的目标运维数据。参见图2所示,Copilot代理可以与不同的运维系统实现对接。Based on the functions of the Agent, the Copilot agent can realize the call docking with the operation and maintenance system, thereby completing the tasks related to the original input problem and returning the target operation and maintenance data corresponding to the original input problem to the user. As shown in Figure 2, the Copilot agent can be docked with different operation and maintenance systems.
本申请实施例中提出一种基于大语言模型的数据处理方法,在运维系统中,可以实现基于大语言模型(私域大模型)的对话式人机交互。参见图3所示,为该数据处理方法的流程示意图,该数据处理方法可以包括:In the embodiment of the present application, a data processing method based on a large language model is proposed. In the operation and maintenance system, a conversational human-computer interaction based on a large language model (private domain large model) can be realized. Referring to FIG3 , a flow chart of the data processing method is shown. The data processing method may include:
步骤301、Copilot代理获取运维系统的私域数据。比如说,该私域数据可以包括访问描述信息和访问参数信息,和/或,该私域数据可以包括访问描述信息和多个操作步骤描述信息;其中,针对每个操作步骤描述信息,该私域数据还可以包括该操作步骤描述信息对应的访问描述信息和访问参数信息。Step 301: The Copilot agent obtains private domain data of the operation and maintenance system. For example, the private domain data may include access description information and access parameter information, and/or the private domain data may include access description information and multiple operation step description information; wherein, for each operation step description information, the private domain data may also include access description information and access parameter information corresponding to the operation step description information.
示例性的,在数据准备阶段,Copilot代理可以获取与运维系统相关的私域数据,私域数据可以包括访问描述信息和访问参数信息、访问描述信息和操作步骤描述信息、操作步骤描述信息对应的访问描述信息和操作步骤描述信息对应的访问参数信息。此外,私域数据还可以包括运维知识等,对此不做限制。Exemplarily, during the data preparation phase, the Copilot agent can obtain private domain data related to the operation and maintenance system, and the private domain data can include access description information and access parameter information, access description information and operation step description information, access description information corresponding to the operation step description information, and access parameter information corresponding to the operation step description information. In addition, private domain data can also include operation and maintenance knowledge, etc., which is not limited.
示例性的,访问参数信息可以是与访问参数有关的信息,而访问参数是用于对运维系统进行访问的参数,如URL地址、API、SQL等,对此访问参数不做限制。访问描述信息可以是与访问描述有关的信息,访问描述是用于对问题进行描述的信息,如对问题的关键词进行描述的信息,对此访问描述不做限制。Exemplarily, the access parameter information may be information related to access parameters, and the access parameters are parameters used to access the operation and maintenance system, such as URL addresses, APIs, SQL, etc., and there is no restriction on the access parameters. The access description information may be information related to access descriptions, and the access descriptions are information used to describe the problem, such as information describing the keywords of the problem, and there is no restriction on the access descriptions.
示例性的,当一个任务对应多个操作步骤时,私域数据可以包括访问描述信息和多个操作步骤描述信息,该访问描述信息可以是与任务有关的信息,是针对该任务的描述信息。针对每个操作步骤描述信息,该操作步骤描述信息可以是针对操作步骤的描述信息,可以是针对该操作步骤的问题描述。Exemplarily, when a task corresponds to multiple operation steps, the private domain data may include access description information and multiple operation step description information, where the access description information may be information related to the task and is description information for the task. For each operation step description information, the operation step description information may be description information for the operation step and may be a problem description for the operation step.
针对每个操作步骤描述信息,私域数据还可以包括该操作步骤描述信息对应的访问描述信息和访问参数信息,访问参数信息可以是与访问参数有关的信息,而访问参数是用于对运维系统进行访问的参数,如URL地址、API、SQL等。该访问描述信息可以是与访问描述有关的信息,而访问描述是用于对问题(即操作步骤描述信息对应的问题)进行描述的信息,对此访问描述不做限制。For each operation step description information, the private domain data may also include access description information and access parameter information corresponding to the operation step description information. The access parameter information may be information related to access parameters, and access parameters are parameters used to access the operation and maintenance system, such as URL address, API, SQL, etc. The access description information may be information related to the access description, and the access description is information used to describe the problem (i.e., the problem corresponding to the operation step description information), and there is no restriction on this access description.
步骤302、Copilot代理采用私域数据对大语言模型进行增量训练,即对大语言模型进行微调,得到训练后的大语言模型,从而使大语言模型学习私域数据的知识。关于大语言模型的增量训练过程,本实施例中不做限制。Step 302: The Copilot agent uses the private domain data to perform incremental training on the large language model, that is, fine-tunes the large language model to obtain a trained large language model, so that the large language model learns the knowledge of the private domain data. The incremental training process of the large language model is not limited in this embodiment.
步骤303、Copilot代理采用私域数据维护RAG功能的特征数据库。Step 303: The Copilot agent uses private domain data to maintain the feature database of the RAG function.
示例性的,为了解决大语言模型的幻觉和不准确问题,可以对私域数据进行提取、整理、向量化(embedding),并注入到RAG功能的特征数据库,从而借助RAG功能的特征数据库来提升大语言模型的准确性。For example, in order to solve the hallucination and inaccuracy problems of the large language model, the private domain data can be extracted, sorted, vectorized (embedded), and injected into the feature database of the RAG function, thereby improving the accuracy of the large language model with the help of the feature database of the RAG function.
示例性的,若私域数据包括访问描述信息和访问参数信息,则可以对访问描述信息进行处理,得到访问描述,该访问描述可以是用于对问题进行描述的信息。可以对访问参数信息进行处理,得到访问参数,该访问参数可以是用于对运维系统进行访问的参数,如URL地址、API、SQL等。For example, if the private domain data includes access description information and access parameter information, the access description information can be processed to obtain an access description, which can be information used to describe the problem. The access parameter information can be processed to obtain access parameters, which can be parameters used to access the operation and maintenance system, such as URL addresses, APIs, SQL, etc.
对访问描述信息(访问参数信息的处理过程类似)进行处理,得到访问描述是指:从该访问描述信息中提取第一描述关键词,基于该第一描述关键词生成访问描述。比如说,可以过滤访问描述信息中的无效信息,对访问描述信息进行关键词提取、整理、分词等处理,得到访问描述信息中的第一描述关键词(如AP信道利用率高等),将第一描述关键词作为访问描述,对上述处理过程不做限制。或者,在得到第一描述关键词之后,还可以对第一描述关键词进行向量化处理,将向量化处理后的关键词作为访问描述。Processing the access description information (the processing process of the access parameter information is similar) to obtain the access description means: extracting the first description keyword from the access description information, and generating the access description based on the first description keyword. For example, invalid information in the access description information can be filtered, and the access description information can be processed by keyword extraction, sorting, and word segmentation to obtain the first description keyword in the access description information (such as high AP channel utilization), and the first description keyword is used as the access description, and the above processing process is not limited. Alternatively, after obtaining the first description keyword, the first description keyword can also be vectorized and the vectorized keyword can be used as the access description.
对访问描述信息的第一描述关键词进行向量化处理是指:将高维度的数据(如文字、图片、音频等)映射到低维度空间的过程,embedding向量是一个由实数构成的向量,它将输入的数据表示成一个连续的数值空间中的点。Vectorizing the first description keyword of the access description information refers to the process of mapping high-dimensional data (such as text, pictures, audio, etc.) into a low-dimensional space. The embedding vector is a vector composed of real numbers, which represents the input data as a point in a continuous numerical space.
在得到访问描述信息对应的访问描述和访问参数信息对应的访问参数之后,可以在特征数据库中记录访问描述与访问参数之间的对应关系。After obtaining the access description corresponding to the access description information and the access parameters corresponding to the access parameter information, the corresponding relationship between the access description and the access parameters may be recorded in the feature database.
示例性的,若私域数据包括访问描述信息和多个操作步骤描述信息,则可以对访问描述信息进行处理,得到访问描述(如从访问描述信息中提取第二描述关键词,基于该第二描述关键词生成访问描述),该访问描述可以是针对任务的描述信息。针对每个操作步骤描述信息,可以对该操作步骤描述信息进行处理,得到操作步骤描述(如从该操作步骤描述信息中提取操作步骤关键词,基于该操作步骤关键词生成操作步骤描述),该操作步骤描述是针对操作步骤的问题描述信息。Exemplarily, if the private domain data includes access description information and multiple operation step description information, the access description information can be processed to obtain an access description (such as extracting a second description keyword from the access description information and generating an access description based on the second description keyword), and the access description can be description information for the task. For each operation step description information, the operation step description information can be processed to obtain an operation step description (such as extracting an operation step keyword from the operation step description information and generating an operation step description based on the operation step keyword), and the operation step description is problem description information for the operation step.
在得到访问描述信息对应的访问描述和操作步骤描述信息对应的操作步骤描述之后,在特征数据库中记录访问描述与操作步骤描述之间的对应关系。After obtaining the access description corresponding to the access description information and the operation step description corresponding to the operation step description information, the corresponding relationship between the access description and the operation step description is recorded in the feature database.
针对每个操作步骤描述信息,私域数据可以包括该操作步骤描述信息对应的访问描述信息和访问参数信息,基于此,针对每个操作步骤描述信息,还可以对该操作步骤描述信息对应的访问描述信息进行处理,得到访问描述(如从该操作步骤描述信息对应的访问描述信息中提取第三描述关键词,基于该第三描述关键词生成访问描述),可以对该操作步骤描述信息对应的访问参数信息进行处理,得到访问参数(如从该操作步骤描述信息对应的访问参数信息中提取第二参数关键词,基于该第二参数关键词生成访问参数)。在得到访问描述信息对应的访问描述和访问参数信息对应的访问参数后,在特征数据库中记录访问描述与访问参数之间的对应关系。For each operation step description information, the private domain data may include the access description information and access parameter information corresponding to the operation step description information. Based on this, for each operation step description information, the access description information corresponding to the operation step description information may also be processed to obtain an access description (such as extracting a third description keyword from the access description information corresponding to the operation step description information, and generating an access description based on the third description keyword), and the access parameter information corresponding to the operation step description information may be processed to obtain an access parameter (such as extracting a second parameter keyword from the access parameter information corresponding to the operation step description information, and generating an access parameter based on the second parameter keyword). After obtaining the access description corresponding to the access description information and the access parameter corresponding to the access parameter information, the corresponding relationship between the access description and the access parameter is recorded in the feature database.
比如说,参见表1和表2所示,为该特征数据库的一个示例。For example, see Table 1 and Table 2, which are examples of the feature database.
表1Table 1
表2Table 2
步骤304、Copilot代理获取原始输入问题,该原始输入问题可以是用户输入的文本格式的原始输入问题、或语音格式的原始输入问题、或其它格式的原始输入问题,对此不做限制,后续以文本格式的原始输入问题为例。Step 304: The Copilot agent obtains the original input question, which may be an original input question in text format input by the user, or an original input question in voice format, or an original input question in another format. There is no limitation to this, and the original input question in text format is used as an example below.
步骤305、Copilot代理将原始输入问题和提示词输入给大语言模型,通过大语言模型基于该提示词对该原始输入问题进行处理得到目标问题参数和目标问题描述,该提示词可以包括问题样例、问题参数样例和问题描述样例。Step 305: The Copilot agent inputs the original input question and the prompt word into the large language model. The large language model processes the original input question based on the prompt word to obtain target question parameters and target question description. The prompt word may include question examples, question parameter examples, and question description examples.
示例性的,Copilot代理在得到原始输入问题之后,可以将原始输入问题和提示词一起输入给大语言模型,大语言模型可以根据提示词的语义理解,对原始输入问题进行结构化拆分,得到目标问题参数和目标问题描述。For example, after obtaining the original input question, the Copilot agent can input the original input question and the prompt word together into the large language model. The large language model can perform structural decomposition on the original input question based on the semantic understanding of the prompt word to obtain the target question parameters and the target question description.
比如说,提示词用于辅助大语言模型对原始输入问题进行结构化拆分,即大语言模型可以基于提示词理解原始输入问题的结构化拆分方式,继而对原始输入问题进行结构化拆分,得到目标问题参数和目标问题描述。For example, prompt words are used to assist the large language model in performing structural splitting of the original input question. That is, the large language model can understand the structural splitting method of the original input question based on the prompt words, and then perform structural splitting on the original input question to obtain the target question parameters and target question description.
比如说,提示词可以包括问题样例、问题参数样例和问题描述样例,原始输入问题与问题样例对应。基于问题样例中的问题参数样例的表现形式,大语言模型可以从原始输入问题中拆分出目标问题参数,目标问题参数与问题参数样例对应。基于问题样例中的问题描述样例的表现形式,大语言模型可以从原始输入问题中拆分出目标问题描述,目标问题描述与问题描述样例对应。For example, the prompt words may include question examples, question parameter examples, and question description examples, and the original input question corresponds to the question example. Based on the representation of the question parameter examples in the question example, the large language model can separate the target question parameters from the original input question, and the target question parameters correspond to the question parameter examples. Based on the representation of the question description examples in the question example, the large language model can separate the target question description from the original input question, and the target question description corresponds to the question description example.
步骤306、Copilot代理从特征数据库中查询目标问题描述对应的目标访问参数。比如说,Copilot代理通过大语言模型从特征数据库中查询目标访问参数。Step 306: The Copilot agent queries the target access parameters corresponding to the target question description from the feature database. For example, the Copilot agent queries the target access parameters from the feature database using a large language model.
示例性的,若特征数据库中存在与目标问题描述匹配的第一访问描述,则可以从特征数据库中获取该第一访问描述对应的第一数据结构,该第一数据结构可以包括该第一访问描述和访问参数。将该第一数据结构中的访问参数确定为目标问题描述对应的目标访问参数,且大语言模型可以得到目标访问参数。Exemplarily, if there is a first access description matching the target problem description in the feature database, a first data structure corresponding to the first access description can be obtained from the feature database, and the first data structure can include the first access description and access parameters. The access parameters in the first data structure are determined as target access parameters corresponding to the target problem description, and the large language model can obtain the target access parameters.
比如说,若特征数据库以向量形式存储访问描述,在对目标问题描述进行向量化之后,比较特征数据库中的每个访问描述与目标问题描述的相似度。若特征数据库以关键词形式存储访问描述,则直接比较特征数据库中的每个访问描述与目标问题描述(未向量化)的相似度。若相似度大于阈值,则该访问描述是与目标问题描述匹配的第一访问描述,若相似度不大于阈值,则该访问描述不是与目标问题描述匹配的第一访问描述。基于此,可以找到第一访问描述。For example, if the feature database stores access descriptions in vector form, after vectorizing the target problem description, the similarity between each access description in the feature database and the target problem description is compared. If the feature database stores access descriptions in keyword form, the similarity between each access description in the feature database and the target problem description (not vectorized) is directly compared. If the similarity is greater than a threshold, the access description is the first access description that matches the target problem description. If the similarity is not greater than the threshold, the access description is not the first access description that matches the target problem description. Based on this, the first access description can be found.
在得到第一访问描述后,可以将该第一访问描述对应的访问参数作为目标访问参数,将目标访问参数返回给大语言模型。大语言模型在得到目标访问参数之后,可以对目标访问参数进行语义检查,若发现目标访问参数是单任务结果,如URL地址或者API等,则确认得到目标访问参数,执行后续步骤。After obtaining the first access description, the access parameter corresponding to the first access description can be used as the target access parameter, and the target access parameter can be returned to the large language model. After obtaining the target access parameter, the large language model can perform a semantic check on the target access parameter. If it is found that the target access parameter is a single task result, such as a URL address or API, etc., it is confirmed that the target access parameter is obtained and the subsequent steps are executed.
示例性的,若特征数据库中存在与目标问题描述匹配的第二访问描述,则从特征数据库中获取第二访问描述对应的第二数据结构,第二数据结构包括第二访问描述和多个操作步骤描述。在此基础上,针对每个操作步骤描述:若特征数据库中存在与该操作步骤描述匹配的第三访问描述,则从特征数据库中获取第三访问描述对应的第三数据结构,第三数据结构可以包括第三访问描述和访问参数,将第三数据结构中的访问参数确定为该操作步骤描述对应的访问参数;将每个操作步骤描述对应的访问参数确定为该目标问题描述对应的目标访问参数,且大语言模型可以得到目标访问参数(多个访问参数)。Exemplarily, if there is a second access description matching the target problem description in the feature database, a second data structure corresponding to the second access description is obtained from the feature database, and the second data structure includes the second access description and multiple operation step descriptions. On this basis, for each operation step description: if there is a third access description matching the operation step description in the feature database, a third data structure corresponding to the third access description is obtained from the feature database, and the third data structure may include the third access description and access parameters, and the access parameters in the third data structure are determined as the access parameters corresponding to the operation step description; the access parameters corresponding to each operation step description are determined as the target access parameters corresponding to the target problem description, and the large language model can obtain the target access parameters (multiple access parameters).
比如说,比较特征数据库中的每个访问描述与目标问题描述的相似度。若相似度大于阈值,则该访问描述是与目标问题描述匹配的第二访问描述,若相似度不大于阈值,则该访问描述不是与目标问题描述匹配的第二访问描述。在得到第二访问描述之后,参见表1所示,若第二访问描述对应的是多个操作步骤描述,而不是访问参数,则可以将多个操作步骤描述返回给大语言模型。For example, the similarity between each access description in the feature database and the target problem description is compared. If the similarity is greater than a threshold, the access description is a second access description that matches the target problem description. If the similarity is not greater than a threshold, the access description is not a second access description that matches the target problem description. After obtaining the second access description, as shown in Table 1, if the second access description corresponds to multiple operation step descriptions rather than access parameters, the multiple operation step descriptions can be returned to the large language model.
针对每个操作步骤描述,大语言模型将该操作步骤描述作为输入问题(类似于原始输入问题),对该操作步骤描述进行处理得到问题描述,比如说,通过提示词让大语言模型进行思维链多步骤拆解,将每个操作步骤描述的问题结构化后,得到该操作步骤描述对应的问题描述。比如说,可以将该操作步骤描述作为问题描述,或者,可以从该操作步骤描述中提取关键词作为问题描述,对此不做限制,后续以该操作步骤描述作为问题描述为例。For each operation step description, the large language model uses the operation step description as an input question (similar to the original input question) and processes the operation step description to obtain a problem description. For example, the large language model uses prompt words to decompose the thought chain into multiple steps, and after structuring the problem of each operation step description, the problem description corresponding to the operation step description is obtained. For example, the operation step description can be used as a problem description, or keywords can be extracted from the operation step description as a problem description. There is no restriction on this, and the operation step description is used as a problem description as an example in the following.
比较特征数据库中的每个访问描述与该操作步骤描述的相似度。若相似度大于阈值,则该访问描述是与该操作步骤描述匹配的第三访问描述,若相似度不大于阈值,则该访问描述不是与该操作步骤描述匹配的第三访问描述。Compare the similarity between each access description in the feature database and the operation step description. If the similarity is greater than a threshold, the access description is a third access description that matches the operation step description; if the similarity is not greater than the threshold, the access description is not a third access description that matches the operation step description.
在得到第三访问描述之后,参见表2所示,若第三访问描述对应有访问参数,则可以将该第三访问描述对应的访问参数作为目标访问参数,将目标访问参数返回给大语言模型,这样,大语言模型可以得到目标访问参数。After obtaining the third access description, as shown in Table 2, if the third access description corresponds to an access parameter, the access parameter corresponding to the third access description can be used as the target access parameter, and the target access parameter is returned to the large language model, so that the large language model can obtain the target access parameter.
大语言模型在得到多个操作步骤描述对应的多个目标访问参数之后,可以对这些目标访问参数进行语义检查,若发现目标访问参数是多个任务结果,如URL地址或者API等,则确认得到目标访问参数,执行后续步骤。After obtaining multiple target access parameters corresponding to multiple operation step descriptions, the large language model can perform semantic checks on these target access parameters. If it is found that the target access parameters are multiple task results, such as URL addresses or APIs, the target access parameters are confirmed to be obtained and subsequent steps are executed.
步骤307、Copilot代理基于目标访问参数和目标问题参数生成运维系统操作数据。比如说,可以将目标问题参数添加到目标访问参数中,得到修改后的访问参数,而修改后的访问参数可以作为运维系统操作数据。Step 307: The Copilot agent generates operation data of the operation system based on the target access parameter and the target problem parameter. For example, the target problem parameter can be added to the target access parameter to obtain the modified access parameter, and the modified access parameter can be used as the operation data of the operation system.
比如说,目标访问参数可以包括固定设备标识,目标问题参数可以是目标设备标识,可以将目标访问参数中的固定设备标识替换为目标设备标识,得到修改后的访问参数,运维系统操作数据为修改后的访问参数。For example, the target access parameter may include a fixed device identifier, the target problem parameter may be a target device identifier, the fixed device identifier in the target access parameter may be replaced with the target device identifier to obtain a modified access parameter, and the operation data of the operation and maintenance system is the modified access parameter.
步骤308、Copilot代理通过该运维系统操作数据对运维系统进行操作,得到与该原始输入问题对应的目标运维数据,将该目标运维数据返回给用户。Step 308: The Copilot agent operates the operation and maintenance system through the operation data of the operation and maintenance system to obtain target operation and maintenance data corresponding to the original input question, and returns the target operation and maintenance data to the user.
示例性的,Copilot代理可以支持Agent的功能,基于Agent的功能,Copilot代理可以实现与运维系统的调用对接,因此,可以将运维系统操作数据提供给Agent,Agent调用该运维系统操作数据对运维系统进行操作,得到与该原始输入问题对应的目标运维数据,并将该目标运维数据返回给用户。Exemplarily, the Copilot agent can support the functions of the Agent. Based on the functions of the Agent, the Copilot agent can realize the call docking with the operation and maintenance system. Therefore, the operation data of the operation and maintenance system can be provided to the Agent. The Agent calls the operation data of the operation and maintenance system to operate the operation and maintenance system, obtains the target operation and maintenance data corresponding to the original input problem, and returns the target operation and maintenance data to the user.
一个例子中,参见图4所示,为基于大语言模型的数据处理方法的示意图。红色箭头的1、2、3对应步骤301-步骤303,1表示获取运维系统的私域数据(私域知识语料),2、3表示维护特征数据库(向量数据库)。Embedding model(向量化模型)表示对私域数据进行向量化处理,Semantic Vector(语义向量)表示对私域数据进行语义向量处理,经过上述处理,可以维护特征数据库。In one example, see Figure 4, which is a schematic diagram of a data processing method based on a large language model. The red arrows 1, 2, and 3 correspond to steps 301-303, 1 represents obtaining the private domain data (private domain knowledge corpus) of the operation and maintenance system, and 2 and 3 represent maintaining the feature database (vector database). Embedding model (vectorization model) represents vectorization processing of private domain data, and Semantic Vector (semantic vector) represents semantic vector processing of private domain data. After the above processing, the feature database can be maintained.
蓝色箭头的1-11对应步骤304-步骤308,1表示获取原始输入问题(用户提问),2表示将原始输入问题和提示词输入给大语言模型(LLM大模型)。The blue arrows 1-11 correspond to steps 304-308, 1 represents obtaining the original input question (user question), and 2 represents inputting the original input question and prompt words into the large language model (LLM large model).
3-8表示查询目标访问参数,3表示将目标问题描述输入给Embedding model进行向量化处理,4、5表示比较特征数据库中的每个访问描述与目标问题描述的相似度,确定第一访问描述。若第一访问描述对应访问参数,则可以将第一访问描述对应的访问参数作为目标访问参数。3-8 represent querying the target access parameters, 3 represents inputting the target problem description into the Embedding model for vectorization, 4 and 5 represent comparing the similarity between each access description in the feature database and the target problem description to determine the first access description. If the first access description corresponds to the access parameter, the access parameter corresponding to the first access description can be used as the target access parameter.
6表示第二访问描述对应多个操作步骤描述(知识),7表示将多个操作步骤描述和提示词输入给大语言模型,8表示大语言模型确定每个操作步骤描述对应的目标访问参数(即对复杂任务分解,得到多个目标访问参数)。6 indicates that the second access description corresponds to multiple operation step descriptions (knowledge), 7 indicates that multiple operation step descriptions and prompt words are input into the large language model, and 8 indicates that the large language model determines the target access parameters corresponding to each operation step description (i.e., decomposing the complex task to obtain multiple target access parameters).
9表示基于目标访问参数和目标问题参数生成运维系统操作数据,将运维系统操作数据输入给Agents。10表示Agents通过该运维系统操作数据对运维系统进行操作,得到目标运维数据,11表示将目标运维数据返回给用户。9 indicates that the operation data of the operation system is generated based on the target access parameter and the target problem parameter, and the operation data of the operation system is input to the Agents. 10 indicates that the Agents operate the operation system through the operation data of the operation system to obtain the target operation data, and 11 indicates that the target operation data is returned to the user.
一个例子中,对话式运维系统可以提供四种能力,参见图5所示,为Copilot代理支持四种能力的示意图。1、运维系统支持页面(webpage)访问功能,Copilot代理支持Text2URL功能,用户界面支持navigator(导航)功能。基于Text2URL功能,Copilot代理可以将原始输入问题转换为URL地址,通过URL地址访问运维系统的webpage,并在用户界面显示webpage的访问结果。In an example, the conversational operation and maintenance system can provide four capabilities, as shown in Figure 5, which is a schematic diagram of the four capabilities supported by the Copilot agent. 1. The operation and maintenance system supports the webpage access function, the Copilot agent supports the Text2URL function, and the user interface supports the navigator function. Based on the Text2URL function, the Copilot agent can convert the original input question into a URL address, access the webpage of the operation and maintenance system through the URL address, and display the access result of the webpage in the user interface.
在该功能下,Copilot代理可以将用户的自然语言需求转化到任何一个运维管理页面,用户可以通过语言交流直接访问运维管理页面,无需用户自身寻找和点击链接,该场景下,用户可以查看直接的概览和详细数据。With this feature, the Copilot agent can convert the user's natural language needs into any operation and maintenance management page. The user can directly access the operation and maintenance management page through language communication without the user having to search and click on the link themselves. In this scenario, the user can view a direct overview and detailed data.
2、运维系统支持RestAPI调用功能,Copilot代理支持Text2API功能,用户界面支持troubleshooting(诊断)功能。基于Text2API功能,Copilot代理可以将原始输入问题转换为API,调用API对运维系统进行操作,得到操作结果,并将操作结果返回给用户,用户基于操作结果获知运维系统的诊断结果。2. The operation and maintenance system supports the RestAPI call function, the Copilot agent supports the Text2API function, and the user interface supports the troubleshooting (diagnosis) function. Based on the Text2API function, the Copilot agent can convert the original input problem into an API, call the API to operate the operation and maintenance system, obtain the operation result, and return the operation result to the user. The user can obtain the diagnosis result of the operation and maintenance system based on the operation result.
在该功能下,Copilot代理可以将用户的自然语言需求转化到API调用,对于非原有页面的需求或动作下发需求,通过理解语义后完成REST API调用实现相关功能的展示和下发,该场景下,能够对用户要诊断的问题进行闭环解决。With this function, the Copilot agent can convert the user's natural language requirements into API calls. For requirements for non-original pages or action delivery requirements, it can complete the REST API call after understanding the semantics to display and deliver related functions. In this scenario, it can provide a closed-loop solution to the problem that the user wants to diagnose.
3、运维系统支持SQLDB(数据库查询)功能,Copilot代理支持Text2SQL功能,用户界面支持Report(报表)功能。基于Text2SQL功能,Copilot代理可以将原始输入问题转换为SQL语句,通过SQL语句对运维系统的数据表进行查询,得到数据表查询结果,并将数据表查询结果返回给用户。3. The operation and maintenance system supports the SQLDB (database query) function, the Copilot agent supports the Text2SQL function, and the user interface supports the Report function. Based on the Text2SQL function, the Copilot agent can convert the original input question into an SQL statement, query the data table of the operation and maintenance system through the SQL statement, obtain the data table query result, and return the data table query result to the user.
在该功能下,Copilot代理可以将用户的自然语言需求转化到SQL,支持用户的一些图表汇总和查询,该场景下,可以为用户提供个性化的统计报表。With this function, the Copilot agent can convert the user's natural language requirements into SQL, support some of the user's chart summaries and queries, and provide users with personalized statistical reports in this scenario.
4、运维系统支持Knowledge(知识问答)功能,Copilot代理支持QA(问答)功能,用户界面支持Help(帮助)功能。基于QA功能,Copilot代理可以将原始输入问题转换为问答句子,通过问答句子从运维系统查询问答结果。4. The operation and maintenance system supports the Knowledge function, the Copilot agent supports the QA function, and the user interface supports the Help function. Based on the QA function, the Copilot agent can convert the original input question into a question and answer sentence, and query the question and answer results from the operation and maintenance system through the question and answer sentence.
在该功能下,Copilot代理可以将用户的自然语言问题转化到答案,可以支持用户咨询专业性的技术问题和软件本身的使用帮助问题。With this feature, Copilot agents can convert users' natural language questions into answers, and can support users in asking professional technical questions and asking questions about the software itself.
针对Copilot代理支持Text2URL功能的情况,本申请实施例中提出一种基于大语言模型的数据处理方法,该数据处理方法可以包括:In view of the fact that the Copilot agent supports the Text2URL function, an embodiment of the present application proposes a data processing method based on a large language model, which may include:
步骤S11、Copilot代理获取运维系统的私域数据,该私域数据可以包括URL描述信息和URL地址信息。其中,URL地址信息可以是与URL地址有关的信息,URL地址是用于对运维系统进行访问的参数。URL描述信息可以是与URL描述有关的信息,而URL描述是用于对问题进行描述的信息。Step S11, the Copilot agent obtains private domain data of the operation and maintenance system, and the private domain data may include URL description information and URL address information. The URL address information may be information related to the URL address, and the URL address is a parameter for accessing the operation and maintenance system. The URL description information may be information related to the URL description, and the URL description is information for describing the problem.
步骤S12、Copilot代理采用私域数据对大语言模型进行增量训练。Step S12: The Copilot agent uses private domain data to perform incremental training on the large language model.
步骤S13、Copilot代理采用私域数据维护RAG功能的特征数据库。Step S13: The Copilot agent uses private domain data to maintain the feature database of the RAG function.
若私域数据包括URL描述信息和URL地址信息,则可以对URL描述信息进行处理,得到URL描述,该URL描述可以是用于对问题进行描述的信息,如可以包括URL地址对应页面的描述信息、用于对该页面进行访问的问题的描述信息等,对此不做限制。可以对URL地址信息进行处理,得到URL地址,该URL地址可以是用于对运维系统进行访问的参数。在得到URL描述和URL地址之后,可以在特征数据库中记录URL描述与URL地址之间的对应关系。If the private domain data includes URL description information and URL address information, the URL description information can be processed to obtain a URL description, which can be information used to describe the problem, such as description information of the page corresponding to the URL address, description information of the problem for accessing the page, etc., without limitation. The URL address information can be processed to obtain a URL address, which can be a parameter for accessing the operation and maintenance system. After obtaining the URL description and URL address, the corresponding relationship between the URL description and the URL address can be recorded in the feature database.
步骤S14、Copilot代理获取原始输入问题。Step S14: The Copilot agent obtains the original input question.
步骤S15、Copilot代理将原始输入问题和提示词输入给大语言模型,通过大语言模型基于该提示词对该原始输入问题进行处理得到目标设备标识和目标URL描述,该提示词可以包括问题样例、问题参数样例和问题描述样例。Step S15, the Copilot agent inputs the original input question and prompt words into the large language model, and the large language model processes the original input question based on the prompt words to obtain the target device identification and target URL description. The prompt words may include question samples, question parameter samples, and question description samples.
步骤S16、Copilot代理从特征数据库中查询目标URL描述对应的目标URL地址,且该目标URL地址可以包括已配置的固定设备标识。Step S16: The Copilot agent queries the target URL address corresponding to the target URL description from the feature database, and the target URL address may include a configured fixed device identifier.
示例性的,若特征数据库存在与目标URL描述匹配的第一URL描述,且第一URL描述对应有URL地址,则将该第一URL描述对应的URL地址确定为目标URL描述对应的目标URL地址,且大语言模型可以得到目标URL地址。Exemplarily, if the feature database contains a first URL description that matches the target URL description, and the first URL description corresponds to a URL address, the URL address corresponding to the first URL description is determined as the target URL address corresponding to the target URL description, and the large language model can obtain the target URL address.
步骤S17、Copilot代理将目标URL地址中的固定设备标识(即URL地址中预先配置的设备标识)替换为目标设备标识得到修改后的URL地址。Step S17: The Copilot agent replaces the fixed device identifier in the target URL address (ie, the device identifier pre-configured in the URL address) with the target device identifier to obtain a modified URL address.
步骤S18、Copilot代理访问运维系统的与修改后的URL地址对应的运维管理页面,并将运维管理页面的目标运维数据返回给用户。Step S18: The Copilot agent accesses the operation and maintenance management page of the operation and maintenance system corresponding to the modified URL address, and returns the target operation and maintenance data of the operation and maintenance management page to the user.
比如说,在对话式运维系统的展示界面下,Copilot代理可以在展示界面接收自然语言指令,在展示界面直接回答简单的文字问题或者生成Chart(图表),并在展示界面跳转到维管理页面,以显示维管理页面的目标运维数据。For example, in the display interface of the conversational operation and maintenance system, the Copilot agent can receive natural language instructions on the display interface, directly answer simple text questions or generate a chart on the display interface, and jump to the maintenance management page on the display interface to display the target operation and maintenance data of the maintenance management page.
以下结合一个单任务流程例子,对步骤S11-步骤S18的处理过程进行说明。The following describes the processing of step S11 to step S18 in conjunction with a single task flow example.
参见图6所示,为单任务流程的示意图,以查看AP信息为例,描述单任务调用的流程,图6中的具体问题和格式描述均为样例,对此不做固定限制。See Figure 6, which is a schematic diagram of a single task process. Taking viewing AP information as an example, the process of single task calling is described. The specific questions and format descriptions in Figure 6 are all examples and there is no fixed restriction on this.
首先,用户需要查看某个AP的基本信息,用户给Copilot代理(Copilot助手)发送了“查看名称为A1208-11的AP的信息”的原始输入问题。First, the user needs to view the basic information of a certain AP. The user sends the original input question "View the information of the AP named A1208-11" to the Copilot agent (Copilot assistant).
然后,为提高知识库的向量匹配度,需要准确提取原始输入问题中的问题参数,为此,Copilot代理还需要获取提示词,且Copilot代理将原始输入问题和提示词一起发送给大语言模型(LLM大模型)。通过使用大语言模型的one-shot的提示词技巧,通过在提示词中构造一个问题样例、问题参数样例和问题描述样例,能够激发大语言模型的泛化能力,使得大语言模型可以按照问题样例来完成后面的任务要求。在图6中,“将问题按照参数部分和意图部分解析,并按照json返回;示例:…}”表示提示词,而“问题是:查看名称为A1208-11的AP基本信息”表示原始输入问题。Then, in order to improve the vector matching degree of the knowledge base, it is necessary to accurately extract the question parameters in the original input question. For this purpose, the Copilot agent also needs to obtain the prompt word, and the Copilot agent sends the original input question and the prompt word to the large language model (LLM large model). By using the one-shot prompt word technique of the large language model, by constructing a question sample, a question parameter sample, and a question description sample in the prompt word, the generalization ability of the large language model can be stimulated, so that the large language model can complete the subsequent task requirements according to the question sample. In Figure 6, "Parse the question according to the parameter part and the intent part, and return it in json; example: ...}" represents the prompt word, and "The question is: View the basic information of the AP named A1208-11" represents the original input question.
然后,大语言模型基于该提示词对该原始输入问题进行结构化处理,得到{apname:A1208-11;desc:AP信息}的结构化问题。在该结构化问题中,apname后面的A1208-11表示目标设备标识(即设备名称),表示需要对该目标设备标识对应的设备进行操作,AP信息表示目标URL描述,是针对原始输入问题的问题描述,表示需要对设备的“AP信息”对应的页面进行操作。Then, the large language model performs structured processing on the original input question based on the prompt word, and obtains a structured question of {apname: A1208-11; desc: AP information}. In this structured question, A1208-11 after apname represents the target device identifier (i.e., the device name), indicating that the device corresponding to the target device identifier needs to be operated, and AP information represents the target URL description, which is the question description for the original input question, indicating that the page corresponding to the device's "AP information" needs to be operated.
然后,大语言模型从特征数据库中查询AP信息对应的目标URL地址。比如说,由于“AP基本信息”与AP信息的相似度较大,因此,“AP基本信息”是第一URL描述,将“AP基本信息”对应的URL地址“http://url?apname=$apname”确定为目标URL地址,这样,大语言模型可以得到该目标URL地址。Then, the large language model queries the target URL address corresponding to the AP information from the feature database. For example, since the similarity between "AP basic information" and AP information is relatively large, "AP basic information" is the first URL description, and the URL address "http://url?apname=$apname" corresponding to "AP basic information" is determined as the target URL address, so that the large language model can obtain the target URL address.
在图6中,AP基本信息是URL描述的主要描述信息,AP射频和AP信道是URL描述的辅助描述信息,若AP基本信息与目标URL描述的相似度未满足条件,但是AP射频与目标URL描述的相似度满足条件,和/或,AP信道与目标URL描述的相似度满足条件,也可以将AP基本信息作为第一URL描述。In Figure 6, AP basic information is the main description information of the URL description, AP radio frequency and AP channel are auxiliary description information of the URL description. If the similarity between the AP basic information and the target URL description does not meet the conditions, but the similarity between the AP radio frequency and the target URL description meets the conditions, and/or the similarity between the AP channel and the target URL description meets the conditions, the AP basic information can also be used as the first URL description.
然后,大语言模型基于目标URL地址和结构化的参数进行替换,如将目标URL地址“http://url?apname=$apname”中的$apname替换为A1208-11,得到修改后的URL地址“http://url?apname=A1208-11”,$apname表示固定设备标识。Then, the large language model performs replacement based on the target URL address and structured parameters, such as replacing $apname in the target URL address "http://url?apname=$apname" with A1208-11 to obtain the modified URL address "http://url?apname=A1208-11", where $apname represents a fixed device identifier.
然后,agent调用http://url?apname=A1208-11,运维系统自动导航到对应的运维管理页面,用户查看运维管理页面的数据,实现人机交互方式的页面访问。Then, the agent calls http://url?apname=A1208-11, and the operation and maintenance system automatically navigates to the corresponding operation and maintenance management page. The user views the data on the operation and maintenance management page, realizing page access in a human-computer interactive manner.
一个例子中,大语言模型从特征数据库中查询AP信息对应的目标URL地址时,若得到多个目标URL地址,如URL地址http://url?apname=$apname和URL地址http://url2?apname=$apname,则可以得到两个修改后的URL地址,分别为“http://url?apname=A1208-11”和“http://url2?apname=A1208-11”。In an example, when the large language model queries the target URL address corresponding to the AP information from the feature database, if multiple target URL addresses are obtained, such as the URL address http://url?apname=$apname and the URL address http://url2?apname=$apname, two modified URL addresses can be obtained, namely "http://url?apname=A1208-11" and "http://url2?apname=A1208-11".
然后,将这两个修改后的URL地址返回给用户,由用户选择一个修改后的URL地址,即,点击选择的URL地址的链接,这样,可以自动导航到这个URL地址的链接对应的运维管理页面,用户可以查看运维管理页面的数据。Then, the two modified URL addresses are returned to the user, and the user selects one of the modified URL addresses, that is, clicks the link of the selected URL address, so that the user can automatically navigate to the operation and maintenance management page corresponding to the link of this URL address, and the user can view the data on the operation and maintenance management page.
针对Copilot代理支持Text2URL功能的情况,本申请实施例中提出一种基于大语言模型的数据处理方法,该数据处理方法可以包括:In view of the fact that the Copilot agent supports the Text2URL function, an embodiment of the present application proposes a data processing method based on a large language model, which may include:
步骤S21、Copilot代理获取运维系统的私域数据,该私域数据可以包括URL描述信息和多个操作步骤描述信息。其中,URL描述信息可以是与任务有关的信息,是针对该任务的描述信息。针对每个操作步骤描述信息,该操作步骤描述信息可以是针对操作步骤的描述信息,可以是针对该操作步骤的问题描述。Step S21, the Copilot agent obtains private domain data of the operation and maintenance system, and the private domain data may include URL description information and multiple operation step description information. The URL description information may be information related to the task, which is description information for the task. For each operation step description information, the operation step description information may be description information for the operation step, which may be a description of the problem for the operation step.
针对每个操作步骤描述信息,私域数据还可以包括该操作步骤描述信息对应的URL描述信息和URL地址信息。URL地址信息是与URL地址有关的信息,URL地址用于对运维系统进行访问。URL描述信息是与URL描述有关的信息,而URL描述是用于对问题(即操作步骤描述信息对应的问题)进行描述的信息。For each operation step description information, the private domain data may also include URL description information and URL address information corresponding to the operation step description information. URL address information is information related to the URL address, and the URL address is used to access the operation and maintenance system. URL description information is information related to the URL description, and the URL description is information used to describe the problem (i.e., the problem corresponding to the operation step description information).
步骤S22、Copilot代理采用私域数据对大语言模型进行增量训练。Step S22: The Copilot agent uses private domain data to perform incremental training on the large language model.
步骤S23、Copilot代理采用私域数据维护RAG功能的特征数据库。Step S23: The Copilot agent uses private domain data to maintain the feature database of the RAG function.
示例性的,若私域数据包括URL描述信息和多个操作步骤描述信息,则可以对URL描述信息进行处理,得到URL描述,该URL描述可以是针对任务的描述信息。针对每个操作步骤描述信息,可以对该操作步骤描述信息进行处理,得到操作步骤描述,该操作步骤描述是针对操作步骤的问题描述信息。然后,可以在特征数据库中记录URL描述与每个操作步骤描述之间的对应关系。Exemplarily, if the private domain data includes URL description information and multiple operation step description information, the URL description information can be processed to obtain a URL description, which can be description information for the task. For each operation step description information, the operation step description information can be processed to obtain an operation step description, which is a problem description information for the operation step. Then, the correspondence between the URL description and each operation step description can be recorded in the feature database.
针对每个操作步骤描述信息,若私域数据可以包括该操作步骤描述信息对应的URL描述信息和URL地址信息,则还可以对URL描述信息进行处理,得到URL描述,对URL地址信息进行处理,得到URL地址。在得到URL描述和URL地址后,在特征数据库中记录URL描述与URL地址之间的对应关系。For each operation step description information, if the private domain data may include URL description information and URL address information corresponding to the operation step description information, the URL description information may be processed to obtain the URL description, and the URL address information may be processed to obtain the URL address. After obtaining the URL description and the URL address, the corresponding relationship between the URL description and the URL address is recorded in the feature database.
步骤S24、Copilot代理获取原始输入问题。Step S24: The Copilot agent obtains the original input question.
步骤S25、Copilot代理将原始输入问题和提示词输入给大语言模型,通过大语言模型基于该提示词对该原始输入问题进行处理得到目标设备标识和目标URL描述,该提示词可以包括问题样例、问题参数样例和问题描述样例。Step S25, the Copilot agent inputs the original input question and prompt words into the large language model, and the large language model processes the original input question based on the prompt words to obtain the target device identification and target URL description. The prompt words may include question samples, question parameter samples, and question description samples.
步骤S26、Copilot代理从特征数据库中查询目标URL描述对应的目标URL地址,且该目标URL地址可以包括已配置的固定设备标识。Step S26: The Copilot agent queries the target URL address corresponding to the target URL description from the feature database, and the target URL address may include a configured fixed device identifier.
示例性的,若特征数据库中存在与目标URL描述匹配的第一URL描述,且第一URL描述对应有多个操作步骤描述,那么,针对每个操作步骤描述:若特征数据库中存在与该操作步骤描述匹配的第二URL描述,且第二URL描述对应有URL地址,则将该第二URL描述对应的URL地址确定为目标URL描述对应的目标URL地址,且大语言模型可以得到多个目标URL地址。Exemplarily, if there is a first URL description matching the target URL description in the feature database, and the first URL description corresponds to multiple operation step descriptions, then for each operation step description: if there is a second URL description matching the operation step description in the feature database, and the second URL description corresponds to a URL address, then the URL address corresponding to the second URL description is determined as the target URL address corresponding to the target URL description, and the large language model can obtain multiple target URL addresses.
步骤S27、Copilot代理将目标URL地址中的固定设备标识(即URL地址中预先配置的设备标识)替换为目标设备标识得到修改后的URL地址。Step S27: The Copilot agent replaces the fixed device identifier in the target URL address (ie, the device identifier pre-configured in the URL address) with the target device identifier to obtain a modified URL address.
步骤S28、Copilot代理访问运维系统的与修改后的URL地址对应的运维管理页面,并将运维管理页面的目标运维数据返回给用户。Step S28, the Copilot agent accesses the operation and maintenance management page of the operation and maintenance system corresponding to the modified URL address, and returns the target operation and maintenance data of the operation and maintenance management page to the user.
示例性的,由于大语言模型可以得到多个目标URL地址,因此,可以得到多个修改后的URL地址,Copilot代理可以依次访问每个与修改后的URL地址对应的运维管理页面,并依次将每个运维管理页面显示给用户。或者,也可以同时访问多个修改后的URL地址对应的运维管理页面,并同时将多个运维管理页面显示给用户,即在一个展示界面显示多个运维管理页面。For example, since the large language model can obtain multiple target URL addresses, multiple modified URL addresses can be obtained, and the Copilot agent can sequentially access each operation and maintenance management page corresponding to the modified URL address, and sequentially display each operation and maintenance management page to the user. Alternatively, the operation and maintenance management pages corresponding to multiple modified URL addresses can be accessed simultaneously, and multiple operation and maintenance management pages can be displayed to the user at the same time, that is, multiple operation and maintenance management pages can be displayed on one display interface.
针对Copilot代理支持Text2API功能的情况,本申请实施例中提出一种基于大语言模型的数据处理方法,该数据处理方法可以包括:In view of the fact that the Copilot agent supports the Text2API function, a data processing method based on a large language model is proposed in an embodiment of the present application. The data processing method may include:
步骤S31、Copilot代理获取运维系统的私域数据,该私域数据可以包括API描述信息和API信息。其中,API信息可以是与API(API表示一种操作方法)有关的信息,API是用于对运维系统进行访问的参数。API描述信息可以是与API描述有关的信息,而API描述是用于对问题进行描述的信息。Step S31, the Copilot agent obtains private domain data of the operation and maintenance system, and the private domain data may include API description information and API information. The API information may be information related to an API (API represents an operation method), and an API is a parameter used to access the operation and maintenance system. The API description information may be information related to an API description, and the API description is information used to describe the problem.
步骤S32、Copilot代理采用私域数据对大语言模型进行增量训练。Step S32: The Copilot agent uses private domain data to perform incremental training on the large language model.
步骤S33、Copilot代理采用私域数据维护RAG功能的特征数据库。Step S33: The Copilot agent uses private domain data to maintain the feature database of the RAG function.
若私域数据包括API描述信息和API信息,则可以对API描述信息进行处理,得到API描述,API描述可以是用于对问题进行描述的信息,如可以包括API对应功能/操作的描述信息、用于对API功能/操作进行调用的问题的描述信息等,对此不做限制。可以对API信息进行处理得到API,API可以是用于对运维系统进行访问的参数。在特征数据库中记录API描述与API之间的对应关系。If the private domain data includes API description information and API information, the API description information can be processed to obtain the API description, which can be information used to describe the problem, such as description information of the API corresponding function/operation, description information of the problem for calling the API function/operation, etc., without limitation. The API information can be processed to obtain the API, which can be a parameter for accessing the operation and maintenance system. The correspondence between the API description and the API is recorded in the feature database.
步骤S34、Copilot代理获取原始输入问题。Step S34: The Copilot agent obtains the original input question.
步骤S35、Copilot代理将原始输入问题和提示词输入给大语言模型,通过大语言模型基于该提示词对该原始输入问题进行处理得到目标设备标识和目标API描述,该提示词可以包括问题样例、问题参数样例和问题描述样例。Step S35, the Copilot agent inputs the original input question and prompt words into the large language model, and the large language model processes the original input question based on the prompt words to obtain the target device identification and target API description. The prompt words may include question samples, question parameter samples, and question description samples.
步骤S36、Copilot代理从特征数据库中查询目标API描述对应的目标API,且该目标API可以包括已配置的固定设备标识。Step S36: The Copilot agent queries the target API corresponding to the target API description from the feature database, and the target API may include a configured fixed device identifier.
示例性的,若特征数据库存在与目标API描述匹配的第一API描述,且第一API描述对应有API,则将该第一API描述对应的API确定为目标API描述对应的目标API,且大语言模型可以得到这个目标API。Exemplarily, if the feature database has a first API description that matches the target API description, and the first API description corresponds to an API, the API corresponding to the first API description is determined as the target API corresponding to the target API description, and the large language model can obtain this target API.
步骤S37、Copilot代理将目标API中的固定设备标识(即API中预先配置的设备标识)替换为目标设备标识得到修改后的API。Step S37: The Copilot agent replaces the fixed device identifier in the target API (ie, the device identifier pre-configured in the API) with the target device identifier to obtain a modified API.
步骤S38、Copilot代理调用修改后的API对运维系统进行操作得到操作结果数据,并基于该操作结果数据确定目标运维数据(如将该操作结果数据作为目标运维数据),并将该目标运维数据返回给用户。Step S38, the Copilot agent calls the modified API to operate the operation and maintenance system to obtain operation result data, and determines target operation and maintenance data based on the operation result data (such as using the operation result data as target operation and maintenance data), and returns the target operation and maintenance data to the user.
针对Copilot代理支持Text2API功能的情况,本申请实施例中提出一种基于大语言模型的数据处理方法,该数据处理方法可以包括:In view of the fact that the Copilot agent supports the Text2API function, a data processing method based on a large language model is proposed in an embodiment of the present application. The data processing method may include:
步骤S41、Copilot代理获取运维系统的私域数据,该私域数据可以包括API描述信息和多个操作步骤描述信息。其中,API描述信息可以是与任务有关的信息,是针对该任务的描述信息。针对每个操作步骤描述信息,该操作步骤描述信息可以是针对操作步骤的描述信息,可以是针对该操作步骤的问题描述。Step S41, the Copilot agent obtains private domain data of the operation and maintenance system, and the private domain data may include API description information and multiple operation step description information. Among them, the API description information may be information related to the task, which is the description information for the task. For each operation step description information, the operation step description information may be the description information for the operation step, which may be the problem description for the operation step.
针对每个操作步骤描述信息,私域数据还可以包括该操作步骤描述信息对应的API描述信息和API信息。API信息是与API有关的信息,API用于对运维系统进行访问。API描述信息是与API描述有关的信息,而API描述是用于对问题(即操作步骤描述信息对应的问题)进行描述的信息。For each operation step description information, the private domain data may also include API description information and API information corresponding to the operation step description information. API information is information related to the API, and the API is used to access the operation and maintenance system. API description information is information related to the API description, and the API description is information used to describe the problem (i.e., the problem corresponding to the operation step description information).
步骤S42、Copilot代理采用私域数据对大语言模型进行增量训练。Step S42: The Copilot agent uses private domain data to perform incremental training on the large language model.
步骤S43、Copilot代理采用私域数据维护RAG功能的特征数据库。Step S43: The Copilot agent uses private domain data to maintain the feature database of the RAG function.
示例性的,若私域数据包括API描述信息和多个操作步骤描述信息,则可以对API描述信息进行处理,得到API描述。针对每个操作步骤描述信息,可以对该操作步骤描述信息进行处理,得到操作步骤描述。然后,可以在特征数据库中记录API描述与每个操作步骤描述之间的对应关系。针对每个操作步骤描述信息,若私域数据可以包括该操作步骤描述信息对应的API描述信息和API信息,则还可以对API描述信息进行处理,得到API描述,对API信息进行处理,得到API,在特征数据库中记录API描述与API之间的对应关系。Exemplarily, if the private domain data includes API description information and multiple operation step description information, the API description information can be processed to obtain the API description. For each operation step description information, the operation step description information can be processed to obtain the operation step description. Then, the correspondence between the API description and each operation step description can be recorded in the feature database. For each operation step description information, if the private domain data can include the API description information and API information corresponding to the operation step description information, the API description information can also be processed to obtain the API description, the API information can be processed to obtain the API, and the correspondence between the API description and the API can be recorded in the feature database.
步骤S44、Copilot代理获取原始输入问题。Step S44: The Copilot agent obtains the original input question.
步骤S45、Copilot代理将原始输入问题和提示词输入给大语言模型,通过大语言模型基于该提示词对该原始输入问题进行处理得到目标设备标识和目标API描述,该提示词可以包括问题样例、问题参数样例和问题描述样例。Step S45, the Copilot agent inputs the original input question and prompt words into the large language model, and the large language model processes the original input question based on the prompt words to obtain the target device identification and target API description. The prompt words may include question samples, question parameter samples, and question description samples.
步骤S46、Copilot代理从特征数据库中查询目标API描述对应的目标API,且该目标API可以包括已配置的固定设备标识。Step S46: The Copilot agent queries the target API corresponding to the target API description from the feature database, and the target API may include a configured fixed device identifier.
示例性的,若特征数据库中存在与目标API描述匹配的第一API描述,且第一API描述对应有多个操作步骤描述,那么,针对每个操作步骤描述:若特征数据库中存在与该操作步骤描述匹配的第二API描述,且第二API描述对应有API,则将该第二API描述对应的API确定为目标API描述对应的目标API。Exemplarily, if there is a first API description in the feature database that matches the target API description, and the first API description corresponds to multiple operation step descriptions, then for each operation step description: if there is a second API description in the feature database that matches the operation step description, and the second API description corresponds to an API, then the API corresponding to the second API description is determined as the target API corresponding to the target API description.
步骤S47、Copilot代理将目标API中的固定设备标识(即API中预先配置的设备标识)替换为目标设备标识得到修改后的API。Step S47: The Copilot agent replaces the fixed device identifier in the target API (ie, the device identifier pre-configured in the API) with the target device identifier to obtain a modified API.
步骤S48、Copilot代理调用修改后的API对运维系统进行操作得到操作结果数据,并基于该操作结果数据确定目标运维数据(如将该操作结果数据作为目标运维数据),并将该目标运维数据返回给用户。Step S48, the Copilot agent calls the modified API to operate the operation and maintenance system to obtain operation result data, and determines target operation and maintenance data based on the operation result data (such as using the operation result data as target operation and maintenance data), and returns the target operation and maintenance data to the user.
示例性的,由于大语言模型可以得到多个目标API,因此,可以得到多个修改后的API,Copilot代理可以依次调用每个修改后的API对运维系统进行操作。For example, since the large language model can obtain multiple target APIs, multiple modified APIs can be obtained, and the Copilot agent can call each modified API in turn to operate the operation and maintenance system.
以下结合一个多任务流程例子,对步骤S41-步骤S48的处理过程进行说明。The processing of step S41 to step S48 is described below with reference to a multi-task process example.
参见图7所示,为多任务流程的示意图,以诊断AP信道利用率高的原因为例,描述多任务调用的流程,具体问题和格式描述均为样例,不做固定限制。See FIG. 7 , which is a schematic diagram of a multi-task process. Taking the diagnosis of the cause of high AP channel utilization as an example, the multi-task calling process is described. The specific questions and format descriptions are examples and are not fixed.
首先,用户需要查看某AP的信道利用率高是什么原因,用户给Copilot代理发送“名称为A1208-11的AP的信道利用率高是什么原因”的原始输入问题。First, the user needs to check why the channel utilization of a certain AP is high. The user sends the original input question "Why is the channel utilization of the AP named A1208-11 high?" to the Copilot agent.
然后,为了提高知识库的向量匹配度,需要准确提取原始输入问题中的问题参数,为此,Copilot代理需要获取提示词,且Copilot代理可以将原始输入问题和提示词一起发送给大语言模型。这样,可以通过使用大语言模型的one-shot的提示词技巧,通过在提示词中构造一个问题样例、问题参数样例和问题描述样例,能够激发大语言模型的泛化能力,使得大语言模型可以按照问题样例来完成后面的任务要求。在图7中,“将问题按照参数部分和意图部分解析,并按照json返回;示例:…}”表示提示词,而“问题是:名称为A1208-11的AP信道利用率高是什么原因”表示原始输入问题。Then, in order to improve the vector matching degree of the knowledge base, it is necessary to accurately extract the question parameters in the original input question. For this purpose, the Copilot agent needs to obtain the prompt word, and the Copilot agent can send the original input question and the prompt word together to the large language model. In this way, by using the one-shot prompt word technique of the large language model, by constructing a question sample, a question parameter sample, and a question description sample in the prompt word, the generalization ability of the large language model can be stimulated, so that the large language model can complete the subsequent task requirements according to the question sample. In Figure 7, "Parse the question according to the parameter part and the intent part, and return it in json; example: ...}" represents the prompt word, and "The question is: What is the reason for the high channel utilization of the AP named A1208-11" represents the original input question.
然后,大语言模型基于该提示词对该原始输入问题进行结构化处理,得到{apname:A1208-11;desc:AP信道利用率高}的结构化问题。在该结构化问题中,apname后面的A1208-11表示目标设备标识(即设备名称),表示需要对该目标设备标识对应的设备进行操作。AP信道利用率高表示目标API描述,是针对原始输入问题的问题描述,表示需要查询AP信道利用率高的原因。Then, the large language model performs structured processing on the original input question based on the prompt word, and obtains the structured question {apname: A1208-11; desc: AP channel utilization is high}. In this structured question, A1208-11 after apname represents the target device identifier (i.e., the device name), indicating that the device corresponding to the target device identifier needs to be operated. AP channel utilization is high, which represents the target API description, which is the problem description for the original input question, indicating that the reason for the high AP channel utilization needs to be queried.
然后,针对结构化问题,大语言模型将“AP信道利用率高”与特征数据库进行向量匹配,匹配到对应的结果内容为:{desc:AP信道利用率高;alias:诊断原因;content:1、查看AP的同时使用终端;2、查看AP的低速率终端;3、查看AP的大流量终端},大语言模型得到上述结果内容。Then, for the structured problem, the large language model performs vector matching of "AP channel utilization is high" with the feature database, and the corresponding result content is: {desc: AP channel utilization is high; alias: diagnosis reason; content: 1. Check the AP while using the terminal; 2. Check the AP's low-speed terminal; 3. Check the AP's high-traffic terminal}, and the large language model obtains the above result content.
在上述结果内容中,“AP信道利用率高”表示与目标API描述对应的第一API描述,“查看AP的同时使用终端”、“查看AP的低速率终端”和“查看AP的大流量终端”表示第一API描述对应的多个操作步骤描述。In the above results, "AP channel utilization is high" represents the first API description corresponding to the target API description, and "View AP while using the terminal", "View AP's low-speed terminal" and "View AP's high-traffic terminal" represent multiple operation step descriptions corresponding to the first API description.
大语言模型基于返回的结果内容进行思维链拆解,比如说,可以继续使用oneshot提示词激发大语言模型的思维能力,将诊断步骤拆解为3步。The large language model breaks down the thought chain based on the returned result content. For example, you can continue to use oneshot prompt words to stimulate the thinking ability of the large language model and break down the diagnostic steps into three steps.
第一步,大语言模型基于提示词对“查看AP的同时使用终端”进行结构化处理,即,将“查看AP的同时使用终端”这个操作步骤描述作为原始输入问题,对该原始输入问题进行结构化处理,得到“AP的同时使用终端”这个目标API描述,A1208-11继续作为目标设备标识(即设备名称)。大语言模型可以从特征数据库中查询“AP的同时使用终端”对应的目标API,参见表2所示,目标API可以为API-1,以http://API?apname=$apname为例。In the first step, the large language model performs structural processing on "viewing AP while using terminal" based on the prompt word, that is, the operation step description of "viewing AP while using terminal" is used as the original input question, and the original input question is structured to obtain the target API description of "viewing AP while using terminal", and A1208-11 continues to be the target device identifier (i.e., device name). The large language model can query the target API corresponding to "viewing AP while using terminal" from the feature database, as shown in Table 2. The target API can be API-1, taking http://API?apname=$apname as an example.
第二步,大语言模型基于提示词对“查看AP的低速率终端”进行结构化处理,即,将“查看AP的低速率终端”这个操作步骤描述作为原始输入问题,对原始输入问题进行结构化处理,得到“AP的低速率终端”这个目标API描述,A1208-11作为目标设备标识。大语言模型从特征数据库中查询“AP的低速率终端”对应的目标API,参见表2所示,目标API可以为API-2。In the second step, the large language model performs structural processing on "view the low-rate terminal of AP" based on the prompt word, that is, the operation step description of "view the low-rate terminal of AP" is used as the original input question, and the original input question is structured to obtain the target API description of "low-rate terminal of AP", and A1208-11 is used as the target device identifier. The large language model queries the target API corresponding to "low-rate terminal of AP" from the feature database, as shown in Table 2, and the target API can be API-2.
第三步,大语言模型基于提示词对“查看AP的大流量终端”进行结构化处理,即,将“查看AP的大流量终端”这个操作步骤描述作为原始输入问题,对原始输入问题进行结构化处理,得到“AP的大流量终端”这个目标API描述,A1208-11作为目标设备标识。大语言模型从特征数据库中查询“AP的大流量终端”对应的目标API,参见表2所示,目标API可以为API-3。In the third step, the large language model performs structural processing on "view AP's high-traffic terminal" based on the prompt word, that is, the operation step description of "view AP's high-traffic terminal" is used as the original input question, and the original input question is structured to obtain the target API description of "AP's high-traffic terminal", and A1208-11 is used as the target device identifier. The large language model queries the target API corresponding to "AP's high-traffic terminal" from the feature database. As shown in Table 2, the target API can be API-3.
然后,大语言模型基于目标API(即API-1)和结构化的参数(即目标设备标识“A1208-11”)进行替换,如将目标API“http://API?apname=$apname”中的$apname替换为A1208-11,得到修改后的API-1“http://API?apname=A1208-11”。Then, the large language model performs replacement based on the target API (i.e., API-1) and the structured parameters (i.e., the target device identifier "A1208-11"), such as replacing $apname in the target API "http://API?apname=$apname" with A1208-11 to obtain the modified API-1 "http://API?apname=A1208-11".
同理,可以将API-2中的$apname替换为A1208-11,得到修改后的API-2,将API-3中的$apname替换为A1208-11,得到修改后的API-3。Similarly, $apname in API-2 can be replaced with A1208-11 to obtain the modified API-2, and $apname in API-3 can be replaced with A1208-11 to obtain the modified API-3.
在得到修改后的API-1、修改后的API-2、修改后的API-3之后,可以按照单任务流程,调用修改后的API-1对运维系统进行操作,即查看AP的同时使用终端,从而获知是否存在同时使用终端过多的查询结果,将是否存在同时使用终端过多的查询结果(目标运维数据)返回给用户。此外,可以按照单任务流程,调用修改后的API-2对运维系统进行操作,即查看AP是否存在低速率终端,从而获知是否存在低速率终端的查询结果,将是否存在低速率终端的查询结果返回给用户。此外,可以按照单任务流程,调用修改后的API-3对运维系统进行操作,即查看AP是否存在大流量终端,从而获知是否存在大流量终端的查询结果,将是否存在大流量终端的查询结果返回给用户。After obtaining the modified API-1, the modified API-2, and the modified API-3, the modified API-1 can be called according to the single-task process to operate the operation and maintenance system, that is, to check the terminals used at the same time by the AP, so as to obtain the query results of whether there are too many terminals used at the same time, and return the query results of whether there are too many terminals used at the same time (target operation and maintenance data) to the user. In addition, the modified API-2 can be called according to the single-task process to operate the operation and maintenance system, that is, to check whether there are low-speed terminals on the AP, so as to obtain the query results of whether there are low-speed terminals, and return the query results of whether there are low-speed terminals to the user. In addition, the modified API-3 can be called according to the single-task process to operate the operation and maintenance system, that is, to check whether there are high-traffic terminals on the AP, so as to obtain the query results of whether there are high-traffic terminals, and return the query results of whether there are high-traffic terminals to the user.
在将每个步骤的结果返回给用户之后,请示用户是否执行恢复策略。用户可以选择是否基于大语言模型推荐的操作方式来完成问题闭环。After returning the result of each step to the user, the user is asked whether to execute the recovery strategy. The user can choose whether to complete the problem loop based on the operation recommended by the large language model.
针对Copilot代理支持Text2SQL功能的情况,本申请实施例中提出一种基于大语言模型的数据处理方法,该数据处理方法可以包括:In view of the fact that the Copilot agent supports the Text2SQL function, an embodiment of the present application proposes a data processing method based on a large language model, which may include:
步骤S51、Copilot代理获取运维系统的私域数据,该私域数据可以包括SQL描述信息和SQL定义信息。其中,SQL定义信息可以是与SQL语句有关的信息,SQL语句是用于对运维系统进行访问的参数。此外,SQL描述信息可以是与SQL描述有关的信息,而SQL描述是用于对问题进行描述的信息。Step S51, the Copilot agent obtains private domain data of the operation and maintenance system, and the private domain data may include SQL description information and SQL definition information. The SQL definition information may be information related to SQL statements, and SQL statements are parameters used to access the operation and maintenance system. In addition, the SQL description information may be information related to SQL descriptions, and SQL descriptions are information used to describe problems.
步骤S52、Copilot代理采用私域数据对大语言模型进行增量训练。Step S52: The Copilot agent uses private domain data to perform incremental training on the large language model.
步骤S53、Copilot代理采用私域数据维护RAG功能的特征数据库。Step S53: The Copilot agent uses private domain data to maintain the feature database of the RAG function.
示例性的,若私域数据包括SQL描述信息和SQL定义信息,则可以对SQL描述信息进行处理,得到SQL描述,SQL描述可以是用于对问题进行描述的信息。可以对SQL定义信息进行处理得到SQL定义,SQL定义也可以理解为SQL功能,如SQL定义可以为select、insert、update、delete等,对此SQL定义不做限制。select可以表示数据表的查询功能,insert可以表示数据表的增加功能,update可以表示数据表的更新功能,delete可以表示数据表的删除功能。Exemplarily, if the private domain data includes SQL description information and SQL definition information, the SQL description information can be processed to obtain the SQL description, which can be information used to describe the problem. The SQL definition information can be processed to obtain the SQL definition, which can also be understood as the SQL function, such as the SQL definition can be select, insert, update, delete, etc., and there is no restriction on this SQL definition. Select can represent the query function of the data table, insert can represent the addition function of the data table, update can represent the update function of the data table, and delete can represent the deletion function of the data table.
然后,可以在特征数据库中记录SQL描述与SQL定义之间的对应关系。Then, the correspondence between the SQL description and the SQL definition may be recorded in the feature database.
步骤S54、Copilot代理获取原始输入问题。Step S54: The Copilot agent obtains the original input question.
步骤S55、Copilot代理将原始输入问题和提示词输入给大语言模型,通过大语言模型基于该提示词对该原始输入问题进行处理得到目标数据表标识和目标SQL描述,该提示词可以包括问题样例、问题参数样例和问题描述样例。Step S55, the Copilot agent inputs the original input question and prompt words into the large language model, and the large language model processes the original input question based on the prompt words to obtain the target data table identifier and target SQL description. The prompt words may include question examples, question parameter examples, and question description examples.
步骤S56、Copilot代理从特征数据库中查询目标SQL描述对应的目标SQL定义。示例性的,若特征数据库存在与目标SQL描述匹配的第一SQL描述,且第一SQL描述对应有SQL定义,则将该第一SQL描述对应的SQL定义确定为目标SQL描述对应的目标SQL定义,且大语言模型可以得到目标SQL定义。Step S56: The Copilot agent queries the target SQL definition corresponding to the target SQL description from the feature database. Exemplarily, if the feature database has a first SQL description that matches the target SQL description, and the first SQL description corresponds to a SQL definition, then the SQL definition corresponding to the first SQL description is determined as the target SQL definition corresponding to the target SQL description, and the large language model can obtain the target SQL definition.
步骤S57、Copilot代理基于目标SQL定义和目标数据表标识生成目标SQL语句,且目标SQL语句可以包括该目标数据表标识。Step S57: The Copilot agent generates a target SQL statement based on the target SQL definition and the target data table identifier, and the target SQL statement may include the target data table identifier.
比如说,若目标SQL定义为select,则可以生成一个用于实现select的目标SQL语句,且目标SQL语句包括目标数据表标识,该目标SQL语句用于对目标数据表标识对应的数据表进行select操作。若目标SQL定义为insert、update、delete等,目标SQL语句的生成方式和功能类似,在此不再赘述。For example, if the target SQL is defined as select, a target SQL statement for implementing select can be generated, and the target SQL statement includes a target data table identifier, and the target SQL statement is used to perform a select operation on the data table corresponding to the target data table identifier. If the target SQL is defined as insert, update, delete, etc., the generation method and function of the target SQL statement are similar and will not be repeated here.
步骤S58、Copilot代理通过执行目标SQL语句,对运维系统的目标数据表标识对应的数据表进行操作,得到数据表的操作结果数据,基于数据表的操作结果数据确定目标运维数据,并将该目标运维数据返回给用户。Step S58, the Copilot agent operates on the data table corresponding to the target data table identifier of the operation and maintenance system by executing the target SQL statement, obtains the operation result data of the data table, determines the target operation and maintenance data based on the operation result data of the data table, and returns the target operation and maintenance data to the user.
由以上技术方案可见,本申请实施例中,通过一个Copilot代理作为中间层,用户可以与Copilot代理进行自然语言交互,完成运维需求的交互,Copilot代理将自然语言转化为指令传递给运维系统,实现对话式智能运维系统,提升运维体验,实现用户与运维系统的无障碍人机交互体验,提升用户体验。基于大语言模型的能力,解决自然语言理解的障碍,让用户的输入从页面点击方式变为人机语言交互方式。基于大语言模型的能力,针对运维系统的功能,如概览、诊断、自愈闭环、报表等,实现基于语言交互的自动实现能力。It can be seen from the above technical solutions that in the embodiment of the present application, through a Copilot agent as an intermediate layer, the user can interact with the Copilot agent in natural language to complete the interaction of operation and maintenance needs. The Copilot agent converts natural language into instructions and passes them to the operation and maintenance system to realize a conversational intelligent operation and maintenance system, improve the operation and maintenance experience, and realize barrier-free human-computer interaction experience between users and the operation and maintenance system, thereby improving user experience. Based on the capabilities of the large language model, the obstacles to natural language understanding are solved, and the user's input is changed from page clicks to human-computer language interaction. Based on the capabilities of the large language model, the automatic implementation capability based on language interaction is realized for the functions of the operation and maintenance system, such as overview, diagnosis, self-healing closed loop, reports, etc.
Copilot代理通过将用户意图转化为Action与运维系统进行页面、API、数据、知识的交互,实现对话式智能运维系统,提升运维体验。通过大语言模型的增量训练和微调,让大语言模型具备行业知识属性,包括但不限于运维系统的页面描述与页面URL的对应关系,运维系统的功能描述与API的对应关系,运维系统的运维经验以及基础的行业运维知识。为解决大语言模型的幻觉问题,提高准确度,采用RAG知识库,将上述知识通过知识库的方式进行构建。The Copilot agent converts user intent into actions to interact with the operation and maintenance system on pages, APIs, data, and knowledge, thereby implementing a conversational intelligent operation and maintenance system and improving the operation and maintenance experience. Through incremental training and fine-tuning of the large language model, the large language model is endowed with industry knowledge attributes, including but not limited to the correspondence between the page description and page URL of the operation and maintenance system, the correspondence between the function description and API of the operation and maintenance system, the operation and maintenance experience of the operation and maintenance system, and basic industry operation and maintenance knowledge. In order to solve the hallucination problem of the large language model and improve accuracy, the RAG knowledge base is used to construct the above knowledge in the form of a knowledge base.
为了提高用户问题与知识库的准确召回率,将用户的问题通过大语言模型进行理解并结构化,将用户的问题进行主谓宾的语义分解,将具体参数拆解出来,并使用核心的问题部分与知识库进行向量匹配,提高匹配度。In order to improve the accurate recall rate of user questions and the knowledge base, the user's questions are understood and structured through a large language model, the user's questions are semantically decomposed into subject, predicate and object, the specific parameters are disassembled, and the core question part is used for vector matching with the knowledge base to improve the matching degree.
对于多步骤操作,大语言模型进行思维链拆分,针对每一步骤的操作描述进行结构化后与知识库进行匹配,来找到对应的操作URL或者API,并将结构化参数替换到function-call的动作中,并在用户确认后进行逐步操作执行。For multi-step operations, the large language model splits the thought chain, structures the operation description of each step, and matches it with the knowledge base to find the corresponding operation URL or API, replace the structured parameters with the function-call action, and execute the step-by-step operation after the user confirms.
基于与上述方法同样的申请构思,本申请实施例中提出一种基于大语言模型的数据处理装置,参见图8所示,为该装置的结构示意图,该装置包括:Based on the same application concept as the above method, a data processing device based on a large language model is proposed in the embodiment of the present application. See FIG8 , which is a schematic diagram of the structure of the device. The device includes:
处理模块81,用于将原始输入问题和提示词输入给大语言模型,通过大语言模型基于所述提示词对所述原始输入问题进行处理得到目标问题参数和目标问题描述;其中,所述提示词包括问题样例、问题参数样例和问题描述样例;The processing module 81 is used to input the original input question and the prompt word into the large language model, and the large language model processes the original input question based on the prompt word to obtain the target question parameters and the target question description; wherein the prompt word includes a question sample, a question parameter sample and a question description sample;
查询模块82,用于从特征数据库中查询所述目标问题描述对应的目标访问参数;其中,所述特征数据库包括访问描述与访问参数之间的对应关系;A query module 82, configured to query a target access parameter corresponding to the target problem description from a feature database; wherein the feature database includes a correspondence between access descriptions and access parameters;
操作模块83,用于基于所述目标访问参数和所述目标问题参数生成运维系统操作数据,并通过所述运维系统操作数据对运维系统进行操作,得到与所述原始输入问题对应的目标运维数据,将所述目标运维数据返回给用户。The operation module 83 is used to generate operation and maintenance system operation data based on the target access parameters and the target question parameters, and operate the operation and maintenance system through the operation and maintenance system operation data to obtain the target operation and maintenance data corresponding to the original input question, and return the target operation and maintenance data to the user.
示例性的,所述查询模块82从特征数据库中查询所述目标问题描述对应的目标访问参数时具体用于:若所述特征数据库中存在与所述目标问题描述匹配的第一访问描述,则从所述特征数据库中获取所述第一访问描述对应的第一数据结构,所述第一数据结构包括所述第一访问描述和访问参数,将所述第一数据结构中的所述访问参数确定为所述目标问题描述对应的目标访问参数。Exemplarily, when the query module 82 queries the target access parameters corresponding to the target problem description from the feature database, it is specifically used to: if there is a first access description matching the target problem description in the feature database, obtain a first data structure corresponding to the first access description from the feature database, the first data structure including the first access description and access parameters, and determine the access parameters in the first data structure as the target access parameters corresponding to the target problem description.
示例性的,所述处理模块81,还用于在从特征数据库中查询所述目标问题描述对应的目标访问参数之前,获取运维系统的私域数据,所述私域数据包括访问描述信息和访问参数信息;从所述访问描述信息中提取第一描述关键词,基于该第一描述关键词生成访问描述;从所述访问参数信息中提取第一参数关键词,基于该第一参数关键词生成访问参数;在所述特征数据库中记录该访问描述与该访问参数之间的对应关系。Exemplarily, the processing module 81 is also used to obtain private domain data of the operation and maintenance system before querying the target access parameters corresponding to the target problem description from the feature database, the private domain data including access description information and access parameter information; extract the first description keyword from the access description information, and generate an access description based on the first description keyword; extract the first parameter keyword from the access parameter information, and generate an access parameter based on the first parameter keyword; and record the correspondence between the access description and the access parameter in the feature database.
示例性的,所述查询模块82从特征数据库中查询所述目标问题描述对应的目标访问参数时具体用于:若所述特征数据库中存在与所述目标问题描述匹配的第二访问描述,则从所述特征数据库中获取所述第二访问描述对应的第二数据结构,所述第二数据结构包括所述第二访问描述和多个操作步骤描述;针对每个操作步骤描述,若所述特征数据库中存在与该操作步骤描述匹配的第三访问描述,则从所述特征数据库中获取所述第三访问描述对应的第三数据结构,所述第三数据结构包括所述第三访问描述和访问参数,将所述第三数据结构中的访问参数确定为该操作步骤描述对应的访问参数;将每个操作步骤描述对应的访问参数确定为所述目标问题描述对应的目标访问参数。Exemplarily, when the query module 82 queries the target access parameters corresponding to the target problem description from the feature database, it is specifically used to: if there is a second access description matching the target problem description in the feature database, obtain a second data structure corresponding to the second access description from the feature database, the second data structure including the second access description and multiple operation step descriptions; for each operation step description, if there is a third access description matching the operation step description in the feature database, obtain a third data structure corresponding to the third access description from the feature database, the third data structure including the third access description and access parameters, and determine the access parameters in the third data structure as the access parameters corresponding to the operation step description; determine the access parameters corresponding to each operation step description as the target access parameters corresponding to the target problem description.
示例性的,所述处理模块81,还用于在从特征数据库中查询所述目标问题描述对应的目标访问参数之前,获取运维系统的私域数据,所述私域数据包括访问描述信息和多个操作步骤描述信息;针对每个操作步骤描述信息,所述私域数据还包括该操作步骤描述信息对应的访问描述信息和访问参数信息;从访问描述信息中提取第二描述关键词,基于该第二描述关键词生成访问描述;针对每个操作步骤描述信息,从该操作步骤描述信息中提取操作步骤关键词,基于该操作步骤关键词生成操作步骤描述;在特征数据库中记录该访问描述与该操作步骤描述的对应关系;针对每个操作步骤描述信息,从该操作步骤描述信息对应的访问描述信息中提取第三描述关键词,基于该第三描述关键词生成访问描述;从该操作步骤描述信息对应的访问参数信息中提取第二参数关键词,基于该第二参数关键词生成访问参数;在所述特征数据库中记录该访问描述与该访问参数之间的对应关系。Exemplarily, the processing module 81 is also used to obtain private domain data of the operation and maintenance system before querying the target access parameters corresponding to the target problem description from the feature database, the private domain data including access description information and multiple operation step description information; for each operation step description information, the private domain data also includes access description information and access parameter information corresponding to the operation step description information; extract the second description keyword from the access description information, and generate an access description based on the second description keyword; for each operation step description information, extract the operation step keyword from the operation step description information, and generate an operation step description based on the operation step keyword; record the correspondence between the access description and the operation step description in the feature database; for each operation step description information, extract the third description keyword from the access description information corresponding to the operation step description information, and generate an access description based on the third description keyword; extract the second parameter keyword from the access parameter information corresponding to the operation step description information, and generate an access parameter based on the second parameter keyword; and record the correspondence between the access description and the access parameter in the feature database.
示例性的,所述目标问题参数包括目标设备标识,所述目标问题描述包括目标URL描述,所述目标访问参数包括目标URL地址,且所述目标URL地址包括已配置的固定设备标识;所述操作模块83基于所述目标访问参数和所述目标问题参数生成运维系统操作数据时具体用于:将所述目标URL地址中的固定设备标识替换为所述目标设备标识得到修改后的URL地址,作为所述运维系统操作数据;所述操作模块83通过所述运维系统操作数据对运维系统进行操作,得到与所述原始输入问题对应的目标运维数据时具体用于:访问运维系统的与所述修改后的URL地址对应的运维管理页面,所述运维管理页面包括所述目标运维数据。Exemplarily, the target problem parameters include a target device identifier, the target problem description includes a target URL description, the target access parameters include a target URL address, and the target URL address includes a configured fixed device identifier; when the operation module 83 generates the operation and maintenance system operation data based on the target access parameters and the target problem parameters, it is specifically used to: replace the fixed device identifier in the target URL address with the target device identifier to obtain a modified URL address as the operation and maintenance system operation data; when the operation module 83 operates the operation and maintenance system through the operation and maintenance system operation data to obtain the target operation and maintenance data corresponding to the original input problem, it is specifically used to: access the operation and maintenance management page of the operation and maintenance system corresponding to the modified URL address, and the operation and maintenance management page includes the target operation and maintenance data.
示例性的,所述目标问题参数包括目标设备标识,所述目标问题描述包括目标API描述,所述目标访问参数包括目标API,且所述目标API包括已配置的固定设备标识;所述操作模块83基于所述目标访问参数和所述目标问题参数生成运维系统操作数据时具体用于:将所述目标API中的固定设备标识替换为所述目标设备标识得到修改后的API,作为所述运维系统操作数据;所述操作模块83通过所述运维系统操作数据对运维系统进行操作,得到与所述原始输入问题对应的目标运维数据时具体用于:调用所述修改后的API对运维系统进行操作得到操作结果数据,基于操作结果数据确定所述目标运维数据。Exemplarily, the target problem parameters include a target device identifier, the target problem description includes a target API description, the target access parameters include a target API, and the target API includes a configured fixed device identifier; when the operation module 83 generates the operation and maintenance system operation data based on the target access parameters and the target problem parameters, it is specifically used to: replace the fixed device identifier in the target API with the target device identifier to obtain a modified API as the operation and maintenance system operation data; when the operation module 83 operates the operation and maintenance system through the operation and maintenance system operation data to obtain the target operation and maintenance data corresponding to the original input problem, it is specifically used to: call the modified API to operate the operation and maintenance system to obtain operation result data, and determine the target operation and maintenance data based on the operation result data.
示例性的,所述目标问题参数包括目标数据表标识,所述目标问题描述包括目标SQL描述,所述目标访问参数包括目标SQL定义;所述操作模块83基于所述目标访问参数和所述目标问题参数生成运维系统操作数据时具体用于:基于所述目标SQL定义和所述目标数据表标识生成目标SQL语句,作为所述运维系统操作数据,所述目标SQL语句包括所述目标数据表标识;所述操作模块83通过所述运维系统操作数据对运维系统进行操作,得到与所述原始输入问题对应的目标运维数据时具体用于:通过执行所述目标SQL语句,对运维系统的所述目标数据表标识对应的数据表进行操作,得到所述数据表的操作结果数据;基于所述数据表的操作结果数据确定所述目标运维数据。Exemplarily, the target problem parameters include a target data table identifier, the target problem description includes a target SQL description, and the target access parameters include a target SQL definition; when the operation module 83 generates the operation and maintenance system operation data based on the target access parameters and the target problem parameters, it is specifically used to: generate a target SQL statement based on the target SQL definition and the target data table identifier as the operation and maintenance system operation data, and the target SQL statement includes the target data table identifier; when the operation module 83 operates the operation and maintenance system through the operation and maintenance system operation data to obtain the target operation and maintenance data corresponding to the original input problem, it is specifically used to: operate the data table corresponding to the target data table identifier of the operation and maintenance system by executing the target SQL statement to obtain the operation result data of the data table; determine the target operation and maintenance data based on the operation result data of the data table.
基于与上述方法同样的申请构思,本申请实施例中提出一种电子设备,参见图9所示,包括:处理器91和机器可读存储介质92,机器可读存储介质92存储有能够被处理器91执行的机器可执行指令;处理器91用于执行机器可执行指令,以实现本申请上述示例公开的基于大语言模型的数据处理方法。Based on the same application concept as the above method, an electronic device is proposed in an embodiment of the present application, as shown in Figure 9, including: a processor 91 and a machine-readable storage medium 92, the machine-readable storage medium 92 stores machine-executable instructions that can be executed by the processor 91; the processor 91 is used to execute the machine-executable instructions to implement the data processing method based on the large language model disclosed in the above example of the present application.
基于与上述方法同样的申请构思,本申请实施例还提供一种机器可读存储介质,所述机器可读存储介质上存储有若干计算机指令,所述计算机指令被处理器执行时,能够实现本申请上述示例公开的基于大语言模型的数据处理方法。Based on the same application concept as the above method, an embodiment of the present application also provides a machine-readable storage medium, on which a number of computer instructions are stored. When the computer instructions are executed by a processor, the data processing method based on a large language model disclosed in the above example of the present application can be implemented.
其中,上述机器可读存储介质可以是任何电子、磁性、光学或其它物理存储装置,可以包含或存储信息,如可执行指令、数据,等等。例如,机器可读存储介质可以是:RAM(Radom Access Memory,随机存取存储器)、易失存储器、非易失性存储器、闪存、存储驱动器(如硬盘驱动器)、固态硬盘、任何类型的存储盘(如光盘、dvd等),或者类似的存储介质,或者它们的组合。The above-mentioned machine-readable storage medium may be any electronic, magnetic, optical or other physical storage device, which may contain or store information, such as executable instructions, data, etc. For example, the machine-readable storage medium may be: RAM (Radom Access Memory), volatile memory, non-volatile memory, flash memory, storage drive (such as hard disk drive), solid state drive, any type of storage disk (such as CD, DVD, etc.), or similar storage medium, or a combination thereof.
本领域内的技术人员应明白,本申请的实施例可提供为方法、系统、或计算机程序产品。因此,本申请可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本申请实施例可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。Those skilled in the art will appreciate that the embodiments of the present application may be provided as methods, systems, or computer program products. Therefore, the present application may adopt the form of a complete hardware embodiment, a complete software embodiment, or an embodiment combining software and hardware. Moreover, the embodiments of the present application may adopt the form of a computer program product implemented on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) containing computer-usable program codes.
以上所述仅为本申请的实施例而已,并不用于限制本申请。对于本领域技术人员来说,本申请可以有各种更改和变化。凡在本申请的精神和原理之内所作的任何修改、等同替换、改进等,均应包含在本申请的权利要求范围之内。The above is only an embodiment of the present application and is not intended to limit the present application. For those skilled in the art, the present application may have various changes and variations. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present application should be included in the scope of the claims of the present application.
Claims (15)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202410772348.6A CN118349661B (en) | 2024-06-14 | 2024-06-14 | Data processing method, device and equipment based on large language model |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202410772348.6A CN118349661B (en) | 2024-06-14 | 2024-06-14 | Data processing method, device and equipment based on large language model |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN118349661A true CN118349661A (en) | 2024-07-16 |
| CN118349661B CN118349661B (en) | 2024-09-10 |
Family
ID=91815816
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202410772348.6A Active CN118349661B (en) | 2024-06-14 | 2024-06-14 | Data processing method, device and equipment based on large language model |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN118349661B (en) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN118964580A (en) * | 2024-08-22 | 2024-11-15 | 浪潮云信息技术股份公司 | A method and system for improving the NL2SQL capability of large models in the rail transit industry based on hybrid cloud multi-mode and RAG framework |
| CN119341815A (en) * | 2024-10-22 | 2025-01-21 | 安徽大学 | Multimodal phishing website detection method based on large model |
Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20170011308A1 (en) * | 2015-07-09 | 2017-01-12 | SunView Software, Inc. | Methods and Systems for Applying Machine Learning to Automatically Solve Problems |
| US11328796B1 (en) * | 2020-02-25 | 2022-05-10 | Vignet Incorporated | Techniques for selecting cohorts for decentralized clinical trials for pharmaceutical research |
| CN116992006A (en) * | 2023-09-26 | 2023-11-03 | 武汉益模科技股份有限公司 | Chain type natural language interaction method and system driven by large language model |
| CN117473037A (en) * | 2023-12-27 | 2024-01-30 | 广州云积软件技术有限公司 | Examination question bank construction method and system based on large language model and electronic equipment |
| CN117556008A (en) * | 2023-11-13 | 2024-02-13 | 网易传媒科技(北京)有限公司 | Interaction method, medium, device and computing equipment based on large language model |
| CN117763079A (en) * | 2023-11-09 | 2024-03-26 | 招银云创信息技术有限公司 | Solution recommendation method, device and computer readable storage medium |
| CN118171646A (en) * | 2024-03-14 | 2024-06-11 | 中国电信股份有限公司 | Parameter calling method, device, electronic equipment and computer program product |
-
2024
- 2024-06-14 CN CN202410772348.6A patent/CN118349661B/en active Active
Patent Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20170011308A1 (en) * | 2015-07-09 | 2017-01-12 | SunView Software, Inc. | Methods and Systems for Applying Machine Learning to Automatically Solve Problems |
| US11328796B1 (en) * | 2020-02-25 | 2022-05-10 | Vignet Incorporated | Techniques for selecting cohorts for decentralized clinical trials for pharmaceutical research |
| CN116992006A (en) * | 2023-09-26 | 2023-11-03 | 武汉益模科技股份有限公司 | Chain type natural language interaction method and system driven by large language model |
| CN117763079A (en) * | 2023-11-09 | 2024-03-26 | 招银云创信息技术有限公司 | Solution recommendation method, device and computer readable storage medium |
| CN117556008A (en) * | 2023-11-13 | 2024-02-13 | 网易传媒科技(北京)有限公司 | Interaction method, medium, device and computing equipment based on large language model |
| CN117473037A (en) * | 2023-12-27 | 2024-01-30 | 广州云积软件技术有限公司 | Examination question bank construction method and system based on large language model and electronic equipment |
| CN118171646A (en) * | 2024-03-14 | 2024-06-11 | 中国电信股份有限公司 | Parameter calling method, device, electronic equipment and computer program product |
Non-Patent Citations (1)
| Title |
|---|
| 裴丹等: ""大语言模型时代的智能运维"", 《中兴通讯技术》, 30 April 2024 (2024-04-30), pages 56 - 62 * |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN118964580A (en) * | 2024-08-22 | 2024-11-15 | 浪潮云信息技术股份公司 | A method and system for improving the NL2SQL capability of large models in the rail transit industry based on hybrid cloud multi-mode and RAG framework |
| CN119341815A (en) * | 2024-10-22 | 2025-01-21 | 安徽大学 | Multimodal phishing website detection method based on large model |
Also Published As
| Publication number | Publication date |
|---|---|
| CN118349661B (en) | 2024-09-10 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11887597B2 (en) | Voice application platform | |
| US11615791B2 (en) | Voice application platform | |
| US10943589B2 (en) | Voice application platform | |
| WO2025082390A1 (en) | Method and electronic device for generating article content | |
| US11394667B2 (en) | Chatbot skills systems and methods | |
| CN118349661B (en) | Data processing method, device and equipment based on large language model | |
| US11437029B2 (en) | Voice application platform | |
| US20130239006A1 (en) | Aggregator, filter and delivery system for online context dependent interaction, systems and methods | |
| CN109492053B (en) | Method and apparatus for accessing data | |
| CN116974554A (en) | Code data processing methods, devices, computer equipment and storage media | |
| CN119358657B (en) | Method, device and storage medium for selecting knowledge base | |
| EP4582968A1 (en) | Efficient generation of application programming interface calls using language models, data types, and enriched schema | |
| CN120162326A (en) | A personalized interaction enhancement method and system based on large language model | |
| JP7493195B1 (en) | Program, method, information processing device, and system | |
| CN112470216B (en) | Voice Application Platform | |
| CN118132721A (en) | Intelligent question-answering method, intelligent question-answering device, related equipment and computer program product | |
| CN118861560A (en) | A medical task execution method and computer device | |
| US20220075960A1 (en) | Interactive Communication System with Natural Language Adaptive Components | |
| JP2021015460A (en) | Question/answer display server, question/answer display method and question/answer display program | |
| US12321343B1 (en) | Natural language to SQL on custom enterprise data warehouse powered by generative artificial intelligence | |
| CN120011385A (en) | Database query method, device, equipment and medium based on natural language | |
| WO2025241513A1 (en) | Text generation method, apparatus and system, computing device, and readable storage medium | |
| US12505288B1 (en) | Multi-machine learning system for interacting with a large language model | |
| JP2025169153A (en) | Method, system, and program for configuring RAG | |
| CN121233620A (en) | Tool creation, maintenance and calling method, system, device, storage medium and program product |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |