CN117390336A - A web page process automation method, device, equipment and storage medium - Google Patents
A web page process automation method, device, equipment and storage medium Download PDFInfo
- Publication number
- CN117390336A CN117390336A CN202311498917.4A CN202311498917A CN117390336A CN 117390336 A CN117390336 A CN 117390336A CN 202311498917 A CN202311498917 A CN 202311498917A CN 117390336 A CN117390336 A CN 117390336A
- Authority
- CN
- China
- Prior art keywords
- natural language
- web page
- instruction
- language instructions
- leaf nodes
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 74
- 238000004801 process automation Methods 0.000 title claims abstract description 27
- 230000008569 process Effects 0.000 claims abstract description 29
- 238000012545 processing Methods 0.000 claims abstract description 22
- 238000012549 training Methods 0.000 claims description 10
- 230000009471 action Effects 0.000 claims description 8
- 238000013528 artificial neural network Methods 0.000 claims description 6
- 230000000306 recurrent effect Effects 0.000 claims description 5
- 238000004590 computer program Methods 0.000 claims 4
- 238000011161 development Methods 0.000 abstract description 7
- 238000013473 artificial intelligence Methods 0.000 abstract description 3
- 230000000875 corresponding effect Effects 0.000 description 7
- 230000002787 reinforcement Effects 0.000 description 7
- 238000003058 natural language processing Methods 0.000 description 6
- 241000282414 Homo sapiens Species 0.000 description 5
- 230000003287 optical effect Effects 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 238000011112 process operation Methods 0.000 description 3
- 230000002159 abnormal effect Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000002093 peripheral effect Effects 0.000 description 2
- 238000004088 simulation Methods 0.000 description 2
- 230000001133 acceleration Effects 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/958—Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/092—Reinforcement learning
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Biomedical Technology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Description
技术领域Technical field
本申请涉及人工智能技术领域,尤其涉及一种网页流程自动化方法、装置、设备及存储介质。This application relates to the field of artificial intelligence technology, and in particular to a web page process automation method, device, equipment and storage medium.
背景技术Background technique
随着企业信息化建设日益完善,大量的机器人网页流程自动化(Robotic ProcessAutomation,RPA)产品涌入市场,RPA产品通过模拟人类在电子系统间的操作,不仅可以高效的完成大量重复性的工作,还可以高效高质的实现大量的数据整合、迁移和处理,极大的提高了办公效率。With the increasing improvement of enterprise information construction, a large number of Robotic Process Automation (RPA) products have flooded into the market. By simulating human operations between electronic systems, RPA products can not only efficiently complete a large number of repetitive tasks, but also It can realize a large amount of data integration, migration and processing with high efficiency and high quality, greatly improving office efficiency.
目前,RPA具有泛化能力差的缺点,每一个RPA流程只对应固定的键鼠操作顺序和UI交互位置,因此每个网页作业都具有特定的RPA流程,且无法应用于其他场景,导致泛化能力差;另外,同样的网页如果按钮或输入框的样式发生任意改变,RPA流程就可能定位不到需要操作的UI,从而导致误操作等。Currently, RPA has the disadvantage of poor generalization ability. Each RPA process only corresponds to a fixed keyboard and mouse operation sequence and UI interaction position. Therefore, each web page job has a specific RPA process and cannot be applied to other scenarios, leading to generalization. Poor ability; in addition, if the style of buttons or input boxes on the same web page is arbitrarily changed, the RPA process may not be able to locate the UI that needs to be operated, leading to misoperations.
因此对于更换网页界面或者在不同的应用环境下使用RPA,都需要重新部署RPA,而重新部署RPA需要花费大量的时间,所需成本较高。Therefore, when changing the web interface or using RPA in a different application environment, RPA needs to be redeployed, and redeploying RPA takes a lot of time and requires high costs.
发明内容Contents of the invention
有鉴于此,本申请提供了一种网页流程自动化方法及装置,旨在使得RPA可以适用于多种应用场景,减少在不同应用场景下重新部署RPA,降低开发成本。In view of this, this application provides a web page process automation method and device, aiming to make RPA applicable to a variety of application scenarios, reduce the redeployment of RPA in different application scenarios, and reduce development costs.
第一方面,本申请提供了一种网页流程自动化方法,该方法包括:In the first aspect, this application provides a web page process automation method, which method includes:
获取网页插件接收的自然语言指令,所述自然语言指令中包括多类指令;Obtain the natural language instructions received by the web page plug-in, and the natural language instructions include multiple types of instructions;
对所述自然语言指令进行处理,得到多个指令键值对;Process the natural language instructions to obtain multiple instruction key-value pairs;
根据所述自然语言指令解析所述网页的文档对象模型,得到所述网页中的可操作叶子节点;Parse the document object model of the web page according to the natural language instructions to obtain operable leaf nodes in the web page;
将所述多个指令键值对和所述可操作叶子节点输入预先训练好的智能体,以自动执行所述自然语言指令。The plurality of instruction key-value pairs and the operable leaf nodes are input into a pre-trained agent to automatically execute the natural language instructions.
可选的,对所述智能体进行训练包括:Optionally, training the agent includes:
将循环神经网络作为所述智能体的核心网络,以指令键值对作为输入、可操作叶子节点作为状态空间、点击文本框和输入自然语言文字为动作空间对所述智能体进行训练。The recurrent neural network is used as the core network of the agent, and the agent is trained with instruction key-value pairs as input, operable leaf nodes as state space, clicking text boxes and inputting natural language text as action space.
可选的,所述获取网页插件接收的自然语言指令包括:Optionally, the natural language instructions received by the web page acquisition plug-in include:
获取预设时间段内接收到的自然语言指令,按照接收的时间先后顺序对所述预设时间段内接收到的自然语言指令进行存储。Obtain the natural language instructions received within the preset time period, and store the natural language instructions received within the preset time period in the order of reception time.
可选的,当所述自然语言指令为下载类指令时,所述下载类指令中包括下载完成后的存储地址,所述方法还包括:Optionally, when the natural language instruction is a download instruction, the download instruction includes the storage address after the download is completed, and the method further includes:
完成所述下载类指令后,按照所述存储地址存储所述下载指令中包括的下载内容。After completing the download instruction, the download content included in the download instruction is stored according to the storage address.
可选的,所述可操作叶子节点包括:Optionally, the operable leaf nodes include:
网页中的文本框、按钮、下拉框和复选框中的一种或多种。One or more of text boxes, buttons, drop-down boxes, and check boxes in web pages.
第二方面,本申请提供了一种网页流程自动化装置,该装置包括:In the second aspect, this application provides a web page process automation device, which includes:
获取单元,用于获取网页插件接收的自然语言指令,所述自然语言指令中包括多类指令;An acquisition unit, used to acquire natural language instructions received by the web page plug-in, where the natural language instructions include multiple types of instructions;
第一处理单元,用于对所述自然语言指令进行处理,得到多个指令键值对;A first processing unit, configured to process the natural language instructions to obtain multiple instruction key-value pairs;
第二处理单元,用于根据所述自然语言指令解析所述网页的文档对象模型,得到所述网页中的可操作叶子节点;A second processing unit configured to parse the document object model of the web page according to the natural language instruction to obtain operable leaf nodes in the web page;
输入单元,用于将所述多个指令键值对和所述可操作叶子节点输入预先训练好的智能体,以自动执行所述自然语言指令。An input unit is used to input the plurality of instruction key-value pairs and the operable leaf nodes into a pre-trained agent to automatically execute the natural language instruction.
可选的,所述装置还包括训练单元,用于将循环神经网络作为所述智能体的核心网络,以指令键值对作为输入、可操作叶子节点作为状态空间、点击文本框和输入自然语言文字为动作空间对所述智能体进行训练。Optionally, the device also includes a training unit for using the recurrent neural network as the core network of the agent, using instruction key-value pairs as input, operable leaf nodes as the state space, clicking on the text box and inputting natural language. The text trains the agent in action space.
可选的,所述获取单元具体用于:Optionally, the acquisition unit is specifically used for:
获取预设时间段内接收到的自然语言指令,按照接收的时间先后顺序对所述预设时间段内接收到的自然语言指令进行存储。Obtain the natural language instructions received within the preset time period, and store the natural language instructions received within the preset time period in the order of reception time.
可选的,所述装置还包括存储单元,当所述自然语言指令为下载类指令时,所述下载类指令中包括下载完成后的存储地址,当完成所述下载类指令后,所述存储单元用于按照所述存储地址存储所述下载指令中包括的下载内容。Optionally, the device further includes a storage unit. When the natural language instruction is a download instruction, the download instruction includes a storage address after the download is completed. After the download instruction is completed, the storage The unit is configured to store the download content included in the download instruction according to the storage address.
可选的,所述可操作叶子节点包括:Optionally, the operable leaf nodes include:
网页中的文本框、按钮、下拉框和复选框中的一种或多种。One or more of text boxes, buttons, drop-down boxes, and check boxes in web pages.
第三方面,本申请提供了一种设备,所述设备包括存储器和处理器,所述存储器用于存储指令或代码,所述处理器用于执行所述指令或代码,以使所述设备执行前述第一方面任一项所述的网页流程自动化方法。In a third aspect, the application provides a device. The device includes a memory and a processor. The memory is used to store instructions or codes. The processor is used to execute the instructions or codes, so that the device executes the foregoing. The web page process automation method described in any one of the first aspects.
第四方面,本申请提供了一种计算机可读存储介质,所述计算机可读存储介质中存储有代码,当所述代码被运行时,运行所述代码的设备实现前述第一方面任一项所述的网页流程自动化方法。In a fourth aspect, the present application provides a computer-readable storage medium. Code is stored in the computer-readable storage medium. When the code is run, the device running the code implements any of the foregoing first aspects. The described web page process automation method.
本申请提供了一种网页流程自动化方法。在执行所述方法时,先获取网页插件接收的自然语言指令,所述自然语言指令中包括多类指令,然后对所述自然语言指令进行处理,得到多个指令键值对,根据所述自然语言指令解析所述网页的文档对象模型,得到所述网页中的可操作叶子节点,将所述多个指令键值对和所述可操作叶子节点输入预先训练好的智能体,以自动执行所述自然语言指令。如此,通过对网页获取的自然语言指令进行处理,得到需要智能体进行操作的叶子节点和指令键值对,使得智能体根据叶子节点和指令键值对实现网页流程自动化。由于键值对和叶子节点为网页流程操作中的简单元素,对于更换网页界面或者更换应用环境的情况下,训练好的智能体仍能通过操作对自然语言指令进行解析得到指令键值对和可操作叶子节点进而实现网页流程自动化,减少在不同应用场景下重新部署RPA,降低开发成本。This application provides a web page process automation method. When executing the method, first obtain the natural language instructions received by the web page plug-in. The natural language instructions include multiple types of instructions, and then process the natural language instructions to obtain multiple instruction key-value pairs. According to the natural language instructions, The language instruction parses the document object model of the web page to obtain the operable leaf nodes in the web page, and inputs the multiple instruction key-value pairs and the operable leaf nodes into the pre-trained agent to automatically execute the Describe natural language instructions. In this way, by processing the natural language instructions obtained from the web page, the leaf nodes and instruction key-value pairs that need to be operated by the agent are obtained, so that the agent can realize the automation of the web page process based on the leaf nodes and instruction key-value pairs. Since key-value pairs and leaf nodes are simple elements in web page process operations, when the web interface is changed or the application environment is changed, the trained agent can still parse the natural language instructions through operations to obtain the instruction key-value pairs and available Operate leaf nodes to automate web page processes, reduce redeployment of RPA in different application scenarios, and reduce development costs.
附图说明Description of the drawings
为更清楚地说明本实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to explain this embodiment or the technical solution in the prior art more clearly, the drawings needed to be used in the description of the embodiment or the prior art will be briefly introduced below. Obviously, the drawings in the following description are only For some embodiments of the present application, those of ordinary skill in the art can also obtain other drawings based on these drawings without exerting creative efforts.
图1为本申请实施例提供的一种网页流程自动化方法的流程图;Figure 1 is a flow chart of a web page process automation method provided by an embodiment of the present application;
图2为本申请实施例提供的一种网页流程自动化装置的结构示意图;Figure 2 is a schematic structural diagram of a web page process automation device provided by an embodiment of the present application;
图3为本申请实施例提供的一种计算机设备的结构示意图。FIG. 3 is a schematic structural diagram of a computer device provided by an embodiment of the present application.
具体实施方式Detailed ways
如本申请背景技术所述,现有技术中的网页流程自动化方法中,需要对RPA进行整个操作流程的训练,当网页或应用场景变化时则需要重新进行训练,会花费大量时间,使得成本较高。As mentioned in the background technology of this application, in the web process automation method in the prior art, RPA needs to be trained on the entire operation process. When the web page or application scenario changes, the training needs to be re-trained, which takes a lot of time and makes the cost relatively high. high.
为了解决上述技术问题,本申请实施例提供了一种网页流程自动化方法,该方法包括:In order to solve the above technical problems, embodiments of the present application provide a web page process automation method, which method includes:
先获取网页插件接收的自然语言指令,所述自然语言指令中包括多类指令,然后对所述自然语言指令进行处理,得到多个指令键值对,根据所述自然语言指令解析所述网页的文档对象模型,得到所述网页中的可操作叶子节点,将所述多个指令键值对和所述可操作叶子节点输入预先训练好的智能体,以自动执行所述自然语言指令。如此,通过对网页获取的自然语言指令进行处理,得到需要智能体进行操作的叶子节点和指令键值对,使得智能体根据叶子节点和指令键值对实现网页流程自动化。由于键值对和叶子节点为网页流程操作中的简单元素,对于更换网页界面或者更换应用环境的情况下,训练好的智能体仍能通过操作对自然语言指令进行解析得到指令键值对和可操作叶子节点进而实现网页流程自动化,减少在不同应用场景下重新部署RPA,降低开发成本。First obtain the natural language instructions received by the web page plug-in, which include multiple types of instructions, then process the natural language instructions to obtain multiple instruction key-value pairs, and parse the web page according to the natural language instructions. The document object model obtains the operable leaf nodes in the web page, and inputs the multiple instruction key-value pairs and the operable leaf nodes into the pre-trained agent to automatically execute the natural language instructions. In this way, by processing the natural language instructions obtained from the web page, the leaf nodes and instruction key-value pairs that need to be operated by the agent are obtained, so that the agent can realize the automation of the web page process based on the leaf nodes and instruction key-value pairs. Since key-value pairs and leaf nodes are simple elements in web page process operations, when the web interface is changed or the application environment is changed, the trained agent can still parse the natural language instructions through operations to obtain the instruction key-value pairs and available Operate leaf nodes to automate web page processes, reduce redeployment of RPA in different application scenarios, and reduce development costs.
需要说明的是,本申请所涉及的用户信息(包括但不限于用户设备信息、用户个人信息等)和数据(包括但不限于用于分析的数据、存储的数据、展示的数据等),均为经用户授权或者经过各方充分授权的信息和数据,且相关数据的收集、使用和处理需要遵守相关国家和地区的相关法律法规和标准。It should be noted that the user information (including but not limited to user equipment information, user personal information, etc.) and data (including but not limited to data used for analysis, stored data, displayed data, etc.) involved in this application are all It is information and data authorized by the user or fully authorized by all parties, and the collection, use and processing of relevant data need to comply with the relevant laws, regulations and standards of relevant countries and regions.
为了使本申请实施例的目的、技术方案和优点更加清楚,下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。In order to make the purpose, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments These are only some of the embodiments of this application, not all of them. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative efforts fall within the scope of protection of this application.
图1为本申请实施例提供的一种网页流程自动化方法的流程图。结合图1所示,本申请实施例提供的网页流程自动化方法可以包括:Figure 1 is a flow chart of a web page process automation method provided by an embodiment of the present application. As shown in Figure 1, the web page process automation method provided by the embodiment of the present application may include:
S101、获取网页插件接收的自然语言指令,所述自然语言指令中包括多类指令。S101. Obtain the natural language instructions received by the web page plug-in. The natural language instructions include multiple types of instructions.
网页插件又称浏览器插件,用于扩展浏览器的功能、丰富浏览体验、满足不同的功能需求。本申请实施例中所述网页插件在启动后,会在前端网页上显示一个文本框,通过所述文本框可以输入自然语言指令,例如:用户输入的自然语言指令为:“帮我下载最近三个月的流水账单”。Web plug-ins, also known as browser plug-ins, are used to extend browser functions, enrich browsing experience, and meet different functional requirements. After the web page plug-in in the embodiment of this application is started, a text box will be displayed on the front-end web page, and a natural language instruction can be input through the text box. For example: the natural language instruction input by the user is: "Help me download the last three Monthly bills."
自然语言是指汉语、英语、法语等日常使用的语言,是人类社会发展演变而来的语言,而不是人造的语言,它是人类学习生活的重要工具。概括说来,自然语言是指人类社会约定俗成的,区别于如程序设计的语言的人工语言。Natural language refers to languages used daily such as Chinese, English, and French. It is a language that evolved from the development of human society, rather than a man-made language. It is an important tool for human beings to learn and live. In summary, natural language refers to artificial language that is conventional in human society and is different from language such as programming.
其中所述自然语言指令包括多种类型,具体的可以包括下载类指令、填写类指令和搜索类指令。所述下载类指令具体为下载某类文件;所述填写类指令例如填写某类表单;所述搜索类指令指查找某项内容,如数据或文件等。The natural language instructions include multiple types, specifically, they may include download instructions, fill-in instructions, and search instructions. The downloading instruction specifically refers to downloading a certain type of file; the filling instruction refers to filling in a certain form; and the searching instruction refers to searching for a certain content, such as data or files.
S102、对所述自然语言指令进行处理,得到多个指令键值对。S102. Process the natural language instructions to obtain multiple instruction key-value pairs.
在通过前端的网页插件得到自然语言指令后,将所述自然语言指令传输到后端进行处理,所述处理过程具体为通过自然语言模型对所述自然语言指令进行识别。After obtaining the natural language instructions through the front-end web page plug-in, the natural language instructions are transmitted to the back-end for processing. The processing process specifically includes identifying the natural language instructions through a natural language model.
自然语言处理模型是指能够进行自然语言处理的模型,自然语言处理(NaturalLanguage Processing,简称NLP)是一种用计算机来处理、理解以及运用人类语言的手段,属于人工智能的一个分支。自然语言处理模型能够对自然语言的形、音、义等信息进行处理,即对字、词、句、篇章的输入、输出、识别、分析、理解、生成等的操作和加工,实现人机间的信息交流。Natural language processing model refers to a model that can perform natural language processing. Natural Language Processing (NLP) is a method of using computers to process, understand and use human language. It is a branch of artificial intelligence. The natural language processing model can process the form, pronunciation, meaning and other information of natural language, that is, the input, output, recognition, analysis, understanding, generation, etc. of characters, words, sentences, and chapters, to achieve human-computer interaction. exchange of information.
指令键值对是指对所述自然语言指令进行处理后得到的有关于指令的键值对,键值对(key:value)是一种简单的对应关系,键后面对应着相应的值,例如:{时间:三个月},其中“时间”为键,“三个月”为对应的值,组合为一个键值对。具体的,对自然语言指令进行处理可以得到多个键值对,例如:用户输入的自然语言指令为:“帮我下载最近三个月的流水账单”,经过自然语言处理模型可生成“{时间:三个月,文档类型:流水账单,命令:下载}”这样的键值对。The instruction key-value pair refers to the key-value pair related to the instruction obtained after processing the natural language instruction. The key-value pair (key: value) is a simple correspondence relationship, and the key corresponds to the corresponding value, for example : {Time: three months}, where "time" is the key and "three months" is the corresponding value, which is combined into a key-value pair. Specifically, multiple key-value pairs can be obtained by processing natural language instructions. For example, the natural language instruction input by the user is: "Help me download the bills of the last three months." After the natural language processing model, "{time" can be generated. : three months, document type: running bill, command: download}" such key-value pairs.
S103、根据所述自然语言指令解析所述网页的文档对象模型,得到所述网页中的可操作叶子节点。S103. Parse the document object model of the web page according to the natural language instruction to obtain operable leaf nodes in the web page.
文档对象模型(Document Object Model,简称DOM)是一个网络文档的编程接口。DOM将文档表示为节点和对象,以通过编程语言实现与页面之间的交互。一般来说,将页面中所有呈现的内容,作为DOM文档中的节点(node),例如:元素标签是元素节点、注释的内容是注释节点、文本内容是文本节点、document是文档节点等。本申请中将DOM中的可操作控件作为叶子节点,所述可操作叶子节点包括网页中的文本框、按钮、下拉框和复选框中的一种或多种。Document Object Model (DOM for short) is a programming interface for network documents. DOM represents documents as nodes and objects to enable interaction with the page through programming languages. Generally speaking, all the content presented on the page is regarded as a node in the DOM document. For example, element labels are element nodes, annotated content is annotation nodes, text content is text nodes, document is a document node, etc. In this application, operable controls in the DOM are used as leaf nodes, and the operable leaf nodes include one or more of text boxes, buttons, drop-down boxes, and check boxes in web pages.
在不同应用场景下的可操作叶子节点不同,例如在自然语言指令为下载类指令时,所述可操作叶子节点包括网页中的文本框、按钮、下拉框;当所述自然语言指令为填写类指令时,所述可操作叶子节点包括网页中的文本框、按钮、下拉框和复选框。The operable leaf nodes are different in different application scenarios. For example, when the natural language instruction is a download type instruction, the operable leaf nodes include text boxes, buttons, and drop-down boxes in the web page; when the natural language instruction is a fill-in type instruction When commanding, the operable leaf nodes include text boxes, buttons, drop-down boxes and check boxes in the web page.
S104、将所述多个指令键值对和所述可操作叶子节点输入预先训练好的智能体,以自动执行所述自然语言指令。S104. Input the plurality of instruction key-value pairs and the operable leaf nodes into a pre-trained agent to automatically execute the natural language instruction.
预先训练好的智能体在获得指令键值对和可操作叶子节点后,自动执行对应的动作,从而完成网页作业。After obtaining the instruction key-value pair and operable leaf nodes, the pre-trained agent automatically performs the corresponding actions to complete the web page job.
所述智能体为基于强化学习的智能体,所述智能体经过训练后学会了各种原子任务例如点击按钮、点击下拉框、选择复选框、输入文本等操作,而任何网页作业流程,例如下载文件,填报表单等都可以拆分为这些原子任务的组合。只要训练智能体学会处理原子任务,那么就可以通过强化学习泛化到其他复杂作业流程。另外,由于强化学习智能体直接在网页DOM元素上进行操作,因此对UI的位置和形状等属性描述不敏感,从而具有极高的样式泛化能力。The agent is an agent based on reinforcement learning. After training, the agent has learned various atomic tasks such as clicking buttons, clicking drop-down boxes, selecting check boxes, entering text, etc., and any web page operation process, such as Downloading files, filling out forms, etc. can all be broken down into a combination of these atomic tasks. As long as the agent is trained to handle atomic tasks, it can be generalized to other complex workflows through reinforcement learning. In addition, because the reinforcement learning agent operates directly on the DOM elements of the web page, it is not sensitive to attribute descriptions such as the position and shape of the UI, and thus has extremely high style generalization capabilities.
对所述智能体进行训练包括:将循环神经网络(Rerrent Neural Network,RNN)作为所述智能体的核心网络,以指令键值对作为输入、可操作叶子节点作为状态空间、点击文本框和输入自然语言文字为动作空间对所述智能体进行训练。为了能让强化学习智能体识别各种DOM元素对应的UI组件类别(即按钮、文本框、下拉框等),并学会对齐指令键值对与可操作叶子节点,在训练阶段需要收集不同网页关于按钮、下拉框、复选框等UI组件的DOM描述的数据集,并对这些数据集进行标注。例如,一个搜索按钮,DOM既可以表示为“<buttontype="search">搜索</button>”,也可以表示为“<a href="#"class="button">搜索</a>”,它们都被标注为按钮,数据集越丰富训练出的智能体泛化性越强。然后基于这些DOM数据集反向生成仿真环境。在仿真环境中训练智能体直到它能够仅根据DOM描述来判断出UI组件类别。Training the agent includes: using Rerrent Neural Network (RNN) as the core network of the agent, using instruction key-value pairs as input, operable leaf nodes as the state space, clicking on the text box and input Natural language text is used to train the agent in the action space. In order to enable the reinforcement learning agent to identify the UI component categories corresponding to various DOM elements (i.e. buttons, text boxes, drop-down boxes, etc.) and learn to align command key-value pairs with operable leaf nodes, it is necessary to collect information about different web pages during the training phase. Data sets described by the DOM of UI components such as buttons, drop-down boxes, and check boxes, and these data sets are annotated. For example, a search button, the DOM can be represented as "<buttontype="search">Search</button>" or "<a href="#"class="button">Search</a>" , they are all marked as buttons. The richer the data set, the stronger the generalization of the trained agent. Then the simulation environment is generated reversely based on these DOM data sets. The agent is trained in the simulation environment until it can determine the UI component category based only on the DOM description.
由于经过强化学习的智能体是通过判断当前环境的状态(state)来采取动作(action)的,在网页作业流程中,状态(state)就是网页的DOM信息,动作(action)则是点击、输入、拖拽等操作。因此当网页出现异常情况时,智能体通过判断网页DOM元素的变化来决定输出的行为。如果网页异常并未影响到关键的可操作DOM叶子节点,则作业流程不会就此中断,由此达到了可以应对异常网络情况的效果。Since the agent that has undergone reinforcement learning takes action by judging the state of the current environment, in the web page operation process, the state is the DOM information of the web page, and the action is click or input. , drag and drop and other operations. Therefore, when an abnormal situation occurs on the web page, the agent determines the output behavior by judging the changes in the DOM elements of the web page. If the web page exception does not affect the key operable DOM leaf nodes, the job process will not be interrupted, thus achieving the effect of being able to cope with abnormal network conditions.
此外,本申请实施例中所述的基于强化学习训练的智能体可以快速适应新网页并完成相应指令。由于强化学习智能体在训练时是以网页DOM元素为输入,因此并不会受到网页布局、结构、颜色、样式等因素的影响,而仅与可操作叶子节点相关,因此可以快速适应视觉风格不同的网页界面,避免了由于网页中按钮或者输入框样式或位置发生变化后智能体无法识别的情况。In addition, the agent trained based on reinforcement learning described in the embodiments of this application can quickly adapt to new web pages and complete corresponding instructions. Since the reinforcement learning agent uses web page DOM elements as input during training, it will not be affected by factors such as web page layout, structure, color, style, etc., but is only related to operable leaf nodes, so it can quickly adapt to different visual styles. The web interface avoids the situation where the intelligent agent cannot recognize the button or input box style or position due to changes in the web page.
在本申请实施例的一种实现方式中,所述获取网页插件接收的自然语言指令包括:获取预设时间段内接收到的自然语言指令,按照接收的时间先后顺序对所述预设时间段内接收到的自然语言指令进行存储。In an implementation manner of the embodiment of the present application, obtaining the natural language instructions received by the web page plug-in includes: obtaining the natural language instructions received within a preset time period, and processing the preset time period in the order of the time of receipt. The natural language instructions received within are stored.
当有多项任务需要通过流程自动化实现时,可以一次性获取多个任务指令,例如下载a用户一年内的流水账单;下载b用户3个月内的流水账单,然后按照获取指令的时间顺序,将获取的指令存储,按照顺序依次完成指令。为了保证能够按照指令的先后顺序完成,可以按照时间顺序对指令进行存储。When there are multiple tasks that need to be realized through process automation, multiple task instructions can be obtained at one time, such as downloading user a's bills for one year; downloading user b's bills for three months, and then follow the chronological order of obtaining instructions. Store the obtained instructions and complete the instructions in sequence. In order to ensure that the instructions can be completed in the order they appear, instructions can be stored in chronological order.
在本申请实施例的一种实现方式中,当所述自然语言指令为下载类指令时,所述下载类指令中包括下载完成后的存储地址,所述方法还包括:In an implementation manner of the embodiment of the present application, when the natural language instruction is a download instruction, the download instruction includes the storage address after the download is completed, and the method further includes:
完成所述下载类指令后,按照所述存储地址存储所述下载指令中包括的下载内容。After completing the download instruction, the download content included in the download instruction is stored according to the storage address.
所述下载类指令例如:“下载三个月内的流水账单并存储在桌面上”。则通过所述网页流程自动化方法,可以完成所述指令,当存储地址发生变化时,同样可以存储到目标地址。由于本申请中的智能体在进行训练时已经学会了操作网页中的叶子节点即小的组件如点击按钮、点击下拉框、选择复选框、输入文本等动作,因此无论是更换应用场景还是更改网页布局,智能体均可通过点击组件实现,相比于在不同场景下都需重新部署一次RPA,本申请实施例中所述方法可以降低开发成本。The downloading instruction is, for example: "Download the bills within three months and store them on the desktop." Then, through the web page process automation method, the instruction can be completed, and when the storage address changes, it can also be stored to the target address. Since the agent in this application has learned to operate the leaf nodes in the web page, that is, small components, such as clicking buttons, clicking drop-down boxes, selecting check boxes, entering text, etc., whether it is changing the application scenario or changing Web page layout and agents can be realized through click components. Compared with the need to redeploy RPA once in different scenarios, the method described in the embodiment of this application can reduce development costs.
本申请上述例提供了一种网页流程自动化方法,该方法包括:先获取网页插件接收的自然语言指令,所述自然语言指令中包括多类指令,然后对所述自然语言指令进行处理,得到多个指令键值对,根据所述自然语言指令解析所述网页的文档对象模型,得到所述网页中的可操作叶子节点,将所述多个指令键值对和所述可操作叶子节点输入预先训练好的智能体,以自动执行所述自然语言指令。如此,通过对网页获取的自然语言指令进行处理,得到需要智能体进行操作的叶子节点和指令键值对,使得智能体根据叶子节点和指令键值对实现网页流程自动化。由于键值对和叶子节点为网页流程操作中的简单元素,对于更换网页界面或者更换应用环境的情况下,训练好的智能体仍能通过操作对自然语言指令进行解析得到指令键值对和可操作叶子节点进而实现网页流程自动化,减少在不同应用场景下重新部署RPA,降低开发成本。The above examples of this application provide a web page process automation method. The method includes: first obtaining the natural language instructions received by the web page plug-in. The natural language instructions include multiple types of instructions, and then processing the natural language instructions to obtain multiple types of instructions. instruction key-value pairs, parse the document object model of the web page according to the natural language instruction, obtain operable leaf nodes in the web page, and input the plurality of instruction key-value pairs and the operable leaf nodes in advance The intelligent agent is trained to automatically execute the natural language instructions. In this way, by processing the natural language instructions obtained from the web page, the leaf nodes and instruction key-value pairs that need to be operated by the agent are obtained, so that the agent can realize the automation of the web page process based on the leaf nodes and instruction key-value pairs. Since key-value pairs and leaf nodes are simple elements in web page process operations, when the web interface is changed or the application environment is changed, the trained agent can still parse the natural language instructions through operations to obtain the instruction key-value pairs and available Operate leaf nodes to automate web page processes, reduce redeployment of RPA in different application scenarios, and reduce development costs.
以上为本申请实施例提供的一种网页流程自动化方法的一些具体实现方式,基于此,本申请还提供了对应的装置。下面将从功能模块化的角度对本申请实施例提供的装置进行介绍。The above are some specific implementations of a web page process automation method provided by embodiments of the present application. Based on this, the present application also provides corresponding devices. The following will introduce the device provided by the embodiment of the present application from the perspective of functional modularization.
图2为本申请实施例提供的一种网页流程自动化装置的结构示意图。结合图2所示,本申请实施例提供的网页流程自动化装置200,包括:FIG. 2 is a schematic structural diagram of a web page process automation device provided by an embodiment of the present application. As shown in FIG. 2 , the web page process automation device 200 provided by the embodiment of the present application includes:
获取单元210,用于获取网页插件接收的自然语言指令,所述自然语言指令中包括多类指令;The acquisition unit 210 is used to acquire the natural language instructions received by the web page plug-in, where the natural language instructions include multiple types of instructions;
第一处理单元220,用于对所述自然语言指令进行处理,得到多个指令键值对;The first processing unit 220 is used to process the natural language instructions to obtain multiple instruction key-value pairs;
第二处理单元230,用于根据所述自然语言指令解析所述网页的文档对象模型,得到所述网页中的可操作叶子节点;The second processing unit 230 is configured to parse the document object model of the web page according to the natural language instruction to obtain operable leaf nodes in the web page;
输入单元240,用于将所述多个指令键值对和所述可操作叶子节点输入预先训练好的智能体,以自动执行所述自然语言指令。The input unit 240 is used to input the plurality of instruction key-value pairs and the operable leaf nodes into a pre-trained agent to automatically execute the natural language instructions.
在本申请实施例的一种实现方式中,所述装置还包括训练单元,用于将循环神经网络作为所述智能体的核心网络,以指令键值对作为输入、可操作叶子节点作为状态空间、点击文本框和输入自然语言文字为动作空间对所述智能体进行训练。In an implementation manner of the embodiment of the present application, the device further includes a training unit for using a recurrent neural network as the core network of the agent, using instruction key-value pairs as input and operable leaf nodes as the state space. , click the text box and input natural language text to train the intelligent agent in the action space.
在本申请实施例的一种实现方式中,所述获取单元具体用于:In an implementation manner of the embodiment of the present application, the acquisition unit is specifically used to:
获取预设时间段内接收到的自然语言指令,按照接收的时间先后顺序对所述预设时间段内接收到的自然语言指令进行存储。Obtain the natural language instructions received within the preset time period, and store the natural language instructions received within the preset time period in the order of reception time.
在本申请实施例的一种实现方式中,所述装置还包括存储单元,当所述自然语言指令为下载类指令时,所述下载类指令中包括下载完成后的存储地址,当完成所述下载类指令后,所述存储单元用于按照所述存储地址存储所述下载指令中包括的下载内容。In an implementation manner of the embodiment of the present application, the device further includes a storage unit. When the natural language instruction is a download instruction, the download instruction includes the storage address after the download is completed. After downloading the instruction, the storage unit is used to store the download content included in the download instruction according to the storage address.
在本申请实施例的一种实现方式中,所述可操作叶子节点包括:In an implementation manner of the embodiment of this application, the operable leaf nodes include:
网页中的文本框、按钮、下拉框和复选框中的一种或多种。One or more of text boxes, buttons, drop-down boxes, and check boxes in web pages.
本申请实施例还提供了对应的设备以及计算机存储介质,用于实现本申请实施例提供的方案。The embodiments of this application also provide corresponding equipment and computer storage media for implementing the solution provided by the embodiments of this application.
如图3所示,计算机设备01以通用计算设备的形式表现。计算机设备01的组件可以包括但不限于:一个或者多个处理器或者处理单元03,系统存储器08,连接不同系统组件(包括系统存储器08和处理单元03)的总线04。As shown in Figure 3, computer device 01 is embodied in the form of a general computing device. The components of the computer device 01 may include, but are not limited to: one or more processors or processing units 03, a system memory 08, and a bus 04 connecting different system components (including the system memory 08 and the processing unit 03).
总线04表示几类总线结构中的一种或多种,包括存储器总线或者存储器控制器,外围总线,图形加速端口,处理器或者使用多种总线结构中的任意总线结构的局域总线。举例来说,这些体系结构包括但不限于工业标准体系结构(ISA)总线,微通道体系结构(MAC)总线,增强型ISA总线、视频电子标准协会(VESA)局域总线以及外围组件互连(PCI)总线。Bus 04 represents one or more of several types of bus structures, including a memory bus or memory controller, a peripheral bus, a graphics acceleration port, a processor, or a local bus using any of a variety of bus structures. For example, these architectures include, but are not limited to, the Industry Standard Architecture (ISA) bus, the Micro Channel Architecture (MAC) bus, the Enhanced ISA bus, the Video Electronics Standards Association (VESA) local bus, and the Peripheral Component Interconnect ( PCI) bus.
计算机设备01典型地包括多种计算机系统可读介质。这些介质可以是任何能够被计算机设备01访问的可用介质,包括易失性和非易失性介质,可移动的和不可移动的介质。Computer device 01 typically includes a variety of computer system readable media. These media can be any available media that can be accessed by computer device 01, including volatile and non-volatile media, removable and non-removable media.
系统存储器08可以包括易失性存储器形式的计算机系统可读介质,例如随机存取存储器(RAM)09和/或高速缓存存储器10。计算机设备01可以进一步包括其它可移动/不可移动的、易失性/非易失性计算机系统存储介质。仅作为举例,存储系统11可以用于读写不可移动的、非易失性磁介质(图3未显示,通常称为“硬盘驱动器”)。尽管图3中未示出,可以提供用于对可移动非易失性磁盘(例如“软盘”)读写的磁盘驱动器,以及对可移动非易失性光盘(例如CD-ROM,DVD-ROM或者其它光介质)读写的光盘驱动器。在这些情况下,每个驱动器可以通过一个或者多个数据介质接口与总线04相连。存储器08可以包括至少一个程序产品,该程序产品具有一组(例如至少一个)程序模块,这些程序模块被配置以执行本申请各实施例的功能。System memory 08 may include computer system readable media in the form of volatile memory, such as random access memory (RAM) 09 and/or cache memory 10 . Computer device 01 may further include other removable/non-removable, volatile/non-volatile computer system storage media. By way of example only, storage system 11 may be used to read and write to non-removable, non-volatile magnetic media (not shown in Figure 3, commonly referred to as a "hard drive"). Although not shown in FIG. 3, a disk drive may be provided for reading and writing to removable non-volatile disks (e.g., "floppy disks"), and for removable non-volatile optical disks (e.g., CD-ROM, DVD-ROM or other optical media) that can read and write optical disc drives. In these cases, each drive can be connected to bus 04 via one or more data media interfaces. The memory 08 may include at least one program product having a set of (eg, at least one) program modules configured to perform the functions of various embodiments of the present application.
具有一组(至少一个)程序模块13的程序/实用工具12,可以存储在例如存储器08中,这样的程序模块13包括但不限于操作系统、一个或者多个应用程序、其它程序模块以及程序数据,这些示例中的每一个或某种组合中可能包括网络环境的实现。程序模块13通常执行本申请所描述的实施例中的功能和/或方法。A program/utility 12 having a set of (at least one) program modules 13 including, but not limited to, an operating system, one or more application programs, other program modules, and program data, may be stored, for example, in memory 08 , each of these examples or some combination may include the implementation of a network environment. The program module 13 generally performs the functions and/or methods in the embodiments described in this application.
计算机设备01也可以与一个或多个外部设备02(例如键盘、指向设备、显示器07等)通信,还可与一个或者多个使得用户能与该计算机设备01交互的设备通信,和/或与使得该计算机设备01能与一个或多个其它计算设备进行通信的任何设备(例如网卡,调制解调器等等)通信。这种通信可以通过输入/输出(I/O)接口06进行。并且,计算机设备01还可以通过网络适配器05与一个或者多个网络(例如局域网(LAN),广域网(WAN)和/或公共网络,例如因特网)通信。如图3所示,网络适配器05通过总线04与计算机设备01的其它模块通信。应当明白,尽管图3中未示出,可以结合计算机设备01使用其它硬件和/或软件模块,包括但不限于:微代码、设备驱动器、冗余处理单元、外部磁盘驱动阵列、RAID系统、磁带驱动器以及数据备份存储系统等。Computer device 01 may also communicate with one or more external devices 02 (e.g., keyboard, pointing device, display 07, etc.), with one or more devices that enable a user to interact with computer device 01, and/or with Any device (such as a network card, modem, etc.) that enables the computer device 01 to communicate with one or more other computing devices. This communication can occur through the input/output (I/O) interface 06. Furthermore, the computer device 01 may also communicate with one or more networks (eg, a local area network (LAN), a wide area network (WAN), and/or a public network, such as the Internet) through the network adapter 05. As shown in Figure 3, network adapter 05 communicates with other modules of computer device 01 via bus 04. It should be understood that, although not shown in Figure 3, other hardware and/or software modules may be used in conjunction with computer device 01, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID systems, tapes drives and data backup storage systems, etc.
处理器单元03通过运行存储在系统存储器08中的程序,从而执行各种功能应用以及数据处理,例如实现本申请实施例提供的方法。The processor unit 03 executes various functional applications and data processing by running programs stored in the system memory 08 , for example, implementing the methods provided by the embodiments of the present application.
通过以上的实施方式的描述可知,本领域的技术人员可以清楚地了解到上述实施例方法中的全部或部分步骤可借助软件加通用硬件平台的方式来实现。基于这样的理解,本申请的技术方案可以以软件产品的形式体现出来,该计算机软件产品可以存储在存储介质中,如只读存储器(英文:read-only memory,ROM)/RAM、磁碟、光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者诸如路由器等网络通信设备)执行本申请各个实施例或者实施例的某些部分所述的方法。From the description of the above embodiments, those skilled in the art can clearly understand that all or part of the steps in the methods of the above embodiments can be implemented by means of software plus a general hardware platform. Based on this understanding, the technical solution of the present application can be embodied in the form of a software product. The computer software product can be stored in a storage medium, such as read-only memory (English: read-only memory, ROM)/RAM, disk, Optical disc, etc., including a number of instructions to cause a computer device (which can be a personal computer, a server, or a network communication device such as a router) to execute the methods described in various embodiments or certain parts of the embodiments of this application.
需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。It should be noted that in this article, relational terms such as first and second are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply that these entities or operations are mutually exclusive. any such actual relationship or sequence exists between them. Furthermore, the terms "comprises," "comprises," or any other variations thereof are intended to cover a non-exclusive inclusion such that a process, method, article, or apparatus that includes a list of elements includes not only those elements, but also those not expressly listed other elements, or elements inherent to the process, method, article or equipment. Without further limitation, an element defined by the statement "comprises a..." does not exclude the presence of additional identical elements in a process, method, article, or apparatus that includes the stated element.
还需要说明的是,本说明书中的各个实施例均采用递进的方式描述,各个实施例之间相同相似的部分互相参见即可,每个实施例重点说明的都是与其他实施例的不同之处。尤其,对于设备及系统实施例而言,由于其基本相似于方法实施例,所以描述得比较简单,相关之处参见方法实施例的部分说明即可。以上所描述的设备及系统实施例仅仅是示意性的,其中作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元提示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。本领域普通技术人员在不付出创造性劳动的情况下,即可以理解并实施。It should also be noted that each embodiment in this specification is described in a progressive manner. The same and similar parts between various embodiments can be referred to each other. Each embodiment focuses on the differences from other embodiments. place. In particular, the device and system embodiments are described simply because they are basically similar to the method embodiments. For relevant details, please refer to the partial description of the method embodiments. The device and system embodiments described above are only illustrative. The units described as separate components may or may not be physically separated. The components indicated as units may or may not be physical units, that is, they may be located in One location, or it can be distributed across multiple network units. Some or all of the modules can be selected according to actual needs to achieve the purpose of the solution of this embodiment. Persons of ordinary skill in the art can understand and implement the method without any creative effort.
以上所述,仅为本申请的一种具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到的变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应该以权利要求的保护范围为准。The above is only a specific implementation mode of the present application, but the protection scope of the present application is not limited thereto. Any person familiar with the technical field can easily think of changes or modifications within the technical scope disclosed in the present application. Replacements shall be covered by the protection scope of this application. Therefore, the protection scope of this application should be subject to the protection scope of the claims.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311498917.4A CN117390336A (en) | 2023-11-10 | 2023-11-10 | A web page process automation method, device, equipment and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311498917.4A CN117390336A (en) | 2023-11-10 | 2023-11-10 | A web page process automation method, device, equipment and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117390336A true CN117390336A (en) | 2024-01-12 |
Family
ID=89468265
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311498917.4A Pending CN117390336A (en) | 2023-11-10 | 2023-11-10 | A web page process automation method, device, equipment and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117390336A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117634867A (en) * | 2024-01-26 | 2024-03-01 | 杭州实在智能科技有限公司 | RPA flow automatic construction method and system combining large language model and reinforcement learning |
CN118426874A (en) * | 2024-07-03 | 2024-08-02 | 科大讯飞股份有限公司 | Intelligent proxy method, device and system |
-
2023
- 2023-11-10 CN CN202311498917.4A patent/CN117390336A/en active Pending
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117634867A (en) * | 2024-01-26 | 2024-03-01 | 杭州实在智能科技有限公司 | RPA flow automatic construction method and system combining large language model and reinforcement learning |
CN117634867B (en) * | 2024-01-26 | 2024-05-24 | 杭州实在智能科技有限公司 | RPA flow automatic construction method and system combining large language model and reinforcement learning |
CN118426874A (en) * | 2024-07-03 | 2024-08-02 | 科大讯飞股份有限公司 | Intelligent proxy method, device and system |
CN118426874B (en) * | 2024-07-03 | 2024-10-25 | 科大讯飞股份有限公司 | Intelligent proxy method, device and system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2022062798A1 (en) | Rpa and ai-based table information extraction method and apparatus, device and medium | |
Jones | Programming productivity | |
Abbasnasab Sardareh et al. | Comparing four contemporary statistical software tools for introductory data science and statistics in the social sciences | |
US11232134B2 (en) | Customized visualization based intelligence augmentation | |
CN117390336A (en) | A web page process automation method, device, equipment and storage medium | |
Ahasanuzzaman et al. | CAPS: a supervised technique for classifying Stack Overflow posts concerning API issues | |
US20240345808A1 (en) | Intelligent and predictive modules for software development and coding using artificial intelligence and machine learning | |
US8707250B2 (en) | Automation support for domain modeling | |
Malik et al. | Back to origin: Transformation of business process models to business rules | |
Bao et al. | Inference of development activities from interaction with uninstrumented applications | |
US6931385B1 (en) | Interactive examples for online coding tutorials | |
CN111737472A (en) | Method and system for updating text classification model, electronic device and storage medium | |
WO2025092056A1 (en) | Question-and-answer data generation method and apparatus, and computer device and storage medium | |
CN115033436A (en) | Page testing method and device, electronic equipment and storage medium | |
US20230342383A1 (en) | Method and system for managing workflows for authoring data documents | |
Bordbar et al. | MDA and analysis of web applications | |
AU2020202730A1 (en) | Method and system for personalizing software based on real time tracking of voice-of-customer feedback | |
CN114579461A (en) | Browser compatibility detection method and related equipment | |
CN113887913A (en) | Efficiency evaluation method, device and electronic equipment combining RPA and AI | |
US20240303421A1 (en) | Predicate-guided preparation for llm integrations in spreadsheet environments | |
Hrendus et al. | Developing an Intelligent Online Learning System for Foreign Language Vocabulary Training Based on Gamification. | |
Malik et al. | A rule based approach for business rule generation from business process models | |
CN113050933B (en) | Brain graph data processing method, device, equipment and storage medium | |
JP7416665B2 (en) | Dialogue system and control method for dialogue system | |
George et al. | Enhanced model-driven web application development with code generation using deep learning technique |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |