CN116304098A

CN116304098A - Intelligent generation matching theme and information method based on commodity

Info

Publication number: CN116304098A
Application number: CN202310275371.XA
Authority: CN
Inventors: 周必奎
Original assignee: Hangzhou Tianzhi Information Technology Co ltd
Current assignee: Hangzhou Tianzhi Information Technology Co ltd
Priority date: 2023-03-15
Filing date: 2023-03-15
Publication date: 2023-06-23

Abstract

The invention discloses a method for intelligently generating matched topics and information based on commodities, and relates to the technical field of computers. In order to solve the problems of how to keep news generation fresh and making a search engine feel original in the prior art; a method for intelligently generating matched subjects and information based on commodities comprises the following steps: step one: inputting a theme; step two: outputting a basic article; step three: outputting multiple articles; by combining knowledge graph forms with the information of the existing commodity portraits, corresponding commodity libraries are expanded for each platform to respectively form commodity multi-mode knowledge graphs of each platform, information of the same product on different platforms is obtained, a generation model driven by the knowledge base is trained, the multi-mode knowledge graphs of the commodity of the knowledge base are combined, pictures and texts in details are input into the model at the same time, news of different styles are generated, and therefore the effects of keeping fresh and enabling a search engine to feel original are achieved.

Description

Intelligent generation matching theme and information method based on commodity

Technical Field

The invention relates to the technical field of computers, in particular to an intelligent generation matching theme and information method based on commodities.

Background

In the operation process of each platform, operators need to find pictures and contents related to commodities to write and release the contents every day, and the work is repeated and the effect is not high; regarding a method for intelligently generating matching information, related patents, such as chinese patent publication No. CN115298660a, disclose an artificial intelligence-based information matching method, apparatus, device, medium and program product, wherein the method comprises: acquiring information content; determining an information topic label of the information content based on a preset algorithm model; judging whether a target theme label matched with the information theme label exists in the first gallery or not, wherein the picture in the first gallery is marked with the corresponding theme label; if the first gallery has the target theme label matched with the information theme label, taking the picture corresponding to the target theme label as a cover map of the information content.

The above patent, while improving the efficiency and effectiveness of information mapping, still has the following problems:

in the prior art, the difficulty of how to keep news generation fresh and enable a search engine to feel original often exists, and when a generated model is trained, the simplest training mode is to send marked data into a language model for fine adjustment, but the method is high in cost, and the operation cost of each platform is greatly improved.

Disclosure of Invention

The invention aims to provide a method for intelligently generating matched topics and information based on commodities so as to solve the problems in the background technology.

In order to achieve the above purpose, the present invention provides the following technical solutions:

a method for intelligently generating matched topics and information based on commodities, the method comprising the following steps:

step one: theme input: determining an input theme, determining commodities based on the theme, matching the commodities with an existing product library and commodity images, inputting detailed data corresponding to the commodities into the existing product library and commodity images, matching the detailed data one by one, and drawing a knowledge graph;

acquiring commodity libraries corresponding to all the platforms, respectively forming commodity multi-mode knowledge maps of all the platforms, aligning the commodities in the commodity multi-mode knowledge maps, and establishing a generation model driven by the knowledge libraries;

step two: basic article output: crawling corresponding commodity soft texts from the acquired multi-mode knowledge patterns, and learning a writing method of the commodity soft texts based on the generated model, wherein the number of the commodity soft texts is at least two;

performing migration learning on the trained generative model by adopting a generative pre-training task, generating according to the generative model driven by a knowledge base, training a reward model, classifying the model into abcd grades, scoring the generated soft text, and manually selecting the optimal abcd by human intervention at the moment;

step three: multi-article output: and searching by combining a search engine, comparing the difference degree between generated news to serve as a loss function, extracting corresponding line text style characteristics and emotion characteristics in the commodity soft text through learning of a model layer, and generating news of different styles through prompt learning of Promp.

Further, based on the generated model driven by the knowledge base, when obtaining one commodity detail, combining the multi-mode knowledge graph of the commodity in the knowledge base, and simultaneously inputting the picture and the text in the commodity detail into the generated model.

Further, in the first step, the matching of the commodity with the existing product library and commodity portraits is specifically as follows:

acquiring actively input theme data from each platform terminal, acquiring a data identifier of the theme data, and inputting the data identifier of the theme data into a preset theme database for matching;

outputting basic data types of commodity data of the theme based on the matching result, wherein the preset theme database comprises basic data types of commodity names, commodity pictures and commodity text descriptions;

preprocessing commodity data based on basic data types of the commodity data to obtain corresponding matching results contained in the commodity data, wherein the matching results are not less than one;

establishing a corresponding commodity storage space based on the matching result, binding commodity detailed data of a commodity pre-stored detailed information database with commodities, and respectively acquiring pre-stored detailed commodity information corresponding to each commodity;

inputting the pre-stored detailed commodity information into a corresponding commodity storage space, establishing commodity information, and simultaneously, based on data associated parameters carried by commodities in the commodity storage space;

and extracting detailed commodity data of each commodity in the corresponding commodity storage space, combining corresponding data association parameters, acquiring association relations among different commodity storage spaces, and drawing a knowledge graph corresponding to the commodity based on the association relations.

Further, aiming at the commodity multi-mode knowledge graph formed in the first step, the method comprises the following steps:

acquiring commodity databases corresponding to all platforms, inputting data in the commodity storage space into the commodity databases, and classifying and storing based on basic data types of the data in the commodity storage space;

according to the commodity database corresponding to each platform, after the commodity database is stored, the associated information of the commodity is called, wherein the associated information comprises: similar commodity picture information, similar commodity description information and associated data characteristic information;

and generating a knowledge graph of each commodity according to the data of each commodity in the commodity database and the commodity corresponding to the association relation, and filling the data of the corresponding commodity into the knowledge graph of each commodity based on the association relation to obtain a commodity multi-mode knowledge graph.

Further, in the multi-mode knowledge graph of the commodity, when the data of any commodity is called, detailed information of the corresponding commodity is called according to the multi-mode knowledge graph of the commodity, and meanwhile, the related commodity under the related information of the commodity is independently presented in the form of the knowledge graph.

Further, for learning the writing method of the commodity soft text based on the generated model in the second step, the writing method specifically comprises the following steps:

crawling soft text data and parameter data in a period of time, and determining a writing weight value of each soft text according to node attribute and parameter data of each soft text data and a generated pre-training task;

creating a filtered word stock, and performing word segmentation on the text and the title of the soft text data to obtain a plurality of extracted words, and determining the word characteristics of each extracted word and the sentence characteristics of each extracted word in the text and the title;

determining the similarity between the extracted words based on the constructed synonym dictionary, the word characteristics of each extracted word and the sentence characteristics of each extracted word in the text and the title, and screening the words based on the similarity;

and cleaning words according to the part-of-speech statistical features to obtain a soft text main body, transversely mining soft text data based on the soft text main body, calculating writing features in each soft text by using writing weights, and determining a plurality of writing feature parameters based on the writing features.

Further, aiming at the search combined with the search engine in the third step, the difference degree between the generated news is compared as a loss function, and the specific process comprises the following steps:

text generation: from language model P _lm Sampling in (W), wherein w= [ W ] ₁ ...w _n ]Is a series of discrete words, P _1m (W) is a distribution about word sequences;

controllable text generation: sampling from a conditional distribution p (w|c), where c represents a control attribute.

Further, the controllable text generation further includes: and establishing a diffusion model, and generating controllable text attributes based on the diffusion model.

Compared with the prior art, the invention has the beneficial effects that:

1. the method comprises the steps of combining information of existing commodity portraits in a knowledge graph mode, expanding corresponding commodity libraries for each platform to form commodity multi-mode knowledge graphs of each platform, conducting entity alignment on commodities in the knowledge graphs to obtain information of the same product on different platforms, training a generation model driven by the knowledge bases, combining the multi-mode knowledge graphs of the commodities in the knowledge bases, inputting pictures and texts in details into the model at the same time, learning line style features and emotion features of news through learning of model layers, prompting learning through Promp, and generating news of different styles, so that the effects of keeping freshness and enabling a search engine to feel original are achieved.

2. The basic data type of commodity data of a theme is determined, the commodity data is preprocessed, the commodity detailed data of a commodity pre-stored detailed information database is utilized to bind with the commodity, the detailed information and the corresponding picture information of the commodity can be known, and the association characteristics of the commodity and similar commodities are determined by combining the corresponding data association parameters, so that the commodity and the related commodity are displayed more comprehensively, the basis is made for data analysis of the commodity and soft writing of the commodity, and the comprehensiveness of commodity data is improved.

3. The commodity display has a unified format through the knowledge graph, the commodity multi-mode knowledge graph can be formed only by filling the knowledge graph, the management of detailed data of the commodity is effectively improved, the simplicity of the commodity data in the knowledge graph is improved through displaying any commodity data in the knowledge graph, the sensitivity to main data is improved, people can obtain effective information in the commodity multi-mode knowledge graph more easily, and in addition, the whole and part of conversion is realized by retrieving the data of any commodity, so that the knowledge graph is convenient to display and observe.

4. By calculating the weight value of the soft text writing method for multiple times, different soft text writing method information is accurately mined and learned from mass data, the problems that part of information is easy to filter and the mining range is not comprehensive in the traditional data mining method can be solved, a large number of soft text templates are preset after the soft text writing method is learned, soft text is automatically generated based on the structural information of commodities, the problems that the traditional soft text editing is slow in timeliness and low in originality are solved, and the freshness and originality of the generated soft text are kept.

Drawings

FIG. 1 is a flow chart of a method for intelligently generating matching topics and information based on commodities according to the present invention.

Detailed Description

The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.

In order to solve the technical problem that in the prior art, how to keep news generation fresh and make a search engine feel original is often existed, referring to fig. 1, the present embodiment provides the following technical scheme:

step one: theme input: determining an input theme, determining commodities based on the theme, matching the commodities with an existing product library and commodity images, inputting detailed data corresponding to the commodities into the existing product library and commodity images, matching the detailed data one by one, and drawing a knowledge graph; acquiring commodity libraries corresponding to all the platforms, respectively forming commodity multi-mode knowledge maps of all the platforms, aligning the commodities in the commodity multi-mode knowledge maps, and establishing a generation model driven by the knowledge libraries; based on a generated model driven by a knowledge base, when obtaining a commodity detail, combining a multi-mode knowledge graph of the commodity in the knowledge base, and simultaneously inputting pictures and texts in the commodity detail into the generated model;

step two: basic article output: crawling corresponding commodity soft texts from the acquired multi-mode knowledge patterns, and learning a writing method of the commodity soft texts based on the generated model, wherein the number of the commodity soft texts is at least two; performing migration learning on the trained generative model by adopting a generative pre-training task, generating according to the generative model driven by a knowledge base, training a reward model, classifying the model into abcd grades, scoring the generated soft text, and manually selecting the optimal abcd by human intervention at the moment;

Specifically, the method comprises the steps of adding commodity details into the existing product library and commodity images through the existing product library, combining information of the existing commodity images through a knowledge graph mode, expanding corresponding commodity libraries for each platform, respectively forming commodity multi-mode knowledge graphs of each platform, conducting entity alignment on commodities in the knowledge graphs to obtain information of the same product on different platforms, training a generation model driven by the knowledge base, combining the multi-mode knowledge graphs of the commodities in the knowledge library, inputting pictures and texts in the details into the model at the same time, learning the character and emotion characteristics of news through learning of the model layer, and generating news of different styles through prompt learning, so that the effects of keeping freshness and enabling a search engine to feel original are achieved.

In order to solve the technical problem that in the prior art, since a large amount of commodity information and data need to be acquired, other commodities related to the commodity cannot be judged when the system extracts the commodity information, and writing of corresponding soft texts is affected, referring to fig. 1, the present embodiment provides the following technical scheme:

aiming at matching the commodity with the existing product library and commodity portraits in the first step, the method specifically comprises the following steps:

acquiring actively input theme data from each platform terminal, acquiring a data identifier of the theme data, and inputting the data identifier of the theme data into a preset theme database for matching; outputting basic data types of commodity data of the theme based on the matching result, wherein the preset theme database comprises basic data types of commodity names, commodity pictures and commodity text descriptions;

preprocessing commodity data based on basic data types of the commodity data to obtain corresponding matching results contained in the commodity data, wherein the matching results are not less than one; establishing a corresponding commodity storage space based on the matching result, binding commodity detailed data of a commodity pre-stored detailed information database with commodities, and respectively acquiring pre-stored detailed commodity information corresponding to each commodity;

inputting the pre-stored detailed commodity information into a corresponding commodity storage space, establishing commodity information, and simultaneously, based on data associated parameters carried by commodities in the commodity storage space; and extracting detailed commodity data of each commodity in the corresponding commodity storage space, combining corresponding data association parameters, acquiring association relations among different commodity storage spaces, and drawing a knowledge graph corresponding to the commodity based on the association relations.

Specifically, the basic data type of commodity data of a theme is determined, the commodity data is preprocessed, commodity detailed data of a commodity pre-stored detailed information database is utilized to bind with the commodity, detailed information and corresponding picture information of the commodity can be known, and the association characteristics of the commodity and similar commodities are determined by combining corresponding data association parameters, so that the commodity and the related commodities are displayed more comprehensively, the data analysis of the commodity is carried out subsequently, the basis is based on soft text writing of the commodity, and the comprehensiveness of the commodity data is improved.

In order to solve the technical problem that the structure is huge and the display of commodity center data cannot be performed when the knowledge graph displays a large amount of commodities and detailed data related to the commodities in the prior art, referring to fig. 1, the present embodiment provides the following technical scheme:

aiming at the commodity multi-mode knowledge graph which is formed in the first step and comprises the following steps:

acquiring commodity databases corresponding to all platforms, inputting data in the commodity storage space into the commodity databases, and classifying and storing based on basic data types of the data in the commodity storage space; according to the commodity database corresponding to each platform, after the commodity database is stored, the associated information of the commodity is called, wherein the associated information comprises: similar commodity picture information, similar commodity description information and associated data characteristic information; generating a knowledge graph of each commodity according to the data of each commodity in the commodity database and the commodity corresponding to the association relation, and filling the data of the corresponding commodity into the knowledge graph of each commodity based on the association relation to obtain a multi-mode knowledge graph of the commodity; in the commodity multi-mode knowledge graph, when the data of any commodity is called, the detailed information of the corresponding commodity is called according to the commodity multi-mode knowledge graph, and the related commodity under the commodity related information is independently presented in the form of the knowledge graph.

Specifically, the commodity display can be provided with a uniform format through the knowledge graph, the commodity multi-mode knowledge graph can be formed only by filling the knowledge graph, the management of detailed data of the commodity is effectively improved, the simplicity of the commodity data in the knowledge graph is improved through displaying any commodity data in the knowledge graph, the sensitivity of the main data is improved, people can obtain effective information in the commodity multi-mode knowledge graph more easily, and in addition, the whole and part of conversion is realized through retrieving the data of any commodity, so that the knowledge graph is convenient to present and observe.

In order to solve the technical problems of slow editing aging and low originality in the prior art, please refer to fig. 1, the present embodiment provides the following technical scheme:

aiming at the writing method for learning the commodity soft text based on the generated model in the second step, the method specifically comprises the following steps:

crawling soft text data and parameter data in a period of time, and determining a writing weight value of each soft text according to node attribute and parameter data of each soft text data and a generated pre-training task; creating a filtered word stock, and performing word segmentation on the text and the title of the soft text data to obtain a plurality of extracted words, and determining the word characteristics of each extracted word and the sentence characteristics of each extracted word in the text and the title;

determining the similarity between the extracted words based on the constructed synonym dictionary, the word characteristics of each extracted word and the sentence characteristics of each extracted word in the text and the title, and screening the words based on the similarity; and cleaning words according to the part-of-speech statistical features to obtain a soft text main body, transversely mining soft text data based on the soft text main body, calculating writing features in each soft text by using writing weights, and determining a plurality of writing feature parameters based on the writing features.

Specifically, by extracting keywords from mass data and calculating the weight value of the soft text writing method for multiple times, different soft text writing method information is accurately mined and learned from the mass data, the problem that part of information is easy to filter and the mining range is incomplete in a traditional data mining method can be solved, a large number of soft text templates are preset after the soft text writing method is learned, soft text is automatically generated based on structural information of commodities, the problems that the traditional soft text editing time is slow and the originality is low are solved, and the freshness and originality of the generated soft text are kept.

In order to solve the technical problem that in the prior art, the simplest training method is to send the markup data into the language model for fine tuning, but the cost of the method is expensive, referring to fig. 1, the present embodiment provides the following technical scheme:

aiming at the search combined with a search engine in the third step, comparing the difference degree between generated news as a loss function, wherein the specific process comprises the following steps:

text generation: from language model P _1m Sampling in (W), wherein w= [ W ] ₁ ...w _n ]Is a series of discrete words, P _lm (W) is a distribution about word sequences; controllable text generation: sampling from a conditional distribution p (w|c), where c represents the control property, namely: syntax parsing tree in syntax control or expected emotion in emotion control; the controllable text generation further includes: establishing a diffusion model, and generating controllable text attributes based on the diffusion model;

in this embodiment, a plug-and-play controllable generation: p (w|c) ≡P _lm (W)*p(c|w)。

Specifically, the text is generated by sampling from the language model, the controllable text is generated by sampling from the condition distribution, the attribute control is completed while the fluency is effectively ensured, meanwhile, the plug-and-play controllable generation method is derived, the output of the preview model is controlled, the traditional method of sending the mark data into the language model for fine tuning is avoided, the cost is greatly reduced, the search engine can be quickly combined for searching, and the difference degree between generated news is compared as a loss function.

The foregoing is only a preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art, who is within the scope of the present invention, should be covered by the protection scope of the present invention by making equivalents and modifications to the technical solution and the inventive concept thereof.

Claims

1. A method for intelligently generating matched subjects and information based on commodities is characterized in that: the method comprises the following steps:

2. The intelligent commodity-based method for generating matching topics and information according to claim 1, wherein: and based on the generated model driven by the knowledge base, when one commodity detail is obtained, combining the multi-mode knowledge graph of the commodity in the knowledge base, and simultaneously inputting the picture and the text in the commodity detail into the generated model.

3. The intelligent commodity-based method for generating matching topics and information as claimed in claim 2, wherein: aiming at matching the commodity with the existing product library and commodity portraits in the first step, the method specifically comprises the following steps:

4. A method for intelligently generating matching topics and information based on commodities as in claim 3, wherein: aiming at the commodity multi-mode knowledge graph which is formed in the first step and comprises the following steps:

5. The intelligent commodity-based method for generating matching topics and information as claimed in claim 4, wherein: in the commodity multi-mode knowledge graph, when the data of any commodity is called, the detailed information of the corresponding commodity is called according to the commodity multi-mode knowledge graph, and the related commodity under the commodity related information is independently presented in the form of the knowledge graph.

6. The intelligent commodity-based method for generating matching topics and information as claimed in claim 5, wherein: aiming at the writing method for learning the commodity soft text based on the generated model in the second step, the method specifically comprises the following steps:

7. The intelligent commodity-based method for generating matching topics and information as claimed in claim 6, wherein: aiming at the search combined with a search engine in the third step, comparing the difference degree between generated news as a loss function, wherein the specific process comprises the following steps:

text generation: from language model P _lm Sampling in (W), wherein w= [ W ] ₁ ...w _n ]Is a series of discrete words, P _lm (W) is a distribution about word sequences;

8. The intelligent commodity-based method for generating matching topics and information as claimed in claim 7, wherein: the controllable text generation further includes: and establishing a diffusion model, and generating controllable text attributes based on the diffusion model.