CN108563765B

CN108563765B - Intelligent image-text matching method and system

Info

Publication number: CN108563765B
Application number: CN201810353977.XA
Authority: CN
Inventors: 刘晓燕; 欧阳波; 杨光启
Original assignee: Aiyouya Information Technology Shenzhen Co ltd
Current assignee: Aiyouya information technology (Shenzhen) Co.,Ltd.
Priority date: 2018-04-19
Filing date: 2018-04-19
Publication date: 2021-11-16
Anticipated expiration: 2038-04-19
Also published as: CN108563765A

Abstract

The invention relates to an intelligent image-text matching method and system, which judges whether character information is matched with an image or not and outputs a matching result; if the matching is carried out, the picture and the characters are displayed, and if the matching is not carried out, the picture is not displayed, and only the character information is displayed. The method and the device can reduce unnecessary picture information to the maximum extent when browsing news or characters, help the user to save traffic, and simultaneously help the user to click on unnecessary characters due to the picture information which is not related to the characters, thereby reducing unnecessary traffic waste caused by a banner party.

Description

Intelligent image-text matching method and system

Technical Field

The invention relates to the field of computers, in particular to an intelligent image-text matching method and system.

Background

In the prior art, with the development of science and technology, the demand of human beings on data processing services is increasing day by day, a mobile phone is used as information browsing and is used as a main channel for people to connect external information, more and more people do not need to know news of all places through televisions or newspapers, however, because the network news brings fast real-time news and various self media are added to acquire flow, the news value is measured through the flow, and more news media are used for producing the news, through pictures or headline parties which are not related to characters or pictures which can attract users, the users are attracted to click on news, the purpose of acquiring more traffic is achieved, this is then unnecessary for users of news browsing, the pictures cause extra traffic loss, and the pictures are completely unrelated to the text, which is annoying to the user to some extent. Therefore, browsing news in the prior art has certain defects for users.

Content of application

In order to solve the technical problems: the application provides a picture and text matching method, which comprises the following steps:

judging whether the character information is matched with the picture or not, and outputting a matching result;

if the matching is carried out, the picture and the characters are displayed, and if the matching is not carried out, the picture is not displayed, and only the character information is displayed.

The image-text matching method specifically comprises the following steps: the step of judging whether the text information is matched with the picture direction specifically comprises the following steps:

searching whether the characters contain characters with non-matching pictures and texts, and if the characters contain characters with non-matching pictures and texts, determining that the pictures are not matched with the character information;

searching picture information and outputting picture keywords;

comparing the picture key words with the text information, and judging whether the text information contains the picture key words;

if yes, outputting a matching result, and if not, outputting no match.

The image-text matching method, wherein the searching for the picture information and the outputting of the picture keywords specifically comprise:

and connecting the picture to a network, searching the picture, determining keywords of the picture, and outputting a plurality of keywords related to the picture in a list mode.

In the image-text matching method, the determining of the keywords of the image specifically includes: detecting titles or character information related to pictures, comparing a plurality of titles, extracting words with frequency exceeding a search threshold, arranging the words exceeding the search threshold according to a frequency sequence, comparing an arrangement result with a keyword threshold, and outputting the words exceeding the keyword threshold.

The image-text matching method comprises the steps that the search threshold comprises a first search threshold and a second search threshold, the first search threshold is larger than the second search threshold, when the frequency of words exceeds the first search threshold, the words are determined as determined keywords, when the frequency of the words is between the first search threshold and the second search threshold, the words are determined as selectable keywords, and when the frequency of the words is smaller than the second search threshold, the words are determined as fuzzy keywords; the keyword threshold comprises a first keyword threshold and a second keyword threshold, when the determined keyword is smaller than the first keyword threshold, a difference value between the first keyword threshold and the determined keyword is selected from the selectable keywords, an output picture keyword is obtained according to the sum of the difference value and the determined keyword, when the determined keyword is larger than the first keyword threshold, the current determined keyword is output as the output picture keyword, and when the determined keyword is larger than the second keyword threshold, a corresponding number of keywords of the second keyword threshold are output as the output picture keyword.

In the image-text matching method, if matching, displaying the image and the text comprises: and labeling the pictures, associating the pictures with the characters according to the picture distribution with different labels, not displaying the pictures which cannot be associated with the characters, and finding out the label associated symbols corresponding to the characters and the pictures with the characters.

The image-text matching method does not display the image if the image-text matching method is not matched, and only displays the text information, and then the method further comprises the following steps: typesetting the text information again, determining the author of the text information, recording the author into a record table, comparing the record with an early warning value, if the recording times of the author exceed the early warning value, performing early warning prompt, and correspondingly recording the early warning prompt times, if the early warning prompt times exceed a preset value, listing the corresponding author into a blacklist, and simultaneously reporting the blacklist in the blacklist; if the author is the first record, filling the author into a fixed position of a record table.

A teletext matching system comprising:

the searching module is used for searching picture keywords and character information;

the matching judgment module is used for judging whether the character information is matched with the picture or not and outputting a matching result;

and the output module is used for outputting a result according to the matching result, if the result is matched, the picture and the characters are displayed, and if the result is not matched, the picture is not displayed, and only the character information is displayed.

The image-text matching system further comprises a threshold setting module, a networking module, a sorting module, a text typesetting module, an author information recording module and an author information early warning module, wherein the threshold setting module is used for setting a search threshold and a keyword threshold, the search threshold comprises a first search threshold and a second search threshold, and the keyword threshold comprises a first keyword threshold and a second keyword threshold; the networking module is used for connecting a network and searching keywords for the pictures; the sorting module is used for sorting the searched picture keywords; the text typesetting module is used for typesetting the text information again and performing associated operation on the pictures and texts; the author information recording module is used for recording author information with unmatched images and texts, and the author information early warning module is used for early warning authors with unmatched images and texts and with records exceeding the early warning value.

Whether this application can match with the characters by automatic identification picture, the flow when furthest's saving user browses the news reduces the puzzlement that unnecessary picture information brought for the user, promotes the user and reads the experience effect of news or browse the characters information.

Drawings

Fig. 1 is a schematic diagram of the image-text matching method of the present invention.

Fig. 2 is a schematic diagram of the image-text matching system of the present invention.

Detailed Description

The present application will now be described in further detail with reference to the drawings, it should be noted that the following detailed description is given for illustrative purposes only and is not to be construed as limiting the scope of the present application, as those skilled in the art will be able to make numerous insubstantial modifications and adaptations to the present application based on the above disclosure.

The first embodiment is as follows:

as shown in fig. 1, the invention provides an intelligent image-text matching method, which comprises the following steps:

searching picture information and outputting picture keywords;

if yes, outputting a matching result, and if not, outputting no match.

Example two:

the invention provides a method for monitoring the flow rate of image-text mismatching, which comprises the following steps:

if the matching is carried out, displaying the pictures and the characters; and if not, monitoring the flow consumption value, judging whether the flow consumption value exceeds a flow threshold value, and if the flow consumption value exceeds the flow threshold value, not displaying the picture and only displaying the text information.

The specific method for monitoring the flow consumption value comprises the following steps:

step 1: carrying out flow monitoring on image-text mismatching in a monitoring period to form a flow loss waveform; selecting one day or half day or a fixed time period as a monitoring period;

step 2: analyzing the flow loss waveform to obtain the amplitude xi of the wave in the monitoring period, wherein i is a natural number from 1 to n, and n is the number of the waves in one monitoring period;

and step 3: calculating a mean calculation value Yn of the amplitude xi of the wave by performing a recursive calculation by the following formula (1): yn ═ 1-k (Yn-1 + kxn (1)

In the formula (1), xn is the amplitude of the wave measured at the nth time; yn is the average calculation value of the amplitude xi when the nth recursion is performed; k is a calculation constant;

and 4, step 4: carrying out recursive calculation again through a formula (2) to obtain a calculated value of a mean square value Zn of the amplitude xi;

Zn＝(1-k)Zn-1+kxn2 (2)

zn is a calculated value of the mean square value of the amplitude xi when the nth recursion is carried out;

and 5: calculating a standard value standard (xi) of the calculated value of xi by a formula (3);

standard(x)＝Yn2-Zn (3)；

step 6: then, performing binary judgment on xi through a formula (4);

S＝xi-Yn*standard(xi) (4)

and 7: when S is larger than zero, the flow rate consumed when the pictures and texts are not matched is considered to be excessive, namely the flow rate exceeds a threshold value.

searching picture information and outputting picture keywords;

if yes, outputting a matching result, and if not, outputting no match.

Example three:

as shown in fig. 2, the image-text matching system of the present invention includes:

Claims

1. An intelligent image-text matching method is characterized by comprising the following steps:

if the matching is carried out, the picture and the characters are displayed, and if the matching is not carried out, the picture is not displayed, and only the character information is displayed;

the step of judging whether the text information is matched with the picture specifically comprises the following steps: searching whether the characters contain characters with non-matching pictures and texts, and if the characters contain characters with non-matching pictures and texts, determining that the pictures are not matched with the character information;

the step of judging whether the text information is matched with the picture specifically comprises the following steps: searching picture information and outputting picture keywords;

if yes, outputting a matching result, and if not, outputting mismatching;

the searching for the picture information and outputting the picture keywords specifically comprise: connecting the picture to a network, searching the picture, determining keywords of the picture, and outputting a plurality of keywords related to the picture in a list mode;

the determining of the keywords of the picture specifically includes: detecting titles or character information related to pictures, comparing a plurality of titles, extracting words with frequency exceeding a search threshold, arranging the words exceeding the search threshold according to a frequency sequence, comparing an arrangement result with a keyword threshold, and outputting the words exceeding the keyword threshold.

2. The teletext matching method according to claim 1, wherein the search threshold comprises a first search threshold and a second search threshold, the first search threshold being greater than the second search threshold, the word being determined as a determined keyword when the frequency of the word exceeds the first search threshold, the word being determined as a selectable keyword when the frequency of the word is between the first search threshold and the second search threshold, the word being determined as a fuzzy keyword when the frequency of the word is less than the second search threshold; the keyword threshold comprises a first keyword threshold and a second keyword threshold, when the number of the determined keywords is smaller than the first keyword threshold, the number of the selectable keywords is the difference between the first keyword threshold and the determined keywords, the output picture keywords are obtained by summing the difference and the determined keywords, when the number of the determined keywords is larger than the first keyword threshold, the current determined keywords are output as the output picture keywords, and when the determined keywords are larger than the second keyword threshold, the keywords with the corresponding number of the second keyword threshold are output as the output picture keywords.

3. The teletext matching method of claim 1, wherein displaying the picture and text if matched comprises: and labeling the pictures, respectively associating the pictures with the characters according to different labels, not displaying the pictures which cannot be associated with the characters, and finding out the label associated symbols corresponding to the characters and the pictures with the characters.

4. The teletext matching method of claim 1, wherein if there is no match, no picture is displayed, only after displaying the textual information, further comprising: typesetting the text information again, determining the author of the text information, recording the author into a record table, comparing the record with an early warning value, if the recording times of the author exceed the early warning value, performing early warning prompt, and correspondingly recording the early warning prompt times, if the early warning prompt times exceed a preset value, listing the corresponding author into a blacklist, and simultaneously reporting the blacklist in the blacklist; if the author is the first record, filling the author into a fixed position of a record table.

5. An intelligent image-text matching system, characterized by comprising:

the output module is used for outputting a result according to the matching result, if the result is matched with the matching result, the picture and the characters are displayed, and if the result is not matched with the matching result, the picture is not displayed, and only the character information is displayed;

the system comprises a threshold setting module, a networking module, a sorting module, a text typesetting module, an author information recording module and an author information early warning module, wherein the threshold setting module is used for setting a search threshold and a keyword threshold, the search threshold comprises a first search threshold and a second search threshold, and the keyword threshold comprises a first keyword threshold and a second keyword threshold; the networking module is used for connecting a network and searching keywords for the pictures; the sorting module is used for sorting the searched picture keywords; the text typesetting module is used for typesetting the text information again and performing associated operation on the pictures and texts; the author information recording module is used for recording author information with unmatched images and the author information early warning module is used for early warning authors with unmatched images and with recorded times exceeding the early warning value.