[go: up one dir, main page]

CN113268981A - Information processing method and device and electronic equipment - Google Patents

Information processing method and device and electronic equipment Download PDF

Info

Publication number
CN113268981A
CN113268981A CN202110587658.7A CN202110587658A CN113268981A CN 113268981 A CN113268981 A CN 113268981A CN 202110587658 A CN202110587658 A CN 202110587658A CN 113268981 A CN113268981 A CN 113268981A
Authority
CN
China
Prior art keywords
text
target
character
recognized
characters
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110587658.7A
Other languages
Chinese (zh)
Other versions
CN113268981B (en
Inventor
谢佳美
王滨宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Mobile Communications Group Co Ltd
MIGU Music Co Ltd
MIGU Culture Technology Co Ltd
Original Assignee
China Mobile Communications Group Co Ltd
MIGU Music Co Ltd
MIGU Culture Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Mobile Communications Group Co Ltd, MIGU Music Co Ltd, MIGU Culture Technology Co Ltd filed Critical China Mobile Communications Group Co Ltd
Priority to CN202110587658.7A priority Critical patent/CN113268981B/en
Publication of CN113268981A publication Critical patent/CN113268981A/en
Application granted granted Critical
Publication of CN113268981B publication Critical patent/CN113268981B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3343Query execution using phonetics

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Acoustics & Sound (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • User Interface Of Digital Computer (AREA)
  • Document Processing Apparatus (AREA)

Abstract

本发明公开了一种信息处理方法、装置及电子设备,涉及信息处理技术领域,以解决现有文字查询操作较为繁琐,效率较低的问题。该方法包括:获取目标语音,所述目标语音包括对电子设备上显示的第一文本中第一文字的读音;根据所述目标语音,确定所述第一文本中的待识别文字,所述待识别文字与所述第一文字不同;获取所述待识别文字的目标信息,所述目标信息包括发音和注释中的至少一项。这样,用户在利用电子设备阅读时,若遇到不认识的文字,可以对设备上显示的某处文字进行读音,即可触发电子设备根据用户读音定位出待识别文字,进而获取待识别文字的发音或注释等信息,该交互方式无需用户选中复制并查询待识别文字,相比现有方式更为实时便捷。

Figure 202110587658

The invention discloses an information processing method, a device and an electronic device, and relates to the technical field of information processing to solve the problems of complicated and low efficiency of existing text query operations. The method includes: acquiring a target voice, where the target voice includes the pronunciation of a first character in a first text displayed on an electronic device; according to the target voice, determining a character to be recognized in the first text, the to-be-recognized character The text is different from the first text; the target information of the text to be recognized is acquired, and the target information includes at least one of pronunciation and annotation. In this way, if a user encounters an unrecognized text when reading with an electronic device, he can read a text displayed on the device, which can trigger the electronic device to locate the text to be recognized according to the user's pronunciation, and then obtain the text to be recognized. Pronunciation or annotation and other information, this interactive method does not require users to select and copy and query the text to be recognized, which is more real-time and convenient than the existing method.

Figure 202110587658

Description

Information processing method and device and electronic equipment
Technical Field
The present invention relates to the field of information processing technologies, and in particular, to an information processing method and apparatus, and an electronic device.
Background
With the popularization of electronic devices, users often read various information with the electronic devices. During reading, users can encounter characters with unknown pronunciation or uncommon characters with unknown recognizability. In the related art, a user usually needs to operate a cursor with a finger to select and copy an unknown character, then jump to a related search page to search the character, query pronunciation or related paraphrases, and the problem of inaccurate positioning commonly exists in the selection operation on the electronic equipment, so that the user often needs to repeat multiple selection operations to select the character to be queried. Therefore, the existing text query operation is complicated, and the efficiency is low.
Disclosure of Invention
The embodiment of the invention provides an information processing method, an information processing device and electronic equipment, and aims to solve the problems of more complicated operation and lower efficiency of the conventional character query.
In a first aspect, an embodiment of the present invention provides an information processing method, including:
acquiring target voice, wherein the target voice comprises pronunciation of a first word in a first text displayed on electronic equipment;
determining characters to be recognized in the first text according to the target voice, wherein the characters to be recognized are different from the first characters;
acquiring target information of the character to be recognized, wherein the target information comprises at least one of pronunciation and annotation.
Optionally, the target speech further includes pronunciation of a preset cue word;
determining the characters to be recognized in the first text according to the target voice, wherein the determining comprises the following steps:
identifying the target voice to obtain a second text corresponding to the target voice;
determining second characters in the second text except the preset prompt words;
determining a target word sentence matched with the second character in the first text;
and determining characters to be recognized in the target words and sentences, wherein the characters to be recognized are different from the second characters.
Optionally, the determining second words in the second text except for the preset cue word includes:
according to the preset cue word, the second text is segmented to obtain a third character in the second text before the preset cue word and a fourth character after the preset cue word, wherein the second character comprises the third character and the fourth character;
the determining the target words and phrases in the first text that match the second words includes:
determining a fifth word in the first text, which is matched with the third word, and a sixth word which is matched with the fourth word;
and determining a target word sentence comprising the fifth word and the sixth word from the first text.
Optionally, the determining a target word sentence including the fifth word and the sixth word from the first text includes:
under the condition that the number of the fifth words or the sixth words is larger than 1, determining target fifth words and target sixth words with the smallest position intervals in the first text, wherein the target fifth words are before the target sixth words;
and determining a target word and sentence in the first text by taking the target fifth character as a starting word and the target sixth character as an ending word.
Optionally, the determining the characters to be recognized in the target words and sentences includes:
and determining characters positioned between the target fifth character and the target sixth character in the target words and sentences as characters to be recognized.
Optionally, the determining, according to the target speech, a character to be recognized in the first text includes:
identifying the target voice to obtain a third text corresponding to the target voice;
determining a seventh word in the first text that matches the third text;
receiving a first input of a user;
responding to the first input, and determining the word number K of the character to be recognized, wherein K is a positive integer;
and determining K characters positioned after the seventh character in the first text as the characters to be recognized.
Optionally, the receiving a first input of the user includes:
receiving a tap input of a user on a screen of the electronic equipment;
the responding to the first input, and determining the number K of the character words to be recognized comprises the following steps:
and determining the knocking times K of the knocking input as the number of the characters to be recognized.
Optionally, the number of the seventh characters is L, and L is an integer greater than 1;
the determining that K characters located after the seventh character in the first text are the characters to be recognized includes:
determining K characters respectively positioned behind each seventh character in the first text as candidate characters to obtain L groups of candidate characters;
receiving a second input of the user;
and responding to the second input, determining a target candidate character from the L groups of candidate characters, and determining the target candidate character as the character to be recognized.
Optionally, before receiving the second input of the user, the method further includes:
identifying the L groups of candidate words;
the receiving of the second input of the user comprises:
and receiving selection input of the L groups of candidate characters from a user.
In a second aspect, an embodiment of the present invention further provides an information processing apparatus, including:
the device comprises a first acquisition module, a second acquisition module and a third acquisition module, wherein the first acquisition module is used for acquiring target voice, and the target voice comprises pronunciation of a first character in a first text displayed on the electronic equipment;
the determining module is used for determining characters to be recognized in the first text according to the target voice, wherein the characters to be recognized are different from the first characters;
and the second acquisition module is used for acquiring target information of the character to be recognized, wherein the target information comprises at least one of pronunciation and annotation.
Optionally, the target speech further includes pronunciation of a preset cue word;
the determining module comprises:
the first recognition submodule is used for recognizing the target voice to obtain a second text corresponding to the target voice;
the first determining submodule is used for determining second characters except the preset prompt words in the second text;
the second determining submodule is used for determining a target word and sentence matched with the second word in the first text;
and the third determining submodule is used for determining characters to be recognized in the target words and sentences, wherein the characters to be recognized are different from the second characters.
Optionally, the first determining sub-module is configured to segment the second text according to the preset cue word to obtain a third word before the preset cue word and a fourth word after the preset cue word in the second text, where the second word includes the third word and the fourth word;
the second determination submodule includes:
a first determining unit, configured to determine a fifth word in the first text that matches the third word, and a sixth word that matches the fourth word;
a second determining unit, configured to determine a target word sentence including the fifth word and the sixth word from the first text.
Optionally, the second determining unit includes:
a first determining subunit, configured to determine, when the number of the fifth words or the sixth words is greater than 1, a target fifth word and a target sixth word that are located at a minimum interval in the first text, where the target fifth word precedes the target sixth word;
and the second determining subunit is configured to determine a target word and sentence in the first text by using the target fifth character as a start word and the target sixth character as an end word.
Optionally, the third determining submodule is configured to determine, as a character to be recognized, a character in the target sentence, which is located between the target fifth character and the target sixth character.
Optionally, the determining module includes:
the second recognition submodule is used for recognizing the target voice to obtain a third text corresponding to the target voice;
a fourth determining submodule, configured to determine a seventh word in the first text, where the seventh word matches the third text;
the receiving submodule is used for receiving a first input of a user;
the fifth determining submodule is used for responding to the first input and determining the number K of the characters to be recognized, wherein K is a positive integer;
and the sixth determining submodule is used for determining K characters behind the seventh character in the first text as the characters to be recognized.
Optionally, the receiving sub-module is configured to receive a tap input of a user on the screen of the electronic device;
and the fifth determining submodule is used for determining the knocking times K of the knocking input as the character word number to be recognized.
Optionally, the number of the seventh characters is L, and L is an integer greater than 1;
the sixth determination submodule includes:
a third determining unit, configured to determine that K characters respectively located after each seventh character in the first text are candidate characters, so as to obtain L groups of candidate characters;
a receiving unit for receiving a second input of the user;
and the fourth determining unit is used for responding to the second input, determining a target candidate character from the L groups of candidate characters and determining the target candidate character as the character to be recognized.
Optionally, the sixth determining sub-module further includes:
an identification unit, configured to identify the L groups of candidate characters;
the receiving unit is used for receiving the selection input of the L groups of candidate characters by the user.
In a third aspect, an embodiment of the present invention further provides an electronic device, including: a transceiver, a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the steps in the information processing method as described above when executing the computer program.
In a fourth aspect, the embodiment of the present invention further provides a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the steps in the information processing method as described above.
In the embodiment of the invention, target voice is obtained, wherein the target voice comprises pronunciation of a first character in a first text displayed on electronic equipment; determining characters to be recognized in the first text according to the target voice, wherein the characters to be recognized are different from the first characters; acquiring target information of the character to be recognized, wherein the target information comprises at least one of pronunciation and annotation. Therefore, when a user reads by using the electronic equipment, if the user encounters unknown characters, the user can read the characters displayed on the equipment, the electronic equipment can be triggered to position the characters to be recognized according to the user's reading, and then information such as pronunciation or comments of the characters to be recognized is acquired.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the description of the embodiments of the present invention will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to these drawings without inventive exercise.
FIG. 1 is a flow chart of an information processing method provided by an embodiment of the invention;
FIG. 2 is a schematic diagram of an interactive interface for a user to query uncommon words through voice according to an embodiment of the present invention;
FIG. 3 is a second schematic view of an interactive interface for a user to query uncommon words through voice according to an embodiment of the present invention;
FIG. 4 is a schematic diagram of an interactive interface for a user to query a uncommon word through voice and tapping operations according to an embodiment of the present invention;
FIG. 5 is a block diagram of an information processing apparatus provided by an embodiment of the present invention;
fig. 6 is a block diagram of an electronic device provided in an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1, fig. 1 is a flowchart of an information processing method according to an embodiment of the present invention, and as shown in fig. 1, the method includes the following steps:
step 101, obtaining a target voice, wherein the target voice comprises a pronunciation of a first character in a first text displayed on an electronic device.
The embodiment of the invention can be applied to scenes that a user quickly pronounces and inquires unknown words such as uncommon words, quickly explains and inquires the encountered words with unknown literal meanings, quickly pronounces or inquires the unknown English words and the like in the reading process by utilizing electronic equipment.
In the embodiment of the invention, when the electronic equipment is in a reading page, namely the first text is displayed, the user may encounter unknown characters in reading, and at the moment, the user can read the characters before and after the unknown characters, so that the electronic equipment can collect the voice uttered by the characters read by the user, namely, the target voice is acquired. The first text may refer to a text currently displayed on the electronic device, the first character may refer to a certain section of characters before and after the character to be recognized in the first text, and the target voice is a voice uttered by a user by reading the first character. It should be noted that the first characters may be characters of different languages, such as chinese characters and english.
For example, the electronic device displays the text "ancient human shape and sound meaning is debated from badness", the user does not recognize the pronunciation of " " two characters in reading, the user can read out the characters before and after " " two characters, namely, the words "ancient human shape and sound meaning" and "badness", for " " two characters can be stopped and not read, or other cue words can be used for substitution; alternatively, the user may read only the word "ancient ideographical debate" preceding the word " ".
Optionally, the step 101 includes:
and acquiring the target voice under the condition of receiving preset input of a user.
The preset input may be a preset input for triggering the electronic device to collect a voice of a user, for example, as shown in fig. 2, the preset input may be a character recognition function button 21 displayed on an interface of the touch electronic device 20, or a sound pickup function for waking up the electronic device by voice. That is, in this embodiment, the voice collecting module may be turned on only when an input for triggering the voice collecting function is received from the user, so as to obtain the target voice, so as to ensure that the electronic device starts the voice collecting function at an appropriate time.
Step 102, determining characters to be recognized in the first text according to the target voice, wherein the characters to be recognized are different from the first characters.
After the target voice is obtained, a field currently read by a user can be positioned in the first text according to the target voice, and specific characters to be recognized are determined according to a preset rule.
Specifically, the target speech may be subjected to speech recognition, the target speech is converted into a text, then, based on the converted text, a position of the converted text is found in the first text, so that a reading position of the user may be located, and further, a word at the position where the user does not pronounce or a plurality of words after the position may be determined as a word to be recognized, where a specific word number may be set by default of the system or determined based on a word number parameter further input by the user, that is, the word to be recognized is different from the first word in pronunciation of the user, and the word to be recognized may be located between the first words or may be located immediately after the first word.
For example, when the text corresponding to the target voice is recognized as "ancient pictophonetic ideology and malignance", it may be determined that the user is reading the text "ancient pictophonetic ideology malignance" displayed on the electronic device, and two characters " " in which the user has not read may be determined as the characters to be recognized; or, when the text corresponding to the target voice is recognized as the ancient human form and sound meaning, it can be determined that the user is reading the text of the ancient human form and sound meaning displayed on the electronic equipment, and the next two characters "" "" are determined as the characters to be recognized by default.
Optionally, the target speech further includes pronunciation of a preset cue word;
the step 102 comprises:
identifying the target voice to obtain a second text corresponding to the target voice;
determining second characters in the second text except the preset prompt words;
determining a target word sentence matched with the second character in the first text;
and determining characters to be recognized in the target words and sentences, wherein the characters to be recognized are different from the second characters.
In one embodiment, when a user reads an unknown character, the user may use a specific cue word to replace the unknown character, where the specific cue word may be a preset cue word used to replace the character to be recognized, such as a "shape, sound, meaning, word learning," or a "learning sound," and thus the target speech may further include a pronunciation of the preset cue word, and the electronic device may accurately determine that the word in the first text corresponding to the specific cue word is the character to be recognized based on the preset cue word.
In this embodiment, the electronic device may recognize the target voice, obtain a second text corresponding to the target voice, extract a second word except the preset cue word from the second text, find a target word and sentence matched with the first text based on the second word, and determine a word at a position corresponding to the preset cue word in the target word and sentence, where the word is a word to be recognized.
For example, as shown in FIG. 2, when the user reads the three pieces displayed on the electronic device 20, the user can read the sentence aloud without knowing the four words of the three pieces, and the preset prompt word of shape and sound meaning instead of the three pieces, the electronic device can read out the voice information read out by the user and convert the voice information into text information; the text that converts is distinguished for "other sound will, very mobile", electronic equipment distinguishes "and text" other sound will distinguish through predetermineeing the suggestion word "the shape sound will, very mobile" compares, can confirm the target word and sentence of matching and for "other pieces and pieces, very mobile" stands to and can further confirm in this word and sentence with "shape sound will distinguish" the characters of waiting to discern that the position corresponds are "piece and piece with writings" in this sentence, electronic equipment can carry out the mark corresponding at the demonstration position department of "piece writings" four characters in obtaining "piece writings" four characters with writings.
Therefore, through the embodiment, the user only needs to read the unknown text paragraph and uses the preset cue word to replace the unknown text during reading, so that the electronic equipment can be triggered to accurately position the text to be recognized according to the user's pronunciation, and the target information of the text to be recognized of the user can be fed back in real time.
Optionally, the determining second words in the second text except for the preset cue word includes:
according to the preset cue word, the second text is segmented to obtain a third character in the second text before the preset cue word and a fourth character after the preset cue word, wherein the second character comprises the third character and the fourth character;
the determining the target words and phrases in the first text that match the second words includes:
determining a fifth word in the first text, which is matched with the third word, and a sixth word which is matched with the fourth word;
and determining a target word sentence comprising the fifth word and the sixth word from the first text.
Specifically, when determining a second text in the second text except the preset cue word, the second text may be segmented by using a position of the preset cue word in the second text as a boundary to obtain a third text which is a text in the second text before the preset cue word and a fourth text which is a text after the preset cue word, where the second text includes the third text and the fourth text.
For example, by taking "geometric-phonetic-meaning-distinguishment" as a preset hint word for example, the converted text is "geometric-phonetic-meaning-distinguishment, which is very mobile", the positions of the four words of "geometric-phonetic-meaning-distinguishment" in the converted text can be determined, then, "geometric-phonetic-meaning-distinguishment" can be used as a separator, and the converted text is split into "front section" [ geometric-phonetic-meaning-distinguishment ] "rear section", for example, the converted text is split into "other" [ geometric-phonetic-meaning-distinguishment ] "very mobile", "front section" is "other", and "rear section" is "very mobile".
Then, the third words and the fourth words can be used as matching keywords respectively, and fifth words respectively matched with the third words and sixth words matched with the fourth words can be found in the first text; and finally, determining a target word sentence comprising the fifth word and the sixth word from the first text, specifically, finding the position of the fifth word from the first text, continuing to find the position of the sixth word from the position, and taking words comprising the fifth word and the sixth word as the target word sentence.
Therefore, through the implementation mode, the words and sentences of the characters to be recognized can be accurately and quickly positioned from the first text.
Optionally, the determining a target word sentence including the fifth word and the sixth word from the first text includes:
under the condition that the number of the fifth words or the sixth words is larger than 1, determining target fifth words and target sixth words with the smallest position intervals in the first text, wherein the target fifth words are before the target sixth words;
and determining a target word and sentence in the first text by taking the target fifth character as a starting word and the target sixth character as an ending word.
In this embodiment, when a plurality of fifth characters matching with the third characters are determined from the first text, or a plurality of sixth characters matching with the fourth characters are determined from the first text, it is necessary to further determine words and phrases including the characters to be recognized, which are actually read by the user.
Specifically, the target fifth text and the target sixth text actually read by the user may be determined based on the position relationship and the position interval between each fifth text and each sixth text, for example, the size of the position interval between each fifth text and each sixth text may be compared one by one, and finally, a group of fifth text and sixth text with the smallest position interval and the fifth text located before the sixth text is obtained, where the group of text is the target fifth text and the target sixth text. Then, the target word and sentence may be determined in the first text by using the target fifth word as a start word and the target sixth word as an end word, where the target word and sentence includes words between the target fifth word and the target sixth word.
Therefore, under the condition that a plurality of matched characters exist in the first text, the words and sentences where the characters to be recognized are actually read by the user can be accurately positioned through the implementation mode.
Optionally, the determining the characters to be recognized in the target words and sentences includes:
and determining characters positioned between the target fifth character and the target sixth character in the target words and sentences as characters to be recognized.
When the target word and sentence is determined to be a word and sentence in the first text, which takes the target fifth character as a starting word and takes the target sixth character as an ending word, the character between the target fifth character and the target sixth character in the target word and sentence can be directly determined to be a character to be recognized. Of course, after the target fifth word and the target sixth word are determined, the word between the target fifth word and the target sixth word in the first text may also be directly determined as the word to be recognized.
For example, referring to FIG. 3, assume P represents a text string of the content of the page displayed on electronic device 20, V represents a string of characters after the user speaks the page text and converts the speech of the prompt "our Chinese character recognition" into text, V1 represents a string of characters before "ideograph" in the converted text, i.e., "our Chinese character", and V2 represents a string of characters after "ideograph" in the converted text, i.e., "whine".
As shown in fig. 3, in the content of the page displayed on the electronic device 20, a plurality of fields matching V1, assuming X1, X2, and X3 from front to back, respectively, by the display position, and a plurality of fields matching V2, assuming Y1 and Y2 from front to back, respectively, by the display position, may be determined.
The specific matching steps may be as follows:
1) after the user reads the sentence at the character position to be recognized, the electronic equipment waits for several seconds to convert the current reading sound into a character string V;
2) extracting the prompting words of 'shape, sound and meaning' contained in the V character string, and respectively defining the contents before and after the prompting words as V1 and V2;
3) starting from front to back in P, retrieve V1, resulting in X1, X2, and X3, respectively;
4) searching V2 from back to front (or from front to back) in P to obtain Y1 and Y2 respectively;
5) position interval minimum match calculation: judging the position relations of X1, X2 and X3 and Y1 respectively, and judging the position relations of X1, X2 and X3 and Y2 respectively, thereby excluding the matching relation of X3 and Y2; calculating the position intervals of X1 and Y1 and Y2, X2 and Y1 and Y2, and X3 and Y1 respectively; determining X3 and Y1 with the minimum position interval as a matching pair;
6) the method comprises the steps of determining an string ' waiver 222763,34337 ', azulene 35274and short palace moth ' between X3 and Y1 as characters to be identified, and sending the string to a rear-end interface to inquire pinyin information.
Through the implementation mode, the characters to be recognized really expected by the user can be quickly and accurately positioned.
Optionally, the determining, according to the target speech, a character to be recognized in the first text includes:
identifying the target voice to obtain a third text corresponding to the target voice;
determining a seventh word in the first text that matches the third text;
receiving a first input of a user;
responding to the first input, and determining the word number K of the character to be recognized, wherein K is a positive integer;
and determining K characters positioned after the seventh character in the first text as the characters to be recognized.
In another embodiment, the user may only read a segment of text before the unknown text, and the electronic device locates the text matching the user's pronunciation in the first text by the user's pronunciation, and determines how many characters following the selected matching text are the text to be recognized based on the user's input.
In this embodiment, the electronic device may recognize the target voice, obtain a third text corresponding to the target voice, and find a word, that is, a seventh word, matching with the first text based on the third text, and then the user may perform a first input for determining the word number of the word to be recognized, so that the electronic device may receive the first input, determine the word number K of the word to be recognized based on the first input, and further determine that K words located after the seventh word in the first text are the word to be recognized.
The first input may be inputting a specific number on the display interface of the electronic device, such as directly handwriting a number on the display interface, or inputting a corresponding number in a pop-up window for a user to input a word count, or clicking K times in a blank of the display interface, or tapping K times on a screen, or inputting a number by voice.
Therefore, through the embodiment, the user only needs to read the unknown text paragraph and input the word number of the character to be recognized during reading, and the electronic equipment can be triggered to accurately position the character to be recognized according to the user's reading and the input word number.
Optionally, the receiving a first input of the user includes:
receiving a tap input of a user on a screen of the electronic equipment;
the responding to the first input, and determining the number K of the character words to be recognized comprises the following steps:
and determining the knocking times K of the knocking input as the number of the characters to be recognized.
In an embodiment, the first input may be a tapping input on the screen of the electronic device, after the user pronounces a segment of text before the text to be recognized, the electronic device may locate the reading position of the user based on the pronunciation of the user, and the user may tap the screen of the electronic device for K times to prompt the electronic device to select K texts after the currently located position as the text to be recognized.
For example, for a text displayed on an electronic device, the text "ancient pictophonetic ideological dyscrasia", demon of drought, front , chi, 3957, 3953 ", wherein a user does not recognize demon of drought, front , chi, 3957, and 3953", the user can read the text "ancient pictophonetic ideographic dyscrasia", the electronic device can locate the sentence based on the user's reading, the user can continue to tap 8 times on the screen of the electronic device, and the electronic device can monitor the tap times of the user by tapping a monitoring module, so that 8 characters "demon of drought, front , chi, 57, 39397" after the "ancient pictophonetic ideographic dyscrasia" can be determined as characters to be recognized.
Therefore, the user can trigger the electronic equipment to accurately position the character to be recognized by the user only by matching with the knocking operation with the pronunciation of a section of the character before the character to be recognized, and the target information of the character to be recognized can be quickly acquired.
Optionally, the number of the seventh characters is L, and L is an integer greater than 1;
the determining that K characters located after the seventh character in the first text are the characters to be recognized includes:
determining K characters respectively positioned behind each seventh character in the first text as candidate characters to obtain L groups of candidate characters;
receiving a second input of the user;
and responding to the second input, determining a target candidate character from the L groups of candidate characters, and determining the target candidate character as the character to be recognized.
That is, in one embodiment, when a plurality of seventh characters matching with the pronunciation of the user are determined from the first text, the position of the character that the user really needs to recognize needs to be further determined.
Specifically, in the case that L seventh words are matched, for each seventh word, the part may determine, in the first text, K words located after the seventh word as candidate words, so as to obtain L groups of candidate words, and then the user may perform a second input for determining a target candidate word, so that the electronic device may receive the second input, determine a target candidate word from the L groups of candidate words based on the second input, and may determine the target candidate word as the word to be recognized.
The second input may be tapping on the screen several times to prompt the electronic device that the candidate characters in the group 3 are the target candidate characters, if tapping 3 times, the candidate characters in the group 3 are determined to be the target candidate characters, or clicking several times on a blank of the display interface, or clicking the position of the target candidate characters, or inputting a number by voice.
That is, when the system determines that the character to be recognized that meets the condition is unique, the system can directly acquire the K characters after the seventh character as the target character to be recognized; when the system judges that the characters to be recognized which meet the conditions are not unique, the system can monitor the number M of times of continuous knocking of the user for two times, and selects K characters after the Mth seventh character as the target characters to be recognized.
Therefore, under the condition that a plurality of matched candidate characters exist in the first text, the position of the character which is really needed to be identified by the user can be accurately and conveniently located through the implementation mode.
Optionally, before receiving the second input of the user, the method further includes:
identifying the L groups of candidate words;
the receiving of the second input of the user comprises:
and receiving selection input of the L groups of candidate characters from a user.
That is, in this embodiment, after determining that K characters respectively located after each seventh character in the first text are candidate characters, and obtaining L groups of candidate characters, the L groups of candidate characters may be identified in the first text, for example, the L groups of candidate characters are highlighted, or the L groups of candidate characters are displayed with a specific color, so as to visually prompt a user of a position where the candidate character to be selected is located, and the user may perform selection input on the identified L groups of candidate characters, for example, click on a group of candidate characters to be recognized, or click on a screen of the electronic device for a corresponding number of times after determining that the group of candidate characters to be recognized is a few groups in a front-to-back order, so as to trigger the electronic device to determine that the group of candidate characters is the character to be recognized.
After determining L seventh words in the first text, the L seventh words may also be identified, and after determining the number of words of the word to be recognized based on the user input, candidate words following each seventh word may be further identified.
Therefore, the position of the candidate character of the user can be visually prompted by identifying the L groups of candidate characters, and the user is helped to accurately and conveniently select the target character to be recognized.
For example, referring to fig. 4, the electronic device 20 displays a text "our Chinese characters, written down into a painting leaves a history of five thousand years, which is recognized by the world, our Chinese characters are all story at one time, Qiang honest and glaring at kneeling and lifting fire, our Chinese characters, wale 2227634337azu 35274g, whig 222277g, whig 3433277g, which is not recognized by the user in reading," wale 22276277g, azu 35354g ", then the user can read" our Chinese characters "by tapping the screen 8 times at a light point, the electronic device identifies display positions of three" our Chinese characters "and 8 characters after identifying each" hui yao "as candidate characters, at this time, the user can tap the screen 3 times at a light point, and can determine the candidate characters to be recognized as" wale 35763576353576303, and the electronic device can retrieve the text for the third group 3, i.e. wale 3522235763435763 times, after obtaining the pinyin, the pronunciation can be marked at the character position.
It should be noted that the above process of recognizing the target speech may be executed at the terminal side, or may be executed at the server side, if the process is executed at the terminal side, the technical process of acquiring the target speech and recognizing the target speech may be directly implemented by the terminal, and if the process is executed at the server side, the terminal may acquire the target speech uttered by the user, and send the target speech to the server for speech-to-text recognition and conversion.
Step 103, acquiring target information of the character to be recognized, wherein the target information comprises at least one of pronunciation and annotation.
After the character to be recognized is determined, target information of the character to be recognized, such as pronunciation, word sense annotation, and the like, can be directly acquired to help a user recognize or understand the character to be recognized.
The obtaining of the target information of the character to be recognized may specifically be searching information such as pronunciation, meaning and the like of the character to be recognized from a database, where the database may include data such as a chinese dictionary, an english-chinese dictionary and the like, or may be performing background networking search on the character to be recognized, and extracting information such as pronunciation, meaning and the like of the character to be recognized from a search result.
It should be noted that, when the target information of the character to be recognized is obtained, the target information of the character to be recognized may be directly identified in the first text, or the target information of the character to be recognized may be output in a voice prompt manner.
The information processing method of the embodiment of the invention comprises the steps of obtaining target voice, wherein the target voice comprises pronunciation of a first character in a first text displayed on electronic equipment; determining characters to be recognized in the first text according to the target voice, wherein the characters to be recognized are different from the first characters; acquiring target information of the character to be recognized, wherein the target information comprises at least one of pronunciation and annotation. Therefore, when a user reads by using the electronic equipment, if the user encounters unknown characters, the user can read the characters displayed on the equipment, the electronic equipment can be triggered to position the characters to be recognized according to the user's reading, and then information such as pronunciation or comments of the characters to be recognized is acquired. In addition, the embodiment of the invention is also suitable for the characters which are displayed on the interface and can not be selected and copied.
The embodiment of the invention also provides an information processing device. Referring to fig. 5, fig. 5 is a block diagram of an information processing apparatus according to an embodiment of the present invention. Because the principle of solving the problem of the information processing device is similar to the information processing method in the embodiment of the invention, the implementation of the information processing device can refer to the implementation of the method, and repeated details are not repeated.
As shown in fig. 5, the information processing apparatus 500 includes:
a first obtaining module 501, configured to obtain a target voice, where the target voice includes a pronunciation of a first word in a first text displayed on an electronic device;
a determining module 502, configured to determine, according to the target voice, a to-be-recognized word in the first text, where the to-be-recognized word is different from the first word;
a second obtaining module 503, configured to obtain target information of the text to be recognized, where the target information includes at least one of pronunciation and annotation.
Optionally, the target speech further includes pronunciation of a preset cue word;
the determination module 502 includes:
the first recognition submodule is used for recognizing the target voice to obtain a second text corresponding to the target voice;
the first determining submodule is used for determining second characters except the preset prompt words in the second text;
the second determining submodule is used for determining a target word and sentence matched with the second word in the first text;
and the third determining submodule is used for determining characters to be recognized in the target words and sentences, wherein the characters to be recognized are different from the second characters.
Optionally, the first determining sub-module is configured to segment the second text according to the preset cue word to obtain a third word before the preset cue word and a fourth word after the preset cue word in the second text, where the second word includes the third word and the fourth word;
the second determination submodule includes:
a first determining unit, configured to determine a fifth word in the first text that matches the third word, and a sixth word that matches the fourth word;
a second determining unit, configured to determine a target word sentence including the fifth word and the sixth word from the first text.
Optionally, the second determining unit includes:
a first determining subunit, configured to determine, when the number of the fifth words or the sixth words is greater than 1, a target fifth word and a target sixth word that are located at a minimum interval in the first text, where the target fifth word precedes the target sixth word;
and the second determining subunit is configured to determine a target word and sentence in the first text by using the target fifth character as a start word and the target sixth character as an end word.
Optionally, the third determining submodule is configured to determine, as a character to be recognized, a character in the target sentence, which is located between the target fifth character and the target sixth character.
Optionally, the determining module 502 includes:
the second recognition submodule is used for recognizing the target voice to obtain a third text corresponding to the target voice;
a fourth determining submodule, configured to determine a seventh word in the first text, where the seventh word matches the third text;
the receiving submodule is used for receiving a first input of a user;
the fifth determining submodule is used for responding to the first input and determining the number K of the characters to be recognized, wherein K is a positive integer;
and the sixth determining submodule is used for determining K characters behind the seventh character in the first text as the characters to be recognized.
Optionally, the receiving sub-module is configured to receive a tap input of a user on the screen of the electronic device;
and the fifth determining submodule is used for determining the knocking times K of the knocking input as the character word number to be recognized.
Optionally, the number of the seventh characters is L, and L is an integer greater than 1;
the sixth determination submodule includes:
a third determining unit, configured to determine that K characters respectively located after each seventh character in the first text are candidate characters, so as to obtain L groups of candidate characters;
a receiving unit for receiving a second input of the user;
and the fourth determining unit is used for responding to the second input, determining a target candidate character from the L groups of candidate characters and determining the target candidate character as the character to be recognized.
Optionally, the sixth determining sub-module further includes:
an identification unit, configured to identify the L groups of candidate characters;
the receiving unit is used for receiving the selection input of the L groups of candidate characters by the user.
The information processing apparatus provided in the embodiment of the present invention may implement the method embodiments described above, and the implementation principle and the technical effect are similar, which are not described herein again.
The information processing apparatus 500 of the embodiment of the present invention obtains a target voice, where the target voice includes a pronunciation of a first word in a first text displayed on an electronic device; determining characters to be recognized in the first text according to the target voice, wherein the characters to be recognized are different from the first characters; acquiring target information of the character to be recognized, wherein the target information comprises at least one of pronunciation and annotation. Therefore, when a user reads by using the electronic equipment, if the user encounters unknown characters, the user can read the characters displayed on the equipment, the electronic equipment can be triggered to position the characters to be recognized according to the user's reading, and then information such as pronunciation or comments of the characters to be recognized is acquired.
The embodiment of the invention also provides the electronic equipment. Because the principle of the electronic device for solving the problem is similar to the information processing method in the embodiment of the present invention, the implementation of the electronic device may refer to the implementation of the method, and repeated details are not described again. As shown in fig. 6, the electronic device according to the embodiment of the present invention includes: the processor 600, which is used to read the program in the memory 620, executes the following processes:
acquiring target voice, wherein the target voice comprises pronunciation of a first word in a first text displayed on electronic equipment;
determining characters to be recognized in the first text according to the target voice, wherein the characters to be recognized are different from the first characters;
acquiring target information of the character to be recognized, wherein the target information comprises at least one of pronunciation and annotation.
A transceiver 610 for receiving and transmitting data under the control of the processor 600.
Where in fig. 6, the bus architecture may include any number of interconnected buses and bridges, with various circuits being linked together, particularly one or more processors represented by processor 600 and memory represented by memory 620. The bus architecture may also link together various other circuits such as peripherals, voltage regulators, power management circuits, and the like, which are well known in the art, and therefore, will not be described any further herein. The bus interface provides an interface. The transceiver 610 may be a number of elements including a transmitter and a transceiver providing a means for communicating with various other apparatus over a transmission medium. For different user devices, the user interface 630 may also be an interface capable of interfacing with a desired device externally, including but not limited to a keypad, display, speaker, microphone, joystick, etc. The processor 600 is responsible for managing the bus architecture and general processing, and the memory 620 may store data used by the processor 600 in performing operations.
Optionally, the target speech further includes pronunciation of a preset cue word;
the processor 600 is also used to read the program in the memory 620 and execute the following steps:
identifying the target voice to obtain a second text corresponding to the target voice;
determining second characters in the second text except the preset prompt words;
determining a target word sentence matched with the second character in the first text;
and determining characters to be recognized in the target words and sentences, wherein the characters to be recognized are different from the second characters.
Optionally, the processor 600 is further configured to read the program in the memory 620, and perform the following steps:
according to the preset cue word, the second text is segmented to obtain a third character in the second text before the preset cue word and a fourth character after the preset cue word, wherein the second character comprises the third character and the fourth character;
determining a fifth word in the first text, which is matched with the third word, and a sixth word which is matched with the fourth word;
and determining a target word sentence comprising the fifth word and the sixth word from the first text.
Optionally, the processor 600 is further configured to read the program in the memory 620, and perform the following steps:
under the condition that the number of the fifth words or the sixth words is larger than 1, determining target fifth words and target sixth words with the smallest position intervals in the first text, wherein the target fifth words are before the target sixth words;
and determining a target word and sentence in the first text by taking the target fifth character as a starting word and the target sixth character as an ending word.
Optionally, the processor 600 is further configured to read the program in the memory 620, and perform the following steps:
and determining characters positioned between the target fifth character and the target sixth character in the target words and sentences as characters to be recognized.
Optionally, the processor 600 is further configured to read the program in the memory 620, and perform the following steps:
identifying the target voice to obtain a third text corresponding to the target voice;
determining a seventh word in the first text that matches the third text;
receiving a first input of a user;
responding to the first input, and determining the word number K of the character to be recognized, wherein K is a positive integer;
and determining K characters positioned after the seventh character in the first text as the characters to be recognized.
Optionally, the processor 600 is further configured to read the program in the memory 620, and perform the following steps:
receiving a tap input of a user on a screen of the electronic equipment;
and determining the knocking times K of the knocking input as the number of the characters to be recognized.
Optionally, the number of the seventh characters is L, and L is an integer greater than 1;
the processor 600 is also used to read the program in the memory 620 and execute the following steps:
determining K characters respectively positioned behind each seventh character in the first text as candidate characters to obtain L groups of candidate characters;
receiving a second input of the user;
and responding to the second input, determining a target candidate character from the L groups of candidate characters, and determining the target candidate character as the character to be recognized.
Optionally, the processor 600 is further configured to read the program in the memory 620, and perform the following steps:
identifying the L groups of candidate words;
and receiving selection input of the L groups of candidate characters from a user.
The electronic device provided by the embodiment of the present invention can execute the above method embodiments, and the implementation principle and technical effect are similar, which are not described herein again.
Furthermore, a computer-readable storage medium of an embodiment of the present invention stores a computer program executable by a processor to implement:
acquiring target voice, wherein the target voice comprises pronunciation of a first word in a first text displayed on electronic equipment;
determining characters to be recognized in the first text according to the target voice, wherein the characters to be recognized are different from the first characters;
acquiring target information of the character to be recognized, wherein the target information comprises at least one of pronunciation and annotation.
Optionally, the target speech further includes pronunciation of a preset cue word;
determining the characters to be recognized in the first text according to the target voice, wherein the determining comprises the following steps:
identifying the target voice to obtain a second text corresponding to the target voice;
determining second characters in the second text except the preset prompt words;
determining a target word sentence matched with the second character in the first text;
and determining characters to be recognized in the target words and sentences, wherein the characters to be recognized are different from the second characters.
Optionally, the determining second words in the second text except for the preset cue word includes:
according to the preset cue word, the second text is segmented to obtain a third character in the second text before the preset cue word and a fourth character after the preset cue word, wherein the second character comprises the third character and the fourth character;
the determining the target words and phrases in the first text that match the second words includes:
determining a fifth word in the first text, which is matched with the third word, and a sixth word which is matched with the fourth word;
and determining a target word sentence comprising the fifth word and the sixth word from the first text.
Optionally, the determining a target word sentence including the fifth word and the sixth word from the first text includes:
under the condition that the number of the fifth words or the sixth words is larger than 1, determining target fifth words and target sixth words with the smallest position intervals in the first text, wherein the target fifth words are before the target sixth words;
and determining a target word and sentence in the first text by taking the target fifth character as a starting word and the target sixth character as an ending word.
Optionally, the determining the characters to be recognized in the target words and sentences includes:
and determining characters positioned between the target fifth character and the target sixth character in the target words and sentences as characters to be recognized.
Optionally, the determining, according to the target speech, a character to be recognized in the first text includes:
identifying the target voice to obtain a third text corresponding to the target voice;
determining a seventh word in the first text that matches the third text;
receiving a first input of a user;
responding to the first input, and determining the word number K of the character to be recognized, wherein K is a positive integer;
and determining K characters positioned after the seventh character in the first text as the characters to be recognized.
Optionally, the receiving a first input of the user includes:
receiving a tap input of a user on a screen of the electronic equipment;
the responding to the first input, and determining the number K of the character words to be recognized comprises the following steps:
and determining the knocking times K of the knocking input as the number of the characters to be recognized.
Optionally, the number of the seventh characters is L, and L is an integer greater than 1;
the determining that K characters located after the seventh character in the first text are the characters to be recognized includes:
determining K characters respectively positioned behind each seventh character in the first text as candidate characters to obtain L groups of candidate characters;
receiving a second input of the user;
and responding to the second input, determining a target candidate character from the L groups of candidate characters, and determining the target candidate character as the character to be recognized.
Optionally, before receiving the second input of the user, the method further includes:
identifying the L groups of candidate words;
the receiving of the second input of the user comprises:
and receiving selection input of the L groups of candidate characters from a user.
In the several embodiments provided in the present application, it should be understood that the disclosed method and apparatus may be implemented in other ways. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the units is only one logical division, and other divisions may be realized in practice, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may be physically included alone, or two or more units may be integrated into one unit. The integrated unit can be realized in a form of hardware, or in a form of hardware plus a software functional unit.
The integrated unit implemented in the form of a software functional unit may be stored in a computer readable storage medium. The software functional unit is stored in a storage medium and includes several instructions to enable a computer device (which may be a personal computer, a server, or a network device) to execute some steps of the transceiving method according to various embodiments of the present invention. And the aforementioned storage medium includes: various media capable of storing program codes, such as a usb disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.
While the foregoing is directed to the preferred embodiment of the present invention, it will be understood by those skilled in the art that various changes and modifications may be made without departing from the spirit and scope of the invention as defined in the appended claims.

Claims (12)

1.一种信息处理方法,其特征在于,包括:1. an information processing method, is characterized in that, comprises: 获取目标语音,其中,所述目标语音包括对电子设备上显示的第一文本中第一文字的读音;obtaining a target voice, wherein the target voice includes the pronunciation of the first character in the first text displayed on the electronic device; 根据所述目标语音,确定所述第一文本中的待识别文字,所述待识别文字与所述第一文字不同;According to the target voice, determine the text to be recognized in the first text, where the text to be recognized is different from the first text; 获取所述待识别文字的目标信息,其中,所述目标信息包括发音和注释中的至少一项。Obtain target information of the text to be recognized, wherein the target information includes at least one of pronunciation and annotation. 2.根据权利要求1所述的方法,其特征在于,所述目标语音中还包括对预设提示词的读音;2. method according to claim 1, is characterized in that, also comprises the pronunciation to preset prompt word in described target speech; 所述根据所述目标语音,确定所述第一文本中的待识别文字,包括:The determining the text to be recognized in the first text according to the target voice includes: 识别所述目标语音,得到所述目标语音对应的第二文本;Recognizing the target voice to obtain a second text corresponding to the target voice; 确定所述第二文本中除所述预设提示词外的第二文字;determining the second text in the second text except the preset prompt word; 确定所述第一文本中与所述第二文字匹配的目标词句;determining a target phrase in the first text that matches the second text; 确定所述目标词句中的待识别文字,其中,所述待识别文字与所述第二文字不同。Determine the text to be recognized in the target phrase, wherein the text to be recognized is different from the second text. 3.根据权利要求2所述的方法,其特征在于,所述确定所述第二文本中除所述预设提示词外的第二文字,包括:3. The method according to claim 2, wherein the determining the second text in the second text except the preset prompt word comprises: 按照所述预设提示词,对所述第二文本进行分割,得到所述第二文本中所述预设提示词之前的第三文字,以及所述预设提示词之后的第四文字,其中,所述第二文字包含所述第三文字和所述第四文字;According to the preset prompt word, the second text is segmented to obtain the third text before the preset prompt word and the fourth text after the preset prompt word in the second text, wherein , the second character includes the third character and the fourth character; 所述确定所述第一文本中与所述第二文字匹配的目标词句,包括:The determining of the target words and sentences in the first text that matches the second text includes: 确定所述第一文本中与所述第三文字匹配的第五文字,以及与所述第四文字匹配的第六文字;determining a fifth character in the first text that matches the third character, and a sixth character that matches the fourth character; 从所述第一文本中确定包括所述第五文字和所述第六文字的目标词句。A target phrase including the fifth script and the sixth script is determined from the first text. 4.根据权利要求3所述的方法,其特征在于,所述从所述第一文本中确定包括所述第五文字和所述第六文字的目标词句,包括:4. The method according to claim 3, wherein the determining the target words and sentences including the fifth character and the sixth character from the first text comprises: 在所述第五文字或所述第六文字的数量大于1的情况下,确定所述第一文本中位置间隔最小的目标第五文字和目标第六文字,其中,所述目标第五文字在所述目标第六文字之前;When the number of the fifth character or the sixth character is greater than 1, determine the target fifth character and the target sixth character with the smallest position interval in the first text, wherein the target fifth character is in before the sixth word of the target; 在所述第一文本中以所述目标第五文字为起始词,以所述目标第六文字为结束词,确定目标词句。In the first text, the target fifth character is used as a starting word, and the target sixth character is used as an ending word to determine a target sentence. 5.根据权利要求4所述的方法,其特征在于,所述确定所述目标词句中的待识别文字,包括:5. The method according to claim 4, wherein the determining the text to be recognized in the target word and sentence comprises: 将所述目标词句中位于所述目标第五文字与所述目标第六文字之间的文字确定为待识别文字。A character in the target phrase located between the target fifth character and the target sixth character is determined as the character to be recognized. 6.根据权利要求1所述的方法,其特征在于,所述根据所述目标语音,确定所述第一文本中的待识别文字,包括:6. The method according to claim 1, wherein the determining the text to be recognized in the first text according to the target voice comprises: 识别所述目标语音,得到所述目标语音对应的第三文本;Recognizing the target voice to obtain a third text corresponding to the target voice; 确定所述第一文本中与所述第三文本匹配的第七文字;determining a seventh text in the first text that matches the third text; 接收用户的第一输入;receiving the first input from the user; 响应于所述第一输入,确定待识别文字字数K,K为正整数;In response to the first input, determine the number of characters K to be recognized, where K is a positive integer; 确定所述第一文本中位于所述第七文字之后的K个文字为所述待识别文字。It is determined that K characters located after the seventh character in the first text are the characters to be recognized. 7.根据权利要求6所述的方法,其特征在于,所述接收用户的第一输入,包括:7. The method according to claim 6, wherein the receiving the first input from the user comprises: 接收用户在所述电子设备屏幕上的敲击输入;receiving a user's tap input on the screen of the electronic device; 所述响应于所述第一输入,确定待识别文字字数K,包括:The determining the number K of characters to be recognized in response to the first input includes: 将所述敲击输入的敲击次数K确定为待识别文字字数。The number of taps K of the tap input is determined as the number of characters to be recognized. 8.根据权利要求6所述的方法,其特征在于,所述第七文字的数量为L,L为大于1的整数;8. The method according to claim 6, wherein the number of the seventh character is L, and L is an integer greater than 1; 所述确定所述第一文本中位于所述第七文字之后的K个文字为所述待识别文字,包括:The determining that the K characters after the seventh character in the first text are the characters to be recognized, including: 确定所述第一文本中分别位于每个第七文字之后的K个文字为候选文字,得到L组候选文字;It is determined that the K characters behind each seventh character in the first text are candidate characters, and L groups of candidate characters are obtained; 接收用户的第二输入;receiving a second input from the user; 响应于所述第二输入,从所述L组候选文字中确定目标候选文字,并确定所述目标候选文字为所述待识别文字。In response to the second input, a target candidate character is determined from the L groups of candidate characters, and the target candidate character is determined as the to-be-recognized character. 9.根据权利要求8所述的方法,其特征在于,所述接收用户的第二输入之前,所述方法还包括:9. The method according to claim 8, wherein before the receiving the second input from the user, the method further comprises: 标识所述L组候选文字;identifying the L groups of candidate characters; 所述接收用户的第二输入,包括:The receiving the user's second input includes: 接收用户对所述L组候选文字的选择输入。A user's selection input for the L groups of candidate characters is received. 10.一种信息处理装置,其特征在于,包括:10. An information processing device, comprising: 第一获取模块,用于获取目标语音,其中,所述目标语音包括对电子设备上显示的第一文本中第一文字的读音;a first acquisition module, configured to acquire a target voice, wherein the target voice includes the pronunciation of the first text in the first text displayed on the electronic device; 确定模块,用于根据所述目标语音,确定所述第一文本中的待识别文字,所述待识别文字与所述第一文字不同;a determining module, configured to determine the text to be recognized in the first text according to the target voice, where the text to be recognized is different from the first text; 第二获取模块,用于获取所述待识别文字的目标信息,其中,所述目标信息包括发音和注释中的至少一项。The second acquiring module is configured to acquire target information of the text to be recognized, wherein the target information includes at least one of pronunciation and annotation. 11.一种电子设备,包括:收发机、存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序;其特征在于,所述处理器,用于读取存储器中的程序实现如权利要求1至9中任一项所述的信息处理方法中的步骤。11. An electronic device, comprising: a transceiver, a memory, a processor, and a computer program stored on the memory and running on the processor; characterized in that, the processor is configured to read the memory The program in the implements the steps in the information processing method as claimed in any one of claims 1 to 9. 12.一种计算机可读存储介质,用于存储计算机程序,其特征在于,所述计算机程序被处理器执行时实现如权利要求1至9中任一项所述的信息处理方法中的步骤。12. A computer-readable storage medium for storing a computer program, wherein when the computer program is executed by a processor, the steps in the information processing method according to any one of claims 1 to 9 are implemented.
CN202110587658.7A 2021-05-27 2021-05-27 Information processing method and device and electronic equipment Active CN113268981B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110587658.7A CN113268981B (en) 2021-05-27 2021-05-27 Information processing method and device and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110587658.7A CN113268981B (en) 2021-05-27 2021-05-27 Information processing method and device and electronic equipment

Publications (2)

Publication Number Publication Date
CN113268981A true CN113268981A (en) 2021-08-17
CN113268981B CN113268981B (en) 2023-04-28

Family

ID=77233505

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110587658.7A Active CN113268981B (en) 2021-05-27 2021-05-27 Information processing method and device and electronic equipment

Country Status (1)

Country Link
CN (1) CN113268981B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114780793A (en) * 2022-04-29 2022-07-22 咪咕数字传媒有限公司 Information labeling method and device, terminal equipment and storage medium
CN115171653A (en) * 2022-05-25 2022-10-11 深圳市世强元件网络有限公司 Rarely-used character pronunciation reminding method and device and computer equipment
CN116052671A (en) * 2022-11-21 2023-05-02 深圳市东象设计有限公司 Intelligent translator and translation method

Citations (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003316384A (en) * 2002-04-24 2003-11-07 Nippon Hoso Kyokai <Nhk> REAL-TIME CHARACTER MODIFICATION DEVICE AND METHOD, PROGRAM, AND STORAGE MEDIUM
JP2013182256A (en) * 2012-03-05 2013-09-12 Toshiba Corp Voice synthesis system and voice conversion support device
CN104123931A (en) * 2013-04-26 2014-10-29 纬创资通股份有限公司 Language learning method and apparatus and computer readable recording medium
CN104346084A (en) * 2013-07-24 2015-02-11 腾讯科技(深圳)有限公司 Word and phrase inputting method and device
KR20160017625A (en) * 2014-08-06 2016-02-16 주식회사 엘지화학 Method for changing contents of character data into transmitter's voice and outputting the transmiter's voice
CN105657146A (en) * 2015-05-28 2016-06-08 宇龙计算机通信科技(深圳)有限公司 Call information reminding method and device
CN108364651A (en) * 2017-01-26 2018-08-03 三星电子株式会社 Audio recognition method and equipment
CN109671309A (en) * 2018-12-12 2019-04-23 广东小天才科技有限公司 Wrong pronunciation identification method and electronic equipment
CN110718226A (en) * 2019-09-19 2020-01-21 厦门快商通科技股份有限公司 Speech recognition result processing method and device, electronic equipment and medium
CN110782885A (en) * 2019-09-29 2020-02-11 深圳和而泰家居在线网络科技有限公司 Voice text correction method and device, computer equipment and computer storage medium
CN111128186A (en) * 2019-12-30 2020-05-08 云知声智能科技股份有限公司 Multi-phonetic-character phonetic transcription method and device
CN111128185A (en) * 2019-12-25 2020-05-08 北京声智科技有限公司 Method, device, terminal and storage medium for converting voice into characters
CN112309389A (en) * 2020-03-02 2021-02-02 北京字节跳动网络技术有限公司 Information interaction method and device
CN112735428A (en) * 2020-12-27 2021-04-30 科大讯飞(上海)科技有限公司 Hot word acquisition method, voice recognition method and related equipment
CN112818089A (en) * 2021-02-23 2021-05-18 掌阅科技股份有限公司 Text phonetic notation method, electronic equipment and storage medium

Patent Citations (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003316384A (en) * 2002-04-24 2003-11-07 Nippon Hoso Kyokai <Nhk> REAL-TIME CHARACTER MODIFICATION DEVICE AND METHOD, PROGRAM, AND STORAGE MEDIUM
JP2013182256A (en) * 2012-03-05 2013-09-12 Toshiba Corp Voice synthesis system and voice conversion support device
CN104123931A (en) * 2013-04-26 2014-10-29 纬创资通股份有限公司 Language learning method and apparatus and computer readable recording medium
CN104346084A (en) * 2013-07-24 2015-02-11 腾讯科技(深圳)有限公司 Word and phrase inputting method and device
KR20160017625A (en) * 2014-08-06 2016-02-16 주식회사 엘지화학 Method for changing contents of character data into transmitter's voice and outputting the transmiter's voice
CN105657146A (en) * 2015-05-28 2016-06-08 宇龙计算机通信科技(深圳)有限公司 Call information reminding method and device
CN108364651A (en) * 2017-01-26 2018-08-03 三星电子株式会社 Audio recognition method and equipment
CN109671309A (en) * 2018-12-12 2019-04-23 广东小天才科技有限公司 Wrong pronunciation identification method and electronic equipment
CN110718226A (en) * 2019-09-19 2020-01-21 厦门快商通科技股份有限公司 Speech recognition result processing method and device, electronic equipment and medium
CN110782885A (en) * 2019-09-29 2020-02-11 深圳和而泰家居在线网络科技有限公司 Voice text correction method and device, computer equipment and computer storage medium
CN111128185A (en) * 2019-12-25 2020-05-08 北京声智科技有限公司 Method, device, terminal and storage medium for converting voice into characters
CN111128186A (en) * 2019-12-30 2020-05-08 云知声智能科技股份有限公司 Multi-phonetic-character phonetic transcription method and device
CN112309389A (en) * 2020-03-02 2021-02-02 北京字节跳动网络技术有限公司 Information interaction method and device
CN112735428A (en) * 2020-12-27 2021-04-30 科大讯飞(上海)科技有限公司 Hot word acquisition method, voice recognition method and related equipment
CN112818089A (en) * 2021-02-23 2021-05-18 掌阅科技股份有限公司 Text phonetic notation method, electronic equipment and storage medium

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114780793A (en) * 2022-04-29 2022-07-22 咪咕数字传媒有限公司 Information labeling method and device, terminal equipment and storage medium
CN114780793B (en) * 2022-04-29 2024-08-27 咪咕数字传媒有限公司 Information labeling method, device, terminal equipment and storage medium
CN115171653A (en) * 2022-05-25 2022-10-11 深圳市世强元件网络有限公司 Rarely-used character pronunciation reminding method and device and computer equipment
CN116052671A (en) * 2022-11-21 2023-05-02 深圳市东象设计有限公司 Intelligent translator and translation method

Also Published As

Publication number Publication date
CN113268981B (en) 2023-04-28

Similar Documents

Publication Publication Date Title
CN107016994B (en) Voice recognition method and device
US8504350B2 (en) User-interactive automatic translation device and method for mobile device
US9484034B2 (en) Voice conversation support apparatus, voice conversation support method, and computer readable medium
JP3962763B2 (en) Dialogue support device
US9548052B2 (en) Ebook interaction using speech recognition
TWI224771B (en) Speech recognition device and method using di-phone model to realize the mixed-multi-lingual global phoneme
US10872207B2 (en) Determining translation similarity of reverse translations for a plurality of languages
CN105869640B (en) Method and device for recognizing voice control instruction aiming at entity in current page
CN113268981B (en) Information processing method and device and electronic equipment
JP2006190006A5 (en)
JP2015026057A (en) Interactive character based foreign language learning device and method
CN1742273A (en) Multimodal speech-to-speech language translation and display
US20150073801A1 (en) Apparatus and method for selecting a control object by voice recognition
US20070288240A1 (en) User interface for text-to-phone conversion and method for correcting the same
CN113743102B (en) Method and device for recognizing characters and electronic equipment
CN112395863B (en) Text processing method and device
WO2014181508A1 (en) Information processing apparatus, information processing method, and program
CN107797676B (en) Single character input method and device
CN103680503A (en) Semantic identification method
CN111914533B (en) Method and system for analyzing English long sentence
US10133920B2 (en) OCR through voice recognition
CN113722447B (en) Voice search method based on multi-strategy matching
TWI782436B (en) Display system and method of interacting with the same
KR20160106363A (en) Smart lecture system and method
CN1275174C (en) Chinese input method and system with voice recognition auxiliary function

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant