WO2019006792A1

WO2019006792A1 - Voice-controlled education method and system, mobile terminal, and storage medium

Info

Publication number: WO2019006792A1
Application number: PCT/CN2017/094486
Authority: WO
Inventors: 袁晖; 李凝华
Original assignee: 深圳市科迈爱康科技有限公司
Priority date: 2017-07-07
Filing date: 2017-07-26
Publication date: 2019-01-10
Also published as: CN107463626A

Abstract

A voice-controlled education method and system, a mobile terminal, and a storage medium. The method comprises: detecting voice in a current environment to obtain current education voice (S10); performing keyword recognition on the current education voice to obtain a target keyword in the current education voice (S20); determining the category of the target keyword (S30); and searching for, when the category of the target keyword is a first category, an audio file corresponding to the target keyword, and playing the found audio file (S40). Whereby, an infant or young child can more intuitively feel a sound scene corresponding to the specific object in education voice content, so that the intelligence of the infant or young child is developed, and the interest in learning is improved.

Description

Voice-activated education method, mobile terminal, system and storage medium

Technical field

The present invention relates to the field of education, and in particular, to a voice-activated educational method, a mobile terminal, a system, and a storage medium.

Background technique

The education and intellectual development of infants and children is a very important part of the growth process, and it is also a part of parents' concern. According to a large number of studies, prematurely letting infants and children watch TV may hinder the intellectual development of infants and young children. Because watching TV too early is one of the important reasons why more and more children can't concentrate for a long time.

Infants and toddlers watch TV differently from adults watching TV. Adults watch TV and can understand the pictures before and after and the logical relationship between the pictures. Each picture constitutes a logical story. Infants under the age of two are completely different. The baby has just been born without the same thinking as an adult. There are only a few unconditional launches, such as foraging reflections, sucking reflections, and gripping reflections. The development of the baby's thinking in this age group is called the “sensory movement period”. In the past two years, infants and young children have mainly learned and recognized the world through their senses of hearing, sight, touch, and hands. The thinking at this stage is intuitive action thinking. That is to say, infants and young children mainly carry out specific and direct thinking in perceptive actions. Therefore, watching TV for infants under the age of two is a fast-flashing picture. They cannot organize these pictures into a continuous and meaningful story. Because the memory and understanding of infants under the age of two are not enough for them to remember and judge the relationship between the pictures before and after, so let them watch TV too much too early, for them, the only information that is accepted is chaotic.

Existing educational methods or educational systems, as well as educational materials for infants and young children's intellectual development, mostly educate infants and young children by listening to adults reading or playing videos, but adult reading or video playback is only flat, not for infants. When children hear stereoscopic sounds, they cannot effectively develop the intelligence of infants and young children.

The above content is only used to assist in understanding the technical solutions of the present invention, and does not constitute an admission that the above is prior art.

Summary of the invention

The main object of the present invention is to provide a voice-activated educational method, a mobile terminal, a system, and a storage medium, which aim to solve the problem that the prior art cannot allow a baby to hear a stereoscopic sound, and cannot effectively develop the intelligence of an infant.

To achieve the above object, the present invention provides a voice-activated educational method, the method comprising the steps of:

Detecting the voice in the current environment and obtaining the current educational voice;

Performing keyword recognition on the current educational voice to obtain a target keyword in the current educational voice;

Determining a category of the target keyword;

When the belonging category of the target keyword is the first category, the audio file corresponding to the target keyword is searched, and the found audio file is played.

Preferably, after the determining the category of the target keyword, the method further includes:

When the category to which the target keyword belongs is the second category, obtaining an adjustment instruction corresponding to the target keyword, and synchronously adjusting the current environment according to the adjustment instruction, so that the current environment and the target keyword are Corresponding.

Preferably, the adjustment instruction comprises at least one of a temperature adjustment instruction, a humidity adjustment instruction, a brightness adjustment instruction or an odor adjustment instruction.

Preferably, before the detecting the voice in the current environment and obtaining the current educational voice, the method further includes: when the education mode selected by the user is the voice playing mode, acquiring the educational voice to be played selected by the user, and It is said that the educational voice is played for playback.

Preferably, before the detecting the voice in the current environment to obtain the current educational voice, the method further includes: when the education mode selected by the user is the personal reading mode, performing the detecting the voice in the current environment, Get the steps to the current educational voice.

Preferably, the method further includes: receiving a play control instruction sent by the user, and performing a corresponding operation according to the play control instruction, where the play control command includes: a volume adjustment instruction, a sound effect adjustment instruction, or a speech speed adjustment instruction at least one.

Preferably, the method further includes: receiving interaction information sent by the user, and searching for corresponding user preference information according to the interaction information, where the user preference information includes: at least one of a favorite sound effect, a favorite educational content, or a favorite scene; The interaction information and the found user preference information adjust the current environment.

In addition, in order to achieve the above object, the present invention further provides a mobile terminal, the mobile terminal comprising: a memory, a processor, and a voice-activated educational program stored on the memory and operable on the processor, The voice-activated educational program is configured to implement the steps of the voice-activated educational method as described above.

In addition, in order to achieve the above object, the present invention further provides a voice-activated education system, the education system comprising: the mobile terminal, the playback device, and the adjustment device described above; wherein the playback device is configured to be in the process The audio and video files are played under control; the adjustment device is configured to adjust the current environment under the control of the processor.

In addition, in order to achieve the above object, the present invention also provides a storage medium on which a voice-activated educational program is stored, and when the voice-activated educational program is executed by a processor, the voice-activated educational method as described above is implemented. step.

The present invention obtains a current educational voice by detecting a voice in a current environment; performing keyword recognition on the current educational voice to obtain a target keyword in the current educational voice; and determining a category of the target keyword; When the belonging category of the target keyword is the first category, the audio file corresponding to the target keyword is searched, and the found audio file is played, so that the educational voice in the current environment is stereoscopically presented. Infants and young children can more intuitively feel the sound scene corresponding to the specific things in the educational voice content, and then effectively develop the intelligence of infants and children and improve their interest in learning.

DRAWINGS

1 is a schematic structural diagram of a mobile terminal in a hardware operating environment according to an embodiment of the present invention;

2 is a schematic flow chart of a first embodiment of a voice-activated education method according to the present invention;

3 is a schematic flow chart of a second embodiment of a voice-activated education method according to the present invention;

4 is a schematic flow chart of a third embodiment of a voice-activated education method according to the present invention;

FIG. 5 is a schematic flow chart of a fourth embodiment of a voice-activated education method according to the present invention.

The implementation, functional features, and advantages of the present invention will be further described in conjunction with the embodiments.

Detailed ways

It is understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

Referring to FIG. 1, FIG. 1 is a schematic structural diagram of a mobile terminal in a hardware operating environment according to an embodiment of the present invention.

As shown in FIG. 1, the mobile terminal may include a processor 1001, such as a CPU, a communication bus 1002, a user interface 1003, a network interface 1004, a memory 1005, and a sound collector 1006. Among them, the communication bus 1002 is used to implement connection communication between these components. The user interface 1003 can include a display, an input unit such as a keyboard, and the optional user interface 1003 can also include a standard wired interface, a wireless interface. The network interface 1004 can optionally include a standard wired interface, a wireless interface (such as a WI-FI interface). The memory 1005 may be a high speed RAM memory or a stable memory (non-volatile) Memory), such as disk storage. The memory 1005 can also optionally be a storage device independent of the aforementioned processor 1001.

It will be understood by those skilled in the art that the mobile terminal structure shown in FIG. 1 does not constitute a limitation of the mobile terminal, and may include more or less components than those illustrated, or combine some components, or different component arrangements.

As shown in FIG. 1, the memory 1005 as a storage medium may include an operating system, a data storage module, a network communication module, a user interface module, and a voice-activated educational program.

The mobile terminal may be a mobile terminal that can implement voice collection or detection and program running, for example, a smart phone, a tablet computer or a notebook computer, etc., which is not limited in this embodiment.

In the mobile terminal shown in FIG. 1 , the network interface 1004 is mainly used for data communication with the background server; the sound collector 1006 is configured to collect or detect the current voice; the user interface 1003 is mainly used for data interaction with the user; The processor 1001 and the memory 1005 in the mobile terminal may be disposed in the mobile terminal, and the mobile terminal invokes the voice-activated educational program stored in the memory 1005 through the processor 1001, and performs the following operations:

Determining a category of the target keyword;

Further, the processor 1001 can call the voice-activated educational program stored in the memory 1005, and also performs the following operations:

When the education mode selected by the user is the voice play mode, the educational voice to be played selected by the user is acquired, and the educational voice to be played is played.

When the education mode selected by the user is the personal reading mode, the detecting the voice in the current environment is performed to obtain the operation of the current educational voice.

Receiving a play control command sent by the user, and performing a corresponding operation according to the play control command, the play control command comprising: at least one of a volume adjustment instruction, a sound effect adjustment instruction, or a speech rate adjustment instruction.

Receiving the interaction information sent by the user, and searching corresponding user preference information according to the interaction information, where the user preference information includes: at least one of a favorite sound effect, a favorite educational content, or a favorite scene; according to the interaction information and the found user Preferences, adjust the current environment.

The beneficial effects of the embodiment are: obtaining the current educational voice by detecting the voice in the current environment; performing keyword recognition on the current educational voice, obtaining a target keyword in the current educational voice; determining the target The belonging category of the keyword; when the belonging category of the target keyword is the first category, the audio file corresponding to the target keyword is searched, and the found audio file is played. Therefore, the educational voice in the current environment can be stereoscopically presented, so that the infant can more intuitively feel the sound scene corresponding to the specific things in the educational voice content, thereby effectively developing the intelligence of the infant and the child, and improving the interest in learning.

Based on the above hardware structure, an embodiment of the voice-activated education method of the present invention is proposed.

Referring to FIG. 2, FIG. 2 is a schematic flow chart of a first embodiment of a voice-activated education method according to the present invention.

In this embodiment, the method includes the following steps:

S10: detecting the voice in the current environment, and obtaining the current educational voice;

It should be noted that the execution subject of the method in this embodiment is a mobile terminal, and the mobile terminal may be a mobile terminal capable of implementing voice collection or detection and program running, for example, a smart phone, a tablet computer or a notebook computer, etc. This example does not limit this.

It is to be understood that the current environment may be a place where the voice-activated education can be implemented, for example, a children's room, a kindergarten classroom, and the like, which is not limited in this embodiment.

It should be noted that the detecting the voice in the current environment to obtain the current educational voice; mainly detecting and judging the voice in the collected current environment, and removing the noise voice that is not the educational voice, for example, in the current environment. The speech rate is gentle, the rhythm is strong, the volume is moderate, the voice is rich in magnetic voice or sentence coherence, the words are clear, the recognition is high, and the speech with long duration is judged as educational voice. The specific judgment rules can be set according to the actual situation. This embodiment does not limit this.

S20: performing keyword recognition on the current educational voice to obtain a target keyword in the current educational voice;

It should be noted that not all the information contained in the educational speech or the meaning expressed need to be presented to the infants in the current environment. Therefore, in the process of obtaining the current educational speech, the keywords of the current educational speech need to be keywords. Identification, extracting the words that need to be presented as the target keyword.

S30: determining a category of the target keyword;

It should be noted that determining the category of the target keyword may be determining a category to which the target keyword belongs according to a preset keyword classification table, for example, presetting a keyword table, where the keyword classification table includes Various different categories of keywords, for example, the first category keywords "whistle", "water flow", "bird call", etc. representing the sound category, the vocabulary specifically included in the preset keyword classification table and the category to which the vocabulary belongs It can be set according to actual conditions, and this embodiment does not limit this.

In a specific implementation, whether there is a preset keyword corresponding to the target keyword classification table in the preset keyword classification table, if yes, the target keyword category is determined as the corresponding preset keyword. The category, for example, the current target keyword is “Bird Call”, and the preset keyword classification table is used to find whether there is a preset keyword corresponding to “Bird Call”. If it exists, and the category is the first category, then The category to which the currently acquired target keyword "Bird Call" belongs is determined as the first category. It should be noted that the corresponding preset keyword may be a keyword that is similar to or the same as the target keyword, such as “bird call”, “bird song”, etc., and the specific corresponding rule may be set by itself, this embodiment There is no restriction on this.

S40: When the belonging category of the target keyword is the first category, search for an audio file corresponding to the target keyword, and play the found audio file.

It should be noted that, in this embodiment, the keyword category representing the voice is preset to the first category. In the actual situation, the division of the specific category of each keyword may be set according to actual conditions, which is not limited in this embodiment. .

It can be understood that, in order to present the stereoscopic representation of the voice of the child while learning or listening to the educational voice, it is necessary to acquire the audio file corresponding to the target keyword immediately when acquiring the target keyword, and perform the audio file on the audio file. Play. Therefore, in order to realize fast and accurate search of the audio file, a mapping relationship between the preset keyword and the audio file corresponding to the preset keyword may be established in advance, so that when the category of the target keyword is confirmed, the search may be performed by And the preset keyword corresponding to the target keyword, and then immediately obtaining an audio file corresponding to the preset keyword according to the mapping relationship, and playing the audio file to implement synchronous teaching.

In a specific implementation, the embodiment first detects and determines the voice in the current environment according to the preset judgment condition, removes the non-educational voice in the current environment, and then obtains the current educational voice, and then performs the foregoing according to the preset keyword table. The current educational voice performs keyword recognition, obtains the target keyword in the current educational voice and the category to which the target keyword belongs, and after determining the category to which the target keyword belongs, acquires the corresponding audio file according to the mapping relationship, and plays the audio. The file stereoscopically plays the live sound corresponding to the target keyword in the current educational voice. For example, in the kindergarten classroom, the teacher read the following sentence in a slow speech: "In the early morning jungle trail, the green trees obscured the bright sunshine, the breeze blew, the cool air was refreshing, People feel that everything is so quiet and beautiful. When confirming that the text read by the teacher is an educational voice, immediately identify the key words. When the target keyword "bird call" is obtained, immediately obtain and The audio file corresponding to the target keyword "Bird Call" is played and the audio file is played. At this time, the kindergarten students who are listening carefully to the teacher will hear the teacher hear the "bird call" and hear the sound simultaneously. The bird screams, vividly combining the text message “Bird Call” received by the brain with the sound of the bird heard.

The voice-activated education method provided in this embodiment obtains a current educational voice by detecting a voice in a current environment; performing keyword recognition on the current educational voice to obtain a target keyword in the current educational voice; When the category of the target keyword is the first category, the audio file corresponding to the target keyword is searched for, and the found audio file is played. Therefore, the educational voice in the current environment can be stereoscopically presented, so that the infant can more intuitively feel the sound scene corresponding to the specific things in the educational voice content, thereby effectively developing the intelligence of the infant and the child, and improving the interest in learning.

In order to better present a story scene of audio and video education content for infants and toddlers, the method further includes:

It should be noted that the user may be a teaching user who educates the infant, such as a parent or a teacher, or an educated user, such as an infant or a child.

It can be understood that during the synchronous playback of the target keyword in the current educational voice, the user may adjust the playing sound effect or the volume level according to his own needs or preferences, so the mobile terminal receives the playback control command sent by the user. Immediately perform the corresponding operation to improve the user experience. For example, during the teaching process, the user feels that the playing sound is too large, and the volume needs to be lowered. After receiving the volume down command sent by the user, the mobile terminal reduces the playing volume according to the instruction. User target volume.

In order to improve the experience of infants and young children in learning and educating speech, and further improving the interest in learning, the method further includes:

It should be noted that the interaction information may be information that is sent by the user when interacting with the mobile terminal, for example, the communication voice sent by the infant when interacting with the mobile terminal, or the infant to the current environment. When the user feels uncomfortable, the control voice of the current environment is adjusted, or the multimedia file transmitted by the educated user through the personal device, and the specific category of the interactive information may be set according to actual needs, and the embodiment does not limit this. .

It should be noted that, in order to implement the teaching of the educated user, the mobile terminal may pre-establish a personalized account corresponding to the educated user, and the personalized account may include user-friendly sound type information, such as “soft type”; Educational content category information, such as "Zhang Ailing's prose", "Zheng Yuanjie's fairy tale"; favorite scenes, etc., at the same time, the mobile terminal can use the voice features of the educated user or the personal device with the fixed logo used by the user The personalized account is associated, that is, when the educated user sends the interactive voice or the interactive information, the mobile terminal can find the corresponding personalized account according to the interactive voice or the interaction information, obtain the user preference information, and combine the current interaction information, The environment is adjusted.

It can be understood that the mobile terminal can also record and analyze the interaction information sent by the educated user in the process of interacting with the user, and update and store the pre-stored user preferences according to the analysis result, for example, with age. The growth, the educated users like to listen to Zheng Yuanjie's fairy tales for a period of time, then the mobile terminal will give Zheng Yuanjie's fairy tale as an educational content to educated users when the educated user chooses educational content; for example, in a certain section In the time, the mobile terminal found that the educated users like to listen to Zhang Ailing's essays, and adjust the user's favorite educational content accordingly, and preferentially promote Zhang Ailing's essays as educational content to educated users.

In this embodiment, by receiving the playback control command and the interaction information sent by the user, the user's interest or preference is recorded and stored, and the interest or preference is updated according to the preference of the user in different time periods, so that the educated user is growing continuously. Learning knowledge in a pleasant and comfortable educational environment effectively develops the intelligence of educated people and increases their interest in learning.

Referring to FIG. 3, FIG. 3 is a schematic flowchart diagram of a second embodiment of a voice-activated education method according to the present invention.

In order to carry out audio education for infants and young children, in addition to playing the audio file, the infant can personally feel the live sound corresponding to the target keyword representing the sound, and can also use the keyword representing the natural environment in the current educational voice as the target keyword. And adjusting the current environment to create a scene corresponding to the keyword representing the natural environment, thereby allowing the infant to understand and learn the knowledge information contained in the current educational voice from the sense of smell and touch.

Therefore, in order to further effectively develop the intelligence of the infants and children and improve their interest in learning, after the step S30, the method further includes:

Step S50, when the category to which the target keyword belongs is the second category, acquiring an adjustment instruction corresponding to the target keyword, and synchronously adjusting the current environment according to the adjustment instruction, so that the current environment and the The target keyword corresponds.

It should be noted that, in this embodiment, the keyword category representing the natural environment may be preset to the second category, for example, the words “cold” and “hot” representing temperature, and the word “fragrance” representing odor, “Fragrance”, the category of keywords such as “Breeze”, “Daylight” and “Dark” that represent natural phenomena is preset to the second category. In the actual situation, the division of the specific categories of the keywords may be set according to actual conditions, and this embodiment does not limit this.

In order to reproduce a scene corresponding to a keyword representing a natural environment, the adjustment instruction includes at least one of a temperature adjustment instruction, a humidity adjustment instruction, a brightness adjustment instruction, or an odor adjustment instruction. The temperature adjustment command is used to adjust the ambient temperature of the current environment, the humidity adjustment information is used to adjust the ambient humidity of the current environment, the brightness adjustment information is used to adjust the brightness of the current environment, and the odor adjustment command is used to adjust the odor of the current environment.

In a specific implementation, when the acquired target keyword belongs to the second category, the mobile terminal immediately acquires the preset keyword corresponding to the target keyword, and obtains a corresponding adjustment instruction according to the preset keyword, and then according to the preset The adjustment instruction synchronously adjusts the current environment, so that the current environment corresponds to the target keyword, for example, immediately after the current educational voice refers to the target keyword “floral”, the scent adjustment instruction corresponding to “floral” is obtained immediately, And according to the odor adjustment instruction, the odor regulating device is controlled to emit a faint floral fragrance, so that the infant can truly feel the scene corresponding to the target keyword “flower”.

In this embodiment, a keyword representing a natural environment in the current educational voice is used as a target keyword, and an adjustment instruction corresponding to the target keyword is acquired, and the current environment is correspondingly adjusted according to the adjustment instruction to create a key with the target. The scene corresponding to the word, so that the infant can understand and learn the knowledge information contained in the current educational voice from different senses such as smell and touch, and more effectively develop the intelligence of the infant and the child, and improve the learning interest of the infant.

Referring to FIG. 4, FIG. 4 is a schematic flowchart diagram of a third embodiment of a voice-activated education method according to the present invention. Based on the embodiment shown in FIG. 2 or FIG. 3, a third embodiment of the voice-activated education method of the present invention is proposed.

In this embodiment, in order to allow the infant to better feel the story scene and the plot represented by the learning content during the learning process, before step S10, the method further includes:

In step S01, when the education mode selected by the user is the voice play mode, the educational voice to be played selected by the user is acquired, and the educational voice to be played is played.

It should be noted that the education mode is used to obtain an education mode determined by the user according to his/her needs. For example, if the user needs to play a voice through the mobile terminal, the voice play mode may be selected; if the education voice needs to be read aloud, the individual may read aloud. mode. The setting and selection of the specific mode may be determined according to actual conditions, and this embodiment does not limit this.

It can be understood that, when the user selects to perform the teaching in the voice play mode, the user may first send the pre-selected educational audio and video file to be played including the educational voice to the mobile terminal, and the educational voice to be played may be pre-recorded by the user. Educational voice. In real life, each child is often most familiar with his parents' voices. When parents teach stories to them, learning and thinking are relatively active, which is more conducive to intellectual development. Therefore, when parents go out during the day, The above effects can also be achieved by playing pre-recorded audio and video files for children at home.

In this embodiment, by setting different education modes for the user to select according to the actual situation, when the user selects the voice play mode, the mobile terminal selects the educational voice to be played by the user, which satisfies the different types of users and is more effective. Achieve the purpose of synchronous teaching.

Referring to FIG. 5, FIG. 5 is a schematic flowchart diagram of a fourth embodiment of a voice-activated education method according to the present invention. Based on the embodiment shown in FIG. 2 or FIG. 3 above, a fourth embodiment of the voice-activated education method of the present invention is proposed.

In the embodiment, before the step S10, the method further includes:

Step S02: When the education mode selected by the user is the personal reading mode, the step of detecting the voice in the current environment to obtain the current educational voice is performed.

It should be noted that when the user wishes to tell or read the educational content in person, the personal reading mode can be selected for teaching.

In a specific implementation, when the user selects the personal reading mode for teaching, the mobile terminal detects the voice in the current environment, and when the educational voice is detected, starts to perform keyword recognition on the current educational voice, and performs corresponding subsequent steps. For example, when mom told her daughter a story, she read the following sentence slowly: "The winter night comes earlier, the sky is very dark, and the cold north wind blows." When the mobile terminal detects the voice After the education of the voice, the keyword recognition of the current educational voice is immediately performed. After the target keyword "night" is recognized, the brightness of the room is slowly lowered synchronously; after the target keyword "cold" is recognized, the synchronization is slow. Slowly lower the temperature of the room; after identifying the target keyword "North Wind", the effect of blowing is simultaneously produced.

In this embodiment, by setting a personal reading mode, when the user chooses to personally teach the infant, the audience can fully mobilize the senses of the infant in the learning process by reading the educational content, which is a good way to deepen the infant's educational content. The memory has been very active and effective in the development of infant intelligence, which has increased the interest of infants and young children.

In addition, the present invention also provides a voice-activated education system, the education system comprising: a mobile terminal, a playback device, and an adjustment device as shown in FIG. 1; wherein the playback device is configured to be under the control of the processor The audio and video files are played; the adjustment device is configured to adjust the current environment under the control of the processor.

In addition, the present invention further provides a storage medium, wherein the storage medium stores a voice-activated educational program, and when the voice-activated educational program is executed by the processor, the following operations are implemented:

Determining a category of the target keyword;

Further, when the voice-activated educational program is executed by the processor, the following operations are further performed: when the category to which the target keyword belongs is the second category, acquiring an adjustment instruction corresponding to the target keyword, and according to the adjustment instruction The current environment is adjusted synchronously such that the current environment corresponds to the target keyword.

Further, when the voice-activated educational program is executed by the processor, the following operations are also performed: when the educational mode selected by the user is the voice playing mode, the educational voice to be played selected by the user is acquired, and the educational voice to be played is played.

Further, when the voice-activated educational program is executed by the processor, the following operations are further performed: when the education mode selected by the user is the personal reading mode, the detecting the voice in the current environment is performed to obtain the operation of the current educational voice.

Further, when the voice-activated educational program is executed by the processor, the following operations are further performed: receiving a play control command sent by the user, and performing a corresponding operation according to the play control command, where the play control command includes: a volume adjustment command, and a sound effect adjustment At least one of an instruction or a speech rate adjustment instruction.

Further, when the voice-activated educational program is executed by the processor, the following operations are further performed: receiving interaction information sent by the user, and searching for corresponding user preference information according to the interaction information, where the user preference information includes: a favorite sound effect, a favorite educational content, or At least one of the favorite scenes; adjusting the current environment according to the interaction information and the found user preference information.

The beneficial effects of the embodiment are: obtaining the current educational voice by detecting the voice in the current environment; performing keyword recognition on the current educational voice, obtaining a target keyword in the current educational voice; determining the target The belonging category of the keyword; when the belonging category of the target keyword is the first category, the audio file corresponding to the target keyword is searched, and the found audio file is played. Therefore, the educational voice in the current environment is presented in a three-dimensional manner, so that the infant can more intuitively feel the sound scene corresponding to the specific things in the educational voice content, thereby effectively developing the intelligence of the infant and the child, and improving the interest in learning.

It is to be understood that the term "comprises", "comprising", or any other variants thereof, is intended to encompass a non-exclusive inclusion, such that a process, method, article, or It also includes other elements that are not explicitly listed, or elements that are inherent to such a process, method, item, or system. An element defined by the phrase "comprising a ..." does not exclude the presence of additional equivalent elements in a process, method, article, or system that includes the element, without further limitation.

The serial numbers of the embodiments of the present invention are merely for the description, and do not represent the advantages and disadvantages of the embodiments.

Those skilled in the art can clearly understand the above by the description of the above embodiments. The embodiment method can be implemented by means of software plus a necessary general hardware platform, and of course hardware, but in many cases the former is a better implementation. Based on such understanding, the present invention The technical solution in essence or the contribution to the prior art can be embodied in the form of a software product stored in a storage medium (such as ROM/RAM, disk, light). The disc includes a number of instructions for causing a terminal device (which may be a cell phone, a computer, a server, an air conditioner, or a network device, etc.) to perform the methods described in various embodiments of the present invention.

The above are only the preferred embodiments of the present invention, and are not intended to limit the scope of the invention, and the equivalent structure or equivalent process transformations made by the description of the present invention and the drawings are directly or indirectly applied to other related technical fields. The same is included in the scope of patent protection of the present invention.

Claims

A voice-activated education method, characterized in that the method comprises the following steps:

Detecting the voice in the current environment and obtaining the current educational voice;

Performing keyword recognition on the current educational voice to obtain a target keyword in the current educational voice;

Determining a category of the target keyword;

When the belonging category of the target keyword is the first category, the audio file corresponding to the target keyword is searched, and the found audio file is played.
The method of claim 1, wherein after the determining the category of the target keyword, the method further comprises:

When the belonging category of the target keyword is the second category, acquiring an adjustment instruction corresponding to the target keyword, and synchronously adjusting the current environment according to the adjustment instruction, so that the current environment and the target keyword are Corresponding.
The method of claim 2, wherein the adjustment command comprises at least one of a temperature adjustment command, a humidity adjustment command, a brightness adjustment command, or an odor adjustment command.
The method of claim 3, wherein the method further comprises: before detecting the voice in the current environment to obtain the current educational voice, the method further comprising:

When the education mode selected by the user is the voice play mode, the educational voice to be played selected by the user is acquired, and the educational voice to be played is played.
The method of claim 4, wherein the method further comprises: before detecting the voice in the current environment to obtain the current educational voice, the method further comprising:

When the education mode selected by the user is the personal reading mode, the step of detecting the voice in the current environment to obtain the current educational voice is performed.
The method of claim 5, wherein the method further comprises:

Receiving a play control command sent by the user, and performing a corresponding operation according to the play control command, the play control command comprising: at least one of a volume adjustment instruction, a sound effect adjustment instruction, or a speech rate adjustment instruction.
The method of claim 6 wherein the method further comprises:

Receiving the interaction information sent by the user, and searching for corresponding user preference information according to the interaction information, where the user preference information includes: at least one of a favorite sound effect, a favorite educational content, or a favorite scene;

The current environment is adjusted according to the interaction information and the found user preference information.
A mobile terminal, comprising: a memory, a processor, and a voice-activated educational program stored on the memory and operable on the processor, wherein the voice-activated educational program is configured to implement The steps of the voice-activated educational method of claim 1.
A mobile terminal, comprising: a memory, a processor, and a voice-activated educational program stored on the memory and operable on the processor, wherein the voice-activated educational program is configured to implement The steps of the voice-activated educational method of claim 2.
A mobile terminal, comprising: a memory, a processor, and a voice-activated educational program stored on the memory and operable on the processor, wherein the voice-activated educational program is configured to implement The steps of the voice-activated educational method of claim 4.
A mobile terminal, comprising: a memory, a processor, and a voice-activated educational program stored on the memory and operable on the processor, wherein the voice-activated educational program is configured to implement The steps of the voice-activated educational method of claim 5.
A mobile terminal, comprising: a memory, a processor, and a voice-activated educational program stored on the memory and operable on the processor, wherein the voice-activated educational program is configured to implement The steps of the voice-activated educational method of claim 6.
A mobile terminal, comprising: a memory, a processor, and a voice-activated educational program stored on the memory and operable on the processor, wherein the voice-activated educational program is configured to implement The steps of the voice-activated educational method of claim 7.
A voice-activated education system, characterized in that the education system comprises: the mobile terminal, the playback device and the adjustment device of claim 8; wherein the playback device is configured to control audio and video under the control of the processor The file is played; the adjustment device is configured to adjust the current environment under the control of the processor.
A storage medium, characterized in that the storage medium stores a voice-activated educational program, and the voice-activated educational program is executed by a processor to implement the steps of the voice-activated educational method according to claim 1.
A storage medium, characterized in that the storage medium stores a voice-activated educational program, and the voice-activated educational program is executed by a processor to implement the steps of the voice-activated educational method according to claim 2.
A storage medium, characterized in that the storage medium stores a voice-activated educational program, and the voice-activated educational program is executed by a processor to implement the steps of the voice-activated educational method according to claim 4.
A storage medium, characterized in that the storage medium stores a voice-activated educational program, and the voice-activated educational program is executed by a processor to implement the steps of the voice-activated educational method according to claim 5.
A storage medium, characterized in that the storage medium stores a voice-activated educational program, and the voice-activated educational program is executed by a processor to implement the steps of the voice-activated educational method according to claim 6.
A storage medium, characterized in that the storage medium stores a voice-activated educational program, and the voice-activated educational program is executed by a processor to implement the steps of the voice-activated educational method according to claim 7.