[go: up one dir, main page]

WO2019006792A1 - Voice-controlled education method and system, mobile terminal, and storage medium - Google Patents

Voice-controlled education method and system, mobile terminal, and storage medium Download PDF

Info

Publication number
WO2019006792A1
WO2019006792A1 PCT/CN2017/094486 CN2017094486W WO2019006792A1 WO 2019006792 A1 WO2019006792 A1 WO 2019006792A1 CN 2017094486 W CN2017094486 W CN 2017094486W WO 2019006792 A1 WO2019006792 A1 WO 2019006792A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
educational
activated
processor
target keyword
Prior art date
Application number
PCT/CN2017/094486
Other languages
French (fr)
Chinese (zh)
Inventor
袁晖
李凝华
Original Assignee
深圳市科迈爱康科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市科迈爱康科技有限公司 filed Critical 深圳市科迈爱康科技有限公司
Publication of WO2019006792A1 publication Critical patent/WO2019006792A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/63Querying
    • G06F16/638Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/685Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using automatically derived transcript of audio data, e.g. lyrics
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/20Education
    • G06Q50/205Education administration or guidance
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B5/00Electrically-operated educational appliances
    • G09B5/04Electrically-operated educational appliances with audible presentation of the material to be studied
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B5/00Electrically-operated educational appliances
    • G09B5/06Electrically-operated educational appliances with both visual and audible presentation of the material to be studied
    • G09B5/065Combinations of audio and video presentations, e.g. videotapes, videodiscs, television systems

Definitions

  • the present invention relates to the field of education, and in particular, to a voice-activated educational method, a mobile terminal, a system, and a storage medium.
  • Infants and toddlers watch TV differently from adults watching TV.
  • Adults watch TV and can understand the pictures before and after and the logical relationship between the pictures.
  • Each picture constitutes a logical story.
  • Infants under the age of two are completely different. The baby has just been born without the same thinking as an adult. There are only a few unconditional launches, such as foraging reflections, sucking reflections, and gripping reflections.
  • the development of the baby's thinking in this age group is called the “sensory movement period”.
  • infants and young children have mainly learned and recognized the world through their senses of hearing, sight, touch, and hands.
  • the thinking at this stage is intuitive action thinking. That is to say, infants and young children mainly carry out specific and direct thinking in perceptive actions.
  • the main object of the present invention is to provide a voice-activated educational method, a mobile terminal, a system, and a storage medium, which aim to solve the problem that the prior art cannot allow a baby to hear a stereoscopic sound, and cannot effectively develop the intelligence of an infant.
  • the present invention provides a voice-activated educational method, the method comprising the steps of:
  • the audio file corresponding to the target keyword is searched, and the found audio file is played.
  • the method further includes:
  • the category to which the target keyword belongs is the second category, obtaining an adjustment instruction corresponding to the target keyword, and synchronously adjusting the current environment according to the adjustment instruction, so that the current environment and the target keyword are Corresponding.
  • the adjustment instruction comprises at least one of a temperature adjustment instruction, a humidity adjustment instruction, a brightness adjustment instruction or an odor adjustment instruction.
  • the method further includes: when the education mode selected by the user is the voice playing mode, acquiring the educational voice to be played selected by the user, and It is said that the educational voice is played for playback.
  • the method further includes: when the education mode selected by the user is the personal reading mode, performing the detecting the voice in the current environment, Get the steps to the current educational voice.
  • the method further includes: receiving a play control instruction sent by the user, and performing a corresponding operation according to the play control instruction, where the play control command includes: a volume adjustment instruction, a sound effect adjustment instruction, or a speech speed adjustment instruction at least one.
  • the method further includes: receiving interaction information sent by the user, and searching for corresponding user preference information according to the interaction information, where the user preference information includes: at least one of a favorite sound effect, a favorite educational content, or a favorite scene; The interaction information and the found user preference information adjust the current environment.
  • the present invention further provides a mobile terminal, the mobile terminal comprising: a memory, a processor, and a voice-activated educational program stored on the memory and operable on the processor,
  • the voice-activated educational program is configured to implement the steps of the voice-activated educational method as described above.
  • the present invention further provides a voice-activated education system, the education system comprising: the mobile terminal, the playback device, and the adjustment device described above; wherein the playback device is configured to be in the process The audio and video files are played under control; the adjustment device is configured to adjust the current environment under the control of the processor.
  • the present invention also provides a storage medium on which a voice-activated educational program is stored, and when the voice-activated educational program is executed by a processor, the voice-activated educational method as described above is implemented. step.
  • the present invention obtains a current educational voice by detecting a voice in a current environment; performing keyword recognition on the current educational voice to obtain a target keyword in the current educational voice; and determining a category of the target keyword;
  • the belonging category of the target keyword is the first category
  • the audio file corresponding to the target keyword is searched, and the found audio file is played, so that the educational voice in the current environment is stereoscopically presented.
  • Infants and young children can more intuitively feel the sound scene corresponding to the specific things in the educational voice content, and then effectively develop the intelligence of infants and children and improve their interest in learning.
  • FIG. 1 is a schematic structural diagram of a mobile terminal in a hardware operating environment according to an embodiment of the present invention
  • FIG. 2 is a schematic flow chart of a first embodiment of a voice-activated education method according to the present invention
  • FIG. 3 is a schematic flow chart of a second embodiment of a voice-activated education method according to the present invention.
  • FIG. 4 is a schematic flow chart of a third embodiment of a voice-activated education method according to the present invention.
  • FIG. 5 is a schematic flow chart of a fourth embodiment of a voice-activated education method according to the present invention.
  • FIG. 1 is a schematic structural diagram of a mobile terminal in a hardware operating environment according to an embodiment of the present invention.
  • the mobile terminal may include a processor 1001, such as a CPU, a communication bus 1002, a user interface 1003, a network interface 1004, a memory 1005, and a sound collector 1006.
  • the communication bus 1002 is used to implement connection communication between these components.
  • the user interface 1003 can include a display, an input unit such as a keyboard, and the optional user interface 1003 can also include a standard wired interface, a wireless interface.
  • the network interface 1004 can optionally include a standard wired interface, a wireless interface (such as a WI-FI interface).
  • the memory 1005 may be a high speed RAM memory or a stable memory (non-volatile) Memory), such as disk storage.
  • the memory 1005 can also optionally be a storage device independent of the aforementioned processor 1001.
  • the mobile terminal structure shown in FIG. 1 does not constitute a limitation of the mobile terminal, and may include more or less components than those illustrated, or combine some components, or different component arrangements.
  • the memory 1005 as a storage medium may include an operating system, a data storage module, a network communication module, a user interface module, and a voice-activated educational program.
  • the mobile terminal may be a mobile terminal that can implement voice collection or detection and program running, for example, a smart phone, a tablet computer or a notebook computer, etc., which is not limited in this embodiment.
  • the network interface 1004 is mainly used for data communication with the background server;
  • the sound collector 1006 is configured to collect or detect the current voice;
  • the user interface 1003 is mainly used for data interaction with the user;
  • the processor 1001 and the memory 1005 in the mobile terminal may be disposed in the mobile terminal, and the mobile terminal invokes the voice-activated educational program stored in the memory 1005 through the processor 1001, and performs the following operations:
  • the audio file corresponding to the target keyword is searched, and the found audio file is played.
  • processor 1001 can call the voice-activated educational program stored in the memory 1005, and also performs the following operations:
  • the category to which the target keyword belongs is the second category, obtaining an adjustment instruction corresponding to the target keyword, and synchronously adjusting the current environment according to the adjustment instruction, so that the current environment and the target keyword are Corresponding.
  • processor 1001 can call the voice-activated educational program stored in the memory 1005, and also performs the following operations:
  • the education mode selected by the user is the voice play mode
  • the educational voice to be played selected by the user is acquired, and the educational voice to be played is played.
  • processor 1001 can call the voice-activated educational program stored in the memory 1005, and also performs the following operations:
  • the detecting the voice in the current environment is performed to obtain the operation of the current educational voice.
  • processor 1001 can call the voice-activated educational program stored in the memory 1005, and also performs the following operations:
  • the play control command comprising: at least one of a volume adjustment instruction, a sound effect adjustment instruction, or a speech rate adjustment instruction.
  • processor 1001 can call the voice-activated educational program stored in the memory 1005, and also performs the following operations:
  • the user preference information includes: at least one of a favorite sound effect, a favorite educational content, or a favorite scene; according to the interaction information and the found user Preferences, adjust the current environment.
  • the beneficial effects of the embodiment are: obtaining the current educational voice by detecting the voice in the current environment; performing keyword recognition on the current educational voice, obtaining a target keyword in the current educational voice; determining the target The belonging category of the keyword; when the belonging category of the target keyword is the first category, the audio file corresponding to the target keyword is searched, and the found audio file is played. Therefore, the educational voice in the current environment can be stereoscopically presented, so that the infant can more intuitively feel the sound scene corresponding to the specific things in the educational voice content, thereby effectively developing the intelligence of the infant and the child, and improving the interest in learning.
  • FIG. 2 is a schematic flow chart of a first embodiment of a voice-activated education method according to the present invention.
  • the method includes the following steps:
  • the execution subject of the method in this embodiment is a mobile terminal
  • the mobile terminal may be a mobile terminal capable of implementing voice collection or detection and program running, for example, a smart phone, a tablet computer or a notebook computer, etc. This example does not limit this.
  • the current environment may be a place where the voice-activated education can be implemented, for example, a children's room, a kindergarten classroom, and the like, which is not limited in this embodiment.
  • the detecting the voice in the current environment to obtain the current educational voice mainly detecting and judging the voice in the collected current environment, and removing the noise voice that is not the educational voice, for example, in the current environment.
  • the speech rate is gentle, the rhythm is strong, the volume is moderate, the voice is rich in magnetic voice or sentence coherence, the words are clear, the recognition is high, and the speech with long duration is judged as educational voice.
  • the specific judgment rules can be set according to the actual situation. This embodiment does not limit this.
  • the keywords of the current educational speech need to be keywords. Identification, extracting the words that need to be presented as the target keyword.
  • determining the category of the target keyword may be determining a category to which the target keyword belongs according to a preset keyword classification table, for example, presetting a keyword table, where the keyword classification table includes Various different categories of keywords, for example, the first category keywords "whistle”, “water flow”, “bird call”, etc. representing the sound category, the vocabulary specifically included in the preset keyword classification table and the category to which the vocabulary belongs It can be set according to actual conditions, and this embodiment does not limit this.
  • the target keyword category is determined as the corresponding preset keyword.
  • the category for example, the current target keyword is “Bird Call”, and the preset keyword classification table is used to find whether there is a preset keyword corresponding to “Bird Call”. If it exists, and the category is the first category, then The category to which the currently acquired target keyword "Bird Call” belongs is determined as the first category.
  • the corresponding preset keyword may be a keyword that is similar to or the same as the target keyword, such as “bird call”, “bird song”, etc., and the specific corresponding rule may be set by itself, this embodiment There is no restriction on this.
  • the keyword category representing the voice is preset to the first category.
  • the division of the specific category of each keyword may be set according to actual conditions, which is not limited in this embodiment. .
  • a mapping relationship between the preset keyword and the audio file corresponding to the preset keyword may be established in advance, so that when the category of the target keyword is confirmed, the search may be performed by And the preset keyword corresponding to the target keyword, and then immediately obtaining an audio file corresponding to the preset keyword according to the mapping relationship, and playing the audio file to implement synchronous teaching.
  • the embodiment first detects and determines the voice in the current environment according to the preset judgment condition, removes the non-educational voice in the current environment, and then obtains the current educational voice, and then performs the foregoing according to the preset keyword table.
  • the current educational voice performs keyword recognition, obtains the target keyword in the current educational voice and the category to which the target keyword belongs, and after determining the category to which the target keyword belongs, acquires the corresponding audio file according to the mapping relationship, and plays the audio.
  • the file stereoscopically plays the live sound corresponding to the target keyword in the current educational voice.
  • the teacher read the following sentence in a slow speech: "In the early morning canyon trail, the green trees obscured the bright sunshine, the breeze blew, the cool air was refreshing, People feel that everything is so quiet and beautiful.
  • the text read by the teacher is an educational voice
  • the target keyword "bird call” is obtained, immediately obtain and The audio file corresponding to the target keyword "Bird Call” is played and the audio file is played.
  • the kindergarten students who are listening carefully to the teacher will hear the teacher hear the "bird call” and hear the sound simultaneously.
  • the bird screams, vividly combining the text message “Bird Call” received by the brain with the sound of the bird heard.
  • the voice-activated education method provided in this embodiment obtains a current educational voice by detecting a voice in a current environment; performing keyword recognition on the current educational voice to obtain a target keyword in the current educational voice;
  • the category of the target keyword is the first category
  • the audio file corresponding to the target keyword is searched for, and the found audio file is played. Therefore, the educational voice in the current environment can be stereoscopically presented, so that the infant can more intuitively feel the sound scene corresponding to the specific things in the educational voice content, thereby effectively developing the intelligence of the infant and the child, and improving the interest in learning.
  • the method further includes:
  • the play control command comprising: at least one of a volume adjustment instruction, a sound effect adjustment instruction, or a speech rate adjustment instruction.
  • the user may be a teaching user who educates the infant, such as a parent or a teacher, or an educated user, such as an infant or a child.
  • the user may adjust the playing sound effect or the volume level according to his own needs or preferences, so the mobile terminal receives the playback control command sent by the user. Immediately perform the corresponding operation to improve the user experience. For example, during the teaching process, the user feels that the playing sound is too large, and the volume needs to be lowered. After receiving the volume down command sent by the user, the mobile terminal reduces the playing volume according to the instruction. User target volume.
  • the method further includes:
  • the user preference information includes: at least one of a favorite sound effect, a favorite educational content, or a favorite scene; according to the interaction information and the found user Preferences, adjust the current environment.
  • the interaction information may be information that is sent by the user when interacting with the mobile terminal, for example, the communication voice sent by the infant when interacting with the mobile terminal, or the infant to the current environment.
  • the control voice of the current environment is adjusted, or the multimedia file transmitted by the educated user through the personal device, and the specific category of the interactive information may be set according to actual needs, and the embodiment does not limit this. .
  • the mobile terminal may pre-establish a personalized account corresponding to the educated user, and the personalized account may include user-friendly sound type information, such as “soft type”; Educational content category information, such as "Zhang Ailing's prose", “Zheng Yuanjie's fairy tale”; favorite scenes, etc., at the same time, the mobile terminal can use the voice features of the educated user or the personal device with the fixed logo used by the user
  • the personalized account is associated, that is, when the educated user sends the interactive voice or the interactive information, the mobile terminal can find the corresponding personalized account according to the interactive voice or the interaction information, obtain the user preference information, and combine the current interaction information, The environment is adjusted.
  • the mobile terminal can also record and analyze the interaction information sent by the educated user in the process of interacting with the user, and update and store the pre-stored user preferences according to the analysis result, for example, with age.
  • the mobile terminal found that the educated users like to listen to Zhang Ailing's essays, and adjust the user's favorite educational content accordingly, and preferentially promote Zhang Ailing's essays as educational content to educated users.
  • the user's interest or preference is recorded and stored, and the interest or preference is updated according to the preference of the user in different time periods, so that the educated user is growing continuously.
  • Learning knowledge in a pleasant and comfortable educational environment effectively develops the intelligence of educated people and increases their interest in learning.
  • FIG. 3 is a schematic flowchart diagram of a second embodiment of a voice-activated education method according to the present invention.
  • the infant can personally feel the live sound corresponding to the target keyword representing the sound, and can also use the keyword representing the natural environment in the current educational voice as the target keyword. And adjusting the current environment to create a scene corresponding to the keyword representing the natural environment, thereby allowing the infant to understand and learn the knowledge information contained in the current educational voice from the sense of smell and touch.
  • the method further includes:
  • Step S50 when the category to which the target keyword belongs is the second category, acquiring an adjustment instruction corresponding to the target keyword, and synchronously adjusting the current environment according to the adjustment instruction, so that the current environment and the The target keyword corresponds.
  • the keyword category representing the natural environment may be preset to the second category, for example, the words “cold” and “hot” representing temperature, and the word “fragrance” representing odor, “Fragrance”, the category of keywords such as “Breeze”, “Daylight” and “Dark” that represent natural phenomena is preset to the second category.
  • the division of the specific categories of the keywords may be set according to actual conditions, and this embodiment does not limit this.
  • the adjustment instruction includes at least one of a temperature adjustment instruction, a humidity adjustment instruction, a brightness adjustment instruction, or an odor adjustment instruction.
  • the temperature adjustment command is used to adjust the ambient temperature of the current environment
  • the humidity adjustment information is used to adjust the ambient humidity of the current environment
  • the brightness adjustment information is used to adjust the brightness of the current environment
  • the odor adjustment command is used to adjust the odor of the current environment.
  • the mobile terminal when the acquired target keyword belongs to the second category, the mobile terminal immediately acquires the preset keyword corresponding to the target keyword, and obtains a corresponding adjustment instruction according to the preset keyword, and then according to the preset
  • the adjustment instruction synchronously adjusts the current environment, so that the current environment corresponds to the target keyword, for example, immediately after the current educational voice refers to the target keyword “floral”, the scent adjustment instruction corresponding to “floral” is obtained immediately, And according to the odor adjustment instruction, the odor regulating device is controlled to emit a faint floral fragrance, so that the infant can truly feel the scene corresponding to the target keyword “flower”.
  • a keyword representing a natural environment in the current educational voice is used as a target keyword, and an adjustment instruction corresponding to the target keyword is acquired, and the current environment is correspondingly adjusted according to the adjustment instruction to create a key with the target.
  • the scene corresponding to the word so that the infant can understand and learn the knowledge information contained in the current educational voice from different senses such as smell and touch, and more effectively develop the intelligence of the infant and the child, and improve the learning interest of the infant.
  • FIG. 4 is a schematic flowchart diagram of a third embodiment of a voice-activated education method according to the present invention. Based on the embodiment shown in FIG. 2 or FIG. 3, a third embodiment of the voice-activated education method of the present invention is proposed.
  • the method further includes:
  • step S01 when the education mode selected by the user is the voice play mode, the educational voice to be played selected by the user is acquired, and the educational voice to be played is played.
  • the education mode is used to obtain an education mode determined by the user according to his/her needs. For example, if the user needs to play a voice through the mobile terminal, the voice play mode may be selected; if the education voice needs to be read aloud, the individual may read aloud. mode.
  • the setting and selection of the specific mode may be determined according to actual conditions, and this embodiment does not limit this.
  • the user may first send the pre-selected educational audio and video file to be played including the educational voice to the mobile terminal, and the educational voice to be played may be pre-recorded by the user.
  • Educational voice In real life, each child is often most familiar with his parents' voices. When parents teach stories to them, learning and thinking are relatively active, which is more conducive to intellectual development. Therefore, when parents go out during the day, The above effects can also be achieved by playing pre-recorded audio and video files for children at home.
  • the mobile terminal selects the educational voice to be played by the user, which satisfies the different types of users and is more effective. Achieve the purpose of synchronous teaching.
  • FIG. 5 is a schematic flowchart diagram of a fourth embodiment of a voice-activated education method according to the present invention. Based on the embodiment shown in FIG. 2 or FIG. 3 above, a fourth embodiment of the voice-activated education method of the present invention is proposed.
  • the method before the step S10, the method further includes:
  • Step S02 When the education mode selected by the user is the personal reading mode, the step of detecting the voice in the current environment to obtain the current educational voice is performed.
  • the personal reading mode can be selected for teaching.
  • the mobile terminal when the user selects the personal reading mode for teaching, the mobile terminal detects the voice in the current environment, and when the educational voice is detected, starts to perform keyword recognition on the current educational voice, and performs corresponding subsequent steps. For example, when mom told her daughter a story, she read the following sentence slowly: "The winter night comes earlier, the sky is very dark, and the cold north wind blows.”
  • the mobile terminal detects the voice After the education of the voice, the keyword recognition of the current educational voice is immediately performed. After the target keyword "night” is recognized, the brightness of the room is slowly lowered synchronously; after the target keyword "cold” is recognized, the synchronization is slow. Slowly lower the temperature of the room; after identifying the target keyword "North Wind", the effect of blowing is simultaneously produced.
  • the audience can fully mobilize the senses of the infant in the learning process by reading the educational content, which is a good way to deepen the infant's educational content.
  • the memory has been very active and effective in the development of infant intelligence, which has increased the interest of infants and young children.
  • the present invention also provides a voice-activated education system, the education system comprising: a mobile terminal, a playback device, and an adjustment device as shown in FIG. 1; wherein the playback device is configured to be under the control of the processor The audio and video files are played; the adjustment device is configured to adjust the current environment under the control of the processor.
  • the present invention further provides a storage medium, wherein the storage medium stores a voice-activated educational program, and when the voice-activated educational program is executed by the processor, the following operations are implemented:
  • the audio file corresponding to the target keyword is searched, and the found audio file is played.
  • the voice-activated educational program when executed by the processor, the following operations are further performed: when the category to which the target keyword belongs is the second category, acquiring an adjustment instruction corresponding to the target keyword, and according to the adjustment instruction The current environment is adjusted synchronously such that the current environment corresponds to the target keyword.
  • the voice-activated educational program when executed by the processor, the following operations are also performed: when the educational mode selected by the user is the voice playing mode, the educational voice to be played selected by the user is acquired, and the educational voice to be played is played.
  • the voice-activated educational program when executed by the processor, the following operations are further performed: when the education mode selected by the user is the personal reading mode, the detecting the voice in the current environment is performed to obtain the operation of the current educational voice.
  • the voice-activated educational program when executed by the processor, the following operations are further performed: receiving a play control command sent by the user, and performing a corresponding operation according to the play control command, where the play control command includes: a volume adjustment command, and a sound effect adjustment At least one of an instruction or a speech rate adjustment instruction.
  • the voice-activated educational program when executed by the processor, the following operations are further performed: receiving interaction information sent by the user, and searching for corresponding user preference information according to the interaction information, where the user preference information includes: a favorite sound effect, a favorite educational content, or At least one of the favorite scenes; adjusting the current environment according to the interaction information and the found user preference information.
  • the beneficial effects of the embodiment are: obtaining the current educational voice by detecting the voice in the current environment; performing keyword recognition on the current educational voice, obtaining a target keyword in the current educational voice; determining the target The belonging category of the keyword; when the belonging category of the target keyword is the first category, the audio file corresponding to the target keyword is searched, and the found audio file is played. Therefore, the educational voice in the current environment is presented in a three-dimensional manner, so that the infant can more intuitively feel the sound scene corresponding to the specific things in the educational voice content, thereby effectively developing the intelligence of the infant and the child, and improving the interest in learning.
  • the embodiment method can be implemented by means of software plus a necessary general hardware platform, and of course hardware, but in many cases the former is a better implementation.
  • the present invention The technical solution in essence or the contribution to the prior art can be embodied in the form of a software product stored in a storage medium (such as ROM/RAM, disk, light).
  • the disc includes a number of instructions for causing a terminal device (which may be a cell phone, a computer, a server, an air conditioner, or a network device, etc.) to perform the methods described in various embodiments of the present invention.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Educational Administration (AREA)
  • Educational Technology (AREA)
  • Databases & Information Systems (AREA)
  • Tourism & Hospitality (AREA)
  • Library & Information Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Strategic Management (AREA)
  • Data Mining & Analysis (AREA)
  • Human Resources & Organizations (AREA)
  • General Business, Economics & Management (AREA)
  • Primary Health Care (AREA)
  • Marketing (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Economics (AREA)
  • Electrically Operated Instructional Devices (AREA)

Abstract

A voice-controlled education method and system, a mobile terminal, and a storage medium. The method comprises: detecting voice in a current environment to obtain current education voice (S10); performing keyword recognition on the current education voice to obtain a target keyword in the current education voice (S20); determining the category of the target keyword (S30); and searching for, when the category of the target keyword is a first category, an audio file corresponding to the target keyword, and playing the found audio file (S40). Whereby, an infant or young child can more intuitively feel a sound scene corresponding to the specific object in education voice content, so that the intelligence of the infant or young child is developed, and the interest in learning is improved.

Description

一种声控式教育方法、移动终端、系统及存储介质  Voice-activated education method, mobile terminal, system and storage medium
技术领域Technical field
本发明涉及教育领域,尤其涉及一种声控式教育方法、移动终端、系统及存储介质。The present invention relates to the field of education, and in particular, to a voice-activated educational method, a mobile terminal, a system, and a storage medium.
背景技术Background technique
婴幼儿及儿童的教育和智力发展是成长过程中非常重要的环节,也是家长非常关注的部分,据大量研究表明,过早地让婴幼儿看电视可能会有碍婴幼儿的智力发展。因为,过早过多地看电视,是越来越多的孩子注意力不能长时间集中的重要原因之一。The education and intellectual development of infants and children is a very important part of the growth process, and it is also a part of parents' concern. According to a large number of studies, prematurely letting infants and children watch TV may hinder the intellectual development of infants and young children. Because watching TV too early is one of the important reasons why more and more children can't concentrate for a long time.
婴幼儿看电视和成人看电视不同。成人看电视,能够理解前后的画面以及画面之间的逻辑关系,一个个画面组成了一个有逻辑的故事。两岁以下的婴幼儿则完全不同。婴儿刚生下来没有像成人一样的思维,只有一些先天的无条件发射,比如觅食反射、吸吮反射、抓握反射等。婴儿在这个年龄段的思维发展过程被称之为“感觉运动时期”。在这两年里,婴幼儿主要是通过听觉、视觉、触觉等感觉和手的动作来学习和认识这个世界。这个阶段的思维是直觉行动思维。也就是说,婴幼儿主要在感知行动中进行具体而直接的思维。所以,两岁以下的婴幼儿看电视,就是一个个快速闪光的画面,他们不能把这些画面组织成一个连续而有意义的故事。由于两岁以下的婴幼儿的记忆力、理解力,都不足以让他们记住并判断前后画面的关系,因此让他们过早过多地看电视,对他们来说,接受的只是杂乱无章的信息。Infants and toddlers watch TV differently from adults watching TV. Adults watch TV and can understand the pictures before and after and the logical relationship between the pictures. Each picture constitutes a logical story. Infants under the age of two are completely different. The baby has just been born without the same thinking as an adult. There are only a few unconditional launches, such as foraging reflections, sucking reflections, and gripping reflections. The development of the baby's thinking in this age group is called the “sensory movement period”. In the past two years, infants and young children have mainly learned and recognized the world through their senses of hearing, sight, touch, and hands. The thinking at this stage is intuitive action thinking. That is to say, infants and young children mainly carry out specific and direct thinking in perceptive actions. Therefore, watching TV for infants under the age of two is a fast-flashing picture. They cannot organize these pictures into a continuous and meaningful story. Because the memory and understanding of infants under the age of two are not enough for them to remember and judge the relationship between the pictures before and after, so let them watch TV too much too early, for them, the only information that is accepted is chaotic.
现有的教育方法或教育系统以及面向婴幼儿智力开发的教育材料,大都通过让婴幼儿听成人朗读或播放视频来对婴幼儿进行教育,但成人朗读或视频播放都只是平面的,不能让婴幼儿听到立体化的声音,无法有效地开发婴幼儿的智力。 Existing educational methods or educational systems, as well as educational materials for infants and young children's intellectual development, mostly educate infants and young children by listening to adults reading or playing videos, but adult reading or video playback is only flat, not for infants. When children hear stereoscopic sounds, they cannot effectively develop the intelligence of infants and young children.
上述内容仅用于辅助理解本发明的技术方案,并不代表承认上述内容是现有技术。The above content is only used to assist in understanding the technical solutions of the present invention, and does not constitute an admission that the above is prior art.
发明内容Summary of the invention
本发明的主要目的在于提供一种声控式教育方法、移动终端、系统及存储介质,旨在解决现有技术不能让婴幼儿听到立体化的声音,无法有效地开发婴幼儿的智力的问题。The main object of the present invention is to provide a voice-activated educational method, a mobile terminal, a system, and a storage medium, which aim to solve the problem that the prior art cannot allow a baby to hear a stereoscopic sound, and cannot effectively develop the intelligence of an infant.
为实现上述目的,本发明提供一种声控式教育方法,所述方法包括以下步骤:To achieve the above object, the present invention provides a voice-activated educational method, the method comprising the steps of:
对当前环境中的语音进行检测,获得当前教育语音;Detecting the voice in the current environment and obtaining the current educational voice;
对所述当前教育语音进行关键词识别,获得所述当前教育语音中的目标关键词;Performing keyword recognition on the current educational voice to obtain a target keyword in the current educational voice;
确定所述目标关键词的所属类别;Determining a category of the target keyword;
在所述目标关键词的所属类别为第一类别时,查找与所述目标关键词对应的音频文件,并对查找到的音频文件进行播放。When the belonging category of the target keyword is the first category, the audio file corresponding to the target keyword is searched, and the found audio file is played.
优选地,所述确定所述目标关键词的所属类别之后,所述方法还包括:Preferably, after the determining the category of the target keyword, the method further includes:
在所述目标关键词所属的类别为第二类别时,获取与所述目标关键词对应的调节指令,并根据所述调节指令同步调节当前环境,以使所述当前环境与所述目标关键词相对应。When the category to which the target keyword belongs is the second category, obtaining an adjustment instruction corresponding to the target keyword, and synchronously adjusting the current environment according to the adjustment instruction, so that the current environment and the target keyword are Corresponding.
优选地,所述调节指令包括:温度调节指令、湿度调节指令、亮度调节指令或气味调节指令中的至少一个。Preferably, the adjustment instruction comprises at least one of a temperature adjustment instruction, a humidity adjustment instruction, a brightness adjustment instruction or an odor adjustment instruction.
优选地,所述对当前环境中的语音进行检测,获得当前教育语音之前,所述方法还包括:在用户选择的教育模式为语音播放模式时,获取用户选择的待播放教育语音,并对所述待播放教育语音进行播放。Preferably, before the detecting the voice in the current environment and obtaining the current educational voice, the method further includes: when the education mode selected by the user is the voice playing mode, acquiring the educational voice to be played selected by the user, and It is said that the educational voice is played for playback.
优选地,所述对当前环境中的语音进行检测,获得当前教育语音之前,所述方法还包括:在用户选择的教育模式为个人朗读模式时,执行所述对当前环境中的语音进行检测,获得当前教育语音的步骤。Preferably, before the detecting the voice in the current environment to obtain the current educational voice, the method further includes: when the education mode selected by the user is the personal reading mode, performing the detecting the voice in the current environment, Get the steps to the current educational voice.
优选地,所述方法还包括:接收用户发送的播放控制指令,并根据所述播放控制指令执行相应的操作,所述播放控制指令包括:音量调节指令、音效调节指令或语速调节指令中的至少一个。Preferably, the method further includes: receiving a play control instruction sent by the user, and performing a corresponding operation according to the play control instruction, where the play control command includes: a volume adjustment instruction, a sound effect adjustment instruction, or a speech speed adjustment instruction at least one.
优选地,所述方法还包括:接收用户发出的交互信息,根据所述交互信息查找对应的用户喜好信息,所述用户喜好信息包括:喜好音效、喜好教育内容或喜好场景中的至少一个;根据所述交互信息以及查找到的用户喜好信息,对当前环境进行调节。Preferably, the method further includes: receiving interaction information sent by the user, and searching for corresponding user preference information according to the interaction information, where the user preference information includes: at least one of a favorite sound effect, a favorite educational content, or a favorite scene; The interaction information and the found user preference information adjust the current environment.
此外,为实现上述目的,本发明还提出一种移动终端,所述移动终端包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的声控式教育程序,所述声控式教育程序配置为实现如上文所述的声控式教育方法的步骤。In addition, in order to achieve the above object, the present invention further provides a mobile terminal, the mobile terminal comprising: a memory, a processor, and a voice-activated educational program stored on the memory and operable on the processor, The voice-activated educational program is configured to implement the steps of the voice-activated educational method as described above.
此外,为实现上述目的,本发明还提出一种声控式教育系统,所述教育系统包括:上文所述的移动终端、播放设备和调节设备;其中,所述播放设备配置为在所述处理器控制下对音视频文件进行播放;所述调节设备配置为在所述处理器控制下对当前环境进行调节。In addition, in order to achieve the above object, the present invention further provides a voice-activated education system, the education system comprising: the mobile terminal, the playback device, and the adjustment device described above; wherein the playback device is configured to be in the process The audio and video files are played under control; the adjustment device is configured to adjust the current environment under the control of the processor.
此外,为实现上述目的,本发明还提出一种存储介质,所述存储介质上存储有声控式教育程序,所述声控式教育程序被处理器执行时实现如上文所述的声控式教育方法的步骤。In addition, in order to achieve the above object, the present invention also provides a storage medium on which a voice-activated educational program is stored, and when the voice-activated educational program is executed by a processor, the voice-activated educational method as described above is implemented. step.
本发明通过对当前环境中的语音进行检测,获得当前教育语音;对所述当前教育语音进行关键词识别,获得所述当前教育语音中的目标关键词;确定所述目标关键词的所属类别;在所述目标关键词的所属类别为第一类别时,查找与所述目标关键词对应的音频文件,并对查找到的音频文件进行播放,从而将当前环境中的教育语音立体化的呈现出来,让婴幼儿更直观的感受到教育语音内容中具体事物所对应的声音场景,进而有效地开发婴幼儿的智力,提高学习兴趣。The present invention obtains a current educational voice by detecting a voice in a current environment; performing keyword recognition on the current educational voice to obtain a target keyword in the current educational voice; and determining a category of the target keyword; When the belonging category of the target keyword is the first category, the audio file corresponding to the target keyword is searched, and the found audio file is played, so that the educational voice in the current environment is stereoscopically presented. Infants and young children can more intuitively feel the sound scene corresponding to the specific things in the educational voice content, and then effectively develop the intelligence of infants and children and improve their interest in learning.
附图说明DRAWINGS
图1为本发明实施例方案涉及的硬件运行环境的移动终端结构示意图;1 is a schematic structural diagram of a mobile terminal in a hardware operating environment according to an embodiment of the present invention;
图2为本发明一种声控式教育方法第一实施例的流程示意图;2 is a schematic flow chart of a first embodiment of a voice-activated education method according to the present invention;
图3为本发明一种声控式教育方法第二实施例的流程示意图;3 is a schematic flow chart of a second embodiment of a voice-activated education method according to the present invention;
图4为本发明一种声控式教育方法第三实施例的流程示意图;4 is a schematic flow chart of a third embodiment of a voice-activated education method according to the present invention;
图5为本发明一种声控式教育方法第四实施例的流程示意图。FIG. 5 is a schematic flow chart of a fourth embodiment of a voice-activated education method according to the present invention.
本发明目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The implementation, functional features, and advantages of the present invention will be further described in conjunction with the embodiments.
具体实施方式Detailed ways
应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。It is understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
参照图1,图1为本发明实施例方案涉及的硬件运行环境的移动终端结构示意图。Referring to FIG. 1, FIG. 1 is a schematic structural diagram of a mobile terminal in a hardware operating environment according to an embodiment of the present invention.
如图1所示,该移动终端可以包括:处理器1001,例如CPU,通信总线1002、用户接口1003,网络接口1004,存储器1005,声音采集器1006。其中,通信总线1002用于实现这些组件之间的连接通信。用户接口1003可以包括显示屏(Display)、输入单元比如键盘(Keyboard),可选用户接口1003还可以包括标准的有线接口、无线接口。网络接口1004可选的可以包括标准的有线接口、无线接口(如WI-FI接口)。存储器1005可以是高速RAM存储器,也可以是稳定的存储器(non-volatile memory),例如磁盘存储器。存储器1005可选的还可以是独立于前述处理器1001的存储装置。As shown in FIG. 1, the mobile terminal may include a processor 1001, such as a CPU, a communication bus 1002, a user interface 1003, a network interface 1004, a memory 1005, and a sound collector 1006. Among them, the communication bus 1002 is used to implement connection communication between these components. The user interface 1003 can include a display, an input unit such as a keyboard, and the optional user interface 1003 can also include a standard wired interface, a wireless interface. The network interface 1004 can optionally include a standard wired interface, a wireless interface (such as a WI-FI interface). The memory 1005 may be a high speed RAM memory or a stable memory (non-volatile) Memory), such as disk storage. The memory 1005 can also optionally be a storage device independent of the aforementioned processor 1001.
本领域技术人员可以理解,图1中示出的移动终端结构并不构成对移动终端的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。It will be understood by those skilled in the art that the mobile terminal structure shown in FIG. 1 does not constitute a limitation of the mobile terminal, and may include more or less components than those illustrated, or combine some components, or different component arrangements.
如图1所示,作为一种存储介质的存储器1005中可以包括操作系统、数据存储模块、网络通信模块、用户接口模块以及声控式教育程序。As shown in FIG. 1, the memory 1005 as a storage medium may include an operating system, a data storage module, a network communication module, a user interface module, and a voice-activated educational program.
所述移动终端可以是能够实现语音采集或检测及程序运行的移动终端,例如,智能手机、平板电脑或笔记本电脑等,本实施例对此不加以限制。The mobile terminal may be a mobile terminal that can implement voice collection or detection and program running, for example, a smart phone, a tablet computer or a notebook computer, etc., which is not limited in this embodiment.
在图1所示的移动终端中,网络接口1004主要用于与后台服务器进行数据通信;声音采集器1006,配置为采集或检测当前语音;用户接口1003主要用于与用户进行数据交互;本发明移动终端中的处理器1001、存储器1005可以设置在移动终端中,所述移动终端通过处理器1001调用存储器1005中存储的声控式教育程序,并执行以下操作:In the mobile terminal shown in FIG. 1 , the network interface 1004 is mainly used for data communication with the background server; the sound collector 1006 is configured to collect or detect the current voice; the user interface 1003 is mainly used for data interaction with the user; The processor 1001 and the memory 1005 in the mobile terminal may be disposed in the mobile terminal, and the mobile terminal invokes the voice-activated educational program stored in the memory 1005 through the processor 1001, and performs the following operations:
对当前环境中的语音进行检测,获得当前教育语音;Detecting the voice in the current environment and obtaining the current educational voice;
对所述当前教育语音进行关键词识别,获得所述当前教育语音中的目标关键词;Performing keyword recognition on the current educational voice to obtain a target keyword in the current educational voice;
确定所述目标关键词的所属类别;Determining a category of the target keyword;
在所述目标关键词的所属类别为第一类别时,查找与所述目标关键词对应的音频文件,并对查找到的音频文件进行播放。When the belonging category of the target keyword is the first category, the audio file corresponding to the target keyword is searched, and the found audio file is played.
进一步地,处理器1001可以调用存储器1005中存储的声控式教育程序,还执行以下操作:Further, the processor 1001 can call the voice-activated educational program stored in the memory 1005, and also performs the following operations:
在所述目标关键词所属的类别为第二类别时,获取与所述目标关键词对应的调节指令,并根据所述调节指令同步调节当前环境,以使所述当前环境与所述目标关键词相对应。When the category to which the target keyword belongs is the second category, obtaining an adjustment instruction corresponding to the target keyword, and synchronously adjusting the current environment according to the adjustment instruction, so that the current environment and the target keyword are Corresponding.
进一步地,处理器1001可以调用存储器1005中存储的声控式教育程序,还执行以下操作:Further, the processor 1001 can call the voice-activated educational program stored in the memory 1005, and also performs the following operations:
在用户选择的教育模式为语音播放模式时,获取用户选择的待播放教育语音,并对所述待播放教育语音进行播放。When the education mode selected by the user is the voice play mode, the educational voice to be played selected by the user is acquired, and the educational voice to be played is played.
进一步地,处理器1001可以调用存储器1005中存储的声控式教育程序,还执行以下操作:Further, the processor 1001 can call the voice-activated educational program stored in the memory 1005, and also performs the following operations:
在用户选择的教育模式为个人朗读模式时,执行所述对当前环境中的语音进行检测,获得当前教育语音的操作。When the education mode selected by the user is the personal reading mode, the detecting the voice in the current environment is performed to obtain the operation of the current educational voice.
进一步地,处理器1001可以调用存储器1005中存储的声控式教育程序,还执行以下操作:Further, the processor 1001 can call the voice-activated educational program stored in the memory 1005, and also performs the following operations:
接收用户发送的播放控制指令,并根据所述播放控制指令执行相应的操作,所述播放控制指令包括:音量调节指令、音效调节指令或语速调节指令中的至少一个。Receiving a play control command sent by the user, and performing a corresponding operation according to the play control command, the play control command comprising: at least one of a volume adjustment instruction, a sound effect adjustment instruction, or a speech rate adjustment instruction.
进一步地,处理器1001可以调用存储器1005中存储的声控式教育程序,还执行以下操作:Further, the processor 1001 can call the voice-activated educational program stored in the memory 1005, and also performs the following operations:
接收用户发出的交互信息,根据所述交互信息查找对应的用户喜好信息,所述用户喜好信息包括:喜好音效、喜好教育内容或喜好场景中的至少一个;根据所述交互信息以及查找到的用户喜好信息,对当前环境进行调节。Receiving the interaction information sent by the user, and searching corresponding user preference information according to the interaction information, where the user preference information includes: at least one of a favorite sound effect, a favorite educational content, or a favorite scene; according to the interaction information and the found user Preferences, adjust the current environment.
本实施例的有益效果是:通过对当前环境中的语音进行检测,获得当前教育语音;对所述当前教育语音进行关键词识别,获得所述当前教育语音中的目标关键词;确定所述目标关键词的所属类别;在所述目标关键词的所属类别为第一类别时,查找与所述目标关键词对应的音频文件,并对查找到的音频文件进行播放。从而能够将当前环境中的教育语音立体化的呈现出来,让婴幼儿更直观的感受到教育语音内容中具体事物所对应的声音场景,进而有效地开发婴幼儿的智力,提高学习兴趣。The beneficial effects of the embodiment are: obtaining the current educational voice by detecting the voice in the current environment; performing keyword recognition on the current educational voice, obtaining a target keyword in the current educational voice; determining the target The belonging category of the keyword; when the belonging category of the target keyword is the first category, the audio file corresponding to the target keyword is searched, and the found audio file is played. Therefore, the educational voice in the current environment can be stereoscopically presented, so that the infant can more intuitively feel the sound scene corresponding to the specific things in the educational voice content, thereby effectively developing the intelligence of the infant and the child, and improving the interest in learning.
基于上述硬件结构,提出本发明声控式教育方法实施例。Based on the above hardware structure, an embodiment of the voice-activated education method of the present invention is proposed.
参照图2,图2为本发明一种声控式教育方法第一实施例的流程示意图。Referring to FIG. 2, FIG. 2 is a schematic flow chart of a first embodiment of a voice-activated education method according to the present invention.
本实施例中,所述方法包括以下步骤:In this embodiment, the method includes the following steps:
S10:对当前环境中的语音进行检测,获得当前教育语音;S10: detecting the voice in the current environment, and obtaining the current educational voice;
需要说明的是,本实施例的方法的执行主体为移动终端,所述移动终端可以是能够实现语音采集或检测及程序运行的移动终端,例如,智能手机、平板电脑或笔记本电脑等,本实施例对此不加以限制。It should be noted that the execution subject of the method in this embodiment is a mobile terminal, and the mobile terminal may be a mobile terminal capable of implementing voice collection or detection and program running, for example, a smart phone, a tablet computer or a notebook computer, etc. This example does not limit this.
可以理解的是,所述当前环境可以是能够实现所述声控式教育的场所,例如,儿童房、幼儿园教室等其他教育场所,本实施例对此不加以限制。It is to be understood that the current environment may be a place where the voice-activated education can be implemented, for example, a children's room, a kindergarten classroom, and the like, which is not limited in this embodiment.
需要说明的是,所述对当前环境中的语音进行检测,获得当前教育语音;主要是对采集到的当前环境中的语音进行检测判断,去除不属于教育语音的噪声语音,例如将当前环境中语速平缓,节奏感强,音量适中,声音富有磁性的语音或语句连贯,吐词清晰,识别度高,持续时间较长的语音判定为教育语音,具体的判断规则可根据实际情况设定,本实施例对此不加以限制。It should be noted that the detecting the voice in the current environment to obtain the current educational voice; mainly detecting and judging the voice in the collected current environment, and removing the noise voice that is not the educational voice, for example, in the current environment. The speech rate is gentle, the rhythm is strong, the volume is moderate, the voice is rich in magnetic voice or sentence coherence, the words are clear, the recognition is high, and the speech with long duration is judged as educational voice. The specific judgment rules can be set according to the actual situation. This embodiment does not limit this.
S20:对所述当前教育语音进行关键词识别,获得所述当前教育语音中的目标关键词;S20: performing keyword recognition on the current educational voice to obtain a target keyword in the current educational voice;
需要说明的是,并非每段教育语音所包含的信息或所表达的含义都需要呈现给当前环境中的婴幼儿,因此,在获取当前教育语音的过程中,就需要对当前教育语音进行关键词识别,将需要进行呈现的词语提取出来,作为目标关键词。It should be noted that not all the information contained in the educational speech or the meaning expressed need to be presented to the infants in the current environment. Therefore, in the process of obtaining the current educational speech, the keywords of the current educational speech need to be keywords. Identification, extracting the words that need to be presented as the target keyword.
S30:确定所述目标关键词的所属类别;S30: determining a category of the target keyword;
需要说明的是,确定所述目标关键词的所属类别可以是根据预设的关键词分类表来确定目标关键词所属的类别,例如,预设一个关键词表,该关键词分类表中包含有各种不同类别的关键词,例如,代表声音类别的第一类别关键词“汽笛”,“水流”,“鸟叫”等,该预设关键词分类表中具体包括的词汇以及词汇所属的类别均可根据实际情况设定,本实施例对此不加以限制。It should be noted that determining the category of the target keyword may be determining a category to which the target keyword belongs according to a preset keyword classification table, for example, presetting a keyword table, where the keyword classification table includes Various different categories of keywords, for example, the first category keywords "whistle", "water flow", "bird call", etc. representing the sound category, the vocabulary specifically included in the preset keyword classification table and the category to which the vocabulary belongs It can be set according to actual conditions, and this embodiment does not limit this.
在具体实现中,可通过查找目标关键词在预设的关键词分类表中是否存在与之对应的预设关键词,若存在,则将目标关键词类别确定为对应的预设关键词所在的类别,例如,当前目标关键词为“鸟叫”,则在预设的关键词分类表查找是否存在与“鸟叫”对应的预设关键词,若存在,且类别为第一类别,则将当前获取到的目标关键词“鸟叫”所属的类别确定为第一类别。需要说明的是,所述对应的预设关键词可以为与目标关键词意思相近或相同的关键词,例如“鸟叫”,“鸟鸣”等,具体对应规则可以自行设定,本实施例对此不加以限制。In a specific implementation, whether there is a preset keyword corresponding to the target keyword classification table in the preset keyword classification table, if yes, the target keyword category is determined as the corresponding preset keyword. The category, for example, the current target keyword is “Bird Call”, and the preset keyword classification table is used to find whether there is a preset keyword corresponding to “Bird Call”. If it exists, and the category is the first category, then The category to which the currently acquired target keyword "Bird Call" belongs is determined as the first category. It should be noted that the corresponding preset keyword may be a keyword that is similar to or the same as the target keyword, such as “bird call”, “bird song”, etc., and the specific corresponding rule may be set by itself, this embodiment There is no restriction on this.
S40:在所述目标关键词的所属类别为第一类别时,查找与所述目标关键词对应的音频文件,并对查找到的音频文件进行播放。S40: When the belonging category of the target keyword is the first category, search for an audio file corresponding to the target keyword, and play the found audio file.
需要说明的是,本实施例中,将代表声音的关键词类别预设为第一类别;实际情况中,各关键词具体类别的划分可根据实际情况设定,本实施例对此不加以限制。It should be noted that, in this embodiment, the keyword category representing the voice is preset to the first category. In the actual situation, the division of the specific category of each keyword may be set according to actual conditions, which is not limited in this embodiment. .
可以理解的是,为在婴幼儿学习或聆听教育语音的同时将代表声音的关键词立体的呈现出来,需要在获取目标关键词时立即获取目标关键词对应的音频文件,并对该音频文件进行播放。因此,为实现音频文件快速精准的查找,可预先建立一个预设关键词与该预设关键词对应音频文件之间对应的映射关系,使得在目标关键词的类别被确认时,可通过查找与所述目标关键词对应的预设关键词,然后根据所述映射关系立即获得预设关键词对应的音频文件,并对该音频文件进行播放,实现同步教学。It can be understood that, in order to present the stereoscopic representation of the voice of the child while learning or listening to the educational voice, it is necessary to acquire the audio file corresponding to the target keyword immediately when acquiring the target keyword, and perform the audio file on the audio file. Play. Therefore, in order to realize fast and accurate search of the audio file, a mapping relationship between the preset keyword and the audio file corresponding to the preset keyword may be established in advance, so that when the category of the target keyword is confirmed, the search may be performed by And the preset keyword corresponding to the target keyword, and then immediately obtaining an audio file corresponding to the preset keyword according to the mapping relationship, and playing the audio file to implement synchronous teaching.
在具体实现中,本实施例先根据预设的判断条件对当前环境中的语音进行检测判断,去除当前环境中的非教育语音进而获取当前教育语音,然后根据预设的关键词表对所述当前教育语音进行关键词识别,获得所述当前教育语音中的目标关键词以及目标关键词所属的类别,在确定目标关键词所属的类别后,根据映射关系获取对应的音频文件,并播放该音频文件,立体地将当前教育语音中目标关键词对应的现场声音播放出来。例如,在幼儿园教室里,老师以缓慢的语速朗读以下句子:“清晨的丛林小道上,苍翠的树木遮挡住了明媚的阳光,微风拂面,微凉的空气令人神清气爽,,让人觉得一切是那么的宁静与美好”,当确认老师朗读的这段文字为教育语音时,立即对这段语音进行关键词识别,当获取到目标关键词“鸟叫”时,立即获取与目标关键词“鸟叫”对应的音频文件,并对该音频文件进行播放,此时正在认真聆听老师朗读的幼儿园学生们就会听到在老师念及“鸟叫”时候,同步听到动听的鸟叫声,将大脑接收的文字信息“鸟叫”与听到的鸟叫声生动地结合了起来。In a specific implementation, the embodiment first detects and determines the voice in the current environment according to the preset judgment condition, removes the non-educational voice in the current environment, and then obtains the current educational voice, and then performs the foregoing according to the preset keyword table. The current educational voice performs keyword recognition, obtains the target keyword in the current educational voice and the category to which the target keyword belongs, and after determining the category to which the target keyword belongs, acquires the corresponding audio file according to the mapping relationship, and plays the audio. The file stereoscopically plays the live sound corresponding to the target keyword in the current educational voice. For example, in the kindergarten classroom, the teacher read the following sentence in a slow speech: "In the early morning jungle trail, the green trees obscured the bright sunshine, the breeze blew, the cool air was refreshing, People feel that everything is so quiet and beautiful. When confirming that the text read by the teacher is an educational voice, immediately identify the key words. When the target keyword "bird call" is obtained, immediately obtain and The audio file corresponding to the target keyword "Bird Call" is played and the audio file is played. At this time, the kindergarten students who are listening carefully to the teacher will hear the teacher hear the "bird call" and hear the sound simultaneously. The bird screams, vividly combining the text message “Bird Call” received by the brain with the sound of the bird heard.
本实施例提供的声控式教育方法,通过对当前环境中的语音进行检测,获得当前教育语音;对所述当前教育语音进行关键词识别,获得所述当前教育语音中的目标关键词;确定所述目标关键词的所属类别;在所述目标关键词的所属类别为第一类别时,查找与所述目标关键词对应的音频文件,并对查找到的音频文件进行播放。从而能够将当前环境中的教育语音立体化的呈现出来,让婴幼儿更直观的感受到教育语音内容中具体事物所对应的声音场景,进而有效地开发婴幼儿的智力,提高学习兴趣。The voice-activated education method provided in this embodiment obtains a current educational voice by detecting a voice in a current environment; performing keyword recognition on the current educational voice to obtain a target keyword in the current educational voice; When the category of the target keyword is the first category, the audio file corresponding to the target keyword is searched for, and the found audio file is played. Therefore, the educational voice in the current environment can be stereoscopically presented, so that the infant can more intuitively feel the sound scene corresponding to the specific things in the educational voice content, thereby effectively developing the intelligence of the infant and the child, and improving the interest in learning.
为了更好的为婴幼儿呈现出音视频教育内容的故事场景,所述方法还包括:In order to better present a story scene of audio and video education content for infants and toddlers, the method further includes:
接收用户发送的播放控制指令,并根据所述播放控制指令执行相应的操作,所述播放控制指令包括:音量调节指令、音效调节指令或语速调节指令中的至少一个。Receiving a play control command sent by the user, and performing a corresponding operation according to the play control command, the play control command comprising: at least one of a volume adjustment instruction, a sound effect adjustment instruction, or a speech rate adjustment instruction.
需要说明的是,所述用户既可以是对婴幼儿进行教育的施教用户,例如家长或教师,也可以是受教育用户,例如婴幼儿或儿童。It should be noted that the user may be a teaching user who educates the infant, such as a parent or a teacher, or an educated user, such as an infant or a child.
可以理解的是,在对当前教育语音中目标关键词同步播放过程中,用户可能会根据自身需要或喜好需要调节播放音效或音量大小,因此所述移动终端在接收到用户发送的播放控制指令时,立即执行相应操作,以提高用户体验,例如,在教学过程中,用户觉得播放声音过大,需要降低音量,移动终端在接收到用户发送的降低音量指令后,根据该指令将播放音量降低到用户目标音量。It can be understood that during the synchronous playback of the target keyword in the current educational voice, the user may adjust the playing sound effect or the volume level according to his own needs or preferences, so the mobile terminal receives the playback control command sent by the user. Immediately perform the corresponding operation to improve the user experience. For example, during the teaching process, the user feels that the playing sound is too large, and the volume needs to be lowered. After receiving the volume down command sent by the user, the mobile terminal reduces the playing volume according to the instruction. User target volume.
为了提高婴幼儿在学习教育语音中的体验感,进一步提高学习兴趣,所述方法还包括:In order to improve the experience of infants and young children in learning and educating speech, and further improving the interest in learning, the method further includes:
接收用户发出的交互信息,根据所述交互信息查找对应的用户喜好信息,所述用户喜好信息包括:喜好音效、喜好教育内容或喜好场景中的至少一个;根据所述交互信息以及查找到的用户喜好信息,对当前环境进行调节。Receiving the interaction information sent by the user, and searching corresponding user preference information according to the interaction information, where the user preference information includes: at least one of a favorite sound effect, a favorite educational content, or a favorite scene; according to the interaction information and the found user Preferences, adjust the current environment.
需要说明的是,所述交互信息可以是用户在与移动终端进行交流互动时发送的信息,例如,婴幼儿在与移动终端进行互动沟通时,发出的交流语音,又或是婴幼儿对当前环境感到不适时,发出的调节当前环境的控制语音,又或是受教育用户通过个人设备传输的多媒体文件等,所述交互信息的具体类别可根据实际需要设定,本实施例对此不加以限制。It should be noted that the interaction information may be information that is sent by the user when interacting with the mobile terminal, for example, the communication voice sent by the infant when interacting with the mobile terminal, or the infant to the current environment. When the user feels uncomfortable, the control voice of the current environment is adjusted, or the multimedia file transmitted by the educated user through the personal device, and the specific category of the interactive information may be set according to actual needs, and the embodiment does not limit this. .
需要说明的是,为实现对受教育用户的因材施教,移动终端可预先建立与受教育用户对应的个性化账户,该个性化账户中可以包括用户喜好的音效类型信息,例如“柔和型”;喜好的教育内容类别信息,例如“张爱玲的散文”,“郑渊洁的童话故事”;喜好的场景等,同时,移动终端可将受教育用户的语音特征或其使用的带有固定标识的个人设备与该个性化账户进行关联,即当受教育用户发出交互语音或交互信息时,移动终端可根据所述交互语音或交互信息查找到对应的个性化账户,获取用户喜好信息,并结合当前交互信息,对环境进行调节。It should be noted that, in order to implement the teaching of the educated user, the mobile terminal may pre-establish a personalized account corresponding to the educated user, and the personalized account may include user-friendly sound type information, such as “soft type”; Educational content category information, such as "Zhang Ailing's prose", "Zheng Yuanjie's fairy tale"; favorite scenes, etc., at the same time, the mobile terminal can use the voice features of the educated user or the personal device with the fixed logo used by the user The personalized account is associated, that is, when the educated user sends the interactive voice or the interactive information, the mobile terminal can find the corresponding personalized account according to the interactive voice or the interaction information, obtain the user preference information, and combine the current interaction information, The environment is adjusted.
可以理解的是,移动终端还可以在与用户进行交流互动的过程中,对受教育用户发送的交互信息进行记录分析,并根据分析结果对预存的用户喜好进行更新并储存,例如,随着年龄的增长,受教育用户在一段时间内,喜欢收听郑渊洁的童话,那么移动终端会在受教育用户选择教育内容时,优先将郑渊洁的童话作为教育内容推送给受教育用户;又如,在某一段时间内,移动终端又发现受教育用户喜欢收听张爱玲的散文,就会对用户喜好教育内容作相应调整,优先将张爱玲的散文作为教育内容推送给受教育用户。It can be understood that the mobile terminal can also record and analyze the interaction information sent by the educated user in the process of interacting with the user, and update and store the pre-stored user preferences according to the analysis result, for example, with age. The growth, the educated users like to listen to Zheng Yuanjie's fairy tales for a period of time, then the mobile terminal will give Zheng Yuanjie's fairy tale as an educational content to educated users when the educated user chooses educational content; for example, in a certain section In the time, the mobile terminal found that the educated users like to listen to Zhang Ailing's essays, and adjust the user's favorite educational content accordingly, and preferentially promote Zhang Ailing's essays as educational content to educated users.
本实施例通过接收用户发送的播放控制指令和交互信息,对用户的兴趣或喜好进行记录储存,并根据不同时段用户的偏好对兴趣或喜好进行更新,让受教育用户在不断成长的过程中,在一种愉快舒心的教育环境中学习知识,有效地开发了受教育者的智力,提高了学习兴趣。In this embodiment, by receiving the playback control command and the interaction information sent by the user, the user's interest or preference is recorded and stored, and the interest or preference is updated according to the preference of the user in different time periods, so that the educated user is growing continuously. Learning knowledge in a pleasant and comfortable educational environment effectively develops the intelligence of educated people and increases their interest in learning.
参照图3,图3为本发明一种声控式教育方法第二实施例的流程示意图。Referring to FIG. 3, FIG. 3 is a schematic flowchart diagram of a second embodiment of a voice-activated education method according to the present invention.
为了在对婴幼儿进行音频教育时,除了可以通过播放音频文件让婴幼儿亲身感受到代表声音的目标关键词对应的现场声音,还可以将当前教育语音中代表自然环境的关键词作为目标关键词,并通过调节当前环境来营造出所述代表自然环境的关键词对应的场景,进而让婴幼儿从嗅觉和触觉来理解和学习当前教育语音所包含的知识信息。In order to carry out audio education for infants and young children, in addition to playing the audio file, the infant can personally feel the live sound corresponding to the target keyword representing the sound, and can also use the keyword representing the natural environment in the current educational voice as the target keyword. And adjusting the current environment to create a scene corresponding to the keyword representing the natural environment, thereby allowing the infant to understand and learn the knowledge information contained in the current educational voice from the sense of smell and touch.
因此,为了进一步有效地开发婴幼儿的智力,提高他们的学习兴趣,在所述步骤S30之后,还包括:Therefore, in order to further effectively develop the intelligence of the infants and children and improve their interest in learning, after the step S30, the method further includes:
步骤S50,在所述目标关键词所属的类别为第二类别时,获取与所述目标关键词对应的调节指令,并根据所述调节指令同步调节当前环境,以使所述当前环境与所述目标关键词相对应。Step S50, when the category to which the target keyword belongs is the second category, acquiring an adjustment instruction corresponding to the target keyword, and synchronously adjusting the current environment according to the adjustment instruction, so that the current environment and the The target keyword corresponds.
需要说明的是,在本实施例中,可将代表自然环境的关键词类别预设为第二类别,例如,将代表温度的词“寒冷”、“炎热”,代表气味的词“芳香”、“清香”,代表自然现象的词“微风”、“天亮了”、“天黑了”等关键词的类别预设为第二类类别。在实际情况中各关键词具体类别的划分可根据实际情况设定,本实施例对此不加以限制。It should be noted that, in this embodiment, the keyword category representing the natural environment may be preset to the second category, for example, the words “cold” and “hot” representing temperature, and the word “fragrance” representing odor, “Fragrance”, the category of keywords such as “Breeze”, “Daylight” and “Dark” that represent natural phenomena is preset to the second category. In the actual situation, the division of the specific categories of the keywords may be set according to actual conditions, and this embodiment does not limit this.
为了再现代表自然环境的关键词对应的场景,所述调节指令包括:温度调节指令、湿度调节指令、亮度调节指令或气味调节指令中的至少一个。其中,所述温度调节指令用于调节当前环境的环境温度,湿度调节信息用于调节当前环境的环境湿度,亮度调节信息用于调节当前环境的亮度,气味调节指令用于调节当前环境的气味。In order to reproduce a scene corresponding to a keyword representing a natural environment, the adjustment instruction includes at least one of a temperature adjustment instruction, a humidity adjustment instruction, a brightness adjustment instruction, or an odor adjustment instruction. The temperature adjustment command is used to adjust the ambient temperature of the current environment, the humidity adjustment information is used to adjust the ambient humidity of the current environment, the brightness adjustment information is used to adjust the brightness of the current environment, and the odor adjustment command is used to adjust the odor of the current environment.
在具体实现中,移动终端在获取到的目标关键词所属的类别为第二类别时,立即获取目标关键词对应的预设关键词,并根据预设关键词获取对应的调节指令,然后根据所述调节指令同步调节当前环境,以使所述当前环境与所述目标关键词相对应,例如,在当前教育语音提及目标关键词“花香”之后,立即获取“花香”对应的气味调节指令,并根据所述气味调节指令,控制气味调节装置同步散发出淡淡的花香,让婴幼儿真切地感受到目标关键词“花香”对应的场景。In a specific implementation, when the acquired target keyword belongs to the second category, the mobile terminal immediately acquires the preset keyword corresponding to the target keyword, and obtains a corresponding adjustment instruction according to the preset keyword, and then according to the preset The adjustment instruction synchronously adjusts the current environment, so that the current environment corresponds to the target keyword, for example, immediately after the current educational voice refers to the target keyword “floral”, the scent adjustment instruction corresponding to “floral” is obtained immediately, And according to the odor adjustment instruction, the odor regulating device is controlled to emit a faint floral fragrance, so that the infant can truly feel the scene corresponding to the target keyword “flower”.
本实施例将当前教育语音中代表自然环境的关键词作为目标关键词,并通过获取所述目标关键词对应的调节指令,根据所述调节指令来对应调节当前环境,营造出与所述目标关键词对应的场景,从而让婴幼儿可以从嗅觉和触觉等不同的感官来理解和学习当前教育语音所包含的知识信息,更加有效地开发了婴幼儿的智力,提高了婴幼儿的学习兴趣。In this embodiment, a keyword representing a natural environment in the current educational voice is used as a target keyword, and an adjustment instruction corresponding to the target keyword is acquired, and the current environment is correspondingly adjusted according to the adjustment instruction to create a key with the target. The scene corresponding to the word, so that the infant can understand and learn the knowledge information contained in the current educational voice from different senses such as smell and touch, and more effectively develop the intelligence of the infant and the child, and improve the learning interest of the infant.
参照图4,图4为本发明一种声控式教育方法第三实施例的流程示意图,基于上述图2或图3所示的实施例,提出本发明声控式教育方法的第三实施例。Referring to FIG. 4, FIG. 4 is a schematic flowchart diagram of a third embodiment of a voice-activated education method according to the present invention. Based on the embodiment shown in FIG. 2 or FIG. 3, a third embodiment of the voice-activated education method of the present invention is proposed.
在本实施例中,为让婴幼儿在学习过程中,更好地同步感受到学习内容所代表的故事场景与情节,在步骤S10之前,还包括:In this embodiment, in order to allow the infant to better feel the story scene and the plot represented by the learning content during the learning process, before step S10, the method further includes:
步骤S01,在用户选择的教育模式为语音播放模式时,获取用户选择的待播放教育语音,并对所述待播放教育语音进行播放。In step S01, when the education mode selected by the user is the voice play mode, the educational voice to be played selected by the user is acquired, and the educational voice to be played is played.
需要说明的是,所述教育模式用于获取用户根据自身需要而确定的教育方式,例如,用户需要通过移动终端播放语音,则可选择语音播放模式;需要亲自朗读教育语音,则可选择个人朗读模式。具体模式的设定与选择均可根据实际情况而定,本实施例对此不加以限制。It should be noted that the education mode is used to obtain an education mode determined by the user according to his/her needs. For example, if the user needs to play a voice through the mobile terminal, the voice play mode may be selected; if the education voice needs to be read aloud, the individual may read aloud. mode. The setting and selection of the specific mode may be determined according to actual conditions, and this embodiment does not limit this.
可以理解的是,在用户选择以语音播放模式进行教学时,用户可先向移动终端发送预先选择的包含所述教育语音的待播放教育音视频文件,所述待播放教育语音可以是用户预先录制的教育语音。现实生活中,每个孩子对自己父母的声音往往最为熟悉认同感也较高,当父母为他们讲故事传授知识时,学习思维也相对活跃,更有利于智力开发,因此,当父母白天外出不在家时为孩子播放预先录制的音视频文件,也同样可以达到上述效果。It can be understood that, when the user selects to perform the teaching in the voice play mode, the user may first send the pre-selected educational audio and video file to be played including the educational voice to the mobile terminal, and the educational voice to be played may be pre-recorded by the user. Educational voice. In real life, each child is often most familiar with his parents' voices. When parents teach stories to them, learning and thinking are relatively active, which is more conducive to intellectual development. Therefore, when parents go out during the day, The above effects can also be achieved by playing pre-recorded audio and video files for children at home.
本实施例通过设置不同教育模式以供用户根据实际情况进行选择,在用户选择语音播放模式时,通过移动终端获取用户选择的待播放教育语音,既满足了用户不同种类的需求,也更加有效地实现了同步教学的目的。In this embodiment, by setting different education modes for the user to select according to the actual situation, when the user selects the voice play mode, the mobile terminal selects the educational voice to be played by the user, which satisfies the different types of users and is more effective. Achieve the purpose of synchronous teaching.
参照图5,图5为本发明一种声控式教育方法第四实施例的流程示意图。基于上述图2或图3所示的实施例,提出本发明声控式教育方法的第四实施例Referring to FIG. 5, FIG. 5 is a schematic flowchart diagram of a fourth embodiment of a voice-activated education method according to the present invention. Based on the embodiment shown in FIG. 2 or FIG. 3 above, a fourth embodiment of the voice-activated education method of the present invention is proposed.
为满足父母亲自对婴幼儿进行现场教学,本实施例中,在步骤S10之前,还包括:In the embodiment, before the step S10, the method further includes:
步骤S02,在用户选择的教育模式为个人朗读模式时,执行所述对当前环境中的语音进行检测,获得当前教育语音的步骤。Step S02: When the education mode selected by the user is the personal reading mode, the step of detecting the voice in the current environment to obtain the current educational voice is performed.
需要说明的是,在用户希望通过亲自讲述或朗读教育内容时,可选择个人朗读模式来进行教学。It should be noted that when the user wishes to tell or read the educational content in person, the personal reading mode can be selected for teaching.
在具体实现中,用户选择个人朗读模式进行教学时,移动终端会对当前环境中的语音进行检测,当检测到教育语音时,开始对当前教育语音进行关键词识别,并执行相应的后续步骤,例如,妈妈在为女儿讲故事时,以缓慢语速朗读以下句子:“冬天的夜晚来得比较早,天很快就黑了,寒冷的北风呼呼地吹。”当移动终端检测到上述语音为教育语音后,立即对当前教育语音进行关键词识别,在识别到目标关键词“夜晚”以后,同步慢慢地调低所在房间的亮度;在识别到目标关键词“寒冷”以后,同步慢慢地调低所在房间的温度;在识别到目标关键词“北风”以后,同步制造出吹风的效果。In a specific implementation, when the user selects the personal reading mode for teaching, the mobile terminal detects the voice in the current environment, and when the educational voice is detected, starts to perform keyword recognition on the current educational voice, and performs corresponding subsequent steps. For example, when mom told her daughter a story, she read the following sentence slowly: "The winter night comes earlier, the sky is very dark, and the cold north wind blows." When the mobile terminal detects the voice After the education of the voice, the keyword recognition of the current educational voice is immediately performed. After the target keyword "night" is recognized, the brightness of the room is slowly lowered synchronously; after the target keyword "cold" is recognized, the synchronization is slow. Slowly lower the temperature of the room; after identifying the target keyword "North Wind", the effect of blowing is simultaneously produced.
本实施例,通过设置个人朗读模式,在用户选择想要亲自对婴幼儿进行教学时,通过朗读教育内容,充分地调动婴幼儿在学习过程中的感官,很好地加深了婴幼儿对教育内容的记忆,十分积极而有效地对对婴幼儿智力进行了开发,提高了婴幼儿的学习兴趣。In this embodiment, by setting a personal reading mode, when the user chooses to personally teach the infant, the audience can fully mobilize the senses of the infant in the learning process by reading the educational content, which is a good way to deepen the infant's educational content. The memory has been very active and effective in the development of infant intelligence, which has increased the interest of infants and young children.
此外,本发明还提供一种声控式教育系统,所述教育系统包括:如图1所示的移动终端、播放设备和调节设备;其中,所述播放设备配置为在所述处理器控制下对音视频文件进行播放;所述调节设备配置为在所述处理器控制下对当前环境进行调节。In addition, the present invention also provides a voice-activated education system, the education system comprising: a mobile terminal, a playback device, and an adjustment device as shown in FIG. 1; wherein the playback device is configured to be under the control of the processor The audio and video files are played; the adjustment device is configured to adjust the current environment under the control of the processor.
此外,本发明还提供一种存储介质,其特征在于,所述存储介质上存储有声控式教育程序,所述声控式教育程序被处理器执行时实现如下操作:In addition, the present invention further provides a storage medium, wherein the storage medium stores a voice-activated educational program, and when the voice-activated educational program is executed by the processor, the following operations are implemented:
对当前环境中的语音进行检测,获得当前教育语音;Detecting the voice in the current environment and obtaining the current educational voice;
对所述当前教育语音进行关键词识别,获得所述当前教育语音中的目标关键词;Performing keyword recognition on the current educational voice to obtain a target keyword in the current educational voice;
确定所述目标关键词的所属类别;Determining a category of the target keyword;
在所述目标关键词的所属类别为第一类别时,查找与所述目标关键词对应的音频文件,并对查找到的音频文件进行播放。When the belonging category of the target keyword is the first category, the audio file corresponding to the target keyword is searched, and the found audio file is played.
进一步地,声控式教育程序被处理器执行时还实现如下操作:在所述目标关键词所属的类别为第二类别时,获取与所述目标关键词对应的调节指令,并根据所述调节指令同步调节当前环境,以使所述当前环境与所述目标关键词相对应。Further, when the voice-activated educational program is executed by the processor, the following operations are further performed: when the category to which the target keyword belongs is the second category, acquiring an adjustment instruction corresponding to the target keyword, and according to the adjustment instruction The current environment is adjusted synchronously such that the current environment corresponds to the target keyword.
进一步地,声控式教育程序被处理器执行时还实现如下操作:在用户选择的教育模式为语音播放模式时,获取用户选择的待播放教育语音,并对所述待播放教育语音进行播放。Further, when the voice-activated educational program is executed by the processor, the following operations are also performed: when the educational mode selected by the user is the voice playing mode, the educational voice to be played selected by the user is acquired, and the educational voice to be played is played.
进一步地,声控式教育程序被处理器执行时还实现如下操作:在用户选择的教育模式为个人朗读模式时,执行所述对当前环境中的语音进行检测,获得当前教育语音的操作。Further, when the voice-activated educational program is executed by the processor, the following operations are further performed: when the education mode selected by the user is the personal reading mode, the detecting the voice in the current environment is performed to obtain the operation of the current educational voice.
进一步地,声控式教育程序被处理器执行时还实现如下操作:接收用户发送的播放控制指令,并根据所述播放控制指令执行相应的操作,所述播放控制指令包括:音量调节指令、音效调节指令或语速调节指令中的至少一个。Further, when the voice-activated educational program is executed by the processor, the following operations are further performed: receiving a play control command sent by the user, and performing a corresponding operation according to the play control command, where the play control command includes: a volume adjustment command, and a sound effect adjustment At least one of an instruction or a speech rate adjustment instruction.
进一步地,声控式教育程序被处理器执行时还实现如下操作:接收用户发出的交互信息,根据所述交互信息查找对应的用户喜好信息,所述用户喜好信息包括:喜好音效、喜好教育内容或喜好场景中的至少一个;根据所述交互信息以及查找到的用户喜好信息,对当前环境进行调节。Further, when the voice-activated educational program is executed by the processor, the following operations are further performed: receiving interaction information sent by the user, and searching for corresponding user preference information according to the interaction information, where the user preference information includes: a favorite sound effect, a favorite educational content, or At least one of the favorite scenes; adjusting the current environment according to the interaction information and the found user preference information.
本实施例的有益效果是:通过对当前环境中的语音进行检测,获得当前教育语音;对所述当前教育语音进行关键词识别,获得所述当前教育语音中的目标关键词;确定所述目标关键词的所属类别;在所述目标关键词的所属类别为第一类别时,查找与所述目标关键词对应的音频文件,并对查找到的音频文件进行播放。从而将当前环境中的教育语音立体化的呈现出来,让婴幼儿更直观的感受到教育语音内容中具体事物所对应的声音场景,进而有效地开发婴幼儿的智力,提高学习兴趣。The beneficial effects of the embodiment are: obtaining the current educational voice by detecting the voice in the current environment; performing keyword recognition on the current educational voice, obtaining a target keyword in the current educational voice; determining the target The belonging category of the keyword; when the belonging category of the target keyword is the first category, the audio file corresponding to the target keyword is searched, and the found audio file is played. Therefore, the educational voice in the current environment is presented in a three-dimensional manner, so that the infant can more intuitively feel the sound scene corresponding to the specific things in the educational voice content, thereby effectively developing the intelligence of the infant and the child, and improving the interest in learning.
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者系统不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者系统所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者系统中还存在另外的相同要素。It is to be understood that the term "comprises", "comprising", or any other variants thereof, is intended to encompass a non-exclusive inclusion, such that a process, method, article, or It also includes other elements that are not explicitly listed, or elements that are inherent to such a process, method, item, or system. An element defined by the phrase "comprising a ..." does not exclude the presence of additional equivalent elements in a process, method, article, or system that includes the element, without further limitation.
上述本发明实施例序号仅仅为了描述,不代表实施例的优劣。The serial numbers of the embodiments of the present invention are merely for the description, and do not represent the advantages and disadvantages of the embodiments.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述 实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通 过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本发明的 技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体 现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光 盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,空调器,或者网络设备等)执行本发明各个实施例所述的方法。Those skilled in the art can clearly understand the above by the description of the above embodiments. The embodiment method can be implemented by means of software plus a necessary general hardware platform, and of course hardware, but in many cases the former is a better implementation. Based on such understanding, the present invention The technical solution in essence or the contribution to the prior art can be embodied in the form of a software product stored in a storage medium (such as ROM/RAM, disk, light). The disc includes a number of instructions for causing a terminal device (which may be a cell phone, a computer, a server, an air conditioner, or a network device, etc.) to perform the methods described in various embodiments of the present invention.
以上仅为本发明的优选实施例,并非因此限制本发明的专利范围,凡是利用本发明说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本发明的专利保护范围内。The above are only the preferred embodiments of the present invention, and are not intended to limit the scope of the invention, and the equivalent structure or equivalent process transformations made by the description of the present invention and the drawings are directly or indirectly applied to other related technical fields. The same is included in the scope of patent protection of the present invention.

Claims (20)

  1. 一种声控式教育方法,其特征在于,所述方法包括以下步骤: A voice-activated education method, characterized in that the method comprises the following steps:
    对当前环境中的语音进行检测,获得当前教育语音;Detecting the voice in the current environment and obtaining the current educational voice;
    对所述当前教育语音进行关键词识别,获得所述当前教育语音中的目标关键词;Performing keyword recognition on the current educational voice to obtain a target keyword in the current educational voice;
    确定所述目标关键词的所属类别;Determining a category of the target keyword;
    在所述目标关键词的所属类别为第一类别时,查找与所述目标关键词对应的音频文件,并对查找到的音频文件进行播放。When the belonging category of the target keyword is the first category, the audio file corresponding to the target keyword is searched, and the found audio file is played.
  2. 如权利要求1所述的方法,其特征在于,所述确定所述目标关键词的所属类别之后,所述方法还包括:The method of claim 1, wherein after the determining the category of the target keyword, the method further comprises:
    在所述目标关键词的所属类别为第二类别时,获取与所述目标关键词对应的调节指令,并根据所述调节指令同步调节当前环境,以使所述当前环境与所述目标关键词相对应。When the belonging category of the target keyword is the second category, acquiring an adjustment instruction corresponding to the target keyword, and synchronously adjusting the current environment according to the adjustment instruction, so that the current environment and the target keyword are Corresponding.
  3. 如权利要求2所述的方法,其特征在于,所述调节指令包括:温度调节指令、湿度调节指令、亮度调节指令或气味调节指令中的至少一个。The method of claim 2, wherein the adjustment command comprises at least one of a temperature adjustment command, a humidity adjustment command, a brightness adjustment command, or an odor adjustment command.
  4. 如权利要求3所述的方法,其特征在于,所述对当前环境中的语音进行检测,获得当前教育语音之前,所述方法还包括:The method of claim 3, wherein the method further comprises: before detecting the voice in the current environment to obtain the current educational voice, the method further comprising:
    在用户选择的教育模式为语音播放模式时,获取用户选择的待播放教育语音,并对所述待播放教育语音进行播放。When the education mode selected by the user is the voice play mode, the educational voice to be played selected by the user is acquired, and the educational voice to be played is played.
  5. 如权利要求4所述的方法,其特征在于,所述对当前环境中的语音进行检测,获得当前教育语音之前,所述方法还包括:The method of claim 4, wherein the method further comprises: before detecting the voice in the current environment to obtain the current educational voice, the method further comprising:
    在用户选择的教育模式为个人朗读模式时,执行所述对当前环境中的语音进行检测,获得当前教育语音的步骤。When the education mode selected by the user is the personal reading mode, the step of detecting the voice in the current environment to obtain the current educational voice is performed.
  6. 如权利要求5所述的方法,其特征在于,所述方法还包括:The method of claim 5, wherein the method further comprises:
    接收用户发送的播放控制指令,并根据所述播放控制指令执行相应的操作,所述播放控制指令包括:音量调节指令、音效调节指令或语速调节指令中的至少一个。Receiving a play control command sent by the user, and performing a corresponding operation according to the play control command, the play control command comprising: at least one of a volume adjustment instruction, a sound effect adjustment instruction, or a speech rate adjustment instruction.
  7. 如权利要求6所述的方法,其特征在于,所述方法还包括:The method of claim 6 wherein the method further comprises:
    接收用户发出的交互信息,根据所述交互信息查找对应的用户喜好信息,所述用户喜好信息包括:喜好音效、喜好教育内容或喜好场景中的至少一个;Receiving the interaction information sent by the user, and searching for corresponding user preference information according to the interaction information, where the user preference information includes: at least one of a favorite sound effect, a favorite educational content, or a favorite scene;
    根据所述交互信息以及查找到的用户喜好信息,对当前环境进行调节。 The current environment is adjusted according to the interaction information and the found user preference information.
  8. 一种移动终端,其特征在于,所述移动终端包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的声控式教育程序,所述声控式教育程序配置为实现如权利要求1所述的声控式教育方法的步骤。A mobile terminal, comprising: a memory, a processor, and a voice-activated educational program stored on the memory and operable on the processor, wherein the voice-activated educational program is configured to implement The steps of the voice-activated educational method of claim 1.
  9. 一种移动终端,其特征在于,所述移动终端包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的声控式教育程序,所述声控式教育程序配置为实现如权利要求2所述的声控式教育方法的步骤。A mobile terminal, comprising: a memory, a processor, and a voice-activated educational program stored on the memory and operable on the processor, wherein the voice-activated educational program is configured to implement The steps of the voice-activated educational method of claim 2.
  10. 一种移动终端,其特征在于,所述移动终端包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的声控式教育程序,所述声控式教育程序配置为实现如权利要求4所述的声控式教育方法的步骤。A mobile terminal, comprising: a memory, a processor, and a voice-activated educational program stored on the memory and operable on the processor, wherein the voice-activated educational program is configured to implement The steps of the voice-activated educational method of claim 4.
  11. 一种移动终端,其特征在于,所述移动终端包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的声控式教育程序,所述声控式教育程序配置为实现如权利要求5所述的声控式教育方法的步骤。A mobile terminal, comprising: a memory, a processor, and a voice-activated educational program stored on the memory and operable on the processor, wherein the voice-activated educational program is configured to implement The steps of the voice-activated educational method of claim 5.
  12. 一种移动终端,其特征在于,所述移动终端包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的声控式教育程序,所述声控式教育程序配置为实现如权利要求6所述的声控式教育方法的步骤。A mobile terminal, comprising: a memory, a processor, and a voice-activated educational program stored on the memory and operable on the processor, wherein the voice-activated educational program is configured to implement The steps of the voice-activated educational method of claim 6.
  13. 一种移动终端,其特征在于,所述移动终端包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的声控式教育程序,所述声控式教育程序配置为实现如权利要求7所述的声控式教育方法的步骤。A mobile terminal, comprising: a memory, a processor, and a voice-activated educational program stored on the memory and operable on the processor, wherein the voice-activated educational program is configured to implement The steps of the voice-activated educational method of claim 7.
  14. 一种声控式教育系统,其特征在于,所述教育系统包括:权利要求8所述的移动终端、播放设备和调节设备;其中,所述播放设备配置为在所述处理器控制下对音视频文件进行播放;所述调节设备配置为在所述处理器控制下对当前环境进行调节。A voice-activated education system, characterized in that the education system comprises: the mobile terminal, the playback device and the adjustment device of claim 8; wherein the playback device is configured to control audio and video under the control of the processor The file is played; the adjustment device is configured to adjust the current environment under the control of the processor.
  15. 一种存储介质,其特征在于,所述存储介质上存储有声控式教育程序,所述声控式教育程序被处理器执行时实现如权利要求1所述的声控式教育方法的步骤。A storage medium, characterized in that the storage medium stores a voice-activated educational program, and the voice-activated educational program is executed by a processor to implement the steps of the voice-activated educational method according to claim 1.
  16. 一种存储介质,其特征在于,所述存储介质上存储有声控式教育程序,所述声控式教育程序被处理器执行时实现如权利要求2所述的声控式教育方法的步骤。A storage medium, characterized in that the storage medium stores a voice-activated educational program, and the voice-activated educational program is executed by a processor to implement the steps of the voice-activated educational method according to claim 2.
  17. 一种存储介质,其特征在于,所述存储介质上存储有声控式教育程序,所述声控式教育程序被处理器执行时实现如权利要求4所述的声控式教育方法的步骤。A storage medium, characterized in that the storage medium stores a voice-activated educational program, and the voice-activated educational program is executed by a processor to implement the steps of the voice-activated educational method according to claim 4.
  18. 一种存储介质,其特征在于,所述存储介质上存储有声控式教育程序,所述声控式教育程序被处理器执行时实现如权利要求5所述的声控式教育方法的步骤。A storage medium, characterized in that the storage medium stores a voice-activated educational program, and the voice-activated educational program is executed by a processor to implement the steps of the voice-activated educational method according to claim 5.
  19. 一种存储介质,其特征在于,所述存储介质上存储有声控式教育程序,所述声控式教育程序被处理器执行时实现如权利要求6所述的声控式教育方法的步骤。A storage medium, characterized in that the storage medium stores a voice-activated educational program, and the voice-activated educational program is executed by a processor to implement the steps of the voice-activated educational method according to claim 6.
  20. 一种存储介质,其特征在于,所述存储介质上存储有声控式教育程序,所述声控式教育程序被处理器执行时实现如权利要求7所述的声控式教育方法的步骤。 A storage medium, characterized in that the storage medium stores a voice-activated educational program, and the voice-activated educational program is executed by a processor to implement the steps of the voice-activated educational method according to claim 7.
PCT/CN2017/094486 2017-07-07 2017-07-26 Voice-controlled education method and system, mobile terminal, and storage medium WO2019006792A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710554361.4A CN107463626A (en) 2017-07-07 2017-07-07 A kind of voice-control educational method, mobile terminal, system and storage medium
CN201710554361.4 2017-07-07

Publications (1)

Publication Number Publication Date
WO2019006792A1 true WO2019006792A1 (en) 2019-01-10

Family

ID=60546727

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/094486 WO2019006792A1 (en) 2017-07-07 2017-07-26 Voice-controlled education method and system, mobile terminal, and storage medium

Country Status (2)

Country Link
CN (1) CN107463626A (en)
WO (1) WO2019006792A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107909871A (en) * 2017-12-26 2018-04-13 安徽声讯信息技术有限公司 A kind of tablet computer of intelligent sound teaching
CN108806360A (en) * 2018-05-31 2018-11-13 北京智能管家科技有限公司 Reading partner method, apparatus, equipment and storage medium
CN109872722B (en) * 2019-01-17 2021-08-31 珠海格力电器股份有限公司 Voice interaction method and device, storage medium and air conditioner
CN110534094B (en) * 2019-07-31 2022-05-31 大众问问(北京)信息科技有限公司 Voice interaction method, device and equipment
CN112580593A (en) * 2020-12-28 2021-03-30 深圳创维-Rgb电子有限公司 Behavior monitoring method and apparatus, behavior monitoring device, and computer storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006338476A (en) * 2005-06-03 2006-12-14 Casio Comput Co Ltd Comfort improvement support device and program
CN101414412A (en) * 2007-10-19 2009-04-22 陈修志 Interaction type acoustic control children education studying device
CN203596113U (en) * 2013-07-11 2014-05-14 安徽科大讯飞信息科技股份有限公司 Playing device
CN104538030A (en) * 2014-12-11 2015-04-22 科大讯飞股份有限公司 Control system and method for controlling household appliances through voice
CN106823096A (en) * 2017-02-04 2017-06-13 张星星 Nurse a baby system

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100538823C (en) * 2006-07-13 2009-09-09 英业达股份有限公司 Language auxiliary expression system and method
CN101915448A (en) * 2010-07-30 2010-12-15 中山大学 Smart Home Atmosphere Adjustment Method and System
CN101989086A (en) * 2010-09-10 2011-03-23 李隆 Music and color-light environmental control centre based on internet
CN105808733B (en) * 2016-03-10 2019-06-21 深圳创维-Rgb电子有限公司 Display method and device
CN106027752A (en) * 2016-04-28 2016-10-12 努比亚技术有限公司 Self-adaption method and device for mobile terminal call background sounds
CN106557298A (en) * 2016-11-08 2017-04-05 北京光年无限科技有限公司 Background towards intelligent robot matches somebody with somebody sound outputting method and device
CN106873773B (en) * 2017-01-09 2021-02-05 北京奇虎科技有限公司 Robot interaction control method, server and robot

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006338476A (en) * 2005-06-03 2006-12-14 Casio Comput Co Ltd Comfort improvement support device and program
CN101414412A (en) * 2007-10-19 2009-04-22 陈修志 Interaction type acoustic control children education studying device
CN203596113U (en) * 2013-07-11 2014-05-14 安徽科大讯飞信息科技股份有限公司 Playing device
CN104538030A (en) * 2014-12-11 2015-04-22 科大讯飞股份有限公司 Control system and method for controlling household appliances through voice
CN106823096A (en) * 2017-02-04 2017-06-13 张星星 Nurse a baby system

Also Published As

Publication number Publication date
CN107463626A (en) 2017-12-12

Similar Documents

Publication Publication Date Title
WO2019006792A1 (en) Voice-controlled education method and system, mobile terminal, and storage medium
US10957325B2 (en) Method and apparatus for speech interaction with children
CN107633719B (en) Anthropomorphic image artificial intelligence teaching system and method based on multi-language human-computer interaction
WO2020192400A1 (en) Playback terminal playback control method, apparatus, and device, and computer readable storage medium
WO2019156332A1 (en) Device for producing artificial intelligence character for augmented reality and service system using same
WO2012046901A1 (en) Music-based language-learning method, and learning device using same
JP2020056996A (en) Tone color selectable voice reproduction system, its reproduction method, and computer readable storage medium
JP2011239141A (en) Information processing method, information processor, scenery metadata extraction device, lack complementary information generating device and program
WO2016060296A1 (en) Apparatus for recording audio information and method for controlling same
US20210295836A1 (en) Information processing apparatus, information processing method, and program
JP2016100033A (en) Reproduction control apparatus
JP6889597B2 (en) robot
Kertz-Welzel “Kim had the Same Idea as Haydn”: International Perspectives on Classical Music and Music Education
WO2019190817A1 (en) Method and apparatus for speech interaction with children
Eardley-Weaver Lifting the Curtain on Opera Translation and Accessibility: Translating Opera for Audiences with Varying Sensory Ability
WO2023185007A1 (en) Sleep scene setting method and apparatus
CN214587406U (en) Story teller for children
KR100418528B1 (en) Fable book educating method and system for children on the network
KR102346158B1 (en) Emotional Intelligence Education AI Speaker System
TW201120834A (en) Audio-visual synthesis interaction system, its method, and its computer program product.
JP2006337490A (en) Content distribution system
CN107154173B (en) Language learning method and system
JP2003295749A (en) Method and device for image processing in remote learning system
JPH0822238A (en) Language training system effectively utilizing quadruple time characteristic of english
JP2002229440A (en) System for learning foreign language using dvd video

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17916494

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205 DATED 12/05/2020)

122 Ep: pct application non-entry in european phase

Ref document number: 17916494

Country of ref document: EP

Kind code of ref document: A1