WO2019006792A1 - Procédé et système d'enseignement à commande vocale, terminal mobile et support d'informations - Google Patents
Procédé et système d'enseignement à commande vocale, terminal mobile et support d'informations Download PDFInfo
- Publication number
- WO2019006792A1 WO2019006792A1 PCT/CN2017/094486 CN2017094486W WO2019006792A1 WO 2019006792 A1 WO2019006792 A1 WO 2019006792A1 CN 2017094486 W CN2017094486 W CN 2017094486W WO 2019006792 A1 WO2019006792 A1 WO 2019006792A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- voice
- educational
- activated
- processor
- target keyword
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 78
- 230000003993 interaction Effects 0.000 claims description 21
- 230000000694 effects Effects 0.000 claims description 13
- 230000008569 process Effects 0.000 description 11
- 238000004891 communication Methods 0.000 description 6
- 238000010586 diagram Methods 0.000 description 5
- 230000002452 interceptive effect Effects 0.000 description 4
- 239000003205 fragrance Substances 0.000 description 3
- 238000013507 mapping Methods 0.000 description 3
- 230000001360 synchronised effect Effects 0.000 description 3
- 230000009471 action Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 206010071299 Slow speech Diseases 0.000 description 1
- 238000007664 blowing Methods 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 230000000739 chaotic effect Effects 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 230000002431 foraging effect Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000033764 rhythmic process Effects 0.000 description 1
- 230000001953 sensory effect Effects 0.000 description 1
- 230000008786 sensory perception of smell Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/63—Querying
- G06F16/638—Presentation of query results
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/68—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/683—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
- G06F16/685—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using automatically derived transcript of audio data, e.g. lyrics
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/20—Education
- G06Q50/205—Education administration or guidance
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B5/00—Electrically-operated educational appliances
- G09B5/04—Electrically-operated educational appliances with audible presentation of the material to be studied
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B5/00—Electrically-operated educational appliances
- G09B5/06—Electrically-operated educational appliances with both visual and audible presentation of the material to be studied
- G09B5/065—Combinations of audio and video presentations, e.g. videotapes, videodiscs, television systems
Definitions
- the present invention relates to the field of education, and in particular, to a voice-activated educational method, a mobile terminal, a system, and a storage medium.
- Infants and toddlers watch TV differently from adults watching TV.
- Adults watch TV and can understand the pictures before and after and the logical relationship between the pictures.
- Each picture constitutes a logical story.
- Infants under the age of two are completely different. The baby has just been born without the same thinking as an adult. There are only a few unconditional launches, such as foraging reflections, sucking reflections, and gripping reflections.
- the development of the baby's thinking in this age group is called the “sensory movement period”.
- infants and young children have mainly learned and recognized the world through their senses of hearing, sight, touch, and hands.
- the thinking at this stage is intuitive action thinking. That is to say, infants and young children mainly carry out specific and direct thinking in perceptive actions.
- the main object of the present invention is to provide a voice-activated educational method, a mobile terminal, a system, and a storage medium, which aim to solve the problem that the prior art cannot allow a baby to hear a stereoscopic sound, and cannot effectively develop the intelligence of an infant.
- the present invention provides a voice-activated educational method, the method comprising the steps of:
- the audio file corresponding to the target keyword is searched, and the found audio file is played.
- the method further includes:
- the category to which the target keyword belongs is the second category, obtaining an adjustment instruction corresponding to the target keyword, and synchronously adjusting the current environment according to the adjustment instruction, so that the current environment and the target keyword are Corresponding.
- the adjustment instruction comprises at least one of a temperature adjustment instruction, a humidity adjustment instruction, a brightness adjustment instruction or an odor adjustment instruction.
- the method further includes: when the education mode selected by the user is the voice playing mode, acquiring the educational voice to be played selected by the user, and It is said that the educational voice is played for playback.
- the method further includes: when the education mode selected by the user is the personal reading mode, performing the detecting the voice in the current environment, Get the steps to the current educational voice.
- the method further includes: receiving a play control instruction sent by the user, and performing a corresponding operation according to the play control instruction, where the play control command includes: a volume adjustment instruction, a sound effect adjustment instruction, or a speech speed adjustment instruction at least one.
- the method further includes: receiving interaction information sent by the user, and searching for corresponding user preference information according to the interaction information, where the user preference information includes: at least one of a favorite sound effect, a favorite educational content, or a favorite scene; The interaction information and the found user preference information adjust the current environment.
- the present invention further provides a mobile terminal, the mobile terminal comprising: a memory, a processor, and a voice-activated educational program stored on the memory and operable on the processor,
- the voice-activated educational program is configured to implement the steps of the voice-activated educational method as described above.
- the present invention further provides a voice-activated education system, the education system comprising: the mobile terminal, the playback device, and the adjustment device described above; wherein the playback device is configured to be in the process The audio and video files are played under control; the adjustment device is configured to adjust the current environment under the control of the processor.
- the present invention also provides a storage medium on which a voice-activated educational program is stored, and when the voice-activated educational program is executed by a processor, the voice-activated educational method as described above is implemented. step.
- the present invention obtains a current educational voice by detecting a voice in a current environment; performing keyword recognition on the current educational voice to obtain a target keyword in the current educational voice; and determining a category of the target keyword;
- the belonging category of the target keyword is the first category
- the audio file corresponding to the target keyword is searched, and the found audio file is played, so that the educational voice in the current environment is stereoscopically presented.
- Infants and young children can more intuitively feel the sound scene corresponding to the specific things in the educational voice content, and then effectively develop the intelligence of infants and children and improve their interest in learning.
- FIG. 1 is a schematic structural diagram of a mobile terminal in a hardware operating environment according to an embodiment of the present invention
- FIG. 2 is a schematic flow chart of a first embodiment of a voice-activated education method according to the present invention
- FIG. 3 is a schematic flow chart of a second embodiment of a voice-activated education method according to the present invention.
- FIG. 4 is a schematic flow chart of a third embodiment of a voice-activated education method according to the present invention.
- FIG. 5 is a schematic flow chart of a fourth embodiment of a voice-activated education method according to the present invention.
- FIG. 1 is a schematic structural diagram of a mobile terminal in a hardware operating environment according to an embodiment of the present invention.
- the mobile terminal may include a processor 1001, such as a CPU, a communication bus 1002, a user interface 1003, a network interface 1004, a memory 1005, and a sound collector 1006.
- the communication bus 1002 is used to implement connection communication between these components.
- the user interface 1003 can include a display, an input unit such as a keyboard, and the optional user interface 1003 can also include a standard wired interface, a wireless interface.
- the network interface 1004 can optionally include a standard wired interface, a wireless interface (such as a WI-FI interface).
- the memory 1005 may be a high speed RAM memory or a stable memory (non-volatile) Memory), such as disk storage.
- the memory 1005 can also optionally be a storage device independent of the aforementioned processor 1001.
- the mobile terminal structure shown in FIG. 1 does not constitute a limitation of the mobile terminal, and may include more or less components than those illustrated, or combine some components, or different component arrangements.
- the memory 1005 as a storage medium may include an operating system, a data storage module, a network communication module, a user interface module, and a voice-activated educational program.
- the mobile terminal may be a mobile terminal that can implement voice collection or detection and program running, for example, a smart phone, a tablet computer or a notebook computer, etc., which is not limited in this embodiment.
- the network interface 1004 is mainly used for data communication with the background server;
- the sound collector 1006 is configured to collect or detect the current voice;
- the user interface 1003 is mainly used for data interaction with the user;
- the processor 1001 and the memory 1005 in the mobile terminal may be disposed in the mobile terminal, and the mobile terminal invokes the voice-activated educational program stored in the memory 1005 through the processor 1001, and performs the following operations:
- the audio file corresponding to the target keyword is searched, and the found audio file is played.
- processor 1001 can call the voice-activated educational program stored in the memory 1005, and also performs the following operations:
- the category to which the target keyword belongs is the second category, obtaining an adjustment instruction corresponding to the target keyword, and synchronously adjusting the current environment according to the adjustment instruction, so that the current environment and the target keyword are Corresponding.
- processor 1001 can call the voice-activated educational program stored in the memory 1005, and also performs the following operations:
- the education mode selected by the user is the voice play mode
- the educational voice to be played selected by the user is acquired, and the educational voice to be played is played.
- processor 1001 can call the voice-activated educational program stored in the memory 1005, and also performs the following operations:
- the detecting the voice in the current environment is performed to obtain the operation of the current educational voice.
- processor 1001 can call the voice-activated educational program stored in the memory 1005, and also performs the following operations:
- the play control command comprising: at least one of a volume adjustment instruction, a sound effect adjustment instruction, or a speech rate adjustment instruction.
- processor 1001 can call the voice-activated educational program stored in the memory 1005, and also performs the following operations:
- the user preference information includes: at least one of a favorite sound effect, a favorite educational content, or a favorite scene; according to the interaction information and the found user Preferences, adjust the current environment.
- the beneficial effects of the embodiment are: obtaining the current educational voice by detecting the voice in the current environment; performing keyword recognition on the current educational voice, obtaining a target keyword in the current educational voice; determining the target The belonging category of the keyword; when the belonging category of the target keyword is the first category, the audio file corresponding to the target keyword is searched, and the found audio file is played. Therefore, the educational voice in the current environment can be stereoscopically presented, so that the infant can more intuitively feel the sound scene corresponding to the specific things in the educational voice content, thereby effectively developing the intelligence of the infant and the child, and improving the interest in learning.
- FIG. 2 is a schematic flow chart of a first embodiment of a voice-activated education method according to the present invention.
- the method includes the following steps:
- the execution subject of the method in this embodiment is a mobile terminal
- the mobile terminal may be a mobile terminal capable of implementing voice collection or detection and program running, for example, a smart phone, a tablet computer or a notebook computer, etc. This example does not limit this.
- the current environment may be a place where the voice-activated education can be implemented, for example, a children's room, a kindergarten classroom, and the like, which is not limited in this embodiment.
- the detecting the voice in the current environment to obtain the current educational voice mainly detecting and judging the voice in the collected current environment, and removing the noise voice that is not the educational voice, for example, in the current environment.
- the speech rate is gentle, the rhythm is strong, the volume is moderate, the voice is rich in magnetic voice or sentence coherence, the words are clear, the recognition is high, and the speech with long duration is judged as educational voice.
- the specific judgment rules can be set according to the actual situation. This embodiment does not limit this.
- the keywords of the current educational speech need to be keywords. Identification, extracting the words that need to be presented as the target keyword.
- determining the category of the target keyword may be determining a category to which the target keyword belongs according to a preset keyword classification table, for example, presetting a keyword table, where the keyword classification table includes Various different categories of keywords, for example, the first category keywords "whistle”, “water flow”, “bird call”, etc. representing the sound category, the vocabulary specifically included in the preset keyword classification table and the category to which the vocabulary belongs It can be set according to actual conditions, and this embodiment does not limit this.
- the target keyword category is determined as the corresponding preset keyword.
- the category for example, the current target keyword is “Bird Call”, and the preset keyword classification table is used to find whether there is a preset keyword corresponding to “Bird Call”. If it exists, and the category is the first category, then The category to which the currently acquired target keyword "Bird Call” belongs is determined as the first category.
- the corresponding preset keyword may be a keyword that is similar to or the same as the target keyword, such as “bird call”, “bird song”, etc., and the specific corresponding rule may be set by itself, this embodiment There is no restriction on this.
- the keyword category representing the voice is preset to the first category.
- the division of the specific category of each keyword may be set according to actual conditions, which is not limited in this embodiment. .
- a mapping relationship between the preset keyword and the audio file corresponding to the preset keyword may be established in advance, so that when the category of the target keyword is confirmed, the search may be performed by And the preset keyword corresponding to the target keyword, and then immediately obtaining an audio file corresponding to the preset keyword according to the mapping relationship, and playing the audio file to implement synchronous teaching.
- the embodiment first detects and determines the voice in the current environment according to the preset judgment condition, removes the non-educational voice in the current environment, and then obtains the current educational voice, and then performs the foregoing according to the preset keyword table.
- the current educational voice performs keyword recognition, obtains the target keyword in the current educational voice and the category to which the target keyword belongs, and after determining the category to which the target keyword belongs, acquires the corresponding audio file according to the mapping relationship, and plays the audio.
- the file stereoscopically plays the live sound corresponding to the target keyword in the current educational voice.
- the teacher read the following sentence in a slow speech: "In the early morning canyon trail, the green trees obscured the bright sunshine, the breeze blew, the cool air was refreshing, People feel that everything is so quiet and beautiful.
- the text read by the teacher is an educational voice
- the target keyword "bird call” is obtained, immediately obtain and The audio file corresponding to the target keyword "Bird Call” is played and the audio file is played.
- the kindergarten students who are listening carefully to the teacher will hear the teacher hear the "bird call” and hear the sound simultaneously.
- the bird screams, vividly combining the text message “Bird Call” received by the brain with the sound of the bird heard.
- the voice-activated education method provided in this embodiment obtains a current educational voice by detecting a voice in a current environment; performing keyword recognition on the current educational voice to obtain a target keyword in the current educational voice;
- the category of the target keyword is the first category
- the audio file corresponding to the target keyword is searched for, and the found audio file is played. Therefore, the educational voice in the current environment can be stereoscopically presented, so that the infant can more intuitively feel the sound scene corresponding to the specific things in the educational voice content, thereby effectively developing the intelligence of the infant and the child, and improving the interest in learning.
- the method further includes:
- the play control command comprising: at least one of a volume adjustment instruction, a sound effect adjustment instruction, or a speech rate adjustment instruction.
- the user may be a teaching user who educates the infant, such as a parent or a teacher, or an educated user, such as an infant or a child.
- the user may adjust the playing sound effect or the volume level according to his own needs or preferences, so the mobile terminal receives the playback control command sent by the user. Immediately perform the corresponding operation to improve the user experience. For example, during the teaching process, the user feels that the playing sound is too large, and the volume needs to be lowered. After receiving the volume down command sent by the user, the mobile terminal reduces the playing volume according to the instruction. User target volume.
- the method further includes:
- the user preference information includes: at least one of a favorite sound effect, a favorite educational content, or a favorite scene; according to the interaction information and the found user Preferences, adjust the current environment.
- the interaction information may be information that is sent by the user when interacting with the mobile terminal, for example, the communication voice sent by the infant when interacting with the mobile terminal, or the infant to the current environment.
- the control voice of the current environment is adjusted, or the multimedia file transmitted by the educated user through the personal device, and the specific category of the interactive information may be set according to actual needs, and the embodiment does not limit this. .
- the mobile terminal may pre-establish a personalized account corresponding to the educated user, and the personalized account may include user-friendly sound type information, such as “soft type”; Educational content category information, such as "Zhang Ailing's prose", “Zheng Yuanjie's fairy tale”; favorite scenes, etc., at the same time, the mobile terminal can use the voice features of the educated user or the personal device with the fixed logo used by the user
- the personalized account is associated, that is, when the educated user sends the interactive voice or the interactive information, the mobile terminal can find the corresponding personalized account according to the interactive voice or the interaction information, obtain the user preference information, and combine the current interaction information, The environment is adjusted.
- the mobile terminal can also record and analyze the interaction information sent by the educated user in the process of interacting with the user, and update and store the pre-stored user preferences according to the analysis result, for example, with age.
- the mobile terminal found that the educated users like to listen to Zhang Ailing's essays, and adjust the user's favorite educational content accordingly, and preferentially promote Zhang Ailing's essays as educational content to educated users.
- the user's interest or preference is recorded and stored, and the interest or preference is updated according to the preference of the user in different time periods, so that the educated user is growing continuously.
- Learning knowledge in a pleasant and comfortable educational environment effectively develops the intelligence of educated people and increases their interest in learning.
- FIG. 3 is a schematic flowchart diagram of a second embodiment of a voice-activated education method according to the present invention.
- the infant can personally feel the live sound corresponding to the target keyword representing the sound, and can also use the keyword representing the natural environment in the current educational voice as the target keyword. And adjusting the current environment to create a scene corresponding to the keyword representing the natural environment, thereby allowing the infant to understand and learn the knowledge information contained in the current educational voice from the sense of smell and touch.
- the method further includes:
- Step S50 when the category to which the target keyword belongs is the second category, acquiring an adjustment instruction corresponding to the target keyword, and synchronously adjusting the current environment according to the adjustment instruction, so that the current environment and the The target keyword corresponds.
- the keyword category representing the natural environment may be preset to the second category, for example, the words “cold” and “hot” representing temperature, and the word “fragrance” representing odor, “Fragrance”, the category of keywords such as “Breeze”, “Daylight” and “Dark” that represent natural phenomena is preset to the second category.
- the division of the specific categories of the keywords may be set according to actual conditions, and this embodiment does not limit this.
- the adjustment instruction includes at least one of a temperature adjustment instruction, a humidity adjustment instruction, a brightness adjustment instruction, or an odor adjustment instruction.
- the temperature adjustment command is used to adjust the ambient temperature of the current environment
- the humidity adjustment information is used to adjust the ambient humidity of the current environment
- the brightness adjustment information is used to adjust the brightness of the current environment
- the odor adjustment command is used to adjust the odor of the current environment.
- the mobile terminal when the acquired target keyword belongs to the second category, the mobile terminal immediately acquires the preset keyword corresponding to the target keyword, and obtains a corresponding adjustment instruction according to the preset keyword, and then according to the preset
- the adjustment instruction synchronously adjusts the current environment, so that the current environment corresponds to the target keyword, for example, immediately after the current educational voice refers to the target keyword “floral”, the scent adjustment instruction corresponding to “floral” is obtained immediately, And according to the odor adjustment instruction, the odor regulating device is controlled to emit a faint floral fragrance, so that the infant can truly feel the scene corresponding to the target keyword “flower”.
- a keyword representing a natural environment in the current educational voice is used as a target keyword, and an adjustment instruction corresponding to the target keyword is acquired, and the current environment is correspondingly adjusted according to the adjustment instruction to create a key with the target.
- the scene corresponding to the word so that the infant can understand and learn the knowledge information contained in the current educational voice from different senses such as smell and touch, and more effectively develop the intelligence of the infant and the child, and improve the learning interest of the infant.
- FIG. 4 is a schematic flowchart diagram of a third embodiment of a voice-activated education method according to the present invention. Based on the embodiment shown in FIG. 2 or FIG. 3, a third embodiment of the voice-activated education method of the present invention is proposed.
- the method further includes:
- step S01 when the education mode selected by the user is the voice play mode, the educational voice to be played selected by the user is acquired, and the educational voice to be played is played.
- the education mode is used to obtain an education mode determined by the user according to his/her needs. For example, if the user needs to play a voice through the mobile terminal, the voice play mode may be selected; if the education voice needs to be read aloud, the individual may read aloud. mode.
- the setting and selection of the specific mode may be determined according to actual conditions, and this embodiment does not limit this.
- the user may first send the pre-selected educational audio and video file to be played including the educational voice to the mobile terminal, and the educational voice to be played may be pre-recorded by the user.
- Educational voice In real life, each child is often most familiar with his parents' voices. When parents teach stories to them, learning and thinking are relatively active, which is more conducive to intellectual development. Therefore, when parents go out during the day, The above effects can also be achieved by playing pre-recorded audio and video files for children at home.
- the mobile terminal selects the educational voice to be played by the user, which satisfies the different types of users and is more effective. Achieve the purpose of synchronous teaching.
- FIG. 5 is a schematic flowchart diagram of a fourth embodiment of a voice-activated education method according to the present invention. Based on the embodiment shown in FIG. 2 or FIG. 3 above, a fourth embodiment of the voice-activated education method of the present invention is proposed.
- the method before the step S10, the method further includes:
- Step S02 When the education mode selected by the user is the personal reading mode, the step of detecting the voice in the current environment to obtain the current educational voice is performed.
- the personal reading mode can be selected for teaching.
- the mobile terminal when the user selects the personal reading mode for teaching, the mobile terminal detects the voice in the current environment, and when the educational voice is detected, starts to perform keyword recognition on the current educational voice, and performs corresponding subsequent steps. For example, when mom told her daughter a story, she read the following sentence slowly: "The winter night comes earlier, the sky is very dark, and the cold north wind blows.”
- the mobile terminal detects the voice After the education of the voice, the keyword recognition of the current educational voice is immediately performed. After the target keyword "night” is recognized, the brightness of the room is slowly lowered synchronously; after the target keyword "cold” is recognized, the synchronization is slow. Slowly lower the temperature of the room; after identifying the target keyword "North Wind", the effect of blowing is simultaneously produced.
- the audience can fully mobilize the senses of the infant in the learning process by reading the educational content, which is a good way to deepen the infant's educational content.
- the memory has been very active and effective in the development of infant intelligence, which has increased the interest of infants and young children.
- the present invention also provides a voice-activated education system, the education system comprising: a mobile terminal, a playback device, and an adjustment device as shown in FIG. 1; wherein the playback device is configured to be under the control of the processor The audio and video files are played; the adjustment device is configured to adjust the current environment under the control of the processor.
- the present invention further provides a storage medium, wherein the storage medium stores a voice-activated educational program, and when the voice-activated educational program is executed by the processor, the following operations are implemented:
- the audio file corresponding to the target keyword is searched, and the found audio file is played.
- the voice-activated educational program when executed by the processor, the following operations are further performed: when the category to which the target keyword belongs is the second category, acquiring an adjustment instruction corresponding to the target keyword, and according to the adjustment instruction The current environment is adjusted synchronously such that the current environment corresponds to the target keyword.
- the voice-activated educational program when executed by the processor, the following operations are also performed: when the educational mode selected by the user is the voice playing mode, the educational voice to be played selected by the user is acquired, and the educational voice to be played is played.
- the voice-activated educational program when executed by the processor, the following operations are further performed: when the education mode selected by the user is the personal reading mode, the detecting the voice in the current environment is performed to obtain the operation of the current educational voice.
- the voice-activated educational program when executed by the processor, the following operations are further performed: receiving a play control command sent by the user, and performing a corresponding operation according to the play control command, where the play control command includes: a volume adjustment command, and a sound effect adjustment At least one of an instruction or a speech rate adjustment instruction.
- the voice-activated educational program when executed by the processor, the following operations are further performed: receiving interaction information sent by the user, and searching for corresponding user preference information according to the interaction information, where the user preference information includes: a favorite sound effect, a favorite educational content, or At least one of the favorite scenes; adjusting the current environment according to the interaction information and the found user preference information.
- the beneficial effects of the embodiment are: obtaining the current educational voice by detecting the voice in the current environment; performing keyword recognition on the current educational voice, obtaining a target keyword in the current educational voice; determining the target The belonging category of the keyword; when the belonging category of the target keyword is the first category, the audio file corresponding to the target keyword is searched, and the found audio file is played. Therefore, the educational voice in the current environment is presented in a three-dimensional manner, so that the infant can more intuitively feel the sound scene corresponding to the specific things in the educational voice content, thereby effectively developing the intelligence of the infant and the child, and improving the interest in learning.
- the embodiment method can be implemented by means of software plus a necessary general hardware platform, and of course hardware, but in many cases the former is a better implementation.
- the present invention The technical solution in essence or the contribution to the prior art can be embodied in the form of a software product stored in a storage medium (such as ROM/RAM, disk, light).
- the disc includes a number of instructions for causing a terminal device (which may be a cell phone, a computer, a server, an air conditioner, or a network device, etc.) to perform the methods described in various embodiments of the present invention.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Business, Economics & Management (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Educational Administration (AREA)
- Educational Technology (AREA)
- Databases & Information Systems (AREA)
- Tourism & Hospitality (AREA)
- Library & Information Science (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Strategic Management (AREA)
- Data Mining & Analysis (AREA)
- Human Resources & Organizations (AREA)
- General Business, Economics & Management (AREA)
- Primary Health Care (AREA)
- Marketing (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Economics (AREA)
- Electrically Operated Instructional Devices (AREA)
Abstract
L'invention concerne un procédé et un système d'enseignement à commande vocale, un terminal mobile et un support d'informations. Le procédé comprend les étapes qui consistent : à détecter une voix dans un environnement courant pour obtenir une voix d'enseignement courante (S10) ; à effectuer une reconnaissance de mot-clé sur la voix d'enseignement courante pour obtenir un mot-clé cible dans la voix d'enseignement courante (S20) ; à déterminer la catégorie du mot-clé cible (S30) ; à rechercher, lorsque la catégorie du mot-clé cible est une première catégorie, un fichier audio correspondant au mot-clé cible, et à lire le fichier audio trouvé (S40). Ainsi, un bébé ou un petit enfant peut ressentir de manière plus intuitive une scène sonore correspondant à un objet spécifique dans un contenu vocal d'enseignement, ce qui développe son intelligence et accroît son intérêt pour l'apprentissage.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710554361.4 | 2017-07-07 | ||
CN201710554361.4A CN107463626A (zh) | 2017-07-07 | 2017-07-07 | 一种声控式教育方法、移动终端、系统及存储介质 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2019006792A1 true WO2019006792A1 (fr) | 2019-01-10 |
Family
ID=60546727
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2017/094486 WO2019006792A1 (fr) | 2017-07-07 | 2017-07-26 | Procédé et système d'enseignement à commande vocale, terminal mobile et support d'informations |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN107463626A (fr) |
WO (1) | WO2019006792A1 (fr) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107909871A (zh) * | 2017-12-26 | 2018-04-13 | 安徽声讯信息技术有限公司 | 一种智能语音教学的平板电脑 |
CN108806360A (zh) * | 2018-05-31 | 2018-11-13 | 北京智能管家科技有限公司 | 伴读方法、装置、设备和存储介质 |
CN109872722B (zh) * | 2019-01-17 | 2021-08-31 | 珠海格力电器股份有限公司 | 一种语音交互方法、装置、存储介质及空调 |
CN110534094B (zh) * | 2019-07-31 | 2022-05-31 | 大众问问(北京)信息科技有限公司 | 一种语音交互方法、装置及设备 |
CN112580593A (zh) * | 2020-12-28 | 2021-03-30 | 深圳创维-Rgb电子有限公司 | 行为监控方法及装置、行为监控设备和计算机存储介质 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2006338476A (ja) * | 2005-06-03 | 2006-12-14 | Casio Comput Co Ltd | 快適さ向上支援装置及びプログラム |
CN101414412A (zh) * | 2007-10-19 | 2009-04-22 | 陈修志 | 互动式声控儿童教育学习装置 |
CN203596113U (zh) * | 2013-07-11 | 2014-05-14 | 安徽科大讯飞信息科技股份有限公司 | 一种播放装置 |
CN104538030A (zh) * | 2014-12-11 | 2015-04-22 | 科大讯飞股份有限公司 | 一种可以通过语音控制家电的控制系统与方法 |
CN106823096A (zh) * | 2017-02-04 | 2017-06-13 | 张星星 | 育婴系统 |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN100538823C (zh) * | 2006-07-13 | 2009-09-09 | 英业达股份有限公司 | 语言辅助表达系统及方法 |
CN101915448A (zh) * | 2010-07-30 | 2010-12-15 | 中山大学 | 智能家居气氛调节方法及系统 |
CN101989086A (zh) * | 2010-09-10 | 2011-03-23 | 李隆 | 一种基于互联网的音乐色光环境控制中心 |
CN105808733B (zh) * | 2016-03-10 | 2019-06-21 | 深圳创维-Rgb电子有限公司 | 显示方法及装置 |
CN106027752A (zh) * | 2016-04-28 | 2016-10-12 | 努比亚技术有限公司 | 移动终端通话背景音自适应方法及装置 |
CN106557298A (zh) * | 2016-11-08 | 2017-04-05 | 北京光年无限科技有限公司 | 面向智能机器人的背景配音输出方法及装置 |
CN106873773B (zh) * | 2017-01-09 | 2021-02-05 | 北京奇虎科技有限公司 | 机器人交互控制方法、服务器和机器人 |
-
2017
- 2017-07-07 CN CN201710554361.4A patent/CN107463626A/zh active Pending
- 2017-07-26 WO PCT/CN2017/094486 patent/WO2019006792A1/fr active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2006338476A (ja) * | 2005-06-03 | 2006-12-14 | Casio Comput Co Ltd | 快適さ向上支援装置及びプログラム |
CN101414412A (zh) * | 2007-10-19 | 2009-04-22 | 陈修志 | 互动式声控儿童教育学习装置 |
CN203596113U (zh) * | 2013-07-11 | 2014-05-14 | 安徽科大讯飞信息科技股份有限公司 | 一种播放装置 |
CN104538030A (zh) * | 2014-12-11 | 2015-04-22 | 科大讯飞股份有限公司 | 一种可以通过语音控制家电的控制系统与方法 |
CN106823096A (zh) * | 2017-02-04 | 2017-06-13 | 张星星 | 育婴系统 |
Also Published As
Publication number | Publication date |
---|---|
CN107463626A (zh) | 2017-12-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2019006792A1 (fr) | Procédé et système d'enseignement à commande vocale, terminal mobile et support d'informations | |
US10957325B2 (en) | Method and apparatus for speech interaction with children | |
CN107633719B (zh) | 基于多语种人机交互的拟人形象人工智能教学系统和方法 | |
WO2020192400A1 (fr) | Procédé, appareil et dispositif de commande de lecture de terminal de lecture, et support d'informations lisible par ordinateur | |
Marschark | Raising and educating a deaf child: A comprehensive guide to the choices, controversies, and decisions faced by parents and educators | |
WO2012046901A1 (fr) | Procédé d'apprentissage d'une langue basé sur de la musique et son dispositif d'apprentissage | |
JP2020056996A (ja) | 音色選択可能なボイス再生システム、その再生方法、およびコンピュータ読み取り可能な記録媒体 | |
WO2016060296A1 (fr) | Appareil pour enregistrement d'informations audio et son procédé de commande | |
JP2011239141A (ja) | 情報処理方法、情報処理装置、情景メタデータ抽出装置、欠損補完情報生成装置及びプログラム | |
JP2016100033A (ja) | 再生制御装置 | |
CN112270768A (zh) | 基于虚拟现实技术的古籍阅读方法、系统及其构建方法 | |
US20210295836A1 (en) | Information processing apparatus, information processing method, and program | |
JP2006337490A (ja) | コンテンツ配信システム | |
WO2023185007A1 (fr) | Procédé et appareil de réglage de scène de sommeil | |
WO2019190817A1 (fr) | Procédé et appareil d'interaction vocale avec des enfants | |
EaRdlEy-wEaVER | Lifting the Curtain on Opera Translation and Accessibility: Translating Opera for Audiences with Varying Sensory Ability | |
JP6889597B2 (ja) | ロボット | |
KR100393122B1 (ko) | 골도청각을 이용한 농아용 언어학습장치 및언어학습방법과 저장매체 | |
KR102346158B1 (ko) | 감성지능 교육 ai 스피커 시스템 | |
TW201120834A (en) | Audio-visual synthesis interaction system, its method, and its computer program product. | |
CN107154173B (zh) | 一种语言学习方法及系统 | |
KR20010044310A (ko) | 네트워크를 이용한 동화 데이터 서비스 방법 및 시스템 | |
WO2023214740A1 (fr) | Système et procédé de sortie audio | |
WO2023214739A1 (fr) | Système et procédé de sortie audio | |
JP2003295749A (ja) | 遠隔学習システムにおける画像処理方法および画像処理装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 17916494 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205 DATED 12/05/2020) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 17916494 Country of ref document: EP Kind code of ref document: A1 |