CN112397068B - Voice instruction execution method and storage device - Google Patents

Voice instruction execution method and storage device Download PDF

Info

Publication number
CN112397068B
CN112397068B CN202011277363.1A CN202011277363A CN112397068B CN 112397068 B CN112397068 B CN 112397068B CN 202011277363 A CN202011277363 A CN 202011277363A CN 112397068 B CN112397068 B CN 112397068B
Authority
CN
China
Prior art keywords
voice
information
voice operation
text information
intelligent terminal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011277363.1A
Other languages
Chinese (zh)
Other versions
CN112397068A (en
Inventor
杜铁军
何伟龙
王芬
朱治钢
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Netac Technology Co Ltd
Original Assignee
Netac Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Netac Technology Co Ltd filed Critical Netac Technology Co Ltd
Priority to CN202011277363.1A priority Critical patent/CN112397068B/en
Publication of CN112397068A publication Critical patent/CN112397068A/en
Application granted granted Critical
Publication of CN112397068B publication Critical patent/CN112397068B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephone Function (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention discloses a voice instruction execution method and storage equipment, wherein the method comprises the following steps: acquiring text information generated based on a voice instruction; the text information is sent to the intelligent terminal through the wireless network module, and the intelligent terminal is controlled to execute a voice operation instruction corresponding to the text information; acquiring voice operation data generated when the intelligent terminal executes a voice operation instruction, and storing the voice operation data; the embodiment of the invention can conveniently carry out subsequent offline playing on some audio data by the method, brings convenience to users, and is portable.

Description

Voice instruction execution method and storage device
Technical Field
The present invention relates to the field of communications technologies, and in particular, to a method for executing a voice command and a storage device.
Background
Intelligent speech is a communication that enables people to communicate with machines in language. In the daily information processed by the human cerebral cortex, the voice information accounts for 20%, which is the most important link for communication, and the man-machine conversation is convenient for people to work and live. The complete man-machine dialogue includes front-end processing of sound signals, converting sound into text for machine processing, and after machine-generated language, converting text language into sound waves by using a speech synthesis technology, thereby forming complete man-machine speech interaction. Consumer-level intelligent hardware is the racetrack that shows market potential at the earliest, and all parties to the market are aiming at consumer-level intelligent interactive terminals. However, in the prior art, no portable small-sized device can download data from the cloud and store the data in an intelligent voice mode, so that the subsequent multiplexing in an offline state is convenient, and if some audio and video resources liked by a user cannot be stored, the offline playing can be realized under the condition of no network.
Accordingly, there is a need for improvement and development in the art.
Disclosure of Invention
The invention aims to solve the technical problems that aiming at the defects in the prior art, a voice instruction execution method is provided, and aims to solve the problems that no portable small storage device in the prior art can download data from a cloud and store the data in an intelligent voice mode, so that the follow-up multiplexing in an off-line state is convenient.
The technical scheme adopted by the invention for solving the problems is as follows:
in a first aspect, an embodiment of the present invention provides a method for executing a voice instruction, where the method includes:
acquiring text information generated based on a voice instruction;
the text information is sent to an intelligent terminal through a wireless network module, and the intelligent terminal is controlled to execute a voice operation instruction corresponding to the text information;
and acquiring voice operation data generated when the intelligent terminal executes the voice operation instruction, and storing the voice operation data.
In one implementation manner, the generation manner of the text information is as follows:
acquiring a voice instruction of a user;
and converting the voice instruction into text information.
In one implementation, the converting the voice instruction into text information includes:
performing voice preprocessing, feature extraction and voice decoding on the voice command to obtain voice processing information;
and obtaining text information matched with the sound processing information according to the mapping relation between the sound processing information and the text information.
In one implementation manner, the specific steps of executing the voice operation instruction corresponding to the text information by the intelligent terminal are as follows:
analyzing the text information to obtain behavior information and name information corresponding to the text information;
determining whether the cloud application has the name information according to the name information;
and if the cloud application has the name information, executing a voice operation instruction corresponding to the behavior information.
In one implementation manner, the determining whether the cloud application has the name information according to the name information includes:
determining whether the name information exists in entry information of the cloud application according to the name information, wherein the entry information is name association information generated by the cloud application according to the name information;
and if the name information exists in the entry information of the cloud application, determining that the cloud application has the name information.
In one implementation manner, the obtaining the voice operation data generated when the intelligent terminal executes the voice operation instruction, and storing the voice operation data includes:
obtaining voice operation data generated when the intelligent terminal executes the voice operation instruction,
analyzing the voice operation data to obtain the type of the voice operation data, wherein the type comprises audio, video and text files;
and storing the voice operation data in a storage area corresponding to the type of the voice operation data in a storage module.
In one implementation manner, the obtaining the voice operation data generated when the intelligent terminal executes the voice operation instruction, and storing the voice operation data further includes:
and carrying out different encryption processing on the storage area according to the priority of the storage area.
In a second aspect, an embodiment of the present invention further provides a voice instruction execution apparatus, where the apparatus includes:
the text information sending unit is used for sending the text information to the intelligent terminal through the wireless network module and controlling the intelligent terminal to execute a voice operation instruction corresponding to the text information through the main control micro control module;
the voice operation instruction response unit is used for acquiring voice operation data generated when the intelligent terminal executes the voice operation instruction and storing the voice operation data;
the data transmission unit is used for realizing data transmission between a storage module and a functional module in the storage equipment according to the USB interface;
and the data processing control unit is used for controlling the intelligent terminal to send the voice operation data generated when the voice operation instruction is executed to the storage device through the main control micro control module.
In a third aspect, an embodiment of the present invention further provides an intelligent terminal, including a memory, and one or more programs, where the one or more programs are stored in the memory, and configured to be executed by the one or more processors, where the one or more programs include a method for executing a voice instruction according to any one of the above.
In a fourth aspect, embodiments of the present invention further provide a non-transitory computer-readable storage medium, which when executed by a processor of an electronic device, enables the electronic device to perform a voice instruction execution method as set forth in any one of the above.
The invention has the beneficial effects that: firstly, acquiring text information generated based on a voice instruction; then the text information is sent to an intelligent terminal through a wireless network module, and the intelligent terminal is controlled to execute a voice operation instruction corresponding to the text information; finally, voice operation data generated when the intelligent terminal executes the voice operation instruction are obtained, and the voice operation data are stored; therefore, in the embodiment of the invention, the voice is recognized through the storage device, then the intelligent terminal is controlled to execute the voice operation instruction corresponding to the voice, and finally the data of the intelligent terminal after the voice operation instruction is executed is stored to realize that the storage device downloads and stores the data from the cloud in an intelligent voice mode, so that the follow-up offline playing of some audio data can be conveniently carried out, convenience is brought to a user, and the storage device is portable.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings that are required to be used in the embodiments or the description of the prior art will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments described in the present invention, and other drawings may be obtained according to the drawings without inventive effort to those skilled in the art.
Fig. 1 is a schematic flow chart of a voice command execution method according to an embodiment of the present invention.
Fig. 2 is a schematic diagram of a voice command execution system according to an embodiment of the present invention.
FIG. 3 is a schematic block diagram of a voice instruction execution device according to an embodiment of the present invention.
Fig. 4 is a schematic block diagram of an internal structure of an intelligent terminal according to an embodiment of the present invention.
Detailed Description
The invention discloses a voice instruction execution method, which is used for making the purpose, the technical scheme and the effect of the invention clearer and more definite, and the invention is further described in detail below by referring to the accompanying drawings and the embodiment. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the invention.
As used herein, the singular forms "a", "an", "the" and "the" are intended to include the plural forms as well, unless expressly stated otherwise, as understood by those skilled in the art. It will be further understood that the terms "comprises" and/or "comprising," when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof. It will be understood that when an element is referred to as being "connected" or "coupled" to another element, it can be directly connected or coupled to the other element or intervening elements may also be present. Further, "connected" or "coupled" as used herein may include wirelessly connected or wirelessly coupled. The term "and/or" as used herein includes all or any element and all combination of one or more of the associated listed items.
It will be understood by those skilled in the art that all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs unless defined otherwise. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the prior art and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.
In the prior art, no portable small storage device can download data from the cloud and store the data in an intelligent voice mode, so that the subsequent multiplexing in an offline state is convenient, audio and video resources which are liked by some users cannot be stored, and offline playing is realized under the condition of no network.
In order to solve the problems in the prior art, the embodiment provides a voice instruction execution method, which receives a voice signal sent by a user through an intelligent voice module in a storage device. The intelligent voice module converts the voice signal into text information which can be recognized by a machine; and then the text information is sent to the intelligent terminal through the wireless network module, the intelligent terminal is controlled to execute and return an execution result, finally, the data of the intelligent terminal after the voice operation instruction is executed is stored to realize that the storage device downloads data from the cloud through an intelligent voice mode and stores the data, so that the follow-up offline playing of some audio data can be conveniently carried out, convenience is brought to a user, and the storage device is portable. In this embodiment, the smart terminal is a mobile phone, and is controlled to execute a voice operation instruction matched with the text information sent by the storage device, after the smart terminal executes the voice operation instruction, voice operation data is generated, at this time, the smart terminal returns the voice operation data to the storage device, the storage device receives the voice operation data, and the voice operation data is stored in the storage device, in this embodiment, namely, the voice intelligent U disc. In the embodiment, the data is downloaded from the cloud and stored in an intelligent voice mode, so that the follow-up multiplexing in an offline state is facilitated.
Illustrative examples
When a user is driving, he suddenly wants to listen to a song, in the prior art, for example, the post-loading vehicle-mounted system is generally connected with the mobile phone in a Bluetooth connection or USB connection mode, the post-loading vehicle-mounted system is connected with the mobile phone in a wired or wireless mode, then the vehicle-mounted system can read song information downloaded on the mobile phone or played on line and play the song through the vehicle-mounted system, but if the user hears a song, the song is obtained and listened in an on-line mode, and when the user is in an environment without network connection, the user cannot hear the song if he wants to hear the song. The method of the embodiment of the invention can solve the actual requirement of the user, when the user inserts the voice intelligent USB flash disk on the vehicle-mounted system, and then when the user wants to listen to a song, the user does not need to stop driving to search manually, only needs to speak the requirement of the user, "downloading the legend song", and then the intelligent voice module in the voice intelligent USB flash disk can send the voice instruction of the user: the method comprises the steps that the song of a legend is downloaded and converted into text information, namely, the text information can be identified by a system, then a main control micro control module (main control MCU) in a storage device sends the text information to an intelligent terminal, and the intelligent terminal is controlled to execute voice operation instructions corresponding to the text information, and if the intelligent terminal identifies the text information: the method comprises the steps that a 'legend' is downloaded, the 'legend' is downloaded at an intelligent terminal, the 'legend' is downloaded at the intelligent terminal, the 'legend' is transmitted to a voice intelligent USB flash disk in cloud application, the voice intelligent USB flash disk receives voice operation data 'legend' through a main control micro control module, the voice operation data 'legend' is stored in the voice intelligent USB flash disk, at the moment, the voice intelligent USB flash disk of a user is inserted into vehicle-mounted equipment, namely, the voice intelligent USB flash disk is connected with the vehicle-mounted equipment through a USB interface, at the moment, the vehicle-mounted equipment can charge the voice intelligent USB flash disk, can read the voice operation data in the voice intelligent USB flash disk, namely, play the 'legend' in the voice intelligent USB flash disk, and can also circularly play favorite songs 'legend' when the user is in an environment without a network, and user experience is improved.
Exemplary method
The embodiment provides a voice instruction execution method, which can be applied to a communication intelligent terminal. As shown in fig. 1, the method includes:
step S100, acquiring text information generated based on a voice instruction;
specifically, after the user sends out voice, the intelligent voice module in the storage device converts the received voice of the user into text information through voice recognition, wherein the voice recognition is to take the voice as a research object, and the machine automatically recognizes and understands the language dictated by the human through voice signal processing and pattern recognition. In this embodiment, the storage device is a voice smart usb disk, as shown in fig. 2, and the voice recognition technology is a technology that enables a machine to convert a voice signal into a corresponding text or command through a recognition and understanding process, and the text information generated in the above manner can prepare for a subsequent smart terminal to execute an operation corresponding to the text information.
In order to obtain text information, the generation mode of the text information is as follows:
step S101, acquiring a voice instruction of a user;
step S102, converting the voice instruction into text information.
Specifically, people can communicate with each other through communication, but the machine is not capable of directly communicating with spoken language like people, so that the language of people needs to be converted into a language recognizable by the machine. In this embodiment, the storage device is a voice intelligent usb disk, and when a user sends a voice to the voice intelligent usb disk, an intelligent voice module in the storage device first obtains a voice instruction of the user, then recognizes the input voice instruction, and then converts the voice instruction into text information through voice recognition.
In order to obtain accurate text information, the converting the voice instruction into text information includes:
performing voice preprocessing, feature extraction and voice decoding on the voice command to obtain voice processing information; and obtaining text information matched with the sound processing information according to the mapping relation between the sound processing information and the text information.
Specifically, after the intelligent voice module in the storage device receives the voice command, the voice command is preprocessed, for example, preprocessing operations such as pre-emphasis, framing, windowing and the like are performed on the voice command, so as to eliminate the influence on the quality of the voice signal due to the human sounding organ and the factors such as aliasing, higher harmonic distortion, high frequency and the like caused by the storage device for collecting the voice signal. The method ensures that the signals obtained by the subsequent voice processing are more uniform and smoother as far as possible, provides high-quality parameters for signal parameter extraction, and improves the voice processing quality. The voice signal contains very rich characteristic parameters, different characteristic vectors represent different physical and acoustic meanings, so that the voice instruction is preprocessed and then is required to be subjected to characteristic extraction, the characteristic extraction is to take out or cut down information influence factors irrelevant to recognition in the voice signal as much as possible, the data quantity required to be processed in the subsequent recognition stage is reduced, and the characteristic parameters representing speaker information carried in the voice signal are generated. According to different uses of the voice feature, different feature parameters need to be extracted, so that the accuracy of recognition is guaranteed, and the LPCC and MFCC feature parameters can be adopted. Because the voice received by the hardware is an analog signal and needs to be converted into digital pulses, the voice is decoded, namely, the audio data after the characteristics are extracted are reconstructed and compressed through an acoustic model, so that sound processing information is obtained. And then mapping the sound processing information to the text information according to the mapping relation between the sound processing information and the text information, wherein in practice, a dictionary is arranged in the system, and contains rich text information and pinyin information corresponding to pronunciation, namely, the sound processing information and the text information are mapped well, and in practical use, the matched text information can be obtained according to the sound processing information.
The embodiment provides a voice instruction execution method, which can be applied to a communication intelligent terminal. As shown in fig. 1, the method includes:
step 200, the text information is sent to the intelligent terminal through the wireless network module, and the intelligent terminal is controlled to execute a voice operation instruction corresponding to the text information.
Specifically, the storage device comprises an intelligent voice module and a wireless network module, and the voice operation instruction is executed in the intelligent terminal, so that the text information is sent to the intelligent terminal through the wireless network module, and the intelligent terminal is controlled to execute the voice operation instruction corresponding to the text information through the main control micro-control module. For example, when the intelligent voice module recognizes that the text information is: the intelligent terminal is controlled by the main control micro-control module to execute a downloading task, the "legend" is downloaded, and preparation is made for the subsequent data transmission back to the storage device. In one implementation, an indicator light is arranged in the storage device, when the intelligent terminal executes a downloading task, the indicator light is controlled to be on by the main control micro control module, and when the intelligent terminal executes the downloading task, the indicator light is controlled to be off by the main control micro control module; by the mode, the user is reminded, and the user can know the working state of the storage device in time.
In order to obtain the voice operation instruction, the specific steps of the intelligent terminal executing the voice operation instruction corresponding to the text information are as follows:
step S201, analyzing the text information to obtain behavior information and name information corresponding to the text information;
step S202, determining whether a cloud application has the name information according to the name information;
step S203, if the cloud application has the name information, executing a voice operation instruction corresponding to the behavior information.
In particular, text information is composed of a plurality of parts, and each part of information represents different contents and works differently. After receiving the text information, the intelligent terminal analyzes the text information to obtain behavior information and name information corresponding to the text information. The intelligent terminal executes the voice operation instruction, which is represented by the execution behavior information, and the operation object is name information, and the intelligent terminal searches the cloud application according to the name information, so that whether the cloud application has the name information or not needs to be determined according to the name information, and the voice operation instruction corresponding to the behavior information is executed only when the cloud application has the name information. For example, when the text information is analyzed to be downloaded and the name information is "legend", the "legend" is queried in the cloud application, and the downloading is executed after the query.
In order to determine whether the cloud application has name information, the determining whether the cloud application has the name information according to the name information includes the following steps: determining whether the name information exists in entry information of the cloud application according to the name information, wherein the entry information is name association information generated by the cloud application according to the name information; and if the name information exists in the entry information of the cloud application, determining that the cloud application has the name information.
In the embodiment, after name information is acquired by an intelligent terminal, when a cloud application inquires, the cloud application acquires the name information, and then the cloud application generates serial entry information, wherein the entry information is name association information generated by the cloud application according to the name information; when the name information exists in the entry information of the cloud application, determining that the cloud application has the name information. For example, when the intelligent terminal acquires name information: the "legend" appears in the cloud, and also includes the legends such as legends, life, story and the like, namely name related information, and when the "legend" exists in the term information, the cloud application is determined to have the name information "legend".
The embodiment provides a voice instruction execution method, which can be applied to a communication intelligent terminal. As shown in fig. 1, the method includes:
step S300, voice operation data generated when the intelligent terminal executes the voice operation instruction are obtained, and the voice operation data are stored.
In practice, after the intelligent terminal executes the voice operation instruction, the intelligent terminal sends the voice operation data generated by the voice operation instruction to the storage device, the main control micro control module of the storage device can receive the voice operation data and store the voice operation data, and the stored data can be further prepared for subsequent cyclic multiplexing.
In order to obtain voice operation data, the obtaining the voice operation data generated when the intelligent terminal executes the voice operation instruction, and storing the voice operation data includes:
step 301, obtaining voice operation data generated when the intelligent terminal executes the voice operation instruction,
step S302, analyzing the voice operation data to obtain the type of the voice operation data, wherein the type comprises audio, video and text files;
step S303, the voice operation data is stored in a storage area corresponding to the type of the voice operation data in a storage module.
Specifically, the intelligent terminal executes the voice operation instruction to generate voice operation data, the storage device firstly acquires the voice operation data, and the voice operation data is analyzed to obtain the type of the voice operation data because the voice operation data has multiple types such as audio, video and text files, and finally the voice operation data is stored in a storage area corresponding to the type of the voice operation data in the storage module. For example, the storage module of the storage device stores an audio storage area, a video storage area and a text storage area, the video is stored in the video storage area after being downloaded to the video, the audio is stored in the audio storage area after being downloaded to the audio, and the text file is stored in the text file storage area after being downloaded to the text file, so that a user can conveniently and quickly search for needed contents.
In addition, different encryption processes are performed on the storage area according to the priority of the storage area. In practice, for a user, the importance degree of each storage area is different, and the authority of each storage area to different users is also possible to be different, so that the user with different authorities can operate different storage areas to obtain different file contents by performing different levels of protection according to the importance degree of the storage area, and different encryption methods are adopted for the audio storage area, the video storage area and the text storage area in the storage module. In one implementation, for example, the audio storage area is encrypted using an EFS (File Encryption Key ) algorithm, the video storage area is encrypted using an AES (Advanced Encryption Standard advanced encryption Standard) algorithm, and the text storage area is encrypted using a Rijndael (block cipher Algorithm) symmetric algorithm. In practice, the voice intelligent USB flash disk can be common data of a company, which stores important data of different businesses of the company, only a few staff in the company have the operation right to the voice intelligent USB flash disk, and for each staff, the corresponding area of the operation right is different, and only the password with the operation right of the audio storage area is disclosed to the staff A; the password with the video storage area operation right is disclosed for the staff B, and the password with the text storage area operation right is disclosed for the staff C, so that different staff can use one voice intelligent USB flash disk simultaneously, and the staff without the operation right can not acquire important files, thereby avoiding leakage, ensuring the security of company data and saving resources. In another embodiment, the voice storage USB flash disk is subjected to copy prevention processing, and the copy prevention processing is controlled by a code rate mode, so that the safety of user data can be better improved.
Exemplary apparatus
As shown in fig. 3, an embodiment of the present invention provides a voice instruction execution storage device including a text information generation unit 401, a text information transmission unit 402, a voice operation instruction response unit 403, a data transmission unit 404, a data processing control unit 405; wherein:
a text information generating unit 401, configured to obtain text information generated based on a voice instruction of the intelligent voice module;
a text information sending unit 402, configured to send the text information to an intelligent terminal through a wireless network module, and control the intelligent terminal to execute a voice operation instruction corresponding to the text information through a master control micro control module;
a voice operation instruction response unit 403, configured to obtain voice operation data generated when the intelligent terminal executes the voice operation instruction, and store the voice operation data;
a data transmission unit 404, configured to implement data transmission between a storage module and a functional module in the storage device according to the USB interface;
and the data processing control unit 405 is configured to control, by using the master control micro control module, the intelligent terminal to send the voice operation data generated when the voice operation instruction is executed to the storage device.
The present embodiment also provides a voice instruction execution storage device, including a text information generating unit 401, which is used for receiving a voice instruction by a text information generating unit in the storage device through an intelligent voice module on the storage device, then generating a text information by the voice instruction, and further including a text information transmitting unit 402 connected with the text information generating unit, the text information transmitting unit 402 transmitting the text information, and transmitting the text information to an intelligent terminal through a wireless network module, wherein the intelligent terminal can execute a voice operation instruction only, so that the intelligent terminal is controlled by a master control micro control module to execute the voice operation instruction corresponding to the text information; the text information sending unit 402 is connected with the voice operation instruction response unit 403, and the voice operation instruction response unit 403 is used for executing the voice operation instruction on the intelligent terminal to generate voice operation data, and then the storage device can acquire the voice operation data and store the voice operation data; the storage device further includes a data transmission unit 404, configured to implement data transmission between a storage module and other functional modules or intelligent terminals in the storage device according to the USB interface; in one implementation, the storage device, i.e., the voice smart USB disk, is connected to the power supply module through the USB interface, and the voice smart USB disk is charged through the power supply module. In another implementation manner, the storage device, that is, the voice intelligent USB flash disk, is connected with the playing device through the USB interface, so that files in the voice intelligent USB flash disk are played. In order to realize control, a voice command execution storage device further includes a data processing control unit 405, which is used to control, through a master micro control module, the intelligent terminal to send voice operation data generated when the voice operation command is executed to the storage device.
Based on the above embodiment, the present invention further provides an intelligent terminal, and a functional block diagram thereof may be shown in fig. 4. The intelligent terminal comprises a processor, a memory, a network interface, a display screen and a temperature sensor which are connected through a system bus. The processor of the intelligent terminal is used for providing computing and control capabilities. The memory of the intelligent terminal comprises a nonvolatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and a computer program. The internal memory provides an environment for the operation of the operating system and computer programs in the non-volatile storage media. The network interface of the intelligent terminal is used for communicating with an external terminal through network connection. The computer program is executed by a processor to implement a method of executing voice instructions. The display screen of the intelligent terminal can be a liquid crystal display screen or an electronic ink display screen, and a temperature sensor of the intelligent terminal is arranged in the intelligent terminal in advance and used for detecting the running temperature of internal equipment.
It will be appreciated by those skilled in the art that the schematic diagram in fig. 4 is merely a block diagram of a portion of the structure related to the present invention and does not constitute a limitation of the smart terminal to which the present invention is applied, and that a specific smart terminal may include more or less components than those shown in the drawings, or may combine some components, or have different arrangements of components.
In one embodiment, a smart terminal is provided that includes a memory, and one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by one or more processors, the one or more programs comprising instructions for:
acquiring text information generated based on a voice instruction;
the text information is sent to an intelligent terminal through a wireless network module, and the intelligent terminal is controlled to execute a voice operation instruction corresponding to the text information;
and acquiring voice operation data generated when the intelligent terminal executes the voice operation instruction, and storing the voice operation data.
Those skilled in the art will appreciate that implementing all or part of the above described methods may be accomplished by way of a computer program stored on a non-transitory computer readable storage medium, which when executed, may comprise the steps of the embodiments of the methods described above. Any reference to memory, storage, database, or other medium used in embodiments provided herein may include non-volatile and/or volatile memory. The nonvolatile memory can include Read Only Memory (ROM), programmable ROM (PROM), electrically Programmable ROM (EPROM), electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double Data Rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous Link DRAM (SLDRAM), memory bus direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), among others.
In summary, the invention discloses a voice instruction execution method, an intelligent terminal and a storage medium, wherein the method comprises the following steps:
firstly, acquiring text information generated based on a voice instruction; then the text information is sent to an intelligent terminal through a wireless network module, and the intelligent terminal is controlled to execute a voice operation instruction corresponding to the text information; finally, voice operation data generated when the intelligent terminal executes the voice operation instruction are obtained, and the voice operation data are stored; therefore, in the embodiment of the invention, the voice is recognized through the storage device, then the intelligent terminal is controlled to execute the voice operation instruction corresponding to the voice, and finally the data of the intelligent terminal after the voice operation instruction is executed is stored to realize that the storage device downloads and stores the data from the cloud in an intelligent voice mode, so that the follow-up offline playing is convenient for the user.
It is to be understood that the present invention discloses a method for executing voice instructions, and it is to be understood that the invention is not limited to the above examples and that modifications and variations may be made by those skilled in the art in light of the above teachings, and all such modifications and variations are intended to be included within the scope of the appended claims.

Claims (6)

1. A method for executing a voice command, applied to a storage device, the method comprising:
acquiring text information generated by the storage device based on voice instructions;
the text information is sent to an intelligent terminal through a wireless network module, and the intelligent terminal is controlled to execute a voice operation instruction corresponding to the text information;
acquiring voice operation data generated when the intelligent terminal executes the voice operation instruction, and storing the voice operation data into the storage device so as to multiplex the voice operation data in an offline state;
the generation mode of the text information is as follows:
acquiring a voice instruction of a user; converting the voice instruction into text information;
the converting the voice instruction into text information includes:
performing voice preprocessing, feature extraction and voice decoding on the voice command to obtain voice processing information; according to the mapping relation between the sound processing information and the text information, obtaining the text information matched with the sound processing information;
the specific steps of the intelligent terminal executing the voice operation instruction corresponding to the text information are as follows:
analyzing the text information to obtain behavior information and name information corresponding to the text information; determining whether the cloud application has the name information according to the name information; if the cloud application has the name information, executing a voice operation instruction corresponding to the behavior information;
the determining whether the cloud application has the name information according to the name information comprises:
determining whether the name information exists in entry information of the cloud application according to the name information, wherein the entry information is name association information generated by the cloud application according to the name information; and if the name information exists in the entry information of the cloud application, determining that the cloud application has the name information.
2. The voice command execution method according to claim 1, wherein the acquiring voice operation data generated when the intelligent terminal executes the voice operation command and storing the voice operation data comprises:
obtaining voice operation data generated when the intelligent terminal executes the voice operation instruction,
analyzing the voice operation data to obtain the type of the voice operation data, wherein the type comprises audio, video and text files;
and storing the voice operation data in a storage area corresponding to the type of the voice operation data in a storage module.
3. The voice command execution method according to claim 2, wherein the acquiring voice operation data generated when the intelligent terminal executes the voice operation command and storing the voice operation data further comprises:
and carrying out different encryption processing on the storage area according to the priority of the storage area.
4. A voice instruction execution storage device, the storage device comprising:
the text information generating unit is used for acquiring text information generated based on the voice instruction of the intelligent voice module; the generation mode of the text information is as follows: acquiring a voice instruction of a user; converting the voice instruction into text information; the converting the voice instruction into text information includes: performing voice preprocessing, feature extraction and voice decoding on the voice command to obtain voice processing information; according to the mapping relation between the sound processing information and the text information, obtaining the text information matched with the sound processing information;
the text information sending unit is used for sending the text information to the intelligent terminal through the wireless network module and controlling the intelligent terminal to execute a voice operation instruction corresponding to the text information through the main control micro control module; the specific steps of the intelligent terminal executing the voice operation instruction corresponding to the text information are as follows: analyzing the text information to obtain behavior information and name information corresponding to the text information; determining whether the cloud application has the name information according to the name information; if the cloud application has the name information, executing a voice operation instruction corresponding to the behavior information; the determining whether the cloud application has the name information according to the name information comprises: determining whether the name information exists in entry information of the cloud application according to the name information, wherein the entry information is name association information generated by the cloud application according to the name information; if the name information exists in the entry information of the cloud application, determining that the cloud application has the name information;
the voice operation instruction response unit is used for acquiring voice operation data generated when the intelligent terminal executes the voice operation instruction and storing the voice operation data;
the data transmission unit is used for realizing data transmission between a storage module and a functional module in the storage equipment according to the USB interface;
and the data processing control unit is used for controlling the intelligent terminal to send the voice operation data generated when the voice operation instruction is executed to the storage device through the main control micro control module.
5. An intelligent terminal comprising a memory, and one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by one or more processors, the one or more programs comprising instructions for performing the method of any of claims 1-3.
6. A non-transitory computer readable storage medium, wherein instructions in the storage medium, when executed by a processor of an electronic device, enable the electronic device to perform the method of any one of claims 1-3.
CN202011277363.1A 2020-11-16 2020-11-16 Voice instruction execution method and storage device Active CN112397068B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011277363.1A CN112397068B (en) 2020-11-16 2020-11-16 Voice instruction execution method and storage device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011277363.1A CN112397068B (en) 2020-11-16 2020-11-16 Voice instruction execution method and storage device

Publications (2)

Publication Number Publication Date
CN112397068A CN112397068A (en) 2021-02-23
CN112397068B true CN112397068B (en) 2024-03-26

Family

ID=74599885

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011277363.1A Active CN112397068B (en) 2020-11-16 2020-11-16 Voice instruction execution method and storage device

Country Status (1)

Country Link
CN (1) CN112397068B (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103187076A (en) * 2011-12-28 2013-07-03 上海博泰悦臻电子设备制造有限公司 Voice music control device
CN103685393A (en) * 2012-09-13 2014-03-26 大陆汽车投资(上海)有限公司 Vehicle-borne voice control terminal, voice control system and data processing system
CN108366319A (en) * 2018-03-30 2018-08-03 京东方科技集团股份有限公司 Intelligent sound box and its sound control method
CN108495160A (en) * 2018-02-08 2018-09-04 百度在线网络技术(北京)有限公司 Intelligent control method, system, equipment and storage medium
CN109979036A (en) * 2019-04-03 2019-07-05 深圳市海圳汽车技术有限公司 With recorder control and the system and control method of speech recognition controlled, recorder
CN110992955A (en) * 2019-12-25 2020-04-10 苏州思必驰信息科技有限公司 Voice operation method, device, equipment and storage medium of intelligent equipment
WO2020133946A1 (en) * 2018-12-24 2020-07-02 深圳创维-Rgb电子有限公司 Device control method, device, apparatus and medium
CN111681658A (en) * 2020-06-05 2020-09-18 苏州思必驰信息科技有限公司 Voice control method and device for vehicle-mounted APP

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107943405A (en) * 2016-10-13 2018-04-20 广州市动景计算机科技有限公司 Sound broadcasting device, method, browser and user terminal
CN109474843B (en) * 2017-09-08 2021-09-03 腾讯科技(深圳)有限公司 Method for voice control of terminal, client and server

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103187076A (en) * 2011-12-28 2013-07-03 上海博泰悦臻电子设备制造有限公司 Voice music control device
CN103685393A (en) * 2012-09-13 2014-03-26 大陆汽车投资(上海)有限公司 Vehicle-borne voice control terminal, voice control system and data processing system
CN108495160A (en) * 2018-02-08 2018-09-04 百度在线网络技术(北京)有限公司 Intelligent control method, system, equipment and storage medium
CN108366319A (en) * 2018-03-30 2018-08-03 京东方科技集团股份有限公司 Intelligent sound box and its sound control method
WO2020133946A1 (en) * 2018-12-24 2020-07-02 深圳创维-Rgb电子有限公司 Device control method, device, apparatus and medium
CN109979036A (en) * 2019-04-03 2019-07-05 深圳市海圳汽车技术有限公司 With recorder control and the system and control method of speech recognition controlled, recorder
CN110992955A (en) * 2019-12-25 2020-04-10 苏州思必驰信息科技有限公司 Voice operation method, device, equipment and storage medium of intelligent equipment
CN111681658A (en) * 2020-06-05 2020-09-18 苏州思必驰信息科技有限公司 Voice control method and device for vehicle-mounted APP

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
一种基于云平台的智能机器人语音交互系统设计;林枫亭等;电子测试;20180305(第Z1期);40-42 *
基于USB多路语音信号实时采集系统的设计与实现;吕钊;吴小培;李密;;电子测量技术;20080215(第02期);17-19 *

Also Published As

Publication number Publication date
CN112397068A (en) 2021-02-23

Similar Documents

Publication Publication Date Title
CN111667814B (en) Multilingual speech synthesis method and device
CN110970014B (en) Voice conversion, file generation, broadcasting and voice processing method, equipment and medium
CN108831437B (en) Singing voice generation method, singing voice generation device, terminal and storage medium
CN110661927A (en) Voice interaction method and device, computer equipment and storage medium
CN111261151B (en) Voice processing method and device, electronic equipment and storage medium
CN109599092B (en) Audio synthesis method and device
CN114333865B (en) Model training and tone conversion method, device, equipment and medium
US9009050B2 (en) System and method for cloud-based text-to-speech web services
CN109376363A (en) A kind of real-time voice interpretation method and device based on earphone
CN107808007A (en) Information processing method and device
CN110992955A (en) Voice operation method, device, equipment and storage medium of intelligent equipment
CN112163084B (en) Problem feedback method, device, medium and electronic equipment
CN109346057A (en) A kind of speech processing system of intelligence toy for children
CN111640434A (en) Method and apparatus for controlling voice device
KR20200011198A (en) Method, apparatus and computer program for providing interaction message
CN110310642A (en) Method of speech processing, system, client, equipment and storage medium
CN110503960A (en) Uploaded in real time method, apparatus, equipment and the storage medium of speech recognition result
CN118098199B (en) Personalized speech synthesis method, electronic device, server and storage medium
CN114945110B (en) Method and device for synthesizing voice head video, terminal equipment and readable storage medium
CN111563182A (en) Voice conference record storage processing method and device
CN113256133B (en) Conference summary management method, device, computer equipment and storage medium
CN112397068B (en) Voice instruction execution method and storage device
CN112712793A (en) ASR (error correction) method based on pre-training model under voice interaction and related equipment
CN109213466B (en) Court trial information display method and device
CN113421571A (en) Voice conversion method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant