CN111724791A - Recognition control method based on intelligent voice equipment - Google Patents

Recognition control method based on intelligent voice equipment Download PDF

Info

Publication number
CN111724791A
CN111724791A CN202010444071.6A CN202010444071A CN111724791A CN 111724791 A CN111724791 A CN 111724791A CN 202010444071 A CN202010444071 A CN 202010444071A CN 111724791 A CN111724791 A CN 111724791A
Authority
CN
China
Prior art keywords
voice
intelligent
intelligent voice
equipment
control method
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010444071.6A
Other languages
Chinese (zh)
Inventor
陈超
潘叶江
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Vatti Co Ltd
Original Assignee
Vatti Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Vatti Co Ltd filed Critical Vatti Co Ltd
Priority to CN202010444071.6A priority Critical patent/CN111724791A/en
Publication of CN111724791A publication Critical patent/CN111724791A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1815Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/32Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/34Adaptation of a single recogniser for parallel processing, e.g. by use of multiple processors or cloud computing

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Theoretical Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention discloses an identification control method based on intelligent voice equipment, which comprises the following steps: presetting a plurality of voice recognition service providers and application scenes corresponding to the voice recognition service providers on a cloud platform; the intelligent voice equipment automatically uploads the voice file to a corresponding voice recognition service provider according to the selected application scene; and the corresponding voice recognition service provider analyzes the voice file and feeds back an analysis result to the intelligent voice equipment. According to the recognition control method based on the intelligent voice equipment, the multiple voice recognition service providers and the corresponding application scenes are preset, so that each voice recognition service provider can process the good application scene and analyze the voice files sent by the intelligent voice equipment in the application scene, the accuracy of semantic analysis is integrally improved, and the intelligent voice equipment is better controlled.

Description

Recognition control method based on intelligent voice equipment
Technical Field
The invention belongs to the technical field of recognition of intelligent voice equipment, and particularly relates to a recognition control method based on the intelligent voice equipment.
Background
The speech recognition technology is a high-tech technology for converting a speech signal into a corresponding text or command by a machine through recognition and understanding, and is widely applied to various fields such as industry, home appliances, communication, automotive electronics, medical care, home services, consumer electronics, and the like. Along with popularization and application of AI intelligent voice recognition, many intelligent devices have an AI voice recognition function.
However, when a plurality of devices work together in a fixed environment for human-computer interaction, the following problems exist: 1. the phenomenon that the same voice resolves a plurality of semantic results because the voice AI identification and the AI equipment come from a plurality of voice identification service providers; 2. the accuracy of speech recognition is low due to the difference of the precision and the algorithm of different intelligent devices, so that the intelligent devices are abnormally awakened, wrong speech is played, and abnormal actions are executed.
Disclosure of Invention
In order to solve the problems, the invention provides an identification control method based on intelligent voice equipment, which can process an application scene which is good at self and a voice file sent by the corresponding intelligent voice equipment by a plurality of voice identification service providers, thereby integrally improving the accuracy of semantic analysis and better controlling the intelligent voice equipment.
The technical scheme adopted by the invention is as follows:
a recognition control method based on intelligent voice equipment comprises the following steps:
s1, presetting a plurality of voice recognition service providers and corresponding application scenes on a cloud platform;
s2, the intelligent voice equipment automatically uploads the voice file to a corresponding voice recognition service provider according to the selected application scene;
and S3, the corresponding voice recognition service provider analyzes the voice file and feeds back the analysis result to the intelligent voice equipment.
Preferably, the S1 is specifically:
the method comprises the steps that service addresses of a plurality of voice recognition service providers and application scenes corresponding to the voice recognition service providers are preset on a cloud platform.
Preferably, the service address of the voice recognition service provider can be modified, deleted or added.
Preferably, the application scenarios corresponding to the voice recognition service providers are all good application scenarios.
Preferably, the S2 is specifically:
and the user freely selects the application scene, and the intelligent voice equipment automatically uploads the voice file to the corresponding voice recognition service provider according to the application scene selected by the user.
Preferably, the S3 is specifically:
and the corresponding voice recognition service provider receives the voice file, analyzes the voice file by combining the application scene of the voice file, and feeds back the analysis result to the intelligent voice equipment.
Preferably, the application scenario includes one of a smart home scenario, a smart office scenario, and a smart medical scenario.
Preferably, the method further comprises the following steps:
the intelligent voice equipment is main intelligent voice equipment, and the main intelligent voice equipment controls other auxiliary intelligent voice equipment according to a feedback result.
Preferably, the master intelligent voice device and the slave intelligent voice device are internally provided with voice execution modules for unifying the specification parameters.
Preferably, the voice execution module is a plurality of microphones and loudspeakers with the same parameter standard, and the parameters comprise sampling frequency and volume.
Compared with the prior art, the recognition control method based on the intelligent voice equipment, disclosed by the invention, has the advantages that a plurality of voice recognition service providers and corresponding application scenes are preset, so that each voice recognition service provider can process the good application scene and analyze the voice file sent by the intelligent voice equipment in the application scene, the semantic analysis accuracy is integrally improved, and the intelligent voice equipment is better controlled.
Drawings
Fig. 1 is a schematic flowchart of a recognition control method based on an intelligent speech device according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
The embodiment of the invention provides an identification control method based on intelligent voice equipment, which comprises the following steps as shown in figure 1:
s1, presetting a plurality of voice recognition service providers and corresponding application scenes on a cloud platform;
s2, the intelligent voice equipment automatically uploads the voice file to a corresponding voice recognition service provider according to the selected application scene;
and S3, the voice recognition service provider analyzes the voice file and feeds back the analysis result to the intelligent voice equipment.
Therefore, a plurality of voice recognition service providers and application scenes corresponding to the voice recognition service providers are preset on the cloud platform, so that after the application scenes are selected, the intelligent voice equipment uploads the voice files to the corresponding voice recognition providers, and the corresponding voice recognition providers analyze the voice files and feed back analysis results to the intelligent voice equipment.
Meanwhile, as the industry and the field where each family is adept are different, the user can select different industry voice recognition providers according to needs, for example, a smart home scene selects a science news communication voice recognition provider, a smart office scene selects an civic voice recognition provider, a smart medical scene selects a cloud known voice recognition provider, and the like.
The S1 specifically includes:
the method comprises the steps that service addresses of a plurality of voice recognition service providers and application scenes corresponding to the voice recognition service providers are preset on a cloud platform.
Therefore, the service addresses of a plurality of voice recognition service providers and the corresponding application scenes are preset on the platform, so that the intelligent voice equipment can upload the voice files under the specific application scenes to the corresponding voice recognition service providers.
The service address of the speech recognition service provider can be modified, deleted or added.
Therefore, the voice recognition service providers and the corresponding application scenes can be modified, deleted or added according to the needs.
The application scenes corresponding to the voice recognition service providers are all good application scenes.
Therefore, each voice recognition service provider can process the best application scene, and the voice analysis accuracy is improved.
The S2 specifically includes:
and the user freely selects the application scene, and the intelligent voice equipment automatically uploads the voice file to the corresponding voice recognition service provider according to the application scene selected by the user.
Therefore, the user can freely select the required application scene in advance to adapt to the self requirement, and then the intelligent voice equipment in the application scene can automatically upload the received voice file to the corresponding voice recognition service provider.
The S3 specifically includes:
and the corresponding voice recognition service provider receives the voice file, analyzes the voice file by combining the application scene of the voice file, and feeds back the analysis result to the intelligent voice equipment.
Therefore, after receiving the voice file uploaded by the intelligent voice equipment, the corresponding voice recognition service provider can analyze the voice file by combining the selected application scene and feed back the analysis result to the intelligent voice equipment, so that the voice analysis accuracy is improved.
The application scenario includes one of a smart home scenario, a smart office scenario, and a smart medical scenario.
Therefore, the user can select various application scenes such as a smart home scene, a smart office scene or a smart medical scene according to the self requirement, and the application scenes can be improved according to time lapse and science and technology, so that the adaptability is modified or added or deleted.
Further comprising:
the intelligent voice equipment is main intelligent voice equipment, and the main intelligent voice equipment controls other auxiliary intelligent voice equipment according to a feedback result.
In this way, since the smart speech device is set as the master smart speech device, the master smart speech device controls the other slave smart speech devices according to the analysis result fed back by the speech recognition service provider.
And voice execution modules are arranged in the master intelligent voice device and the slave intelligent voice devices so as to be used for unifying the standard parameters.
Like this, set up the pronunciation execution module of unified standard parameter in main intelligent speech equipment and from intelligent speech equipment, unified hardware in standardizing intelligent speech equipment to be convenient for unified management main intelligent speech equipment and follow intelligent speech equipment.
The voice execution module is a microphone and a loudspeaker with a plurality of same parameter standards, and the parameters comprise sampling frequency and volume.
Therefore, the microphones and the loudspeakers with the same parameter standards are arranged in the intelligent voice devices, and the sampling frequency and the volume are unified and standardized, so that the master intelligent voice device can conveniently and uniformly control the other slave intelligent voice devices according to the same hardware parameter standards.
According to the recognition control method based on the intelligent voice equipment, disclosed by the invention, through presetting a plurality of voice recognition service providers and application scenes corresponding to the voice recognition service providers, each voice recognition service provider can process the good application scene, and analyze and feed back voice files sent by the intelligent voice equipment in the application scenes, so that the accuracy of semantic analysis is integrally improved, and the intelligent voice equipment is better controlled.
The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (10)

1. A recognition control method based on intelligent voice equipment is characterized by comprising the following steps:
s1, presetting a plurality of voice recognition service providers and corresponding application scenes on a cloud platform;
s2, the intelligent voice equipment automatically uploads the voice file to a corresponding voice recognition service provider according to the selected application scene;
and S3, the corresponding voice recognition service provider analyzes the voice file and feeds back the analysis result to the intelligent voice equipment.
2. The recognition control method based on the intelligent voice device according to claim 1, wherein the S1 specifically is:
the method comprises the steps that service addresses of a plurality of voice recognition service providers and application scenes corresponding to the voice recognition service providers are preset on a cloud platform.
3. The intelligent voice device-based recognition control method of claim 2, wherein the service address of the voice recognition service provider can be modified, deleted or added.
4. The intelligent voice device-based recognition control method according to claim 3, wherein the application scenarios corresponding to the voice recognition facilitator are all application scenarios that are good at it.
5. The recognition control method based on the intelligent voice device according to claim 4, wherein the S2 specifically is:
and the user freely selects the application scene, and the intelligent voice equipment automatically uploads the voice file to the corresponding voice recognition service provider according to the application scene selected by the user.
6. The recognition control method based on the intelligent voice device according to claim 5, wherein the S3 specifically is:
and the corresponding voice recognition service provider receives the voice file, analyzes the voice file by combining the application scene of the voice file, and feeds back the analysis result to the intelligent voice equipment.
7. The intelligent voice device-based recognition control method of any one of claims 1-6, wherein the application scenario includes one of a smart home scenario, a smart office scenario, and a smart medical scenario.
8. The recognition control method based on intelligent voice equipment according to claim 7, further comprising:
the intelligent voice equipment is main intelligent voice equipment, and the main intelligent voice equipment controls other auxiliary intelligent voice equipment according to a feedback result.
9. The recognition control method based on intelligent voice equipment as claimed in claim 8, wherein voice execution modules are arranged in the master intelligent voice equipment and the slave intelligent voice equipment so as to unify the specification parameters.
10. The intelligent voice device-based recognition control method of claim 9, wherein the voice execution module comprises a plurality of microphones and loudspeakers with the same parameter standard, and the parameters comprise sampling frequency and volume.
CN202010444071.6A 2020-05-22 2020-05-22 Recognition control method based on intelligent voice equipment Pending CN111724791A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010444071.6A CN111724791A (en) 2020-05-22 2020-05-22 Recognition control method based on intelligent voice equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010444071.6A CN111724791A (en) 2020-05-22 2020-05-22 Recognition control method based on intelligent voice equipment

Publications (1)

Publication Number Publication Date
CN111724791A true CN111724791A (en) 2020-09-29

Family

ID=72564978

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010444071.6A Pending CN111724791A (en) 2020-05-22 2020-05-22 Recognition control method based on intelligent voice equipment

Country Status (1)

Country Link
CN (1) CN111724791A (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104199810A (en) * 2014-08-29 2014-12-10 科大讯飞股份有限公司 Intelligent service method and system based on natural language interaction
CN108415683A (en) * 2018-03-07 2018-08-17 深圳车盒子科技有限公司 More scene voice householder methods, intelligent voice system, equipment and storage medium
CN109087639A (en) * 2018-08-02 2018-12-25 泰康保险集团股份有限公司 Method for voice recognition, device, electronic equipment and computer-readable medium
CN109509473A (en) * 2019-01-28 2019-03-22 维沃移动通信有限公司 Sound control method and terminal device
CN109859761A (en) * 2019-02-22 2019-06-07 安徽卓上智能科技有限公司 A kind of intelligent sound interaction control method
CN111049996A (en) * 2019-12-26 2020-04-21 苏州思必驰信息科技有限公司 Multi-scene voice recognition method and device and intelligent customer service system applying same

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104199810A (en) * 2014-08-29 2014-12-10 科大讯飞股份有限公司 Intelligent service method and system based on natural language interaction
CN108415683A (en) * 2018-03-07 2018-08-17 深圳车盒子科技有限公司 More scene voice householder methods, intelligent voice system, equipment and storage medium
CN109087639A (en) * 2018-08-02 2018-12-25 泰康保险集团股份有限公司 Method for voice recognition, device, electronic equipment and computer-readable medium
CN109509473A (en) * 2019-01-28 2019-03-22 维沃移动通信有限公司 Sound control method and terminal device
CN109859761A (en) * 2019-02-22 2019-06-07 安徽卓上智能科技有限公司 A kind of intelligent sound interaction control method
CN111049996A (en) * 2019-12-26 2020-04-21 苏州思必驰信息科技有限公司 Multi-scene voice recognition method and device and intelligent customer service system applying same

Similar Documents

Publication Publication Date Title
CN107452386B (en) Voice data processing method and system
CN105913847B (en) Voice control system, user end equipment, server and central control unit
CN111800443B (en) Data processing system and method, device and electronic equipment
US20140372109A1 (en) Smart volume control of device audio output based on received audio input
CN108470568B (en) Intelligent device control method and device, storage medium and electronic device
US10495336B2 (en) Energy operations across domains
CN109285555A (en) A kind of change of voice method, device and mobile terminal
CN111312253A (en) Voice control method, cloud server and terminal equipment
US10048713B2 (en) Energy operations across domains
WO2020119437A1 (en) Voice control method, cloud server and terminal device
EP3111738A1 (en) Method for controlling operation of an agricultural machine and system
CN106302997A (en) A kind of output control method, electronic equipment and system
CN110531632B (en) Control method and system
CN112035086A (en) Audio playing method and device
CN114257658A (en) Communication protocol conversion configuration method, communication protocol conversion method and related equipment
CN113921004A (en) Intelligent device control method and device, storage medium and electronic device
CN113053369A (en) Voice control method and device of intelligent household appliance and intelligent household appliance
CN111724791A (en) Recognition control method based on intelligent voice equipment
CN109545209A (en) Operation executes method, apparatus and storage medium
CN112004154A (en) Recording method and system of intelligent terminal and intelligent terminal
CN105825854A (en) Voice signal processing method, device, and mobile terminal
CN109582114A (en) A kind of mobile terminal and its start-up control method
CN111128177B (en) Dynamic loading system and method for voice control command words
CN102325283B (en) Earphone, user equipment and audio data output method
CN104376846A (en) Voice adjusting method and device and electronic devices

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20200929