CN103500578B

CN103500578B - Speech control method and apparatus

Info

Publication number: CN103500578B
Application number: CN201310500587.8A
Authority: CN
Inventors: 张毅军; 徐征
Original assignee: Shanghai Cloud Vision Technology Co Ltd
Current assignee: Shanghai Cloud Vision Technology Co Ltd
Priority date: 2013-10-22
Filing date: 2013-10-22
Publication date: 2016-05-11
Anticipated expiration: 2033-10-22
Also published as: CN103500578A

Abstract

The invention discloses a kind of speech control method and apparatus, promoted the simple operation degree of the types of applications of mobile interconnected and the integration of three networks. Its technical scheme is: after voice are sampled, transfer to Set Top Box; Set Top Box changes into word after the voice of sampling are identified; Set Top Box carries out semanteme identification to the word after transforming, and word is resolved into control command, Apply Names and parameter three parts; Whether Set Top Box is inquired about in state of activation based on the Apply Names decompositing, and according to the state of control command control application, and parameter is sent into application operates.

Description

Speech control method and apparatus

Technical field

The present invention relates to speech control technology, relate in particular to the language for the types of applications of mobile Internet and the integration of three networksSound control method and device.

Background technology

Development of Mobile Internet technology and triple play technology, make business and content great outburst, these business and content revealingAfter in end user's hand, user's property convenient for control is a very large difficult problem. Remote controller, mouse, finger sliding,The manipulation means such as gravity sensing emerge in an endless stream, but relatively the most naturally manipulation, or voice control. At present, due to languageSound recognition technology is constantly ripe, utilizes the mode that voice manipulate also constantly to occur in various terminals.

How effectively to manipulate Set Top Box by voice is current problem demanding prompt solution.

Summary of the invention

The object of the invention is to address the above problem, a kind of speech control method and apparatus is provided, promoted movementThe simple operation degree of the types of applications of the interconnected and integration of three networks.

Technical scheme of the present invention is: the present invention has disclosed a kind of speech control method, comprising:

After being sampled, voice transfer to Set Top Box;

Set Top Box changes into word after the voice of sampling are identified;

Set Top Box carries out semanteme identification to the word after transforming, and word is resolved into control command, Apply Names and parameterThree parts;

Whether Set Top Box is inquired about in state of activation based on the Apply Names decompositing, and applies according to control command controlState, and by parameter send into application operate.

According to an embodiment of speech control method of the present invention, speech recognition and semantic identification are known by the voice of Set Top BoxOther layer is realized.

According to an embodiment of speech control method of the present invention, the operation of application is called by the application manager of Set Top BoxCorresponding should being used for realized, and calling by the voice key-course of Set Top Box of application manager completes.

According to an embodiment of speech control method of the present invention, the state of the application that control command is controlled comprise open/Activate, close, inactivation, upgrading, unloading.

The present invention has also disclosed a kind of speech control device, comprises user end apparatus and Set Top Box, wherein:

User end apparatus comprises sampling module, after voice are sampled, transfers to Set Top Box;

Set Top Box comprises sound identification module, semantic identification module and application invocation module, wherein:

Sound identification module changes into word after the voice of sampling are identified;

Semantic identification module carries out semanteme identification to the word after transforming, and word is resolved into control command, applicationTitle and parameter three parts;

Whether application invocation module inquires about in state of activation based on the Apply Names decompositing, according to controlling lifeOrder control application state, and by parameter send into application operate.

According to an embodiment of speech control device of the present invention, the state of the application that control command is controlled comprise open/Activate, close, inactivation, upgrading, unloading.

The present invention contrasts prior art following beneficial effect: the solution of the present invention is after speech recognition, to carry out languageJustice identification, resolves into control command, Apply Names and parameter by voice command, then calls corresponding application placeReason. Compared to conventional art, the present invention can be generalized to the digital terminal of accepting speech control, includes but not limited to machineTop box, OTT, PAD, vehicle GPS etc. can be accepted the user terminal of phonetic entry.

Brief description of the drawings

Fig. 1 shows the flow chart of the preferred embodiment of speech control method of the present invention.

Fig. 2 shows the schematic diagram of the preferred embodiment of speech control device of the present invention.

Detailed description of the invention

Below in conjunction with drawings and Examples, the invention will be further described.

Fig. 1 shows the flow process of the preferred embodiment of speech control method of the present invention. Refer to Fig. 1, the present embodimentThe implementation step of speech control method details are as follows.

Step S10: transfer to Set Top Box after voice are sampled.

Step S12: the speech recognition layer of Set Top Box changes into word after the voice of sampling are identified.

Step S14: the speech recognition layer of Set Top Box carries out semanteme identification to the word after transforming, and word is resolved into controlOrder processed, Apply Names and parameter three parts.

For example, " turning on TV to CCTV-1 ", " opening ", for controlling, " TV " is TV applications, " CCTV-1 "For importing the parameter of TV applications into. For capable of dynamic increases the number of applying, this process need retrieve application registration database.

For an application, state of a control has: open/activate: application off-duty, in internal memory, becomes after openingState of activation, or application is after inactivation suspended state, is transformed into current application state; Close: will specify applicationTransfer closed condition to, releasing resource; Inactivation: become non-current state from current running status, but still in internal memory;Upgrading: updating operation is carried out in application; Unloading: delete application; Hang up; Backstage. Hang up and these two kinds, backstage commonIt seems that user substantially do not use, and therefore only need to consider front 5 kinds of states.

From user's angle, user's control command roughly has so several: " seeing " TV, and " object for appreciation " game," look into " weather, " looking into " data, " making a call ", " sending out " note, " input " name, " preservation " address list. ThisA little control languages are in speech recognition and change in this process of " control " " application " " parameter " and need to relate to.Substantially be all the meaning of " open/activate ".

Apply Names is the Part II of speech recognition, such as, " seeing that TV is to CCTV1 ", " TV " wherein justIt is Apply Names. Apply Names likely can be by breviary in voice control command, such as, user can be with " seeing CCTV1 "Replace and say " seeing that TV is to CCTV1 ", at this moment, need to carry out fuzzy Judgment Apply Names according to controlling parameter. Some is looked intoLess than Apply Names, the search engine that can jump out browser comes, directly according to the mode processing of Query Information. ExampleAs " what the capital of Switzerland is? " Deng order.

Parameter is for some application, and what biography was entered is a character string, removes to process this character by application-specificString, such as " CCTV1 " of " seeing that TV is to CCTV1 " is exactly application parameter. Again such as " send out micro-letter to Zhang San ",Wherein " Zhang San " is exactly parameter. Also has a kind of parameter, in current application. For example, be VOD in current applicationTime, user sends instructions " looking for lord of the rings three ", and application itself should be inputted " lord of the rings three " in film search, itAfter start search.

Step S16: whether Set Top Box is inquired about in state of activation based on the Apply Names decompositing, according to controlling lifeOrder control application state, and by parameter send into application operate.

The voice key-course of Set Top Box obtains after control command, Apply Names and the parameter that speech recognition layer gives, and callsApplication manager. Application manager whether in state of activation, is controlled the state of application according to application, and by parameterSending into application operates. Application, according to the processing of application parameter, oneself is responded and is represented by each application.

Fig. 2 shows the principle of the preferred embodiment of speech control device of the present invention, refers to Fig. 2, the present embodimentSpeech control device comprise user end apparatus 1 and Set Top Box 2.

User end apparatus 1 comprises sampling module 10, after voice are sampled, transfers to Set Top Box 2. Set Top Box 2 wrapsDraw together sound identification module 21, semantic identification module 22 and application invocation module 23. Sound identification module 21 to samplingVoice change into word after identifying. Semantic identification module 22 carries out semanteme identification to the word after transforming, by wordResolve into control command, Apply Names and parameter three parts, the state of the application that wherein control command is controlled comprises to be beatenOpen/activate, close, inactivation, upgrading, unloading. The Apply Names inquiry of application invocation module 23 based on decompositing isNo in state of activation, according to the state of control command control application, and parameter is sent into application operate.

Above-described embodiment be available to those of ordinary skill in the art realize and use of the present invention, the common skill in this areaArt personnel can without departing from the present invention in the case of the inventive idea, make various modifications or variation to above-described embodiment,Thereby protection scope of the present invention do not limit by above-described embodiment, and should be to meet the wound that claims are mentionedThe maximum magnitude of new property feature.

Claims

1. a speech control method, comprising:

After being sampled, voice transfer to Set Top Box;

Set Top Box changes into word after the voice of sampling are identified;

Set Top Box carries out semanteme identification to the word after transforming, and word is resolved into control command, Apply Names and parameterThree parts, wherein speech recognition and semantic identification are realized by the speech recognition layer of Set Top Box;

Whether Set Top Box is inquired about in state of activation based on the Apply Names decompositing, and applies according to control command controlState, and by parameter send into application operate, wherein application operation called by the application manager of Set Top Box rightAnswer should be used for realize, calling by the voice key-course of Set Top Box of application manager completes.

2. speech control method according to claim 1, is characterized in that, the application that control command is controlledState comprise open/activate, close, inactivation, upgrading, unloading.

3. a speech control device, comprises user end apparatus and Set Top Box, wherein:

Semantic identification module carries out semanteme identification to the word after transforming, and word is resolved into control command, applicationTitle and parameter three parts, wherein sound identification module and semantic identification module are realized by the speech recognition layer of Set Top Box;

Whether application invocation module inquires about in state of activation based on the Apply Names decompositing, according to controlling lifeOrder control application state, and by parameter send into application operate, wherein application operation managed by the application of Set Top BoxReason device calls corresponding should being used for and realizes, and calling by the voice key-course of Set Top Box of application manager completes.

4. speech control device according to claim 3, is characterized in that, the application that control command is controlledState comprise open/activate, close, inactivation, upgrading, unloading.