CN105096936A - Push-to-talk service control method and apparatus - Google Patents

Push-to-talk service control method and apparatus Download PDF

Info

Publication number
CN105096936A
CN105096936A CN201410205777.1A CN201410205777A CN105096936A CN 105096936 A CN105096936 A CN 105096936A CN 201410205777 A CN201410205777 A CN 201410205777A CN 105096936 A CN105096936 A CN 105096936A
Authority
CN
China
Prior art keywords
phonetic feature
preserved
advance
user
voice
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410205777.1A
Other languages
Chinese (zh)
Inventor
秦瑞伦
杨晨
张斐聪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
HARBIN HAINENGDA TECHNOLOGY Co Ltd
Original Assignee
HARBIN HAINENGDA TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by HARBIN HAINENGDA TECHNOLOGY Co Ltd filed Critical HARBIN HAINENGDA TECHNOLOGY Co Ltd
Priority to CN201410205777.1A priority Critical patent/CN105096936A/en
Publication of CN105096936A publication Critical patent/CN105096936A/en
Pending legal-status Critical Current

Links

Landscapes

  • Telephonic Communication Services (AREA)

Abstract

The embodiment of the invention discloses a push-to-talk service control method and an apparatus. The method includes: a speech is received, a phonetic characteristic of the received speech is analyzed; whether there is the analyzed phonetic characteristic in pre-saved phonetic characteristics is searched, and the pre-saved phonetic characteristics are pre-collected and pre-saved phonetic characteristics of users allowed by equipment; and if there is the analyzed phonetic characteristic, the push-to-talk service is triggered. According to the technical scheme, the phonetic characteristics of the users are employed to trigger the push-to-talk service, and maloperation of the push-to-talk service due to large external interference noises or phonetic interferences of other users can be avoided.

Description

A kind of instant push-to-talk service control method and device
Technical field
The present invention relates to mobile communication technology field, particularly relate to a kind of instant push-to-talk service control method and device.
Background technology
Instant push-to-talk (Push-To-Talk, PTT) business refers to the business can conversed by next button, in order to meet the immediate communication demand of people, public network and private network all provide multiple product and various application occasions to support PTT service, such as: smart mobile phone, intercom, dispatching desk etc.
User is when applying PTT service, first need to trigger Prepaid & Postpaid Telephony Service by certain mode, occur in prior art that two kinds of PTT service control methods are for triggering PTT service, first method judges whether to trigger PTT service by the mode of the size of detection sound; Second method is by pre-setting control statement, then detects whether the voice received are the control statements pre-set, and if it is triggers PTT service; In actual applications, due to the outside noise interference in crowd massing place and more, the sound of surrounding user interference sound source comparatively large, cause employing existing the first or second method all easily to produce the maloperation of PTT service.
Summary of the invention
In order to solve the problems of the technologies described above, the invention provides a kind of instant push-to-talk service control method and device, by identifying the mode of the phonetic feature of user, control instant push-to-talk business, this mode can avoid the interference of outside noise interference and surrounding user, controls instant push-to-talk business more accurately.
First aspect, the invention provides a kind of instant push-to-talk service control method, the method comprises:
Receive voice, the phonetic feature that the voice that parsing receives have;
Search in the phonetic feature preserved in advance and whether there is resolved phonetic feature, the described phonetic feature preserved in advance gathers in advance and the phonetic feature that has of user that allows of the equipment preserved;
If exist, then trigger instant push-to-talk business.
Preferably, search in the phonetic feature preserved in advance before whether there is resolved phonetic feature, described method also comprises:
Judge whether the volume of the voice received is greater than predetermined threshold value, if so, then carries out in the phonetic feature preserved in advance, search the operation that whether there is resolved phonetic feature.
Preferably, search in the phonetic feature preserved in advance before whether there is resolved phonetic feature, described method also comprises:
Identify in the voice received whether there is default control statement, in the phonetic feature if so, then preserved in advance, search the operation that whether there is resolved phonetic feature.
Preferably, search in the phonetic feature preserved in advance before whether there is resolved phonetic feature, described method also comprises:
Judge whether the volume of the voice received is greater than predetermined threshold value, if be greater than, then identify in the voice received whether there is default control statement, if exist, then search the operation that whether there is resolved phonetic feature in the phonetic feature preserved in advance.
Preferably, the described phonetic feature preserved in advance gathers in advance and the phonetic feature that has of user that allows of the equipment preserved, wherein, phonetic feature and the user name one_to_one corresponding of each user are preserved, in the phonetic feature preserved in advance, search whether there is resolved phonetic feature then, if exist, then trigger instant push-to-talk business, comprising:
According to the user name that user selects, from the phonetic feature preserved in advance, extract the phonetic feature corresponding with described user name;
Contrast the phonetic feature of resolving whether identical with extracted phonetic feature;
If identical, then trigger instant push-to-talk business.
Second aspect, present invention also offers a kind of instant push-to-talk business control device, comprising:
Receiving element, for receiving voice;
Resolution unit, for resolving the phonetic feature that received voice have;
Searching unit, whether there is resolved phonetic feature for searching in the phonetic feature preserved in advance, when it is present, start trigger element; The described phonetic feature preserved in advance gathers in advance and the phonetic feature that has of user that allows of the equipment preserved;
Trigger element, for triggering instant push-to-talk business.
Preferably, described device also comprises:
First judging unit, for judging whether the volume of received voice is greater than predetermined threshold value, searching unit described in if so, then triggering and performing search operation.
Preferably, described device also comprises:
First recognition unit, for identifying in received voice whether there is default control statement, searching unit described in if so, then triggering and performing search operation.
Preferably, described device also comprises:
Second judging unit, for judging whether the volume of received voice is greater than predetermined threshold value, if be greater than, then trigger the second recognition unit and performs identifying operation;
Described second recognition unit, for identifying in received voice whether there is default control statement, if exist, then searches unit described in triggering and performs search operation.
Preferably, the described phonetic feature preserved in advance gathers in advance and the phonetic feature that has of user that allows of the equipment preserved, and wherein, phonetic feature and the user name one_to_one corresponding of each user are preserved, then search unit described in, comprising:
Extraction module, for the user name selected according to user, extracts the phonetic feature corresponding with described user name from the phonetic feature preserved in advance;
Whether contrast module is identical with extracted phonetic feature for contrasting resolved phonetic feature; If identical, then start trigger element.
Known by foregoing description, beneficial effect of the present invention is:
The instant push-to-talk service control method of the present invention and device, in order to reduce misuse rate, realize the control of instant push-to-talk business more accurately, first, receives voice, the phonetic feature that the voice that parsing receives have; Then, in the phonetic feature preserved in advance, search whether there is resolved phonetic feature, the described phonetic feature preserved in advance refers to the phonetic feature that the user that collecting device allows in advance has and the phonetic feature be kept in phonetic feature storehouse; If exist, then trigger instant push-to-talk business.Control method of the present invention and device, the phonetic feature based on user controls PTT service, avoids the maloperation that external interference sound causes.
Accompanying drawing explanation
In order to be illustrated more clearly in the technical scheme of the embodiment of the present invention, below the accompanying drawing used required in describing embodiment is briefly described, apparently, accompanying drawing in the following describes is only some embodiments of the present invention, for those of ordinary skill in the art, under the prerequisite not paying creative work, other accompanying drawing can also be obtained according to these accompanying drawings.
Fig. 1 is the process flow diagram of the embodiment of the present invention instant push-to-talk service control method embodiment 1;
Fig. 2 is the process flow diagram of the embodiment of the present invention instant push-to-talk service control method embodiment 2;
Fig. 3 is the process flow diagram of the embodiment of the present invention instant push-to-talk service control method embodiment 3;
Fig. 4 is the process flow diagram of the embodiment of the present invention instant push-to-talk service control method embodiment 4;
Fig. 5 is the process flow diagram of the embodiment of the present invention instant push-to-talk service control method embodiment 5;
Fig. 6 is the process flow diagram of the embodiment of the present invention instant push-to-talk service control method embodiment 6;
Fig. 7 is the structural drawing of the embodiment of the present invention instant push-to-talk business control device embodiment 1;
Fig. 8 is the structural drawing of the embodiment of the present invention instant push-to-talk business control device embodiment 2;
Fig. 9 is the structural drawing of the embodiment of the present invention instant push-to-talk business control device embodiment 3;
Figure 10 is the structural drawing of the embodiment of the present invention instant push-to-talk business control device embodiment 4;
Figure 11 is the structural drawing of the embodiment of the present invention instant push-to-talk business control device embodiment 5.
Embodiment
In order to the scheme making those skilled in the art person understand the embodiment of the present invention better, below in conjunction with the detailed description that drawings and embodiments are further described the embodiment of the present invention.
Refer to Fig. 1, the process flow diagram of its instant push-to-talk service control method embodiment 1 disclosed for the embodiment of the present invention, the method comprises:
Step 101, receives voice, the phonetic feature that the voice that parsing receives have.
In actual applications, user can use multiple different terminal device such as: smart mobile phone, intercom, dispatching desk etc., these terminals have physics PTT button or software PTT button, Prepaid & Postpaid Telephony Service can be supported, simultaneously, these terminal devices all have the function receiving voice, no matter which type of environment user is in, equipment all can receive voice.Only for intercom, the present embodiment is explained below.
When user is when being busy with one's work both hands inconvenience, hand cannot be extracted out and press PTT button to trigger PTT service always, user only needs by Voice command, if control by means of only the sound size of voice, easy generation maloperation, such as: user at one's side other people due to the excessive and PTT service of activated user equipment of noise that spoken sounds is excessive or extraneous, thus cause maloperation frequently to occur.According to control statement fairly simple, such as: the control statement preset is " beginning ", when user or people at one's side unconsciously tell " beginning " this word in talk, will the PTT service of trigger equipment, maloperation equally also can be caused frequently to occur.According to control statement more complicated, user is difficult to remember, usually in use, cannot proper operation, or after control statement known by other people, also easily produce maloperation.
In order to ensure the proper operation of user to equipment PTT service, the present embodiment utilizes the phonetic feature of user to trigger PTT service, because everyone phonetic feature is all not identical, even if the voice size of two people is identical, the statement of expressing is identical, also can not affect the phonetic feature of individual.
So-called phonetic feature can be embodied by the main characteristic parameters of voice, and these speech characteristic parameters can comprise: frequency, bandwidth, amplitude, fundamental frequency, average energy, par are across the remainder or across parameters such as zero rate, resonance peak, LP parameter cepstrum parameter, critical band cepstrums; Can be realized by multiple different algorithm the phonetic feature analysis of voice, such as: linear prediction analysis LPC method etc.
The situation of assembling owing to often having intercom user in daily use occurs, for a certain user, when he is when using the intercom of oneself, he is by Voice command intercom, easily be subject to the interference of other sound such as user voice or outside noise, in order to reduce or eliminate interference, intercom is after the voice receiving user, filtration treatment can be carried out, the voice of user are as far as possible intactly extracted from all sound received, then, recycling phonetic feature analytical approach parses the phonetic feature of these voice, the phonetic feature parsed is that technical foundation is accomplished fluently in the triggering of follow-up PTT service.
Step 102, searches whether there is resolved phonetic feature in the phonetic feature preserved in advance, and the described phonetic feature preserved in advance gathers in advance and the phonetic feature that has of user that allows of the equipment preserved.If exist, then perform step 103, trigger instant push-to-talk business.
In order to ensure the accuracy of user's trigger equipment PTT service, needing the phonetic feature preserving user in advance, if when this equipment only allows a people to use, then only needing to gather in advance the phonetic feature of this user and being preserved.If when this equipment can allow multiple people to use, then need to gather in advance the phonetic feature of admissible multiple user and it is all preserved.For equipment, when user uses this equipment first time, the phonetic feature that equipment can extract user is automatically preserved, and when user uses next time, voice characteristics information can be utilized to trigger PTT service.For user, want to trigger PTT service by the mode of phonetic feature, must in advance by the voice characteristics information recording device of oneself.
For equipment, when receiving the voice messaging of user, parse phonetic feature, then just need from the phonetic feature preserved in advance, search whether there is current resolved phonetic feature, if existed, show that the Voice command of active user is effectively executable, therefore, trigger PTT service.If there is no, show that the Voice command of active user is invalid not executable, therefore, ignore and do not process, to avoid maloperation.
Can be found out by the invention described above embodiment, using the phonetic feature of user as the judgment standard whether triggering PTT service, the interference of other user speech or outside noise can be avoided like this and the maloperation produced, thus accurately control instant push-to-talk business, improve the experience of user.
In actual applications, when multiple user flocks together or when external interference sound is larger, for equipment, do not know which sound is the sound of the user that this equipment allows, which is the sound of other users, and which is noise, as long as receive voice, all first will carry out parsing phonetic feature, then from the phonetic feature preserved in advance, search whether there is resolved phonetic feature, if existed, then trigger PTT service, if the user of a certain equipment does not sound, do not want to trigger PTT service, and other people sound is excessive at one's side, the equipment of this user still can receive these sound, carry out a series of operation, will serious waste device resource.Based on the consideration of above-mentioned technical matters, the embodiment of the present invention provides following three preferred versions, is embodiment 2,3 and embodiment 4 respectively.Successively these three preferred versions are explained below.
First, refer to Fig. 2, the process flow diagram of the embodiment of the present invention illustrated instant push-to-talk service control method embodiment 2, the method comprises:
Step 201, receives voice, the phonetic feature that the voice that parsing receives have;
Step 202, judges whether the volume of the voice received is greater than predetermined threshold value, if so, then performs step 203.
Due to user use equipment time, interference sound in external environment is more, such as outside noise, the sound such as the voice of other users, all may be received by the equipment that user is using, consider, the equipment that the sound source potential range user of these sound is using is far away, or these sound are little compared with the normal sound of user, therefore, this step is by judging the sound size of received voice, this sound of preliminary judgement be this equipment allow user to send sound, if not, follow-up search operation can be avoided to make meaningless operation waste device resource, if, carry out follow-up search operation again.
Certainly, also can be after reception voice, first judge whether the volume of the voice received is greater than predetermined threshold value, the phonetic feature that the voice that if so, then parsing receives have, then, then perform step 203.Otherwise, do not carry out parse operation, search operation.After can avoiding receiving voice like this, directly carry out parse operation, then, carry out judgement operation again, if judged result is no, follow-up needs performs search operation, cause resolved phonetic feature without any value, and parse operation compares consumes resources, therefore, process further can save device resource like this.
Step 203, searches whether there is resolved phonetic feature in the phonetic feature preserved in advance, and the described phonetic feature preserved in advance gathers in advance and the phonetic feature that has of user that allows of the equipment preserved; If exist, then perform step 204.
Step 204, triggers instant push-to-talk business.
Can be found out by the invention described above embodiment, by judging the mode of the volume of the voice received, determine whether carrying out follow-up search operation, due to search operation specifically contrast phone characteristic information, process more complicated, expend more device resource, utilize the mode judging volume, the appearance of the wasting of resources phenomenon avoiding unnecessary search operation to cause.
Then, refer to Fig. 3, the process flow diagram of the embodiment of the present invention illustrated instant push-to-talk service control method embodiment 3, the method comprises:
Step 301, receives voice, the phonetic feature that the voice that parsing receives have;
Step 302, identifies in the voice received whether there is default control statement, if so, then performs step 303.
Due to, in section sometime, equipment can only be belong to a user, therefore for equipment, respective operations can only be made to the instruction of active user, in order to avoid equipment all carries out the operation of searching phonetic feature at the voice receiving any user, owing to resolving phonetic feature, search phonetic feature more complicated and expend device resource, therefore, user is before use equipment, a control statement can be pre-set for controlling PTT service, such as: pre-set control statement for " startup PTT service ", " beginning PTT service ", " beginning ", arbitrary statements such as " start ", this control statement can be the statement of arbitrary languages that equipment can identify, it can be a word, in short, the arbitrary forms such as a letter.
After such process, only have user oneself to know default control statement, other users do not know this control statement, even if sound also cannot to this equipment generation effect for other users.Therefore, can first judge to receive the triggering statement that voice are PTT service by this step, if not, just need not carry out follow-up parsing phonetic feature again, search the operations such as phonetic feature, this makes it possible to avoid wasting device resource.In addition, even if when the control statement of this equipment is known by other users and says, by follow-up search operation, can also make and judging more accurately, thus avoid maloperation.
Certainly, also can be after reception voice, first identify in the voice received whether there is default control statement, the phonetic feature that the voice that if so, then parsing receives have, then, then perform step 303.Otherwise, do not carry out parse operation, search operation.After can avoiding receiving voice like this, directly carry out parse operation, then, carry out identifying operation again, if recognition result is no, follow-up needs performs search operation, cause resolved phonetic feature without any value, and parse operation compares consumes resources, therefore, process further can save device resource like this.
Step 303, searches whether there is resolved phonetic feature in the phonetic feature preserved in advance, and the described phonetic feature preserved in advance gathers in advance and the phonetic feature that has of user that allows of the equipment preserved; If exist, perform step 304.
Step 304, triggers instant push-to-talk business.
Can be found out by the invention described above embodiment, utilize the method that whether there is default control statement in the voice identifying and receive, control whether to carry out search operation, the appearance of the wasting of resources phenomenon that unnecessary search operation can either be avoided like this to cause, can improve again the reliability triggering PTT service.
Finally, refer to Fig. 4, the process flow diagram of the embodiment of the present invention illustrated instant push-to-talk service control method embodiment 4, the method comprises:
Step 401, receives voice, the phonetic feature that the voice that parsing receives have;
Step 402, judges whether the volume of the voice received is greater than predetermined threshold value, if be greater than, then performs step 403;
Step 403, identifies in the voice received whether there is default control statement, if exist, then performs step 404;
Certainly, also can be after reception voice, first judge whether the volume of the voice received is greater than predetermined threshold value, if be greater than, then identify in the voice received whether there is default control statement, if exist, the phonetic feature that the voice that then parsing receives have, then performs step 404; Otherwise, do not carry out parse operation, search operation.
Step 404, searches whether there is resolved phonetic feature in the phonetic feature preserved in advance, and the described phonetic feature preserved in advance gathers in advance and the phonetic feature that has of user that allows of the equipment preserved; If exist, perform step 405.
Step 405, triggers instant push-to-talk business.
All can be found out by above-mentioned three preferred versions, by judging whether the volume of voice received exists default control statement in whether being greater than predetermined threshold value and identifying the voice that receive, two operations further ensure the reliability of follow-up search operation, avoid unnecessary search operation waste device resource.
Due to, trigger the most frequently used mode of PTT service in actual applications, or the mode of hand pressing physics PTT button or click software PTT button, therefore, in order to the mode of the pressing keys of better compatible existing equipment, the embodiment of the present invention provides preferred version, specifically on the basis of any one embodiment above-mentioned, increase the pressing signal of the button of the instant push-to-talk business of monitoring, when monitoring the pressing signal of button, trigger the operation of instant push-to-talk business, make equipment that phonetic feature can either be utilized to trigger PTT service, can support that again key mode triggers PTT service.Below only based on embodiment 1, this preferred version is explained.
Specifically refer to Fig. 5, the process flow diagram of the embodiment of the present invention illustrated instant push-to-talk service control method embodiment 5, the method can comprise:
Step 501, receives voice, the phonetic feature that the voice that parsing receives have;
Step 502, searches whether there is resolved phonetic feature in the phonetic feature preserved in advance, and the described phonetic feature preserved in advance gathers in advance and the phonetic feature that has of user that allows of the equipment preserved; If exist, then perform step 504.
Step 503, monitors the pressing signal of the button of instant push-to-talk business, when monitoring the pressing signal of button, then performs step 504.
Step 504, triggers instant push-to-talk business.
It should be noted that do not have ordinal relation between step 503 and step 501, step 502, be two operations performed side by side, any one step satisfies condition and namely performs step 504.
Certainly, in actual applications, also can arrange based on the triggering mode of PTT physical button or software keys, with the triggering mode of phonetic feature is auxiliary steering logic; Or also can arrange based on the triggering mode of phonetic feature, with the triggering mode of PTT physical button or software keys is auxiliary steering logic.
In actual applications, in order to control PTT service better, also need to consider how to close PTT service, based on this, on the basis of above-described embodiment, the present invention also provides following several implementation:
When monitoring the release signal of PTT physics or software keys, close PTT service; Or, when the voice received are greater than Preset Time break time, close PTT service; Such as: equipment continuous reception, to the voice of user, when the time not receiving voice is greater than preset time value, closes PTT service at once.Or, can also pass through to preset closing control statement as the control statement of closing PTT service, such as: arranging closing control statement is the arbitrary statement such as " over ", " stopping conversing ", " byebye ", " closedown PTT service "; When receiving voice and being default closing control statement, close PTT service.
Consider that some equipment is not belong to a specific user, such as: team uses intercom, an intercom is distributed to everyone, but which user to use which intercom to be uncertain for, therefore, in order to realize the instant push-to-talk service control method of the present invention, just need to preserve in advance in a device the phonetic feature that user that this equipment allows has, when the user that equipment allows is more, the data volume of the phonetic feature then preserved in advance in equipment is larger, and when receiving phonetic feature, just need the reliability that all phonetic feature guarantees of traversal are searched, search operation is made to need to expend comparatively large resource like this, and the time needed for search operation is longer, delay can be caused time serious, affect user's experience, in addition, when the user that this equipment allows flocks together, the voice of other users can cause maloperation to the equipment that active user uses.Based on the consideration of this problem, the embodiment of the present invention provides preferred version to solve this problem.
Specifically refer to Fig. 6, the process flow diagram of the embodiment of the present invention illustrated instant push-to-talk service control method embodiment 6, the method can comprise:
Step 601, receives voice, the phonetic feature that the voice that parsing receives have;
Step 602, according to the user name that user selects, extracts the phonetic feature corresponding with described user name from the phonetic feature preserved in advance; The described phonetic feature preserved in advance gathers in advance and the phonetic feature that has of user that allows of the equipment preserved, and wherein, phonetic feature and the user name one_to_one corresponding of each user are preserved;
Step 603, contrasts the phonetic feature of resolving whether identical with extracted phonetic feature; If identical, then perform step 604, trigger instant push-to-talk business.
As can be seen from the invention described above embodiment, by the way to manage that user name and phonetic feature one_to_one corresponding are preserved, can ensure that user is when use equipment realizes PTT service, equipment only needs to be associated to corresponding phonetic feature by the user name of active user, only carry out a contrast operation and just can realize PTT service control preparatively, avoid the problem of resource waste that the phonetic feature traveling through all users causes.
The embodiment of the present invention corresponding with above-mentioned control method embodiment 1 provides instant push-to-talk business control device, specifically consult Fig. 7, the structural drawing of the embodiment of the present invention illustrated instant push-to-talk business control device embodiment 1, this device can comprise: receiving element 701, resolution unit 702, search unit 703 and trigger element 704, and the principle of work below in conjunction with this device explains each unit connection relation in its inside and function.
Receiving element 701, for receiving voice;
Resolution unit 702, for resolving the phonetic feature that received voice have;
Search unit 703, resolved phonetic feature whether is there is for searching in the phonetic feature preserved in advance, when it is present, start trigger element, the described phonetic feature preserved in advance gathers in advance and the phonetic feature that has of user of the equipment preserved permission;
Trigger element 704, for triggering instant push-to-talk business.
The instant push-to-talk business control device of the embodiment of the present invention, using the phonetic feature of user as the judgment standard whether triggering PTT service, the interference of other user speech or outside noise can be avoided like this and the maloperation produced, thus accurately control instant push-to-talk business, improve the experience of user.
Corresponding with above-mentioned control method embodiment 2 embodiments provides control device embodiment 2, specifically on the basis of above-mentioned control device embodiment 1, increase by the first judging unit, in the mode of the volume by judging the voice received, determine whether carrying out follow-up search operation, due to search operation specifically contrast phone characteristic information, process more complicated, expend more device resource, utilize the mode judging volume, the appearance of the wasting of resources phenomenon avoiding unnecessary search operation to cause.
Specifically refer to Fig. 8, the structural drawing of the embodiment of the present invention illustrated instant push-to-talk business control device embodiment 2, this device comprises five unit, wherein, receiving element 801, resolution unit 802, search unit 803 and trigger element 804, identical with each unit in control device embodiment 1, do not repeat them here.
Wherein, the first judging unit 805, for judging whether the volume of received voice is greater than predetermined threshold value, searching unit described in if so, then triggering and performing search operation.
Due to, when the first judging unit judged result is no, follow-up search unit and trigger element all no longer performs any operation, the phonetic feature causing resolution unit to parse also just loses meaning, the parse operation of resolution unit needs certain hardware resource and processing time, therefore, in order to avoid the redundant operation of device causes the wasting of resources, each unit of this device can also carry out work by following connected mode.
First, receiving element receives voice, then, first judging unit carries out judgement operation, for the first judging unit 805, when judging that the volume of the voice received is greater than predetermined threshold value, trigger resolution unit again and carry out parse operation, then, then triggering lookup unit carries out search operation.That is, for control device, receiving element is first utilized to carry out reception speech processes, then utilize the first judging unit to carry out judgement process, when judged result is for being, triggers resolution unit and carrying out parse operation, after parsing phonetic feature, recycling is searched unit and is carried out search operation, if when lookup result is for existing, restarts trigger element and performing trigger action.Such syndeton resolution unit can carry out insignificant operation, avoids the operation waste resource of redundancy, saving, resource further.
Corresponding with above-mentioned control method embodiment 3 embodiments provides control device embodiment 3, specifically on the basis of above-mentioned control device embodiment 1, increase by the second judging unit, specifically refer to 9, the structural drawing of the embodiment of the present invention illustrated instant push-to-talk business control device embodiment 3, this device comprises five unit, wherein, receiving element 901, resolution unit 902, search unit 903 and trigger element 904, identical with each unit in control device embodiment 1, do not repeat them here.
Wherein, the first recognition unit 905, for identifying in received voice whether there is default control statement, searching unit described in if so, then triggering and performing search operation.
Due to, when the first recognition unit recognition result be do not exist time, follow-up search unit and trigger element all no longer performs any operation, the phonetic feature that this situation causes resolution unit to parse also just loses meaning, but the parse operation of resolution unit needs certain hardware resource and processing time, therefore, in order to avoid the wasting of resources that the redundant operation of device causes, each unit of this device can also carry out work by following connected mode.
First, receiving element receives voice, then, first recognition unit carries out identifying operation, for the first recognition unit 905, when there is default control statement in the voice that identification receives, trigger resolution unit again and carry out parse operation, then, then triggering lookup unit carries out search operation.That is, for control device, receiving element is first utilized to carry out reception speech processes, then utilize recognition unit to carry out identifying processing, when recognition result is for existing, triggers resolution unit and carrying out parse operation, after parsing phonetic feature, recycling is searched unit and is carried out search operation, if when lookup result is for existing, restarts trigger element and performing trigger action.Such syndeton can saveall resource further, avoids the operation waste resource of redundancy.
Corresponding with above-mentioned control method embodiment 4 embodiments provides control device embodiment 4, specifically on the basis of above-mentioned control device embodiment 1, increase the second judging unit and the second recognition unit, specifically refer to 10, the structural drawing of the embodiment of the present invention illustrated instant push-to-talk business control device embodiment 4, wherein receiving element 1001, resolution unit 1002, search unit 1003 and trigger element 1004, identical with each unit in control device embodiment 1, do not repeat them here.
Wherein, the second judging unit 1005, for judging whether the volume of received voice is greater than predetermined threshold value, if be greater than, then trigger the second recognition unit and performs identifying operation;
Second recognition unit 1006, for identifying in received voice whether there is default control statement, if exist, then searches unit described in triggering and performs search operation.
Due to, when the second recognition unit recognition result is not for existing, search unit and trigger element does not redo, also the phonetic feature of resolving just no longer is needed, in this case, cause the phonetic feature first parsed without any meaning, parse operation is exactly redundant operation waste resource.In order to avoid the wasting of resources, each unit of the present embodiment device can also carry out work by following connected mode, specifically:
After receiving element receives voice, second judging unit carries out judgement operation, when judged result is for being, second recognition unit carries out identifying operation, when recognition result is for existing, triggers resolution unit and carrying out parse operation, and then start and search unit and carry out search operation, when lookup result is for being, restarts trigger element and performing trigger action.That is, resolution unit is not just directly carry out parse operation after receiving element receives voice, but when the second recognition unit recognition result is for existing, then carry out parse operation, this connected mode can avoid resolution unit to carry out insignificant operation waste resource.
Corresponding with above-mentioned control method embodiment 5 embodiments provides control device embodiment 5, specifically on the basis of above-mentioned control device embodiment 1, increase monitoring means, specifically refer to 11, the structural drawing of the embodiment of the present invention illustrated instant push-to-talk business control device embodiment 5, this device comprises five unit, wherein, receiving element 1101, resolution unit 1102, search unit 1103 and trigger element 1104, identical with each unit in control device embodiment 1, do not repeat them here.
Wherein, monitoring means 1105, for monitoring the pressing signal of the button of instant push-to-talk business, when monitoring pressing signal, then starts trigger element.
The instant push-to-talk business control device of the embodiment of the present invention, physics PTT button can either be utilized to trigger PTT service for user provides, user vocal feature can be utilized again to trigger the control mode of PTT service, this control mode can either improve the reliability of the control of PTT service, can improve again the experience of user.
Although, equipment can allow multiple user to use, but, equipment does not allow multiple user to use simultaneously, for certain a period of time, equipment can only belong to a user, be used by one user, in order to more accurate, realize the control of active user to instant push-to-talk business more easily, the embodiment of the present invention provides following preferred version, specifically on the basis of any one control device above-mentioned, the described phonetic feature preserved in advance searching unit foundation gathers in advance and the phonetic feature that has of user that allows of the equipment preserved, wherein, phonetic feature and the user name one_to_one corresponding of each user are preserved, unit is searched then, comprise:
Extraction module, for the user name selected according to user, extracts the phonetic feature corresponding with described user name from the phonetic feature preserved in advance;
Whether contrast module is identical with extracted phonetic feature for contrasting resolved phonetic feature; If identical, if exist, then start trigger element.
For control device, need to gather in advance and the phonetic feature that has of user that allows of the equipment preserved, wherein, phonetic feature and the user name one_to_one corresponding of each user are preserved, like this when certain user uses equipment, can first select the user name that oneself is set, with the active user's situation determined, when device receives voice, phonetic feature corresponding to this user name is only needed to contrast, just can judge the need of triggering PTT service, can avoid carrying out with other phonetic features in system redundancy like this and contrast the device resource caused and waste.Meanwhile, the interference that other users that the equipment that it also avoid allows use equipment to cause to active user.
It should be noted that, the such as relational terms of first and second grades and so on is only used for an entity or operation to separate with another entity or operational zone in this article, and not necessarily requires or imply the relation that there is any this reality between these entities or operation or sequentially.And, term " comprises ", " comprising " or its any other variant are intended to contain comprising of nonexcludability, thus make to comprise the process of a series of key element, method, article or equipment and not only comprise those key elements, but also comprise other key elements clearly do not listed, or also comprise by the intrinsic key element of this process, method, article or equipment.When not more restrictions, the key element limited by statement " comprising ... ", and be not precluded within process, method, article or the equipment comprising described key element and also there is other identical element.
It should be noted that, one of ordinary skill in the art will appreciate that all or part of flow process realized in above-described embodiment method, that the hardware that can carry out instruction relevant by the executable program of terminal device has come, described program can be stored in device-readable and get in storage medium, this program, when performing, can comprise the flow process of the embodiment as above-mentioned each side method.Wherein, described storage medium can be magnetic disc, CD, read-only store-memory body (Read-OnlyMemory, or random store-memory body (RandomAccessMemory, RAM) etc. ROM).
Above the instant push-to-talk service control method of one provided by the present invention and device are described in detail, apply specific embodiment herein to set forth principle of the present invention and embodiment, the explanation of above embodiment just understands method of the present invention and core concept thereof for helping; Meanwhile, for one of ordinary skill in the art, according to thought of the present invention, all will change in specific embodiments and applications, in sum, this description should not be construed as limitation of the present invention.

Claims (10)

1. an instant push-to-talk service control method, is characterized in that, comprising:
Receive voice, the phonetic feature that the voice that parsing receives have;
Search in the phonetic feature preserved in advance and whether there is resolved phonetic feature, the described phonetic feature preserved in advance gathers in advance and the phonetic feature that has of user that allows of the equipment preserved;
If exist, then trigger instant push-to-talk business.
2. method according to claim 1, is characterized in that, search in the phonetic feature preserved in advance before whether there is resolved phonetic feature, described method also comprises:
Judge whether the volume of the voice received is greater than predetermined threshold value, if so, then carries out in the phonetic feature preserved in advance, search the operation that whether there is resolved phonetic feature.
3. method according to claim 1, is characterized in that, search in the phonetic feature preserved in advance before whether there is resolved phonetic feature, described method also comprises:
Identify in the voice received whether there is default control statement, in the phonetic feature if so, then preserved in advance, search the operation that whether there is resolved phonetic feature.
4. method according to claim 1, is characterized in that, search in the phonetic feature preserved in advance before whether there is resolved phonetic feature, described method also comprises:
Judge whether the volume of the voice received is greater than predetermined threshold value, if be greater than, then identify in the voice received whether there is default control statement, if exist, then search the operation that whether there is resolved phonetic feature in the phonetic feature preserved in advance.
5. method according to any one of claim 1 to 4, it is characterized in that, the described phonetic feature preserved in advance gathers in advance and the phonetic feature that has of user that allows of the equipment preserved, wherein, phonetic feature and the user name one_to_one corresponding of each user are preserved, then in the phonetic feature preserved in advance, search whether there is resolved phonetic feature described in, if exist, then trigger instant push-to-talk business, comprising:
According to the user name that user selects, from the phonetic feature preserved in advance, extract the phonetic feature corresponding with described user name;
Contrast the phonetic feature of resolving whether identical with extracted phonetic feature;
If identical, then trigger instant push-to-talk business.
6. an instant push-to-talk business control device, is characterized in that, comprising:
Receiving element, for receiving voice;
Resolution unit, for resolving the phonetic feature that received voice have;
Searching unit, whether there is resolved phonetic feature for searching in the phonetic feature preserved in advance, when it is present, start trigger element; The described phonetic feature preserved in advance gathers in advance and the phonetic feature that has of user that allows of the equipment preserved;
Trigger element, for triggering instant push-to-talk business.
7. device according to claim 6, is characterized in that, described device also comprises:
First judging unit, for judging whether the volume of received voice is greater than predetermined threshold value, searching unit described in if so, then triggering and performing search operation.
8. device according to claim 6, is characterized in that, described device also comprises:
First recognition unit, for identifying in received voice whether there is default control statement, searching unit described in if so, then triggering and performing search operation.
9. device according to claim 6, is characterized in that, described device also comprises:
Second judging unit, for judging whether the volume of received voice is greater than predetermined threshold value, if be greater than, then trigger the second recognition unit and performs identifying operation;
Described second recognition unit, for identifying in received voice whether there is default control statement, if exist, then searches unit described in triggering and performs search operation.
10. the device according to any one of claim 6 to 9, it is characterized in that, the described phonetic feature preserved in advance gathers in advance and the phonetic feature that has of user that allows of the equipment preserved, wherein, phonetic feature and the user name one_to_one corresponding of each user are preserved, search unit then, comprising:
Extraction module, for the user name selected according to user, extracts the phonetic feature corresponding with described user name from the phonetic feature preserved in advance;
Whether contrast module is identical with extracted phonetic feature for contrasting resolved phonetic feature; If identical, then start trigger element.
CN201410205777.1A 2014-05-15 2014-05-15 Push-to-talk service control method and apparatus Pending CN105096936A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410205777.1A CN105096936A (en) 2014-05-15 2014-05-15 Push-to-talk service control method and apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410205777.1A CN105096936A (en) 2014-05-15 2014-05-15 Push-to-talk service control method and apparatus

Publications (1)

Publication Number Publication Date
CN105096936A true CN105096936A (en) 2015-11-25

Family

ID=54577222

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410205777.1A Pending CN105096936A (en) 2014-05-15 2014-05-15 Push-to-talk service control method and apparatus

Country Status (1)

Country Link
CN (1) CN105096936A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105338429A (en) * 2015-12-10 2016-02-17 南京正泽科技有限公司 Interphone voice frequency signal relay system
CN106331556A (en) * 2016-09-20 2017-01-11 深圳市同行者科技有限公司 Traffic violation snapshot control method and device based on voice recognition
CN109243447A (en) * 2018-10-12 2019-01-18 西安蜂语信息科技有限公司 Voice sends triggering method and device
CN109936814A (en) * 2019-01-16 2019-06-25 深圳市北斗智能科技有限公司 A kind of intercommunication terminal, speech talkback coordinated dispatching method and its system
CN110752973A (en) * 2018-07-24 2020-02-04 Tcl集团股份有限公司 Terminal equipment control method and device and terminal equipment

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1656366A (en) * 2002-05-29 2005-08-17 诺基亚有限公司 Method in a digital network system for controlling the transmission of terminal equipment
DE102005014520A1 (en) * 2005-03-30 2006-10-05 Siemens Ag Push-to-talk-over-cellular capable subscriber station e.g. mobile radio device, for cellular communication system, has control device and/or language analysis module producing functionality to control connection if password is identified
WO2009075211A1 (en) * 2007-12-10 2009-06-18 Sharp Kabushiki Kaisha Automatic utterer judgment-recording device and automatic utterer judgment-recording system
CN102054481A (en) * 2009-10-30 2011-05-11 大陆汽车有限责任公司 Device, system and method for activating and/or managing spoken dialogue
CN102881287A (en) * 2012-09-20 2013-01-16 熊猫电子集团有限公司 Voice triggering method and circuit
CN103559883A (en) * 2013-08-24 2014-02-05 郑静晨 Cabin interphone starting method based voice frequency domain fingerprint

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1656366A (en) * 2002-05-29 2005-08-17 诺基亚有限公司 Method in a digital network system for controlling the transmission of terminal equipment
DE102005014520A1 (en) * 2005-03-30 2006-10-05 Siemens Ag Push-to-talk-over-cellular capable subscriber station e.g. mobile radio device, for cellular communication system, has control device and/or language analysis module producing functionality to control connection if password is identified
WO2009075211A1 (en) * 2007-12-10 2009-06-18 Sharp Kabushiki Kaisha Automatic utterer judgment-recording device and automatic utterer judgment-recording system
CN102054481A (en) * 2009-10-30 2011-05-11 大陆汽车有限责任公司 Device, system and method for activating and/or managing spoken dialogue
CN102881287A (en) * 2012-09-20 2013-01-16 熊猫电子集团有限公司 Voice triggering method and circuit
CN103559883A (en) * 2013-08-24 2014-02-05 郑静晨 Cabin interphone starting method based voice frequency domain fingerprint

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105338429A (en) * 2015-12-10 2016-02-17 南京正泽科技有限公司 Interphone voice frequency signal relay system
CN105338429B (en) * 2015-12-10 2022-05-20 南京正泽科技股份有限公司 Intercom audio signal relay system
CN106331556A (en) * 2016-09-20 2017-01-11 深圳市同行者科技有限公司 Traffic violation snapshot control method and device based on voice recognition
CN110752973A (en) * 2018-07-24 2020-02-04 Tcl集团股份有限公司 Terminal equipment control method and device and terminal equipment
CN110752973B (en) * 2018-07-24 2020-12-25 Tcl科技集团股份有限公司 Terminal equipment control method and device and terminal equipment
CN109243447A (en) * 2018-10-12 2019-01-18 西安蜂语信息科技有限公司 Voice sends triggering method and device
CN109936814A (en) * 2019-01-16 2019-06-25 深圳市北斗智能科技有限公司 A kind of intercommunication terminal, speech talkback coordinated dispatching method and its system

Similar Documents

Publication Publication Date Title
CN107895578B (en) Voice interaction method and device
CN110692055B (en) Keyword group detection using audio watermarking
US20140309993A1 (en) System and method for determining query intent
CN103095911B (en) Method and system for finding mobile phone through voice awakening
US20170046124A1 (en) Responding to Human Spoken Audio Based on User Input
CN103888581B (en) A kind of communication terminal and its method for recording call-information
CN104834847B (en) Auth method and device
CN105096936A (en) Push-to-talk service control method and apparatus
US8417524B2 (en) Analysis of the temporal evolution of emotions in an audio interaction in a service delivery environment
CN106502649A (en) A kind of robot service awakening method and device
EP1561203B1 (en) Method for operating a speech recognition system
CN109753663B (en) Customer emotion grading method and device
CN104282307A (en) Method, device and terminal for awakening voice control system
WO2020038145A1 (en) Service data processing method and apparatus, and related device
CN110675873B (en) Data processing method, device and equipment of intelligent equipment and storage medium
CN108028044A (en) The speech recognition system of delay is reduced using multiple identifiers
CN107491286A (en) Pronunciation inputting method, device, mobile terminal and the storage medium of mobile terminal
EP2933789B1 (en) Security alarm system with adaptive speech processing
CN109637542A (en) A kind of outer paging system of voice
CN105227557A (en) A kind of account number processing method and device
US11250854B2 (en) Method and apparatus for voice interaction, device and computer-readable storage medium
CN107799115A (en) A kind of audio recognition method and device
CN105611033A (en) Method and device for voice control
CN111862965A (en) Awakening processing method and device, intelligent sound box and electronic equipment
EP2913822B1 (en) Speaker recognition

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20151125

RJ01 Rejection of invention patent application after publication