WO2019127057A1

WO2019127057A1 - Method for processing voice signal for group call, communication terminal and computer storage medium

Info

Publication number: WO2019127057A1
Application number: PCT/CN2017/118766
Authority: WO
Inventors: 邓智; 于海洋; 陈芬; 杜湘洋
Original assignee: 海能达通信股份有限公司
Priority date: 2017-12-26
Filing date: 2017-12-26
Publication date: 2019-07-04

Abstract

The present application relates to the field of wireless terminal communications, and provided thereby are a method for processing a voice signal for a group call, a communication terminal, and a computer storage medium. Provided by the present application is a method for processing a voice signal for a group call, the method comprising: receiving a voice signal; extracting key information from the voice signal; determining whether the key information matches pre-stored template information; if so, running a preset function command so as to respond to a group call. The method of the present application may solve the existing problem wherein key information cannot be followed promptly or is missed.

Description

Voice signal processing method for group call, communication terminal and computer storage medium

【技术领域】[Technical Field]

The present application relates to the field of wireless terminal communication, and in particular, to a voice signal processing method for a group call, a communication terminal, and a computer storage medium.

【背景技术】【Background technique】

As a two-way mobile communication tool, the walkie-talkie has many advantages, such as making a call without any network, so that no call charges are incurred, thereby reducing economic costs, and it is suitable for applications where relatively fixed and frequent calls are made.

As the frequency of use of walkie-talkies in group professional services is increasing, the application of walkie-talkies in group calls is receiving more and more attention. However, in the group call application scenario, it is inevitable that the received information contains many useless information. As a result of receiving information redundancy, it is often impossible to pay attention to or miss key information in time, resulting in poor user experience.

【发明内容】 [Summary of the Invention]

The application provides a voice signal processing method for a group call, a communication terminal, and a computer storage medium, so that the user can pay attention to or avoid missing key information in time.

To solve the above technical problem, the present application provides a method for processing a voice signal of a group call, the method comprising: receiving a voice signal; extracting key information in the voice signal; determining whether the key information matches the pre-stored template information; if yes, running the pre- Set the function command to respond to the group call.

To solve the above technical problem, the present application further provides a communication terminal, the communication terminal is configured to receive a voice signal of a group call, the communication device includes a processor and a memory coupled to each other, and the template information is pre-stored in the memory, and the processor is used to Extract key information in the voice signal; determine whether the key information matches the template information; if yes, run the preset function command to respond to the group call.

In order to solve the above technical problems, the present application further provides a computer storage medium having stored thereon a computer program capable of being executed to implement the method of any of the above methods.

The utility model has the beneficial effects of: receiving a voice signal; extracting key information in the voice signal; determining whether the key information matches the pre-stored template information; if yes, running a preset function instruction to respond to the group call, the application can The key information matching the preset template information is extracted from the language signal of the group call, and the corresponding function is run to respond to the group call, which can solve the problem that the user cannot pay attention to or miss the key information in time.

【附图说明】 [Description of the Drawings]

In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings to be used in the embodiments or the prior art description will be briefly described below. Obviously, the drawings in the following description are only It is a certain embodiment of the present application, and other drawings can be obtained according to the drawings without any creative work for those skilled in the art.

1 is a schematic flow chart of an embodiment of a method for processing a voice signal of a group call according to the present application;

2 is a schematic flow chart of still another embodiment of a method for processing a voice signal of a group call according to the present application;

3 is a schematic diagram of a set of call scenes of the present application;

4 is a schematic structural diagram of an embodiment of a computer storage medium of the present application;

FIG. 5 is a schematic structural diagram of an embodiment of a communication terminal according to the present application; FIG.

FIG. 6 is a schematic structural diagram of still another embodiment of a communication terminal according to the present application.

【具体实施方式】【Detailed ways】

The technical solutions in the embodiments of the present application will be clearly and completely described in the following with reference to the accompanying drawings in the embodiments. It is understood that the specific embodiments described herein are merely illustrative of the application and are not intended to be limiting. In addition, it should be noted that, for the convenience of description, only some but not all of the structures related to the present application are shown in the drawings. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present application without departing from the inventive scope are the scope of the present application.

The communication terminal provided by the embodiment of the present application includes an electronic device such as a smart phone, a tablet computer, a smart wearable device, a digital audio and video player, an electronic reader, and a handheld game machine.

The terms "first", "second", and "third" in this application are used for descriptive purposes only, and are not to be construed as indicating or implying a relative importance or implicitly indicating the number of technical features indicated. Thus, features defining "first", "second", and "third" may include at least one of the features, either explicitly or implicitly. In the description of the present application, the meaning of "plurality" is at least two, for example two, three, and the like. Furthermore, the terms "comprises" and "comprising" and "comprising" are intended to cover a non-exclusive inclusion. For example, a process, method, system, product, or device that comprises a series of steps or units is not limited to the listed steps or units, but optionally also includes steps or units not listed, or alternatively Other steps or units inherent to these processes, methods, products or equipment.

References to "an embodiment" herein mean that a particular feature, structure, or characteristic described in connection with the embodiments can be included in at least one embodiment of the present application. The appearances of the phrases in various places in the specification are not necessarily referring to the same embodiments, and are not exclusive or alternative embodiments that are mutually exclusive. Those skilled in the art will understand and implicitly understand that the embodiments described herein can be combined with other embodiments.

Please refer to FIG. 1. FIG. 1 is a schematic flowchart diagram of an embodiment of a method for processing a voice signal of a group call according to the present application. In this embodiment, the voice signal processing method of the group call may include the following steps:

S11: Receive a voice signal.

In this step S11, the communication terminal first receives the voice signal. The voice signal is obtained by the communication terminal as the transmitting end, and the voice information may be a piece of music, a sentence, etc., and then the voice information is converted into a voice signal and sent to the communication terminal as the receiving end, as the communication terminal of the receiving end. The voice signal of the transmitting end is received.

The number of the communication terminals on the transmitting end and the number of the receiving terminal terminals may be one-to-one or one-to-many. In this embodiment, one communication terminal as the transmitting end corresponds to a plurality of communication terminals as the receiving end, for example, In a group call scenario, the base station, the control center, or the LEADER walkie-talkie can serve as the transmitting end. When at least two walkie-talkies that communicate with the base station, the control center, or the LEADER walkie-talkie are used as the receiving end, they can be regarded as one-to-many. When the number of the radios is one, it can be regarded as one-to-one. In this embodiment, it is not limited.

The voice signal can be transmitted by wired technology or by wireless technology. For example, the wired transmission can be twisted pair transmission, coaxial cable transmission, optical fiber transmission, etc., and the wireless transmission can be video baseband transmission, optical fiber transmission, and network. Transmission, microwave transmission, broadband common cable transmission, and so on. Generally, when the communication terminal of the receiving end receives the voice signal, the voice signal is generally converted into voice for playback. However, in this embodiment, after the voice signal is received, the playback operation is not performed, but The received speech signal is analyzed, and the operation of extracting the key information in the following step S12 is performed.

S12: Extract key information in the voice signal.

After receiving the speech signal in the above step S11, the key information in the speech signal is extracted in this step S12. The voice signal may include information such as frequency, loudness, pitch, text pronunciation, etc., wherein some information in the voice signal may be extracted as key information. The key information may be information including the number of words, the pronunciation of the text, etc. The content of the key information may be preset or may follow the default setting. It can be known that a plurality of key information may be included in one voice signal.

When the voice signal received by the communication terminal at the receiving end, generally speaking, the operation of playing the voice signal is directly performed, but in this embodiment, the voice signal is not directly played, but the key information extracted is used and used in the following In step S13, it is judged whether or not matching with the pre-stored template information.

S13: Determine whether the key information matches the pre-stored template information.

After the key information of the voice signal is extracted in the above step S12, the pre-stored template information is matched in this step S13, and if it matches, the following step S14 is performed.

The template information is fixed information used for comparison with key information. Specifically, before using the communication terminal to carry out activities, the user adjusts the template information according to the current personnel and activity content, such as activity code, password, and the like. At this time, the corresponding template information is set. Generally speaking, the template information is not modified before the end of the activity, and the template information is set on the communication terminal of the receiving end. When the communication terminal of the receiving end receives the voice information, step S12 and step S13 are repeated, that is, key information is extracted and matched with the pre-stored template information for each received voice signal. For example, when the communication terminal at the transmitting end makes a group call to the communication terminal at the receiving end, the communication terminal at the receiving end receives a voice signal whose content is “Zhang San, Li Si, please reply to your location”, according to the rules for extracting key information. Extracting key information from the content of the received voice signal, wherein the template information of the communication terminal of the first receiving end is set to the name of the owner, "Zhang San", and the template information of the communication terminal of the second receiving end is set as the owner The name "Li Si", the template information of the communication terminal of the third receiving end is set to the name of the owner "Wang Wu", at this time, for the communication terminals of the first and third receiving ends, the voice signal is extracted. The key information matches the template information. At this time, the following step S14 is performed, and for the communication device of the second receiving end, the key information extracted in the voice signal does not match the template information, and no operation is performed at this time.

S14: Run a preset function command to respond to the group call.

When it is determined in the above step S13 that the key information matches the pre-stored template information, the step S14 is executed to execute the preset function instruction in response to the group call. A function instruction is an instruction that implements a certain function. Generally speaking, it may be a voice information that illuminates a display screen, vibrates, and plays a voice signal, and the function instruction may follow a default setting or a preset, for example, When the communication terminal receives the voice signal, the voice information converted by the voice signal is played after the default vibration, and the user may preset the reminder ringtone according to the personal preference, that is, the voice message is converted by playing the ringtone and then the voice signal is converted. The function command is preset. When the key information matches the pre-stored template information, the preset function command will be run in response to the group call.

In this embodiment, by receiving a voice signal, and then extracting key information in the voice signal, the key information is matched with the pre-stored template information to determine whether the received voice signal is important information to be concerned, and if matched, the preset is run. The function command, in response to the group call, enables the user to pay attention to or avoid missing key information in time to improve the user experience.

Please refer to FIG. 2. FIG. 2 is a schematic flowchart diagram of still another embodiment of a method for processing a voice signal of a group call according to the present application. In this embodiment, the voice signal processing method of the group call may include the following steps:

S21: Acquire voice data.

S22: Extract keyword language information in the voice data.

S23: Save the keyword language information as template information.

In this embodiment, the template information is stored in advance before the voice signal processing of the group call is performed, and the step of storing the template information may include acquiring voice data, and extracting keyword voice information in the voice data, where the method further includes: according to the preset keyword. The keyword voice information in the voice data is extracted, and the keyword voice information is saved as a voice template. Steps S21 to S23 are put together for explanation.

For example, in a group call scenario, a communication terminal held by the group leader as a transmitting end and a communication terminal held by the group member as a receiving end of the subsystem form a contact system, and the voice signal transmits a radio frequency signal through the communication terminal of the transmitting end. After receiving the communication terminal, the receiving terminal sends the RF subsystem to perform related processing.

In this embodiment, in order to obtain key information, a voice template containing key information is also established in the communication terminal of each receiving end. Specifically, the keyword voice information is first recorded by an external microphone to obtain voice data, and then the voice data is converted into a voice analog signal, and then the voice analog signal is sent to the CODEC (supporting video and audio compression (CO) and solution. compression( DEC) The codec or software) chip module performs analog conversion and amplification related processing, and the processed speech signal enters the storage mode for correlation processing. The storage mode may include the following steps: detecting the keyword voice signal, and if the detection is successful, extracting the keyword voice information, and extracting the keyword voice information to store the final voice template.

There are a plurality of ways for the voice information to be recorded by the external microphone to obtain the voice data. In this embodiment, the following three methods are available:

In the first method, the keyword voice information that the communication terminal of the receiving end receives through the microphone is from the group leader, and is not a member of the group holding the communication terminal. Specifically, when the communication terminal records the keyword voice information, the voice recognition technology is used for recognition. In addition to the basic voice information, other conditions may be added. For example, the voice of the group leader may also include tone, sound color, and accent. And so on, so it can be used as a limit when identifying, in case other people know that the keyword content interferes with the information of the group members.

In the second mode, the keyword voice information that the communication terminal of the receiving end receives through the microphone is from a group member holding the communication terminal. Specifically, when the communication terminal records the keyword voice information, the voice recognition technology recognizes, that is, the voice information content is recognized.

In the third mode, the keyword voice information received by the communication terminal of the receiving end through the microphone is from the group leader, but the keyword voice information is pre-recorded group length voice information, and is stored in the communication terminal of the receiving end, and is used in use. Check the programming options. Specifically, when the group is active, there may be more than one common keyword voice information. To perform related settings, it is possible to avoid resetting the keyword voice information before each activity, and reduce the operation steps, for example, according to The different time of the activity time uses different keyword voice information, which is divided into three keyword voice information at the beginning of the month, the middle of the month and the end of the month, that is, the three key words of the month at the beginning of the month, the middle of the month and the end of the month, so at each event , the corresponding keyword information will be selected according to the activity time.

In the above storage mode, the voice signal is detected, and the keyword voice information that is successfully detected is extracted, and the extracted keyword voice information is stored as a final voice template. Specifically, the user can use the microphone to record the keyword voice information in multiple ways. In this embodiment, the user can have the following two types. The first one can only say the keyword that is desired to be recorded, and the key is recognized by the voice recognition technology. Word information, for example, when the keyword is set to "Zhang San", only the word "Zhang San" is recorded when recording the voice message; the second is that the text information of the keyword can be input in advance, and then recorded by voice, only If the keyword that is desired to be recorded is said to contain a keyword, the keyword voice information is extracted according to the text information of the keyword input in advance. For example, when the keyword is set to "Zhang San", the keyword input in advance is input. The text message is "Zhang San". When recording voice messages, say "Zhang San please answer". At this time, the keyword voice information will be extracted based on the text information. After extracting the keyword voice information, the extracted keyword voice information is stored to form a voice template, wherein the voice template is stored in the communication terminal at the receiving end, and is used to receive the voice signal in step S26 described below. The key information in the comparison.

For the content of the keyword in the voice template, in general, it may be an activity code, a secret number, a name of the owner, and the like. In this embodiment, in order to prevent the user at the receiving end from receiving unnecessary information or missing key information, the voice template is used. Set to the owner's name. Generally speaking, in an activity, when the group leader A wants to release the task to the member C, he will first call the name of the member C, and then publish the task, so the voice template is set to the machine. The main name can effectively filter out what information the owner needs to pay attention to.

S24: Receive a voice signal.

In this step S24, the communication terminal first receives the voice signal. The voice signal is collected by the communication terminal of the sending end first, wherein the voice information may be a piece of music, a sentence, etc., but in the embodiment, the voice information is a paragraph, and then the voice information is converted into a voice signal. The communication terminal sent to the receiving end receives the voice signal of the transmitting end on the communication terminal of the receiving end. The voice signal can be transmitted by using a wired technology or by using a wireless technology. In this embodiment, the voice signal is transmitted by using a wireless technology, and the radio terminal is used, that is, the communication terminal at the receiving end receives the voice based on the radio frequency technology. Signal.

Generally, when the communication terminal of the receiving end receives the voice signal, the voice signal is generally converted into voice information for playing, but in this embodiment, after the voice signal is received, the playback operation is not performed, but The received speech signal is analyzed, and the operation of extracting the key information in the following step S25 is performed.

S25: Extract key information in the voice signal.

After receiving the speech signal in the above step S24, the key information in the speech signal is extracted in this step S25. Specifically, the keyword information may be information including the number of words, the pronunciation of the text, etc., and the content of the key information may be preset or may follow a default setting, for example, when the number of words of the content of the default key information is set to three words. When the received speech signal is extracted one by one, the adjacent three words are combined to form a key information, for example, as shown in FIG. As shown in the figure, FIG. 3 is a schematic diagram of a set of call scenes of the present application. When the content of the voice signal sent by the LEADER walkie-talkie is “group member C, group member C, here is LEADER, please move the window on the south side of the third floor, please answer. ", then each of the three words in the content constitutes a key message, that is, extracting "Group C", "C-C", "C-member", "C-C", "C here", etc. The word composed of words is the key information in the speech signal, that is, a speech signal can contain multiple key information.

When the voice signal received by the communication terminal at the receiving end, generally speaking, the operation of playing the voice signal is directly performed, but in this embodiment, the voice signal is not directly played, but the key information extracted is used and used in the following In step S26, it is judged whether or not matching with the pre-stored template information.

S26: Determine whether the key information matches the pre-stored template information.

In this step S26, the voice template pre-stored in the above step S23 is matched with the key information extracted in the above step S25. If it matches, the following step S27 is performed, and if it does not match, no operation is performed.

Specifically, as shown in FIG. 3, in a group call scenario, the voice templates of the communication terminals of each group member are the respective names recorded by the LEADER, for example, "group member A" and "group member B". , "group member C", "group member D", "group member E", etc., the content of the key information set by the communication terminal of each receiving end is the number of words is three, when the communication terminal of the transmitting end held by the group leader is to the receiving end When the communication terminal performs a group call, all the communication terminals of the receiving end that all the group members hold receive the same voice signal, and the content thereof is “group member C, group member E, please reply to your position”, at this time, each receiving end The communication terminal extracts the key information in the voice signal and matches the voice template pre-stored in each communication terminal, and the matching result of the group member A, the group member B, and the group member D is no, that is, the matching fails, then their communication terminal The original state is maintained, and no operation is performed, and the result of the match between the member C and the member E is YES, that is, the matching is successful, and the following step S27 is performed.

S27: Run the preset function command to respond to the group call.

After the key information is matched with the pre-stored template information in the above step S26, the preset function command is executed to respond to the group call. The preset function commands can perform related expansion functions, that is, there can be multiple function commands to meet various needs of the user's various group call scenes, for example, voice information for realizing lighting display, vibration, and playback of voice signal conversion, etc. The function command may be a default setting or a preset. In this embodiment, the function command for running the preset includes an instruction to operate the volume, vibration or flash, and an instruction to save the voice signal to the voice signal. The sender transmits location information.

For example, in the group call scenario shown in FIG. 3, when the LEADER walkie-talkie initiates a group call, the LEADER communicates the instructions for assigning tasks to each group member, and calls the group member C by voice. Generally speaking, when team member C is busy and has insufficient concentration, the important voice information sent by the LEADER walkie-talkie will be missed. LEADER will repeatedly call group member C because he has not received the response, until group member C replies. Voice message; however, when LEADER repeatedly calls team member C, but team member C does not reply, LEADER cannot distribute the task normally, then there will be cases where team member C cannot know the task content of LEADER before returning; When the team is in a dangerous field environment, if LEADER cannot confirm the exact location of team member C, LEADER will worry about whether team C is dangerous and will also affect the smooth progress of the work.

However, in this embodiment, the voice template set by the walkie-talkie held by the group member C is the owner name. When the LEADER calls the group member C, the intercommunication opportunity of the group member C matches the received voice signal pre-stored voice template voice. After the matching is successful, the walkie-talkie of the member C will automatically switch to the LED flash mode or the vibration mode. The form of the LED flash mode can be various, for example, the setting flashing time or the number of flashing times, when the intercom includes When multiple LED lights are used, the number of LED flashes can be set, etc. The vibration mode can be in the form of setting the duration or frequency of the motor vibration. The duration or frequency can be preset or follow the default, in this embodiment. Set the duration of the vibration to 3 seconds, the frequency can be 1 second after each vibration for 5 seconds, the vibration or flashing light of the member C intercom will cause the attention of the member C, so that the member C finds that the walkie-talkie has received the voice information. Then, the voice information is smoothly received; the walkie-talkie of the member C will enlarge the volume when playing the voice information, so that the member C can hear the contents of the task assigned by the LEADER; however, When the member C does not answer for a certain period of time, the intercommunication opportunity of the member C turns on the automatic recording mode, that is, the walkie-talkie of the member C automatically saves the voice information after the keyword matching and supports playback, for example, setting the time to 30 seconds, which can avoid If the member C misses the voice information that needs attention, it cannot be retrieved. The time of the missed call can be preset or the default setting can be followed. It is not limited here; if the member C does not answer for a long time, for example, 5 Minutes, the default owner of the crew of the crew C is in a dangerous situation. At this time, the intercom will automatically send the location information of the crew C to the LEADER, which can enable the LEADER to confirm the rescue location when the crew C is in danger, saving the rescue time. The time of the unanswered time can be preset or the default setting, which is not limited here.

In this embodiment, by receiving a voice signal, and then extracting key information in the voice signal, the key information is matched with the pre-stored template information to determine whether the received voice signal is important information to be concerned, and if matched, the preset is run. The function instruction is used to remind the user to respond to the group call. In this embodiment, the key information matching the preset template information can be extracted from the language signal of the group call, and the corresponding function is executed to respond to the group call, so that the user can timely pay attention. Or avoid missing key information and improve the user experience.

The above method is applied to a communication terminal, and the logic process thereof is represented by a computer program, and is specifically implemented by a communication terminal.

When the computer program is implemented in software and sold or used as a stand-alone product, it can be stored in an electronic device readable storage medium, that is, the application further provides a computer storage medium on which the program data is stored. The steps of the above method are implemented when the program data is executed by the processor. Please refer to FIG. 4. FIG. 4 is a schematic structural diagram of an embodiment of a computer storage medium according to the present application. Program data in the computer storage medium 100 can be executed to implement the method of the foregoing embodiment. The computer storage medium can be, for example, a USB flash drive or an optical disk. , server, etc.

For the hardware structure of the communication terminal, please refer to FIG. 5. FIG. 5 is a schematic structural diagram of an embodiment of a communication terminal according to the present application. The communication terminal 200 of the embodiment includes a processor 21, a memory 22, a microphone 23, and a radio frequency module 24. The processor 21 is coupled to the memory 22, the microphone 23, and the radio frequency module 24. The program data is stored in the memory 22, and the processor 21 can load the program. The data is executed and implemented to implement the voice signal processing method of the group call, and the radio frequency module 24 is configured to receive the voice signal.

Specifically, the processor 21 is configured to extract key information in the voice signal, determine whether the key information matches the template information, and if the key information matches the template information, run the preset function instruction to respond to the group call.

For the voice signal processing method for the group terminal to implement the group call, the method for processing the voice signal of the group call in the communication terminal of the embodiment is similar to the embodiment of the foregoing embodiment. For the specific implementation steps, refer to FIG. 1 or FIG. 2, and details are not described herein. .

It should be noted that the communication terminal that transmits the voice information and the communication terminal that receives the voice information may be two different communication terminals. Specifically, in a group call scene, the communication terminal as the transmitting end sends a voice signal to the receiving end. The communication terminal receives the key information in the voice signal, and determines whether the key information matches the template information; if yes, runs the preset function command to respond to the communication terminal of the sender.

The communication terminal of this embodiment enables the user to pay attention to or avoid missing key information in time to improve the user experience.

Please refer to FIG. 6. FIG. 6 is a schematic structural diagram of still another embodiment of a communication terminal according to the present application. In this embodiment, the communication terminal 200 is the communication terminal in the above embodiment, and the communication terminal 200 includes a receiving module 31, an extracting module 32, a determining module 33, and an operating module 34.

The receiving module 31 is configured to receive a voice signal.

The extraction module 32 is configured to extract key information in the voice signal.

The determining module 33 is configured to determine whether the key information matches the pre-stored template information.

The running module 34 is configured to: when the key information matches the pre-stored template information, run the preset function instruction to respond to the group call.

The above is only the embodiment of the present application, and thus does not limit the scope of patents of the present application, and the equivalent structure or equivalent process transformation made by using the specification and the contents of the drawings, or directly or indirectly applied to other related technical fields, The same is included in the scope of patent protection of this application.

Claims

A method for processing a voice signal of a group call, characterized in that the method comprises:

Receiving the voice signal;

Extracting key information in the voice signal;

Determining whether the key information matches the pre-stored template information;

If so, a preset function command is run in response to the group call.
The method of claim 1 wherein the method further comprises:

Acquire voice data;

Extracting keyword voice information in the voice data;

The keyword voice information is saved as the template information.
The method according to claim 2, wherein the extracting the keyword voice information in the voice data comprises:

The keyword voice information in the voice data is extracted according to a preset keyword.
The method according to claim 1, wherein the running the preset function instruction comprises:

Run the command to amplify the volume, vibration or flash.
The method according to claim 1, wherein the running the preset function instruction comprises:

Run an instruction to save the voice signal.
The method according to claim 1, wherein the running the preset function instruction comprises:

The location information is transmitted to the sender of the voice signal.
The method of claim 1, wherein the receiving the voice signal comprises:

The speech signal is received based on radio frequency technology.
A communication terminal, the communication terminal is configured to receive a voice signal of a group call, wherein the communication device comprises a processor and a memory coupled to each other, wherein the memory pre-stores template information, and the processor is used for :

Extracting key information in the voice signal;

Determining whether the key information matches the template information;

If so, a preset function command is run in response to the group call.
The communication terminal according to claim 8, wherein the communication terminal further comprises a microphone and a radio frequency module, wherein the microphone and the radio frequency module are respectively coupled to the processor, and the radio frequency module is used for receiving The speech signal.
A computer storage medium having stored thereon a computer program, characterized in that the computer program can be executed to carry out the steps of the method of any of claims 1-7.