WO2019127057A1 - Method for processing voice signal for group call, communication terminal and computer storage medium - Google Patents

Method for processing voice signal for group call, communication terminal and computer storage medium Download PDF

Info

Publication number
WO2019127057A1
WO2019127057A1 PCT/CN2017/118766 CN2017118766W WO2019127057A1 WO 2019127057 A1 WO2019127057 A1 WO 2019127057A1 CN 2017118766 W CN2017118766 W CN 2017118766W WO 2019127057 A1 WO2019127057 A1 WO 2019127057A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
voice
voice signal
communication terminal
group call
Prior art date
Application number
PCT/CN2017/118766
Other languages
French (fr)
Chinese (zh)
Inventor
邓智
于海洋
陈芬
杜湘洋
Original Assignee
海能达通信股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 海能达通信股份有限公司 filed Critical 海能达通信股份有限公司
Priority to PCT/CN2017/118766 priority Critical patent/WO2019127057A1/en
Publication of WO2019127057A1 publication Critical patent/WO2019127057A1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/725Cordless telephones

Definitions

  • the present application relates to the field of wireless terminal communication, and in particular, to a voice signal processing method for a group call, a communication terminal, and a computer storage medium.
  • the walkie-talkie As a two-way mobile communication tool, the walkie-talkie has many advantages, such as making a call without any network, so that no call charges are incurred, thereby reducing economic costs, and it is suitable for applications where relatively fixed and frequent calls are made.
  • the application provides a voice signal processing method for a group call, a communication terminal, and a computer storage medium, so that the user can pay attention to or avoid missing key information in time.
  • the present application provides a method for processing a voice signal of a group call, the method comprising: receiving a voice signal; extracting key information in the voice signal; determining whether the key information matches the pre-stored template information; if yes, running the pre- Set the function command to respond to the group call.
  • the present application further provides a communication terminal, the communication terminal is configured to receive a voice signal of a group call, the communication device includes a processor and a memory coupled to each other, and the template information is pre-stored in the memory, and the processor is used to Extract key information in the voice signal; determine whether the key information matches the template information; if yes, run the preset function command to respond to the group call.
  • the present application further provides a computer storage medium having stored thereon a computer program capable of being executed to implement the method of any of the above methods.
  • the utility model has the beneficial effects of: receiving a voice signal; extracting key information in the voice signal; determining whether the key information matches the pre-stored template information; if yes, running a preset function instruction to respond to the group call, the application can The key information matching the preset template information is extracted from the language signal of the group call, and the corresponding function is run to respond to the group call, which can solve the problem that the user cannot pay attention to or miss the key information in time.
  • FIG. 1 is a schematic flow chart of an embodiment of a method for processing a voice signal of a group call according to the present application
  • FIG. 2 is a schematic flow chart of still another embodiment of a method for processing a voice signal of a group call according to the present application
  • FIG. 3 is a schematic diagram of a set of call scenes of the present application.
  • FIG. 4 is a schematic structural diagram of an embodiment of a computer storage medium of the present application.
  • FIG. 5 is a schematic structural diagram of an embodiment of a communication terminal according to the present application.
  • FIG. 6 is a schematic structural diagram of still another embodiment of a communication terminal according to the present application.
  • the communication terminal provided by the embodiment of the present application includes an electronic device such as a smart phone, a tablet computer, a smart wearable device, a digital audio and video player, an electronic reader, and a handheld game machine.
  • an electronic device such as a smart phone, a tablet computer, a smart wearable device, a digital audio and video player, an electronic reader, and a handheld game machine.
  • first”, “second”, and “third” in this application are used for descriptive purposes only, and are not to be construed as indicating or implying a relative importance or implicitly indicating the number of technical features indicated. Thus, features defining “first”, “second”, and “third” may include at least one of the features, either explicitly or implicitly.
  • the meaning of “plurality” is at least two, for example two, three, and the like.
  • the terms “comprises” and “comprising” and “comprising” are intended to cover a non-exclusive inclusion. For example, a process, method, system, product, or device that comprises a series of steps or units is not limited to the listed steps or units, but optionally also includes steps or units not listed, or alternatively Other steps or units inherent to these processes, methods, products or equipment.
  • references to "an embodiment” herein mean that a particular feature, structure, or characteristic described in connection with the embodiments can be included in at least one embodiment of the present application.
  • the appearances of the phrases in various places in the specification are not necessarily referring to the same embodiments, and are not exclusive or alternative embodiments that are mutually exclusive. Those skilled in the art will understand and implicitly understand that the embodiments described herein can be combined with other embodiments.
  • FIG. 1 is a schematic flowchart diagram of an embodiment of a method for processing a voice signal of a group call according to the present application.
  • the voice signal processing method of the group call may include the following steps:
  • the communication terminal first receives the voice signal.
  • the voice signal is obtained by the communication terminal as the transmitting end, and the voice information may be a piece of music, a sentence, etc., and then the voice information is converted into a voice signal and sent to the communication terminal as the receiving end, as the communication terminal of the receiving end.
  • the voice signal of the transmitting end is received.
  • the number of the communication terminals on the transmitting end and the number of the receiving terminal terminals may be one-to-one or one-to-many.
  • one communication terminal as the transmitting end corresponds to a plurality of communication terminals as the receiving end, for example,
  • the base station, the control center, or the LEADER walkie-talkie can serve as the transmitting end.
  • the base station, the control center, or the LEADER walkie-talkie can serve as the transmitting end.
  • the number of the radios is one, it can be regarded as one-to-one. In this embodiment, it is not limited.
  • the voice signal can be transmitted by wired technology or by wireless technology.
  • the wired transmission can be twisted pair transmission, coaxial cable transmission, optical fiber transmission, etc.
  • the wireless transmission can be video baseband transmission, optical fiber transmission, and network. Transmission, microwave transmission, broadband common cable transmission, and so on.
  • the voice signal is generally converted into voice for playback.
  • the playback operation is not performed, but The received speech signal is analyzed, and the operation of extracting the key information in the following step S12 is performed.
  • the voice signal may include information such as frequency, loudness, pitch, text pronunciation, etc., wherein some information in the voice signal may be extracted as key information.
  • the key information may be information including the number of words, the pronunciation of the text, etc.
  • the content of the key information may be preset or may follow the default setting. It can be known that a plurality of key information may be included in one voice signal.
  • step S13 it is judged whether or not matching with the pre-stored template information.
  • step S12 After the key information of the voice signal is extracted in the above step S12, the pre-stored template information is matched in this step S13, and if it matches, the following step S14 is performed.
  • the template information is fixed information used for comparison with key information. Specifically, before using the communication terminal to carry out activities, the user adjusts the template information according to the current personnel and activity content, such as activity code, password, and the like. At this time, the corresponding template information is set. Generally speaking, the template information is not modified before the end of the activity, and the template information is set on the communication terminal of the receiving end.
  • step S12 and step S13 are repeated, that is, key information is extracted and matched with the pre-stored template information for each received voice signal.
  • the communication terminal at the transmitting end makes a group call to the communication terminal at the receiving end
  • the communication terminal at the receiving end receives a voice signal whose content is “Zhang San, Li Si, please reply to your location”, according to the rules for extracting key information. Extracting key information from the content of the received voice signal, wherein the template information of the communication terminal of the first receiving end is set to the name of the owner, "Zhang San", and the template information of the communication terminal of the second receiving end is set as the owner
  • the name "Li Si” the template information of the communication terminal of the third receiving end is set to the name of the owner "Wang Wu", at this time, for the communication terminals of the first and third receiving ends, the voice signal is extracted.
  • the key information matches the template information.
  • the following step S14 is performed, and for the communication device of the second receiving end, the key information extracted in the voice signal does not match the template information, and no operation is performed at this time.
  • a function instruction is an instruction that implements a certain function. Generally speaking, it may be a voice information that illuminates a display screen, vibrates, and plays a voice signal, and the function instruction may follow a default setting or a preset, for example,
  • the communication terminal receives the voice signal
  • the voice information converted by the voice signal is played after the default vibration, and the user may preset the reminder ringtone according to the personal preference, that is, the voice message is converted by playing the ringtone and then the voice signal is converted.
  • the function command is preset. When the key information matches the pre-stored template information, the preset function command will be run in response to the group call.
  • the key information is matched with the pre-stored template information to determine whether the received voice signal is important information to be concerned, and if matched, the preset is run.
  • the function command in response to the group call, enables the user to pay attention to or avoid missing key information in time to improve the user experience.
  • FIG. 2 is a schematic flowchart diagram of still another embodiment of a method for processing a voice signal of a group call according to the present application.
  • the voice signal processing method of the group call may include the following steps:
  • the template information is stored in advance before the voice signal processing of the group call is performed, and the step of storing the template information may include acquiring voice data, and extracting keyword voice information in the voice data, where the method further includes: according to the preset keyword.
  • the keyword voice information in the voice data is extracted, and the keyword voice information is saved as a voice template. Steps S21 to S23 are put together for explanation.
  • a communication terminal held by the group leader as a transmitting end and a communication terminal held by the group member as a receiving end of the subsystem form a contact system, and the voice signal transmits a radio frequency signal through the communication terminal of the transmitting end.
  • the receiving terminal After receiving the communication terminal, the receiving terminal sends the RF subsystem to perform related processing.
  • a voice template containing key information is also established in the communication terminal of each receiving end.
  • the keyword voice information is first recorded by an external microphone to obtain voice data, and then the voice data is converted into a voice analog signal, and then the voice analog signal is sent to the CODEC (supporting video and audio compression (CO) and solution.
  • compression( DEC) The codec or software) chip module performs analog conversion and amplification related processing, and the processed speech signal enters the storage mode for correlation processing.
  • the storage mode may include the following steps: detecting the keyword voice signal, and if the detection is successful, extracting the keyword voice information, and extracting the keyword voice information to store the final voice template.
  • the keyword voice information that the communication terminal of the receiving end receives through the microphone is from the group leader, and is not a member of the group holding the communication terminal.
  • the voice recognition technology is used for recognition.
  • the voice of the group leader may also include tone, sound color, and accent. And so on, so it can be used as a limit when identifying, in case other people know that the keyword content interferes with the information of the group members.
  • the keyword voice information that the communication terminal of the receiving end receives through the microphone is from a group member holding the communication terminal. Specifically, when the communication terminal records the keyword voice information, the voice recognition technology recognizes, that is, the voice information content is recognized.
  • the keyword voice information received by the communication terminal of the receiving end through the microphone is from the group leader, but the keyword voice information is pre-recorded group length voice information, and is stored in the communication terminal of the receiving end, and is used in use.
  • the different time of the activity time uses different keyword voice information, which is divided into three keyword voice information at the beginning of the month, the middle of the month and the end of the month, that is, the three key words of the month at the beginning of the month, the middle of the month and the end of the month, so at each event , the corresponding keyword information will be selected according to the activity time.
  • the voice signal is detected, and the keyword voice information that is successfully detected is extracted, and the extracted keyword voice information is stored as a final voice template.
  • the user can use the microphone to record the keyword voice information in multiple ways.
  • the user can have the following two types. The first one can only say the keyword that is desired to be recorded, and the key is recognized by the voice recognition technology. Word information, for example, when the keyword is set to "Zhang San", only the word "Zhang San” is recorded when recording the voice message; the second is that the text information of the keyword can be input in advance, and then recorded by voice, only If the keyword that is desired to be recorded is said to contain a keyword, the keyword voice information is extracted according to the text information of the keyword input in advance.
  • the keyword when the keyword is set to "Zhang San", the keyword input in advance is input.
  • the text message is "Zhang San”.
  • the keyword voice information When recording voice messages, say “Zhang San please answer”.
  • the keyword voice information will be extracted based on the text information.
  • the extracted keyword voice information is stored to form a voice template, wherein the voice template is stored in the communication terminal at the receiving end, and is used to receive the voice signal in step S26 described below.
  • the key information in the comparison.
  • the content of the keyword in the voice template in general, it may be an activity code, a secret number, a name of the owner, and the like.
  • the voice template is used. Set to the owner's name.
  • the group leader A wants to release the task to the member C, he will first call the name of the member C, and then publish the task, so the voice template is set to the machine.
  • the main name can effectively filter out what information the owner needs to pay attention to.
  • the communication terminal first receives the voice signal.
  • the voice signal is collected by the communication terminal of the sending end first, wherein the voice information may be a piece of music, a sentence, etc., but in the embodiment, the voice information is a paragraph, and then the voice information is converted into a voice signal.
  • the communication terminal sent to the receiving end receives the voice signal of the transmitting end on the communication terminal of the receiving end.
  • the voice signal can be transmitted by using a wired technology or by using a wireless technology.
  • the voice signal is transmitted by using a wireless technology, and the radio terminal is used, that is, the communication terminal at the receiving end receives the voice based on the radio frequency technology. Signal.
  • the voice signal is generally converted into voice information for playing, but in this embodiment, after the voice signal is received, the playback operation is not performed, but The received speech signal is analyzed, and the operation of extracting the key information in the following step S25 is performed.
  • the key information in the speech signal is extracted in this step S25.
  • the keyword information may be information including the number of words, the pronunciation of the text, etc.
  • the content of the key information may be preset or may follow a default setting, for example, when the number of words of the content of the default key information is set to three words.
  • the received speech signal is extracted one by one, the adjacent three words are combined to form a key information, for example, as shown in FIG.
  • FIG. 3 is a schematic diagram of a set of call scenes of the present application.
  • each of the three words in the content constitutes a key message, that is, extracting "Group C", “C-C”, “C-member”, “C-C”, “C here”, etc.
  • the word composed of words is the key information in the speech signal, that is, a speech signal can contain multiple key information.
  • step S26 it is judged whether or not matching with the pre-stored template information.
  • step S26 the voice template pre-stored in the above step S23 is matched with the key information extracted in the above step S25. If it matches, the following step S27 is performed, and if it does not match, no operation is performed.
  • the voice templates of the communication terminals of each group member are the respective names recorded by the LEADER, for example, "group member A” and “group member B”. , “group member C”, “group member D”, “group member E”, etc., the content of the key information set by the communication terminal of each receiving end is the number of words is three, when the communication terminal of the transmitting end held by the group leader is to the receiving end.
  • the communication terminal performs a group call, all the communication terminals of the receiving end that all the group members hold receive the same voice signal, and the content thereof is “group member C, group member E, please reply to your position”, at this time, each receiving end
  • the communication terminal extracts the key information in the voice signal and matches the voice template pre-stored in each communication terminal, and the matching result of the group member A, the group member B, and the group member D is no, that is, the matching fails, then their communication terminal The original state is maintained, and no operation is performed, and the result of the match between the member
  • the preset function command is executed to respond to the group call.
  • the preset function commands can perform related expansion functions, that is, there can be multiple function commands to meet various needs of the user's various group call scenes, for example, voice information for realizing lighting display, vibration, and playback of voice signal conversion, etc.
  • the function command may be a default setting or a preset.
  • the function command for running the preset includes an instruction to operate the volume, vibration or flash, and an instruction to save the voice signal to the voice signal.
  • the sender transmits location information.
  • the LEADER when the LEADER walkie-talkie initiates a group call, the LEADER communicates the instructions for assigning tasks to each group member, and calls the group member C by voice.
  • the important voice information sent by the LEADER walkie-talkie will be missed.
  • LEADER will repeatedly call group member C because he has not received the response, until group member C replies.
  • the voice template set by the walkie-talkie held by the group member C is the owner name.
  • the intercommunication opportunity of the group member C matches the received voice signal pre-stored voice template voice. After the matching is successful, the walkie-talkie of the member C will automatically switch to the LED flash mode or the vibration mode.
  • the form of the LED flash mode can be various, for example, the setting flashing time or the number of flashing times, when the intercom includes When multiple LED lights are used, the number of LED flashes can be set, etc.
  • the vibration mode can be in the form of setting the duration or frequency of the motor vibration. The duration or frequency can be preset or follow the default, in this embodiment.
  • the frequency can be 1 second after each vibration for 5 seconds, the vibration or flashing light of the member C intercom will cause the attention of the member C, so that the member C finds that the walkie-talkie has received the voice information.
  • the voice information is smoothly received; the walkie-talkie of the member C will enlarge the volume when playing the voice information, so that the member C can hear the contents of the task assigned by the LEADER; however,
  • the intercommunication opportunity of the member C turns on the automatic recording mode, that is, the walkie-talkie of the member C automatically saves the voice information after the keyword matching and supports playback, for example, setting the time to 30 seconds, which can avoid If the member C misses the voice information that needs attention, it cannot be retrieved.
  • the time of the missed call can be preset or the default setting can be followed.
  • the intercom will automatically send the location information of the crew C to the LEADER, which can enable the LEADER to confirm the rescue location when the crew C is in danger, saving the rescue time.
  • the time of the unanswered time can be preset or the default setting, which is not limited here.
  • the key information is matched with the pre-stored template information to determine whether the received voice signal is important information to be concerned, and if matched, the preset is run.
  • the function instruction is used to remind the user to respond to the group call.
  • the key information matching the preset template information can be extracted from the language signal of the group call, and the corresponding function is executed to respond to the group call, so that the user can timely pay attention. Or avoid missing key information and improve the user experience.
  • the above method is applied to a communication terminal, and the logic process thereof is represented by a computer program, and is specifically implemented by a communication terminal.
  • FIG. 4 is a schematic structural diagram of an embodiment of a computer storage medium according to the present application.
  • Program data in the computer storage medium 100 can be executed to implement the method of the foregoing embodiment.
  • the computer storage medium can be, for example, a USB flash drive or an optical disk. , server, etc.
  • FIG. 5 is a schematic structural diagram of an embodiment of a communication terminal according to the present application.
  • the communication terminal 200 of the embodiment includes a processor 21, a memory 22, a microphone 23, and a radio frequency module 24.
  • the processor 21 is coupled to the memory 22, the microphone 23, and the radio frequency module 24.
  • the program data is stored in the memory 22, and the processor 21 can load the program.
  • the data is executed and implemented to implement the voice signal processing method of the group call, and the radio frequency module 24 is configured to receive the voice signal.
  • the processor 21 is configured to extract key information in the voice signal, determine whether the key information matches the template information, and if the key information matches the template information, run the preset function instruction to respond to the group call.
  • the method for processing the voice signal of the group call in the communication terminal of the embodiment is similar to the embodiment of the foregoing embodiment.
  • the specific implementation steps refer to FIG. 1 or FIG. 2, and details are not described herein. .
  • the communication terminal that transmits the voice information and the communication terminal that receives the voice information may be two different communication terminals. Specifically, in a group call scene, the communication terminal as the transmitting end sends a voice signal to the receiving end. The communication terminal receives the key information in the voice signal, and determines whether the key information matches the template information; if yes, runs the preset function command to respond to the communication terminal of the sender.
  • the communication terminal of this embodiment enables the user to pay attention to or avoid missing key information in time to improve the user experience.
  • FIG. 6 is a schematic structural diagram of still another embodiment of a communication terminal according to the present application.
  • the communication terminal 200 is the communication terminal in the above embodiment, and the communication terminal 200 includes a receiving module 31, an extracting module 32, a determining module 33, and an operating module 34.
  • the receiving module 31 is configured to receive a voice signal.
  • the extraction module 32 is configured to extract key information in the voice signal.
  • the determining module 33 is configured to determine whether the key information matches the pre-stored template information.
  • the running module 34 is configured to: when the key information matches the pre-stored template information, run the preset function instruction to respond to the group call.
  • the communication terminal of this embodiment enables the user to pay attention to or avoid missing key information in time to improve the user experience.

Abstract

The present application relates to the field of wireless terminal communications, and provided thereby are a method for processing a voice signal for a group call, a communication terminal, and a computer storage medium. Provided by the present application is a method for processing a voice signal for a group call, the method comprising: receiving a voice signal; extracting key information from the voice signal; determining whether the key information matches pre-stored template information; if so, running a preset function command so as to respond to a group call. The method of the present application may solve the existing problem wherein key information cannot be followed promptly or is missed.

Description

组呼的语音信号处理方法、通讯终端以及计算机存储介质 Voice signal processing method for group call, communication terminal and computer storage medium
【技术领域】[Technical Field]
本申请涉及无线终端通信领域,特别是涉及一种组呼的语音信号处理方法、通讯终端以及计算机存储介质。The present application relates to the field of wireless terminal communication, and in particular, to a voice signal processing method for a group call, a communication terminal, and a computer storage medium.
【背景技术】 【Background technique】
对讲机作为一种双向移动通信工具,具有诸多优点,例如不需要任何网络的情况下就可以进行通话,因此不会产生话费,从而减少经济成本,适合应用于相对固定且频繁通话的场合等等。As a two-way mobile communication tool, the walkie-talkie has many advantages, such as making a call without any network, so that no call charges are incurred, thereby reducing economic costs, and it is suitable for applications where relatively fixed and frequent calls are made.
随着对讲机在群体专业业务中使用的频率逐渐增多,对讲机在组呼方面的应用越来越受到广泛的关注,然而在组呼应用场景中,难免会出现接收到的信息包含许多无用信息的情况,造成接受信息冗余,往往会不能及时关注或者漏掉关键信息,导致用户体验不佳。As the frequency of use of walkie-talkies in group professional services is increasing, the application of walkie-talkies in group calls is receiving more and more attention. However, in the group call application scenario, it is inevitable that the received information contains many useless information. As a result of receiving information redundancy, it is often impossible to pay attention to or miss key information in time, resulting in poor user experience.
【发明内容】 [Summary of the Invention]
本申请提供一种组呼的语音信号处理方法、通讯终端、计算机存储介质,以使用户能够及时关注或者避免漏掉关键信息。The application provides a voice signal processing method for a group call, a communication terminal, and a computer storage medium, so that the user can pay attention to or avoid missing key information in time.
为解决上述技术问题,本申请提供一种组呼的语音信号处理方法,该方法包括接收语音信号;提取语音信号中的关键信息;判断关键信息与预存的模板信息是否匹配;若是,则运行预设的功能指令,以响应组呼。To solve the above technical problem, the present application provides a method for processing a voice signal of a group call, the method comprising: receiving a voice signal; extracting key information in the voice signal; determining whether the key information matches the pre-stored template information; if yes, running the pre- Set the function command to respond to the group call.
为解决上述技术问题,本申请又提供一种通讯终端,通讯终端用于接收组呼的语音信号,该通讯设备包括相互耦接的处理器和存储器,存储器中预存有模板信息,该处理器用于提取语音信号中的关键信息;判断关键信息与模板信息是否匹配;若是,则运行预设的功能指令,以响应组呼。To solve the above technical problem, the present application further provides a communication terminal, the communication terminal is configured to receive a voice signal of a group call, the communication device includes a processor and a memory coupled to each other, and the template information is pre-stored in the memory, and the processor is used to Extract key information in the voice signal; determine whether the key information matches the template information; if yes, run the preset function command to respond to the group call.
为解决上述技术问题,本申请另提供一种计算机存储介质,其上存储有计算机程序,该计算机程序能够被执行以实现上述方法中任一项的方法。In order to solve the above technical problems, the present application further provides a computer storage medium having stored thereon a computer program capable of being executed to implement the method of any of the above methods.
本申请的有益效果是:通过接收语音信号;提取语音信号中的关键信息;判断关键信息与预存的模板信息是否匹配;若是,则运行预设的功能指令,以响应组呼,本申请能够从组呼的语言信号中提取出匹配预设模板信息的关键信息,并且运行相应的功能来响应该组呼,可以解决用户无法及时关注或者漏掉关键信息的问题。The utility model has the beneficial effects of: receiving a voice signal; extracting key information in the voice signal; determining whether the key information matches the pre-stored template information; if yes, running a preset function instruction to respond to the group call, the application can The key information matching the preset template information is extracted from the language signal of the group call, and the corresponding function is run to respond to the group call, which can solve the problem that the user cannot pay attention to or miss the key information in time.
【附图说明】 [Description of the Drawings]
为了更清楚地说明本申请实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings to be used in the embodiments or the prior art description will be briefly described below. Obviously, the drawings in the following description are only It is a certain embodiment of the present application, and other drawings can be obtained according to the drawings without any creative work for those skilled in the art.
图1 是本申请一种组呼的语音信号处理方法一实施例的流程示意图;1 is a schematic flow chart of an embodiment of a method for processing a voice signal of a group call according to the present application;
图2是本申请一种组呼的语音信号处理方法又一实施例的流程示意图;2 is a schematic flow chart of still another embodiment of a method for processing a voice signal of a group call according to the present application;
图3是本申请一组呼场景示意图;3 is a schematic diagram of a set of call scenes of the present application;
图4是本申请计算机存储介质一实施例的结构示意图;4 is a schematic structural diagram of an embodiment of a computer storage medium of the present application;
图5是本申请一种通讯终端一实施例的结构示意图;FIG. 5 is a schematic structural diagram of an embodiment of a communication terminal according to the present application; FIG.
图6是本申请一种通讯终端又一实施例的结构示意图。FIG. 6 is a schematic structural diagram of still another embodiment of a communication terminal according to the present application.
【具体实施方式】【Detailed ways】
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述。可以理解的是,此处所描述的具体实施例仅用于解释本申请,而非对本申请的限定。另外还需要说明的是,为了便于描述,附图中仅示出了与本申请相关的部分而非全部结构。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be clearly and completely described in the following with reference to the accompanying drawings in the embodiments. It is understood that the specific embodiments described herein are merely illustrative of the application and are not intended to be limiting. In addition, it should be noted that, for the convenience of description, only some but not all of the structures related to the present application are shown in the drawings. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present application without departing from the inventive scope are the scope of the present application.
本申请实施例所提供的通讯终端,包括智能手机、平板电脑、智能穿戴设备、数字音视频播放器、电子阅读器、手持游戏机等电子设备。The communication terminal provided by the embodiment of the present application includes an electronic device such as a smart phone, a tablet computer, a smart wearable device, a digital audio and video player, an electronic reader, and a handheld game machine.
本申请中的术语“第一”、“第二”、“第三”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”、“第三”的特征可以明示或者隐含地包括至少一个该特征。本申请的描述中,“多个”的含义是至少两个,例如两个,三个等。此外,术语“包括”和“具有”以及它们任何变形,意图在于覆盖不排他的包含。例如包含了一系列步骤或单元的过程、方法、系统、产品或设备没有限定于已列出的步骤或单元,而是可选地还包括没有列出的步骤或单元,或可选地还包括对于这些过程、方法、产品或设备固有的其它步骤或单元。The terms "first", "second", and "third" in this application are used for descriptive purposes only, and are not to be construed as indicating or implying a relative importance or implicitly indicating the number of technical features indicated. Thus, features defining "first", "second", and "third" may include at least one of the features, either explicitly or implicitly. In the description of the present application, the meaning of "plurality" is at least two, for example two, three, and the like. Furthermore, the terms "comprises" and "comprising" and "comprising" are intended to cover a non-exclusive inclusion. For example, a process, method, system, product, or device that comprises a series of steps or units is not limited to the listed steps or units, but optionally also includes steps or units not listed, or alternatively Other steps or units inherent to these processes, methods, products or equipment.
在本文中提及“实施例”意味着,结合实施例描述的特定特征、结构或特性可以包含在本申请的至少一个实施例中。在说明书中的各个位置出现该短语并不一定均是指相同的实施例,也不是与其它实施例互斥的独立的或备选的实施例。本领域技术人员显式地和隐式地理解的是,本文所描述的实施例可以与其它实施例相结合。References to "an embodiment" herein mean that a particular feature, structure, or characteristic described in connection with the embodiments can be included in at least one embodiment of the present application. The appearances of the phrases in various places in the specification are not necessarily referring to the same embodiments, and are not exclusive or alternative embodiments that are mutually exclusive. Those skilled in the art will understand and implicitly understand that the embodiments described herein can be combined with other embodiments.
请参阅图1,图1是本申请一种组呼的语音信号处理方法一实施例的流程示意图。在本实施例中,组呼的语音信号处理方法可以包括以下步骤:Please refer to FIG. 1. FIG. 1 is a schematic flowchart diagram of an embodiment of a method for processing a voice signal of a group call according to the present application. In this embodiment, the voice signal processing method of the group call may include the following steps:
S11:接收语音信号。S11: Receive a voice signal.
在本步骤S11中,通讯终端首先接收语音信号。其中,语音信号是由作为发送端的通讯终端先采集到语音信息,语音信息可以是一段音乐、一句话等等,然后将语音信息转换为语音信号发送给作为接收端的通讯终端,作为接收端的通讯终端上接收到发送端的语音信号。In this step S11, the communication terminal first receives the voice signal. The voice signal is obtained by the communication terminal as the transmitting end, and the voice information may be a piece of music, a sentence, etc., and then the voice information is converted into a voice signal and sent to the communication terminal as the receiving end, as the communication terminal of the receiving end. The voice signal of the transmitting end is received.
发送端的通讯终端的个数与接收端终端的个数可以是一对一的,也可以是一对多,在本实施例中作为发送端的一个通讯终端对应多个作为接收端的通讯终端,例如,在一个组呼场景中,基站、控制中心或LEADER对讲机可以作为发送端,与基站、控制中心或LEADER对讲机进行通信的至少两个对讲机作为接收端时,即可以看作是一对多,而当对讲机的个数为一个时,则可以看作是一对一,在本实施例中,具体不做限定。The number of the communication terminals on the transmitting end and the number of the receiving terminal terminals may be one-to-one or one-to-many. In this embodiment, one communication terminal as the transmitting end corresponds to a plurality of communication terminals as the receiving end, for example, In a group call scenario, the base station, the control center, or the LEADER walkie-talkie can serve as the transmitting end. When at least two walkie-talkies that communicate with the base station, the control center, or the LEADER walkie-talkie are used as the receiving end, they can be regarded as one-to-many. When the number of the radios is one, it can be regarded as one-to-one. In this embodiment, it is not limited.
语音信号可以通过有线技术进行传输,也可以通过无线技术进行传输,例如,有线传输可以是双绞线传输、同轴电缆传输、光纤传输等等,无线传输可以是视频基带传输、光纤传输、网络传输、微波传输、宽频共缆传输等等。一般来说,当接收端的通讯终端接收到语音信号时,一般会直接将语音信号转换为语音进行播放,但在本实施例中,在接收到语音信号后并不进行播放的操作,而是对接收到的语音信号进行分析,执行下述步骤S12中提取关键信息的操作。The voice signal can be transmitted by wired technology or by wireless technology. For example, the wired transmission can be twisted pair transmission, coaxial cable transmission, optical fiber transmission, etc., and the wireless transmission can be video baseband transmission, optical fiber transmission, and network. Transmission, microwave transmission, broadband common cable transmission, and so on. Generally, when the communication terminal of the receiving end receives the voice signal, the voice signal is generally converted into voice for playback. However, in this embodiment, after the voice signal is received, the playback operation is not performed, but The received speech signal is analyzed, and the operation of extracting the key information in the following step S12 is performed.
S12:提取语音信号中的关键信息。S12: Extract key information in the voice signal.
在上述步骤S11中接收语音信号后,在本步骤S12中提取语音信号中的关键信息。语音信号中可以包含频率、响度、音调、文字发音等信息,其中可以将语音信号中的一些信息提取出来作为关键信息。关键信息可以是包含的字数、文字发音等的信息,关键信息的内容既可以预先设定也可以遵循默认设置,可以知道的是,在一条语音信号中可以包含多个关键信息。After receiving the speech signal in the above step S11, the key information in the speech signal is extracted in this step S12. The voice signal may include information such as frequency, loudness, pitch, text pronunciation, etc., wherein some information in the voice signal may be extracted as key information. The key information may be information including the number of words, the pronunciation of the text, etc. The content of the key information may be preset or may follow the default setting. It can be known that a plurality of key information may be included in one voice signal.
当接收端的通讯终端接收到的语音信号时,一般来说,会直接进行会对语音信号播放的操作,但本实施例中并不直接播放语音信号,而是提取到的关键信息并用于在下述步骤S13中判断是否与预存的模板信息进行匹配。When the voice signal received by the communication terminal at the receiving end, generally speaking, the operation of playing the voice signal is directly performed, but in this embodiment, the voice signal is not directly played, but the key information extracted is used and used in the following In step S13, it is judged whether or not matching with the pre-stored template information.
S13:判断关键信息与预存的模板信息是否匹配。 S13: Determine whether the key information matches the pre-stored template information.
在上述步骤S12中提取到语音信号的关键信息后,在本步骤S13中与预存的模板信息进行匹配,若匹配则执行下述步骤S14。After the key information of the voice signal is extracted in the above step S12, the pre-stored template information is matched in this step S13, and if it matches, the following step S14 is performed.
模板信息即为用于与关键信息进行比对的固定信息,具体来说,用户在使用通讯终端开展活动之前,会根据当前人员及活动内容对模板信息进行调整,例如活动代号、暗号等等,此时相对应的设置模板信息,一般来说,在该次活动结束之前都不会对模板信息进行修改,其中模板信息是设置在接收端的通讯终端上的。当接收端的通讯终端接收到语音信息时,则会重复步骤S12和步骤S13,即对每一条接收到的语音信号都会进行关键信息的提取以及与预存的模板信息进行匹配。例如,当发送端的通讯终端对接收端的通讯终端进行组呼时,接收端的通讯终端接收到一条语音信号,其内容为“张三、李四请回复你们的位置”,会根据提取关键信息的规则对接收到的语音信号中的内容进行关键信息的提取,其中,第一接收端的通讯终端的模板信息设为机主的名字“张三”,第二接收端的通讯终端的模板信息设为机主的名字“李四”,第三接收端的通讯终端的模板信息设为机主的名字“王五”,此时,对于第一、第三接收端的通讯终端来说,在语音信号中提取出来的关键信息匹配模板信息,此时执行下述步骤S14,而对于第二接收端的通讯装置来说,在语音信号中提取出来的关键信息不匹配模板信息,此时不做任何操作。The template information is fixed information used for comparison with key information. Specifically, before using the communication terminal to carry out activities, the user adjusts the template information according to the current personnel and activity content, such as activity code, password, and the like. At this time, the corresponding template information is set. Generally speaking, the template information is not modified before the end of the activity, and the template information is set on the communication terminal of the receiving end. When the communication terminal of the receiving end receives the voice information, step S12 and step S13 are repeated, that is, key information is extracted and matched with the pre-stored template information for each received voice signal. For example, when the communication terminal at the transmitting end makes a group call to the communication terminal at the receiving end, the communication terminal at the receiving end receives a voice signal whose content is “Zhang San, Li Si, please reply to your location”, according to the rules for extracting key information. Extracting key information from the content of the received voice signal, wherein the template information of the communication terminal of the first receiving end is set to the name of the owner, "Zhang San", and the template information of the communication terminal of the second receiving end is set as the owner The name "Li Si", the template information of the communication terminal of the third receiving end is set to the name of the owner "Wang Wu", at this time, for the communication terminals of the first and third receiving ends, the voice signal is extracted. The key information matches the template information. At this time, the following step S14 is performed, and for the communication device of the second receiving end, the key information extracted in the voice signal does not match the template information, and no operation is performed at this time.
S14:运行预设的功能指令,以响应组呼。S14: Run a preset function command to respond to the group call.
在上述步骤S13中判断出关键信息与预存的模板信息相匹配时,执行本步骤S14运行预设的功能指令,以响应组呼。功能指令是指实现某项功能的指令,一般来说,可以是实现点亮显示屏、振动、播放语音信号转换的语音信息等等,功能指令可以是遵循默认设置也可以是预设的,例如,当通讯终端收到语音信号时,默认振动后再播放语音信号转换的语音信息,还可以是用户根据个人喜好预设提醒铃声,即先播放铃声再播放语音信号转换的语音信息,本实施例中,功能指令是预先设置好的,当关键信息与预存的模板信息相匹配时,预设的功能指令会被运行,以响应组呼。When it is determined in the above step S13 that the key information matches the pre-stored template information, the step S14 is executed to execute the preset function instruction in response to the group call. A function instruction is an instruction that implements a certain function. Generally speaking, it may be a voice information that illuminates a display screen, vibrates, and plays a voice signal, and the function instruction may follow a default setting or a preset, for example, When the communication terminal receives the voice signal, the voice information converted by the voice signal is played after the default vibration, and the user may preset the reminder ringtone according to the personal preference, that is, the voice message is converted by playing the ringtone and then the voice signal is converted. The function command is preset. When the key information matches the pre-stored template information, the preset function command will be run in response to the group call.
本实施例通过接收语音信号,然后提取语音信号中的关键信息,将关键信息与预存的模板信息进行匹配来判断接收到的语音信号是否为要关注的重要信息,若匹配,则运行预设的功能指令,以响应组呼,可以使用户能够及时关注或者避免漏掉关键信息,提高用户体验。In this embodiment, by receiving a voice signal, and then extracting key information in the voice signal, the key information is matched with the pre-stored template information to determine whether the received voice signal is important information to be concerned, and if matched, the preset is run. The function command, in response to the group call, enables the user to pay attention to or avoid missing key information in time to improve the user experience.
请参阅图2,图2是本申请一种组呼的语音信号处理方法又一实施例的流程示意图。在本实施例中,组呼的语音信号处理方法可以包括以下步骤:Please refer to FIG. 2. FIG. 2 is a schematic flowchart diagram of still another embodiment of a method for processing a voice signal of a group call according to the present application. In this embodiment, the voice signal processing method of the group call may include the following steps:
S21:获取语音数据。S21: Acquire voice data.
S22:提取语音数据中的关键字语言信息。S22: Extract keyword language information in the voice data.
S23:将关键字语言信息保存为模板信息。S23: Save the keyword language information as template information.
在本实施例中,在进行组呼的语音信号处理之前会预先存储模板信息,存储模板信息的步骤可以包括获取语音数据,提取语音数据中的关键字语音信息,其中还包括根据预设关键字提取语音数据中的关键字语音信息,将关键字语音信息保存为语音模板,下面将步骤S21~S23放到一起进行说明。In this embodiment, the template information is stored in advance before the voice signal processing of the group call is performed, and the step of storing the template information may include acquiring voice data, and extracting keyword voice information in the voice data, where the method further includes: according to the preset keyword. The keyword voice information in the voice data is extracted, and the keyword voice information is saved as a voice template. Steps S21 to S23 are put together for explanation.
例如,在一个组呼场景中,一般是由组长持有作为发送端的通讯终端和组员们持有作为子系统接收端的通讯终端组成一个联络系统,语音信号通过发送端的通讯终端发送射频信号,接收端的通讯终端接收到后送入射频子系统进行相关的处理。For example, in a group call scenario, a communication terminal held by the group leader as a transmitting end and a communication terminal held by the group member as a receiving end of the subsystem form a contact system, and the voice signal transmits a radio frequency signal through the communication terminal of the transmitting end. After receiving the communication terminal, the receiving terminal sends the RF subsystem to perform related processing.
在本实施例中,为了能够得到关键信息,还在每一个接收端的通讯终端中建立包含关键信息的语音模板。具体来说,首先通过外部的麦克风进行关键字语音信息录制以获取语音数据,然后将语音数据转换为语音模拟信号后,再将语音模拟信号送入CODEC(支持视频和音频压缩(CO)与解压缩( DEC ) 的编解码器或软件)芯片模块进行模拟转换和放大相关处理,处理后的语音信号进入到存储模式中进行相关处理。其中,存储模式可以包括以下步骤,将关键字语音信号进行检测,若检测成功,则对关键字语音信息进行提取,提取后的关键字语音信息进行存储以形成最终的语音模板。In this embodiment, in order to obtain key information, a voice template containing key information is also established in the communication terminal of each receiving end. Specifically, the keyword voice information is first recorded by an external microphone to obtain voice data, and then the voice data is converted into a voice analog signal, and then the voice analog signal is sent to the CODEC (supporting video and audio compression (CO) and solution. compression( DEC) The codec or software) chip module performs analog conversion and amplification related processing, and the processed speech signal enters the storage mode for correlation processing. The storage mode may include the following steps: detecting the keyword voice signal, and if the detection is successful, extracting the keyword voice information, and extracting the keyword voice information to store the final voice template.
对于上述通过外部的麦克风进行关键字语音信息录制以获取语音数据的方式可以有多种,在本实施例中,可以有如下三种方式:There are a plurality of ways for the voice information to be recorded by the external microphone to obtain the voice data. In this embodiment, the following three methods are available:
第一种方式,接收端的通讯终端通过麦克风录取的关键字语音信息是来自于组长的,并非持有该通讯终端的组员。具体来说,通讯终端录制到关键字语音信息时,通过语音识别技术进行识别,除了基本的识别语音信息,还可以增加其他条件,例如,组长发出的声音中还可以包含声调、声色、口音等特点,因此在识别时可以作为限定,以防其他人得知关键字内容对组员进行信息干扰。In the first method, the keyword voice information that the communication terminal of the receiving end receives through the microphone is from the group leader, and is not a member of the group holding the communication terminal. Specifically, when the communication terminal records the keyword voice information, the voice recognition technology is used for recognition. In addition to the basic voice information, other conditions may be added. For example, the voice of the group leader may also include tone, sound color, and accent. And so on, so it can be used as a limit when identifying, in case other people know that the keyword content interferes with the information of the group members.
第二种方式,接收端的通讯终端通过麦克风录取的关键字语音信息是来自于持有该通讯终端的组员。具体来说,通讯终端录制到关键字语音信息时,通过语音识别技术进行识别,即识别语音信息内容即可。In the second mode, the keyword voice information that the communication terminal of the receiving end receives through the microphone is from a group member holding the communication terminal. Specifically, when the communication terminal records the keyword voice information, the voice recognition technology recognizes, that is, the voice information content is recognized.
第三种方式,接收端的通讯终端通过麦克风录取的关键字语音信息是来自于组长的,但是该关键字语音信息是预先录制组长语音信息,存储到接收端的通讯终端中,在使用时通过编程选项进行勾选。具体来说,在小组在活动时,常用的关键字语音信息可以有多个,进行相关的设定就可以避免每次活动前都要重新设定关键字语音信息,减少操作步骤,例如,根据活动时间的不同使用不同的关键字语音信息,共分为月初、月中、月末三个关键字语音信息,即月初、月中、月末分别对应三个关键字语音信息,因此在每次活动时,会根据活动时间选择相对应的关键词信息即可。In the third mode, the keyword voice information received by the communication terminal of the receiving end through the microphone is from the group leader, but the keyword voice information is pre-recorded group length voice information, and is stored in the communication terminal of the receiving end, and is used in use. Check the programming options. Specifically, when the group is active, there may be more than one common keyword voice information. To perform related settings, it is possible to avoid resetting the keyword voice information before each activity, and reduce the operation steps, for example, according to The different time of the activity time uses different keyword voice information, which is divided into three keyword voice information at the beginning of the month, the middle of the month and the end of the month, that is, the three key words of the month at the beginning of the month, the middle of the month and the end of the month, so at each event , the corresponding keyword information will be selected according to the activity time.
在上述存储模式中,会将语音信号进行检测,并对检测成功的关键字语音信息进行提取,将提取后的关键字语音信息存储成为最终的语音模板。具体来说,用户在使用麦克风录制关键字语音信息时可以有多种方式,在本实施例中可以有如下两种,第一种是可以只说期望录制的关键字,通过语音识别技术识别关键字信息,例如,设置关键字为“张三”时,录制语音信息时只说“张三”二字;第二种是可以预先输入关键字的文字信息,然后在通过语音录制,既可以只说期望录制的关键字,也可以说一句包含关键字的话,根据预先输入的关键字的文字信息对关键字语音信息进行提取,例如,设置关键字为“张三”时,预先输入的关键字的文字信息为“张三”,录制语音信息时则说“张三请回答”,此时则会根据文字信息进行提取关键字语音信息。在提取到关键词语音信息后,将该提取出的关键字语音信息进行存储以形成语音模板,其中,语音模板是存储在接收端的通讯终端中,用于在下述步骤S26中与接收到语音信号中的关键信息进行对比的。In the above storage mode, the voice signal is detected, and the keyword voice information that is successfully detected is extracted, and the extracted keyword voice information is stored as a final voice template. Specifically, the user can use the microphone to record the keyword voice information in multiple ways. In this embodiment, the user can have the following two types. The first one can only say the keyword that is desired to be recorded, and the key is recognized by the voice recognition technology. Word information, for example, when the keyword is set to "Zhang San", only the word "Zhang San" is recorded when recording the voice message; the second is that the text information of the keyword can be input in advance, and then recorded by voice, only If the keyword that is desired to be recorded is said to contain a keyword, the keyword voice information is extracted according to the text information of the keyword input in advance. For example, when the keyword is set to "Zhang San", the keyword input in advance is input. The text message is "Zhang San". When recording voice messages, say "Zhang San please answer". At this time, the keyword voice information will be extracted based on the text information. After extracting the keyword voice information, the extracted keyword voice information is stored to form a voice template, wherein the voice template is stored in the communication terminal at the receiving end, and is used to receive the voice signal in step S26 described below. The key information in the comparison.
对于语音模板中关键字的内容,一般来说,可以是活动代号、暗号、机主姓名等等,在本实施例中,为了使接收端的用户避免接收到无用信息或者漏掉关键信息,语音模板设置为机主姓名,一般来说,在一个活动中,组长A想要给组员C发布任务的时候,会首先呼叫组员C的名字,然后再发布任务,因此将语音模板设置为机主姓名可以有效的筛选出哪些信息是机主需要关注的。For the content of the keyword in the voice template, in general, it may be an activity code, a secret number, a name of the owner, and the like. In this embodiment, in order to prevent the user at the receiving end from receiving unnecessary information or missing key information, the voice template is used. Set to the owner's name. Generally speaking, in an activity, when the group leader A wants to release the task to the member C, he will first call the name of the member C, and then publish the task, so the voice template is set to the machine. The main name can effectively filter out what information the owner needs to pay attention to.
S24:接收语音信号。S24: Receive a voice signal.
在本步骤S24中,通讯终端首先接收语音信号。其中,语音信号是由发送端的通讯终端先采集到语音信息,其中,语音信息可以是一段音乐、一句话等等,而在本实施例中语音信息是一段话,然后将语音信息转换为语音信号发送给接收端的通讯终端,接收端的通讯终端上接收到发送端的语音信号。语音信号可以通过有线技术进行传输,也可以通过无线技术进行传输,在本实施例中,语音信号是通过无线技术进行传输,使用的是射频技术,即接收端的通讯终端是基于射频技术接收到语音信号的。In this step S24, the communication terminal first receives the voice signal. The voice signal is collected by the communication terminal of the sending end first, wherein the voice information may be a piece of music, a sentence, etc., but in the embodiment, the voice information is a paragraph, and then the voice information is converted into a voice signal. The communication terminal sent to the receiving end receives the voice signal of the transmitting end on the communication terminal of the receiving end. The voice signal can be transmitted by using a wired technology or by using a wireless technology. In this embodiment, the voice signal is transmitted by using a wireless technology, and the radio terminal is used, that is, the communication terminal at the receiving end receives the voice based on the radio frequency technology. Signal.
一般来说,当接收端的通讯终端接收到语音信号时,一般会直接将语音信号转换为语音信息进行播放,但在本实施例中,在接收到语音信号后并不进行播放的操作,而是对接收到的语音信号进行分析,执行下述步骤S25中提取关键信息的操作。Generally, when the communication terminal of the receiving end receives the voice signal, the voice signal is generally converted into voice information for playing, but in this embodiment, after the voice signal is received, the playback operation is not performed, but The received speech signal is analyzed, and the operation of extracting the key information in the following step S25 is performed.
S25:提取语音信号中的关键信息。S25: Extract key information in the voice signal.
在上述步骤S24中接收语音信号后,在本步骤S25中提取语音信号中的关键信息。具体来说,关键字信息可以是包含的字数、文字发音等的信息,关键信息的内容既可以预先设定也可以遵循默认设置,例如,当默认关键信息的内容的字数设定为三个字时,则在接收到的语音信号中逐一提取相邻三个字组成一个关键信息,例如,如图3 所示,图3是本申请一组呼场景示意图,当LEADER对讲机发送的语音信号的内容为“组员C,组员C,这里是LEADER,请移动三楼南侧窗户处,收到请回答“,则将该内容中每三个字组成一个关键信息,即提取“组员C”、“员C组”、“C组员”、“员C这”、“C这里”等每三个字组成的词为语音信号中的关键信息,即一条语音信号中可以包含多个关键信息。After receiving the speech signal in the above step S24, the key information in the speech signal is extracted in this step S25. Specifically, the keyword information may be information including the number of words, the pronunciation of the text, etc., and the content of the key information may be preset or may follow a default setting, for example, when the number of words of the content of the default key information is set to three words. When the received speech signal is extracted one by one, the adjacent three words are combined to form a key information, for example, as shown in FIG. As shown in the figure, FIG. 3 is a schematic diagram of a set of call scenes of the present application. When the content of the voice signal sent by the LEADER walkie-talkie is “group member C, group member C, here is LEADER, please move the window on the south side of the third floor, please answer. ", then each of the three words in the content constitutes a key message, that is, extracting "Group C", "C-C", "C-member", "C-C", "C here", etc. The word composed of words is the key information in the speech signal, that is, a speech signal can contain multiple key information.
当接收端的通讯终端接收到的语音信号时,一般来说,会直接进行会对语音信号播放的操作,但本实施例中并不直接播放语音信号,而是提取到的关键信息并用于在下述步骤S26中判断是否与预存的模板信息进行匹配。When the voice signal received by the communication terminal at the receiving end, generally speaking, the operation of playing the voice signal is directly performed, but in this embodiment, the voice signal is not directly played, but the key information extracted is used and used in the following In step S26, it is judged whether or not matching with the pre-stored template information.
S26:判断关键信息与预存的模板信息是否匹配。S26: Determine whether the key information matches the pre-stored template information.
在本步骤S26中将上述步骤S23中预存的语音模板与上述步骤S25中提取出的关键信息进行匹配,若匹配,则执行下述步骤S27,若不匹配,则不进行任何操作。In this step S26, the voice template pre-stored in the above step S23 is matched with the key information extracted in the above step S25. If it matches, the following step S27 is performed, and if it does not match, no operation is performed.
具体来说,如图3中所示,在一个组呼场景中,每个组员的通讯终端的语音模板均为LEADER录制的各自的名称,例如,“组员A”、“组员B”、“组员C”、“组员D”、“组员E”等等,各个接收端的通讯终端设置的关键信息的内容为字数为三,当组长持有的发送端的通讯终端对接收端的通讯终端进行组呼时,所有组员持有的多个接收端的通讯终端都接收到同一条语音信号,其内容为“组员C、组员E请回复你们的位置”,此时各个接收端的通讯终端都提取该条语音信号中的关键信息与各通讯终端中预存的语音模板进行匹配,组员A、组员B以及组员D的匹配结果为否,即匹配失败,那么他们的通讯终端保持原来的状态,不进行任何操作,而组员C、组员E两人的匹配结果为是,即匹配成功,此时执行下述步骤S27。Specifically, as shown in FIG. 3, in a group call scenario, the voice templates of the communication terminals of each group member are the respective names recorded by the LEADER, for example, "group member A" and "group member B". , "group member C", "group member D", "group member E", etc., the content of the key information set by the communication terminal of each receiving end is the number of words is three, when the communication terminal of the transmitting end held by the group leader is to the receiving end When the communication terminal performs a group call, all the communication terminals of the receiving end that all the group members hold receive the same voice signal, and the content thereof is “group member C, group member E, please reply to your position”, at this time, each receiving end The communication terminal extracts the key information in the voice signal and matches the voice template pre-stored in each communication terminal, and the matching result of the group member A, the group member B, and the group member D is no, that is, the matching fails, then their communication terminal The original state is maintained, and no operation is performed, and the result of the match between the member C and the member E is YES, that is, the matching is successful, and the following step S27 is performed.
S27:运行预设的功能指令,以响应组呼。S27: Run the preset function command to respond to the group call.
在上述步骤S26中判断关键信息与预存的模板信息匹配后,即运行预设的功能指令,以响应组呼。预设的功能指令可以进行相关扩展功能,即可以有多种功能指令来满足用户各种组呼场景多种需求,例如,可以是实现点亮显示屏、振动、播放语音信号转换的语音信息等等,功能指令可以是遵循默认设置也可以是预设的,在本实施例中,运行预设的功能指令包括运行放大音量、振动或闪灯的指令,运行保存语音信号的指令,向语音信号的发送方传输位置信息。After the key information is matched with the pre-stored template information in the above step S26, the preset function command is executed to respond to the group call. The preset function commands can perform related expansion functions, that is, there can be multiple function commands to meet various needs of the user's various group call scenes, for example, voice information for realizing lighting display, vibration, and playback of voice signal conversion, etc. The function command may be a default setting or a preset. In this embodiment, the function command for running the preset includes an instruction to operate the volume, vibration or flash, and an instruction to save the voice signal to the voice signal. The sender transmits location information.
例如,如图3所示的组呼场景,当LEADER对讲机发起组呼时,LEADER对各组员传达分配任务的指令,通过语音呼叫组员C。一般来说,当组员C由于工作繁忙和注意力不集中时,则会漏掉LEADER对讲机发送过来的重要语音信息,LEADER由于没有收到回应就会反复呼叫组员C,直至组员C回复语音信息;然而,当LEADER反复呼叫组员C,但组员C一直不回复时,LEADER就无法将任务正常派发,那么就会出现组员C在返回之前无法知道LEADER派发的任务内容的情况;当小组处于危险的野外作业环境时,若LEADER无法确认组员C的确切位置,LEADER则会担心组员C是否出现危险,同时也会影响工作的顺利进行。For example, in the group call scenario shown in FIG. 3, when the LEADER walkie-talkie initiates a group call, the LEADER communicates the instructions for assigning tasks to each group member, and calls the group member C by voice. Generally speaking, when team member C is busy and has insufficient concentration, the important voice information sent by the LEADER walkie-talkie will be missed. LEADER will repeatedly call group member C because he has not received the response, until group member C replies. Voice message; however, when LEADER repeatedly calls team member C, but team member C does not reply, LEADER cannot distribute the task normally, then there will be cases where team member C cannot know the task content of LEADER before returning; When the team is in a dangerous field environment, if LEADER cannot confirm the exact location of team member C, LEADER will worry about whether team C is dangerous and will also affect the smooth progress of the work.
但在本实施例中,组员C持有的对讲机设置的语音模板为机主姓名,当LEADER呼叫组员C时,组员C的对讲机会将接收到的语音信号预存语音模板语音进行匹配,在匹配成功后,组员C的对讲机则会自动切换到LED闪灯模式或者振动模式,LED闪灯模式的形式可以有多种,例如,可以是设定闪烁时间也可以闪烁次数,当对讲机包含多个LED灯时,可以设定LED闪灯个数等等,振动模式的形式可以是设定马达振动的持续时间或频率,持续时间或频率可预先设置也可遵循默认,在本实施例中将振动的持续时间设置为3秒,频率可以是每振动5秒后停歇1秒,组员C对讲机的振动或闪灯会引起组员C的注意,从而使组员C发现对讲机有接收到语音信息,继而顺利接听到语音信息;组员C的对讲机在播放语音信息时会将音量放大,方便组员C能够听清LEADER派发的任务内容是什么;然而当组员C一段时间没有接听时,组员C的对讲机会开启自动录音模式,即组员C的对讲机自动保存关键字匹配后的语音信息并支持回放,例如将时间设置为30秒,这样可以避免组员C漏掉需要关注的语音信息时无法找回的情况,其中未接听的时间可以预先设定也可遵循默认设置,此处不做限定;若组员C的长时间没有应答,例如5分钟,则组员C的对讲机默认机主处于危险状况,此时对讲机会自动将组员C的位置信息发送给LEADER,可以使LEADER可以确认组员C发生危险时及时确认救援位置,节省救援时间,其中未接听的时间可以预先设定也可遵循默认设置,此处不做限定。However, in this embodiment, the voice template set by the walkie-talkie held by the group member C is the owner name. When the LEADER calls the group member C, the intercommunication opportunity of the group member C matches the received voice signal pre-stored voice template voice. After the matching is successful, the walkie-talkie of the member C will automatically switch to the LED flash mode or the vibration mode. The form of the LED flash mode can be various, for example, the setting flashing time or the number of flashing times, when the intercom includes When multiple LED lights are used, the number of LED flashes can be set, etc. The vibration mode can be in the form of setting the duration or frequency of the motor vibration. The duration or frequency can be preset or follow the default, in this embodiment. Set the duration of the vibration to 3 seconds, the frequency can be 1 second after each vibration for 5 seconds, the vibration or flashing light of the member C intercom will cause the attention of the member C, so that the member C finds that the walkie-talkie has received the voice information. Then, the voice information is smoothly received; the walkie-talkie of the member C will enlarge the volume when playing the voice information, so that the member C can hear the contents of the task assigned by the LEADER; however, When the member C does not answer for a certain period of time, the intercommunication opportunity of the member C turns on the automatic recording mode, that is, the walkie-talkie of the member C automatically saves the voice information after the keyword matching and supports playback, for example, setting the time to 30 seconds, which can avoid If the member C misses the voice information that needs attention, it cannot be retrieved. The time of the missed call can be preset or the default setting can be followed. It is not limited here; if the member C does not answer for a long time, for example, 5 Minutes, the default owner of the crew of the crew C is in a dangerous situation. At this time, the intercom will automatically send the location information of the crew C to the LEADER, which can enable the LEADER to confirm the rescue location when the crew C is in danger, saving the rescue time. The time of the unanswered time can be preset or the default setting, which is not limited here.
本实施例通过接收语音信号,然后提取语音信号中的关键信息,将关键信息与预存的模板信息进行匹配来判断接收到的语音信号是否为要关注的重要信息,若匹配,则运行预设的功能指令来提醒用户,以响应组呼,本实施例能够从组呼的语言信号中提取出匹配预设模板信息的关键信息,并且运行相应的功能来响应该组呼,可以使用户能够及时关注或者避免漏掉关键信息,提高用户体验。In this embodiment, by receiving a voice signal, and then extracting key information in the voice signal, the key information is matched with the pre-stored template information to determine whether the received voice signal is important information to be concerned, and if matched, the preset is run. The function instruction is used to remind the user to respond to the group call. In this embodiment, the key information matching the preset template information can be extracted from the language signal of the group call, and the corresponding function is executed to respond to the group call, so that the user can timely pay attention. Or avoid missing key information and improve the user experience.
上述方法应用于通讯终端中,其逻辑过程通过计算机程序来表示,并具体通过通讯终端实现。The above method is applied to a communication terminal, and the logic process thereof is represented by a computer program, and is specifically implemented by a communication terminal.
对于计算机程序,以软件形式实现并作为独立的产品销售或使用时,可存储在一个电子设备可读取存储介质中,即,本申请还提供一种计算机存储介质,设备上存储有程序数据,程序数据被处理器执行时实现上述方法的步骤。请参阅图4,图4是本申请计算机存储介质一实施例的结构示意图,计算机存储介质100中有程序数据能够被执行以实现上述实施例的方法,该计算机存储介质可以为如U盘、光盘、服务器等。When the computer program is implemented in software and sold or used as a stand-alone product, it can be stored in an electronic device readable storage medium, that is, the application further provides a computer storage medium on which the program data is stored. The steps of the above method are implemented when the program data is executed by the processor. Please refer to FIG. 4. FIG. 4 is a schematic structural diagram of an embodiment of a computer storage medium according to the present application. Program data in the computer storage medium 100 can be executed to implement the method of the foregoing embodiment. The computer storage medium can be, for example, a USB flash drive or an optical disk. , server, etc.
对于通讯终端的硬件结构,请参阅图5,图5是本申请一种通讯终端一实施例的结构示意图。本实施例通讯终端200包括处理器21、存储器22、麦克风23以及射频模块24,处理器21耦接存储器22、麦克风23以及射频模块24,存储器22中存储有程序数据,处理器21能够加载程序数据并执行,以实现上述组呼的语音信号处理方法,射频模块24用于接收语音信号。For the hardware structure of the communication terminal, please refer to FIG. 5. FIG. 5 is a schematic structural diagram of an embodiment of a communication terminal according to the present application. The communication terminal 200 of the embodiment includes a processor 21, a memory 22, a microphone 23, and a radio frequency module 24. The processor 21 is coupled to the memory 22, the microphone 23, and the radio frequency module 24. The program data is stored in the memory 22, and the processor 21 can load the program. The data is executed and implemented to implement the voice signal processing method of the group call, and the radio frequency module 24 is configured to receive the voice signal.
具体来说,处理器21用于提取语音信号中的关键信息,判断关键信息与模板信息是否匹配;若关键信息与模板信息匹配,则运行预设的功能指令,以响应组呼。Specifically, the processor 21 is configured to extract key information in the voice signal, determine whether the key information matches the template information, and if the key information matches the template information, run the preset function instruction to respond to the group call.
对于通讯终端实现组呼的语音信号处理方法,本实施例通讯终端实现组呼的语音信号处理方法与上述实施例的实施方式类似,具体实施步骤可参考图1或图2,此处不做赘述。For the voice signal processing method for the group terminal to implement the group call, the method for processing the voice signal of the group call in the communication terminal of the embodiment is similar to the embodiment of the foregoing embodiment. For the specific implementation steps, refer to FIG. 1 or FIG. 2, and details are not described herein. .
需要说明的是,发送语音信息的通讯终端与接收语音信息的通讯终端可以为两个不同的通讯终端,具体来说,在一个组呼场景中,作为发送端的通讯终端发送语音信号给作为接收端的通信终端,接收端的通讯终端提取语音信号中的关键信息,判断关键信息与模板信息是否匹配;若是,则运行预设的功能指令,以响应发送端的通讯终端。It should be noted that the communication terminal that transmits the voice information and the communication terminal that receives the voice information may be two different communication terminals. Specifically, in a group call scene, the communication terminal as the transmitting end sends a voice signal to the receiving end. The communication terminal receives the key information in the voice signal, and determines whether the key information matches the template information; if yes, runs the preset function command to respond to the communication terminal of the sender.
本实施例通讯终端能够使用户及时关注或者避免漏掉关键信息,提高用户体验。The communication terminal of this embodiment enables the user to pay attention to or avoid missing key information in time to improve the user experience.
请参阅图6,图6是本申请一种通讯终端又一实施例的结构示意图。本实施例中,该通讯终端200为上述实施例中的通信终端,该通信终端200包括接收模块31、提取模块32、判断模块33以及运行模块34。Please refer to FIG. 6. FIG. 6 is a schematic structural diagram of still another embodiment of a communication terminal according to the present application. In this embodiment, the communication terminal 200 is the communication terminal in the above embodiment, and the communication terminal 200 includes a receiving module 31, an extracting module 32, a determining module 33, and an operating module 34.
接收模块31用于接收语音信号。The receiving module 31 is configured to receive a voice signal.
提取模块32用于提取语音信号中的关键信息。The extraction module 32 is configured to extract key information in the voice signal.
判断模块33用于判断关键信息与预存的模板信息是否匹配。The determining module 33 is configured to determine whether the key information matches the pre-stored template information.
运行模块34用于判断到关键信息与预存的模板信息匹配时,运行预设的功能指令,以响应组呼。The running module 34 is configured to: when the key information matches the pre-stored template information, run the preset function instruction to respond to the group call.
本实施例通讯终端能够使用户及时关注或者避免漏掉关键信息,提高用户体验。The communication terminal of this embodiment enables the user to pay attention to or avoid missing key information in time to improve the user experience.
以上仅为本申请的实施方式,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利保护范围内。 The above is only the embodiment of the present application, and thus does not limit the scope of patents of the present application, and the equivalent structure or equivalent process transformation made by using the specification and the contents of the drawings, or directly or indirectly applied to other related technical fields, The same is included in the scope of patent protection of this application.

Claims (10)

  1. 一种组呼的语音信号处理方法,其特征在于,所述方法包括:A method for processing a voice signal of a group call, characterized in that the method comprises:
    接收所述语音信号;Receiving the voice signal;
    提取所述语音信号中的关键信息;Extracting key information in the voice signal;
    判断所述关键信息与预存的模板信息是否匹配;Determining whether the key information matches the pre-stored template information;
    若是,则运行预设的功能指令,以响应所述组呼。If so, a preset function command is run in response to the group call.
  2. 根据权利要求1所述的方法,其特征在于,所述方法进一步包括:The method of claim 1 wherein the method further comprises:
    获取语音数据;Acquire voice data;
    提取所述语音数据中的关键字语音信息;Extracting keyword voice information in the voice data;
    将所述关键字语音信息保存为所述模板信息。The keyword voice information is saved as the template information.
  3. 根据权利要求2所述的方法,其特征在于,所述提取所述语音数据中的关键字语音信息,包括:The method according to claim 2, wherein the extracting the keyword voice information in the voice data comprises:
    根据预设关键字提取所述语音数据中的关键字语音信息。The keyword voice information in the voice data is extracted according to a preset keyword.
  4. 根据权利要求1所述的方法,其特征在于,所述运行预设的功能指令,包括:The method according to claim 1, wherein the running the preset function instruction comprises:
    运行放大音量、振动或闪灯的指令。Run the command to amplify the volume, vibration or flash.
  5. 根据权利要求1所述的方法,其特征在于,所述运行预设的功能指令,包括:The method according to claim 1, wherein the running the preset function instruction comprises:
    运行保存所述语音信号的指令。Run an instruction to save the voice signal.
  6. 根据权利要求1所述的方法,其特征在于,所述运行预设的功能指令,包括:The method according to claim 1, wherein the running the preset function instruction comprises:
    向所述语音信号的发送方传输位置信息。The location information is transmitted to the sender of the voice signal.
  7. 根据权利要求1所述的方法,其特征在于,所述接收所述语音信号,包括:The method of claim 1, wherein the receiving the voice signal comprises:
    基于射频技术接收所述语音信号。The speech signal is received based on radio frequency technology.
  8. 一种通讯终端,所述通讯终端用于接收组呼的语音信号,其特征在于,所述通讯设备包括相互耦接的处理器和存储器,所述存储器中预存有模板信息,所述处理器用于:A communication terminal, the communication terminal is configured to receive a voice signal of a group call, wherein the communication device comprises a processor and a memory coupled to each other, wherein the memory pre-stores template information, and the processor is used for :
    提取所述语音信号中的关键信息;Extracting key information in the voice signal;
    判断所述关键信息与所述模板信息是否匹配;Determining whether the key information matches the template information;
    若是,则运行预设的功能指令,以响应所述组呼。If so, a preset function command is run in response to the group call.
  9. 根据权利要求8所述的通讯终端,其特征在于,所述通讯终端进一步包括麦克风和射频模块,所述麦克风和所述射频模块分别耦接于所述处理器,所述射频模块用于接收所述语音信号。The communication terminal according to claim 8, wherein the communication terminal further comprises a microphone and a radio frequency module, wherein the microphone and the radio frequency module are respectively coupled to the processor, and the radio frequency module is used for receiving The speech signal.
  10. 一种计算机存储介质,其上存储有计算机程序,其特征在于,所述计算机程序能够被执行以实现权利要求1-7中任一项所述方法的步骤。A computer storage medium having stored thereon a computer program, characterized in that the computer program can be executed to carry out the steps of the method of any of claims 1-7.
PCT/CN2017/118766 2017-12-26 2017-12-26 Method for processing voice signal for group call, communication terminal and computer storage medium WO2019127057A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/118766 WO2019127057A1 (en) 2017-12-26 2017-12-26 Method for processing voice signal for group call, communication terminal and computer storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/118766 WO2019127057A1 (en) 2017-12-26 2017-12-26 Method for processing voice signal for group call, communication terminal and computer storage medium

Publications (1)

Publication Number Publication Date
WO2019127057A1 true WO2019127057A1 (en) 2019-07-04

Family

ID=67062947

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/118766 WO2019127057A1 (en) 2017-12-26 2017-12-26 Method for processing voice signal for group call, communication terminal and computer storage medium

Country Status (1)

Country Link
WO (1) WO2019127057A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112532691A (en) * 2020-11-06 2021-03-19 问问智能信息科技有限公司 Information processing method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110145000A1 (en) * 2009-10-30 2011-06-16 Continental Automotive Gmbh Apparatus, System and Method for Voice Dialogue Activation and/or Conduct
CN104978957A (en) * 2014-04-14 2015-10-14 美的集团股份有限公司 Voice control method and system based on voiceprint identification
CN105096937A (en) * 2015-05-26 2015-11-25 努比亚技术有限公司 Voice data processing method and terminal
CN106161745A (en) * 2015-04-07 2016-11-23 中兴通讯股份有限公司 Call control method of terminal and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110145000A1 (en) * 2009-10-30 2011-06-16 Continental Automotive Gmbh Apparatus, System and Method for Voice Dialogue Activation and/or Conduct
CN104978957A (en) * 2014-04-14 2015-10-14 美的集团股份有限公司 Voice control method and system based on voiceprint identification
CN106161745A (en) * 2015-04-07 2016-11-23 中兴通讯股份有限公司 Call control method of terminal and device
CN105096937A (en) * 2015-05-26 2015-11-25 努比亚技术有限公司 Voice data processing method and terminal

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112532691A (en) * 2020-11-06 2021-03-19 问问智能信息科技有限公司 Information processing method and device

Similar Documents

Publication Publication Date Title
US10820098B2 (en) Wireless microphone system, control method and audio-video conference system
CN101569214A (en) Method and device for data capture for push over cellular
WO2017166764A1 (en) Bluetooth beacon-based arrival prompting method and arrival prompting device
CN210986246U (en) Conference terminal and conference terminal system
JPH11505981A (en) Logging recording system for wireless trunk
WO2015131451A1 (en) Mobile terminal and method for playing ring tone thereof
US20160366528A1 (en) Communication system, audio server, and method for operating a communication system
US20170034330A1 (en) Method and Device to Operate Phone with a Single Key
WO2023151526A1 (en) Audio acquisition method and apparatus, electronic device and peripheral component
CN203340289U (en) Voice communication terminal and voice communication system
US20180255163A1 (en) Automatically delaying playback of a message
CN111182139A (en) Bluetooth sound box mobile phone control system based on Internet of things
WO2019127057A1 (en) Method for processing voice signal for group call, communication terminal and computer storage medium
JP2006140542A (en) Multipoint speech system, voice volume adjustment unit, mobile terminal and voice volume adjustment method used for them, and program therefor
JP2015002394A (en) Information processing apparatus and computer program
WO2015034174A1 (en) System for switching and outputting sender-controlled incoming ringtone and method therefor
US11516346B2 (en) Three-way calling terminal for mobile human-machine coordination calling robot
JP2008147975A (en) Content recorder, communication system, control method, control program and computer-readable recording medium
CN101212529A (en) System, device, and method for controlling automobile sound system by telephone
CN109660914A (en) A kind of distributed bluetooth audible control system
WO2021049683A1 (en) Digital radio system based on mobile terminal
JP2005102033A (en) Simulcast call system by portable telephone
CN111556406A (en) Audio processing method, audio processing device and earphone
WO2020041973A1 (en) Intelligent control system and method for mobile phone message
CN215187399U (en) Intercom system based on TWS bluetooth headset

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17936897

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17936897

Country of ref document: EP

Kind code of ref document: A1