WO2017161724A1 - Voice processing method and device, and terminal - Google Patents

Voice processing method and device, and terminal Download PDF

Info

Publication number
WO2017161724A1
WO2017161724A1 PCT/CN2016/087609 CN2016087609W WO2017161724A1 WO 2017161724 A1 WO2017161724 A1 WO 2017161724A1 CN 2016087609 W CN2016087609 W CN 2016087609W WO 2017161724 A1 WO2017161724 A1 WO 2017161724A1
Authority
WO
WIPO (PCT)
Prior art keywords
audio data
recording
audio
permission
module
Prior art date
Application number
PCT/CN2016/087609
Other languages
French (fr)
Chinese (zh)
Inventor
朱峰结
Original Assignee
宇龙计算机通信科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 宇龙计算机通信科技(深圳)有限公司 filed Critical 宇龙计算机通信科技(深圳)有限公司
Publication of WO2017161724A1 publication Critical patent/WO2017161724A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L9/00Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols
    • H04L9/40Network security protocols
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L63/00Network architectures or network communication protocols for network security
    • H04L63/04Network architectures or network communication protocols for network security for providing a confidential data exchange among entities communicating through data packet networks
    • H04L63/0428Network architectures or network communication protocols for network security for providing a confidential data exchange among entities communicating through data packet networks wherein the data content is protected, e.g. by encrypting or encapsulating the payload
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L63/00Network architectures or network communication protocols for network security
    • H04L63/08Network architectures or network communication protocols for network security for authentication of entities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/68Circuit arrangements for preventing eavesdropping

Definitions

  • the present invention relates to the field of terminal technologies, and in particular, to a method, an apparatus, and a terminal for voice processing.
  • voice encryption schemes such as encryption algorithms, encrypted data transmission methods, and key management methods, which are mainly used to prevent voice/signaling data from being monitored or acquired during network transmission.
  • the communication terminal can decrypt the encrypted voice data after receiving the encrypted voice data sent through the network transmission layer; when the decrypted voice data passes through the operating system, the driver or even the application of the communication terminal At that time, it may be captured by the lurking Trojan program, which causes the leakage of voice data of both parties and reduces the security of the voice data in the terminal.
  • the technical problem to be solved by the embodiments of the present invention is to provide a method, a device and a terminal for voice processing, which can improve the security, convenience and speed of voice processing.
  • the embodiment of the present invention provides a method for voice processing, where the method includes:
  • the terminal When the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, the authority of the initiator that initiates the recording request is detected;
  • the permission is a recording permission for indicating that the audio data of the encrypted call service is recorded, acquiring audio data obtained by the terminal performing audio coding on the call voice;
  • the obtained audio data is stored to a preset recording storage area.
  • the method further includes: when the terminal performs the recording request of the audio data of the encrypted call service, and before detecting the recording permission of the initiator that initiates the recording request, the method further includes:
  • the recording permission, the set initiator having the recording permission includes: a call UI, a communication frame, or an audio coding driver.
  • the method further includes:
  • the security prompt information is used to prompt the sender that the audio data of the encrypted call service is not recorded.
  • the obtaining, by the acquiring terminal, the audio data obtained by performing audio coding on the call voice includes:
  • the audio encoding driver having recording authority is called to acquire audio data obtained by audio encoding the call voice.
  • the method further includes:
  • an embodiment of the present invention further provides an apparatus for voice processing, where the apparatus includes:
  • a detecting module configured to: when receiving the recording request of the audio data of the encrypted call service, when the terminal performs the encrypted call service, detecting the authority of the initiator that initiates the recording request;
  • an obtaining module configured to: if the permission detected by the detecting module is a recording permission for indicating that the audio data of the encrypted calling service is recorded, acquiring audio data obtained by the terminal performing audio encoding on the call voice;
  • a storage module configured to store the audio data acquired by the acquiring module to a preset recording storage area.
  • the device further includes:
  • a setting module configured to set a recording permission for recording audio data of the encrypted call service in an audio management module of the terminal, where the set initiator having the recording permission includes: a call UI, a communication frame, or an audio Code driver.
  • the device further includes:
  • a sending module configured to send security prompt information if the right detected by the detecting module is used to indicate that the audio data of the encrypted calling service is not authorized to be recorded;
  • the security prompt information is used to prompt the sender that the audio data of the encrypted call service is not recorded.
  • the obtaining module is specifically configured to invoke the audio encoding driver with recording permission to obtain audio data obtained by audio encoding the call voice.
  • the detecting module is further configured to detect, when the access request for the audio data stored in the recording storage area is received, the authority of the initiator that initiates the access request;
  • the sending module is further configured to: if the permission detected by the detecting module is used to indicate that the right to access the audio data stored in the recording storage area, respond to the access request detected by the detecting module To return the audio data stored in the recording storage area.
  • an embodiment of the present invention further provides a terminal, where the terminal includes the voice processing device.
  • the terminal when the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, the right of the initiator that initiates the recording request is detected; if the permission is used for the indication Obtaining the recording permission for recording the audio data of the encrypted call service, acquiring audio data obtained by the terminal to audio-encode the call voice, and further storing the obtained audio data into a preset recording storage area; This can improve the security and convenience of voice processing.
  • FIG. 1 is a schematic structural diagram of an encrypted call frame according to an embodiment of the present invention.
  • FIG. 2 is a schematic flow chart of a voice processing method according to an embodiment of the present invention.
  • FIG. 3 is a schematic structural diagram of voice recording of an encrypted call according to an embodiment of the present invention.
  • FIG. 4 is a schematic flowchart diagram of another voice processing method according to an embodiment of the present invention.
  • FIG. 5 is a schematic flowchart diagram of another voice processing method according to an embodiment of the present invention.
  • FIG. 6 is a schematic structural diagram of a voice processing apparatus according to an embodiment of the present invention.
  • FIG. 7 is a schematic structural diagram of another voice processing apparatus according to an embodiment of the present invention.
  • FIG. 8 is a schematic structural diagram of a terminal according to an embodiment of the present invention.
  • the embodiment of the invention provides a method, a device and a terminal for voice processing, in order to quickly search for keywords and obtain the result information that the user most wants, which is simple in operation and high in efficiency.
  • FIG. 1 is a schematic structural diagram of an encrypted call frame disclosed in an embodiment of the present invention.
  • the encrypted call frame diagram may include an application processor CPU, a communication module, a MIC/Speaker module, an audio code (that is, an audio codec), and an ADSP (Audio digital signal processor). Referred to as ADSP).
  • the application processor CPU may include a call UI (User Interface, user interface, Referred to as UI), it is a call interface, a communication framework, an audio management module, an Audio driver, a Codec driver, and an audio data storage area.
  • the call UI is mainly used for displaying a call interface and a logical processing part related to a call, such as a call.
  • the communication framework may be logic for maintaining communication work and a control center;
  • the audio management module may be for managing audio related processing units, such as audio on/ Off, call recording, audio mode management (such as hands-free mode management), etc.
  • the audio data storage area may be a storage center of audio data, such as call audio, normal audio and other audio data storage center;
  • the Audio driver is mainly used It is responsible for managing the operation of modules such as MIC/Speaker;
  • the Codec driver is mainly responsible for managing the operation of the driver audio Codec module.
  • the communication module may include a call control module, an Rx decryption module, and a Tx encryption module.
  • the call control module is mainly a module for managing call related in the communication module, such as link, audio data management, etc.; the Tx encryption The module and the Rx decryption module are mainly used for encrypting/decrypting the generated/received call audio data to ensure the security of the audio data transmission.
  • the ADSP audio data signal processor is mainly used for performing digital-to-analog conversion and operation of audio data.
  • the audio Codec may be a hardware device, mainly used for encoding/decoding a sound signal.
  • the MIC/Speaker module is mainly used to implement a mutual conversion module of electrical signals and sound signals, and may also be referred to as a microphone or a microphone.
  • the MIC module may be an energy conversion device for converting a sound signal into an electrical signal;
  • the Speaker module can be an energy conversion device used to convert electrical signals into sound signals.
  • each module in the encrypted call frame structure diagram shown in FIG. 1 can be included in the communication terminal.
  • the user can make a call request for the voice call service by calling a call UI (such as a dial pad) in the communication terminal, and the call request is sent to the base station through the call module through the interaction between the communication frame and the call module, and waits until the call partner answers the call.
  • a call UI such as a dial pad
  • the base station Transmitting, by the base station, the response information to the communication terminal, the communication module in the communication terminal receiving the response information, and transmitting the response information to the call frame and the call UI, where the call UI of the communication terminal makes a call Presentation (such as display of the call answering interface); after the communication module receives the response information, that is, after the communication module knows that the call partner has received the information, the audio manager can perform corresponding audio mode setting (such as setting to audio). In the call mode, the audio management module is notified to enter the voice call state; at this time, the audio management module also switches the audio Codec, MIC/Speaker module, etc. to the call working state through the Codec driver and the Audio driver, and the user can communicate with the call partner. Make an encrypted call.
  • a call Presentation such as display of the call answering interface
  • the specific workflow includes: the MIC/Speaker module can collect the voice data of the user for the encrypted call and send it to The audio Codec performs audio encoding to obtain audio data, and then sends the audio data to the ADSP for analog-to-digital AD conversion to convert the audio data in the form of an analog signal into audio data in the form of a digital signal, and then transmits the audio data in the form of a digital signal.
  • the audio Codec performs audio encoding to obtain audio data, and then sends the audio data to the ADSP for analog-to-digital AD conversion to convert the audio data in the form of an analog signal into audio data in the form of a digital signal, and then transmits the audio data in the form of a digital signal.
  • the call control module Encrypting the audio data in the form of the digital signal to the Tx in the communication module, and finally transmitting it to the base station (which may be referred to as uplink audio) through the call control module in the communication module; in the opposite sense, in the communication module
  • the call control module receives the encrypted audio data sent by the caller, the encrypted audio data is decrypted by Rx decryption, and then the decrypted audio data is sent to the ADSP for digital-analog DA conversion to become an analog signal.
  • the audio data of the form is then sent to the audio codec for audio decoding, that is, the inverse process of the audio encoding, and finally sent to the MIC/Speaker module in the communication terminal for voice playback (referred to as downlink audio);
  • the process of working on each module of the terminal during the entire encrypted call is: MIC/Spea Ker module ⁇ audio Codec ⁇ ADSP ⁇ Rx decryption/Tx encryption in the communication module ⁇ Call control module in the communication module.
  • the user can also perform call recording through the call UI in the communication terminal, and then the uplink audio data and the downlink audio data will all be included in the uplink through the audio codec in the communication terminal. All audio data of the audio data and the downlink audio data are sent to the Codec driver, and then sent to the audio management module, and finally the audio management module stores the audio data in a playable format in the audio data storage area of the communication terminal. That is, the call recording process is: audio Coedc ⁇ Codec driver ⁇ audio management module ⁇ audio data storage area.
  • the thick solid line in FIG. 1 represents a data transmission stream
  • the thin solid line represents a control flow (which may also be a signaling control flow).
  • FIG. 2 is a schematic flowchart of a voice processing method according to an embodiment of the present invention. The method in the embodiment of the present invention further includes the following steps.
  • the user when the communication terminal performs the processing of the encrypted call service, the user may use the communication terminal to record the audio data of the encrypted call service (for example, the user selects to record the call on the recording interface of the communication terminal).
  • the communication terminal may detect and acquire a recording request for the user to record the audio data of the encrypted call service, and the communication terminal may further send the recording request to a function module related to the communication terminal to perform a corresponding call.
  • Voice recording processing When said When the application processor CPU (CPU) of the communication terminal receives the recording request for the audio data of the encrypted call service, the CPU may detect the authority of the initiator that initiated the recording request.
  • the method further includes: when the terminal performs the recording request of the audio data of the encrypted call service, and before detecting the recording permission of the initiator that initiates the recording request, the method further includes:
  • Recording permission for recording the audio data of the encrypted call service is set in the audio management module of the terminal, and the set initiator having the recording permission includes: a call UI, a communication frame, or an audio coding driver.
  • the CPU may set a recording permission according to the initiator of the user or the system custom setting, and correspondingly set a recording permission for recording the audio data of the encrypted call service in the audio management module of the communication terminal;
  • the CPU may set a call-only UI, and/or a communication framework, and/or an audio encoding driver (ie, a Codec driver) in the audio management module, the initiator having audio data for the encrypted call service. Recorded recording rights.
  • the communication terminal may include an Internet device such as a smart phone (such as an Android mobile phone, an IOS mobile phone, etc.), a personal computer, a tablet computer, a palmtop computer, a mobile Internet device (MID), or a wearable smart device, and the embodiment of the present invention Not limited.
  • a smart phone such as an Android mobile phone, an IOS mobile phone, etc.
  • a personal computer such as an Android mobile phone, an IOS mobile phone, etc.
  • a tablet computer such as a tablet computer, a palmtop computer, a mobile Internet device (MID), or a wearable smart device
  • MID mobile Internet device
  • the permission is a recording permission for indicating that the audio data of the encrypted call service is recorded, acquiring audio data obtained by the terminal performing audio coding on the call voice.
  • the CPU may acquire the audio coding of the call voice in the communication terminal. The resulting audio data.
  • the obtaining, by the acquiring terminal, the audio data obtained by performing audio coding on the call voice includes:
  • the audio encoding driver having recording authority is called to acquire audio data obtained by audio encoding the call voice.
  • the CPU When the CPU detects that the authority of the initiator that initiates the recording request is the recording authority for indicating that the audio data of the encrypted call service is recorded, the CPU may invoke the location with the recording authority.
  • the audio encoding driver (that is, the audio Codec) is described to obtain audio data obtained by audio encoding the call voice.
  • the method further includes:
  • the security prompt information is used to prompt the sender that the audio data of the encrypted call service is not recorded.
  • the CPU may send one or more to prompt the sender to have no right to The security prompt information of the audio data of the encrypted call service is recorded, such as "**The application has a virus/trojan cannot record the call".
  • the CPU may store the audio data acquired in S102 in a recording storage area preset by a user or a system.
  • the method further includes:
  • an access request for accessing audio data stored in the recording storage area may be sent to the CPU;
  • the CPU may detect the authority of the initiator that initiates the access request, if the CPU detects that the initiator that initiated the access request has stored in the recording storage area.
  • the audio data is accessed, and the CPU transmits the audio data stored in the recording storage area to the corresponding initiator in response to the access request (that is, the CPU sends the audio data to the Said other functional modules of the communication terminal).
  • FIG. 3 a schematic structural diagram of an encrypted call voice recording as shown in FIG. 3 is given, wherein a thick solid line indicates a data transmission stream, and The solid line represents the control flow (which can also be a signaling control flow) and the dashed line represents the audio data stream.
  • the communication frame in the communication terminal can acquire and judge the current progress of the communication terminal in time.
  • Whether the voice call is an encrypted call when the communication architecture determines that the communication terminal is currently in an encrypted call, the communication framework notifies the audio management module to perform rights control on the audio data generated by the encrypted call, that is, Controlling the flow of the audio data stream as shown in FIG. 3 and/or controlling the access rights of the audio data includes the following three aspects of access control:
  • the access permission setting is performed on the audio data storage area for storing the audio data, and only the call UI or the communication frame can be accessed.
  • the audio management module notifies the Codec driver that it is in the process of encrypting the call (that is, when the audio management module notifies the Codec driver to switch the audio Codec to the working state), and interfaces the Codec driver.
  • the permission settings only allow the audio management module to operate or block all interfaces to the audio data related operations.
  • the terminal when the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, the right of the initiator that initiates the recording request is detected; if the permission is used for the indication Obtaining the recording permission for recording the audio data of the encrypted call service, acquiring audio data obtained by the terminal to audio-encode the call voice, and further storing the obtained audio data into a preset recording storage area; This can improve the security and convenience of voice processing.
  • FIG. 4 is a schematic flowchart of another voice processing method according to an embodiment of the present invention.
  • the method of the embodiment of the present invention further includes the following steps.
  • S201 Set a recording permission for recording audio data of the encrypted call service in the audio management module of the terminal, where the set initiator having the recording permission includes: a call UI, a communication frame, or an audio coding driver.
  • S203 Determine whether the permission to initiate the initiation of the recording request is a recording permission for indicating that the audio data of the encrypted call service is recorded.
  • the CPU may determine whether the initiator that initiates the recording request detected in S202 has the right to record the audio data of the encrypted call service; if yes, the CPU continues to perform step S204; otherwise, Execute S206.
  • S206 Send security prompt information, where the security prompt information is used to prompt the sender to have no right to record audio data of the encrypted call service.
  • FIG. 5 is a schematic flowchart of another voice processing method according to an embodiment of the present invention.
  • the method in the embodiment of the present invention includes the steps S201 to S206 as described above, and may further include:
  • S302. Determine whether the authority of the initiator that initiates the access request is a right for indicating that the right to access the audio data stored in the recording storage area.
  • the CPU may determine whether the initiator that initiated the access request detected in S301 has the right to access the audio data of the encrypted call service; if yes, the CPU proceeds to step S303; Otherwise, the process is terminated or the CPU may send one or more prompt information for prompting the originator not to access the audio data.
  • the terminal when the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, the right of the initiator that initiates the recording request is detected; if the permission is used for the indication Obtaining the recording permission for recording the audio data of the encrypted call service, acquiring audio data obtained by the terminal to audio-encode the call voice, and further storing the obtained audio data into a preset recording storage area; This can improve the security and convenience of voice processing.
  • FIG. 6 is a schematic structural diagram of a voice processing apparatus according to an embodiment of the present invention.
  • the apparatus 6 of the embodiment of the present invention includes:
  • the detecting module 60 is configured to: when the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, detecting the permission of the initiator that initiates the recording request;
  • the obtaining module 61 is configured to: if the permission detected by the detecting module 60 is a recording permission for indicating that the audio data of the encrypted calling service is recorded, acquiring the audio data obtained by the terminal performing audio encoding on the call voice ;
  • the storage module 62 is configured to store the audio data acquired by the obtaining module 61 to a preset recording storage area.
  • the terminal when the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, the right of the initiator that initiates the recording request is detected; if the permission is used for the indication Obtaining the recording permission for recording the audio data of the encrypted call service, acquiring audio data obtained by the terminal to audio-encode the call voice, and further storing the obtained audio data into a preset recording storage area; This can improve the security and convenience of voice processing.
  • FIG. 7 is a schematic structural diagram of another voice processing apparatus according to an embodiment of the present invention.
  • the apparatus 7 of the embodiment of the present invention includes the detecting module 60, the obtaining module 61, and the storage module 62 as described above. include:
  • a setting module 63 configured to set a recording permission for recording audio data of the encrypted call service in an audio management module of the terminal, where the set initiator having the recording permission includes: a call UI, a communication frame, or Audio code driver.
  • the device further includes:
  • the sending module 64 is configured to send the security prompt information if the right detected by the detecting module 60 is used to indicate that the right is not authorized to record the audio data of the encrypted calling service;
  • the security prompt information is used to prompt the sender that the audio data of the encrypted call service is not recorded.
  • the obtaining module 61 is specifically configured to invoke the audio encoding driver with recording permission to obtain audio data obtained by audio encoding the call voice.
  • the detecting module 60 is further configured to: if the audio data stored in the recording storage area is received When accessing the request, detecting the authority of the initiator that initiated the access request;
  • the sending module 64 is further configured to: if the permission detected by the detecting module 60 is used to indicate that the right to access the audio data stored in the recording storage area, the response module detects The request is accessed to return the audio data stored in the recording storage area.
  • the terminal when the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, the right of the initiator that initiates the recording request is detected; if the permission is used for the indication Obtaining the recording permission for recording the audio data of the encrypted call service, acquiring audio data obtained by the terminal to audio-encode the call voice, and further storing the obtained audio data into a preset recording storage area; This can improve the security and convenience of voice processing.
  • FIG. 8 is a schematic structural diagram of a terminal according to an embodiment of the present invention.
  • the terminal may be a device with a communication network function, such as a smart phone, a tablet computer, or a smart wearable device.
  • the terminal in the embodiment of the present invention may include a display screen, a button, a speaker, a pickup, and the like. And further comprising: at least one bus 501, at least one processor 502 connected to the bus 501, and at least one memory 503 connected to the bus 501, a communication device 505 implementing a communication function, and a power supply device 504 for powering each power consumption module of the communication terminal. .
  • the processor 502 can call the code stored in the memory 503 via the bus 501 to perform related functions.
  • the processor 502 is configured to: when the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, detecting the permission of the initiator that initiates the recording request; if the permission is used And the recording permission for recording the audio data of the encrypted call service is obtained, the audio data obtained by the terminal to audio-encode the call voice is obtained; and the obtained audio data is stored in the preset recording storage area.
  • the processor 502 is further configured to set a recording permission for recording the audio data of the encrypted call service in the audio management module of the terminal, where the set initiator with the recording permission includes: Call UI, communication framework, or audio coding driver.
  • the processor 502 is further configured to: if the permission is used to indicate that the right is not The security prompt information is sent to the audio data of the encrypted call service, and the security prompt information is used to prompt the sender to have no right to record the audio data of the encrypted call service.
  • the processor 502 is further configured to invoke the audio encoding driver with recording permission to obtain audio data obtained by audio encoding the call voice.
  • the processor 502 is further configured to: if the access request for the audio data stored in the recording storage area is received, detect the authority of the initiator that initiates the access request; if the permission is used In response to the right to access the audio data stored in the recording storage area, the access request is returned in response to the audio data stored in the recording storage area.
  • the terminal when the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, the right of the initiator that initiates the recording request is detected; if the permission is used for the indication Obtaining the recording permission for recording the audio data of the encrypted call service, acquiring audio data obtained by the terminal to audio-encode the call voice, and further storing the obtained audio data into a preset recording storage area; This can improve the security and convenience of voice processing.
  • the embodiment of the present invention further provides a computer storage medium, wherein the computer storage medium can store a program, and the program includes some or all of the steps of the operation method of any of the audio playback applications described in the foregoing method embodiments.
  • the disclosed apparatus may be implemented in other ways.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • there may be another division manner for example, multiple units or components may be combined or may be Integrate into another system, or some features can be ignored or not executed.
  • Another point, the mutual coupling or direct coupling or communication connection shown or discussed may be The indirect coupling or communication connection through some interfaces, devices or units may be in electrical or other form.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
  • the integrated unit if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium.
  • the technical solution of the present invention which is essential or contributes to the prior art, or all or part of the technical solution, may be embodied in the form of a software product stored in a storage medium.
  • a number of instructions are included to cause a computer device (which may be a personal computer, server or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present invention.
  • the foregoing storage medium includes: a U disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic disk, or an optical disk, and the like. .

Abstract

Embodiments of the present invention provide a voice processing method and device, and a terminal. The method comprises: checking, if a request of recording audio data of an encrypted call service is received when a terminal is executing the encrypted call service, a permission of an initiator initiating the recording request; if the permission is a recoding permission used for indicating the right to record audio data of the encrypted call service, obtaining audio data obtained after the terminal performs audio encoding on the call voice; and storing the obtained audio data to a preset record storage area. The present invention can improve the security and convenience of voice processing.

Description

一种语音处理的方法、装置以及终端Method, device and terminal for speech processing 技术领域Technical field
本发明涉及终端技术领域,尤其涉及一种语音处理的方法、装置以及终端。The present invention relates to the field of terminal technologies, and in particular, to a method, an apparatus, and a terminal for voice processing.
背景技术Background technique
随着终端技术的不断发展,通信终端在移动互联网环境下带给用户很大地便捷和快乐,但同时对隐私要求也很高。目前有很多语音加密的方案,比如加密算法、加密数据传输方式、密钥管理方式等,主要用于防止语音/信令数据在网络传输过程中被监听或获取。With the continuous development of terminal technologies, communication terminals bring users a lot of convenience and happiness in the mobile Internet environment, but at the same time, the privacy requirements are also high. At present, there are many voice encryption schemes, such as encryption algorithms, encrypted data transmission methods, and key management methods, which are mainly used to prevent voice/signaling data from being monitored or acquired during network transmission.
在实践中发现,通信终端在接收通过网络传输层发送过来的加密语音数据后可以对所述加密语音数据进行解密;当解密后的语音数据通过所述通信终端的操作系统、驱动甚至是应用程序时,可能被早就潜伏的木马程序捕获,这样就造成通话双方语音数据的泄露,降低了终端中语音数据的安全性。It is found in practice that the communication terminal can decrypt the encrypted voice data after receiving the encrypted voice data sent through the network transmission layer; when the decrypted voice data passes through the operating system, the driver or even the application of the communication terminal At that time, it may be captured by the lurking Trojan program, which causes the leakage of voice data of both parties and reduces the security of the voice data in the terminal.
发明内容Summary of the invention
本发明实施例所要解决的技术问题在于,提供一种语音处理的方法、装置以及终端,可提升语音处理的安全性和方便快捷性。The technical problem to be solved by the embodiments of the present invention is to provide a method, a device and a terminal for voice processing, which can improve the security, convenience and speed of voice processing.
一方面,本发明实施例公开提供了一种语音处理的方法,所述方法包括:In one aspect, the embodiment of the present invention provides a method for voice processing, where the method includes:
当终端执行加密通话业务时,如果接收到所述加密通话业务的音频数据的录制请求时,检测发起所述录制请求的发起方的权限;When the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, the authority of the initiator that initiates the recording request is detected;
若所述权限为用于指示有权对所述加密通话业务的音频数据进行录制的录制权限,则获取终端对通话语音进行音频编码得到的音频数据;If the permission is a recording permission for indicating that the audio data of the encrypted call service is recorded, acquiring audio data obtained by the terminal performing audio coding on the call voice;
将获取到的所述音频数据存储至预置的录音存储区。The obtained audio data is stored to a preset recording storage area.
其中可选地,所述当终端执行加密通话业务时,如果接收到所述加密通话业务的音频数据的录制请求时,检测发起所述录制请求的发起方的录制权限之前,还包括:Optionally, the method further includes: when the terminal performs the recording request of the audio data of the encrypted call service, and before detecting the recording permission of the initiator that initiates the recording request, the method further includes:
在所述终端的音频管理模块中设置对所述加密通话业务的音频数据进行录 制的录制权限,设置的具有所述录制权限的发起方包括:通话UI、通信框架、或者音频编码驱动。Setting audio data of the encrypted call service in an audio management module of the terminal The recording permission, the set initiator having the recording permission includes: a call UI, a communication frame, or an audio coding driver.
其中可选地,所述方法还包括:Optionally, the method further includes:
若所述权限为用于指示无权对所述加密通话业务的音频数据进行录制的权限,则发送安全提示信息;Sending the security prompt information if the permission is for indicating that the audio data of the encrypted call service is not authorized to be recorded;
其中,所述安全提示信息用于提示所述发送方无权对所述加密通话业务的音频数据进行录制。The security prompt information is used to prompt the sender that the audio data of the encrypted call service is not recorded.
其中可选地,所述获取终端对通话语音进行音频编码得到的音频数据,包括:Optionally, the obtaining, by the acquiring terminal, the audio data obtained by performing audio coding on the call voice, includes:
调用具有录制权限的所述音频编码驱动来获取对通话语音进行音频编码得到的音频数据。The audio encoding driver having recording authority is called to acquire audio data obtained by audio encoding the call voice.
其中可选地,所述方法还包括:Optionally, the method further includes:
如果接收到对所述录音存储区中存储的音频数据的访问请求时,检测发起所述访问请求的发起方的权限;If the access request for the audio data stored in the recording storage area is received, detecting the authority of the initiator that initiated the access request;
若所述权限为用于指示有权对所述录音存储区中存储的音频数据进行访问的权限,则响应所述访问请求以返回所述录音存储区中存储的音频数据。And if the permission is for indicating the right to access the audio data stored in the recording storage area, responding to the access request to return the audio data stored in the recording storage area.
另一方面,本发明实施例还公开提供了一种语音处理的装置,所述装置包括:In another aspect, an embodiment of the present invention further provides an apparatus for voice processing, where the apparatus includes:
检测模块,用于当终端执行加密通话业务时,如果接收到所述加密通话业务的音频数据的录制请求时,检测发起所述录制请求的发起方的权限;a detecting module, configured to: when receiving the recording request of the audio data of the encrypted call service, when the terminal performs the encrypted call service, detecting the authority of the initiator that initiates the recording request;
获取模块,用于若所述检测模块检测到的权限为用于指示有权对所述加密通话业务的音频数据进行录制的录制权限,则获取终端对通话语音进行音频编码得到的音频数据;And an obtaining module, configured to: if the permission detected by the detecting module is a recording permission for indicating that the audio data of the encrypted calling service is recorded, acquiring audio data obtained by the terminal performing audio encoding on the call voice;
存储模块,用于将所述获取模块获取到的所述音频数据存储至预置的录音存储区。And a storage module, configured to store the audio data acquired by the acquiring module to a preset recording storage area.
其中可选地,所述装置还包括:Optionally, the device further includes:
设置模块,用于在所述终端的音频管理模块中设置对所述加密通话业务的音频数据进行录制的录制权限,设置的具有所述录制权限的发起方包括:通话UI、通信框架、或者音频编码驱动。 a setting module, configured to set a recording permission for recording audio data of the encrypted call service in an audio management module of the terminal, where the set initiator having the recording permission includes: a call UI, a communication frame, or an audio Code driver.
其中可选地,所述装置还包括:Optionally, the device further includes:
发送模块,用于若所述检测模块检测到的权限为用于指示无权对所述加密通话业务的音频数据进行录制的权限,则发送安全提示信息;a sending module, configured to send security prompt information if the right detected by the detecting module is used to indicate that the audio data of the encrypted calling service is not authorized to be recorded;
其中,所述安全提示信息用于提示所述发送方无权对所述加密通话业务的音频数据进行录制。The security prompt information is used to prompt the sender that the audio data of the encrypted call service is not recorded.
其中可选地,Optionally,
所述获取模块,具体用于调用具有录制权限的所述音频编码驱动来获取对通话语音进行音频编码得到的音频数据。The obtaining module is specifically configured to invoke the audio encoding driver with recording permission to obtain audio data obtained by audio encoding the call voice.
其中可选地,Optionally,
所述检测模块,还用于如果接收到对所述录音存储区中存储的音频数据的访问请求时,检测发起所述访问请求的发起方的权限;The detecting module is further configured to detect, when the access request for the audio data stored in the recording storage area is received, the authority of the initiator that initiates the access request;
所述发送模块,还用于若所述检测模块检测到的权限为用于指示有权对所述录音存储区中存储的音频数据进行访问的权限,则响应所述检测模块检测到的访问请求以返回所述录音存储区中存储的音频数据。The sending module is further configured to: if the permission detected by the detecting module is used to indicate that the right to access the audio data stored in the recording storage area, respond to the access request detected by the detecting module To return the audio data stored in the recording storage area.
再一方面,本发明实施例还公开提供了一种终端,所述终端包括所述的语音处理装置。In still another aspect, an embodiment of the present invention further provides a terminal, where the terminal includes the voice processing device.
本发明实施例可通过在终端执行加密通话业务时,如果接收到所述加密通话业务的音频数据的录制请求时,检测发起所述录制请求的发起方的权限;若所述权限为用于指示有权对所述加密通话业务的音频数据进行录制的录制权限,则获取终端对通话语音进行音频编码得到的音频数据,进一步地将获取到的所述音频数据存储至预置的录音存储区;这样可提升语音处理的安全性和方便快捷性。In the embodiment of the present invention, when the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, the right of the initiator that initiates the recording request is detected; if the permission is used for the indication Obtaining the recording permission for recording the audio data of the encrypted call service, acquiring audio data obtained by the terminal to audio-encode the call voice, and further storing the obtained audio data into a preset recording storage area; This can improve the security and convenience of voice processing.
附图说明DRAWINGS
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the embodiments or the description of the prior art will be briefly described below. Obviously, the drawings in the following description are only It is a certain embodiment of the present invention, and other drawings can be obtained from those skilled in the art without any creative work.
图1是本发明实施例的一种加密通话框架结构示意图; 1 is a schematic structural diagram of an encrypted call frame according to an embodiment of the present invention;
图2是本发明实施例的一种语音处理方法的流程示意图;2 is a schematic flow chart of a voice processing method according to an embodiment of the present invention;
图3是本发明实施例的一种加密通话语音录制的结构示意图;3 is a schematic structural diagram of voice recording of an encrypted call according to an embodiment of the present invention;
图4是本发明实施例的另一种语音处理方法的流程示意图;4 is a schematic flowchart diagram of another voice processing method according to an embodiment of the present invention;
图5是本发明实施例的另一种语音处理方法的流程示意图;FIG. 5 is a schematic flowchart diagram of another voice processing method according to an embodiment of the present invention; FIG.
图6是本发明实施例的一种语音处理装置的结构示意图;FIG. 6 is a schematic structural diagram of a voice processing apparatus according to an embodiment of the present invention; FIG.
图7是本发明实施例的另一种语音处理装置的结构示意图;FIG. 7 is a schematic structural diagram of another voice processing apparatus according to an embodiment of the present invention; FIG.
图8是本发明实施例的一种终端的结构示意图。FIG. 8 is a schematic structural diagram of a terminal according to an embodiment of the present invention.
具体实施方式detailed description
本发明实施例提供了一种语音处理的方法、装置及终端,以期可以对关键词进行快速搜索,获取用户最想得到的结果信息,操作简单,效率高。The embodiment of the invention provides a method, a device and a terminal for voice processing, in order to quickly search for keywords and obtain the result information that the user most wants, which is simple in operation and high in efficiency.
为了使本技术领域的人员更好地理解本发明方案,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分的实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都应当属于本发明保护的范围。The technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention. It is an embodiment of the invention, but not all of the embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts shall fall within the scope of the present invention.
本发明的说明书和权利要求书及上述附图中的术语“第一”、“第二”和“第三”等是用于区别不同对象,而非用于描述特定顺序。此外,术语“包括”以及它们任何变形,意图在于覆盖不排他的包含。例如包含了一系列步骤或单元的过程、方法、系统、产品或设备没有限定于已列出的步骤或单元,而是可选地还包括没有列出的步骤或单元,或可选地还包括对于这些过程、方法、产品或设备固有的其它步骤或单元。The terms "first", "second" and "third" and the like in the specification and claims of the present invention and the above drawings are used to distinguish different objects, and are not intended to describe a specific order. Moreover, the term "comprise" and any variants thereof are intended to cover a non-exclusive inclusion. For example, a process, method, system, product, or device that comprises a series of steps or units is not limited to the listed steps or units, but optionally also includes steps or units not listed, or alternatively Other steps or units inherent to these processes, methods, products or equipment.
为了更好理解本发明实施例提供的一种语音处理的方法、装置及终端,下面先对本发明实施例适用的加密通话构架进行描述。请参阅图1,图1是本发明实施例公开的一种加密通话构架的结构示意图。如图1所示,该加密通话框架示意图中可以包括应用处理器CPU、通信模块、MIC/Speaker模块、音频编码(也即是音频Codec)、ADSP(Audio digital signal processor,音频数据信号处理器,简称ADSP)。所述应用处理器CPU可以包括通话UI(User Interface,用户界面, 简称UI)也即是通话界面、通信框架、音频管理模块、Audio驱动、Codec驱动、音频数据存储区;其中,所述通话UI主要用于通话界面的显示以及通话相关的逻辑处理部分,如通话显示界面的显示和界面上的一些按钮;所述通信框架可以为用于维护通信功能够正常工作的逻辑以及控制中心;所述音频管理模块可以为管理音频相关的处理单元,如音频的开启/关闭、通话录音、音频模式管理(如免提模式管理)等;所述音频数据存储区可以是音频数据的存放中心,如通话音频、普通音频等音频数据的存放中心;所述Audio驱动主要用于负责管理驱动MIC/Speaker等模块工作;所述Codec驱动主要负责管理驱动音频Codec模块工作。所述通信模块可以包括呼叫控制模块、Rx解密模块、Tx加密模块;其中,所述呼叫控制模块主要为通信模块中管理通话相关的模块,如链路、音频数据的管理等;所述Tx加密模块和所述Rx解密模块主要用来将发生/接收到的通话音频数据进行加密/解密处理,以保障音频数据传输的安全性。所述ADSP音频数据信号处理器主要用于进行音频数据的数模转化和运算。所述音频Codec可以为硬件设备,主要用于对声音信号的编码/解码。所述MIC/Speaker模块主要用来实现电信号和声音信号的相互转换模块,也可以将其称为话筒、微音器,MIC模块可以为用来将声音信号转换为电信号的能量转换器件;Speaker模块可以为用来将电信号转化为声音信号的能量转换器件。In order to better understand a method, device, and terminal for voice processing provided by the embodiments of the present invention, an encrypted call frame to which the embodiments of the present invention are applied is described below. Please refer to FIG. 1. FIG. 1 is a schematic structural diagram of an encrypted call frame disclosed in an embodiment of the present invention. As shown in FIG. 1 , the encrypted call frame diagram may include an application processor CPU, a communication module, a MIC/Speaker module, an audio code (that is, an audio codec), and an ADSP (Audio digital signal processor). Referred to as ADSP). The application processor CPU may include a call UI (User Interface, user interface, Referred to as UI), it is a call interface, a communication framework, an audio management module, an Audio driver, a Codec driver, and an audio data storage area. The call UI is mainly used for displaying a call interface and a logical processing part related to a call, such as a call. Displaying the display of the interface and some buttons on the interface; the communication framework may be logic for maintaining communication work and a control center; the audio management module may be for managing audio related processing units, such as audio on/ Off, call recording, audio mode management (such as hands-free mode management), etc.; the audio data storage area may be a storage center of audio data, such as call audio, normal audio and other audio data storage center; the Audio driver is mainly used It is responsible for managing the operation of modules such as MIC/Speaker; the Codec driver is mainly responsible for managing the operation of the driver audio Codec module. The communication module may include a call control module, an Rx decryption module, and a Tx encryption module. The call control module is mainly a module for managing call related in the communication module, such as link, audio data management, etc.; the Tx encryption The module and the Rx decryption module are mainly used for encrypting/decrypting the generated/received call audio data to ensure the security of the audio data transmission. The ADSP audio data signal processor is mainly used for performing digital-to-analog conversion and operation of audio data. The audio Codec may be a hardware device, mainly used for encoding/decoding a sound signal. The MIC/Speaker module is mainly used to implement a mutual conversion module of electrical signals and sound signals, and may also be referred to as a microphone or a microphone. The MIC module may be an energy conversion device for converting a sound signal into an electrical signal; The Speaker module can be an energy conversion device used to convert electrical signals into sound signals.
可以理解的是,通信终端中可以包括如图1所示的加密通话框架结构图中各个模块。用户可以通过通信终端中的通话UI(如拨号盘)拨打电话发起语音通话业务的呼叫请求,通过通信框架和通话模块的交互将所述呼叫请求通过通话模块发送给基站,等到通话对方接听电话之后将响应信息通过基站发送给所述通信终端,所述通信终端中的通信模块接收所述响应信息,并将所述响应信息发送给通话框架和通话UI,所述通信终端的通话UI进行通话的呈现(如通话接听界面的显示);在通信模块接收到所述响应信息之后,也即是通信模块获知通话对方已接听信息之后,可以通过音频管理器进行相应的音频模式设置(如设置为音频通话模式),通知音频管理模块进入语音通话状态中;此时音频管理模块还会通过Codec驱动和Audio驱动将音频Codec、MIC/Speaker模块等切换到通话工作状态,这时用户就可以和通话对方进行加密通话。具体工作流程包括:MIC/Speaker模块可以采集用户进行加密通话的语音数据,并将其发送给 音频Codec进行音频编码得到音频数据,再将所述音频数据发送给ADSP进行模数AD转换使其将模拟信号形式的音频数据转化为数字信号形式的音频数据,然后将数字信号形式的音频数据发送给通信模块中的Tx加密对所述数字信号形式的音频数据进行加密,最后通过通信模块中的呼叫控制模块将其发送给基站(可以简称为上行音频);同理相反的,在通信模块中的呼叫控制模块接收到通话对方发送过来的加密音频数据时,通过Rx解密对所述加密音频数据进行解密处理,再将解密之后的音频数据发送给ADSP进行数模DA转化使其变成模拟信号形式的音频数据,然后将其发送给音频Codec进行音频解码也即是音频编码的逆过程,最后发送给所述通信终端中的MIC/Speaker模块进行语音播放(简称为下行音频);也即是,在整个加密通话过程中终端的各个模块工作的流程为:MIC/Speaker模块←→音频Codec←→ADSP←→通信模块中的Rx解密/Tx加密←→通信模块中的呼叫控制模块。It can be understood that each module in the encrypted call frame structure diagram shown in FIG. 1 can be included in the communication terminal. The user can make a call request for the voice call service by calling a call UI (such as a dial pad) in the communication terminal, and the call request is sent to the base station through the call module through the interaction between the communication frame and the call module, and waits until the call partner answers the call. Transmitting, by the base station, the response information to the communication terminal, the communication module in the communication terminal receiving the response information, and transmitting the response information to the call frame and the call UI, where the call UI of the communication terminal makes a call Presentation (such as display of the call answering interface); after the communication module receives the response information, that is, after the communication module knows that the call partner has received the information, the audio manager can perform corresponding audio mode setting (such as setting to audio). In the call mode, the audio management module is notified to enter the voice call state; at this time, the audio management module also switches the audio Codec, MIC/Speaker module, etc. to the call working state through the Codec driver and the Audio driver, and the user can communicate with the call partner. Make an encrypted call. The specific workflow includes: the MIC/Speaker module can collect the voice data of the user for the encrypted call and send it to The audio Codec performs audio encoding to obtain audio data, and then sends the audio data to the ADSP for analog-to-digital AD conversion to convert the audio data in the form of an analog signal into audio data in the form of a digital signal, and then transmits the audio data in the form of a digital signal. Encrypting the audio data in the form of the digital signal to the Tx in the communication module, and finally transmitting it to the base station (which may be referred to as uplink audio) through the call control module in the communication module; in the opposite sense, in the communication module When the call control module receives the encrypted audio data sent by the caller, the encrypted audio data is decrypted by Rx decryption, and then the decrypted audio data is sent to the ADSP for digital-analog DA conversion to become an analog signal. The audio data of the form is then sent to the audio codec for audio decoding, that is, the inverse process of the audio encoding, and finally sent to the MIC/Speaker module in the communication terminal for voice playback (referred to as downlink audio); The process of working on each module of the terminal during the entire encrypted call is: MIC/Spea Ker module ←→audio Codec←→ADSP←→Rx decryption/Tx encryption in the communication module ←→Call control module in the communication module.
可以理解的是,在加密通话过程中,用户还可以通过所述通信终端中的通话UI选择进行通话录音,那么上行音频数据和下行音频数据都将通过所述通信终端中的音频Codec将包括上行音频数据和下行音频数据的所有音频数据发送给Codec驱动,再将其发送给音频管理模块,最后音频管理模块将所述音频数据以可播放的格式存储在所述通信终端中的音频数据存储区;也即是通话录音过程为:音频Coedc→Codec驱动→音频管理模块→音频数据存储区。It can be understood that during the encrypted call, the user can also perform call recording through the call UI in the communication terminal, and then the uplink audio data and the downlink audio data will all be included in the uplink through the audio codec in the communication terminal. All audio data of the audio data and the downlink audio data are sent to the Codec driver, and then sent to the audio management module, and finally the audio management module stores the audio data in a playable format in the audio data storage area of the communication terminal. That is, the call recording process is: audio Coedc→Codec driver→audio management module→audio data storage area.
需要说明的是,图1中的粗实线表示数据传输流,细实线表示控制流(也可以为信令控制流)。It should be noted that the thick solid line in FIG. 1 represents a data transmission stream, and the thin solid line represents a control flow (which may also be a signaling control flow).
基于图1所示的加密通话架构,请参阅图2是本发明实施例的一种语音处理方法的流程示意图,本发明实施例的所述方法还包括如下步骤。Based on the encrypted call architecture shown in FIG. 1, FIG. 2 is a schematic flowchart of a voice processing method according to an embodiment of the present invention. The method in the embodiment of the present invention further includes the following steps.
S101、当终端执行加密通话业务时,如果接收到所述加密通话业务的音频数据的录制请求时,检测发起所述录制请求的发起方的权限。S101. When the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, the authority of the initiator that initiates the recording request is detected.
本发明实施例中,当通信终端进行加密通话业务处理时,用户可以利用所述通信终端对所述加密通话业务的音频数据进行录制(如用户在所述通信终端的录音界面上选择进行通话录音),所述通信终端可以检测并获取用户对所述加密通话业务的音频数据进行录制的录制请求,所述通信终端还可以将所述录制请求发送给本通信终端相关的功能模块进行相应地通话语音录制处理。当所述 通信终端的应用处理器CPU(Central Processing Unit,CPU)接收到对所述加密通话业务的音频数据的录制请求时,所述CPU可以检测发起所述录制请求的发起方的权限。In the embodiment of the present invention, when the communication terminal performs the processing of the encrypted call service, the user may use the communication terminal to record the audio data of the encrypted call service (for example, the user selects to record the call on the recording interface of the communication terminal). The communication terminal may detect and acquire a recording request for the user to record the audio data of the encrypted call service, and the communication terminal may further send the recording request to a function module related to the communication terminal to perform a corresponding call. Voice recording processing. When said When the application processor CPU (CPU) of the communication terminal receives the recording request for the audio data of the encrypted call service, the CPU may detect the authority of the initiator that initiated the recording request.
其中可选地,所述当终端执行加密通话业务时,如果接收到所述加密通话业务的音频数据的录制请求时,检测发起所述录制请求的发起方的录制权限之前,还包括:Optionally, the method further includes: when the terminal performs the recording request of the audio data of the encrypted call service, and before detecting the recording permission of the initiator that initiates the recording request, the method further includes:
在所述终端的音频管理模块中设置对所述加密通话业务的音频数据进行录制的录制权限,设置的具有所述录制权限的发起方包括:通话UI、通信框架、或者音频编码驱动。Recording permission for recording the audio data of the encrypted call service is set in the audio management module of the terminal, and the set initiator having the recording permission includes: a call UI, a communication frame, or an audio coding driver.
所述CPU可以根据用户或者系统自定义设置的发起方录制权限,相应地在所述通信终端的音频管理模块中设置用于对所述加密通话业务的音频数据进行录制的录制权限;如所述CPU可以在所述音频管理模块中设置仅有通话UI,和/或通信框架,和/或音频编码驱动(也即是Codec驱动)这些发起方具有用于对所述加密通话业务的音频数据进行录制的录制权限。The CPU may set a recording permission according to the initiator of the user or the system custom setting, and correspondingly set a recording permission for recording the audio data of the encrypted call service in the audio management module of the communication terminal; The CPU may set a call-only UI, and/or a communication framework, and/or an audio encoding driver (ie, a Codec driver) in the audio management module, the initiator having audio data for the encrypted call service. Recorded recording rights.
所述通信终端可以包括智能手机(如Android手机、IOS手机等)、个人电脑、平板电脑、掌上电脑、移动互联网设备(MID,Mobile Internet Devices)或穿戴式智能设备等互联网设备,本发明实施例不作限定。The communication terminal may include an Internet device such as a smart phone (such as an Android mobile phone, an IOS mobile phone, etc.), a personal computer, a tablet computer, a palmtop computer, a mobile Internet device (MID), or a wearable smart device, and the embodiment of the present invention Not limited.
S102、若所述权限为用于指示有权对所述加密通话业务的音频数据进行录制的录制权限,则获取终端对通话语音进行音频编码得到的音频数据。S102. If the permission is a recording permission for indicating that the audio data of the encrypted call service is recorded, acquiring audio data obtained by the terminal performing audio coding on the call voice.
本发明实施例中,若CPU在S101中检测到发起所述录制请求的发起方有权对所述加密通话业务的音频数据进行录制,则所述CPU可以获取通信终端中对通话语音进行音频编码得到的音频数据。In the embodiment of the present invention, if the CPU detects in S101 that the initiator that initiated the recording request has the right to record the audio data of the encrypted call service, the CPU may acquire the audio coding of the call voice in the communication terminal. The resulting audio data.
可选地,所述获取终端对通话语音进行音频编码得到的音频数据,包括:Optionally, the obtaining, by the acquiring terminal, the audio data obtained by performing audio coding on the call voice, includes:
调用具有录制权限的所述音频编码驱动来获取对通话语音进行音频编码得到的音频数据。The audio encoding driver having recording authority is called to acquire audio data obtained by audio encoding the call voice.
所述CPU在S101中检测到发起所述录制请求的发起方的权限为用于指示有权对所述加密通话业务的音频数据进行录制的录制权限时,所述CPU可以调用具有录制权限的所述音频编码驱动(也即是所述音频Codec)来获取对通话语音进行音频编码得到的音频数据。 When the CPU detects that the authority of the initiator that initiates the recording request is the recording authority for indicating that the audio data of the encrypted call service is recorded, the CPU may invoke the location with the recording authority. The audio encoding driver (that is, the audio Codec) is described to obtain audio data obtained by audio encoding the call voice.
可选地,所述方法还包括:Optionally, the method further includes:
若所述权限为用于指示无权对所述加密通话业务的音频数据进行录制的权限,则发送安全提示信息;Sending the security prompt information if the permission is for indicating that the audio data of the encrypted call service is not authorized to be recorded;
其中,所述安全提示信息用于提示所述发送方无权对所述加密通话业务的音频数据进行录制。The security prompt information is used to prompt the sender that the audio data of the encrypted call service is not recorded.
所述CPU在S101中检测到发起所述录制请求的发起方无权对所述加密通话业务的音频数据进行录制时,所述CPU可以发送一个或者多个用于提示所述发送方无权对所述加密通话业务的音频数据进行录制的安全提示信息,如“**应用存在病毒/木马不能进行通话录音”。When the CPU detects that the initiator that initiates the recording request does not have the right to record the audio data of the encrypted call service, the CPU may send one or more to prompt the sender to have no right to The security prompt information of the audio data of the encrypted call service is recorded, such as "**The application has a virus/trojan cannot record the call".
S103、将获取到的所述音频数据存储至预置的录音存储区。S103. Store the acquired audio data into a preset recording storage area.
本发明实施例中,CPU可以将S102中获取到的所述音频数据存储在用户或者系统预先自定义设置的录音存储区中。In the embodiment of the present invention, the CPU may store the audio data acquired in S102 in a recording storage area preset by a user or a system.
其中可选地,所述方法还包括:Optionally, the method further includes:
如果接收到对所述录音存储区中存储的音频数据的访问请求时,检测发起所述访问请求的发起方的权限;If the access request for the audio data stored in the recording storage area is received, detecting the authority of the initiator that initiated the access request;
若所述权限为用于指示有权对所述录音存储区中存储的音频数据进行访问的权限,则响应所述访问请求以返回所述录音存储区中存储的音频数据。And if the permission is for indicating the right to access the audio data stored in the recording storage area, responding to the access request to return the audio data stored in the recording storage area.
如果通信终端中的其他功能模块需要调用所述录音存储区中存储的所述音频数据时,可以向所述CPU发送用于对所述录音存储区中存储的音频数据进行访问的访问请求;在所述CPU接收到所述访问请求时,所述CPU可以检测发起所述访问请求的发起方的权限,如果所述CPU检测到发起所述访问请求的发起方具有对所述录音存储区中存储的音频数据进行访问的权限,则所述CPU响应所述访问请求将所述录音存储区中存储的音频数据发送给对应的发起方(也即是,所述CPU将所述音频数据发送给所述通信终端的其他功能模块)。If another function module in the communication terminal needs to invoke the audio data stored in the recording storage area, an access request for accessing audio data stored in the recording storage area may be sent to the CPU; When the CPU receives the access request, the CPU may detect the authority of the initiator that initiates the access request, if the CPU detects that the initiator that initiated the access request has stored in the recording storage area. The audio data is accessed, and the CPU transmits the audio data stored in the recording storage area to the corresponding initiator in response to the access request (that is, the CPU sends the audio data to the Said other functional modules of the communication terminal).
为了便于对上述实施例的理解,基于图1所示的加密通话框架之上,给出了如下图3所示的一种加密通话语音录制的结构示意图,其中粗实线表示数据传输流,细实线表示控制流(也可以为信令控制流),虚线表示音频数据流。当用户利用包括有如图1中所有通信相关功能模块的通信终端机建立语音通话之后,所述通信终端中的通信框架可以及时获取并判断所述通信终端当前进行的 语音通话是否为加密通话;当所述通信架构判断到所述通信终端当前处于加密通话时,所述通信框架会通知音频管理模块对所述加密通话所产生的音频数据进行权限控制,也即是控制如图3所示的音频数据流的流向和/或控制所述音频数据的访问权限,具体包括如下三方面的权限控制:In order to facilitate the understanding of the above embodiment, based on the encrypted call frame shown in FIG. 1, a schematic structural diagram of an encrypted call voice recording as shown in FIG. 3 is given, wherein a thick solid line indicates a data transmission stream, and The solid line represents the control flow (which can also be a signaling control flow) and the dashed line represents the audio data stream. After the user establishes a voice call by using a communication terminal including all the communication related function modules as in FIG. 1, the communication frame in the communication terminal can acquire and judge the current progress of the communication terminal in time. Whether the voice call is an encrypted call; when the communication architecture determines that the communication terminal is currently in an encrypted call, the communication framework notifies the audio management module to perform rights control on the audio data generated by the encrypted call, that is, Controlling the flow of the audio data stream as shown in FIG. 3 and/or controlling the access rights of the audio data includes the following three aspects of access control:
一、对所述音频管理模块进行权限设置,此时只有通话UI或者通信框架才能够调用所述音频管理模块的接口和功能,或者将所有上层的音频相关操作的接口全部屏蔽掉(如屏蔽掉所述音频管理模块中用于读取音频Codec数据的接口)。1. Perform permission setting on the audio management module. At this time, only the call UI or the communication framework can call the interface and function of the audio management module, or block all the interfaces of the audio-related operations of the upper layer (such as masking off). An interface for reading audio Codec data in the audio management module).
二、对用于存储所述音频数据的音频数据存储区进行访问权限设置,此时只有通话UI或者通信框架才可以访问。2. The access permission setting is performed on the audio data storage area for storing the audio data, and only the call UI or the communication frame can be accessed.
三、在所述音频管理模块通知Codec驱动此时处于加密通话过程中(也即是在所述音频管理模块通知所述Codec驱动将音频Codec切换到工作状态中),对所述Codec驱动进行接口权限设置,只允许音频管理模块进行操作或者屏蔽所有对所述音频数据相关操作的接口。3. The audio management module notifies the Codec driver that it is in the process of encrypting the call (that is, when the audio management module notifies the Codec driver to switch the audio Codec to the working state), and interfaces the Codec driver. The permission settings only allow the audio management module to operate or block all interfaces to the audio data related operations.
本发明实施例可通过在终端执行加密通话业务时,如果接收到所述加密通话业务的音频数据的录制请求时,检测发起所述录制请求的发起方的权限;若所述权限为用于指示有权对所述加密通话业务的音频数据进行录制的录制权限,则获取终端对通话语音进行音频编码得到的音频数据,进一步地将获取到的所述音频数据存储至预置的录音存储区;这样可提升语音处理的安全性和方便快捷性。In the embodiment of the present invention, when the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, the right of the initiator that initiates the recording request is detected; if the permission is used for the indication Obtaining the recording permission for recording the audio data of the encrypted call service, acquiring audio data obtained by the terminal to audio-encode the call voice, and further storing the obtained audio data into a preset recording storage area; This can improve the security and convenience of voice processing.
请一并参阅图4是本发明实施例的另一种语音处理方法的流程示意图,本发明实施例的所述方法还包括如下步骤。FIG. 4 is a schematic flowchart of another voice processing method according to an embodiment of the present invention. The method of the embodiment of the present invention further includes the following steps.
S201、在终端的音频管理模块中设置对加密通话业务的音频数据进行录制的录制权限,设置的具有所述录制权限的发起方包括:通话UI、通信框架、或者音频编码驱动。S201: Set a recording permission for recording audio data of the encrypted call service in the audio management module of the terminal, where the set initiator having the recording permission includes: a call UI, a communication frame, or an audio coding driver.
S202、当所述终端执行所述加密通话业务时,如果接收到所述加密通话业务的音频数据的录制请求时,检测发起所述录制请求的发起方的权限。S202. When the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, detecting the authority of the initiator that initiates the recording request.
S203、判断发起所述录制请求的发起的权限是否为用于指示有权对所述加密通话业务的音频数据进行录制的录制权限。 S203. Determine whether the permission to initiate the initiation of the recording request is a recording permission for indicating that the audio data of the encrypted call service is recorded.
本发明实施例中,CPU可以判断S202中检测到的发起所述录制请求的发起方是否有权对所述加密通话业务的音频数据进行录制;若是,则所述CPU继续执行步骤S204;否则,执行S206。In the embodiment of the present invention, the CPU may determine whether the initiator that initiates the recording request detected in S202 has the right to record the audio data of the encrypted call service; if yes, the CPU continues to perform step S204; otherwise, Execute S206.
S204、调用具有录制权限的所述音频编码驱动来获取对通话语音进行音频编码得到的音频数据。S204. Call the audio encoding driver with recording permission to obtain audio data obtained by audio encoding the call voice.
S205、将获取到的所述音频数据存储至预置的录音存储区。S205. Store the acquired audio data into a preset recording storage area.
S206、发送安全提示信息;其中,所述安全提示信息用于提示所述发送方无权对所述加密通话业务的音频数据进行录制。S206: Send security prompt information, where the security prompt information is used to prompt the sender to have no right to record audio data of the encrypted call service.
请一并参阅图5是本发明实施例的另一种语音处理方法的流程示意图,本发明实施例的所述方法包括如上所述的步骤S201至步骤S206,还可以包括:5 is a schematic flowchart of another voice processing method according to an embodiment of the present invention. The method in the embodiment of the present invention includes the steps S201 to S206 as described above, and may further include:
S301、如果接收到对所述录音存储区中存储的音频数据的访问请求时,检测发起所述访问请求的发起方的权限;S301. If an access request for the audio data stored in the recording storage area is received, detecting an authority of an initiator that initiates the access request;
S302、判断发起所述访问请求的发起方的权限是否为用于指示有权对所述录音存储区中存储的音频数据进行访问的权限。S302. Determine whether the authority of the initiator that initiates the access request is a right for indicating that the right to access the audio data stored in the recording storage area.
本发明实施例中,CPU可以判断S301中检测到的发起所述访问请求的发起方是否有权对所述加密通话业务的音频数据进行访问的权限;若是,则所述CPU继续执行步骤S303;否则,结束流程或者所述CPU可以发送一个或者多个用于提示所述发起方无权访问所述音频数据的提示信息。In the embodiment of the present invention, the CPU may determine whether the initiator that initiated the access request detected in S301 has the right to access the audio data of the encrypted call service; if yes, the CPU proceeds to step S303; Otherwise, the process is terminated or the CPU may send one or more prompt information for prompting the originator not to access the audio data.
S303、响应所述访问请求以返回所述录音存储区中存储的音频数据。S303. Respond to the access request to return audio data stored in the recording storage area.
本发明实施例可通过在终端执行加密通话业务时,如果接收到所述加密通话业务的音频数据的录制请求时,检测发起所述录制请求的发起方的权限;若所述权限为用于指示有权对所述加密通话业务的音频数据进行录制的录制权限,则获取终端对通话语音进行音频编码得到的音频数据,进一步地将获取到的所述音频数据存储至预置的录音存储区;这样可提升语音处理的安全性和方便快捷性。In the embodiment of the present invention, when the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, the right of the initiator that initiates the recording request is detected; if the permission is used for the indication Obtaining the recording permission for recording the audio data of the encrypted call service, acquiring audio data obtained by the terminal to audio-encode the call voice, and further storing the obtained audio data into a preset recording storage area; This can improve the security and convenience of voice processing.
请参见图6,是本发明实施例的一种语音处理装置的结构示意图,本发明实施例的所述装置6包括:FIG. 6 is a schematic structural diagram of a voice processing apparatus according to an embodiment of the present invention. The apparatus 6 of the embodiment of the present invention includes:
检测模块60,用于当终端执行加密通话业务时,如果接收到所述加密通话业务的音频数据的录制请求时,检测发起所述录制请求的发起方的权限; The detecting module 60 is configured to: when the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, detecting the permission of the initiator that initiates the recording request;
获取模块61,用于若所述检测模块60检测到的权限为用于指示有权对所述加密通话业务的音频数据进行录制的录制权限,则获取终端对通话语音进行音频编码得到的音频数据;The obtaining module 61 is configured to: if the permission detected by the detecting module 60 is a recording permission for indicating that the audio data of the encrypted calling service is recorded, acquiring the audio data obtained by the terminal performing audio encoding on the call voice ;
存储模块62,用于将所述获取模块61获取到的所述音频数据存储至预置的录音存储区。The storage module 62 is configured to store the audio data acquired by the obtaining module 61 to a preset recording storage area.
本发明实施例可通过在终端执行加密通话业务时,如果接收到所述加密通话业务的音频数据的录制请求时,检测发起所述录制请求的发起方的权限;若所述权限为用于指示有权对所述加密通话业务的音频数据进行录制的录制权限,则获取终端对通话语音进行音频编码得到的音频数据,进一步地将获取到的所述音频数据存储至预置的录音存储区;这样可提升语音处理的安全性和方便快捷性。In the embodiment of the present invention, when the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, the right of the initiator that initiates the recording request is detected; if the permission is used for the indication Obtaining the recording permission for recording the audio data of the encrypted call service, acquiring audio data obtained by the terminal to audio-encode the call voice, and further storing the obtained audio data into a preset recording storage area; This can improve the security and convenience of voice processing.
本发明实施例中涉及的各个模块的具体实现可参考图1至图5对应实施例中相关功能模块或者实施步骤的描述,在此不赘述。For specific implementations of the various modules involved in the embodiments of the present invention, reference may be made to the description of related functional modules or implementation steps in the corresponding embodiments in FIG. 1 to FIG. 5, and details are not described herein.
请一并参阅图7,是本发明实施例的另一种语音处理装置的结构示意图,本发明实施例的所述装置7包括如上所述的检测模块60、获取模块61、存储模块62,还包括:FIG. 7 is a schematic structural diagram of another voice processing apparatus according to an embodiment of the present invention. The apparatus 7 of the embodiment of the present invention includes the detecting module 60, the obtaining module 61, and the storage module 62 as described above. include:
设置模块63,用于在所述终端的音频管理模块中设置对所述加密通话业务的音频数据进行录制的录制权限,设置的具有所述录制权限的发起方包括:通话UI、通信框架、或者音频编码驱动。a setting module 63, configured to set a recording permission for recording audio data of the encrypted call service in an audio management module of the terminal, where the set initiator having the recording permission includes: a call UI, a communication frame, or Audio code driver.
其中可选地,在本发明实施例中,所述装置还包括:Optionally, in the embodiment of the present invention, the device further includes:
发送模块64,用于若所述检测模块60检测到的权限为用于指示无权对所述加密通话业务的音频数据进行录制的权限,则发送安全提示信息;The sending module 64 is configured to send the security prompt information if the right detected by the detecting module 60 is used to indicate that the right is not authorized to record the audio data of the encrypted calling service;
其中,所述安全提示信息用于提示所述发送方无权对所述加密通话业务的音频数据进行录制。The security prompt information is used to prompt the sender that the audio data of the encrypted call service is not recorded.
其中可选地,在本发明实施例中,Optionally, in the embodiment of the present invention,
所述获取模块61,具体用于调用具有录制权限的所述音频编码驱动来获取对通话语音进行音频编码得到的音频数据。The obtaining module 61 is specifically configured to invoke the audio encoding driver with recording permission to obtain audio data obtained by audio encoding the call voice.
其中可选地,在本发明实施例中,Optionally, in the embodiment of the present invention,
所述检测模块60,还用于如果接收到对所述录音存储区中存储的音频数据 的访问请求时,检测发起所述访问请求的发起方的权限;The detecting module 60 is further configured to: if the audio data stored in the recording storage area is received When accessing the request, detecting the authority of the initiator that initiated the access request;
所述发送模块64,还用于若所述检测模块60检测到的权限为用于指示有权对所述录音存储区中存储的音频数据进行访问的权限,则响应所述检测模块检测到的访问请求以返回所述录音存储区中存储的音频数据。The sending module 64 is further configured to: if the permission detected by the detecting module 60 is used to indicate that the right to access the audio data stored in the recording storage area, the response module detects The request is accessed to return the audio data stored in the recording storage area.
本发明实施例可通过在终端执行加密通话业务时,如果接收到所述加密通话业务的音频数据的录制请求时,检测发起所述录制请求的发起方的权限;若所述权限为用于指示有权对所述加密通话业务的音频数据进行录制的录制权限,则获取终端对通话语音进行音频编码得到的音频数据,进一步地将获取到的所述音频数据存储至预置的录音存储区;这样可提升语音处理的安全性和方便快捷性。In the embodiment of the present invention, when the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, the right of the initiator that initiates the recording request is detected; if the permission is used for the indication Obtaining the recording permission for recording the audio data of the encrypted call service, acquiring audio data obtained by the terminal to audio-encode the call voice, and further storing the obtained audio data into a preset recording storage area; This can improve the security and convenience of voice processing.
本发明实施例中涉及的各个模块的具体实现可参考图1至图5对应实施例中相关功能模块或者实施步骤的描述,在此不赘述。For specific implementations of the various modules involved in the embodiments of the present invention, reference may be made to the description of related functional modules or implementation steps in the corresponding embodiments in FIG. 1 to FIG. 5, and details are not described herein.
再请参见图8,是本发明实施例的一种终端的结构示意图。所述终端可以为智能手机、平板电脑、智能可穿戴设备等带通信网络功能的设备,如图8所示,本发明实施例的所述终端可以包括显示屏、按键、扬声器、拾音器等模块,并且还包括:至少一个总线501、与总线501相连的至少一个处理器502以及与总线501相连的至少一个存储器503,实现通信功能的通信装置505,为通信终端各耗电模块供电的电源装置504。Referring to FIG. 8, FIG. 8 is a schematic structural diagram of a terminal according to an embodiment of the present invention. The terminal may be a device with a communication network function, such as a smart phone, a tablet computer, or a smart wearable device. As shown in FIG. 8 , the terminal in the embodiment of the present invention may include a display screen, a button, a speaker, a pickup, and the like. And further comprising: at least one bus 501, at least one processor 502 connected to the bus 501, and at least one memory 503 connected to the bus 501, a communication device 505 implementing a communication function, and a power supply device 504 for powering each power consumption module of the communication terminal. .
所述处理器502可通过总线501,调用存储器503中存储的代码以执行相关的功能。The processor 502 can call the code stored in the memory 503 via the bus 501 to perform related functions.
所述处理器502,用于当终端执行加密通话业务时,如果接收到所述加密通话业务的音频数据的录制请求时,检测发起所述录制请求的发起方的权限;若所述权限为用于指示有权对所述加密通话业务的音频数据进行录制的录制权限,则获取终端对通话语音进行音频编码得到的音频数据;将获取到的所述音频数据存储至预置的录音存储区。The processor 502 is configured to: when the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, detecting the permission of the initiator that initiates the recording request; if the permission is used And the recording permission for recording the audio data of the encrypted call service is obtained, the audio data obtained by the terminal to audio-encode the call voice is obtained; and the obtained audio data is stored in the preset recording storage area.
进一步可选地,所述处理器502还用于在所述终端的音频管理模块中设置对所述加密通话业务的音频数据进行录制的录制权限,设置的具有所述录制权限的发起方包括:通话UI、通信框架、或者音频编码驱动。Further, optionally, the processor 502 is further configured to set a recording permission for recording the audio data of the encrypted call service in the audio management module of the terminal, where the set initiator with the recording permission includes: Call UI, communication framework, or audio coding driver.
进一步可选地,所述处理器502还用于若所述权限为用于指示无权对所述 加密通话业务的音频数据进行录制的权限,则发送安全提示信息;其中,所述安全提示信息用于提示所述发送方无权对所述加密通话业务的音频数据进行录制。Further optionally, the processor 502 is further configured to: if the permission is used to indicate that the right is not The security prompt information is sent to the audio data of the encrypted call service, and the security prompt information is used to prompt the sender to have no right to record the audio data of the encrypted call service.
进一步可选地,所述处理器502还用于调用具有录制权限的所述音频编码驱动来获取对通话语音进行音频编码得到的音频数据。Further optionally, the processor 502 is further configured to invoke the audio encoding driver with recording permission to obtain audio data obtained by audio encoding the call voice.
进一步可选地,所述处理器502还用于如果接收到对所述录音存储区中存储的音频数据的访问请求时,检测发起所述访问请求的发起方的权限;若所述权限为用于指示有权对所述录音存储区中存储的音频数据进行访问的权限,则响应所述访问请求以返回所述录音存储区中存储的音频数据。Further, optionally, the processor 502 is further configured to: if the access request for the audio data stored in the recording storage area is received, detect the authority of the initiator that initiates the access request; if the permission is used In response to the right to access the audio data stored in the recording storage area, the access request is returned in response to the audio data stored in the recording storage area.
本发明实施例可通过在终端执行加密通话业务时,如果接收到所述加密通话业务的音频数据的录制请求时,检测发起所述录制请求的发起方的权限;若所述权限为用于指示有权对所述加密通话业务的音频数据进行录制的录制权限,则获取终端对通话语音进行音频编码得到的音频数据,进一步地将获取到的所述音频数据存储至预置的录音存储区;这样可提升语音处理的安全性和方便快捷性。In the embodiment of the present invention, when the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, the right of the initiator that initiates the recording request is detected; if the permission is used for the indication Obtaining the recording permission for recording the audio data of the encrypted call service, acquiring audio data obtained by the terminal to audio-encode the call voice, and further storing the obtained audio data into a preset recording storage area; This can improve the security and convenience of voice processing.
本发明实施例还提供一种计算机存储介质,其中,该计算机存储介质可存储有程序,该程序执行时包括上述方法实施例中记载的任何音频播放应用的操作方法的部分或全部步骤。The embodiment of the present invention further provides a computer storage medium, wherein the computer storage medium can store a program, and the program includes some or all of the steps of the operation method of any of the audio playback applications described in the foregoing method embodiments.
需要说明的是,对于前述的各方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本发明并不受所描述的动作顺序的限制,因为依据本发明,某些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作和模块并不一定是本发明所必须的。It should be noted that, for the foregoing method embodiments, for the sake of simple description, they are all expressed as a series of action combinations, but those skilled in the art should understand that the present invention is not limited by the described action sequence. Because certain steps may be performed in other sequences or concurrently in accordance with the present invention. In addition, those skilled in the art should also understand that the embodiments described in the specification are all preferred embodiments, and the actions and modules involved are not necessarily required by the present invention.
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其他实施例的相关描述。In the above embodiments, the descriptions of the various embodiments are different, and the details that are not detailed in a certain embodiment can be referred to the related descriptions of other embodiments.
在本申请所提供的几个实施例中,应该理解到,所揭露的装置,可通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可 以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性或其它的形式。In the several embodiments provided herein, it should be understood that the disclosed apparatus may be implemented in other ways. For example, the device embodiments described above are merely illustrative. For example, the division of the unit is only a logical function division. In actual implementation, there may be another division manner, for example, multiple units or components may be combined or may be Integrate into another system, or some features can be ignored or not executed. Another point, the mutual coupling or direct coupling or communication connection shown or discussed may be The indirect coupling or communication connection through some interfaces, devices or units may be in electrical or other form.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
另外,在本发明的各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可为个人计算机、服务器或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、移动硬盘、磁碟或者光盘等各种可以存储程序代码的介质。The integrated unit, if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention, which is essential or contributes to the prior art, or all or part of the technical solution, may be embodied in the form of a software product stored in a storage medium. A number of instructions are included to cause a computer device (which may be a personal computer, server or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present invention. The foregoing storage medium includes: a U disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic disk, or an optical disk, and the like. .
以上所述,以上实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的范围。 The above embodiments are only used to illustrate the technical solutions of the present invention, and are not intended to be limiting; although the present invention has been described in detail with reference to the foregoing embodiments, those skilled in the art will understand that The technical solutions described in the embodiments are modified, or some of the technical features are replaced by equivalents; and the modifications or substitutions do not deviate from the scope of the technical solutions of the embodiments of the present invention.

Claims (11)

  1. 一种语音处理的方法,其特征在于,所述方法包括:A method of voice processing, the method comprising:
    当终端执行加密通话业务时,如果接收到所述加密通话业务的音频数据的录制请求时,检测发起所述录制请求的发起方的权限;When the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, the authority of the initiator that initiates the recording request is detected;
    若所述权限为用于指示有权对所述加密通话业务的音频数据进行录制的录制权限,则获取终端对通话语音进行音频编码得到的音频数据;If the permission is a recording permission for indicating that the audio data of the encrypted call service is recorded, acquiring audio data obtained by the terminal performing audio coding on the call voice;
    将获取到的所述音频数据存储至预置的录音存储区。The obtained audio data is stored to a preset recording storage area.
  2. 如权利要求1所述的方法,其特征在于,所述当终端执行加密通话业务时,如果接收到所述加密通话业务的音频数据的录制请求时,检测发起所述录制请求的发起方的录制权限之前,还包括:The method according to claim 1, wherein when the terminal performs the encrypted call service, if the recording request of the audio data of the encrypted call service is received, detecting the recording of the initiator that initiated the recording request Before the permissions, it also includes:
    在所述终端的音频管理模块中设置对所述加密通话业务的音频数据进行录制的录制权限,设置的具有所述录制权限的发起方包括:通话UI、通信框架、或者音频编码驱动。Recording permission for recording the audio data of the encrypted call service is set in the audio management module of the terminal, and the set initiator having the recording permission includes: a call UI, a communication frame, or an audio coding driver.
  3. 如权利要求2所述的方法,其特征在于,所述获取终端对通话语音进行音频编码得到的音频数据,包括:The method of claim 2, wherein the obtaining audio data obtained by the terminal for audio encoding the call voice comprises:
    调用具有录制权限的所述音频编码驱动来获取对通话语音进行音频编码得到的音频数据。The audio encoding driver having recording authority is called to acquire audio data obtained by audio encoding the call voice.
  4. 如权利要求1所述的方法,其特征在于,还包括:The method of claim 1 further comprising:
    若所述权限为用于指示无权对所述加密通话业务的音频数据进行录制的权限,则发送安全提示信息;Sending the security prompt information if the permission is for indicating that the audio data of the encrypted call service is not authorized to be recorded;
    其中,所述安全提示信息用于提示所述发送方无权对所述加密通话业务的音频数据进行录制。The security prompt information is used to prompt the sender that the audio data of the encrypted call service is not recorded.
  5. 如权利要求1所述的方法,其特征在于,还包括:The method of claim 1 further comprising:
    如果接收到对所述录音存储区中存储的音频数据的访问请求时,检测发起所述访问请求的发起方的权限; If the access request for the audio data stored in the recording storage area is received, detecting the authority of the initiator that initiated the access request;
    若所述权限为用于指示有权对所述录音存储区中存储的音频数据进行访问的权限,则响应所述访问请求以返回所述录音存储区中存储的音频数据。And if the permission is for indicating the right to access the audio data stored in the recording storage area, responding to the access request to return the audio data stored in the recording storage area.
  6. 一种语音处理的装置,其特征在于,所述装置包括:A device for voice processing, characterized in that the device comprises:
    检测模块,用于当终端执行加密通话业务时,如果接收到所述加密通话业务的音频数据的录制请求时,检测发起所述录制请求的发起方的权限;a detecting module, configured to: when receiving the recording request of the audio data of the encrypted call service, when the terminal performs the encrypted call service, detecting the authority of the initiator that initiates the recording request;
    获取模块,用于若所述检测模块检测到的权限为用于指示有权对所述加密通话业务的音频数据进行录制的录制权限,则获取终端对通话语音进行音频编码得到的音频数据;And an obtaining module, configured to: if the permission detected by the detecting module is a recording permission for indicating that the audio data of the encrypted calling service is recorded, acquiring audio data obtained by the terminal performing audio encoding on the call voice;
    存储模块,用于将所述获取模块获取到的所述音频数据存储至预置的录音存储区。And a storage module, configured to store the audio data acquired by the acquiring module to a preset recording storage area.
  7. 如权利要求6所述的装置,其特征在于,所述装置还包括:The device of claim 6 wherein said device further comprises:
    设置模块,用于在所述终端的音频管理模块中设置对所述加密通话业务的音频数据进行录制的录制权限,设置的具有所述录制权限的发起方包括:通话UI、通信框架、或者音频编码驱动。a setting module, configured to set a recording permission for recording audio data of the encrypted call service in an audio management module of the terminal, where the set initiator having the recording permission includes: a call UI, a communication frame, or an audio Code driver.
  8. 如权利要求7所述的装置,其特征在于,The device of claim 7 wherein:
    所述获取模块,具体用于调用具有录制权限的所述音频编码驱动来获取对通话语音进行音频编码得到的音频数据。The obtaining module is specifically configured to invoke the audio encoding driver with recording permission to obtain audio data obtained by audio encoding the call voice.
  9. 如权利要求6所述的装置,其特征在于,所述装置还包括:The device of claim 6 wherein said device further comprises:
    发送模块,用于若所述检测模块检测到的权限为用于指示无权对所述加密通话业务的音频数据进行录制的权限,则发送安全提示信息;a sending module, configured to send security prompt information if the right detected by the detecting module is used to indicate that the audio data of the encrypted calling service is not authorized to be recorded;
    其中,所述安全提示信息用于提示所述发送方无权对所述加密通话业务的音频数据进行录制。The security prompt information is used to prompt the sender that the audio data of the encrypted call service is not recorded.
  10. 如权利要求6所述的装置,其特征在于,The device of claim 6 wherein:
    所述检测模块,还用于如果接收到对所述录音存储区中存储的音频数据的访问请求时,检测发起所述访问请求的发起方的权限;The detecting module is further configured to detect, when the access request for the audio data stored in the recording storage area is received, the authority of the initiator that initiates the access request;
    所述发送模块,还用于若所述检测模块检测到的权限为用于指示有权对所述录音存储区中存储的音频数据进行访问的权限,则响应所述检测模块检测到的访问请求以返回所述录音存储区中存储的音频数据。The sending module is further configured to: if the permission detected by the detecting module is used to indicate that the right to access the audio data stored in the recording storage area, respond to the access request detected by the detecting module To return the audio data stored in the recording storage area.
  11. 一种终端,其特征在于,所述终端如权利要求6至10中任意一项所述 的语音处理装置。 A terminal, characterized in that the terminal is as claimed in any one of claims 6 to 10. Voice processing device.
PCT/CN2016/087609 2016-03-25 2016-06-29 Voice processing method and device, and terminal WO2017161724A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610177451.1A CN105721492B (en) 2016-03-25 2016-03-25 A kind of method, apparatus and terminal of speech processes
CN201610177451.1 2016-03-25

Publications (1)

Publication Number Publication Date
WO2017161724A1 true WO2017161724A1 (en) 2017-09-28

Family

ID=56158315

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/087609 WO2017161724A1 (en) 2016-03-25 2016-06-29 Voice processing method and device, and terminal

Country Status (2)

Country Link
CN (1) CN105721492B (en)
WO (1) WO2017161724A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113779546A (en) * 2021-06-01 2021-12-10 武汉深之度科技有限公司 Recording permission management method, computing device and storage medium

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106301784B (en) * 2016-08-03 2020-06-16 南昌欧菲生物识别技术有限公司 Data acquisition method and terminal
CN107786753A (en) * 2017-11-08 2018-03-09 西安中科创达软件有限公司 A kind of mobile device and its method of controlling security
CN111462785B (en) * 2020-04-03 2021-09-28 惠州Tcl移动通信有限公司 Recording control method, recording control device, storage medium and mobile terminal
CN113938565A (en) * 2021-10-18 2022-01-14 北京博瑞彤芸科技股份有限公司 Data processing method and equipment based on telephone call
CN117135266B (en) * 2023-10-25 2024-03-22 Tcl通讯科技(成都)有限公司 Information processing method, device and computer readable storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101005534A (en) * 2006-01-18 2007-07-25 西安大唐电信有限公司 Method for realizing telephone recording function of telephone terminal
CN101127949A (en) * 2007-08-30 2008-02-20 中国移动通信集团重庆有限公司 A method for realizing instant recording service based on mobile communication network
CN101335585A (en) * 2008-07-24 2008-12-31 中兴通讯股份有限公司 Audio and video program separating method based on mobile terminal
CN102170617A (en) * 2011-04-07 2011-08-31 中兴通讯股份有限公司 Mobile terminal and remote control method thereof
CN103095752A (en) * 2011-10-31 2013-05-08 中兴通讯股份有限公司 Transcribing method, device and system of voice and video
CN105049592A (en) * 2015-05-27 2015-11-11 中国科学院信息工程研究所 Voice safety protection method and system for mobile intelligent terminal

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1414190B1 (en) * 2002-10-22 2007-03-07 Alcatel Method and system for informing a user about a WLAN accessibility
CN102572123B (en) * 2011-12-21 2014-10-22 成都三零瑞通移动通信有限公司 Method for monitoring call record uploading of eavesdropping software X undercover
CN104980406B (en) * 2014-04-11 2018-11-20 华为技术有限公司 Call recording method, recording server, user class interchanger and recording system
CN104113625B (en) * 2014-07-28 2016-02-17 努比亚技术有限公司 Talking recording system, method, device and mobile terminal

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101005534A (en) * 2006-01-18 2007-07-25 西安大唐电信有限公司 Method for realizing telephone recording function of telephone terminal
CN101127949A (en) * 2007-08-30 2008-02-20 中国移动通信集团重庆有限公司 A method for realizing instant recording service based on mobile communication network
CN101335585A (en) * 2008-07-24 2008-12-31 中兴通讯股份有限公司 Audio and video program separating method based on mobile terminal
CN102170617A (en) * 2011-04-07 2011-08-31 中兴通讯股份有限公司 Mobile terminal and remote control method thereof
CN103095752A (en) * 2011-10-31 2013-05-08 中兴通讯股份有限公司 Transcribing method, device and system of voice and video
CN105049592A (en) * 2015-05-27 2015-11-11 中国科学院信息工程研究所 Voice safety protection method and system for mobile intelligent terminal

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113779546A (en) * 2021-06-01 2021-12-10 武汉深之度科技有限公司 Recording permission management method, computing device and storage medium
CN113779546B (en) * 2021-06-01 2024-03-26 武汉深之度科技有限公司 Recording authority management method, computing device and storage medium

Also Published As

Publication number Publication date
CN105721492B (en) 2019-10-11
CN105721492A (en) 2016-06-29

Similar Documents

Publication Publication Date Title
WO2017161724A1 (en) Voice processing method and device, and terminal
US9031226B2 (en) Multi-stream-multipoint-jack audio streaming
WO2018054356A1 (en) Method for displaying information and terminal
US9571475B2 (en) Call encryption systems and methods
CN113411793A (en) Bluetooth communication method and terminal
WO2010080902A1 (en) System and method for recording calls in a communication system
CN104393994B (en) Audio data secure transmission method, system and terminal
JPWO2013014734A1 (en) ENCRYPTION DEVICE, ENCRYPTION METHOD, AND ENCRYPTION PROGRAM
WO2021031290A1 (en) Translation method and device for earphone pair, earphone pair and translation system
WO2020233218A1 (en) Information encryption method, information decryption method, and terminal
TW201328309A (en) Handling incoming calls systems and methods and accessing data method
US20170195817A1 (en) Simultaneous Binaural Presentation of Multiple Audio Streams
US20130202097A1 (en) Priority telephonic communications
CN103973696A (en) Data processing method of voice communication
TW201931814A (en) System and method for secure communication
CN107667553A (en) For the method and system for the audio session for establishing encryption
CN103974242B (en) A kind of data processing method of voice call
US20140179294A1 (en) Electronic device and method for transferring communication session
KR20110047390A (en) Method, apparatus and system for managing drm contents
WO2012024904A1 (en) Method and system for pre-accessing conference telephone and network side device
WO2021197235A1 (en) Hotspot sharing method and electronic device
JP5103950B2 (en) Call control server, personal information database, and voice data generation method
WO2012163127A1 (en) Speech processing method and system
KR20120109812A (en) Apparatus and method for sharing out data in portable terminal
KR101945174B1 (en) Program Stored in Recording Medium for Supporting Automatic Response Service

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16895075

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 16895075

Country of ref document: EP

Kind code of ref document: A1