WO2020232631A1 - Voice frequency division transmission method, source terminal, playback terminal, source terminal circuit and playback terminal circuit - Google Patents

Voice frequency division transmission method, source terminal, playback terminal, source terminal circuit and playback terminal circuit Download PDF

Info

Publication number
WO2020232631A1
WO2020232631A1 PCT/CN2019/087811 CN2019087811W WO2020232631A1 WO 2020232631 A1 WO2020232631 A1 WO 2020232631A1 CN 2019087811 W CN2019087811 W CN 2019087811W WO 2020232631 A1 WO2020232631 A1 WO 2020232631A1
Authority
WO
WIPO (PCT)
Prior art keywords
frequency band
voice
source
link
voice signal
Prior art date
Application number
PCT/CN2019/087811
Other languages
French (fr)
Chinese (zh)
Inventor
郭仕林
Original Assignee
深圳市汇顶科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市汇顶科技股份有限公司 filed Critical 深圳市汇顶科技股份有限公司
Priority to PCT/CN2019/087811 priority Critical patent/WO2020232631A1/en
Priority to CN201980000976.XA priority patent/CN110366752B/en
Publication of WO2020232631A1 publication Critical patent/WO2020232631A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/0017Lossless audio signal coding; Perfect reconstruction of coded audio signal by transmission of coding error
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing

Definitions

  • This application relates to the field of communications, in particular to a voice frequency division transmission method, a source end, a playback end, a source end circuit, and a playback end circuit.
  • lossy encoding with ultra-high compression rates is usually used for transmission, at the expense of sound quality in exchange for longer battery life.
  • Commonly used lossy coding such as: AAC (Advanced Audio Coding, advanced audio coding) coding method, SBC (Sub-band coding, sub-band coding) coding method or MP3 (Moving Picture Experts Group Audio Layer III, motion picture experts compress standard audio Level 3) Encoding methods, etc., usually the high frequency part is directly discarded to save bandwidth, but when the encoded data is received in this way, due to the loss of the high frequency part, the original voice signal cannot be obtained, so that it cannot provide users with more information. Good sound quality experience.
  • this application provides a voice frequency division transmission method, source end, playback end, Source end circuit and playback end circuit.
  • the first aspect of the embodiments of the present application provides a voice frequency division transmission method, including:
  • the source end encodes the first frequency band speech signal and the second frequency band speech signal; the source end marks the frame synchronization information into the encoded first frequency band speech signal and the encoded second frequency band speech signal; the source end passes the first synchronization
  • the link and the second synchronization link respectively send the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information to the playback terminal.
  • an implementation of the first aspect includes: the compression rate of the encoding method of the second frequency band speech signal is higher than the compression rate of the encoding method of the first frequency band speech signal; or the second frequency band speech The compression rate of the encoding method of the signal is lower than the compression rate of the encoding method of the voice signal in the first frequency band.
  • the source end encoding the second frequency band voice signal includes:
  • the source end encodes the high-frequency speech signal, and the source end encodes the high-frequency speech signal including CELT encoding or SBR encoding; or
  • the source end encodes the non-high frequency speech signal, and the source end encodes the non-high frequency speech signal including SILK encoding, SBC encoding, AAC encoding or MP3 encoding.
  • the source end marks the frame synchronization information into the encoded first frequency band speech signal and the encoded second frequency band speech signal include:
  • the source marks the encoded voice signal in any frequency band detected within a preset delay as a frame of data, and the frame synchronization information includes the start time and end time of a frame of data.
  • the source end sends the encoded second frequency band voice signal with frame synchronization information to the playback end through the second synchronization link Before, including:
  • the source end establishes a second synchronization link with the playback end according to the used link parameters.
  • the source end sends a second synchronization link request to the playback end
  • the source receives the reply to the second synchronization link request
  • the source terminal determines whether to establish the second synchronization link according to the response to the second synchronization link request.
  • the method before the source end encodes the voice signal in the second frequency band, the method further includes:
  • the source terminal judges whether the playback terminal supports voice frequency division transmission according to the control data stream, and the control data stream is transmitted through the asynchronous link.
  • the source controls the data flow Judging whether the playback end supports voice frequency division transmission includes:
  • the control data stream includes the value of the custom UUID. If the custom UUID value of the player received by the source is equal to the preset UUID value, the source determines that the player supports voice frequency division transmission.
  • the source terminal determines that the playback terminal supports voice frequency division transmission according to the control data stream, the source terminal sends an audio configuration parameter request to the playback terminal ;
  • the source terminal receives the audio configuration parameters corresponding to the voice signal in the second frequency band supported by the playback terminal.
  • the audio configuration parameters include coding and decoding parameters and bit rates, and the coding and decoding parameters include one or both of coding and decoding methods;
  • the source terminal determines the used audio configuration parameters according to the audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal and sends the used audio configuration parameters to the playback terminal.
  • the number of second synchronization links is less than or equal to the number of frequency bands of the second frequency band voice signal
  • the source disconnects one or more second synchronization links according to the power situation or the link quality of the second synchronization link;
  • the source establishes one or more second synchronization links according to the power condition or the link quality of the second synchronization link.
  • the second aspect of the embodiments of the present application provides a voice frequency division transmission method, including:
  • the playback terminal receives the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively;
  • the playback terminal decodes the received voice signal in the first frequency band and the voice signal in the second frequency band;
  • the player uses the frame synchronization information to synchronize the decoded first frequency band speech signal and the decoded second frequency band speech signal.
  • the method includes:
  • the playback terminal performs digital-to-analog conversion on the synchronized voice signal in the first frequency band and the synchronized voice signal in the second frequency band through one or more different digital-to-analog converters;
  • the playback terminal amplifies the first frequency band voice signal after digital-to-analog conversion and the second frequency band voice signal after digital-to-analog conversion through one or more different amplifiers respectively;
  • the playback terminal performs electroacoustic conversion on the amplified first frequency band speech signal and the amplified second frequency band speech signal through one or more different electroacoustic converters.
  • the playback end sends the control data stream to the source end, so that the source end determines whether the playback end supports voice frequency division transmission according to the control data stream.
  • the playback terminal receives the second synchronization sent by the source terminal. Link request and send a reply to the second synchronization link request;
  • the playback end If the playback end supports voice frequency division transmission, the playback end sends the link parameters supported by the playback end to the source end.
  • the method before the playback end receives the second synchronization link request sent by the source end, the method includes:
  • the player receives the audio configuration parameter request sent by the source;
  • the playback terminal sends the audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal to the source terminal;
  • the player receives the used audio configuration parameters sent by the source and configures the used audio configuration parameters.
  • the player disconnects one or more second synchronization links according to the power condition or the link quality of the second synchronization link ;
  • the playback end requests the source end to establish one or more second synchronization links according to the power condition or the link quality of the second synchronization link.
  • the third aspect of the embodiments of the present application provides a source terminal for voice frequency division transmission, and the source terminal includes:
  • Encoding module used to encode the first frequency band speech signal and the second frequency band speech signal
  • the pre-synchronization module is used to mark the frame synchronization information into the encoded first frequency band speech signal and the encoded second frequency band speech signal;
  • the first sending module is configured to send the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively To the player.
  • the compression rate of the encoding method of the second frequency band speech signal is higher than the compression rate of the encoding method of the first frequency band speech signal
  • the compression rate of the encoding method of the speech signal in the second frequency band is lower than the compression rate of the encoding method of the speech signal in the first frequency band.
  • the encoding module includes:
  • the high-frequency encoding module is used to encode high-frequency speech signals, and the encoding methods for high-frequency speech signals include CELT encoding or SBR encoding; or
  • the non-high frequency encoding module is used to encode non-high frequency speech signals.
  • the encoding methods for non-high frequency speech signals include SILK encoding, SBC encoding, AAC encoding or MP3 encoding.
  • the pre-synchronization module includes:
  • the data frame marking module is used to mark the encoded voice signal of any frequency band detected within a preset delay as one frame of data, and the frame synchronization information includes the start time and end time of a frame of data.
  • the source further includes:
  • the first parameter determination module the first sending module sends the encoded second frequency band voice signal with frame synchronization information to the playback terminal through the second synchronization link, and is used to send the playback terminal according to the link parameters supported by the playback terminal. Determine the link parameters used;
  • the link establishment module is used to establish a second synchronization link with the playback terminal according to the used link parameters.
  • the source further includes:
  • the second sending module, the first parameter determining module is used to send a second synchronization link request to the playback terminal before determining the link parameter to be used according to the link parameters supported by the playback terminal sent by the playback terminal;
  • the first receiving module is configured to receive a reply to the second synchronization link request
  • the first parameter determination module is further configured to determine whether to establish the second synchronization link according to the reply to the second synchronization link request.
  • the source further includes:
  • the first judging module before the encoding module encodes the voice signal in the second frequency band, is used to judge whether the playback end supports voice frequency division transmission according to the control data stream, and control the data flow through asynchronous link transmission.
  • the source also includes a UUID module for identifying the voice frequency division transmission service through a custom universally unique identification code (UUID);
  • UUID custom universally unique identification code
  • the first judgment module includes:
  • the second judgment module, the control data stream includes the value of the custom UUID, and if the received custom UUID value of the player is equal to the preset UUID value, it is used to determine that the player supports voice frequency division transmission.
  • the source further includes:
  • the third sending module if the first judging module determines that the playback end supports voice frequency division transmission according to the control data stream, it is used to send an audio configuration parameter request to the playback end;
  • the second receiving module used to receive the audio configuration parameters corresponding to the voice signal in the second frequency band supported by the player.
  • the audio configuration parameters include codec parameters and bit rates, and the codec parameters include one or two of the coding mode and the decoding mode ;as well as
  • the second parameter determination module is configured to determine the audio configuration parameters to be used according to the audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal;
  • the third sending module is also used to send the used audio configuration parameters to the playback terminal.
  • the source further includes:
  • the first transmission control module is configured to disconnect one or more second synchronization links according to the power condition or the link quality of the second synchronization link, and the number of the second synchronization links is less than or equal to that of the second frequency band voice signal Number of frequency bands; or
  • the first transmission control module is also used to establish one or more second synchronization links according to the power condition or the link quality of the second synchronization link, and the number of the second synchronization links is less than or equal to the frequency band of the second frequency band voice signal number.
  • the fourth aspect of the embodiments of the present application provides a playback terminal for voice frequency division transmission.
  • the playback terminal includes:
  • the third receiving module is used to receive the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively ;
  • the decoding module is used to decode the received coded first frequency band speech signal and the coded second frequency band speech signal;
  • the synchronization module is used to synchronize the decoded first frequency band speech signal and the decoded second frequency band speech signal through frame synchronization information.
  • the playback terminal further includes:
  • One or more different digital-to-analog conversion modules for performing digital-to-analog conversion on the synchronized voice signal in the first frequency band and the voice signal in the second frequency band respectively;
  • One or more different amplifying modules for respectively amplifying the first frequency band voice signal and the second frequency band voice signal after digital-to-analog conversion
  • One or more different electro-acoustic conversion modules are used to perform electro-acoustic conversion on the amplified first frequency band voice signal and the second frequency band voice signal respectively.
  • the playback terminal further includes:
  • the fourth sending module, the third receiving module is used to send the control data stream to the source end before receiving the encoded second frequency band speech signal with frame synchronization information through the second synchronization link, so that the source end can control the data stream according to the Determine whether the playback terminal supports voice frequency division transmission.
  • the playback terminal further includes:
  • the fourth receiving module if the source terminal determines that the playback terminal supports voice frequency division transmission according to the control data stream, it is used to receive the second synchronization link request sent by the source terminal;
  • the fifth sending module is used to send a reply to the second synchronization link request
  • the fifth sending module is also used to send link parameters supported by the playback end to the source end.
  • the playback terminal further includes:
  • the fifth receiving module, the fourth receiving module is used to receive the audio configuration parameter request sent by the source end before receiving the second synchronization link request sent by the source end;
  • the sixth sending module is used to send the audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal to the source terminal;
  • the fifth receiving module is also used to receive the audio configuration parameters used by the source end.
  • the parameter configuration module is used to configure the audio configuration parameters used.
  • the playback terminal further includes:
  • the second transmission control module is configured to disconnect one or more second synchronization links according to the power condition or the link quality of the second synchronization link;
  • the second transmission control module is further configured to request the source end to establish one or more second synchronization links according to the power condition or the link quality of the second synchronization link.
  • the fifth aspect of the embodiments of the present application provides a source terminal for voice frequency division transmission, including: a memory and a processor;
  • the memory is coupled to the processor
  • Memory used to store program instructions
  • the processor is configured to call the program instructions stored in the memory to enable the source to execute the voice frequency division transmission method described in the first aspect.
  • the sixth aspect of the embodiments of the present application provides a playback terminal for voice frequency division transmission, including: a memory and a processor;
  • the memory is coupled to the processor
  • Memory used to store program instructions
  • the processor is configured to call the program instructions stored in the memory to make the playback terminal execute the voice frequency division transmission method described in the second aspect.
  • the seventh aspect of the embodiments of the present application provides a computer-readable storage medium, including a computer program stored thereon, and the computer program is executed by a processor to implement the voice frequency division transmission method described in the first aspect.
  • An eighth aspect of the embodiments of the present application provides a computer-readable storage medium, including: a computer program stored thereon, wherein the computer program is executed by a processor to implement the voice frequency division described in the second aspect. Transmission method.
  • a ninth aspect of the embodiments of the present application provides a source circuit, including:
  • An encoder for encoding the first frequency band speech signal and the second frequency band speech signal
  • the source controller connected to the encoder, is used to send the encoded first frequency band speech signal with frame synchronization information and the encoded voice signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively
  • the voice signal of the second frequency band is sent to the playback end circuit.
  • the source circuit further includes a filter, which is connected to the encoder and is used to separate the voice signal in the first frequency band from the voice signal in the second frequency band.
  • a tenth aspect of the embodiments of the present application provides a playback end circuit, including:
  • the player controller is used to receive the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively ;as well as
  • the decoder is connected to the controller of the playback terminal and is used to decode the received voice signal in the first frequency band and the voice signal in the second frequency band.
  • the playback end circuit further includes:
  • One or more different digital-to-analog converters respectively connected to the decoder, for performing digital-to-analog conversion on the decoded first frequency band speech signal and the decoded second frequency band speech signal respectively;
  • One or more different amplifiers respectively connected to one or more digital-to-analog converters for respectively amplifying the first-band voice signal after digital-to-analog conversion and the second-band voice signal after digital-to-analog conversion;
  • One or more different electro-acoustic converters are respectively connected to one or more amplifiers, and are used to perform electro-acoustic conversion on the amplified first frequency band speech signal and the amplified second frequency band speech signal respectively.
  • the advantageous effect of the embodiments of the present application is that: the embodiments of the present application provide a voice frequency division transmission method, a source end, a playback end, a source end circuit, and a playback end circuit, through a first synchronization link And the second synchronization link respectively send the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information to the playback end, which solves the problem caused by the limitation of transmission bandwidth The problem of sound quality degradation and the problem of affecting the audio being played when the sound quality is improved.
  • FIG. 1 is a flowchart of a voice frequency division transmission method according to an embodiment of the application
  • FIG. 2 is a flowchart of an embodiment of the application in which the source marks a voice signal in any frequency band after encoding detected within a preset delay as one frame of data;
  • FIG. 3 is a schematic diagram of a source terminal acquiring frame synchronization information and a playback terminal performing synchronization according to an embodiment of the application;
  • Fig. 4 is a schematic diagram of a UUID setting method according to an embodiment of the application.
  • FIG. 5 is a flowchart of another voice frequency division transmission method according to an embodiment of the application.
  • FIG. 6 is a schematic diagram of the structure of the source end of an embodiment of the application.
  • FIG. 7 is a schematic diagram of the structure of the playback terminal according to an embodiment of the application.
  • FIG. 8 is a schematic diagram of another source structure according to an embodiment of the application.
  • FIG. 9 is a schematic structural diagram of another playback terminal according to an embodiment of the application.
  • FIG. 10 is a schematic diagram of the structure of the source circuit and the playback circuit of an embodiment of the application.
  • FIG. 11 is a schematic structural diagram of another source-end circuit and a playback-end circuit according to an embodiment of the application.
  • the embodiment of the present application provides a voice frequency division transmission method, which can be applied to various audio electronic devices.
  • This embodiment describes a voice frequency division transmission method provided in the embodiment of the present application from the perspective of the source end.
  • the source end is the transmitting end of the voice signal
  • the playback end is the receiving end of the voice signal.
  • the source end can be an electronic device such as a mobile phone, TV, computer, tablet or mp3 that stores the voice signal, or one of the electronic devices or Multiple chips
  • the playback end may be an electronic device such as a speaker or earphone that plays the voice signal, or one or more chips in the electronic device.
  • the source terminal and the playback terminal in this embodiment can support various transmission protocols, such as Bluetooth low energy protocol, classic Bluetooth protocol, or wifi, which is not limited in this embodiment.
  • FIG. 1 is a flowchart of a voice frequency division transmission method according to an embodiment of the present application. The method includes the following steps:
  • the source end encodes the voice signal in the first frequency band and the voice signal in the second frequency band;
  • the number of frequency bands of the first frequency band speech signal and the second frequency band speech signal is not limited, and the first frequency band speech signal or the second frequency band speech signal may be a speech signal in one frequency band or multiple sub-band speech signals.
  • the first frequency band speech signal and the second frequency band speech signal can be obtained through filtering.
  • This embodiment does not limit the type of filter, which can be a low-pass, high-pass, or band-pass filter and a combination of multiple filters; in this embodiment, the first frequency band speech signal and the second frequency band speech signal can also be Obtained by other methods besides filtering.
  • the encoding method only encodes the voice signal of a specific frequency band, then there is no need to filter to obtain the voice signal of the specific frequency band, and the voice signal of all frequency bands can be directly transmitted to the encoder.
  • the coding of this frequency band taking alphabet coding as an example, syntax can realize the coding of voice signals in a specific frequency band without the need to obtain the specific frequency band by means of filtering or the like.
  • all frequency bands of the voice signal are 20Hz-20kHz.
  • the voice signal of the first frequency band and the voice signal of the second frequency band in this embodiment can be any frequency band in 20Hz-20kHz, or any frequency other than 20Hz-20kHz. Frequency band.
  • the source terminal marks the frame synchronization information into the encoded first frequency band speech signal and the encoded second frequency band speech signal;
  • the source end Since the source end sends the encoded first frequency band speech signal and the encoded second frequency band speech signal to the playback end through the first synchronization link and the second synchronization link, respectively, synchronization needs to be performed on the playback end to improve audio quality , And the playback end needs frame synchronization information when performing synchronization. Therefore, the source end can mark the frame synchronization information into the encoded first frequency band speech signal and the encoded second frequency band speech signal.
  • the source end sends the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information to the playback end through the first synchronization link and the second synchronization link, respectively .
  • the synchronization link can be used to transmit voice signals.
  • the synchronization link only allows a limited number of retransmissions. It is assumed that the original audio transmission method of the system uses the first synchronization link to transmit voice signals. This embodiment of the application It can be fully compatible with the original audio transmission method of the system. When the source end and the playback end use the original audio transmission method for audio transmission, this solution only needs to send the data with the first synchronization link and the second synchronization link on this basis.
  • the encoded first frequency band speech signal and second frequency band speech signal of the frame synchronization information are sent to the playback end without affecting the original audio transmission of the system.
  • the second frequency band speech signal is divided into multiple sub-band speech signals
  • multiple second synchronization links can be established to respectively transmit the multiple sub-band voice signals.
  • the number of second synchronization links is not limited, and it can be one or more.
  • the source end It is possible to establish only a second synchronization link with the playback terminal, and send the encoded second frequency band voice signal with frame synchronization information to the playback terminal through the second synchronization link.
  • the second frequency band voice signal The signal is divided into multiple sub-band voice signals, and the multiple sub-band voice signals can also be transmitted through the second synchronization link.
  • the multiple sub-band voice signals are transmitted through the second synchronization link, time division multiplexing the way.
  • only one second synchronization link may be established to transmit the second frequency band voice signal. If the second frequency band voice signal is divided into two sub-band voice signals, only one or two second synchronization links may be established To transmit these two sub-band voice signals.
  • the source end transmits the encoded first-band speech signal and second-band speech signal with frame synchronization information to the playback end through the first synchronization link and the second synchronization link, so that the playback end can further verify the received first frequency
  • the frequency band speech signal and the second frequency band speech signal are processed.
  • the original audio transmission method of the system uses a first synchronization link to transmit the voice signal of the main frequency band
  • the main frequency band is 20Hz-18kHz speech signal
  • the original audio transmission method of the system can use a first synchronization link to transmit
  • the voice signal of 20Hz-18kHz, and the voice signal of 18kHz-20kHz are usually discarded directly to save bandwidth, but this will cause the high frequency part of the voice signal received by the playback end to be lost, and the sound quality is reduced.
  • this embodiment can The encoded 18kHz-20kHz voice signal is transmitted to the playback end through the second synchronization link, and the integrity of the voice signal can be ensured at the source end, so that all or most of the frequency band audio signals are encoded and transmitted to improve
  • the first synchronization link and the second synchronization link can transmit voice signals in any frequency band, which is not limited in this embodiment.
  • the first synchronization link and the second synchronization link are used to respectively transmit the first frequency band voice signal and the second frequency band voice signal.
  • the system only needs to establish a second synchronization link to transmit the encoded second frequency band speech signal to improve the sound quality. Moreover, when establishing the second synchronization link or transmitting the encoded first frequency signal In the case of a two-band voice signal, it will not affect the first synchronization link to transmit the encoded first-band voice signal. Therefore, the user will not hear a short pause in the audio being played.
  • the embodiment of the present application provides a voice frequency division transmission method, which transmits an encoded first frequency band voice signal with frame synchronization information and an encoded voice signal with frame synchronization information through a first synchronization link and a second synchronization link, respectively
  • the second frequency band voice signal is sent to the playback end, which solves the problem of sound quality degradation caused by transmission bandwidth limitation and the problem of affecting the audio being played when the sound quality is improved.
  • the compression rate of the encoding method of the voice signal in the second frequency band is higher than the compression rate of the encoding method of the voice signal in the first frequency band;
  • the compression rate of the encoding method of the speech signal in the second frequency band is lower than the compression rate of the encoding method of the speech signal in the first frequency band.
  • the speech signal is divided into the first frequency band speech signal and the second frequency band speech signal.
  • the compression rate of the encoding method of the first frequency band speech signal and the second frequency band speech signal can be different.
  • the speech signal is mainly composed of the first frequency band speech signal
  • the compression rate of the encoding method of the second frequency band voice signal can be set to be lower than the compression rate of the encoding method of the first frequency band voice signal.
  • the source end encoding the second frequency band voice signal includes: the source end encoding the high-frequency voice signal, or the source end encoding the non-high-frequency voice signal.
  • the voice signal in the second frequency band can be a high-frequency voice signal or a non-high-frequency voice signal.
  • For high-frequency voice signals because they occupy more bandwidth, when encoding high-frequency voice signals, you can select those suitable for high-frequency voice signals. Encoding method to improve sound quality while saving some bandwidth. Due to the different characteristics of high-frequency voice signals and non-high-frequency voice signals, different encoding methods can be adopted for high-frequency voice signals and non-high-frequency voice signals.
  • the encoding methods of high-frequency voice signals of 18kHz-20kHz can include CELT Encoding method or SBR (Spectral Band Replication) encoding method, where CELT encoding method is a core encoding algorithm built into the alphabet encoder; encoding methods for non-high frequency speech signals can include SILK encoding method and SBC encoding method , AAC encoding method or MP3 encoding method, where the SILK encoding method is also a core encoding algorithm built in the alphabet encoder; it should be noted that in this embodiment, the second frequency band speech signal is a high-frequency speech signal or a non-high-frequency speech signal. Take the voice signal as an example.
  • CELT encoding method is a core encoding algorithm built into the alphabet encoder
  • encoding methods for non-high frequency speech signals can include SILK encoding method and SBC encoding method , AAC encoding method or MP3 encoding method, where the SILK encoding method is also a core
  • the user can divide the voice signal in the second frequency band into multiple sub-band voice signals according to requirements and then encode them separately, which is not limited in this embodiment.
  • the second synchronization link may be used to transmit the encoded high-frequency voice signal or the non-high-frequency voice signal.
  • the source end encoding the first frequency band speech signal and the second frequency band speech signal may be the same or different.
  • the source end encodes the first frequency band speech signal and the second frequency band speech signal in a different way, Save some bandwidth while improving sound quality.
  • the selection of encoding methods in this embodiment can be diversified. Taking the voice signal in the second frequency band as an example, if the voice signal in the second frequency band is divided into multiple sub-band voice signals, the multiple sub-band voice signals can also be the same or different. Encoding method. Choosing different encoding methods or setting different code rates or compression rates for voice signals of different frequency bands can provide users with a better sound quality experience by adjusting the data compression ratio and only increasing a small amount of bandwidth occupation.
  • the source terminal marks the frame synchronization information into the encoded first frequency band speech signal and the encoded second frequency band speech signal includes the source marking detected within a preset delay
  • the encoded voice signal in any frequency band is a frame of data
  • the frame synchronization information includes the start time and end time of a frame of data.
  • the source can obtain the frame synchronization information and send it to the playback end, so that the playback end can obtain the frame synchronization information to perform synchronization.
  • FIG. 2 is an embodiment of the present application marking the coded voice signal in any frequency band detected within a preset delay time as a frame of data
  • the flow chart can specifically include the following steps:
  • the source detects a voice signal in any frequency band among the encoded voice signals in multiple frequency bands;
  • the source starts timing the first delay until the first delay is equal to the preset delay
  • the source marks the detected voice signals in multiple frequency bands within the first delay as the first frame of data;
  • the source After the first delay, the source again detects a voice signal in any frequency band among the encoded voice signals in multiple frequency bands;
  • the source starts timing the second delay until the second delay is equal to the preset delay
  • the source marks the voice signals of multiple frequency bands detected within the second delay as the second frame of data
  • the source After the second delay, the source again detects a voice signal in any frequency band among the encoded voice signals in multiple frequency bands;
  • the source starts timing the Nth delay until the Nth delay is equal to the preset delay, N>2 and an integer;
  • the source marks the voice signals of multiple frequency bands detected within the Nth delay as the Nth frame of data.
  • the source end After the source end marks the encoded speech signal of any frequency band detected within the preset delay as one frame of data, the speech signal of any frequency band contains the marking information, so the player can recognize the same frame of speech signal.
  • the source terminal obtains the frame synchronization information, and the frame synchronization information includes the start time and end time of a frame of data.
  • the value of N can be set by the user according to the data volume of the voice signal, which is not limited in this embodiment.
  • the source end sends the frame synchronization information to the playback end.
  • the playback end can use the frame synchronization information to synchronize the voice signal received by the playback end to further improve audio quality . You can choose to mark the start time and end time of the first frame data, the second frame data, and the Nth frame data into the corresponding frame of data, so that the player can synchronize according to the start time and end time.
  • the second synchronization link may also It is used to transmit the encoded non-high frequency voice signal, and this embodiment does not restrict it;
  • Figure 3(a) shows the source end processing the voice signal in the first frequency band and the voice signal in the second frequency band, as shown in Figure 3(a) ), this embodiment only takes the first frequency band speech signal and the second frequency band speech signal each having only one sub-band speech signal as an example, but the source end of this embodiment marks the encoded code detected within the preset delay
  • the method in which the voice signal in any frequency band is one frame of data can also be applied to an application scenario where the voice signal in the first frequency band and the voice signal in the second frequency band respectively have multiple sub-band voice signals.
  • the unencoded speech signal is divided into data frames.
  • the first frame can be divided into high-frequency speech signals and non-frequency speech signals through filters.
  • High-frequency voice signal and then encode the high-frequency voice signal and the non-high-frequency voice signal separately.
  • the high-frequency voice signal is encoded, it is 01DATA1, and the non-high-frequency voice signal is encoded as 01DATA2.
  • the voice signal 01DATA2 is transmitted to the playback end through the first synchronization link, and the encoded high-frequency voice signal 01DATA1 is transmitted to the playback end through the second synchronization link.
  • the encoded voice signal in the first frequency band and the voice signal in the second frequency band are also transmitted to the playback terminal according to the processing manner of the first frame.
  • Figure 3(c) shows the process of marking the encoded voice signal in any frequency band detected within the preset delay time as one frame of data at the source, as shown in Figure 3(c) .
  • the source sends a frame of data at an interval Interval.
  • the source starts to count the first delay when it detects a voice signal in any one of the encoded voice signals in multiple frequency bands
  • the source detects the encoded high-frequency voice signal 01DATA1 in the first frame of data, it starts timing the first delay Delay1 until the first delay is equal to the preset delay, which can be greater than , Is less than or equal to the interval time Interval, this embodiment does not limit the specific length of the preset delay.
  • the source marks the detected voice signals in multiple frequency bands within the first delay as the first frame of data.
  • the first frame of data is 01DATA1 and 01DATA2
  • the frame synchronization information includes the start time and end time of 01DATA1 and 01DATA2;
  • the source After the first delay Delay1, the source again detects the voice signal 02DATA1 in any frequency band among the encoded voice signals in multiple frequency bands, and then starts timing the second delay Delay2 until the second delay is equal to the preset delay ;
  • the source marks the 02DATA1 and 02DATA2 detected within the second delay as the second frame of data, and the frame synchronization information includes the start and end times of 01DATA1 and 01DATA2, and so on, until the source marks all the data.
  • the frame synchronization information can also include the end moments End1 and End2 of the first delay Delay1 and the second delay Delay2.
  • the frame synchronization information of the first frame of data can be directly written into each frame of data, for example, written to 01DATA1 and 01DATA2. , So that the player can obtain the frame synchronization information and perform synchronization.
  • synchronization can be performed. If the playback end obtains the frame synchronization information, the end of the first delay time End1 pair The first frame of data received after encoding starts to be decoded, and at the end of the second delay, End2 starts to decode the received second frame of data after encoding, and so on, to achieve synchronization; or one frame of data Start decoding to achieve synchronization; after the player obtains the frame synchronization information, since the frame synchronization information has been written into 01DATA1 and 01DATA2, the player can also perform synchronization based on the specific data in 01DATA1 and 01DATA2.
  • the method in which the source terminal marks the encoded voice signal in any frequency band detected within the preset delay time as one frame of data is only an exemplary description.
  • the technology in the art Personnel can refer to the solution of the embodiment of the present application, and without creative work, can also obtain other methods of obtaining frame synchronization information according to this embodiment.
  • the source end establishes a second synchronization link with the playback end according to the used link parameters.
  • the source end Before the source end sends the encoded second frequency band voice signal with frame synchronization information to the playback end through the second synchronization link, it needs to establish a second synchronization link with the playback end according to the link parameters used, and the source The terminal can determine the link parameters used according to the link parameters supported by the playback terminal sent by the playback terminal.
  • the source terminal can also send the link parameters supported by the source terminal to the playback terminal, so that the playback terminal can according to the link parameters supported by the source terminal and the link parameters supported by the playback terminal.
  • the link parameter selects one or more link parameters supported by both the source and the playback end and sends it to the source so that the source can determine the link parameters used.
  • the source end sends the link parameters supported by the source end to the playback end, which enables the playback end to determine the link parameters used.
  • the source end sends the link parameters supported by the source end to the playback end.
  • the link parameter selected by the player is the link parameter used, that is, the player can also determine
  • the player can determine the link parameters to be used according to the link parameters supported by the source sent by the source.
  • the link parameters used are determined to facilitate the establishment of the second synchronization link.
  • the link parameters include data frame size and data frame interval, PHY (PhysicalLayer, physical layer) communication information, etc.
  • the source end sends a second synchronization link request to the playback end
  • the source receives the reply to the second synchronization link request
  • the source terminal determines whether to establish the second synchronization link according to the reply of the second synchronization link request.
  • the playback end After the playback end receives the second synchronization link request sent by the source end, the playback end sends a reply to the second synchronization link request to the source end, and the source end determines whether to establish the second synchronization link according to the response of the second synchronization link request road.
  • the source sends a second synchronization link request to the playback end, so that the playback end can choose whether to accept the establishment of the second synchronization link.
  • the playback end chooses to accept the establishment of the second synchronization link, it sends the link parameters supported by the playback end to the source. So that the source can determine the link parameters used.
  • the playback end After the playback end receives the second synchronization link request sent by the source end, the playback end sends a reply to the second synchronization link request so that the source end determines whether to establish the second synchronization link, and the source end determines to establish the second synchronization link Later, the source terminal determines the link parameters used according to the link parameters supported by the playback terminal. It should be noted that the link parameters used need to be supported by both the playback terminal and the source terminal.
  • the source end before the source end encodes the voice signal in the second frequency band, the source end determines whether the playback end supports voice frequency division transmission according to the control data stream.
  • the source can send a request to control the data stream to the player through the asynchronous link.
  • the control data stream can be transmitted through the asynchronous link between the source and the player.
  • the asynchronous link allows unlimited Retransmission times, so no data packets will be lost; after the player receives the request for the control data stream sent by the source, the player sends the control data stream to the source, and the source can judge by the control data stream sent by the player Whether the playback end supports the voice frequency division transmission, perform a judgment before starting the voice frequency division transmission, so as not to increase energy consumption when the playback end does not support voice frequency division transmission, when the source end judges that the playback end does not support voice frequency division transmission , There is no need for voice frequency division transmission, and the source end will use the original audio transmission method of the system for audio transmission.
  • the source end judges whether the playback end supports voice frequency division transmission according to the control data stream includes:
  • the control data stream includes the value of the custom UUID. If the custom UUID value of the player received by the source is equal to the preset UUID value, the source determines that the player supports voice frequency division transmission. The source uses the value of a custom UUID to determine whether the player supports voice frequency division transmission, which can improve the device compatibility between the source and the player. If the value of the custom UUID of the player received by the source is equal to the preset UUID value, the source determines that the player supports voice frequency division transmission. Take the Bluetooth connection between the source and the player as an example. In the Bluetooth protocol, UUID is used to identify the service provided by the Bluetooth device.
  • the UUID type can be the primary service (Primary Service), characteristic (Characteristic), etc., and the user can customize The Universal Unique Identifier (UUID) identifies the voice frequency division transmission service.
  • the UUID setting method can refer to Figure 4, in Figure 4, Enhance Audio Value can represent the voice frequency division transmission service, the user can set the preset UUID value corresponding to Enhance Audio Value, and the preset UUID value of the main service and the enhanced service can be The same or different; in Figure 4, handle represents an index, which can help find the address of the UUID in the memory.
  • the specific value of OxXXXX is determined by the specific address of the UUID in the memory.
  • This embodiment does not limit the value of 0xXXXX
  • the values of Y and Z are determined by the size of the data corresponding to the UUID, and this embodiment does not limit this; in Figure 4, 0xOPQ is a user-defined UUID, and the user can refer to the format of the UUID value in Figure 4
  • Fig. 4 is only an exemplary description. In actual use, those skilled in the art can refer to the solutions of the embodiments of the present application, and without creative work, Other ways of setting the UUID value can also be obtained according to this embodiment.
  • the source terminal determines that the playback terminal supports voice frequency division transmission according to the control data stream, the following steps are performed:
  • the source sends an audio configuration parameter request to the player:
  • the source terminal receives the audio configuration parameters corresponding to the voice signal in the second frequency band supported by the playback terminal.
  • the audio configuration parameters include coding and decoding parameters and bit rates, and the coding and decoding parameters include one or both of coding and decoding methods;
  • the source terminal determines the used audio configuration parameters according to the audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal and sends the used audio configuration parameters to the playback terminal.
  • the player will receive the audio configuration parameter request sent by the source. After the player receives the audio configuration parameter request from the source, the player will send the audio configuration parameters corresponding to the second frequency band voice signal supported by the player to At the source end, after the source end receives the audio configuration parameters corresponding to the second frequency band voice signal supported by the player, the source end determines the audio configuration parameters to be used according to the audio configuration parameters corresponding to the second frequency band voice signal supported by the player The audio configuration parameters are sent to the player. In addition, the audio configuration parameters used need to be audio configuration parameters supported by the source. The playback terminal receives the used audio configuration parameters sent by the source terminal and configures the used audio configuration parameters. After determining the used audio configuration parameters, the playback terminal can also configure the used audio configuration parameters.
  • the source end can also send the audio configuration parameters supported by the source end to the playback end, so that the playback end can according to the audio configuration parameters supported by the source end and the playback end support Select one or more audio configuration parameters that are supported by both the source and the player to send the audio configuration parameters to the source so that the source can determine the audio configuration parameters used.
  • the source end can send the audio configuration parameters supported by the source end to the playback end, so that the playback end can also determine the audio configuration parameters used.
  • the playback end selects one according to the audio configuration parameters supported by the source end and the audio configuration parameters supported by the playback end
  • the audio configuration parameter selected by the player is the audio configuration parameter used;
  • the audio configuration parameters include encoding and decoding parameters and bit rates
  • the encoding and decoding parameters include one or two of encoding and decoding
  • the source receives one of the encoding and decoding supported by the playback end.
  • the source can determine the encoding method used, so that the player can decode it after receiving the voice signal; after the player receives one or two of the encoding and decoding methods supported by the source, the player
  • the decoding method used can be determined so that the player can decode the received encoded first-band voice signal and second-band voice signal;
  • the encoding and decoding parameters include one or two of the encoding method or the decoding method, and It can include sampling depth and sampling rate.
  • the source can perform corresponding encoding according to the encoding method, and the playback end can correspond according to the decoding method.
  • the audio configuration parameters can also include the device number, device address, etc., to facilitate the mutual recognition of the source and the player.
  • the audio configuration parameters can also include the bit rate.
  • the bit rate can be set according to the data frame size. Setting the bit rate can further save bandwidth. You can set the bit rate of the voice signal in the second frequency band to be higher or lower than that of the voice signal in the first frequency band.
  • the bit rate of the high-frequency voice signal can be set to be lower than the bit rate of the non-high-frequency voice signal.
  • the bit rate of the high-frequency voice signal can be set to be lower than or equal to a percentage of the bit rate of the non-high-frequency voice signal. Twenty percent, this embodiment does not limit the specific value of the code rate.
  • the number of second synchronization links is less than or equal to the number of frequency bands of the second frequency band voice signal.
  • the number of second synchronization links may be equal to the number of frequency bands of the second frequency band voice signal, that is, each second synchronization link may only transmit one sub-band voice signal in the second frequency band voice signal;
  • the number of synchronization links may be less than the number of frequency bands of the second frequency band voice signal, that is, each second synchronization link may transmit two or two sub-band voice signals in the second frequency band voice signal.
  • the source can suspend the voice frequency division transmission according to the power situation or the link quality of the second synchronization link.
  • disconnecting one or more second synchronization links includes disconnecting part or all of the second synchronization links.
  • the source can disconnect one or more second synchronization links to partially or completely suspend the voice frequency division transmission to extend the battery life, so as not to occupy the bandwidth and reach the limit.
  • the source can re-establish the one or more second synchronization links according to the power situation or the link quality of the second synchronization link
  • the source can re-establish one or more The second synchronization link is to restart or enhance voice frequency division transmission.
  • one or more second synchronization links can be disconnected or one or more second synchronization links can be re-established according to the power situation or the link quality of the second synchronization link, whether it is the source end or
  • the playback end initiates the disconnection of one or more second synchronization links, and the initiated end should unconditionally accept the disconnection of one or more second synchronization links.
  • FIG. 5 is a flowchart of a voice frequency division transmission method provided by an embodiment of the application. Methods include:
  • the playback terminal receives the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively;
  • the playback terminal decodes the received voice signal in the first frequency band and the voice signal in the second frequency band;
  • the player uses the frame synchronization information to synchronize the decoded first frequency band speech signal and the decoded second frequency band speech signal.
  • the source end transmits the encoded first frequency band speech with frame synchronization information through the first synchronization link and the second synchronization link, respectively
  • the signal and the second frequency band voice signal are sent to the playback terminal.
  • the playback terminal receives the encoded first frequency band voice signal and the second frequency band voice signal with frame synchronization information, the encoded voice signal with frame synchronization information
  • the first frequency band speech signal and the second frequency band speech signal are decoded separately, and this embodiment does not limit the type of decoding mode.
  • the playback terminal can use the same decoding method for the received voice signal in the first frequency band and the voice signal in the second frequency band, or different decoding methods.
  • the playback terminal pair receives
  • the decoding methods of the encoded multiple sub-band voice signals can be the same or different. For example, if the player decodes the received encoded four sub-band voice signals, you can choose one, two, or three Or four decoding methods.
  • the playback terminal uses the frame synchronization information to synchronize the decoded voice signal in the first frequency band and the second frequency band, which can further improve the audio quality
  • the specific synchronization method is not limited.
  • the first synchronization link and the second synchronization link respectively receive the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information, which solves the problem The problem of sound quality degradation caused by the limitation of transmission bandwidth and the problem of affecting the audio being played when the sound quality is improved.
  • the user can Hear the sound.
  • the player uses the frame synchronization information to synchronize the decoded first frequency band speech signal and the decoded second frequency band speech signal, the following steps are included:
  • the playback terminal performs digital-to-analog conversion on the synchronized voice signal in the first frequency band and the synchronized voice signal in the second frequency band through one or more different digital-to-analog converters;
  • the playback terminal amplifies the voice signal in the first frequency band after digital-to-analog conversion and the voice signal in the second frequency band after digital-to-analog conversion through different one or more amplifiers;
  • the playback terminal performs electroacoustic conversion on the amplified first frequency band speech signal and the amplified second frequency band speech signal through one or more different electroacoustic converters.
  • the playback terminal After the playback terminal synchronizes the decoded voice signal in the first frequency band with the decoded voice signal in the second frequency band through the frame synchronization information sent by the source, the playback terminal can perform digital-to-analog conversion through a digital-to-analog converter, and then through a The amplifier is amplified, and then an electro-acoustic converter is used for electro-acoustic conversion; after the playback end synchronizes the decoded first frequency band speech signal and the decoded second frequency band speech signal, it can also synchronize the first frequency band
  • the voice signal and the voice signal in the second frequency band are respectively subjected to digital-to-analog conversion through different digital-to-analog converters.
  • two or more different digital-to-analog converters may be used to adapt to the characteristics of voice signals in different frequency bands.
  • the player can amplify the voice signal in the first frequency band and the voice signal in the second frequency band after digital-to-analog conversion through different amplifiers.
  • two or more different amplifiers can be used to adapt to voices in different frequency bands.
  • the playback terminal performs electro-acoustic conversion on the amplified voice signal in the first frequency band and the voice signal in the second frequency band through different electro-acoustic converters.
  • two or more different The electro-acoustic converter is adapted to the characteristics of voice signals in different frequency bands.
  • the electro-acoustic converter may be a device that converts electrical energy into sound energy, such as a speaker or a horn. The present embodiment does not limit the type of the electro-acoustic converter.
  • the playback terminal performs digital-to-analog conversion on the synchronized voice signal in the first frequency band and the voice signal in the second frequency band respectively, and different digital-to-analog converters can be used according to the characteristics of the voice signals passing through different frequency bands to compare the synchronized first frequency band.
  • the voice signal of the first frequency band and the voice signal of the second frequency band are respectively subjected to digital-to-analog conversion to further improve the audio quality.
  • the player amplifies the first-band voice signal and the second-band voice signal after the digital-to-analog conversion.
  • the playback terminal performs digital-to-analog conversion on the synchronized voice signal in the first frequency band and the voice signal in the second frequency band
  • the playback terminal performs digital-to-analog conversion on the voice signal in the first frequency band and the voice signal in the second frequency band.
  • Amplification according to the characteristics of voice signals in different frequency bands, different amplifiers can be used to amplify the synchronized voice signals in the first frequency band and the voice signals in the second frequency band to further improve the audio quality.
  • the voice signal in the first frequency band and the voice signal in the second frequency band are converted into electro-acoustics.
  • the playback terminal only performs digital-to-analog conversion on the synchronized voice signal in the first frequency band and the voice signal in the second frequency band through different digital-to-analog converters to further improve the audio quality;
  • the audio quality can be further improved by separately amplifying the voice signal in the first frequency band and the voice signal in the second frequency band after digital-to-analog conversion through different amplifiers; the playback end only passes the amplified voice signal in the first frequency band and the voice signal in the second frequency band.
  • Different electro-acoustic converters can also further improve the audio quality.
  • the playback end before the playback end receives the encoded second frequency band voice signal with frame synchronization information through the second synchronization link, the playback end can send a control data stream to the source end to Enable the source to judge whether the playback end supports voice frequency division transmission according to the control data stream, and determine whether the playback end supports voice frequency division transmission before starting voice frequency division transmission, so as to avoid increasing energy consumption when the playback end does not support voice frequency division transmission , Assuming that the playback end does not support voice frequency division transmission, and the source end does not determine whether the playback end supports voice frequency division transmission according to the control data stream, when the source end starts voice frequency division transmission, the source end will When the two-band speech signal is encoded, energy consumption begins to increase. When the source terminal determines that the playback terminal does not support voice frequency division transmission, the source terminal will continue to use the original audio transmission method of the system for audio transmission, instead of encoding both the first frequency band voice signal and the second frequency band voice signal.
  • the playback terminal if the source terminal determines that the playback terminal supports voice frequency division transmission according to the control data stream, the playback terminal will receive the second synchronization link request sent by the source terminal, and the playback terminal will receive After the second synchronization link request sent by the source end, the reply of the second synchronization link request is sent to the source end. If the playback end supports voice frequency division transmission, the playback end sends the link parameters supported by the playback end to the source end for the source end. Establish a second synchronization link.
  • the player receives the audio configuration parameter request sent by the source;
  • the playback terminal sends the audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal to the source terminal;
  • the player receives the used audio configuration parameters sent by the source and configures the used audio configuration parameters.
  • the source sends an audio configuration parameter request to the player. After the player receives the audio configuration parameter request from the source, the player sends the audio configuration parameters corresponding to the second frequency band voice signal supported by the player to the source, and the source receives the playback After the audio configuration parameters corresponding to the second-band voice signal supported by the player, the source determines the audio configuration parameters used according to the audio configuration parameters corresponding to the second-band voice signal supported by the player and sends the used audio configuration parameters to the player. The player receives the used audio configuration parameters sent by the source and configures the used audio configuration parameters. At the same time, the source must also configure the used audio configuration parameters.
  • the playback end can disconnect one or more second synchronization links according to the power condition or the link quality of the second synchronization link.
  • the playback end can disconnect one or more second synchronization links according to the power condition or the link quality of the second synchronization link.
  • the playback end One or more second synchronization links can be disconnected to partially or completely stop the voice frequency division transmission to extend the endurance time.
  • the disconnected one or more The voice signal of the corresponding frequency band on the second synchronization link is not encoded and decoded; or when the link quality of the second synchronization link is poor, the player can disconnect one or more second synchronization links to avoid occupying bandwidth The effect of improving the sound quality is not achieved; in addition, after the voice frequency division transmission is partially or completely suspended, the player can request the source to re-establish one or more pieces of data according to the power situation or the link quality of the second synchronization link.
  • the second synchronization link is used to restart or enhance voice frequency division transmission.
  • the playback end can request the source end
  • the disconnected one or more second synchronization links are re-established to restart or enhance voice frequency division transmission.
  • one or more second synchronization links can be disconnected or one or more second synchronization links can be re-established according to the power situation or the link quality of the second synchronization link, whether it is the source end or
  • the playback end initiates the disconnection of one or more second synchronization links, and the initiated end should unconditionally accept the disconnection of one or more second synchronization links.
  • the embodiment of the present application provides a voice frequency division transmission method.
  • the first synchronization link and the second synchronization link respectively receive the first frequency band speech signal and the second frequency band speech signal after encoding with frame synchronization information.
  • FIG. 6 is a schematic structural diagram of the source provided in this embodiment. As shown in FIG. 6, the source End 50 includes:
  • the encoding module 51 is configured to encode the first frequency band speech signal and the second frequency band speech signal;
  • the pre-synchronization module 52 is used to mark the frame synchronization information into the encoded first frequency band speech signal and the encoded second frequency band speech signal;
  • the first sending module 53 is configured to send the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively Signal to the playback terminal.
  • the compression rate of the encoding method of the voice signal in the second frequency band is higher than the compression rate of the encoding method of the voice signal in the first frequency band;
  • the compression rate of the encoding method of the speech signal in the second frequency band is lower than the compression rate of the encoding method of the speech signal in the first frequency band.
  • the encoding module includes:
  • the high-frequency encoding module is used to encode high-frequency speech signals, and the encoding methods for high-frequency speech signals include CELT encoding or SBR encoding; or
  • the non-high frequency encoding module is used to encode non-high frequency speech signals.
  • the encoding methods for non-high frequency speech signals include SILK encoding, SBC encoding, AAC encoding or MP3 encoding.
  • the pre-synchronization module includes:
  • the data frame marking module is used to mark the encoded voice signal of any frequency band detected within a preset delay as one frame of data, and the frame synchronization information includes the start time and end time of a frame of data.
  • the source also includes:
  • the first parameter determination module the first sending module sends the encoded second frequency band voice signal with frame synchronization information to the playback terminal through the second synchronization link, and is used to send the playback terminal according to the link parameters supported by the playback terminal. Determine the link parameters used;
  • the link establishment module is used to establish a second synchronization link with the playback terminal according to the used link parameters.
  • the source also includes:
  • the second sending module, the first parameter determining module is used to send a second synchronization link request to the playback terminal before determining the link parameter to be used according to the link parameters supported by the playback terminal sent by the playback terminal;
  • the first receiving module is configured to receive a reply to the second synchronization link request
  • the first parameter determination module is further configured to determine whether to establish the second synchronization link according to the reply to the second synchronization link request.
  • the source also includes:
  • the first judgment module is used for judging whether the playback end supports voice frequency division transmission according to the control data stream before the encoding module encodes the voice signal in the second frequency band, and the control data stream is transmitted through the asynchronous link.
  • the source terminal also includes a UUID module, which is used to identify the voice frequency division transmission service through a custom universally unique identifier (UUID);
  • UUID universally unique identifier
  • the first judgment module includes:
  • the second judgment module, the control data stream includes the value of the custom UUID, and if the received custom UUID value of the player is equal to the preset UUID value, it is used to determine that the player supports voice frequency division transmission.
  • the source also includes:
  • the third sending module if the first judging module determines that the playback end supports voice frequency division transmission according to the control data stream, it is used to send an audio configuration parameter request to the playback end;
  • the second receiving module used to receive the audio configuration parameters corresponding to the voice signal in the second frequency band supported by the player.
  • the audio configuration parameters include codec parameters and bit rates, and the codec parameters include one or two of the coding mode and the decoding mode ;as well as
  • the second parameter determination module is configured to determine the audio configuration parameters to be used according to the audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal;
  • the third sending module is also used to send the used audio configuration parameters to the playback terminal.
  • the source also includes:
  • the first transmission control module is configured to disconnect one or more second synchronization links according to the power condition or the link quality of the second synchronization link, and the number of the second synchronization links is less than or equal to that of the second frequency band voice signal Number of frequency bands; or
  • the first transmission control module is also used to establish one or more second synchronization links according to the power condition or the link quality of the second synchronization link, and the number of the second synchronization links is less than or equal to the frequency band of the second frequency band voice signal number.
  • the embodiment of the application provides a source end for executing the voice frequency division transmission method proposed in the foregoing embodiment, and transmits the encoded post-synchronization information with frame synchronization information through the first synchronization link and the second synchronization link.
  • the first frequency band speech signal and the encoded second frequency band speech signal with frame synchronization information are sent to the playback end, which solves the problem of sound quality degradation caused by transmission bandwidth limitation and the problem of affecting the audio being played when the sound quality is improved.
  • FIG. 7 is a schematic structural diagram of the playback terminal provided in this embodiment. As shown in FIG. End 60 includes:
  • the third receiving module 61 is configured to receive the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively signal;
  • the decoding module 62 is configured to decode the received coded first frequency band speech signal and the coded second frequency band speech signal;
  • the synchronization module 63 is used to synchronize the decoded first frequency band speech signal and the decoded second frequency band speech signal through frame synchronization information.
  • the playback terminal also includes:
  • One or more different digital-to-analog conversion modules for performing digital-to-analog conversion on the synchronized voice signal in the first frequency band and the voice signal in the second frequency band respectively;
  • One or more different amplifying modules for respectively amplifying the first frequency band voice signal and the second frequency band voice signal after digital-to-analog conversion
  • One or more different electro-acoustic conversion modules are used to perform electro-acoustic conversion on the amplified first frequency band voice signal and the second frequency band voice signal respectively.
  • the playback terminal also includes:
  • the fourth sending module, the third receiving module is used to send the control data stream to the source end before receiving the encoded second frequency band speech signal with frame synchronization information through the second synchronization link, so that the source end can control the data stream according to the Determine whether the playback terminal supports voice frequency division transmission.
  • the playback terminal also includes:
  • the fourth receiving module if the source terminal determines that the playback terminal supports voice frequency division transmission according to the control data stream, it is used to receive the second synchronization link request sent by the source terminal;
  • the fifth sending module is used to send a reply to the second synchronization link request
  • the fifth sending module is also used to send link parameters supported by the playback end to the source end.
  • the playback terminal also includes:
  • the fifth receiving module, the fourth receiving module is used to receive the audio configuration parameter request sent by the source end before receiving the second synchronization link request sent by the source end;
  • the sixth sending module is used to send the audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal to the source terminal;
  • the fifth receiving module is also used to receive the audio configuration parameters used by the source end.
  • the parameter configuration module is used to configure the audio configuration parameters used.
  • the playback terminal also includes:
  • the second transmission control module is configured to disconnect one or more second synchronization links according to the power condition or the link quality of the second synchronization link;
  • the second transmission control module is further configured to request the source end to establish one or more second synchronization links according to the power condition or the link quality of the second synchronization link.
  • the embodiment of the present application provides a playback terminal that receives the encoded first frequency band speech signal and the second frequency band speech signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively, which solves the problem of transmission
  • the embodiment of the present application may also provide a source end for executing the voice frequency division transmission method proposed in the embodiment.
  • the source end 70 includes a memory 71 and a processor 72;
  • the memory 71 is coupled with the processor 72;
  • the memory 71 is used to store program instructions
  • the processor 72 is configured to call the program instructions stored in the memory to make the source end execute the voice frequency division transmission method.
  • the source provided in the embodiment of the present application can execute the voice frequency division transmission method provided in any of the above-mentioned embodiments.
  • voice frequency division transmission method provided in any of the above-mentioned embodiments.
  • the embodiment of the present application may also provide a playback terminal for executing the voice frequency division transmission method proposed in the embodiment.
  • the playback terminal 80 includes a memory 81 and a processor 82;
  • the memory 81 is coupled with the processor 82;
  • the memory 81 is used to store program instructions
  • the processor 82 is configured to call the program instructions stored in the memory to make the playback terminal execute the voice frequency division transmission method.
  • the playback terminal provided in the embodiment of the present application can execute the voice frequency division transmission method provided in any of the above-mentioned embodiments.
  • voice frequency division transmission method provided in any of the above-mentioned embodiments.
  • the embodiment of the present application may also provide a computer-readable storage medium on which a computer program is stored.
  • the computer program is executed by the processor 72, the voice frequency division transmission method executed by the source is executed.
  • the computer-readable storage medium provided by the embodiment of the present application can execute the voice frequency division transmission method executed by the source provided in any of the above-mentioned embodiments.
  • voice frequency division transmission method executed by the source provided in any of the above-mentioned embodiments.
  • the embodiment of the present application may also provide a computer-readable storage medium on which a computer program is stored, and when the computer program is executed by the processor 82, the voice frequency division transmission method executed by the player is executed.
  • the computer-readable storage medium provided by the embodiment of the present application can execute the voice frequency division transmission method executed by the player provided in any of the above-mentioned embodiments.
  • voice frequency division transmission method executed by the player provided in any of the above-mentioned embodiments.
  • the embodiment of the present application may also provide a source-end circuit, which may be used to implement the voice frequency division transmission method proposed in the foregoing embodiment.
  • FIG. 10 is a schematic structural diagram of the source-end circuit provided in this embodiment.
  • the source circuit includes:
  • An encoder for encoding the first frequency band speech signal and the second frequency band speech signal
  • the source controller connected to the encoder, is used to send the encoded first frequency band speech signal with frame synchronization information and the encoded voice signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively
  • the voice signal of the second frequency band is sent to the playback end circuit.
  • encoder 0 is the original encoder of the system.
  • the first synchronization link 0 of the system is used to transmit the encoded first frequency band speech signal.
  • the speech signal is separated into a first frequency band speech signal and a second frequency band speech signal.
  • the second frequency band speech signal is a high frequency speech signal
  • the first frequency band speech signal is a non-high frequency speech signal;
  • encoder 0 can Using alphabet encoder, there are two core encoding algorithms built in alphabet encoder: CELT encoding method and SILK encoding method.
  • the encoder uses CELT encoding to encode high-frequency speech signals; this embodiment can also use multiple encoders, this embodiment does not limit the number of encoders, the source end has encoder 0 and encoder 1 as For example, for example, encoder 0 can encode non-high-frequency speech signals of speech signals, while encoder 1 can encode high-frequency speech signals. Due to the different characteristics of high-frequency speech signals and non-high-frequency speech signals, Therefore, if different encoding methods are used for encoding separately, the audio quality can be improved by adding a small amount of bandwidth.
  • the encoder 0 encodes it and transmits it to the playback end through the second synchronization link 1.
  • the source controller is connected to the encoder 0 for passing through the first synchronization link 0 and the second synchronization link 1 transmit the encoded first frequency band speech signal and the second frequency band speech signal with frame synchronization information.
  • m>1 is an integer, which can correspond to m second synchronization links respectively.
  • the number of second synchronization links here is m
  • the bars are merely illustrative. In this embodiment, the number of second synchronization links less than m may also be used to transmit m sub-band voice signals, and this embodiment does not limit the number of second synchronization links.
  • the source-end circuit may further include a filter, which is connected to the encoder and is used to separate the voice signal in the first frequency band and the voice signal in the second frequency band.
  • a filter which is connected to the encoder and is used to separate the voice signal in the first frequency band and the voice signal in the second frequency band.
  • the voice signal is encoded by the filter 1 and then transmitted to the playback end through the corresponding second synchronization link 1.
  • the voice signal of the first frequency band can also be obtained It is implemented by a filter, or no filter is needed. Only encoder 0 is used to encode the voice signal of a specific frequency band.
  • the number of filters is not limited in this embodiment, or only A filter, especially if the encoding method used by some encoders only encodes the voice signal of a specific frequency band, the encoder may not need the corresponding filter, for example, the filter may not be required before the alphabet encoder.
  • the embodiment does not limit the types of filters. Low-pass filters, high-pass filters, band-pass filters, or other filters and any combination thereof can be used.
  • the filter can also be used to separate multiple sub-band voice signals in the first frequency band voice signal, and can also be used to separate multiple sub-band voice signals in the second frequency band voice signal.
  • the embodiment of the application provides a source-end circuit, which encodes a voice signal in a first frequency band and a voice signal in a second frequency band, and transmits the encoding with frame synchronization information through the first synchronization link and the second synchronization link.
  • the subsequent first frequency band voice signal and second frequency band voice signal solve the problem of sound quality degradation caused by transmission bandwidth limitation and the problem of affecting the audio being played when the sound quality is improved.
  • the embodiment of the present application may also provide a playback end circuit, which can be used to implement the voice frequency division transmission method proposed in the foregoing embodiment. Please refer to FIG. 10, which is the source end provided by this embodiment. Schematic diagram of the circuit and the structure of the playback end circuit.
  • the playback terminal circuit includes:
  • the player controller is configured to receive the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively; as well as
  • a plurality of decoders are connected with the controller of the playback end, and are used for decoding the received voice signal of the first frequency band and the voice signal of the second frequency band.
  • decoder 0 can be the original decoder of the system, and the first synchronization link 0 of the system is used to transmit the encoded first frequency band speech signal.
  • This embodiment can use one or more decoders. This embodiment does not limit the number of decoders.
  • one decoder can use different decoding methods to decode the encoded first frequency band speech signal and second frequency band speech signal with frame synchronization information. ; Take decoder 0 and decoder 1 on the playback side as an example. For example, decoder 0 can decode non-high-frequency voice signals, while decoder 1 can decode high-frequency voice signals.
  • the number of decoders may be equal to the number of encoders, or may not be equal to the number of encoders.
  • the decoded voice signal in the first frequency band and the voice signal in the second frequency band pass through a digital-to-analog converter and an amplifier, and then undergo electro-acoustic conversion through an electro-acoustic converter.
  • the playback end circuit further includes:
  • One or more different digital-to-analog converters respectively connected to the decoder, for performing digital-to-analog conversion on the decoded first frequency band speech signal and the decoded second frequency band speech signal respectively;
  • One or more different amplifiers respectively connected to one or more digital-to-analog converters for respectively amplifying the first-band voice signal after digital-to-analog conversion and the second-band voice signal after digital-to-analog conversion;
  • One or more different electro-acoustic converters are respectively connected to one or more amplifiers, and are used to perform electro-acoustic conversion on the amplified first frequency band speech signal and the amplified second frequency band speech signal respectively.
  • FIG. 11 is a schematic diagram of the structure of the source-end circuit and the playback-end circuit of an embodiment of this application.
  • the decoded voice signal in the first frequency band and the voice signal in the second frequency band can pass through a digital-to-analog converter.
  • the digital-to-analog converter 1 perform digital-to-analog conversion separately.
  • different digital-to-analog converters can be selected according to the characteristics of the voice signals in different frequency bands to further improve the sound quality.
  • the first frequency band speech signal and the second frequency band speech signal can be amplified in an amplifier, and then an electroacoustic converter is used for electroacoustic conversion; the first frequency band speech signal and the second frequency band speech signal after digital-to-analog conversion can also be separately Through amplifier 0 and amplifier 1, respectively, for the voice signals of different frequency bands, different amplifiers can be selected according to the characteristics of the voice signals of different frequency bands to further improve the sound quality.
  • the amplified voice signals of the first frequency band and the second frequency band can be An electroacoustic converter is used for electroacoustic conversion; the amplified voice signal of the first frequency band and the voice signal of the second frequency band can also be converted by multiple electroacoustic converters, which can be selected according to the characteristics of the voice signal of different frequency bands Electroacoustic converters with different frequency responses are used to perform electroacoustic conversion on the amplified first-band voice signal and the second-band voice signal to further improve the audio quality.
  • speaker 0 and speaker 1 are used for electroacoustic conversion.
  • each decoder corresponds to a digital-to-analog converter, an amplifier, and a speaker.
  • Figure 11 is only an exemplary illustration, and multiple decoders can also share one or more digital-analogs.
  • the converter, one or more amplifiers or one or more speakers are not limited in this embodiment.
  • the voice signal in the second frequency band contains m sub-band voice signals, it can correspond to m second synchronization links respectively.
  • the number of corresponding decoders may be less than or equal to m
  • the number of corresponding digital-to-analog converters, amplifiers, and electro-acoustic converters may be less than or equal to m
  • the specific value of m is not limited in this embodiment.
  • the embodiment of the present application provides a playback terminal circuit, which receives the encoded first frequency band speech signal and the second frequency band speech signal with frame synchronization information through the first synchronization link and the second synchronization link respectively, which solves the problem of The problem of sound quality degradation caused by transmission bandwidth limitation and the problem of affecting the audio being played when the sound quality is improved.
  • the processor may be an integrated circuit chip with signal processing capabilities.
  • the steps of the foregoing method embodiments can be completed by hardware integrated logic circuits in the processor or instructions in the form of software.
  • the above-mentioned processor may be a general-purpose processor, a digital signal processor (DSP), an application specific integrated circuit (ASIC), a ready-made programmable gate array (field programmable gate array, FPGA) or other Programming logic devices, discrete gates or transistor logic devices, discrete hardware components.
  • DSP digital signal processor
  • ASIC application specific integrated circuit
  • FPGA field programmable gate array
  • Programming logic devices discrete gates or transistor logic devices, discrete hardware components.
  • the general-purpose processor may be a microprocessor or the processor may also be any conventional processor or the like.
  • the steps of the method disclosed in the embodiments of the present application may be directly embodied as being executed and completed by a hardware decoding processor, or executed and completed by a combination of hardware and software modules in the decoding processor.
  • the software module can be located in a mature storage medium in the field such as random access memory, flash memory, read-only memory, programmable read-only memory, or electrically erasable programmable memory, registers.
  • the storage medium is located in the memory, and the processor reads the information in the memory and completes the steps of the above method in combination with its hardware.
  • the memory in the embodiment of the present application may be a volatile memory or a non-volatile memory, or may include both volatile and non-volatile memory.
  • the non-volatile memory can be read-only memory (ROM), programmable read-only memory (programmable rom, PROM), erasable programmable read-only memory (erasable PROM, EPROM), and electrically available Erase programmable read-only memory (electrically EPROM, EEPROM) or flash memory.
  • the volatile memory may be random access memory (RAM), which is used as an external cache.
  • RAM random access memory
  • static random access memory static random access memory
  • dynamic RAM dynamic random access memory
  • DRAM dynamic random access memory
  • SDRAM synchronous dynamic random access memory
  • double data rate synchronous dynamic random access memory double data rate SDRAM, DDR SDRAM
  • enhanced synchronous dynamic random access memory enhanced SDRAM, ESDRAM
  • serial link DRAM SLDRAM
  • direct rambus RAM direct rambus RAM
  • B corresponding to A means that B is associated with A, and B can be determined according to A.
  • determining B according to A does not mean that B is determined only according to A, and B can also be determined according to A and/or other information.
  • the disclosed system, device, and method may be implemented in other ways.
  • the device embodiments described above are only illustrative.
  • the division of the units is only a logical function division, and there may be other divisions in actual implementation, for example, multiple units or components can be combined or It can be integrated into another system, or some features can be ignored or not implemented.
  • the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
  • each unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
  • the function is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer readable storage medium.
  • the technical solution of this application essentially or the part that contributes to the existing technology or the part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium, including Several instructions are used to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the method described in each embodiment of the present application.
  • the aforementioned storage media include: U disk, mobile hard disk, read-only memory (read-only memory, ROM), random access memory (random access memory, RAM), magnetic disk or optical disk and other media that can store program code .

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Telephonic Communication Services (AREA)
  • Telephone Function (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The present application relates to the field of communications, especially relates to a voice frequency division transmission method, a source terminal, a playback terminal, a source terminal circuit and a playback terminal circuit; wherein the voice frequency division transmission method comprises: the source terminal encodes a first frequency band voice signal and a second frequency band voice signal; the source terminal marks frame synchronization information into the encoded first frequency band voice signal and the encoded second frequency band voice signal; and the source terminal sends the encoded first frequency band voice signal with the frame synchronization information and the encoded second frequency band voice signal with the frame synchronization information to the playback terminal through a first synchronization link and a second synchronization link respectively. According to the present application, the encoded first frequency band voice signal with the frame synchronization information and the encoded second frequency band voice signal with the frame synchronization information are sent to the playback terminal through the first synchronization link and the second synchronization link respectively, thereby solving the problem of sound quality degradation caused by the transmission bandwidth limitation and the problem of affecting the audio being played when improving the sound quality.

Description

一种语音分频传输方法、源端、播放端、源端电路和播放端电路Voice frequency division transmission method, source end, playback end, source end circuit and playback end circuit 技术领域Technical field
本申请涉及通信领域,尤其涉及一种语音分频传输方法、源端、播放端、源端电路和播放端电路。This application relates to the field of communications, in particular to a voice frequency division transmission method, a source end, a playback end, a source end circuit, and a playback end circuit.
背景技术Background technique
在无线音频传输时,由于传输带宽限制,在兼顾续航和带宽的同时,通常会使用超高压缩率的有损编码进行传输,以牺牲音质为代价换取更长的续航时间。常用的有损编码,例如:AAC(Advanced Audio Coding,高级音频编码)编码方式、SBC(Sub-bandcoding,子带编码)编码方式或者MP3(Moving Picture Experts Group Audio Layer III,动态影像专家压缩标准音频层面3)编码方式等,通常会将高频部分直接舍弃,以节省带宽,但是这样接收到编码后的数据时,由于高频部分丢失,无法得到最初的语音信号,以至于不能为用户提供更好的音质体验,另外,使用一条同步链路传输语音信号时,一般需要对该同步链路进行操作以达到改善音质的目的,但是这种方式往往会对正在播放的音频产生影响,例如,用户可以听到正在播放的音频出现短时停顿现象。In wireless audio transmission, due to transmission bandwidth limitations, while taking into account battery life and bandwidth, lossy encoding with ultra-high compression rates is usually used for transmission, at the expense of sound quality in exchange for longer battery life. Commonly used lossy coding, such as: AAC (Advanced Audio Coding, advanced audio coding) coding method, SBC (Sub-band coding, sub-band coding) coding method or MP3 (Moving Picture Experts Group Audio Layer III, motion picture experts compress standard audio Level 3) Encoding methods, etc., usually the high frequency part is directly discarded to save bandwidth, but when the encoded data is received in this way, due to the loss of the high frequency part, the original voice signal cannot be obtained, so that it cannot provide users with more information. Good sound quality experience. In addition, when a synchronous link is used to transmit voice signals, it is generally necessary to operate the synchronous link to achieve the purpose of improving the sound quality, but this method often affects the audio being played, for example, the user You can hear a short pause in the audio being played.
发明内容Summary of the invention
针对现有技术中无线音频传输时由于传输带宽限制带来的音质下降的问题和改善音质时影响正在播放的音频的问题,本申请提供了 一种语音分频传输方法、源端、播放端、源端电路和播放端电路。Aiming at the problem of sound quality degradation caused by the limitation of transmission bandwidth during wireless audio transmission in the prior art and the problem of affecting the audio being played when the sound quality is improved, this application provides a voice frequency division transmission method, source end, playback end, Source end circuit and playback end circuit.
本申请的实施例的第一方面提供了一种语音分频传输方法,包括:The first aspect of the embodiments of the present application provides a voice frequency division transmission method, including:
源端对第一频段语音信号和第二频段语音信号进行编码;源端将帧同步信息标记到编码后的第一频段语音信号和编码后的第二频段语音信号中;源端通过第一同步链路和第二同步链路分别发送带有帧同步信息的编码后的第一频段语音信号和带有帧同步信息的编码后的第二频段语音信号给播放端。The source end encodes the first frequency band speech signal and the second frequency band speech signal; the source end marks the frame synchronization information into the encoded first frequency band speech signal and the encoded second frequency band speech signal; the source end passes the first synchronization The link and the second synchronization link respectively send the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information to the playback terminal.
另外,结合第一方面,在第一方面的一种实现方式中,包括:第二频段语音信号的编码方式的压缩率高于第一频段语音信号的编码方式的压缩率;或者第二频段语音信号的编码方式的压缩率低于第一频段语音信号的编码方式的压缩率。In addition, with reference to the first aspect, an implementation of the first aspect includes: the compression rate of the encoding method of the second frequency band speech signal is higher than the compression rate of the encoding method of the first frequency band speech signal; or the second frequency band speech The compression rate of the encoding method of the signal is lower than the compression rate of the encoding method of the voice signal in the first frequency band.
另外,结合第一方面及其上述实现方式,在第一方面的另一种实现方式中,源端对第二频段语音信号进行编码包括:In addition, in combination with the first aspect and the foregoing implementation manners, in another implementation manner of the first aspect, the source end encoding the second frequency band voice signal includes:
源端对高频语音信号进行编码,源端对高频语音信号的编码方式包括CELT编码方式或SBR编码方式;或者The source end encodes the high-frequency speech signal, and the source end encodes the high-frequency speech signal including CELT encoding or SBR encoding; or
源端对非高频语音信号进行编码,源端对非高频语音信号的编码方式包括SILK编码方式、SBC编码方式、AAC编码方式或MP3编码方式。The source end encodes the non-high frequency speech signal, and the source end encodes the non-high frequency speech signal including SILK encoding, SBC encoding, AAC encoding or MP3 encoding.
另外,结合第一方面及其上述实现方式,在第一方面的另一种实现方式中,源端将帧同步信息标记到编码后的第一频段语音信号和编码后的第二频段语音信号中包括:In addition, in combination with the first aspect and the foregoing implementation manners, in another implementation manner of the first aspect, the source end marks the frame synchronization information into the encoded first frequency band speech signal and the encoded second frequency band speech signal include:
源端标记预设延时内检测到的编码后的任一频段的语音信号为一帧数据,帧同步信息包括一帧数据的开始时刻和结束时刻。The source marks the encoded voice signal in any frequency band detected within a preset delay as a frame of data, and the frame synchronization information includes the start time and end time of a frame of data.
另外,结合第一方面及其上述实现方式,在第一方面的另一种实现方式中,源端通过第二同步链路发送带有帧同步信息的编码后的第二频段语音信号给播放端之前,包括:In addition, in combination with the first aspect and the foregoing implementation manners, in another implementation manner of the first aspect, the source end sends the encoded second frequency band voice signal with frame synchronization information to the playback end through the second synchronization link Before, including:
根据播放端发送的播放端支持的链路参数确定使用的链路参数;Determine the link parameters used according to the link parameters supported by the player sent by the player;
源端根据使用的链路参数建立与播放端之间的第二同步链路。The source end establishes a second synchronization link with the playback end according to the used link parameters.
另外,结合第一方面及其上述实现方式,在第一方面的另一种实 现方式中,根据播放端发送的播放端支持的链路参数确定使用的链路参数之前,包括:In addition, in combination with the first aspect and the foregoing implementation manners, in another implementation manner of the first aspect, before determining the link parameters used according to the link parameters supported by the playback end sent by the playback end, it includes:
源端发送第二同步链路请求给播放端;The source end sends a second synchronization link request to the playback end;
源端接收第二同步链路请求的回复;The source receives the reply to the second synchronization link request;
源端根据第二同步链路请求的回复确定是否建立第二同步链路。The source terminal determines whether to establish the second synchronization link according to the response to the second synchronization link request.
另外,结合第一方面及其上述实现方式,在第一方面的另一种实现方式中,源端对第二频段语音信号进行编码之前,还包括:In addition, with reference to the first aspect and the foregoing implementation manners, in another implementation manner of the first aspect, before the source end encodes the voice signal in the second frequency band, the method further includes:
源端根据控制数据流判断播放端是否支持语音分频传输,控制数据流通过异步链路传输。The source terminal judges whether the playback terminal supports voice frequency division transmission according to the control data stream, and the control data stream is transmitted through the asynchronous link.
另外,结合第一方面及其上述实现方式,在第一方面的另一种实现方式中,若源端通过自定义通用唯一识别码(UUID)标识语音分频传输服务,源端根据控制数据流判断播放端是否支持语音分频传输包括:In addition, in combination with the first aspect and the foregoing implementation manners, in another implementation manner of the first aspect, if the source uses a custom universally unique identifier (UUID) to identify the voice frequency division transmission service, the source controls the data flow Judging whether the playback end supports voice frequency division transmission includes:
控制数据流包括自定义UUID的值,若源端接收到的播放端的自定义UUID的值等于预设UUID值,源端确定播放端支持语音分频传输。The control data stream includes the value of the custom UUID. If the custom UUID value of the player received by the source is equal to the preset UUID value, the source determines that the player supports voice frequency division transmission.
另外,结合第一方面及其上述实现方式,在第一方面的另一种实现方式中,若源端根据控制数据流确定播放端支持语音分频传输,源端发送音频配置参数请求给播放端;In addition, in combination with the first aspect and the foregoing implementation manners, in another implementation manner of the first aspect, if the source terminal determines that the playback terminal supports voice frequency division transmission according to the control data stream, the source terminal sends an audio configuration parameter request to the playback terminal ;
源端接收播放端支持的第二频段语音信号对应的音频配置参数,音频配置参数包括编解码参数和码率,编解码参数包括编码方式和解码方式其中的一种或两种;The source terminal receives the audio configuration parameters corresponding to the voice signal in the second frequency band supported by the playback terminal. The audio configuration parameters include coding and decoding parameters and bit rates, and the coding and decoding parameters include one or both of coding and decoding methods;
源端根据播放端支持的第二频段语音信号对应的音频配置参数确定使用的音频配置参数并将使用的音频配置参数发送给播放端。The source terminal determines the used audio configuration parameters according to the audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal and sends the used audio configuration parameters to the playback terminal.
另外,结合第一方面及其上述实现方式,在第一方面的另一种实现方式中,第二同步链路的条数小于或等于第二频段语音信号的频段数;In addition, with reference to the first aspect and the foregoing implementation manners, in another implementation manner of the first aspect, the number of second synchronization links is less than or equal to the number of frequency bands of the second frequency band voice signal;
源端根据电量情况或者第二同步链路的链路质量断开一条或多条第二同步链路;或者The source disconnects one or more second synchronization links according to the power situation or the link quality of the second synchronization link; or
源端根据电量情况或者第二同步链路的链路质量建立一条或多条第二同步链路。The source establishes one or more second synchronization links according to the power condition or the link quality of the second synchronization link.
本申请的实施例的第二方面提供了一种语音分频传输方法,包括:The second aspect of the embodiments of the present application provides a voice frequency division transmission method, including:
播放端通过第一同步链路和第二同步链路分别接收带有帧同步信息的编码后的第一频段语音信号和带有帧同步信息的编码后的第二频段语音信号;The playback terminal receives the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively;
播放端对接收到的第一频段语音信号和第二频段语音信号进行解码;The playback terminal decodes the received voice signal in the first frequency band and the voice signal in the second frequency band;
播放端通过帧同步信息对解码后的第一频段语音信号和解码后的第二频段语音信号进行同步。The player uses the frame synchronization information to synchronize the decoded first frequency band speech signal and the decoded second frequency band speech signal.
另外,结合第二方面,在第二方面的一种实现方式中,播放端通过帧同步信息对解码后的第一频段语音信号和解码后的第二频段语音信号进行同步之后,包括:In addition, with reference to the second aspect, in an implementation of the second aspect, after the player uses frame synchronization information to synchronize the decoded first frequency band speech signal and the decoded second frequency band speech signal, the method includes:
播放端对同步后的第一频段语音信号和同步后的第二频段语音信号通过一个或多个不同的数模转换器分别进行数模转换;The playback terminal performs digital-to-analog conversion on the synchronized voice signal in the first frequency band and the synchronized voice signal in the second frequency band through one or more different digital-to-analog converters;
播放端对数模转换后的第一频段语音信号和数模转换后的第二频段语音信号通过一个或多个不同的放大器分别进行放大;The playback terminal amplifies the first frequency band voice signal after digital-to-analog conversion and the second frequency band voice signal after digital-to-analog conversion through one or more different amplifiers respectively;
播放端对放大后的第一频段语音信号和放大后的第二频段语音信号通过一个或多个不同的电声转换器分别进行电声转换。The playback terminal performs electroacoustic conversion on the amplified first frequency band speech signal and the amplified second frequency band speech signal through one or more different electroacoustic converters.
另外,结合第二方面及其上述实现方式,在第二方面的另一种实现方式中,播放端通过第二同步链路接收带有帧同步信息的编码后的第二频段语音信号之前包括:In addition, in combination with the second aspect and the foregoing implementation manners of the second aspect, in another implementation manner of the second aspect, before the player receives the encoded second frequency band voice signal with frame synchronization information through the second synchronization link, the following includes:
播放端发送控制数据流给源端,以使源端根据控制数据流判断播放端是否支持语音分频传输。The playback end sends the control data stream to the source end, so that the source end determines whether the playback end supports voice frequency division transmission according to the control data stream.
另外,结合第二方面及其上述实现方式,在第二方面的另一种实现方式中,若源端根据控制数据流确定播放端支持语音分频传输,播放端接收源端发送的第二同步链路请求并发送第二同步链路请求的回复;In addition, in combination with the second aspect and the foregoing implementation manners, in another implementation manner of the second aspect, if the source terminal determines that the playback terminal supports voice frequency division transmission according to the control data stream, the playback terminal receives the second synchronization sent by the source terminal. Link request and send a reply to the second synchronization link request;
若播放端支持语音分频传输,播放端发送播放端支持的链路参数 给源端。If the playback end supports voice frequency division transmission, the playback end sends the link parameters supported by the playback end to the source end.
另外,结合第二方面及其上述实现方式,在第二方面的另一种实现方式中,播放端接收源端发送的第二同步链路请求之前,包括:In addition, in combination with the second aspect and the foregoing implementation manners, in another implementation manner of the second aspect, before the playback end receives the second synchronization link request sent by the source end, the method includes:
播放端接收源端发送的音频配置参数请求;The player receives the audio configuration parameter request sent by the source;
播放端发送播放端支持的第二频段语音信号对应的音频配置参数给源端;The playback terminal sends the audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal to the source terminal;
播放端接收源端发送的使用的音频配置参数并配置使用的音频配置参数。The player receives the used audio configuration parameters sent by the source and configures the used audio configuration parameters.
另外,结合第二方面及其上述实现方式,在第二方面的另一种实现方式中,播放端根据电量情况或者第二同步链路的链路质量断开一条或多条第二同步链路;或者In addition, in combination with the second aspect and the foregoing implementation manners of the second aspect, in another implementation manner of the second aspect, the player disconnects one or more second synchronization links according to the power condition or the link quality of the second synchronization link ;or
播放端根据电量情况或者第二同步链路的链路质量请求源端建立一条或多条第二同步链路。The playback end requests the source end to establish one or more second synchronization links according to the power condition or the link quality of the second synchronization link.
本申请的实施例的第三方面提供了一种源端,用于语音分频传输,源端包括:The third aspect of the embodiments of the present application provides a source terminal for voice frequency division transmission, and the source terminal includes:
编码模块,用于对第一频段语音信号和第二频段语音信号进行编码;Encoding module, used to encode the first frequency band speech signal and the second frequency band speech signal;
预同步模块,用于将帧同步信息标记到编码后的第一频段语音信号和编码后的第二频段语音信号中;以及The pre-synchronization module is used to mark the frame synchronization information into the encoded first frequency band speech signal and the encoded second frequency band speech signal; and
第一发送模块,用于通过第一同步链路和第二同步链路分别发送带有帧同步信息的编码后的第一频段语音信号和带有帧同步信息的编码后的第二频段语音信号给播放端。The first sending module is configured to send the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively To the player.
另外,结合第三方面,在第三方面的一种实现方式中,第二频段语音信号的编码方式的压缩率高于第一频段语音信号的编码方式的压缩率;或者In addition, with reference to the third aspect, in an implementation of the third aspect, the compression rate of the encoding method of the second frequency band speech signal is higher than the compression rate of the encoding method of the first frequency band speech signal; or
第二频段语音信号的编码方式的压缩率低于第一频段语音信号的编码方式的压缩率。The compression rate of the encoding method of the speech signal in the second frequency band is lower than the compression rate of the encoding method of the speech signal in the first frequency band.
另外,结合第三方面及其上述实现方式,在第三方面的另一种实现方式中,编码模块包括:In addition, in combination with the third aspect and the foregoing implementation manners, in another implementation manner of the third aspect, the encoding module includes:
高频编码模块,用于对高频语音信号进行编码,对高频语音信号的编码方式包括CELT编码方式或SBR编码方式;或者The high-frequency encoding module is used to encode high-frequency speech signals, and the encoding methods for high-frequency speech signals include CELT encoding or SBR encoding; or
非高频编码模块,用于对非高频语音信号进行编码,对非高频语音信号的编码方式包括SILK编码方式、SBC编码方式、AAC编码方式或MP3编码方式。The non-high frequency encoding module is used to encode non-high frequency speech signals. The encoding methods for non-high frequency speech signals include SILK encoding, SBC encoding, AAC encoding or MP3 encoding.
另外,结合第三方面及其上述实现方式,在第三方面的另一种实现方式中,预同步模块包括:In addition, in combination with the third aspect and the foregoing implementation manners, in another implementation manner of the third aspect, the pre-synchronization module includes:
数据帧标记模块,用于标记预设延时内检测到的编码后的任一频段的语音信号为一帧数据,帧同步信息包括一帧数据的开始时刻和结束时刻。The data frame marking module is used to mark the encoded voice signal of any frequency band detected within a preset delay as one frame of data, and the frame synchronization information includes the start time and end time of a frame of data.
另外,结合第三方面及其上述实现方式,在第三方面的另一种实现方式中,源端还包括:In addition, with reference to the third aspect and the foregoing implementation manners, in another implementation manner of the third aspect, the source further includes:
第一参数确定模块,第一发送模块通过第二同步链路发送带有帧同步信息的编码后的第二频段语音信号给播放端之前,用于根据播放端发送的播放端支持的链路参数确定使用的链路参数;The first parameter determination module, the first sending module sends the encoded second frequency band voice signal with frame synchronization information to the playback terminal through the second synchronization link, and is used to send the playback terminal according to the link parameters supported by the playback terminal. Determine the link parameters used;
链路建立模块,用于根据使用的链路参数建立与播放端之间的第二同步链路。The link establishment module is used to establish a second synchronization link with the playback terminal according to the used link parameters.
另外,结合第三方面及其上述实现方式,在第三方面的另一种实现方式中,源端还包括:In addition, with reference to the third aspect and the foregoing implementation manners, in another implementation manner of the third aspect, the source further includes:
第二发送模块,第一参数确定模块根据播放端发送的播放端支持的链路参数确定使用的链路参数之前,用于发送第二同步链路请求给播放端;以及The second sending module, the first parameter determining module is used to send a second synchronization link request to the playback terminal before determining the link parameter to be used according to the link parameters supported by the playback terminal sent by the playback terminal; and
第一接收模块,用于接收第二同步链路请求的回复;The first receiving module is configured to receive a reply to the second synchronization link request;
第一参数确定模块还用于根据第二同步链路请求的回复确定是否建立第二同步链路。The first parameter determination module is further configured to determine whether to establish the second synchronization link according to the reply to the second synchronization link request.
另外,结合第三方面及其上述实现方式,在第三方面的另一种实现方式中,源端还包括:In addition, with reference to the third aspect and the foregoing implementation manners, in another implementation manner of the third aspect, the source further includes:
第一判断模块,编码模块对第二频段语音信号进行编码之前,用于根据控制数据流判断播放端是否支持语音分频传输,控制数据流通 过异步链路传输。The first judging module, before the encoding module encodes the voice signal in the second frequency band, is used to judge whether the playback end supports voice frequency division transmission according to the control data stream, and control the data flow through asynchronous link transmission.
另外,结合第三方面及其上述实现方式,在第三方面的另一种实现方式中,源端还包括UUID模块,用于通过自定义通用唯一识别码(UUID)标识语音分频传输服务;In addition, in combination with the third aspect and the foregoing implementation manners, in another implementation manner of the third aspect, the source also includes a UUID module for identifying the voice frequency division transmission service through a custom universally unique identification code (UUID);
第一判断模块包括:The first judgment module includes:
第二判断模块,控制数据流包括自定义UUID的值,若接收到的播放端的自定义UUID的值等于预设UUID值,用于确定播放端支持语音分频传输。The second judgment module, the control data stream includes the value of the custom UUID, and if the received custom UUID value of the player is equal to the preset UUID value, it is used to determine that the player supports voice frequency division transmission.
另外,结合第三方面及其上述实现方式,在第三方面的另一种实现方式中,源端还包括:In addition, with reference to the third aspect and the foregoing implementation manners, in another implementation manner of the third aspect, the source further includes:
第三发送模块:若第一判断模块根据控制数据流确定播放端支持语音分频传输,用于发送音频配置参数请求给播放端;The third sending module: if the first judging module determines that the playback end supports voice frequency division transmission according to the control data stream, it is used to send an audio configuration parameter request to the playback end;
第二接收模块:用于接收播放端支持的第二频段语音信号对应的音频配置参数,音频配置参数包括编解码参数和码率,编解码参数包括编码方式和解码方式中的一种或两种;以及The second receiving module: used to receive the audio configuration parameters corresponding to the voice signal in the second frequency band supported by the player. The audio configuration parameters include codec parameters and bit rates, and the codec parameters include one or two of the coding mode and the decoding mode ;as well as
第二参数确定模块,用于根据播放端支持的第二频段语音信号对应的音频配置参数确定使用的音频配置参数;The second parameter determination module is configured to determine the audio configuration parameters to be used according to the audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal;
第三发送模块还用于将使用的音频配置参数发送给播放端。The third sending module is also used to send the used audio configuration parameters to the playback terminal.
另外,结合第三方面及其上述实现方式,在第三方面的另一种实现方式中,源端还包括:In addition, with reference to the third aspect and the foregoing implementation manners, in another implementation manner of the third aspect, the source further includes:
第一传输控制模块,用于根据电量情况或者第二同步链路的链路质量断开一条或多条第二同步链路,第二同步链路的条数小于或等于第二频段语音信号的频段数;或者The first transmission control module is configured to disconnect one or more second synchronization links according to the power condition or the link quality of the second synchronization link, and the number of the second synchronization links is less than or equal to that of the second frequency band voice signal Number of frequency bands; or
第一传输控制模块还用于根据电量情况或者第二同步链路的链路质量建立一条或多条第二同步链路,第二同步链路的条数小于或等于第二频段语音信号的频段数。The first transmission control module is also used to establish one or more second synchronization links according to the power condition or the link quality of the second synchronization link, and the number of the second synchronization links is less than or equal to the frequency band of the second frequency band voice signal number.
本申请的实施例的第四方面提供了一种播放端,用于语音分频传输,播放端包括:The fourth aspect of the embodiments of the present application provides a playback terminal for voice frequency division transmission. The playback terminal includes:
第三接收模块,用于通过第一同步链路和第二同步链路分别接收 带有帧同步信息的编码后的第一频段语音信号和带有帧同步信息的编码后的第二频段语音信号;The third receiving module is used to receive the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively ;
解码模块,用于对接收到的编码后的第一频段语音信号和编码后的第二频段语音信号进行解码;The decoding module is used to decode the received coded first frequency band speech signal and the coded second frequency band speech signal;
同步模块,用于通过帧同步信息对解码后的第一频段语音信号和解码后的第二频段语音信号进行同步。The synchronization module is used to synchronize the decoded first frequency band speech signal and the decoded second frequency band speech signal through frame synchronization information.
另外,结合第四方面,在第四方面的一种实现方式中,播放端还包括:In addition, with reference to the fourth aspect, in an implementation manner of the fourth aspect, the playback terminal further includes:
一个或多个不同的数模转换模块,用于对同步后的第一频段语音信号和第二频段语音信号分别进行数模转换;One or more different digital-to-analog conversion modules for performing digital-to-analog conversion on the synchronized voice signal in the first frequency band and the voice signal in the second frequency band respectively;
一个或多个不同的放大模块,用于对数模转换后的第一频段语音信号和第二频段语音信号分别进行放大;以及One or more different amplifying modules for respectively amplifying the first frequency band voice signal and the second frequency band voice signal after digital-to-analog conversion; and
一个或多个不同的电声转换模块,用于对放大后的第一频段语音信号和第二频段语音信号分别进行电声转换。One or more different electro-acoustic conversion modules are used to perform electro-acoustic conversion on the amplified first frequency band voice signal and the second frequency band voice signal respectively.
另外,结合第四方面及其上述实现方式,在第四方面的另一种实现方式中,播放端还包括:In addition, in combination with the fourth aspect and the foregoing implementation manners, in another implementation manner of the fourth aspect, the playback terminal further includes:
第四发送模块,第三接收模块通过第二同步链路接收带有帧同步信息的编码后的第二频段语音信号之前,用于发送控制数据流给源端,以使源端根据控制数据流判断播放端是否支持语音分频传输。The fourth sending module, the third receiving module is used to send the control data stream to the source end before receiving the encoded second frequency band speech signal with frame synchronization information through the second synchronization link, so that the source end can control the data stream according to the Determine whether the playback terminal supports voice frequency division transmission.
另外,结合第四方面及其上述实现方式,在第四方面的另一种实现方式中,播放端还包括:In addition, in combination with the fourth aspect and the foregoing implementation manners, in another implementation manner of the fourth aspect, the playback terminal further includes:
第四接收模块,若源端根据控制数据流确定播放端支持语音分频传输,用于接收源端发送的第二同步链路请求;以及The fourth receiving module, if the source terminal determines that the playback terminal supports voice frequency division transmission according to the control data stream, it is used to receive the second synchronization link request sent by the source terminal; and
第五发送模块,用于发送第二同步链路请求的回复;The fifth sending module is used to send a reply to the second synchronization link request;
若播放端支持语音分频传输,第五发送模块还用于发送播放端支持的链路参数给源端。If the playback end supports voice frequency division transmission, the fifth sending module is also used to send link parameters supported by the playback end to the source end.
另外,结合第四方面及其上述实现方式,在第四方面的另一种实现方式中,播放端还包括:In addition, in combination with the fourth aspect and the foregoing implementation manners, in another implementation manner of the fourth aspect, the playback terminal further includes:
第五接收模块,第四接收模块接收源端发送的第二同步链路请求 之前,用于接收源端发送的音频配置参数请求;The fifth receiving module, the fourth receiving module is used to receive the audio configuration parameter request sent by the source end before receiving the second synchronization link request sent by the source end;
第六发送模块,用于发送播放端支持的第二频段语音信号对应的音频配置参数给源端;The sixth sending module is used to send the audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal to the source terminal;
第五接收模块还用于接收源端发送的使用的音频配置参数;以及The fifth receiving module is also used to receive the audio configuration parameters used by the source end; and
参数配置模块,用于配置使用的音频配置参数。The parameter configuration module is used to configure the audio configuration parameters used.
另外,结合第四方面及其上述实现方式,在第四方面的另一种实现方式中,播放端还包括:In addition, in combination with the fourth aspect and the foregoing implementation manners, in another implementation manner of the fourth aspect, the playback terminal further includes:
第二传输控制模块,用于根据电量情况或者第二同步链路的链路质量断开一条或多条第二同步链路;或者The second transmission control module is configured to disconnect one or more second synchronization links according to the power condition or the link quality of the second synchronization link; or
第二传输控制模块还用于根据电量情况或者第二同步链路的链路质量请求源端建立一条或多条第二同步链路。The second transmission control module is further configured to request the source end to establish one or more second synchronization links according to the power condition or the link quality of the second synchronization link.
本申请的实施例的第五方面提供了一种源端,用于语音分频传输,包括:存储器和处理器;The fifth aspect of the embodiments of the present application provides a source terminal for voice frequency division transmission, including: a memory and a processor;
存储器与处理器耦合;The memory is coupled to the processor;
存储器,用于存储程序指令;Memory, used to store program instructions;
处理器,用于调用存储器存储的程序指令,使得源端执行上述第一方面所述的语音分频传输方法。The processor is configured to call the program instructions stored in the memory to enable the source to execute the voice frequency division transmission method described in the first aspect.
本申请的实施例的第六方面提供了一种播放端,用于语音分频传输,包括:存储器和处理器;The sixth aspect of the embodiments of the present application provides a playback terminal for voice frequency division transmission, including: a memory and a processor;
存储器与处理器耦合;The memory is coupled to the processor;
存储器,用于存储程序指令;Memory, used to store program instructions;
处理器,用于调用存储器存储的程序指令,使得播放端执行上述第二方面所述的语音分频传输方法。The processor is configured to call the program instructions stored in the memory to make the playback terminal execute the voice frequency division transmission method described in the second aspect.
本申请的实施例的第七方面提供了一种计算机可读存储介质,包括:其上存储有计算机程序,计算机程序被处理器执行时实现上述第一方面所述的语音分频传输方法。The seventh aspect of the embodiments of the present application provides a computer-readable storage medium, including a computer program stored thereon, and the computer program is executed by a processor to implement the voice frequency division transmission method described in the first aspect.
本申请的实施例的第八方面提供了一种计算机可读存储介质,包括:其上存储有计算机程序,其特征在于,计算机程序被处理器执行时实现上述第二方面所述的语音分频传输方法。An eighth aspect of the embodiments of the present application provides a computer-readable storage medium, including: a computer program stored thereon, wherein the computer program is executed by a processor to implement the voice frequency division described in the second aspect. Transmission method.
本申请的实施例的第九方面提供了一种源端电路,包括:A ninth aspect of the embodiments of the present application provides a source circuit, including:
编码器,用于对第一频段语音信号和第二频段语音信号进行编码;以及An encoder for encoding the first frequency band speech signal and the second frequency band speech signal; and
源端控制器,与编码器连接,用于通过第一同步链路和第二同步链路分别发送带有帧同步信息的编码后的第一频段语音信号和带有帧同步信息的编码后的第二频段语音信号给播放端电路。The source controller, connected to the encoder, is used to send the encoded first frequency band speech signal with frame synchronization information and the encoded voice signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively The voice signal of the second frequency band is sent to the playback end circuit.
另外,结合第九方面,在第九方面的一种实现方式中,源端电路还包括滤波器,滤波器与编码器连接,用于分离第一频段语音信号和第二频段语音信号。In addition, with reference to the ninth aspect, in an implementation of the ninth aspect, the source circuit further includes a filter, which is connected to the encoder and is used to separate the voice signal in the first frequency band from the voice signal in the second frequency band.
本申请的实施例的第十方面提供了一种播放端电路,包括:A tenth aspect of the embodiments of the present application provides a playback end circuit, including:
播放端控制器,用于通过第一同步链路和第二同步链路分别接收带有帧同步信息的编码后的第一频段语音信号和带有帧同步信息的编码后的第二频段语音信号;以及The player controller is used to receive the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively ;as well as
解码器,与播放端控制器连接,用于对接收到的第一频段语音信号和第二频段语音信号进行解码。The decoder is connected to the controller of the playback terminal and is used to decode the received voice signal in the first frequency band and the voice signal in the second frequency band.
另外,结合第十方面,在第十方面的一种实现方式中,播放端电路还包括:In addition, with reference to the tenth aspect, in an implementation manner of the tenth aspect, the playback end circuit further includes:
一个或多个不同的数模转换器,分别与解码器连接,用于对解码后的第一频段语音信号和解码后的第二频段语音信号分别进行数模转换;One or more different digital-to-analog converters, respectively connected to the decoder, for performing digital-to-analog conversion on the decoded first frequency band speech signal and the decoded second frequency band speech signal respectively;
一个或多个不同的放大器,分别与一个或多个数模转换器连接,用于对数模转换后的第一频段语音信号和数模转换后的第二频段语音信号分别进行放大;以及One or more different amplifiers respectively connected to one or more digital-to-analog converters for respectively amplifying the first-band voice signal after digital-to-analog conversion and the second-band voice signal after digital-to-analog conversion; and
一个或多个不同的电声转换器,分别与一个或多个放大器连接,用于对放大后的第一频段语音信号和放大后的第二频段语音信号分别进行电声转换。One or more different electro-acoustic converters are respectively connected to one or more amplifiers, and are used to perform electro-acoustic conversion on the amplified first frequency band speech signal and the amplified second frequency band speech signal respectively.
与现有技术相比,本申请实施例的有益效果在于:本申请实施例提供了一种语音分频传输方法、源端、播放端、源端电路和播放端电路,通过第一同步链路和第二同步链路分别发送带有帧同步信息的编 码后的第一频段语音信号和带有帧同步信息的编码后的第二频段语音信号给播放端,解决了由于传输带宽限制带来的音质下降的问题和改善音质时影响正在播放的音频的问题。Compared with the prior art, the advantageous effect of the embodiments of the present application is that: the embodiments of the present application provide a voice frequency division transmission method, a source end, a playback end, a source end circuit, and a playback end circuit, through a first synchronization link And the second synchronization link respectively send the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information to the playback end, which solves the problem caused by the limitation of transmission bandwidth The problem of sound quality degradation and the problem of affecting the audio being played when the sound quality is improved.
附图说明Description of the drawings
为了更清楚地说明本申请实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to more clearly describe the technical solutions in the embodiments of the present application or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the drawings in the following description are only These are some embodiments of the present application. For those skilled in the art, other drawings can be obtained based on these drawings without creative labor.
图1为本申请实施例的一种语音分频传输方法的流程图;FIG. 1 is a flowchart of a voice frequency division transmission method according to an embodiment of the application;
图2为本申请实施例的源端标记预设延时内检测到的编码后的任一频段的语音信号为一帧数据的流程图;FIG. 2 is a flowchart of an embodiment of the application in which the source marks a voice signal in any frequency band after encoding detected within a preset delay as one frame of data;
图3为本申请实施例的源端获取帧同步信息和播放端进行同步的示意图;FIG. 3 is a schematic diagram of a source terminal acquiring frame synchronization information and a playback terminal performing synchronization according to an embodiment of the application;
图4为本申请实施例的UUID的设置方式示意图。Fig. 4 is a schematic diagram of a UUID setting method according to an embodiment of the application.
图5为本申请实施例的另一种语音分频传输方法的流程图;FIG. 5 is a flowchart of another voice frequency division transmission method according to an embodiment of the application;
图6为本申请实施例的源端的结构示意图;FIG. 6 is a schematic diagram of the structure of the source end of an embodiment of the application;
图7为本申请实施例的播放端的结构示意图;FIG. 7 is a schematic diagram of the structure of the playback terminal according to an embodiment of the application;
图8为本申请实施例的另一种源端的结构示意图;FIG. 8 is a schematic diagram of another source structure according to an embodiment of the application;
图9为本申请实施例的另一种播放端的结构示意图;FIG. 9 is a schematic structural diagram of another playback terminal according to an embodiment of the application;
图10为本申请实施例的源端电路和播放端电路的结构示意图;10 is a schematic diagram of the structure of the source circuit and the playback circuit of an embodiment of the application;
图11为本申请实施例的另一种源端电路和播放端电路的结构示意图。FIG. 11 is a schematic structural diagram of another source-end circuit and a playback-end circuit according to an embodiment of the application.
具体实施方式Detailed ways
为使本申请的目的、技术方案和优点更加清楚,下面将结合附图对本申请的部分实施例采用举例的方式进行详细的阐述。然而,本领域的普通技术人员可以理解,在各例子中,为了使读者更好地理解本申请而提出了许多技术细节。但是,即使没有这些技术细节和基于以下各实施例的种种变化和修改,也可以实现本申请所要求保护的技术方案。In order to make the purpose, technical solutions, and advantages of the present application clearer, some embodiments of the present application will be described in detail below with reference to the accompanying drawings. However, a person of ordinary skill in the art can understand that, in each example, many technical details are proposed for the reader to better understand the application. However, even without these technical details and various changes and modifications based on the following embodiments, the technical solution claimed in this application can be realized.
本申请实施例提供了一种语音分频传输方法,该方法可以应用于各种音频电子设备中。本实施例从源端的角度对本申请实施例提供的一种语音分频传输方法进行说明。源端为语音信号的发射端,播放端为语音信号的接收端,一般来说,源端可以是储存语音信号的手机、电视、电脑、平板或mp3等电子设备或该电子设备中的一个或多个芯片,播放端可以是播放该语音信号的扬声器或耳机等电子设备或该电子设备中的一个或多个芯片。本实施例中的源端和播放端可以支持各种传输协议,例如低功耗蓝牙协议、经典蓝牙协议或者wifi等,本实施例对此不做限定。请参考图1,图1是本申请实施例的语音分频传输方法的流程图,该方法包括以下步骤:The embodiment of the present application provides a voice frequency division transmission method, which can be applied to various audio electronic devices. This embodiment describes a voice frequency division transmission method provided in the embodiment of the present application from the perspective of the source end. The source end is the transmitting end of the voice signal, and the playback end is the receiving end of the voice signal. Generally speaking, the source end can be an electronic device such as a mobile phone, TV, computer, tablet or mp3 that stores the voice signal, or one of the electronic devices or Multiple chips, and the playback end may be an electronic device such as a speaker or earphone that plays the voice signal, or one or more chips in the electronic device. The source terminal and the playback terminal in this embodiment can support various transmission protocols, such as Bluetooth low energy protocol, classic Bluetooth protocol, or wifi, which is not limited in this embodiment. Please refer to FIG. 1, which is a flowchart of a voice frequency division transmission method according to an embodiment of the present application. The method includes the following steps:
100:源端对第一频段语音信号和第二频段语音信号进行编码;100: The source end encodes the voice signal in the first frequency band and the voice signal in the second frequency band;
本实施例中,对第一频段语音信号和第二频段语音信号的频段数不做限制,第一频段语音信号或第二频段语音信号可以是一个频段的语音信号或者多个子频段语音信号。本实施例中,对源端的具体编码方式不做限制,用户可以根据各个频段语音信号的特点选取对应的编码方式,本实施例中,第一频段语音信号和第二频段语音信号可以通过滤波得到,本实施例对滤波器种类不做限制,可以是低通、高通或者带通滤波器及多种滤波器的组合等;本实施例中,第一频段语音信号和第二频段语音信号也可以通过滤波以外的其他方式得到,例如,如果编码方式仅仅针对特定频段的语音信号进行编码,那么不需要进行滤波得到该特定频段的语音信号,直接将所有频段的语音信号传输 到编码器即可实现在该频段的编码,以opus编码为例,opus可以实现对特定频段的语音信号的编码,而不需要通过滤波等方式得到该特定频段。一般来说,语音信号的全部频段为20Hz-20kHz,本实施例中的第一频段语音信号和第二频段语音信号可以是20Hz-20kHz中的任意频段,也可以是20Hz-20kHz之外的任意频段。In this embodiment, the number of frequency bands of the first frequency band speech signal and the second frequency band speech signal is not limited, and the first frequency band speech signal or the second frequency band speech signal may be a speech signal in one frequency band or multiple sub-band speech signals. In this embodiment, there is no restriction on the specific encoding method of the source end. The user can select the corresponding encoding method according to the characteristics of the voice signal of each frequency band. In this embodiment, the first frequency band speech signal and the second frequency band speech signal can be obtained through filtering. This embodiment does not limit the type of filter, which can be a low-pass, high-pass, or band-pass filter and a combination of multiple filters; in this embodiment, the first frequency band speech signal and the second frequency band speech signal can also be Obtained by other methods besides filtering. For example, if the encoding method only encodes the voice signal of a specific frequency band, then there is no need to filter to obtain the voice signal of the specific frequency band, and the voice signal of all frequency bands can be directly transmitted to the encoder. In the coding of this frequency band, taking opus coding as an example, opus can realize the coding of voice signals in a specific frequency band without the need to obtain the specific frequency band by means of filtering or the like. Generally speaking, all frequency bands of the voice signal are 20Hz-20kHz. The voice signal of the first frequency band and the voice signal of the second frequency band in this embodiment can be any frequency band in 20Hz-20kHz, or any frequency other than 20Hz-20kHz. Frequency band.
101:源端将帧同步信息标记到编码后的第一频段语音信号和编码后的第二频段语音信号中;101: The source terminal marks the frame synchronization information into the encoded first frequency band speech signal and the encoded second frequency band speech signal;
由于源端通过第一同步链路和第二同步链路分别发送编码后的第一频段语音信号和编码后的第二频段语音信号给播放端,因此,在播放端需要执行同步才能提高音频质量,而播放端执行同步时需要帧同步信息,因此,源端可以将帧同步信息标记到编码后的第一频段语音信号和编码后的第二频段语音信号中。Since the source end sends the encoded first frequency band speech signal and the encoded second frequency band speech signal to the playback end through the first synchronization link and the second synchronization link, respectively, synchronization needs to be performed on the playback end to improve audio quality , And the playback end needs frame synchronization information when performing synchronization. Therefore, the source end can mark the frame synchronization information into the encoded first frequency band speech signal and the encoded second frequency band speech signal.
102:源端通过第一同步链路和第二同步链路分别发送带有帧同步信息的编码后的第一频段语音信号和带有帧同步信息的编码后的第二频段语音信号给播放端。102: The source end sends the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information to the playback end through the first synchronization link and the second synchronization link, respectively .
同步链路可以用来传输语音信号,同步链路为保证数据的同步性以及实时性只允许有限次重传,假设系统原有音频传输方法使用第一同步链路传输语音信号,本申请实施例可以完全兼容系统原有音频传输方法,源端和播放端通过原有音频传输方法进行音频传输时,本方案只需要在此基础上通过第一同步链路和第二同步链路分别发送带有帧同步信息的编码后的第一频段语音信号和第二频段语音信号给播放端,不会影响系统原有音频传输,本实施例中,如果第二频段语音信号分为多个子频段语音信号,则可以建立多条第二同步链路分别传输该多个子频段语音信号,本实施例对第二同步链路的条数不做限制,可以是一条或者是多条,本实施例中,源端可以仅建立一条与播放端之间的第二同步链路,并且通过该第二同步链路发送带有帧同步信息的编码后的第二频段语音信号给播放端,另外,如果第二频段语音信号分为多个子频段语音信号,也可以通过该第二同步链路传输该多个子频段语音信号,当通过该第二同步链路传输该多个子频段语音 信号时,可以通过分时复用的方式。本实施例中,可以只建立一条第二同步链路来传输第二频段语音信号,如果将第二频段语音信号分为两个子频段语音信号,则可以只建立一条或两条第二同步链路来传输这两个子频段语音信号。The synchronization link can be used to transmit voice signals. In order to ensure data synchronization and real-time performance, the synchronization link only allows a limited number of retransmissions. It is assumed that the original audio transmission method of the system uses the first synchronization link to transmit voice signals. This embodiment of the application It can be fully compatible with the original audio transmission method of the system. When the source end and the playback end use the original audio transmission method for audio transmission, this solution only needs to send the data with the first synchronization link and the second synchronization link on this basis. The encoded first frequency band speech signal and second frequency band speech signal of the frame synchronization information are sent to the playback end without affecting the original audio transmission of the system. In this embodiment, if the second frequency band speech signal is divided into multiple sub-band speech signals, Then, multiple second synchronization links can be established to respectively transmit the multiple sub-band voice signals. In this embodiment, the number of second synchronization links is not limited, and it can be one or more. In this embodiment, the source end It is possible to establish only a second synchronization link with the playback terminal, and send the encoded second frequency band voice signal with frame synchronization information to the playback terminal through the second synchronization link. In addition, if the second frequency band voice signal The signal is divided into multiple sub-band voice signals, and the multiple sub-band voice signals can also be transmitted through the second synchronization link. When the multiple sub-band voice signals are transmitted through the second synchronization link, time division multiplexing the way. In this embodiment, only one second synchronization link may be established to transmit the second frequency band voice signal. If the second frequency band voice signal is divided into two sub-band voice signals, only one or two second synchronization links may be established To transmit these two sub-band voice signals.
源端通过第一同步链路和第二同步链路分别传输带有帧同步信息的编码后的第一频段语音信号和第二频段语音信号给播放端,以便播放端进一步对接收到的第一频段语音信号和第二频段语音信号进行处理。当系统原有音频传输方法使用一条第一同步链路传输主频段的语音信号时,例如,当主频段为20Hz-18kHz的语音信号时,系统原有音频传输方法可以使用一条第一同步链路传输20Hz-18kHz的语音信号,而18kHz-20kHz的语音信号通常会直接舍弃,以节省带宽,但是这会使得播放端接收到的语音信号中的高频部分丢失,音质下降,因此,本实施例可以通过第二同步链路传输编码后的18kHz-20kHz的语音信号给播放端,可以在源端确保语音信号的完整性,使得全部或者大部分频段的音频信号都进行编码后传输,以起到提高音质的作用,需要说明的是,第一同步链路和第二同步链路可以传输任意频段的语音信号,本实施例对此不做限制。当需要改善音质时,利用第一同步链路和第二同步链路来分别传输第一频段语音信号和第二频段语音信号,若系统原有音频传输方法使用第一同步链路传输编码后的第一频段语音信号,则系统只需要建立第二同步链路来传输编码后的第二频段语音信号就可以起到提高音质的作用,并且,在建立第二同步链路或者传输编码后的第二频段语音信号时,不会影响第一同步链路传输编码后的第一频段语音信号,因此,用户不会听到正在播放的音频出现短时停顿现象。The source end transmits the encoded first-band speech signal and second-band speech signal with frame synchronization information to the playback end through the first synchronization link and the second synchronization link, so that the playback end can further verify the received first frequency The frequency band speech signal and the second frequency band speech signal are processed. When the original audio transmission method of the system uses a first synchronization link to transmit the voice signal of the main frequency band, for example, when the main frequency band is 20Hz-18kHz speech signal, the original audio transmission method of the system can use a first synchronization link to transmit The voice signal of 20Hz-18kHz, and the voice signal of 18kHz-20kHz are usually discarded directly to save bandwidth, but this will cause the high frequency part of the voice signal received by the playback end to be lost, and the sound quality is reduced. Therefore, this embodiment can The encoded 18kHz-20kHz voice signal is transmitted to the playback end through the second synchronization link, and the integrity of the voice signal can be ensured at the source end, so that all or most of the frequency band audio signals are encoded and transmitted to improve For the effect of sound quality, it should be noted that the first synchronization link and the second synchronization link can transmit voice signals in any frequency band, which is not limited in this embodiment. When the sound quality needs to be improved, the first synchronization link and the second synchronization link are used to respectively transmit the first frequency band voice signal and the second frequency band voice signal. If the original audio transmission method of the system uses the first synchronization link to transmit the encoded For the first frequency band speech signal, the system only needs to establish a second synchronization link to transmit the encoded second frequency band speech signal to improve the sound quality. Moreover, when establishing the second synchronization link or transmitting the encoded first frequency signal In the case of a two-band voice signal, it will not affect the first synchronization link to transmit the encoded first-band voice signal. Therefore, the user will not hear a short pause in the audio being played.
本申请实施例提供了一种语音分频传输方法,通过第一同步链路和第二同步链路分别发送带有帧同步信息的编码后的第一频段语音信号和带有帧同步信息的编码后的第二频段语音信号给播放端,解决了由于传输带宽限制带来的音质下降的问题和改善音质时影响正在播放的音频的问题。The embodiment of the present application provides a voice frequency division transmission method, which transmits an encoded first frequency band voice signal with frame synchronization information and an encoded voice signal with frame synchronization information through a first synchronization link and a second synchronization link, respectively The second frequency band voice signal is sent to the playback end, which solves the problem of sound quality degradation caused by transmission bandwidth limitation and the problem of affecting the audio being played when the sound quality is improved.
基于上述实施例公开的内容,本实施例中,第二频段语音信号的编码方式的压缩率高于第一频段语音信号的编码方式的压缩率;或者Based on the content disclosed in the foregoing embodiment, in this embodiment, the compression rate of the encoding method of the voice signal in the second frequency band is higher than the compression rate of the encoding method of the voice signal in the first frequency band; or
第二频段语音信号的编码方式的压缩率低于第一频段语音信号的编码方式的压缩率。The compression rate of the encoding method of the speech signal in the second frequency band is lower than the compression rate of the encoding method of the speech signal in the first frequency band.
将语音信号分为了第一频段语音信号和第二频段语音信号,第一频段语音信号和第二频段语音信号的编码方式的压缩率可以不同,当语音信号以第一频段语音信号为主要部分时,可以设置第二频段语音信号的编码方式的压缩率高于第一频段语音信号的编码方式的压缩率,以此,第一频段语音信号被压缩后仍然保留有大部分的原始语音信号,在播放端进行播放时,不影响用户体验,而对于第二频段语音信号,虽然压缩率较高,但是带宽占用较少,而且由于该第二频段语音信号依然发送到了播放端,播放端播放该第二频段语音信号后,还是可以起到提高音质的作用,这种压缩率的设置方式仅仅增加少量带宽占用就可以为用户提供更好的音质体验;同样地,当语音信号以第二频段语音信号为主要部分时,可以设置第二频段语音信号的编码方式的压缩率低于第一频段语音信号的编码方式的压缩率。The speech signal is divided into the first frequency band speech signal and the second frequency band speech signal. The compression rate of the encoding method of the first frequency band speech signal and the second frequency band speech signal can be different. When the speech signal is mainly composed of the first frequency band speech signal , You can set the compression rate of the encoding method of the voice signal in the second frequency band to be higher than the compression rate of the encoding method of the voice signal in the first frequency band. In this way, most of the original voice signal remains after the voice signal in the first frequency band is compressed. When the player is playing, it does not affect the user experience. For the voice signal of the second frequency band, although the compression rate is higher, the bandwidth is less occupied, and because the voice signal of the second frequency band is still sent to the player, the player plays the first After the voice signal in the second frequency band, it can still improve the sound quality. This compression ratio setting method can provide users with a better sound quality experience by only increasing a small amount of bandwidth. Similarly, when the voice signal is in the second frequency band When it is the main part, the compression rate of the encoding method of the second frequency band voice signal can be set to be lower than the compression rate of the encoding method of the first frequency band voice signal.
基于上述实施例公开的内容,本实施例中,源端对第二频段语音信号进行编码包括:源端对高频语音信号进行编码,或者源端对非高频语音信号进行编码。第二频段语音信号可以为高频语音信号或者非高频语音信号,对于高频语音信号,由于其占用带宽较多,因此,对高频语音信号进行编码时,可以选取适合高频语音信号的编码方式,以提高音质的同时节省部分带宽。由于高频语音信号和非高频语音信号的特性不同,因此对于高频语音信号和非高频语音信号可以采取不同的编码方式,例如,18kHz-20kHz的高频语音信号的编码方式可以包括CELT编码方式或SBR(Spectral Band Replication,频段复制)编码方式,其中,CELT编码方式为opus编码器中内置的一种核心编码算法;非高频语音信号的编码方式可以包括SILK编码方式、SBC编码方式、AAC编码方式或MP3编码方式,其中,SILK编码方式也是opus编码器中内置的一种核心编码算法;需要说明的是,本实 施例以第二频段语音信号为高频语音信号或者非高频语音信号为例,用户可以根据需求将第二频段语音信号划分为其他多个子频段语音信号然后分别进行编码,本实施例对此不做限制。本实施例中,对高频语音信号或者非高频语音信号分别进行编码后,可以使用第二同步链路传输编码后的高频语音信号或者非高频语音信号。Based on the content disclosed in the foregoing embodiment, in this embodiment, the source end encoding the second frequency band voice signal includes: the source end encoding the high-frequency voice signal, or the source end encoding the non-high-frequency voice signal. The voice signal in the second frequency band can be a high-frequency voice signal or a non-high-frequency voice signal. For high-frequency voice signals, because they occupy more bandwidth, when encoding high-frequency voice signals, you can select those suitable for high-frequency voice signals. Encoding method to improve sound quality while saving some bandwidth. Due to the different characteristics of high-frequency voice signals and non-high-frequency voice signals, different encoding methods can be adopted for high-frequency voice signals and non-high-frequency voice signals. For example, the encoding methods of high-frequency voice signals of 18kHz-20kHz can include CELT Encoding method or SBR (Spectral Band Replication) encoding method, where CELT encoding method is a core encoding algorithm built into the opus encoder; encoding methods for non-high frequency speech signals can include SILK encoding method and SBC encoding method , AAC encoding method or MP3 encoding method, where the SILK encoding method is also a core encoding algorithm built in the opus encoder; it should be noted that in this embodiment, the second frequency band speech signal is a high-frequency speech signal or a non-high-frequency speech signal. Take the voice signal as an example. The user can divide the voice signal in the second frequency band into multiple sub-band voice signals according to requirements and then encode them separately, which is not limited in this embodiment. In this embodiment, after encoding the high-frequency voice signal or the non-high-frequency voice signal respectively, the second synchronization link may be used to transmit the encoded high-frequency voice signal or the non-high-frequency voice signal.
本实施例中,源端对第一频段语音信号和第二频段语音信号的编码方式可以相同也可以不同,当源端对第一频段语音信号和第二频段语音信号的编码方式不同时,可以在提高音质的同时节省部分带宽。本实施例的编码方式的选取可以多样化,以第二频段语音信号为例,若第二频段语音信号分为多个子频段语音信号,则对该多个子频段语音信号,也可以采取相同或者不同的编码方式。对不同频段的语音信号选择不同的编码方式或者是设置不同的码率或者压缩率,都可以通过调整数据压缩比,仅仅增加少量带宽占用就可以为用户提供更好的音质体验。In this embodiment, the source end encoding the first frequency band speech signal and the second frequency band speech signal may be the same or different. When the source end encodes the first frequency band speech signal and the second frequency band speech signal in a different way, Save some bandwidth while improving sound quality. The selection of encoding methods in this embodiment can be diversified. Taking the voice signal in the second frequency band as an example, if the voice signal in the second frequency band is divided into multiple sub-band voice signals, the multiple sub-band voice signals can also be the same or different. Encoding method. Choosing different encoding methods or setting different code rates or compression rates for voice signals of different frequency bands can provide users with a better sound quality experience by adjusting the data compression ratio and only increasing a small amount of bandwidth occupation.
基于上述实施例公开的内容,本实施例中,源端将帧同步信息标记到编码后的第一频段语音信号和编码后的第二频段语音信号中包括源端标记预设延时内检测到的编码后的任一频段的语音信号为一帧数据,帧同步信息包括一帧数据的开始时刻和结束时刻。Based on the content disclosed in the above-mentioned embodiment, in this embodiment, the source terminal marks the frame synchronization information into the encoded first frequency band speech signal and the encoded second frequency band speech signal includes the source marking detected within a preset delay The encoded voice signal in any frequency band is a frame of data, and the frame synchronization information includes the start time and end time of a frame of data.
为了能正确分离每一帧数据,以保证音频质量,源端可以获取帧同步信息后发送给播放端,以便播放端获取该帧同步信息以执行同步。In order to correctly separate each frame of data to ensure audio quality, the source can obtain the frame synchronization information and send it to the playback end, so that the playback end can obtain the frame synchronization information to perform synchronization.
基于上述实施例公开的内容,本实施例中,请参考图2,图2是本申请实施例的源端标记预设延时内检测到的编码后的任一频段的语音信号为一帧数据的流程图,具体可以包括以下步骤:Based on the content disclosed in the foregoing embodiment, in this embodiment, please refer to FIG. 2. FIG. 2 is an embodiment of the present application marking the coded voice signal in any frequency band detected within a preset delay time as a frame of data The flow chart can specifically include the following steps:
200:源端检测到编码后的多个频段的语音信号中的任一频段的语音信号;200: The source detects a voice signal in any frequency band among the encoded voice signals in multiple frequency bands;
201:源端开始计时第一延时直到第一延时等于预设延时;201: The source starts timing the first delay until the first delay is equal to the preset delay;
202:源端标记第一延时内检测到的多个频段的语音信号为第一帧数据;202: The source marks the detected voice signals in multiple frequency bands within the first delay as the first frame of data;
203:第一延时之后,源端再次检测到编码后的多个频段的语音 信号中的任一频段的语音信号;203: After the first delay, the source again detects a voice signal in any frequency band among the encoded voice signals in multiple frequency bands;
204:源端开始计时第二延时直到第二延时等于预设延时;204: The source starts timing the second delay until the second delay is equal to the preset delay;
205:源端标记第二延时内检测到的多个频段的语音信号为第二帧数据;205: The source marks the voice signals of multiple frequency bands detected within the second delay as the second frame of data;
206:第二延时之后,源端再次检测到编码后的多个频段的语音信号中的任一频段的语音信号;206: After the second delay, the source again detects a voice signal in any frequency band among the encoded voice signals in multiple frequency bands;
207:源端开始计时第N延时直到第N延时等于预设延时,N>2且为整数;207: The source starts timing the Nth delay until the Nth delay is equal to the preset delay, N>2 and an integer;
208:源端标记第N延时内检测到的多个频段的语音信号为第N帧数据。208: The source marks the voice signals of multiple frequency bands detected within the Nth delay as the Nth frame of data.
源端标记预设延时内检测到的编码后的任一频段的语音信号为一帧数据后,任一频段的语音信号都包含了标记信息,因此播放端可以识别出同一帧语音信号。源端获取到该帧同步信息,帧同步信息包括一帧数据的开始时刻和结束时刻。N的值可以由用户根据语音信号的数据量大小来设置,本实施例对此不做限制。该帧同步信息获取之后,源端将该帧同步信息发送给播放端,播放端收到该帧同步信息就可以利用该帧同步信息对播放端接收到的语音信号进行同步,以进一步提高音频质量。可以选择将该第一帧数据、第二帧数据和第N帧数据的开始时刻和结束时刻标记到对应的一帧数据中,以便播放端根据该开始时刻和结束时刻进行同步。After the source end marks the encoded speech signal of any frequency band detected within the preset delay as one frame of data, the speech signal of any frequency band contains the marking information, so the player can recognize the same frame of speech signal. The source terminal obtains the frame synchronization information, and the frame synchronization information includes the start time and end time of a frame of data. The value of N can be set by the user according to the data volume of the voice signal, which is not limited in this embodiment. After the frame synchronization information is obtained, the source end sends the frame synchronization information to the playback end. Upon receiving the frame synchronization information, the playback end can use the frame synchronization information to synchronize the voice signal received by the playback end to further improve audio quality . You can choose to mark the start time and end time of the first frame data, the second frame data, and the Nth frame data into the corresponding frame of data, so that the player can synchronize according to the start time and end time.
以源端仅建立一条第二同步链路,并且该第二同步链路用于传输编码后的高频语音信号为例,需要说明的是,本实施例中,该第二同步链路也可以用于传输编码后的非高频语音信号,本实施例对此不做限制;图3(a)表示源端对第一频段语音信号和第二频段语音信号的处理过程,如图3(a)所示,此实施例仅以第一频段语音信号和第二频段语音信号分别只有一个子频段语音信号为例进行说明,但是本实施的源端标记预设延时内检测到的编码后的任一频段的语音信号为一帧数据的方法也可以应用于第一频段语音信号和第二频段语音信号分别有多个子频段语音信号的应用场景。Taking the source end establishes only one second synchronization link, and the second synchronization link is used to transmit the encoded high-frequency voice signal as an example, it should be noted that in this embodiment, the second synchronization link may also It is used to transmit the encoded non-high frequency voice signal, and this embodiment does not restrict it; Figure 3(a) shows the source end processing the voice signal in the first frequency band and the voice signal in the second frequency band, as shown in Figure 3(a) ), this embodiment only takes the first frequency band speech signal and the second frequency band speech signal each having only one sub-band speech signal as an example, but the source end of this embodiment marks the encoded code detected within the preset delay The method in which the voice signal in any frequency band is one frame of data can also be applied to an application scenario where the voice signal in the first frequency band and the voice signal in the second frequency band respectively have multiple sub-band voice signals.
如图3(a)所示,在源端,对未编码的语音信号按照数据帧进行划分,以第一帧为例进行说明,可以通过滤波器将第一帧分为高频语音信号和非高频语音信号,然后对高频语音信号和非高频语音信号分别进行编码,对高频语音信号进行编码后为01DATA1,对非高频语音信号进行编码后为01DATA2,编码后的非高频语音信号01DATA2通过第一同步链路传输到播放端,编码后的高频语音信号01DATA1通过第二同步链路传输到播放端。对第二帧、第三帧以及第N帧也按照第一帧的处理方式将编码后的第一频段语音信号和第二频段语音信号传输到播放端。As shown in Figure 3(a), at the source end, the unencoded speech signal is divided into data frames. Taking the first frame as an example, the first frame can be divided into high-frequency speech signals and non-frequency speech signals through filters. High-frequency voice signal, and then encode the high-frequency voice signal and the non-high-frequency voice signal separately. After the high-frequency voice signal is encoded, it is 01DATA1, and the non-high-frequency voice signal is encoded as 01DATA2. The voice signal 01DATA2 is transmitted to the playback end through the first synchronization link, and the encoded high-frequency voice signal 01DATA1 is transmitted to the playback end through the second synchronization link. For the second frame, the third frame, and the Nth frame, the encoded voice signal in the first frequency band and the voice signal in the second frequency band are also transmitted to the playback terminal according to the processing manner of the first frame.
以分时传输为例进行说明,图3(c)表示源端标记预设延时内检测到的编码后的任一频段的语音信号为一帧数据的过程,如图3(c)所示,源端每隔一个间隔时间Interval发送一帧数据,当源端发送语音信号时,源端检测到编码后的多个频段的语音信号中的任一频段的语音信号时开始计时第一延时,本实施例中,源端检测到第一帧数据中的编码后的高频语音信号01DATA1则开始计时第一延时Delay1,直到第一延时等于预设延时,预设延时可以大于、小于或等于间隔时间Interval,本实施例对预设延时的具体长度不做限制。源端标记第一延时内检测到的多个频段的语音信号为第一帧数据,本实施例中第一帧数据为01DATA1和01DATA2,帧同步信息包括01DATA1和01DATA2的开始时刻和结束时刻;第一延时Delay1之后,源端再次检测到编码后的多个频段的语音信号中的任一频段的语音信号02DATA1,则开始计时第二延时Delay2,直到第二延时等于预设延时;源端标记第二延时内检测到的02DATA1和02DATA2为第二帧数据,帧同步信息包括01DATA1和01DATA2的开始时刻和结束时刻,依此类推,直到源端标记完全部数据。该帧同步信息也可以包括第一延时Delay1和第二延时Delay2的结束时刻End1和End2,第一帧数据的帧同步信息可以直接写入每一帧数据中,例如写入到01DATA1和01DATA2中,以便播放端获取到该帧同步信息执行同步。Taking time-sharing transmission as an example, Figure 3(c) shows the process of marking the encoded voice signal in any frequency band detected within the preset delay time as one frame of data at the source, as shown in Figure 3(c) , The source sends a frame of data at an interval Interval. When the source sends a voice signal, the source starts to count the first delay when it detects a voice signal in any one of the encoded voice signals in multiple frequency bands In this embodiment, when the source detects the encoded high-frequency voice signal 01DATA1 in the first frame of data, it starts timing the first delay Delay1 until the first delay is equal to the preset delay, which can be greater than , Is less than or equal to the interval time Interval, this embodiment does not limit the specific length of the preset delay. The source marks the detected voice signals in multiple frequency bands within the first delay as the first frame of data. In this embodiment, the first frame of data is 01DATA1 and 01DATA2, and the frame synchronization information includes the start time and end time of 01DATA1 and 01DATA2; After the first delay Delay1, the source again detects the voice signal 02DATA1 in any frequency band among the encoded voice signals in multiple frequency bands, and then starts timing the second delay Delay2 until the second delay is equal to the preset delay ; The source marks the 02DATA1 and 02DATA2 detected within the second delay as the second frame of data, and the frame synchronization information includes the start and end times of 01DATA1 and 01DATA2, and so on, until the source marks all the data. The frame synchronization information can also include the end moments End1 and End2 of the first delay Delay1 and the second delay Delay2. The frame synchronization information of the first frame of data can be directly written into each frame of data, for example, written to 01DATA1 and 01DATA2. , So that the player can obtain the frame synchronization information and perform synchronization.
如图3(b)所示,在播放端,播放端获取到该帧同步信息之后, 可以执行同步,如果播放端获取到该帧同步信息,则在该第一延时的结束时刻End1对接收到的编码后的第一帧数据开始解码,在该第二延时的结束时刻End2对接收到的编码后的第二帧数据开始解码,依此类推,以实现同步;或者是在一帧数据的结束时刻开始解码以实现同步;播放端获取到帧同步信息后,由于该帧同步信息已经写入到01DATA1和01DATA2中,因此,播放端也可以根据01DATA1和01DATA2中的具体数据来执行同步。As shown in Figure 3(b), at the playback end, after the playback end obtains the frame synchronization information, synchronization can be performed. If the playback end obtains the frame synchronization information, the end of the first delay time End1 pair The first frame of data received after encoding starts to be decoded, and at the end of the second delay, End2 starts to decode the received second frame of data after encoding, and so on, to achieve synchronization; or one frame of data Start decoding to achieve synchronization; after the player obtains the frame synchronization information, since the frame synchronization information has been written into 01DATA1 and 01DATA2, the player can also perform synchronization based on the specific data in 01DATA1 and 01DATA2.
需要说明的是,本实施例中源端标记预设延时内检测到的编码后的任一频段的语音信号为一帧数据的方法仅为示例性说明,在实际使用中,本领域的技术人员可以参照本申请实施例的方案,在不付出创造性劳动的前提下,还可以根据此实施例获得其他获取帧同步信息的方法。It should be noted that, in this embodiment, the method in which the source terminal marks the encoded voice signal in any frequency band detected within the preset delay time as one frame of data is only an exemplary description. In actual use, the technology in the art Personnel can refer to the solution of the embodiment of the present application, and without creative work, can also obtain other methods of obtaining frame synchronization information according to this embodiment.
基于上述实施例公开的内容,本实施例中,源端通过第二同步链路发送带有帧同步信息的编码后的第二频段语音信号给播放端之前,包括以下步骤:Based on the content disclosed in the foregoing embodiment, in this embodiment, before the source end sends the encoded second frequency band voice signal with frame synchronization information to the playback end through the second synchronization link, the following steps are included:
300:根据播放端发送的播放端支持的链路参数确定使用的链路参数;300: Determine the link parameters used according to the link parameters supported by the playback end sent by the playback end;
301:源端根据使用的链路参数建立与播放端之间的第二同步链路。301: The source end establishes a second synchronization link with the playback end according to the used link parameters.
源端通过第二同步链路发送带有帧同步信息的编码后的第二频段语音信号给播放端之前,需要根据使用的链路参数建立与播放端之间的第二同步链路,而源端可以根据播放端发送的播放端支持的链路参数确定使用的链路参数。Before the source end sends the encoded second frequency band voice signal with frame synchronization information to the playback end through the second synchronization link, it needs to establish a second synchronization link with the playback end according to the link parameters used, and the source The terminal can determine the link parameters used according to the link parameters supported by the playback terminal sent by the playback terminal.
在本实施例中,源端确定建立第二同步链路后,源端也可以将源端支持的链路参数发送给播放端,以便播放端根据源端支持的链路参数和播放端支持的链路参数选择某一个或多个源端和播放端均支持的链路参数发送给源端,以便源端确定使用的链路参数。源端将源端支持的链路参数发送给播放端,可以使得播放端确定使用的链路参数,源端发送源端支持的链路参数给播放端,当播放端根据源端支持的链 路参数和播放端支持的链路参数选择某一个源端和播放端均支持的链路参数发送给源端时,播放端选择的这个链路参数就是使用的链路参数,即播放端也可以确定使用的链路参数,播放端可以根据源端发送的源端支持的链路参数确定使用的链路参数。In this embodiment, after the source terminal determines to establish the second synchronization link, the source terminal can also send the link parameters supported by the source terminal to the playback terminal, so that the playback terminal can according to the link parameters supported by the source terminal and the link parameters supported by the playback terminal. The link parameter selects one or more link parameters supported by both the source and the playback end and sends it to the source so that the source can determine the link parameters used. The source end sends the link parameters supported by the source end to the playback end, which enables the playback end to determine the link parameters used. The source end sends the link parameters supported by the source end to the playback end. When the playback end is based on the link supported by the source end, Parameters and Link Parameters Supported by the Player When a link parameter supported by both the source and the player is selected and sent to the source, the link parameter selected by the player is the link parameter used, that is, the player can also determine For the link parameters used, the player can determine the link parameters to be used according to the link parameters supported by the source sent by the source.
经过上述步骤之后,使用的链路参数就确定了,以便于第二同步链路的建立,该链路参数包括数据帧大小和数据帧间隔,PHY(PhysicalLayer,物理层)的通信信息等。After the above steps, the link parameters used are determined to facilitate the establishment of the second synchronization link. The link parameters include data frame size and data frame interval, PHY (PhysicalLayer, physical layer) communication information, etc.
基于上述实施例公开的内容,本实施例中,根据播放端发送的播放端支持的链路参数确定使用的链路参数之前,包括以下步骤:Based on the content disclosed in the foregoing embodiment, in this embodiment, before determining the link parameters used according to the link parameters supported by the playback terminal sent by the playback terminal, the following steps are included:
400:源端发送第二同步链路请求给播放端;400: The source end sends a second synchronization link request to the playback end;
401:源端接收第二同步链路请求的回复;401: The source receives the reply to the second synchronization link request;
402:源端根据第二同步链路请求的回复确定是否建立第二同步链路。402: The source terminal determines whether to establish the second synchronization link according to the reply of the second synchronization link request.
当播放端收到源端发送的第二同步链路请求之后,播放端发送第二同步链路请求的回复给源端,源端根据第二同步链路请求的回复确定是否建立第二同步链路。After the playback end receives the second synchronization link request sent by the source end, the playback end sends a reply to the second synchronization link request to the source end, and the source end determines whether to establish the second synchronization link according to the response of the second synchronization link request road.
源端发送第二同步链路请求给播放端,以便播放端选择是否接受建立第二同步链路,当播放端选择接受第二同步链路的建立,则发送播放端支持的链路参数给源端,以便源端确定使用的链路参数。播放端收到源端发送的第二同步链路请求之后,播放端发送第二同步链路请求的回复,以便源端确定是否建立第二同步链路,在源端确定建立第二同步链路后,源端根据播放端支持的链路参数确定使用的链路参数,需要说明的是,该使用的链路参数需要播放端和源端同时支持。The source sends a second synchronization link request to the playback end, so that the playback end can choose whether to accept the establishment of the second synchronization link. When the playback end chooses to accept the establishment of the second synchronization link, it sends the link parameters supported by the playback end to the source. So that the source can determine the link parameters used. After the playback end receives the second synchronization link request sent by the source end, the playback end sends a reply to the second synchronization link request so that the source end determines whether to establish the second synchronization link, and the source end determines to establish the second synchronization link Later, the source terminal determines the link parameters used according to the link parameters supported by the playback terminal. It should be noted that the link parameters used need to be supported by both the playback terminal and the source terminal.
基于上述实施例公开的内容,本实施例中,源端对第二频段语音信号进行编码之前,源端根据控制数据流判断播放端是否支持语音分频传输。当源端与播放端建立连接后,源端可以通过异步链路向播放端发送控制数据流的请求,控制数据流可以通过源端与播放端之间的异步链路传输,异步链路允许无限次重传,因此,不会丢失数据包;播放端收到源端发送的控制数据流的请求之后,播放端发送控制数据 流给源端,源端可以通过播放端发送的控制数据流来判断播放端是否支持该语音分频传输,在开始语音分频传输前执行判断,以免在播放端不支持语音分频传输的情况下增加耗能,当源端判断播放端不支持语音分频传输时,则不需要进行语音分频传输,源端会沿用系统原有音频传输方法进行音频传输。Based on the content disclosed in the foregoing embodiment, in this embodiment, before the source end encodes the voice signal in the second frequency band, the source end determines whether the playback end supports voice frequency division transmission according to the control data stream. When a connection between the source and the player is established, the source can send a request to control the data stream to the player through the asynchronous link. The control data stream can be transmitted through the asynchronous link between the source and the player. The asynchronous link allows unlimited Retransmission times, so no data packets will be lost; after the player receives the request for the control data stream sent by the source, the player sends the control data stream to the source, and the source can judge by the control data stream sent by the player Whether the playback end supports the voice frequency division transmission, perform a judgment before starting the voice frequency division transmission, so as not to increase energy consumption when the playback end does not support voice frequency division transmission, when the source end judges that the playback end does not support voice frequency division transmission , There is no need for voice frequency division transmission, and the source end will use the original audio transmission method of the system for audio transmission.
基于上述实施例公开的内容,本实施例中,若源端通过自定义通用唯一识别码(UUID)标识语音分频传输服务,源端根据控制数据流判断播放端是否支持语音分频传输包括:Based on the content disclosed in the foregoing embodiment, in this embodiment, if the source uses a custom universally unique identification code (UUID) to identify the voice frequency division transmission service, the source end judges whether the playback end supports voice frequency division transmission according to the control data stream includes:
控制数据流包括自定义UUID的值,若源端接收到的播放端的自定义UUID的值等于预设UUID值,源端确定播放端支持语音分频传输。源端使用自定义UUID的值来判断播放端是否支持语音分频传输,可以提高源端和播放端的设备兼容性。若源端接收到的播放端的自定义UUID的值等于预设UUID值,则源端确定播放端支持语音分频传输。以源端和播放端通过蓝牙连接为例,在蓝牙协议中,UUID被用来标识蓝牙设备所提供的服务,UUID类型可以为主服务(Primary Service)、特性(Characteristic)等,用户可以自定义通用唯一识别码(UUID)标识语音分频传输服务。该UUID的设置方式可以参考图4,图4中,Enhance Audio Value可以表示语音分频传输服务,Enhance Audio Value对应的预设UUID值用户可以自己设置,主服务和增强服务的预设UUID值可以相同,也可以不同;图4中,handle表示索引,该索引可以帮助找到UUID在内存中的地址,OxXXXX的具体值由UUID在内存中的具体地址确定,本实施例对0xXXXX的值不做限定,Y和Z的值由UUID对应的数据量大小确定,本实施例对此不做限制;图4中,0xOPQ为用户自定义的UUID,用户可以根据图4中的UUID的值的格式作为参考来设置预设UUID,需要说明的是,图4中的内容仅仅为示例性说明,在实际使用中,本领域的技术人员可以参照本申请实施例的方案,在不付出创造性劳动的前提下,还可以根据此实施例获得其他的UUID的值的设置方式。The control data stream includes the value of the custom UUID. If the custom UUID value of the player received by the source is equal to the preset UUID value, the source determines that the player supports voice frequency division transmission. The source uses the value of a custom UUID to determine whether the player supports voice frequency division transmission, which can improve the device compatibility between the source and the player. If the value of the custom UUID of the player received by the source is equal to the preset UUID value, the source determines that the player supports voice frequency division transmission. Take the Bluetooth connection between the source and the player as an example. In the Bluetooth protocol, UUID is used to identify the service provided by the Bluetooth device. The UUID type can be the primary service (Primary Service), characteristic (Characteristic), etc., and the user can customize The Universal Unique Identifier (UUID) identifies the voice frequency division transmission service. The UUID setting method can refer to Figure 4, in Figure 4, Enhance Audio Value can represent the voice frequency division transmission service, the user can set the preset UUID value corresponding to Enhance Audio Value, and the preset UUID value of the main service and the enhanced service can be The same or different; in Figure 4, handle represents an index, which can help find the address of the UUID in the memory. The specific value of OxXXXX is determined by the specific address of the UUID in the memory. This embodiment does not limit the value of 0xXXXX The values of Y and Z are determined by the size of the data corresponding to the UUID, and this embodiment does not limit this; in Figure 4, 0xOPQ is a user-defined UUID, and the user can refer to the format of the UUID value in Figure 4 To set the preset UUID, it should be noted that the content in Fig. 4 is only an exemplary description. In actual use, those skilled in the art can refer to the solutions of the embodiments of the present application, and without creative work, Other ways of setting the UUID value can also be obtained according to this embodiment.
基于上述实施例公开的内容,本实施例中,若源端根据控制数据 流确定播放端支持语音分频传输,则执行以下步骤:Based on the content disclosed in the foregoing embodiment, in this embodiment, if the source terminal determines that the playback terminal supports voice frequency division transmission according to the control data stream, the following steps are performed:
500:源端发送音频配置参数请求给播放端:500: The source sends an audio configuration parameter request to the player:
501:源端接收播放端支持的第二频段语音信号对应的音频配置参数,音频配置参数包括编解码参数和码率,编解码参数包括编码方式和解码方式其中的一种或两种;501: The source terminal receives the audio configuration parameters corresponding to the voice signal in the second frequency band supported by the playback terminal. The audio configuration parameters include coding and decoding parameters and bit rates, and the coding and decoding parameters include one or both of coding and decoding methods;
502:源端根据播放端支持的第二频段语音信号对应的音频配置参数确定使用的音频配置参数并将使用的音频配置参数发送给播放端。502: The source terminal determines the used audio configuration parameters according to the audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal and sends the used audio configuration parameters to the playback terminal.
步骤500之后,播放端会收到源端发送的音频配置参数请求,播放端收到源端发送的音频配置参数请求之后,播放端发送播放端支持的第二频段语音信号对应的音频配置参数给源端,源端收到播放端支持的第二频段语音信号对应的音频配置参数之后,源端根据播放端支持的第二频段语音信号对应的音频配置参数确定使用的音频配置参数并将使用的音频配置参数发送给播放端,另外,使用的音频配置参数也需要是源端支持的音频配置参数。播放端收到源端发送的使用的音频配置参数并配置使用的音频配置参数,确定了使用的音频配置参数之后,播放端也可以配置使用的音频配置参数。After step 500, the player will receive the audio configuration parameter request sent by the source. After the player receives the audio configuration parameter request from the source, the player will send the audio configuration parameters corresponding to the second frequency band voice signal supported by the player to At the source end, after the source end receives the audio configuration parameters corresponding to the second frequency band voice signal supported by the player, the source end determines the audio configuration parameters to be used according to the audio configuration parameters corresponding to the second frequency band voice signal supported by the player The audio configuration parameters are sent to the player. In addition, the audio configuration parameters used need to be audio configuration parameters supported by the source. The playback terminal receives the used audio configuration parameters sent by the source terminal and configures the used audio configuration parameters. After determining the used audio configuration parameters, the playback terminal can also configure the used audio configuration parameters.
在本实施例中,源端发送音频配置参数请求给播放端后,源端也可以将源端支持的音频配置参数发送给播放端,以便播放端根据源端支持的音频配置参数和播放端支持的音频配置参数选择一种或多种源端和播放端均支持的音频配置参数发送给源端,以便源端确定使用的音频配置参数。源端可以将源端支持的音频配置参数发送给播放端,使得播放端也可以确定使用的音频配置参数,当播放端根据源端支持的音频配置参数和播放端支持的音频配置参数选择一种源端和播放端均支持的音频配置参数发送给源端时,播放端选择的这个音频配置参数就是使用的音频配置参数;In this embodiment, after the source end sends an audio configuration parameter request to the playback end, the source end can also send the audio configuration parameters supported by the source end to the playback end, so that the playback end can according to the audio configuration parameters supported by the source end and the playback end support Select one or more audio configuration parameters that are supported by both the source and the player to send the audio configuration parameters to the source so that the source can determine the audio configuration parameters used. The source end can send the audio configuration parameters supported by the source end to the playback end, so that the playback end can also determine the audio configuration parameters used. When the playback end selects one according to the audio configuration parameters supported by the source end and the audio configuration parameters supported by the playback end When the audio configuration parameters supported by the source and the player are sent to the source, the audio configuration parameter selected by the player is the audio configuration parameter used;
本实施例中,音频配置参数包括编解码参数和码率,编解码参数包括编码方式和解码方式中的一种或两种,源端收到播放端支持的编码方式和解码方式中的一种或两种后,源端可以确定使用的编码方式, 以便于播放端收到语音信号后可以解码;播放端收到源端支持的编码方式和解码方式中的一种或两种后,播放端可以确定使用的解码方式,以便于播放端对接收到的编码后的第一频段语音信号和第二频段语音信号进行解码;编解码参数包括编码方式或解码方式中的一种或两种,还可以包括采样深度和采样率,用户可以根据需求设置第二同步链路上的采样率和采样深度,以进一步提高音质,源端可以根据编码方式进行对应的编码,播放端可以根据解码方式进行对应的解码,另外,音频配置参数还可以包括设备号、设备地址等,便于源端和播放端的相互识别。另外,音频配置参数也可以包括码率,码率可以根据数据帧大小设置,设置码率可以进一步节省带宽,可以设置第二频段语音信号的码率高于或者低于第一频段语音信号的码率,例如,可以设置高频语音信号的码率低于非高频语音信号的码率,具体地,可以设置高频语音信号的码率低于或等于非高频语音信号的码率的百分之二十,本实施例对码率的具体数值不做限制。In this embodiment, the audio configuration parameters include encoding and decoding parameters and bit rates, the encoding and decoding parameters include one or two of encoding and decoding, and the source receives one of the encoding and decoding supported by the playback end. Or after both, the source can determine the encoding method used, so that the player can decode it after receiving the voice signal; after the player receives one or two of the encoding and decoding methods supported by the source, the player The decoding method used can be determined so that the player can decode the received encoded first-band voice signal and second-band voice signal; the encoding and decoding parameters include one or two of the encoding method or the decoding method, and It can include sampling depth and sampling rate. Users can set the sampling rate and sampling depth on the second synchronization link according to their needs to further improve the sound quality. The source can perform corresponding encoding according to the encoding method, and the playback end can correspond according to the decoding method. In addition, the audio configuration parameters can also include the device number, device address, etc., to facilitate the mutual recognition of the source and the player. In addition, the audio configuration parameters can also include the bit rate. The bit rate can be set according to the data frame size. Setting the bit rate can further save bandwidth. You can set the bit rate of the voice signal in the second frequency band to be higher or lower than that of the voice signal in the first frequency band. For example, the bit rate of the high-frequency voice signal can be set to be lower than the bit rate of the non-high-frequency voice signal. Specifically, the bit rate of the high-frequency voice signal can be set to be lower than or equal to a percentage of the bit rate of the non-high-frequency voice signal. Twenty percent, this embodiment does not limit the specific value of the code rate.
基于上述实施例公开的内容,本实施例中,第二同步链路的条数小于或等于第二频段语音信号的频段数。本实施例中,第二同步链路的条数可以等于第二频段语音信号的频段数,即每一条第二同步链路可以仅仅传输第二频段语音信号中的一个子频段语音信号;第二同步链路的条数可以小于第二频段语音信号的频段数,即每一条第二同步链路可以传输第二频段语音信号中的两个或两个的子频段语音信号。另外,源端可以根据电量情况或者第二同步链路的链路质量中止语音分频传输,在源端电量不足时,源端可以断开一条或多条第二同步链路以部分中止或者全部中止语音分频传输来延长续航时间,另外,断开一条或多条第二同步链路包括断开部分或者断开全部第二同步链路。在第二同步链路的链路质量较差时,源端可以断开一条或多条第二同步链路以部分中止或者全部中止语音分频传输来延长续航时间,以免既占用带宽也达不到提高音质的效果;另外,在一条或多条第二同步链路被断开后,源端可以根据电量情况或者第二同步链路的链路质量重新建立该一条或多条第二同步链路以重启或者增强语音分频 传输,例如,在源端电量达到电量预设值或者第二同步链路的链路质量提高到链路质量预设值时,源端可以重新建立一条或多条第二同步链路以重启或者增强语音分频传输。另外,对于播放端,也可以根据电量情况或者第二同步链路的链路质量断开一条或多条第二同步链路或者重新建立一条或多条第二同步链路,无论是源端还是播放端发起断开一条或多条第二同步链路,被发起端应该无条件地接受断开一条或多条第二同步链路。Based on the content disclosed in the foregoing embodiment, in this embodiment, the number of second synchronization links is less than or equal to the number of frequency bands of the second frequency band voice signal. In this embodiment, the number of second synchronization links may be equal to the number of frequency bands of the second frequency band voice signal, that is, each second synchronization link may only transmit one sub-band voice signal in the second frequency band voice signal; second The number of synchronization links may be less than the number of frequency bands of the second frequency band voice signal, that is, each second synchronization link may transmit two or two sub-band voice signals in the second frequency band voice signal. In addition, the source can suspend the voice frequency division transmission according to the power situation or the link quality of the second synchronization link. When the power of the source is insufficient, the source can disconnect one or more second synchronization links to partially or completely suspend The voice frequency division transmission is suspended to extend the battery life. In addition, disconnecting one or more second synchronization links includes disconnecting part or all of the second synchronization links. When the link quality of the second synchronization link is poor, the source can disconnect one or more second synchronization links to partially or completely suspend the voice frequency division transmission to extend the battery life, so as not to occupy the bandwidth and reach the limit. To improve the sound quality; in addition, after one or more second synchronization links are disconnected, the source can re-establish the one or more second synchronization links according to the power situation or the link quality of the second synchronization link To restart or enhance voice frequency division transmission, for example, when the power of the source reaches the preset power value or the link quality of the second synchronization link improves to the preset link quality, the source can re-establish one or more The second synchronization link is to restart or enhance voice frequency division transmission. In addition, for the playback end, one or more second synchronization links can be disconnected or one or more second synchronization links can be re-established according to the power situation or the link quality of the second synchronization link, whether it is the source end or The playback end initiates the disconnection of one or more second synchronization links, and the initiated end should unconditionally accept the disconnection of one or more second synchronization links.
本申请实施例提供了一种语音分频传输方法,本实施例从播放端的角度进行说明,请参考图5,图5为本申请实施例提供的一种语音分频传输方法的流程图,该方法包括:This embodiment of the application provides a voice frequency division transmission method. This embodiment is described from the perspective of the player. Please refer to FIG. 5. FIG. 5 is a flowchart of a voice frequency division transmission method provided by an embodiment of the application. Methods include:
600:播放端通过第一同步链路和第二同步链路分别接收带有帧同步信息的编码后的第一频段语音信号和带有帧同步信息的编码后的第二频段语音信号;600: The playback terminal receives the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively;
601:播放端对接收到的第一频段语音信号和第二频段语音信号进行解码;601: The playback terminal decodes the received voice signal in the first frequency band and the voice signal in the second frequency band;
602:播放端通过帧同步信息对解码后的第一频段语音信号和解码后的第二频段语音信号进行同步。602: The player uses the frame synchronization information to synchronize the decoded first frequency band speech signal and the decoded second frequency band speech signal.
本实施例中,源端建立与播放端之间的第二同步链路之后,源端通过第一同步链路和第二同步链路分别传输带有帧同步信息的编码后的第一频段语音信号和第二频段语音信号给播放端,播放端收到带有帧同步信息的编码后的第一频段语音信号和第二频段语音信号频段语音信号后,对该带有帧同步信息的编码后的第一频段语音信号和第二频段语音信号分别进行解码,本实施例对解码方式的种类不做限制。播放端对接收到的第一频段语音信号和第二频段语音信号可以采用相同的解码方式,也可以采用不同的解码方式,若第二频段语音信号包含多个子频段语音信号,播放端对接收到的编码后的多个子频段语音信号的解码方式可以相同,也可以不同,例如,若播放端对接收到的编码后的四个子频段语音信号进行解码,则可以选择一种、两种、三种或者四种解码方式。播放端对接收到的第一频段语音信号和第二 频段语音信号进行解码后,播放端通过帧同步信息对解码后的第一频段语音信号和第二频段语音信号进行同步,可以进一步提高音频质量,本实施例对具体的同步方法不做限制。In this embodiment, after the source end establishes the second synchronization link with the playback end, the source end transmits the encoded first frequency band speech with frame synchronization information through the first synchronization link and the second synchronization link, respectively The signal and the second frequency band voice signal are sent to the playback terminal. After the playback terminal receives the encoded first frequency band voice signal and the second frequency band voice signal with frame synchronization information, the encoded voice signal with frame synchronization information The first frequency band speech signal and the second frequency band speech signal are decoded separately, and this embodiment does not limit the type of decoding mode. The playback terminal can use the same decoding method for the received voice signal in the first frequency band and the voice signal in the second frequency band, or different decoding methods. If the voice signal in the second frequency band contains multiple sub-band voice signals, the playback terminal pair receives The decoding methods of the encoded multiple sub-band voice signals can be the same or different. For example, if the player decodes the received encoded four sub-band voice signals, you can choose one, two, or three Or four decoding methods. After the playback terminal decodes the received voice signal in the first frequency band and the voice signal in the second frequency band, the playback terminal uses the frame synchronization information to synchronize the decoded voice signal in the first frequency band and the second frequency band, which can further improve the audio quality In this embodiment, the specific synchronization method is not limited.
本申请实施例通过第一同步链路和第二同步链路分别接收带有帧同步信息的编码后的第一频段语音信号和带有帧同步信息的编码后的第二频段语音信号,解决了由于传输带宽限制带来的音质下降的问题和改善音质时影响正在播放的音频的问题。In this embodiment of the application, the first synchronization link and the second synchronization link respectively receive the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information, which solves the problem The problem of sound quality degradation caused by the limitation of transmission bandwidth and the problem of affecting the audio being played when the sound quality is improved.
基于上述实施例公开的内容,本实施例中,对解码后的第一频段语音信号和解码后的第二频段语音信号进行同步之后,经过数模转换、放大后进行电声转换,用户就可以听到声音。播放端通过帧同步信息对解码后的第一频段语音信号和解码后的第二频段语音信号进行同步之后,包括以下步骤:Based on the content disclosed in the above embodiment, in this embodiment, after synchronizing the decoded first frequency band speech signal and the decoded second frequency band speech signal, after digital-to-analog conversion, amplification and electro-acoustic conversion, the user can Hear the sound. After the player uses the frame synchronization information to synchronize the decoded first frequency band speech signal and the decoded second frequency band speech signal, the following steps are included:
700:播放端对同步后的第一频段语音信号和同步后的第二频段语音信号通过一个或多个不同的数模转换器分别进行数模转换;700: The playback terminal performs digital-to-analog conversion on the synchronized voice signal in the first frequency band and the synchronized voice signal in the second frequency band through one or more different digital-to-analog converters;
701:播放端对数模转换后的第一频段语音信号和数模转换后的第二频段语音信号通过不同的一个或多个放大器分别进行放大;701: The playback terminal amplifies the voice signal in the first frequency band after digital-to-analog conversion and the voice signal in the second frequency band after digital-to-analog conversion through different one or more amplifiers;
702:播放端对放大后的第一频段语音信号和放大后的第二频段语音信号通过一个或多个不同的电声转换器分别进行电声转换。702: The playback terminal performs electroacoustic conversion on the amplified first frequency band speech signal and the amplified second frequency band speech signal through one or more different electroacoustic converters.
播放端通过源端发送的帧同步信息对解码后的第一频段语音信号和解码后的第二频段语音信号进行同步之后,播放端可以通过一个数模转换器进行数模转换,然后,通过一个放大器进行放大,然后,通过一个电声转换器进行电声转换;播放端对解码后的第一频段语音信号和解码后的第二频段语音信号进行同步之后,也可以对同步后的第一频段语音信号和第二频段语音信号通过不同的数模转换器分别进行数模转换,本实施例可以采用两个或两个以上的不同的数模转换器以适应不同频段的语音信号的特性,然后,播放端可以对数模转换后的第一频段语音信号和第二频段语音信号通过不同的放大器分别进行放大,本实施例可以采用两个或两个以上的不同的放大器以适应不同频段的语音信号的特性,然后,播放端对放大后的第一频段语音 信号和第二频段语音信号通过不同的电声转换器分别进行电声转换,本实施例可以采用两个或两个以上的不同的电声转换器以适应不同频段的语音信号的特性,电声转换器可以是扬声器或者喇叭等将电能转换成声能的器件,本实施例对电声转换器的种类不做限制。After the playback terminal synchronizes the decoded voice signal in the first frequency band with the decoded voice signal in the second frequency band through the frame synchronization information sent by the source, the playback terminal can perform digital-to-analog conversion through a digital-to-analog converter, and then through a The amplifier is amplified, and then an electro-acoustic converter is used for electro-acoustic conversion; after the playback end synchronizes the decoded first frequency band speech signal and the decoded second frequency band speech signal, it can also synchronize the first frequency band The voice signal and the voice signal in the second frequency band are respectively subjected to digital-to-analog conversion through different digital-to-analog converters. In this embodiment, two or more different digital-to-analog converters may be used to adapt to the characteristics of voice signals in different frequency bands. , The player can amplify the voice signal in the first frequency band and the voice signal in the second frequency band after digital-to-analog conversion through different amplifiers. In this embodiment, two or more different amplifiers can be used to adapt to voices in different frequency bands. Then, the playback terminal performs electro-acoustic conversion on the amplified voice signal in the first frequency band and the voice signal in the second frequency band through different electro-acoustic converters. In this embodiment, two or more different The electro-acoustic converter is adapted to the characteristics of voice signals in different frequency bands. The electro-acoustic converter may be a device that converts electrical energy into sound energy, such as a speaker or a horn. The present embodiment does not limit the type of the electro-acoustic converter.
本实施例中,播放端对同步后的第一频段语音信号和第二频段语音信号分别进行数模转换,可以根据通过不同频段的语音信号的特性使用不同的数模转换器对同步后的第一频段语音信号和第二频段语音信号分别进行数模转换,以进一步提高音频质量,数模转换后,播放端对数模转换后的第一频段语音信号和第二频段语音信号进行放大后再进行电声转换;播放端对同步后的第一频段语音信号和第二频段语音信号分别进行数模转换后,播放端对数模转换后的第一频段语音信号和第二频段语音信号分别进行放大,可以根据不同频段的语音信号的特性使用不同的放大器对同步后的第一频段语音信号和第二频段语音信号分别进行放大,以进一步提高音频质量,放大后,播放端对放大后的第一频段语音信号和第二频段语音信号进行电声转换。需要说明的是,本实施例中,播放端仅仅对同步后的第一频段语音信号和第二频段语音信号通过不同的数模转换器分别进行数模转换就可以进一步提高音频质量;播放端仅仅对数模转换后的第一频段语音信号和第二频段语音信号通过不同的放大器分别进行放大也可以进一步提高音频质量;播放端仅仅对放大后的第一频段语音信号和第二频段语音信号通过不同的电声转换器分别进行电声转换也可以进一步提高音频质量。In this embodiment, the playback terminal performs digital-to-analog conversion on the synchronized voice signal in the first frequency band and the voice signal in the second frequency band respectively, and different digital-to-analog converters can be used according to the characteristics of the voice signals passing through different frequency bands to compare the synchronized first frequency band. The voice signal of the first frequency band and the voice signal of the second frequency band are respectively subjected to digital-to-analog conversion to further improve the audio quality. After the digital-to-analog conversion, the player amplifies the first-band voice signal and the second-band voice signal after the digital-to-analog conversion. Perform electro-acoustic conversion; after the playback terminal performs digital-to-analog conversion on the synchronized voice signal in the first frequency band and the voice signal in the second frequency band, the playback terminal performs digital-to-analog conversion on the voice signal in the first frequency band and the voice signal in the second frequency band. Amplification, according to the characteristics of voice signals in different frequency bands, different amplifiers can be used to amplify the synchronized voice signals in the first frequency band and the voice signals in the second frequency band to further improve the audio quality. The voice signal in the first frequency band and the voice signal in the second frequency band are converted into electro-acoustics. It should be noted that, in this embodiment, the playback terminal only performs digital-to-analog conversion on the synchronized voice signal in the first frequency band and the voice signal in the second frequency band through different digital-to-analog converters to further improve the audio quality; The audio quality can be further improved by separately amplifying the voice signal in the first frequency band and the voice signal in the second frequency band after digital-to-analog conversion through different amplifiers; the playback end only passes the amplified voice signal in the first frequency band and the voice signal in the second frequency band. Different electro-acoustic converters can also further improve the audio quality.
基于上述实施例公开的内容,本实施例中,播放端通过第二同步链路接收带有帧同步信息的编码后的第二频段语音信号之前,播放端可以发送控制数据流给源端,以使源端根据控制数据流判断播放端是否支持语音分频传输,在开始语音分频传输前判断播放端是否支持语音分频传输,以免在播放端不支持语音分频传输的情况下增加耗能,假设播放端不支持语音分频传输,而源端不根据控制数据流判断播放端是否支持语音分频传输,则当源端开始进行语音分频传输,源端对 第一频段语音信号和第二频段语音信号进行编码时就开始增加耗能了。当源端判断播放端不支持语音分频传输时,源端会沿用系统原有音频传输方法进行音频传输,而不再进行对第一频段语音信号和第二频段语音信号都进行编码。Based on the content disclosed in the above embodiment, in this embodiment, before the playback end receives the encoded second frequency band voice signal with frame synchronization information through the second synchronization link, the playback end can send a control data stream to the source end to Enable the source to judge whether the playback end supports voice frequency division transmission according to the control data stream, and determine whether the playback end supports voice frequency division transmission before starting voice frequency division transmission, so as to avoid increasing energy consumption when the playback end does not support voice frequency division transmission , Assuming that the playback end does not support voice frequency division transmission, and the source end does not determine whether the playback end supports voice frequency division transmission according to the control data stream, when the source end starts voice frequency division transmission, the source end will When the two-band speech signal is encoded, energy consumption begins to increase. When the source terminal determines that the playback terminal does not support voice frequency division transmission, the source terminal will continue to use the original audio transmission method of the system for audio transmission, instead of encoding both the first frequency band voice signal and the second frequency band voice signal.
基于上述实施例公开的内容,本实施例中,若源端根据控制数据流确定播放端支持语音分频传输,则播放端会收到源端发送的第二同步链路请求,播放端收到源端发送的第二同步链路请求后,发送第二同步链路请求的回复给源端,若播放端支持语音分频传输,播放端发送播放端支持的链路参数给源端以便源端建立第二同步链路。Based on the content disclosed in the foregoing embodiment, in this embodiment, if the source terminal determines that the playback terminal supports voice frequency division transmission according to the control data stream, the playback terminal will receive the second synchronization link request sent by the source terminal, and the playback terminal will receive After the second synchronization link request sent by the source end, the reply of the second synchronization link request is sent to the source end. If the playback end supports voice frequency division transmission, the playback end sends the link parameters supported by the playback end to the source end for the source end. Establish a second synchronization link.
基于上述实施例公开的内容,本实施例中,播放端接收源端发送的第二同步链路请求之前,包括以下步骤:Based on the content disclosed in the foregoing embodiment, in this embodiment, before the playback end receives the second synchronization link request sent by the source end, the following steps are included:
701:播放端接收源端发送的音频配置参数请求;701: The player receives the audio configuration parameter request sent by the source;
702:播放端发送播放端支持的第二频段语音信号对应的音频配置参数给源端;702: The playback terminal sends the audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal to the source terminal;
703:播放端接收源端发送的使用的音频配置参数并配置使用的音频配置参数。703: The player receives the used audio configuration parameters sent by the source and configures the used audio configuration parameters.
源端发送音频配置参数请求给播放端,播放端接收源端发送的音频配置参数请求之后,播放端发送播放端支持的第二频段语音信号对应的音频配置参数给源端,源端收到播放端支持的第二频段语音信号对应的音频配置参数之后,源端根据播放端支持的第二频段语音信号对应的音频配置参数确定使用的音频配置参数并将使用的音频配置参数发送给播放端,播放端收到源端发送的使用的音频配置参数并配置使用的音频配置参数,同时,源端也要配置使用的音频配置参数。The source sends an audio configuration parameter request to the player. After the player receives the audio configuration parameter request from the source, the player sends the audio configuration parameters corresponding to the second frequency band voice signal supported by the player to the source, and the source receives the playback After the audio configuration parameters corresponding to the second-band voice signal supported by the player, the source determines the audio configuration parameters used according to the audio configuration parameters corresponding to the second-band voice signal supported by the player and sends the used audio configuration parameters to the player. The player receives the used audio configuration parameters sent by the source and configures the used audio configuration parameters. At the same time, the source must also configure the used audio configuration parameters.
基于上述实施例公开的内容,本实施例中,播放端可以根据电量情况或者第二同步链路的链路质量断开一条或多条第二同步链路,在播放端电量不足时,播放端可以断开一条或多条第二同步链路以部分中止或者全部中止语音分频传输来延长续航时间,当一条或多条第二同步链路被断开时,断开的一条或多条第二同步链路上对应频段的语音信号不进行编码和解码;或者在第二同步链路的链路质量较差时, 播放端可以断开一条或多条第二同步链路,以免既占用带宽也达不到提高音质的效果;另外,在语音分频传输被部分中止或者全部中止后,播放端可以根据电量情况或者第二同步链路的链路质量请求源端重新建立一条或多条第二同步链路以重启或者增强语音分频传输,例如,在播放端电量达到电量预设值或者第二同步链路的链路质量提高到链路质量预设值时,播放端可以请求源端重新建立断开的一条或多条第二同步链路以重启或者增强语音分频传输。另外,对于源端,也可以根据电量情况或者第二同步链路的链路质量断开一条或多条第二同步链路或者重新建立一条或多条第二同步链路,无论是源端还是播放端发起断开一条或多条第二同步链路,被发起端应该无条件地接受断开一条或多条第二同步链路。Based on the content disclosed in the above embodiment, in this embodiment, the playback end can disconnect one or more second synchronization links according to the power condition or the link quality of the second synchronization link. When the power of the playback end is insufficient, the playback end One or more second synchronization links can be disconnected to partially or completely stop the voice frequency division transmission to extend the endurance time. When one or more second synchronization links are disconnected, the disconnected one or more The voice signal of the corresponding frequency band on the second synchronization link is not encoded and decoded; or when the link quality of the second synchronization link is poor, the player can disconnect one or more second synchronization links to avoid occupying bandwidth The effect of improving the sound quality is not achieved; in addition, after the voice frequency division transmission is partially or completely suspended, the player can request the source to re-establish one or more pieces of data according to the power situation or the link quality of the second synchronization link. The second synchronization link is used to restart or enhance voice frequency division transmission. For example, when the power of the playback end reaches the preset value of power or the link quality of the second synchronization link improves to the preset value of link quality, the playback end can request the source end The disconnected one or more second synchronization links are re-established to restart or enhance voice frequency division transmission. In addition, for the source end, one or more second synchronization links can be disconnected or one or more second synchronization links can be re-established according to the power situation or the link quality of the second synchronization link, whether it is the source end or The playback end initiates the disconnection of one or more second synchronization links, and the initiated end should unconditionally accept the disconnection of one or more second synchronization links.
本申请实施例提供了一种语音分频传输方法,通过第一同步链路和第二同步链路分别接收带有帧同步信息的编码后的第一频段语音信号和第二频段语音信号,解决了由于传输带宽限制带来的音质下降的问题和改善音质时影响正在播放的音频的问题The embodiment of the present application provides a voice frequency division transmission method. The first synchronization link and the second synchronization link respectively receive the first frequency band speech signal and the second frequency band speech signal after encoding with frame synchronization information. The problem of sound quality degradation caused by transmission bandwidth limitation and the problem of affecting the audio being played when the sound quality is improved
本申请实施例还可提供一种源端,用于执行前述实施例中提出的一种语音分频传输方法,图6为本实施例提供的源端的结构示意图,如图6所示,该源端50包括:The embodiment of the present application may also provide a source to execute the voice frequency division transmission method proposed in the foregoing embodiment. FIG. 6 is a schematic structural diagram of the source provided in this embodiment. As shown in FIG. 6, the source End 50 includes:
编码模块51,用于对第一频段语音信号和第二频段语音信号进行编码;The encoding module 51 is configured to encode the first frequency band speech signal and the second frequency band speech signal;
预同步模块52,用于将帧同步信息标记到编码后的第一频段语音信号和编码后的第二频段语音信号中;以及The pre-synchronization module 52 is used to mark the frame synchronization information into the encoded first frequency band speech signal and the encoded second frequency band speech signal; and
第一发送模块53,用于通过第一同步链路和第二同步链路分别发送带有帧同步信息的编码后的第一频段语音信号和带有帧同步信息的编码后的第二频段语音信号给播放端。The first sending module 53 is configured to send the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively Signal to the playback terminal.
可选的,第二频段语音信号的编码方式的压缩率高于第一频段语音信号的编码方式的压缩率;或者Optionally, the compression rate of the encoding method of the voice signal in the second frequency band is higher than the compression rate of the encoding method of the voice signal in the first frequency band; or
第二频段语音信号的编码方式的压缩率低于第一频段语音信号的编码方式的压缩率。The compression rate of the encoding method of the speech signal in the second frequency band is lower than the compression rate of the encoding method of the speech signal in the first frequency band.
可选的,编码模块包括:Optionally, the encoding module includes:
高频编码模块,用于对高频语音信号进行编码,对高频语音信号的编码方式包括CELT编码方式或SBR编码方式;或者The high-frequency encoding module is used to encode high-frequency speech signals, and the encoding methods for high-frequency speech signals include CELT encoding or SBR encoding; or
非高频编码模块,用于对非高频语音信号进行编码,对非高频语音信号的编码方式包括SILK编码方式、SBC编码方式、AAC编码方式或MP3编码方式。The non-high frequency encoding module is used to encode non-high frequency speech signals. The encoding methods for non-high frequency speech signals include SILK encoding, SBC encoding, AAC encoding or MP3 encoding.
可选的,预同步模块包括:Optionally, the pre-synchronization module includes:
数据帧标记模块,用于标记预设延时内检测到的编码后的任一频段的语音信号为一帧数据,帧同步信息包括一帧数据的开始时刻和结束时刻。The data frame marking module is used to mark the encoded voice signal of any frequency band detected within a preset delay as one frame of data, and the frame synchronization information includes the start time and end time of a frame of data.
可选的,源端还包括:Optionally, the source also includes:
第一参数确定模块,第一发送模块通过第二同步链路发送带有帧同步信息的编码后的第二频段语音信号给播放端之前,用于根据播放端发送的播放端支持的链路参数确定使用的链路参数;The first parameter determination module, the first sending module sends the encoded second frequency band voice signal with frame synchronization information to the playback terminal through the second synchronization link, and is used to send the playback terminal according to the link parameters supported by the playback terminal. Determine the link parameters used;
链路建立模块,用于根据使用的链路参数建立与播放端之间的第二同步链路。The link establishment module is used to establish a second synchronization link with the playback terminal according to the used link parameters.
可选的,源端还包括:Optionally, the source also includes:
第二发送模块,第一参数确定模块根据播放端发送的播放端支持的链路参数确定使用的链路参数之前,用于发送第二同步链路请求给播放端;以及The second sending module, the first parameter determining module is used to send a second synchronization link request to the playback terminal before determining the link parameter to be used according to the link parameters supported by the playback terminal sent by the playback terminal; and
第一接收模块,用于接收第二同步链路请求的回复;The first receiving module is configured to receive a reply to the second synchronization link request;
第一参数确定模块还用于根据第二同步链路请求的回复确定是否建立第二同步链路。The first parameter determination module is further configured to determine whether to establish the second synchronization link according to the reply to the second synchronization link request.
可选的,源端还包括:Optionally, the source also includes:
第一判断模块,编码模块对第二频段语音信号进行编码之前,用于根据控制数据流判断播放端是否支持语音分频传输,控制数据流通过异步链路传输。The first judgment module is used for judging whether the playback end supports voice frequency division transmission according to the control data stream before the encoding module encodes the voice signal in the second frequency band, and the control data stream is transmitted through the asynchronous link.
可选的,源端还包括UUID模块,用于通过自定义通用唯一识别码(UUID)标识语音分频传输服务;Optionally, the source terminal also includes a UUID module, which is used to identify the voice frequency division transmission service through a custom universally unique identifier (UUID);
第一判断模块包括:The first judgment module includes:
第二判断模块,控制数据流包括自定义UUID的值,若接收到的播放端的自定义UUID的值等于预设UUID值,用于确定播放端支持语音分频传输。The second judgment module, the control data stream includes the value of the custom UUID, and if the received custom UUID value of the player is equal to the preset UUID value, it is used to determine that the player supports voice frequency division transmission.
可选的,源端还包括:Optionally, the source also includes:
第三发送模块:若第一判断模块根据控制数据流确定播放端支持语音分频传输,用于发送音频配置参数请求给播放端;The third sending module: if the first judging module determines that the playback end supports voice frequency division transmission according to the control data stream, it is used to send an audio configuration parameter request to the playback end;
第二接收模块:用于接收播放端支持的第二频段语音信号对应的音频配置参数,音频配置参数包括编解码参数和码率,编解码参数包括编码方式和解码方式中的一种或两种;以及The second receiving module: used to receive the audio configuration parameters corresponding to the voice signal in the second frequency band supported by the player. The audio configuration parameters include codec parameters and bit rates, and the codec parameters include one or two of the coding mode and the decoding mode ;as well as
第二参数确定模块,用于根据播放端支持的第二频段语音信号对应的音频配置参数确定使用的音频配置参数;The second parameter determination module is configured to determine the audio configuration parameters to be used according to the audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal;
第三发送模块还用于将使用的音频配置参数发送给播放端。The third sending module is also used to send the used audio configuration parameters to the playback terminal.
可选的,源端还包括:Optionally, the source also includes:
第一传输控制模块,用于根据电量情况或者第二同步链路的链路质量断开一条或多条第二同步链路,第二同步链路的条数小于或等于第二频段语音信号的频段数;或者The first transmission control module is configured to disconnect one or more second synchronization links according to the power condition or the link quality of the second synchronization link, and the number of the second synchronization links is less than or equal to that of the second frequency band voice signal Number of frequency bands; or
第一传输控制模块还用于根据电量情况或者第二同步链路的链路质量建立一条或多条第二同步链路,第二同步链路的条数小于或等于第二频段语音信号的频段数。The first transmission control module is also used to establish one or more second synchronization links according to the power condition or the link quality of the second synchronization link, and the number of the second synchronization links is less than or equal to the frequency band of the second frequency band voice signal number.
本申请实施例提供了一种源端,用于执行前述实施例中提出的一种语音分频传输方法,通过第一同步链路和第二同步链路分别发送带有帧同步信息的编码后的第一频段语音信号和带有帧同步信息的编码后的第二频段语音信号给播放端,解决了由于传输带宽限制带来的音质下降的问题和改善音质时影响正在播放的音频的问题。The embodiment of the application provides a source end for executing the voice frequency division transmission method proposed in the foregoing embodiment, and transmits the encoded post-synchronization information with frame synchronization information through the first synchronization link and the second synchronization link. The first frequency band speech signal and the encoded second frequency band speech signal with frame synchronization information are sent to the playback end, which solves the problem of sound quality degradation caused by transmission bandwidth limitation and the problem of affecting the audio being played when the sound quality is improved.
本申请实施例还可提供一种播放端,用于执行前述实施例中提出的一种语音分频传输方法,图7为本实施例提供的播放端的结构示意图,如图7所示,该播放端60包括:The embodiment of the present application may also provide a playback terminal for executing the voice frequency division transmission method proposed in the foregoing embodiment. FIG. 7 is a schematic structural diagram of the playback terminal provided in this embodiment. As shown in FIG. End 60 includes:
第三接收模块61,用于通过第一同步链路和第二同步链路分别 接收带有帧同步信息的编码后的第一频段语音信号和带有帧同步信息的编码后的第二频段语音信号;The third receiving module 61 is configured to receive the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively signal;
解码模块62,用于对接收到的编码后的第一频段语音信号和编码后的第二频段语音信号进行解码;The decoding module 62 is configured to decode the received coded first frequency band speech signal and the coded second frequency band speech signal;
同步模块63,用于通过帧同步信息对解码后的第一频段语音信号和解码后的第二频段语音信号进行同步。The synchronization module 63 is used to synchronize the decoded first frequency band speech signal and the decoded second frequency band speech signal through frame synchronization information.
可选的,播放端还包括:Optionally, the playback terminal also includes:
一个或多个不同的数模转换模块,用于对同步后的第一频段语音信号和第二频段语音信号分别进行数模转换;One or more different digital-to-analog conversion modules for performing digital-to-analog conversion on the synchronized voice signal in the first frequency band and the voice signal in the second frequency band respectively;
一个或多个不同的放大模块,用于对数模转换后的第一频段语音信号和第二频段语音信号分别进行放大;以及One or more different amplifying modules for respectively amplifying the first frequency band voice signal and the second frequency band voice signal after digital-to-analog conversion; and
一个或多个不同的电声转换模块,用于对放大后的第一频段语音信号和第二频段语音信号分别进行电声转换。One or more different electro-acoustic conversion modules are used to perform electro-acoustic conversion on the amplified first frequency band voice signal and the second frequency band voice signal respectively.
可选的,播放端还包括:Optionally, the playback terminal also includes:
第四发送模块,第三接收模块通过第二同步链路接收带有帧同步信息的编码后的第二频段语音信号之前,用于发送控制数据流给源端,以使源端根据控制数据流判断播放端是否支持语音分频传输。The fourth sending module, the third receiving module is used to send the control data stream to the source end before receiving the encoded second frequency band speech signal with frame synchronization information through the second synchronization link, so that the source end can control the data stream according to the Determine whether the playback terminal supports voice frequency division transmission.
可选的,播放端还包括:Optionally, the playback terminal also includes:
第四接收模块,若源端根据控制数据流确定播放端支持语音分频传输,用于接收源端发送的第二同步链路请求;以及The fourth receiving module, if the source terminal determines that the playback terminal supports voice frequency division transmission according to the control data stream, it is used to receive the second synchronization link request sent by the source terminal; and
第五发送模块,用于发送第二同步链路请求的回复;The fifth sending module is used to send a reply to the second synchronization link request;
若播放端支持语音分频传输,第五发送模块还用于发送播放端支持的链路参数给源端。If the playback end supports voice frequency division transmission, the fifth sending module is also used to send link parameters supported by the playback end to the source end.
可选的,播放端还包括:Optionally, the playback terminal also includes:
第五接收模块,第四接收模块接收源端发送的第二同步链路请求之前,用于接收源端发送的音频配置参数请求;The fifth receiving module, the fourth receiving module is used to receive the audio configuration parameter request sent by the source end before receiving the second synchronization link request sent by the source end;
第六发送模块,用于发送播放端支持的第二频段语音信号对应的音频配置参数给源端;The sixth sending module is used to send the audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal to the source terminal;
第五接收模块还用于接收源端发送的使用的音频配置参数;以及The fifth receiving module is also used to receive the audio configuration parameters used by the source end; and
参数配置模块,用于配置使用的音频配置参数。The parameter configuration module is used to configure the audio configuration parameters used.
可选的,播放端还包括:Optionally, the playback terminal also includes:
第二传输控制模块,用于根据电量情况或者第二同步链路的链路质量断开一条或多条第二同步链路;或者The second transmission control module is configured to disconnect one or more second synchronization links according to the power condition or the link quality of the second synchronization link; or
第二传输控制模块还用于根据电量情况或者第二同步链路的链路质量请求源端建立一条或多条第二同步链路。The second transmission control module is further configured to request the source end to establish one or more second synchronization links according to the power condition or the link quality of the second synchronization link.
本申请实施例提供了一种播放端,通过第一同步链路和第二同步链路分别接收带有帧同步信息的编码后的第一频段语音信号和第二频段语音信号,解决了由于传输带宽限制带来的音质下降的问题和改善音质时影响正在播放的音频的问题。The embodiment of the present application provides a playback terminal that receives the encoded first frequency band speech signal and the second frequency band speech signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively, which solves the problem of transmission The problem of reduced sound quality caused by bandwidth limitation and the problem of affecting the audio being played when the sound quality is improved.
本申请实施例还可提供一种源端,用于执行实施例提出的一种语音分频传输方法,如图8所示,该源端70包括:存储器71和处理器72;The embodiment of the present application may also provide a source end for executing the voice frequency division transmission method proposed in the embodiment. As shown in FIG. 8, the source end 70 includes a memory 71 and a processor 72;
存储器71与处理器72耦合;The memory 71 is coupled with the processor 72;
存储器71,用于存储程序指令;The memory 71 is used to store program instructions;
处理器72,用于调用存储器存储的程序指令,使得源端执行语音分频传输方法。The processor 72 is configured to call the program instructions stored in the memory to make the source end execute the voice frequency division transmission method.
本申请实施例提供的源端,可执行上述任一所述实施例提供的语音分频传输方法方法,其具体的实现过程及有益效果参见上述,在此不再赘述。The source provided in the embodiment of the present application can execute the voice frequency division transmission method provided in any of the above-mentioned embodiments. For the specific implementation process and beneficial effects, refer to the above, and will not be repeated here.
本申请实施例还可提供一种播放端,用于执行实施例提出的一种语音分频传输方法,如图9所示,该播放端80包括:存储器81和处理器82;The embodiment of the present application may also provide a playback terminal for executing the voice frequency division transmission method proposed in the embodiment. As shown in FIG. 9, the playback terminal 80 includes a memory 81 and a processor 82;
存储器81与处理器82耦合;The memory 81 is coupled with the processor 82;
存储器81,用于存储程序指令;The memory 81 is used to store program instructions;
处理器82,用于调用存储器存储的程序指令,使得播放端执行语音分频传输方法。The processor 82 is configured to call the program instructions stored in the memory to make the playback terminal execute the voice frequency division transmission method.
本申请实施例提供的播放端,可执行上述任一所述实施例提供的语音分频传输方法,其具体的实现过程及有益效果参见上述,在此不 再赘述。The playback terminal provided in the embodiment of the present application can execute the voice frequency division transmission method provided in any of the above-mentioned embodiments. For the specific implementation process and beneficial effects, refer to the above, and will not be repeated here.
本申请实施例还可提供一种计算机可读存储介质,其上存储有计算机程序,该计算机程序被处理器72执行时实现执行源端执行的语音分频传输方法。The embodiment of the present application may also provide a computer-readable storage medium on which a computer program is stored. When the computer program is executed by the processor 72, the voice frequency division transmission method executed by the source is executed.
本申请实施例提供的计算机可读存储介质,可执行上述任一所述实施例提供的源端执行的语音分频传输方法,其具体的实现过程及有益效果参见上述,在此不再赘述。The computer-readable storage medium provided by the embodiment of the present application can execute the voice frequency division transmission method executed by the source provided in any of the above-mentioned embodiments. For the specific implementation process and beneficial effects, refer to the above, and will not be repeated here.
本申请实施例还可提供一种计算机可读存储介质,其上存储有计算机程序,该计算机程序被处理器82执行时实现执行播放端执行的语音分频传输方法。The embodiment of the present application may also provide a computer-readable storage medium on which a computer program is stored, and when the computer program is executed by the processor 82, the voice frequency division transmission method executed by the player is executed.
本申请实施例提供的计算机可读存储介质,可执行上述任一所述实施例提供的播放端执行的语音分频传输方法,其具体的实现过程及有益效果参见上述,在此不再赘述。The computer-readable storage medium provided by the embodiment of the present application can execute the voice frequency division transmission method executed by the player provided in any of the above-mentioned embodiments. For the specific implementation process and beneficial effects, refer to the above, and will not be repeated here.
本申请实施例还可提供一种源端电路,该源端电路可以用于实现前述实施例中提出的一种语音分频传输方法,图10为本实施例提供的源端电路的结构示意图。该源端电路包括:The embodiment of the present application may also provide a source-end circuit, which may be used to implement the voice frequency division transmission method proposed in the foregoing embodiment. FIG. 10 is a schematic structural diagram of the source-end circuit provided in this embodiment. The source circuit includes:
编码器,用于对第一频段语音信号和第二频段语音信号进行编码;以及An encoder for encoding the first frequency band speech signal and the second frequency band speech signal; and
源端控制器,与编码器连接,用于通过第一同步链路和第二同步链路分别发送带有帧同步信息的编码后的第一频段语音信号和带有帧同步信息的编码后的第二频段语音信号给播放端电路。The source controller, connected to the encoder, is used to send the encoded first frequency band speech signal with frame synchronization information and the encoded voice signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively The voice signal of the second frequency band is sent to the playback end circuit.
如图10所示,编码器0是系统原有的编码器,系统第一同步链路0用来传输编码后的第一频段语音信号,以源端只有一个编码器0为例进行说明,本实施例中,将语音信号分离为第一频段语音信号和第二频段语音信号,例如,第二频段语音信号为高频语音信号,第一频段语音信号为非高频语音信号;编码器0可以采用opus编码器,opus编码器中内置有两种核心编码算法:CELT编码方式和SILK编码方式,假设原有的音频传输方法使用SILK编码方式对非高频语音信号进行编码,则可以依旧使用opus编码器使用CELT方式编码对 高频语音信号进行编码;本实施例也可以采用多个编码器,本实施例对编码器的个数不做限制,以源端有编码器0和编码器1为例进行说明,例如,编码器0可以对语音信号的非高频语音信号进行编码,而编码器1可以对高频语音信号进行编码,由于高频语音信号和非高频语音信号的特性不同,因此,如果使用不同的编码方式分别进行编码,则可以达到增加少量带宽就可以提高音频质量的目的。如图10,对于第二频段语音信号,编码器0对其进行编码后,通过第二同步链路1传输给播放端,源端控制器与编码器0连接,用于通过第一同步链路0和第二同步链路1传输带有帧同步信息的编码后的第一频段语音信号和第二频段语音信号。当第二频段语音信号包含m个子频段语音信号时,m>1且为整数,可以分别对应m条第二同步链路,需要说明的是,此处的第二同步链路的条数为m条仅仅为示例性说明,本实施例也可以使用小于m的条数的第二同步链路来传输m个子频段语音信号,本实施例对第二同步链路的条数不做限制。As shown in Figure 10, encoder 0 is the original encoder of the system. The first synchronization link 0 of the system is used to transmit the encoded first frequency band speech signal. Take only one encoder 0 at the source as an example. In the embodiment, the speech signal is separated into a first frequency band speech signal and a second frequency band speech signal. For example, the second frequency band speech signal is a high frequency speech signal, and the first frequency band speech signal is a non-high frequency speech signal; encoder 0 can Using opus encoder, there are two core encoding algorithms built in opus encoder: CELT encoding method and SILK encoding method. Assuming that the original audio transmission method uses SILK encoding method to encode non-high frequency speech signals, opus can still be used The encoder uses CELT encoding to encode high-frequency speech signals; this embodiment can also use multiple encoders, this embodiment does not limit the number of encoders, the source end has encoder 0 and encoder 1 as For example, for example, encoder 0 can encode non-high-frequency speech signals of speech signals, while encoder 1 can encode high-frequency speech signals. Due to the different characteristics of high-frequency speech signals and non-high-frequency speech signals, Therefore, if different encoding methods are used for encoding separately, the audio quality can be improved by adding a small amount of bandwidth. As shown in Figure 10, for the voice signal in the second frequency band, the encoder 0 encodes it and transmits it to the playback end through the second synchronization link 1. The source controller is connected to the encoder 0 for passing through the first synchronization link 0 and the second synchronization link 1 transmit the encoded first frequency band speech signal and the second frequency band speech signal with frame synchronization information. When the voice signal of the second frequency band contains m sub-band voice signals, m>1 is an integer, which can correspond to m second synchronization links respectively. It should be noted that the number of second synchronization links here is m The bars are merely illustrative. In this embodiment, the number of second synchronization links less than m may also be used to transmit m sub-band voice signals, and this embodiment does not limit the number of second synchronization links.
基于上述实施例公开的内容,本实施例中,源端电路还可以包括滤波器,滤波器与编码器连接,用于分离第一频段语音信号和第二频段语音信号。请参考图11所示的源端电路,如图11所示,语音信号经过滤波器1进行编码后再经过对应的第二同步链路1传输到播放端,第一频段语音信号的获取也可以通过一个滤波器实现,也可以不需要滤波器,仅通过编码器0以实现对特定频段的语音信号进行编码,需要说明的是,本实施例对滤波器的数量不做限制,也可以仅仅只有一个滤波器,特别是如果某些编码器采用的编码方式仅仅只对特定频段的语音信号进行编码,则该编码器可以不需要对应的滤波器,例如opus编码器前可以不需要滤波器,本实施例对滤波器的种类也不做限制,可以采用低通滤波器、高通滤波器、带通滤波器或者其他滤波器及其任意组合,本实施例中,若第一频段语音信号或第二频段语音信号为多个子频段语音信号,则滤波器还可以用于分离第一频段语音信号中的多个子频段语音信号,也可以用于分离第二频段语音信号中的多个子频段语音信号。Based on the content disclosed in the foregoing embodiment, in this embodiment, the source-end circuit may further include a filter, which is connected to the encoder and is used to separate the voice signal in the first frequency band and the voice signal in the second frequency band. Please refer to the source circuit shown in Figure 11. As shown in Figure 11, the voice signal is encoded by the filter 1 and then transmitted to the playback end through the corresponding second synchronization link 1. The voice signal of the first frequency band can also be obtained It is implemented by a filter, or no filter is needed. Only encoder 0 is used to encode the voice signal of a specific frequency band. It should be noted that the number of filters is not limited in this embodiment, or only A filter, especially if the encoding method used by some encoders only encodes the voice signal of a specific frequency band, the encoder may not need the corresponding filter, for example, the filter may not be required before the opus encoder. The embodiment does not limit the types of filters. Low-pass filters, high-pass filters, band-pass filters, or other filters and any combination thereof can be used. In this embodiment, if the voice signal in the first frequency band or the second frequency band If the frequency band voice signal is multiple sub-band voice signals, the filter can also be used to separate multiple sub-band voice signals in the first frequency band voice signal, and can also be used to separate multiple sub-band voice signals in the second frequency band voice signal.
本申请实施例提供了一种源端电路,通过对第一频段语音信号和第二频段语音信号进行编码,并通过第一同步链路和第二同步链路分别进行传输有帧同步信息的编码后的第一频段语音信号和第二频段语音信号,解决了由于传输带宽限制带来的音质下降的问题和改善音质时影响正在播放的音频的问题。The embodiment of the application provides a source-end circuit, which encodes a voice signal in a first frequency band and a voice signal in a second frequency band, and transmits the encoding with frame synchronization information through the first synchronization link and the second synchronization link. The subsequent first frequency band voice signal and second frequency band voice signal solve the problem of sound quality degradation caused by transmission bandwidth limitation and the problem of affecting the audio being played when the sound quality is improved.
本申请实施例还可提供一种播放端电路,该播放端电路可以用于实现前述实施例中提出的一种语音分频传输方法,请参考图10,图10为本实施例提供的源端电路和播放端电路的结构示意图。The embodiment of the present application may also provide a playback end circuit, which can be used to implement the voice frequency division transmission method proposed in the foregoing embodiment. Please refer to FIG. 10, which is the source end provided by this embodiment. Schematic diagram of the circuit and the structure of the playback end circuit.
该播放端电路包括:The playback terminal circuit includes:
播放端控制器,用于通过第一同步链路和第二同步链路分别接收有帧同步信息的编码后的第一频段语音信号和带有帧同步信息的编码后的第二频段语音信号;以及The player controller is configured to receive the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively; as well as
多个解码器,与播放端控制器连接,用于对接收到的第一频段语音信号和第二频段语音信号进行解码。A plurality of decoders are connected with the controller of the playback end, and are used for decoding the received voice signal of the first frequency band and the voice signal of the second frequency band.
如图10所示,解码器0可以是系统原有的解码器,系统第一同步链路0用来传输编码后的第一频段语音信号,本实施例可以采用1个或多个解码器,本实施例对解码器的个数不做限制,使用一个解码器时,一个解码器可以采用不同的解码方式对有帧同步信息的编码后的第一频段语音信号和第二频段语音信号进行解码;以播放端有解码器0和解码器1为例进行说明,例如,解码器0可以对非高频语音信号进行解码,而解码器1可以对高频语音信号进行解码,由于高频语音信号和非高频语音信号的特性不同,因此,采用不同的解码器,以达到增加少量带宽就可以提高音频质量的目的。如图10所示,编码器0对第二频段语音信号进行编码后,通过第二同步链路1传输给播放端,播放端控制器与解码器0连接,用于通过第一同步链路0和第二同步链路1分别接收带有帧同步信息的编码后的第一频段语音信号和第二频段语音信号。本实施例中,解码器的个数可以等于编码器的个数,也可以不等于编码器的个数。解码后的第一频段语音信号和第二频段语音信号经过数模转换器、放大器后通过电声转换器进行电 声转换。As shown in Figure 10, decoder 0 can be the original decoder of the system, and the first synchronization link 0 of the system is used to transmit the encoded first frequency band speech signal. This embodiment can use one or more decoders. This embodiment does not limit the number of decoders. When one decoder is used, one decoder can use different decoding methods to decode the encoded first frequency band speech signal and second frequency band speech signal with frame synchronization information. ; Take decoder 0 and decoder 1 on the playback side as an example. For example, decoder 0 can decode non-high-frequency voice signals, while decoder 1 can decode high-frequency voice signals. It has different characteristics from non-high frequency voice signals, so different decoders are used to achieve the purpose of increasing audio quality by adding a small amount of bandwidth. As shown in Figure 10, after encoder 0 encodes the voice signal in the second frequency band, it is transmitted to the playback end through the second synchronization link 1, and the playback end controller is connected to the decoder 0 for passing through the first synchronization link 0. And the second synchronization link 1 respectively receive the first frequency band speech signal and the second frequency band speech signal after encoding with frame synchronization information. In this embodiment, the number of decoders may be equal to the number of encoders, or may not be equal to the number of encoders. The decoded voice signal in the first frequency band and the voice signal in the second frequency band pass through a digital-to-analog converter and an amplifier, and then undergo electro-acoustic conversion through an electro-acoustic converter.
基于上述实施例公开的内容,本实施例中,播放端电路还包括:Based on the content disclosed in the foregoing embodiment, in this embodiment, the playback end circuit further includes:
一个或多个不同的数模转换器,分别与解码器连接,用于对解码后的第一频段语音信号和解码后的第二频段语音信号分别进行数模转换;One or more different digital-to-analog converters, respectively connected to the decoder, for performing digital-to-analog conversion on the decoded first frequency band speech signal and the decoded second frequency band speech signal respectively;
一个或多个不同的放大器,分别与一个或多个数模转换器连接,用于对数模转换后的第一频段语音信号和数模转换后的第二频段语音信号分别进行放大;以及One or more different amplifiers respectively connected to one or more digital-to-analog converters for respectively amplifying the first-band voice signal after digital-to-analog conversion and the second-band voice signal after digital-to-analog conversion; and
一个或多个不同的电声转换器,分别与一个或多个放大器连接,用于对放大后的第一频段语音信号和放大后的第二频段语音信号分别进行电声转换。One or more different electro-acoustic converters are respectively connected to one or more amplifiers, and are used to perform electro-acoustic conversion on the amplified first frequency band speech signal and the amplified second frequency band speech signal respectively.
请参考图11,图11为本申请实施例的源端电路和播放端电路的结构示意图,本实施例中,解码后的第一频段语音信号和第二频段语音信号可以通过数模转换器0和数模转换器1分别进行数模转换,对于不同频段的语音信号,可以根据不同频段的语音信号的特性选择不同的数模转换器,以进一步提高音质,本实施例中,数模转换后第一频段语音信号和第二频段语音信号可以在一个放大器中进行放大,然后用一个电声转换器进行电声转换;数模转换后的第一频段语音信号和第二频段语音信号也可以分别通过放大器0和放大器1分别放大,对于不同频段的语音信号,可以根据不同频段的语音信号的特性选择不同的放大器,以进一步提高音质,放大后的第一频段语音信号和第二频段语音信号可以用一个电声转换器进行电声转换;放大后的第一频段语音信号和第二频段语音信号也可以通过多个电声转换器进行电声转换,可以根据不同频段的语音信号的特性,选择使用不同频率响应的电声转换器对放大后的第一频段语音信号和第二频段语音信号分别进行电声转换,以进一步提高音频质量本实施例中用扬声器0和扬声器1分别进行电声转换。图11中,每一个解码器都对应一个数模转换器、一个放大器和一个扬声器,需要说明的是,图11仅为示例性说明,多个解码器之间也可以共用一个或多个数模转换器、一 个或多个放大器或者一个或多个扬声器,本实施例对此不做限制。当第二频段语音信号包含m个子频段语音信号时,可以分别对应m条第二同步链路,本实施例中,对于编码后的m个子频段语音信号,对应的解码器的个数可以小于等于m,对应的数模转换器、放大器和电声转换器的个数可以小于等于m,本实施例对m的具体数值不做限制。Please refer to FIG. 11, which is a schematic diagram of the structure of the source-end circuit and the playback-end circuit of an embodiment of this application. In this embodiment, the decoded voice signal in the first frequency band and the voice signal in the second frequency band can pass through a digital-to-analog converter. And the digital-to-analog converter 1 perform digital-to-analog conversion separately. For voice signals in different frequency bands, different digital-to-analog converters can be selected according to the characteristics of the voice signals in different frequency bands to further improve the sound quality. In this embodiment, after the digital-to-analog conversion The first frequency band speech signal and the second frequency band speech signal can be amplified in an amplifier, and then an electroacoustic converter is used for electroacoustic conversion; the first frequency band speech signal and the second frequency band speech signal after digital-to-analog conversion can also be separately Through amplifier 0 and amplifier 1, respectively, for the voice signals of different frequency bands, different amplifiers can be selected according to the characteristics of the voice signals of different frequency bands to further improve the sound quality. The amplified voice signals of the first frequency band and the second frequency band can be An electroacoustic converter is used for electroacoustic conversion; the amplified voice signal of the first frequency band and the voice signal of the second frequency band can also be converted by multiple electroacoustic converters, which can be selected according to the characteristics of the voice signal of different frequency bands Electroacoustic converters with different frequency responses are used to perform electroacoustic conversion on the amplified first-band voice signal and the second-band voice signal to further improve the audio quality. In this embodiment, speaker 0 and speaker 1 are used for electroacoustic conversion. . In Figure 11, each decoder corresponds to a digital-to-analog converter, an amplifier, and a speaker. It should be noted that Figure 11 is only an exemplary illustration, and multiple decoders can also share one or more digital-analogs. The converter, one or more amplifiers or one or more speakers are not limited in this embodiment. When the voice signal in the second frequency band contains m sub-band voice signals, it can correspond to m second synchronization links respectively. In this embodiment, for the encoded m sub-band voice signals, the number of corresponding decoders may be less than or equal to m, the number of corresponding digital-to-analog converters, amplifiers, and electro-acoustic converters may be less than or equal to m, and the specific value of m is not limited in this embodiment.
本申请实施例提供了一种播放端电路,通过第一同步链路和第二同步链路分别接收带有帧同步信息的编码后的第一频段语音信号和第二频段语音信号,解决了由于传输带宽限制带来的音质下降的问题和改善音质时影响正在播放的音频的问题。The embodiment of the present application provides a playback terminal circuit, which receives the encoded first frequency band speech signal and the second frequency band speech signal with frame synchronization information through the first synchronization link and the second synchronization link respectively, which solves the problem of The problem of sound quality degradation caused by transmission bandwidth limitation and the problem of affecting the audio being played when the sound quality is improved.
应注意,本申请上述方法实施例可以应用于处理器中,或者由处理器实现。处理器可能是一种集成电路芯片,具有信号的处理能力。在实现过程中,上述方法实施例的各步骤可以通过处理器中的硬件的集成逻辑电路或者软件形式的指令完成。上述的处理器可以是通用处理器、数字信号处理器(digital signal processor,DSP)、专用集成电路(application specific integrated circuit,ASIC)、现成可编程门阵列(field programmable gate array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。可以实现或者执行本申请实施例中的公开的各方法、步骤及逻辑框图。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。结合本申请实施例所公开的方法的步骤可以直接体现为硬件译码处理器执行完成,或者用译码处理器中的硬件及软件模块组合执行完成。软件模块可以位于随机存储器,闪存、只读存储器,可编程只读存储器或者电可擦写可编程存储器、寄存器等本领域成熟的存储介质中。该存储介质位于存储器,处理器读取存储器中的信息,结合其硬件完成上述方法的步骤。It should be noted that the foregoing method embodiments of the present application may be applied to or implemented by a processor. The processor may be an integrated circuit chip with signal processing capabilities. In the implementation process, the steps of the foregoing method embodiments can be completed by hardware integrated logic circuits in the processor or instructions in the form of software. The above-mentioned processor may be a general-purpose processor, a digital signal processor (DSP), an application specific integrated circuit (ASIC), a ready-made programmable gate array (field programmable gate array, FPGA) or other Programming logic devices, discrete gates or transistor logic devices, discrete hardware components. The methods, steps, and logical block diagrams disclosed in the embodiments of the present application can be implemented or executed. The general-purpose processor may be a microprocessor or the processor may also be any conventional processor or the like. The steps of the method disclosed in the embodiments of the present application may be directly embodied as being executed and completed by a hardware decoding processor, or executed and completed by a combination of hardware and software modules in the decoding processor. The software module can be located in a mature storage medium in the field such as random access memory, flash memory, read-only memory, programmable read-only memory, or electrically erasable programmable memory, registers. The storage medium is located in the memory, and the processor reads the information in the memory and completes the steps of the above method in combination with its hardware.
可以理解,本申请实施例中的存储器可以是易失性存储器或非易失性存储器,或可包括易失性和非易失性存储器两者。其中,非易失性存储器可以是只读存储器(read-only memory,ROM)、可编程只 读存储器(programmable rom,PROM)、可擦除可编程只读存储器(erasable PROM,EPROM)、电可擦除可编程只读存储器(electrically EPROM,EEPROM)或闪存。易失性存储器可以是随机存取存储器(random access memory,RAM),其用作外部高速缓存。通过示例性但不是限制性说明,许多形式的RAM可用,例如静态随机存取存储器(static RAM,SRAM)、动态随机存取存储器(dynamic RAM,DRAM)、同步动态随机存取存储器(synchronous DRAM,SDRAM)、双倍数据速率同步动态随机存取存储器(double data rate SDRAM,DDR SDRAM)、增强型同步动态随机存取存储器(enhanced SDRAM,ESDRAM)、同步连接动态随机存取存储器(synchlink DRAM,SLDRAM)和直接内存总线随机存取存储器(direct rambus RAM,DR RAM)。应注意,本文描述的系统和方法的存储器旨在包括但不限于这些和任意其它适合类型的存储器。It can be understood that the memory in the embodiment of the present application may be a volatile memory or a non-volatile memory, or may include both volatile and non-volatile memory. Among them, the non-volatile memory can be read-only memory (ROM), programmable read-only memory (programmable rom, PROM), erasable programmable read-only memory (erasable PROM, EPROM), and electrically available Erase programmable read-only memory (electrically EPROM, EEPROM) or flash memory. The volatile memory may be random access memory (RAM), which is used as an external cache. By way of exemplary but not restrictive description, many forms of RAM are available, such as static random access memory (static RAM, SRAM), dynamic random access memory (dynamic RAM, DRAM), synchronous dynamic random access memory (synchronous DRAM, SDRAM), double data rate synchronous dynamic random access memory (double data rate SDRAM, DDR SDRAM), enhanced synchronous dynamic random access memory (enhanced SDRAM, ESDRAM), synchronous connection dynamic random access memory (synchlink DRAM, SLDRAM) ) And direct memory bus random access memory (direct rambus RAM, DR RAM). It should be noted that the memories of the systems and methods described herein are intended to include, but are not limited to, these and any other suitable types of memories.
应理解,在本申请实施例中,“与A相应的B”表示B与A相关联,根据A可以确定B。但还应理解,根据A确定B并不意味着仅仅根据A确定B,还可以根据A和/或其它信息确定B。It should be understood that in the embodiments of the present application, "B corresponding to A" means that B is associated with A, and B can be determined according to A. However, it should also be understood that determining B according to A does not mean that B is determined only according to A, and B can also be determined according to A and/or other information.
另外,本文中术语“和/或”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,本文中字符“/”,一般表示前后关联对象是一种“或”的关系。In addition, the term "and/or" in this article is only an association relationship describing associated objects, which means that there can be three types of relationships, for example, A and/or B, which can mean: A alone exists, and both A and B exist. There are three cases of B alone. In addition, the character "/" in this text generally indicates that the associated objects before and after are in an "or" relationship.
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。A person of ordinary skill in the art may be aware that the units and algorithm steps of the examples described in combination with the embodiments disclosed herein can be implemented by electronic hardware or a combination of computer software and electronic hardware. Whether these functions are executed by hardware or software depends on the specific application and design constraint conditions of the technical solution. Professionals and technicians can use different methods for each specific application to implement the described functions, but such implementation should not be considered beyond the scope of this application.
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统、装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that, for the convenience and conciseness of description, the specific working process of the above-described system, device, and unit can refer to the corresponding process in the foregoing method embodiment, which will not be repeated here.
在本申请所提供的几个实施例中,应该理解到,所揭露的系统、装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed system, device, and method may be implemented in other ways. For example, the device embodiments described above are only illustrative. For example, the division of the units is only a logical function division, and there may be other divisions in actual implementation, for example, multiple units or components can be combined or It can be integrated into another system, or some features can be ignored or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。In addition, the functional units in each embodiment of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(read-only memory,ROM)、随机存取存储器(random access memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。If the function is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer readable storage medium. Based on this understanding, the technical solution of this application essentially or the part that contributes to the existing technology or the part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium, including Several instructions are used to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the method described in each embodiment of the present application. The aforementioned storage media include: U disk, mobile hard disk, read-only memory (read-only memory, ROM), random access memory (random access memory, RAM), magnetic disk or optical disk and other media that can store program code .
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以所述权利要求的保护范围为准。The above are only specific implementations of this application, but the protection scope of this application is not limited to this. Any person skilled in the art can easily think of changes or substitutions within the technical scope disclosed in this application. Should be covered within the scope of protection of this application. Therefore, the protection scope of this application should be subject to the protection scope of the claims.

Claims (40)

  1. 一种语音分频传输方法,其特征在于,包括:A voice frequency division transmission method, characterized in that it comprises:
    源端对第一频段语音信号和第二频段语音信号进行编码;The source end encodes the voice signal in the first frequency band and the voice signal in the second frequency band;
    所述源端将帧同步信息标记到编码后的所述第一频段语音信号和编码后的所述第二频段语音信号中;The source end marks the frame synchronization information into the encoded first frequency band speech signal and the encoded second frequency band speech signal;
    所述源端通过第一同步链路和第二同步链路分别发送带有所述帧同步信息的所述编码后的所述第一频段语音信号和带有所述帧同步信息的所述编码后的所述第二频段语音信号给播放端。The source end sends the encoded first frequency band speech signal with the frame synchronization information and the encoded with the frame synchronization information through the first synchronization link and the second synchronization link, respectively The second frequency band voice signal is sent to the playback terminal.
  2. 根据权利要求1所述的语音分频传输方法,其特征在于,包括:The voice frequency division transmission method according to claim 1, characterized by comprising:
    所述第二频段语音信号的编码方式的压缩率高于所述第一频段语音信号的编码方式的压缩率;或者The compression rate of the encoding method of the second frequency band speech signal is higher than the compression rate of the encoding method of the first frequency band speech signal; or
    所述第二频段语音信号的编码方式的压缩率低于所述第一频段语音信号的编码方式的压缩率。The compression rate of the encoding method of the speech signal in the second frequency band is lower than the compression rate of the encoding method of the speech signal in the first frequency band.
  3. 根据权利要求1或2所述的语音分频传输方法,其特征在于,源端对第二频段语音信号进行编码包括:The voice frequency division transmission method according to claim 1 or 2, wherein the source end encoding the second frequency band voice signal comprises:
    所述源端对高频语音信号进行编码,所述源端对所述高频语音信号的所述编码方式包括CELT编码方式或SBR编码方式;或者The source end encodes a high-frequency voice signal, and the source end encodes the high-frequency voice signal in a CELT encoding method or an SBR encoding method; or
    所述源端对非高频语音信号进行编码,所述源端对所述非高频语音信号的所述编码方式包括SILK编码方式、SBC编码方式、AAC编码方式或MP3编码方式。The source end encodes a non-high frequency speech signal, and the encoding method of the source end to the non-high frequency speech signal includes a SILK encoding method, an SBC encoding method, an AAC encoding method, or an MP3 encoding method.
  4. 根据权利要求1至3中任一项所述的语音分频传输方法,其特征在于,所述源端将帧同步信息标记到编码后的所述第一频段语音信号和编码后的所述第二频段语音信号中包括:The voice frequency division transmission method according to any one of claims 1 to 3, wherein the source end marks the frame synchronization information on the encoded first frequency band speech signal and the encoded first frequency signal The two-band voice signal includes:
    所述源端标记预设延时内检测到的编码后的任一频段的语音信号为一帧数据,所述帧同步信息包括所述一帧数据的开始时刻和结束时刻。The source end marks the encoded voice signal in any frequency band detected within a preset delay as one frame of data, and the frame synchronization information includes the start time and the end time of the one frame of data.
  5. 根据权利要求1至4中任一项所述的语音分频传输方法,其 特征在于,所述源端通过第二同步链路发送带有所述帧同步信息的所述编码后的所述第二频段语音信号给播放端之前,包括:The voice frequency division transmission method according to any one of claims 1 to 4, wherein the source end sends the encoded first with the frame synchronization information through a second synchronization link Before the two-band voice signal is sent to the player, it includes:
    根据所述播放端发送的所述播放端支持的链路参数确定使用的链路参数;Determining the link parameter used according to the link parameter supported by the playback terminal sent by the playback terminal;
    所述源端根据所述使用的链路参数建立与所述播放端之间的所述第二同步链路。The source end establishes the second synchronization link with the playback end according to the used link parameter.
  6. 根据权利要求5所述的语音分频传输方法,其特征在于,所述根据所述播放端发送的所述播放端支持的链路参数确定使用的链路参数之前,包括:The voice frequency division transmission method according to claim 5, characterized in that, before determining the link parameters used according to the link parameters supported by the playback end sent by the playback end, the method comprises:
    所述源端发送第二同步链路请求给所述播放端;Sending, by the source end, a second synchronization link request to the playback end;
    所述源端接收所述第二同步链路请求的回复;Receiving, by the source end, a reply to the second synchronization link request;
    所述源端根据所述第二同步链路请求的回复确定是否建立所述第二同步链路。The source terminal determines whether to establish the second synchronization link according to the reply of the second synchronization link request.
  7. 根据权利要求1至6中任一项所述的语音分频传输方法,其特征在于,所述源端对第二频段语音信号进行编码之前,还包括:The voice frequency division transmission method according to any one of claims 1 to 6, wherein before the source end encodes the voice signal of the second frequency band, the method further comprises:
    所述源端根据控制数据流判断所述播放端是否支持所述语音分频传输,所述控制数据流通过异步链路传输。The source terminal judges whether the playback terminal supports the voice frequency division transmission according to the control data stream, and the control data stream is transmitted through an asynchronous link.
  8. 根据权利要求7所述的语音分频传输方法,其特征在于,若所述源端通过自定义通用唯一识别码(UUID)标识语音分频传输服务,所述源端根据控制数据流判断所述播放端是否支持所述语音分频传输包括:The voice frequency division transmission method according to claim 7, wherein if the source end identifies the voice frequency division transmission service by a custom universally unique identifier (UUID), the source end judges the voice frequency division transmission service according to the control data flow Whether the playback terminal supports the voice frequency division transmission includes:
    所述控制数据流包括所述自定义UUID的值,若所述源端接收到的所述播放端的所述自定义UUID的值等于预设UUID值,所述源端确定所述播放端支持所述语音分频传输。The control data stream includes the value of the custom UUID, and if the value of the custom UUID of the playback terminal received by the source is equal to the preset UUID value, the source determines that the playback terminal supports all The voice frequency division transmission.
  9. 根据权利要求7或8所述的语音分频传输方法,其特征在于,包括:The voice frequency division transmission method according to claim 7 or 8, characterized in that it comprises:
    若所述源端根据控制数据流确定所述播放端支持所述语音分频传输,所述源端发送音频配置参数请求给所述播放端;If the source terminal determines that the playback terminal supports the voice frequency division transmission according to the control data stream, the source terminal sends an audio configuration parameter request to the playback terminal;
    所述源端接收所述播放端支持的所述第二频段语音信号对应的 音频配置参数,所述音频配置参数包括编解码参数和码率,所述编解码参数包括所述编码方式和解码方式其中的一种或两种;The source terminal receives audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal, where the audio configuration parameters include coding and decoding parameters and bit rates, and the coding and decoding parameters include the coding and decoding modes One or two of them;
    所述源端根据所述播放端支持的所述第二频段语音信号对应的音频配置参数确定使用的音频配置参数并将所述使用的音频配置参数发送给所述播放端。The source terminal determines the used audio configuration parameters according to the audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal and sends the used audio configuration parameters to the playback terminal.
  10. 根据权利要求1至9中任一项所述的语音分频传输方法,其特征在于,包括:The voice frequency division transmission method according to any one of claims 1 to 9, characterized in that it comprises:
    所述第二同步链路的条数小于或等于所述第二频段语音信号的频段数;The number of the second synchronization links is less than or equal to the number of frequency bands of the second frequency band voice signal;
    所述源端根据电量情况或者所述第二同步链路的链路质量断开一条或多条所述第二同步链路;或者The source end disconnects one or more of the second synchronization links according to the power condition or the link quality of the second synchronization link; or
    所述源端根据所述电量情况或者所述第二同步链路的链路质量建立所述一条或多条所述第二同步链路。The source end establishes the one or more second synchronization links according to the power condition or the link quality of the second synchronization link.
  11. 一种语音分频传输方法,其特征在于,包括:A voice frequency division transmission method, characterized in that it comprises:
    播放端通过第一同步链路和第二同步链路分别接收带有帧同步信息的编码后的第一频段语音信号和带有所述帧同步信息的编码后的第二频段语音信号;The playback terminal receives the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band speech signal with frame synchronization information through the first synchronization link and the second synchronization link, respectively;
    所述播放端对接收到的所述第一频段语音信号和所述第二频段语音信号进行解码;The playback terminal decodes the received voice signal in the first frequency band and the voice signal in the second frequency band;
    所述播放端通过所述帧同步信息对解码后的所述第一频段语音信号和解码后的所述第二频段语音信号进行同步。The playback terminal synchronizes the decoded first frequency band speech signal and the decoded second frequency band speech signal through the frame synchronization information.
  12. 根据权利要求11所述的语音分频传输方法,其特征在于,所述播放端通过所述帧同步信息对解码后的所述第一频段语音信号和解码后的所述第二频段语音信号进行同步之后,包括:The voice frequency division transmission method according to claim 11, wherein the playback end uses the frame synchronization information to perform the decoded first frequency band speech signal and the decoded second frequency band speech signal After synchronization, include:
    所述播放端对同步后的所述第一频段语音信号和同步后的所述第二频段语音信号通过一个或多个不同的数模转换器分别进行数模转换;The playback terminal performs digital-to-analog conversion on the synchronized voice signal in the first frequency band and the synchronized voice signal in the second frequency band through one or more different digital-to-analog converters;
    所述播放端对数模转换后的所述第一频段语音信号和数模转换后的所述第二频段语音信号通过一个或多个不同的放大器分别进行 放大;The playback terminal amplifies the first frequency band speech signal after digital-to-analog conversion and the second frequency band speech signal after digital-to-analog conversion respectively through one or more different amplifiers;
    所述播放端对放大后的所述第一频段语音信号和放大后的所述第二频段语音信号通过一个或多个不同的电声转换器分别进行电声转换。The playback terminal performs electro-acoustic conversion on the amplified voice signal in the first frequency band and the amplified voice signal in the second frequency band through one or more different electro-acoustic converters.
  13. 根据权利要求11或12所述的语音分频传输方法,其特征在于,所述播放端通过第二同步链路接收带有所述帧同步信息的编码后的第二频段语音信号之前包括:The voice frequency division transmission method according to claim 11 or 12, wherein before the playback terminal receives the encoded second frequency band voice signal with the frame synchronization information through the second synchronization link, the method comprises:
    所述播放端发送控制数据流给所述源端,以使所述源端根据所述控制数据流判断所述播放端是否支持所述语音分频传输。The playback end sends a control data stream to the source end, so that the source end judges whether the playback end supports the voice frequency division transmission according to the control data stream.
  14. 根据权利要求13所述的语音分频传输方法,其特征在于,包括:The voice frequency division transmission method according to claim 13, characterized in that it comprises:
    若所述源端根据所述控制数据流确定所述播放端支持所述语音分频传输,所述播放端接收所述源端发送的第二同步链路请求并发送所述第二同步链路请求的回复;If the source terminal determines that the playback terminal supports the voice frequency division transmission according to the control data stream, the playback terminal receives the second synchronization link request sent by the source terminal and sends the second synchronization link Requested reply
    若所述播放端支持所述语音分频传输,所述播放端发送所述播放端支持的链路参数给所述源端。If the playback end supports the voice frequency division transmission, the playback end sends the link parameters supported by the playback end to the source end.
  15. 根据权利要求14所述的语音分频传输方法,其特征在于,所述播放端接收所述源端发送的第二同步链路请求之前,包括:The voice frequency division transmission method according to claim 14, wherein before the playback end receives the second synchronization link request sent by the source end, the method comprises:
    所述播放端接收所述源端发送的音频配置参数请求;The playback terminal receives the audio configuration parameter request sent by the source terminal;
    所述播放端发送所述播放端支持的所述第二频段语音信号对应的音频配置参数给所述源端;Sending, by the playback terminal, the audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal to the source terminal;
    所述播放端接收所述源端发送的使用的音频配置参数并配置所述使用的音频配置参数。The playback terminal receives the used audio configuration parameters sent by the source terminal and configures the used audio configuration parameters.
  16. 根据权利要求11至15中任一项所述的语音分频传输方法,其特征在于,包括:The voice frequency division transmission method according to any one of claims 11 to 15, characterized in that it comprises:
    所述播放端根据电量情况或者所述第二同步链路的链路质量断开一条或多条所述第二同步链路;或者The playback terminal disconnects one or more of the second synchronization links according to the power condition or the link quality of the second synchronization link; or
    所述播放端根据所述电量情况或者所述第二同步链路的链路质量请求所述源端建立所述一条或多条所述第二同步链路。The playback end requests the source end to establish the one or more second synchronization links according to the power condition or the link quality of the second synchronization link.
  17. 一种源端,用于语音分频传输,所述源端包括:A source terminal is used for voice frequency division transmission. The source terminal includes:
    编码模块,用于对第一频段语音信号和第二频段语音信号进行编码;Encoding module, used to encode the first frequency band speech signal and the second frequency band speech signal;
    预同步模块,用于将帧同步信息标记到编码后的所述第一频段语音信号和编码后的所述第二频段语音信号中;以及A pre-synchronization module for marking frame synchronization information into the encoded first frequency band speech signal and the encoded second frequency band speech signal; and
    第一发送模块,用于通过第一同步链路和第二同步链路分别发送带有所述帧同步信息的所述编码后的所述第一频段语音信号和带有所述帧同步信息的所述编码后的所述第二频段语音信号给播放端。The first sending module is configured to send the encoded first frequency band voice signal with the frame synchronization information and the voice signal with the frame synchronization information through the first synchronization link and the second synchronization link, respectively The encoded voice signal of the second frequency band is sent to the playback terminal.
  18. 根据权利要求17所述的源端,其特征在于,包括:The source according to claim 17, characterized in that it comprises:
    所述第二频段语音信号的编码方式的压缩率高于所述第一频段语音信号的编码方式的压缩率;或者The compression rate of the encoding method of the second frequency band speech signal is higher than the compression rate of the encoding method of the first frequency band speech signal; or
    所述第二频段语音信号的编码方式的压缩率低于所述第一频段语音信号的编码方式的压缩率。The compression rate of the encoding method of the speech signal in the second frequency band is lower than the compression rate of the encoding method of the speech signal in the first frequency band.
  19. 根据权利要求17或18所述的源端,其特征在于,所述编码模块包括:The source according to claim 17 or 18, wherein the encoding module comprises:
    高频编码模块,用于对高频语音信号进行编码,对所述高频语音信号的编码方式包括CELT编码方式或SBR编码方式;或者The high-frequency encoding module is used to encode the high-frequency speech signal, and the encoding method of the high-frequency speech signal includes the CELT encoding method or the SBR encoding method; or
    非高频编码模块,用于对非高频语音信号进行编码,对所述非高频语音信号的编码方式包括SILK编码方式、SBC编码方式、AAC编码方式或MP3编码方式。The non-high frequency encoding module is used to encode the non-high frequency voice signal, and the encoding method of the non-high frequency voice signal includes the SILK encoding method, the SBC encoding method, the AAC encoding method or the MP3 encoding method.
  20. 根据权利要求17至19中任一项所述的源端,其特征在于,所述预同步模块包括:The source terminal according to any one of claims 17 to 19, wherein the pre-synchronization module comprises:
    数据帧标记模块,用于标记预设延时内检测到的编码后的任一频段的语音信号为一帧数据,所述帧同步信息包括所述一帧数据的开始时刻和结束时刻。The data frame marking module is used to mark the coded voice signal of any frequency band detected within a preset delay as a frame of data, and the frame synchronization information includes the start time and end time of the frame of data.
  21. 根据权利要求17至20中任一项所述的源端,其特征在于,所述源端还包括:The source terminal according to any one of claims 17 to 20, wherein the source terminal further comprises:
    第一参数确定模块,第一发送模块通过第二同步链路发送带有所述帧同步信息的所述编码后的所述第二频段语音信号给播放端之前, 用于根据所述播放端发送的所述播放端支持的链路参数确定使用的链路参数;The first parameter determination module, the first sending module sends the encoded second frequency band voice signal with the frame synchronization information to the playback terminal through the second synchronization link, and is used for sending according to the playback terminal The link parameters supported by the playback terminal determine the link parameters used;
    链路建立模块,用于根据所述使用的链路参数建立与所述播放端之间的所述第二同步链路。The link establishment module is configured to establish the second synchronization link with the playback terminal according to the used link parameters.
  22. 根据权利要求21所述的源端,其特征在于,所述源端还包括:The source terminal according to claim 21, wherein the source terminal further comprises:
    第二发送模块,所述第一参数确定模块根据所述播放端发送的所述播放端支持的链路参数确定使用的链路参数之前,用于发送第二同步链路请求给所述播放端;以及The second sending module is configured to send a second synchronization link request to the playback terminal before the first parameter determination module determines the link parameters used according to the link parameters supported by the playback terminal sent by the playback terminal. ;as well as
    第一接收模块,用于接收所述第二同步链路请求的回复;The first receiving module is configured to receive a reply to the second synchronization link request;
    所述第一参数确定模块还用于根据所述第二同步链路请求的回复确定是否建立所述第二同步链路。The first parameter determination module is further configured to determine whether to establish the second synchronization link according to a reply to the second synchronization link request.
  23. 根据权利要求17至22中任一项所述的源端,其特征在于,所述源端还包括:The source terminal according to any one of claims 17 to 22, wherein the source terminal further comprises:
    第一判断模块,所述编码模块对所述第二频段语音信号进行编码之前,用于根据控制数据流判断所述播放端是否支持所述语音分频传输,所述控制数据流通过异步链路传输。The first determining module, before the encoding module encodes the voice signal in the second frequency band, is used to determine whether the playback terminal supports the voice frequency division transmission according to the control data stream, and the control data stream passes through an asynchronous link transmission.
  24. 根据权利要求23所述的源端,其特征在于,所述源端还包括UUID模块,用于通过自定义通用唯一识别码(UUID)标识语音分频传输服务;The source terminal according to claim 23, wherein the source terminal further comprises a UUID module for identifying the voice frequency division transmission service through a custom universally unique identifier (UUID);
    所述第一判断模块包括:The first judgment module includes:
    第二判断模块,所述控制数据流包括所述自定义UUID的值,若接收到的所述播放端的所述自定义UUID的值等于预设UUID值,用于确定所述播放端支持所述语音分频传输。The second judgment module, the control data stream includes the value of the custom UUID, and if the received value of the custom UUID of the player is equal to the preset UUID value, it is used to determine that the player supports the Voice frequency division transmission.
  25. 根据权利要求23或24所述的源端,其特征在于,所述源端还包括:The source terminal according to claim 23 or 24, wherein the source terminal further comprises:
    第三发送模块:若第一判断模块根据控制数据流确定所述播放端支持所述语音分频传输,用于发送音频配置参数请求给所述播放端;The third sending module: if the first judging module determines that the playback end supports the voice frequency division transmission according to the control data stream, it is used to send an audio configuration parameter request to the playback end;
    第二接收模块:用于接收所述播放端支持的所述第二频段语音信 号对应的音频配置参数,所述音频配置参数包括编解码参数和码率,所述编解码参数包括所述编码方式和解码方式中的一种或两种;以及The second receiving module: configured to receive audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal, where the audio configuration parameters include codec parameters and bit rates, and the codec parameters include the coding mode And one or two of the decoding methods; and
    第二参数确定模块,用于根据所述播放端支持的所述第二频段语音信号对应的音频配置参数确定使用的音频配置参数;The second parameter determination module is configured to determine the audio configuration parameters used according to the audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal;
    所述第三发送模块还用于将所述使用的音频配置参数发送给所述播放端。The third sending module is further configured to send the used audio configuration parameters to the playback terminal.
  26. 根据权利要求17至25中任一项所述的源端,其特征在于,所述源端还包括:The source terminal according to any one of claims 17 to 25, wherein the source terminal further comprises:
    第一传输控制模块,用于根据电量情况或者所述第二同步链路的链路质量断开一条或多条所述第二同步链路,所述第二同步链路的条数小于或等于所述第二频段语音信号的频段数;或者The first transmission control module is configured to disconnect one or more of the second synchronization links according to the power condition or the link quality of the second synchronization link, and the number of the second synchronization links is less than or equal to The number of frequency bands of the voice signal in the second frequency band; or
    所述第一传输控制模块还用于根据所述电量情况或者所述第二同步链路的链路质量建立所述一条或多条所述第二同步链路,所述第二同步链路的条数小于或等于所述第二频段语音信号的频段数。The first transmission control module is further configured to establish the one or more second synchronization links according to the power status or the link quality of the second synchronization link, and the second synchronization link The number of bars is less than or equal to the number of frequency bands of the second frequency band voice signal.
  27. 一种播放端,用于语音分频传输,所述播放端包括:A playback terminal is used for voice frequency division transmission. The playback terminal includes:
    第三接收模块,用于通过第一同步链路和第二同步链路分别接收带有帧同步信息的编码后的第一频段语音信号和带有所述帧同步信息的编码后的第二频段语音信号;The third receiving module is configured to receive the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band with frame synchronization information through the first synchronization link and the second synchronization link, respectively voice signal;
    解码模块,用于对接收到的所述编码后的第一频段语音信号和所述编码后的第二频段语音信号进行解码;A decoding module, configured to decode the received encoded first frequency band speech signal and the encoded second frequency band speech signal;
    同步模块,用于通过所述帧同步信息对解码后的所述第一频段语音信号和解码后的所述第二频段语音信号进行同步。The synchronization module is configured to synchronize the decoded voice signal in the first frequency band and the decoded voice signal in the second frequency band through the frame synchronization information.
  28. 根据权利要求27所述的播放端,其特征在于,所述播放端还包括:The playback terminal according to claim 27, wherein the playback terminal further comprises:
    一个或多个不同的数模转换模块,用于对同步后的所述第一频段语音信号和所述第二频段语音信号分别进行数模转换;One or more different digital-to-analog conversion modules for performing digital-to-analog conversion on the synchronized voice signal in the first frequency band and the voice signal in the second frequency band respectively;
    一个或多个不同的放大模块,用于对数模转换后的所述第一频段语音信号和所述第二频段语音信号分别进行放大;以及One or more different amplifying modules for respectively amplifying the first frequency band speech signal and the second frequency band speech signal after digital-to-analog conversion; and
    一个或多个不同的电声转换模块,用于对放大后的所述第一频段 语音信号和所述第二频段语音信号分别进行电声转换。One or more different electro-acoustic conversion modules are used to perform electro-acoustic conversion on the amplified speech signal in the first frequency band and the speech signal in the second frequency band, respectively.
  29. 根据权利要求27或28所述的播放端,其特征在于,所述播放端还包括:The playback terminal according to claim 27 or 28, wherein the playback terminal further comprises:
    第四发送模块,所述第三接收模块通过第二同步链路接收带有所述帧同步信息的编码后的第二频段语音信号之前,用于发送控制数据流给所述源端,以使所述源端根据所述控制数据流判断所述播放端是否支持所述语音分频传输。The fourth sending module, before the third receiving module receives the encoded second frequency band voice signal with the frame synchronization information through the second synchronization link, is used to send a control data stream to the source end, so that The source terminal determines whether the playback terminal supports the voice frequency division transmission according to the control data stream.
  30. 根据权利要求29所述的播放端,其特征在于,所述播放端还包括:The playback terminal according to claim 29, wherein the playback terminal further comprises:
    第四接收模块,若所述源端根据所述控制数据流确定所述播放端支持所述语音分频传输,用于接收所述源端发送的第二同步链路请求;以及A fourth receiving module, if the source terminal determines that the playback terminal supports the voice frequency division transmission according to the control data stream, it is used to receive the second synchronization link request sent by the source terminal; and
    第五发送模块,用于发送所述第二同步链路请求的回复;A fifth sending module, configured to send a reply to the second synchronization link request;
    若所述播放端支持所述语音分频传输,所述第五发送模块还用于发送所述播放端支持的链路参数给所述源端。If the playback end supports the voice frequency division transmission, the fifth sending module is further configured to send link parameters supported by the playback end to the source end.
  31. 根据权利要求30所述的播放端,其特征在于,所述播放端还包括:The playback terminal according to claim 30, wherein the playback terminal further comprises:
    第五接收模块,所述第四接收模块接收所述源端发送的第二同步链路请求之前,用于接收所述源端发送的音频配置参数请求;A fifth receiving module, configured to receive an audio configuration parameter request sent by the source end before the fourth receiving module receives the second synchronization link request sent by the source end;
    第六发送模块,用于发送所述播放端支持的所述第二频段语音信号对应的音频配置参数给所述源端;A sixth sending module, configured to send audio configuration parameters corresponding to the second frequency band voice signal supported by the playback terminal to the source terminal;
    所述第五接收模块还用于接收所述源端发送的使用的音频配置参数;以及The fifth receiving module is further configured to receive the audio configuration parameters used by the source end; and
    参数配置模块,用于配置所述使用的音频配置参数。The parameter configuration module is used to configure the audio configuration parameters used.
  32. 根据权利要求27至31中任一项所述的播放端,其特征在于,所述播放端还包括:The playback terminal according to any one of claims 27 to 31, wherein the playback terminal further comprises:
    第二传输控制模块,用于根据电量情况或者所述第二同步链路的链路质量断开一条或多条所述第二同步链路;或者The second transmission control module is configured to disconnect one or more of the second synchronization links according to the power condition or the link quality of the second synchronization link; or
    所述第二传输控制模块还用于根据所述电量情况或者所述第二 同步链路的链路质量请求所述源端建立所述一条或多条所述第二同步链路。The second transmission control module is further configured to request the source end to establish the one or more second synchronization links according to the power condition or the link quality of the second synchronization link.
  33. 一种源端,用于语音分频传输,其特征在于,包括:存储器和处理器;A source terminal for voice frequency division transmission, characterized in that it includes: a memory and a processor;
    所述存储器与所述处理器耦合;The memory is coupled with the processor;
    所述存储器,用于存储程序指令;The memory is used to store program instructions;
    所述处理器,用于调用所述存储器存储的程序指令,使得所述源端执行上述权利要求1至10中任一项所述的语音分频传输方法。The processor is configured to call program instructions stored in the memory to enable the source to execute the voice frequency division transmission method according to any one of claims 1 to 10.
  34. 一种播放端,用于语音分频传输,其特征在于,包括:存储器和处理器;A playback terminal, used for voice frequency division transmission, characterized in that it comprises: a memory and a processor;
    所述存储器与所述处理器耦合;The memory is coupled with the processor;
    所述存储器,用于存储程序指令;The memory is used to store program instructions;
    所述处理器,用于调用所述存储器存储的程序指令,使得所述播放端执行上述权利要求11至16中任一项所述的语音分频传输方法。The processor is configured to call the program instructions stored in the memory, so that the playback terminal executes the voice frequency division transmission method according to any one of claims 11 to 16.
  35. 一种计算机可读存储介质,其特征在于,包括:其上存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现上述权利要求1至10中任一项所述的语音分频传输方法。A computer-readable storage medium, comprising: a computer program stored thereon, wherein the computer program implements the voice analysis according to any one of claims 1 to 10 when the computer program is executed by a processor. Frequency transmission method.
  36. 一种计算机可读存储介质,其特征在于,包括:其上存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现上述权利要求11至16中任一项所述的语音分频传输方法。A computer-readable storage medium, comprising: a computer program stored thereon, wherein the computer program implements the voice analysis according to any one of claims 11 to 16 when the computer program is executed by a processor. Frequency transmission method.
  37. 一种源端电路,其特征在于,包括:A source circuit, characterized in that it comprises:
    编码器,用于对第一频段语音信号和第二频段语音信号进行编码;以及An encoder for encoding the first frequency band speech signal and the second frequency band speech signal; and
    源端控制器,与所述编码器连接,用于通过第一同步链路和第二同步链路分别发送带有帧同步信息的编码后的所述第一频段语音信号和带有所述帧同步信息的编码后的所述第二频段语音信号给播放端电路。The source controller, connected to the encoder, is configured to send the encoded first frequency band speech signal with frame synchronization information and the frame with the frame synchronization information through the first synchronization link and the second synchronization link, respectively. The second frequency band voice signal after encoding of the synchronization information is sent to the playback terminal circuit.
  38. 根据权利要求37所述的源端电路,其特征在于,所述源端电路还包括滤波器,所述滤波器与所述编码器连接,用于分离所述第 一频段语音信号和所述第二频段语音信号。The source circuit according to claim 37, wherein the source circuit further comprises a filter, and the filter is connected to the encoder for separating the speech signal of the first frequency band from the speech signal of the first frequency band. Two-band voice signal.
  39. 一种播放端电路,其特征在于,包括:A playback terminal circuit, characterized in that it comprises:
    播放端控制器,用于通过第一同步链路和第二同步链路分别接收带有帧同步信息的编码后的第一频段语音信号和带有所述帧同步信息的编码后的第二频段语音信号;以及The player controller is used to receive the encoded first frequency band speech signal with frame synchronization information and the encoded second frequency band with frame synchronization information through the first synchronization link and the second synchronization link, respectively Voice signal; and
    解码器,与所述播放端控制器连接,用于对接收到的所述第一频段语音信号和所述第二频段语音信号进行解码。The decoder is connected to the player controller, and is used to decode the received voice signal in the first frequency band and the voice signal in the second frequency band.
  40. 根据权利要求39所述的播放端电路,其特征在于,所述播放端电路还包括:The playback terminal circuit of claim 39, wherein the playback terminal circuit further comprises:
    一个或多个不同的数模转换器,分别与所述解码器连接,用于对解码后的所述第一频段语音信号和解码后的所述第二频段语音信号分别进行数模转换;One or more different digital-to-analog converters, respectively connected to the decoder, for performing digital-to-analog conversion on the decoded speech signal in the first frequency band and the speech signal in the second frequency band after decoding;
    一个或多个不同的放大器,分别与一个或多个所述数模转换器连接,用于对数模转换后的所述第一频段语音信号和数模转换后的所述第二频段语音信号分别进行放大;以及One or more different amplifiers are respectively connected to one or more of the digital-to-analog converters, and are used to perform the digital-to-analog conversion of the first frequency band speech signal and the digital-to-analog conversion of the second frequency band speech signal Zoom in separately; and
    一个或多个不同的电声转换器,分别与一个或多个所述放大器连接,用于对放大后的所述第一频段语音信号和放大后的所述第二频段语音信号分别进行电声转换。One or more different electroacoustic converters, respectively connected to one or more of the amplifiers, for performing electroacoustics on the amplified voice signal in the first frequency band and the amplified voice signal in the second frequency band, respectively Conversion.
PCT/CN2019/087811 2019-05-21 2019-05-21 Voice frequency division transmission method, source terminal, playback terminal, source terminal circuit and playback terminal circuit WO2020232631A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2019/087811 WO2020232631A1 (en) 2019-05-21 2019-05-21 Voice frequency division transmission method, source terminal, playback terminal, source terminal circuit and playback terminal circuit
CN201980000976.XA CN110366752B (en) 2019-05-21 2019-05-21 Voice frequency division transmission method, source terminal, play terminal, source terminal circuit and play terminal circuit

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2019/087811 WO2020232631A1 (en) 2019-05-21 2019-05-21 Voice frequency division transmission method, source terminal, playback terminal, source terminal circuit and playback terminal circuit

Publications (1)

Publication Number Publication Date
WO2020232631A1 true WO2020232631A1 (en) 2020-11-26

Family

ID=68225504

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/087811 WO2020232631A1 (en) 2019-05-21 2019-05-21 Voice frequency division transmission method, source terminal, playback terminal, source terminal circuit and playback terminal circuit

Country Status (2)

Country Link
CN (1) CN110366752B (en)
WO (1) WO2020232631A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112738732B (en) * 2019-10-28 2022-07-29 成都鼎桥通信技术有限公司 Audio playing method and device
CN112992161A (en) * 2021-04-12 2021-06-18 北京世纪好未来教育科技有限公司 Audio encoding method, audio decoding method, audio encoding apparatus, audio decoding medium, and electronic device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1848241A (en) * 1995-12-01 2006-10-18 数字剧场系统股份有限公司 Multi-channel audio frequency coder
CN105163155A (en) * 2015-08-26 2015-12-16 小米科技有限责任公司 Method and device for synchronous playing
CN105653237A (en) * 2016-02-01 2016-06-08 宇龙计算机通信科技(深圳)有限公司 Audio playing control method and system and terminal
US20190050192A1 (en) * 2014-07-30 2019-02-14 Sonos, Inc. Contextual Indexing of Media Items

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4402073A (en) * 1980-03-11 1983-08-30 Vanderhoff Communications Ltd. Speech and data communication network
CA2149006C (en) * 1994-06-07 2003-07-15 Cecil Henry Bannister Synchronous voice/data messaging system
JP3484341B2 (en) * 1998-03-30 2004-01-06 三菱電機株式会社 Audio signal transmission device
JP2003023683A (en) * 2001-07-06 2003-01-24 Mitsubishi Electric Corp Voice relay transmission system
US7805313B2 (en) * 2004-03-04 2010-09-28 Agere Systems Inc. Frequency-based coding of channels in parametric multi-channel coding systems
US7272567B2 (en) * 2004-03-25 2007-09-18 Zoran Fejzo Scalable lossless audio codec and authoring tool
RU2396726C2 (en) * 2005-02-18 2010-08-10 Квэлкомм Инкорпорейтед Radio communication protocols for multichannel communication systems
US8755407B2 (en) * 2005-02-18 2014-06-17 Qualcomm Incorporated Radio link protocols for enhancing efficiency of multi-link communication systems
EP1861969B1 (en) * 2005-03-23 2019-10-16 QUALCOMM Incorporated Methods and apparatus for using multiple wireless links with a wireless terminal
US7464313B2 (en) * 2006-03-09 2008-12-09 Motorola, Inc. Hybrid approach for data transmission using a combination of single-user and multi-user packets
KR101379263B1 (en) * 2007-01-12 2014-03-28 삼성전자주식회사 Method and apparatus for decoding bandwidth extension
US20080300025A1 (en) * 2007-05-31 2008-12-04 Motorola, Inc. Method and system to configure audio processing paths for voice recognition
CN103812824A (en) * 2012-11-07 2014-05-21 中兴通讯股份有限公司 Audio frequency multi-code transmission method and corresponding device
WO2014108738A1 (en) * 2013-01-08 2014-07-17 Nokia Corporation Audio signal multi-channel parameter encoder

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1848241A (en) * 1995-12-01 2006-10-18 数字剧场系统股份有限公司 Multi-channel audio frequency coder
US20190050192A1 (en) * 2014-07-30 2019-02-14 Sonos, Inc. Contextual Indexing of Media Items
CN105163155A (en) * 2015-08-26 2015-12-16 小米科技有限责任公司 Method and device for synchronous playing
CN105653237A (en) * 2016-02-01 2016-06-08 宇龙计算机通信科技(深圳)有限公司 Audio playing control method and system and terminal

Also Published As

Publication number Publication date
CN110366752A (en) 2019-10-22
CN110366752B (en) 2023-10-10

Similar Documents

Publication Publication Date Title
US11109138B2 (en) Data transmission method and system, and bluetooth headphone
KR102569374B1 (en) How to operate a Bluetooth device
TWI287371B (en) Method and system for dynamically changing audio stream bit rate based on condition of a bluetooth connection
JP7426448B2 (en) Control of connected multimedia devices
CN109785841B (en) Bluetooth intelligent equipment voice interaction system and method
TW200805901A (en) Method and system for optimized architecture for bluetooth streaming audio applications
WO2008122212A1 (en) Method and system for auto-setting audio coding formats of bluetooth a2dp transmission
TW201118737A (en) Dynamically provisioning a device with audio processing capability
WO2021052293A1 (en) Audio coding method and apparatus
WO2021160040A1 (en) Audio transmission method and electronic device
WO2020232631A1 (en) Voice frequency division transmission method, source terminal, playback terminal, source terminal circuit and playback terminal circuit
KR20150121641A (en) Appratus and method for transmitting and receiving voice data in wireless communication system
EP3745813A1 (en) Method for operating a bluetooth device
US9924303B2 (en) Device and method for implementing synchronous connection-oriented (SCO) pass-through links
JP2000270024A (en) Method for exchanging capability of frame packet processing size in internet phone, terminal utilizing internet phone and medium recording program of internet phone
US20110235632A1 (en) Method And Apparatus For Performing High-Quality Speech Communication Across Voice Over Internet Protocol (VoIP) Communications Networks
CN111385780A (en) Bluetooth audio signal transmission method and device
US20140163971A1 (en) Method of using a mobile device as a microphone, method of audio playback, and related device and system
CN207718805U (en) A kind of high definition Bluetooth audio frequency transceiver terminal and communication system
US20210312934A1 (en) Communication method, apparatus, and system for digital enhanced cordless telecommunications (dect) base station
JP5006975B2 (en) Background noise information decoding method and background noise information decoding means
CN110662205B (en) Bluetooth-based audio transmission method, device, medium and equipment
US20220103948A1 (en) Method and system for performing audio ducking for headsets
TW202315430A (en) Bluetooth voice communication system and related computer program product for generating stereo voice effect
TW202405793A (en) Audio compression device, audio compression system and audio compression method

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19929354

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19929354

Country of ref document: EP

Kind code of ref document: A1