WO2023240653A1 - 音频信号格式确定方法、装置 - Google Patents

音频信号格式确定方法、装置 Download PDF

Info

Publication number
WO2023240653A1
WO2023240653A1 PCT/CN2022/099626 CN2022099626W WO2023240653A1 WO 2023240653 A1 WO2023240653 A1 WO 2023240653A1 CN 2022099626 W CN2022099626 W CN 2022099626W WO 2023240653 A1 WO2023240653 A1 WO 2023240653A1
Authority
WO
WIPO (PCT)
Prior art keywords
audio
terminal device
audio format
signal
format signal
Prior art date
Application number
PCT/CN2022/099626
Other languages
English (en)
French (fr)
Inventor
高硕�
Original Assignee
北京小米移动软件有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京小米移动软件有限公司 filed Critical 北京小米移动软件有限公司
Priority to PCT/CN2022/099626 priority Critical patent/WO2023240653A1/zh
Priority to CN202280001877.5A priority patent/CN117597936A/zh
Publication of WO2023240653A1 publication Critical patent/WO2023240653A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/81Monomedia components thereof

Definitions

  • the present disclosure relates to the field of communication technology, and in particular, to an audio signal format determination method, device, equipment and storage medium.
  • voice and audio communication services can be extended from narrowband signals to ultra-wideband or even full-band services, which can meet users' increasing demand for high-quality audio.
  • the terminal device can only determine the audio format signal through preset settings, which makes the audio format signal determination less accurate and cannot provide optimal audio services that match the terminal device. Therefore, there is an urgent need for a method of "audio signal format determination" to determine the audio signal format based on the constraints of the terminal device, improve the accuracy of audio format signal determination, and provide optimal audio services based on the constraints.
  • the present disclosure proposes an audio signal format determination method, device, equipment and storage medium to determine the audio signal format based on the constraints of the terminal equipment, improve the accuracy of the audio format signal determination, and provide optimal audio signal format based on the constraints. Audio services.
  • An embodiment of the present disclosure provides a method for determining an audio signal format.
  • the method is executed by a terminal device.
  • the method includes:
  • the audio format signal to be selected for audio communication is determined from the set of audio format signals that the terminal device can provide, wherein the audio format signal to be selected is used to provide audio services for audio communication , the set of audio format signals includes at least one audio format signal.
  • the constraints of the terminal device include at least one of the following:
  • the hardware resource conditions of the terminal device are the hardware resource conditions of the terminal device.
  • the network environment conditions where the terminal device is located
  • the power of the terminal device is the power of the terminal device.
  • the method further includes:
  • a set of audio format signals that can be provided by the terminal device is determined.
  • determining the set of audio format signals that the terminal device can provide based on the number and/or microphone types set by the terminal device includes:
  • the audio format signal set that the terminal device can provide is the first audio format signal set
  • the audio format signals in the first audio format signal set include at least one of the following:
  • determining the set of audio format signals that the terminal device can provide based on the number and/or microphone types set by the terminal device includes:
  • the terminal device In response to the microphone provided in the terminal device including at least one external audio collection device, determining that the audio format signal set that the terminal device can provide is the second audio format signal set;
  • the audio format signals in the second audio format signal set include at least one of the following:
  • determining the audio format signal to be used for audio communication among the set of audio format signals that the terminal device can provide includes:
  • the score corresponding to the constraint condition is obtained, wherein the constraint condition includes the hardware resource condition of the terminal device, the terminal The software resource conditions of the device, the network environment conditions in which the terminal device is located, and the power of the terminal device;
  • the audio format signal to be selected for audio communication is determined among the set of audio format signals that can be provided by the terminal device.
  • determining the audio format signal to be used for audio communication among the set of audio format signals that can be provided by the terminal device includes:
  • the audio format to be selected for audio communication determined in the first audio format signal set is a spatial audio signal based on auxiliary metadata
  • the audio communication determined in the first audio format signal set is to be selected.
  • the audio format signal is a mono signal.
  • determining the audio format signal to be used for audio communication among the set of audio format signals that can be provided by the terminal device includes:
  • the audio format to be selected for audio communication determined in the second audio format signal set The signal is a scene-based HOA signal;
  • the audio communication requirements determined in the second audio format signal set are The audio format signal selected is a mono signal.
  • determining the audio format signal to be used for audio communication among the set of audio format signals that the terminal device can provide includes:
  • the audio format signal to be selected for audio communication is determined among the set of audio format signals that the terminal device can provide.
  • determining the audio format signal to be used for audio communication among the set of audio format signals that the terminal device can provide based on the power of the terminal device includes:
  • the audio communication determined in the first audio format signal set is to be selected.
  • the audio format signal is a spatial audio signal based on auxiliary metadata;
  • the audio format signal set determined in the first audio format signal set is a mono signal.
  • determining the audio format signal to be used for audio communication among the set of audio format signals that the terminal device can provide based on the power of the terminal device includes:
  • the audio communication determined in the second audio format signal set is to be selected.
  • the audio format signal is based on scene HOA signal;
  • the audio format signal set determined in the second audio format signal set is a mono signal.
  • Another aspect of the present disclosure provides an audio signal format determining device, which is characterized in that the device includes:
  • Determining module configured to determine an audio format signal to be selected for audio communication from a set of audio format signals that can be provided by the terminal device based on the constraints of the terminal device, wherein the audio format signal to be selected is used to provide Audio service for audio communication, the set of audio format signals includes at least one audio format signal.
  • the device includes a processor and a memory.
  • a computer program is stored in the memory.
  • the processor executes the computer program stored in the memory so that the The device performs the method proposed in the embodiment of the above aspect.
  • a communication device provided by another embodiment of the present disclosure includes: a processor and an interface circuit
  • the interface circuit is used to receive code instructions and transmit them to the processor
  • the processor is configured to run the code instructions to perform the method proposed in the embodiment of one aspect.
  • a computer-readable storage medium provided by an embodiment of another aspect of the present disclosure is used to store instructions. When the instructions are executed, the method proposed by the embodiment of the present disclosure is implemented.
  • the audio format signal to be selected for audio communication is determined from the set of audio format signals that the terminal device can provide, where the audio format signal to be selected is used
  • the audio format signal set includes at least one audio format signal.
  • the audio format signal to be selected for audio communication can be determined through the constraints of the terminal device, which can improve the matching between the audio format signal and the constraints and reduce the need to directly determine the audio format without considering the constraints of the terminal device.
  • the signal makes the accuracy of audio format signal determination lower, the accuracy of audio format signal determination can be improved, and the audio service of audio communication can be improved.
  • the present disclosure provides a processing method for the situation of "audio signal format determination" to determine the audio signal format based on the constraints of the terminal device, improve the accuracy of the audio format signal determination, and provide the best solution based on the constraints. Excellent audio service.
  • Figure 1 is a schematic flowchart of a method for determining an audio signal format provided by an embodiment of the present disclosure
  • Figure 2 is a schematic flowchart of a method for determining an audio signal format provided by another embodiment of the present disclosure
  • FIG. 3 is a schematic flowchart of an audio signal format determination method provided by yet another embodiment of the present disclosure.
  • Figure 4 is a schematic diagram of an example of a terminal device provided by yet another embodiment of the present disclosure.
  • FIG. 5 is a schematic flowchart of an audio signal format determination method provided by yet another embodiment of the present disclosure.
  • Figure 6 is a schematic diagram of an example of a terminal device provided by yet another embodiment of the present disclosure.
  • FIG. 7 is a schematic flowchart of an audio signal format determination method provided by yet another embodiment of the present disclosure.
  • Figure 8 is a schematic flowchart of an audio signal format determination method provided by yet another embodiment of the present disclosure.
  • Figure 9 is a schematic flowchart of an audio signal format determination method provided by yet another embodiment of the present disclosure.
  • Figure 10 is a schematic flowchart of an audio signal format determination method provided by yet another embodiment of the present disclosure.
  • Figure 11 is a schematic flow chart of an audio signal format determination method provided by yet another embodiment of the present disclosure.
  • Figure 12 is a schematic flowchart of an audio signal format determination method provided by yet another embodiment of the present disclosure.
  • Figure 13 is a schematic structural diagram of an audio signal format determining device provided by an embodiment of the present disclosure.
  • Figure 14 is a block diagram of a terminal device provided by an embodiment of the present disclosure.
  • first, second, third, etc. may be used to describe various information in the embodiments of the present disclosure, the information should not be limited to these terms. These terms are only used to distinguish information of the same type from each other.
  • first information may also be called second information, and similarly, the second information may also be called first information.
  • the words "if” and “if” as used herein may be interpreted as “when” or “when” or “in response to determining.”
  • the network elements or network functions involved in the embodiments of the present disclosure can be implemented by independent hardware devices or by software in the hardware devices. This is not limited in the embodiments of the present disclosure.
  • the first generation mobile communication technology (1G) began in the 1980s.
  • 1G is the first generation of wireless cellular technology and is an analog mobile communication network.
  • 2G the terminal equipment is transferred from analog communication to digital communication.
  • the GSM network standard can be used, and the speech coder can use the adaptive multi-rate speech coder (Adaptive Multi-Rate, AMR), enhanced full rate codec (Enhanced Full Rate Speed Encoding, EFR), full rate Codec (Full Rate, FR), half rate codec (Half Rate, HR) communication provides single-channel narrowband voice services.
  • AMR adaptive multi-rate speech coder
  • EFR enhanced Full Rate Speed Encoding
  • EFR Full Rate Speed Encoding
  • Full Rate, FR full rate Codec
  • HR half rate codec
  • the third generation mobile communication technology (3rd-Generation, 3G) mobile communication system was proposed by the International Telecommunication Union ITU for the 2000 International Mobile Communications.
  • different operators can use different encoding methods.
  • Time Division-Synchronous Code Division Multiple Access (TD-SCDMA) can be used
  • Code Division Multiple Access 2000, CDMA2000 can also be used
  • wideband code division multiple access CDMA2000
  • W-CDMA Wideband Code Division Multiple Access
  • W-CDMA Wideband Code Division Multiple Access
  • its voice coder uses the new variable rate multi-mode wideband voice codec AMR-WB to provide single-channel broadband voice services.
  • the 4th generation mobile communication technology (4G) is a better improvement on 3G technology.
  • Both data and voice use the Internet Protocol (IP) to provide voice and audio.
  • IP Internet Protocol
  • the real-time high-definition voice HD+Voice service adopts the audio encoder (Enhanced Voice Services, EVS) codec that can take into account high-quality compression of voice and audio.
  • EVS
  • the voice and audio communication services provided above are extended from narrowband signals to ultra-wideband or even full-band services, but they are still monophonic services, and users continue to demand high-quality audio.
  • Increased, compared to mono audio, stereo audio has a sense of orientation and distribution for each sound source, and can improve clarity and clarity.
  • three signal formats such as channel-based signals, object-based signals, and scene-based signals can be Provide three-dimensional audio services.
  • the 3rd Generation Partnership Project (3GPP) Encoder and Media Working Group SA4 is standardizing the Immersive Voice and Audio Services (IVAS) codec that can support the above three signal formats. Codec requirements.
  • the specific signal formats among the three signal formats are: Channel-based signals include: mono signal, stereo signal (Stereo), binaural signal (Binaural), 5.1, 7.1 surround signal (Surround), 5.1. 4, 7.1.4 surround sound signal (Surround), where .4 represents the height channel signal (Height).
  • the scene-based signals are: first-order high-fidelity stereo surround sound (FOA), second-order high-fidelity stereo surround sound ( 2nd -Order Ambisonics, HOA2), third-order high-fidelity surround sound ( 3rd -Order Ambisonics, HOA3), object-based signal contains audio data and metadata, in addition, IVAS also supports auxiliary element-based Data spatial audio signal (Metadata-Assisted Spatial Audio, MASA).
  • Terminal devices that can support 3D audio services include but are not limited to mobile phones, computers, tablets, conference system equipment, AR/VR equipment, cars, etc.
  • Figure 1 is a schematic flowchart of an audio signal format determination method provided by an embodiment of the present disclosure. The method is executed by a terminal device. As shown in Figure 1, the method may include the following steps:
  • Step 101 Based on the constraints of the terminal device, determine the audio format signal to be used for audio communication from the set of audio format signals that can be provided by the terminal device, where the audio format signal to be selected is used to provide audio services for audio communication.
  • the audio format The set of signals includes at least one audio format signal.
  • the terminal device may be a device that provides voice and/or data connectivity to the user.
  • Terminal devices can communicate with one or more core networks via RAN (Radio Access Network).
  • Terminal devices can be IoT terminals, such as sensor devices, mobile phones (or "cellular" phones) and devices with The computer of the Internet of Things terminal, for example, can be a fixed, portable, pocket-sized, handheld, computer-built-in or vehicle-mounted device.
  • station STA
  • subscriber unit subscriber unit
  • subscriber station subscriber station
  • mobile station mobile station
  • remote station remote station
  • access point remote terminal
  • remote terminal access terminal
  • user device user terminal
  • user agent user agent
  • the terminal device may also be a device of an unmanned aerial vehicle.
  • the terminal device may also be a vehicle-mounted device, for example, it may be a driving computer with wireless communication function, or a wireless terminal connected to an external driving computer.
  • the terminal device may also be a roadside device, for example, it may be a street light, a signal light or other roadside device with wireless communication function.
  • the constraints of the terminal device include at least one of the following:
  • the network environment conditions where the terminal equipment is located
  • the power of the terminal device is the power of the terminal device.
  • the method further includes:
  • a set of audio format signals that can be provided by the terminal device is determined.
  • the set of audio format signals that can be provided by the terminal device is determined, including:
  • the audio format signal set that the terminal device can provide is the first audio format signal set
  • the audio format signals in the first audio format signal set include at least one of the following:
  • the set of audio format signals that can be provided by the terminal device is determined, including:
  • the audio format signal set that the terminal device can provide is the second audio format signal set
  • the audio format signals in the second audio format signal set include at least one of the following:
  • the audio format signal to be selected for audio communication is determined from the set of audio format signals that the terminal device can provide, including:
  • the constraint condition includes the hardware resource condition of the terminal device, the software resource condition of the terminal device, and the network where the terminal device is located. At least two of the environmental conditions and the power of the terminal device;
  • the audio format signal to be selected for audio communication is determined among the audio format signal sets that the terminal device can provide.
  • the audio format signal to be selected for audio communication is determined from the set of audio format signals that the terminal device can provide, including:
  • the audio format signal to be selected for audio communication determined in the first audio format signal set is based on the auxiliary metadata spatial audio signal;
  • the audio format signal to be selected for audio communication determined in the first audio format signal set is a mono signal .
  • the audio format signal to be selected for audio communication is determined among the audio format signal sets that the terminal device can provide, including:
  • the audio format signal to be selected for audio communication determined in the second audio format signal set is the scene-based HOA signal ;
  • the audio format signal to be selected for audio communication determined in the second audio format signal set is a mono signal .
  • the audio format signal to be selected for audio communication is determined from the set of audio format signals that the terminal device can provide, including:
  • the audio format signal to be selected for audio communication is determined from the set of audio format signals that the terminal device can provide.
  • the audio format signal to be selected for audio communication is determined from the set of audio format signals that the terminal device can provide, including:
  • the audio format signal to be selected for audio communication determined in the first audio format signal set is based on Spatial audio signal with auxiliary metadata;
  • the audio format signal to be selected for audio communication determined in the first audio format signal set is a single channel signal.
  • the audio format signal to be selected for audio communication is determined from the set of audio format signals that the terminal device can provide, including:
  • the audio format signal to be selected for audio communication determined in the second audio format signal set is based on Scene HOA signal;
  • the audio format signal to be selected for audio communication determined in the second audio format signal set is a single channel signal.
  • the audio format signal to be selected for audio communication is determined from the set of audio format signals that the terminal device can provide, where the audio format signal to be selected is used
  • the audio format signal set includes at least one audio format signal.
  • the audio format signal to be selected for audio communication can be determined through the constraints of the terminal device, which can improve the matching between the audio format signal and the constraints and reduce the need to directly determine the audio format without considering the constraints of the terminal device.
  • the signal makes the accuracy of audio format signal determination lower, the accuracy of audio format signal determination can be improved, and the audio service of audio communication can be improved.
  • the present disclosure provides a processing method for the situation of "audio signal format determination" to determine the audio signal format based on the constraints of the terminal device, improve the accuracy of the audio format signal determination, and provide the best audio signal format based on the constraints. Excellent audio service.
  • FIG. 2 is a schematic flowchart of an audio signal format determination method provided by an embodiment of the present disclosure. The method is executed by a terminal device. As shown in Figure 2, the method may include the following steps:
  • Step 201 Determine the set of audio format signals that the terminal device can provide based on the number and/or microphone types set by the terminal device;
  • Step 202 Based on the constraints of the terminal device, determine the audio format signal to be selected for audio communication from the set of audio format signals that the terminal device can provide, where the audio format signal to be selected is used to provide audio services for audio communication, and the audio format The set of signals includes at least one audio format signal.
  • the number of microphones refers to the number of microphones provided on the terminal device.
  • the microphone may be a microphone provided in the terminal device, or may be a microphone external to the terminal device.
  • the number of microphones does not refer to a fixed number. For example, when the number of microphones set by the terminal changes, the number of microphones can also change accordingly.
  • the microphone type refers to the type corresponding to at least one microphone provided by the terminal.
  • Different microphones can correspond to different microphone types, and different microphones can also correspond to the same microphone type.
  • the audio format signal set refers to a collection of at least one audio format signal.
  • the set of audio format signals does not refer to a fixed set. For example, when the number of audio format signals included in the audio format signal set changes, the audio format signal set may also change accordingly. For example, when the audio format signals included in the audio format signal set change, the audio format signal set may also change accordingly.
  • the constraint conditions refer to conditions used to determine the audio format signal to be selected for audio communication. This constraint does not specify a fixed condition.
  • the constraints may include at least one of the following:
  • the network environment conditions where the terminal equipment is located
  • the power of the terminal device is the power of the terminal device.
  • the terminal device can use a dedicated extraction module to obtain the constraints of the terminal device.
  • the terminal device may determine a set of audio format signals that the terminal device can provide. Based on the constraints of the terminal device, the terminal device can determine the audio format signal to be selected for audio communication from the set of audio format signals that the terminal device can provide, where the audio format signal to be selected is used to provide audio services for audio communication, and the audio format
  • the set of signals includes at least one audio format signal. Among them, different audio format signals can correspond to different numbers of audio data channels.
  • the set of audio format signals that the terminal device can provide is determined based on the number of microphones and/or the type of microphones set by the terminal device. Based on the constraints of the terminal device, the set of audio format signals that can be provided by the terminal device is determined.
  • the audio format signal to be selected for audio communication is determined from a set of audio format signals, wherein the audio format signal to be selected is used to provide audio services for audio communication, and the audio format signal set includes at least one audio format signal.
  • the audio format signal to be selected for audio communication can be determined through the constraints of the terminal device, which can improve the matching between the audio format signal and the constraints and reduce the need to directly determine the audio format without considering the constraints of the terminal device.
  • the signal makes the accuracy of audio format signal determination lower, the accuracy of audio format signal determination can be improved, and the audio service of audio communication can be improved.
  • the audio format signal set determination scheme is specifically described, which can improve the accuracy of audio format signal determination.
  • the present disclosure provides a processing method for the situation of "audio signal format determination" to determine the audio signal format based on the constraints of the terminal device, improve the accuracy of the audio format signal determination, and provide the best solution based on the constraints. Excellent audio service.
  • FIG 3 is a schematic flowchart of an audio signal format determination method provided by an embodiment of the present disclosure. The method is executed by a terminal device. As shown in Figure 3, the method may include the following steps:
  • Step S301 in response to the number of microphones set by the terminal device being at least three, determining that the audio format signal set that the terminal device can provide is the first audio format signal set;
  • the audio format signals in the first audio format signal set include at least one of the following:
  • Step 302 Based on the constraints of the terminal device, determine the audio format signal to be selected for audio communication from the first set of audio format signals that the terminal device can provide, where the audio format signal to be selected is used to provide audio services for audio communication,
  • the set of audio format signals includes at least one audio format signal.
  • the first audio format signal set refers to the audio format signal set determined by the terminal device when the number of microphones set by the terminal device is at least three.
  • the first of the first audio format signal sets is only used to distinguish from other audio format signal sets, and the first audio format signal set does not specifically refer to a fixed set.
  • the first audio format signal set may include a mono signal, a stereo signal, and a spatial audio signal based on auxiliary metadata, and the first audio format signal set may further include a mono signal, a stereo signal, and a multi-channel signal.
  • the number of microphones provided on the terminal device may be three, for example.
  • the three microphones may be, for example, three microphones built into the terminal device.
  • an example schematic diagram of the terminal device can be shown in Figure 4. Among them, one microphone is arranged on the upper part of the terminal device, and two microphones are arranged on the lower part of the terminal device.
  • the terminal device determines that the first set of audio format signals that the terminal device can provide may include, for example, a mono signal, a stereo signal, and a spatial audio signal based on auxiliary metadata.
  • the number of microphones provided on the terminal device may be one.
  • the terminal device determines that the first set of audio format signals that the terminal device can provide may, for example, only include mono signals.
  • the high-order Ambisonics HOA signal includes, but is not limited to, a second-order Ambisonics signal, a third-order Ambisonics signal, and the like.
  • the audio format signal to be selected for audio communication is determined from the first audio format signal set that can be provided by the terminal device, wherein the audio format signal to be selected is used to provide audio services for audio communication, and the audio format signal set includes at least one Audio format signal.
  • the audio format signal to be selected for audio communication can be determined through the constraints of the terminal device, which can improve the matching between the audio format signal and the constraints and reduce the need to directly determine the audio format without considering the constraints of the terminal device.
  • the signal makes the accuracy of audio format signal determination lower, the accuracy of audio format signal determination can be improved, and the audio service of audio communication can be improved.
  • the determination scheme of the first audio format signal set is specifically described, which can improve the accuracy of determining the audio format signal in the first audio format signal set that can be provided by the terminal device.
  • the present disclosure provides a processing method for the situation of "audio signal format determination" to determine the audio signal format based on the constraints of the terminal device, improve the accuracy of the audio format signal determination, and provide the best solution based on the constraints. Excellent audio service.
  • Figure 5 is a schematic flowchart of an audio signal format determination method provided by an embodiment of the present disclosure. The method is executed by a terminal device. As shown in Figure 5, the method may include the following steps:
  • Step 501 In response to the microphone set in the terminal device including at least one external audio collection device, determine that the audio format signal set that the terminal device can provide is the second audio format signal set;
  • the audio format signals in the second audio format signal set include at least one of the following:
  • Step 502 Based on the constraints of the terminal device, determine the audio format signal to be selected for audio communication from the second audio format signal set that the terminal device can provide, where the audio format signal to be selected is used to provide audio services for audio communication,
  • the set of audio format signals includes at least one audio format signal.
  • the number of microphones provided on the terminal device may be one.
  • an example schematic diagram of the terminal device can be shown in Figure 6.
  • at least one external audio collection device may be a collection ball, for example.
  • the terminal device determines that the audio format signal set that the terminal device can provide may be, for example, a second audio format signal set.
  • the second audio format signal set may include, for example, a mono signal, a stereo signal, a scene-based FOA signal, a scene-based HOA signal, and Spatial audio signals based on auxiliary metadata.
  • the microphone provided in the terminal device may include, for example, an external audio collection device and a microphone built into the terminal device.
  • the at least one external audio collection device may be, for example, a collection ball.
  • the terminal device determines that the audio format signal set that the terminal device can provide may be, for example, a second audio format signal set.
  • the audio format signal set that the terminal device can provide is the second audio format signal set, based on the terminal device Constraints, determine the audio format signal to be used for audio communication in the second audio format signal set that can be provided by the terminal device, where the audio format signal to be selected is used to provide audio services for audio communication, and the audio format signal set includes at least An audio format signal.
  • the audio format signal to be selected for audio communication can be determined through the constraints of the terminal device, which can improve the matching between the audio format signal and the constraints and reduce the need to directly determine the audio format without considering the constraints of the terminal device.
  • the signal makes the accuracy of audio format signal determination lower, the accuracy of audio format signal determination can be improved, and the audio service of audio communication can be improved.
  • the determination scheme of the second audio format signal set is specifically described, which can improve the accuracy of determining the audio format signal in the second audio format signal set that can be provided by the terminal device.
  • the present disclosure provides a processing method for the situation of "audio signal format determination" to determine the audio signal format based on the constraints of the terminal device, improve the accuracy of the audio format signal determination, and provide the best solution based on the constraints. Excellent audio service.
  • Figure 7 is a schematic flowchart of an audio signal format determination method provided by an embodiment of the present disclosure. The method is executed by a terminal device. As shown in Figure 7, the method may include the following steps:
  • Step 701 Obtain the score corresponding to the constraint condition based on the condition parameter corresponding to the constraint condition of the terminal device and the weight coefficient corresponding to the constraint condition, where the constraint condition includes the hardware resource condition of the terminal device, the software resource condition of the terminal device, and the condition of the terminal device.
  • the network environment conditions and the power of the terminal equipment;
  • Step 702 Based on the score, determine the audio format signal to be used for audio communication from the set of audio format signals that can be provided by the terminal device.
  • different constraints may correspond to different weight coefficients, or may correspond to the same weight coefficient.
  • the weight coefficient corresponding to the hardware resource conditions of the terminal device may be 0.8
  • the weight coefficient corresponding to the software resource conditions of the terminal device may be 0.6
  • the weight coefficient corresponding to the network environment conditions where the terminal device is located may be 0.7.
  • the weight coefficient corresponding to the hardware resource conditions of the terminal device may be 0.8
  • the weight coefficient corresponding to the software resource conditions of the terminal device may be 0.8
  • the weight coefficient corresponding to the network environment conditions where the terminal device is located may be 0.7.
  • condition parameter does not specifically refer to a certain fixed parameter.
  • the constraint condition may be, for example, the network environment condition in which the terminal device is located. When the network environment changes, the condition parameter may also change accordingly.
  • the score corresponding to the constraint condition does not specifically refer to a fixed score.
  • the score corresponding to the constraint can also change accordingly.
  • the score corresponding to the constraint can also change accordingly.
  • the terminal device when the terminal device obtains the score corresponding to the constraint condition, the terminal device may obtain the score corresponding to the constraint condition through the decision module.
  • the condition parameter corresponding to the hardware resource condition of the terminal device may be, for example, 0.7.
  • the condition parameter corresponding to the software resource condition of the terminal device may be, for example, 1.0.
  • the condition parameter corresponding to the network environment condition where the terminal device is located may be, for example, 0.7.
  • the condition parameter corresponding to the power of the device may be 1.0, for example.
  • the weight coefficient corresponding to the hardware resource conditions of the terminal device may be, for example, 0.8.
  • the weight coefficient corresponding to the software resource conditions of the terminal device may be, for example, 0.6.
  • the weight coefficient corresponding to the network environment conditions of the terminal device may be, for example, 0.7.
  • the weight coefficient corresponding to the electric quantity may be, for example, 0.9.
  • the score corresponding to the constraint obtained by the terminal device can be, for example:
  • the terminal device can determine the audio format signal to be used for audio communication among the set of audio format signals that the terminal device can provide.
  • the score corresponding to the constraint condition is obtained based on the condition parameter corresponding to the constraint condition of the terminal device and the weight coefficient corresponding to the constraint condition, where the constraint condition includes the hardware resource condition of the terminal device, Based on the score, the software resource conditions of the terminal device, the network environment conditions of the terminal device, and the power of the terminal device, the audio format signal to be used for audio communication is determined from the set of audio format signals that the terminal device can provide.
  • the audio format signal to be selected for audio communication can be determined through the condition parameters corresponding to the constraint conditions of the terminal device and the weight coefficient corresponding to the constraint conditions, which can improve the matching of the audio format signal and the constraint conditions, and reduce Directly determining the audio format signal without considering the constraints of the terminal device makes the accuracy of the audio format signal determination low, which can improve the accuracy of the audio format signal determination and improve the audio service of audio communication.
  • the present disclosure provides a processing method for the situation of "audio signal format determination" to determine the audio signal format based on the constraints of the terminal device, improve the accuracy of the audio format signal determination, and provide the best solution based on the constraints. Excellent audio service.
  • FIG 8 is a schematic flowchart of an audio signal format determination method provided by an embodiment of the present disclosure. The method is executed by a terminal device. As shown in Figure 8, the method may include the following steps:
  • Step 801 Obtain the score corresponding to the constraint condition based on the condition parameter corresponding to the constraint condition of the terminal device and the weight coefficient corresponding to the constraint condition, where the constraint condition includes the hardware resource condition of the terminal device, the software resource condition of the terminal device, and the condition of the terminal device. At least two of the network environment conditions and the power of the terminal device;
  • Step 802 In response to the audio format signal set that can be provided by the terminal device being the first audio format signal set, and the score being greater than the first score threshold, the audio format signal to be selected for audio communication determined in the first audio format signal set is based on Spatial audio signal with auxiliary metadata;
  • Step 803 In response to the audio format signal set that can be provided by the terminal device being the first audio format signal set, and the score being less than the first score threshold, the audio format signal to be selected for audio communication determined in the first audio format signal set is single. channel signal.
  • the first score threshold refers to the score threshold for the audio format signal to be selected for audio communication determined in the first audio format signal set.
  • the first among the first score thresholds is only used to distinguish it from the other score thresholds, and does not specifically refer to a fixed score threshold.
  • the terminal device when the score is equal to the first score threshold, can determine the audio format signal to be used for audio communication according to the solution with the score greater than the first score threshold, and can also determine the audio format signal to be used for audio communication according to the solution with the score less than the first score threshold.
  • the fractional threshold scheme determines the audio format signal to be used for audio communication. For example, in response to the audio format signal set that the terminal device can provide being the first audio format signal set, and the score is equal to the first score threshold, the terminal device determines the audio format signal to be used for audio communication in the first audio format signal set, such as It can be a spatial audio signal based on auxiliary metadata, or a mono signal.
  • the terminal device obtains a score corresponding to the constraint condition based on the condition parameter corresponding to the constraint condition of the terminal device and the weight coefficient corresponding to the constraint condition, which may be, for example, 2.55.
  • the first score threshold may be 2, for example.
  • the first set of audio format signals may include, for example, a mono signal, a stereo signal, and a spatial audio signal based on auxiliary metadata.
  • the audio format signal to be selected for audio communication determined by the terminal device in the first audio format signal set may be, for example, a spatial audio signal based on auxiliary metadata. That is to say, when the terminal device has a hardware processor with better performance, richer software resources, and if the network environment is better, the audio information output by the terminal device tends to use a larger number of audio data channels. Audio format signal.
  • the terminal device obtains a score corresponding to the constraint condition based on the condition parameter corresponding to the constraint condition of the terminal device and the weight coefficient corresponding to the constraint condition, which may be, for example, 1.55.
  • the first score threshold may be 2, for example.
  • the first set of audio format signals may include, for example, a mono signal, a stereo signal, and a spatial audio signal based on auxiliary metadata.
  • the audio format signal to be selected for audio communication determined by the terminal device in the first audio format signal set may be, for example, a mono signal.
  • the terminal device has a hardware processor with poor performance and poor software resources. At the same time, if the network environment is poor, the audio information output by the terminal device tends to use an audio format signal with a smaller number of audio data channels. .
  • the terminal device obtains a score corresponding to the constraint condition based on the condition parameter corresponding to the constraint condition of the terminal device and the weight coefficient corresponding to the constraint condition, which may be, for example, 2.
  • the first score threshold may be 2, for example.
  • the first set of audio format signals may include, for example, a mono signal, a stereo signal, and a spatial audio signal based on auxiliary metadata.
  • the audio format signal to be selected for audio communication determined by the terminal device in the first audio format signal set may be, for example, a mono signal or a spatial audio signal based on auxiliary metadata.
  • the first score threshold may be multiple score thresholds. With different score thresholds, the audio format signals to be selected for audio communication determined in the first audio format signal set are also different.
  • the first score threshold may include 1.5 and 2, for example.
  • the first set of audio format signals may include, for example, a mono signal, a stereo signal, and a spatial audio signal based on auxiliary metadata.
  • the terminal device obtains a score corresponding to the constraint condition based on the condition parameter corresponding to the constraint condition of the terminal device and the weight coefficient corresponding to the constraint condition, for example, 1.85, the terminal device determines in the first audio format signal set to select
  • the audio format signal may be a stereo signal, for example.
  • the terminal device determines in the first audio format signal set the audio communication to be used.
  • the audio format signal may be a mono signal, for example.
  • the terminal device determines in the first audio format signal set the audio communication to be used.
  • the audio format signal may be, for example, a spatial audio signal based on auxiliary metadata.
  • the score corresponding to the constraint condition is obtained based on the condition parameter corresponding to the constraint condition of the terminal device and the weight coefficient corresponding to the constraint condition, where the constraint condition includes the hardware resource condition of the terminal device, Based on the score, the software resource conditions of the terminal device, the network environment conditions of the terminal device, and the power of the terminal device, the audio format signal to be used for audio communication is determined from the set of audio format signals that the terminal device can provide.
  • the audio format signal to be selected for audio communication can be determined through the condition parameters corresponding to the constraint conditions of the terminal device and the weight coefficient corresponding to the constraint conditions, which can improve the matching of the audio format signal and the constraint conditions, and reduce Directly determining the audio format signal without considering the constraints of the terminal device makes the accuracy of the audio format signal determination low, which can improve the accuracy of the audio format signal determination and improve the audio service of audio communication.
  • the embodiments of the present disclosure specifically illustrate the solution of determining the audio format signal to be used for audio communication based on scores in the first audio format signal set, which can improve the accuracy of audio format signal determination.
  • the present disclosure provides a processing method for the situation of "audio signal format determination" to determine the audio signal format based on the constraints of the terminal device, improve the accuracy of the audio format signal determination, and provide the best solution based on the constraints. Excellent audio service.
  • Figure 9 is a schematic flowchart of an audio signal format determination method provided by an embodiment of the present disclosure. The method is executed by a terminal device. As shown in Figure 9, the method may include the following steps:
  • Step 901 Obtain the score corresponding to the constraint condition based on the condition parameter corresponding to the constraint condition of the terminal device and the weight coefficient corresponding to the constraint condition, where the constraint condition includes the hardware resource condition of the terminal device, the software resource condition of the terminal device, and the condition of the terminal device. At least two of the network environment conditions and the power of the terminal device;
  • Step 902 In response to the audio format signal set that the terminal device can provide being the second audio format signal set, and the score is greater than the second score threshold, the audio format signal to be selected for audio communication determined in the second audio format signal set is based on Scene HOA signal;
  • Step 903 In response to the audio format signal set that can be provided by the terminal device being the second audio format signal set, and the score being less than the second score threshold, the audio format signal to be selected for audio communication determined in the second audio format signal set is single. channel signal.
  • the second score threshold refers to the score threshold for the audio format signal to be selected for the audio communication determined in the second audio format signal set.
  • the second of the second score thresholds is only used to distinguish it from the other score thresholds, and does not specifically refer to a fixed score threshold.
  • the second score threshold may be the same as the first score threshold, or may be different from the first score threshold.
  • the terminal device when the score is equal to the second score threshold, can determine the audio format signal to be used for audio communication according to the solution with the score greater than the second score threshold, and can also determine the audio format signal to be used for audio communication according to the solution with the score less than the second score threshold.
  • the binary threshold scheme determines the audio format signal to be used for audio communication. For example, in response to the audio format signal set that the terminal device can provide being the second audio format signal set, and the score is equal to the second score threshold, the terminal device determines the audio format signal to be used for audio communication in the second audio format signal set, such as It may be a mono signal, for example, it may also be a scene-based HOA signal.
  • the terminal device obtains a score corresponding to the constraint condition based on the condition parameter corresponding to the constraint condition of the terminal device and the weight coefficient corresponding to the constraint condition, which may be, for example, 2.55.
  • the second score threshold may be 2, for example.
  • the second set of audio format signals may include, for example, a mono signal, a stereo signal, a scene-based FOA signal, a scene-based HOA signal, and an auxiliary metadata-based spatial audio signal.
  • the audio format signal to be selected for audio communication determined by the terminal device in the second audio format signal set may be, for example, a scene-based HOA signal.
  • the terminal device obtains a score corresponding to the constraint condition based on the condition parameter corresponding to the constraint condition of the terminal device and the weight coefficient corresponding to the constraint condition, which may be, for example, 1.55.
  • the second score threshold may be 2, for example.
  • the second set of audio format signals may include, for example, a mono signal, a stereo signal, a scene-based FOA signal, a scene-based HOA signal, and an auxiliary metadata-based spatial audio signal.
  • the audio format signal to be selected for audio communication determined by the terminal device in the second audio format signal set may be, for example, a mono signal.
  • the terminal device obtains a score corresponding to the constraint condition based on the condition parameter corresponding to the constraint condition of the terminal device and the weight coefficient corresponding to the constraint condition, which may be, for example, 2.
  • the second score threshold may be 2, for example.
  • the second set of audio format signals may include, for example, a mono signal, a stereo signal, a scene-based FOA signal, a scene-based HOA signal, and an auxiliary metadata-based spatial audio signal.
  • the audio format signal to be selected for audio communication determined by the terminal device in the second audio format signal set may be, for example, a mono signal or a scene-based HOA signal.
  • the second score threshold may be multiple score thresholds. With different score thresholds, the audio format signals to be selected for audio communication determined in the second audio format signal set are also different.
  • the second score threshold may include 1.2, 1.5, 1.8, and 2, for example.
  • the second set of audio format signals may include, for example, a mono signal, a stereo signal, a scene-based FOA signal, a scene-based HOA signal, and an auxiliary metadata-based spatial audio signal.
  • the terminal device determines in the second audio format signal set to select
  • the audio format signal may be, for example, a scene-based FOA signal.
  • the terminal device determines in the second audio format signal set the audio communication to be used.
  • the audio format signal may be a stereo signal, for example.
  • the terminal device determines in the second audio format signal set the audio communication to be used.
  • the audio format signal may be a scene based HOA signal, for example.
  • the audio format signal may be a mono signal, for example.
  • the terminal device determines in the second audio format signal set the audio communication to be used.
  • the audio format signal may be, for example, a spatial audio signal based on auxiliary metadata.
  • the score corresponding to the constraint condition is obtained based on the condition parameter corresponding to the constraint condition of the terminal device and the weight coefficient corresponding to the constraint condition, where the constraint condition includes the hardware resource condition of the terminal device, Based on the score, the software resource conditions of the terminal device, the network environment conditions of the terminal device, and the power of the terminal device, the audio format signal to be used for audio communication is determined from the set of audio format signals that the terminal device can provide.
  • the audio format signal to be selected for audio communication can be determined through the condition parameters corresponding to the constraint conditions of the terminal device and the weight coefficient corresponding to the constraint conditions, which can improve the matching of the audio format signal and the constraint conditions, and reduce Directly determining the audio format signal without considering the constraints of the terminal device makes the accuracy of the audio format signal determination low, which can improve the accuracy of the audio format signal determination and improve the audio service of audio communication.
  • the embodiments of the present disclosure specifically illustrate the solution of determining the audio format signal to be used for audio communication based on scores in the second audio format signal set, which can improve the accuracy of audio format signal determination.
  • the present disclosure provides a processing method for the situation of "audio signal format determination" to determine the audio signal format based on the constraints of the terminal device, improve the accuracy of the audio format signal determination, and provide the best solution based on the constraints. Excellent audio service.
  • Figure 10 is a schematic flowchart of an audio signal format determination method provided by an embodiment of the present disclosure. The method is executed by a terminal device. As shown in Figure 10, the method may include the following steps:
  • Step 1001 Based on the power of the terminal device, determine the audio format signal to be used for audio communication from the set of audio format signals that the terminal device can provide.
  • the constraint condition of the terminal device may be, for example, the power of the terminal device. Based on the power of the terminal device, the terminal device may determine the audio format signal to be selected for audio communication among the set of audio format signals that the terminal device can provide. audio format signal.
  • the audio format signal to be selected for audio communication is determined from the set of audio format signals that the terminal device can provide.
  • the audio format signal to be selected for audio communication can be determined through the power of the terminal device, which can improve the matching of the audio format signal and the constraints and reduce the need to directly determine the audio format signal without considering the constraints of the terminal device.
  • the accuracy of determining the audio format signal is low, the accuracy of determining the audio format signal can be improved, and the audio service of audio communication can be improved.
  • the present disclosure provides a processing method for the situation of "audio signal format determination" to determine the audio signal format based on the constraints of the terminal device, improve the accuracy of the audio format signal determination, and provide the best solution based on the constraints. Excellent audio service.
  • FIG 11 is a schematic flowchart of an audio signal format determination method provided by an embodiment of the present disclosure. The method is executed by a terminal device. As shown in Figure 11, the method may include the following steps:
  • Step 1101. In response to the audio format signal set that the terminal device can provide being the first audio format signal set, and the power of the terminal device is greater than the first power threshold, the audio format to be selected for audio communication determined in the first audio format signal set
  • the signal is a spatial audio signal based on auxiliary metadata
  • Step 1102. In response to the audio format signal set that the terminal device can provide being the first audio format signal set, and the power of the terminal device is less than the first power threshold, the audio format to be selected for audio communication determined in the first audio format signal set
  • the signal is a mono signal
  • Step 1103 Use the audio format signal to be selected for audio communication determined in the first audio format signal set to provide an audio service for audio communication.
  • the first power threshold refers to the power threshold used to determine the audio format signal to be selected for audio communication in the first audio format signal set.
  • the first among the first power thresholds is only used to distinguish it from other power thresholds, and does not specifically refer to a fixed power threshold.
  • the terminal device when the power of the terminal device is equal to the first fractional threshold, the terminal device can determine the audio format signal to be used for audio communication according to a solution in which the power of the terminal device is greater than the first power threshold, The audio format signal to be used for audio communication may also be determined based on the solution that the power of the terminal device is less than the first power threshold. For example, in response to the audio format signal set that the terminal device can provide being the first audio format signal set, and the power of the terminal device is equal to the first power threshold, the terminal device determines the audio to be used for audio communication in the first audio format signal set.
  • the format signal may be, for example, a mono signal, or may be a spatial audio signal based on auxiliary metadata.
  • the first set of audio format signals may include, for example, a mono signal, a stereo signal, and a spatial audio signal based on auxiliary metadata.
  • the first power threshold may be, for example, 20% of the total power of the terminal device. The total power is the power when the battery of the terminal device is fully charged.
  • the audio to be selected for the audio communication determined in the first audio format signal set is
  • the format signal can be a mono signal, which can extend the battery life of the terminal device.
  • the first set of audio format signals may include, for example, a mono signal, a stereo signal, and a spatial audio signal based on auxiliary metadata.
  • the first power threshold may be, for example, 20% of the total power of the terminal device. The total power is the power when the battery of the terminal device is fully charged.
  • the audio to be selected for audio communication determined in the first audio format signal set The format signal may be, for example, a spatial audio signal based on auxiliary metadata.
  • the first power threshold may be multiple power thresholds. With different power thresholds, the audio format signals to be selected for audio communication determined in the first audio format signal set are also different.
  • the first power threshold may include 20% and 40%, for example.
  • the first set of audio format signals may include, for example, a mono signal, a stereo signal, and a spatial audio signal based on auxiliary metadata.
  • the power of the terminal device may be, for example, 30%, and the audio format signal to be selected for audio communication determined by the terminal device in the first set of audio format signals may be, for example, a stereo signal.
  • the power of the terminal device may be, for example, 15%, and the audio format signal to be selected for audio communication determined by the terminal device in the first set of audio format signals may be, for example, a mono signal.
  • the power of the terminal device may be, for example, 50%, and the audio format signal to be selected for audio communication determined by the terminal device in the first set of audio format signals may be, for example, a spatial audio signal based on auxiliary metadata.
  • the audio format signal to be selected for audio communication is determined from the set of audio format signals that the terminal device can provide.
  • the audio format signal to be selected for audio communication can be determined through the power of the terminal device, which can improve the matching of the audio format signal and the constraints and reduce the need to directly determine the audio format signal without considering the constraints of the terminal device.
  • the accuracy of determining the audio format signal is low, the accuracy of determining the audio format signal can be improved, and the audio service of audio communication can be improved.
  • the embodiments of the present disclosure specifically illustrate the solution of determining the audio format signal to be used for audio communication based on the power of the terminal device in the first audio format signal set, which can improve the accuracy of audio format signal determination.
  • the present disclosure provides a processing method for the situation of "audio signal format determination" to determine the audio signal format based on the constraints of the terminal device, improve the accuracy of the audio format signal determination, and provide the best solution based on the constraints. Excellent audio service.
  • Figure 12 is a schematic flowchart of an audio signal format determination method provided by an embodiment of the present disclosure. The method is executed by a terminal device. As shown in Figure 12, the method may include the following steps:
  • Step 1201 In response to the audio format signal set that the terminal device can provide being the second audio format signal set, and the power of the terminal device is greater than the second power threshold, the audio format to be selected for audio communication determined in the second audio format signal set
  • the signal is a scene-based HOA signal
  • Step 1202 In response to the audio format signal set that the terminal device can provide being the second audio format signal set, and the power of the terminal device is less than the second power threshold, the audio format to be selected for audio communication determined in the second audio format signal set
  • the signal is a mono signal
  • Step 1203 Use the audio format signal to be selected for audio communication determined in the second audio format signal set to provide an audio service for audio communication.
  • the second power threshold refers to the power threshold for the audio format signal to be selected for audio communication determined in the first audio format signal set.
  • the second of the second power thresholds is only used to distinguish it from other power thresholds, and does not specifically refer to a fixed power threshold.
  • the second power threshold may be the same as the first power threshold, or may be different from the first power threshold.
  • the terminal device when the power of the terminal device is equal to the second power threshold, the terminal device can determine the audio format signal to be used for audio communication according to a solution where the power of the terminal device is greater than the second power threshold, The audio format signal to be used for audio communication may also be determined according to a solution in which the power of the terminal device is less than the second power threshold. For example, in response to the audio format signal set that the terminal device can provide being the second audio format signal set, and the power of the terminal device is equal to the second power threshold, the terminal device determines the audio to be used for audio communication in the second audio format signal set.
  • the format signal may be, for example, a mono signal, or may also be a scene-based HOA signal.
  • the second audio format signal set may include, for example, a mono signal, a stereo signal, a scene-based FOA signal, a scene-based HOA signal, and an auxiliary metadata-based spatial audio signal.
  • the second power threshold may be, for example, 20% of the total power of the terminal device. The total power is the power when the battery of the terminal device is fully charged.
  • the audio to be selected for the audio communication determined in the second audio format signal set The format signal is a mono signal, which can extend the battery life of the terminal device.
  • the second audio format signal set may include, for example, a mono signal, a stereo signal, a scene-based FOA signal, a scene-based HOA signal, and an auxiliary metadata-based spatial audio signal.
  • the second power threshold may be, for example, 20% of the total power of the terminal device. The total power is the power when the battery of the terminal device is fully charged.
  • the audio to be selected for audio communication determined in the second audio format signal set The format signal is a scene-based HOA signal.
  • the second power threshold may be multiple power thresholds. With different power thresholds, the audio format signals to be selected for audio communication determined in the second audio format signal set are also different.
  • the second power threshold may include 20%, 40%, 60% and 75%, for example.
  • the second set of audio format signals may include, for example, a mono signal, a stereo signal, a scene-based FOA signal, a scene-based HOA signal, and an auxiliary metadata-based spatial audio signal.
  • the power of the terminal device may be, for example, 15%, and the audio format signal to be selected for audio communication determined by the terminal device in the second audio format signal set may be, for example, a mono signal.
  • the power of the terminal device may be, for example, 25%, and the audio format signal to be selected for audio communication determined by the terminal device in the second audio format signal set may be, for example, a stereo signal.
  • the power of the terminal device may be, for example, 55%, and the audio format signal to be selected for audio communication determined by the terminal device in the second audio format signal set may be, for example, a spatial audio signal based on auxiliary metadata.
  • the power of the terminal device may be, for example, 65%, and the audio format signal to be selected for audio communication determined by the terminal device in the second audio format signal set may be, for example, a scene-based FOA signal.
  • the power of the terminal device may be, for example, 85%, and the audio format signal to be selected for audio communication determined by the terminal device in the second audio format signal set may be, for example, a scene-based HOA signal.
  • the audio format signal to be selected for audio communication is determined from the set of audio format signals that the terminal device can provide.
  • the audio format signal to be selected for audio communication can be determined through the power of the terminal device, which can improve the matching of the audio format signal and the constraints and reduce the need to directly determine the audio format signal without considering the constraints of the terminal device.
  • the accuracy of determining the audio format signal is low, the accuracy of determining the audio format signal can be improved, and the audio service of audio communication can be improved.
  • the embodiments of the present disclosure specifically illustrate the solution of determining the audio format signal to be used for audio communication based on the power of the terminal device in the second audio format signal set, which can improve the accuracy of audio format signal determination.
  • the present disclosure provides a processing method for the situation of "audio signal format determination" to determine the audio signal format based on the constraints of the terminal device, improve the accuracy of the audio format signal determination, and provide the best solution based on the constraints. Excellent audio service.
  • Figure 13 is a schematic structural diagram of an audio signal format determination device provided by an embodiment of the present disclosure. As shown in Figure 13, the device 1300 may include:
  • the determination module 1301 is used to determine the audio format signal to be selected for audio communication from the set of audio format signals that can be provided by the terminal device based on the constraints of the terminal device, where the audio format signal to be selected is used to provide audio services for audio communication.
  • the audio format signal set includes at least one audio format signal.
  • the audio format to be selected for audio communication can be determined from the set of audio format signals that the terminal device can provide based on the constraints of the terminal device.
  • signals wherein the audio format signals to be selected are used to provide audio services for audio communication
  • the audio format signal set includes at least one audio format signal.
  • the audio format signal to be selected for audio communication can be determined through the constraints of the terminal device, which can improve the matching between the audio format signal and the constraints and reduce the need to directly determine the audio format without considering the constraints of the terminal device.
  • the signal makes the accuracy of audio format signal determination lower, the accuracy of audio format signal determination can be improved, and the audio service of audio communication can be improved.
  • the present disclosure provides a processing device for the situation of "audio signal format determination" to determine the audio signal format based on the constraints of the terminal device, improve the accuracy of the audio format signal determination, and provide the most optimal audio signal format based on the constraints. Excellent audio service.
  • the constraints of the terminal device include at least one of the following:
  • the network environment conditions where the terminal equipment is located
  • the power of the terminal device is the power of the terminal device.
  • the determining module 1701 is also used to:
  • a set of audio format signals that can be provided by the terminal device is determined.
  • the determination module 1701 is used to determine the set of audio format signals that the terminal device can provide based on the number and/or microphone types set by the terminal device, specifically for:
  • the audio format signal set that the terminal device can provide is the first audio format signal set
  • the audio format signals in the first audio format signal set include at least one of the following:
  • the determination module 1701 is used to determine the set of audio format signals that the terminal device can provide based on the number and/or microphone types set by the terminal device, specifically for:
  • the audio format signal set that the terminal device can provide is the second audio format signal set
  • the audio format signals in the second audio format signal set include at least one of the following:
  • the determination module 1701 is configured to determine the audio format signal to be used for audio communication among the set of audio format signals that can be provided by the terminal device based on the constraints of the terminal device. Specifically, :
  • the constraint condition includes the hardware resource condition of the terminal device, the software resource condition of the terminal device, and the network where the terminal device is located. Environmental conditions, power supply of terminal equipment;
  • the audio format signal to be selected for audio communication is determined among the audio format signal sets that the terminal device can provide.
  • the determination module 1701 is used to determine, based on the score, the audio format signal to be used for audio communication among the set of audio format signals that can be provided by the terminal device, and is specifically used to:
  • the audio format signal to be selected for audio communication determined in the first audio format signal set is based on auxiliary metadata spatial audio signal
  • the audio format signal to be selected for audio communication determined in the first audio format signal set is a mono signal .
  • the determination module 1701 is used to determine, based on the score, the audio format signal to be used for audio communication among the set of audio format signals that can be provided by the terminal device, and is specifically used to:
  • the audio format signal to be selected for audio communication determined in the second audio format signal set is the scene-based HOA signal ;
  • the audio format signal to be selected for audio communication determined in the second audio format signal set is a mono signal.
  • the determination module 1701 is configured to determine the audio format signal to be used for audio communication among the set of audio format signals that can be provided by the terminal device based on the constraints of the terminal device. Specifically, :
  • the audio format signal to be selected for audio communication is determined from the set of audio format signals that the terminal device can provide.
  • the determination module 1701 is used to determine the audio format signal to be used for audio communication from the set of audio format signals that the terminal device can provide based on the power of the terminal device, and is specifically used to:
  • the audio format signal to be selected for audio communication determined in the first audio format signal set is based on Spatial audio signal with auxiliary metadata;
  • the audio format signal to be selected for audio communication determined in the first audio format signal set is a single channel signal.
  • the determination module 1701 is used to determine the audio format signal to be used for audio communication from the set of audio format signals that the terminal device can provide based on the power of the terminal device, and is specifically used to:
  • the audio format signal to be selected for audio communication determined in the second audio format signal set is based on Scene HOA signal;
  • the audio format signal to be selected for audio communication determined in the second audio format signal set is a single channel signal.
  • Figure 14 is a block diagram of a terminal device UE1400 provided by an embodiment of the present disclosure.
  • the UE1400 may be a mobile phone, a computer, a digital broadcast terminal device, a messaging device, a game console, a tablet device, a medical device, a fitness device, a personal digital assistant, etc.
  • the UE 1400 may include at least one of the following components: a processing component 1402 , a memory 1404 , a power supply component 1406 , a multimedia component 1408 , an audio component 1410 , an input/output (I/O) interface 1412 , a sensor component 1414 , and a communication component. 1416.
  • Processing component 1402 generally controls the overall operations of UE 1400, such as operations associated with display, phone calls, data communications, camera operations, and recording operations.
  • the processing component 1402 may include at least one processor 1420 to execute instructions to complete all or part of the steps of the above method. Additionally, processing component 1402 may include at least one module that facilitates interaction between processing component 1402 and other components. For example, processing component 1402 may include a multimedia module to facilitate interaction between multimedia component 1408 and processing component 1402.
  • Memory 1404 is configured to store various types of data to support operations at UE 1400 . Examples of this data include instructions for any application or method operating on the UE1400, contact data, phonebook data, messages, pictures, videos, etc.
  • Memory 1404 may be implemented by any type of volatile or non-volatile storage device, or a combination thereof, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable read-only memory (EEPROM), Programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic or optical disk.
  • SRAM static random access memory
  • EEPROM electrically erasable programmable read-only memory
  • EEPROM erasable programmable read-only memory
  • EPROM Programmable read-only memory
  • PROM programmable read-only memory
  • ROM read-only memory
  • magnetic memory flash memory, magnetic or optical disk.
  • Power supply component 1406 provides power to various components of UE 1400.
  • Power component 1406 may include a power management system, at least one power supply, and other components associated with generating, managing, and distributing power to UE 1400 .
  • Multimedia component 1408 includes a screen that provides an output interface between the UE 1400 and the user.
  • the screen may include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive input signals from the user.
  • the touch panel includes at least one touch sensor to sense touches, slides, and gestures on the touch panel. The touch sensor may not only sense the boundary of the touch or sliding operation, but also detect the wake-up time and pressure related to the touch or sliding operation.
  • multimedia component 1408 includes a front-facing camera and/or a rear-facing camera. When the UE1400 is in an operating mode, such as shooting mode or video mode, the front camera and/or rear camera can receive external multimedia data.
  • Each front-facing camera and rear-facing camera can be a fixed optical lens system or have a focal length and optical zoom capabilities.
  • Audio component 1410 is configured to output and/or input audio signals.
  • audio component 1410 includes a microphone (MIC) configured to receive external audio signals when UE 1400 is in operating modes, such as call mode, recording mode, and voice recognition mode. The received audio signals may be further stored in memory 1404 or sent via communications component 1416 .
  • audio component 1410 also includes a speaker for outputting audio signals.
  • the I/O interface 1412 provides an interface between the processing component 1402 and a peripheral interface module.
  • the peripheral interface module may be a keyboard, a click wheel, a button, etc. These buttons may include, but are not limited to: Home button, Volume buttons, Start button, and Lock button.
  • Sensor component 1414 includes at least one sensor for providing various aspects of status assessment for UE 1400 .
  • the sensor component 1414 can detect the on/off state of the device 1400, the relative positioning of components, such as the display and keypad of the UE 1400, the sensor component 1414 can also detect the position change of the UE 1400 or a component of the UE 1400, the user The presence or absence of contact with the UE1400, the orientation or acceleration/deceleration of the UE1400 and the temperature change of the UE1400.
  • Sensor assembly 1414 may include a proximity sensor configured to detect the presence of nearby objects without any physical contact.
  • Sensor assembly 1414 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications.
  • the sensor component 1414 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.
  • Communication component 1416 is configured to facilitate wired or wireless communication between UE 1400 and other devices.
  • UE1400 can access wireless networks based on communication standards, such as WiFi, 2G or 3G, or a combination thereof.
  • the communication component 1416 receives broadcast signals or broadcast related information from an external broadcast management system via a broadcast channel.
  • the communications component 1416 also includes a near field communications (NFC) module to facilitate short-range communications.
  • NFC near field communications
  • the NFC module can be implemented based on radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology and other technologies.
  • RFID radio frequency identification
  • IrDA infrared data association
  • UWB ultra-wideband
  • Bluetooth Bluetooth
  • UE 1400 may be configured by at least one application specific integrated circuit (ASIC), digital signal processor (DSP), digital signal processing device (DSPD), programmable logic device (PLD), field programmable gate array ( FPGA), controller, microcontroller, microprocessor or other electronic component implementation for executing the above method.
  • ASIC application specific integrated circuit
  • DSP digital signal processor
  • DSPD digital signal processing device
  • PLD programmable logic device
  • FPGA field programmable gate array
  • controller microcontroller, microprocessor or other electronic component implementation for executing the above method.
  • the methods provided by the embodiments of the present disclosure are introduced from the perspectives of network side equipment and UE respectively.
  • the network side device and the UE may include a hardware structure and a software module to implement the above functions in the form of a hardware structure, a software module, or a hardware structure plus a software module.
  • a certain function among the above functions can be executed by a hardware structure, a software module, or a hardware structure plus a software module.
  • the methods provided by the embodiments of the present disclosure are introduced from the perspectives of network side equipment and UE respectively.
  • the network side device and the UE may include a hardware structure and a software module to implement the above functions in the form of a hardware structure, a software module, or a hardware structure plus a software module.
  • a certain function among the above functions can be executed by a hardware structure, a software module, or a hardware structure plus a software module.
  • the communication device may include a transceiver module and a processing module.
  • the transceiver module may include a sending module and/or a receiving module.
  • the sending module is used to implement the sending function
  • the receiving module is used to implement the receiving function.
  • the transceiving module may implement the sending function and/or the receiving function.
  • the communication device may be a terminal device (such as the terminal device in the foregoing method embodiment), a device in the terminal device, or a device that can be used in conjunction with the terminal device.
  • the communication device may be a network device, a device in a network device, or a device that can be used in conjunction with the network device.
  • the communication device may be a network device, or may be a terminal device (such as the terminal device in the foregoing method embodiment), or may be a chip, chip system, or processor that supports the network device to implement the above method, or may be a terminal device that supports A chip, chip system, or processor that implements the above method.
  • the device can be used to implement the method described in the above method embodiment. For details, please refer to the description in the above method embodiment.
  • a communications device may include one or more processors.
  • the processor may be a general-purpose processor or a special-purpose processor, etc.
  • it can be a baseband processor or a central processing unit.
  • the baseband processor can be used to process communication protocols and communication data
  • the central processor can be used to control and execute communication devices (such as network side equipment, baseband chips, terminal equipment, terminal equipment chips, DU or CU, etc.)
  • a computer program processes data for a computer program.
  • the communication device may also include one or more memories, on which a computer program may be stored, and the processor executes the computer program, so that the communication device performs the method described in the above method embodiment.
  • data may also be stored in the memory. Communication devices and memory can be provided separately or integrated together.
  • the communication device may also include a transceiver and an antenna.
  • the transceiver can be called a transceiver unit, a transceiver, or a transceiver circuit, etc., and is used to implement transceiver functions.
  • the transceiver can include a receiver and a transmitter.
  • the receiver can be called a receiver or a receiving circuit, etc., and is used to implement the receiving function;
  • the transmitter can be called a transmitter or a transmitting circuit, etc., and is used to implement the transmitting function.
  • one or more interface circuits may also be included in the communication device.
  • Interface circuitry is used to receive code instructions and transmit them to the processor.
  • the processor executes the code instructions to cause the communication device to perform the method described in the above method embodiment.
  • the communication device is a terminal device (such as the terminal device in the foregoing method embodiment): the processor is configured to execute the method shown in any one of Figures 1-12.
  • a transceiver for implementing receiving and transmitting functions may be included in the processor.
  • the transceiver may be a transceiver circuit, an interface, or an interface circuit.
  • the transceiver circuits, interfaces or interface circuits used to implement the receiving and transmitting functions can be separate or integrated together.
  • the above-mentioned transceiver circuit, interface or interface circuit can be used for reading and writing codes/data, or the above-mentioned transceiver circuit, interface or interface circuit can be used for signal transmission or transfer.
  • the processor may store a computer program, and the computer program runs on the processor, which can cause the communication device to perform the method described in the above method embodiment.
  • the computer program may be embedded in the processor, in which case the processor may be implemented in hardware.
  • the communication device may include a circuit, and the circuit may implement the functions of sending or receiving or communicating in the foregoing method embodiments.
  • the processors and transceivers described in this disclosure may be implemented on integrated circuits (ICs), analog ICs, radio frequency integrated circuits (RFICs), mixed signal ICs, application specific integrated circuits (ASICs), printed circuit boards ( printed circuit board (PCB), electronic equipment, etc.
  • the processor and transceiver can also be manufactured using various IC process technologies, such as complementary metal oxide semiconductor (CMOS), n-type metal oxide-semiconductor (NMOS), P-type Metal oxide semiconductor (positive channel metal oxide semiconductor, PMOS), bipolar junction transistor (BJT), bipolar CMOS (BiCMOS), silicon germanium (SiGe), gallium arsenide (GaAs), etc.
  • CMOS complementary metal oxide semiconductor
  • NMOS n-type metal oxide-semiconductor
  • PMOS P-type Metal oxide semiconductor
  • BJT bipolar junction transistor
  • BiCMOS bipolar CMOS
  • SiGe silicon germanium
  • GaAs gallium arsenide
  • the communication device described in the above embodiments may be a network device or a terminal device (such as the terminal device in the foregoing method embodiment), but the scope of the communication device described in the present disclosure is not limited thereto, and the structure of the communication device may not be limited to limits.
  • the communication device may be a stand-alone device or may be part of a larger device.
  • the communication device may be:
  • the IC collection may also include storage components for storing data and computer programs;
  • the communication device may be a chip or a system on a chip
  • the chip includes a processor and an interface.
  • the number of processors may be one or more, and the number of interfaces may be multiple.
  • the chip also includes a memory for storing necessary computer programs and data.
  • the present disclosure also provides a readable storage medium on which instructions are stored, and when the instructions are executed by a computer, the functions of any of the above method embodiments are implemented.
  • the present disclosure also provides a computer program product, which, when executed by a computer, implements the functions of any of the above method embodiments.
  • the above embodiments it may be implemented in whole or in part by software, hardware, firmware, or any combination thereof.
  • software it may be implemented in whole or in part in the form of a computer program product.
  • the computer program product includes one or more computer programs.
  • the computer program When the computer program is loaded and executed on a computer, the processes or functions described in accordance with the embodiments of the present disclosure are generated in whole or in part.
  • the computer may be a general-purpose computer, a special-purpose computer, a computer network, or other programmable device.
  • the computer program may be stored in or transferred from one computer-readable storage medium to another, for example, the computer program may be transferred from a website, computer, server, or data center Transmission to another website, computer, server or data center through wired (such as coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (such as infrared, wireless, microwave, etc.) means.
  • the computer-readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server, data center, etc. that contains one or more available media integrated.
  • the usable media may be magnetic media (e.g., floppy disks, hard disks, magnetic tapes), optical media (e.g., high-density digital video discs (DVD)), or semiconductor media (e.g., solid state disks, SSD)) etc.
  • magnetic media e.g., floppy disks, hard disks, magnetic tapes
  • optical media e.g., high-density digital video discs (DVD)
  • DVD digital video discs
  • semiconductor media e.g., solid state disks, SSD
  • At least one in the present disclosure can also be described as one or more, and the plurality can be two, three, four or more, and the present disclosure is not limited.
  • the technical feature is distinguished by “first”, “second”, “third”, “A”, “B”, “C” and “D” etc.
  • the technical features described in “first”, “second”, “third”, “A”, “B”, “C” and “D” are in no particular order or order.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Telephone Function (AREA)
  • Stereophonic System (AREA)

Abstract

本公开提出一种音频信号格式确定方法、装置、设备及存储介质,属于通信技术领域。该方法包括基于终端设备的约束条件,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,其中,所要选用的音频格式信号用于提供音频通信的音频服务,音频格式信号集合包括至少一种音频格式信号。本公开针对一种"音频信号格式确定"这一情形提供了一种处理方法,以根据终端设备的约束条件确定音频信号格式,提高音频格式信号确定的准确性,提供基于所述约束条件下最优的音频服务。

Description

音频信号格式确定方法、装置 技术领域
本公开涉及通信技术领域,尤其涉及一种音频信号格式确定方法、装置、设备及存储介质。
背景技术
在通信系统中,语音和音频通信服务可以从窄带信号扩展到超宽带甚至是全带服务,可以满足用户对高质量音频不断增加的需求。但是终端设备仅可以通过预先设置确定音频格式信号,使得音频格式信号确定的准确性较低,无法提供与终端设备匹配的最优音频服务。因此,亟需一种“音频信号格式确定”的方法,以基于终端设备的约束条件确定音频信号格式,提高音频格式信号确定的准确性,提供基于所述约束条件下最优的音频服务。
发明内容
本公开提出的一种音频信号格式确定方法、装置、设备及存储介质,以基于终端设备的约束条件确定音频信号格式,提高音频格式信号确定的准确性,提供基于所述约束条件下最优的音频服务。
本公开一方面实施例提出的一种音频信号格式确定方法,所述方法由终端设备执行,所述方法包括:
基于所述终端设备的约束条件,在所述终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,其中,所述所要选用的音频格式信号用于提供音频通信的音频服务,所述音频格式信号集合包括至少一种音频格式信号。
可选地,在本公开的一个实施例之中,所述终端设备的约束条件,包括以下至少一种:
所述终端设备的硬件资源条件;
所述终端设备的软件资源条件;
所述终端设备所处的网络环境条件;
所述终端设备的电量。
可选地,在本公开的一个实施例之中,所述方法还包括:
基于所述终端设备所设置的麦克风数量和/或麦克风类型,确定所述终端设备可提供的音频格式信号集合。
可选地,在本公开的一个实施例之中,所述基于所述终端设备所设置的麦克风数量和/或麦克风类型,确定所述终端设备可提供的音频格式信号集合,包括:
响应于所述终端设备所设置的麦克风数量为至少三个,确定所述终端设备可提供的音频格式信号集合为第一音频格式信号集合;
其中,所述第一音频格式信号集合中的音频格式信号包括以下至少一种:
单声道信号;
立体声信号;
基于辅助元数据的空间音频信号(Metadata-Assisted Spatial Audio,MASA);
基于场景一阶高保真度立体环绕声(First-Order Ambisonics,FOA)信号;
基于场景高阶高保真度立体环绕声(Higher-Order Ambisonics,HOA)信号;
多声道信号。
可选地,在本公开的一个实施例之中,所述基于所述终端设备所设置的麦克风数量和/或麦克风类型,确定所述终端设备可提供的音频格式信号集合,包括:
响应于所述终端设备中设置的麦克风包括至少一个外接音频采集设备,确定所述终端设备可提供的音频格式信号集合为第二音频格式信号集合;
其中,所述第二音频格式信号集合中的音频格式信号包括以下至少一种:
单声道信号;
立体声信号;
双耳信号(Binaural);
对象信号;
基于辅助元数据的空间音频信号;
基于场景FOA信号;
基于场景HOA信号;
多声道信号。
可选地,在本公开的一个实施例之中,所述基于所述终端设备的约束条件,在所述终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,包括:
基于所述终端设备的约束条件对应的条件参数和所述约束条件对应的权重系数,获取所述约束条件对应的分数,其中,所述约束条件包括所述终端设备的硬件资源条件、所述终端设备的软件资源条件、所述终端设备所处的网络环境条件、所述终端设备的电量;
基于所述分数,在所述终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号。
可选地,在本公开的一个实施例之中,所述基于所述分数,在所述终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,包括:
响应于所述终端设备可提供的音频格式信号集合为第一音频格式信号集合,且所述分数大于第一分数阈值,在所述第一音频格式信号集合中确定的音频通信所要选用的音频格式信号为基于辅助元数据的空间音频信号;
响应于所述终端设备可提供的音频格式信号集合为第一音频格式信号集合,且所述分数小于所述第一分数阈值,在所述第一音频格式信号集合中确定的音频通信所要选用的音频格式信号为单声道信号。
可选地,在本公开的一个实施例之中,所述基于所述分数,在所述终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,包括:
响应于所述终端设备可提供的音频格式信号集合为第二音频格式信号集合,且所述分数大于第二分数阈值,在所述第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为基于场景HOA信号;
响应于所述终端设备可提供的音频格式信号集合为所述第二音频格式信号集合,且所述分数小于所述第二分数阈值,在所述第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为单声道信号。
可选地,在本公开的一个实施例之中,所述基于所述终端设备的约束条件,在所述终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,包括:
基于所述终端设备的电量,在所述终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号。
可选地,在本公开的一个实施例之中,所述基于所述终端设备的电量,在所述终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,包括:
响应于所述终端设备可提供的音频格式信号集合为第一音频格式信号集合,且所述终端设备的电量大于第一电量阈值,在所述第一音频格式信号集合中确定的音频通信所要选用的音频格式信号为基于辅助元数据的空间音频信号;
响应于所述终端设备可提供的音频格式信号集合为所述第一音频格式信号集合,且所述终端设备的电量小于所述第一电量阈值,在所述第一音频格式信号集合中确定的音频通信所要选用的音频格式信号为单声道信号。
可选地,在本公开的一个实施例之中,所述基于所述终端设备的电量,在所述终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,包括:
响应于所述终端设备可提供的音频格式信号集合为第二音频格式信号集合,且所述终端设备的电量大于第二电量阈值,在所述第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为基于场景HOA信号;
响应于所述终端设备可提供的音频格式信号集合为所述第二音频格式信号集合,且所述终端设备的电量小于所述第二电量阈值,在所述第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为单声道信号。
本公开另一方面实施例提出的一种音频信号格式确定装置,其特征在于,所述装置包括:
确定模块,用于基于所述终端设备的约束条件,在所述终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,其中,所述所要选用的音频格式信号用于提供音频通信的音频服务,所述音频格式信号集合包括至少一种音频格式信号。
本公开又一方面实施例提出的一种终端设备,所述装置包括处理器和存储器,所述存储器中存储有计算机程序,所述处理器执行所述存储器中存储的计算机程序,以使所述装置执行如上一方面实施例提出的方法。
本公开又一方面实施例提出的通信装置,包括:处理器和接口电路;
所述接口电路,用于接收代码指令并传输至所述处理器;
所述处理器,用于运行所述代码指令以执行如一方面实施例提出的方法。
本公开又一方面实施例提出的计算机可读存储介质,用于存储有指令,当所述指令被执行时,使如一方面实施例提出的方法被实现。
综上所述,在本公开实施例之中,基于终端设备的约束条件,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,其中,所要选用的音频格式信号用于提供音频通信的音频服务,音频格式信号集合包括至少一种音频格式信号。在本公开实施例之中,通过终端设备的约束条件,可以确定音频通信所要选用的音 频格式信号,可以提高音频格式信号与约束条件的匹配性,减少不考虑终端设备的约束条件直接确定音频格式信号使得音频格式信号确定的准确性较低的情况,可以提高音频格式信号确定的准确性,可以改善音频通信的音频服务。本公开针对一种“音频信号格式确定”这一情形提供了一种处理方法,以基于终端设备的约束条件确定音频信号格式,提高音频格式信号确定的准确性,提供基于所述约束条件下最优的音频服务。
附图说明
本公开上述的和/或附加的方面和优点从下面结合附图对实施例的描述中将变得明显和容易理解,其中:
图1为本公开一个实施例所提供的一种音频信号格式确定方法的流程示意图;
图2为本公开另一个实施例所提供的一种音频信号格式确定方法的流程示意图;
图3为本公开再一个实施例所提供的一种音频信号格式确定方法的流程示意图;
图4为本公开又一个实施例所提供的一种终端设备的举例示意图;
图5为本公开又一个实施例所提供的一种音频信号格式确定方法的流程示意图;
图6为本公开又一个实施例所提供的一种终端设备的举例示意图;
图7为本公开又一个实施例所提供的一种音频信号格式确定方法的流程示意图;
图8为本公开又一个实施例所提供的一种音频信号格式确定方法的流程示意图;
图9为本公开又一个实施例所提供的一种音频信号格式确定方法的流程示意图;
图10为本公开又一个实施例所提供的一种音频信号格式确定方法的流程示意图;
图11为本公开又一个实施例所提供的一种音频信号格式确定方法的流程示意图;
图12为本公开又一个实施例所提供的一种音频信号格式确定方法的流程示意图;
图13为本公开一个实施例所提供的一种音频信号格式确定装置的结构示意图;
图14是本公开一个实施例所提供的一种终端设备的框图。
具体实施方式
这里将详细地对示例性实施例进行说明,其示例表示在附图中。下面的描述涉及附图时,除非另有表示,不同附图中的相同数字表示相同或相似的要素。以下示例性实施例中所描述的实施方式并不代表与本公开实施例相一致的所有实施方式。相反,它们仅是与如所附权利要求书中所详述的、本公开实施例的一些方面相一致的装置和方法的例子。
在本公开实施例使用的术语是仅仅出于描述特定实施例的目的,而非旨在限制本公开实施例。在本公开实施例和所附权利要求书中所使用的单数形式的“一种”和“该”也旨在包括多数形式,除非上下文清楚地表示其他含义。还应当理解,本文中使用的术语“和/或”是指并包含一个或多个相关联的列出项目的任何或所有可能组合。
应当理解,尽管在本公开实施例可能采用术语第一、第二、第三等来描述各种信息,但这些信息不应限于这些术语。这些术语仅用来将同一类型的信息彼此区分开。例如,在不脱离本公开实施例范围的情况下,第一信息也可以被称为第二信息,类似地,第二信息也可以被称为第一信息。取决于语境,如在此所使用的词语“如果”及“若”可以被解释成为“在……时”或“当……时”或“响应于确定”。
在本公开实施例中涉及的网元或是网络功能,其既可以是独立的硬件设备实现,也可以通过硬件设备中的软件实现,本公开实施例中并不对此做出限定。
在通信系统中,第一代移动通信技术(first generation mobile communication technology,1G)开始于20世纪80年代,1G是第一代无线蜂窝技术,是属于模拟移动通信网。1G升级到第二代手机通信技术(2-Generation wireless telephone technology,2G)时将终端设备从模拟通信转移到数字通信。例如在2G中可以采用GSM网络制式,语音编码器可以采用自适应多速率语音编码器(Adaptive Multi-Rate,AMR),增强型全速率编解码器(Enhanced Full Rate Speed Encoding,EFR),全速率编解码器(Full Rate,FR),半速率编解码器(Half Rate,HR)通信提供单通道窄带语音服务。第三代移动通信技术(3rd-Generation,3G)移动通信系统是国际电信联盟ITU为2000年国际移动通信而提出的。例如不同的运营商可以采用不同的编码方式。例如可以采用时分同步码分多址(Time Division-Synchronous Code Division Multiple Access,TD-SCDMA),还可以采用码分多址(Code Division Multiple Access 2000,CDMA2000),还可以采用宽带码分多址(Wideband Code Division Multiple Access,W-CDMA),其语音编码器采用新型可变速率多模式宽带语音编解码器AMR-WB提供单通道宽带语音服务。第四代移动通信技术(the 4th generation mobile communication technology,4G)是在3G技术上的一次更好的改良,数据和话音都采用全网际互连协议(Internet Protocol,IP)的方式,提供语音音频的实时高清语音HD+Voice服务,采用的音频编码 器(Enhanced Voice Services,EVS)编解码器可以兼顾语音和音频的高质量压缩。
其中,在本公开的一个实施例之中,以上提供的语音和音频通信服务从窄带信号扩展到超宽带甚至是全带服务,但还都是单声道服务,用户对高质量音频的需求不断增加,与单声道音频相比,立体声音频对于每个声源具有取向感和分布感,并且可以提高清晰度,清晰度。随着传输带宽的增加以及终端设备信号采集设备的升级,信号处理器性能的提升,以及终端回放设备的升级,基于声道的信号,基于对象的信号,基于场景的信号等三种信号格式可以提供三维音频服务。第三代合作伙伴计划(3rd Generation Partnership Project,3GPP)编码器和媒体工作组SA4正在标准化的浸入式语音音频服务(Immersive Voice and Audio Services,IVAS)编解码器即能支持上述三种信号格式的编解码需求。其中三种信号格式中具体的信号格式有:其中基于声道的信号有:单声道信号,立体声信号(Stereo),双耳信号(Binaural),5.1,7.1环绕声信号(Surround),5.1.4,7.1.4环绕声信号(Surround),其中.4代表高度声道信号(Height),基于场景的信号有:一阶高保真度立体环绕声(FOA),二阶高保真度立体环绕声(2 nd-Order Ambisonics,HOA2),三阶高保真度立体环绕声(3 rd-Order Ambisonics,HOA3),基于对象的信号包含音频数据和元数据,除此之外,IVAS还支持基于辅助元数据的空间音频信号(Metadata-Assisted Spatial Audio,MASA)。能够支持三维音频服务的终端设备包括但不限于手机,电脑,平板,会议系统设备,AR/VR设备,汽车等。
下面参考附图对本公开实施例所提供的一种音频信号格式确定方法、装置、设备及存储介质进行详细描述。
图1为本公开实施例所提供的一种音频信号格式确定方法的流程示意图,该方法由终端设备执行,如图1所示,该方法可以包括以下步骤:
步骤101、基于终端设备的约束条件,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,其中,所要选用的音频格式信号用于提供音频通信的音频服务,音频格式信号集合包括至少一种音频格式信号。
需要说明的是,在本公开的一个实施例之中,终端设备可以是指向用户提供语音和/或数据连通性的设备。终端设备可以经RAN(Radio Access Network,无线接入网)与一个或多个核心网进行通信,终端设备可以是物联网终端,如传感器设备、移动电话(或称为“蜂窝”电话)和具有物联网终端的计算机,例如,可以是固定式、便携式、袖珍式、手持式、计算机内置的或者车载的装置。例如,站(Station,STA)、订户单元(subscriber unit)、订户站(subscriber station),移动站(mobile station)、移动台(mobile)、远程站(remote station)、接入点、远程终端(remote terminal)、接入终端(access terminal)、用户装置(user terminal)或用户代理(user agent)。或者,终端设备也可以是无人飞行器的设备。或者,终端设备也可以是车载设备,比如,可以是具有无线通信功能的行车电脑,或者是外接行车电脑的无线终端。或者,终端设备也可以是路边设备,比如,可以是具有无线通信功能的路灯、信号灯或者其它路边设备等。
其中,在本公开的一个实施例之中,终端设备的约束条件,包括以下至少一种:
终端设备的硬件资源条件;
终端设备的软件资源条件;
终端设备所处的网络环境条件;
终端设备的电量。
以及,在本公开的一个实施例中,该方法还包括:
基于终端设备所设置的麦克风数量和/或麦克风类型,确定终端设备可提供的音频格式信号集合。
以及,在本公开的一个实施例中,基于终端设备所设置的麦克风数量和/或麦克风类型,确定终端设备可提供的音频格式信号集合,包括:
响应于终端设备所设置的麦克风数量为至少三个,确定终端设备可提供的音频格式信号集合为第一音频格式信号集合;
其中,第一音频格式信号集合中的音频格式信号包括以下至少一种:
单声道信号(mono);
立体声信号(Stereo);
基于辅助元数据的空间音频信号;
基于场景一阶高保真度立体环绕声FOA信号;
基于场景高阶高保真度立体环绕声HOA信号;
多声道信号。
以及,在本公开的一个实施例中,基于终端设备所设置的麦克风数量和/或麦克风类型,确定终端设备可提供的音频格式信号集合,包括:
响应于终端设备中设置的麦克风包括至少一个外接音频采集设备,确定终端设备可提供的音频格式信号集合为 第二音频格式信号集合;
其中,第二音频格式信号集合中的音频格式信号包括以下至少一种:
单声道信号;
立体声信号;
双耳信号;
对象信号;
基于辅助元数据的空间音频信号;
基于场景FOA信号;
基于场景HOA信号;
多声道信号。
以及,在本公开的一个实施例中,基于终端设备的约束条件,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,包括:
基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数,其中,约束条件包括终端设备的硬件资源条件、终端设备的软件资源条件、终端设备所处的网络环境条件、终端设备的电量中至少两种;
基于分数,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号。
示例地,在本公开的一个实施例中,基于分数,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,包括:
响应于终端设备可提供的音频格式信号集合为第一音频格式信号集合,且分数大于第一分数阈值,在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号为基于辅助元数据的空间音频信号;
响应于终端设备可提供的音频格式信号集合为第一音频格式信号集合,且分数小于第一分数阈值,在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号为单声道信号。
进一步地,在本公开的一个实施例中,基于分数,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,包括:
响应于终端设备可提供的音频格式信号集合为第二音频格式信号集合,且分数大于第二分数阈值,在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为基于场景HOA信号;
响应于终端设备可提供的音频格式信号集合为第二音频格式信号集合,且分数小于第二分数阈值,在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为单声道信号。
以及,在本公开的一个实施例中,基于终端设备的约束条件,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,包括:
基于终端设备的电量,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号。
以及,在本公开的一个实施例中,基于终端设备的电量,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,包括:
响应于终端设备可提供的音频格式信号集合为第一音频格式信号集合,且终端设备的电量大于第一电量阈值,在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号为基于辅助元数据的空间音频信号;
响应于终端设备可提供的音频格式信号集合为第一音频格式信号集合,且终端设备的电量小于第一电量阈值,在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号为单声道信号。
以及,在本公开的一个实施例中,基于终端设备的电量,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,包括:
响应于终端设备可提供的音频格式信号集合为第二音频格式信号集合,且终端设备的电量大于第二电量阈值,在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为基于场景HOA信号;
响应于终端设备可提供的音频格式信号集合为第二音频格式信号集合,且终端设备的电量小于第二电量阈值,在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为单声道信号。
综上所述,在本公开实施例之中,基于终端设备的约束条件,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,其中,所要选用的音频格式信号用于提供音频通信的音频服务,音频格式信号集合包括至少一种音频格式信号。在本公开实施例之中,通过终端设备的约束条件,可以确定音频通信所要选用的音频格式信号,可以提高音频格式信号与约束条件的匹配性,减少不考虑终端设备的约束条件直接确定音频格式信号使得音频格式信号确定的准确性较低的情况,可以提高音频格式信号确定的准确性,可以改善音频通信的音频服务。本公开针对一种“音频信号格式确定”这一情形提供了一种处理方法,以基于终端设备的约束条件确定音频信号格式, 提高音频格式信号确定的准确性,提供基于所述约束条件下最优的音频服务。
图2为本公开实施例所提供的一种音频信号格式确定方法的流程示意图,该方法由终端设备执行,如图2所示,该方法可以包括以下步骤:
步骤201、基于终端设备所设置的麦克风数量和/或麦克风类型,确定终端设备可提供的音频格式信号集合;
步骤202、基于终端设备的约束条件,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,其中,所要选用的音频格式信号用于提供音频通信的音频服务,音频格式信号集合包括至少一种音频格式信号。
其中,在本公开的一个实施例之中,麦克风数量是指终端设备所设置的麦克风的数量。该麦克风可以是设置于终端设备中所设置的麦克风,还可以是终端设备外接的麦克风。该麦克风数量并不特指某一固定数量。例如,当终端设置的麦克风的数量发生变化时,该麦克风数量也可以相应变化。
示例地,在本公开的一个实施例之中,麦克风类型是指终端设置的至少一个麦克风对应的类型。不同的麦克风可以对应不同的麦克风类型,不同的麦克风也可以对应相同的麦克风类型。
以及,在本公开的一个实施例之中,音频格式信号集合是指由至少一种音频格式信号汇聚而成的集体。该音频格式信号集合并不特指某一固定集合。例如,当音频格式信号集合中包括的音频格式信号的数量发生变化时,该音频格式信号集合也可以相应变化。例如,当音频格式信号集合中包括的音频格式信号发生变化时,该音频格式信号集合也可以相应变化。
其中,在本公开的一个实施例之中,约束条件是指用于确定音频通信所要选用的音频格式信号的条件。该约束条件并不特指某一固定条件。该约束条件可以包括以下至少一种:
终端设备的硬件资源条件;
终端设备的软件资源条件;
终端设备所处的网络环境条件;
终端设备的电量。
以及,在本公开的一个实施例之中,终端设备可以采用专用提取模块获取终端设备的约束条件。
进一步地,在本公开的一个实施例之中,基于终端设备所设置的麦克风数量和/或麦克风类型,终端设备可以确定终端设备可提供的音频格式信号集合。终端设备基于终端设备的约束条件,可以在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,其中,所要选用的音频格式信号用于提供音频通信的音频服务,音频格式信号集合包括至少一种音频格式信号。其中,不同的音频格式信号可以对应的不同的音频数据声道数目。
综上所述,在本公开实施例之中,基于终端设备所设置的麦克风数量和/或麦克风类型,确定终端设备可提供的音频格式信号集合,基于终端设备的约束条件,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,其中,所要选用的音频格式信号用于提供音频通信的音频服务,音频格式信号集合包括至少一种音频格式信号。在本公开实施例之中,通过终端设备的约束条件,可以确定音频通信所要选用的音频格式信号,可以提高音频格式信号与约束条件的匹配性,减少不考虑终端设备的约束条件直接确定音频格式信号使得音频格式信号确定的准确性较低的情况,可以提高音频格式信号确定的准确性,可以改善音频通信的音频服务。本公开实施例之中,具体说明了音频格式信号集合确定方案,可以提高音频格式信号确定的准确性。本公开针对一种“音频信号格式确定”这一情形提供了一种处理方法,以基于终端设备的约束条件确定音频信号格式,提高音频格式信号确定的准确性,提供基于所述约束条件下最优的音频服务。
图3为本公开实施例所提供的一种音频信号格式确定方法的流程示意图,该方法由终端设备执行,如图3所示,该方法可以包括以下步骤:
步骤S301,响应于终端设备所设置的麦克风数量为至少三个,确定终端设备可提供的音频格式信号集合为第一音频格式信号集合;
其中,第一音频格式信号集合中的音频格式信号包括以下至少一种:
单声道信号;
立体声信号;
基于辅助元数据的空间音频信号;
基于场景一阶高保真度立体环绕声FOA信号;
基于场景高阶高保真度立体环绕声HOA信号;
多声道信号;
步骤302、基于终端设备的约束条件,在终端设备可提供的第一音频格式信号集合中确定音频通信所要选用的音频格式信号,其中,所要选用的音频格式信号用于提供音频通信的音频服务,音频格式信号集合包括至少一种音 频格式信号。
其中,在本公开的一个实施例之中,第一音频格式信号集合是指终端设备所设置的麦克风数量为至少三个时,终端设备所确定的音频格式信号集合。第一音频格式信号集合中的第一仅用于与其他音频格式信号集合进行区分,第一音频格式信号集合并不特指某一固定集合。例如,第一音频格式信号集合可以包括单声道信号、立体声信号和基于辅助元数据的空间音频信号,第一音频格式信号集合还可以包括单声道信号、立体声信号和多声道信号。
示例地,在本公开的一个实施例之中,终端设备所设置的麦克风数量例如可以为三个。该三个麦克风例如可以是内置于终端设备的三个麦克风。此时终端设备的举例示意图可以如图4所示。其中,一个麦克风设置于终端设备的上部,两个麦克风设置于终端设备的下部。终端设备确定终端设备可提供的第一音频格式信号集合例如可以包括单声道信号、立体声信号和基于辅助元数据的空间音频信号。
示例地,在本公开的一个实施例之中,终端设备所设置的麦克风数量例如可以为一个。终端设备确定终端设备可提供的第一音频格式信号集合例如可以仅包括单声道信号。
示例地,在本公开的一个实施例之中,高阶高保真度立体环绕声HOA信号包括但不限于二阶高保真度立体环绕声信号和三阶高保真度立体环绕声信号等。
综上所述,在本公开实施例之中,响应于终端设备所设置的麦克风数量为至少三个,确定终端设备可提供的音频格式信号集合为第一音频格式信号集合,基于终端设备的约束条件,在终端设备可提供的第一音频格式信号集合中确定音频通信所要选用的音频格式信号,其中,所要选用的音频格式信号用于提供音频通信的音频服务,音频格式信号集合包括至少一种音频格式信号。在本公开实施例之中,通过终端设备的约束条件,可以确定音频通信所要选用的音频格式信号,可以提高音频格式信号与约束条件的匹配性,减少不考虑终端设备的约束条件直接确定音频格式信号使得音频格式信号确定的准确性较低的情况,可以提高音频格式信号确定的准确性,可以改善音频通信的音频服务。本公开实施例之中,具体说明了第一音频格式信号集合的确定方案,可以提高在终端设备可提供的第一音频格式信号集合中确定音频格式信号的准确性。本公开针对一种“音频信号格式确定”这一情形提供了一种处理方法,以基于终端设备的约束条件确定音频信号格式,提高音频格式信号确定的准确性,提供基于所述约束条件下最优的音频服务。
图5为本公开实施例所提供的一种音频信号格式确定方法的流程示意图,该方法由终端设备执行,如图5所示,该方法可以包括以下步骤:
步骤501、响应于终端设备中设置的麦克风包括至少一个外接音频采集设备,确定终端设备可提供的音频格式信号集合为第二音频格式信号集合;
其中,第二音频格式信号集合中的音频格式信号包括以下至少一种:
单声道信号;
立体声信号;
双耳信号;
对象信号;
基于辅助元数据的空间音频信号;
基于场景FOA信号;
基于场景HOA信号;
多声道信号;
步骤502、基于终端设备的约束条件,在终端设备可提供的第二音频格式信号集合中确定音频通信所要选用的音频格式信号,其中,所要选用的音频格式信号用于提供音频通信的音频服务,音频格式信号集合包括至少一种音频格式信号。
示例地,在本公开的一个实施例之中,终端设备所设置的麦克风数量例如可以为一个。此时终端设备的举例示意图可以如图6所示。其中,至少一个外接音频采集设备例如可以是采集球。终端设备确定终端设备可提供的音频格式信号集合例如可以为第二音频格式信号集合,该第二音频格式信号集合例如可以包括单声道信号、立体声信号、基于场景FOA信号、基于场景HOA信号和基于辅助元数据的空间音频信号。
示例地,在本公开的一个实施例之中,终端设备中设置的麦克风例如可以包括一个外接音频采集设备和一个内置于终端设备的麦克风,至少一个外接音频采集设备例如可以是采集球。终端设备确定终端设备可提供的音频格式信号集合例如可以是第二音频格式信号集合。
综上所述,在本公开实施例之中,响应于终端设备中设置的麦克风包括至少一个外接音频采集设备,确定终端设备可提供的音频格式信号集合为第二音频格式信号集合,基于终端设备的约束条件,在终端设备可提供的第二音频格式信号集合中确定音频通信所要选用的音频格式信号,其中,所要选用的音频格式信号用于提供音频通信的音 频服务,音频格式信号集合包括至少一种音频格式信号。在本公开实施例之中,通过终端设备的约束条件,可以确定音频通信所要选用的音频格式信号,可以提高音频格式信号与约束条件的匹配性,减少不考虑终端设备的约束条件直接确定音频格式信号使得音频格式信号确定的准确性较低的情况,可以提高音频格式信号确定的准确性,可以改善音频通信的音频服务。本公开实施例之中,具体说明了第二音频格式信号集合的确定方案,可以提高在终端设备可提供的第二音频格式信号集合中确定音频格式信号的准确性。本公开针对一种“音频信号格式确定”这一情形提供了一种处理方法,以基于终端设备的约束条件确定音频信号格式,提高音频格式信号确定的准确性,提供基于所述约束条件下最优的音频服务。
图7为本公开实施例所提供的一种音频信号格式确定方法的流程示意图,该方法由终端设备执行,如图7所示,该方法可以包括以下步骤:
步骤701、基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数,其中,约束条件包括终端设备的硬件资源条件、终端设备的软件资源条件、终端设备所处的网络环境条件、终端设备的电量;
步骤702、基于分数,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号。
其中,在本公开的一个实施例之中,不同的约束条件可以对应不同的权重系数,也可以对应相同的权重系数。例如,终端设备的硬件资源条件对应的权重系数可以是0.8,终端设备的软件资源条件对应的权重系数可以是0.6,终端设备所处的网络环境条件对应的权重系数可以是0.7。例如,终端设备的硬件资源条件对应的权重系数可以是0.8,终端设备的软件资源条件对应的权重系数可以是0.8,终端设备所处的网络环境条件对应的权重系数可以是0.7。
以及,在本公开的一个实施例之中,条件参数并不特指某一固定参数。该约束条件例如可以是终端设备所处的网络环境条件,当网络环境发生变化时,该条件参数也可以相应变化。
以及,在本公开的一个实施例之中,约束条件对应的分数并不特指某一固定分数。例如,当约束条件包括的条件数量发生变化时,该约束条件对应的分数也可以相应变化。例如,当约束条件包括的条件类型发生变化时,该约束条件对应的分数也可以相应变化。
示例地,在本公开的一个实施例之中,终端设备获取约束条件对应的分数时,终端设备可以通过判决模块获取约束条件对应的分数。例如,终端设备的硬件资源条件对应的条件参数例如可以是0.7、终端设备的软件资源条件对应的条件参数例如可以是1.0、终端设备所处的网络环境条件对应的条件参数例如可以是0.7、终端设备的电量对应的条件参数例如可以是1.0。终端设备的硬件资源条件对应的权重系数例如可以是0.8,终端设备的软件资源条件对应的权重系数例如可以是0.6,终端设备所处的网络环境条件对应的权重系数例如可以是0.7,终端设备的电量对应的权重系数例如可以是0.9。终端设备获取约束条件对应的分数例如可以是:
f=0.7*0.8+1.0*0.6+0.7*0.7+1.0*0.9=0.56+0.6+0.49+0.9=2.55。
终端设备可以基于该分数2.55,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号。
综上所述,在本公开实施例之中,基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数,其中,约束条件包括终端设备的硬件资源条件、终端设备的软件资源条件、终端设备所处的网络环境条件、终端设备的电量,基于分数,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号。在本公开实施例之中,通过终端设备的约束条件对应的条件参数和约束条件对应的权重系数,可以确定音频通信所要选用的音频格式信号,可以提高音频格式信号与约束条件的匹配性,减少不考虑终端设备的约束条件直接确定音频格式信号使得音频格式信号确定的准确性较低的情况,可以提高音频格式信号确定的准确性,可以改善音频通信的音频服务。本公开针对一种“音频信号格式确定”这一情形提供了一种处理方法,以基于终端设备的约束条件确定音频信号格式,提高音频格式信号确定的准确性,提供基于所述约束条件下最优的音频服务。
图8为本公开实施例所提供的一种音频信号格式确定方法的流程示意图,该方法由终端设备执行,如图8所示,该方法可以包括以下步骤:
步骤801、基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数,其中,约束条件包括终端设备的硬件资源条件、终端设备的软件资源条件、终端设备所处的网络环境条件、终端设备的电量中至少两种;
以下步骤择一执行:
步骤802、响应于终端设备可提供的音频格式信号集合为第一音频格式信号集合,且分数大于第一分数阈值,在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号为基于辅助元数据的空间音频信号;
步骤803、响应于终端设备可提供的音频格式信号集合为第一音频格式信号集合,且分数小于第一分数阈值,在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号为单声道信号。
其中,在本公开的一个实施例之中,第一分数阈值是指用于在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号的分数阈值。第一分数阈值中的第一仅用于与其余分数阈值进行区分,并不特指某一固定分数阈值。
其中,在本公开的一个实施例之中,当分数等于第一分数阈值时,终端设备可以按照分数大于第一分数阈值的方案确定音频通信所要选用的音频格式信号,还可以按照分数小于第一分数阈值的方案确定音频通信所要选用的音频格式信号。例如,响应于终端设备可提供的音频格式信号集合为第一音频格式信号集合,且分数等于第一分数阈值,终端设备在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为基于辅助元数据的空间音频信号,还可以为单声道信号。
示例地,在本公开的一个实施例之中,终端设备基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数例如可以是2.55。第一分数阈值例如可以是2。第一音频格式信号集合例如可以包括单声道信号、立体声信号和基于辅助元数据的空间音频信号。当分数2.55大于第一分数阈值2时,终端设备在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为基于辅助元数据的空间音频信号。也就是说,当终端设备具有性能比较好的硬件处理器,比较丰富的软件资源,同时如果网络环境较好的情况下,终端设备输出的音频信息倾向于选择使用音频数据声道数目更多的音频格式信号。
示例地,在本公开的一个实施例之中,终端设备基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数例如可以是1.55。第一分数阈值例如可以是2。第一音频格式信号集合例如可以包括单声道信号、立体声信号和基于辅助元数据的空间音频信号。当分数1.55小于第一分数阈值2时,终端设备在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为单声道信号。例如,终端设备具有性能比较差的硬件处理器,比较贫瘠的软件资源,同时如果网络环境较差的情况下,终端设备输出的音频信息倾向于选择使用音频数据声道数目更少的音频格式信号。
示例地,在本公开的一个实施例之中,终端设备基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数例如可以是2。第一分数阈值例如可以是2。第一音频格式信号集合例如可以包括单声道信号、立体声信号和基于辅助元数据的空间音频信号。当分数2等于第一分数阈值2时,终端设备在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为单声道信号,也可以为基于辅助元数据的空间音频信号。
进一步地,在本公开的一个实施例之中,第一分数阈值可以是多个分数阈值。不同的分数阈值,在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号也不相同。
示例地,在本公开的一个实施例之中,第一分数阈值例如可以包括1.5和2。第一音频格式信号集合例如可以包括单声道信号、立体声信号和基于辅助元数据的空间音频信号。当终端设备基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数例如可以是1.85时,终端设备在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为立体声信号。当终端设备基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数例如可以是1.35时,终端设备在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为单声道信号。当终端设备基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数例如可以是2.35时,终端设备在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为基于辅助元数据的空间音频信号。
综上所述,在本公开实施例之中,基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数,其中,约束条件包括终端设备的硬件资源条件、终端设备的软件资源条件、终端设备所处的网络环境条件、终端设备的电量,基于分数,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号。在本公开实施例之中,通过终端设备的约束条件对应的条件参数和约束条件对应的权重系数,可以确定音频通信所要选用的音频格式信号,可以提高音频格式信号与约束条件的匹配性,减少不考虑终端设备的约束条件直接确定音频格式信号使得音频格式信号确定的准确性较低的情况,可以提高音频格式信号确定的准确性,可以改善音频通信的音频服务。本公开实施例具体说明了在第一音频格式信号集合中基于分数确定音频通信所要选用的音频格式信号的方案,可以提高音频格式信号确定的准确性。本公开针对一种“音频信号格式确定”这一情形提供了一种处理方法,以基于终端设备的约束条件确定音频信号格式,提高音频格式信号确定的准确性,提供基于所述约束条件下最优的音频服务。
图9为本公开实施例所提供的一种音频信号格式确定方法的流程示意图,该方法由终端设备执行,如图9所示,该方法可以包括以下步骤:
步骤901、基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数,其中,约束条件包括终端设备的硬件资源条件、终端设备的软件资源条件、终端设备所处的网络环境条件、终端设 备的电量中至少两种;
以下步骤择一执行:
步骤902、响应于终端设备可提供的音频格式信号集合为第二音频格式信号集合,且分数大于第二分数阈值,在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为基于场景HOA信号;
步骤903、响应于终端设备可提供的音频格式信号集合为第二音频格式信号集合,且分数小于第二分数阈值,在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为单声道信号。
其中,在本公开的一个实施例之中,第二分数阈值是指用于在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号的分数阈值。第二分数阈值中的第二仅用于与其余分数阈值进行区分,并不特指某一固定分数阈值。其中,第二分数阈值可以与第一分数阈值相同,也可以与第一分数阈值不同。
示例地,在本公开的一个实施例之中,当分数等于第二分数阈值时,终端设备可以按照分数大于第二分数阈值的方案确定音频通信所要选用的音频格式信号,还可以按照分数小于第二分数阈值的方案确定音频通信所要选用的音频格式信号。例如,响应于终端设备可提供的音频格式信号集合为第二音频格式信号集合,且分数等于第二分数阈值,终端设备在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为单声道信号,例如还可以为基于场景HOA信号。
示例地,在本公开的一个实施例之中,终端设备基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数例如可以是2.55。第二分数阈值例如可以是2。第二音频格式信号集合例如可以包括单声道信号、立体声信号、基于场景FOA信号、基于场景HOA信号和基于辅助元数据的空间音频信号。当分数2.55大于第一分数阈值2时,终端设备在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为基于场景HOA信号。
示例地,在本公开的一个实施例之中,终端设备基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数例如可以是1.55。第二分数阈值例如可以是2。第二音频格式信号集合例如可以包括单声道信号、立体声信号、基于场景FOA信号、基于场景HOA信号和基于辅助元数据的空间音频信号。当分数1.55小于第二分数阈值2时,终端设备在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为单声道信号。
示例地,在本公开的一个实施例之中,终端设备基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数例如可以是2。第二分数阈值例如可以是2。第二音频格式信号集合例如可以包括单声道信号、立体声信号、基于场景FOA信号、基于场景HOA信号和基于辅助元数据的空间音频信号。当分数2等于第二分数阈值2时,终端设备在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为单声道信号,也可以为基于场景HOA信号。
进一步地,在本公开的一个实施例之中,第二分数阈值可以是多个分数阈值。不同的分数阈值,在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号也不相同。
示例地,在本公开的一个实施例之中,第二分数阈值例如可以包括1.2、1.5、1.8和2。第二音频格式信号集合例如可以包括单声道信号、立体声信号、基于场景FOA信号、基于场景HOA信号和基于辅助元数据的空间音频信号。当终端设备基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数例如可以是1.85时,终端设备在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为基于场景FOA信号。当终端设备基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数例如可以是1.35时,终端设备在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为立体声信号。当终端设备基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数例如可以是2.35时,终端设备在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为基于场景HOA信号。当终端设备基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数例如可以是1.15时,终端设备在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为单声道信号。当终端设备基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数例如可以是1.6时,终端设备在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为基于辅助元数据的空间音频信号。
综上所述,在本公开实施例之中,基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数,其中,约束条件包括终端设备的硬件资源条件、终端设备的软件资源条件、终端设备所处的网络环境条件、终端设备的电量,基于分数,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号。在本公开实施例之中,通过终端设备的约束条件对应的条件参数和约束条件对应的权重系数,可以确定音频通信所要选用的音频格式信号,可以提高音频格式信号与约束条件的匹配性,减少不考虑终端设备的约束条 件直接确定音频格式信号使得音频格式信号确定的准确性较低的情况,可以提高音频格式信号确定的准确性,可以改善音频通信的音频服务。本公开实施例具体说明了在第二音频格式信号集合中基于分数确定音频通信所要选用的音频格式信号的方案,可以提高音频格式信号确定的准确性。本公开针对一种“音频信号格式确定”这一情形提供了一种处理方法,以基于终端设备的约束条件确定音频信号格式,提高音频格式信号确定的准确性,提供基于所述约束条件下最优的音频服务。
图10为本公开实施例所提供的一种音频信号格式确定方法的流程示意图,该方法由终端设备执行,如图10所示,该方法可以包括以下步骤:
步骤1001、基于终端设备的电量,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号。
其中,在本公开的一个实施例之中,终端设备的约束条件例如可以是终端设备的电量,基于终端设备的电量,终端设备可以在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号。
综上所述,在本公开实施例之中,基于终端设备的电量,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号。在本公开实施例之中,通过终端设备的电量,可以确定音频通信所要选用的音频格式信号,可以提高音频格式信号与约束条件的匹配性,减少不考虑终端设备的约束条件直接确定音频格式信号使得音频格式信号确定的准确性较低的情况,可以提高音频格式信号确定的准确性,可以改善音频通信的音频服务。本公开针对一种“音频信号格式确定”这一情形提供了一种处理方法,以基于终端设备的约束条件确定音频信号格式,提高音频格式信号确定的准确性,提供基于所述约束条件下最优的音频服务。
图11为本公开实施例所提供的一种音频信号格式确定方法的流程示意图,该方法由终端设备执行,如图11所示,该方法可以包括以下步骤:
以下步骤1101和步骤1102可以择一执行:
步骤1101、响应于终端设备可提供的音频格式信号集合为第一音频格式信号集合,且终端设备的电量大于第一电量阈值,在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号为基于辅助元数据的空间音频信号;
步骤1102、响应于终端设备可提供的音频格式信号集合为第一音频格式信号集合,且终端设备的电量小于第一电量阈值,在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号为单声道信号;
步骤1103、采用在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号提供音频通信的音频服务。
其中,在本公开的一个实施例之中,第一电量阈值是指用于在第一音频格式信号集合中确定音频通信所要选用的音频格式信号的电量阈值。第一电量阈值中的第一仅用于与其余电量阈值进行区分,并不特指某一固定电量阈值。
示例地,在本公开的一个实施例之中,当终端设备的电量等于第一分数阈值时,终端设备可以按照终端设备的电量大于第一电量阈值的方案确定音频通信所要选用的音频格式信号,还可以按照终端设备的电量小于第一电量阈值的方案确定音频通信所要选用的音频格式信号。例如,响应于终端设备可提供的音频格式信号集合为第一音频格式信号集合,且终端设备的电量等于第一电量阈值,终端设备在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为单声道信号,例如还可以为基于辅助元数据的空间音频信号。
示例地,在本公开的一个实施例之中,第一音频格式信号集合例如可以包括单声道信号、立体声信号和基于辅助元数据的空间音频信号。第一电量阈值例如可以是终端设备的总电量的20%。该总电量为终端设备的电池充满电时的电量。响应于终端设备可提供的音频格式信号集合为第一音频格式信号集合,且终端设备的电量15%小于第一电量阈值20%,在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为单声道信号,可以延长终端设备的续航时长。
示例地,在本公开的一个实施例之中,第一音频格式信号集合例如可以包括单声道信号、立体声信号和基于辅助元数据的空间音频信号。第一电量阈值例如可以是终端设备的总电量的20%。该总电量为终端设备的电池充满电时的电量。响应于终端设备可提供的音频格式信号集合为第一音频格式信号集合,且终端设备的电量50%大于第一电量阈值20%,在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为基于辅助元数据的空间音频信号。
其中,在本公开的一个实施例之中,第一电量阈值可以是多个电量阈值。不同的电量阈值,在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号也不相同。
示例地,在本公开的一个实施例之中,第一电量阈值例如可以包括20%和40%。第一音频格式信号集合例如可以包括单声道信号、立体声信号和基于辅助元数据的空间音频信号。终端设备的电量例如可以是30%,终端设备在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为立体声信号。终端设备的电量例如可以是15%,终端设备在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为单声道信号。终端设备的电量例如可以是50%,终端设备在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号例 如可以为基于辅助元数据的空间音频信号。
综上所述,在本公开实施例之中,基于终端设备的电量,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号。在本公开实施例之中,通过终端设备的电量,可以确定音频通信所要选用的音频格式信号,可以提高音频格式信号与约束条件的匹配性,减少不考虑终端设备的约束条件直接确定音频格式信号使得音频格式信号确定的准确性较低的情况,可以提高音频格式信号确定的准确性,可以改善音频通信的音频服务。本公开实施例具体说明了在第一音频格式信号集合中基于终端设备的电量确定音频通信所要选用的音频格式信号的方案,可以提高音频格式信号确定的准确性。本公开针对一种“音频信号格式确定”这一情形提供了一种处理方法,以基于终端设备的约束条件确定音频信号格式,提高音频格式信号确定的准确性,提供基于所述约束条件下最优的音频服务。
图12为本公开实施例所提供的一种音频信号格式确定方法的流程示意图,该方法由终端设备执行,如图12所示,该方法可以包括以下步骤:
以下步骤1201和步骤1202择一执行:
步骤1201、响应于终端设备可提供的音频格式信号集合为第二音频格式信号集合,且终端设备的电量大于第二电量阈值,在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为基于场景HOA信号;
步骤1202、响应于终端设备可提供的音频格式信号集合为第二音频格式信号集合,且终端设备的电量小于第二电量阈值,在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为单声道信号;
步骤1203、采用在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号提供音频通信的音频服务。
其中,在本公开的一个实施例之中,第二电量阈值是指用于在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号的电量阈值。第二电量阈值中的第二仅用于与其余电量阈值进行区分,并不特指某一固定电量阈值。其中,第二电量阈值可以与第一电量阈值相同,也可以与第一电量阈值不同。
示例地,在本公开的一个实施例之中,当终端设备的电量等于第二电量阈值时,终端设备可以按照终端设备的电量大于第二电量阈值的方案确定音频通信所要选用的音频格式信号,还可以按照终端设备的电量小于第二电量阈值的方案确定音频通信所要选用的音频格式信号。例如,响应于终端设备可提供的音频格式信号集合为第二音频格式信号集合,且终端设备的电量等于第二电量阈值,终端设备在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为单声道信号,例如还可以为基于场景HOA信号。
示例地,在本公开的一个实施例之中,第二音频格式信号集合例如可以包括单声道信号、立体声信号、基于场景FOA信号、基于场景HOA信号和基于辅助元数据的空间音频信号。第二电量阈值例如可以是终端设备的总电量的20%。该总电量为终端设备的电池充满电时的电量。响应于终端设备可提供的音频格式信号集合为第二音频格式信号集合,且终端设备的电量15%小于第二电量阈值20%,在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为单声道信号,可以延长终端设备的续航时长。
示例地,在本公开的一个实施例之中,第二音频格式信号集合例如可以包括单声道信号、立体声信号、基于场景FOA信号、基于场景HOA信号和基于辅助元数据的空间音频信号。第二电量阈值例如可以是终端设备的总电量的20%。该总电量为终端设备的电池充满电时的电量。响应于终端设备可提供的音频格式信号集合为第一音频格式信号集合,且终端设备的电量50%大于第二电量阈值20%,在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为基于场景HOA信号。
其中,在本公开的一个实施例之中,第二电量阈值可以是多个电量阈值。不同的电量阈值,在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号也不相同。
示例地,在本公开的一个实施例之中,第二电量阈值例如可以包括20%、40%、60%和75%。第二音频格式信号集合例如可以包括单声道信号、立体声信号、基于场景FOA信号、基于场景HOA信号和基于辅助元数据的空间音频信号。终端设备的电量例如可以是15%,终端设备在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为单声道信号。终端设备的电量例如可以是25%,终端设备在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为立体声信号。终端设备的电量例如可以是55%,终端设备在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为基于辅助元数据的空间音频信号。终端设备的电量例如可以是65%,终端设备在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为基于场景FOA信号。终端设备的电量例如可以是85%,终端设备在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号例如可以为基于场景HOA信号。
综上所述,在本公开实施例之中,基于终端设备的电量,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号。在本公开实施例之中,通过终端设备的电量,可以确定音频通信所要选用的音频格式信号,可以提高音频格式信号与约束条件的匹配性,减少不考虑终端设备的约束条件直接确定音频格式信号使得音 频格式信号确定的准确性较低的情况,可以提高音频格式信号确定的准确性,可以改善音频通信的音频服务。本公开实施例具体说明了在第二音频格式信号集合中基于终端设备的电量确定音频通信所要选用的音频格式信号的方案,可以提高音频格式信号确定的准确性。本公开针对一种“音频信号格式确定”这一情形提供了一种处理方法,以基于终端设备的约束条件确定音频信号格式,提高音频格式信号确定的准确性,提供基于所述约束条件下最优的音频服务。
图13为本公开实施例所提供的一种音频信号格式确定装置的结构示意图,如图13所示,该装置1300可以包括:
确定模块1301,用于基于终端设备的约束条件,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,其中,所要选用的音频格式信号用于提供音频通信的音频服务,音频格式信号集合包括至少一种音频格式信号。
综上所述,在本公开实施例的音频信号格式确定装置之中,通过确定模块,可以基于终端设备的约束条件,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,其中,所要选用的音频格式信号用于提供音频通信的音频服务,音频格式信号集合包括至少一种音频格式信号。在本公开实施例之中,通过终端设备的约束条件,可以确定音频通信所要选用的音频格式信号,可以提高音频格式信号与约束条件的匹配性,减少不考虑终端设备的约束条件直接确定音频格式信号使得音频格式信号确定的准确性较低的情况,可以提高音频格式信号确定的准确性,可以改善音频通信的音频服务。本公开针对一种“音频信号格式确定”这一情形提供了一种处理装置,以基于终端设备的约束条件确定音频信号格式,提高音频格式信号确定的准确性,提供基于所述约束条件下最优的音频服务。
其中,在本公开的一个实施例之中,终端设备的约束条件,包括以下至少一种:
终端设备的硬件资源条件;
终端设备的软件资源条件;
终端设备所处的网络环境条件;
终端设备的电量。
以及,在本公开的一个实施例之中,确定模块1701,还用于:
基于终端设备所设置的麦克风数量和/或麦克风类型,确定终端设备可提供的音频格式信号集合。
以及,在本公开的一个实施例之中,确定模块1701,用于基于终端设备所设置的麦克风数量和/或麦克风类型,确定终端设备可提供的音频格式信号集合时,具体用于:
响应于终端设备所设置的麦克风数量为至少三个,确定终端设备可提供的音频格式信号集合为第一音频格式信号集合;
其中,第一音频格式信号集合中的音频格式信号包括以下至少一种:
单声道信号;
立体声信号;
基于辅助元数据的空间音频信号;
基于场景一阶高保真度立体环绕声FOA信号;
基于场景高阶高保真度立体环绕声HOA信号;
多声道信号。
以及,在本公开的一个实施例之中,确定模块1701,用于基于终端设备所设置的麦克风数量和/或麦克风类型,确定终端设备可提供的音频格式信号集合时,具体用于:
响应于终端设备中设置的麦克风包括至少一个外接音频采集设备,确定终端设备可提供的音频格式信号集合为第二音频格式信号集合;
其中,第二音频格式信号集合中的音频格式信号包括以下至少一种:
单声道信号;
立体声信号;
双耳信号;
对象信号;
基于辅助元数据的空间音频信号;
基于场景FOA信号;
基于场景HOA信号;
多声道信号。
以及,在本公开的一个实施例之中,确定模块1701,用于基于终端设备的约束条件,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号时,具体用于:
基于终端设备的约束条件对应的条件参数和约束条件对应的权重系数,获取约束条件对应的分数,其中,约束条件包括终端设备的硬件资源条件、终端设备的软件资源条件、终端设备所处的网络环境条件、终端设备的电量;
基于分数,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号。
以及,在本公开的一个实施例之中,确定模块1701,用于基于分数,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号时,具体用于:
响应于终端设备可提供的音频格式信号集合为第一音频格式信号集合,且分数大于第一分数阈值,在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号为基于辅助元数据的空间音频信号;
响应于终端设备可提供的音频格式信号集合为第一音频格式信号集合,且分数小于第一分数阈值,在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号为单声道信号。
以及,在本公开的一个实施例之中,确定模块1701,用于基于分数,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号时,具体用于:
响应于终端设备可提供的音频格式信号集合为第二音频格式信号集合,且分数大于第二分数阈值,在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为基于场景HOA信号;
响应于终端设备可提供的音频格式信号集合为第二音频格式信号集合,且分数小于第二分数阈值,在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为单声道信号。
以及,在本公开的一个实施例之中,确定模块1701,用于基于终端设备的约束条件,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号时,具体用于:
基于终端设备的电量,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号。
以及,在本公开的一个实施例之中,确定模块1701,用于基于终端设备的电量,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号时,具体用于:
响应于终端设备可提供的音频格式信号集合为第一音频格式信号集合,且终端设备的电量大于第一电量阈值,在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号为基于辅助元数据的空间音频信号;
响应于终端设备可提供的音频格式信号集合为第一音频格式信号集合,且终端设备的电量小于第一电量阈值,在第一音频格式信号集合中确定的音频通信所要选用的音频格式信号为单声道信号。
以及,在本公开的一个实施例之中,确定模块1701,用于基于终端设备的电量,在终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号时,具体用于:
响应于终端设备可提供的音频格式信号集合为第二音频格式信号集合,且终端设备的电量大于第二电量阈值,在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为基于场景HOA信号;
响应于终端设备可提供的音频格式信号集合为第二音频格式信号集合,且终端设备的电量小于第二电量阈值,在第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为单声道信号。
图14是本公开一个实施例所提供的一种终端设备UE1400的框图。例如,UE1400可以是移动电话,计算机,数字广播终端设备,消息收发设备,游戏控制台,平板设备,医疗设备,健身设备,个人数字助理等。
参照图14,UE1400可以包括以下至少一个组件:处理组件1402,存储器1404,电源组件1406,多媒体组件1408,音频组件1410,输入/输出(I/O)的接口1412,传感器组件1414,以及通信组件1416。
处理组件1402通常控制UE1400的整体操作,诸如与显示,电话呼叫,数据通信,相机操作和记录操作相关联的操作。处理组件1402可以包括至少一个处理器1420来执行指令,以完成上述的方法的全部或部分步骤。此外,处理组件1402可以包括至少一个模块,便于处理组件1402和其他组件之间的交互。例如,处理组件1402可以包括多媒体模块,以方便多媒体组件1408和处理组件1402之间的交互。
存储器1404被配置为存储各种类型的数据以支持在UE1400的操作。这些数据的示例包括用于在UE1400上操作的任何应用程序或方法的指令,联系人数据,电话簿数据,消息,图片,视频等。存储器1404可以由任何类型的易失性或非易失性存储设备或者它们的组合实现,如静态随机存取存储器(SRAM),电可擦除可编程只读存储器(EEPROM),可擦除可编程只读存储器(EPROM),可编程只读存储器(PROM),只读存储器(ROM),磁存储器,快闪存储器,磁盘或光盘。
电源组件1406为UE1400的各种组件提供电力。电源组件1406可以包括电源管理系统,至少一个电源,及其他与为UE1400生成、管理和分配电力相关联的组件。
多媒体组件1408包括在所述UE1400和用户之间的提供一个输出接口的屏幕。在一些实施例中,屏幕可以包括液晶显示器(LCD)和触摸面板(TP)。如果屏幕包括触摸面板,屏幕可以被实现为触摸屏,以接收来自用户的输入信 号。触摸面板包括至少一个触摸传感器以感测触摸、滑动和触摸面板上的手势。所述触摸传感器可以不仅感测触摸或滑动动作的边界,而且还检测与所述触摸或滑动操作相关的唤醒时间和压力。在一些实施例中,多媒体组件1408包括一个前置摄像头和/或后置摄像头。当UE1400处于操作模式,如拍摄模式或视频模式时,前置摄像头和/或后置摄像头可以接收外部的多媒体数据。每个前置摄像头和后置摄像头可以是一个固定的光学透镜系统或具有焦距和光学变焦能力。
音频组件1410被配置为输出和/或输入音频信号。例如,音频组件1410包括一个麦克风(MIC),当UE1400处于操作模式,如呼叫模式、记录模式和语音识别模式时,麦克风被配置为接收外部音频信号。所接收的音频信号可以被进一步存储在存储器1404或经由通信组件1416发送。在一些实施例中,音频组件1410还包括一个扬声器,用于输出音频信号。
I/O接口1412为处理组件1402和外围接口模块之间提供接口,上述外围接口模块可以是键盘,点击轮,按钮等。这些按钮可包括但不限于:主页按钮、音量按钮、启动按钮和锁定按钮。
传感器组件1414包括至少一个传感器,用于为UE1400提供各个方面的状态评估。例如,传感器组件1414可以检测到设备1400的打开/关闭状态,组件的相对定位,例如所述组件为UE1400的显示器和小键盘,传感器组件1414还可以检测UE1400或UE1400的一个组件的位置改变,用户与UE1400接触的存在或不存在,UE1400方位或加速/减速和UE1400的温度变化。传感器组件1414可以包括接近传感器,被配置用来在没有任何的物理接触时检测附近物体的存在。传感器组件1414还可以包括光传感器,如CMOS或CCD图像传感器,用于在成像应用中使用。在一些实施例中,该传感器组件1414还可以包括加速度传感器,陀螺仪传感器,磁传感器,压力传感器或温度传感器。
通信组件1416被配置为便于UE1400和其他设备之间有线或无线方式的通信。UE1400可以接入基于通信标准的无线网络,如WiFi,2G或3G,或它们的组合。在一个示例性实施例中,通信组件1416经由广播信道接收来自外部广播管理系统的广播信号或广播相关信息。在一个示例性实施例中,所述通信组件1416还包括近场通信(NFC)模块,以促进短程通信。例如,在NFC模块可基于射频识别(RFID)技术,红外数据协会(IrDA)技术,超宽带(UWB)技术,蓝牙(BT)技术和其他技术来实现。
在示例性实施例中,UE1400可以被至少一个应用专用集成电路(ASIC)、数字信号处理器(DSP)、数字信号处理设备(DSPD)、可编程逻辑器件(PLD)、现场可编程门阵列(FPGA)、控制器、微控制器、微处理器或其他电子元件实现,用于执行上述方法。
上述本公开提供的实施例中,分别从网络侧设备、UE的角度对本公开实施例提供的方法进行了介绍。为了实现上述本公开实施例提供的方法中的各功能,网络侧设备和UE可以包括硬件结构、软件模块,以硬件结构、软件模块、或硬件结构加软件模块的形式来实现上述各功能。上述各功能中的某个功能可以以硬件结构、软件模块、或者硬件结构加软件模块的方式来执行。
上述本公开提供的实施例中,分别从网络侧设备、UE的角度对本公开实施例提供的方法进行了介绍。为了实现上述本公开实施例提供的方法中的各功能,网络侧设备和UE可以包括硬件结构、软件模块,以硬件结构、软件模块、或硬件结构加软件模块的形式来实现上述各功能。上述各功能中的某个功能可以以硬件结构、软件模块、或者硬件结构加软件模块的方式来执行。
本公开实施例提供的一种通信装置。通信装置可包括收发模块和处理模块。收发模块可包括发送模块和/或接收模块,发送模块用于实现发送功能,接收模块用于实现接收功能,收发模块可以实现发送功能和/或接收功能。
通信装置可以是终端设备(如前述方法实施例中的终端设备),也可以是终端设备中的装置,还可以是能够与终端设备匹配使用的装置。或者,通信装置可以是网络设备,也可以是网络设备中的装置,还可以是能够与网络设备匹配使用的装置。
本公开实施例提供的另一种通信装置。通信装置可以是网络设备,也可以是终端设备(如前述方法实施例中的终端设备),也可以是支持网络设备实现上述方法的芯片、芯片系统、或处理器等,还可以是支持终端设备实现上述方法的芯片、芯片系统、或处理器等。该装置可用于实现上述方法实施例中描述的方法,具体可以参见上述方法实施例中的说明。
通信装置可以包括一个或多个处理器。处理器可以是通用处理器或者专用处理器等。例如可以是基带处理器或中央处理器。基带处理器可以用于对通信协议以及通信数据进行处理,中央处理器可以用于对通信装置(如,网络侧设备、基带芯片,终端设备、终端设备芯片,DU或CU等)进行控制,执行计算机程序,处理计算机程序的数据。
可选地,通信装置中还可以包括一个或多个存储器,其上可以存有计算机程序,处理器执行所述计算机程序,以使得通信装置执行上述方法实施例中描述的方法。可选地,所述存储器中还可以存储有数据。通信装置和存储器 可以单独设置,也可以集成在一起。
可选地,通信装置还可以包括收发器、天线。收发器可以称为收发单元、收发机、或收发电路等,用于实现收发功能。收发器可以包括接收器和发送器,接收器可以称为接收机或接收电路等,用于实现接收功能;发送器可以称为发送机或发送电路等,用于实现发送功能。
可选地,通信装置中还可以包括一个或多个接口电路。接口电路用于接收代码指令并传输至处理器。处理器运行所述代码指令以使通信装置执行上述方法实施例中描述的方法。
通信装置为终端设备(如前述方法实施例中的终端设备):处理器用于执行图1-图12任一所示的方法。
在一种实现方式中,处理器中可以包括用于实现接收和发送功能的收发器。例如该收发器可以是收发电路,或者是接口,或者是接口电路。用于实现接收和发送功能的收发电路、接口或接口电路可以是分开的,也可以集成在一起。上述收发电路、接口或接口电路可以用于代码/数据的读写,或者,上述收发电路、接口或接口电路可以用于信号的传输或传递。
在一种实现方式中,处理器可以存有计算机程序,计算机程序在处理器上运行,可使得通信装置执行上述方法实施例中描述的方法。计算机程序可能固化在处理器中,该种情况下,处理器可能由硬件实现。
在一种实现方式中,通信装置可以包括电路,所述电路可以实现前述方法实施例中发送或接收或者通信的功能。本公开中描述的处理器和收发器可实现在集成电路(integrated circuit,IC)、模拟IC、射频集成电路RFIC、混合信号IC、专用集成电路(application specific integrated circuit,ASIC)、印刷电路板(printed circuit board,PCB)、电子设备等上。该处理器和收发器也可以用各种IC工艺技术来制造,例如互补金属氧化物半导体(complementary metal oxide semiconductor,CMOS)、N型金属氧化物半导体(nMetal-oxide-semiconductor,NMOS)、P型金属氧化物半导体(positive channel metal oxide semiconductor,PMOS)、双极结型晶体管(bipolar junction transistor,BJT)、双极CMOS(BiCMOS)、硅锗(SiGe)、砷化镓(GaAs)等。
以上实施例描述中的通信装置可以是网络设备或者终端设备(如前述方法实施例中的终端设备),但本公开中描述的通信装置的范围并不限于此,而且通信装置的结构可以不受的限制。通信装置可以是独立的设备或者可以是较大设备的一部分。例如所述通信装置可以是:
(1)独立的集成电路IC,或芯片,或,芯片系统或子系统;
(2)具有一个或多个IC的集合,可选地,该IC集合也可以包括用于存储数据,计算机程序的存储部件;
(3)ASIC,例如调制解调器(Modem);
(4)可嵌入在其他设备内的模块;
(5)接收机、终端设备、智能终端设备、蜂窝电话、无线设备、手持机、移动单元、车载设备、网络设备、云设备、人工智能设备等等;
(6)其他等等。
对于通信装置可以是芯片或芯片系统的情况,芯片包括处理器和接口。其中,处理器的数量可以是一个或多个,接口的数量可以是多个。
可选地,芯片还包括存储器,存储器用于存储必要的计算机程序和数据。
本领域技术人员还可以了解到本公开实施例列出的各种说明性逻辑块(illustrative logical block)和步骤(step)可以通过电子硬件、电脑软件,或两者的结合进行实现。这样的功能是通过硬件还是软件来实现取决于特定的应用和整个系统的设计要求。本领域技术人员可以对于每种特定的应用,可以使用各种方法实现所述的功能,但这种实现不应被理解为超出本公开实施例保护的范围。
本公开还提供一种可读存储介质,其上存储有指令,该指令被计算机执行时实现上述任一方法实施例的功能。
本公开还提供一种计算机程序产品,该计算机程序产品被计算机执行时实现上述任一方法实施例的功能。
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。所述计算机程序产品包括一个或多个计算机程序。在计算机上加载和执行所述计算机程序时,全部或部分地产生按照本公开实施例所述的流程或功能。所述计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机程序可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,所述计算机程序可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(digital subscriber line,DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。所述计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。所述可用介质可以是磁性介质(例如,软盘、硬盘、磁带)、光介质(例如,高密度数字视频光盘(digital video disc,DVD))、或者半导体介质(例如,固态硬盘(solid state disk,SSD))等。
本领域普通技术人员可以理解:本公开中涉及的第一、第二等各种数字编号仅为描述方便进行的区分,并不用来限制本公开实施例的范围,也表示先后顺序。
本公开中的至少一个还可以描述为一个或多个,多个可以是两个、三个、四个或者更多个,本公开不做限制。在本公开实施例中,对于一种技术特征,通过“第一”、“第二”、“第三”、“A”、“B”、“C”和“D”等区分该种技术特征中的技术特征,该“第一”、“第二”、“第三”、“A”、“B”、“C”和“D”描述的技术特征间无先后顺序或者大小顺序。
本领域技术人员在考虑说明书及实践这里公开的发明后,将容易想到本发明的其它实施方案。本公开旨在涵盖本发明的任何变型、用途或者适应性变化,这些变型、用途或者适应性变化遵循本发明的一般性原理并包括本公开未公开的本技术领域中的公知常识或惯用技术手段。说明书和实施例仅被视为示例性的,本公开的真正范围和精神由下面的权利要求指出。
应当理解的是,本公开并不局限于上面已经描述并在附图中示出的精确结构,并且可以在不脱离其范围进行各种修改和改变。本公开的范围仅由所附的权利要求来限制。

Claims (15)

  1. 一种音频信号格式确定方法,其特征在于,所述方法由终端设备执行,所述方法包括:
    基于所述终端设备的约束条件,在所述终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,其中,所述所要选用的音频格式信号用于提供音频通信的音频服务,所述音频格式信号集合包括至少一种音频格式信号。
  2. 根据权利要求1所述的方法,其特征在于,所述终端设备的约束条件,包括以下至少一种:
    所述终端设备的硬件资源条件;
    所述终端设备的软件资源条件;
    所述终端设备所处的网络环境条件;
    所述终端设备的电量。
  3. 根据权利要求1所述的方法,其特征在于,所述方法还包括:
    基于所述终端设备所设置的麦克风数量和/或麦克风类型,确定所述终端设备可提供的音频格式信号集合。
  4. 根据权利要求3所述的方法,其特征在于,所述基于所述终端设备所设置的麦克风数量和/或麦克风类型,确定所述终端设备可提供的音频格式信号集合,包括:
    响应于所述终端设备所设置的麦克风数量为至少三个,确定所述终端设备可提供的音频格式信号集合为第一音频格式信号集合;
    其中,所述第一音频格式信号集合中的音频格式信号包括以下至少一种:
    单声道信号;
    立体声信号;
    基于辅助元数据的空间音频信号;
    基于场景一阶高保真度立体环绕声FOA信号;
    基于场景高阶高保真度立体环绕声HOA信号;
    多声道信号。
  5. 根据权利要求3所述的方法,其特征在于,所述基于所述终端设备所设置的麦克风数量和/或麦克风类型,确定所述终端设备可提供的音频格式信号集合,包括:
    响应于所述终端设备中设置的麦克风包括至少一个外接音频采集设备,确定所述终端设备可提供的音频格式信号集合为第二音频格式信号集合;
    其中,所述第二音频格式信号集合中的音频格式信号包括以下至少一种:
    单声道信号;
    立体声信号;
    双耳信号;
    对象信号;
    基于辅助元数据的空间音频信号;
    基于场景FOA信号;
    基于场景HOA信号;
    多声道信号。
  6. 根据权利要求1所述的方法,其特征在于,所述基于所述终端设备的约束条件,在所述终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,包括:
    基于所述终端设备的约束条件对应的条件参数和所述约束条件对应的权重系数,获取所述约束条件对应的分数,其中,所述约束条件包括所述终端设备的硬件资源条件、所述终端设备的软件资源条件、所述终端设备所处的网络环境条件、所述终端设备的电量;
    基于所述分数,在所述终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号。
  7. 根据权利要求6所述的方法,其特征在于,所述基于所述分数,在所述终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,包括:
    响应于所述终端设备可提供的音频格式信号集合为第一音频格式信号集合,且所述分数大于第一分数阈值,在所述第一音频格式信号集合中确定的音频通信所要选用的音频格式信号为基于辅助元数据的空间音频信号;
    响应于所述终端设备可提供的音频格式信号集合为第一音频格式信号集合,且所述分数小于所述第一分数阈值,在所述第一音频格式信号集合中确定的音频通信所要选用的音频格式信号为单声道信号。
  8. 根据权利要求6所述的方法,其特征在于,所述基于所述分数,在所述终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,包括:
    响应于所述终端设备可提供的音频格式信号集合为第二音频格式信号集合,且所述分数大于第二分数阈值,在所述第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为基于场景HOA信号;
    响应于所述终端设备可提供的音频格式信号集合为所述第二音频格式信号集合,且所述分数小于所述第二分数阈值,在所述第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为单声道信号。
  9. 根据权利要求1所述的方法,其特征在于,所述基于所述终端设备的约束条件,在所述终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,包括:
    基于所述终端设备的电量,在所述终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号。
  10. 根据权利要求9所述的方法,其特征在于,所述基于所述终端设备的电量,在所述终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,包括:
    响应于所述终端设备可提供的音频格式信号集合为第一音频格式信号集合,且所述终端设备的电量大于第一电量阈值,在所述第一音频格式信号集合中确定的音频通信所要选用的音频格式信号为基于辅助元数据的空间音频信号;
    响应于所述终端设备可提供的音频格式信号集合为所述第一音频格式信号集合,且所述终端设备的电量小于所述第一电量阈值,在所述第一音频格式信号集合中确定的音频通信所要选用的音频格式信号为单声道信号。
  11. 根据权利要求9所述的方法,其特征在于,所述基于所述终端设备的电量,在所述终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,包括:
    响应于所述终端设备可提供的音频格式信号集合为第二音频格式信号集合,且所述终端设备的电量大于第二电量阈值,在所述第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为基于场景HOA信号;
    响应于所述终端设备可提供的音频格式信号集合为所述第二音频格式信号集合,且所述终端设备的电量小于所述第二电量阈值,在所述第二音频格式信号集合中确定的音频通信所要选用的音频格式信号为单声道信号。
  12. 一种音频信号格式确定装置,其特征在于,所述装置包括:
    确定模块,用于基于所述终端设备的约束条件,在所述终端设备可提供的音频格式信号集合中确定音频通信所要选用的音频格式信号,其中,所述所要选用的音频格式信号用于提供音频通信的音频服务,所述音频格式信号集合包括至少一种音频格式信号。
  13. 一种终端设备,其特征在于,所述装置包括处理器和存储器,其中,所述存储器中存储有计算机程序,所述处理器执行所述存储器中存储的计算机程序,以使所述装置执行如权利要求1至11中任一项所述的方法。
  14. 一种通信装置,其特征在于,包括:处理器和接口电路,其中
    所述接口电路,用于接收代码指令并传输至所述处理器;
    所述处理器,用于运行所述代码指令以执行如权利要求1至11中任一项所述的方法。
  15. 一种计算机可读存储介质,其特征在于,用于存储有指令,当所述指令被执行时,使如权利要求1至11中任一项所述的方法被实现。
PCT/CN2022/099626 2022-06-17 2022-06-17 音频信号格式确定方法、装置 WO2023240653A1 (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2022/099626 WO2023240653A1 (zh) 2022-06-17 2022-06-17 音频信号格式确定方法、装置
CN202280001877.5A CN117597936A (zh) 2022-06-17 2022-06-17 音频信号格式确定方法、装置

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2022/099626 WO2023240653A1 (zh) 2022-06-17 2022-06-17 音频信号格式确定方法、装置

Publications (1)

Publication Number Publication Date
WO2023240653A1 true WO2023240653A1 (zh) 2023-12-21

Family

ID=89192932

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/099626 WO2023240653A1 (zh) 2022-06-17 2022-06-17 音频信号格式确定方法、装置

Country Status (2)

Country Link
CN (1) CN117597936A (zh)
WO (1) WO2023240653A1 (zh)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103544956A (zh) * 2013-06-20 2014-01-29 Tcl集团股份有限公司 Wifi音响与智能终端协商解码的方法及系统
US20210219028A1 (en) * 2018-05-29 2021-07-15 Huawei Technologies Co., Ltd. Method and apparatus for selecting audio track from audio and video file
CN113347530A (zh) * 2021-04-15 2021-09-03 苏州鸿鹄骐骥电子科技有限公司 一种用于全景相机的全景音频处理方法
CN114285910A (zh) * 2021-12-23 2022-04-05 号百信息服务有限公司 通信终端与互联网音频格式重塑系统和方法

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103544956A (zh) * 2013-06-20 2014-01-29 Tcl集团股份有限公司 Wifi音响与智能终端协商解码的方法及系统
US20210219028A1 (en) * 2018-05-29 2021-07-15 Huawei Technologies Co., Ltd. Method and apparatus for selecting audio track from audio and video file
CN113347530A (zh) * 2021-04-15 2021-09-03 苏州鸿鹄骐骥电子科技有限公司 一种用于全景相机的全景音频处理方法
CN114285910A (zh) * 2021-12-23 2022-04-05 号百信息服务有限公司 通信终端与互联网音频格式重塑系统和方法

Also Published As

Publication number Publication date
CN117597936A (zh) 2024-02-23

Similar Documents

Publication Publication Date Title
WO2018086063A1 (zh) 配置工作带宽的方法及装置
EP3890332A1 (en) Video splitting method and electronic device
EP4354972A1 (en) Method and apparatus for determining time-domain window, and user equipment, base station and storage medium
CN117909082A (zh) 占用设备的方法以及电子设备
WO2023206182A1 (zh) 成功PScell添加或更换报告的位置信息记录方法、装置
US20200359300A1 (en) Information broadcast method and device, service implementation method and device, and access point
CN115552518A (zh) 一种信号编解码方法、装置、用户设备、网络侧设备及存储介质
CN116368460A (zh) 音频处理方法、装置
CN115226185A (zh) 发射功率控制方法及相关设备
WO2023240653A1 (zh) 音频信号格式确定方法、装置
WO2023216119A1 (zh) 音频信号编码方法、装置、电子设备和存储介质
WO2023201759A1 (zh) 成功PScell添加或更换报告的上报方法、装置
WO2021109085A1 (zh) 通信资源单元分配方法、装置、通信设备及存储介质
WO2021109141A1 (zh) 通信连接配置方法、装置、通信设备及存储介质
WO2023173434A1 (zh) 一种信道估计方法、装置、设备及存储介质
WO2024016240A1 (zh) Pdsch传输方法、装置
WO2023193148A1 (zh) 音频回放方法/装置/设备及存储介质
WO2023178567A1 (zh) 一种上报方法/装置/设备及存储介质
WO2023184261A1 (zh) 一种上报方法/装置/设备及存储介质
WO2023173433A1 (zh) 一种信道估计方法/装置/设备及存储介质
EP4293944A1 (en) Ue capability processing method and apparatus, communications device and storage medium
WO2023206570A1 (zh) 一种配置管理方法、装置、设备及存储介质
WO2023173254A1 (zh) 一种定时调整方法/装置/设备及存储介质
CN113950037B (zh) 一种音频播放方法及终端设备
WO2023245589A1 (zh) 定位模型确定方法、装置

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 202280001877.5

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22946334

Country of ref document: EP

Kind code of ref document: A1