WO2014101169A1 - 提供增强音频数据流的方法及装置 - Google Patents

提供增强音频数据流的方法及装置 Download PDF

Info

Publication number
WO2014101169A1
WO2014101169A1 PCT/CN2012/088005 CN2012088005W WO2014101169A1 WO 2014101169 A1 WO2014101169 A1 WO 2014101169A1 CN 2012088005 W CN2012088005 W CN 2012088005W WO 2014101169 A1 WO2014101169 A1 WO 2014101169A1
Authority
WO
WIPO (PCT)
Prior art keywords
audio data
data stream
high frequency
digital signal
original audio
Prior art date
Application number
PCT/CN2012/088005
Other languages
English (en)
French (fr)
Inventor
孟剑强
刘明刚
张江红
Original Assignee
北京印声科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京印声科技有限公司 filed Critical 北京印声科技有限公司
Priority to CN201280077909.6A priority Critical patent/CN104871243A/zh
Priority to PCT/CN2012/088005 priority patent/WO2014101169A1/zh
Publication of WO2014101169A1 publication Critical patent/WO2014101169A1/zh
Priority to HK15112092.5A priority patent/HK1214025A1/zh

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/018Audio watermarking, i.e. embedding inaudible data in the audio signal

Definitions

  • the present invention relates to audio technology and, more particularly, to a method and apparatus for providing enhanced audio data streams. Background technique
  • the frequency of the sound that can be heard by the human ear ranges from 20 Hz to 20,000 Hz, and the frequency range from 2500 Hz to 20 kHz is the high-frequency section. As the age increases, the ability of the human ear to hear the sound of the high-frequency section is gradually weakened or even lost. The upper limit of the sound that adults can generally hear is about 16 kHz.
  • Ultrasonic waves Sounds with frequencies above 20 kHz are called ultrasonic waves.
  • Ultrasonic transmission in the air is characterized by large attenuation and significant multipath effects in the enclosed space.
  • Ultrasonic waves in the low frequency range have stronger reflection ability and weaker transmission ability.
  • audio data for playing such as advertisements, movies, televisions, programs, audio data in music, etc., does not exceed 16 kHz for user convenience.
  • audio data typically only contains audio data in the frequency range audible to the human ear. If the user receiving the audio data wants to know more about the audio data being played, it can only be obtained by other means, and cannot be acquired in real time. For providers that provide audio data, there is no way to know the user's needs. Summary of the invention
  • the present invention has been made in view of the above technical problems, and an object thereof is to provide a provision A method and apparatus for enhancing an audio data stream that enables a user to obtain information related to an original audio data stream in real time.
  • a method of providing an enhanced audio data stream comprising: acquiring an original audio data stream; generating a high frequency digital signal associated with the original audio data stream; and synthesizing the high frequency digital Signaling the original audio data stream to obtain the enhanced audio data stream.
  • generating the high frequency digital signal associated with the original audio data stream can include: determining supplemental content related to the content of the original audio data stream; generating and supplementing the information of the intrinsic; encoding the information into a digital signal ; and modulating digital signals in the high frequency band to obtain high frequency digital signals.
  • synthesizing the high frequency digital signal and the original audio data stream can include: selecting at least one portion of the original audio data stream; and combining the one or more high frequency digital signals with the length of the at least one portion Said at least one part is synthesized.
  • an apparatus for providing an enhanced audio data stream comprising: an acquisition module configured to acquire an original audio data stream; a generation module configured to generate the original audio data a stream-related high frequency digital signal; and a synthesis module configured to synthesize the high frequency digital signal and the original audio data stream to obtain the enhanced audio data stream.
  • FIG. 1 is a flow diagram of a method of providing an enhanced audio data stream, in accordance with one embodiment of the present invention
  • Figure 2 is a schematic 3 ⁇ 4 ⁇ 2 diagram of the generation of high frequency digital signals in the embodiment of Figure 1;
  • Figure 3 is a schematic diagram illustrating the format of a high frequency digital signal
  • FIG. 4 is a schematic block diagram of an apparatus for providing enhanced audio data streams in accordance with one embodiment of the present invention. detailed description The above and other objects, features and advantages of the present invention will become apparent from
  • FIG. 1 shows a flow chart of a method of providing enhanced audio data streams in accordance with one embodiment of the present invention.
  • the present embodiment will be described in detail below with reference to the accompanying drawings.
  • an original audio data stream is acquired.
  • the original audio data stream is audio data containing content to be played, the frequency of which is usually within the frequency range of the sound audible to the human ear.
  • the original audio data stream can be provided by an internal vendor of the audio content.
  • the original audio data stream can be audio data for the advertisement, audio data for the music, and the like.
  • step S120 a high frequency digital signal that is related to the acquired original audio data is generated.
  • the high frequency digital signal refers to a digital signal whose frequency is in a high frequency band.
  • Figure 2 shows a schematic flow chart for generating a high frequency digital signal.
  • supplementary content related to the content of the original audio data stream is determined.
  • the supplemental content may be information related to the advertisement, such as offer information for the product for which the advertisement is targeted, purchase information, and the like.
  • the supplementary content may be information related to the song or the music piece, such as a singer or performer of the song or music piece, album name, lyrics, and the like.
  • the related information may include a network address in which the supplemental content is stored and/or text information related to the supplemental content.
  • the network address is, for example, a Uniform Resource Locator (URL) indicating the location of the supplemental content.
  • the text information is, for example, a text that briefly describes the supplementary content.
  • the relevant information may also include other information, such as the brand, play area, play type, etc., to which the supplemental content relates.
  • the information generated in step S220 is encoded into a digital signal.
  • the digital signal may include a "total number of frames” field, a “frame serial number” field, a “play area” field, a “brand” field, a “category” field, a “play type” field, a "network address” Field, "Validity” field, "Encoding method” field, "Brief information” field and "school face” field.
  • the "total number of frames” field indicates the total number of frames constituting the digital signal
  • the "frame sequence number” field indicates the position of the frame in the digital signal
  • the "play area” field defines the supplementary content.
  • the geographic area being played; the "Brand” field defines the brand of the product to which the supplemental content relates; the "Classification” field defines the classification of the supplemental content; the "Playback Type” field defines the type of supplemental content being played, such as theater playback, TV broadcast, Internet play, etc.; the "Network Address” field defines the network address of the supplemental content; the “Validity Period” field defines the playback validity period of the supplemental content; the "Encoding Method” field defines the encoding format and content of the digital signal; The ft field defines a brief description of the supplement; the "check” field indicates the type of face, such as the Cyclic Redundancy Check (CRC).
  • CRC Cyclic Redundancy Check
  • the digital signal can include other fields in addition to the fields described above.
  • Fig. 3 shows an example of the format of a digital signal.
  • the digital signal uses two 128-bit frames.
  • the format of the first frame is shown in Fig. 3 (a), where the "total number of frames” field is a 3-bit field, the "frame sequence number” field is a 3-bit field, and the "play area” field is an 18-bit field.
  • the "Brand” field is a 16-bit field
  • the "Classification” field is a 12-bit field
  • the "Playback Type” field is an 8-bit field
  • the "Network Address” field is a 22-bit field
  • the "Validity Period” field is 16-bit.
  • the field, the "Encoding Method” field is a 4-bit field
  • the “Check” field is a 26-bit field.
  • the format of the second frame is shown in Figure 3 (b), where the "total number of frames” field and the “frame sequence number” field are respectively 3-bit fields, and the "brief information” field is a 96-bit field, "checking The field is a 26-bit field.
  • the digital signal may only include the first frame.
  • the value of the "Total Frames" field in the first frame is 1 and the value of the "Frame Sequence Number” field is 0.
  • the digital signal can include a first frame and at least one second frame.
  • the value of the "total number of frames" field in the first frame and each second frame is the sum of the number of the first frame and the second frame.
  • step S240 the number obtained in step S230 is modulated in the high frequency band.
  • Word signal to obtain high frequency digital signals may be a high audio segment or an ultrasonic frequency band.
  • the high frequency band is in the frequency range of 18 kHz to 22 kHz.
  • the modulation of the signal may include amplitude modulation (ASK), frequency modulation (FSK), and phase modulation (PSK) of the signal.
  • ASK amplitude modulation
  • FSK frequency modulation
  • PSK phase modulation
  • frequency modulation or phase modulation can be used.
  • two frequency points can be selected in the high frequency band to represent 0 and 1, respectively, to perform frequency modulation of the digital signal.
  • one frequency point can be selected in the high frequency band, and the two carrier phases of the reverse phase represent 0 and 1, thereby performing phase modulation of the digital signal.
  • step S130 the generated high frequency digital signal and the original audio data stream are synthesized in step S130 to obtain an enhanced audio data stream.
  • the high frequency digital signal and the original audio data stream are synthesized by linear superposition.
  • the original audio data stream is encoded using 16-bit Pulse Code Modulation (PCM).
  • PCM Pulse Code Modulation
  • the amplitude of the synthesized enhanced audio data stream does not exceed the range of 16-bit PCM encoding.
  • the original audio data stream is selected.
  • the selected portion may be part or multiple parts or all of the original audio data stream.
  • one or more high frequency digital signals are combined with the audio data stream of the portion according to the length of each of the selected at least one portion. Specifically, determining the number of high frequency digital signals that can be synthesized with the audio data stream of the portion according to the length of the selected audio data stream and the length of the high frequency digital signal, and then the number of these high numbers The frequency digital signal is synthesized with the audio data stream of the portion.
  • the synthesized enhanced audio data stream can be played and received by the microphone of the terminal device.
  • the terminal device recovers the high frequency digital signal by performing signal sampling, detection and estimation on the received enhanced audio data stream, and then decoding the network address and/or text information of the supplementary content to obtain supplementary content.
  • the network address is used to obtain supplemental content, or text information is displayed on the terminal device.
  • the method for providing enhanced audio data stream of this embodiment passes The high frequency digital signal enables the user receiving the original audio data stream to obtain relevant information in real time.
  • the method of the present embodiment can be used for the provision of audio data streams in, for example, advertisements, movies, television, and the like.
  • FIG. 4 shows a schematic block diagram of an apparatus 400 for providing enhanced audio data streams in accordance with one embodiment of the present invention.
  • the present embodiment will be described in detail below with reference to the drawings, and the description of the same portions as those of the previous embodiment will be appropriately omitted.
  • the apparatus 400 of this embodiment includes: a block 401 that acquires an original audio data stream; a signal generating module 402 that generates a high frequency digital signal that is related to the acquired original audio data; A synthesis module 403 that synthesizes the generated high frequency digital signal from the original audio data stream to obtain an enhanced audio data stream.
  • the determination unit 4021 determines the supplemental content related to the content in the original audio data stream.
  • the supplemental content may be offer information or the like of the product to which the advertisement relates.
  • the information generating unit 4022 generates information related to the supplement, for example, a network address indicating the storage location of the supplementary content, text information briefly describing the supplementary content, etc.
  • the encoding unit 4023 encodes the generated information into a digital signal.
  • the digital signal may comprise a field as previously described, and may use a format as shown in Figure 3.
  • modulation unit 4024 modulates the digital signal in a high frequency band, A high frequency digital signal is obtained.
  • the high frequency band may be in the frequency range of 18 kHz to 22 kHz.
  • Modulation unit 4024 may modulate the digital signal using frequency modulation or phase modulation.
  • the high frequency digital signal is then provided to synthesis module 403.
  • the selection unit 4031 selects at least a portion of the original audio data stream. Specifically, the selection unit 4031 may select a part or portions of the original audio data stream as a portion to be synthesized.
  • the synthesizing unit 4032 synthesizes one or more high frequency digital signals with the portion according to the length of each of the selected at least one portion.
  • the synthesizing unit 4032 for each of the selected audio data streams, determining a high frequency digital signal synthesizable with the audio data stream of the portion according to the length of the audio data stream of the portion and the length of the high frequency digital signal The number is then combined with the high frequency digital signal of the number and the audio data stream of the portion. Combined The unit 4032 can synthesize the high frequency digital signal and the original audio data stream by linear superposition.
  • the apparatus 400 for providing enhanced audio data streams of the present embodiment is operationally capable of implementing the method of providing enhanced audio data streams of the embodiment illustrated in FIG.
  • the apparatus 400 of the present embodiment can be used for the provision of audio data streams, such as in advertisements, movies, and the like.
  • the methods of the above disclosed embodiments can be implemented in software, hardware, or a combination of software and hardware.
  • the hardware part can be implemented using dedicated logic.
  • the apparatus for providing enhanced audio data streams and its various components in the above embodiments may be comprised of semiconductors such as very large scale integrated circuits or gate arrays, such as logic chips, transistors, etc., or such as field programmable gate arrays, programmable logic
  • the hardware circuit implementation of the programmable hardware device such as a device can also be implemented by software executed by various types of processors, or by a combination of the above hardware circuits and software.
  • the software portion can be stored in memory and executed by a suitable instruction execution system, such as a microprocessor, personal computer (PC) or mainframe.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

一种提供增强音频数据流的方法及其装置。该方法包括:获取原始音频数据流(S110),生成与所述原始音频数据流相关的高频数字信号(S120),并合成所述高频数字信号与所述原始音频数据流(S130),以获得所述增强音频数据流。

Description

提供增强音频数据流的方法及装置 技术领域
本发明涉及音频技术, 更具体地, 涉及提供增强音频数据流的方法及 装置。 背景技术
人耳可以听到的声音的频率范围是 20赫兹(Hz )到 20000赫兹, 其 中, 频率范围 2500Hz到 20kHz是高音频段。 随着年龄的增加, 人耳可以 听到高音频段的声音的能力逐渐减弱甚至丧失。 成年人一般能听到的声音 的上限大约为 16kHz。
频率超过 20kHz的声音被称为超声波。超声波在空气中传输的特点是 衰减大、 在封闭空间中的多途效应明显。 处于低频段的超声波的反射能力 较强, 而透射能力较弱。
通常, 用于播放的音频数据, 例如广告、 电影、 电视、 节目、 音乐中 的音频数据等, 其频率并不超过 16kHz, 以方便用户收听。
随着技术的发展, 大多数可用于播放音频数据的设备都能够播放频率 接近或超过 20kHz的高音频段的声音或超声波。 此外, 也出现了能够接收 高音频段的声音或超声波的接收装置, 例如基于声学微机电技术的硅晶麦 克风。 目前, 大多数终端设备都采用这种接收装置, 从而大大提高终端设 备的声音感知能力。
如前所述, 音频数据通常仅包含在人耳可听见的频率范围中的音频数 据。 如果接收音频数据的用户想要进一步知道与所播放的音频数据有关的 信息, 则只能通过其它途径获取, 而不能实时地获取。 对于提供音频数据 的提供商来说, 也无法知道用户的需求。 发明内容
本发明正是鉴于上述的技术问题而提出的, 其目的在于提供一种提供 增强音频数据流的方法及装置, 其能够使用户实时地获取与原始音频数据 流相关的信息。
根据本发明的一个方面, 提供了一种提供增强音频数据流的方法, 其 包括: 获取原始音频数据流; 生成与所述原始音频数据流相关的高频数字 信号; 以及合成所述高频数字信号与所述原始音频数据流, 以获得所述增 强音频数据流。
在一个实施例中,生成与原始音频数据流相关的高频数字信号可包括: 确定与原始音频数据流的内容相关的补充内容; 生成与补充内 关的信 息; 将所述信息编码成数字信号; 以及在高频段中调制数字信号, 以获得 高频数字信号。
在一个实施例中, 合成高频数字信号与原始音频数据流可包括: 选择 原始音频数据流的至少一个部分; 以及按照所述至少一个部分的长度, 将 一个或多个高频数字信号与所述至少一个部分进行合成。
根据本发明的另一个方面, 提供了一种提供增强音频数据流的装置, 其包括: 获取模块, 其被配置为获取原始音频数据流; 生成模块, 其被配 置为生成与所述原始音频数据流相关的高频数字信号; 以及合成模块, 其 被配置为合成所述高频数字信号与所述原始音频数据流, 以获得所述增强 音频数据流。 附图说明
图 1是根据本发明的一个实施例的提供增强音频数据流的方法的流程 图;
图 2是图 1的实施例中生成高频数字信号的示意性¾½图;
图 3是示例性的说明高频数字信号的格式的示意图;
图 4是根据本发明的一个实施例的提供增强音频数据流的装置的示意 性方框图。 具体实施方式 相信通过以下结合附图对本发明的具体实施方式的详细描述, 本发明 的上述和其它目的、 特征和优点将更加清楚。
图 1示出了根据本发明的一个实施例的提供增强音频数据流的方法的 流程图。 下面结合附图, 对本实施例进行详细描述。
如图 1所示, 在步骤 S110, 获取原始音频数据流。 在本实施例中, 原 始音频数据流是包含将要播放的内容的音频数据, 其频率通常在人耳可听 见的声音的频率范围内。原始音频数据流可由音频内容的内 供商提供。 在某些实施例中, 原始音频数据流可以是广告的音频数据、 音乐的音频数 据等。
接着, 在步骤 S120, 生成与所获取的原始音频数据 目关的高频数字 信号。 在本实施例中, 高频数字信号是指频率在高频段内的数字信号。 图 2示出了生成高频数字信号的示意性流程图。
如图 2所示, 首先, 在步骤 S210, 确定与原始音频数据流的内容相关 的补充内容。 例如, 如果原始音频数据流是关于某个广告的音频数据, 则 补充内容可以是与该广告相关的信息, 例如该广告所针对的产品的优惠信 息、购买信息等。如果原始音频数据流是关于某个歌曲或乐曲的音频数据, 则补充内容可以是与歌曲或乐曲相关的信息, 例如歌曲或乐曲的演唱者或 演奏者、 专辑名称、 歌词等。
然后, 在步骤 S220, 基于所确定的补充内容, 生成相关的信息。 在本 实施例中,相关的信息可以包括存储有补充内容的网络地址和 /或与补充内 容有关的文本信息。 网络地址例如是指示补充内容的位置的统一资源定位 符(URL ) 。 文本信息例如是简要描述补充内容的文本。 当然, 本领域的 普通技术人员能够理解, 相关的信息还可以包括其它信息, 例如补充内容 涉及的品牌、 播放地区、 播放类型等。
在步骤 S230, 将在步骤 S220中生成的信息编码成数字信号。 在一个 实施例中, 数字信号可包括 "总帧数" 字段、 "帧序列号" 字段、 "播放 地区" 字段、 "品牌" 字段、 "分类" 字段、 "播放类型" 字段、 "网络 地址" 字段、 "有效期" 字段、 "编码方式" 字段、 "简要信息" 字段和 "校臉" 字段。 在该数字信号中, "总帧数" 字段可表明构成数字信号的 帧的总个数; "帧序列号" 字段可表明帧在数字信号中的位置; "播放地 区" 字段可定义补充内容可被播放的地理区域; "品牌" 字段可定义补充 内容所涉及的产品的品牌; "分类" 字段可定义补充内容的分类; "播放 类型" 字段可定义补充内容被播放的类型, 例如影院播放、 电视播放、 因 特网播放等; "网络地址" 字段可定义补充内容的网络地址; "有效期" 字段可定义补充内容的播放有效期; "编码方式" 字段可定义数字信号的 编码格式和内容; "简^ ft息" 字段可定义补充内容的简要描述; "校验" 字段可指示校臉的类型, 例如循环冗余检验 ( CRC ) 。
对于本领域的普通技术人员来说, 容易知道数字信号除了包括上述的 字段外, 还可以包括其它字段。
图 3示出了数字信号的格式的一个实例。 在该例子中, 数字信号使用 了两个 128位的帧。 第一帧的格式如图 3 ( a )所示, 其中, "总帧数" 字 段是 3位的字段, "帧序列号" 字段是 3位的字段, "播放地区" 字段是 18位的字段, "品牌"字段是 16位的字段, "分类"字段是 12位的字段, "播放类型" 字段是 8位的字段, "网络地址" 字段是 22位的字段, "有 效期" 字段是 16位的字段, "编码方式" 字段是 4位的字段, "校验" 字 段是 26位的字段。 第二帧的格式如图 3 ( b )所示, 其中, "总帧数" 字 段和 "帧序列号"字段分别是 3位的字段, "简要信息"字段是 96位的字 段, "校验" 字段是 26位的字段。
在一个实施方式中, 数字信号可以仅包括第一帧。 在这种情况下, 第 一帧中的 "总帧数" 字段的值为 1 , "帧序列号" 字段的值为 0。
在另一个实施方式中, 数字信号可以包括第一帧和至少一个第二帧。 在这种情况下, 第一帧和每个第二帧中的 "总帧数" 字段的值为第一帧和 第二帧的个数之和。
虽然以上给出了数字信号的格式的一个例子, 但本领域的普通技术人 员能够知道, 数字信号也可以使用其它格式。
返回到图 2, 在步骤 S240, 在高频段中调制在步驟 S230中获得的数 字信号, 以获得高频数字信号。 在本实施例中, 高频段可以是高音频段或 超声波频段。 优选地, 高频段是 18kHz到 22kHz的频率范围。
一般地,信号的调制可包括信号的幅度调制( ASK )、频率调制( FSK ) 和相位调制 (PSK )等。 在本实施例中, 可使用频率调制或相位调制。
在使用频率调制的情况下, 可在高频段中选择两个频率点以分别代表 0和 1, 从而进行数字信号的频率调制。
在使用相位调制的情况下, 可在高频段中选择一个频率点, 通 it^反 的两个载波相位代表 0和 1, 从而进行数字信号的相位调制。
返回到图 1, 在生成了高频数字信号后, 在步骤 S130, 合成所生成的 高频数字信号与原始音频数据流, 以获得增强音频数据流。
在本实施例中,通过线性叠加来合成高频数字信号和原始音频数据流。 通常, 原始音频数据流采用 16位脉冲编码调制 (PCM )进行编码。 在进 行合成时, 为了不引入新的噪声, 合成后的增强音频数据流的幅度不超过 16位 PCM编码的范围。
在该步骤中, 首先, 选择原始音频数据流的至少一个部分。 所选择的 部分可以是原始音频数据流的一部分或者多个部分或者全部。 然后, 按照 所选择的至少一个部分的每一个的长度, 将一个或多个高频数字信号与该 部分的音频数据流进行合成。 具体地, 根据所选择的每个部分的音频数据 流的长度以及高频数字信号的长度, 确定可与该部分的音频数据流合成的 高频数字信号的个数, 然后将这些个数的高频数字信号与该部分的音频数 据流进行合成。
所合成的增强音频数据流可被播放, 并可被终端设备的麦克风接收。 终端设备通过对所接收的增强音频数据流进行信号采样、 检测和估计, 恢 复高频数字信号, 然后对其进行解码以获得补充内容的网络地址和 /或文本 信息等, 从而能够访问补充内容的网络地址以获得补充内容, 或者在终端 设备上显示文本信息。
通过以上描述可以看出, 本实施例的提供增强音频数据流的方法通过 高频数字信号,能够使接收原始音频数据流的用户实时地获取相关的信息。 本实施例的方法可用于例如广告、 电影、 电视等中的音频数据流的提供。
在同一个发明构思下, 图 4示出了根据本发明的一个实施例的提供增 强音频数据流的装置 400的示意性方框图。 以下结合附图, 对本实施例进 行详细说明, 其中对于与前面实施例相同的部分, 适当省略其说明。
如图 4所示, 本实施例的装置 400包括: 获^ ^块 401 , 其获取原始 音频数据流; 信号生成模块 402, 其生成与所获取的原始音频数据 目关 的高频数字信号; 以及合成模块 403, 其合成所生成的高频数字信号与原 始音频数据流, 以获得增强音频数据流。
在本实施例的装置 400中, 在获取模块 401获取了原始音频数据流之 后,在信号生成模块 402中,确定单元 4021确定与原始音频数据流中的内 容相关的补充内容。 如前所述, 当原始音频数据流中的内容是广告时, 则 补充内容可以是该广告所涉及的产品的优惠信息等。 接着, 信息生成单元 4022生成与补充内^"关的信息, 例如, 指示补充内容的存储位置的网络 地址、简要描述补充内容的文本信息等。编码单元 4023将所生成的信息编 码成数字信号。 在一个实施例中, 数字信号可以包含如前所述的字段, 并 可使用如图 3所示的格式。在编码单元 4023编码生成数字信号后,调制单 元 4024在高频段中调制该数字信号, 以获得高频数字信号。优选地, 高频 段可以是 18kHz到 22kHZ的频率范围。调制单元 4024可以使用频率调制 或相位调制来对数字信号进行调制。
然后, 高频数字信号被提供给合成模块 403。 在合成模块 403中, 选 择单元 4031选择原始音频数据流的至少一个部分。具体地,选择单元 4031 可以选择原始音频数据流的一部分或多个部分或全部, 作为将被合成的部 分。接着, 合成单元 4032按照所选择的至少一个部分的每一个的长度, 将 一个或多个高频数字信号与该部分进行合成。在合成单元 4032中,对于所 选择的每一个部分的音频数据流, 根据该部分的音频数据流的长度和高频 数字信号的长度, 确定可与该部分的音频数据流合成的高频数字信号的个 数, 然后将这些个数的高频数字信号与该部分的音频数据流进行合成。 合 成单元 4032可通过线性叠加来合成高频数字信号和原始音频数据流。
应当指出, 本实施例的提供增强音频数据流的装置 400在操作上能够 实现图 1所示的实施例的提供增强音频数据流的方法。本实施例的装置 400 可用于例如广告、 电影等中的音频数据流的提供。
以上所公开的实施例的方法可以在软件、 硬件、 或软件和硬件的结合 中实现。 硬件部分可以利用专用逻辑来实现。 例如, 上述实施例中的提供 增强音频数据流的装置及其各个组成部分可以由诸如超大规模集成电路或 门阵列、诸如逻辑芯片、 晶体管等的半导体、或者诸如现场可编程门阵列、 可编程逻辑设备等的可编程硬件设备的硬件电路实现, 也可以用由各种类 型的处理器执行的软件实现, 也可以由上述硬件电路和软件的结合实现。 软件部分可以存储在存储器中, 由适当的指令执行系统, 例如微处理器、 个人计算机 ( PC )或大型机来执行。
以上虽然通过示例性的实施例详细描述了本发明的提供增强音频数据 流的方法及装置, 但是以上这些实施例并不是穷举的, 本领域技术人员可 以在本发明的精神和范围内实现各种变化和修改。 因此, 本发明并不限于 这些实施例, 本发明的范围仅由所附的权利要求限定。

Claims

权利要求
1. 一种提供增强音频数据流的方法, 包括:
获取原始音频数据流;
生成与所述原始音频数据流相关的高频数字信号; 以及
合成所述高频数字信号与所述原始音频数据流, 以获得所述增强音频 数据流。
2.根据权利要求 1所述的方法, 其中, 生成与所述原始音频数据流相 关的高频数字信号包括:
确定与所述原始音频数据流的内^目关的补充内容;
生成与所述补充内^ ^关的信息;
将所述信息编码成数字信号; 以及
在高频段中调制所述数字信号, 以获得所述高频数字信号。
3.根据权利要求 2所述的方法, 其中, 所述信息包括存储有所述补充 内容的网络地址以及与所述补充内 关的文本信息中的至少一个。
4.根据权利要求 2所述的方法,其中,所述高频段是从 18kHz到 22kHz 的频率范围。
5.根据权利要求 1所述的方法, 其中, 合成所述高频数字信号与所述 原始音频数据流包括:
选择所述原始音频数据流的至少一个部分; 以及
按照所述至少一个部分的长度, 将一个或多个所述高频数字信号与所 述至少一个部分进行合成。
6. 一种提供增强音频数据流的装置, 包括:
获取模块, 其被配置为获取原始音频数据流;
信号生成模块, 其被配置为生成与所述原始音频数据 目关的高频数 字信号; 以及
合成模块,其被配置为合成所述高频数字信号与所述原始音频数据流, 以获得所述增强音频数据流。
7.根据权利要求 6所述的装置, 其中, 所述信号生成模块包括: 确定单元, 其被配置为确定与所述原始音频数据流的内容相关的补充 内容;
信息生成单元, 其被配置为生成与所述补充内容有关的信息; 编码单元, 其被配置为将所述信息编码成数字信号; 以及
调制单元, 其被配置为在高频段中调制所述数字信号, 以获得所述高 频数字信号。
8.根据权利要求 7所述的装置, 其中, 所述信息包括存储所述补充内 容的网络地址以及与所述补充内^ "关的文本信息中的至少一个。
9.根据权利要求 7所述的装置,其中,所述高频段是从 18kHz到 22kHz 的频率范围。
10.根据权利要求 6所述的装置, 其中, 所述合成模块包括:
选择单元, 其被配置为选择所述原始音频数据流的至少一个部分; 以 及
合成单元, 其被配置为按照所述至少一个部分的长度, 将一个或多个 所述高频数字信号与所述至少一个部分进行合成。
PCT/CN2012/088005 2012-12-31 2012-12-31 提供增强音频数据流的方法及装置 WO2014101169A1 (zh)

Priority Applications (3)

Application Number Priority Date Filing Date Title
CN201280077909.6A CN104871243A (zh) 2012-12-31 2012-12-31 提供增强音频数据流的方法及装置
PCT/CN2012/088005 WO2014101169A1 (zh) 2012-12-31 2012-12-31 提供增强音频数据流的方法及装置
HK15112092.5A HK1214025A1 (zh) 2012-12-31 2015-12-08 提供增强音頻數據流的方法及裝置

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2012/088005 WO2014101169A1 (zh) 2012-12-31 2012-12-31 提供增强音频数据流的方法及装置

Publications (1)

Publication Number Publication Date
WO2014101169A1 true WO2014101169A1 (zh) 2014-07-03

Family

ID=51019776

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2012/088005 WO2014101169A1 (zh) 2012-12-31 2012-12-31 提供增强音频数据流的方法及装置

Country Status (3)

Country Link
CN (1) CN104871243A (zh)
HK (1) HK1214025A1 (zh)
WO (1) WO2014101169A1 (zh)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1249517A (zh) * 1998-09-29 2000-04-05 国际商业机器公司 用于在音频数据中插入附加信息的系统
JP2001296863A (ja) * 1996-07-02 2001-10-26 Yamaha Corp 電子情報処理方法及び装置並びに記録媒体
JP2006098717A (ja) * 2004-09-29 2006-04-13 Denon Ltd デジタル信号処理装置
JP2006243398A (ja) * 2005-03-03 2006-09-14 Dainippon Printing Co Ltd 音響信号の合成装置および検索装置
JP2008225232A (ja) * 2007-03-14 2008-09-25 Crimson Technology Inc 信号処理方法および音声コンテンツ配信方法
CN101652810A (zh) * 2006-09-29 2010-02-17 Lg电子株式会社 用于处理混合信号的装置及其方法
CN101682756A (zh) * 2007-06-18 2010-03-24 高通股份有限公司 用于增强无线电节目的装置和方法
CN103137134A (zh) * 2011-11-28 2013-06-05 鸿富锦精密工业(深圳)有限公司 音频设备及音频信号的水印信息加载方法

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2261896B1 (en) * 2008-07-29 2017-12-06 Yamaha Corporation Performance-related information output device, system provided with performance-related information output device, and electronic musical instrument
JP5782677B2 (ja) * 2010-03-31 2015-09-24 ヤマハ株式会社 コンテンツ再生装置および音声処理システム

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001296863A (ja) * 1996-07-02 2001-10-26 Yamaha Corp 電子情報処理方法及び装置並びに記録媒体
CN1249517A (zh) * 1998-09-29 2000-04-05 国际商业机器公司 用于在音频数据中插入附加信息的系统
JP2006098717A (ja) * 2004-09-29 2006-04-13 Denon Ltd デジタル信号処理装置
JP2006243398A (ja) * 2005-03-03 2006-09-14 Dainippon Printing Co Ltd 音響信号の合成装置および検索装置
CN101652810A (zh) * 2006-09-29 2010-02-17 Lg电子株式会社 用于处理混合信号的装置及其方法
JP2008225232A (ja) * 2007-03-14 2008-09-25 Crimson Technology Inc 信号処理方法および音声コンテンツ配信方法
CN101682756A (zh) * 2007-06-18 2010-03-24 高通股份有限公司 用于增强无线电节目的装置和方法
CN103137134A (zh) * 2011-11-28 2013-06-05 鸿富锦精密工业(深圳)有限公司 音频设备及音频信号的水印信息加载方法

Also Published As

Publication number Publication date
CN104871243A (zh) 2015-08-26
HK1214025A1 (zh) 2016-07-15

Similar Documents

Publication Publication Date Title
JP3822224B1 (ja) 情報提供システム
US9344802B2 (en) Information providing system
JP4528365B1 (ja) 発信装置
JP5782677B2 (ja) コンテンツ再生装置および音声処理システム
CN102169705B (zh) 音调再现装置和方法
US6856990B2 (en) Network dedication system
US20130301392A1 (en) Methods and apparatuses for communication of audio tokens
JP3834579B1 (ja) 情報提供システム
JP2007164659A (ja) 音楽情報を利用した情報配信システム及び情報配信方法
JP4295781B2 (ja) 情報提供システム
JP5953687B2 (ja) 情報処理装置及びプログラム
JP2013228755A (ja) コンテンツ再生装置およびコンテンツ再生プログラム
CN104038772B (zh) 生成铃声文件的方法及装置
JP2007195105A (ja) 音情報を利用した携帯情報端末による情報取得支援システム及び情報取得方法
US20030033385A1 (en) System and method for utilizing broadcast synchronized data triggers
US20100089223A1 (en) Microphone set providing audio and text data
WO2014101169A1 (zh) 提供增强音频数据流的方法及装置
WO2002058053A1 (en) Encoding method and decoding method for digital voice data
JP2006195061A (ja) 音響信号に対する情報の埋め込み装置、音響信号からの情報の抽出装置および音響信号再生装置
WO2022143530A1 (zh) 音频处理方法、装置、计算机设备及存储介质
EP3391372B1 (en) Improved method, apparatus and system for embedding data within a data stream
WO2019052121A1 (zh) 一种音乐识别系统、装置及音乐管理服务器和方法
KR20160010843A (ko) 진동 기능을 제공하는 오디오북 재생 방법, 장치 및 컴퓨터 판독 가능 매체
JP6353402B2 (ja) 音響電子透かしシステム、電子透かし埋め込み装置、電子透かし読み取り装置、その方法及びプログラム
KR100709756B1 (ko) 통신망을 통한 멀티미디어 콘텐츠 제공 시스템, 제공 방법및 멀티미디어 콘텐츠 구매 장치

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12891225

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 12891225

Country of ref document: EP

Kind code of ref document: A1