CN107112025A - 用于恢复语音分量的系统和方法 - Google Patents

用于恢复语音分量的系统和方法 Download PDF

Info

Publication number
CN107112025A
CN107112025A CN201580060446.6A CN201580060446A CN107112025A CN 107112025 A CN107112025 A CN 107112025A CN 201580060446 A CN201580060446 A CN 201580060446A CN 107112025 A CN107112025 A CN 107112025A
Authority
CN
China
Prior art keywords
audio signal
iteration
distortion
frequency area
audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201580060446.6A
Other languages
English (en)
Inventor
C·艾文达诺
J·伍德拉夫
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Knowles Electronics LLC
Original Assignee
Knowles Electronics LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Knowles Electronics LLC filed Critical Knowles Electronics LLC
Publication of CN107112025A publication Critical patent/CN107112025A/zh
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks

Abstract

一种用于恢复因噪声降低或噪声消除而失真的音频信号的失真语音分量的方法,该方法包括:确定音频信号中的失真频率区和不失真频率区。失真频率区包括音频信号的存在语音失真的区域。利用模型执行迭代,以完善对失真频率区处的音频信号的预测。所述模型被设置成修改音频信号,并且可以包括利用干净或未损坏的音频信号的频谱包络训练的深度神经网络。在每次迭代之前,将不失真频率区的音频信号恢复至第一次迭代之前音频信号的值;而失真频率区的音频信号在第一次迭代时从零开始完善。当不失真频率区的音频信号的差异满足预定义标准时,迭代结束。

Description

用于恢复语音分量的系统和方法
相关申请的交叉引用
本申请要求保护2014年9月12日提交的美国临时申请No.62/049988的权益。前述申请的主题出于所有目的通过引用而并入于此。
技术领域
本申请总体上涉及音频处理,并且更具体地,涉及用于恢复噪声被抑制的音频信号的失真语音分量的系统和方法。
背景技术
在音频处理系统中广泛使用噪声降低以抑制或消除用于发送语音的音频信号中的不希望噪声。然而,在噪声消除和/或抑制之后,与噪声纠缠在一起的语音在噪声降低系统中易于被过度衰减或消除。
有一些大脑的模型解释了如何利用经由反馈机制感知地取代输入的内部表述来恢复声音。神经科学已经描述了称为大脑的收敛发散区(CDZ)模型的一个示例性模型,并且尝试解释在人类语音感知中发现的频谱完成和音素恢复现象等。
发明内容
提供本发明内容以通过简化形式介绍选择的概念,其在下面详细描述中进一步描述。本发明内容不旨在标识所要求保护的主题的关键特征或基本特征,也不是旨在被用作在确定所要求保护的主题的范围的辅助。
提供了用于恢复音频信号的失真语音分量的系统和方法。一种示例性方法包括:确定所述音频信号中的失真频率区和不失真频率区。所述失真频率区包括所述音频信号的存在语音失真的区域。该方法包括:利用用于完善对所述失真频率区处的所述音频信号的预测的模型执行一次或更多次迭代。所述模型可被设置成修改所述音频信号。
在一些实施方式中,所述音频信号包括通过对包括语音的声学信号进行噪声降低或噪声消除中的至少一种而获取的噪声被抑制的音频信号。所述声学信号在所述失真频率区衰减或消除。
在一些实施方式中,所述用于完善对所述失真频率区处的所述音频信号的的预测的模型包括利用干净的音频信号或未损坏的音频信号的频谱包络训练的深度神经网络。所完善的预测可以被用于恢复所述失真频率区中的语音分量。
在一些实施方式中,所述失真频率区处的所述音频信号在第一次迭代之前被设置成零。在执行所述迭代中的每一次迭代之前,所述不失真频率区处的所述音频信号在第一次迭代之前被恢复至初始值。
在一些实施方式中,该方法还包括:比较每一次迭代之前和之后的所述不失真频率区处的所述音频信号,以确定差异。在某些实施方式中,如果所述差异满足预定标准,则所述方法允许结束所述一次或更多次迭代。所述预定标准可以根据所述音频信号的能量的下限和上限来限定。
根据本公开另一示例性实施方式,用于恢复音频信号的失真语音分量的方法的步骤被存储在包括指令的非暂时性机器可读介质上,所述指令在通过一个或更多个处理器实现时执行所述步骤。
根据结合附图的下列描述,本公开和方面的其它示例性实施方式将变得清楚。
附图说明
实施方式在附图的图中通过示例而非限制的方式进行例示,其中,相同标记指示相似部件。
图1是例示可以实践本技术的环境的框图。
图2是例示根据示例性实施方式的音频装置的框图。
图3是例示根据示例性实施方式的音频处理系统的模块的框图。
图4是例示根据示例性实施方式的用于恢复音频信号的语音分量的方法的流程图。
图5是根据示例性实施方式的可用于实现本技术的方法的计算机系统。
具体实施方式
本文公开的技术涉及用于恢复音频信号的失真语音分量的系统和方法。本技术的实施方式可以利用被设置成接收和/或提供音频的任何音频装置(诸如蜂窝电话、可佩戴装置、电话听筒、耳机以及会议系统,但不限于此)来实践。应当明白,虽然本技术的一些实施方式将参照蜂窝电话的操作来描述,但本技术可以利用任何音频装置来实践。
音频装置可以包括射频(RF)接收器、发送器及收发器、有线和/或无线电信和/或联网装置、放大器、音频和/或视频播放器、编码器、解码器、扬声器、输入部、输出部,存储装置以及用户输入装置。音频装置可以包括输入装置,诸如按钮、开关、按键、键盘、轨迹球、滑块、触摸屏、一个或更多个麦克风、陀螺仪、加速度计、全球定位系统(GPS)接收器等。音频装置可以包括输出装置,诸如LED指示器、视频显示器、触摸屏、扬声器等。在一些实施方式中,移动装置包括可佩戴装置和手持装置,诸如有线和/或无线遥控器、笔记本计算机、平板计算机、平板电话、智能电话、个人数字助理、媒体播放器、移动电话等。
在不同实施方式中,音频装置可以在固定和便携式环境中工作。固定环境可以包括住宅和商业建筑物或结构物等。例如,固定的实施方式可以包括:客厅、卧室、家庭影院、会议室、礼堂、商业场所等。便携式环境可以包括:移动中的车辆、移动中的人、其它运输工具等。
根据示例性实施方式,用于恢复音频信号的失真语音分量的方法包括确定该音频信号中的失真频率区和不失真频率区的步骤。失真频率区包括音频信号中的存在语音失真的区域。该方法包括利用用于完善对失真频率区处的音频信号的预测的模型来执行一次或更多次迭代的步骤。该模型可以被设置成修改该音频信号。
下面参照图1,示出了其中可以实践用于恢复音频信号的失真语音分量的方法的环境100。该示例环境100可以包括至少可工作以接收音频信号的音频装置104。音频装置104还可工作以处理和/或记录/存储所接收的音频信号。
在一些实施方式中,音频装置104包括一个或更多个声学传感器,例如,麦克风。在图1的示例中,音频装置104包括主麦克风(M1)106和辅麦克风108。在不同实施方式中,麦克风106和108被用于检测声学音频信号(例如,来自用户102的口头交流)和噪声110两者。该口头交流可以包括关键词、讲话、唱歌等。
噪声110是环境100中存在的不需要的声音,其可以例如通过诸如麦克风106和108的传感器检测到。在固定环境中,噪声源可以包括街道噪声、环境噪声、来自移动装置的声音(如音频)、来自除了预定演讲者以外的其它实体的讲话等。噪声110可以包括混响和回声。移动环境可能会遇到某些种类的噪声,这些噪声是由其操作及其操作环境引起的,例如,道路、轨道、轮胎/车轮、风扇、刮水片、发动机、排气装置、娱乐系统、通信系统、竞争扬声器、风、雨、波浪、其它车辆、外部等噪声。由麦克风106和108检测的声信号可以被用于将希望的语音与噪声110分离。
在一些实施方式中,音频装置104连接至基于云的计算资源160(也称为计算云)。在一些实施方式中,计算云160包括一个或更多个服务器机群/集群(包括计算机服务器集合),并且该计算机云与网络交换机和/或路由器处于同一位置。计算云160可工作为通过网络(例如,因特网、移动电话(蜂窝电话)网络等)传送一个或更多个服务。在某些实施方式中,音频信号的至少部分处理在计算云160中远程地执行。音频装置104可工作为向计算云160发送数据(例如,记录的声信号)、请求计算服务并接收计算结果。
图2是示例音频装置104的框图。如图所示,音频装置104包括接收器200、处理器202、主麦克风106、辅麦克风108、音频处理系统210以及输出装置206。根据音频装置104的工作需要,音频装置104可以包括进一步的或不同的组件。类似地,音频装置104可以包括较少的组件来执行与图2描绘的那些类似或等同的功能。例如,音频装置104在一些实施方式中包括单个麦克风,而在其它实施方式中包括两个或更多个麦克风。
在不同实施方式中,接收器200可以被设置成与诸如因特网、广域网(WAN)、局域网(LAN)、蜂窝网络等的网络通信,以接收音频信号。然后,将接收到的音频信号转发至音频处理系统210。
在不同实施方式中,处理器202包括硬件和/或软件,处理器202可工作以执行存储在存储器(图2中未例示)中的指令。示例性处理器202使用浮点运算、复数运算以及包括噪声抑制和恢复音频信号中的失真语音分量的其它运算。
音频处理系统210可被设置成,经由至少一个麦克风(例如,图1和图2的示例中的主麦克风106和辅麦克风108)从声源接收声信号,并处理声信号分量。示例系统中的麦克风106和108间隔开一段距离,使得从某些方向碰撞到装置上的声波在两个或更多个麦克风处表现出不同的能级。在被麦克风106和108接收之后,该声信号可以被转换成电信号。这些电信号然后可以通过模数转换器(未示出)转换成数字信号以用于根据一些实施方式进行处理。
在不同实施方式中,在麦克风106和108是间隔小(例如,隔开1至2厘米)的全向麦克风,可以使用波束形成技术来仿真向前和向后的定向麦克风响应。可以利用仿真的向前和向后的定向麦克风来获取电平差。该电平差可以被用于在例如时-频域中区分语音和噪声,这可用于噪声和/或回声降低。在一些实施方式中,一些麦克风主要被用于检测语音,而其它麦克风主要被用于检测噪声。在不同实施方式中,一些麦克风被用于检测噪声和语音两者。
噪声降低可以由音频处理系统210基于麦克风间的电平差、级别显著性(levelsalience)、音高显著性(pitch salience)、信号类型分类、扬声器标识来执行。在不同实施方式中,噪声降低包括噪声消除和/或噪声抑制。
在一些实施方式中,输出装置206是向收听者提供音频输出的任何装置(例如,声源)。例如,输出装置206可以包括扬声器、class-D输出、耳机的听筒或音频装置104上的听筒(handset)。
图3是示出根据示例性实施方式的音频处理系统210的模块的框图。图3的音频处理系统210可以提供图2的音频处理系统210的更多细节。音频处理系统210包括频率分析模块310、噪声降低模块320、语音恢复模块330以及重建模块340。可以从接收器200或麦克风106和108接收输入信号。
在一些实施方式中,音频处理系统210可工作为接收包括一个或更多个时域输入音频信号的音频信号,在图3的示例中描绘为从图1中的主麦克风(M1)和辅麦克风(M2)接收。输入的音频信号被提供给频率分析模块310。
在一些实施方式中,频率分析模块310可工作为接收输入的音频信号。频率分析模块310从时域输入音频信号生成频率子带,并输出频率子带信号。在一些实施方式中,频率分析模块310可工作为计算或确定所接收的音频信号的语音分量,例如,频谱包络和激发。
在不同实施方式中,噪声降低模块320包括多个模块并从频率分析模块310接收音频信号。噪声降低模块320可工作为执行音频信号中的噪声降低来生成噪声被抑制的信号。在一些实施方式中,噪声降低包括减法噪声消除或乘法噪声抑制。通过示例而非限制的方式,噪声降低方法在2008年6月30日提交的主题为“System and Method for ProvidingNoise Suppression Utilizing Null Processing Noise Subtraction”的美国专利申请No.12/215,980中和在2007年1月29日提交的主题为“System and Method for UtilizingOmni-Directional Microphones for Speech Enhancement”的美国专利申请No.11/699,732(美国专利No.8,194,880)中进行了描述,其出于上述目的通过引用其全部内容而并入于此。噪声降低模块320向语音恢复模块330提供经转换的噪声被抑制的信号。在噪声被抑制的信号中,因为噪声降低变换了音频信号的频率,可以消除或过度衰减一个或更多个语音分量。
在一些实施方式中,语音恢复模块330从噪声降低模块320接收噪声抑制信号。语音恢复模块330被设置成恢复噪声被抑制的信号中的损坏的语音分量。在一些实施方式中,语音恢复模块330包括被训练用于恢复损坏频率区中的语音分量的深度神经网络(DNN)315。在某些实施方式中,DNN 315被设置成自动编码器。
在不同实施方式中,DNN 315利用机器学习进行训练。DNN 315是一种前馈人工神经网络,在其输入与输出之间具有超过一层的隐藏单元。DNN 315可以通过接收干净音频信号或未损坏音频信号的频谱包络的一个或更多个帧的输入特征来训练。在训练过程中,DNN315可以提取干净或未损坏的频谱包络的所学习的高阶频谱时间特征。在不同实施方式中,在语音恢复模块330中使用利用干净或未损坏包络的频谱包络训练的DNN 315来完善对特别适于恢复失真频率区中的语音分量的干净语音分量的预测。通过示例而非限制的方式,关于深度神经网络的示例性方法还在共同受让的、2015年2月4日提交的主题为“Noise-Robust Multi-Lingual Keyword Spotting with a Deep Neural Network BasedArchitecture”的美国专利申请No.14/614,348和2015年6月9日提交的主题为“Key ClickSuppression”的美国专利申请No.14/745,176中进行了描述,其全部内容通过引用而并入于此。
在工作期间,语音恢复模块330可以向抑制了噪声的信号的存在语音失真的频率区(失真区)分配零值。在图3的示例中,噪声被抑制的信号进一步提供给DNN 315的输入部以接收输出信号。输出信号包括对失真区的初始预测,这可能不是很准确。
在一些实施方式中,为改进初始预测,进一步应用迭代反馈机制。输出信号350可选地反馈回DNN 315的输入部,以接收输出信号的下一次迭代,将初始的噪声被抑制的信号保持在输出信号的不失真区。为了防止系统发散,可以在每次迭代之后将不失真区的输出与输入进行比较,并且可以基于输入的音频信号中的能量向所估计的不失真频率区处的能量应用上限和下限。在不同实施方式中,应用若干次迭代来改进预测的准确度,直到满足针对特定应用期望的准确度水平为止,例如,响应于不失真区处的音频信号的差异满足针对特定应用的预定义标准,不进一步迭代。
在一些实施方式中,重建模块340可工作为从语音恢复模块330接收具有恢复的语音分量的噪声被抑制的信号,并将恢复的语音分量重建成单个音频信号。
图4是示出根据示例实施方式的用于恢复音频信号的失真语音分量的方法400的流程图。方法400可以利用语音恢复模块330来执行。
该方法可以在框402中开始,其中,确定音频信号中的失真频率区和不失真的频率区。失真语音区是例如因噪声降低而导致存在语音失真的区域。
在框404中,方法400包括:利用模型来执行一次或更多次迭代,以完善对失真频率区处的音频信号的预测。该模型可以被设置成修改该音频信号。在一些实施方式中,该模型包括利用干净或未损坏信号的频谱包络训练的深度神经网络。在某些实施方式中,第一次迭代之前,对失真频率区处的音频信号的预测被设置成零。在执行每次迭代之前,将不失真频率区的音频信号恢复至第一次迭代之前的音频信号的值。
在框406中,该方法包括:比较每一次迭代之前和之后的不失真区的音频信号,以确定差异。
在框408中,如果该差异满足预定义标准,则停止迭代。
一些示例实施方式包括语音动力学。针对语音动力学,音频处理系统210可以被提供有多个连续的音频信号帧并被训练以输出相同数量的帧。在一些实施方式中包括语音动力学用于强制时间平滑度并允许恢复较长的失真区域。
使用各种实施方式来为许多应用提供改进,如噪声抑制、带宽扩展、语音编码以及语音合成。另外,该方法和系统服从于传感器融合,以使在一些实施方式中,可以将该方法和系统扩展成包括其他非声学传感器信息。关于传感器融合的示例性方法还在共同受让的、2014年11月19日提交的主题为“Method for Modeling User Possession of MobileDevice for User Authentication Framework”的美国专利申请No.14/548,207和2014年7月14日提交的主题为“Selection of System Parameters Based on Non-AcousticSensor Information”的美国专利申请No.14/331,205中进行了描述,其全部内容通过引用而并入于此。
用于恢复噪声降低的语音的各种方法还在共同受让的、2013年1月28日提交的主题为“Restoration of Noise Reduced Speech”的美国专利申请No.13/751,907(美国专利No.8,615,394)中进行了描述,其全部内容通过引用而并入于此。
图5例示了可以用于实现本发明的一些实施方式的示例性计算机系统500。图5的计算机系统500可以在诸如计算系统、网络、服务器或其组合的背景下实现。图5的计算机系统500包括一个或更多个处理器单元510和主存储器520。主存储器520部分地存储用于由处理器单元510执行的指令和数据。在这个示例中,主存储器520在工作中时存储可执行代码。图5的计算机系统500还包括:海量数据存储部530、便携式存储装置540、输出装置550、用户输入装置560、图形显示系统570以及外围装置580。
图5所示的组件被描绘为经由单个总线590连接。这些组件可以通过一个或更多个数据传输装置连接。处理器单元510和主存储器520经由本地微处理器总线连接,而海量数据存储部530、外围装置580、便携式存储装置540以及图形显示系统570经由一个或更多个输入/输出(I/O)总线连接。
可利用磁盘驱动器、固态驱动器或光盘驱动器来实现的海量数据存储部530是用于存储供处理器单元510使用的数据和指令的非易失性存储装置。海量数据存储部530存储用于实现本公开的实施方式的系统软件,用于将该软件加载到主存储器520中。
便携式存储装置540结合便携式非易失性存储介质(诸如闪速驱动器、软盘、光盘、数字视频盘或通用串行总线(USB)存储装置)工作,以向图5的计算机系统500输入并从图5的计算机系统500输出数据和代码。用于实现本公开的实施方式的系统软件存储在这种便携式介质上,并且经由便携式存储装置540输入至计算机系统500。
用户输入装置560可以提供用户接口的一部分。用户输入装置560可以包括一个或多个麦克风、用于输入字母数字和其它信息的诸如键盘的字母数字小键盘、或者诸如鼠标、轨迹球、触控笔或光标方向键的指示装置。用户输入装置560还可以包括触摸屏。另外,如图5所示的计算机系统500包括输出装置550。合适的输出装置550包括扬声器、打印机、网络接口以及监视器。
图形显示系统570包括液晶显示器(LCD)或其它合适的显示装置。图形显示系统570可设置成接收文本和图形信息并处理该信息以输出至显示装置。
外围装置580可以包括任何类型的计算机支持装置以向计算机系统500添加附加功能。
设置在图5的计算机系统500中的组件是通常在可以适用于本公开的实施方式的计算机系统中发现的那些,并且旨在表示本领域公知的这种计算机组件的广泛类别。由此,图5的计算机系统500可以是个人计算机(PC)、手持计算机系统、电话、移动计算机系统、工作站、平板、平板电话、移动电话、服务器、迷你计算机、大型计算机、可佩戴装置、或者任何其它计算机系统。计算机还可以包括不同的总线配置、联网平台、多处理器平台等。可以使用各种操作系统,包括UNIX、LINUX、WINDOWS、MAC OS、PALM OS、QNX ANDROID、IOS、CHROME、TIZEN以及其它合适的操作系统。
针对各种实施方式的处理可以按基于云的软件来实现。在一些实施方式中,计算机系统500被实现为基于云的计算环境,如在计算云内工作的虚拟机。在其它实施方式中,计算机系统500本身可以包括基于云的计算环境,其中,按分布式方式执行计算机系统500的功能。由此,在被设置为计算云时,计算机系统500可以包括各种形式的多个计算装置,如下更详细描述的那样。
一般来说,基于云的计算环境是通常将一大组处理器(如在网络服务器内)的计算能力相结合的资源和/或组合一大组计算机存储器或存储装置的存储容量的资源。提供基于云的资源的系统可以由其所有者专门使用,或者这种系统可以被在该计算基础设施内部署应用以获得大的计算或存储资源的益处的外部用户访问。
云例如可以由包括诸如计算机系统500的多个计算装置的网络服务器的网络形成,其中每个服务器(或至少其中的多个)提供处理器和/或存储资源。这些服务器可以管理由多个用户(例如,云资源客户或其他用户)提供的工作负载。通常,每个用户有时显著地将工作负载需求放在实时变化的云上。这些变化的性质和范围通常取决于与用户相关联的业务类型。
上面参照示例实施方式对本技术进行了描述。因此,针对本示例实施方式的其它变型例旨在被本公开所覆盖。

Claims (20)

1.一种用于恢复音频信号的失真语音分量的方法,该方法包括:
确定所述音频信号中的失真频率区和不失真频率区,所述失真频率区包括所述音频信号的存在语音失真的区域;以及
利用模型执行一次或更多次迭代,以完善对所述失真频率区处的所述音频信号的预测,所述模型被设置成修改所述音频信号。
2.根据权利要求1所述的方法,其中,所述音频信号包括通过对包含语音的声信号进行噪声降低或噪声消除中的至少一种而获取的噪声被抑制的音频信号。
3.根据权利要求2所述的方法,其中,所述声信号在所述失真频率区被衰减或消除。
4.根据权利要求1所述的方法,其中,所述模型包括利用干净的音频信号或未损坏的音频信号的频谱包络训练的深度神经网络。
5.根据权利要求1所述的方法,其中,所完善的预测被用于恢复所述失真频率区中的语音分量。
6.根据权利要求1所述的方法,其中,所述失真频率区处的所述音频信号在所述一次或更多次迭代中的第一次迭代之前被设置成零。
7.根据权利要求1所述的方法,其中,在执行所述一次或更多次迭代中的每一次迭代之前,所述不失真频率区处的所述音频信号被恢复至所述音频信号在所述一次或更多次迭代中的第一次迭代之前的值。
8.根据权利要求1所述的方法,所述方法还包括:在执行所述一次或更多次迭代中的每一次迭代之后,比较所述迭代之前和之后的所述不失真频率区处的所述音频信号,以确定差异。
9.根据权利要求8所述的方法,所述方法还包括:如果所述差异满足预定标准,则结束所述一次或更多次迭代。
10.根据权利要求9所述的方法,其中,所述预定标准根据所述音频信号的能量的下限和上限来限定。
11.一种用于恢复音频信号的失真语音分量的系统,该系统包括:
至少一个处理器;以及
存储器,该存储器以通信方式与所述至少一个处理器联接,所述存储器存储指令,所述指令在由所述至少一个处理器执行时执行以下方法,该方法包括:
确定所述音频信号中的失真频率区和不失真频率区,所述失真频率区包括所述音频信号的存在语音失真的区域;以及
利用模型执行一次或更多次迭代,以完善对所述失真频率区处的所述音频信号的预测,所述模型被设置成修改所述音频信号。
12.根据权利要求11所述的系统,其中,所述音频信号包括通过对包含语音的声信号进行噪声降低或噪声消除中的至少一种而获取的噪声被抑制的音频信号。
13.根据权利要求12所述的系统,其中,所述声信号在所述失真频率区被衰减或消除。
14.根据权利要求11所述的系统,其中,所述模型包括深度神经网络。
15.根据权利要求14所述的系统,其中,所述深度神经网络利用干净的音频信号或未损坏的音频信号的频谱包络训练。
16.根据权利要求15所述的系统,其中,所述失真频率区处的所述音频信号在所述一次或更多次迭代中的第一次迭代之前被设置成零。
17.根据权利要求11所述的系统,其中,在执行所述一次或更多次迭代中的每一次迭代之前,所述不失真频率区处的所述音频信号被恢复至所述一次或更多次迭代中的第一次迭代之前的值。
18.根据权利要求11所述的系统,所述系统还包括,在执行所述一次或更多次迭代中的每一次迭代之后,比较所述迭代之前和之后的所述不失真区处的所述音频信号,以确定差异。
19.根据权利要求18所述的系统,所述系统还包括:如果所述差异满足预定标准,则结束所述一次或更多次迭代,所述预定标准根据所述音频信号的能量的下限和上限来限定。
20.一种非暂时性计算机可读存储介质,该非暂时性计算机可读存储介质上包含有指令,所述指令在由至少一个处理器执行时执行以下方法的步骤,该方法包括:
确定所述音频信号中的失真频率区和不失真频率区,所述失真频率区包括所述音频信号的存在语音失真的区域;以及
利用模型执行一次或更多次迭代,以完善对所述失真频率区处的所述音频信号的预测,所述模型被设置成修改所述音频信号。
CN201580060446.6A 2014-09-12 2015-09-11 用于恢复语音分量的系统和方法 Pending CN107112025A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201462049988P 2014-09-12 2014-09-12
US62/049,988 2014-09-12
PCT/US2015/049816 WO2016040885A1 (en) 2014-09-12 2015-09-11 Systems and methods for restoration of speech components

Publications (1)

Publication Number Publication Date
CN107112025A true CN107112025A (zh) 2017-08-29

Family

ID=55455344

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201580060446.6A Pending CN107112025A (zh) 2014-09-12 2015-09-11 用于恢复语音分量的系统和方法

Country Status (4)

Country Link
US (1) US9978388B2 (zh)
CN (1) CN107112025A (zh)
DE (1) DE112015004185T5 (zh)
WO (1) WO2016040885A1 (zh)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109147804A (zh) * 2018-06-05 2019-01-04 安克创新科技股份有限公司 一种基于深度学习的音质特性处理方法及系统
CN109147805A (zh) * 2018-06-05 2019-01-04 安克创新科技股份有限公司 基于深度学习的音频音质增强
CN112088385A (zh) * 2018-04-23 2020-12-15 塞伦妮经营公司 用于回归深度神经网络的判别训练的系统和方法
CN112289343A (zh) * 2020-10-28 2021-01-29 腾讯音乐娱乐科技(深圳)有限公司 音频修复方法、装置及电子设备和计算机可读存储介质
CN112820315A (zh) * 2020-07-13 2021-05-18 腾讯科技(深圳)有限公司 音频信号处理方法、装置、计算机设备及存储介质

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9838784B2 (en) 2009-12-02 2017-12-05 Knowles Electronics, Llc Directional audio capture
US9536540B2 (en) 2013-07-19 2017-01-03 Knowles Electronics, Llc Speech signal separation and synthesis based on auditory scene analysis and speech modeling
WO2016040885A1 (en) 2014-09-12 2016-03-17 Audience, Inc. Systems and methods for restoration of speech components
US9820042B1 (en) 2016-05-02 2017-11-14 Knowles Electronics, Llc Stereo separation and directional suppression with omni-directional microphones
US10311219B2 (en) * 2016-06-07 2019-06-04 Vocalzoom Systems Ltd. Device, system, and method of user authentication utilizing an optical microphone
US9984701B2 (en) 2016-06-10 2018-05-29 Apple Inc. Noise detection and removal systems, and related methods
US11205103B2 (en) 2016-12-09 2021-12-21 The Research Foundation for the State University Semisupervised autoencoder for sentiment analysis
KR20180111271A (ko) 2017-03-31 2018-10-11 삼성전자주식회사 신경망 모델을 이용하여 노이즈를 제거하는 방법 및 장치
KR20190037844A (ko) * 2017-09-29 2019-04-08 엘지전자 주식회사 이동 단말기
EP3474280B1 (en) * 2017-10-19 2021-07-07 Goodix Technology (HK) Company Limited Signal processor for speech signal enhancement
CN111164682A (zh) 2017-10-24 2020-05-15 三星电子株式会社 使用机器学习的音频重建方法和设备
US11416742B2 (en) 2017-11-24 2022-08-16 Electronics And Telecommunications Research Institute Audio signal encoding method and apparatus and audio signal decoding method and apparatus using psychoacoustic-based weighted error function
WO2019133765A1 (en) 2017-12-28 2019-07-04 Knowles Electronics, Llc Direction of arrival estimation for multiple audio content streams
US10522167B1 (en) * 2018-02-13 2019-12-31 Amazon Techonlogies, Inc. Multichannel noise cancellation using deep neural network masking
US10672414B2 (en) 2018-04-13 2020-06-02 Microsoft Technology Licensing, Llc Systems, methods, and computer-readable media for improved real-time audio processing
CN109545227B (zh) * 2018-04-28 2023-05-09 华中师范大学 基于深度自编码网络的说话人性别自动识别方法及系统
EP3807878B1 (en) 2018-06-14 2023-12-13 Pindrop Security, Inc. Deep neural network based speech enhancement
US11341983B2 (en) 2018-09-17 2022-05-24 Honeywell International Inc. System and method for audio noise reduction
US11682411B2 (en) * 2021-08-31 2023-06-20 Spotify Ab Wind noise suppresor

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1653520A (zh) * 2002-05-20 2005-08-10 微软公司 确定和降噪相关联的不确定性的方法
CN1910656A (zh) * 2004-01-20 2007-02-07 杜比实验室特许公司 基于块分组的音频编码
CN101512938A (zh) * 2006-08-01 2009-08-19 Dts(英属维尔京群岛)有限公司 用于补偿音频变换器的线性和非-线性失真的神经网络滤波技术
CN101976567A (zh) * 2010-10-28 2011-02-16 吉林大学 一种语音信号差错掩盖方法
CN102280108A (zh) * 2010-06-10 2011-12-14 佳能株式会社 音频信号处理设备及其控制方法
CN103430233A (zh) * 2010-12-14 2013-12-04 弗兰霍菲尔运输应用研究公司 用于预测性编码的编码器及方法、用于译码的译码器及方法、用于预测性编码及译码的系统及方法和预测性编码信息信号

Family Cites Families (355)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4025724A (en) 1975-08-12 1977-05-24 Westinghouse Electric Corporation Noise cancellation apparatus
US4137510A (en) 1976-01-22 1979-01-30 Victor Company Of Japan, Ltd. Frequency band dividing filter
ATE41544T1 (de) 1982-08-04 1989-04-15 Trans Data Associates Einrichtung und verfahren zur spracherkennung unter verwendung von vokaltraktmodell.
US4802227A (en) 1987-04-03 1989-01-31 American Telephone And Telegraph Company Noise reduction processing arrangement for microphone arrays
US5115404A (en) 1987-12-23 1992-05-19 Tektronix, Inc. Digital storage oscilloscope with indication of aliased display
US4969203A (en) 1988-01-25 1990-11-06 North American Philips Corporation Multiplicative sieve signal processing
US5182557A (en) 1989-09-20 1993-01-26 Semborg Recrob, Corp. Motorized joystick
US5204906A (en) 1990-02-13 1993-04-20 Matsushita Electric Industrial Co., Ltd. Voice signal processing device
JPH0454100A (ja) 1990-06-22 1992-02-21 Clarion Co Ltd 音声信号補償回路
WO1992005538A1 (en) 1990-09-14 1992-04-02 Chris Todter Noise cancelling systems
GB9107011D0 (en) 1991-04-04 1991-05-22 Gerzon Michael A Illusory sound distance control method
US5224170A (en) 1991-04-15 1993-06-29 Hewlett-Packard Company Time domain compensation for transducer mismatch
US5440751A (en) 1991-06-21 1995-08-08 Compaq Computer Corp. Burst data transfer to single cycle data transfer conversion and strobe signal conversion
CA2080608A1 (en) 1992-01-02 1993-07-03 Nader Amini Bus control logic for computer system having dual bus architecture
EP0559348A3 (en) 1992-03-02 1993-11-03 AT&T Corp. Rate control loop processor for perceptual encoder/decoder
JPH05300419A (ja) 1992-04-16 1993-11-12 Sanyo Electric Co Ltd ビデオカメラ
US5400409A (en) 1992-12-23 1995-03-21 Daimler-Benz Ag Noise-reduction method for noise-affected voice channels
US5524056A (en) 1993-04-13 1996-06-04 Etymotic Research, Inc. Hearing aid having plural microphones and a microphone switching system
DE4316297C1 (de) 1993-05-14 1994-04-07 Fraunhofer Ges Forschung Frequenzanalyseverfahren
JPH07336793A (ja) 1994-06-09 1995-12-22 Matsushita Electric Ind Co Ltd ビデオカメラ用マイクロホン
US5978567A (en) 1994-07-27 1999-11-02 Instant Video Technologies Inc. System for distribution of interactive multimedia and linear programs by enabling program webs which include control scripts to define presentation by client transceiver
US5598505A (en) 1994-09-30 1997-01-28 Apple Computer, Inc. Cepstral correction vector quantizer for speech recognition
GB9501734D0 (en) 1995-01-30 1995-03-22 Neopost Ltd franking apparatus and printing means therefor
US5682463A (en) 1995-02-06 1997-10-28 Lucent Technologies Inc. Perceptual audio compression based on loudness uncertainty
JP3307138B2 (ja) 1995-02-27 2002-07-24 ソニー株式会社 信号符号化方法及び装置、並びに信号復号化方法及び装置
DE69619284T3 (de) * 1995-03-13 2006-04-27 Matsushita Electric Industrial Co., Ltd., Kadoma Vorrichtung zur Erweiterung der Sprachbandbreite
US6263307B1 (en) 1995-04-19 2001-07-17 Texas Instruments Incorporated Adaptive weiner filtering using line spectral frequencies
US5625697A (en) 1995-05-08 1997-04-29 Lucent Technologies Inc. Microphone selection process for use in a multiple microphone voice actuated switching system
US5774837A (en) 1995-09-13 1998-06-30 Voxware, Inc. Speech coding system and method using voicing probability determination
FI99062C (fi) 1995-10-05 1997-09-25 Nokia Mobile Phones Ltd Puhesignaalin taajuuskorjaus matkapuhelimessa
US5819215A (en) 1995-10-13 1998-10-06 Dobson; Kurt Method and apparatus for wavelet based data compression having adaptive bit rate control for compression of digital audio or other sensory data
US5956674A (en) 1995-12-01 1999-09-21 Digital Theater Systems, Inc. Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels
US5734713A (en) 1996-01-30 1998-03-31 Jabra Corporation Method and system for remote telephone calibration
US6035177A (en) 1996-02-26 2000-03-07 Donald W. Moses Simultaneous transmission of ancillary and audio signals by means of perceptual coding
JP3325770B2 (ja) 1996-04-26 2002-09-17 三菱電機株式会社 ノイズ低減回路及びノイズ低減装置及びノイズ低減方法
US5715319A (en) 1996-05-30 1998-02-03 Picturetel Corporation Method and apparatus for steerable and endfire superdirective microphone arrays with reduced analog-to-digital converter and computational requirements
US5806025A (en) 1996-08-07 1998-09-08 U S West, Inc. Method and system for adaptive filtering of speech signals using signal-to-noise ratio to choose subband filter bank
US5757933A (en) 1996-12-11 1998-05-26 Micro Ear Technology, Inc. In-the-ear hearing aid with directional microphone system
JP2930101B2 (ja) 1997-01-29 1999-08-03 日本電気株式会社 雑音消去装置
US6104993A (en) 1997-02-26 2000-08-15 Motorola, Inc. Apparatus and method for rate determination in a communication system
FI114247B (fi) 1997-04-11 2004-09-15 Nokia Corp Menetelmä ja laite puheen tunnistamiseksi
US6281749B1 (en) 1997-06-17 2001-08-28 Srs Labs, Inc. Sound enhancement system
US6084916A (en) 1997-07-14 2000-07-04 Vlsi Technology, Inc. Receiver sample rate frequency adjustment for sample rate conversion between asynchronous digital systems
US5991385A (en) 1997-07-16 1999-11-23 International Business Machines Corporation Enhanced audio teleconferencing with sound field effect
US6144937A (en) 1997-07-23 2000-11-07 Texas Instruments Incorporated Noise suppression of speech by signal processing including applying a transform to time domain input sequences of digital signals representing audio information
KR19990015748A (ko) 1997-08-09 1999-03-05 구자홍 전자우편
FR2768547B1 (fr) 1997-09-18 1999-11-19 Matra Communication Procede de debruitage d'un signal de parole numerique
US6202047B1 (en) 1998-03-30 2001-03-13 At&T Corp. Method and apparatus for speech recognition using second order statistics and linear estimation of cepstral coefficients
WO1999053673A1 (en) 1998-04-08 1999-10-21 British Telecommunications Public Limited Company Teleconferencing system
US6684199B1 (en) 1998-05-20 2004-01-27 Recording Industry Association Of America Method for minimizing pirating and/or unauthorized copying and/or unauthorized access of/to data on/from data media including compact discs and digital versatile discs, and system and data media for same
US6421388B1 (en) 1998-05-27 2002-07-16 3Com Corporation Method and apparatus for determining PCM code translations
US6717991B1 (en) 1998-05-27 2004-04-06 Telefonaktiebolaget Lm Ericsson (Publ) System and method for dual microphone signal noise reduction using spectral subtraction
US6041130A (en) 1998-06-23 2000-03-21 Mci Communications Corporation Headset with multiple connections
US20040066940A1 (en) 2002-10-03 2004-04-08 Silentium Ltd. Method and system for inhibiting noise produced by one or more sources of undesired sound from pickup by a speech recognition unit
US6240386B1 (en) 1998-08-24 2001-05-29 Conexant Systems, Inc. Speech codec employing noise classification for noise compensation
US6381469B1 (en) 1998-10-02 2002-04-30 Nokia Corporation Frequency equalizer, and associated method, for a radio telephone
US6768979B1 (en) 1998-10-22 2004-07-27 Sony Corporation Apparatus and method for noise attenuation in a speech recognition system
US6188769B1 (en) 1998-11-13 2001-02-13 Creative Technology Ltd. Environmental reverberation processor
US6504926B1 (en) 1998-12-15 2003-01-07 Mediaring.Com Ltd. User control system for internet phone quality
US6873837B1 (en) 1999-02-03 2005-03-29 Matsushita Electric Industrial Co., Ltd. Emergency reporting system and terminal apparatus therein
US6496795B1 (en) 1999-05-05 2002-12-17 Microsoft Corporation Modulated complex lapped transform for integrated signal enhancement and coding
US7423983B1 (en) 1999-09-20 2008-09-09 Broadcom Corporation Voice and data exchange over a packet based network
US6219408B1 (en) 1999-05-28 2001-04-17 Paul Kurth Apparatus and method for simultaneously transmitting biomedical data and human voice over conventional telephone lines
US6490556B2 (en) 1999-05-28 2002-12-03 Intel Corporation Audio classifier for half duplex communication
US7035666B2 (en) 1999-06-09 2006-04-25 Shimon Silberfening Combination cellular telephone, sound storage device, and email communication device
US6381284B1 (en) 1999-06-14 2002-04-30 T. Bogomolny Method of and devices for telecommunications
US6226616B1 (en) 1999-06-21 2001-05-01 Digital Theater Systems, Inc. Sound quality of established low bit-rate audio coding systems without loss of decoder compatibility
EP1081685A3 (en) 1999-09-01 2002-04-24 TRW Inc. System and method for noise reduction using a single microphone
US6480610B1 (en) 1999-09-21 2002-11-12 Sonic Innovations, Inc. Subband acoustic feedback cancellation in hearing aids
US7054809B1 (en) 1999-09-22 2006-05-30 Mindspeed Technologies, Inc. Rate selection method for selectable mode vocoder
US6636829B1 (en) 1999-09-22 2003-10-21 Mindspeed Technologies, Inc. Speech communication system and method for handling lost frames
FI116643B (fi) 1999-11-15 2006-01-13 Nokia Corp Kohinan vaimennus
US7058572B1 (en) 2000-01-28 2006-06-06 Nortel Networks Limited Reducing acoustic noise in wireless and landline based telephony
US6584438B1 (en) 2000-04-24 2003-06-24 Qualcomm Incorporated Frame erasure compensation method in a variable rate speech coder
JP2001318694A (ja) 2000-05-10 2001-11-16 Toshiba Corp 信号処理装置、信号処理方法および記録媒体
US6377637B1 (en) 2000-07-12 2002-04-23 Andrea Electronics Corporation Sub-band exponential smoothing noise canceling system
US8019091B2 (en) 2000-07-19 2011-09-13 Aliphcom, Inc. Voice activity detector (VAD) -based multiple-microphone acoustic noise suppression
US20030179888A1 (en) 2002-03-05 2003-09-25 Burnett Gregory C. Voice activity detection (VAD) devices and methods for use with noise suppression systems
US20020041678A1 (en) 2000-08-18 2002-04-11 Filiz Basburg-Ertem Method and apparatus for integrated echo cancellation and noise reduction for fixed subscriber terminals
US6862567B1 (en) 2000-08-30 2005-03-01 Mindspeed Technologies, Inc. Noise suppression in the frequency domain by adjusting gain according to voicing parameters
JP2002149200A (ja) 2000-08-31 2002-05-24 Matsushita Electric Ind Co Ltd 音声処理装置及び音声処理方法
DE10045197C1 (de) 2000-09-13 2002-03-07 Siemens Audiologische Technik Verfahren zum Betrieb eines Hörhilfegerätes oder Hörgerätessystems sowie Hörhilfegerät oder Hörgerätesystem
US6520673B2 (en) 2000-12-08 2003-02-18 Msp Corporation Mixing devices for sample recovery from a USP induction port or a pre-separator
US6907045B1 (en) 2000-11-17 2005-06-14 Nortel Networks Limited Method and apparatus for data-path conversion comprising PCM bit robbing signalling
EP1340320B1 (en) 2000-11-30 2008-10-15 Intrasonics Limited Apparatus and system for using data signal embedded into an acoustic signal
US7472059B2 (en) 2000-12-08 2008-12-30 Qualcomm Incorporated Method and apparatus for robust speech classification
US20020097884A1 (en) 2001-01-25 2002-07-25 Cairns Douglas A. Variable noise reduction algorithm based on vehicle conditions
US6754623B2 (en) 2001-01-31 2004-06-22 International Business Machines Corporation Methods and apparatus for ambient noise removal in speech recognition
US7617099B2 (en) 2001-02-12 2009-11-10 FortMedia Inc. Noise suppression by two-channel tandem spectrum modification for speech signal in an automobile
EP1239455A3 (de) 2001-03-09 2004-01-21 Alcatel Verfahren und Anordnung zur Durchführung einer an die Übertragungsfunktion menschilcher Sinnesorgane angepassten Fourier Transformation sowie darauf basierende Vorrichtungen zur Geräuschreduktion und Spracherkennung
US7349841B2 (en) 2001-03-28 2008-03-25 Mitsubishi Denki Kabushiki Kaisha Noise suppression device including subband-based signal-to-noise ratio
SE0101175D0 (sv) 2001-04-02 2001-04-02 Coding Technologies Sweden Ab Aliasing reduction using complex-exponential-modulated filterbanks
ATE410901T1 (de) 2001-04-18 2008-10-15 Widex As Richtungssteuerung und verfahren zur steuerung eines hörgeräts
US20020160751A1 (en) 2001-04-26 2002-10-31 Yingju Sun Mobile devices with integrated voice recording mechanism
US8934382B2 (en) 2001-05-10 2015-01-13 Polycom, Inc. Conference endpoint controlling functions of a remote device
US8452023B2 (en) 2007-05-25 2013-05-28 Aliphcom Wind suppression/replacement component for use with electronic systems
US6493668B1 (en) 2001-06-15 2002-12-10 Yigal Brandman Speech feature extraction system
AUPR647501A0 (en) 2001-07-19 2001-08-09 Vast Audio Pty Ltd Recording a three dimensional auditory scene and reproducing it for the individual listener
GB0121206D0 (en) 2001-08-31 2001-10-24 Mitel Knowledge Corp System and method of indicating and controlling sound pickup direction and location in a teleconferencing system
GB0121308D0 (en) 2001-09-03 2001-10-24 Thomas Swan & Company Ltd Optical processing
US7574474B2 (en) 2001-09-14 2009-08-11 Xerox Corporation System and method for sharing and controlling multiple audio and video streams
US6895375B2 (en) * 2001-10-04 2005-05-17 At&T Corp. System for bandwidth extension of Narrow-band speech
US6707921B2 (en) 2001-11-26 2004-03-16 Hewlett-Packard Development Company, Lp. Use of mouth position and mouth movement to filter noise from speech in a hearing aid
EP1449305B1 (en) 2001-11-30 2006-04-05 Telefonaktiebolaget LM Ericsson (publ) Method for replacing corrupted audio data
US7096037B2 (en) 2002-01-29 2006-08-22 Palm, Inc. Videoconferencing bandwidth management for a handheld computer system and method
US8098844B2 (en) 2002-02-05 2012-01-17 Mh Acoustics, Llc Dual-microphone spatial noise suppression
US7171008B2 (en) 2002-02-05 2007-01-30 Mh Acoustics, Llc Reducing noise in audio systems
US20050228518A1 (en) 2002-02-13 2005-10-13 Applied Neurosystems Corporation Filter set for frequency analysis
US7158572B2 (en) 2002-02-14 2007-01-02 Tellabs Operations, Inc. Audio enhancement communication techniques
JP4195267B2 (ja) 2002-03-14 2008-12-10 インターナショナル・ビジネス・マシーンズ・コーポレーション 音声認識装置、その音声認識方法及びプログラム
US6978010B1 (en) 2002-03-21 2005-12-20 Bellsouth Intellectual Property Corp. Ambient noise cancellation for voice communication device
WO2003084103A1 (en) 2002-03-22 2003-10-09 Georgia Tech Research Corporation Analog audio enhancement system using a noise suppression algorithm
US7447631B2 (en) 2002-06-17 2008-11-04 Dolby Laboratories Licensing Corporation Audio coding system using spectral hole filling
US20030228019A1 (en) 2002-06-11 2003-12-11 Elbit Systems Ltd. Method and system for reducing noise
JP2004023481A (ja) 2002-06-17 2004-01-22 Alpine Electronics Inc 音響信号処理装置及び方法並びにオーディオ装置
US7516066B2 (en) 2002-07-16 2009-04-07 Koninklijke Philips Electronics N.V. Audio coding
JP3579047B2 (ja) 2002-07-19 2004-10-20 日本電気株式会社 オーディオ復号装置と復号方法およびプログラム
JP4227772B2 (ja) 2002-07-19 2009-02-18 日本電気株式会社 オーディオ復号装置と復号方法およびプログラム
US7783061B2 (en) 2003-08-27 2010-08-24 Sony Computer Entertainment Inc. Methods and apparatus for the targeted sound detection
US8019121B2 (en) 2002-07-27 2011-09-13 Sony Computer Entertainment Inc. Method and system for processing intensity from input devices for interfacing with a computer program
US7760248B2 (en) 2002-07-27 2010-07-20 Sony Computer Entertainment Inc. Selective sound source listening in conjunction with computer interactive processing
US7283956B2 (en) 2002-09-18 2007-10-16 Motorola, Inc. Noise suppression
US7657427B2 (en) 2002-10-11 2010-02-02 Nokia Corporation Methods and devices for source controlled variable bit-rate wideband speech coding
US7630409B2 (en) 2002-10-21 2009-12-08 Lsi Corporation Method and apparatus for improved play-out packet control algorithm
US20040083110A1 (en) 2002-10-23 2004-04-29 Nokia Corporation Packet loss recovery based on music signal classification and mixing
US7970606B2 (en) 2002-11-13 2011-06-28 Digital Voice Systems, Inc. Interoperable vocoder
WO2004064041A1 (en) 2003-01-09 2004-07-29 Dilithium Networks Pty Limited Method and apparatus for improved quality voice transcoding
JP4247002B2 (ja) 2003-01-22 2009-04-02 富士通株式会社 マイクロホンアレイを用いた話者距離検出装置及び方法並びに当該装置を用いた音声入出力装置
KR100503479B1 (ko) 2003-01-24 2005-07-28 삼성전자주식회사 휴대용단말기의 크레들 및 이를 이용한 휴대용단말기고정방법
EP1443498B1 (en) 2003-01-24 2008-03-19 Sony Ericsson Mobile Communications AB Noise reduction and audio-visual speech activity detection
DE10305820B4 (de) 2003-02-12 2006-06-01 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum Bestimmen einer Wiedergabeposition
US7885420B2 (en) 2003-02-21 2011-02-08 Qnx Software Systems Co. Wind noise suppression system
US7725315B2 (en) 2003-02-21 2010-05-25 Qnx Software Systems (Wavemakers), Inc. Minimization of transient noises in a voice signal
GB2398913B (en) 2003-02-27 2005-08-17 Motorola Inc Noise estimation in speech recognition
FR2851879A1 (fr) 2003-02-27 2004-09-03 France Telecom Procede de traitement de donnees sonores compressees, pour spatialisation.
US7090431B2 (en) 2003-03-19 2006-08-15 Cosgrove Patrick J Marine vessel lifting system with variable level detection
US8412526B2 (en) 2003-04-01 2013-04-02 Nuance Communications, Inc. Restoration of high-order Mel frequency cepstral coefficients
NO318096B1 (no) 2003-05-08 2005-01-31 Tandberg Telecom As Arrangement og fremgangsmate for lokalisering av lydkilde
US7353169B1 (en) 2003-06-24 2008-04-01 Creative Technology Ltd. Transient detection and modification in audio signals
US7376553B2 (en) 2003-07-08 2008-05-20 Robert Patel Quinn Fractal harmonic overtone mapping of speech and musical sounds
EP1513137A1 (en) 2003-08-22 2005-03-09 MicronasNIT LCC, Novi Sad Institute of Information Technologies Speech processing system and method with multi-pulse excitation
WO2005027094A1 (fr) 2003-09-17 2005-03-24 Beijing E-World Technology Co.,Ltd. Procede et dispositif de quantification de vecteur multi-resolution multiple pour codage et decodage audio
US7190775B2 (en) 2003-10-29 2007-03-13 Broadcom Corporation High quality audio conferencing with adaptive beamforming
DE602004021716D1 (de) 2003-11-12 2009-08-06 Honda Motor Co Ltd Spracherkennungssystem
JP4396233B2 (ja) 2003-11-13 2010-01-13 パナソニック株式会社 複素指数変調フィルタバンクの信号分析方法、信号合成方法、そのプログラム及びその記録媒体
GB2408655B (en) 2003-11-27 2007-02-28 Motorola Inc Communication system, communication units and method of ambience listening thereto
CA2454296A1 (en) 2003-12-29 2005-06-29 Nokia Corporation Method and device for speech enhancement in the presence of background noise
JP2005249816A (ja) 2004-03-01 2005-09-15 Internatl Business Mach Corp <Ibm> 信号強調装置、方法及びプログラム、並びに音声認識装置、方法及びプログラム
WO2005086138A1 (ja) 2004-03-05 2005-09-15 Matsushita Electric Industrial Co., Ltd. エラー隠蔽装置およびエラー隠蔽方法
GB0408856D0 (en) 2004-04-21 2004-05-26 Nokia Corp Signal encoding
JP4437052B2 (ja) 2004-04-21 2010-03-24 パナソニック株式会社 音声復号化装置および音声復号化方法
US20050249292A1 (en) 2004-05-07 2005-11-10 Ping Zhu System and method for enhancing the performance of variable length coding
US7103176B2 (en) 2004-05-13 2006-09-05 International Business Machines Corporation Direct coupling of telephone volume control with remote microphone gain and noise cancellation
GB2414369B (en) 2004-05-21 2007-08-01 Hewlett Packard Development Co Processing audio data
US7695438B2 (en) 2004-05-26 2010-04-13 Siemens Medical Solutions Usa, Inc. Acoustic disruption minimizing systems and methods
EP1600947A3 (en) 2004-05-26 2005-12-21 Honda Research Institute Europe GmbH Subtractive cancellation of harmonic noise
US7254665B2 (en) 2004-06-16 2007-08-07 Microsoft Corporation Method and system for reducing latency in transferring captured image data by utilizing burst transfer after threshold is reached
US20060063560A1 (en) 2004-09-21 2006-03-23 Samsung Electronics Co., Ltd. Dual-mode phone using GPS power-saving assist for operating in cellular and WiFi networks
US7383179B2 (en) 2004-09-28 2008-06-03 Clarity Technologies, Inc. Method of cascading noise reduction algorithms to avoid speech distortion
US20060092918A1 (en) 2004-11-04 2006-05-04 Alexander Talalai Audio receiver having adaptive buffer delay
EP1815462A1 (en) 2004-11-09 2007-08-08 Koninklijke Philips Electronics N.V. Audio coding and decoding
JP4283212B2 (ja) 2004-12-10 2009-06-24 インターナショナル・ビジネス・マシーンズ・コーポレーション 雑音除去装置、雑音除去プログラム、及び雑音除去方法
US20060206320A1 (en) 2005-03-14 2006-09-14 Li Qi P Apparatus and method for noise reduction and speech enhancement with microphones and loudspeakers
RU2376657C2 (ru) 2005-04-01 2009-12-20 Квэлкомм Инкорпорейтед Системы, способы и устройства для высокополосного предыскажения шкалы времени
US7664495B1 (en) 2005-04-21 2010-02-16 At&T Mobility Ii Llc Voice call redirection for enterprise hosted dual mode service
CA2574468C (en) 2005-04-28 2014-01-14 Siemens Aktiengesellschaft Noise suppression process and device
EP1878013B1 (en) 2005-05-05 2010-12-15 Sony Computer Entertainment Inc. Video game control with joystick
EP1914727B1 (en) 2005-05-17 2009-08-12 Yamaha Corporation Noise suppression methods and apparatuses
US7531973B2 (en) 2005-05-31 2009-05-12 Rockwell Automation Technologies, Inc. Wizard for configuring a motor drive system
US7647077B2 (en) 2005-05-31 2010-01-12 Bitwave Pte Ltd Method for echo control of a wireless headset
JP2006339991A (ja) 2005-06-01 2006-12-14 Matsushita Electric Ind Co Ltd マルチチャンネル収音装置、マルチチャンネル音声再生装置、およびマルチチャンネル収音再生装置
JP4910312B2 (ja) 2005-06-03 2012-04-04 ソニー株式会社 撮像装置および撮像方法
US8311840B2 (en) * 2005-06-28 2012-11-13 Qnx Software Systems Limited Frequency extension of harmonic signals
US8566086B2 (en) 2005-06-28 2013-10-22 Qnx Software Systems Limited System for adaptive enhancement of speech signals
US20070003097A1 (en) 2005-06-30 2007-01-04 Altec Lansing Technologies, Inc. Angularly adjustable speaker system
US20070005351A1 (en) 2005-06-30 2007-01-04 Sathyendra Harsha M Method and system for bandwidth expansion for voice communications
ATE497327T1 (de) 2005-07-06 2011-02-15 Koninkl Philips Electronics Nv Vorrichtung und verfahren zur schallstrahlformung
US7617436B2 (en) 2005-08-02 2009-11-10 Nokia Corporation Method, device, and system for forward channel error recovery in video sequence transmission over packet-based network
KR101116363B1 (ko) 2005-08-11 2012-03-09 삼성전자주식회사 음성신호 분류방법 및 장치, 및 이를 이용한 음성신호부호화방법 및 장치
US20070041589A1 (en) 2005-08-17 2007-02-22 Gennum Corporation System and method for providing environmental specific noise reduction algorithms
US8326614B2 (en) 2005-09-02 2012-12-04 Qnx Software Systems Limited Speech enhancement system
JP4356670B2 (ja) 2005-09-12 2009-11-04 ソニー株式会社 雑音低減装置及び雑音低減方法並びに雑音低減プログラムとその電子機器用収音装置
US7917561B2 (en) 2005-09-16 2011-03-29 Coding Technologies Ab Partially complex modulated filter bank
US20100130198A1 (en) 2005-09-29 2010-05-27 Plantronics, Inc. Remote processing of multiple acoustic signals
EA011601B1 (ru) 2005-09-30 2009-04-28 Скуэрхэд Текнолоджи Ас Способ и система для направленного захвата аудиосигнала
US7813923B2 (en) 2005-10-14 2010-10-12 Microsoft Corporation Calibration based beamforming, non-linear adaptive filtering, and multi-sensor headset
US7970123B2 (en) 2005-10-20 2011-06-28 Mitel Networks Corporation Adaptive coupling equalization in beamforming-based communication systems
US7562140B2 (en) 2005-11-15 2009-07-14 Cisco Technology, Inc. Method and apparatus for providing trend information from network devices
US20070127668A1 (en) 2005-12-02 2007-06-07 Ahya Deepak P Method and system for performing a conference call
US7366658B2 (en) 2005-12-09 2008-04-29 Texas Instruments Incorporated Noise pre-processor for enhanced variable rate speech codec
ATE449403T1 (de) 2005-12-12 2009-12-15 Gregory John Gadbois Mehrstimmige spracherkennung
US7565288B2 (en) 2005-12-22 2009-07-21 Microsoft Corporation Spatial noise suppression for a microphone array
JP4876574B2 (ja) 2005-12-26 2012-02-15 ソニー株式会社 信号符号化装置及び方法、信号復号装置及び方法、並びにプログラム及び記録媒体
US8345890B2 (en) 2006-01-05 2013-01-01 Audience, Inc. System and method for utilizing inter-microphone level differences for speech enhancement
US8346544B2 (en) 2006-01-20 2013-01-01 Qualcomm Incorporated Selection of encoding modes and/or encoding rates for speech compression with closed loop re-decision
US8032369B2 (en) 2006-01-20 2011-10-04 Qualcomm Incorporated Arbitrary average data rates for variable rate coders
JP4940671B2 (ja) 2006-01-26 2012-05-30 ソニー株式会社 オーディオ信号処理装置、オーディオ信号処理方法及びオーディオ信号処理プログラム
US8744844B2 (en) 2007-07-06 2014-06-03 Audience, Inc. System and method for adaptive intelligent noise suppression
US8194880B2 (en) 2006-01-30 2012-06-05 Audience, Inc. System and method for utilizing omni-directional microphones for speech enhancement
US9185487B2 (en) 2006-01-30 2015-11-10 Audience, Inc. System and method for providing noise suppression utilizing null processing noise subtraction
US7685132B2 (en) 2006-03-15 2010-03-23 Mog, Inc Automatic meta-data sharing of existing media through social networking
US7676374B2 (en) 2006-03-28 2010-03-09 Nokia Corporation Low complexity subband-domain filtering in the case of cascaded filter banks
US7555075B2 (en) 2006-04-07 2009-06-30 Freescale Semiconductor, Inc. Adjustable noise suppression system
US8180067B2 (en) 2006-04-28 2012-05-15 Harman International Industries, Incorporated System for selectively extracting components of an audio input signal
US8068619B2 (en) 2006-05-09 2011-11-29 Fortemedia, Inc. Method and apparatus for noise suppression in a small array microphone system
US7548791B1 (en) 2006-05-18 2009-06-16 Adobe Systems Incorporated Graphically displaying audio pan or phase information
US8044291B2 (en) 2006-05-18 2011-10-25 Adobe Systems Incorporated Selection of visually displayed audio data for editing
US8204253B1 (en) 2008-06-30 2012-06-19 Audience, Inc. Self calibration of audio device
US8934641B2 (en) 2006-05-25 2015-01-13 Audience, Inc. Systems and methods for reconstructing decomposed audio signals
US8150065B2 (en) 2006-05-25 2012-04-03 Audience, Inc. System and method for processing an audio signal
US8229137B2 (en) 2006-08-31 2012-07-24 Sony Ericsson Mobile Communications Ab Volume control circuits for use in electronic devices and related methods and electronic devices
US8036767B2 (en) 2006-09-20 2011-10-11 Harman International Industries, Incorporated System for extracting and changing the reverberant content of an audio input signal
EP1918910B1 (en) 2006-10-31 2009-03-11 Harman Becker Automotive Systems GmbH Model-based enhancement of speech signals
US7492312B2 (en) 2006-11-14 2009-02-17 Fam Adly T Multiplicative mismatched filters for optimum range sidelobe suppression in barker code reception
US8019089B2 (en) 2006-11-20 2011-09-13 Microsoft Corporation Removal of noise, corresponding to user input devices from an audio signal
US7626942B2 (en) 2006-11-22 2009-12-01 Spectra Link Corp. Method of conducting an audio communications session using incorrect timestamps
US7983685B2 (en) 2006-12-07 2011-07-19 Innovative Wireless Technologies, Inc. Method and apparatus for management of a global wireless sensor network
US20080159507A1 (en) 2006-12-27 2008-07-03 Nokia Corporation Distributed teleconference multichannel architecture, system, method, and computer program product
US7973857B2 (en) 2006-12-27 2011-07-05 Nokia Corporation Teleconference group formation using context information
WO2008082441A1 (en) 2006-12-29 2008-07-10 Prodea Systems, Inc. Display inserts, overlays, and graphical user interfaces for multimedia systems
GB2445984B (en) 2007-01-25 2011-12-07 Sonaptic Ltd Ambient noise reduction
US20080187143A1 (en) 2007-02-01 2008-08-07 Research In Motion Limited System and method for providing simulated spatial sound in group voice communication sessions on a wireless communication device
US8060363B2 (en) 2007-02-13 2011-11-15 Nokia Corporation Audio signal encoding
JP4449987B2 (ja) 2007-02-15 2010-04-14 ソニー株式会社 音声処理装置、音声処理方法およびプログラム
BRPI0807703B1 (pt) 2007-02-26 2020-09-24 Dolby Laboratories Licensing Corporation Método para aperfeiçoar a fala em áudio de entretenimento e meio de armazenamento não-transitório legível por computador
US20080208575A1 (en) 2007-02-27 2008-08-28 Nokia Corporation Split-band encoding and decoding of an audio signal
US7848738B2 (en) 2007-03-19 2010-12-07 Avaya Inc. Teleconferencing system with multiple channels at each location
US20080259731A1 (en) 2007-04-17 2008-10-23 Happonen Aki P Methods and apparatuses for user controlled beamforming
EP2162881B1 (en) 2007-05-22 2013-01-23 Telefonaktiebolaget LM Ericsson (publ) Voice activity detection with improved music detection
TWI421858B (zh) 2007-05-24 2014-01-01 Audience Inc 用於處理音頻訊號的系統及方法
US8488803B2 (en) 2007-05-25 2013-07-16 Aliphcom Wind suppression/replacement component for use with electronic systems
US8253770B2 (en) 2007-05-31 2012-08-28 Eastman Kodak Company Residential video communication system
US20080304677A1 (en) 2007-06-08 2008-12-11 Sonitus Medical Inc. System and method for noise cancellation with motion tracking capability
JP4455614B2 (ja) 2007-06-13 2010-04-21 株式会社東芝 音響信号処理方法及び装置
US8428275B2 (en) 2007-06-22 2013-04-23 Sanyo Electric Co., Ltd. Wind noise reduction device
US7873513B2 (en) 2007-07-06 2011-01-18 Mindspeed Technologies, Inc. Speech transcoding in GSM networks
JP5009082B2 (ja) 2007-08-02 2012-08-22 シャープ株式会社 表示装置
WO2009020001A1 (ja) 2007-08-07 2009-02-12 Nec Corporation 音声ミキシング装置およびその雑音抑圧方法、ならびにプログラム
US20090043577A1 (en) 2007-08-10 2009-02-12 Ditech Networks, Inc. Signal presence detection using bi-directional communication data
JP4469882B2 (ja) 2007-08-16 2010-06-02 株式会社東芝 音響信号処理方法及び装置
DE602007004217D1 (de) 2007-08-31 2010-02-25 Harman Becker Automotive Sys Schnelle Schätzung der Spektraldichte der Rauschleistung zur Sprachsignalverbesserung
US7986228B2 (en) 2007-09-05 2011-07-26 Stanley Convergent Security Solutions, Inc. System and method for monitoring security at a premises using line card
KR101409169B1 (ko) 2007-09-05 2014-06-19 삼성전자주식회사 억제 폭 조절을 통한 사운드 줌 방법 및 장치
US8694310B2 (en) 2007-09-17 2014-04-08 Qnx Software Systems Limited Remote control server protocol system
US7522074B2 (en) 2007-09-17 2009-04-21 Samplify Systems, Inc. Enhanced control for compression and decompression of sampled signals
US8175871B2 (en) 2007-09-28 2012-05-08 Qualcomm Incorporated Apparatus and method of noise and echo reduction in multiple microphone audio systems
DE602007008429D1 (de) 2007-10-01 2010-09-23 Harman Becker Automotive Sys Effiziente Audiosignalverarbeitung im Subbandbereich, Verfahren, Vorrichtung und dazugehöriges Computerprogramm
US8046219B2 (en) 2007-10-18 2011-10-25 Motorola Mobility, Inc. Robust two microphone noise suppression system
US8326617B2 (en) 2007-10-24 2012-12-04 Qnx Software Systems Limited Speech enhancement with minimum gating
US8606566B2 (en) 2007-10-24 2013-12-10 Qnx Software Systems Limited Speech enhancement through partial speech reconstruction
DE602007004504D1 (de) 2007-10-29 2010-03-11 Harman Becker Automotive Sys Partielle Sprachrekonstruktion
TW200922272A (en) 2007-11-06 2009-05-16 High Tech Comp Corp Automobile noise suppression system and method thereof
US8358787B2 (en) 2007-11-07 2013-01-22 Apple Inc. Method and apparatus for acoustics testing of a personal mobile device
DE602007014382D1 (de) 2007-11-12 2011-06-16 Harman Becker Automotive Sys Unterscheidung zwischen Vordergrundsprache und Hintergrundgeräuschen
KR101238362B1 (ko) 2007-12-03 2013-02-28 삼성전자주식회사 음원 거리에 따라 음원 신호를 여과하는 방법 및 장치
JP5159279B2 (ja) 2007-12-03 2013-03-06 株式会社東芝 音声処理装置及びそれを用いた音声合成装置。
US8219387B2 (en) 2007-12-10 2012-07-10 Microsoft Corporation Identifying far-end sound
US8433061B2 (en) 2007-12-10 2013-04-30 Microsoft Corporation Reducing echo
US8175291B2 (en) 2007-12-19 2012-05-08 Qualcomm Incorporated Systems, methods, and apparatus for multi-microphone based speech enhancement
EP2232704A4 (en) 2007-12-20 2010-12-01 Ericsson Telefon Ab L M APPARATUS AND METHOD FOR NOISE SUPPRESSION
KR101456570B1 (ko) 2007-12-21 2014-10-31 엘지전자 주식회사 디지털 이퀄라이저를 구비한 이동 단말기 및 그 제어방법
US8326635B2 (en) 2007-12-25 2012-12-04 Personics Holdings Inc. Method and system for message alert and delivery using an earpiece
DE102008031150B3 (de) 2008-07-01 2009-11-19 Siemens Medical Instruments Pte. Ltd. Verfahren zur Störgeräuschunterdrückung und zugehöriges Hörgerät
US8554551B2 (en) 2008-01-28 2013-10-08 Qualcomm Incorporated Systems, methods, and apparatus for context replacement by audio level
US8200479B2 (en) 2008-02-08 2012-06-12 Texas Instruments Incorporated Method and system for asymmetric independent audio rendering
US8194882B2 (en) 2008-02-29 2012-06-05 Audience, Inc. System and method for providing single microphone noise suppression fallback
WO2009109373A2 (en) 2008-03-04 2009-09-11 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus for mixing a plurality of input data streams
US20090323655A1 (en) 2008-03-31 2009-12-31 Cozybit, Inc. System and method for inviting and sharing conversations between cellphones
US8457328B2 (en) 2008-04-22 2013-06-04 Nokia Corporation Method, apparatus and computer program product for utilizing spatial information for audio signal enhancement in a distributed network environment
US8611554B2 (en) 2008-04-22 2013-12-17 Bose Corporation Hearing assistance apparatus
US8369973B2 (en) 2008-06-19 2013-02-05 Texas Instruments Incorporated Efficient asynchronous sample rate conversion
US8300801B2 (en) 2008-06-26 2012-10-30 Centurylink Intellectual Property Llc System and method for telephone based noise cancellation
US8189807B2 (en) 2008-06-27 2012-05-29 Microsoft Corporation Satellite microphone array for video conferencing
CN101304391A (zh) 2008-06-30 2008-11-12 腾讯科技(深圳)有限公司 一种基于即时通讯系统的语音通话方法及系统
US8774423B1 (en) 2008-06-30 2014-07-08 Audience, Inc. System and method for controlling adaptivity of signal modification using a phantom coefficient
KR20100003530A (ko) 2008-07-01 2010-01-11 삼성전자주식회사 전자기기에서 음성 신호의 잡음 제거 장치 및 방법
BRPI0910792B1 (pt) 2008-07-11 2020-03-24 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. "sintetizador de sinal de áudio e codificador de sinal de áudio"
US8538749B2 (en) 2008-07-18 2013-09-17 Qualcomm Incorporated Systems, methods, apparatus, and computer program products for enhanced intelligibility
EP2151822B8 (en) 2008-08-05 2018-10-24 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for processing an audio signal for speech enhancement using a feature extraction
EP2151821B1 (en) 2008-08-07 2011-12-14 Nuance Communications, Inc. Noise-reduction processing of speech signals
US8189429B2 (en) 2008-09-30 2012-05-29 Apple Inc. Microphone proximity detection
JP5608660B2 (ja) 2008-10-10 2014-10-15 テレフオンアクチーボラゲット エル エム エリクソン(パブル) エネルギ保存型マルチチャネルオーディオ符号化
US8130978B2 (en) 2008-10-15 2012-03-06 Microsoft Corporation Dynamic switching of microphone inputs for identification of a direction of a source of speech sounds
US9779598B2 (en) 2008-11-21 2017-10-03 Robert Bosch Gmbh Security system including less than lethal deterrent
US8467891B2 (en) 2009-01-21 2013-06-18 Utc Fire & Security Americas Corporation, Inc. Method and system for efficient optimization of audio sampling rate conversion
EP2394270A1 (en) 2009-02-03 2011-12-14 University Of Ottawa Method and system for a multi-microphone noise reduction
EP2222091B1 (en) 2009-02-23 2013-04-24 Nuance Communications, Inc. Method for determining a set of filter coefficients for an acoustic echo compensation means
US8184180B2 (en) 2009-03-25 2012-05-22 Broadcom Corporation Spatially synchronized audio and video capture
EP2237271B1 (en) 2009-03-31 2021-01-20 Cerence Operating Company Method for determining a signal component for reducing noise in an input signal
JP5535198B2 (ja) 2009-04-02 2014-07-02 三菱電機株式会社 雑音抑圧装置
US9202456B2 (en) 2009-04-23 2015-12-01 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for automatic control of active noise cancellation
US8416715B2 (en) 2009-06-15 2013-04-09 Microsoft Corporation Interest determination for auditory enhancement
US8908882B2 (en) 2009-06-29 2014-12-09 Audience, Inc. Reparation of corrupted audio signals
US8626344B2 (en) 2009-08-21 2014-01-07 Allure Energy, Inc. Energy management system and method
EP2285112A1 (en) 2009-08-07 2011-02-16 Canon Kabushiki Kaisha Method for sending compressed data representing a digital image and corresponding device
US8644517B2 (en) 2009-08-17 2014-02-04 Broadcom Corporation System and method for automatic disabling and enabling of an acoustic beamformer
US8233352B2 (en) 2009-08-17 2012-07-31 Broadcom Corporation Audio source localization system and method
JP5397131B2 (ja) 2009-09-29 2014-01-22 沖電気工業株式会社 音源方向推定装置及びプログラム
US8571231B2 (en) 2009-10-01 2013-10-29 Qualcomm Incorporated Suppressing noise in an audio signal
US9372251B2 (en) 2009-10-05 2016-06-21 Harman International Industries, Incorporated System for spatial extraction of audio signals
CN102044243B (zh) 2009-10-15 2012-08-29 华为技术有限公司 语音激活检测方法与装置、编码器
WO2011049516A1 (en) 2009-10-19 2011-04-28 Telefonaktiebolaget Lm Ericsson (Publ) Detector and method for voice activity detection
US20110107367A1 (en) 2009-10-30 2011-05-05 Sony Corporation System and method for broadcasting personal content to client devices in an electronic network
CN102630385B (zh) 2009-11-30 2015-05-27 诺基亚公司 音频场景内的音频缩放处理的方法、装置及系统
US8615392B1 (en) 2009-12-02 2013-12-24 Audience, Inc. Systems and methods for producing an acoustic field having a target spatial pattern
US9210503B2 (en) 2009-12-02 2015-12-08 Audience, Inc. Audio zoom
US9838784B2 (en) 2009-12-02 2017-12-05 Knowles Electronics, Llc Directional audio capture
DE112010005020B4 (de) 2009-12-28 2018-12-13 Mitsubishi Electric Corporation Sprachsignal-Wiederherstellungsvorrichtung und Sprachsignal-Wiederherstellungsverfahren
US8718290B2 (en) 2010-01-26 2014-05-06 Audience, Inc. Adaptive noise reduction using level cues
US8626498B2 (en) 2010-02-24 2014-01-07 Qualcomm Incorporated Voice activity detection based on plural voice activity detectors
WO2011129725A1 (en) 2010-04-12 2011-10-20 Telefonaktiebolaget L M Ericsson (Publ) Method and arrangement for noise cancellation in a speech encoder
US8473287B2 (en) 2010-04-19 2013-06-25 Audience, Inc. Method for jointly optimizing noise reduction and voice quality in a mono or multi-microphone system
US8798290B1 (en) 2010-04-21 2014-08-05 Audience, Inc. Systems and methods for adaptive signal equalization
US8880396B1 (en) 2010-04-28 2014-11-04 Audience, Inc. Spectrum reconstruction for automatic speech recognition
US9558755B1 (en) * 2010-05-20 2017-01-31 Knowles Electronics, Llc Noise suppression assisted automatic speech recognition
US8639516B2 (en) 2010-06-04 2014-01-28 Apple Inc. User-specific noise suppression for voice quality improvements
US9094496B2 (en) 2010-06-18 2015-07-28 Avaya Inc. System and method for stereophonic acoustic echo cancellation
KR101285391B1 (ko) 2010-07-28 2013-07-10 주식회사 팬택 음향 객체 정보 융합 장치 및 방법
US9071831B2 (en) 2010-08-27 2015-06-30 Broadcom Corporation Method and system for noise cancellation and audio enhancement based on captured depth information
US9274744B2 (en) 2010-09-10 2016-03-01 Amazon Technologies, Inc. Relative position-inclusive device interfaces
US8311817B2 (en) 2010-11-04 2012-11-13 Audience, Inc. Systems and methods for enhancing voice quality in mobile device
US8831937B2 (en) 2010-11-12 2014-09-09 Audience, Inc. Post-noise suppression processing to improve voice quality
US8451315B2 (en) 2010-11-30 2013-05-28 Hewlett-Packard Development Company, L.P. System and method for distributed meeting capture
WO2012094422A2 (en) 2011-01-05 2012-07-12 Health Fidelity, Inc. A voice based system and method for data input
US8525868B2 (en) 2011-01-13 2013-09-03 Qualcomm Incorporated Variable beamforming with a mobile platform
US20120202485A1 (en) 2011-02-04 2012-08-09 Takwak GmBh Systems and methods for audio roaming for mobile devices
US8606249B1 (en) 2011-03-07 2013-12-10 Audience, Inc. Methods and systems for enhancing audio quality during teleconferencing
US9007416B1 (en) 2011-03-08 2015-04-14 Audience, Inc. Local social conference calling
JP5060631B1 (ja) 2011-03-31 2012-10-31 株式会社東芝 信号処理装置及び信号処理方法
US8811601B2 (en) 2011-04-04 2014-08-19 Qualcomm Incorporated Integrated echo cancellation and noise suppression
US8989411B2 (en) 2011-04-08 2015-03-24 Board Of Regents, The University Of Texas System Differential microphone with sealed backside cavities and diaphragms coupled to a rocking structure thereby providing resistance to deflection under atmospheric pressure and providing a directional response to sound pressure
US8363823B1 (en) 2011-08-08 2013-01-29 Audience, Inc. Two microphone uplink communication and stereo audio playback on three wire headset assembly
US9386147B2 (en) 2011-08-25 2016-07-05 Verizon Patent And Licensing Inc. Muting and un-muting user devices
US8750526B1 (en) 2012-01-04 2014-06-10 Audience, Inc. Dynamic bandwidth change detection for configuring audio processor
US9197974B1 (en) 2012-01-06 2015-11-24 Audience, Inc. Directional audio capture adaptation based on alternative sensory input
US8615394B1 (en) 2012-01-27 2013-12-24 Audience, Inc. Restoration of noise-reduced speech
US9093076B2 (en) 2012-04-30 2015-07-28 2236008 Ontario Inc. Multipass ASR controlling multiple applications
US9431012B2 (en) 2012-04-30 2016-08-30 2236008 Ontario Inc. Post processing of natural language automatic speech recognition
US9479275B2 (en) 2012-06-01 2016-10-25 Blackberry Limited Multiformat digital audio interface
US20130332156A1 (en) 2012-06-11 2013-12-12 Apple Inc. Sensor Fusion to Improve Speech/Audio Processing in a Mobile Device
WO2013188562A2 (en) 2012-06-12 2013-12-19 Audience, Inc. Bandwidth extension via constrained synthesis
US20130343549A1 (en) 2012-06-22 2013-12-26 Verisilicon Holdings Co., Ltd. Microphone arrays for generating stereo and surround channels, method of operation thereof and module incorporating the same
EP2680615B1 (en) 2012-06-25 2018-08-08 LG Electronics Inc. Mobile terminal and audio zooming method thereof
US9119012B2 (en) 2012-06-28 2015-08-25 Broadcom Corporation Loudspeaker beamforming for personal audio focal points
WO2014012582A1 (en) 2012-07-18 2014-01-23 Huawei Technologies Co., Ltd. Portable electronic device with directional microphones for stereo recording
EP2823631B1 (en) 2012-07-18 2017-09-06 Huawei Technologies Co., Ltd. Portable electronic device with directional microphones for stereo recording
US9984675B2 (en) 2013-05-24 2018-05-29 Google Technology Holdings LLC Voice controlled audio recording system with adjustable beamforming
KR101475894B1 (ko) * 2013-06-21 2014-12-23 서울대학교산학협력단 장애 음성 개선 방법 및 장치
US9536540B2 (en) 2013-07-19 2017-01-03 Knowles Electronics, Llc Speech signal separation and synthesis based on auditory scene analysis and speech modeling
CN106105259A (zh) 2014-01-21 2016-11-09 美商楼氏电子有限公司 提供极高声学过载点的麦克风设备和方法
US9500739B2 (en) 2014-03-28 2016-11-22 Knowles Electronics, Llc Estimating and tracking multiple attributes of multiple objects from multi-sensor data
US20160037245A1 (en) 2014-07-29 2016-02-04 Knowles Electronics, Llc Discrete MEMS Including Sensor Device
WO2016040885A1 (en) 2014-09-12 2016-03-17 Audience, Inc. Systems and methods for restoration of speech components
US20160093307A1 (en) 2014-09-25 2016-03-31 Audience, Inc. Latency Reduction
US9368110B1 (en) * 2015-07-07 2016-06-14 Mitsubishi Electric Research Laboratories, Inc. Method for distinguishing components of an acoustic signal

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1653520A (zh) * 2002-05-20 2005-08-10 微软公司 确定和降噪相关联的不确定性的方法
CN1910656A (zh) * 2004-01-20 2007-02-07 杜比实验室特许公司 基于块分组的音频编码
CN101512938A (zh) * 2006-08-01 2009-08-19 Dts(英属维尔京群岛)有限公司 用于补偿音频变换器的线性和非-线性失真的神经网络滤波技术
CN102280108A (zh) * 2010-06-10 2011-12-14 佳能株式会社 音频信号处理设备及其控制方法
CN101976567A (zh) * 2010-10-28 2011-02-16 吉林大学 一种语音信号差错掩盖方法
CN103430233A (zh) * 2010-12-14 2013-12-04 弗兰霍菲尔运输应用研究公司 用于预测性编码的编码器及方法、用于译码的译码器及方法、用于预测性编码及译码的系统及方法和预测性编码信息信号

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112088385A (zh) * 2018-04-23 2020-12-15 塞伦妮经营公司 用于回归深度神经网络的判别训练的系统和方法
CN109147804A (zh) * 2018-06-05 2019-01-04 安克创新科技股份有限公司 一种基于深度学习的音质特性处理方法及系统
CN109147805A (zh) * 2018-06-05 2019-01-04 安克创新科技股份有限公司 基于深度学习的音频音质增强
WO2019233364A1 (zh) * 2018-06-05 2019-12-12 安克创新科技股份有限公司 基于深度学习的音频音质增强
US11790934B2 (en) 2018-06-05 2023-10-17 Anker Innovations Technology Co., Ltd. Deep learning based method and system for processing sound quality characteristics
CN112820315A (zh) * 2020-07-13 2021-05-18 腾讯科技(深圳)有限公司 音频信号处理方法、装置、计算机设备及存储介质
WO2022012195A1 (zh) * 2020-07-13 2022-01-20 腾讯科技(深圳)有限公司 音频信号处理方法和相关装置
CN112820315B (zh) * 2020-07-13 2023-01-06 腾讯科技(深圳)有限公司 音频信号处理方法、装置、计算机设备及存储介质
CN112289343A (zh) * 2020-10-28 2021-01-29 腾讯音乐娱乐科技(深圳)有限公司 音频修复方法、装置及电子设备和计算机可读存储介质
CN112289343B (zh) * 2020-10-28 2024-03-19 腾讯音乐娱乐科技(深圳)有限公司 音频修复方法、装置及电子设备和计算机可读存储介质

Also Published As

Publication number Publication date
US20160078880A1 (en) 2016-03-17
US9978388B2 (en) 2018-05-22
DE112015004185T5 (de) 2017-06-01
WO2016040885A1 (en) 2016-03-17

Similar Documents

Publication Publication Date Title
CN107112025A (zh) 用于恢复语音分量的系统和方法
CN108520741B (zh) 一种耳语音恢复方法、装置、设备及可读存储介质
CN110503970B (zh) 一种音频数据处理方法、装置及存储介质
US20200082829A1 (en) Training a dialog system using user feedback
US9953634B1 (en) Passive training for automatic speech recognition
Latif et al. Adversarial machine learning and speech emotion recognition: Utilizing generative adversarial networks for robustness
EP3992965A1 (en) Voice signal processing method and speech separation method
CN112435684B (zh) 语音分离方法、装置、计算机设备和存储介质
US20240087587A1 (en) Wearable system speech processing
US20160034811A1 (en) Efficient generation of complementary acoustic models for performing automatic speech recognition system combination
WO2021093380A1 (zh) 一种噪声处理方法、装置、系统
CN107507615A (zh) 界面智能交互控制方法、装置、系统及存储介质
CN102903362A (zh) 集成的本地和基于云的语音识别
US20200251124A1 (en) Method and terminal for reconstructing speech signal, and computer storage medium
WO2021114847A1 (zh) 网络通话方法、装置、计算机设备及存储介质
CN107240396B (zh) 说话人自适应方法、装置、设备及存储介质
CN108877787A (zh) 语音识别方法、装置、服务器及存储介质
CN105794226A (zh) 用于声学回声消除的房间脉冲响应估计
CN112492207B (zh) 一种基于声源定位控制摄像头转动的方法和装置
US10079028B2 (en) Sound enhancement through reverberation matching
US11776563B2 (en) Textual echo cancellation
US20180277134A1 (en) Key Click Suppression
Liu et al. Wavoice: A mmWave-assisted Noise-resistant Speech Recognition System
Chowdhury et al. AdVerb: Visually Guided Audio Dereverberation
US11869478B2 (en) Audio processing using sound source representations

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20170829