CN107112025A - 用于恢复语音分量的系统和方法 - Google Patents
用于恢复语音分量的系统和方法 Download PDFInfo
- Publication number
- CN107112025A CN107112025A CN201580060446.6A CN201580060446A CN107112025A CN 107112025 A CN107112025 A CN 107112025A CN 201580060446 A CN201580060446 A CN 201580060446A CN 107112025 A CN107112025 A CN 107112025A
- Authority
- CN
- China
- Prior art keywords
- audio signal
- iteration
- distortion
- frequency area
- audio
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
- G10L25/30—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
Abstract
一种用于恢复因噪声降低或噪声消除而失真的音频信号的失真语音分量的方法,该方法包括:确定音频信号中的失真频率区和不失真频率区。失真频率区包括音频信号的存在语音失真的区域。利用模型执行迭代,以完善对失真频率区处的音频信号的预测。所述模型被设置成修改音频信号,并且可以包括利用干净或未损坏的音频信号的频谱包络训练的深度神经网络。在每次迭代之前,将不失真频率区的音频信号恢复至第一次迭代之前音频信号的值;而失真频率区的音频信号在第一次迭代时从零开始完善。当不失真频率区的音频信号的差异满足预定义标准时,迭代结束。
Description
相关申请的交叉引用
本申请要求保护2014年9月12日提交的美国临时申请No.62/049988的权益。前述申请的主题出于所有目的通过引用而并入于此。
技术领域
本申请总体上涉及音频处理,并且更具体地,涉及用于恢复噪声被抑制的音频信号的失真语音分量的系统和方法。
背景技术
在音频处理系统中广泛使用噪声降低以抑制或消除用于发送语音的音频信号中的不希望噪声。然而,在噪声消除和/或抑制之后,与噪声纠缠在一起的语音在噪声降低系统中易于被过度衰减或消除。
有一些大脑的模型解释了如何利用经由反馈机制感知地取代输入的内部表述来恢复声音。神经科学已经描述了称为大脑的收敛发散区(CDZ)模型的一个示例性模型,并且尝试解释在人类语音感知中发现的频谱完成和音素恢复现象等。
发明内容
提供本发明内容以通过简化形式介绍选择的概念,其在下面详细描述中进一步描述。本发明内容不旨在标识所要求保护的主题的关键特征或基本特征,也不是旨在被用作在确定所要求保护的主题的范围的辅助。
提供了用于恢复音频信号的失真语音分量的系统和方法。一种示例性方法包括:确定所述音频信号中的失真频率区和不失真频率区。所述失真频率区包括所述音频信号的存在语音失真的区域。该方法包括:利用用于完善对所述失真频率区处的所述音频信号的预测的模型执行一次或更多次迭代。所述模型可被设置成修改所述音频信号。
在一些实施方式中,所述音频信号包括通过对包括语音的声学信号进行噪声降低或噪声消除中的至少一种而获取的噪声被抑制的音频信号。所述声学信号在所述失真频率区衰减或消除。
在一些实施方式中,所述用于完善对所述失真频率区处的所述音频信号的的预测的模型包括利用干净的音频信号或未损坏的音频信号的频谱包络训练的深度神经网络。所完善的预测可以被用于恢复所述失真频率区中的语音分量。
在一些实施方式中,所述失真频率区处的所述音频信号在第一次迭代之前被设置成零。在执行所述迭代中的每一次迭代之前,所述不失真频率区处的所述音频信号在第一次迭代之前被恢复至初始值。
在一些实施方式中,该方法还包括:比较每一次迭代之前和之后的所述不失真频率区处的所述音频信号,以确定差异。在某些实施方式中,如果所述差异满足预定标准,则所述方法允许结束所述一次或更多次迭代。所述预定标准可以根据所述音频信号的能量的下限和上限来限定。
根据本公开另一示例性实施方式,用于恢复音频信号的失真语音分量的方法的步骤被存储在包括指令的非暂时性机器可读介质上,所述指令在通过一个或更多个处理器实现时执行所述步骤。
根据结合附图的下列描述,本公开和方面的其它示例性实施方式将变得清楚。
附图说明
实施方式在附图的图中通过示例而非限制的方式进行例示,其中,相同标记指示相似部件。
图1是例示可以实践本技术的环境的框图。
图2是例示根据示例性实施方式的音频装置的框图。
图3是例示根据示例性实施方式的音频处理系统的模块的框图。
图4是例示根据示例性实施方式的用于恢复音频信号的语音分量的方法的流程图。
图5是根据示例性实施方式的可用于实现本技术的方法的计算机系统。
具体实施方式
本文公开的技术涉及用于恢复音频信号的失真语音分量的系统和方法。本技术的实施方式可以利用被设置成接收和/或提供音频的任何音频装置(诸如蜂窝电话、可佩戴装置、电话听筒、耳机以及会议系统,但不限于此)来实践。应当明白,虽然本技术的一些实施方式将参照蜂窝电话的操作来描述,但本技术可以利用任何音频装置来实践。
音频装置可以包括射频(RF)接收器、发送器及收发器、有线和/或无线电信和/或联网装置、放大器、音频和/或视频播放器、编码器、解码器、扬声器、输入部、输出部,存储装置以及用户输入装置。音频装置可以包括输入装置,诸如按钮、开关、按键、键盘、轨迹球、滑块、触摸屏、一个或更多个麦克风、陀螺仪、加速度计、全球定位系统(GPS)接收器等。音频装置可以包括输出装置,诸如LED指示器、视频显示器、触摸屏、扬声器等。在一些实施方式中,移动装置包括可佩戴装置和手持装置,诸如有线和/或无线遥控器、笔记本计算机、平板计算机、平板电话、智能电话、个人数字助理、媒体播放器、移动电话等。
在不同实施方式中,音频装置可以在固定和便携式环境中工作。固定环境可以包括住宅和商业建筑物或结构物等。例如,固定的实施方式可以包括:客厅、卧室、家庭影院、会议室、礼堂、商业场所等。便携式环境可以包括:移动中的车辆、移动中的人、其它运输工具等。
根据示例性实施方式,用于恢复音频信号的失真语音分量的方法包括确定该音频信号中的失真频率区和不失真频率区的步骤。失真频率区包括音频信号中的存在语音失真的区域。该方法包括利用用于完善对失真频率区处的音频信号的预测的模型来执行一次或更多次迭代的步骤。该模型可以被设置成修改该音频信号。
下面参照图1,示出了其中可以实践用于恢复音频信号的失真语音分量的方法的环境100。该示例环境100可以包括至少可工作以接收音频信号的音频装置104。音频装置104还可工作以处理和/或记录/存储所接收的音频信号。
在一些实施方式中,音频装置104包括一个或更多个声学传感器,例如,麦克风。在图1的示例中,音频装置104包括主麦克风(M1)106和辅麦克风108。在不同实施方式中,麦克风106和108被用于检测声学音频信号(例如,来自用户102的口头交流)和噪声110两者。该口头交流可以包括关键词、讲话、唱歌等。
噪声110是环境100中存在的不需要的声音,其可以例如通过诸如麦克风106和108的传感器检测到。在固定环境中,噪声源可以包括街道噪声、环境噪声、来自移动装置的声音(如音频)、来自除了预定演讲者以外的其它实体的讲话等。噪声110可以包括混响和回声。移动环境可能会遇到某些种类的噪声,这些噪声是由其操作及其操作环境引起的,例如,道路、轨道、轮胎/车轮、风扇、刮水片、发动机、排气装置、娱乐系统、通信系统、竞争扬声器、风、雨、波浪、其它车辆、外部等噪声。由麦克风106和108检测的声信号可以被用于将希望的语音与噪声110分离。
在一些实施方式中,音频装置104连接至基于云的计算资源160(也称为计算云)。在一些实施方式中,计算云160包括一个或更多个服务器机群/集群(包括计算机服务器集合),并且该计算机云与网络交换机和/或路由器处于同一位置。计算云160可工作为通过网络(例如,因特网、移动电话(蜂窝电话)网络等)传送一个或更多个服务。在某些实施方式中,音频信号的至少部分处理在计算云160中远程地执行。音频装置104可工作为向计算云160发送数据(例如,记录的声信号)、请求计算服务并接收计算结果。
图2是示例音频装置104的框图。如图所示,音频装置104包括接收器200、处理器202、主麦克风106、辅麦克风108、音频处理系统210以及输出装置206。根据音频装置104的工作需要,音频装置104可以包括进一步的或不同的组件。类似地,音频装置104可以包括较少的组件来执行与图2描绘的那些类似或等同的功能。例如,音频装置104在一些实施方式中包括单个麦克风,而在其它实施方式中包括两个或更多个麦克风。
在不同实施方式中,接收器200可以被设置成与诸如因特网、广域网(WAN)、局域网(LAN)、蜂窝网络等的网络通信,以接收音频信号。然后,将接收到的音频信号转发至音频处理系统210。
在不同实施方式中,处理器202包括硬件和/或软件,处理器202可工作以执行存储在存储器(图2中未例示)中的指令。示例性处理器202使用浮点运算、复数运算以及包括噪声抑制和恢复音频信号中的失真语音分量的其它运算。
音频处理系统210可被设置成,经由至少一个麦克风(例如,图1和图2的示例中的主麦克风106和辅麦克风108)从声源接收声信号,并处理声信号分量。示例系统中的麦克风106和108间隔开一段距离,使得从某些方向碰撞到装置上的声波在两个或更多个麦克风处表现出不同的能级。在被麦克风106和108接收之后,该声信号可以被转换成电信号。这些电信号然后可以通过模数转换器(未示出)转换成数字信号以用于根据一些实施方式进行处理。
在不同实施方式中,在麦克风106和108是间隔小(例如,隔开1至2厘米)的全向麦克风,可以使用波束形成技术来仿真向前和向后的定向麦克风响应。可以利用仿真的向前和向后的定向麦克风来获取电平差。该电平差可以被用于在例如时-频域中区分语音和噪声,这可用于噪声和/或回声降低。在一些实施方式中,一些麦克风主要被用于检测语音,而其它麦克风主要被用于检测噪声。在不同实施方式中,一些麦克风被用于检测噪声和语音两者。
噪声降低可以由音频处理系统210基于麦克风间的电平差、级别显著性(levelsalience)、音高显著性(pitch salience)、信号类型分类、扬声器标识来执行。在不同实施方式中,噪声降低包括噪声消除和/或噪声抑制。
在一些实施方式中,输出装置206是向收听者提供音频输出的任何装置(例如,声源)。例如,输出装置206可以包括扬声器、class-D输出、耳机的听筒或音频装置104上的听筒(handset)。
图3是示出根据示例性实施方式的音频处理系统210的模块的框图。图3的音频处理系统210可以提供图2的音频处理系统210的更多细节。音频处理系统210包括频率分析模块310、噪声降低模块320、语音恢复模块330以及重建模块340。可以从接收器200或麦克风106和108接收输入信号。
在一些实施方式中,音频处理系统210可工作为接收包括一个或更多个时域输入音频信号的音频信号,在图3的示例中描绘为从图1中的主麦克风(M1)和辅麦克风(M2)接收。输入的音频信号被提供给频率分析模块310。
在一些实施方式中,频率分析模块310可工作为接收输入的音频信号。频率分析模块310从时域输入音频信号生成频率子带,并输出频率子带信号。在一些实施方式中,频率分析模块310可工作为计算或确定所接收的音频信号的语音分量,例如,频谱包络和激发。
在不同实施方式中,噪声降低模块320包括多个模块并从频率分析模块310接收音频信号。噪声降低模块320可工作为执行音频信号中的噪声降低来生成噪声被抑制的信号。在一些实施方式中,噪声降低包括减法噪声消除或乘法噪声抑制。通过示例而非限制的方式,噪声降低方法在2008年6月30日提交的主题为“System and Method for ProvidingNoise Suppression Utilizing Null Processing Noise Subtraction”的美国专利申请No.12/215,980中和在2007年1月29日提交的主题为“System and Method for UtilizingOmni-Directional Microphones for Speech Enhancement”的美国专利申请No.11/699,732(美国专利No.8,194,880)中进行了描述,其出于上述目的通过引用其全部内容而并入于此。噪声降低模块320向语音恢复模块330提供经转换的噪声被抑制的信号。在噪声被抑制的信号中,因为噪声降低变换了音频信号的频率,可以消除或过度衰减一个或更多个语音分量。
在一些实施方式中,语音恢复模块330从噪声降低模块320接收噪声抑制信号。语音恢复模块330被设置成恢复噪声被抑制的信号中的损坏的语音分量。在一些实施方式中,语音恢复模块330包括被训练用于恢复损坏频率区中的语音分量的深度神经网络(DNN)315。在某些实施方式中,DNN 315被设置成自动编码器。
在不同实施方式中,DNN 315利用机器学习进行训练。DNN 315是一种前馈人工神经网络,在其输入与输出之间具有超过一层的隐藏单元。DNN 315可以通过接收干净音频信号或未损坏音频信号的频谱包络的一个或更多个帧的输入特征来训练。在训练过程中,DNN315可以提取干净或未损坏的频谱包络的所学习的高阶频谱时间特征。在不同实施方式中,在语音恢复模块330中使用利用干净或未损坏包络的频谱包络训练的DNN 315来完善对特别适于恢复失真频率区中的语音分量的干净语音分量的预测。通过示例而非限制的方式,关于深度神经网络的示例性方法还在共同受让的、2015年2月4日提交的主题为“Noise-Robust Multi-Lingual Keyword Spotting with a Deep Neural Network BasedArchitecture”的美国专利申请No.14/614,348和2015年6月9日提交的主题为“Key ClickSuppression”的美国专利申请No.14/745,176中进行了描述,其全部内容通过引用而并入于此。
在工作期间,语音恢复模块330可以向抑制了噪声的信号的存在语音失真的频率区(失真区)分配零值。在图3的示例中,噪声被抑制的信号进一步提供给DNN 315的输入部以接收输出信号。输出信号包括对失真区的初始预测,这可能不是很准确。
在一些实施方式中,为改进初始预测,进一步应用迭代反馈机制。输出信号350可选地反馈回DNN 315的输入部,以接收输出信号的下一次迭代,将初始的噪声被抑制的信号保持在输出信号的不失真区。为了防止系统发散,可以在每次迭代之后将不失真区的输出与输入进行比较,并且可以基于输入的音频信号中的能量向所估计的不失真频率区处的能量应用上限和下限。在不同实施方式中,应用若干次迭代来改进预测的准确度,直到满足针对特定应用期望的准确度水平为止,例如,响应于不失真区处的音频信号的差异满足针对特定应用的预定义标准,不进一步迭代。
在一些实施方式中,重建模块340可工作为从语音恢复模块330接收具有恢复的语音分量的噪声被抑制的信号,并将恢复的语音分量重建成单个音频信号。
图4是示出根据示例实施方式的用于恢复音频信号的失真语音分量的方法400的流程图。方法400可以利用语音恢复模块330来执行。
该方法可以在框402中开始,其中,确定音频信号中的失真频率区和不失真的频率区。失真语音区是例如因噪声降低而导致存在语音失真的区域。
在框404中,方法400包括:利用模型来执行一次或更多次迭代,以完善对失真频率区处的音频信号的预测。该模型可以被设置成修改该音频信号。在一些实施方式中,该模型包括利用干净或未损坏信号的频谱包络训练的深度神经网络。在某些实施方式中,第一次迭代之前,对失真频率区处的音频信号的预测被设置成零。在执行每次迭代之前,将不失真频率区的音频信号恢复至第一次迭代之前的音频信号的值。
在框406中,该方法包括:比较每一次迭代之前和之后的不失真区的音频信号,以确定差异。
在框408中,如果该差异满足预定义标准,则停止迭代。
一些示例实施方式包括语音动力学。针对语音动力学,音频处理系统210可以被提供有多个连续的音频信号帧并被训练以输出相同数量的帧。在一些实施方式中包括语音动力学用于强制时间平滑度并允许恢复较长的失真区域。
使用各种实施方式来为许多应用提供改进,如噪声抑制、带宽扩展、语音编码以及语音合成。另外,该方法和系统服从于传感器融合,以使在一些实施方式中,可以将该方法和系统扩展成包括其他非声学传感器信息。关于传感器融合的示例性方法还在共同受让的、2014年11月19日提交的主题为“Method for Modeling User Possession of MobileDevice for User Authentication Framework”的美国专利申请No.14/548,207和2014年7月14日提交的主题为“Selection of System Parameters Based on Non-AcousticSensor Information”的美国专利申请No.14/331,205中进行了描述,其全部内容通过引用而并入于此。
用于恢复噪声降低的语音的各种方法还在共同受让的、2013年1月28日提交的主题为“Restoration of Noise Reduced Speech”的美国专利申请No.13/751,907(美国专利No.8,615,394)中进行了描述,其全部内容通过引用而并入于此。
图5例示了可以用于实现本发明的一些实施方式的示例性计算机系统500。图5的计算机系统500可以在诸如计算系统、网络、服务器或其组合的背景下实现。图5的计算机系统500包括一个或更多个处理器单元510和主存储器520。主存储器520部分地存储用于由处理器单元510执行的指令和数据。在这个示例中,主存储器520在工作中时存储可执行代码。图5的计算机系统500还包括:海量数据存储部530、便携式存储装置540、输出装置550、用户输入装置560、图形显示系统570以及外围装置580。
图5所示的组件被描绘为经由单个总线590连接。这些组件可以通过一个或更多个数据传输装置连接。处理器单元510和主存储器520经由本地微处理器总线连接,而海量数据存储部530、外围装置580、便携式存储装置540以及图形显示系统570经由一个或更多个输入/输出(I/O)总线连接。
可利用磁盘驱动器、固态驱动器或光盘驱动器来实现的海量数据存储部530是用于存储供处理器单元510使用的数据和指令的非易失性存储装置。海量数据存储部530存储用于实现本公开的实施方式的系统软件,用于将该软件加载到主存储器520中。
便携式存储装置540结合便携式非易失性存储介质(诸如闪速驱动器、软盘、光盘、数字视频盘或通用串行总线(USB)存储装置)工作,以向图5的计算机系统500输入并从图5的计算机系统500输出数据和代码。用于实现本公开的实施方式的系统软件存储在这种便携式介质上,并且经由便携式存储装置540输入至计算机系统500。
用户输入装置560可以提供用户接口的一部分。用户输入装置560可以包括一个或多个麦克风、用于输入字母数字和其它信息的诸如键盘的字母数字小键盘、或者诸如鼠标、轨迹球、触控笔或光标方向键的指示装置。用户输入装置560还可以包括触摸屏。另外,如图5所示的计算机系统500包括输出装置550。合适的输出装置550包括扬声器、打印机、网络接口以及监视器。
图形显示系统570包括液晶显示器(LCD)或其它合适的显示装置。图形显示系统570可设置成接收文本和图形信息并处理该信息以输出至显示装置。
外围装置580可以包括任何类型的计算机支持装置以向计算机系统500添加附加功能。
设置在图5的计算机系统500中的组件是通常在可以适用于本公开的实施方式的计算机系统中发现的那些,并且旨在表示本领域公知的这种计算机组件的广泛类别。由此,图5的计算机系统500可以是个人计算机(PC)、手持计算机系统、电话、移动计算机系统、工作站、平板、平板电话、移动电话、服务器、迷你计算机、大型计算机、可佩戴装置、或者任何其它计算机系统。计算机还可以包括不同的总线配置、联网平台、多处理器平台等。可以使用各种操作系统,包括UNIX、LINUX、WINDOWS、MAC OS、PALM OS、QNX ANDROID、IOS、CHROME、TIZEN以及其它合适的操作系统。
针对各种实施方式的处理可以按基于云的软件来实现。在一些实施方式中,计算机系统500被实现为基于云的计算环境,如在计算云内工作的虚拟机。在其它实施方式中,计算机系统500本身可以包括基于云的计算环境,其中,按分布式方式执行计算机系统500的功能。由此,在被设置为计算云时,计算机系统500可以包括各种形式的多个计算装置,如下更详细描述的那样。
一般来说,基于云的计算环境是通常将一大组处理器(如在网络服务器内)的计算能力相结合的资源和/或组合一大组计算机存储器或存储装置的存储容量的资源。提供基于云的资源的系统可以由其所有者专门使用,或者这种系统可以被在该计算基础设施内部署应用以获得大的计算或存储资源的益处的外部用户访问。
云例如可以由包括诸如计算机系统500的多个计算装置的网络服务器的网络形成,其中每个服务器(或至少其中的多个)提供处理器和/或存储资源。这些服务器可以管理由多个用户(例如,云资源客户或其他用户)提供的工作负载。通常,每个用户有时显著地将工作负载需求放在实时变化的云上。这些变化的性质和范围通常取决于与用户相关联的业务类型。
上面参照示例实施方式对本技术进行了描述。因此,针对本示例实施方式的其它变型例旨在被本公开所覆盖。
Claims (20)
1.一种用于恢复音频信号的失真语音分量的方法,该方法包括:
确定所述音频信号中的失真频率区和不失真频率区,所述失真频率区包括所述音频信号的存在语音失真的区域;以及
利用模型执行一次或更多次迭代,以完善对所述失真频率区处的所述音频信号的预测,所述模型被设置成修改所述音频信号。
2.根据权利要求1所述的方法,其中,所述音频信号包括通过对包含语音的声信号进行噪声降低或噪声消除中的至少一种而获取的噪声被抑制的音频信号。
3.根据权利要求2所述的方法,其中,所述声信号在所述失真频率区被衰减或消除。
4.根据权利要求1所述的方法,其中,所述模型包括利用干净的音频信号或未损坏的音频信号的频谱包络训练的深度神经网络。
5.根据权利要求1所述的方法,其中,所完善的预测被用于恢复所述失真频率区中的语音分量。
6.根据权利要求1所述的方法,其中,所述失真频率区处的所述音频信号在所述一次或更多次迭代中的第一次迭代之前被设置成零。
7.根据权利要求1所述的方法,其中,在执行所述一次或更多次迭代中的每一次迭代之前,所述不失真频率区处的所述音频信号被恢复至所述音频信号在所述一次或更多次迭代中的第一次迭代之前的值。
8.根据权利要求1所述的方法,所述方法还包括:在执行所述一次或更多次迭代中的每一次迭代之后,比较所述迭代之前和之后的所述不失真频率区处的所述音频信号,以确定差异。
9.根据权利要求8所述的方法,所述方法还包括:如果所述差异满足预定标准,则结束所述一次或更多次迭代。
10.根据权利要求9所述的方法,其中,所述预定标准根据所述音频信号的能量的下限和上限来限定。
11.一种用于恢复音频信号的失真语音分量的系统,该系统包括:
至少一个处理器;以及
存储器,该存储器以通信方式与所述至少一个处理器联接,所述存储器存储指令,所述指令在由所述至少一个处理器执行时执行以下方法,该方法包括:
确定所述音频信号中的失真频率区和不失真频率区,所述失真频率区包括所述音频信号的存在语音失真的区域;以及
利用模型执行一次或更多次迭代,以完善对所述失真频率区处的所述音频信号的预测,所述模型被设置成修改所述音频信号。
12.根据权利要求11所述的系统,其中,所述音频信号包括通过对包含语音的声信号进行噪声降低或噪声消除中的至少一种而获取的噪声被抑制的音频信号。
13.根据权利要求12所述的系统,其中,所述声信号在所述失真频率区被衰减或消除。
14.根据权利要求11所述的系统,其中,所述模型包括深度神经网络。
15.根据权利要求14所述的系统,其中,所述深度神经网络利用干净的音频信号或未损坏的音频信号的频谱包络训练。
16.根据权利要求15所述的系统,其中,所述失真频率区处的所述音频信号在所述一次或更多次迭代中的第一次迭代之前被设置成零。
17.根据权利要求11所述的系统,其中,在执行所述一次或更多次迭代中的每一次迭代之前,所述不失真频率区处的所述音频信号被恢复至所述一次或更多次迭代中的第一次迭代之前的值。
18.根据权利要求11所述的系统,所述系统还包括,在执行所述一次或更多次迭代中的每一次迭代之后,比较所述迭代之前和之后的所述不失真区处的所述音频信号,以确定差异。
19.根据权利要求18所述的系统,所述系统还包括:如果所述差异满足预定标准,则结束所述一次或更多次迭代,所述预定标准根据所述音频信号的能量的下限和上限来限定。
20.一种非暂时性计算机可读存储介质,该非暂时性计算机可读存储介质上包含有指令,所述指令在由至少一个处理器执行时执行以下方法的步骤,该方法包括:
确定所述音频信号中的失真频率区和不失真频率区,所述失真频率区包括所述音频信号的存在语音失真的区域;以及
利用模型执行一次或更多次迭代,以完善对所述失真频率区处的所述音频信号的预测,所述模型被设置成修改所述音频信号。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201462049988P | 2014-09-12 | 2014-09-12 | |
US62/049,988 | 2014-09-12 | ||
PCT/US2015/049816 WO2016040885A1 (en) | 2014-09-12 | 2015-09-11 | Systems and methods for restoration of speech components |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107112025A true CN107112025A (zh) | 2017-08-29 |
Family
ID=55455344
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201580060446.6A Pending CN107112025A (zh) | 2014-09-12 | 2015-09-11 | 用于恢复语音分量的系统和方法 |
Country Status (4)
Country | Link |
---|---|
US (1) | US9978388B2 (zh) |
CN (1) | CN107112025A (zh) |
DE (1) | DE112015004185T5 (zh) |
WO (1) | WO2016040885A1 (zh) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109147804A (zh) * | 2018-06-05 | 2019-01-04 | 安克创新科技股份有限公司 | 一种基于深度学习的音质特性处理方法及系统 |
CN109147805A (zh) * | 2018-06-05 | 2019-01-04 | 安克创新科技股份有限公司 | 基于深度学习的音频音质增强 |
CN112088385A (zh) * | 2018-04-23 | 2020-12-15 | 塞伦妮经营公司 | 用于回归深度神经网络的判别训练的系统和方法 |
CN112289343A (zh) * | 2020-10-28 | 2021-01-29 | 腾讯音乐娱乐科技(深圳)有限公司 | 音频修复方法、装置及电子设备和计算机可读存储介质 |
CN112820315A (zh) * | 2020-07-13 | 2021-05-18 | 腾讯科技(深圳)有限公司 | 音频信号处理方法、装置、计算机设备及存储介质 |
Families Citing this family (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9838784B2 (en) | 2009-12-02 | 2017-12-05 | Knowles Electronics, Llc | Directional audio capture |
US9536540B2 (en) | 2013-07-19 | 2017-01-03 | Knowles Electronics, Llc | Speech signal separation and synthesis based on auditory scene analysis and speech modeling |
WO2016040885A1 (en) | 2014-09-12 | 2016-03-17 | Audience, Inc. | Systems and methods for restoration of speech components |
US9820042B1 (en) | 2016-05-02 | 2017-11-14 | Knowles Electronics, Llc | Stereo separation and directional suppression with omni-directional microphones |
US10311219B2 (en) * | 2016-06-07 | 2019-06-04 | Vocalzoom Systems Ltd. | Device, system, and method of user authentication utilizing an optical microphone |
US9984701B2 (en) | 2016-06-10 | 2018-05-29 | Apple Inc. | Noise detection and removal systems, and related methods |
US11205103B2 (en) | 2016-12-09 | 2021-12-21 | The Research Foundation for the State University | Semisupervised autoencoder for sentiment analysis |
KR20180111271A (ko) | 2017-03-31 | 2018-10-11 | 삼성전자주식회사 | 신경망 모델을 이용하여 노이즈를 제거하는 방법 및 장치 |
KR20190037844A (ko) * | 2017-09-29 | 2019-04-08 | 엘지전자 주식회사 | 이동 단말기 |
EP3474280B1 (en) * | 2017-10-19 | 2021-07-07 | Goodix Technology (HK) Company Limited | Signal processor for speech signal enhancement |
CN111164682A (zh) | 2017-10-24 | 2020-05-15 | 三星电子株式会社 | 使用机器学习的音频重建方法和设备 |
US11416742B2 (en) | 2017-11-24 | 2022-08-16 | Electronics And Telecommunications Research Institute | Audio signal encoding method and apparatus and audio signal decoding method and apparatus using psychoacoustic-based weighted error function |
WO2019133765A1 (en) | 2017-12-28 | 2019-07-04 | Knowles Electronics, Llc | Direction of arrival estimation for multiple audio content streams |
US10522167B1 (en) * | 2018-02-13 | 2019-12-31 | Amazon Techonlogies, Inc. | Multichannel noise cancellation using deep neural network masking |
US10672414B2 (en) | 2018-04-13 | 2020-06-02 | Microsoft Technology Licensing, Llc | Systems, methods, and computer-readable media for improved real-time audio processing |
CN109545227B (zh) * | 2018-04-28 | 2023-05-09 | 华中师范大学 | 基于深度自编码网络的说话人性别自动识别方法及系统 |
EP3807878B1 (en) | 2018-06-14 | 2023-12-13 | Pindrop Security, Inc. | Deep neural network based speech enhancement |
US11341983B2 (en) | 2018-09-17 | 2022-05-24 | Honeywell International Inc. | System and method for audio noise reduction |
US11682411B2 (en) * | 2021-08-31 | 2023-06-20 | Spotify Ab | Wind noise suppresor |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1653520A (zh) * | 2002-05-20 | 2005-08-10 | 微软公司 | 确定和降噪相关联的不确定性的方法 |
CN1910656A (zh) * | 2004-01-20 | 2007-02-07 | 杜比实验室特许公司 | 基于块分组的音频编码 |
CN101512938A (zh) * | 2006-08-01 | 2009-08-19 | Dts(英属维尔京群岛)有限公司 | 用于补偿音频变换器的线性和非-线性失真的神经网络滤波技术 |
CN101976567A (zh) * | 2010-10-28 | 2011-02-16 | 吉林大学 | 一种语音信号差错掩盖方法 |
CN102280108A (zh) * | 2010-06-10 | 2011-12-14 | 佳能株式会社 | 音频信号处理设备及其控制方法 |
CN103430233A (zh) * | 2010-12-14 | 2013-12-04 | 弗兰霍菲尔运输应用研究公司 | 用于预测性编码的编码器及方法、用于译码的译码器及方法、用于预测性编码及译码的系统及方法和预测性编码信息信号 |
Family Cites Families (355)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4025724A (en) | 1975-08-12 | 1977-05-24 | Westinghouse Electric Corporation | Noise cancellation apparatus |
US4137510A (en) | 1976-01-22 | 1979-01-30 | Victor Company Of Japan, Ltd. | Frequency band dividing filter |
ATE41544T1 (de) | 1982-08-04 | 1989-04-15 | Trans Data Associates | Einrichtung und verfahren zur spracherkennung unter verwendung von vokaltraktmodell. |
US4802227A (en) | 1987-04-03 | 1989-01-31 | American Telephone And Telegraph Company | Noise reduction processing arrangement for microphone arrays |
US5115404A (en) | 1987-12-23 | 1992-05-19 | Tektronix, Inc. | Digital storage oscilloscope with indication of aliased display |
US4969203A (en) | 1988-01-25 | 1990-11-06 | North American Philips Corporation | Multiplicative sieve signal processing |
US5182557A (en) | 1989-09-20 | 1993-01-26 | Semborg Recrob, Corp. | Motorized joystick |
US5204906A (en) | 1990-02-13 | 1993-04-20 | Matsushita Electric Industrial Co., Ltd. | Voice signal processing device |
JPH0454100A (ja) | 1990-06-22 | 1992-02-21 | Clarion Co Ltd | 音声信号補償回路 |
WO1992005538A1 (en) | 1990-09-14 | 1992-04-02 | Chris Todter | Noise cancelling systems |
GB9107011D0 (en) | 1991-04-04 | 1991-05-22 | Gerzon Michael A | Illusory sound distance control method |
US5224170A (en) | 1991-04-15 | 1993-06-29 | Hewlett-Packard Company | Time domain compensation for transducer mismatch |
US5440751A (en) | 1991-06-21 | 1995-08-08 | Compaq Computer Corp. | Burst data transfer to single cycle data transfer conversion and strobe signal conversion |
CA2080608A1 (en) | 1992-01-02 | 1993-07-03 | Nader Amini | Bus control logic for computer system having dual bus architecture |
EP0559348A3 (en) | 1992-03-02 | 1993-11-03 | AT&T Corp. | Rate control loop processor for perceptual encoder/decoder |
JPH05300419A (ja) | 1992-04-16 | 1993-11-12 | Sanyo Electric Co Ltd | ビデオカメラ |
US5400409A (en) | 1992-12-23 | 1995-03-21 | Daimler-Benz Ag | Noise-reduction method for noise-affected voice channels |
US5524056A (en) | 1993-04-13 | 1996-06-04 | Etymotic Research, Inc. | Hearing aid having plural microphones and a microphone switching system |
DE4316297C1 (de) | 1993-05-14 | 1994-04-07 | Fraunhofer Ges Forschung | Frequenzanalyseverfahren |
JPH07336793A (ja) | 1994-06-09 | 1995-12-22 | Matsushita Electric Ind Co Ltd | ビデオカメラ用マイクロホン |
US5978567A (en) | 1994-07-27 | 1999-11-02 | Instant Video Technologies Inc. | System for distribution of interactive multimedia and linear programs by enabling program webs which include control scripts to define presentation by client transceiver |
US5598505A (en) | 1994-09-30 | 1997-01-28 | Apple Computer, Inc. | Cepstral correction vector quantizer for speech recognition |
GB9501734D0 (en) | 1995-01-30 | 1995-03-22 | Neopost Ltd | franking apparatus and printing means therefor |
US5682463A (en) | 1995-02-06 | 1997-10-28 | Lucent Technologies Inc. | Perceptual audio compression based on loudness uncertainty |
JP3307138B2 (ja) | 1995-02-27 | 2002-07-24 | ソニー株式会社 | 信号符号化方法及び装置、並びに信号復号化方法及び装置 |
DE69619284T3 (de) * | 1995-03-13 | 2006-04-27 | Matsushita Electric Industrial Co., Ltd., Kadoma | Vorrichtung zur Erweiterung der Sprachbandbreite |
US6263307B1 (en) | 1995-04-19 | 2001-07-17 | Texas Instruments Incorporated | Adaptive weiner filtering using line spectral frequencies |
US5625697A (en) | 1995-05-08 | 1997-04-29 | Lucent Technologies Inc. | Microphone selection process for use in a multiple microphone voice actuated switching system |
US5774837A (en) | 1995-09-13 | 1998-06-30 | Voxware, Inc. | Speech coding system and method using voicing probability determination |
FI99062C (fi) | 1995-10-05 | 1997-09-25 | Nokia Mobile Phones Ltd | Puhesignaalin taajuuskorjaus matkapuhelimessa |
US5819215A (en) | 1995-10-13 | 1998-10-06 | Dobson; Kurt | Method and apparatus for wavelet based data compression having adaptive bit rate control for compression of digital audio or other sensory data |
US5956674A (en) | 1995-12-01 | 1999-09-21 | Digital Theater Systems, Inc. | Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels |
US5734713A (en) | 1996-01-30 | 1998-03-31 | Jabra Corporation | Method and system for remote telephone calibration |
US6035177A (en) | 1996-02-26 | 2000-03-07 | Donald W. Moses | Simultaneous transmission of ancillary and audio signals by means of perceptual coding |
JP3325770B2 (ja) | 1996-04-26 | 2002-09-17 | 三菱電機株式会社 | ノイズ低減回路及びノイズ低減装置及びノイズ低減方法 |
US5715319A (en) | 1996-05-30 | 1998-02-03 | Picturetel Corporation | Method and apparatus for steerable and endfire superdirective microphone arrays with reduced analog-to-digital converter and computational requirements |
US5806025A (en) | 1996-08-07 | 1998-09-08 | U S West, Inc. | Method and system for adaptive filtering of speech signals using signal-to-noise ratio to choose subband filter bank |
US5757933A (en) | 1996-12-11 | 1998-05-26 | Micro Ear Technology, Inc. | In-the-ear hearing aid with directional microphone system |
JP2930101B2 (ja) | 1997-01-29 | 1999-08-03 | 日本電気株式会社 | 雑音消去装置 |
US6104993A (en) | 1997-02-26 | 2000-08-15 | Motorola, Inc. | Apparatus and method for rate determination in a communication system |
FI114247B (fi) | 1997-04-11 | 2004-09-15 | Nokia Corp | Menetelmä ja laite puheen tunnistamiseksi |
US6281749B1 (en) | 1997-06-17 | 2001-08-28 | Srs Labs, Inc. | Sound enhancement system |
US6084916A (en) | 1997-07-14 | 2000-07-04 | Vlsi Technology, Inc. | Receiver sample rate frequency adjustment for sample rate conversion between asynchronous digital systems |
US5991385A (en) | 1997-07-16 | 1999-11-23 | International Business Machines Corporation | Enhanced audio teleconferencing with sound field effect |
US6144937A (en) | 1997-07-23 | 2000-11-07 | Texas Instruments Incorporated | Noise suppression of speech by signal processing including applying a transform to time domain input sequences of digital signals representing audio information |
KR19990015748A (ko) | 1997-08-09 | 1999-03-05 | 구자홍 | 전자우편 |
FR2768547B1 (fr) | 1997-09-18 | 1999-11-19 | Matra Communication | Procede de debruitage d'un signal de parole numerique |
US6202047B1 (en) | 1998-03-30 | 2001-03-13 | At&T Corp. | Method and apparatus for speech recognition using second order statistics and linear estimation of cepstral coefficients |
WO1999053673A1 (en) | 1998-04-08 | 1999-10-21 | British Telecommunications Public Limited Company | Teleconferencing system |
US6684199B1 (en) | 1998-05-20 | 2004-01-27 | Recording Industry Association Of America | Method for minimizing pirating and/or unauthorized copying and/or unauthorized access of/to data on/from data media including compact discs and digital versatile discs, and system and data media for same |
US6421388B1 (en) | 1998-05-27 | 2002-07-16 | 3Com Corporation | Method and apparatus for determining PCM code translations |
US6717991B1 (en) | 1998-05-27 | 2004-04-06 | Telefonaktiebolaget Lm Ericsson (Publ) | System and method for dual microphone signal noise reduction using spectral subtraction |
US6041130A (en) | 1998-06-23 | 2000-03-21 | Mci Communications Corporation | Headset with multiple connections |
US20040066940A1 (en) | 2002-10-03 | 2004-04-08 | Silentium Ltd. | Method and system for inhibiting noise produced by one or more sources of undesired sound from pickup by a speech recognition unit |
US6240386B1 (en) | 1998-08-24 | 2001-05-29 | Conexant Systems, Inc. | Speech codec employing noise classification for noise compensation |
US6381469B1 (en) | 1998-10-02 | 2002-04-30 | Nokia Corporation | Frequency equalizer, and associated method, for a radio telephone |
US6768979B1 (en) | 1998-10-22 | 2004-07-27 | Sony Corporation | Apparatus and method for noise attenuation in a speech recognition system |
US6188769B1 (en) | 1998-11-13 | 2001-02-13 | Creative Technology Ltd. | Environmental reverberation processor |
US6504926B1 (en) | 1998-12-15 | 2003-01-07 | Mediaring.Com Ltd. | User control system for internet phone quality |
US6873837B1 (en) | 1999-02-03 | 2005-03-29 | Matsushita Electric Industrial Co., Ltd. | Emergency reporting system and terminal apparatus therein |
US6496795B1 (en) | 1999-05-05 | 2002-12-17 | Microsoft Corporation | Modulated complex lapped transform for integrated signal enhancement and coding |
US7423983B1 (en) | 1999-09-20 | 2008-09-09 | Broadcom Corporation | Voice and data exchange over a packet based network |
US6219408B1 (en) | 1999-05-28 | 2001-04-17 | Paul Kurth | Apparatus and method for simultaneously transmitting biomedical data and human voice over conventional telephone lines |
US6490556B2 (en) | 1999-05-28 | 2002-12-03 | Intel Corporation | Audio classifier for half duplex communication |
US7035666B2 (en) | 1999-06-09 | 2006-04-25 | Shimon Silberfening | Combination cellular telephone, sound storage device, and email communication device |
US6381284B1 (en) | 1999-06-14 | 2002-04-30 | T. Bogomolny | Method of and devices for telecommunications |
US6226616B1 (en) | 1999-06-21 | 2001-05-01 | Digital Theater Systems, Inc. | Sound quality of established low bit-rate audio coding systems without loss of decoder compatibility |
EP1081685A3 (en) | 1999-09-01 | 2002-04-24 | TRW Inc. | System and method for noise reduction using a single microphone |
US6480610B1 (en) | 1999-09-21 | 2002-11-12 | Sonic Innovations, Inc. | Subband acoustic feedback cancellation in hearing aids |
US7054809B1 (en) | 1999-09-22 | 2006-05-30 | Mindspeed Technologies, Inc. | Rate selection method for selectable mode vocoder |
US6636829B1 (en) | 1999-09-22 | 2003-10-21 | Mindspeed Technologies, Inc. | Speech communication system and method for handling lost frames |
FI116643B (fi) | 1999-11-15 | 2006-01-13 | Nokia Corp | Kohinan vaimennus |
US7058572B1 (en) | 2000-01-28 | 2006-06-06 | Nortel Networks Limited | Reducing acoustic noise in wireless and landline based telephony |
US6584438B1 (en) | 2000-04-24 | 2003-06-24 | Qualcomm Incorporated | Frame erasure compensation method in a variable rate speech coder |
JP2001318694A (ja) | 2000-05-10 | 2001-11-16 | Toshiba Corp | 信号処理装置、信号処理方法および記録媒体 |
US6377637B1 (en) | 2000-07-12 | 2002-04-23 | Andrea Electronics Corporation | Sub-band exponential smoothing noise canceling system |
US8019091B2 (en) | 2000-07-19 | 2011-09-13 | Aliphcom, Inc. | Voice activity detector (VAD) -based multiple-microphone acoustic noise suppression |
US20030179888A1 (en) | 2002-03-05 | 2003-09-25 | Burnett Gregory C. | Voice activity detection (VAD) devices and methods for use with noise suppression systems |
US20020041678A1 (en) | 2000-08-18 | 2002-04-11 | Filiz Basburg-Ertem | Method and apparatus for integrated echo cancellation and noise reduction for fixed subscriber terminals |
US6862567B1 (en) | 2000-08-30 | 2005-03-01 | Mindspeed Technologies, Inc. | Noise suppression in the frequency domain by adjusting gain according to voicing parameters |
JP2002149200A (ja) | 2000-08-31 | 2002-05-24 | Matsushita Electric Ind Co Ltd | 音声処理装置及び音声処理方法 |
DE10045197C1 (de) | 2000-09-13 | 2002-03-07 | Siemens Audiologische Technik | Verfahren zum Betrieb eines Hörhilfegerätes oder Hörgerätessystems sowie Hörhilfegerät oder Hörgerätesystem |
US6520673B2 (en) | 2000-12-08 | 2003-02-18 | Msp Corporation | Mixing devices for sample recovery from a USP induction port or a pre-separator |
US6907045B1 (en) | 2000-11-17 | 2005-06-14 | Nortel Networks Limited | Method and apparatus for data-path conversion comprising PCM bit robbing signalling |
EP1340320B1 (en) | 2000-11-30 | 2008-10-15 | Intrasonics Limited | Apparatus and system for using data signal embedded into an acoustic signal |
US7472059B2 (en) | 2000-12-08 | 2008-12-30 | Qualcomm Incorporated | Method and apparatus for robust speech classification |
US20020097884A1 (en) | 2001-01-25 | 2002-07-25 | Cairns Douglas A. | Variable noise reduction algorithm based on vehicle conditions |
US6754623B2 (en) | 2001-01-31 | 2004-06-22 | International Business Machines Corporation | Methods and apparatus for ambient noise removal in speech recognition |
US7617099B2 (en) | 2001-02-12 | 2009-11-10 | FortMedia Inc. | Noise suppression by two-channel tandem spectrum modification for speech signal in an automobile |
EP1239455A3 (de) | 2001-03-09 | 2004-01-21 | Alcatel | Verfahren und Anordnung zur Durchführung einer an die Übertragungsfunktion menschilcher Sinnesorgane angepassten Fourier Transformation sowie darauf basierende Vorrichtungen zur Geräuschreduktion und Spracherkennung |
US7349841B2 (en) | 2001-03-28 | 2008-03-25 | Mitsubishi Denki Kabushiki Kaisha | Noise suppression device including subband-based signal-to-noise ratio |
SE0101175D0 (sv) | 2001-04-02 | 2001-04-02 | Coding Technologies Sweden Ab | Aliasing reduction using complex-exponential-modulated filterbanks |
ATE410901T1 (de) | 2001-04-18 | 2008-10-15 | Widex As | Richtungssteuerung und verfahren zur steuerung eines hörgeräts |
US20020160751A1 (en) | 2001-04-26 | 2002-10-31 | Yingju Sun | Mobile devices with integrated voice recording mechanism |
US8934382B2 (en) | 2001-05-10 | 2015-01-13 | Polycom, Inc. | Conference endpoint controlling functions of a remote device |
US8452023B2 (en) | 2007-05-25 | 2013-05-28 | Aliphcom | Wind suppression/replacement component for use with electronic systems |
US6493668B1 (en) | 2001-06-15 | 2002-12-10 | Yigal Brandman | Speech feature extraction system |
AUPR647501A0 (en) | 2001-07-19 | 2001-08-09 | Vast Audio Pty Ltd | Recording a three dimensional auditory scene and reproducing it for the individual listener |
GB0121206D0 (en) | 2001-08-31 | 2001-10-24 | Mitel Knowledge Corp | System and method of indicating and controlling sound pickup direction and location in a teleconferencing system |
GB0121308D0 (en) | 2001-09-03 | 2001-10-24 | Thomas Swan & Company Ltd | Optical processing |
US7574474B2 (en) | 2001-09-14 | 2009-08-11 | Xerox Corporation | System and method for sharing and controlling multiple audio and video streams |
US6895375B2 (en) * | 2001-10-04 | 2005-05-17 | At&T Corp. | System for bandwidth extension of Narrow-band speech |
US6707921B2 (en) | 2001-11-26 | 2004-03-16 | Hewlett-Packard Development Company, Lp. | Use of mouth position and mouth movement to filter noise from speech in a hearing aid |
EP1449305B1 (en) | 2001-11-30 | 2006-04-05 | Telefonaktiebolaget LM Ericsson (publ) | Method for replacing corrupted audio data |
US7096037B2 (en) | 2002-01-29 | 2006-08-22 | Palm, Inc. | Videoconferencing bandwidth management for a handheld computer system and method |
US8098844B2 (en) | 2002-02-05 | 2012-01-17 | Mh Acoustics, Llc | Dual-microphone spatial noise suppression |
US7171008B2 (en) | 2002-02-05 | 2007-01-30 | Mh Acoustics, Llc | Reducing noise in audio systems |
US20050228518A1 (en) | 2002-02-13 | 2005-10-13 | Applied Neurosystems Corporation | Filter set for frequency analysis |
US7158572B2 (en) | 2002-02-14 | 2007-01-02 | Tellabs Operations, Inc. | Audio enhancement communication techniques |
JP4195267B2 (ja) | 2002-03-14 | 2008-12-10 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 音声認識装置、その音声認識方法及びプログラム |
US6978010B1 (en) | 2002-03-21 | 2005-12-20 | Bellsouth Intellectual Property Corp. | Ambient noise cancellation for voice communication device |
WO2003084103A1 (en) | 2002-03-22 | 2003-10-09 | Georgia Tech Research Corporation | Analog audio enhancement system using a noise suppression algorithm |
US7447631B2 (en) | 2002-06-17 | 2008-11-04 | Dolby Laboratories Licensing Corporation | Audio coding system using spectral hole filling |
US20030228019A1 (en) | 2002-06-11 | 2003-12-11 | Elbit Systems Ltd. | Method and system for reducing noise |
JP2004023481A (ja) | 2002-06-17 | 2004-01-22 | Alpine Electronics Inc | 音響信号処理装置及び方法並びにオーディオ装置 |
US7516066B2 (en) | 2002-07-16 | 2009-04-07 | Koninklijke Philips Electronics N.V. | Audio coding |
JP3579047B2 (ja) | 2002-07-19 | 2004-10-20 | 日本電気株式会社 | オーディオ復号装置と復号方法およびプログラム |
JP4227772B2 (ja) | 2002-07-19 | 2009-02-18 | 日本電気株式会社 | オーディオ復号装置と復号方法およびプログラム |
US7783061B2 (en) | 2003-08-27 | 2010-08-24 | Sony Computer Entertainment Inc. | Methods and apparatus for the targeted sound detection |
US8019121B2 (en) | 2002-07-27 | 2011-09-13 | Sony Computer Entertainment Inc. | Method and system for processing intensity from input devices for interfacing with a computer program |
US7760248B2 (en) | 2002-07-27 | 2010-07-20 | Sony Computer Entertainment Inc. | Selective sound source listening in conjunction with computer interactive processing |
US7283956B2 (en) | 2002-09-18 | 2007-10-16 | Motorola, Inc. | Noise suppression |
US7657427B2 (en) | 2002-10-11 | 2010-02-02 | Nokia Corporation | Methods and devices for source controlled variable bit-rate wideband speech coding |
US7630409B2 (en) | 2002-10-21 | 2009-12-08 | Lsi Corporation | Method and apparatus for improved play-out packet control algorithm |
US20040083110A1 (en) | 2002-10-23 | 2004-04-29 | Nokia Corporation | Packet loss recovery based on music signal classification and mixing |
US7970606B2 (en) | 2002-11-13 | 2011-06-28 | Digital Voice Systems, Inc. | Interoperable vocoder |
WO2004064041A1 (en) | 2003-01-09 | 2004-07-29 | Dilithium Networks Pty Limited | Method and apparatus for improved quality voice transcoding |
JP4247002B2 (ja) | 2003-01-22 | 2009-04-02 | 富士通株式会社 | マイクロホンアレイを用いた話者距離検出装置及び方法並びに当該装置を用いた音声入出力装置 |
KR100503479B1 (ko) | 2003-01-24 | 2005-07-28 | 삼성전자주식회사 | 휴대용단말기의 크레들 및 이를 이용한 휴대용단말기고정방법 |
EP1443498B1 (en) | 2003-01-24 | 2008-03-19 | Sony Ericsson Mobile Communications AB | Noise reduction and audio-visual speech activity detection |
DE10305820B4 (de) | 2003-02-12 | 2006-06-01 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Bestimmen einer Wiedergabeposition |
US7885420B2 (en) | 2003-02-21 | 2011-02-08 | Qnx Software Systems Co. | Wind noise suppression system |
US7725315B2 (en) | 2003-02-21 | 2010-05-25 | Qnx Software Systems (Wavemakers), Inc. | Minimization of transient noises in a voice signal |
GB2398913B (en) | 2003-02-27 | 2005-08-17 | Motorola Inc | Noise estimation in speech recognition |
FR2851879A1 (fr) | 2003-02-27 | 2004-09-03 | France Telecom | Procede de traitement de donnees sonores compressees, pour spatialisation. |
US7090431B2 (en) | 2003-03-19 | 2006-08-15 | Cosgrove Patrick J | Marine vessel lifting system with variable level detection |
US8412526B2 (en) | 2003-04-01 | 2013-04-02 | Nuance Communications, Inc. | Restoration of high-order Mel frequency cepstral coefficients |
NO318096B1 (no) | 2003-05-08 | 2005-01-31 | Tandberg Telecom As | Arrangement og fremgangsmate for lokalisering av lydkilde |
US7353169B1 (en) | 2003-06-24 | 2008-04-01 | Creative Technology Ltd. | Transient detection and modification in audio signals |
US7376553B2 (en) | 2003-07-08 | 2008-05-20 | Robert Patel Quinn | Fractal harmonic overtone mapping of speech and musical sounds |
EP1513137A1 (en) | 2003-08-22 | 2005-03-09 | MicronasNIT LCC, Novi Sad Institute of Information Technologies | Speech processing system and method with multi-pulse excitation |
WO2005027094A1 (fr) | 2003-09-17 | 2005-03-24 | Beijing E-World Technology Co.,Ltd. | Procede et dispositif de quantification de vecteur multi-resolution multiple pour codage et decodage audio |
US7190775B2 (en) | 2003-10-29 | 2007-03-13 | Broadcom Corporation | High quality audio conferencing with adaptive beamforming |
DE602004021716D1 (de) | 2003-11-12 | 2009-08-06 | Honda Motor Co Ltd | Spracherkennungssystem |
JP4396233B2 (ja) | 2003-11-13 | 2010-01-13 | パナソニック株式会社 | 複素指数変調フィルタバンクの信号分析方法、信号合成方法、そのプログラム及びその記録媒体 |
GB2408655B (en) | 2003-11-27 | 2007-02-28 | Motorola Inc | Communication system, communication units and method of ambience listening thereto |
CA2454296A1 (en) | 2003-12-29 | 2005-06-29 | Nokia Corporation | Method and device for speech enhancement in the presence of background noise |
JP2005249816A (ja) | 2004-03-01 | 2005-09-15 | Internatl Business Mach Corp <Ibm> | 信号強調装置、方法及びプログラム、並びに音声認識装置、方法及びプログラム |
WO2005086138A1 (ja) | 2004-03-05 | 2005-09-15 | Matsushita Electric Industrial Co., Ltd. | エラー隠蔽装置およびエラー隠蔽方法 |
GB0408856D0 (en) | 2004-04-21 | 2004-05-26 | Nokia Corp | Signal encoding |
JP4437052B2 (ja) | 2004-04-21 | 2010-03-24 | パナソニック株式会社 | 音声復号化装置および音声復号化方法 |
US20050249292A1 (en) | 2004-05-07 | 2005-11-10 | Ping Zhu | System and method for enhancing the performance of variable length coding |
US7103176B2 (en) | 2004-05-13 | 2006-09-05 | International Business Machines Corporation | Direct coupling of telephone volume control with remote microphone gain and noise cancellation |
GB2414369B (en) | 2004-05-21 | 2007-08-01 | Hewlett Packard Development Co | Processing audio data |
US7695438B2 (en) | 2004-05-26 | 2010-04-13 | Siemens Medical Solutions Usa, Inc. | Acoustic disruption minimizing systems and methods |
EP1600947A3 (en) | 2004-05-26 | 2005-12-21 | Honda Research Institute Europe GmbH | Subtractive cancellation of harmonic noise |
US7254665B2 (en) | 2004-06-16 | 2007-08-07 | Microsoft Corporation | Method and system for reducing latency in transferring captured image data by utilizing burst transfer after threshold is reached |
US20060063560A1 (en) | 2004-09-21 | 2006-03-23 | Samsung Electronics Co., Ltd. | Dual-mode phone using GPS power-saving assist for operating in cellular and WiFi networks |
US7383179B2 (en) | 2004-09-28 | 2008-06-03 | Clarity Technologies, Inc. | Method of cascading noise reduction algorithms to avoid speech distortion |
US20060092918A1 (en) | 2004-11-04 | 2006-05-04 | Alexander Talalai | Audio receiver having adaptive buffer delay |
EP1815462A1 (en) | 2004-11-09 | 2007-08-08 | Koninklijke Philips Electronics N.V. | Audio coding and decoding |
JP4283212B2 (ja) | 2004-12-10 | 2009-06-24 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 雑音除去装置、雑音除去プログラム、及び雑音除去方法 |
US20060206320A1 (en) | 2005-03-14 | 2006-09-14 | Li Qi P | Apparatus and method for noise reduction and speech enhancement with microphones and loudspeakers |
RU2376657C2 (ru) | 2005-04-01 | 2009-12-20 | Квэлкомм Инкорпорейтед | Системы, способы и устройства для высокополосного предыскажения шкалы времени |
US7664495B1 (en) | 2005-04-21 | 2010-02-16 | At&T Mobility Ii Llc | Voice call redirection for enterprise hosted dual mode service |
CA2574468C (en) | 2005-04-28 | 2014-01-14 | Siemens Aktiengesellschaft | Noise suppression process and device |
EP1878013B1 (en) | 2005-05-05 | 2010-12-15 | Sony Computer Entertainment Inc. | Video game control with joystick |
EP1914727B1 (en) | 2005-05-17 | 2009-08-12 | Yamaha Corporation | Noise suppression methods and apparatuses |
US7531973B2 (en) | 2005-05-31 | 2009-05-12 | Rockwell Automation Technologies, Inc. | Wizard for configuring a motor drive system |
US7647077B2 (en) | 2005-05-31 | 2010-01-12 | Bitwave Pte Ltd | Method for echo control of a wireless headset |
JP2006339991A (ja) | 2005-06-01 | 2006-12-14 | Matsushita Electric Ind Co Ltd | マルチチャンネル収音装置、マルチチャンネル音声再生装置、およびマルチチャンネル収音再生装置 |
JP4910312B2 (ja) | 2005-06-03 | 2012-04-04 | ソニー株式会社 | 撮像装置および撮像方法 |
US8311840B2 (en) * | 2005-06-28 | 2012-11-13 | Qnx Software Systems Limited | Frequency extension of harmonic signals |
US8566086B2 (en) | 2005-06-28 | 2013-10-22 | Qnx Software Systems Limited | System for adaptive enhancement of speech signals |
US20070003097A1 (en) | 2005-06-30 | 2007-01-04 | Altec Lansing Technologies, Inc. | Angularly adjustable speaker system |
US20070005351A1 (en) | 2005-06-30 | 2007-01-04 | Sathyendra Harsha M | Method and system for bandwidth expansion for voice communications |
ATE497327T1 (de) | 2005-07-06 | 2011-02-15 | Koninkl Philips Electronics Nv | Vorrichtung und verfahren zur schallstrahlformung |
US7617436B2 (en) | 2005-08-02 | 2009-11-10 | Nokia Corporation | Method, device, and system for forward channel error recovery in video sequence transmission over packet-based network |
KR101116363B1 (ko) | 2005-08-11 | 2012-03-09 | 삼성전자주식회사 | 음성신호 분류방법 및 장치, 및 이를 이용한 음성신호부호화방법 및 장치 |
US20070041589A1 (en) | 2005-08-17 | 2007-02-22 | Gennum Corporation | System and method for providing environmental specific noise reduction algorithms |
US8326614B2 (en) | 2005-09-02 | 2012-12-04 | Qnx Software Systems Limited | Speech enhancement system |
JP4356670B2 (ja) | 2005-09-12 | 2009-11-04 | ソニー株式会社 | 雑音低減装置及び雑音低減方法並びに雑音低減プログラムとその電子機器用収音装置 |
US7917561B2 (en) | 2005-09-16 | 2011-03-29 | Coding Technologies Ab | Partially complex modulated filter bank |
US20100130198A1 (en) | 2005-09-29 | 2010-05-27 | Plantronics, Inc. | Remote processing of multiple acoustic signals |
EA011601B1 (ru) | 2005-09-30 | 2009-04-28 | Скуэрхэд Текнолоджи Ас | Способ и система для направленного захвата аудиосигнала |
US7813923B2 (en) | 2005-10-14 | 2010-10-12 | Microsoft Corporation | Calibration based beamforming, non-linear adaptive filtering, and multi-sensor headset |
US7970123B2 (en) | 2005-10-20 | 2011-06-28 | Mitel Networks Corporation | Adaptive coupling equalization in beamforming-based communication systems |
US7562140B2 (en) | 2005-11-15 | 2009-07-14 | Cisco Technology, Inc. | Method and apparatus for providing trend information from network devices |
US20070127668A1 (en) | 2005-12-02 | 2007-06-07 | Ahya Deepak P | Method and system for performing a conference call |
US7366658B2 (en) | 2005-12-09 | 2008-04-29 | Texas Instruments Incorporated | Noise pre-processor for enhanced variable rate speech codec |
ATE449403T1 (de) | 2005-12-12 | 2009-12-15 | Gregory John Gadbois | Mehrstimmige spracherkennung |
US7565288B2 (en) | 2005-12-22 | 2009-07-21 | Microsoft Corporation | Spatial noise suppression for a microphone array |
JP4876574B2 (ja) | 2005-12-26 | 2012-02-15 | ソニー株式会社 | 信号符号化装置及び方法、信号復号装置及び方法、並びにプログラム及び記録媒体 |
US8345890B2 (en) | 2006-01-05 | 2013-01-01 | Audience, Inc. | System and method for utilizing inter-microphone level differences for speech enhancement |
US8346544B2 (en) | 2006-01-20 | 2013-01-01 | Qualcomm Incorporated | Selection of encoding modes and/or encoding rates for speech compression with closed loop re-decision |
US8032369B2 (en) | 2006-01-20 | 2011-10-04 | Qualcomm Incorporated | Arbitrary average data rates for variable rate coders |
JP4940671B2 (ja) | 2006-01-26 | 2012-05-30 | ソニー株式会社 | オーディオ信号処理装置、オーディオ信号処理方法及びオーディオ信号処理プログラム |
US8744844B2 (en) | 2007-07-06 | 2014-06-03 | Audience, Inc. | System and method for adaptive intelligent noise suppression |
US8194880B2 (en) | 2006-01-30 | 2012-06-05 | Audience, Inc. | System and method for utilizing omni-directional microphones for speech enhancement |
US9185487B2 (en) | 2006-01-30 | 2015-11-10 | Audience, Inc. | System and method for providing noise suppression utilizing null processing noise subtraction |
US7685132B2 (en) | 2006-03-15 | 2010-03-23 | Mog, Inc | Automatic meta-data sharing of existing media through social networking |
US7676374B2 (en) | 2006-03-28 | 2010-03-09 | Nokia Corporation | Low complexity subband-domain filtering in the case of cascaded filter banks |
US7555075B2 (en) | 2006-04-07 | 2009-06-30 | Freescale Semiconductor, Inc. | Adjustable noise suppression system |
US8180067B2 (en) | 2006-04-28 | 2012-05-15 | Harman International Industries, Incorporated | System for selectively extracting components of an audio input signal |
US8068619B2 (en) | 2006-05-09 | 2011-11-29 | Fortemedia, Inc. | Method and apparatus for noise suppression in a small array microphone system |
US7548791B1 (en) | 2006-05-18 | 2009-06-16 | Adobe Systems Incorporated | Graphically displaying audio pan or phase information |
US8044291B2 (en) | 2006-05-18 | 2011-10-25 | Adobe Systems Incorporated | Selection of visually displayed audio data for editing |
US8204253B1 (en) | 2008-06-30 | 2012-06-19 | Audience, Inc. | Self calibration of audio device |
US8934641B2 (en) | 2006-05-25 | 2015-01-13 | Audience, Inc. | Systems and methods for reconstructing decomposed audio signals |
US8150065B2 (en) | 2006-05-25 | 2012-04-03 | Audience, Inc. | System and method for processing an audio signal |
US8229137B2 (en) | 2006-08-31 | 2012-07-24 | Sony Ericsson Mobile Communications Ab | Volume control circuits for use in electronic devices and related methods and electronic devices |
US8036767B2 (en) | 2006-09-20 | 2011-10-11 | Harman International Industries, Incorporated | System for extracting and changing the reverberant content of an audio input signal |
EP1918910B1 (en) | 2006-10-31 | 2009-03-11 | Harman Becker Automotive Systems GmbH | Model-based enhancement of speech signals |
US7492312B2 (en) | 2006-11-14 | 2009-02-17 | Fam Adly T | Multiplicative mismatched filters for optimum range sidelobe suppression in barker code reception |
US8019089B2 (en) | 2006-11-20 | 2011-09-13 | Microsoft Corporation | Removal of noise, corresponding to user input devices from an audio signal |
US7626942B2 (en) | 2006-11-22 | 2009-12-01 | Spectra Link Corp. | Method of conducting an audio communications session using incorrect timestamps |
US7983685B2 (en) | 2006-12-07 | 2011-07-19 | Innovative Wireless Technologies, Inc. | Method and apparatus for management of a global wireless sensor network |
US20080159507A1 (en) | 2006-12-27 | 2008-07-03 | Nokia Corporation | Distributed teleconference multichannel architecture, system, method, and computer program product |
US7973857B2 (en) | 2006-12-27 | 2011-07-05 | Nokia Corporation | Teleconference group formation using context information |
WO2008082441A1 (en) | 2006-12-29 | 2008-07-10 | Prodea Systems, Inc. | Display inserts, overlays, and graphical user interfaces for multimedia systems |
GB2445984B (en) | 2007-01-25 | 2011-12-07 | Sonaptic Ltd | Ambient noise reduction |
US20080187143A1 (en) | 2007-02-01 | 2008-08-07 | Research In Motion Limited | System and method for providing simulated spatial sound in group voice communication sessions on a wireless communication device |
US8060363B2 (en) | 2007-02-13 | 2011-11-15 | Nokia Corporation | Audio signal encoding |
JP4449987B2 (ja) | 2007-02-15 | 2010-04-14 | ソニー株式会社 | 音声処理装置、音声処理方法およびプログラム |
BRPI0807703B1 (pt) | 2007-02-26 | 2020-09-24 | Dolby Laboratories Licensing Corporation | Método para aperfeiçoar a fala em áudio de entretenimento e meio de armazenamento não-transitório legível por computador |
US20080208575A1 (en) | 2007-02-27 | 2008-08-28 | Nokia Corporation | Split-band encoding and decoding of an audio signal |
US7848738B2 (en) | 2007-03-19 | 2010-12-07 | Avaya Inc. | Teleconferencing system with multiple channels at each location |
US20080259731A1 (en) | 2007-04-17 | 2008-10-23 | Happonen Aki P | Methods and apparatuses for user controlled beamforming |
EP2162881B1 (en) | 2007-05-22 | 2013-01-23 | Telefonaktiebolaget LM Ericsson (publ) | Voice activity detection with improved music detection |
TWI421858B (zh) | 2007-05-24 | 2014-01-01 | Audience Inc | 用於處理音頻訊號的系統及方法 |
US8488803B2 (en) | 2007-05-25 | 2013-07-16 | Aliphcom | Wind suppression/replacement component for use with electronic systems |
US8253770B2 (en) | 2007-05-31 | 2012-08-28 | Eastman Kodak Company | Residential video communication system |
US20080304677A1 (en) | 2007-06-08 | 2008-12-11 | Sonitus Medical Inc. | System and method for noise cancellation with motion tracking capability |
JP4455614B2 (ja) | 2007-06-13 | 2010-04-21 | 株式会社東芝 | 音響信号処理方法及び装置 |
US8428275B2 (en) | 2007-06-22 | 2013-04-23 | Sanyo Electric Co., Ltd. | Wind noise reduction device |
US7873513B2 (en) | 2007-07-06 | 2011-01-18 | Mindspeed Technologies, Inc. | Speech transcoding in GSM networks |
JP5009082B2 (ja) | 2007-08-02 | 2012-08-22 | シャープ株式会社 | 表示装置 |
WO2009020001A1 (ja) | 2007-08-07 | 2009-02-12 | Nec Corporation | 音声ミキシング装置およびその雑音抑圧方法、ならびにプログラム |
US20090043577A1 (en) | 2007-08-10 | 2009-02-12 | Ditech Networks, Inc. | Signal presence detection using bi-directional communication data |
JP4469882B2 (ja) | 2007-08-16 | 2010-06-02 | 株式会社東芝 | 音響信号処理方法及び装置 |
DE602007004217D1 (de) | 2007-08-31 | 2010-02-25 | Harman Becker Automotive Sys | Schnelle Schätzung der Spektraldichte der Rauschleistung zur Sprachsignalverbesserung |
US7986228B2 (en) | 2007-09-05 | 2011-07-26 | Stanley Convergent Security Solutions, Inc. | System and method for monitoring security at a premises using line card |
KR101409169B1 (ko) | 2007-09-05 | 2014-06-19 | 삼성전자주식회사 | 억제 폭 조절을 통한 사운드 줌 방법 및 장치 |
US8694310B2 (en) | 2007-09-17 | 2014-04-08 | Qnx Software Systems Limited | Remote control server protocol system |
US7522074B2 (en) | 2007-09-17 | 2009-04-21 | Samplify Systems, Inc. | Enhanced control for compression and decompression of sampled signals |
US8175871B2 (en) | 2007-09-28 | 2012-05-08 | Qualcomm Incorporated | Apparatus and method of noise and echo reduction in multiple microphone audio systems |
DE602007008429D1 (de) | 2007-10-01 | 2010-09-23 | Harman Becker Automotive Sys | Effiziente Audiosignalverarbeitung im Subbandbereich, Verfahren, Vorrichtung und dazugehöriges Computerprogramm |
US8046219B2 (en) | 2007-10-18 | 2011-10-25 | Motorola Mobility, Inc. | Robust two microphone noise suppression system |
US8326617B2 (en) | 2007-10-24 | 2012-12-04 | Qnx Software Systems Limited | Speech enhancement with minimum gating |
US8606566B2 (en) | 2007-10-24 | 2013-12-10 | Qnx Software Systems Limited | Speech enhancement through partial speech reconstruction |
DE602007004504D1 (de) | 2007-10-29 | 2010-03-11 | Harman Becker Automotive Sys | Partielle Sprachrekonstruktion |
TW200922272A (en) | 2007-11-06 | 2009-05-16 | High Tech Comp Corp | Automobile noise suppression system and method thereof |
US8358787B2 (en) | 2007-11-07 | 2013-01-22 | Apple Inc. | Method and apparatus for acoustics testing of a personal mobile device |
DE602007014382D1 (de) | 2007-11-12 | 2011-06-16 | Harman Becker Automotive Sys | Unterscheidung zwischen Vordergrundsprache und Hintergrundgeräuschen |
KR101238362B1 (ko) | 2007-12-03 | 2013-02-28 | 삼성전자주식회사 | 음원 거리에 따라 음원 신호를 여과하는 방법 및 장치 |
JP5159279B2 (ja) | 2007-12-03 | 2013-03-06 | 株式会社東芝 | 音声処理装置及びそれを用いた音声合成装置。 |
US8219387B2 (en) | 2007-12-10 | 2012-07-10 | Microsoft Corporation | Identifying far-end sound |
US8433061B2 (en) | 2007-12-10 | 2013-04-30 | Microsoft Corporation | Reducing echo |
US8175291B2 (en) | 2007-12-19 | 2012-05-08 | Qualcomm Incorporated | Systems, methods, and apparatus for multi-microphone based speech enhancement |
EP2232704A4 (en) | 2007-12-20 | 2010-12-01 | Ericsson Telefon Ab L M | APPARATUS AND METHOD FOR NOISE SUPPRESSION |
KR101456570B1 (ko) | 2007-12-21 | 2014-10-31 | 엘지전자 주식회사 | 디지털 이퀄라이저를 구비한 이동 단말기 및 그 제어방법 |
US8326635B2 (en) | 2007-12-25 | 2012-12-04 | Personics Holdings Inc. | Method and system for message alert and delivery using an earpiece |
DE102008031150B3 (de) | 2008-07-01 | 2009-11-19 | Siemens Medical Instruments Pte. Ltd. | Verfahren zur Störgeräuschunterdrückung und zugehöriges Hörgerät |
US8554551B2 (en) | 2008-01-28 | 2013-10-08 | Qualcomm Incorporated | Systems, methods, and apparatus for context replacement by audio level |
US8200479B2 (en) | 2008-02-08 | 2012-06-12 | Texas Instruments Incorporated | Method and system for asymmetric independent audio rendering |
US8194882B2 (en) | 2008-02-29 | 2012-06-05 | Audience, Inc. | System and method for providing single microphone noise suppression fallback |
WO2009109373A2 (en) | 2008-03-04 | 2009-09-11 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus for mixing a plurality of input data streams |
US20090323655A1 (en) | 2008-03-31 | 2009-12-31 | Cozybit, Inc. | System and method for inviting and sharing conversations between cellphones |
US8457328B2 (en) | 2008-04-22 | 2013-06-04 | Nokia Corporation | Method, apparatus and computer program product for utilizing spatial information for audio signal enhancement in a distributed network environment |
US8611554B2 (en) | 2008-04-22 | 2013-12-17 | Bose Corporation | Hearing assistance apparatus |
US8369973B2 (en) | 2008-06-19 | 2013-02-05 | Texas Instruments Incorporated | Efficient asynchronous sample rate conversion |
US8300801B2 (en) | 2008-06-26 | 2012-10-30 | Centurylink Intellectual Property Llc | System and method for telephone based noise cancellation |
US8189807B2 (en) | 2008-06-27 | 2012-05-29 | Microsoft Corporation | Satellite microphone array for video conferencing |
CN101304391A (zh) | 2008-06-30 | 2008-11-12 | 腾讯科技(深圳)有限公司 | 一种基于即时通讯系统的语音通话方法及系统 |
US8774423B1 (en) | 2008-06-30 | 2014-07-08 | Audience, Inc. | System and method for controlling adaptivity of signal modification using a phantom coefficient |
KR20100003530A (ko) | 2008-07-01 | 2010-01-11 | 삼성전자주식회사 | 전자기기에서 음성 신호의 잡음 제거 장치 및 방법 |
BRPI0910792B1 (pt) | 2008-07-11 | 2020-03-24 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | "sintetizador de sinal de áudio e codificador de sinal de áudio" |
US8538749B2 (en) | 2008-07-18 | 2013-09-17 | Qualcomm Incorporated | Systems, methods, apparatus, and computer program products for enhanced intelligibility |
EP2151822B8 (en) | 2008-08-05 | 2018-10-24 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for processing an audio signal for speech enhancement using a feature extraction |
EP2151821B1 (en) | 2008-08-07 | 2011-12-14 | Nuance Communications, Inc. | Noise-reduction processing of speech signals |
US8189429B2 (en) | 2008-09-30 | 2012-05-29 | Apple Inc. | Microphone proximity detection |
JP5608660B2 (ja) | 2008-10-10 | 2014-10-15 | テレフオンアクチーボラゲット エル エム エリクソン(パブル) | エネルギ保存型マルチチャネルオーディオ符号化 |
US8130978B2 (en) | 2008-10-15 | 2012-03-06 | Microsoft Corporation | Dynamic switching of microphone inputs for identification of a direction of a source of speech sounds |
US9779598B2 (en) | 2008-11-21 | 2017-10-03 | Robert Bosch Gmbh | Security system including less than lethal deterrent |
US8467891B2 (en) | 2009-01-21 | 2013-06-18 | Utc Fire & Security Americas Corporation, Inc. | Method and system for efficient optimization of audio sampling rate conversion |
EP2394270A1 (en) | 2009-02-03 | 2011-12-14 | University Of Ottawa | Method and system for a multi-microphone noise reduction |
EP2222091B1 (en) | 2009-02-23 | 2013-04-24 | Nuance Communications, Inc. | Method for determining a set of filter coefficients for an acoustic echo compensation means |
US8184180B2 (en) | 2009-03-25 | 2012-05-22 | Broadcom Corporation | Spatially synchronized audio and video capture |
EP2237271B1 (en) | 2009-03-31 | 2021-01-20 | Cerence Operating Company | Method for determining a signal component for reducing noise in an input signal |
JP5535198B2 (ja) | 2009-04-02 | 2014-07-02 | 三菱電機株式会社 | 雑音抑圧装置 |
US9202456B2 (en) | 2009-04-23 | 2015-12-01 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for automatic control of active noise cancellation |
US8416715B2 (en) | 2009-06-15 | 2013-04-09 | Microsoft Corporation | Interest determination for auditory enhancement |
US8908882B2 (en) | 2009-06-29 | 2014-12-09 | Audience, Inc. | Reparation of corrupted audio signals |
US8626344B2 (en) | 2009-08-21 | 2014-01-07 | Allure Energy, Inc. | Energy management system and method |
EP2285112A1 (en) | 2009-08-07 | 2011-02-16 | Canon Kabushiki Kaisha | Method for sending compressed data representing a digital image and corresponding device |
US8644517B2 (en) | 2009-08-17 | 2014-02-04 | Broadcom Corporation | System and method for automatic disabling and enabling of an acoustic beamformer |
US8233352B2 (en) | 2009-08-17 | 2012-07-31 | Broadcom Corporation | Audio source localization system and method |
JP5397131B2 (ja) | 2009-09-29 | 2014-01-22 | 沖電気工業株式会社 | 音源方向推定装置及びプログラム |
US8571231B2 (en) | 2009-10-01 | 2013-10-29 | Qualcomm Incorporated | Suppressing noise in an audio signal |
US9372251B2 (en) | 2009-10-05 | 2016-06-21 | Harman International Industries, Incorporated | System for spatial extraction of audio signals |
CN102044243B (zh) | 2009-10-15 | 2012-08-29 | 华为技术有限公司 | 语音激活检测方法与装置、编码器 |
WO2011049516A1 (en) | 2009-10-19 | 2011-04-28 | Telefonaktiebolaget Lm Ericsson (Publ) | Detector and method for voice activity detection |
US20110107367A1 (en) | 2009-10-30 | 2011-05-05 | Sony Corporation | System and method for broadcasting personal content to client devices in an electronic network |
CN102630385B (zh) | 2009-11-30 | 2015-05-27 | 诺基亚公司 | 音频场景内的音频缩放处理的方法、装置及系统 |
US8615392B1 (en) | 2009-12-02 | 2013-12-24 | Audience, Inc. | Systems and methods for producing an acoustic field having a target spatial pattern |
US9210503B2 (en) | 2009-12-02 | 2015-12-08 | Audience, Inc. | Audio zoom |
US9838784B2 (en) | 2009-12-02 | 2017-12-05 | Knowles Electronics, Llc | Directional audio capture |
DE112010005020B4 (de) | 2009-12-28 | 2018-12-13 | Mitsubishi Electric Corporation | Sprachsignal-Wiederherstellungsvorrichtung und Sprachsignal-Wiederherstellungsverfahren |
US8718290B2 (en) | 2010-01-26 | 2014-05-06 | Audience, Inc. | Adaptive noise reduction using level cues |
US8626498B2 (en) | 2010-02-24 | 2014-01-07 | Qualcomm Incorporated | Voice activity detection based on plural voice activity detectors |
WO2011129725A1 (en) | 2010-04-12 | 2011-10-20 | Telefonaktiebolaget L M Ericsson (Publ) | Method and arrangement for noise cancellation in a speech encoder |
US8473287B2 (en) | 2010-04-19 | 2013-06-25 | Audience, Inc. | Method for jointly optimizing noise reduction and voice quality in a mono or multi-microphone system |
US8798290B1 (en) | 2010-04-21 | 2014-08-05 | Audience, Inc. | Systems and methods for adaptive signal equalization |
US8880396B1 (en) | 2010-04-28 | 2014-11-04 | Audience, Inc. | Spectrum reconstruction for automatic speech recognition |
US9558755B1 (en) * | 2010-05-20 | 2017-01-31 | Knowles Electronics, Llc | Noise suppression assisted automatic speech recognition |
US8639516B2 (en) | 2010-06-04 | 2014-01-28 | Apple Inc. | User-specific noise suppression for voice quality improvements |
US9094496B2 (en) | 2010-06-18 | 2015-07-28 | Avaya Inc. | System and method for stereophonic acoustic echo cancellation |
KR101285391B1 (ko) | 2010-07-28 | 2013-07-10 | 주식회사 팬택 | 음향 객체 정보 융합 장치 및 방법 |
US9071831B2 (en) | 2010-08-27 | 2015-06-30 | Broadcom Corporation | Method and system for noise cancellation and audio enhancement based on captured depth information |
US9274744B2 (en) | 2010-09-10 | 2016-03-01 | Amazon Technologies, Inc. | Relative position-inclusive device interfaces |
US8311817B2 (en) | 2010-11-04 | 2012-11-13 | Audience, Inc. | Systems and methods for enhancing voice quality in mobile device |
US8831937B2 (en) | 2010-11-12 | 2014-09-09 | Audience, Inc. | Post-noise suppression processing to improve voice quality |
US8451315B2 (en) | 2010-11-30 | 2013-05-28 | Hewlett-Packard Development Company, L.P. | System and method for distributed meeting capture |
WO2012094422A2 (en) | 2011-01-05 | 2012-07-12 | Health Fidelity, Inc. | A voice based system and method for data input |
US8525868B2 (en) | 2011-01-13 | 2013-09-03 | Qualcomm Incorporated | Variable beamforming with a mobile platform |
US20120202485A1 (en) | 2011-02-04 | 2012-08-09 | Takwak GmBh | Systems and methods for audio roaming for mobile devices |
US8606249B1 (en) | 2011-03-07 | 2013-12-10 | Audience, Inc. | Methods and systems for enhancing audio quality during teleconferencing |
US9007416B1 (en) | 2011-03-08 | 2015-04-14 | Audience, Inc. | Local social conference calling |
JP5060631B1 (ja) | 2011-03-31 | 2012-10-31 | 株式会社東芝 | 信号処理装置及び信号処理方法 |
US8811601B2 (en) | 2011-04-04 | 2014-08-19 | Qualcomm Incorporated | Integrated echo cancellation and noise suppression |
US8989411B2 (en) | 2011-04-08 | 2015-03-24 | Board Of Regents, The University Of Texas System | Differential microphone with sealed backside cavities and diaphragms coupled to a rocking structure thereby providing resistance to deflection under atmospheric pressure and providing a directional response to sound pressure |
US8363823B1 (en) | 2011-08-08 | 2013-01-29 | Audience, Inc. | Two microphone uplink communication and stereo audio playback on three wire headset assembly |
US9386147B2 (en) | 2011-08-25 | 2016-07-05 | Verizon Patent And Licensing Inc. | Muting and un-muting user devices |
US8750526B1 (en) | 2012-01-04 | 2014-06-10 | Audience, Inc. | Dynamic bandwidth change detection for configuring audio processor |
US9197974B1 (en) | 2012-01-06 | 2015-11-24 | Audience, Inc. | Directional audio capture adaptation based on alternative sensory input |
US8615394B1 (en) | 2012-01-27 | 2013-12-24 | Audience, Inc. | Restoration of noise-reduced speech |
US9093076B2 (en) | 2012-04-30 | 2015-07-28 | 2236008 Ontario Inc. | Multipass ASR controlling multiple applications |
US9431012B2 (en) | 2012-04-30 | 2016-08-30 | 2236008 Ontario Inc. | Post processing of natural language automatic speech recognition |
US9479275B2 (en) | 2012-06-01 | 2016-10-25 | Blackberry Limited | Multiformat digital audio interface |
US20130332156A1 (en) | 2012-06-11 | 2013-12-12 | Apple Inc. | Sensor Fusion to Improve Speech/Audio Processing in a Mobile Device |
WO2013188562A2 (en) | 2012-06-12 | 2013-12-19 | Audience, Inc. | Bandwidth extension via constrained synthesis |
US20130343549A1 (en) | 2012-06-22 | 2013-12-26 | Verisilicon Holdings Co., Ltd. | Microphone arrays for generating stereo and surround channels, method of operation thereof and module incorporating the same |
EP2680615B1 (en) | 2012-06-25 | 2018-08-08 | LG Electronics Inc. | Mobile terminal and audio zooming method thereof |
US9119012B2 (en) | 2012-06-28 | 2015-08-25 | Broadcom Corporation | Loudspeaker beamforming for personal audio focal points |
WO2014012582A1 (en) | 2012-07-18 | 2014-01-23 | Huawei Technologies Co., Ltd. | Portable electronic device with directional microphones for stereo recording |
EP2823631B1 (en) | 2012-07-18 | 2017-09-06 | Huawei Technologies Co., Ltd. | Portable electronic device with directional microphones for stereo recording |
US9984675B2 (en) | 2013-05-24 | 2018-05-29 | Google Technology Holdings LLC | Voice controlled audio recording system with adjustable beamforming |
KR101475894B1 (ko) * | 2013-06-21 | 2014-12-23 | 서울대학교산학협력단 | 장애 음성 개선 방법 및 장치 |
US9536540B2 (en) | 2013-07-19 | 2017-01-03 | Knowles Electronics, Llc | Speech signal separation and synthesis based on auditory scene analysis and speech modeling |
CN106105259A (zh) | 2014-01-21 | 2016-11-09 | 美商楼氏电子有限公司 | 提供极高声学过载点的麦克风设备和方法 |
US9500739B2 (en) | 2014-03-28 | 2016-11-22 | Knowles Electronics, Llc | Estimating and tracking multiple attributes of multiple objects from multi-sensor data |
US20160037245A1 (en) | 2014-07-29 | 2016-02-04 | Knowles Electronics, Llc | Discrete MEMS Including Sensor Device |
WO2016040885A1 (en) | 2014-09-12 | 2016-03-17 | Audience, Inc. | Systems and methods for restoration of speech components |
US20160093307A1 (en) | 2014-09-25 | 2016-03-31 | Audience, Inc. | Latency Reduction |
US9368110B1 (en) * | 2015-07-07 | 2016-06-14 | Mitsubishi Electric Research Laboratories, Inc. | Method for distinguishing components of an acoustic signal |
-
2015
- 2015-09-11 WO PCT/US2015/049816 patent/WO2016040885A1/en active Application Filing
- 2015-09-11 US US14/852,446 patent/US9978388B2/en active Active
- 2015-09-11 DE DE112015004185.0T patent/DE112015004185T5/de not_active Withdrawn
- 2015-09-11 CN CN201580060446.6A patent/CN107112025A/zh active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1653520A (zh) * | 2002-05-20 | 2005-08-10 | 微软公司 | 确定和降噪相关联的不确定性的方法 |
CN1910656A (zh) * | 2004-01-20 | 2007-02-07 | 杜比实验室特许公司 | 基于块分组的音频编码 |
CN101512938A (zh) * | 2006-08-01 | 2009-08-19 | Dts(英属维尔京群岛)有限公司 | 用于补偿音频变换器的线性和非-线性失真的神经网络滤波技术 |
CN102280108A (zh) * | 2010-06-10 | 2011-12-14 | 佳能株式会社 | 音频信号处理设备及其控制方法 |
CN101976567A (zh) * | 2010-10-28 | 2011-02-16 | 吉林大学 | 一种语音信号差错掩盖方法 |
CN103430233A (zh) * | 2010-12-14 | 2013-12-04 | 弗兰霍菲尔运输应用研究公司 | 用于预测性编码的编码器及方法、用于译码的译码器及方法、用于预测性编码及译码的系统及方法和预测性编码信息信号 |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112088385A (zh) * | 2018-04-23 | 2020-12-15 | 塞伦妮经营公司 | 用于回归深度神经网络的判别训练的系统和方法 |
CN109147804A (zh) * | 2018-06-05 | 2019-01-04 | 安克创新科技股份有限公司 | 一种基于深度学习的音质特性处理方法及系统 |
CN109147805A (zh) * | 2018-06-05 | 2019-01-04 | 安克创新科技股份有限公司 | 基于深度学习的音频音质增强 |
WO2019233364A1 (zh) * | 2018-06-05 | 2019-12-12 | 安克创新科技股份有限公司 | 基于深度学习的音频音质增强 |
US11790934B2 (en) | 2018-06-05 | 2023-10-17 | Anker Innovations Technology Co., Ltd. | Deep learning based method and system for processing sound quality characteristics |
CN112820315A (zh) * | 2020-07-13 | 2021-05-18 | 腾讯科技(深圳)有限公司 | 音频信号处理方法、装置、计算机设备及存储介质 |
WO2022012195A1 (zh) * | 2020-07-13 | 2022-01-20 | 腾讯科技(深圳)有限公司 | 音频信号处理方法和相关装置 |
CN112820315B (zh) * | 2020-07-13 | 2023-01-06 | 腾讯科技(深圳)有限公司 | 音频信号处理方法、装置、计算机设备及存储介质 |
CN112289343A (zh) * | 2020-10-28 | 2021-01-29 | 腾讯音乐娱乐科技(深圳)有限公司 | 音频修复方法、装置及电子设备和计算机可读存储介质 |
CN112289343B (zh) * | 2020-10-28 | 2024-03-19 | 腾讯音乐娱乐科技(深圳)有限公司 | 音频修复方法、装置及电子设备和计算机可读存储介质 |
Also Published As
Publication number | Publication date |
---|---|
US20160078880A1 (en) | 2016-03-17 |
US9978388B2 (en) | 2018-05-22 |
DE112015004185T5 (de) | 2017-06-01 |
WO2016040885A1 (en) | 2016-03-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107112025A (zh) | 用于恢复语音分量的系统和方法 | |
CN108520741B (zh) | 一种耳语音恢复方法、装置、设备及可读存储介质 | |
CN110503970B (zh) | 一种音频数据处理方法、装置及存储介质 | |
US20200082829A1 (en) | Training a dialog system using user feedback | |
US9953634B1 (en) | Passive training for automatic speech recognition | |
Latif et al. | Adversarial machine learning and speech emotion recognition: Utilizing generative adversarial networks for robustness | |
EP3992965A1 (en) | Voice signal processing method and speech separation method | |
CN112435684B (zh) | 语音分离方法、装置、计算机设备和存储介质 | |
US20240087587A1 (en) | Wearable system speech processing | |
US20160034811A1 (en) | Efficient generation of complementary acoustic models for performing automatic speech recognition system combination | |
WO2021093380A1 (zh) | 一种噪声处理方法、装置、系统 | |
CN107507615A (zh) | 界面智能交互控制方法、装置、系统及存储介质 | |
CN102903362A (zh) | 集成的本地和基于云的语音识别 | |
US20200251124A1 (en) | Method and terminal for reconstructing speech signal, and computer storage medium | |
WO2021114847A1 (zh) | 网络通话方法、装置、计算机设备及存储介质 | |
CN107240396B (zh) | 说话人自适应方法、装置、设备及存储介质 | |
CN108877787A (zh) | 语音识别方法、装置、服务器及存储介质 | |
CN105794226A (zh) | 用于声学回声消除的房间脉冲响应估计 | |
CN112492207B (zh) | 一种基于声源定位控制摄像头转动的方法和装置 | |
US10079028B2 (en) | Sound enhancement through reverberation matching | |
US11776563B2 (en) | Textual echo cancellation | |
US20180277134A1 (en) | Key Click Suppression | |
Liu et al. | Wavoice: A mmWave-assisted Noise-resistant Speech Recognition System | |
Chowdhury et al. | AdVerb: Visually Guided Audio Dereverberation | |
US11869478B2 (en) | Audio processing using sound source representations |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20170829 |