WO2013170802A1 - 一种提高移动终端通话音质的方法及装置 - Google Patents

一种提高移动终端通话音质的方法及装置 Download PDF

Info

Publication number
WO2013170802A1
WO2013170802A1 PCT/CN2013/077711 CN2013077711W WO2013170802A1 WO 2013170802 A1 WO2013170802 A1 WO 2013170802A1 CN 2013077711 W CN2013077711 W CN 2013077711W WO 2013170802 A1 WO2013170802 A1 WO 2013170802A1
Authority
WO
WIPO (PCT)
Prior art keywords
mobile terminal
position information
call
area
sound
Prior art date
Application number
PCT/CN2013/077711
Other languages
English (en)
French (fr)
Inventor
胡楠
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2013170802A1 publication Critical patent/WO2013170802A1/zh

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/005Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2499/00Aspects covered by H04R or H04S not otherwise provided for in their subgroups
    • H04R2499/10General applications
    • H04R2499/11Transducers incorporated or for use in hand-held devices, e.g. mobile phones, PDA's, camera's

Definitions

  • the present invention relates to the field of mobile communication technologies, and in particular, to a method and apparatus for improving the voice quality of a mobile terminal. Background technique
  • the single-microphone mobile terminal equipment uses the steady-state noise estimation method in terms of noise cancellation, and can only suppress the smooth noise, such as wind noise, but the dual-mike mobile terminal uses the spatial filtering method. You can focus your sound on one area to minimize noise and echo.
  • the technical problem to be solved by the embodiment of the present invention is a method and a device for improving the voice quality of a mobile terminal, which are used to solve the problem of the drop of the call quality caused by the change of the position of the human mouth in the prior art.
  • an embodiment of the present invention provides a method for improving a voice quality of a mobile terminal, where the method includes:
  • the acquiring the mouth position information of the user is: collecting the face position information by using a camera of the mobile terminal, and acquiring the mouth position information according to the face position information.
  • the method for determining the sound receiving area of the call is: determining whether the single face is a face that is preset to be tracked in the mobile terminal, if the camera collects a single face location information If yes, the sound collection area of the double microphone or the multi-microphone is obtained according to the position information of the human mouth corresponding to the single face; if not, all the sound collection areas collected by the single microphone are used as the sound collection area of the call.
  • the method for determining the sound receiving area of the call is: adjusting the overall sound receiving area to the sound collecting area of the call, or using all the sound collecting areas collected by the single microphone as a sound receiving area of the call; wherein the overall sound receiving area includes a sound receiving area corresponding to each of the plurality of human faces.
  • the human mouth position information includes a direction and a distance of the human mouth relative to the mobile terminal.
  • the embodiment of the present invention further provides an apparatus for improving the voice quality of a mobile terminal, where the apparatus includes: a human mouth location information acquiring unit configured to acquire a human mouth location information of a user who uses the mobile terminal to make a call;
  • a sound receiving area acquiring unit configured to acquire a sound receiving area of the double microphone or the multi-mike according to the position information of the human mouth;
  • a processing unit configured to determine whether the sound receiving area exceeds a default sound receiving area preset by the mobile terminal, and if yes, adjust the sound receiving area to a calling sound receiving area; if not, the default sound receiving area As the radio area of the call.
  • the human mouth position information acquiring unit is configured to collect the face position information by using a camera of the mobile terminal, and acquire the human mouth position information according to the face position information.
  • the processing unit is configured to determine whether the single face is a face that is preset to be tracked in the mobile terminal, and if yes, according to the method, The position information of the mouth corresponding to the single face acquires the sound receiving area of the double microphone or the multi-mike; if not, all the sound collecting areas collected by the single microphone are used as the sound collecting area of the call.
  • the processing unit is configured to adjust the overall sound receiving area as a sound receiving area of the call, or use all the collected sound areas collected by the single microphone as the sound collecting area of the call.
  • the overall sound receiving area includes a sound receiving area corresponding to each of the plurality of human faces.
  • the human mouth position information includes a direction and a distance of the human mouth relative to the mobile terminal.
  • the microphone receiving area is adjusted, the call quality of the mobile terminal is improved, and the downlink sound area of the external terminal can be avoided, and the effect of echo cancellation of the hands-free call can be improved.
  • FIG. 2 is a flowchart of a method for improving the voice quality of a mobile terminal according to an embodiment of the present invention
  • FIG. 3 is a schematic structural diagram of an apparatus for improving the voice quality of a mobile terminal according to an embodiment of the present invention. detailed description
  • the present invention uses the principle to track the position of the person's mouth of the caller and adjust the sound receiving area in real time, thereby achieving the purpose of improving the voice quality of the call.
  • an embodiment of the present invention relates to a method for improving a voice quality of a mobile terminal, including:
  • Step S201 acquiring human mouth position information of the user who uses the mobile terminal to make a call; in this step, the camera position of the caller may be collected by the camera of the mobile terminal, and the face recognition technology is used to identify the position of the mouth of the person,
  • the human mouth performs real-time tracking.
  • Face recognition technology has been widely used in smart mobile terminals, such as Android (android) system has integrated this feature in version 4.0. In practice, it may be, but is not limited to, using only the front camera of the mobile terminal. If the mobile terminal has a back end camera, it can be turned on, so that the angle of the detection area can be expanded from 180 degrees to 360 degrees. However, the back-end camera avatar is usually higher, and it is more power-consuming to use the back-end camera for ⁇ ⁇ .
  • the caller's mouth position information can also be obtained by other technologies, for example: using the voice recognition technology to identify the caller, and using the audio ranging technology to determine the position and distance of the caller's mouth relative to the mobile terminal.
  • the invention mainly needs to obtain the mouth of the caller relative to the mouth
  • the orientation and distance of the mobile terminal, as for the means and methods of acquisition, need not be limited.
  • Step S202 acquiring a sound receiving area of the double microphone or the multi-mike according to the position information of the human mouth; in this step, using various different noise reduction algorithms according to the orientation and distance of the mouth of the person acquiring the caller relative to the mobile terminal , calculate the radio area of the mobile terminal.
  • various noise reduction algorithms are also different.
  • the present invention does not limit a specific noise reduction algorithm. For various noise reduction algorithms, as long as the caller's mouth position information is utilized, The present invention can be applied to the determination of a new radio zone.
  • some mobile terminals have used multiple microphones, such as three microphones, to determine the radio area, and the radio effect is better.
  • the mobile terminal uses the three microphones to determine a more accurate radio area from the three-dimensional angle.
  • the present invention adjusts the sound receiving area by tracking the position of the caller's mouth based on the noise reduction by the three microphones, thereby improving the voice quality of the call.
  • the sound receiving area in this embodiment refers to an area of the voice information collected by the mobile terminal through the noise reduction calculation, which may be an area determined by a dual microphone noise reduction algorithm, or may be an area determined by a single microphone or multiple microphones. .
  • Step S203 determining whether the sound receiving area exceeds a default sound receiving area preset by the mobile terminal, and if yes, adjusting the sound receiving area to a sound receiving area of a call; if not, using the default sound receiving area as a call The radio area.
  • the mobile terminal is provided with a default radio area at the factory.
  • the default radio area is set according to the position of most call faces under normal conditions.
  • the default radio area setting needs to be determined by a large number of tests or simulations by the pre-audio debugger. .
  • the adjusted radio area is the default radio area for the next radio area adjustment. This ensures continuous real-time adjustment of the radio area. Of course, it can also be used by the caller every time.
  • the sound zone determined by the mouth is compared with the default radio zone at the factory.
  • the mobile terminal judges, it can be determined that the radio area determined according to the caller's mouth exceeds the default. Whether the range of the radio area exceeds the preset area difference threshold, and if so, the radio area adjustment is performed, and if not, the existing radio area is maintained as the radio area of the call.
  • the camera may collect a single face location information.
  • the mobile terminal may track any face in the collection area, and directly follow the steps.
  • S202, S203 perform processing; face recognition may also be performed to distinguish whether to track a fixed face, such as a face of a mobile terminal owner; when pre-set is to track a fixed face, and preset
  • the tracked face information determines whether the single face of the camera is a face that is preset to be tracked in the mobile terminal, and if so, according to the position information of the mouth corresponding to the single face of the camera.
  • the camera may also collect a plurality of face position information.
  • the radio area should be adjusted to be wider, and all the radio areas corresponding to the mouth of the person should be included as the worst.
  • the effect is to use a single microphone to perform the arpeggio, that is, to adjust the overall radio area to the radio area of the call, or to use all the radio areas collected by the single microphone as the radio area of the call; wherein the overall radio area includes multiple faces The radio area corresponding to each person's mouth position information.
  • step S201 the face moving speed of the caller can also be detected.
  • the face moving speed of the caller exceeds a preset speed threshold, the caller's mouth position information cannot be collected, or the sound receiving area cannot be completed. Adjusting the operation, in this case, you can adjust the radio area more widely, for example: use the maximum radio area supported by the mobile terminal as the radio area of the call, or use a single microphone to reduce noise, and use all the radio areas collected by the single microphone as the call. The radio area.
  • the face moving speed of the caller does not exceed (including the threshold) the preset speed threshold, the processing is performed in accordance with steps S202 and S203.
  • the present invention further relates to an apparatus for improving the voice quality of a mobile terminal by implementing the foregoing method, including: a human mouth location information obtaining unit 301, configured to acquire human mouth location information of a user who uses the mobile terminal to make a call;
  • a sound receiving area obtaining unit 302 configured to acquire a sound receiving area of the double microphone or the multi-mike according to the position information of the human mouth;
  • the processing unit 303 is configured to determine whether the radio area exceeds the default radio area preset by the mobile terminal, and if yes, adjust the radio area to the radio area of the call; if not, the default radio area is used as the radio area of the call.
  • the human mouth position information acquiring unit 301 uses the camera of the mobile terminal to collect the face position information, and obtains the mouth position information according to the face position information;
  • the human mouth position information includes a direction and a distance of the human mouth relative to the mobile terminal.
  • the processing unit 303 determines whether the single face is a face that is preset to be tracked in the mobile terminal, and if so, according to the position information of the mouth corresponding to the single face , obtain the radio area of the dual microphone or multi-mike; if not, use all the radio areas collected by the single microphone as the radio area of the call.
  • the processing unit 303 adjusts the overall sound receiving area to the sound receiving area of the call, or uses all the collected sound areas collected by the single microphone as the sound collecting area of the call; wherein, the overall sound receiving area includes more The radio area corresponding to each person's mouth position information in the face.
  • the embodiment of the invention is a method for simulating a mobile terminal as a human speech, and the dual microphone simulates the binaural, the camera simulates the eye, combines "listening" with “seeing”, and determines the position of the effective sound source and the noise source by "seeing”. , adjust the "listen” parameters in real time, so that the audio quality can be effectively improved.
  • the specific implementation is to use the equipment of the current smart machine or tablet computer, namely: dual microphone and camera, combined with face recognition technology (currently most intelligent platforms support face recognition technology), according to face recognition
  • face recognition technology currently most intelligent platforms support face recognition technology
  • the invention has a great improvement on the sound effect of the hands-free and three-segment earphone conversation.
  • the three-segment earphone itself does not have a microphone device, and the call can only use the mobile terminal inherent microphone to receive the sound. It can be seen that the present invention improves the call quality of the mobile terminal by checking the position of the human mouth, thereby improving the call quality of the mobile terminal, and can avoid the downlink sound region of the external release, thereby improving the echo cancellation of the hands-free call.
  • the sound receiving area is adjusted to the sound receiving area of the call. Otherwise, the default radio area is used as the radio area for the call. Therefore, the quality of the call of the mobile terminal can be improved, and the sound region of the outgoing sound can be avoided, and the effect of echo cancellation of the hands-free call can be improved.

Landscapes

  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Telephone Function (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

本发明公开了一种提高移动终端通话音质的方法及装置,该方法包括:获取利用移动终端进行通话的用户的人嘴位置信息;根据所述人嘴位置信息,获取双麦克或多麦克的收音区域;判断所述收音区域是否超出了所述移动终端预先设置的默认收音区域,如果是,则将所述收音区域调整为通话的收音区域;否则,以所述默认收音区域作为通话的收音区域。本发明通过检查人嘴的位置,进而调整麦克的收音区域,提高了移动终端的通话质量,而且能避开外放的下行声音区域,且可改善免提通话的回声消除的效果。

Description

一种提高移动终端通话音质的方法及装置 技术领域
本发明涉及移动通讯技术领域, 特别是涉及一种提高移动终端通话音 质的方法及装置。 背景技术
随着移动终端产品消费量的增大, 尤其是智能机和平板电脑的迅猛发 展, 无论是使用移动网络还是 WiFi ( Wireless Fidelity, 无线保真) 网络, 用户可以随时随地的使用移动终端和他人进行通话。 同时, 用户对其通话 音质的要求也越来越高。
为了满足用户在嘈杂的环境下正常通话的需要, 双麦克已经变成了他、 移动终端设备的标准配置。 单麦克的移动终端设备在噪声消除方面都是釆 用稳态噪声估计方式, 只能对平稳的噪声, 例如风声, 有较好的抑制, 但 是双麦克的移动终端釆用了空间滤波的方式, 可以将声音集中在某一个区 域, 这样可以最大限度的减少噪声和回声。
正是这种区域性, 导致了双麦克降噪方案要求人嘴说话的位置非常严 格, 相应技术只能应用于移动终端的手持通话模式, 对于应用场景更加广 泛的免提和三段式耳机通话效果却非常差, 所述三段式耳机本身没有麦克 设备, 通话只能釆用移动终端固有麦克来接收声音。 尤其是当前 WiFi功能 在智能移动终端的应用, VOIP ( Voice on Internet Protocol )技术的广泛釆用, 这种免提和三段式耳机通话的应用越来越广泛, 但相关的技术却没有得到 改善, 成为了瓶颈。 发明内容
本发明实施例要解决的技术问题是一种提高移动终端通话音质的方法 及装置, 用以解决现有技术中人嘴位置改变带来的通话质量下降的问题。
为解决上述技术问题, 一方面, 本发明实施例提供一种提高移动终端 通话音质的方法, 该方法包括:
获取利用移动终端进行通话的用户的人嘴位置信息;
根据所述人嘴位置信息, 获取双麦克或多麦克的收音区域;
判断所述收音区域是否超出了所述移动终端预先设置的默认收音区 域, 如果是, 则将所述收音区域调整为通话的收音区域; 如果否, 则以所 述默认收音区域作为通话的收音区域。
其中, 所述获取用户的人嘴位置信息, 为: 利用所述移动终端的摄像 头釆集所述人脸位置信息, 根据所述人脸位置信息, 获取所述人嘴位置信 息。
其中, 当所述摄像头釆集的是单张人脸位置信息时, 所述通话的收音 区域的确定方法为: 判断所述单张人脸是否为预设在移动终端中追踪的人 脸, 如果是, 则根据所述单张人脸对应的人嘴位置信息, 获取双麦克或多 麦克的收音区域; 如果否, 则将单麦克收集的所有收音区域作为通话的收 音区域。
其中, 当所述摄像头釆集的是多张人脸位置信息时, 所述通话的收音 区域的确定方法为: 将整体收音区域调整为通话的收音区域, 或将单麦克 收集的所有收音区域作为通话的收音区域; 其中, 所述整体收音区域包括 多张人脸中的每个人嘴位置信息对应的收音区域。
其中, 所述人嘴位置信息包括人嘴相对于移动终端的方向和距离。 另一方面, 本发明实施例还提供一种提高移动终端通话音质的装置, 该装置包括: 人嘴位置信息获取单元, 配置为获取利用移动终端进行通话的用户的 人嘴位置信息;
收音区域获取单元, 配置为根据所述人嘴位置信息, 获取双麦克或多 麦克的收音区域;
处理单元, 配置为判断所述收音区域是否超出了所述移动终端预先设 置的默认收音区域, 如果是, 则将所述收音区域调整为通话的收音区域; 如果否, 则以所述默认收音区域作为通话的收音区域。
其中, 所述人嘴位置信息获取单元, 配置为利用所述移动终端的摄像 头釆集所述人脸位置信息, 根据所述人脸位置信息, 获取所述人嘴位置信 息。
其中, 当所述摄像头釆集的是单张人脸位置信息时, 所述处理单元, 配置为判断所述单张人脸是否为预设在移动终端中追踪的人脸, 如果是, 则根据所述单张人脸对应的人嘴位置信息, 获取双麦克或多麦克的收音区 域; 如果否, 则将单麦克收集的所有收音区域作为通话的收音区域。
其中, 当所述摄像头釆集的是多张人脸位置信息时, 所述处理单元, 配置为将整体收音区域调整为通话的收音区域, 或将单麦克收集的所有收 音区域作为通话的收音区域; 其中, 所述整体收音区域包括多张人脸中的 每个人嘴位置信息对应的收音区域。
其中, 所述人嘴位置信息包括人嘴相对于移动终端的方向和距离。 本发明实施例的有益效果如下:
本发明实施例通过检查人嘴的位置, 进而调整麦克的收音区域, 提高 了移动终端的通话质量, 而且能够避开外放的下行声音区域, 能够改善免 提通话的回声消除的效果。 附图说明
图 1 是本发明实施例中移动终端 (手机)双麦克调整收音区域的效果 示意图;
图 2 是本发明实施例中一种提高移动终端通话音质的方法的流程图; 图 3 是本发明实施例中一种提高移动终端通话音质的装置的结构示意 图。 具体实施方式
以下结合附图以及实施例, 对本发明进行进一步详细说明。 应当理解, 此处所描述的具体实施例仅仅用以解释本发明, 并不限定本发明。
如图 1 所示, 本领域中, 移动终端, 例如: 手机利用收音麦克和降噪 麦克进行通话时, 通过改变降噪参数, 利用不同的降噪算法就可以实现对 收音区域的调整, 例如: 从收音区域 A调整到收音区域 因此, 本发明 利用该原理, 通过追踪通话者的人嘴位置, 实时调整收音区域, 进而达到 提高通话音质的目的。
如图 2所示, 本发明实施例涉及一种提高移动终端通话音质的方法, 包括:
步骤 S201 , 获取利用移动终端进行通话的用户的人嘴位置信息; 本步骤中, 可以利用移动终端的摄像头釆集通话者的人脸位置信息, 利用人脸识别技术来识别人嘴的位置, 对人嘴进行实时的跟踪。 人脸识别 技术在智能移动终端上已经得到了广泛的应用, 比如安卓 (android ) 系统 已经在 4.0版本上集成了此项功能。 在实际操作中, 可以但并不限于仅仅使 用移动终端前端摄像头, 如果移动终端有后端摄像头的话, 也可以开启使 用, 这样检测区域的角度就可以从 180度扩展到 360度。 但后端摄像头像 素通常较高, 利用后端摄像头进行釆集会更耗电。
另外, 也可以通过其它技术获取通话者的人嘴位置信息, 例如: 利用 语音识别技术对通话者进行识别, 利用音频测距技术确定通话者的人嘴相 对于移动终端的方位和距离。 本发明主要是需要获取通话者的人嘴相对于 移动终端的方位和距离, 至于获取的手段和方式则不需限定。
步骤 S202 ,根据所述人嘴位置信息,获取双麦克或多麦克的收音区域; 本步骤中, 就是根据获取通话者的人嘴相对于移动终端的方位和距离, 利用各种不同的降噪算法, 计算移动终端的收音区域。 目前, 利用双麦克 进行降噪的移动终端很多, 各种降噪算法也各不相同, 本发明不限定具体 的降噪算法, 对于各种降噪算法, 只要利用通话者的人嘴位置信息, 可以 确定出新的收音区域, 就适用本发明。
另外, 目前一些移动终端出现了利用多麦克, 例如三麦克, 来确定收 音区域, 收音效果更好, 移动终端利用三麦克从三维的角度确定一个更加 精确的收音区域。 但是, 目前没有移动终端对收音区域进行调整, 因此, 本发明在其利用三麦克进行降噪的基础上, 通过追踪通话者人嘴的位置来 调整收音区域, 进而提高通话音质。
本实施例中的收音区域, 是指移动终端通过降噪计算确定的釆集通话 者语音信息的区域, 可以是通过双麦克降噪算法确定的区域, 也可以是单 麦克或多麦克确定的区域。
步骤 S203 , 判断所述收音区域是否超出了所述移动终端预先设置的默 认收音区域, 如果是, 则将所述收音区域调整为通话的收音区域; 如果否, 则以所述默认收音区域作为通话的收音区域。
移动终端在出厂时都设置有默认的收音区域, 默认收音区域是按照正 常情况下多数通话人脸所处位置进行设定的, 默认收音区域设定需要前期 音频调试人员通过大量测试或仿真进行确定。 另外, 当收音区域调整后, 调整后的收音区域就是作为下一次进行收音区域调整的默认收音区域, 这 样, 就可以保证连续实时的调整收音区域; 当然, 也可以每次都用根据通 话者人嘴确定的收音区域和出厂时默认的收音区域进行比较。
移动终端判断时, 可以判断根据通话者人嘴确定的收音区域超出默认 收音区域的范围是不是超过了预先设定的区域差别阔值, 如果是, 则进行 收音区域调整, 如果否, 则保持现有的收音区域为通话的收音区域。
另外, 步骤 S201中, 摄像头釆集的可能是单张人脸位置信息, 此种情 况时, 移动终端可以追踪任意一张在釆集区域内的人脸, 直接按照步骤
S202、 S203进行处理; 也可以进行人脸识别, 来区分是否追踪一张固定人 脸, 例如移动终端机主的人脸,; 当预先设置的是追踪一张固定人脸, 并且 预先设置了要追踪的人脸信息, 则判断摄像头釆集的单张人脸是否为预设 在移动终端中追踪的人脸, 如果是, 则根据摄像头釆集的单张人脸对应的 人嘴位置信息, 获取双麦克或多麦克的收音区域; 如果否, 则将单麦克收 集的所有收音区域作为通话的收音区域。
步骤 S201中,摄像头釆集的也可能是多张人脸位置信息,此种情况时, 应该将收音区域调整的更广阔一些, 尽量将所有的人嘴对应的收音区域都 包含在内, 最差的效果是釆用单麦克进行釆音, 即: 则将整体收音区域调 整为通话的收音区域, 或将单麦克收集的所有收音区域作为通话的收音区 域; 其中, 整体收音区域包括多张人脸中的每个人嘴位置信息对应的收音 区域。
步骤 S201中, 还可以检测通话者的人脸移动速度, 当通话者的人脸移 动速度超过预先设定的速度阈值时, 将无法釆集通话者的人嘴位置信息, 或无法完成收音区域的调整操作, 这时, 可以将收音区域调整的更加广泛, 例如: 将移动终端支持的最大收音区域作为通话的收音区域, 或者釆用单 麦克进行降噪, 将单麦克收集的所有收音区域作为通话的收音区域。 当通 话者的人脸移动速度没有超过(包括该阔值)预先设定的速度阔值时, 则 按照步骤 S202、 S203进行处理。
另外, 如图 3 所示, 本发明还涉及一种实现上述方法的提高移动终端 通话音质的装置, 包括: 人嘴位置信息获取单元 301 ,用于获取利用移动终端进行通话的用户的 人嘴位置信息;
收音区域获取单元 302 , 用于根据人嘴位置信息, 获取双麦克或多麦克 的收音区域;
处理单元 303 ,用于判断收音区域是否超出了移动终端预先设置的默认 收音区域, 如果是, 则将收音区域调整为通话的收音区域; 如果否, 则以 默认收音区域作为通话的收音区域。
人嘴位置信息获取单元 301利用移动终端的摄像头釆集人脸位置信息, 根据人脸位置信息, 获取人嘴位置信息;
其中, 人嘴位置信息包括人嘴相对于移动终端的方向和距离。
当摄像头釆集的是单张人脸位置信息时, 处理单元 303 判断单张人脸 是否为预设在移动终端中追踪的人脸, 如果是, 则根据单张人脸对应的人 嘴位置信息, 获取双麦克或多麦克的收音区域; 如果否, 则将单麦克收集 的所有收音区域作为通话的收音区域。
当摄像头釆集的是多张人脸位置信息时, 处理单元 303 将整体收音区 域调整为通话的收音区域, 或将单麦克收集的所有收音区域作为通话的收 音区域; 其中, 整体收音区域包括多张人脸中的每个人嘴位置信息对应的 收音区域。
本发明实施例是将移动终端模拟为人说话的处理方式, 将双麦克模拟 双耳, 摄像头模拟眼睛, 将 "听" 和 "看" 结合起来, 通过 "看" 判断出 有效音源和噪声音源的位置, 实时调整 "听" 的参数, 这样就可以有效的 提高音频音质。 具体实现就是利用目前智能机或者平板电脑本身带有的设 备, 即: 双麦克和摄像头, 结合人脸识别技术(目前绝大多数智能平台都 是支持人脸识别技术 ), 根据人脸鉴别出人嘴相对于移动终端的位置变化情 况, 设置新的双麦克捕获声音的区域, 从而保证人在通话过程中的音质。 本发明对免提和三段式耳机通话的音效都有较大的提升, 所述三段式耳机 本身没有麦克设备, 通话只能釆用移动终端固有麦克来接收声音。 由此可 见, 本发明通过检查人嘴的位置, 进而调整麦克的收音区域, 提高了移动 终端的通话质量, 而且能够避开外放的下行声音区域, 能够改善免提通话 的回声消除。
尽管为示例目的, 已经公开了本发明的优选实施例, 本领域的技术人 员将意识到各种改进、 增加和取代也是可能的, 因此, 本发明的范围应当 不限于上述实施例。 工业实用性
本发明实施例通过获取用户的人嘴位置信息, 并获取麦克的收音区域, 确定所述收音区域超出了所述移动终端预先设置的默认收音区域, 则将所 述收音区域调整为通话的收音区域; 否则, 以所述默认收音区域作为通话 的收音区域。 因此, 可提高移动终端的通话质量, 而且能够避开外放的下 行声音区域, 能够改善免提通话的回声消除的效果。

Claims

权利要求书
1、 一种提高移动终端通话音质的方法, 该方法包括:
获取利用移动终端进行通话的用户的人嘴位置信息;
根据所述人嘴位置信息, 获取双麦克或多麦克的收音区域;
判断所述收音区域是否超出了所述移动终端预先设置的默认收音区 域, 如果是, 则将所述收音区域调整为通话的收音区域; 否则, 以所述默 认收音区域作为通话的收音区域。
2、 如权利要求 1所述的提高移动终端通话音质的方法, 其中, 所述获 取用户的人嘴位置信息, 为:
利用所述移动终端的摄像头釆集所述人脸位置信息, 根据所述人脸位 置信息, 获取所述人嘴位置信息。
3、 如权利要求 2所述的提高移动终端通话音质的方法, 其中, 当所述 摄像头釆集的是单张人脸位置信息时, 所述通话的收音区域的确定方法为: 判断所述单张人脸是否为预设在移动终端中追踪的人脸, 如果是, 则 根据所述单张人脸对应的人嘴位置信息, 获取双麦克或多麦克的收音区域; 否则, 将单麦克收集的所有收音区域作为通话的收音区域。
4、 如权利要求 2所述的提高移动终端通话音质的方法, 其中, 当所述 摄像头釆集的是多张人脸位置信息时, 所述通话的收音区域的确定方法为: 将整体收音区域调整为通话的收音区域, 或将单麦克收集的所有收音 区域作为通话的收音区域; 其中, 所述整体收音区域包括多张人脸中的每 个人嘴位置信息对应的收音区域。
5、 如权利要求 1~4中任一项所述的提高移动终端通话音质的方法, 其 中,
所述人嘴位置信息, 包括人嘴相对于移动终端的方向和距离。
6、 一种提高移动终端通话音质的装置, 该装置包括: 人嘴位置信息获取单元, 配置为获取利用移动终端进行通话的用户的 人嘴位置信息;
收音区域获取单元, 配置为根据所述人嘴位置信息, 获取双麦克或多 麦克的收音区域;
处理单元, 配置为判断所述收音区域是否超出了所述移动终端预先设 置的默认收音区域, 如果是, 则将所述收音区域调整为通话的收音区域; 否则, 以所述默认收音区域作为通话的收音区域。
7、 如权利要求 6所述的提高移动终端通话音质的装置, 其中, 所述人嘴位置信息获取单元, 配置为利用所述移动终端的摄像头釆集 所述人脸位置信息, 根据所述人脸位置信息, 获取所述人嘴位置信息。
8、 如权利要求 7所述的提高移动终端通话音质的装置, 其中, 当所述 摄像头釆集的是单张人脸位置信息时,
所述处理单元, 配置为判断所述单张人脸是否为预设在移动终端中追 踪的人脸, 如果是, 则根据所述单张人脸对应的人嘴位置信息, 获取双麦 克或多麦克的收音区域; 如果否, 则将单麦克收集的所有收音区域作为通 话的收音区域。
9、 如权利要求 7所述的提高移动终端通话音质的装置, 其中, 当所述 摄像头釆集的是多张人脸位置信息时,
所述处理单元, 配置为将整体收音区域调整为通话的收音区域, 或将 单麦克收集的所有收音区域作为通话的收音区域; 其中, 所述整体收音区 域包括多张人脸中的每个人嘴位置信息对应的收音区域。
10、 如权利要求 6~9 中任一项所述的提高移动终端通话音质的装置, 其中, 所述人嘴位置信息, 包括人嘴相对于移动终端的方向和距离。
PCT/CN2013/077711 2012-10-09 2013-06-21 一种提高移动终端通话音质的方法及装置 WO2013170802A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201210379489.9A CN103716446B (zh) 2012-10-09 2012-10-09 一种提高移动终端通话音质的方法及装置
CN201210379489.9 2012-10-09

Publications (1)

Publication Number Publication Date
WO2013170802A1 true WO2013170802A1 (zh) 2013-11-21

Family

ID=49583166

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2013/077711 WO2013170802A1 (zh) 2012-10-09 2013-06-21 一种提高移动终端通话音质的方法及装置

Country Status (2)

Country Link
CN (1) CN103716446B (zh)
WO (1) WO2013170802A1 (zh)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104052886B (zh) * 2014-06-27 2018-02-27 联想(北京)有限公司 一种信息处理方法及电子设备
CN104320729A (zh) * 2014-10-09 2015-01-28 深圳市金立通信设备有限公司 一种拾音方法
CN104410778A (zh) * 2014-10-09 2015-03-11 深圳市金立通信设备有限公司 一种终端
CN106302974B (zh) * 2015-06-12 2020-01-31 联想(北京)有限公司 一种信息处理的方法及电子设备
CN114374903B (zh) * 2020-10-16 2023-04-07 华为技术有限公司 拾音方法和拾音装置

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2215092A (en) * 1988-01-30 1989-09-13 Toshiba Kk Control of microphone position to receive voice input
JPH0728488A (ja) * 1993-06-24 1995-01-31 Canon Inc 情報処理方法及び装置
CN1612577A (zh) * 2003-10-30 2005-05-04 日本电气株式会社 移动电话
CN102223594A (zh) * 2010-04-19 2011-10-19 鸿富锦精密工业(深圳)有限公司 麦克风控制装置及方法
CN102378097A (zh) * 2010-08-25 2012-03-14 鸿富锦精密工业(深圳)有限公司 麦克风控制系统及方法

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2215092A (en) * 1988-01-30 1989-09-13 Toshiba Kk Control of microphone position to receive voice input
JPH0728488A (ja) * 1993-06-24 1995-01-31 Canon Inc 情報処理方法及び装置
CN1612577A (zh) * 2003-10-30 2005-05-04 日本电气株式会社 移动电话
CN102223594A (zh) * 2010-04-19 2011-10-19 鸿富锦精密工业(深圳)有限公司 麦克风控制装置及方法
CN102378097A (zh) * 2010-08-25 2012-03-14 鸿富锦精密工业(深圳)有限公司 麦克风控制系统及方法

Also Published As

Publication number Publication date
CN103716446A (zh) 2014-04-09
CN103716446B (zh) 2016-12-21

Similar Documents

Publication Publication Date Title
US9756422B2 (en) Noise estimation in a mobile device using an external acoustic microphone signal
EP3122066B1 (en) Audio enhancement via opportunistic use of microphones
CN110493678B (zh) 耳机的控制方法、装置、耳机和存储介质
US9510112B2 (en) External microphone array and hearing aid using it
US8606249B1 (en) Methods and systems for enhancing audio quality during teleconferencing
CN104038625B (zh) 来电时电话音频的自动路由
WO2014161309A1 (zh) 一种移动终端实现声源定位的方法及装置
WO2014101429A1 (zh) 一种终端双麦克风降噪的方法及装置
US20150172830A1 (en) Method of Audio Signal Processing and Hearing Aid System for Implementing the Same
WO2015139642A1 (zh) 一种实现蓝牙耳机降噪的方法、装置和系统
WO2020057419A1 (zh) 一种音频控制方法和装置、及终端
EP2426950A2 (en) Noise suppression for sending voice with binaural microphones
WO2013170802A1 (zh) 一种提高移动终端通话音质的方法及装置
CN106375573B (zh) 一种切换通话模式的方法及装置
CN102098372A (zh) 一种手持设备听筒音量智能调节的方法及其设备
CN105827793B (zh) 一种语音定向输出方法及移动终端
TW201801069A (zh) 語音資訊的接收方法、系統及裝置
CN115482830B (zh) 语音增强方法及相关设备
CN104168369A (zh) 一种终端情景模式的调整方法及装置
WO2016015186A1 (zh) 通信设备的声音信号处理方法和设备
JP2007028134A (ja) 携帯電話機
CN114333886A (zh) 音频处理方法、装置、电子设备及存储介质
WO2017166495A1 (zh) 一种语音信号处理方法及装置
JP6290827B2 (ja) オーディオ信号を処理する方法及び補聴器システム
KR101848458B1 (ko) 레코딩 방법 및 그 장치

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13790388

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 13790388

Country of ref document: EP

Kind code of ref document: A1