CN103390410A - System and method for long-distance telephone conference - Google Patents

System and method for long-distance telephone conference Download PDF

Info

Publication number
CN103390410A
CN103390410A CN2012101442289A CN201210144228A CN103390410A CN 103390410 A CN103390410 A CN 103390410A CN 2012101442289 A CN2012101442289 A CN 2012101442289A CN 201210144228 A CN201210144228 A CN 201210144228A CN 103390410 A CN103390410 A CN 103390410A
Authority
CN
China
Prior art keywords
sound
far
sources
remote phone
phone conference
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2012101442289A
Other languages
Chinese (zh)
Inventor
徐筱琦
杨朝光
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Acer Inc
Original Assignee
Acer Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Acer Inc filed Critical Acer Inc
Priority to CN2012101442289A priority Critical patent/CN103390410A/en
Publication of CN103390410A publication Critical patent/CN103390410A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Telephonic Communication Services (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The invention provides a system and method for a long-distance telephone conference. The system comprises a far-end microphone array, a sound recognition module, a near-end display interface and a sound adjusting module. The far-end microphone array is used for receiving far-end sounds. The voice recognition module is used for recognizing a plurality of sound sources from the far-end sounds. The near-end display interface is used for displaying the sound sources. The sound adjusting module is used for performing adjustment according to sound characteristics of all of the sound sources respectively. Visualization can be performed on spatial positions of far-end participants in the conference, compared with the prior art, the system is better beneficial to understanding of seat relations of the far-end participants for near-end participants, therefore, a basis is provided for adjusting sound parameters, and the purpose of improving the quality of the long-distance telephone conference is achieved.

Description

Remote phone conference system and method
Technical field
The present invention relates to the teleconference technology.
Background technology
The remote phone conference system is the common means of communication of a kind of business office, and it can make both sides, tripartite, personnel are not subjected to linking up of regional limits even in many ways.
In the remote phone meeting, with regard to the both sides of conversation, the participant of far-end or near-end is a people all not only.Some conference system can be respectively the microphone of each participant's configure dedicated, though so can guarantee that each participant's speech can be received really, its authentication program and conference management mechanism are comparatively complicated; In addition, when the participant increased, to the i.e. increase thereupon of demand of number of microphone, and between adjacent microphone, the situation of sound interference also can become more serious.For setting up of Convenient telephone conference system, most teleconferences can not be the microphone of each participant's configure dedicated, but allow all participants of each side share identical microphone.Yet, be subject to the arrangement at seat, when participant's distance microphone far and near different, the radio reception effect of microphone also can be thereupon different, the quality of the both sides' conversations that so namely detracted.
Therefore need a kind of more convenient handy remote phone conference system and method.
Summary of the invention
In order to overcome the defect of prior art, the invention provides a kind of remote phone conference system, it comprises: a far-end microphone array is arranged at far-end, in order to receive far-end sound; One voice recognition module, be coupled to this far-end microphone array, in order to pick out a plurality of sources of sound from far-end sound; One near-end display interface, be arranged at near-end, is coupled to this voice recognition module, in order to the described a plurality of sources of sound that show that this voice recognition module picks out; One sound adjusting module, be coupled to this voice recognition module, in order to adjust for a sound characteristic of this source of sound respectively respectively.
The present invention separately provides a kind of remote phone conference method, and it comprises: with a far-end microphone array, receive far-end sound; Pick out a plurality of sources of sound from far-end sound; Show the described a plurality of sources of sound that picked out with a near-end display interface; Adjust at least one sound characteristic of this source of sound respectively respectively.
The present invention can give visualization with far-end participant's locus, in terms of existing technologies, more help the near-end participant to understand far-end participant's seat relation, and the basis of adjusting audio parameter is provided whereby, reach the purpose that promotes remote phone meeting quality.
Description of drawings
Fig. 1 is the remote phone conference system configuration diagram according to one embodiment of the invention.
Fig. 2 is the remote phone conference method process flow diagram according to one embodiment of the invention.
Wherein, description of reference numerals is as follows:
100 ~ remote phone conference system;
102 ~ far-end microphone array;
104 ~ voice recognition module;
106 ~ near-end display interface;
108 ~ near-end is controlled interface;
110 ~ sound adjusting module;
112 ~ sound broadcasting module;
S202 ~ S210 ~ step.
Embodiment
Hereinafter for introducing most preferred embodiment of the present invention.Each embodiment is in order to principle of the present invention to be described, but non-in order to limit the present invention.Scope of the present invention is when with appending claims, being as the criterion.
, in order to make the remote phone conference system be easier to use, the invention provides a kind of new-type remote phone conference system.The various embodiment that hereinafter will coordinate description of drawings remote phone conference system of the present invention.
The remote phone conference system
Fig. 1 is the remote phone conference system configuration diagram according to one embodiment of the invention.Remote phone conference system 100 of the present invention comprises at least: a far-end microphone array 102, a voice recognition module 104, a near-end display interface 106, a near-end are controlled interface 108, a sound adjusting module 110 and a sound broadcasting module 112.For convenience of description, embodiment hereinafter is all as an example of one-way communication example (be the far-end user speaks, near-end user listen to), yet the present invention certainly needn't be as limit, and those of ordinary skills can be applied in the present invention in two-way communication easily.In like manner, the present invention is not limited to the type of both sides' conversation, and the type of MPTY is also within scope of the present invention.
Far-end microphone array 102 of the present invention is arranged at far-end, can be in order to receive far-end sound.Generally speaking, microphone array 102 generally includes two or more microphones.Microphone of the present invention is not limited to moving-coil type, condenser type or other various types of microphones.Those of ordinary skills can be arranged at appropriate location with microphone array 102 according to directive property and the conference space of number of microphone, each microphone.For example, can adopt the microphone array with omnidirectional field sensitivity in roundtable conference, and it is arranged at the round table center.
Voice recognition module 104 of the present invention does not limit and is arranged at far-end or near-end, as long as can be connected to aforementioned distal microphone array 102 by the wired or wireless communication mode.It should be noted that key character of the present invention is that namely voice recognition module 104 of the present invention can be according to various existing acoustics calculation technology, identification and isolate a plurality of different sources of sound separately from the obtained far-end sound that mixes of microphone array 102.For example, these sources of sound namely comprise each participant's voice, and the noise of various non-voices.Generally, acoustics calculation technology mainly can be divided into audio direction identification technique and tonequality identification technique.The audio direction identification technique can be utilized position and the sensitivity of each microphone in microphone array 102, calculates direction and the distance (being the position of source of sound in space) of each source of sound; The tonequality identification technique can be analyzed sound press, frequency spectrum and the waveform of each source of sound, so as to obtaining the sound characteristics such as each source of sound such as volume, sharpness, audio frequency and tonequality (or claim tone color), even therefrom judge each source of sound be whether voice, whether be noise, speaker's summary sex and age are estimated.In more detail, because voice are not continual sound, and its volume and audio frequency all may change, therefore, in better embodiment, the sustainable intersection of the voice recognition module 104 of the present invention comparison position of one source of sound in space with and tonequality, reach the purpose of following the trail of this source of sound of locking.In addition, in certain embodiments, voice recognition module 104 also can be carried out the action of general noise filtering and echo elimination.Yet, the emphasis of wanting to emphasize due to the non-the present invention of aforementioned acoustic processing ins and outs, and it can be reached by various existing technology, and therefore, this paper is no longer repeated to save space.
Near-end display interface 106 of the present invention (being screen) is arranged at near-end, it is coupled to this voice recognition module 104, can be in order to show each source of sound that this voice recognition module 104 picks out to the near-end user, even, in certain embodiments, the every sound characteristic that shows described a plurality of sources of sound.For example, in the simplest embodiment, near-end display interface 106 only shows with word the far-end source of sound that voice recognition module 104 is picked out, and invests respectively each source of sound of both having deposited as titles such as " participant 1 ", " participant 2 ".To have the newcomer to add fashionable whenever voice recognition module 104 detects far-end, and near-end display interface 106 can be marked it with eye-catching word.In a preferred embodiment, near-end display interface 106 can two dimension or three-dimensional picture simulation far-end conference space, and according to the coordinate of the locus, place of voice recognition module 104 each source of sound that detects, it is labeled on the correspondence position of virtual screen.Wherein, each source of sound is except having the titles such as " participant 1 ", " participant 2 ", but various sound characteristics of note still, such as: whether volume, sharpness, audio frequency, tonequality, are voice, speaker's the associated estimated information such as sex age, those of ordinary skills can be according to shown information project and the display styles thereof of spiritual designed, designed near-end display interface 106 of the present invention.It should be noted that teleconference technology of the present invention also can further be applied in video conference, but and near-end display interface 106 also the real screen that transmits of simultaneous display far-end to replace aforementioned virtual screen.By near-end display interface 106 of the present invention, the near-end user can grasp the participant situation of far-end easily.
Near-end of the present invention is controlled interface 108 and is coupled to sound adjusting module 110 of the present invention, can be in order to receive the control of user to sound adjusting module 110, and each source of sound that sound adjusting module 110 of the present invention can pick out for voice recognition module 104 according to user's control is adjusted respectively its sound characteristic, and sound characteristic namely comprises: volume, sharpness, audio frequency and/or tonequality.For example, the near-end user can increase the important participant's of some far-end volume by controlling sound adjusting module 110, or promotes its sharpness; Same, can reduce, the voice that send of some noise of filtering or non-participant even, strengthen whereby the speech quality of meeting.In some special embodiment, sound adjusting module 110 even can carry out various audios to each source of sound to be processed, and comprises and changes its audio frequency or tonequality, reaches the purpose of concealment speaker identity.Sound adjusting module 110 of the present invention is not limited to be arranged on near-end or far-end, as long as can be connected to this voice recognition module 104 by wired or wireless mode.In preferred embodiment, sound adjusting module 110 can be integrated among a processor with voice recognition module 104, reaches the purpose of strengthening acoustic processing usefulness.
Finally, sound broadcasting module 112 of the present invention is coupled to the near-end loudspeaker, can be in order to play each source of sound after aforementioned adjustment sound characteristic.Sound broadcasting module 112 of the present invention is not limited to be arranged on near-end or far-end equally, as long as can be connected to this sound adjusting module 110 by wired or wireless mode.In preferred embodiment, sound broadcasting module 112 also can be integrated among a processor with sound adjusting module 110 and voice recognition module 104.Those of ordinary skills can recognize, the difference of voice recognition module 104, sound adjusting module 110 and sound broadcasting module 112 only for convenience of description, within the merit able one that any processor has an aforementioned modules all belongs to the scope that the present invention contains.
The remote phone conference method
Except aforesaid remote phone conference system, the present invention separately provides a kind of remote phone conference method.Fig. 2 is the remote phone conference method process flow diagram according to one embodiment of the invention.The method 200 comprises: in step S202, with a far-end microphone array, receive far-end sound; In step S204, pick out a plurality of sources of sound from far-end sound; In step S206, show described a plurality of sources of sound and the sound characteristic thereof that is picked out with a near-end display interface; In step S208, adjust at least one sound characteristic of this source of sound respectively respectively; And in step S210, the described a plurality of sources of sound after sound characteristic are adjusted in broadcast.Wherein, step S204 can be by audio direction identification technique and/or tonequality identification technique and pick out described a plurality of source of sound from far-end sound, and these sound characteristics are direction, distance, volume, sharpness, audio frequency and/or the tonequality of each source of sound.Because those of ordinary skills can be with reference to aforementioned about understanding remote phone conference method of the present invention in each embodiment of remote phone conference system, so locate and will repeat no more its correlative detail to save space.
Though the present invention discloses as above with preferred embodiment; so it is not in order to limit scope of the present invention; any those of ordinary skills; without departing from the spirit and scope of the present invention; when doing a little change and retouching, so protection scope of the present invention is as the criterion when looking appended the scope that claim defines.

Claims (11)

1. remote phone conference system comprises:
One voice recognition module, in order to receive a far-end sound that receives from a far-end microphone array, and pick out a plurality of sources of sound in this far-end sound certainly;
One near-end display interface, be coupled to this voice recognition module, in order to the described a plurality of sources of sound that show that this voice recognition module picks out; And
One sound adjusting module, be coupled to this voice recognition module, in order to adjust for a sound characteristic of this source of sound respectively respectively.
2. remote phone conference system as claimed in claim 1 also comprises:
One near-end is controlled interface, is coupled to this sound adjusting module, in order to receive the control of this user to this sound adjusting module.
3. remote phone conference system as claimed in claim 1 also comprises:
One sound broadcasting module, be coupled to this sound adjusting module, in order to the described a plurality of sources of sound after broadcast adjustment sound characteristic.
4. remote phone conference system as claimed in claim 1, wherein this voice recognition module is that one of them picks out described a plurality of source of sound from this far-end sound by audio direction identification technique and tonequality identification technique.
5. remote phone conference system as claimed in claim 1, wherein this near-end display interface is also in order to show the sound characteristic of described a plurality of sources of sound that this voice recognition module picks out.
6. remote phone conference system as claimed in claim 1, the sound characteristic of wherein said a plurality of sources of sound comprises direction and/or the distance of described a plurality of sources of sound.
7. remote phone conference system as claimed in claim 1, the sound characteristic of wherein said a plurality of sources of sound is volumes of described a plurality of sources of sound.
8. remote phone conference system as claimed in claim 1, the sound characteristic of wherein said a plurality of sources of sound comprises the sharpness of described a plurality of sources of sound, audio frequency and/or tonequality.
9. remote phone conference method comprises:
Receive a far-end sound of a far-end microphone array;
Pick out a plurality of sources of sound from this far-end sound; And
Show the described a plurality of sources of sound that picked out with a near-end display interface;
Adjust at least one sound characteristic of this source of sound respectively respectively.
10. remote phone conference method as claimed in claim 9 also comprises:
One of them picks out described a plurality of source of sound from this far-end sound by audio direction identification technique and tonequality identification technique.
11. remote phone conference method as claimed in claim 9, the sound characteristic of wherein said a plurality of sources of sound comprise the direction of described a plurality of sources of sound, distance, volume, sharpness, audio frequency and tonequality one of them.
CN2012101442289A 2012-05-10 2012-05-10 System and method for long-distance telephone conference Pending CN103390410A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2012101442289A CN103390410A (en) 2012-05-10 2012-05-10 System and method for long-distance telephone conference

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2012101442289A CN103390410A (en) 2012-05-10 2012-05-10 System and method for long-distance telephone conference

Publications (1)

Publication Number Publication Date
CN103390410A true CN103390410A (en) 2013-11-13

Family

ID=49534656

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2012101442289A Pending CN103390410A (en) 2012-05-10 2012-05-10 System and method for long-distance telephone conference

Country Status (1)

Country Link
CN (1) CN103390410A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105991854A (en) * 2014-09-29 2016-10-05 上海兆言网络科技有限公司 System and method for visualizing VOIP teleconference on intelligent terminal
WO2016176951A1 (en) * 2015-05-06 2016-11-10 小米科技有限责任公司 Method and device for optimizing sound signal
CN106210365A (en) * 2015-05-28 2016-12-07 仁宝电脑工业股份有限公司 Videoconference method for regulation of sound volume and system
CN108922538A (en) * 2018-05-29 2018-11-30 平安科技(深圳)有限公司 Conferencing information recording method, device, computer equipment and storage medium
CN112148182A (en) * 2019-06-28 2020-12-29 华为技术服务有限公司 Interaction control method, terminal and storage medium
CN115361474A (en) * 2022-08-18 2022-11-18 上海复旦通讯股份有限公司 Method for auxiliary recognition of sound source in telephone conference

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030185411A1 (en) * 2002-04-02 2003-10-02 University Of Washington Single channel sound separation
US20090015651A1 (en) * 2007-07-11 2009-01-15 Hitachi, Ltd. Voice Communication Device, Voice Communication Method, and Voice Communication Program
US20090220065A1 (en) * 2008-03-03 2009-09-03 Sudhir Raman Ahuja Method and apparatus for active speaker selection using microphone arrays and speaker recognition
CN101690149A (en) * 2007-05-22 2010-03-31 艾利森电话股份有限公司 Methods and arrangements for group sound telecommunication

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030185411A1 (en) * 2002-04-02 2003-10-02 University Of Washington Single channel sound separation
CN101690149A (en) * 2007-05-22 2010-03-31 艾利森电话股份有限公司 Methods and arrangements for group sound telecommunication
US20090015651A1 (en) * 2007-07-11 2009-01-15 Hitachi, Ltd. Voice Communication Device, Voice Communication Method, and Voice Communication Program
US20090220065A1 (en) * 2008-03-03 2009-09-03 Sudhir Raman Ahuja Method and apparatus for active speaker selection using microphone arrays and speaker recognition

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105991854A (en) * 2014-09-29 2016-10-05 上海兆言网络科技有限公司 System and method for visualizing VOIP teleconference on intelligent terminal
CN105991854B (en) * 2014-09-29 2020-03-13 上海兆言网络科技有限公司 System and method for visualizing VoIP (Voice over Internet protocol) teleconference on intelligent terminal
WO2016176951A1 (en) * 2015-05-06 2016-11-10 小米科技有限责任公司 Method and device for optimizing sound signal
CN106205628A (en) * 2015-05-06 2016-12-07 小米科技有限责任公司 Acoustical signal optimization method and device
CN106205628B (en) * 2015-05-06 2018-11-02 小米科技有限责任公司 Voice signal optimization method and device
US10499156B2 (en) 2015-05-06 2019-12-03 Xiaomi Inc. Method and device of optimizing sound signal
CN106210365B (en) * 2015-05-28 2019-06-21 仁宝电脑工业股份有限公司 Videoconference method for regulation of sound volume and system
CN106210365A (en) * 2015-05-28 2016-12-07 仁宝电脑工业股份有限公司 Videoconference method for regulation of sound volume and system
CN108922538A (en) * 2018-05-29 2018-11-30 平安科技(深圳)有限公司 Conferencing information recording method, device, computer equipment and storage medium
CN108922538B (en) * 2018-05-29 2023-04-07 平安科技(深圳)有限公司 Conference information recording method, conference information recording device, computer equipment and storage medium
CN112148182A (en) * 2019-06-28 2020-12-29 华为技术服务有限公司 Interaction control method, terminal and storage medium
CN112148182B (en) * 2019-06-28 2022-10-04 华为技术服务有限公司 Interaction control method, terminal and storage medium
CN115361474A (en) * 2022-08-18 2022-11-18 上海复旦通讯股份有限公司 Method for auxiliary recognition of sound source in telephone conference

Similar Documents

Publication Publication Date Title
US11240598B2 (en) Band-limited beamforming microphone array with acoustic echo cancellation
US8606249B1 (en) Methods and systems for enhancing audio quality during teleconferencing
US8503653B2 (en) Method and apparatus for active speaker selection using microphone arrays and speaker recognition
CA2560034C (en) System for selectively extracting components of an audio input signal
CN103220491B (en) For operating the method for conference system and for the device of conference system
CN103390410A (en) System and method for long-distance telephone conference
US20180359294A1 (en) Intelligent augmented audio conference calling using headphones
US20080273476A1 (en) Device Method and System For Teleconferencing
CN108520754B (en) Noise reduction conference machine
JP2006020314A (en) Stereo microphone processing for audio conferencing
CN103312906A (en) Method and device for realizing teleconference
CN106716526A (en) Method and apparatus for enhancing sound sources
WO2015035785A1 (en) Voice signal processing method and device
CN110708615A (en) Intercommunication system and intercommunication method realized based on TWS earphone
CA3228068A1 (en) Multi-source audio processing systems and methods
US8914007B2 (en) Method and apparatus for voice conferencing
US20100266112A1 (en) Method and device relating to conferencing
CN218217617U (en) Network conference management system for realizing multiple Bluetooth microphone sound boxes
TW201347507A (en) Remote conference system and method
US20120150542A1 (en) Telephone or other device with speaker-based or location-based sound field processing
CN204231481U (en) A kind of intelligent meeting telephone set with nozzle type identification
CN204231472U (en) A kind of intelligent meeting telephone set with feature identification
CN104301564A (en) Intelligent conference telephone with mouth shape identification
JP6392161B2 (en) Audio conference system, audio conference apparatus, method and program thereof
US10419851B2 (en) Retaining binaural cues when mixing microphone signals

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20131113