CN113450821A - Multi-party conference call system, method and computing device based on distributed computing - Google Patents

Multi-party conference call system, method and computing device based on distributed computing Download PDF

Info

Publication number
CN113450821A
CN113450821A CN202110656110.3A CN202110656110A CN113450821A CN 113450821 A CN113450821 A CN 113450821A CN 202110656110 A CN202110656110 A CN 202110656110A CN 113450821 A CN113450821 A CN 113450821A
Authority
CN
China
Prior art keywords
conference call
gsk
voice data
calculation
calculation power
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110656110.3A
Other languages
Chinese (zh)
Other versions
CN113450821B (en
Inventor
朱恩德
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Boluosi Technology Co ltd
Original Assignee
Shenzhen Boluosi Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Boluosi Technology Co ltd filed Critical Shenzhen Boluosi Technology Co ltd
Priority to CN202110656110.3A priority Critical patent/CN113450821B/en
Publication of CN113450821A publication Critical patent/CN113450821A/en
Application granted granted Critical
Publication of CN113450821B publication Critical patent/CN113450821B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • G06F9/5072Grid computing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/02Details
    • H04L12/16Arrangements for providing special services to substations
    • H04L12/18Arrangements for providing special services to substations for broadcast or conference, e.g. multicast
    • H04L12/1813Arrangements for providing special services to substations for broadcast or conference, e.g. multicast for computer conferences, e.g. chat rooms
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Software Systems (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Mathematical Physics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Quality & Reliability (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The embodiment of the invention discloses a multi-party conference call system, a multi-party conference call method and computing equipment based on distributed computing. The system comprises: the microphone array is used for picking up voice data to be processed; the conference call equipment is used for carrying out distributed calculation on the voice data to be processed to obtain target voice data; and a plurality of conference call devices communicate with each other. In the embodiment of the invention, the microphone array is used for picking up the voice, a calculation force handshake mechanism is established at the communication end of the conference communication equipment, and the calculation force of partial algorithm of the microphone is distributed to the point-to-point equipment for distributed calculation, so that the cost of the chip module is reduced.

Description

Multi-party conference call system, method and computing device based on distributed computing
Technical Field
The invention relates to the technical field of sound processing, in particular to a multi-party conference call system based on distributed computing, a method and computing equipment.
Background
With the development of global economy, teleconferences are used more and more frequently. Conventional single-microphone teleconferencing systems have difficulty meeting requirements for pickup quality, echo, and ambient noise suppression. The multi-microphone array teleconference system greatly improves the voice call quality by adopting multi-channel dereverberation and echo cancellation technologies.
However, the multi-microphone array introduces a new technical problem while improving the voice call quality. Due to the high computational complexity of the multi-microphone array, the conference call equipment chip is required to have high computational power, thereby increasing the cost of the chip module.
Disclosure of Invention
In view of the foregoing technical defects, an embodiment of the present invention provides a distributed computing multi-party conference call system, method and computing device.
In order to achieve the above object, in a first aspect, an embodiment of the present invention provides a distributed computing multi-party conference call system, including:
the microphone array is used for picking up voice data to be processed;
the conference call equipment is used for carrying out distributed calculation on the voice data to be processed to obtain target voice data; and a plurality of conference call devices communicate with each other.
Wherein, the actual computational power that the kth conference call equipment possesses is:
Gsk+G*(Tk-Gsk)/(T1+T2+……+Tk+……+Tn-Gs1-……Gsk-……-Gsn)
gsk is the pre-calculation power of the kth device, Gsn is the pre-calculation power of the nth device, G is the calculation power requirement of the microphone array algorithm, Tk is the original calculation power of the kth conference call device, and Tn is the original calculation power of the nth conference call device.
In a second aspect, an embodiment of the present invention provides a multiparty conference call method based on distributed computing, including:
receiving voice data to be processed picked up by a microphone array;
carrying out distributed calculation on the voice data to be processed through a plurality of conference call devices to obtain target voice data; and a plurality of conference call devices communicate with each other.
As a specific embodiment of the present application, before receiving the voice data to be processed picked up by the microphone array, the method further includes:
a computational handshake mechanism between a plurality of conference call devices is established.
As a specific embodiment of the present application, the method further includes:
calculating the actual calculation force of each conference call device;
and performing distributed calculation on the voice data to be processed according to the actual calculation power of each conference call device.
In a third aspect, an embodiment of the present invention provides a computing device, including a processor, an input device, an output device, and a memory, where the processor, the input device, the output device, and the memory are connected to each other, where the memory is used to store a computer program, the computer program includes program instructions, and the processor is configured to call the program instructions to perform the following steps:
receiving voice data to be processed picked up by a microphone array;
calculating the voice data to be processed by adopting actual calculation power to obtain target voice data;
wherein the actual computational power is obtained by computing according to computational power requirements of a plurality of conference call devices and a microphone array; the actual computational power that the kth conference call device possesses is:
Gsk+G*(Tk-Gsk)/(T1+T2+……+Tk+……+Tn-Gs1-……Gsk-……-Gsn)
where Gsk is the pre-calculation power of the kth device, Gsn is the pre-calculation power of the nth device, G is the calculation power requirement of the microphone array algorithm, Tk is the original calculation power of the kth conference call device, and Tn is the original calculation power of the nth conference call device.
The method and the system thereof pick up the voice through the microphone array, establish the computing power handshake mechanism at the communication end of the conference communication equipment, distribute the computing power of partial algorithms of the local machine to the point-to-point equipment for distributed computation, thereby reducing the cost of the chip module.
Drawings
In order to more clearly illustrate the detailed description of the invention or the technical solutions in the prior art, the drawings that are needed in the detailed description of the invention or the prior art will be briefly described below.
FIG. 1 is a computational force calculation schematic according to an embodiment of the present invention;
FIG. 2 is a block diagram of a multi-party conference call system based on distributed computing according to an embodiment of the present invention;
FIG. 3 is a flowchart of a multi-party conference call method based on distributed computing according to an embodiment of the present invention;
FIG. 4 is a block diagram of a computing device provided by an embodiment of the invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1, the inventive concept of the present invention is:
and establishing a calculation handshake mechanism at a communication end of the conference call equipment, and distributing the calculation power of the local part of algorithm to point-to-point equipment for distributed calculation. As shown in the following figure, when N devices perform communication, the computation force of the kth device is Tk, the voice of the kth device needs to be broadcast to N-1 devices to complete a conference call, the microphone array computation force part that needs to be completed on the kth device is placed on the N-1 devices to perform distributed computation, the computation force requirement of the microphone array algorithm is G, and the pre-computation force (the computation force required by the algorithm part that must be placed on the local end) is Gsk, then the kth device only needs to have the computation force of Gsk + G (Tk-Gsk)/(T1+ T2+ … … + Tk + … … + TN-Gs1- … … k- … … -Gsn), and the average computation force required by the devices participating in the distributed computation is not lower than the algorithm pre-computation force Gsk.
Based on the above inventive concept, an embodiment of the present invention provides a multiparty conference call system based on distributed computing, as shown in fig. 2, including:
the microphone array is used for picking up voice data to be processed;
the conference call equipment is used for carrying out distributed calculation on the voice data to be processed to obtain target voice data; and a plurality of conference call devices communicate with each other.
Wherein, the actual computational power that the kth conference call equipment possesses is:
Gsk+G*(Tk-Gsk)/(T1+T2+……+Tk+……+Tn-Gs1-……Gsk-……-Gsn)
gsk is the pre-calculation power of the kth device, Gsn is the pre-calculation power of the nth device, G is the calculation power requirement of the microphone array algorithm, Tk is the original calculation power of the kth conference call device, and Tn is the original calculation power of the nth conference call device.
The system of the invention is implemented, the voice is picked up by the microphone array, a calculation force handshake mechanism is established at the communication end of the conference communication equipment, and the calculation force of partial algorithm of the system is distributed on the point-to-point equipment for distributed calculation, thereby reducing the cost of the chip module.
Based on the same inventive concept, an embodiment of the present invention further provides a multiparty conference call method based on distributed computing, as shown in fig. 3, which may include:
and S1, establishing a computational handshake mechanism among the conference call devices.
S2, calculating the actual calculation force of each conference call device.
For example, the actual computational effort of the kth conference call device is:
Gsk+G*(Tk-Gsk)/(T1+T2+……+Tk+……+Tn-Gs1-……Gsk-……-Gsn)
where Gsk is the pre-calculation power of the kth device, Gsn is the pre-calculation power of the nth device, G is the calculation power requirement of the microphone array algorithm, Tk is the original calculation power of the kth conference call device, and Tn is the original calculation power of the nth conference call device.
And S3, receiving the voice data to be processed picked up by the microphone array.
And S4, performing distributed calculation on the voice data to be processed through a plurality of conference call devices to obtain target voice data.
Specifically, distributed computation is performed on the voice data to be processed according to the computed actual computation power of each conference call device, so that the computation power of the local partial algorithm is distributed to other point-to-point equipment merchants, the voice quality is guaranteed, and the cost of the chip module is reduced.
Referring to fig. 4 again, an embodiment of the present invention provides a computing device, including: one or more processors 101, one or more input devices 102, one or more output devices 103, and memory 104, the processors 101, input devices 102, output devices 103, and memory 104 being interconnected via a bus 105. The memory 104 is used for storing a computer program comprising program instructions, the processor 101 being configured for invoking the program instructions for performing the methods of the above-described method embodiment parts.
It should be understood that, in the embodiment of the present invention, the Processor 101 may be a Central Processing Unit (CPU), a deep learning graphics card (e.g., NPU, england GPU, google TPU), other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), an FPGA (Field-Programmable Gate Array) or other Programmable logic device, a discrete Gate or transistor logic device, a discrete hardware component, etc. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
The input device 102 may include a keyboard or the like, and the output device 103 may include a display (LCD or the like), a speaker, or the like.
The memory 104 may include read-only memory and random access memory, and provides instructions and data to the processor 101. A portion of the memory 104 may also include non-volatile random access memory. For example, the memory 104 may also store device type information.
While the invention has been described with reference to specific embodiments, the invention is not limited thereto, and various equivalent modifications and substitutions can be easily made by those skilled in the art within the technical scope of the invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (7)

1. A multi-party conference call system based on distributed computing, comprising:
the microphone array is used for picking up voice data to be processed;
the conference call equipment is used for carrying out distributed calculation on the voice data to be processed to obtain target voice data; and a plurality of conference call devices communicate with each other.
2. The multi-party conference call system based on distributed computing of claim 1, wherein the k-th conference call device has an actual computational power of:
Gsk+G*(Tk-Gsk)/(T1+T2+……+Tk+……+Tn-Gs1-……Gsk-……-Gsn)
where Gsk is the pre-calculation power of the kth device, Gsn is the pre-calculation power of the nth device, G is the calculation power requirement of the microphone array algorithm, Tk is the original calculation power of the kth conference call device, and Tn is the original calculation power of the nth conference call device.
3. A multi-party conference call method based on distributed computing is characterized by comprising the following steps:
receiving voice data to be processed picked up by a microphone array;
carrying out distributed calculation on the voice data to be processed through a plurality of conference call devices to obtain target voice data; and a plurality of conference call devices communicate with each other.
4. The distributed computing-based multi-party conference call method of claim 3, wherein prior to receiving the pending voice data picked up by the microphone array, the method further comprises:
a computational handshake mechanism between a plurality of conference call devices is established.
5. The distributed computing-based multi-party conference call method of claim 4, wherein the method further comprises:
calculating the actual calculation force of each conference call device;
and performing distributed calculation on the voice data to be processed according to the actual calculation power of each conference call device.
6. The multi-party conference call method based on distributed computing of claim 5, wherein the k-th conference call device has an actual computational power of:
Gsk+G*(Tk-Gsk)/(T1+T2+……+Tk+……+Tn-Gs1-……Gsk-……-Gsn)
where Gsk is the pre-calculation power of the kth device, Gsn is the pre-calculation power of the nth device, G is the calculation power requirement of the microphone array algorithm, Tk is the original calculation power of the kth conference call device, and Tn is the original calculation power of the nth conference call device.
7. A computing device comprising a processor, an input device, an output device, and a memory, the processor, the input device, the output device, and the memory being interconnected, wherein the memory is configured to store a computer program comprising program instructions, the processor being configured to invoke the program instructions to perform the steps of:
receiving voice data to be processed picked up by a microphone array;
calculating the voice data to be processed by adopting actual calculation power to obtain target voice data;
wherein the actual computational power is obtained by computing according to computational power requirements of a plurality of conference call devices and a microphone array; the actual computational power that the kth conference call device possesses is:
Gsk+G*(Tk-Gsk)/(T1+T2+……+Tk+……+Tn-Gs1-……Gsk-……-Gsn)
where Gsk is the pre-calculation power of the kth device, Gsn is the pre-calculation power of the nth device, G is the calculation power requirement of the microphone array algorithm, Tk is the original calculation power of the kth conference call device, and Tn is the original calculation power of the nth conference call device.
CN202110656110.3A 2021-06-11 2021-06-11 Multi-party conference call system, method and computing device based on distributed computing Active CN113450821B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110656110.3A CN113450821B (en) 2021-06-11 2021-06-11 Multi-party conference call system, method and computing device based on distributed computing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110656110.3A CN113450821B (en) 2021-06-11 2021-06-11 Multi-party conference call system, method and computing device based on distributed computing

Publications (2)

Publication Number Publication Date
CN113450821A true CN113450821A (en) 2021-09-28
CN113450821B CN113450821B (en) 2024-05-07

Family

ID=77811319

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110656110.3A Active CN113450821B (en) 2021-06-11 2021-06-11 Multi-party conference call system, method and computing device based on distributed computing

Country Status (1)

Country Link
CN (1) CN113450821B (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1543181A (en) * 2003-04-30 2004-11-03 华为技术有限公司 A distributed mix processing method
US20060062368A1 (en) * 2004-09-20 2006-03-23 International Business Machines Corporation N-ways conference system using only participants' telephony devices without external conference server
CN101252452A (en) * 2007-03-31 2008-08-27 红杉树(杭州)信息技术有限公司 Distributed type tone mixing system in multimedia conference
CN101573955A (en) * 2006-12-27 2009-11-04 诺基亚公司 Distributed teleconference multichannel architecture, system, method, and computer program product
CN105068048A (en) * 2015-08-14 2015-11-18 南京信息工程大学 Distributed microphone array sound source positioning method based on space sparsity
CN106027946A (en) * 2015-03-27 2016-10-12 阿尔卡特朗讯企业通信国际公司 Method for allocating video conferencing task to processing device
CN108712584A (en) * 2018-05-16 2018-10-26 中国电子科技集团公司第二十八研究所 A kind of distributed sound mixing method for videoconference
US20200099792A1 (en) * 2018-09-21 2020-03-26 Dolby Laboratories Licensing Corporation Audio conferencing using a distributed array of smartphones

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1543181A (en) * 2003-04-30 2004-11-03 华为技术有限公司 A distributed mix processing method
US20060062368A1 (en) * 2004-09-20 2006-03-23 International Business Machines Corporation N-ways conference system using only participants' telephony devices without external conference server
CN101573955A (en) * 2006-12-27 2009-11-04 诺基亚公司 Distributed teleconference multichannel architecture, system, method, and computer program product
CN101252452A (en) * 2007-03-31 2008-08-27 红杉树(杭州)信息技术有限公司 Distributed type tone mixing system in multimedia conference
CN106027946A (en) * 2015-03-27 2016-10-12 阿尔卡特朗讯企业通信国际公司 Method for allocating video conferencing task to processing device
CN105068048A (en) * 2015-08-14 2015-11-18 南京信息工程大学 Distributed microphone array sound source positioning method based on space sparsity
CN108712584A (en) * 2018-05-16 2018-10-26 中国电子科技集团公司第二十八研究所 A kind of distributed sound mixing method for videoconference
US20200099792A1 (en) * 2018-09-21 2020-03-26 Dolby Laboratories Licensing Corporation Audio conferencing using a distributed array of smartphones

Also Published As

Publication number Publication date
CN113450821B (en) 2024-05-07

Similar Documents

Publication Publication Date Title
US10546593B2 (en) Deep learning driven multi-channel filtering for speech enhancement
CN111951819A (en) Echo cancellation method, device and storage medium
CN108076226B (en) Method for adjusting call quality, mobile terminal and storage medium
US20200396329A1 (en) Acoustic echo cancellation based sub band domain active speaker detection for audio and video conferencing applications
US9143862B2 (en) Correlation based filter adaptation
WO2013009949A1 (en) Microphone array processing system
US20160133269A1 (en) System and method for improving noise suppression for automatic speech recognition
CN110769352B (en) Signal processing method and device and computer storage medium
CN112489670B (en) Time delay estimation method, device, terminal equipment and computer readable storage medium
CN110060696A (en) Sound mixing method and device, terminal and readable storage medium storing program for executing
US20220020386A1 (en) Intelligent noise cancellation system for video conference calls in telepresence rooms
CN110503973B (en) Audio signal transient noise suppression method, system and storage medium
CN113450821A (en) Multi-party conference call system, method and computing device based on distributed computing
CN110996208B (en) Wireless earphone and noise reduction method thereof
WO2024017110A1 (en) Voice noise reduction method, model training method, apparatus, device, medium, and product
CN113329372A (en) Method, apparatus, device, medium and product for vehicle-mounted call
CN112289336A (en) Audio signal processing method and device
CN110517682A (en) Audio recognition method, device, equipment and storage medium
CN113436636A (en) Acoustic echo cancellation method and system based on adaptive filter and neural network
CN114038452A (en) Voice separation method and device
CN107170461B (en) Voice signal processing method and device
CN111681666A (en) Backup of filter coefficient, device and computer storage medium
CN113409802B (en) Method, device, equipment and storage medium for enhancing voice signal
CN114023347A (en) Directional sound pickup method and device, electronic equipment and storage medium
US20230050621A1 (en) Information transmission device, information reception device, information transmission method, recording medium, and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant