CN105827618A

CN105827618A - Method for improving speech communication quality of fragment asynchronous conference system

Info

Publication number: CN105827618A
Application number: CN201610258803.6A
Authority: CN
Inventors: 王学宗
Original assignee: Sichuan Lianyou Telecom Technology Co Ltd
Current assignee: Sichuan Lianyou Telecom Technology Co Ltd
Priority date: 2016-04-25
Filing date: 2016-04-25
Publication date: 2016-08-03

Abstract

The invention discloses a method for improving the speech communication quality of a fragment asynchronous conference system. The method includes the following steps: S1, a conference speaker records voice statement information through a conference terminal; S2, the conference terminal performs de-noising processing on the voice statement information; S3, the conference terminal sends de-noised voice statement information to a conference cloud server; S4, the conference cloud server performs de-noising processing on received voice statement information; and S5, the conference cloud server sends the voice statement information to participants in a pre-created statement content receiving member list. According to the method of the invention, first de-noising processing is performed on voice statement content on the conference terminal used by the conference speaker, and then, second de-noising processing is performed on the voice statement content in the conference cloud server, and therefore, noise signals in the voice statement content can be greatly inhibited, and the speech communication quality can be improved.

Description

The method improving fragmentation asynchronous conference system speech quality

Technical field

The present invention relates to videoconference speech quality technical field, particularly relate to a kind of method improving fragmentation asynchronous conference system speech quality.

Background technology

Current social, the office facility of many enterprise institutions is distributed in all over the world, in the day-to-day operations of enterprise, needs often to sit on solving the problem in enterprise operation, but traditional conference model is that personnel participating in the meeting all focuses on a local meeting.There is the cost shortcoming such as low high, ageing in this conference model for having the enterprise of numerous branch, therefore, videoconference is risen therewith.

Along with breeding phase is gradually walked out in domestic call conference service market, either videoconference or mobile phone meeting start lively quickly.But under the videoconference market background of so clamour, videoconference the most on the market is nearly all confined to traditional simultaneous voice videoconference, or use VOIP technology, or use SS7 voice technology, powerful network function is not effectively utilized with chip time, no matter use which kind of product will take participant and go a large amount of lock in time to participate in meeting.Existing videoconference is frequently present of noise signal so that the speech content in videoconference is difficult to catch, and has a strong impact on meeting quality.

Summary of the invention

It is an object of the invention to overcome the deficiencies in the prior art, it is provided that a kind of method improving fragmentation asynchronous conference system speech quality, speech utterance content is successively carried out twice noise reduction process, improves speech quality.

It is an object of the invention to be achieved through the following technical solutions: the method improving fragmentation asynchronous conference system speech quality, comprise the following steps:

S1. conference speech person is by conference terminal recorded speech speech information；

S2. conference terminal carries out noise reduction process to speech utterance information；

S3. the speech utterance information after noise reduction process is sent to meeting Cloud Server by conference terminal；

S4. meeting Cloud Server carries out noise reduction process to the speech utterance information received；

S5. the member that respectively attends a meeting during speech utterance information is sent to the speech content reception members list being pre-created by meeting Cloud Server.

In described step S1, conference speech person is by the noise reduction microphone recorded speech speech information on conference terminal.

Described step S2 includes following sub-step:

S21. the signal framing framing to input speech utterance information, takes into Hamming window；

S22. time-domain signal is converted to lean and signal, the spectral power distribution of signal calculated；

S23. collecting mail according to the condition adjudgement docking receiving signal and number carry out gain oscillations detection, detection updates the end according to the state being presently in after terminating and makes an uproar spectral power distribution；

S24. utilize and receive the spectral power distribution of signal and the spectral power distribution made an uproar in the end calculates Spectral structure posteriori SNR, calculate general gain coefficient by MMSE method of estimation, and utilize gain coefficient to suppress noise；

S25. utilize the spectral power distribution after noise reduction and the end make an uproar spectral power distribution calculate frame signal to noise ratio, the frame signal to noise ratio preserving and updating in nearest certain time；

S26. according to frame signal to noise ratio and spectrum envelope signal to noise ratio record information, carry out spectrum envelope multimode transfer, judge that input signal is voice or noise according to the State-output of multimode transfer；

S27. the signal after noise reduction is carried out conversion and window superposition during frequency, output signal is carried out voice head protection, export after noise reduction voice or quiet according to the result of quiet detection.

The step creating speech content reception members list is also included before described step S5.

Described step S5 includes following sub-step:

S51. meeting Cloud Server is respectively attended a meeting in judging the speech content reception members list being pre-created the type of member:

If S52. attending a meeting, member is conference terminal member, then meeting Cloud Server be sent to speech utterance information to attend a meeting member conference terminal on；

If S53. attending a meeting, member is mobile phone member, then meeting Cloud Server be sent to speech utterance information to attend a meeting member mobile phone on.

Described step S2 is identical with the mode that speech utterance information carries out noise reduction process in step S4.

The invention has the beneficial effects as follows: in the present invention, speech utterance content is carried out noise reduction process for the first time by the conference terminal that conference speech person uses, can discuss the most again and speech utterance content is carried out second time noise reduction process by Cloud Server, greatly suppress the noise signal in speech utterance content, improve speech quality.

Accompanying drawing explanation

Fig. 1 is the flow chart that the present invention improves the method for fragmentation asynchronous conference system speech quality.

Detailed description of the invention

Technical scheme is described in further detail below in conjunction with the accompanying drawings, but protection scope of the present invention is not limited to the following stated.

As it is shown in figure 1, the method improving fragmentation asynchronous conference system speech quality, comprise the following steps:

S1. conference speech person is by conference terminal recorded speech speech information.

In described step S1, conference speech person is by the noise reduction microphone recorded speech speech information on conference terminal, use noise reduction microphone recorded speech speech information, from source, reduce the noise signal in speech utterance information, thus improve the speech quality of meeting.

S2. conference terminal carries out noise reduction process to speech utterance information.

Described step S2 includes following sub-step:

Signal framing framing to input speech utterance information in described step S21, every frame is made up of 128-512 sampling point, and every time the sampling point of renewal frame length half, is multiplied by a Hamming window by every frame signal, and window length is identical with frame length.

Described step S22, transfers the time-domain signal received to frequency-region signal by fast fourier transform；According to mankind's phonation characteristics, will be less than 300Hz and the spectrum energy more than 3400Hz is set to zero.

S3. the speech utterance information after noise reduction process is sent to meeting Cloud Server by conference terminal.

S4. meeting Cloud Server carries out noise reduction process to the speech utterance information received.

The step creating speech content reception members list is also included before described step S5.Creating speech content reception members list and specifically include following sub-step: first, conference speech person edits speech content reception member list on conference terminal；Secondly, speech content reception member list is encrypted by conference terminal, and the speech content reception member list after encryption is sent to meeting Cloud Server；Again, speech content reception member list is decrypted by meeting Cloud Server, and creates speech content reception members list according to speech content reception member list.

Described step S5 includes following sub-step:

The above is only the preferred embodiment of the present invention, it is to be understood that the present invention is not limited to form disclosed herein, it is not to be taken as the eliminating to other embodiments, and can be used for other combinations various, amendment and environment, and can be modified by above-mentioned teaching or the technology of association area or knowledge in contemplated scope described herein.And the change that those skilled in the art are carried out and change are without departing from the spirit and scope of the present invention, the most all should be in the protection domain of claims of the present invention.

Claims

1. the method improving fragmentation asynchronous conference system speech quality, it is characterised in that: comprise the following steps:

The method improving fragmentation asynchronous conference system speech quality the most according to claim 1, it is characterised in that: in described step S1, conference speech person is by the noise reduction microphone recorded speech speech information on conference terminal.

The method improving fragmentation asynchronous conference system speech quality the most according to claim 1, it is characterised in that: described step S2 includes following sub-step:

The method improving fragmentation asynchronous conference system speech quality the most according to claim 1, it is characterised in that: also include the step creating speech content reception members list before described step S5.

The method improving fragmentation asynchronous conference system speech quality the most according to claim 1, it is characterised in that: described step S5 includes following sub-step:

The method improving fragmentation asynchronous conference system speech quality the most according to claim 1, it is characterised in that: described step S2 is identical with the mode that speech utterance information carries out noise reduction process in step S4.