CN110010130A

CN110010130A - A kind of intelligent method towards participant's simultaneous voice transcription text

Info

Publication number: CN110010130A
Application number: CN201910263845.2A
Authority: CN
Inventors: 汪丹
Original assignee: Anhui Broad Sound Technology Co Ltd
Current assignee: Anhui Broad Sound Technology Co Ltd
Priority date: 2019-04-03
Filing date: 2019-04-03
Publication date: 2019-07-12

Abstract

The invention discloses a kind of intelligent methods towards participant's simultaneous voice transcription text, the following steps are included: participant microphone of registering to control centre the position of typing oneself, name information in a manner of voice, control centre stores the voice messaging of input by the sequence of input, and control centre pre-processes the voice messaging of typing；Pretreated voice messaging is converted into spectrogram and text information, and spectrogram and text information are stored in the information bank of the participant；Control centre's spectrogram is divided into several groups framing frequency spectrum；Control centre carries out feature extraction to framing frequency spectrum；Store participant's spectral energy values difference DelN；Identify spokesman's identity；Spectral energy values difference DelNf is calculated, and is compared；Formation forms minutes document.The present invention can be realized meeting overall process record, not can recognize that the identity for the spokesman that attends a meeting.

Description

A kind of intelligent method towards participant's simultaneous voice transcription text

Technical field

The invention belongs to intelligent sound technical fields, and in particular to a kind of intelligence towards participant's simultaneous voice transcription text It can method.

Background technique

For some important meetings, needs to record the full content of meeting, be consumed by the way of manual record Take manpower, the existing technology recorded automatically to conference content, the voice signal for usually issuing participant are direct at present Text character is converted into be saved.

It realizes in process of the present invention, at least there are the following problems in the related technology for inventor's discovery: by the voice of participant It is lengthy that signal is directly changed into the minutes that text character is saved and formed, it is difficult to identify each spokesman Any content said.

Summary of the invention

It is an object of the invention to overcome above-mentioned the deficiencies in the prior art, one kind is provided towards participant's simultaneous voice transcription The intelligent method of text.

A kind of intelligent method towards participant's simultaneous voice transcription text, it is characterised in that: the following steps are included:

1) participant microphone of registering to control centre the position of typing oneself, name information in a manner of voice, in control The heart stores the voice messaging of input by the sequence of input, and control centre pre-processes the voice messaging of typing；

2) pretreated voice messaging is converted into spectrogram and text information, and spectrogram and text information are stored in this and attended a meeting The information bank of person；

3) spectrogram of each participant is carried out framing by control centre at a fixed time interval, spectrogram is divided into several Component frame frequency spectrum；

4) control centre to framing frequency spectrum carry out feature extraction, the project of feature extraction include: framing frequency spectrum mass center Ci(i=1, 2 ... n), spectral energy values Ni(i=1,2 ... n), spectral energy values difference DelNd；

5) participant's spectral energy values difference DelN is stored in the information bank of the participant；

6) when participant makes a speech, by the microphone on attending a banquet to control centre by input speech voice, voice messaging of making a speech is through controlling Center processed is pre-processed, and pretreated speech voice messaging is converted into speech spectrogram and text information of making a speech, speech text Word information is stored into minutes document；

7) control centre presses step 3), 4) calculates spectral energy values difference DelNf；

8) DelNf is compared with the DelNd being stored in participant's information bank for control centre, DelNf and participant's information When the threshold value between DelNd in library is less than setting value, spokesman's identity is determined, and increase in the front of minutes document Spokesman's name；

9) after completing meeting, minutes document is formed, and print, sign.

Preferably, spectral energy values Ni in the step 4)=, DelNd=Ni-N(i-1).

Preferably, the pre-treatment step is to remove noise, signal amplification.

Preferably, the fixed time interval is 10-20ms.

Compared with prior art, beneficial effects of the present invention:

In the use of the present invention, the present invention knows otherwise to confirm the identity of each participant by tone color, so as to Conference content is corresponded to each participant, minutes is avoided to be difficult to differentiate the defect of speaker；Pass through pretreatment Technology, to be removed dryness and be amplified to signal, it is ensured that the accuracy of signal；The sound of participant is identified by intelligent automated manner Color has the advantages that accuracy is good；Participant is when carrying out typing identity, it is only necessary to carry out once, in the later period in use, i.e. It does not need to carry out typing.

Specific embodiment

9) after completing meeting, minutes document is formed, and print, sign.

Preferably, spectral energy values Ni=* MERGEFORMAT, DelNd=Ni-N(i-1 in the step 4)).

Preferably, the pre-treatment step is to remove noise, signal amplification.

Preferably, the fixed time interval is 10-20ms.

The working principle of the invention is:

In the use of the present invention, by calculating spectral energy values difference, to identify the identity of spokesman.

It should be noted that present invention specific implementation is not subject to the restrictions described above, as long as using side of the invention The various unsubstantialities that method conception and technical scheme carry out improve, or the not improved conception and technical scheme by invention are directly answered It is within the scope of the present invention for other occasions.

Claims

1. a kind of intelligent method towards participant's simultaneous voice transcription text, it is characterised in that: the following steps are included:

9) after completing meeting, minutes document is formed, and print, sign.

2. a kind of intelligent method towards participant's simultaneous voice transcription text as described in claim 1, it is characterised in that: institute State spectral energy values Ni in step 4)=, DelNd=Ni-N(i-1).

3. a kind of intelligent method towards participant's simultaneous voice transcription text as described in claim 1, it is characterised in that: institute Stating pre-treatment step is to remove noise, signal amplification.

4. a kind of intelligent method towards participant's simultaneous voice transcription text as described in claim 1, it is characterised in that: institute Stating fixed time interval is 10-20ms.