CN106487532A

CN106487532A - A kind of voice automatic record method

Info

Publication number: CN106487532A
Application number: CN201510530240.7A
Authority: CN
Inventors: 龙水维
Original assignee: Chongqing West-Line Technology Co Ltd
Current assignee: Chongqing West-Line Technology Co Ltd
Priority date: 2015-08-26
Filing date: 2015-08-26
Publication date: 2017-03-08

Abstract

The invention discloses a kind of voice automatic record method, including：Pre-enter speech roster；Obtain voice signal；The conversion of described voice signal is identified as by corresponding Word message by voice conversion software, and stored, wherein, when identifying conversion voice signal first, choose and to the name corresponding to this voice signal from the speech roster pre-entering, and extract the tamber characteristic of this voice messaging, the name chosen is associated with described tamber characteristic simultaneously；When carrying out the conversion identification of next voice signal, judging that whether its tone color is identical with the tone color of formerly described extraction, if identical, the name being associated with this tone color is shown in beginning of the sentence, if differing, entering the operation repeating to identifying conversion voice signal first；Described Word message is ranked up in a document show.This relatively existing professional equipment and professional, the cost of implementation of the present invention is lower, is substantially all and can realize in various occasions.

Description

A kind of voice automatic record method

Technical field

The present invention relates to signal processing technology field, more particularly, it is a kind of voice automatic record method.

Background technology

In meeting scene, it is all typically now the voice data to record meeting scene by the way of recording, or by scene Scribe artificial input record is carried out by special recording equipment.Although both modes are not asked in realization Topic, but, if adopting the former method, cannot timely on-the-spot meeting record manuscript, need after the meeting by manually to receive The mode listening session recording is recorded；If using the method for the latter, needing to buy special recording equipment, and need specially The typist of industry just enables the synchronous recording at meeting scene, and high cost is it is impossible to popularize, only in some official's meeting occasions Just can use.

Therefore, how quickly to realize the speech of meeting scene spokesman is automatically recorded, and do not need the record of specialty Personnel and the equipment of specialty, have just become a great problem of the art.

Content of the invention

In view of the above problems, the present invention provides a kind of voice automatic record method, for realizing in the situation not needing professional The speech content at the lower meeting of record automatically scene.Its concrete technical scheme is：

A kind of voice automatic record method, including：Pre-enter speech roster；Obtain voice signal；Software is converted by voice The conversion of described voice signal is identified as corresponding Word message, and is stored, wherein, identify conversion voice signal first When, choose and to the name corresponding to this voice signal from the speech roster pre-entering, and extract the sound of this voice messaging The name chosen is associated with described tamber characteristic by color characteristic simultaneously；When carrying out the conversion identification of next voice signal, Judge that whether its tone color is identical with the tone color of formerly described extraction, if identical, the name being associated with this tone color is shown in beginning of the sentence, If differing, enter the operation repeating to identifying conversion voice signal first；Described Word message is ranked up in a document Display.

Preferably, the method obtaining described voice signal includes：Spoken sounds are converted into by described voice signal by mike.

Preferably, the method for sequencing display Word message includes in a document：Come to described literary composition according to every section of continuous voice signal Word information carries out segmentation sequencing display.

Preferably, described according to the method that every section of continuous voice signal to carry out segmentation sequencing display to described Word message it is： When described voice conversion software completes previous voice signal conversion identification, start timing；Arrive in described voice conversion software receipt During current speech signal, stop timing, and be calculated the time difference between current speech signal and previous voice signal；Judge Whether the described time be more than default time difference, if so, the Word message that current speech signal conversion identification obtains is carried out point Section sequencing display；If it is not, the Word message that then current speech signal conversion identification obtains carries out arranged in sequence showing.

Hinge structure, the present invention does not need the typewriting apparatuss turning it is not required that the typing personnel of specialty, can be achieved with to meeting The automatic conversion of view live speeches content, thus obtaining the Word message corresponding with described speech content, and records, with When can also know clearly that every words are who says in the Word message showing.This relatively existing professional equipment and specially Industry personnel, the cost of implementation of the present invention is lower, and function is more preferably, in hgher efficiency.

Brief description

For the scheme being illustrated more clearly that in the embodiment of the present invention, below will be attached to use required described in specific embodiment Figure be briefly described it should be apparent that, drawings in the following description are only some embodiments of the present invention, for this area For technical staff, on the premise of not paying creative work, other accompanying drawings can also be obtained according to these accompanying drawings.

A kind of flowchart of voice automatic record method that Fig. 1 provides for the present invention.

Specific embodiment

Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clearly and completely Description is it is clear that described embodiment is only a part of embodiment of the present invention, rather than whole embodiments.Based on this Inventive embodiment, all other enforcement that those of ordinary skill in the art are obtained on the premise of not making creative work Example, broadly falls into the scope of protection of the invention.

The optimized integration of the present invention is existing speech software.It is all more ripe that existing speech recognition technology and dress change technology, Phonitic entry method or speech recognition technology are had on the mobile terminals such as mobile phone, as long as can be achieved with setting for terminal by voice For being controlled or operating, as long as or for terminal unit speech, the content that this just can be talked by the software on terminal unit It is automatically recognized as word, and shown.

Under the background of above prior art, in the present embodiment, give a kind of voice automatic record method, below will be to the method It is described in detail.

See Fig. 1, give a kind of flowchart of voice automatic record method, the method comprising the steps of：

Step S1, pre-enters speech roster.

Step S2, obtains voice signal.

In being embodied as, the speech content of spokesman in meeting scene can be passed through by Mike by equipment such as mike and audio amplifiers Wind is changing into the signal of telecommunication, and plays back in sound-box device.Or directly obtained in the speech of spokesman by mike Hold, that is, directly by required voice signal in spoken sounds conversion cost embodiment.In this and prior art, speak against mobile phone, Come to mobile phone transmission voice signal to be a reason with this.

Step S3, described voice signal is changed into corresponding Word message, and is stored, and wherein, identifies conversion first During voice signal, choose and to the name corresponding to this voice signal from the speech roster pre-entering, and extract this voice The name chosen is associated with described tamber characteristic by the tamber characteristic of information simultaneously.

In being embodied as, because in meeting, everyone is not continuous it is entirely possible to be a people one time of speech, deposit In this situation alternately, then shown by not can know that in such scheme, Word message out is to be said by whom, because Gone back in this present embodiment method being overcome.

First, pre-enter speech roster；

Then, when identifying conversion voice signal first, choose from the speech roster pre-entering and to this voice signal institute Corresponding name, and extract the tamber characteristic of this voice messaging, the name chosen is associated with described tamber characteristic simultaneously；

Then, when carrying out the identification of next voice signal, judge whether its tone color is identical with the tone color of formerly described extraction, if phase Same then the name being associated with this tone color is shown in beginning of the sentence；If differing, entering and repeating to convert voice signal to identifying first Operation.Rapidly and efficiently can know that by said method whom the spokesman of every words is in a document.

For example, there is " Zhang San ", " Li Si " and " king five " in the list of typing in advance, after meeting starts, voice is changed Identification software is when carrying out voice signal identification first, if the first voice signal is the speech content of speech " Zhang San ", that Choose Zhang San from list, afterwards, if " Zhang San " talks always, then from the beginning of second voice signal, then meeting Automatically labelling " Zhang San " before the Word message identifying.In addition, if from the beginning of second or the 3rd voice signal After be other people speeches, then by according to the same to the speech recognition labelling of Zhang San, repeat no more here.

In being embodied as, software can be converted by voice and convert voice signals into corresponding Word message.By this side Formula will identify that the Word message coming saves, and reaches the purpose of record.

Step S4, when carrying out the conversion identification of next voice signal, judges whether its tone color is identical with the tone color of formerly described extraction, If identical, the name being associated with this tone color is shown in beginning of the sentence, if differing, entering and repeating to convert voice letter to identifying first Number operation.

Step S5, described Word message is ranked up in a document show.

In being embodied as, because the purpose of meeting is to record the speech content in meeting in a document, accordingly, it would be desirable to The Word message identifying is ranked up show, that is to say that typesetting shows.

Pass through said method can automatically record the speech content at meeting scene well, without by professional Typewriting apparatuss and the typist of specialty, can be carried out in any place.Relatively existing method, cost of the present invention is lower, reliable Property is higher.

In being embodied as, absolutely can not correctly identify the speech content in voice signal due to reason switching software, Or can there is a certain proportion of wrong word, in order to overcome this problem further, present invention also offers implementation below.

When in a document Word message is ranked up with display, described Word message includes correct Word message and wrong word letter Breath.Therefore, the present invention is marked using to the dislocation Word message of sequencing display in a document, in being embodied as, permissible Carry out underlined in red labelling, or be changed font color being marked, be marked also or by way of annotation.

Meanwhile, the wrong Word message of this labelling is associated linking with the voice signal of corresponding described mistake Word message, when When clicking on wrong Word message, the voice signal of described for correspondence mistake Word message is recognized for, and right in a document The secondary Word message identifying carries out editable and shows.So, in being embodied as it is possible in being shown by editable Wrong Word message is carried out with corrigendum editor, to obtain the Word message corrected, and is replaced with the Word message of described corrigendum described Mistake Word message.

For example, have a voice signal A, the content said in A (hereinafter referred content B) be " weather of today is very good, I Go together to stroll in the park ", voice conversion software after voice signal A is identified with conversion, (the hereafter letter of the content that obtains Claim content C) be " just, we go to close public member's plate the weather of today together ", then can see, wherein " just " and " closing public member's plate " is the Word message (hereinafter referred wrong content D) of mistake, therefore, when being ranked up display in a document, Wrong content D " just " and " closing public member's plate " can be marked.Now, can be by manually coming to mistake at meeting scene Content D is corrected, and the method for corrigendum is exactly to click on the wrong content D being marked in document, because wrong content D closes Connection is linked to voice messaging A, then voice messaging A is recognized for change by starting voice conversion knowledge software, and in literary composition Carry out editable and show in shelves, such as, be shown as " very good, whole good, true, pin, earn, demonstrate,prove ... ", from editable content In have correct word, then " very good " can be clicked directly on and selected, then afterwards " very good " will replace mistake in Hold D " just ", if correctly not corresponding to word in editable content, be such as shown that " whole good, true, pin, Earn, demonstrate,prove ... ", then can first click on "true", then next automatically can show the word with "true" pairing again, As " good, bold and unconstrained, number ... ", at this point it is possible to reselection " good ", the corrigendum of wrong Word message is completed with this.

Further, in being embodied as, the speech content of on-the-spot meeting can also be carried out with live forwarding in real time, specifically real Applying method may be referred to implementation below.

Individual in being embodied as, after identification obtains correct Word message, can described Word message will be sent by network Live display in real time is carried out to website.Specifically can be achieved in that, the document showing described Word message will be used for even first It is connected in a website, whenever having Word message to be identified sequencing display in a document after dress changes, Word message is detected, The described Word message of institute does not comprise mistake Word message, then sends Word message and is shown to website.So meeting it Outer other people just can watch Word message by refreshing this website.

In addition, if detect identified conversion after Word message includes mistake Word message when, then can be artificial After it is corrected, by manually by corrigendum after Word message send to website, website after receiving this Word message, Before the same, will show to receiving Word message.

In being embodied as, the Word message after identification can also be sent in the social software to mobile terminal, come with this Forwarded in real time in certain circle or in scope to conference content with live.For example, it is possible to Word message is passed through wechat Software is sent in group automatically.At this point it is possible to be achieved in, first log into wechat account；Then needs will be chosen The group being transmitted or good friend, then now by document associations of described display Word message to the group that chosen or good In the transmission backstage of friend, when there being Word message to be identified sequencing display in a document after dress changes, first Word message is detected, If described Word message does not comprise mistake Word message, then sent Word message to group or good friend；If described literary composition Word information includes wrong Word message, then after artificial corrigendum, by manually sending it in group or good friend.Or Person periodically can also detected to shown Word message, once detect in shown Word message not including Dislocation Word message, then be just automatically sent in group or good friend.

In being embodied as, when automatically being sent to Word message, dependence to be the detection to Word message to identify whether Comprise mistake Word message, wherein detect that the standard of wrong Word message is just to detect whether that shown Word message includes labelling, If Word message has description of symbols, if it is wrong Word message it is to be understood that by manually coming wrong Word message is carried out After corrigendum, can cancel to original to labelling, and then be detected, thus realizing automatically sending.

Further, in a document sequencing display Word message when, the sequence to Word message can be according to every section of continuous language Being ranked up, concrete methods of realizing is message number：When described voice conversion software completes previous voice signal conversion identification, Start timing；When described voice conversion software receipt is to current speech signal, stops timing, and be calculated current speech letter Time difference number and previous voice signal between；Judge whether the described time is more than default time difference, if so, to current language The Word message that message conversion identification obtains carries out segmentation sequencing display；If it is not, then current speech signal conversion identification obtains Word message carries out arranged in sequence and shows.

The principle of preceding method is that the people of general speech can make a short pause after finishing one section of word and put off until some time later the second word, then Can also talk about according to every to carry out compartment for one section and show in the form that Word message is ranked up, pass through between every words Compartment shows.For example, when first dress changes just to be identified to voice signal, current speech signal and previous voice signal are judged Time of origin difference, if time difference is more than 0.5 second, then the Word message swapping out with regard to the identified dress of current speech signal Carry out compartment to show.

Certainly it is to be understood that specific interval time can formerly be arranged, not necessarily 0.5 second, due to everyone Word speed inconsistent, therefore it provides formerly interval time setting, more preferable word-information display effect can be reached.

From the point of view of to sum up, the present invention does not need the typewriting apparatuss turning it is not required that the typing personnel of specialty, can be achieved with existing to meeting The automatic conversion of field speech content, thus obtaining the Word message corresponding with described speech content, and records.This is relatively Existing professional equipment and professional, the cost of implementation of the present invention is lower, is substantially all and can realize in various occasions.

Above-mentioned the specific embodiment only principle of the illustrative present invention and its effect, not for the restriction present invention.Any person skilled in the art all may be used Without prejudice under the spirit and the scope of the present invention, modifications and changes are carried out to above-described embodiment.Therefore, such as have in art and generally know All equivalent modifications or change that the knowledgeable is completed under without departing from disclosed spirit and technological thought, must be by the claim of the present invention Covered.

Claims

1. a kind of voice automatic record method is it is characterised in that include：

Pre-enter speech roster；

Obtain voice signal；

The conversion of described voice signal is identified as by corresponding Word message by voice conversion software, and is stored, wherein, First during identification conversion voice signal, choose and to the name corresponding to this voice signal from the speech roster pre-entering Word, and extract the tamber characteristic of this voice messaging, the name chosen is associated with described tamber characteristic simultaneously；

When carrying out the conversion identification of next voice signal, judge whether its tone color is identical with the tone color of formerly described extraction, if phase Same then the name being associated with this tone color is shown in beginning of the sentence, if differing, entering and repeating to identifying conversion voice signal first Operation；

Described Word message is ranked up in a document show.

2. voice automatic record method according to claim 1 is it is characterised in that the method obtaining described voice signal includes： Spoken sounds are converted into by described voice signal by mike.

3. voice automatic record method according to claim 1 is it is characterised in that the side of sequencing display Word message in a document Method includes：To carry out segmentation sequencing display to described Word message according to every section of continuous voice signal.

4. voice automatic record method according to claim 3 is it is characterised in that described come according to every section of continuous voice signal The method carrying out segmentation sequencing display to described Word message is：

When described voice conversion software completes previous voice signal conversion identification, start timing；

Described voice conversion software receipt to current speech signal when, stop timing, and be calculated current speech signal with Time difference between previous voice signal；

Judge whether the described time is more than default time difference, the word if so, current speech signal conversion identification being obtained Information carries out segmentation sequencing display；If it is not, the Word message that then current speech signal conversion identification obtains carries out arranged in sequence showing Show.