CN106487532A - A kind of voice automatic record method - Google Patents

A kind of voice automatic record method Download PDF

Info

Publication number
CN106487532A
CN106487532A CN201510530240.7A CN201510530240A CN106487532A CN 106487532 A CN106487532 A CN 106487532A CN 201510530240 A CN201510530240 A CN 201510530240A CN 106487532 A CN106487532 A CN 106487532A
Authority
CN
China
Prior art keywords
voice signal
voice
word message
conversion
signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510530240.7A
Other languages
Chinese (zh)
Inventor
龙水维
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chongqing West-Line Technology Co Ltd
Original Assignee
Chongqing West-Line Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chongqing West-Line Technology Co Ltd filed Critical Chongqing West-Line Technology Co Ltd
Priority to CN201510530240.7A priority Critical patent/CN106487532A/en
Publication of CN106487532A publication Critical patent/CN106487532A/en
Pending legal-status Critical Current

Links

Abstract

The invention discloses a kind of voice automatic record method, including:Pre-enter speech roster;Obtain voice signal;The conversion of described voice signal is identified as by corresponding Word message by voice conversion software, and stored, wherein, when identifying conversion voice signal first, choose and to the name corresponding to this voice signal from the speech roster pre-entering, and extract the tamber characteristic of this voice messaging, the name chosen is associated with described tamber characteristic simultaneously;When carrying out the conversion identification of next voice signal, judging that whether its tone color is identical with the tone color of formerly described extraction, if identical, the name being associated with this tone color is shown in beginning of the sentence, if differing, entering the operation repeating to identifying conversion voice signal first;Described Word message is ranked up in a document show.This relatively existing professional equipment and professional, the cost of implementation of the present invention is lower, is substantially all and can realize in various occasions.

Description

A kind of voice automatic record method
Technical field
The present invention relates to signal processing technology field, more particularly, it is a kind of voice automatic record method.
Background technology
In meeting scene, it is all typically now the voice data to record meeting scene by the way of recording, or by scene Scribe artificial input record is carried out by special recording equipment.Although both modes are not asked in realization Topic, but, if adopting the former method, cannot timely on-the-spot meeting record manuscript, need after the meeting by manually to receive The mode listening session recording is recorded;If using the method for the latter, needing to buy special recording equipment, and need specially The typist of industry just enables the synchronous recording at meeting scene, and high cost is it is impossible to popularize, only in some official's meeting occasions Just can use.
Therefore, how quickly to realize the speech of meeting scene spokesman is automatically recorded, and do not need the record of specialty Personnel and the equipment of specialty, have just become a great problem of the art.
Content of the invention
In view of the above problems, the present invention provides a kind of voice automatic record method, for realizing in the situation not needing professional The speech content at the lower meeting of record automatically scene.Its concrete technical scheme is:
A kind of voice automatic record method, including:Pre-enter speech roster;Obtain voice signal;Software is converted by voice The conversion of described voice signal is identified as corresponding Word message, and is stored, wherein, identify conversion voice signal first When, choose and to the name corresponding to this voice signal from the speech roster pre-entering, and extract the sound of this voice messaging The name chosen is associated with described tamber characteristic by color characteristic simultaneously;When carrying out the conversion identification of next voice signal, Judge that whether its tone color is identical with the tone color of formerly described extraction, if identical, the name being associated with this tone color is shown in beginning of the sentence, If differing, enter the operation repeating to identifying conversion voice signal first;Described Word message is ranked up in a document Display.
Preferably, the method obtaining described voice signal includes:Spoken sounds are converted into by described voice signal by mike.
Preferably, the method for sequencing display Word message includes in a document:Come to described literary composition according to every section of continuous voice signal Word information carries out segmentation sequencing display.
Preferably, described according to the method that every section of continuous voice signal to carry out segmentation sequencing display to described Word message it is: When described voice conversion software completes previous voice signal conversion identification, start timing;Arrive in described voice conversion software receipt During current speech signal, stop timing, and be calculated the time difference between current speech signal and previous voice signal;Judge Whether the described time be more than default time difference, if so, the Word message that current speech signal conversion identification obtains is carried out point Section sequencing display;If it is not, the Word message that then current speech signal conversion identification obtains carries out arranged in sequence showing.
Hinge structure, the present invention does not need the typewriting apparatuss turning it is not required that the typing personnel of specialty, can be achieved with to meeting The automatic conversion of view live speeches content, thus obtaining the Word message corresponding with described speech content, and records, with When can also know clearly that every words are who says in the Word message showing.This relatively existing professional equipment and specially Industry personnel, the cost of implementation of the present invention is lower, and function is more preferably, in hgher efficiency.
Brief description
For the scheme being illustrated more clearly that in the embodiment of the present invention, below will be attached to use required described in specific embodiment Figure be briefly described it should be apparent that, drawings in the following description are only some embodiments of the present invention, for this area For technical staff, on the premise of not paying creative work, other accompanying drawings can also be obtained according to these accompanying drawings.
A kind of flowchart of voice automatic record method that Fig. 1 provides for the present invention.
Specific embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clearly and completely Description is it is clear that described embodiment is only a part of embodiment of the present invention, rather than whole embodiments.Based on this Inventive embodiment, all other enforcement that those of ordinary skill in the art are obtained on the premise of not making creative work Example, broadly falls into the scope of protection of the invention.
The optimized integration of the present invention is existing speech software.It is all more ripe that existing speech recognition technology and dress change technology, Phonitic entry method or speech recognition technology are had on the mobile terminals such as mobile phone, as long as can be achieved with setting for terminal by voice For being controlled or operating, as long as or for terminal unit speech, the content that this just can be talked by the software on terminal unit It is automatically recognized as word, and shown.
Under the background of above prior art, in the present embodiment, give a kind of voice automatic record method, below will be to the method It is described in detail.
See Fig. 1, give a kind of flowchart of voice automatic record method, the method comprising the steps of:
Step S1, pre-enters speech roster.
Step S2, obtains voice signal.
In being embodied as, the speech content of spokesman in meeting scene can be passed through by Mike by equipment such as mike and audio amplifiers Wind is changing into the signal of telecommunication, and plays back in sound-box device.Or directly obtained in the speech of spokesman by mike Hold, that is, directly by required voice signal in spoken sounds conversion cost embodiment.In this and prior art, speak against mobile phone, Come to mobile phone transmission voice signal to be a reason with this.
Step S3, described voice signal is changed into corresponding Word message, and is stored, and wherein, identifies conversion first During voice signal, choose and to the name corresponding to this voice signal from the speech roster pre-entering, and extract this voice The name chosen is associated with described tamber characteristic by the tamber characteristic of information simultaneously.
In being embodied as, because in meeting, everyone is not continuous it is entirely possible to be a people one time of speech, deposit In this situation alternately, then shown by not can know that in such scheme, Word message out is to be said by whom, because Gone back in this present embodiment method being overcome.
First, pre-enter speech roster;
Then, when identifying conversion voice signal first, choose from the speech roster pre-entering and to this voice signal institute Corresponding name, and extract the tamber characteristic of this voice messaging, the name chosen is associated with described tamber characteristic simultaneously;
Then, when carrying out the identification of next voice signal, judge whether its tone color is identical with the tone color of formerly described extraction, if phase Same then the name being associated with this tone color is shown in beginning of the sentence;If differing, entering and repeating to convert voice signal to identifying first Operation.Rapidly and efficiently can know that by said method whom the spokesman of every words is in a document.
For example, there is " Zhang San ", " Li Si " and " king five " in the list of typing in advance, after meeting starts, voice is changed Identification software is when carrying out voice signal identification first, if the first voice signal is the speech content of speech " Zhang San ", that Choose Zhang San from list, afterwards, if " Zhang San " talks always, then from the beginning of second voice signal, then meeting Automatically labelling " Zhang San " before the Word message identifying.In addition, if from the beginning of second or the 3rd voice signal After be other people speeches, then by according to the same to the speech recognition labelling of Zhang San, repeat no more here.
In being embodied as, software can be converted by voice and convert voice signals into corresponding Word message.By this side Formula will identify that the Word message coming saves, and reaches the purpose of record.
Step S4, when carrying out the conversion identification of next voice signal, judges whether its tone color is identical with the tone color of formerly described extraction, If identical, the name being associated with this tone color is shown in beginning of the sentence, if differing, entering and repeating to convert voice letter to identifying first Number operation.
Step S5, described Word message is ranked up in a document show.
In being embodied as, because the purpose of meeting is to record the speech content in meeting in a document, accordingly, it would be desirable to The Word message identifying is ranked up show, that is to say that typesetting shows.
Pass through said method can automatically record the speech content at meeting scene well, without by professional Typewriting apparatuss and the typist of specialty, can be carried out in any place.Relatively existing method, cost of the present invention is lower, reliable Property is higher.
In being embodied as, absolutely can not correctly identify the speech content in voice signal due to reason switching software, Or can there is a certain proportion of wrong word, in order to overcome this problem further, present invention also offers implementation below.
When in a document Word message is ranked up with display, described Word message includes correct Word message and wrong word letter Breath.Therefore, the present invention is marked using to the dislocation Word message of sequencing display in a document, in being embodied as, permissible Carry out underlined in red labelling, or be changed font color being marked, be marked also or by way of annotation.
Meanwhile, the wrong Word message of this labelling is associated linking with the voice signal of corresponding described mistake Word message, when When clicking on wrong Word message, the voice signal of described for correspondence mistake Word message is recognized for, and right in a document The secondary Word message identifying carries out editable and shows.So, in being embodied as it is possible in being shown by editable Wrong Word message is carried out with corrigendum editor, to obtain the Word message corrected, and is replaced with the Word message of described corrigendum described Mistake Word message.
For example, have a voice signal A, the content said in A (hereinafter referred content B) be " weather of today is very good, I Go together to stroll in the park ", voice conversion software after voice signal A is identified with conversion, (the hereafter letter of the content that obtains Claim content C) be " just, we go to close public member's plate the weather of today together ", then can see, wherein " just " and " closing public member's plate " is the Word message (hereinafter referred wrong content D) of mistake, therefore, when being ranked up display in a document, Wrong content D " just " and " closing public member's plate " can be marked.Now, can be by manually coming to mistake at meeting scene Content D is corrected, and the method for corrigendum is exactly to click on the wrong content D being marked in document, because wrong content D closes Connection is linked to voice messaging A, then voice messaging A is recognized for change by starting voice conversion knowledge software, and in literary composition Carry out editable and show in shelves, such as, be shown as " very good, whole good, true, pin, earn, demonstrate,prove ... ", from editable content In have correct word, then " very good " can be clicked directly on and selected, then afterwards " very good " will replace mistake in Hold D " just ", if correctly not corresponding to word in editable content, be such as shown that " whole good, true, pin, Earn, demonstrate,prove ... ", then can first click on "true", then next automatically can show the word with "true" pairing again, As " good, bold and unconstrained, number ... ", at this point it is possible to reselection " good ", the corrigendum of wrong Word message is completed with this.
Further, in being embodied as, the speech content of on-the-spot meeting can also be carried out with live forwarding in real time, specifically real Applying method may be referred to implementation below.
Individual in being embodied as, after identification obtains correct Word message, can described Word message will be sent by network Live display in real time is carried out to website.Specifically can be achieved in that, the document showing described Word message will be used for even first It is connected in a website, whenever having Word message to be identified sequencing display in a document after dress changes, Word message is detected, The described Word message of institute does not comprise mistake Word message, then sends Word message and is shown to website.So meeting it Outer other people just can watch Word message by refreshing this website.
In addition, if detect identified conversion after Word message includes mistake Word message when, then can be artificial After it is corrected, by manually by corrigendum after Word message send to website, website after receiving this Word message, Before the same, will show to receiving Word message.
In being embodied as, the Word message after identification can also be sent in the social software to mobile terminal, come with this Forwarded in real time in certain circle or in scope to conference content with live.For example, it is possible to Word message is passed through wechat Software is sent in group automatically.At this point it is possible to be achieved in, first log into wechat account;Then needs will be chosen The group being transmitted or good friend, then now by document associations of described display Word message to the group that chosen or good In the transmission backstage of friend, when there being Word message to be identified sequencing display in a document after dress changes, first Word message is detected, If described Word message does not comprise mistake Word message, then sent Word message to group or good friend;If described literary composition Word information includes wrong Word message, then after artificial corrigendum, by manually sending it in group or good friend.Or Person periodically can also detected to shown Word message, once detect in shown Word message not including Dislocation Word message, then be just automatically sent in group or good friend.
In being embodied as, when automatically being sent to Word message, dependence to be the detection to Word message to identify whether Comprise mistake Word message, wherein detect that the standard of wrong Word message is just to detect whether that shown Word message includes labelling, If Word message has description of symbols, if it is wrong Word message it is to be understood that by manually coming wrong Word message is carried out After corrigendum, can cancel to original to labelling, and then be detected, thus realizing automatically sending.
Further, in a document sequencing display Word message when, the sequence to Word message can be according to every section of continuous language Being ranked up, concrete methods of realizing is message number:When described voice conversion software completes previous voice signal conversion identification, Start timing;When described voice conversion software receipt is to current speech signal, stops timing, and be calculated current speech letter Time difference number and previous voice signal between;Judge whether the described time is more than default time difference, if so, to current language The Word message that message conversion identification obtains carries out segmentation sequencing display;If it is not, then current speech signal conversion identification obtains Word message carries out arranged in sequence and shows.
The principle of preceding method is that the people of general speech can make a short pause after finishing one section of word and put off until some time later the second word, then Can also talk about according to every to carry out compartment for one section and show in the form that Word message is ranked up, pass through between every words Compartment shows.For example, when first dress changes just to be identified to voice signal, current speech signal and previous voice signal are judged Time of origin difference, if time difference is more than 0.5 second, then the Word message swapping out with regard to the identified dress of current speech signal Carry out compartment to show.
Certainly it is to be understood that specific interval time can formerly be arranged, not necessarily 0.5 second, due to everyone Word speed inconsistent, therefore it provides formerly interval time setting, more preferable word-information display effect can be reached.
From the point of view of to sum up, the present invention does not need the typewriting apparatuss turning it is not required that the typing personnel of specialty, can be achieved with existing to meeting The automatic conversion of field speech content, thus obtaining the Word message corresponding with described speech content, and records.This is relatively Existing professional equipment and professional, the cost of implementation of the present invention is lower, is substantially all and can realize in various occasions.
Above-mentioned the specific embodiment only principle of the illustrative present invention and its effect, not for the restriction present invention.Any person skilled in the art all may be used Without prejudice under the spirit and the scope of the present invention, modifications and changes are carried out to above-described embodiment.Therefore, such as have in art and generally know All equivalent modifications or change that the knowledgeable is completed under without departing from disclosed spirit and technological thought, must be by the claim of the present invention Covered.

Claims (4)

1. a kind of voice automatic record method is it is characterised in that include:
Pre-enter speech roster;
Obtain voice signal;
The conversion of described voice signal is identified as by corresponding Word message by voice conversion software, and is stored, wherein, First during identification conversion voice signal, choose and to the name corresponding to this voice signal from the speech roster pre-entering Word, and extract the tamber characteristic of this voice messaging, the name chosen is associated with described tamber characteristic simultaneously;
When carrying out the conversion identification of next voice signal, judge whether its tone color is identical with the tone color of formerly described extraction, if phase Same then the name being associated with this tone color is shown in beginning of the sentence, if differing, entering and repeating to identifying conversion voice signal first Operation;
Described Word message is ranked up in a document show.
2. voice automatic record method according to claim 1 is it is characterised in that the method obtaining described voice signal includes: Spoken sounds are converted into by described voice signal by mike.
3. voice automatic record method according to claim 1 is it is characterised in that the side of sequencing display Word message in a document Method includes:To carry out segmentation sequencing display to described Word message according to every section of continuous voice signal.
4. voice automatic record method according to claim 3 is it is characterised in that described come according to every section of continuous voice signal The method carrying out segmentation sequencing display to described Word message is:
When described voice conversion software completes previous voice signal conversion identification, start timing;
Described voice conversion software receipt to current speech signal when, stop timing, and be calculated current speech signal with Time difference between previous voice signal;
Judge whether the described time is more than default time difference, the word if so, current speech signal conversion identification being obtained Information carries out segmentation sequencing display;If it is not, the Word message that then current speech signal conversion identification obtains carries out arranged in sequence showing Show.
CN201510530240.7A 2015-08-26 2015-08-26 A kind of voice automatic record method Pending CN106487532A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510530240.7A CN106487532A (en) 2015-08-26 2015-08-26 A kind of voice automatic record method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510530240.7A CN106487532A (en) 2015-08-26 2015-08-26 A kind of voice automatic record method

Publications (1)

Publication Number Publication Date
CN106487532A true CN106487532A (en) 2017-03-08

Family

ID=58233542

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510530240.7A Pending CN106487532A (en) 2015-08-26 2015-08-26 A kind of voice automatic record method

Country Status (1)

Country Link
CN (1) CN106487532A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107257448A (en) * 2017-08-09 2017-10-17 成都全云科技有限公司 A kind of video conferencing system exchanged with font
CN108399923A (en) * 2018-02-01 2018-08-14 深圳市鹰硕技术有限公司 More human hairs call the turn spokesman's recognition methods and device

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107257448A (en) * 2017-08-09 2017-10-17 成都全云科技有限公司 A kind of video conferencing system exchanged with font
CN108399923A (en) * 2018-02-01 2018-08-14 深圳市鹰硕技术有限公司 More human hairs call the turn spokesman's recognition methods and device
WO2019148586A1 (en) * 2018-02-01 2019-08-08 深圳市鹰硕技术有限公司 Method and device for speaker recognition during multi-person speech

Similar Documents

Publication Publication Date Title
CN106486116A (en) A kind of online generation method of on-the-spot meeting summary
CN106486113A (en) A kind of minutes method
US11264019B2 (en) Methods, systems, and media for voice-based call operations
US10552118B2 (en) Context based identification of non-relevant verbal communications
US20170359393A1 (en) System and Method for Building Contextual Highlights for Conferencing Systems
US20100158213A1 (en) Sysetms and Methods for Intelligent Call Transcription
US8484040B2 (en) Social analysis in multi-participant meetings
US10574827B1 (en) Method and apparatus of processing user data of a multi-speaker conference call
CN105120048B (en) The recording method of call voice and system
US8731919B2 (en) Methods and system for capturing voice files and rendering them searchable by keyword or phrase
CN105100360A (en) Communication auxiliary method and device for voice communication
US8391445B2 (en) Caller identification using voice recognition
US9549074B2 (en) Method and apparatus for providing ambient social telephony
US20090326939A1 (en) System and method for transcribing and displaying speech during a telephone call
US20110043597A1 (en) Conference annotation system
US20180293996A1 (en) Electronic Communication Platform
CN107527623A (en) Screen transmission method, device, electronic equipment and computer-readable recording medium
CN106487531A (en) A kind of voice automatic record method with automatic error correction function
CN109688276A (en) A kind of incoming call filter system and method based on artificial intelligence technology
CN104618615B (en) A kind of TeleConference Bridge meeting summary method for pushing based on short message
CN101277338A (en) Method for recording downstream voice signal of communication terminal as well as the communication terminal
CN110460798B (en) Video interview service processing method, device, terminal and storage medium
CN106487532A (en) A kind of voice automatic record method
US20120164986A1 (en) Method and apparatus for multipoint call service in mobile terminal
EP2913822B1 (en) Speaker recognition

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20170308