CN106487532A - A kind of voice automatic record method - Google Patents
A kind of voice automatic record method Download PDFInfo
- Publication number
- CN106487532A CN106487532A CN201510530240.7A CN201510530240A CN106487532A CN 106487532 A CN106487532 A CN 106487532A CN 201510530240 A CN201510530240 A CN 201510530240A CN 106487532 A CN106487532 A CN 106487532A
- Authority
- CN
- China
- Prior art keywords
- voice signal
- voice
- word message
- conversion
- signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Abstract
The invention discloses a kind of voice automatic record method, including:Pre-enter speech roster;Obtain voice signal;The conversion of described voice signal is identified as by corresponding Word message by voice conversion software, and stored, wherein, when identifying conversion voice signal first, choose and to the name corresponding to this voice signal from the speech roster pre-entering, and extract the tamber characteristic of this voice messaging, the name chosen is associated with described tamber characteristic simultaneously;When carrying out the conversion identification of next voice signal, judging that whether its tone color is identical with the tone color of formerly described extraction, if identical, the name being associated with this tone color is shown in beginning of the sentence, if differing, entering the operation repeating to identifying conversion voice signal first;Described Word message is ranked up in a document show.This relatively existing professional equipment and professional, the cost of implementation of the present invention is lower, is substantially all and can realize in various occasions.
Description
Technical field
The present invention relates to signal processing technology field, more particularly, it is a kind of voice automatic record method.
Background technology
In meeting scene, it is all typically now the voice data to record meeting scene by the way of recording, or by scene
Scribe artificial input record is carried out by special recording equipment.Although both modes are not asked in realization
Topic, but, if adopting the former method, cannot timely on-the-spot meeting record manuscript, need after the meeting by manually to receive
The mode listening session recording is recorded;If using the method for the latter, needing to buy special recording equipment, and need specially
The typist of industry just enables the synchronous recording at meeting scene, and high cost is it is impossible to popularize, only in some official's meeting occasions
Just can use.
Therefore, how quickly to realize the speech of meeting scene spokesman is automatically recorded, and do not need the record of specialty
Personnel and the equipment of specialty, have just become a great problem of the art.
Content of the invention
In view of the above problems, the present invention provides a kind of voice automatic record method, for realizing in the situation not needing professional
The speech content at the lower meeting of record automatically scene.Its concrete technical scheme is:
A kind of voice automatic record method, including:Pre-enter speech roster;Obtain voice signal;Software is converted by voice
The conversion of described voice signal is identified as corresponding Word message, and is stored, wherein, identify conversion voice signal first
When, choose and to the name corresponding to this voice signal from the speech roster pre-entering, and extract the sound of this voice messaging
The name chosen is associated with described tamber characteristic by color characteristic simultaneously;When carrying out the conversion identification of next voice signal,
Judge that whether its tone color is identical with the tone color of formerly described extraction, if identical, the name being associated with this tone color is shown in beginning of the sentence,
If differing, enter the operation repeating to identifying conversion voice signal first;Described Word message is ranked up in a document
Display.
Preferably, the method obtaining described voice signal includes:Spoken sounds are converted into by described voice signal by mike.
Preferably, the method for sequencing display Word message includes in a document:Come to described literary composition according to every section of continuous voice signal
Word information carries out segmentation sequencing display.
Preferably, described according to the method that every section of continuous voice signal to carry out segmentation sequencing display to described Word message it is:
When described voice conversion software completes previous voice signal conversion identification, start timing;Arrive in described voice conversion software receipt
During current speech signal, stop timing, and be calculated the time difference between current speech signal and previous voice signal;Judge
Whether the described time be more than default time difference, if so, the Word message that current speech signal conversion identification obtains is carried out point
Section sequencing display;If it is not, the Word message that then current speech signal conversion identification obtains carries out arranged in sequence showing.
Hinge structure, the present invention does not need the typewriting apparatuss turning it is not required that the typing personnel of specialty, can be achieved with to meeting
The automatic conversion of view live speeches content, thus obtaining the Word message corresponding with described speech content, and records, with
When can also know clearly that every words are who says in the Word message showing.This relatively existing professional equipment and specially
Industry personnel, the cost of implementation of the present invention is lower, and function is more preferably, in hgher efficiency.
Brief description
For the scheme being illustrated more clearly that in the embodiment of the present invention, below will be attached to use required described in specific embodiment
Figure be briefly described it should be apparent that, drawings in the following description are only some embodiments of the present invention, for this area
For technical staff, on the premise of not paying creative work, other accompanying drawings can also be obtained according to these accompanying drawings.
A kind of flowchart of voice automatic record method that Fig. 1 provides for the present invention.
Specific embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clearly and completely
Description is it is clear that described embodiment is only a part of embodiment of the present invention, rather than whole embodiments.Based on this
Inventive embodiment, all other enforcement that those of ordinary skill in the art are obtained on the premise of not making creative work
Example, broadly falls into the scope of protection of the invention.
The optimized integration of the present invention is existing speech software.It is all more ripe that existing speech recognition technology and dress change technology,
Phonitic entry method or speech recognition technology are had on the mobile terminals such as mobile phone, as long as can be achieved with setting for terminal by voice
For being controlled or operating, as long as or for terminal unit speech, the content that this just can be talked by the software on terminal unit
It is automatically recognized as word, and shown.
Under the background of above prior art, in the present embodiment, give a kind of voice automatic record method, below will be to the method
It is described in detail.
See Fig. 1, give a kind of flowchart of voice automatic record method, the method comprising the steps of:
Step S1, pre-enters speech roster.
Step S2, obtains voice signal.
In being embodied as, the speech content of spokesman in meeting scene can be passed through by Mike by equipment such as mike and audio amplifiers
Wind is changing into the signal of telecommunication, and plays back in sound-box device.Or directly obtained in the speech of spokesman by mike
Hold, that is, directly by required voice signal in spoken sounds conversion cost embodiment.In this and prior art, speak against mobile phone,
Come to mobile phone transmission voice signal to be a reason with this.
Step S3, described voice signal is changed into corresponding Word message, and is stored, and wherein, identifies conversion first
During voice signal, choose and to the name corresponding to this voice signal from the speech roster pre-entering, and extract this voice
The name chosen is associated with described tamber characteristic by the tamber characteristic of information simultaneously.
In being embodied as, because in meeting, everyone is not continuous it is entirely possible to be a people one time of speech, deposit
In this situation alternately, then shown by not can know that in such scheme, Word message out is to be said by whom, because
Gone back in this present embodiment method being overcome.
First, pre-enter speech roster;
Then, when identifying conversion voice signal first, choose from the speech roster pre-entering and to this voice signal institute
Corresponding name, and extract the tamber characteristic of this voice messaging, the name chosen is associated with described tamber characteristic simultaneously;
Then, when carrying out the identification of next voice signal, judge whether its tone color is identical with the tone color of formerly described extraction, if phase
Same then the name being associated with this tone color is shown in beginning of the sentence;If differing, entering and repeating to convert voice signal to identifying first
Operation.Rapidly and efficiently can know that by said method whom the spokesman of every words is in a document.
For example, there is " Zhang San ", " Li Si " and " king five " in the list of typing in advance, after meeting starts, voice is changed
Identification software is when carrying out voice signal identification first, if the first voice signal is the speech content of speech " Zhang San ", that
Choose Zhang San from list, afterwards, if " Zhang San " talks always, then from the beginning of second voice signal, then meeting
Automatically labelling " Zhang San " before the Word message identifying.In addition, if from the beginning of second or the 3rd voice signal
After be other people speeches, then by according to the same to the speech recognition labelling of Zhang San, repeat no more here.
In being embodied as, software can be converted by voice and convert voice signals into corresponding Word message.By this side
Formula will identify that the Word message coming saves, and reaches the purpose of record.
Step S4, when carrying out the conversion identification of next voice signal, judges whether its tone color is identical with the tone color of formerly described extraction,
If identical, the name being associated with this tone color is shown in beginning of the sentence, if differing, entering and repeating to convert voice letter to identifying first
Number operation.
Step S5, described Word message is ranked up in a document show.
In being embodied as, because the purpose of meeting is to record the speech content in meeting in a document, accordingly, it would be desirable to
The Word message identifying is ranked up show, that is to say that typesetting shows.
Pass through said method can automatically record the speech content at meeting scene well, without by professional
Typewriting apparatuss and the typist of specialty, can be carried out in any place.Relatively existing method, cost of the present invention is lower, reliable
Property is higher.
In being embodied as, absolutely can not correctly identify the speech content in voice signal due to reason switching software,
Or can there is a certain proportion of wrong word, in order to overcome this problem further, present invention also offers implementation below.
When in a document Word message is ranked up with display, described Word message includes correct Word message and wrong word letter
Breath.Therefore, the present invention is marked using to the dislocation Word message of sequencing display in a document, in being embodied as, permissible
Carry out underlined in red labelling, or be changed font color being marked, be marked also or by way of annotation.
Meanwhile, the wrong Word message of this labelling is associated linking with the voice signal of corresponding described mistake Word message, when
When clicking on wrong Word message, the voice signal of described for correspondence mistake Word message is recognized for, and right in a document
The secondary Word message identifying carries out editable and shows.So, in being embodied as it is possible in being shown by editable
Wrong Word message is carried out with corrigendum editor, to obtain the Word message corrected, and is replaced with the Word message of described corrigendum described
Mistake Word message.
For example, have a voice signal A, the content said in A (hereinafter referred content B) be " weather of today is very good, I
Go together to stroll in the park ", voice conversion software after voice signal A is identified with conversion, (the hereafter letter of the content that obtains
Claim content C) be " just, we go to close public member's plate the weather of today together ", then can see, wherein " just " and
" closing public member's plate " is the Word message (hereinafter referred wrong content D) of mistake, therefore, when being ranked up display in a document,
Wrong content D " just " and " closing public member's plate " can be marked.Now, can be by manually coming to mistake at meeting scene
Content D is corrected, and the method for corrigendum is exactly to click on the wrong content D being marked in document, because wrong content D closes
Connection is linked to voice messaging A, then voice messaging A is recognized for change by starting voice conversion knowledge software, and in literary composition
Carry out editable and show in shelves, such as, be shown as " very good, whole good, true, pin, earn, demonstrate,prove ... ", from editable content
In have correct word, then " very good " can be clicked directly on and selected, then afterwards " very good " will replace mistake in
Hold D " just ", if correctly not corresponding to word in editable content, be such as shown that " whole good, true, pin,
Earn, demonstrate,prove ... ", then can first click on "true", then next automatically can show the word with "true" pairing again,
As " good, bold and unconstrained, number ... ", at this point it is possible to reselection " good ", the corrigendum of wrong Word message is completed with this.
Further, in being embodied as, the speech content of on-the-spot meeting can also be carried out with live forwarding in real time, specifically real
Applying method may be referred to implementation below.
Individual in being embodied as, after identification obtains correct Word message, can described Word message will be sent by network
Live display in real time is carried out to website.Specifically can be achieved in that, the document showing described Word message will be used for even first
It is connected in a website, whenever having Word message to be identified sequencing display in a document after dress changes, Word message is detected,
The described Word message of institute does not comprise mistake Word message, then sends Word message and is shown to website.So meeting it
Outer other people just can watch Word message by refreshing this website.
In addition, if detect identified conversion after Word message includes mistake Word message when, then can be artificial
After it is corrected, by manually by corrigendum after Word message send to website, website after receiving this Word message,
Before the same, will show to receiving Word message.
In being embodied as, the Word message after identification can also be sent in the social software to mobile terminal, come with this
Forwarded in real time in certain circle or in scope to conference content with live.For example, it is possible to Word message is passed through wechat
Software is sent in group automatically.At this point it is possible to be achieved in, first log into wechat account;Then needs will be chosen
The group being transmitted or good friend, then now by document associations of described display Word message to the group that chosen or good
In the transmission backstage of friend, when there being Word message to be identified sequencing display in a document after dress changes, first Word message is detected,
If described Word message does not comprise mistake Word message, then sent Word message to group or good friend;If described literary composition
Word information includes wrong Word message, then after artificial corrigendum, by manually sending it in group or good friend.Or
Person periodically can also detected to shown Word message, once detect in shown Word message not including
Dislocation Word message, then be just automatically sent in group or good friend.
In being embodied as, when automatically being sent to Word message, dependence to be the detection to Word message to identify whether
Comprise mistake Word message, wherein detect that the standard of wrong Word message is just to detect whether that shown Word message includes labelling,
If Word message has description of symbols, if it is wrong Word message it is to be understood that by manually coming wrong Word message is carried out
After corrigendum, can cancel to original to labelling, and then be detected, thus realizing automatically sending.
Further, in a document sequencing display Word message when, the sequence to Word message can be according to every section of continuous language
Being ranked up, concrete methods of realizing is message number:When described voice conversion software completes previous voice signal conversion identification,
Start timing;When described voice conversion software receipt is to current speech signal, stops timing, and be calculated current speech letter
Time difference number and previous voice signal between;Judge whether the described time is more than default time difference, if so, to current language
The Word message that message conversion identification obtains carries out segmentation sequencing display;If it is not, then current speech signal conversion identification obtains
Word message carries out arranged in sequence and shows.
The principle of preceding method is that the people of general speech can make a short pause after finishing one section of word and put off until some time later the second word, then
Can also talk about according to every to carry out compartment for one section and show in the form that Word message is ranked up, pass through between every words
Compartment shows.For example, when first dress changes just to be identified to voice signal, current speech signal and previous voice signal are judged
Time of origin difference, if time difference is more than 0.5 second, then the Word message swapping out with regard to the identified dress of current speech signal
Carry out compartment to show.
Certainly it is to be understood that specific interval time can formerly be arranged, not necessarily 0.5 second, due to everyone
Word speed inconsistent, therefore it provides formerly interval time setting, more preferable word-information display effect can be reached.
From the point of view of to sum up, the present invention does not need the typewriting apparatuss turning it is not required that the typing personnel of specialty, can be achieved with existing to meeting
The automatic conversion of field speech content, thus obtaining the Word message corresponding with described speech content, and records.This is relatively
Existing professional equipment and professional, the cost of implementation of the present invention is lower, is substantially all and can realize in various occasions.
Above-mentioned the specific embodiment only principle of the illustrative present invention and its effect, not for the restriction present invention.Any person skilled in the art all may be used
Without prejudice under the spirit and the scope of the present invention, modifications and changes are carried out to above-described embodiment.Therefore, such as have in art and generally know
All equivalent modifications or change that the knowledgeable is completed under without departing from disclosed spirit and technological thought, must be by the claim of the present invention
Covered.
Claims (4)
1. a kind of voice automatic record method is it is characterised in that include:
Pre-enter speech roster;
Obtain voice signal;
The conversion of described voice signal is identified as by corresponding Word message by voice conversion software, and is stored, wherein,
First during identification conversion voice signal, choose and to the name corresponding to this voice signal from the speech roster pre-entering
Word, and extract the tamber characteristic of this voice messaging, the name chosen is associated with described tamber characteristic simultaneously;
When carrying out the conversion identification of next voice signal, judge whether its tone color is identical with the tone color of formerly described extraction, if phase
Same then the name being associated with this tone color is shown in beginning of the sentence, if differing, entering and repeating to identifying conversion voice signal first
Operation;
Described Word message is ranked up in a document show.
2. voice automatic record method according to claim 1 is it is characterised in that the method obtaining described voice signal includes:
Spoken sounds are converted into by described voice signal by mike.
3. voice automatic record method according to claim 1 is it is characterised in that the side of sequencing display Word message in a document
Method includes:To carry out segmentation sequencing display to described Word message according to every section of continuous voice signal.
4. voice automatic record method according to claim 3 is it is characterised in that described come according to every section of continuous voice signal
The method carrying out segmentation sequencing display to described Word message is:
When described voice conversion software completes previous voice signal conversion identification, start timing;
Described voice conversion software receipt to current speech signal when, stop timing, and be calculated current speech signal with
Time difference between previous voice signal;
Judge whether the described time is more than default time difference, the word if so, current speech signal conversion identification being obtained
Information carries out segmentation sequencing display;If it is not, the Word message that then current speech signal conversion identification obtains carries out arranged in sequence showing
Show.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510530240.7A CN106487532A (en) | 2015-08-26 | 2015-08-26 | A kind of voice automatic record method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510530240.7A CN106487532A (en) | 2015-08-26 | 2015-08-26 | A kind of voice automatic record method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106487532A true CN106487532A (en) | 2017-03-08 |
Family
ID=58233542
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510530240.7A Pending CN106487532A (en) | 2015-08-26 | 2015-08-26 | A kind of voice automatic record method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106487532A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107257448A (en) * | 2017-08-09 | 2017-10-17 | 成都全云科技有限公司 | A kind of video conferencing system exchanged with font |
CN108399923A (en) * | 2018-02-01 | 2018-08-14 | 深圳市鹰硕技术有限公司 | More human hairs call the turn spokesman's recognition methods and device |
-
2015
- 2015-08-26 CN CN201510530240.7A patent/CN106487532A/en active Pending
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107257448A (en) * | 2017-08-09 | 2017-10-17 | 成都全云科技有限公司 | A kind of video conferencing system exchanged with font |
CN108399923A (en) * | 2018-02-01 | 2018-08-14 | 深圳市鹰硕技术有限公司 | More human hairs call the turn spokesman's recognition methods and device |
WO2019148586A1 (en) * | 2018-02-01 | 2019-08-08 | 深圳市鹰硕技术有限公司 | Method and device for speaker recognition during multi-person speech |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106486116A (en) | A kind of online generation method of on-the-spot meeting summary | |
CN106486113A (en) | A kind of minutes method | |
US11264019B2 (en) | Methods, systems, and media for voice-based call operations | |
US10552118B2 (en) | Context based identification of non-relevant verbal communications | |
US20170359393A1 (en) | System and Method for Building Contextual Highlights for Conferencing Systems | |
US20100158213A1 (en) | Sysetms and Methods for Intelligent Call Transcription | |
US8484040B2 (en) | Social analysis in multi-participant meetings | |
US10574827B1 (en) | Method and apparatus of processing user data of a multi-speaker conference call | |
CN105120048B (en) | The recording method of call voice and system | |
US8731919B2 (en) | Methods and system for capturing voice files and rendering them searchable by keyword or phrase | |
CN105100360A (en) | Communication auxiliary method and device for voice communication | |
US8391445B2 (en) | Caller identification using voice recognition | |
US9549074B2 (en) | Method and apparatus for providing ambient social telephony | |
US20090326939A1 (en) | System and method for transcribing and displaying speech during a telephone call | |
US20110043597A1 (en) | Conference annotation system | |
US20180293996A1 (en) | Electronic Communication Platform | |
CN107527623A (en) | Screen transmission method, device, electronic equipment and computer-readable recording medium | |
CN106487531A (en) | A kind of voice automatic record method with automatic error correction function | |
CN109688276A (en) | A kind of incoming call filter system and method based on artificial intelligence technology | |
CN104618615B (en) | A kind of TeleConference Bridge meeting summary method for pushing based on short message | |
CN101277338A (en) | Method for recording downstream voice signal of communication terminal as well as the communication terminal | |
CN110460798B (en) | Video interview service processing method, device, terminal and storage medium | |
CN106487532A (en) | A kind of voice automatic record method | |
US20120164986A1 (en) | Method and apparatus for multipoint call service in mobile terminal | |
EP2913822B1 (en) | Speaker recognition |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20170308 |