CN107945802A - Voice recognition result processing method and processing device - Google Patents
Voice recognition result processing method and processing device Download PDFInfo
- Publication number
- CN107945802A CN107945802A CN201710995682.8A CN201710995682A CN107945802A CN 107945802 A CN107945802 A CN 107945802A CN 201710995682 A CN201710995682 A CN 201710995682A CN 107945802 A CN107945802 A CN 107945802A
- Authority
- CN
- China
- Prior art keywords
- text
- recognition result
- voice
- voice recognition
- display area
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000003672 processing method Methods 0.000 title description 18
- 238000000034 method Methods 0.000 claims abstract description 26
- 230000001915 proofreading effect Effects 0.000 claims abstract 2
- 230000004048 modification Effects 0.000 claims description 22
- 238000012986 modification Methods 0.000 claims description 22
- 238000004590 computer program Methods 0.000 description 7
- 238000010586 diagram Methods 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 238000003860 storage Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transforming into visible information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Data Mining & Analysis (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
The present invention be on a kind of fast and easily voice recognition result proofreading method and device, wherein, method includes:Obtain the corresponding voice recognition result text of voice messaging;Current display interface is divided into the first display area and the second display area, voice recognition result text is shown in the first display area and the second display area;When receiving the selected operation to the first object text in the second voice recognition result text, play target voice information corresponding with first object text, and the editable cursor of current display interface is positioned into the first voice recognition result text, with corresponding second target text of first object text, so that user is corrected the second target text in the first voice recognition result text according to target voice information, the voice recognition result text after being corrected.With this solution, can be with the work efficiency of user and record accuracy.
Description
Technical field
The present invention relates to technical field of voice recognition, more particularly to a kind of voice recognition result processing method and processing device.
Background technology
Traditional meeting and court's trial are all that recorder carries out content record, such a mode by way of typewriting or notes
Speed is slow, inefficiency, and voice cannot be put on record.With flourishing for speech recognition technology, many companies will
Speech recognition technology is applied in meeting and court's trial, but since speech recognition can not possibly reach 100%, so in some intelligent front yards
, it is necessary to be modified to recognition result in careful or conference system.
The content of the invention
The embodiment of the present invention provides a kind of voice recognition result processing method and processing device, facilitates user to voice to realize
Recognition result is modified, so as to lift the usage experience of user.
First aspect according to embodiments of the present invention, there is provided a kind of voice recognition result processing method, including:
Obtain the corresponding voice recognition result text of voice messaging;
Current display interface is divided into the first display area and the second display area, in first display area and institute
State and institute's speech recognition result text is shown in the second display area, wherein, the second voice in second display area
The first voice recognition result text in recognition result text and first display area is corresponding, and second viewing area
The second voice recognition result text in domain is corresponding with the voice messaging;
When receiving the selected operation to the first object text in the second voice recognition result text, play with
The corresponding target voice information of the first object text, and the editable cursor of current display interface is positioned to described first
In voice recognition result text, and corresponding second target text of the first object text, so that user is according to the mesh
Mark voice messaging is corrected the second target text in the first voice recognition result text, the voice after being corrected
Recognition result text.
In this embodiment, current display interface is divided into the first display area and the second display area, it is aobvious first
Show in region and the second display area and show voice recognition result text, wherein, the second voice in the second display area is known
Other resulting text and the first voice recognition result in the first display area are corresponding and corresponding with voice messaging, in this way, working as user
When the first object text in the second display area is chosen in click, the corresponding target voice information of the first object text is played,
And editable cursor is navigated into the second target text corresponding with first object text in the first display area, in this way, user
It can greatly improve user's to be corrected when listening back to voice messaging to the second target text in the first display area
Work efficiency and record accuracy so that user can listen back to real voice and correct the result of identification in time.
In one embodiment, the method further includes:
The corresponding broadcast button of voice messaging is shown in the second display area of the current display interface;
When receiving the play operation to the voice messaging, the voice messaging is played, and highlights and currently broadcasts
The 3rd target text in the corresponding second voice recognition result text of voice messaging put, and by current display interface can
Editor's cursor is positioned into the first voice recognition result text, the 4th target text corresponding with the 3rd target text
This.
In this embodiment, the corresponding broadcasting of voice messaging can also be shown in the second display area of current display interface
Button, user can click on broadcast button and listen back to voice messaging, when playing voice messaging, highlight in the second display area
The 3rd target text corresponding with voice messaging, and can in synchro edit the first voice recognition result text with the 3rd text pair
The 4th target text answered, consequently facilitating user carries out the amendment of voice recognition result text according to voice messaging.
In one embodiment, the method further includes:
Receive the modification instruction of the second target text input by user to editable cursor present position;
Instructed according to the modification, change second target text, and mark amended second target text.
In this embodiment, user can change the target text of editable cursor present position, after the modification, can mark
Go out the target text of modification, so as to be contrasted with the voice recognition result text in the second display area, checked easy to user
Modification part.In addition, user can also in the first display area automatic moving editable cursor position, to the first viewing area
Other target texts in domain are modified.
In one embodiment, the method further includes:
Before display institute speech recognition result text, the voice messaging is obtained;
The voice messaging is identified again, obtains secondary voice recognition result text;
By the secondary voice recognition result text compared with institute speech recognition result text, whether both are determined
Unanimously;
When both are consistent, the speech recognition is shown on first display area and second display area
Resulting text;
When both are inconsistent, determine in the secondary voice recognition result text with institute speech recognition result text not
Same difference text;
Institute's speech recognition result text is shown on first display area, is shown on second display area
The secondary voice recognition result text, and highlight the difference text.
In this embodiment, can also be to voice messaging before current display interface shows voice recognition result text
Identified again, obtain secondary voice recognition result text, so that it is determined that the voice recognition result text identified twice is
It is no consistent, if both are inconsistent, secondary voice recognition result text is shown in the second display area, in the first display area
Show voice recognition result text, and both difference texts are highlighted in the second display area, in this way, facilitating user's root
Carry out listening back to voice messaging according to highlighted difference text and carry out the amendment of voice recognition result text.
In one embodiment, the method further includes:
Receive the lookup replacement instruction to specifying the text field in the first voice recognition result text;
According to the lookup replacement instruction, the specified text word is searched in the first voice recognition result text
Section, and replace described specified the text field using new the text field.
In this embodiment, user can also be to the field in the first voice recognition result text in the first display area
Lookup replacement is carried out, consequently facilitating user changes voice recognition result text.
In one embodiment, the method further includes:
When playing voice messaging, the pause play instruction of input is received;
According to the pause play instruction, pause plays the voice messaging.
In this embodiment, during voice messaging is played, user is also an option that pause plays voice messaging, from
And further lift the usage experience of user.
Second aspect according to embodiments of the present invention, there is provided a kind of voice recognition result processing unit, including:
Processor;
For storing the memory of processor-executable instruction;
Wherein, the processor is configured as:
Obtain the corresponding voice recognition result text of voice messaging;
Current display interface is divided into the first display area and the second display area, in first display area and institute
State and institute's speech recognition result text is shown in the second display area, wherein, the second voice in second display area
The first voice recognition result text in recognition result text and first display area is corresponding, and second viewing area
The second voice recognition result text in domain is corresponding with the voice messaging;
When receiving the selected operation to the first object text in the second voice recognition result text, play with
The corresponding target voice information of the first object text, and the editable cursor of current display interface is positioned to described first
In voice recognition result text, and corresponding second target text of the first object text, so that user is according to the mesh
Mark voice messaging is corrected the second target text in the first voice recognition result text, the voice after being corrected
Recognition result text.
In one embodiment, the processor is additionally configured to:
The corresponding broadcast button of voice messaging is shown in the second display area of the current display interface;
When receiving the play operation to the voice messaging, the voice messaging is played, and highlights and currently broadcasts
The 3rd target text in the corresponding second voice recognition result text of voice messaging put, and by current display interface can
Editor's cursor is positioned into the first voice recognition result text, the 4th target text corresponding with the 3rd target text
This.
In one embodiment, the processor is additionally configured to:
Receive the modification instruction of the second target text input by user to editable cursor present position;
Instructed according to the modification, change second target text, and mark amended second target text.
In one embodiment, the processor is additionally configured to:
Before display institute speech recognition result text, the voice messaging is obtained;
The voice messaging is identified again, obtains secondary voice recognition result text;
By the secondary voice recognition result text compared with institute speech recognition result text, whether both are determined
Unanimously;
When both are consistent, the speech recognition is shown on first display area and second display area
Resulting text;
When both are inconsistent, determine in the secondary voice recognition result text with institute speech recognition result text not
Same difference text;
Institute's speech recognition result text is shown on first display area, is shown on second display area
The secondary voice recognition result text, and highlight the difference text.
In one embodiment, the processor is additionally configured to:
Receive the lookup replacement instruction to specifying the text field in the first voice recognition result text;
According to the lookup replacement instruction, the specified text word is searched in the first voice recognition result text
Section, and replace described specified the text field using new the text field.
In one embodiment, the processor is additionally configured to:
When playing voice messaging, the pause play instruction of input is received;
According to the pause play instruction, pause plays the voice messaging.
It should be appreciated that the general description and following detailed description of the above are only exemplary and explanatory, not
Can the limitation present invention.
Other features and advantages of the present invention will be illustrated in the following description, also, partly becomes from specification
Obtain it is clear that or being understood by implementing the present invention.The purpose of the present invention and other advantages can be by the explanations write
Specifically noted structure is realized and obtained in book, claims and attached drawing.
Below by drawings and examples, technical scheme is described in further detail.
Brief description of the drawings
Attached drawing herein is merged in specification and forms the part of this specification, shows the implementation for meeting the present invention
Example, and for explaining the principle of the present invention together with specification.
Fig. 1 is a kind of flow chart of voice recognition result processing method according to an exemplary embodiment.
Fig. 2 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.
Fig. 3 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.
Fig. 4 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.
Fig. 5 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.
Fig. 6 is a kind of screenshot capture of current display interface according to an exemplary embodiment.
Fig. 7 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.
Fig. 8 is the screenshot capture of another current display interface according to an exemplary embodiment.
Embodiment
Here exemplary embodiment will be illustrated in detail, its example is illustrated in the accompanying drawings.Following description is related to
During attached drawing, unless otherwise indicated, the same numbers in different attached drawings represent the same or similar key element.Following exemplary embodiment
Described in embodiment do not represent and the consistent all embodiments of the present invention.On the contrary, they be only with it is such as appended
The example of the consistent apparatus and method of some aspects being described in detail in claims, of the invention.
Fig. 1 is a kind of flow chart of voice recognition result processing method according to an exemplary embodiment.The voice
Recognition result processing method is applied in terminal device, which can be mobile phone, computer, and digital broadcasting is whole
End, messaging devices, game console, tablet device, Medical Devices, body-building equipment, personal digital assistant etc. are any to be had
The equipment of speech identifying function.As shown in Figure 1, the method comprising the steps of S101-S103:
In step S101, the corresponding voice recognition result text of voice messaging is obtained;
In step s 102, current display interface is divided into the first display area and the second display area, it is aobvious first
Show in region and the second display area and show voice recognition result text, wherein, the second voice in the second display area is known
The first voice recognition result text in other resulting text and the first display area is corresponding, and second in the second display area
Voice recognition result text is corresponding with voice messaging;Wherein, the second voice recognition result text in the second display area
For checking, it is impossible to modification and editor;And the first voice recognition result text in the first display area can modify and
Editor.
In step s 103, when receiving the selected operation to the first object text in the second voice recognition result text
When, play corresponding with first object text target voice information, and the editable cursor of current display interface is positioned to the
In one voice recognition result text, and corresponding second target text of first object text, so that user is according to target voice
Information is corrected the second target text in the first voice recognition result text, the voice recognition result text after being corrected
This.
In this embodiment, current display interface is divided into the first display area and the second display area, it is aobvious first
Show in region and the second display area and show voice recognition result text, wherein, the second voice in the second display area is known
Other resulting text and the first voice recognition result in the first display area are corresponding and corresponding with voice messaging, in this way, working as user
When the first object text in the second display area is chosen in click, the corresponding target voice information of the first object text is played,
And editable cursor is navigated into the second target text corresponding with first object text in the first display area, in this way, user
User person can be greatly improved to be corrected when listening back to voice messaging to the second target text in the first display area
Work efficiency and record accuracy so that user can listen back to real voice and correct the result of identification in time.
Fig. 2 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.
As shown in Fig. 2, in one embodiment, the above method further includes step S201-S202:
In step s 201, the corresponding broadcast button of voice messaging is shown in the second display area of current display interface;
In step S202, when receiving the play operation to voice messaging, voice messaging is played, and highlight and work as
The 3rd target text in the corresponding second voice recognition result text of voice messaging of preceding broadcasting, and by current display interface
Editable cursor position into the first voice recognition result text, with corresponding 4th target text of the 3rd target text.
In this embodiment, the corresponding broadcasting of voice messaging can also be shown in the second display area of current display interface
Button, user can click on broadcast button and listen back to voice messaging, when playing voice messaging, highlight in the second display area
The 3rd target text corresponding with voice messaging, and can in synchro edit the first voice recognition result text with the 3rd text pair
The 4th target text answered, consequently facilitating user carries out the amendment of voice recognition result text according to voice messaging.
Fig. 3 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.
As shown in figure 3, in one embodiment, the above method further includes step S301-S302:
In step S301, the modification for receiving the second target text input by user to editable cursor present position refers to
Order;
In step s 302, instructed according to modification, change the second target text, and mark amended second target text
This.Wherein, it can highlight amended second target text to mark amended second target text, can also be with it
His mode shows the difference between the second target text and other texts.
In this embodiment, user can change the target text of editable cursor present position, after the modification, can mark
Go out the target text of modification, so as to be contrasted with the voice recognition result text in the second display area, checked easy to user
Modification part.In addition, user can also in the first display area automatic moving editable cursor position, to the first viewing area
Other target texts in domain are modified.
Fig. 4 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.
As shown in figure 4, in one embodiment, the above method further includes step S401-S402:
In step S401, the lookup replacement instruction to specifying the text field in the first voice recognition result text is received;
In step S402, according to replacement instruction is searched, searched in the first voice recognition result text and specify text word
Section, and replaced using new the text field and specify the text field.
In this embodiment, user can also be to the field in the first voice recognition result text in the first display area
Lookup replacement is carried out, consequently facilitating user changes voice recognition result text.
Fig. 5 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.
As shown in figure 5, in one embodiment, the above method further includes step S501-S502:
In step S501, when playing voice messaging, the pause play instruction of input is received;
In step S502, according to pause play instruction, pause plays voice messaging.
In this embodiment, during voice messaging is played, user is also an option that pause plays voice messaging, from
And further lift the usage experience of user.
Above-mentioned technical proposal is described in detail with a specific embodiment below.
As shown in fig. 6, in current display interface 60, two display areas can be divided to, the first display area and second show
Show region, two display areas or so distribution, voice recognition result text is shown two display areas, wherein, the left side
Display area is editable voice recognition result text, and the display area on the right is not editable voice recognition result text
This, user, which clicks on, chooses the target text of right area, then plays the corresponding voice messaging of the target text, in left area,
Text corresponding with the target text of right area is also selected, and is editable cursor position, in this way, can basis
The voice messaging of broadcasting is modified the target text in left area.As shown in fig. 6, in current display interface, it is also aobvious
It is shown with lookup and replaces option, user can search the text specified, keyword specified etc., and then by replacing option by the left side
Specified text or keyword in region in voice recognition result text replace with new text or keyword, so that in some language
When sound identification is inaccurate, voice recognition result is changed easy to user.For example, when carrying out speech recognition, the institute in voice messaging
There is " school " to be identified as " learning " in recognition result, in this way, when user has found the mistake, can be after option be searched
Input " school ", " study " is inputted after replacing option, in this way, all " schools " in left area in voice recognition result text
All it is modified to " learn ", user need not modify one by one, reduce the operation of user, and improve user uses body
Test.In addition, in current display interface, listening back to button of Denging also is shown, user can listen back to voice messaging by the button,
In current display interface, also show button, the users such as pause and listening back to voice messaging, can be with if midway needs to interrupt
Press pause button by touching, pause plays voice messaging, it is necessary to when playing out, then listens tactile press the button to continue to broadcast
Put.Certainly, it on interface, can also show buttons such as " printing " " export ", know consequently facilitating user exports revised voice
Other resulting text.
Fig. 7 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.
As shown in fig. 7, in one embodiment, the above method further includes step S701-S706:
In step s 701, before voice recognition result text is shown, voice messaging is obtained;
In step S702, voice messaging is identified again, obtains secondary voice recognition result text;
In step S703, by secondary voice recognition result text compared with voice recognition result text, two are determined
Whether person is consistent;
In step S704, when both are consistent, show that voice is known on the first display area and the second display area
Other resulting text;
In step S705, when both are inconsistent, determine in secondary voice recognition result text with voice recognition result
The different difference text of text;
In step S706, voice recognition result text is shown on the first display area, is shown on the second display area
Show secondary voice recognition result text, and highlight difference text.
In this embodiment, can also be to voice messaging before current display interface shows voice recognition result text
Identified again, obtain secondary voice recognition result text, so that it is determined that the voice recognition result text identified twice is
It is no consistent, if both are inconsistent, secondary voice recognition result text is shown in the second display area, in the first display area
Show voice recognition result text, and both difference texts are highlighted in the second display area, in this way, facilitating user's root
Carry out listening back to voice messaging according to highlighted difference text and carry out the amendment of voice recognition result text.
Current display interface 80 is illustrated in figure 8, in order to further improve discrimination and cause user's note that can be with
Take secondary identification, i.e. same section of voice messaging can be sent to be identified to identification engine twice, compares recognition result twice, if
Identification is consistent twice, then second of recognition result, if recognition result is inconsistent twice, secondary identification is tied just without displaying
Fruit is shown, wherein, the left side shows a voice recognition result, and the right shows secondary voice recognition result, and by secondary knowledge
Other result is marked, and such as highlights, to cause the attention of user.
Second aspect according to embodiments of the present invention, there is provided a kind of voice recognition result processing unit, including:
Processor;
For storing the memory of processor-executable instruction;
Wherein, the processor is configured as:
Obtain the corresponding voice recognition result text of voice messaging;
Current display interface is divided into the first display area and the second display area, in first display area and institute
State and institute's speech recognition result text is shown in the second display area, wherein, the second voice in second display area
The first voice recognition result text in recognition result text and first display area is corresponding, and second viewing area
The second voice recognition result text in domain is corresponding with the voice messaging;
When receiving the selected operation to the first object text in the second voice recognition result text, play with
The corresponding target voice information of the first object text, and the editable cursor of current display interface is positioned to described first
In voice recognition result text, and corresponding second target text of the first object text, so that user is according to the mesh
Mark voice messaging is corrected the second target text in the first voice recognition result text, the voice after being corrected
Recognition result text.
In one embodiment, the processor is additionally configured to:
The corresponding broadcast button of voice messaging is shown in the second display area of the current display interface;
When receiving the play operation to the voice messaging, the voice messaging is played, and highlights and currently broadcasts
The 3rd target text in the corresponding second voice recognition result text of voice messaging put, and by current display interface can
Editor's cursor is positioned into the first voice recognition result text, the 4th target text corresponding with the 3rd target text
This.
In one embodiment, the processor is additionally configured to:
Receive the modification instruction of the second target text input by user to editable cursor present position;
Instructed according to the modification, change second target text, and mark amended second target text.
In one embodiment, the processor is additionally configured to:
Before display institute speech recognition result text, the voice messaging is obtained;
The voice messaging is identified again, obtains secondary voice recognition result text;
By the secondary voice recognition result text compared with institute speech recognition result text, whether both are determined
Unanimously;
When both are consistent, the speech recognition is shown on first display area and second display area
Resulting text;
When both are inconsistent, determine in the secondary voice recognition result text with institute speech recognition result text not
Same difference text;
Institute's speech recognition result text is shown on first display area, is shown on second display area
The secondary voice recognition result text, and highlight the difference text.
In one embodiment, the processor is additionally configured to:
Receive the lookup replacement instruction to specifying the text field in the first voice recognition result text;
According to the lookup replacement instruction, the specified text word is searched in the first voice recognition result text
Section, and replace described specified the text field using new the text field.
In one embodiment, the processor is additionally configured to:
When playing voice messaging, the pause play instruction of input is received;
According to the pause play instruction, pause plays the voice messaging.
It should be understood by those skilled in the art that, the embodiment of the present invention can be provided as method, system or computer program
Product.Therefore, the present invention can use the reality in terms of complete hardware embodiment, complete software embodiment or combination software and hardware
Apply the form of example.Moreover, the present invention can use the computer for wherein including computer usable program code in one or more
The shape for the computer program product that usable storage medium is implemented on (including but not limited to magnetic disk storage and optical memory etc.)
Formula.
The present invention be with reference to according to the method for the embodiment of the present invention, the flow of equipment (system) and computer program product
Figure and/or block diagram describe.It should be understood that it can be realized by computer program instructions every first-class in flowchart and/or the block diagram
The combination of flow and/or square frame in journey and/or square frame and flowchart and/or the block diagram.These computer programs can be provided
The processors of all-purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices is instructed to produce
A raw machine so that the instruction performed by computer or the processor of other programmable data processing devices, which produces, to be used in fact
The device for the function of being specified in present one flow of flow chart or one square frame of multiple flows and/or block diagram or multiple square frames.
These computer program instructions, which may also be stored in, can guide computer or other programmable data processing devices with spy
Determine in the computer-readable memory that mode works so that the instruction being stored in the computer-readable memory, which produces, to be included referring to
Make the manufacture of device, the command device realize in one flow of flow chart or multiple flows and/or one square frame of block diagram or
The function of being specified in multiple square frames.
These computer program instructions can be also loaded into computer or other programmable data processing devices so that counted
Series of operation steps is performed on calculation machine or other programmable devices to produce computer implemented processing, thus in computer or
The instruction performed on other programmable devices is provided and is used for realization in one flow of flow chart or multiple flows and/or block diagram one
The step of function of being specified in a square frame or multiple square frames.
Obviously, various changes and modifications can be made to the invention without departing from essence of the invention by those skilled in the art
God and scope.In this way, if these modifications and changes of the present invention belongs to the scope of the claims in the present invention and its equivalent technologies
Within, then the present invention is also intended to comprising including these modification and variations.
Claims (10)
- A kind of 1. fast and easily voice recognition result proofreading method, it is characterised in that including:Obtain the corresponding voice recognition result text of voice messaging;Current display interface is divided into the first display area and the second display area, in first display area and described Institute's speech recognition result text is shown in two display areas, wherein, the second speech recognition in second display area The first voice recognition result text in resulting text and first display area is corresponding, and in second display area The second voice recognition result text it is corresponding with the voice messaging;When receiving the selected operation to the first object text in the second voice recognition result text, play with it is described The corresponding target voice information of first object text, and the editable cursor of current display interface is positioned to first voice In recognition result text, and corresponding second target text of the first object text, so that user is according to the target language Message breath is corrected the second target text in the first voice recognition result text, the speech recognition after being corrected Resulting text.
- 2. according to the method described in claim 1, it is characterized in that, the method further includes:The corresponding broadcast button of voice messaging is shown in the second display area of the current display interface;When receiving the play operation to the voice messaging, the voice messaging is played, and is highlighted currently playing The 3rd target text in the corresponding second voice recognition result text of voice messaging, and the editable by current display interface Cursor is positioned into the first voice recognition result text, with corresponding 4th target text of the 3rd target text.
- 3. according to the method described in claim 1, it is characterized in that,The method further includes:Receive the modification instruction of the second target text input by user to editable cursor present position;Referred to according to the modification Order, changes second target text, and marks amended second target text, and marks amended second target text This;And/orThe method further includes:Receive the lookup replacement instruction to specifying the text field in the first voice recognition result text;Replaced according to the lookup Instruction is changed, described specified the text field is searched in the first voice recognition result text, and replace using new the text field Change described specified the text field.
- 4. according to the method described in claim 1, it is characterized in that, the method further includes:Before display institute speech recognition result text, the voice messaging is obtained;The voice messaging is identified again, obtains secondary voice recognition result text;By the secondary voice recognition result text compared with institute speech recognition result text, determine both whether one Cause;When both are consistent, institute's speech recognition result is shown on first display area and second display area Text;When both are inconsistent, determine different from institute speech recognition result text in the secondary voice recognition result text Difference text;Institute's speech recognition result text is shown on first display area, on second display area described in display Secondary voice recognition result text, and highlight the difference text.
- 5. according to the method described in claim 1, it is characterized in that, the method further includes:When playing voice messaging, the pause play instruction of input is received;According to the pause play instruction, pause plays the voice messaging.
- A kind of 6. fast and easily voice recognition result verifying unit, it is characterised in that including:Processor;For storing the memory of processor-executable instruction;Wherein, the processor is configured as:Obtain the corresponding voice recognition result text of voice messaging;Current display interface is divided into the first display area and the second display area, in first display area and described Institute's speech recognition result text is shown in two display areas, wherein, the second speech recognition in second display area The first voice recognition result text in resulting text and first display area is corresponding, and in second display area The second voice recognition result text it is corresponding with the voice messaging;When receiving the selected operation to the first object text in the second voice recognition result text, play with it is described The corresponding target voice information of first object text, and the editable cursor of current display interface is positioned to first voice In recognition result text, and corresponding second target text of the first object text, so that user is according to the target language Message breath is corrected the second target text in the first voice recognition result text, the speech recognition after being corrected Resulting text.
- 7. device according to claim 6, it is characterised in that the processor is additionally configured to:The corresponding broadcast button of voice messaging is shown in the second display area of the current display interface;When receiving the play operation to the voice messaging, the voice messaging is played, and is highlighted currently playing The 3rd target text in the corresponding second voice recognition result text of voice messaging, and the editable by current display interface Cursor is positioned into the first voice recognition result text, with corresponding 4th target text of the 3rd target text.
- 8. device according to claim 6, it is characterised in thatThe processor is additionally configured to:Receive the modification instruction of the second target text input by user to editable cursor present position;Referred to according to the modification Order, changes second target text, and marks amended second target text;And/orThe processor is additionally configured to:Receive the lookup replacement instruction to specifying the text field in the first voice recognition result text;Replaced according to the lookup Instruction is changed, described specified the text field is searched in the first voice recognition result text, and replace using new the text field Change described specified the text field.
- 9. device according to claim 6, it is characterised in that the processor is additionally configured to:Before display institute speech recognition result text, the voice messaging is obtained;The voice messaging is identified again, obtains secondary voice recognition result text;By the secondary voice recognition result text compared with institute speech recognition result text, determine both whether one Cause;When both are consistent, institute's speech recognition result is shown on first display area and second display area Text;When both are inconsistent, determine different from institute speech recognition result text in the secondary voice recognition result text Difference text;Institute's speech recognition result text is shown on first display area, on second display area described in display Secondary voice recognition result text, and highlight the difference text.
- 10. device according to claim 6, it is characterised in that the processor is additionally configured to:When playing voice messaging, the pause play instruction of input is received;According to the pause play instruction, pause plays the voice messaging.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710995682.8A CN107945802A (en) | 2017-10-23 | 2017-10-23 | Voice recognition result processing method and processing device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710995682.8A CN107945802A (en) | 2017-10-23 | 2017-10-23 | Voice recognition result processing method and processing device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107945802A true CN107945802A (en) | 2018-04-20 |
Family
ID=61935600
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710995682.8A Pending CN107945802A (en) | 2017-10-23 | 2017-10-23 | Voice recognition result processing method and processing device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107945802A (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109036423A (en) * | 2018-08-15 | 2018-12-18 | 信利半导体有限公司 | A kind of input system and method for the voice conversion text for computer |
CN109326292A (en) * | 2018-12-04 | 2019-02-12 | 北京九狐时代智能科技有限公司 | A kind of generation method and device of audio recognition result |
CN111953852A (en) * | 2020-07-30 | 2020-11-17 | 北京声智科技有限公司 | Call record generation method, device, terminal and storage medium |
CN113722423A (en) * | 2020-05-20 | 2021-11-30 | 夏普株式会社 | Information processing system, information processing method, and information processing program |
Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1835077A (en) * | 2005-03-14 | 2006-09-20 | 台达电子工业股份有限公司 | Automatic speech recognizing input method and system for Chinese names |
CN101567189A (en) * | 2008-04-22 | 2009-10-28 | 株式会社Ntt都科摩 | Device, method and system for correcting voice recognition result |
CN101625619A (en) * | 2008-07-11 | 2010-01-13 | 索尼株式会社 | Information processing apparatus, information processing method, information processing system, and program |
CN101807399A (en) * | 2010-02-02 | 2010-08-18 | 华为终端有限公司 | Voice recognition method and device |
CN104679723A (en) * | 2013-11-29 | 2015-06-03 | 北京壹人壹本信息科技有限公司 | Text contrast display method, system and device |
CN105786204A (en) * | 2014-12-26 | 2016-07-20 | 联想(北京)有限公司 | Information processing method and electronic equipment |
CN106328145A (en) * | 2016-08-19 | 2017-01-11 | 北京云知声信息技术有限公司 | Voice correction method and voice correction device |
CN106448675A (en) * | 2016-10-21 | 2017-02-22 | 科大讯飞股份有限公司 | Recognition text correction method and system |
JP2017049969A (en) * | 2015-09-03 | 2017-03-09 | 凸版印刷株式会社 | Document calibration server, document calibration terminal and document calibration system |
JP2017117125A (en) * | 2015-12-22 | 2017-06-29 | 凸版印刷株式会社 | Document correction server, document correction terminal and document correction system |
-
2017
- 2017-10-23 CN CN201710995682.8A patent/CN107945802A/en active Pending
Patent Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1835077A (en) * | 2005-03-14 | 2006-09-20 | 台达电子工业股份有限公司 | Automatic speech recognizing input method and system for Chinese names |
CN101567189A (en) * | 2008-04-22 | 2009-10-28 | 株式会社Ntt都科摩 | Device, method and system for correcting voice recognition result |
CN101625619A (en) * | 2008-07-11 | 2010-01-13 | 索尼株式会社 | Information processing apparatus, information processing method, information processing system, and program |
CN101807399A (en) * | 2010-02-02 | 2010-08-18 | 华为终端有限公司 | Voice recognition method and device |
CN104679723A (en) * | 2013-11-29 | 2015-06-03 | 北京壹人壹本信息科技有限公司 | Text contrast display method, system and device |
CN105786204A (en) * | 2014-12-26 | 2016-07-20 | 联想(北京)有限公司 | Information processing method and electronic equipment |
JP2017049969A (en) * | 2015-09-03 | 2017-03-09 | 凸版印刷株式会社 | Document calibration server, document calibration terminal and document calibration system |
JP2017117125A (en) * | 2015-12-22 | 2017-06-29 | 凸版印刷株式会社 | Document correction server, document correction terminal and document correction system |
CN106328145A (en) * | 2016-08-19 | 2017-01-11 | 北京云知声信息技术有限公司 | Voice correction method and voice correction device |
CN106448675A (en) * | 2016-10-21 | 2017-02-22 | 科大讯飞股份有限公司 | Recognition text correction method and system |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109036423A (en) * | 2018-08-15 | 2018-12-18 | 信利半导体有限公司 | A kind of input system and method for the voice conversion text for computer |
CN109326292A (en) * | 2018-12-04 | 2019-02-12 | 北京九狐时代智能科技有限公司 | A kind of generation method and device of audio recognition result |
CN113722423A (en) * | 2020-05-20 | 2021-11-30 | 夏普株式会社 | Information processing system, information processing method, and information processing program |
CN111953852A (en) * | 2020-07-30 | 2020-11-17 | 北京声智科技有限公司 | Call record generation method, device, terminal and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP7069778B2 (en) | Methods, systems and programs for content curation in video-based communications | |
US20140250355A1 (en) | Time-synchronized, talking ebooks and readers | |
US8972265B1 (en) | Multiple voices in audio content | |
CN107945802A (en) | Voice recognition result processing method and processing device | |
Kafle et al. | Evaluating the usability of automatically generated captions for people who are deaf or hard of hearing | |
CN108549662A (en) | The supplement digestion procedure and device of semantic analysis result in more wheel sessions | |
CN103777774B (en) | The word error correction method of terminal installation and input method | |
CN106328145B (en) | Voice modification method and device | |
JP2015510602A (en) | Management of auxiliary information playback | |
WO2014160316A2 (en) | Device, method, and graphical user interface for a group reading environment | |
WO2014151884A2 (en) | Device, method, and graphical user interface for a group reading environment | |
US10089898B2 (en) | Information processing device, control method therefor, and computer program | |
CN109712446A (en) | Interactive learning methods based on new word detection | |
JP6627217B2 (en) | Text display device, learning method, and program | |
CN109741641A (en) | Langue leaning system based on new word detection | |
Norledge | Building The Ark: Text World Theory and the evolution of dystopian epistolary | |
CN114023301A (en) | Audio editing method, electronic device and storage medium | |
CN113378583A (en) | Dialogue reply method and device, dialogue model training method and device, and storage medium | |
Che et al. | Automatic online lecture highlighting based on multimedia analysis | |
KR20130110965A (en) | Sensibility evalution and contents recommendation method based on user feedback | |
CN110148413A (en) | Speech evaluating method and relevant apparatus | |
KR20190090636A (en) | Method for automatically editing pattern of document | |
CN111711865A (en) | Method, apparatus and storage medium for outputting data | |
US10747794B2 (en) | Smart search for annotations and inking | |
JP2018146961A (en) | Voice reproduction device and voice reproduction program |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180420 |
|
RJ01 | Rejection of invention patent application after publication |