CN107945802A

CN107945802A - Voice recognition result processing method and processing device

Info

Publication number: CN107945802A
Application number: CN201710995682.8A
Authority: CN
Inventors: 何世阳
Original assignee: Beijing Yunzhisheng Information Technology Co Ltd
Current assignee: Beijing Yunzhisheng Information Technology Co Ltd
Priority date: 2017-10-23
Filing date: 2017-10-23
Publication date: 2018-04-20

Abstract

The present invention be on a kind of fast and easily voice recognition result proofreading method and device, wherein, method includes：Obtain the corresponding voice recognition result text of voice messaging；Current display interface is divided into the first display area and the second display area, voice recognition result text is shown in the first display area and the second display area；When receiving the selected operation to the first object text in the second voice recognition result text, play target voice information corresponding with first object text, and the editable cursor of current display interface is positioned into the first voice recognition result text, with corresponding second target text of first object text, so that user is corrected the second target text in the first voice recognition result text according to target voice information, the voice recognition result text after being corrected.With this solution, can be with the work efficiency of user and record accuracy.

Description

Voice recognition result processing method and processing device

Technical field

The present invention relates to technical field of voice recognition, more particularly to a kind of voice recognition result processing method and processing device.

Background technology

Traditional meeting and court's trial are all that recorder carries out content record, such a mode by way of typewriting or notes Speed is slow, inefficiency, and voice cannot be put on record.With flourishing for speech recognition technology, many companies will Speech recognition technology is applied in meeting and court's trial, but since speech recognition can not possibly reach 100%, so in some intelligent front yards , it is necessary to be modified to recognition result in careful or conference system.

The content of the invention

The embodiment of the present invention provides a kind of voice recognition result processing method and processing device, facilitates user to voice to realize Recognition result is modified, so as to lift the usage experience of user.

First aspect according to embodiments of the present invention, there is provided a kind of voice recognition result processing method, including：

Obtain the corresponding voice recognition result text of voice messaging；

Current display interface is divided into the first display area and the second display area, in first display area and institute State and institute's speech recognition result text is shown in the second display area, wherein, the second voice in second display area The first voice recognition result text in recognition result text and first display area is corresponding, and second viewing area The second voice recognition result text in domain is corresponding with the voice messaging；

When receiving the selected operation to the first object text in the second voice recognition result text, play with The corresponding target voice information of the first object text, and the editable cursor of current display interface is positioned to described first In voice recognition result text, and corresponding second target text of the first object text, so that user is according to the mesh Mark voice messaging is corrected the second target text in the first voice recognition result text, the voice after being corrected Recognition result text.

In this embodiment, current display interface is divided into the first display area and the second display area, it is aobvious first Show in region and the second display area and show voice recognition result text, wherein, the second voice in the second display area is known Other resulting text and the first voice recognition result in the first display area are corresponding and corresponding with voice messaging, in this way, working as user When the first object text in the second display area is chosen in click, the corresponding target voice information of the first object text is played, And editable cursor is navigated into the second target text corresponding with first object text in the first display area, in this way, user It can greatly improve user's to be corrected when listening back to voice messaging to the second target text in the first display area Work efficiency and record accuracy so that user can listen back to real voice and correct the result of identification in time.

In one embodiment, the method further includes：

The corresponding broadcast button of voice messaging is shown in the second display area of the current display interface；

When receiving the play operation to the voice messaging, the voice messaging is played, and highlights and currently broadcasts The 3rd target text in the corresponding second voice recognition result text of voice messaging put, and by current display interface can Editor's cursor is positioned into the first voice recognition result text, the 4th target text corresponding with the 3rd target text This.

In this embodiment, the corresponding broadcasting of voice messaging can also be shown in the second display area of current display interface Button, user can click on broadcast button and listen back to voice messaging, when playing voice messaging, highlight in the second display area The 3rd target text corresponding with voice messaging, and can in synchro edit the first voice recognition result text with the 3rd text pair The 4th target text answered, consequently facilitating user carries out the amendment of voice recognition result text according to voice messaging.

In one embodiment, the method further includes：

Receive the modification instruction of the second target text input by user to editable cursor present position；

Instructed according to the modification, change second target text, and mark amended second target text.

In this embodiment, user can change the target text of editable cursor present position, after the modification, can mark Go out the target text of modification, so as to be contrasted with the voice recognition result text in the second display area, checked easy to user Modification part.In addition, user can also in the first display area automatic moving editable cursor position, to the first viewing area Other target texts in domain are modified.

In one embodiment, the method further includes：

Before display institute speech recognition result text, the voice messaging is obtained；

The voice messaging is identified again, obtains secondary voice recognition result text；

By the secondary voice recognition result text compared with institute speech recognition result text, whether both are determined Unanimously；

When both are consistent, the speech recognition is shown on first display area and second display area Resulting text；

When both are inconsistent, determine in the secondary voice recognition result text with institute speech recognition result text not Same difference text；

Institute's speech recognition result text is shown on first display area, is shown on second display area The secondary voice recognition result text, and highlight the difference text.

In this embodiment, can also be to voice messaging before current display interface shows voice recognition result text Identified again, obtain secondary voice recognition result text, so that it is determined that the voice recognition result text identified twice is It is no consistent, if both are inconsistent, secondary voice recognition result text is shown in the second display area, in the first display area Show voice recognition result text, and both difference texts are highlighted in the second display area, in this way, facilitating user's root Carry out listening back to voice messaging according to highlighted difference text and carry out the amendment of voice recognition result text.

In one embodiment, the method further includes：

Receive the lookup replacement instruction to specifying the text field in the first voice recognition result text；

According to the lookup replacement instruction, the specified text word is searched in the first voice recognition result text Section, and replace described specified the text field using new the text field.

In this embodiment, user can also be to the field in the first voice recognition result text in the first display area Lookup replacement is carried out, consequently facilitating user changes voice recognition result text.

In one embodiment, the method further includes：

When playing voice messaging, the pause play instruction of input is received；

According to the pause play instruction, pause plays the voice messaging.

In this embodiment, during voice messaging is played, user is also an option that pause plays voice messaging, from And further lift the usage experience of user.

Second aspect according to embodiments of the present invention, there is provided a kind of voice recognition result processing unit, including：

Processor；

For storing the memory of processor-executable instruction；

Wherein, the processor is configured as：

Obtain the corresponding voice recognition result text of voice messaging；

In one embodiment, the processor is additionally configured to：

According to the pause play instruction, pause plays the voice messaging.

It should be appreciated that the general description and following detailed description of the above are only exemplary and explanatory, not Can the limitation present invention.

Other features and advantages of the present invention will be illustrated in the following description, also, partly becomes from specification Obtain it is clear that or being understood by implementing the present invention.The purpose of the present invention and other advantages can be by the explanations write Specifically noted structure is realized and obtained in book, claims and attached drawing.

Below by drawings and examples, technical scheme is described in further detail.

Brief description of the drawings

Attached drawing herein is merged in specification and forms the part of this specification, shows the implementation for meeting the present invention Example, and for explaining the principle of the present invention together with specification.

Fig. 1 is a kind of flow chart of voice recognition result processing method according to an exemplary embodiment.

Fig. 2 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.

Fig. 3 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.

Fig. 4 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.

Fig. 5 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.

Fig. 6 is a kind of screenshot capture of current display interface according to an exemplary embodiment.

Fig. 7 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.

Fig. 8 is the screenshot capture of another current display interface according to an exemplary embodiment.

Embodiment

Here exemplary embodiment will be illustrated in detail, its example is illustrated in the accompanying drawings.Following description is related to During attached drawing, unless otherwise indicated, the same numbers in different attached drawings represent the same or similar key element.Following exemplary embodiment Described in embodiment do not represent and the consistent all embodiments of the present invention.On the contrary, they be only with it is such as appended The example of the consistent apparatus and method of some aspects being described in detail in claims, of the invention.

Fig. 1 is a kind of flow chart of voice recognition result processing method according to an exemplary embodiment.The voice Recognition result processing method is applied in terminal device, which can be mobile phone, computer, and digital broadcasting is whole End, messaging devices, game console, tablet device, Medical Devices, body-building equipment, personal digital assistant etc. are any to be had The equipment of speech identifying function.As shown in Figure 1, the method comprising the steps of S101-S103：

In step S101, the corresponding voice recognition result text of voice messaging is obtained；

In step s 102, current display interface is divided into the first display area and the second display area, it is aobvious first Show in region and the second display area and show voice recognition result text, wherein, the second voice in the second display area is known The first voice recognition result text in other resulting text and the first display area is corresponding, and second in the second display area Voice recognition result text is corresponding with voice messaging；Wherein, the second voice recognition result text in the second display area For checking, it is impossible to modification and editor；And the first voice recognition result text in the first display area can modify and Editor.

In step s 103, when receiving the selected operation to the first object text in the second voice recognition result text When, play corresponding with first object text target voice information, and the editable cursor of current display interface is positioned to the In one voice recognition result text, and corresponding second target text of first object text, so that user is according to target voice Information is corrected the second target text in the first voice recognition result text, the voice recognition result text after being corrected This.

In this embodiment, current display interface is divided into the first display area and the second display area, it is aobvious first Show in region and the second display area and show voice recognition result text, wherein, the second voice in the second display area is known Other resulting text and the first voice recognition result in the first display area are corresponding and corresponding with voice messaging, in this way, working as user When the first object text in the second display area is chosen in click, the corresponding target voice information of the first object text is played, And editable cursor is navigated into the second target text corresponding with first object text in the first display area, in this way, user User person can be greatly improved to be corrected when listening back to voice messaging to the second target text in the first display area Work efficiency and record accuracy so that user can listen back to real voice and correct the result of identification in time.

As shown in Fig. 2, in one embodiment, the above method further includes step S201-S202：

In step s 201, the corresponding broadcast button of voice messaging is shown in the second display area of current display interface；

In step S202, when receiving the play operation to voice messaging, voice messaging is played, and highlight and work as The 3rd target text in the corresponding second voice recognition result text of voice messaging of preceding broadcasting, and by current display interface Editable cursor position into the first voice recognition result text, with corresponding 4th target text of the 3rd target text.

As shown in figure 3, in one embodiment, the above method further includes step S301-S302：

In step S301, the modification for receiving the second target text input by user to editable cursor present position refers to Order；

In step s 302, instructed according to modification, change the second target text, and mark amended second target text This.Wherein, it can highlight amended second target text to mark amended second target text, can also be with it His mode shows the difference between the second target text and other texts.

As shown in figure 4, in one embodiment, the above method further includes step S401-S402：

In step S401, the lookup replacement instruction to specifying the text field in the first voice recognition result text is received；

In step S402, according to replacement instruction is searched, searched in the first voice recognition result text and specify text word Section, and replaced using new the text field and specify the text field.

As shown in figure 5, in one embodiment, the above method further includes step S501-S502：

In step S501, when playing voice messaging, the pause play instruction of input is received；

In step S502, according to pause play instruction, pause plays voice messaging.

Above-mentioned technical proposal is described in detail with a specific embodiment below.

As shown in fig. 6, in current display interface 60, two display areas can be divided to, the first display area and second show Show region, two display areas or so distribution, voice recognition result text is shown two display areas, wherein, the left side Display area is editable voice recognition result text, and the display area on the right is not editable voice recognition result text This, user, which clicks on, chooses the target text of right area, then plays the corresponding voice messaging of the target text, in left area, Text corresponding with the target text of right area is also selected, and is editable cursor position, in this way, can basis The voice messaging of broadcasting is modified the target text in left area.As shown in fig. 6, in current display interface, it is also aobvious It is shown with lookup and replaces option, user can search the text specified, keyword specified etc., and then by replacing option by the left side Specified text or keyword in region in voice recognition result text replace with new text or keyword, so that in some language When sound identification is inaccurate, voice recognition result is changed easy to user.For example, when carrying out speech recognition, the institute in voice messaging There is " school " to be identified as " learning " in recognition result, in this way, when user has found the mistake, can be after option be searched Input " school ", " study " is inputted after replacing option, in this way, all " schools " in left area in voice recognition result text All it is modified to " learn ", user need not modify one by one, reduce the operation of user, and improve user uses body Test.In addition, in current display interface, listening back to button of Denging also is shown, user can listen back to voice messaging by the button, In current display interface, also show button, the users such as pause and listening back to voice messaging, can be with if midway needs to interrupt Press pause button by touching, pause plays voice messaging, it is necessary to when playing out, then listens tactile press the button to continue to broadcast Put.Certainly, it on interface, can also show buttons such as " printing " " export ", know consequently facilitating user exports revised voice Other resulting text.

As shown in fig. 7, in one embodiment, the above method further includes step S701-S706：

In step s 701, before voice recognition result text is shown, voice messaging is obtained；

In step S702, voice messaging is identified again, obtains secondary voice recognition result text；

In step S703, by secondary voice recognition result text compared with voice recognition result text, two are determined Whether person is consistent；

In step S704, when both are consistent, show that voice is known on the first display area and the second display area Other resulting text；

In step S705, when both are inconsistent, determine in secondary voice recognition result text with voice recognition result The different difference text of text；

In step S706, voice recognition result text is shown on the first display area, is shown on the second display area Show secondary voice recognition result text, and highlight difference text.

Current display interface 80 is illustrated in figure 8, in order to further improve discrimination and cause user's note that can be with Take secondary identification, i.e. same section of voice messaging can be sent to be identified to identification engine twice, compares recognition result twice, if Identification is consistent twice, then second of recognition result, if recognition result is inconsistent twice, secondary identification is tied just without displaying Fruit is shown, wherein, the left side shows a voice recognition result, and the right shows secondary voice recognition result, and by secondary knowledge Other result is marked, and such as highlights, to cause the attention of user.

Processor；

For storing the memory of processor-executable instruction；

Wherein, the processor is configured as：

Obtain the corresponding voice recognition result text of voice messaging；

In one embodiment, the processor is additionally configured to：

According to the pause play instruction, pause plays the voice messaging.

It should be understood by those skilled in the art that, the embodiment of the present invention can be provided as method, system or computer program Product.Therefore, the present invention can use the reality in terms of complete hardware embodiment, complete software embodiment or combination software and hardware Apply the form of example.Moreover, the present invention can use the computer for wherein including computer usable program code in one or more The shape for the computer program product that usable storage medium is implemented on (including but not limited to magnetic disk storage and optical memory etc.) Formula.

The present invention be with reference to according to the method for the embodiment of the present invention, the flow of equipment (system) and computer program product Figure and/or block diagram describe.It should be understood that it can be realized by computer program instructions every first-class in flowchart and/or the block diagram The combination of flow and/or square frame in journey and/or square frame and flowchart and/or the block diagram.These computer programs can be provided The processors of all-purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices is instructed to produce A raw machine so that the instruction performed by computer or the processor of other programmable data processing devices, which produces, to be used in fact The device for the function of being specified in present one flow of flow chart or one square frame of multiple flows and/or block diagram or multiple square frames.

These computer program instructions, which may also be stored in, can guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works so that the instruction being stored in the computer-readable memory, which produces, to be included referring to Make the manufacture of device, the command device realize in one flow of flow chart or multiple flows and/or one square frame of block diagram or The function of being specified in multiple square frames.

These computer program instructions can be also loaded into computer or other programmable data processing devices so that counted Series of operation steps is performed on calculation machine or other programmable devices to produce computer implemented processing, thus in computer or The instruction performed on other programmable devices is provided and is used for realization in one flow of flow chart or multiple flows and/or block diagram one The step of function of being specified in a square frame or multiple square frames.

Obviously, various changes and modifications can be made to the invention without departing from essence of the invention by those skilled in the art God and scope.In this way, if these modifications and changes of the present invention belongs to the scope of the claims in the present invention and its equivalent technologies Within, then the present invention is also intended to comprising including these modification and variations.

Claims

A kind of 1. fast and easily voice recognition result proofreading method, it is characterised in that including：

Obtain the corresponding voice recognition result text of voice messaging；

Current display interface is divided into the first display area and the second display area, in first display area and described Institute's speech recognition result text is shown in two display areas, wherein, the second speech recognition in second display area The first voice recognition result text in resulting text and first display area is corresponding, and in second display area The second voice recognition result text it is corresponding with the voice messaging；

When receiving the selected operation to the first object text in the second voice recognition result text, play with it is described The corresponding target voice information of first object text, and the editable cursor of current display interface is positioned to first voice In recognition result text, and corresponding second target text of the first object text, so that user is according to the target language Message breath is corrected the second target text in the first voice recognition result text, the speech recognition after being corrected Resulting text.
2. according to the method described in claim 1, it is characterized in that, the method further includes：

The corresponding broadcast button of voice messaging is shown in the second display area of the current display interface；

When receiving the play operation to the voice messaging, the voice messaging is played, and is highlighted currently playing The 3rd target text in the corresponding second voice recognition result text of voice messaging, and the editable by current display interface Cursor is positioned into the first voice recognition result text, with corresponding 4th target text of the 3rd target text.
3. according to the method described in claim 1, it is characterized in that,

The method further includes：

Receive the modification instruction of the second target text input by user to editable cursor present position；Referred to according to the modification Order, changes second target text, and marks amended second target text, and marks amended second target text This；

And/or

The method further includes：

Receive the lookup replacement instruction to specifying the text field in the first voice recognition result text；Replaced according to the lookup Instruction is changed, described specified the text field is searched in the first voice recognition result text, and replace using new the text field Change described specified the text field.
4. according to the method described in claim 1, it is characterized in that, the method further includes：

Before display institute speech recognition result text, the voice messaging is obtained；

The voice messaging is identified again, obtains secondary voice recognition result text；

By the secondary voice recognition result text compared with institute speech recognition result text, determine both whether one Cause；

When both are consistent, institute's speech recognition result is shown on first display area and second display area Text；

When both are inconsistent, determine different from institute speech recognition result text in the secondary voice recognition result text Difference text；

Institute's speech recognition result text is shown on first display area, on second display area described in display Secondary voice recognition result text, and highlight the difference text.
5. according to the method described in claim 1, it is characterized in that, the method further includes：

When playing voice messaging, the pause play instruction of input is received；

According to the pause play instruction, pause plays the voice messaging.
A kind of 6. fast and easily voice recognition result verifying unit, it is characterised in that including：

Processor；

For storing the memory of processor-executable instruction；

Wherein, the processor is configured as：

Obtain the corresponding voice recognition result text of voice messaging；

Current display interface is divided into the first display area and the second display area, in first display area and described Institute's speech recognition result text is shown in two display areas, wherein, the second speech recognition in second display area The first voice recognition result text in resulting text and first display area is corresponding, and in second display area The second voice recognition result text it is corresponding with the voice messaging；

When receiving the selected operation to the first object text in the second voice recognition result text, play with it is described The corresponding target voice information of first object text, and the editable cursor of current display interface is positioned to first voice In recognition result text, and corresponding second target text of the first object text, so that user is according to the target language Message breath is corrected the second target text in the first voice recognition result text, the speech recognition after being corrected Resulting text.
7. device according to claim 6, it is characterised in that the processor is additionally configured to：

The corresponding broadcast button of voice messaging is shown in the second display area of the current display interface；

When receiving the play operation to the voice messaging, the voice messaging is played, and is highlighted currently playing The 3rd target text in the corresponding second voice recognition result text of voice messaging, and the editable by current display interface Cursor is positioned into the first voice recognition result text, with corresponding 4th target text of the 3rd target text.
8. device according to claim 6, it is characterised in that

The processor is additionally configured to：

Receive the modification instruction of the second target text input by user to editable cursor present position；Referred to according to the modification Order, changes second target text, and marks amended second target text；

And/or

The processor is additionally configured to：

Receive the lookup replacement instruction to specifying the text field in the first voice recognition result text；Replaced according to the lookup Instruction is changed, described specified the text field is searched in the first voice recognition result text, and replace using new the text field Change described specified the text field.
9. device according to claim 6, it is characterised in that the processor is additionally configured to：

Before display institute speech recognition result text, the voice messaging is obtained；

The voice messaging is identified again, obtains secondary voice recognition result text；

By the secondary voice recognition result text compared with institute speech recognition result text, determine both whether one Cause；

When both are consistent, institute's speech recognition result is shown on first display area and second display area Text；

When both are inconsistent, determine different from institute speech recognition result text in the secondary voice recognition result text Difference text；

Institute's speech recognition result text is shown on first display area, on second display area described in display Secondary voice recognition result text, and highlight the difference text.
10. device according to claim 6, it is characterised in that the processor is additionally configured to：

When playing voice messaging, the pause play instruction of input is received；

According to the pause play instruction, pause plays the voice messaging.