CN107945802A - Voice recognition result processing method and processing device - Google Patents

Voice recognition result processing method and processing device Download PDF

Info

Publication number
CN107945802A
CN107945802A CN201710995682.8A CN201710995682A CN107945802A CN 107945802 A CN107945802 A CN 107945802A CN 201710995682 A CN201710995682 A CN 201710995682A CN 107945802 A CN107945802 A CN 107945802A
Authority
CN
China
Prior art keywords
text
recognition result
voice
voice recognition
display area
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710995682.8A
Other languages
Chinese (zh)
Inventor
何世阳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Yunzhisheng Information Technology Co Ltd
Original Assignee
Beijing Yunzhisheng Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Yunzhisheng Information Technology Co Ltd filed Critical Beijing Yunzhisheng Information Technology Co Ltd
Priority to CN201710995682.8A priority Critical patent/CN107945802A/en
Publication of CN107945802A publication Critical patent/CN107945802A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • G10L21/10Transforming into visible information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Data Mining & Analysis (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The present invention be on a kind of fast and easily voice recognition result proofreading method and device, wherein, method includes:Obtain the corresponding voice recognition result text of voice messaging;Current display interface is divided into the first display area and the second display area, voice recognition result text is shown in the first display area and the second display area;When receiving the selected operation to the first object text in the second voice recognition result text, play target voice information corresponding with first object text, and the editable cursor of current display interface is positioned into the first voice recognition result text, with corresponding second target text of first object text, so that user is corrected the second target text in the first voice recognition result text according to target voice information, the voice recognition result text after being corrected.With this solution, can be with the work efficiency of user and record accuracy.

Description

Voice recognition result processing method and processing device
Technical field
The present invention relates to technical field of voice recognition, more particularly to a kind of voice recognition result processing method and processing device.
Background technology
Traditional meeting and court's trial are all that recorder carries out content record, such a mode by way of typewriting or notes Speed is slow, inefficiency, and voice cannot be put on record.With flourishing for speech recognition technology, many companies will Speech recognition technology is applied in meeting and court's trial, but since speech recognition can not possibly reach 100%, so in some intelligent front yards , it is necessary to be modified to recognition result in careful or conference system.
The content of the invention
The embodiment of the present invention provides a kind of voice recognition result processing method and processing device, facilitates user to voice to realize Recognition result is modified, so as to lift the usage experience of user.
First aspect according to embodiments of the present invention, there is provided a kind of voice recognition result processing method, including:
Obtain the corresponding voice recognition result text of voice messaging;
Current display interface is divided into the first display area and the second display area, in first display area and institute State and institute's speech recognition result text is shown in the second display area, wherein, the second voice in second display area The first voice recognition result text in recognition result text and first display area is corresponding, and second viewing area The second voice recognition result text in domain is corresponding with the voice messaging;
When receiving the selected operation to the first object text in the second voice recognition result text, play with The corresponding target voice information of the first object text, and the editable cursor of current display interface is positioned to described first In voice recognition result text, and corresponding second target text of the first object text, so that user is according to the mesh Mark voice messaging is corrected the second target text in the first voice recognition result text, the voice after being corrected Recognition result text.
In this embodiment, current display interface is divided into the first display area and the second display area, it is aobvious first Show in region and the second display area and show voice recognition result text, wherein, the second voice in the second display area is known Other resulting text and the first voice recognition result in the first display area are corresponding and corresponding with voice messaging, in this way, working as user When the first object text in the second display area is chosen in click, the corresponding target voice information of the first object text is played, And editable cursor is navigated into the second target text corresponding with first object text in the first display area, in this way, user It can greatly improve user's to be corrected when listening back to voice messaging to the second target text in the first display area Work efficiency and record accuracy so that user can listen back to real voice and correct the result of identification in time.
In one embodiment, the method further includes:
The corresponding broadcast button of voice messaging is shown in the second display area of the current display interface;
When receiving the play operation to the voice messaging, the voice messaging is played, and highlights and currently broadcasts The 3rd target text in the corresponding second voice recognition result text of voice messaging put, and by current display interface can Editor's cursor is positioned into the first voice recognition result text, the 4th target text corresponding with the 3rd target text This.
In this embodiment, the corresponding broadcasting of voice messaging can also be shown in the second display area of current display interface Button, user can click on broadcast button and listen back to voice messaging, when playing voice messaging, highlight in the second display area The 3rd target text corresponding with voice messaging, and can in synchro edit the first voice recognition result text with the 3rd text pair The 4th target text answered, consequently facilitating user carries out the amendment of voice recognition result text according to voice messaging.
In one embodiment, the method further includes:
Receive the modification instruction of the second target text input by user to editable cursor present position;
Instructed according to the modification, change second target text, and mark amended second target text.
In this embodiment, user can change the target text of editable cursor present position, after the modification, can mark Go out the target text of modification, so as to be contrasted with the voice recognition result text in the second display area, checked easy to user Modification part.In addition, user can also in the first display area automatic moving editable cursor position, to the first viewing area Other target texts in domain are modified.
In one embodiment, the method further includes:
Before display institute speech recognition result text, the voice messaging is obtained;
The voice messaging is identified again, obtains secondary voice recognition result text;
By the secondary voice recognition result text compared with institute speech recognition result text, whether both are determined Unanimously;
When both are consistent, the speech recognition is shown on first display area and second display area Resulting text;
When both are inconsistent, determine in the secondary voice recognition result text with institute speech recognition result text not Same difference text;
Institute's speech recognition result text is shown on first display area, is shown on second display area The secondary voice recognition result text, and highlight the difference text.
In this embodiment, can also be to voice messaging before current display interface shows voice recognition result text Identified again, obtain secondary voice recognition result text, so that it is determined that the voice recognition result text identified twice is It is no consistent, if both are inconsistent, secondary voice recognition result text is shown in the second display area, in the first display area Show voice recognition result text, and both difference texts are highlighted in the second display area, in this way, facilitating user's root Carry out listening back to voice messaging according to highlighted difference text and carry out the amendment of voice recognition result text.
In one embodiment, the method further includes:
Receive the lookup replacement instruction to specifying the text field in the first voice recognition result text;
According to the lookup replacement instruction, the specified text word is searched in the first voice recognition result text Section, and replace described specified the text field using new the text field.
In this embodiment, user can also be to the field in the first voice recognition result text in the first display area Lookup replacement is carried out, consequently facilitating user changes voice recognition result text.
In one embodiment, the method further includes:
When playing voice messaging, the pause play instruction of input is received;
According to the pause play instruction, pause plays the voice messaging.
In this embodiment, during voice messaging is played, user is also an option that pause plays voice messaging, from And further lift the usage experience of user.
Second aspect according to embodiments of the present invention, there is provided a kind of voice recognition result processing unit, including:
Processor;
For storing the memory of processor-executable instruction;
Wherein, the processor is configured as:
Obtain the corresponding voice recognition result text of voice messaging;
Current display interface is divided into the first display area and the second display area, in first display area and institute State and institute's speech recognition result text is shown in the second display area, wherein, the second voice in second display area The first voice recognition result text in recognition result text and first display area is corresponding, and second viewing area The second voice recognition result text in domain is corresponding with the voice messaging;
When receiving the selected operation to the first object text in the second voice recognition result text, play with The corresponding target voice information of the first object text, and the editable cursor of current display interface is positioned to described first In voice recognition result text, and corresponding second target text of the first object text, so that user is according to the mesh Mark voice messaging is corrected the second target text in the first voice recognition result text, the voice after being corrected Recognition result text.
In one embodiment, the processor is additionally configured to:
The corresponding broadcast button of voice messaging is shown in the second display area of the current display interface;
When receiving the play operation to the voice messaging, the voice messaging is played, and highlights and currently broadcasts The 3rd target text in the corresponding second voice recognition result text of voice messaging put, and by current display interface can Editor's cursor is positioned into the first voice recognition result text, the 4th target text corresponding with the 3rd target text This.
In one embodiment, the processor is additionally configured to:
Receive the modification instruction of the second target text input by user to editable cursor present position;
Instructed according to the modification, change second target text, and mark amended second target text.
In one embodiment, the processor is additionally configured to:
Before display institute speech recognition result text, the voice messaging is obtained;
The voice messaging is identified again, obtains secondary voice recognition result text;
By the secondary voice recognition result text compared with institute speech recognition result text, whether both are determined Unanimously;
When both are consistent, the speech recognition is shown on first display area and second display area Resulting text;
When both are inconsistent, determine in the secondary voice recognition result text with institute speech recognition result text not Same difference text;
Institute's speech recognition result text is shown on first display area, is shown on second display area The secondary voice recognition result text, and highlight the difference text.
In one embodiment, the processor is additionally configured to:
Receive the lookup replacement instruction to specifying the text field in the first voice recognition result text;
According to the lookup replacement instruction, the specified text word is searched in the first voice recognition result text Section, and replace described specified the text field using new the text field.
In one embodiment, the processor is additionally configured to:
When playing voice messaging, the pause play instruction of input is received;
According to the pause play instruction, pause plays the voice messaging.
It should be appreciated that the general description and following detailed description of the above are only exemplary and explanatory, not Can the limitation present invention.
Other features and advantages of the present invention will be illustrated in the following description, also, partly becomes from specification Obtain it is clear that or being understood by implementing the present invention.The purpose of the present invention and other advantages can be by the explanations write Specifically noted structure is realized and obtained in book, claims and attached drawing.
Below by drawings and examples, technical scheme is described in further detail.
Brief description of the drawings
Attached drawing herein is merged in specification and forms the part of this specification, shows the implementation for meeting the present invention Example, and for explaining the principle of the present invention together with specification.
Fig. 1 is a kind of flow chart of voice recognition result processing method according to an exemplary embodiment.
Fig. 2 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.
Fig. 3 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.
Fig. 4 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.
Fig. 5 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.
Fig. 6 is a kind of screenshot capture of current display interface according to an exemplary embodiment.
Fig. 7 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.
Fig. 8 is the screenshot capture of another current display interface according to an exemplary embodiment.
Embodiment
Here exemplary embodiment will be illustrated in detail, its example is illustrated in the accompanying drawings.Following description is related to During attached drawing, unless otherwise indicated, the same numbers in different attached drawings represent the same or similar key element.Following exemplary embodiment Described in embodiment do not represent and the consistent all embodiments of the present invention.On the contrary, they be only with it is such as appended The example of the consistent apparatus and method of some aspects being described in detail in claims, of the invention.
Fig. 1 is a kind of flow chart of voice recognition result processing method according to an exemplary embodiment.The voice Recognition result processing method is applied in terminal device, which can be mobile phone, computer, and digital broadcasting is whole End, messaging devices, game console, tablet device, Medical Devices, body-building equipment, personal digital assistant etc. are any to be had The equipment of speech identifying function.As shown in Figure 1, the method comprising the steps of S101-S103:
In step S101, the corresponding voice recognition result text of voice messaging is obtained;
In step s 102, current display interface is divided into the first display area and the second display area, it is aobvious first Show in region and the second display area and show voice recognition result text, wherein, the second voice in the second display area is known The first voice recognition result text in other resulting text and the first display area is corresponding, and second in the second display area Voice recognition result text is corresponding with voice messaging;Wherein, the second voice recognition result text in the second display area For checking, it is impossible to modification and editor;And the first voice recognition result text in the first display area can modify and Editor.
In step s 103, when receiving the selected operation to the first object text in the second voice recognition result text When, play corresponding with first object text target voice information, and the editable cursor of current display interface is positioned to the In one voice recognition result text, and corresponding second target text of first object text, so that user is according to target voice Information is corrected the second target text in the first voice recognition result text, the voice recognition result text after being corrected This.
In this embodiment, current display interface is divided into the first display area and the second display area, it is aobvious first Show in region and the second display area and show voice recognition result text, wherein, the second voice in the second display area is known Other resulting text and the first voice recognition result in the first display area are corresponding and corresponding with voice messaging, in this way, working as user When the first object text in the second display area is chosen in click, the corresponding target voice information of the first object text is played, And editable cursor is navigated into the second target text corresponding with first object text in the first display area, in this way, user User person can be greatly improved to be corrected when listening back to voice messaging to the second target text in the first display area Work efficiency and record accuracy so that user can listen back to real voice and correct the result of identification in time.
Fig. 2 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.
As shown in Fig. 2, in one embodiment, the above method further includes step S201-S202:
In step s 201, the corresponding broadcast button of voice messaging is shown in the second display area of current display interface;
In step S202, when receiving the play operation to voice messaging, voice messaging is played, and highlight and work as The 3rd target text in the corresponding second voice recognition result text of voice messaging of preceding broadcasting, and by current display interface Editable cursor position into the first voice recognition result text, with corresponding 4th target text of the 3rd target text.
In this embodiment, the corresponding broadcasting of voice messaging can also be shown in the second display area of current display interface Button, user can click on broadcast button and listen back to voice messaging, when playing voice messaging, highlight in the second display area The 3rd target text corresponding with voice messaging, and can in synchro edit the first voice recognition result text with the 3rd text pair The 4th target text answered, consequently facilitating user carries out the amendment of voice recognition result text according to voice messaging.
Fig. 3 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.
As shown in figure 3, in one embodiment, the above method further includes step S301-S302:
In step S301, the modification for receiving the second target text input by user to editable cursor present position refers to Order;
In step s 302, instructed according to modification, change the second target text, and mark amended second target text This.Wherein, it can highlight amended second target text to mark amended second target text, can also be with it His mode shows the difference between the second target text and other texts.
In this embodiment, user can change the target text of editable cursor present position, after the modification, can mark Go out the target text of modification, so as to be contrasted with the voice recognition result text in the second display area, checked easy to user Modification part.In addition, user can also in the first display area automatic moving editable cursor position, to the first viewing area Other target texts in domain are modified.
Fig. 4 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.
As shown in figure 4, in one embodiment, the above method further includes step S401-S402:
In step S401, the lookup replacement instruction to specifying the text field in the first voice recognition result text is received;
In step S402, according to replacement instruction is searched, searched in the first voice recognition result text and specify text word Section, and replaced using new the text field and specify the text field.
In this embodiment, user can also be to the field in the first voice recognition result text in the first display area Lookup replacement is carried out, consequently facilitating user changes voice recognition result text.
Fig. 5 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.
As shown in figure 5, in one embodiment, the above method further includes step S501-S502:
In step S501, when playing voice messaging, the pause play instruction of input is received;
In step S502, according to pause play instruction, pause plays voice messaging.
In this embodiment, during voice messaging is played, user is also an option that pause plays voice messaging, from And further lift the usage experience of user.
Above-mentioned technical proposal is described in detail with a specific embodiment below.
As shown in fig. 6, in current display interface 60, two display areas can be divided to, the first display area and second show Show region, two display areas or so distribution, voice recognition result text is shown two display areas, wherein, the left side Display area is editable voice recognition result text, and the display area on the right is not editable voice recognition result text This, user, which clicks on, chooses the target text of right area, then plays the corresponding voice messaging of the target text, in left area, Text corresponding with the target text of right area is also selected, and is editable cursor position, in this way, can basis The voice messaging of broadcasting is modified the target text in left area.As shown in fig. 6, in current display interface, it is also aobvious It is shown with lookup and replaces option, user can search the text specified, keyword specified etc., and then by replacing option by the left side Specified text or keyword in region in voice recognition result text replace with new text or keyword, so that in some language When sound identification is inaccurate, voice recognition result is changed easy to user.For example, when carrying out speech recognition, the institute in voice messaging There is " school " to be identified as " learning " in recognition result, in this way, when user has found the mistake, can be after option be searched Input " school ", " study " is inputted after replacing option, in this way, all " schools " in left area in voice recognition result text All it is modified to " learn ", user need not modify one by one, reduce the operation of user, and improve user uses body Test.In addition, in current display interface, listening back to button of Denging also is shown, user can listen back to voice messaging by the button, In current display interface, also show button, the users such as pause and listening back to voice messaging, can be with if midway needs to interrupt Press pause button by touching, pause plays voice messaging, it is necessary to when playing out, then listens tactile press the button to continue to broadcast Put.Certainly, it on interface, can also show buttons such as " printing " " export ", know consequently facilitating user exports revised voice Other resulting text.
Fig. 7 is the flow chart of another voice recognition result processing method according to an exemplary embodiment.
As shown in fig. 7, in one embodiment, the above method further includes step S701-S706:
In step s 701, before voice recognition result text is shown, voice messaging is obtained;
In step S702, voice messaging is identified again, obtains secondary voice recognition result text;
In step S703, by secondary voice recognition result text compared with voice recognition result text, two are determined Whether person is consistent;
In step S704, when both are consistent, show that voice is known on the first display area and the second display area Other resulting text;
In step S705, when both are inconsistent, determine in secondary voice recognition result text with voice recognition result The different difference text of text;
In step S706, voice recognition result text is shown on the first display area, is shown on the second display area Show secondary voice recognition result text, and highlight difference text.
In this embodiment, can also be to voice messaging before current display interface shows voice recognition result text Identified again, obtain secondary voice recognition result text, so that it is determined that the voice recognition result text identified twice is It is no consistent, if both are inconsistent, secondary voice recognition result text is shown in the second display area, in the first display area Show voice recognition result text, and both difference texts are highlighted in the second display area, in this way, facilitating user's root Carry out listening back to voice messaging according to highlighted difference text and carry out the amendment of voice recognition result text.
Current display interface 80 is illustrated in figure 8, in order to further improve discrimination and cause user's note that can be with Take secondary identification, i.e. same section of voice messaging can be sent to be identified to identification engine twice, compares recognition result twice, if Identification is consistent twice, then second of recognition result, if recognition result is inconsistent twice, secondary identification is tied just without displaying Fruit is shown, wherein, the left side shows a voice recognition result, and the right shows secondary voice recognition result, and by secondary knowledge Other result is marked, and such as highlights, to cause the attention of user.
Second aspect according to embodiments of the present invention, there is provided a kind of voice recognition result processing unit, including:
Processor;
For storing the memory of processor-executable instruction;
Wherein, the processor is configured as:
Obtain the corresponding voice recognition result text of voice messaging;
Current display interface is divided into the first display area and the second display area, in first display area and institute State and institute's speech recognition result text is shown in the second display area, wherein, the second voice in second display area The first voice recognition result text in recognition result text and first display area is corresponding, and second viewing area The second voice recognition result text in domain is corresponding with the voice messaging;
When receiving the selected operation to the first object text in the second voice recognition result text, play with The corresponding target voice information of the first object text, and the editable cursor of current display interface is positioned to described first In voice recognition result text, and corresponding second target text of the first object text, so that user is according to the mesh Mark voice messaging is corrected the second target text in the first voice recognition result text, the voice after being corrected Recognition result text.
In one embodiment, the processor is additionally configured to:
The corresponding broadcast button of voice messaging is shown in the second display area of the current display interface;
When receiving the play operation to the voice messaging, the voice messaging is played, and highlights and currently broadcasts The 3rd target text in the corresponding second voice recognition result text of voice messaging put, and by current display interface can Editor's cursor is positioned into the first voice recognition result text, the 4th target text corresponding with the 3rd target text This.
In one embodiment, the processor is additionally configured to:
Receive the modification instruction of the second target text input by user to editable cursor present position;
Instructed according to the modification, change second target text, and mark amended second target text.
In one embodiment, the processor is additionally configured to:
Before display institute speech recognition result text, the voice messaging is obtained;
The voice messaging is identified again, obtains secondary voice recognition result text;
By the secondary voice recognition result text compared with institute speech recognition result text, whether both are determined Unanimously;
When both are consistent, the speech recognition is shown on first display area and second display area Resulting text;
When both are inconsistent, determine in the secondary voice recognition result text with institute speech recognition result text not Same difference text;
Institute's speech recognition result text is shown on first display area, is shown on second display area The secondary voice recognition result text, and highlight the difference text.
In one embodiment, the processor is additionally configured to:
Receive the lookup replacement instruction to specifying the text field in the first voice recognition result text;
According to the lookup replacement instruction, the specified text word is searched in the first voice recognition result text Section, and replace described specified the text field using new the text field.
In one embodiment, the processor is additionally configured to:
When playing voice messaging, the pause play instruction of input is received;
According to the pause play instruction, pause plays the voice messaging.
It should be understood by those skilled in the art that, the embodiment of the present invention can be provided as method, system or computer program Product.Therefore, the present invention can use the reality in terms of complete hardware embodiment, complete software embodiment or combination software and hardware Apply the form of example.Moreover, the present invention can use the computer for wherein including computer usable program code in one or more The shape for the computer program product that usable storage medium is implemented on (including but not limited to magnetic disk storage and optical memory etc.) Formula.
The present invention be with reference to according to the method for the embodiment of the present invention, the flow of equipment (system) and computer program product Figure and/or block diagram describe.It should be understood that it can be realized by computer program instructions every first-class in flowchart and/or the block diagram The combination of flow and/or square frame in journey and/or square frame and flowchart and/or the block diagram.These computer programs can be provided The processors of all-purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices is instructed to produce A raw machine so that the instruction performed by computer or the processor of other programmable data processing devices, which produces, to be used in fact The device for the function of being specified in present one flow of flow chart or one square frame of multiple flows and/or block diagram or multiple square frames.
These computer program instructions, which may also be stored in, can guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works so that the instruction being stored in the computer-readable memory, which produces, to be included referring to Make the manufacture of device, the command device realize in one flow of flow chart or multiple flows and/or one square frame of block diagram or The function of being specified in multiple square frames.
These computer program instructions can be also loaded into computer or other programmable data processing devices so that counted Series of operation steps is performed on calculation machine or other programmable devices to produce computer implemented processing, thus in computer or The instruction performed on other programmable devices is provided and is used for realization in one flow of flow chart or multiple flows and/or block diagram one The step of function of being specified in a square frame or multiple square frames.
Obviously, various changes and modifications can be made to the invention without departing from essence of the invention by those skilled in the art God and scope.In this way, if these modifications and changes of the present invention belongs to the scope of the claims in the present invention and its equivalent technologies Within, then the present invention is also intended to comprising including these modification and variations.

Claims (10)

  1. A kind of 1. fast and easily voice recognition result proofreading method, it is characterised in that including:
    Obtain the corresponding voice recognition result text of voice messaging;
    Current display interface is divided into the first display area and the second display area, in first display area and described Institute's speech recognition result text is shown in two display areas, wherein, the second speech recognition in second display area The first voice recognition result text in resulting text and first display area is corresponding, and in second display area The second voice recognition result text it is corresponding with the voice messaging;
    When receiving the selected operation to the first object text in the second voice recognition result text, play with it is described The corresponding target voice information of first object text, and the editable cursor of current display interface is positioned to first voice In recognition result text, and corresponding second target text of the first object text, so that user is according to the target language Message breath is corrected the second target text in the first voice recognition result text, the speech recognition after being corrected Resulting text.
  2. 2. according to the method described in claim 1, it is characterized in that, the method further includes:
    The corresponding broadcast button of voice messaging is shown in the second display area of the current display interface;
    When receiving the play operation to the voice messaging, the voice messaging is played, and is highlighted currently playing The 3rd target text in the corresponding second voice recognition result text of voice messaging, and the editable by current display interface Cursor is positioned into the first voice recognition result text, with corresponding 4th target text of the 3rd target text.
  3. 3. according to the method described in claim 1, it is characterized in that,
    The method further includes:
    Receive the modification instruction of the second target text input by user to editable cursor present position;Referred to according to the modification Order, changes second target text, and marks amended second target text, and marks amended second target text This;
    And/or
    The method further includes:
    Receive the lookup replacement instruction to specifying the text field in the first voice recognition result text;Replaced according to the lookup Instruction is changed, described specified the text field is searched in the first voice recognition result text, and replace using new the text field Change described specified the text field.
  4. 4. according to the method described in claim 1, it is characterized in that, the method further includes:
    Before display institute speech recognition result text, the voice messaging is obtained;
    The voice messaging is identified again, obtains secondary voice recognition result text;
    By the secondary voice recognition result text compared with institute speech recognition result text, determine both whether one Cause;
    When both are consistent, institute's speech recognition result is shown on first display area and second display area Text;
    When both are inconsistent, determine different from institute speech recognition result text in the secondary voice recognition result text Difference text;
    Institute's speech recognition result text is shown on first display area, on second display area described in display Secondary voice recognition result text, and highlight the difference text.
  5. 5. according to the method described in claim 1, it is characterized in that, the method further includes:
    When playing voice messaging, the pause play instruction of input is received;
    According to the pause play instruction, pause plays the voice messaging.
  6. A kind of 6. fast and easily voice recognition result verifying unit, it is characterised in that including:
    Processor;
    For storing the memory of processor-executable instruction;
    Wherein, the processor is configured as:
    Obtain the corresponding voice recognition result text of voice messaging;
    Current display interface is divided into the first display area and the second display area, in first display area and described Institute's speech recognition result text is shown in two display areas, wherein, the second speech recognition in second display area The first voice recognition result text in resulting text and first display area is corresponding, and in second display area The second voice recognition result text it is corresponding with the voice messaging;
    When receiving the selected operation to the first object text in the second voice recognition result text, play with it is described The corresponding target voice information of first object text, and the editable cursor of current display interface is positioned to first voice In recognition result text, and corresponding second target text of the first object text, so that user is according to the target language Message breath is corrected the second target text in the first voice recognition result text, the speech recognition after being corrected Resulting text.
  7. 7. device according to claim 6, it is characterised in that the processor is additionally configured to:
    The corresponding broadcast button of voice messaging is shown in the second display area of the current display interface;
    When receiving the play operation to the voice messaging, the voice messaging is played, and is highlighted currently playing The 3rd target text in the corresponding second voice recognition result text of voice messaging, and the editable by current display interface Cursor is positioned into the first voice recognition result text, with corresponding 4th target text of the 3rd target text.
  8. 8. device according to claim 6, it is characterised in that
    The processor is additionally configured to:
    Receive the modification instruction of the second target text input by user to editable cursor present position;Referred to according to the modification Order, changes second target text, and marks amended second target text;
    And/or
    The processor is additionally configured to:
    Receive the lookup replacement instruction to specifying the text field in the first voice recognition result text;Replaced according to the lookup Instruction is changed, described specified the text field is searched in the first voice recognition result text, and replace using new the text field Change described specified the text field.
  9. 9. device according to claim 6, it is characterised in that the processor is additionally configured to:
    Before display institute speech recognition result text, the voice messaging is obtained;
    The voice messaging is identified again, obtains secondary voice recognition result text;
    By the secondary voice recognition result text compared with institute speech recognition result text, determine both whether one Cause;
    When both are consistent, institute's speech recognition result is shown on first display area and second display area Text;
    When both are inconsistent, determine different from institute speech recognition result text in the secondary voice recognition result text Difference text;
    Institute's speech recognition result text is shown on first display area, on second display area described in display Secondary voice recognition result text, and highlight the difference text.
  10. 10. device according to claim 6, it is characterised in that the processor is additionally configured to:
    When playing voice messaging, the pause play instruction of input is received;
    According to the pause play instruction, pause plays the voice messaging.
CN201710995682.8A 2017-10-23 2017-10-23 Voice recognition result processing method and processing device Pending CN107945802A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710995682.8A CN107945802A (en) 2017-10-23 2017-10-23 Voice recognition result processing method and processing device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710995682.8A CN107945802A (en) 2017-10-23 2017-10-23 Voice recognition result processing method and processing device

Publications (1)

Publication Number Publication Date
CN107945802A true CN107945802A (en) 2018-04-20

Family

ID=61935600

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710995682.8A Pending CN107945802A (en) 2017-10-23 2017-10-23 Voice recognition result processing method and processing device

Country Status (1)

Country Link
CN (1) CN107945802A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109036423A (en) * 2018-08-15 2018-12-18 信利半导体有限公司 A kind of input system and method for the voice conversion text for computer
CN109326292A (en) * 2018-12-04 2019-02-12 北京九狐时代智能科技有限公司 A kind of generation method and device of audio recognition result
CN111953852A (en) * 2020-07-30 2020-11-17 北京声智科技有限公司 Call record generation method, device, terminal and storage medium
CN113722423A (en) * 2020-05-20 2021-11-30 夏普株式会社 Information processing system, information processing method, and information processing program

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1835077A (en) * 2005-03-14 2006-09-20 台达电子工业股份有限公司 Automatic speech recognizing input method and system for Chinese names
CN101567189A (en) * 2008-04-22 2009-10-28 株式会社Ntt都科摩 Device, method and system for correcting voice recognition result
CN101625619A (en) * 2008-07-11 2010-01-13 索尼株式会社 Information processing apparatus, information processing method, information processing system, and program
CN101807399A (en) * 2010-02-02 2010-08-18 华为终端有限公司 Voice recognition method and device
CN104679723A (en) * 2013-11-29 2015-06-03 北京壹人壹本信息科技有限公司 Text contrast display method, system and device
CN105786204A (en) * 2014-12-26 2016-07-20 联想(北京)有限公司 Information processing method and electronic equipment
CN106328145A (en) * 2016-08-19 2017-01-11 北京云知声信息技术有限公司 Voice correction method and voice correction device
CN106448675A (en) * 2016-10-21 2017-02-22 科大讯飞股份有限公司 Recognition text correction method and system
JP2017049969A (en) * 2015-09-03 2017-03-09 凸版印刷株式会社 Document calibration server, document calibration terminal and document calibration system
JP2017117125A (en) * 2015-12-22 2017-06-29 凸版印刷株式会社 Document correction server, document correction terminal and document correction system

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1835077A (en) * 2005-03-14 2006-09-20 台达电子工业股份有限公司 Automatic speech recognizing input method and system for Chinese names
CN101567189A (en) * 2008-04-22 2009-10-28 株式会社Ntt都科摩 Device, method and system for correcting voice recognition result
CN101625619A (en) * 2008-07-11 2010-01-13 索尼株式会社 Information processing apparatus, information processing method, information processing system, and program
CN101807399A (en) * 2010-02-02 2010-08-18 华为终端有限公司 Voice recognition method and device
CN104679723A (en) * 2013-11-29 2015-06-03 北京壹人壹本信息科技有限公司 Text contrast display method, system and device
CN105786204A (en) * 2014-12-26 2016-07-20 联想(北京)有限公司 Information processing method and electronic equipment
JP2017049969A (en) * 2015-09-03 2017-03-09 凸版印刷株式会社 Document calibration server, document calibration terminal and document calibration system
JP2017117125A (en) * 2015-12-22 2017-06-29 凸版印刷株式会社 Document correction server, document correction terminal and document correction system
CN106328145A (en) * 2016-08-19 2017-01-11 北京云知声信息技术有限公司 Voice correction method and voice correction device
CN106448675A (en) * 2016-10-21 2017-02-22 科大讯飞股份有限公司 Recognition text correction method and system

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109036423A (en) * 2018-08-15 2018-12-18 信利半导体有限公司 A kind of input system and method for the voice conversion text for computer
CN109326292A (en) * 2018-12-04 2019-02-12 北京九狐时代智能科技有限公司 A kind of generation method and device of audio recognition result
CN113722423A (en) * 2020-05-20 2021-11-30 夏普株式会社 Information processing system, information processing method, and information processing program
CN111953852A (en) * 2020-07-30 2020-11-17 北京声智科技有限公司 Call record generation method, device, terminal and storage medium

Similar Documents

Publication Publication Date Title
JP7069778B2 (en) Methods, systems and programs for content curation in video-based communications
US20140250355A1 (en) Time-synchronized, talking ebooks and readers
US8972265B1 (en) Multiple voices in audio content
CN107945802A (en) Voice recognition result processing method and processing device
Kafle et al. Evaluating the usability of automatically generated captions for people who are deaf or hard of hearing
CN108549662A (en) The supplement digestion procedure and device of semantic analysis result in more wheel sessions
CN103777774B (en) The word error correction method of terminal installation and input method
CN106328145B (en) Voice modification method and device
JP2015510602A (en) Management of auxiliary information playback
WO2014160316A2 (en) Device, method, and graphical user interface for a group reading environment
WO2014151884A2 (en) Device, method, and graphical user interface for a group reading environment
US10089898B2 (en) Information processing device, control method therefor, and computer program
CN109712446A (en) Interactive learning methods based on new word detection
JP6627217B2 (en) Text display device, learning method, and program
CN109741641A (en) Langue leaning system based on new word detection
Norledge Building The Ark: Text World Theory and the evolution of dystopian epistolary
CN114023301A (en) Audio editing method, electronic device and storage medium
CN113378583A (en) Dialogue reply method and device, dialogue model training method and device, and storage medium
Che et al. Automatic online lecture highlighting based on multimedia analysis
KR20130110965A (en) Sensibility evalution and contents recommendation method based on user feedback
CN110148413A (en) Speech evaluating method and relevant apparatus
KR20190090636A (en) Method for automatically editing pattern of document
CN111711865A (en) Method, apparatus and storage medium for outputting data
US10747794B2 (en) Smart search for annotations and inking
JP2018146961A (en) Voice reproduction device and voice reproduction program

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180420

RJ01 Rejection of invention patent application after publication