CN110750229A - Voice quality inspection display method and device and electronic equipment - Google Patents
Voice quality inspection display method and device and electronic equipment Download PDFInfo
- Publication number
- CN110750229A CN110750229A CN201910939378.0A CN201910939378A CN110750229A CN 110750229 A CN110750229 A CN 110750229A CN 201910939378 A CN201910939378 A CN 201910939378A CN 110750229 A CN110750229 A CN 110750229A
- Authority
- CN
- China
- Prior art keywords
- text
- audio
- displaying
- display area
- display
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 32
- 238000007689 inspection Methods 0.000 title claims abstract description 31
- 238000004891 communication Methods 0.000 claims description 3
- 230000000007 visual effect Effects 0.000 abstract description 2
- 238000010586 diagram Methods 0.000 description 16
- 238000012545 processing Methods 0.000 description 13
- 230000000694 effects Effects 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 3
- 238000004590 computer program Methods 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 230000001360 synchronised effect Effects 0.000 description 3
- 230000006870 function Effects 0.000 description 2
- 230000002452 interceptive effect Effects 0.000 description 2
- 239000013307 optical fiber Substances 0.000 description 2
- 230000000644 propagated effect Effects 0.000 description 2
- 238000003908 quality control method Methods 0.000 description 2
- 238000003491 array Methods 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000003203 everyday effect Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/165—Management of the audio stream, e.g. setting of volume, audio stream path
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0481—Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
- G06F3/04817—Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance using icons
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0484—Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0484—Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
- G06F3/0485—Scrolling or panning
- G06F3/04855—Interaction with scrollbars
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0484—Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
- G06F3/0486—Drag-and-drop
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
The invention discloses a voice quality inspection display method, a voice quality inspection display device and electronic equipment, wherein the method comprises the following steps: displaying the audio waveform in the first display area; displaying audio text in a second display area; determining a target audio waveform and a target audio text which hit a preset rule; and identifying and displaying the target audio waveform in the first display area and/or identifying and displaying the target audio text in the second display area. According to the voice quality inspection display method, the audio frequency hitting the preset rule is simultaneously identified and displayed through the audio frequency waveform of the first display interface and the audio frequency text of the second display interface, automatic identification display of voice quality inspection is achieved, workers only need to detect the audio frequency waveform or the audio frequency text displayed through identification, the voice quality inspection workload is reduced, and in addition, the graphical voice quality inspection interface which is visual, concise and convenient to operate is provided through the first display area and the second display area.
Description
Technical Field
The invention relates to the technical field of voice quality inspection, in particular to a voice quality inspection display method and device, electronic equipment and a computer readable medium.
Background
The voice quality inspection is used for promoting enterprise marketing and market development, improving customer satisfaction, improving customer service and simultaneously evaluating the work of customer service staff. Every day, a call center generates huge telephone recordings, and quality inspection is carried out on the recordings to detect whether customer service personnel use standard phrases or not and whether service is in place or not, so that the requirements of customers are met.
At present, voice analysis work such as voice quality inspection, voice labeling and the like is mainly manually detected by a worker in a mode of monitoring recording. In this way, for a place where the quality control rule may be hit or not clearly heard, the worker needs to listen to the recording from the beginning, which causes the worker to repeatedly listen to the recording to determine the voice hitting the quality control rule, wastes a lot of time, and causes low working efficiency. In addition, at present, no imaging which is visual, concise and convenient to operate is available in the aspect of voice quality inspection.
Disclosure of Invention
The invention aims to solve the technical problems that manual voice quality inspection in the prior art is time-consuming and labor-consuming, and the voice quality inspection cannot be graphically displayed.
In order to solve the above technical problem, a first aspect of the present invention provides a voice quality inspection display method, including:
displaying the audio waveform in the first display area;
displaying audio text in a second display area;
determining a target audio waveform and a target audio text which hit a preset rule;
and identifying and displaying the target audio waveform in the first display area and/or identifying and displaying the target audio text in the second display area.
According to a preferred embodiment of the present invention, the associating the audio hitting the preset rule to the target audio waveform and the audio text comprises:
creating a rule set comprising a plurality of preset rules;
extracting a target audio text hitting the preset rule;
and associating the target audio text into a corresponding target audio waveform.
According to a preferred embodiment of the present invention, the identifying and displaying the target audio waveform in the first display area comprises:
identifying a position where an audio waveform hitting a preset rule is displayed with a first identifier in a first display area;
the identifying and displaying the target audio text within the second display region comprises:
and identifying and displaying the keyword text hitting the preset rule in a remarkable mode in the second display area.
According to a preferred embodiment of the invention, the method further comprises:
when receiving user operation on the first identifier, identifying and displaying detailed information of a hit rule by a second identifier;
and when user operation on the first identifier and the second identifier is received, playing audio of the hit rule.
According to a preferred embodiment of the present invention, the displaying the audio text in the second display area includes:
parsing the audio waveform into text;
splitting the text into text units according to the service data, and marking the communication role;
analyzing the starting time, the text unit call duration and the ending time corresponding to the call role conversation text;
and displaying the text according to the call role, the start time corresponding to the conversation text, the call duration of the text unit and the end time.
According to a preferred embodiment of the invention, the method further comprises: and displaying an audio playing control unit in the first display area.
According to a preferred embodiment of the invention, the method further comprises:
displaying additional information within the second display area.
According to a preferred embodiment of the present invention, the additional information includes: user information, service information, and a hit rule list.
In order to solve the above technical problem, a second aspect of the present invention provides a voice quality inspection display device, including:
the first display module is used for displaying the audio waveform in the first display area;
the second display module is used for displaying the audio text in the second display area;
the determining module is used for determining a target audio waveform and a target audio text which hit a preset rule;
and the identification display module is used for identifying and displaying the target audio waveform in the first display area and/or identifying and displaying the target audio text in the second display area.
According to a preferred embodiment of the present invention, the determining module comprises:
the system comprises a creating module, a setting module and a processing module, wherein the creating module is used for creating a rule set containing a plurality of preset rules;
the extraction module is used for extracting the target audio text hitting the preset rule;
and the sub-association module is used for associating the target audio text to the corresponding target audio waveform.
According to a preferred embodiment of the present invention, the logo display module includes:
the first identification display module is used for identifying and displaying the position of the audio waveform hitting the preset rule in the first display area by using a first identifier;
and the second identification display module is used for identifying and displaying the keyword texts hitting the preset rules in a significant mode in the second display area.
According to a preferred embodiment of the invention, the device further comprises:
the interactive display module is used for identifying and displaying detailed information of the hit rule by a second identifier when receiving user operation on the first identifier; and when user operation on the first identifier and the second identifier is received, playing audio of the hit rule.
According to a preferred embodiment of the present invention, the second display module includes:
the first analysis module is used for analyzing the audio waveform into a text;
the splitting module is used for splitting the text into text units according to the service data and marking the conversation role;
the second analysis module is used for analyzing the starting time, the text unit conversation duration and the ending time corresponding to the conversation role conversation text;
and the sub-display module is used for displaying the text according to the call role, the start time corresponding to the conversation text, the call duration of the text unit and the end time.
According to a preferred embodiment of the present invention, the first display module is further configured to display the audio playing control unit in the first display area.
According to a preferred embodiment of the present invention, the second display module is further configured to: displaying additional information within the second display area.
According to a preferred embodiment of the present invention, the additional information includes: user information, service information, and a hit rule list.
To solve the above technical problem, a third aspect of the present invention provides an electronic device, comprising:
a processor; and
a memory storing computer executable instructions that, when executed, cause the processor to perform the method described above.
In order to solve the above technical problem, a fourth aspect of the present invention proposes a computer-readable storage medium, wherein the computer-readable storage medium stores one or more programs that, when executed by a processor, implement the above method.
The invention divides a display interface into a first display area and a second display area, and displays an audio waveform and an audio playing control unit in the first display area; displaying audio text in a second display area; associating the audio hitting the preset rule to a target audio waveform and an audio text; and identifying and displaying the target audio waveform in the first display area and/or identifying and displaying the target audio text in the second display area. The audio frequency hitting the preset rule is simultaneously identified and displayed through the audio frequency waveform of the first display interface and the audio frequency text of the second display interface, automatic identification display of voice quality inspection is achieved, workers only need to detect the audio frequency waveform or the audio frequency text displayed through identification, and the workload of the voice quality inspection is reduced.
Drawings
In order to make the technical problems solved by the present invention, the technical means adopted and the technical effects obtained more clear, the following will describe in detail the embodiments of the present invention with reference to the accompanying drawings. It should be noted, however, that the drawings described below are only illustrations of exemplary embodiments of the invention, from which other embodiments can be derived by those skilled in the art without inventive step.
FIG. 1 is a flow chart of a voice quality inspection display method according to the present invention;
FIG. 2 is a schematic illustration of a first display interface and a second display interface of the present invention;
FIG. 3 is a diagram illustrating the location of an audio waveform hitting a predetermined rule according to the present invention;
FIG. 4 is a diagram illustrating details of a hit rule identified by a second identifier according to the present invention;
FIG. 5 is a diagram illustrating a switching operation of an audio playback control unit to a corresponding audio playback mode according to the present invention;
FIG. 6 is a schematic diagram of a structural framework of a voice quality inspection display device according to the present invention;
FIG. 7 is a block diagram of an exemplary embodiment of an electronic device in accordance with the present invention;
FIG. 8 is a diagrammatic representation of one embodiment of a computer-readable medium of the present invention.
Detailed Description
Exemplary embodiments of the present invention will now be described more fully hereinafter with reference to the accompanying drawings, in which exemplary embodiments of the invention may be embodied in many specific forms, and should not be construed as limited to the embodiments set forth herein. Rather, these exemplary embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of the invention to those skilled in the art.
The structures, properties, effects or other characteristics described in a certain embodiment may be combined in any suitable manner in one or more other embodiments, while still complying with the technical idea of the invention.
In describing particular embodiments, specific details of structures, properties, effects, or other features are set forth in order to provide a thorough understanding of the embodiments by one skilled in the art. However, it is not excluded that a person skilled in the art may implement the invention in a specific case without the above-described structures, performances, effects or other features.
The flow chart in the drawings is only an exemplary flow demonstration, and does not represent that all the contents, operations and steps in the flow chart are necessarily included in the scheme of the invention, nor does it represent that the execution is necessarily performed in the order shown in the drawings. For example, some operations/steps in the flowcharts may be divided, some operations/steps may be combined or partially combined, and the like, and the execution order shown in the flowcharts may be changed according to actual situations without departing from the gist of the present invention.
The block diagrams in the figures generally represent functional entities and do not necessarily correspond to physically separate entities. I.e. these functional entities may be implemented in the form of software, or in one or more hardware modules or integrated circuits, or in different networks and/or processor means and/or microcontroller means.
The same reference numerals denote the same or similar elements, components, or parts throughout the drawings, and thus, a repetitive description thereof may be omitted hereinafter. It will be further understood that, although the terms first, second, third, etc. may be used herein to describe various elements, components, or sections, these elements, components, or sections should not be limited by these terms. That is, these phrases are used only to distinguish one from another. For example, a first device may also be referred to as a second device without departing from the spirit of the present invention. Furthermore, the term "and/or", "and/or" is intended to include all combinations of any one or more of the listed items.
Fig. 1 is a flowchart of a voice quality inspection display method provided by the present invention, and as shown in fig. 1, the method includes:
s1, displaying the audio waveform in the first display area;
as shown in fig. 2, the display interface may be divided into a first display area 10 and a second display area 11, wherein the first display area 10 may be located above the second display area 11. Further, in order to realize control of audio playback, an audio playback control unit may also be displayed in the first display area 10.
The audio waveform and the audio playback control unit are displayed in the first display area 10. The left and right audio channels are split and are separately displayed by audio waveform diagrams, for example, in fig. 2, a left audio channel waveform diagram 21, a right audio channel waveform diagram 22 and an audio playing progress bar 23 are arranged on the upper portion of the first display area 10, so that a user can intuitively know the conditions of the audio length, the audio playing progress, the volume and the like. In this embodiment, the audio can be directly located at the designated position of the audio by dragging and clicking the audio playing progress bar 23 on the audio waveform, so that the user can conveniently and quickly locate and focus on the interested content. An audio length 31, a current playing progress 32, audio playing control units 33, each audio playing control unit 33 including a plurality of playing modes 34 are displayed below the audio waveform diagram. The audio playing control unit 33 includes an audio basic playing control unit, an associated text control unit, a variable speed playing control unit, and a volume control unit, and the playing mode corresponding to the basic playing control unit includes: playing, pausing and stopping, wherein the playing modes corresponding to the associated text control unit comprise: the playing mode corresponding to the variable speed playing control unit comprises the following steps: accelerated playback, decelerated playback, and playback at standard speed.
S2, displaying the audio text in the second display area;
in the invention, the audio playing in the first display area is associated with the audio text display in the second display area, namely, the audio playing in the first display area and the audio text display in the second display area can be synchronous.
Therefore, in the step, firstly, an audio waveform is analyzed into a text through an Automatic Speech Recognition technology (ASR); splitting the text into text units according to the service data, and marking the conversation role; the service data may be service content corresponding to the quality inspection dialogue, learning content in the speech learning (such as a certain article), lyric content in a song, and the like. Words or sentences may be used as text units. Meanwhile, the starting time, the text unit call duration and the ending time corresponding to the call role conversation text are analyzed in the step; and displaying the text according to the call role, the start time corresponding to the conversation text, the call duration of the text unit and the end time. As shown in fig. 2, the second display area 11 displays the call character, the call duration and the target audio text. The position of the audio text target audio waveform can be positioned through the call role, the start time corresponding to the dialog text, the call duration of the text unit and the end time, so that the audio waveform can be played and the audio text can be displayed synchronously according to the call role, the start time corresponding to the dialog text, the call duration of the text unit and the end time.
S3, determining a target audio waveform and a target audio text which hit a preset rule;
illustratively, this step includes:
s31, creating a rule set containing a plurality of preset rules;
the preset rule may be set by the rule engine, and the specific preset rule may be set according to the content that the user wants to detect, for example, the preset rule may be a preset word, a preset keyword, or a preset certain type of statement.
S32, extracting a target audio text hitting the preset rule;
specifically, a preset rule may be compared with the entire audio text, and a target audio text that hits the preset rule is found.
And S33, associating the target audio text with the corresponding target audio waveform.
Specifically, in step S2, the audio waveform and the audio text are synchronized by the call role, the start time corresponding to the dialog text, the call duration of the text unit, and the end time, so that the target audio text can be directly associated with the corresponding target audio waveform, that is, the target audio waveform corresponding to the target audio text can be directly located.
S4, identifying and displaying the target audio waveform in the first display area and/or identifying and displaying the target audio text in the second display area.
For example, in this step, the target audio waveform may be displayed in only the first display area, the target audio text may be displayed in only the second display area, or the target audio text may be displayed in the second display area while the target audio waveform is displayed in the first display area.
In one example, the identifying the target audio waveform for display in the first display region comprises: identifying a position where an audio waveform hitting a preset rule is displayed with a first identifier in a first display area;
the first identifier can be geometric icons such as anchor point icons, circular icons and triangular icons, and marks and displays positions where the left channel audio waveforms and the right channel audio waveforms can be distinguished from each other and hit the preset rules, so that the summary condition of the whole audio quality inspection result can be visually displayed. As shown in fig. 3, in the first display area 10, positions where audio waveforms of a preset rule are hit are displayed as circular icons 50 for the left channel audio waveform map 21 and the right channel audio waveform map 22.
The identifying and displaying the target audio text within the second display region comprises: and identifying and displaying the keyword text hitting the preset rule in a remarkable mode in the second display area. The prominent display mode can be any one of modes of font highlighting, font bolding, font color different from other audio text font colors, special mark display and the like, and can distinguish the display mode of the keyword text hitting the preset rule from other audio texts.
Through the steps, the audio target audio waveform position and the audio text position hitting the preset rule can be automatically displayed in the first display interface and the second display interface, and in order to further improve the accuracy of quality inspection, the audio waveform and the audio keywords hitting the preset rule can be operated to further review the hitting situation of the audio. Therefore, the voice quality inspection display method of the invention further comprises:
s51, when receiving the user operation of the first identifier, identifying and displaying the detailed information of the hit rule by a second identifier;
the user operation on the first identifier may be that the user clicks on the first identifier, and the second identifier may be a bubble, a checkerboard, or the like. As shown in fig. 4, after the user clicks the first identifier 50 in the waveform diagram 21 of the left channel audio, the detailed information of the hit rule of the audio corresponding to the first identifier 50 is displayed in a bubble manner, specifically, the number of the preset rule may be given, and the detailed information may be the number of the hit preset rule, for example, the number of the hit preset rule is displayed in the bubble of fig. 4, where the preset rule number 05 is displayed.
S52, when user operation on the first identifier and the second identifier is received, playing audio of the hit rule.
The user operation on the first identifier and the second identifier may be a click on the first identifier and the second identifier sequentially, or a selection of the first identifier and the second identifier, which is not limited in the present invention.
Further, for some specific services, additional information of these services may be displayed in the second display area.
Specifically, the second display area may include an audio text display area and an additional information display area, and the additional information is displayed in the additional information display area, where the additional information includes: user information, service information, and a hit rule list. The user information may be a user name added with additional information, the service information may be a name of a specific service, and the hit rule list includes a preset rule name hit by the corresponding audio text.
In addition, in the invention, because the audio playing in the first display area and the text display in the second display area are synchronized, the audio playing in the first display area and/or the audio text display in the second display area can be controlled through the first operation in the first display area, and obviously, the audio playing in the first display area and/or the audio text display in the second display area can also be controlled through the second operation in the second display area. And the playing of any specified audio waveform and/or the display of any specified audio text in the quality inspection process are facilitated.
It should be noted that, as shown in fig. 2, in the present invention, a playing control unit may be further displayed in the first display area, each playing control unit includes a plurality of playing modes, and as shown in fig. 5, each audio playing control unit 33 includes a plurality of playing modes 34. Specifically, the audio playing control unit 33 includes an audio basic playing control unit 331, an associated text control unit 332, a variable speed playing control unit 333, and a volume control unit (not shown in fig. 3), and the playing modes corresponding to the basic playing control unit 331 include: playing 341, pausing 342 and stopping 343, the playing modes corresponding to the associated text control unit 332 include: the previous audio 344 of the current audio is played, and the next audio 345 of the current audio is played, and the playing modes corresponding to the variable-speed playing control unit 333 include: accelerated playback 346, decelerated playback 347, and playback 348 at standard speed. Therefore, the playing mode of the selected audio playing control unit can be switched to a corresponding audio playing mode; and playing corresponding audio according to the audio playing mode and displaying a target audio text.
For example, in fig. 5, if the user selects the previous audio button 344 of the associated text control unit 332 for playing the current audio, the mode is switched to the play mode for playing the previous audio of the current audio. And corresponding to the playing mode of the last sentence of the current audio, playing the last sentence of the current audio and displaying the audio text of the last sentence of the current audio.
Fig. 6 is a schematic diagram of a voice quality inspection display device according to the present invention, and as shown in fig. 6, the device includes:
a first display module 61 for displaying the audio waveform in the first display area;
a second display module 62 for displaying audio text in a second display area;
a determining module 63, configured to determine a target audio waveform and a target audio text that hit a preset rule;
and an identifier display module 64, configured to identify and display the target audio waveform in the first display area and/or identify and display the target audio text in the second display area.
An interactive display module 65, configured to, when receiving a user operation on the first identifier, identify and display detailed information of a hit rule with a second identifier; and when user operation on the first identifier and the second identifier is received, playing audio of the hit rule.
Wherein the first display module 61 includes:
the first sub-display module 611, configured to split the audio left and right channels and display them separately as an audio waveform diagram;
and a second sub-display module 612, configured to display the audio length, the current playing progress, the playing control key, the associated text control key, the variable-speed playing control key, and the volume control key below the audio waveform diagram.
The second display module 62 includes:
a first parsing module 621, configured to parse the audio waveform into a text;
a splitting module 622, configured to split the text into text units according to the service data, and mark a call role;
a second parsing module 623, configured to parse the start time, the text unit call duration, and the end time corresponding to the call role dialog text;
and the sub-display module 624 is configured to display the text according to the call role, the start time corresponding to the dialog text, the call duration of the text unit, and the end time.
The determining module 63 includes:
a creating module 631 for creating a rule set including a plurality of preset rules;
the extracting module 632 is configured to extract a target audio text hitting the preset rule;
a sub-association module 633 for associating the target audio text to a corresponding target audio waveform.
The identification display module 64 includes:
a first identifier display module 641 for identifying and displaying a position of the audio waveform hitting the preset rule with a first identifier in the first display area;
and a second identifier display module 642, configured to identify and display the keyword text hitting the preset rule in a significant manner in the second display area.
Further, the second display module 62 is further configured to: displaying additional information within the second display area. Wherein the additional information includes: user information, service information, and a hit rule list.
Those skilled in the art will appreciate that the modules in the above-described embodiments of the apparatus may be distributed as described in the apparatus, and may be correspondingly modified and distributed in one or more apparatuses other than the above-described embodiments. The modules of the above embodiments may be combined into one module, or further split into multiple sub-modules.
In the following, embodiments of the electronic device of the present invention are described, which may be regarded as an implementation in physical form for the above-described embodiments of the method and apparatus of the present invention. Details described in the embodiments of the electronic device of the invention should be considered supplementary to the embodiments of the method or apparatus described above; for details which are not disclosed in embodiments of the electronic device of the invention, reference may be made to the above-described embodiments of the method or the apparatus.
Fig. 7 is a block diagram of an exemplary embodiment of an electronic device according to the present invention. The electronic device shown in fig. 7 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present invention.
As shown in fig. 7, the electronic device 700 of the exemplary embodiment is represented in the form of a general-purpose data processing device. The components of the electronic device 700 may include, but are not limited to: at least one processing unit 710, at least one memory unit 720, a bus 730 connecting different electronic device components (including the memory unit 720 and the processing unit 710), a display unit 740, and the like.
The storage unit 720 stores a computer readable program, which may be a code of a source program or a read-only program. The program may be executed by the processing unit 710 such that the processing unit 710 performs the steps of various embodiments of the present invention. For example, the processing unit 710 may perform the steps as shown in fig. 1.
The memory unit 720 may include readable media in the form of volatile memory units, such as a random access memory unit (RAM)7201 and/or a cache memory unit 7202, and may further include a read only memory unit (ROM) 7203. The memory unit 720 may also include a program/utility 7204 having a set (at least one) of program modules 7205, such program modules 7205 including, but not limited to: operating the electronic device, one or more application programs, other program modules, and program data, each of which, or some combination thereof, may comprise an implementation of a network environment.
The electronic device 700 may also communicate with one or more external devices 300 (e.g., keyboard, display, network device, bluetooth device, etc.), enable a user to interact with the electronic device 700 via the external devices 700, and/or enable the electronic device 700 to communicate with one or more other data processing devices (e.g., router, modem, etc.). Such communication may occur via input/output (I/O) interfaces 750, and may also occur via network adapter 760 with one or more networks, such as a Local Area Network (LAN), a Wide Area Network (WAN) and/or a public network, such as the Internet. The network adapter 760 may communicate with other modules of the electronic device 700 via the bus 730. It should be appreciated that although not shown in FIG. 7, other hardware and/or software modules may be used in electronic device 700, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID electronics, tape drives, and data backup storage electronics, among others.
FIG. 8 is a schematic diagram of one computer-readable medium embodiment of the present invention. As shown in fig. 8, the computer program may be stored on one or more computer readable media. The computer readable medium may be a readable signal medium or a readable storage medium. The readable storage medium may be, for example, but not limited to, an electronic device, apparatus, or device that is electronic, magnetic, optical, electromagnetic, infrared, or semiconductor, or a combination of any of the foregoing. More specific examples (a non-exhaustive list) of the readable storage medium include: an electrical connection having one or more wires, a portable disk, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. The computer program, when executed by one or more data processing devices, enables the computer-readable medium to implement the above-described method of the invention, namely: displaying the audio waveform in the first display area; displaying audio text in a second display area; associating the audio hitting the preset rule to a target audio waveform and an audio text; and identifying and displaying the target audio waveform in the first display area and/or identifying and displaying the target audio text in the second display area.
Through the above description of the embodiments, those skilled in the art will readily understand that the exemplary embodiments of the present invention described herein may be implemented by software, or by software in combination with necessary hardware. Therefore, the technical solution according to the embodiment of the present invention can be embodied in the form of a software product, which can be stored in a computer-readable storage medium (which can be a CD-ROM, a usb disk, a removable hard disk, etc.) or on a network, and includes several instructions to make a data processing device (which can be a personal computer, a server, or a network device, etc.) execute the above-mentioned method according to the present invention.
The computer readable storage medium may include a propagated data signal with readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A readable storage medium may also be any readable medium that is not a readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution electronic device, apparatus, or device. Program code embodied on a readable storage medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
Program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computing device, partly on the user's device, as a stand-alone software package, partly on the user's computing device and partly on a remote computing device, or entirely on the remote computing device or server. In the case of a remote computing device, the remote computing device may be connected to the user computing device through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computing device (e.g., through the internet using an internet service provider).
In summary, the present invention can be implemented as a method, an apparatus, an electronic device, or a computer-readable medium executing a computer program. Some or all of the functions of the present invention may be implemented in practice using a general purpose data processing device such as a microprocessor or a Digital Signal Processor (DSP).
While the foregoing embodiments have described the objects, aspects and advantages of the present invention in further detail, it should be understood that the present invention is not inherently related to any particular computer, virtual machine or electronic device, and various general-purpose machines may be used to implement the present invention. The invention is not to be considered as limited to the specific embodiments thereof, but is to be understood as being modified in all respects, all changes and equivalents that come within the spirit and scope of the invention.
Claims (10)
1. A voice quality inspection display method is characterized by comprising the following steps:
displaying the audio waveform in the first display area;
displaying audio text in a second display area;
determining a target audio waveform and a target audio text which hit a preset rule;
and identifying and displaying the target audio waveform in the first display area and/or identifying and displaying the target audio text in the second display area.
2. The method of claim 1, wherein associating audio that hits a preset rule into a target audio waveform and audio text comprises:
creating a rule set comprising a plurality of preset rules;
extracting a target audio text hitting the preset rule;
and associating the target audio text into a corresponding target audio waveform.
3. The method of any of claims 1-2, wherein the identifying the target audio waveform for display in the first display region comprises:
identifying a position where an audio waveform hitting a preset rule is displayed with a first identifier in a first display area;
the identifying and displaying the target audio text within the second display region comprises:
and identifying and displaying the keyword text hitting the preset rule in a remarkable mode in the second display area.
4. The method according to any one of claims 1-3, further comprising:
when receiving user operation on the first identifier, identifying and displaying detailed information of a hit rule by a second identifier;
and when user operation on the first identifier and the second identifier is received, playing audio of the hit rule.
5. The method of any of claims 1-4, wherein displaying audio text in the second display region comprises:
parsing the audio waveform into text;
splitting the text into text units according to the service data, and marking the communication role;
analyzing the starting time, the text unit call duration and the ending time corresponding to the call role conversation text;
and displaying the text according to the call role, the start time corresponding to the conversation text, the call duration of the text unit and the end time.
6. The method according to any one of claims 1-5, further comprising: and displaying an audio playing control unit in the first display area.
7. The method according to any one of claims 1-6, further comprising:
displaying additional information within the second display area.
8. A voice quality inspection display device, characterized in that the device comprises:
the first display module is used for displaying the audio waveform in the first display area;
the second display module is used for displaying the audio text in the second display area;
the determining module is used for determining a target audio waveform and a target audio text which hit a preset rule;
and the identification display module is used for identifying and displaying the target audio waveform in the first display area and/or identifying and displaying the target audio text in the second display area.
9. An electronic device, comprising:
a processor; and
a memory storing computer-executable instructions that, when executed, cause the processor to perform the method of any of claims 1-7.
10. A computer readable storage medium, wherein the computer readable storage medium stores one or more programs which, when executed by a processor, implement the method of any of claims 1-7.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910939378.0A CN110750229A (en) | 2019-09-30 | 2019-09-30 | Voice quality inspection display method and device and electronic equipment |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910939378.0A CN110750229A (en) | 2019-09-30 | 2019-09-30 | Voice quality inspection display method and device and electronic equipment |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110750229A true CN110750229A (en) | 2020-02-04 |
Family
ID=69277491
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910939378.0A Pending CN110750229A (en) | 2019-09-30 | 2019-09-30 | Voice quality inspection display method and device and electronic equipment |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110750229A (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111405128A (en) * | 2020-03-24 | 2020-07-10 | 中国—东盟信息港股份有限公司 | Call quality inspection system based on voice-to-text conversion |
CN111696527A (en) * | 2020-06-15 | 2020-09-22 | 龙马智芯(珠海横琴)科技有限公司 | Method and device for positioning voice quality inspection area, positioning equipment and storage medium |
CN113689862A (en) * | 2021-08-23 | 2021-11-23 | 南京优飞保科信息技术有限公司 | Quality inspection method and system for customer service seat voice data |
CN114579017A (en) * | 2022-02-10 | 2022-06-03 | 优视科技(中国)有限公司 | Method and device for displaying audio |
CN115116432A (en) * | 2021-03-17 | 2022-09-27 | 中国电信股份有限公司 | Audio file labeling method and device and electronic equipment |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060217881A1 (en) * | 2005-03-28 | 2006-09-28 | Sap Aktiengesellschaft | Incident command post |
CN102937959A (en) * | 2011-06-03 | 2013-02-20 | 苹果公司 | Automatically creating a mapping between text data and audio data |
CN104410920A (en) * | 2014-12-31 | 2015-03-11 | 合一网络技术(北京)有限公司 | Video segment playback amount-based method for labeling highlights |
US20180268820A1 (en) * | 2017-03-16 | 2018-09-20 | Naver Corporation | Method and system for generating content using speech comment |
CN109634700A (en) * | 2018-11-26 | 2019-04-16 | 维沃移动通信有限公司 | A kind of the content of text display methods and terminal device of audio |
-
2019
- 2019-09-30 CN CN201910939378.0A patent/CN110750229A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060217881A1 (en) * | 2005-03-28 | 2006-09-28 | Sap Aktiengesellschaft | Incident command post |
CN102937959A (en) * | 2011-06-03 | 2013-02-20 | 苹果公司 | Automatically creating a mapping between text data and audio data |
CN104410920A (en) * | 2014-12-31 | 2015-03-11 | 合一网络技术(北京)有限公司 | Video segment playback amount-based method for labeling highlights |
US20180268820A1 (en) * | 2017-03-16 | 2018-09-20 | Naver Corporation | Method and system for generating content using speech comment |
CN109634700A (en) * | 2018-11-26 | 2019-04-16 | 维沃移动通信有限公司 | A kind of the content of text display methods and terminal device of audio |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111405128A (en) * | 2020-03-24 | 2020-07-10 | 中国—东盟信息港股份有限公司 | Call quality inspection system based on voice-to-text conversion |
CN111405128B (en) * | 2020-03-24 | 2022-02-18 | 中国—东盟信息港股份有限公司 | Call quality inspection system based on voice-to-text conversion |
CN111696527A (en) * | 2020-06-15 | 2020-09-22 | 龙马智芯(珠海横琴)科技有限公司 | Method and device for positioning voice quality inspection area, positioning equipment and storage medium |
CN115116432A (en) * | 2021-03-17 | 2022-09-27 | 中国电信股份有限公司 | Audio file labeling method and device and electronic equipment |
CN113689862A (en) * | 2021-08-23 | 2021-11-23 | 南京优飞保科信息技术有限公司 | Quality inspection method and system for customer service seat voice data |
CN113689862B (en) * | 2021-08-23 | 2024-03-22 | 南京优飞保科信息技术有限公司 | Quality inspection method and system for customer service agent voice data |
CN114579017A (en) * | 2022-02-10 | 2022-06-03 | 优视科技(中国)有限公司 | Method and device for displaying audio |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110750229A (en) | Voice quality inspection display method and device and electronic equipment | |
US10929449B2 (en) | Generating a structured document guiding view | |
CN102027473B (en) | Method and system for media access by tag cloud | |
US20190362022A1 (en) | Audio file labeling process for building datasets at scale | |
US20140019133A1 (en) | Data processing method, presentation method, and corresponding apparatuses | |
US20140067367A1 (en) | Method and system for reading fluency training | |
US11049525B2 (en) | Transcript-based insertion of secondary video content into primary video content | |
CN108109636B (en) | Text-based voice playing method and device, computer equipment and storage medium | |
AU2012359080A1 (en) | Managing playback of supplemental information | |
CN112951275B (en) | Voice quality inspection method and device, electronic equipment and medium | |
JP2011102862A (en) | Speech recognition result control apparatus and speech recognition result display method | |
CN112487238B (en) | Audio processing method, device, terminal and medium | |
CN105302906A (en) | Information labeling method and apparatus | |
CN114023301A (en) | Audio editing method, electronic device and storage medium | |
CN113901186A (en) | Telephone recording marking method, device, equipment and storage medium | |
CN110674638A (en) | Corpus labeling system and electronic equipment | |
CN114268829A (en) | Video processing method and device, electronic equipment and computer readable storage medium | |
US8958532B2 (en) | Navigating an interactive voice response system | |
CN111723235A (en) | Music content identification method, device and equipment | |
CN114374885A (en) | Video key segment determination method and device, electronic equipment and readable storage medium | |
CN110750230A (en) | Voice interface display method and device and electronic equipment | |
CN106878773B (en) | Electronic device, video processing method and apparatus, and storage medium | |
CN113259763A (en) | Teaching video processing method and device and electronic equipment | |
CN111914115A (en) | Sound information processing method and device and electronic equipment | |
Reidsma et al. | Designing annotation tools based on properties of annotation problems |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |