CN110750229A - Voice quality inspection display method and device and electronic equipment - Google Patents

Voice quality inspection display method and device and electronic equipment Download PDF

Info

Publication number
CN110750229A
CN110750229A CN201910939378.0A CN201910939378A CN110750229A CN 110750229 A CN110750229 A CN 110750229A CN 201910939378 A CN201910939378 A CN 201910939378A CN 110750229 A CN110750229 A CN 110750229A
Authority
CN
China
Prior art keywords
text
audio
displaying
display area
display
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910939378.0A
Other languages
Chinese (zh)
Inventor
石文超
戴会杰
常富洋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qiyu Information Technology Co Ltd
Original Assignee
Beijing Qiyu Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qiyu Information Technology Co Ltd filed Critical Beijing Qiyu Information Technology Co Ltd
Priority to CN201910939378.0A priority Critical patent/CN110750229A/en
Publication of CN110750229A publication Critical patent/CN110750229A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/165Management of the audio stream, e.g. setting of volume, audio stream path
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
    • G06F3/04817Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance using icons
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • G06F3/0485Scrolling or panning
    • G06F3/04855Interaction with scrollbars
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • G06F3/0486Drag-and-drop

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The invention discloses a voice quality inspection display method, a voice quality inspection display device and electronic equipment, wherein the method comprises the following steps: displaying the audio waveform in the first display area; displaying audio text in a second display area; determining a target audio waveform and a target audio text which hit a preset rule; and identifying and displaying the target audio waveform in the first display area and/or identifying and displaying the target audio text in the second display area. According to the voice quality inspection display method, the audio frequency hitting the preset rule is simultaneously identified and displayed through the audio frequency waveform of the first display interface and the audio frequency text of the second display interface, automatic identification display of voice quality inspection is achieved, workers only need to detect the audio frequency waveform or the audio frequency text displayed through identification, the voice quality inspection workload is reduced, and in addition, the graphical voice quality inspection interface which is visual, concise and convenient to operate is provided through the first display area and the second display area.

Description

Voice quality inspection display method and device and electronic equipment
Technical Field
The invention relates to the technical field of voice quality inspection, in particular to a voice quality inspection display method and device, electronic equipment and a computer readable medium.
Background
The voice quality inspection is used for promoting enterprise marketing and market development, improving customer satisfaction, improving customer service and simultaneously evaluating the work of customer service staff. Every day, a call center generates huge telephone recordings, and quality inspection is carried out on the recordings to detect whether customer service personnel use standard phrases or not and whether service is in place or not, so that the requirements of customers are met.
At present, voice analysis work such as voice quality inspection, voice labeling and the like is mainly manually detected by a worker in a mode of monitoring recording. In this way, for a place where the quality control rule may be hit or not clearly heard, the worker needs to listen to the recording from the beginning, which causes the worker to repeatedly listen to the recording to determine the voice hitting the quality control rule, wastes a lot of time, and causes low working efficiency. In addition, at present, no imaging which is visual, concise and convenient to operate is available in the aspect of voice quality inspection.
Disclosure of Invention
The invention aims to solve the technical problems that manual voice quality inspection in the prior art is time-consuming and labor-consuming, and the voice quality inspection cannot be graphically displayed.
In order to solve the above technical problem, a first aspect of the present invention provides a voice quality inspection display method, including:
displaying the audio waveform in the first display area;
displaying audio text in a second display area;
determining a target audio waveform and a target audio text which hit a preset rule;
and identifying and displaying the target audio waveform in the first display area and/or identifying and displaying the target audio text in the second display area.
According to a preferred embodiment of the present invention, the associating the audio hitting the preset rule to the target audio waveform and the audio text comprises:
creating a rule set comprising a plurality of preset rules;
extracting a target audio text hitting the preset rule;
and associating the target audio text into a corresponding target audio waveform.
According to a preferred embodiment of the present invention, the identifying and displaying the target audio waveform in the first display area comprises:
identifying a position where an audio waveform hitting a preset rule is displayed with a first identifier in a first display area;
the identifying and displaying the target audio text within the second display region comprises:
and identifying and displaying the keyword text hitting the preset rule in a remarkable mode in the second display area.
According to a preferred embodiment of the invention, the method further comprises:
when receiving user operation on the first identifier, identifying and displaying detailed information of a hit rule by a second identifier;
and when user operation on the first identifier and the second identifier is received, playing audio of the hit rule.
According to a preferred embodiment of the present invention, the displaying the audio text in the second display area includes:
parsing the audio waveform into text;
splitting the text into text units according to the service data, and marking the communication role;
analyzing the starting time, the text unit call duration and the ending time corresponding to the call role conversation text;
and displaying the text according to the call role, the start time corresponding to the conversation text, the call duration of the text unit and the end time.
According to a preferred embodiment of the invention, the method further comprises: and displaying an audio playing control unit in the first display area.
According to a preferred embodiment of the invention, the method further comprises:
displaying additional information within the second display area.
According to a preferred embodiment of the present invention, the additional information includes: user information, service information, and a hit rule list.
In order to solve the above technical problem, a second aspect of the present invention provides a voice quality inspection display device, including:
the first display module is used for displaying the audio waveform in the first display area;
the second display module is used for displaying the audio text in the second display area;
the determining module is used for determining a target audio waveform and a target audio text which hit a preset rule;
and the identification display module is used for identifying and displaying the target audio waveform in the first display area and/or identifying and displaying the target audio text in the second display area.
According to a preferred embodiment of the present invention, the determining module comprises:
the system comprises a creating module, a setting module and a processing module, wherein the creating module is used for creating a rule set containing a plurality of preset rules;
the extraction module is used for extracting the target audio text hitting the preset rule;
and the sub-association module is used for associating the target audio text to the corresponding target audio waveform.
According to a preferred embodiment of the present invention, the logo display module includes:
the first identification display module is used for identifying and displaying the position of the audio waveform hitting the preset rule in the first display area by using a first identifier;
and the second identification display module is used for identifying and displaying the keyword texts hitting the preset rules in a significant mode in the second display area.
According to a preferred embodiment of the invention, the device further comprises:
the interactive display module is used for identifying and displaying detailed information of the hit rule by a second identifier when receiving user operation on the first identifier; and when user operation on the first identifier and the second identifier is received, playing audio of the hit rule.
According to a preferred embodiment of the present invention, the second display module includes:
the first analysis module is used for analyzing the audio waveform into a text;
the splitting module is used for splitting the text into text units according to the service data and marking the conversation role;
the second analysis module is used for analyzing the starting time, the text unit conversation duration and the ending time corresponding to the conversation role conversation text;
and the sub-display module is used for displaying the text according to the call role, the start time corresponding to the conversation text, the call duration of the text unit and the end time.
According to a preferred embodiment of the present invention, the first display module is further configured to display the audio playing control unit in the first display area.
According to a preferred embodiment of the present invention, the second display module is further configured to: displaying additional information within the second display area.
According to a preferred embodiment of the present invention, the additional information includes: user information, service information, and a hit rule list.
To solve the above technical problem, a third aspect of the present invention provides an electronic device, comprising:
a processor; and
a memory storing computer executable instructions that, when executed, cause the processor to perform the method described above.
In order to solve the above technical problem, a fourth aspect of the present invention proposes a computer-readable storage medium, wherein the computer-readable storage medium stores one or more programs that, when executed by a processor, implement the above method.
The invention divides a display interface into a first display area and a second display area, and displays an audio waveform and an audio playing control unit in the first display area; displaying audio text in a second display area; associating the audio hitting the preset rule to a target audio waveform and an audio text; and identifying and displaying the target audio waveform in the first display area and/or identifying and displaying the target audio text in the second display area. The audio frequency hitting the preset rule is simultaneously identified and displayed through the audio frequency waveform of the first display interface and the audio frequency text of the second display interface, automatic identification display of voice quality inspection is achieved, workers only need to detect the audio frequency waveform or the audio frequency text displayed through identification, and the workload of the voice quality inspection is reduced.
Drawings
In order to make the technical problems solved by the present invention, the technical means adopted and the technical effects obtained more clear, the following will describe in detail the embodiments of the present invention with reference to the accompanying drawings. It should be noted, however, that the drawings described below are only illustrations of exemplary embodiments of the invention, from which other embodiments can be derived by those skilled in the art without inventive step.
FIG. 1 is a flow chart of a voice quality inspection display method according to the present invention;
FIG. 2 is a schematic illustration of a first display interface and a second display interface of the present invention;
FIG. 3 is a diagram illustrating the location of an audio waveform hitting a predetermined rule according to the present invention;
FIG. 4 is a diagram illustrating details of a hit rule identified by a second identifier according to the present invention;
FIG. 5 is a diagram illustrating a switching operation of an audio playback control unit to a corresponding audio playback mode according to the present invention;
FIG. 6 is a schematic diagram of a structural framework of a voice quality inspection display device according to the present invention;
FIG. 7 is a block diagram of an exemplary embodiment of an electronic device in accordance with the present invention;
FIG. 8 is a diagrammatic representation of one embodiment of a computer-readable medium of the present invention.
Detailed Description
Exemplary embodiments of the present invention will now be described more fully hereinafter with reference to the accompanying drawings, in which exemplary embodiments of the invention may be embodied in many specific forms, and should not be construed as limited to the embodiments set forth herein. Rather, these exemplary embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of the invention to those skilled in the art.
The structures, properties, effects or other characteristics described in a certain embodiment may be combined in any suitable manner in one or more other embodiments, while still complying with the technical idea of the invention.
In describing particular embodiments, specific details of structures, properties, effects, or other features are set forth in order to provide a thorough understanding of the embodiments by one skilled in the art. However, it is not excluded that a person skilled in the art may implement the invention in a specific case without the above-described structures, performances, effects or other features.
The flow chart in the drawings is only an exemplary flow demonstration, and does not represent that all the contents, operations and steps in the flow chart are necessarily included in the scheme of the invention, nor does it represent that the execution is necessarily performed in the order shown in the drawings. For example, some operations/steps in the flowcharts may be divided, some operations/steps may be combined or partially combined, and the like, and the execution order shown in the flowcharts may be changed according to actual situations without departing from the gist of the present invention.
The block diagrams in the figures generally represent functional entities and do not necessarily correspond to physically separate entities. I.e. these functional entities may be implemented in the form of software, or in one or more hardware modules or integrated circuits, or in different networks and/or processor means and/or microcontroller means.
The same reference numerals denote the same or similar elements, components, or parts throughout the drawings, and thus, a repetitive description thereof may be omitted hereinafter. It will be further understood that, although the terms first, second, third, etc. may be used herein to describe various elements, components, or sections, these elements, components, or sections should not be limited by these terms. That is, these phrases are used only to distinguish one from another. For example, a first device may also be referred to as a second device without departing from the spirit of the present invention. Furthermore, the term "and/or", "and/or" is intended to include all combinations of any one or more of the listed items.
Fig. 1 is a flowchart of a voice quality inspection display method provided by the present invention, and as shown in fig. 1, the method includes:
s1, displaying the audio waveform in the first display area;
as shown in fig. 2, the display interface may be divided into a first display area 10 and a second display area 11, wherein the first display area 10 may be located above the second display area 11. Further, in order to realize control of audio playback, an audio playback control unit may also be displayed in the first display area 10.
The audio waveform and the audio playback control unit are displayed in the first display area 10. The left and right audio channels are split and are separately displayed by audio waveform diagrams, for example, in fig. 2, a left audio channel waveform diagram 21, a right audio channel waveform diagram 22 and an audio playing progress bar 23 are arranged on the upper portion of the first display area 10, so that a user can intuitively know the conditions of the audio length, the audio playing progress, the volume and the like. In this embodiment, the audio can be directly located at the designated position of the audio by dragging and clicking the audio playing progress bar 23 on the audio waveform, so that the user can conveniently and quickly locate and focus on the interested content. An audio length 31, a current playing progress 32, audio playing control units 33, each audio playing control unit 33 including a plurality of playing modes 34 are displayed below the audio waveform diagram. The audio playing control unit 33 includes an audio basic playing control unit, an associated text control unit, a variable speed playing control unit, and a volume control unit, and the playing mode corresponding to the basic playing control unit includes: playing, pausing and stopping, wherein the playing modes corresponding to the associated text control unit comprise: the playing mode corresponding to the variable speed playing control unit comprises the following steps: accelerated playback, decelerated playback, and playback at standard speed.
S2, displaying the audio text in the second display area;
in the invention, the audio playing in the first display area is associated with the audio text display in the second display area, namely, the audio playing in the first display area and the audio text display in the second display area can be synchronous.
Therefore, in the step, firstly, an audio waveform is analyzed into a text through an Automatic Speech Recognition technology (ASR); splitting the text into text units according to the service data, and marking the conversation role; the service data may be service content corresponding to the quality inspection dialogue, learning content in the speech learning (such as a certain article), lyric content in a song, and the like. Words or sentences may be used as text units. Meanwhile, the starting time, the text unit call duration and the ending time corresponding to the call role conversation text are analyzed in the step; and displaying the text according to the call role, the start time corresponding to the conversation text, the call duration of the text unit and the end time. As shown in fig. 2, the second display area 11 displays the call character, the call duration and the target audio text. The position of the audio text target audio waveform can be positioned through the call role, the start time corresponding to the dialog text, the call duration of the text unit and the end time, so that the audio waveform can be played and the audio text can be displayed synchronously according to the call role, the start time corresponding to the dialog text, the call duration of the text unit and the end time.
S3, determining a target audio waveform and a target audio text which hit a preset rule;
illustratively, this step includes:
s31, creating a rule set containing a plurality of preset rules;
the preset rule may be set by the rule engine, and the specific preset rule may be set according to the content that the user wants to detect, for example, the preset rule may be a preset word, a preset keyword, or a preset certain type of statement.
S32, extracting a target audio text hitting the preset rule;
specifically, a preset rule may be compared with the entire audio text, and a target audio text that hits the preset rule is found.
And S33, associating the target audio text with the corresponding target audio waveform.
Specifically, in step S2, the audio waveform and the audio text are synchronized by the call role, the start time corresponding to the dialog text, the call duration of the text unit, and the end time, so that the target audio text can be directly associated with the corresponding target audio waveform, that is, the target audio waveform corresponding to the target audio text can be directly located.
S4, identifying and displaying the target audio waveform in the first display area and/or identifying and displaying the target audio text in the second display area.
For example, in this step, the target audio waveform may be displayed in only the first display area, the target audio text may be displayed in only the second display area, or the target audio text may be displayed in the second display area while the target audio waveform is displayed in the first display area.
In one example, the identifying the target audio waveform for display in the first display region comprises: identifying a position where an audio waveform hitting a preset rule is displayed with a first identifier in a first display area;
the first identifier can be geometric icons such as anchor point icons, circular icons and triangular icons, and marks and displays positions where the left channel audio waveforms and the right channel audio waveforms can be distinguished from each other and hit the preset rules, so that the summary condition of the whole audio quality inspection result can be visually displayed. As shown in fig. 3, in the first display area 10, positions where audio waveforms of a preset rule are hit are displayed as circular icons 50 for the left channel audio waveform map 21 and the right channel audio waveform map 22.
The identifying and displaying the target audio text within the second display region comprises: and identifying and displaying the keyword text hitting the preset rule in a remarkable mode in the second display area. The prominent display mode can be any one of modes of font highlighting, font bolding, font color different from other audio text font colors, special mark display and the like, and can distinguish the display mode of the keyword text hitting the preset rule from other audio texts.
Through the steps, the audio target audio waveform position and the audio text position hitting the preset rule can be automatically displayed in the first display interface and the second display interface, and in order to further improve the accuracy of quality inspection, the audio waveform and the audio keywords hitting the preset rule can be operated to further review the hitting situation of the audio. Therefore, the voice quality inspection display method of the invention further comprises:
s51, when receiving the user operation of the first identifier, identifying and displaying the detailed information of the hit rule by a second identifier;
the user operation on the first identifier may be that the user clicks on the first identifier, and the second identifier may be a bubble, a checkerboard, or the like. As shown in fig. 4, after the user clicks the first identifier 50 in the waveform diagram 21 of the left channel audio, the detailed information of the hit rule of the audio corresponding to the first identifier 50 is displayed in a bubble manner, specifically, the number of the preset rule may be given, and the detailed information may be the number of the hit preset rule, for example, the number of the hit preset rule is displayed in the bubble of fig. 4, where the preset rule number 05 is displayed.
S52, when user operation on the first identifier and the second identifier is received, playing audio of the hit rule.
The user operation on the first identifier and the second identifier may be a click on the first identifier and the second identifier sequentially, or a selection of the first identifier and the second identifier, which is not limited in the present invention.
Further, for some specific services, additional information of these services may be displayed in the second display area.
Specifically, the second display area may include an audio text display area and an additional information display area, and the additional information is displayed in the additional information display area, where the additional information includes: user information, service information, and a hit rule list. The user information may be a user name added with additional information, the service information may be a name of a specific service, and the hit rule list includes a preset rule name hit by the corresponding audio text.
In addition, in the invention, because the audio playing in the first display area and the text display in the second display area are synchronized, the audio playing in the first display area and/or the audio text display in the second display area can be controlled through the first operation in the first display area, and obviously, the audio playing in the first display area and/or the audio text display in the second display area can also be controlled through the second operation in the second display area. And the playing of any specified audio waveform and/or the display of any specified audio text in the quality inspection process are facilitated.
It should be noted that, as shown in fig. 2, in the present invention, a playing control unit may be further displayed in the first display area, each playing control unit includes a plurality of playing modes, and as shown in fig. 5, each audio playing control unit 33 includes a plurality of playing modes 34. Specifically, the audio playing control unit 33 includes an audio basic playing control unit 331, an associated text control unit 332, a variable speed playing control unit 333, and a volume control unit (not shown in fig. 3), and the playing modes corresponding to the basic playing control unit 331 include: playing 341, pausing 342 and stopping 343, the playing modes corresponding to the associated text control unit 332 include: the previous audio 344 of the current audio is played, and the next audio 345 of the current audio is played, and the playing modes corresponding to the variable-speed playing control unit 333 include: accelerated playback 346, decelerated playback 347, and playback 348 at standard speed. Therefore, the playing mode of the selected audio playing control unit can be switched to a corresponding audio playing mode; and playing corresponding audio according to the audio playing mode and displaying a target audio text.
For example, in fig. 5, if the user selects the previous audio button 344 of the associated text control unit 332 for playing the current audio, the mode is switched to the play mode for playing the previous audio of the current audio. And corresponding to the playing mode of the last sentence of the current audio, playing the last sentence of the current audio and displaying the audio text of the last sentence of the current audio.
Fig. 6 is a schematic diagram of a voice quality inspection display device according to the present invention, and as shown in fig. 6, the device includes:
a first display module 61 for displaying the audio waveform in the first display area;
a second display module 62 for displaying audio text in a second display area;
a determining module 63, configured to determine a target audio waveform and a target audio text that hit a preset rule;
and an identifier display module 64, configured to identify and display the target audio waveform in the first display area and/or identify and display the target audio text in the second display area.
An interactive display module 65, configured to, when receiving a user operation on the first identifier, identify and display detailed information of a hit rule with a second identifier; and when user operation on the first identifier and the second identifier is received, playing audio of the hit rule.
Wherein the first display module 61 includes:
the first sub-display module 611, configured to split the audio left and right channels and display them separately as an audio waveform diagram;
and a second sub-display module 612, configured to display the audio length, the current playing progress, the playing control key, the associated text control key, the variable-speed playing control key, and the volume control key below the audio waveform diagram.
The second display module 62 includes:
a first parsing module 621, configured to parse the audio waveform into a text;
a splitting module 622, configured to split the text into text units according to the service data, and mark a call role;
a second parsing module 623, configured to parse the start time, the text unit call duration, and the end time corresponding to the call role dialog text;
and the sub-display module 624 is configured to display the text according to the call role, the start time corresponding to the dialog text, the call duration of the text unit, and the end time.
The determining module 63 includes:
a creating module 631 for creating a rule set including a plurality of preset rules;
the extracting module 632 is configured to extract a target audio text hitting the preset rule;
a sub-association module 633 for associating the target audio text to a corresponding target audio waveform.
The identification display module 64 includes:
a first identifier display module 641 for identifying and displaying a position of the audio waveform hitting the preset rule with a first identifier in the first display area;
and a second identifier display module 642, configured to identify and display the keyword text hitting the preset rule in a significant manner in the second display area.
Further, the second display module 62 is further configured to: displaying additional information within the second display area. Wherein the additional information includes: user information, service information, and a hit rule list.
Those skilled in the art will appreciate that the modules in the above-described embodiments of the apparatus may be distributed as described in the apparatus, and may be correspondingly modified and distributed in one or more apparatuses other than the above-described embodiments. The modules of the above embodiments may be combined into one module, or further split into multiple sub-modules.
In the following, embodiments of the electronic device of the present invention are described, which may be regarded as an implementation in physical form for the above-described embodiments of the method and apparatus of the present invention. Details described in the embodiments of the electronic device of the invention should be considered supplementary to the embodiments of the method or apparatus described above; for details which are not disclosed in embodiments of the electronic device of the invention, reference may be made to the above-described embodiments of the method or the apparatus.
Fig. 7 is a block diagram of an exemplary embodiment of an electronic device according to the present invention. The electronic device shown in fig. 7 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present invention.
As shown in fig. 7, the electronic device 700 of the exemplary embodiment is represented in the form of a general-purpose data processing device. The components of the electronic device 700 may include, but are not limited to: at least one processing unit 710, at least one memory unit 720, a bus 730 connecting different electronic device components (including the memory unit 720 and the processing unit 710), a display unit 740, and the like.
The storage unit 720 stores a computer readable program, which may be a code of a source program or a read-only program. The program may be executed by the processing unit 710 such that the processing unit 710 performs the steps of various embodiments of the present invention. For example, the processing unit 710 may perform the steps as shown in fig. 1.
The memory unit 720 may include readable media in the form of volatile memory units, such as a random access memory unit (RAM)7201 and/or a cache memory unit 7202, and may further include a read only memory unit (ROM) 7203. The memory unit 720 may also include a program/utility 7204 having a set (at least one) of program modules 7205, such program modules 7205 including, but not limited to: operating the electronic device, one or more application programs, other program modules, and program data, each of which, or some combination thereof, may comprise an implementation of a network environment.
Bus 730 may be any representation of one or more of several types of bus structures, including a memory unit bus or memory unit controller, a peripheral bus, an accelerated graphics port, a processing unit, or a local bus using any of a variety of bus architectures.
The electronic device 700 may also communicate with one or more external devices 300 (e.g., keyboard, display, network device, bluetooth device, etc.), enable a user to interact with the electronic device 700 via the external devices 700, and/or enable the electronic device 700 to communicate with one or more other data processing devices (e.g., router, modem, etc.). Such communication may occur via input/output (I/O) interfaces 750, and may also occur via network adapter 760 with one or more networks, such as a Local Area Network (LAN), a Wide Area Network (WAN) and/or a public network, such as the Internet. The network adapter 760 may communicate with other modules of the electronic device 700 via the bus 730. It should be appreciated that although not shown in FIG. 7, other hardware and/or software modules may be used in electronic device 700, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID electronics, tape drives, and data backup storage electronics, among others.
FIG. 8 is a schematic diagram of one computer-readable medium embodiment of the present invention. As shown in fig. 8, the computer program may be stored on one or more computer readable media. The computer readable medium may be a readable signal medium or a readable storage medium. The readable storage medium may be, for example, but not limited to, an electronic device, apparatus, or device that is electronic, magnetic, optical, electromagnetic, infrared, or semiconductor, or a combination of any of the foregoing. More specific examples (a non-exhaustive list) of the readable storage medium include: an electrical connection having one or more wires, a portable disk, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. The computer program, when executed by one or more data processing devices, enables the computer-readable medium to implement the above-described method of the invention, namely: displaying the audio waveform in the first display area; displaying audio text in a second display area; associating the audio hitting the preset rule to a target audio waveform and an audio text; and identifying and displaying the target audio waveform in the first display area and/or identifying and displaying the target audio text in the second display area.
Through the above description of the embodiments, those skilled in the art will readily understand that the exemplary embodiments of the present invention described herein may be implemented by software, or by software in combination with necessary hardware. Therefore, the technical solution according to the embodiment of the present invention can be embodied in the form of a software product, which can be stored in a computer-readable storage medium (which can be a CD-ROM, a usb disk, a removable hard disk, etc.) or on a network, and includes several instructions to make a data processing device (which can be a personal computer, a server, or a network device, etc.) execute the above-mentioned method according to the present invention.
The computer readable storage medium may include a propagated data signal with readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A readable storage medium may also be any readable medium that is not a readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution electronic device, apparatus, or device. Program code embodied on a readable storage medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
Program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computing device, partly on the user's device, as a stand-alone software package, partly on the user's computing device and partly on a remote computing device, or entirely on the remote computing device or server. In the case of a remote computing device, the remote computing device may be connected to the user computing device through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computing device (e.g., through the internet using an internet service provider).
In summary, the present invention can be implemented as a method, an apparatus, an electronic device, or a computer-readable medium executing a computer program. Some or all of the functions of the present invention may be implemented in practice using a general purpose data processing device such as a microprocessor or a Digital Signal Processor (DSP).
While the foregoing embodiments have described the objects, aspects and advantages of the present invention in further detail, it should be understood that the present invention is not inherently related to any particular computer, virtual machine or electronic device, and various general-purpose machines may be used to implement the present invention. The invention is not to be considered as limited to the specific embodiments thereof, but is to be understood as being modified in all respects, all changes and equivalents that come within the spirit and scope of the invention.

Claims (10)

1. A voice quality inspection display method is characterized by comprising the following steps:
displaying the audio waveform in the first display area;
displaying audio text in a second display area;
determining a target audio waveform and a target audio text which hit a preset rule;
and identifying and displaying the target audio waveform in the first display area and/or identifying and displaying the target audio text in the second display area.
2. The method of claim 1, wherein associating audio that hits a preset rule into a target audio waveform and audio text comprises:
creating a rule set comprising a plurality of preset rules;
extracting a target audio text hitting the preset rule;
and associating the target audio text into a corresponding target audio waveform.
3. The method of any of claims 1-2, wherein the identifying the target audio waveform for display in the first display region comprises:
identifying a position where an audio waveform hitting a preset rule is displayed with a first identifier in a first display area;
the identifying and displaying the target audio text within the second display region comprises:
and identifying and displaying the keyword text hitting the preset rule in a remarkable mode in the second display area.
4. The method according to any one of claims 1-3, further comprising:
when receiving user operation on the first identifier, identifying and displaying detailed information of a hit rule by a second identifier;
and when user operation on the first identifier and the second identifier is received, playing audio of the hit rule.
5. The method of any of claims 1-4, wherein displaying audio text in the second display region comprises:
parsing the audio waveform into text;
splitting the text into text units according to the service data, and marking the communication role;
analyzing the starting time, the text unit call duration and the ending time corresponding to the call role conversation text;
and displaying the text according to the call role, the start time corresponding to the conversation text, the call duration of the text unit and the end time.
6. The method according to any one of claims 1-5, further comprising: and displaying an audio playing control unit in the first display area.
7. The method according to any one of claims 1-6, further comprising:
displaying additional information within the second display area.
8. A voice quality inspection display device, characterized in that the device comprises:
the first display module is used for displaying the audio waveform in the first display area;
the second display module is used for displaying the audio text in the second display area;
the determining module is used for determining a target audio waveform and a target audio text which hit a preset rule;
and the identification display module is used for identifying and displaying the target audio waveform in the first display area and/or identifying and displaying the target audio text in the second display area.
9. An electronic device, comprising:
a processor; and
a memory storing computer-executable instructions that, when executed, cause the processor to perform the method of any of claims 1-7.
10. A computer readable storage medium, wherein the computer readable storage medium stores one or more programs which, when executed by a processor, implement the method of any of claims 1-7.
CN201910939378.0A 2019-09-30 2019-09-30 Voice quality inspection display method and device and electronic equipment Pending CN110750229A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910939378.0A CN110750229A (en) 2019-09-30 2019-09-30 Voice quality inspection display method and device and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910939378.0A CN110750229A (en) 2019-09-30 2019-09-30 Voice quality inspection display method and device and electronic equipment

Publications (1)

Publication Number Publication Date
CN110750229A true CN110750229A (en) 2020-02-04

Family

ID=69277491

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910939378.0A Pending CN110750229A (en) 2019-09-30 2019-09-30 Voice quality inspection display method and device and electronic equipment

Country Status (1)

Country Link
CN (1) CN110750229A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111405128A (en) * 2020-03-24 2020-07-10 中国—东盟信息港股份有限公司 Call quality inspection system based on voice-to-text conversion
CN111696527A (en) * 2020-06-15 2020-09-22 龙马智芯(珠海横琴)科技有限公司 Method and device for positioning voice quality inspection area, positioning equipment and storage medium
CN113689862A (en) * 2021-08-23 2021-11-23 南京优飞保科信息技术有限公司 Quality inspection method and system for customer service seat voice data
CN114579017A (en) * 2022-02-10 2022-06-03 优视科技(中国)有限公司 Method and device for displaying audio
CN115116432A (en) * 2021-03-17 2022-09-27 中国电信股份有限公司 Audio file labeling method and device and electronic equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060217881A1 (en) * 2005-03-28 2006-09-28 Sap Aktiengesellschaft Incident command post
CN102937959A (en) * 2011-06-03 2013-02-20 苹果公司 Automatically creating a mapping between text data and audio data
CN104410920A (en) * 2014-12-31 2015-03-11 合一网络技术(北京)有限公司 Video segment playback amount-based method for labeling highlights
US20180268820A1 (en) * 2017-03-16 2018-09-20 Naver Corporation Method and system for generating content using speech comment
CN109634700A (en) * 2018-11-26 2019-04-16 维沃移动通信有限公司 A kind of the content of text display methods and terminal device of audio

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060217881A1 (en) * 2005-03-28 2006-09-28 Sap Aktiengesellschaft Incident command post
CN102937959A (en) * 2011-06-03 2013-02-20 苹果公司 Automatically creating a mapping between text data and audio data
CN104410920A (en) * 2014-12-31 2015-03-11 合一网络技术(北京)有限公司 Video segment playback amount-based method for labeling highlights
US20180268820A1 (en) * 2017-03-16 2018-09-20 Naver Corporation Method and system for generating content using speech comment
CN109634700A (en) * 2018-11-26 2019-04-16 维沃移动通信有限公司 A kind of the content of text display methods and terminal device of audio

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111405128A (en) * 2020-03-24 2020-07-10 中国—东盟信息港股份有限公司 Call quality inspection system based on voice-to-text conversion
CN111405128B (en) * 2020-03-24 2022-02-18 中国—东盟信息港股份有限公司 Call quality inspection system based on voice-to-text conversion
CN111696527A (en) * 2020-06-15 2020-09-22 龙马智芯(珠海横琴)科技有限公司 Method and device for positioning voice quality inspection area, positioning equipment and storage medium
CN115116432A (en) * 2021-03-17 2022-09-27 中国电信股份有限公司 Audio file labeling method and device and electronic equipment
CN113689862A (en) * 2021-08-23 2021-11-23 南京优飞保科信息技术有限公司 Quality inspection method and system for customer service seat voice data
CN113689862B (en) * 2021-08-23 2024-03-22 南京优飞保科信息技术有限公司 Quality inspection method and system for customer service agent voice data
CN114579017A (en) * 2022-02-10 2022-06-03 优视科技(中国)有限公司 Method and device for displaying audio

Similar Documents

Publication Publication Date Title
CN110750229A (en) Voice quality inspection display method and device and electronic equipment
US10929449B2 (en) Generating a structured document guiding view
CN102027473B (en) Method and system for media access by tag cloud
US20190362022A1 (en) Audio file labeling process for building datasets at scale
US20140019133A1 (en) Data processing method, presentation method, and corresponding apparatuses
US20140067367A1 (en) Method and system for reading fluency training
US11049525B2 (en) Transcript-based insertion of secondary video content into primary video content
CN108109636B (en) Text-based voice playing method and device, computer equipment and storage medium
AU2012359080A1 (en) Managing playback of supplemental information
CN112951275B (en) Voice quality inspection method and device, electronic equipment and medium
JP2011102862A (en) Speech recognition result control apparatus and speech recognition result display method
CN112487238B (en) Audio processing method, device, terminal and medium
CN105302906A (en) Information labeling method and apparatus
CN114023301A (en) Audio editing method, electronic device and storage medium
CN113901186A (en) Telephone recording marking method, device, equipment and storage medium
CN110674638A (en) Corpus labeling system and electronic equipment
CN114268829A (en) Video processing method and device, electronic equipment and computer readable storage medium
US8958532B2 (en) Navigating an interactive voice response system
CN111723235A (en) Music content identification method, device and equipment
CN114374885A (en) Video key segment determination method and device, electronic equipment and readable storage medium
CN110750230A (en) Voice interface display method and device and electronic equipment
CN106878773B (en) Electronic device, video processing method and apparatus, and storage medium
CN113259763A (en) Teaching video processing method and device and electronic equipment
CN111914115A (en) Sound information processing method and device and electronic equipment
Reidsma et al. Designing annotation tools based on properties of annotation problems

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination