CN114025050A - Speech recognition method and device based on intelligent outbound and text analysis - Google Patents
Speech recognition method and device based on intelligent outbound and text analysis Download PDFInfo
- Publication number
- CN114025050A CN114025050A CN202111312262.8A CN202111312262A CN114025050A CN 114025050 A CN114025050 A CN 114025050A CN 202111312262 A CN202111312262 A CN 202111312262A CN 114025050 A CN114025050 A CN 114025050A
- Authority
- CN
- China
- Prior art keywords
- text
- analysis
- robot
- outbound
- text analysis
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 40
- 238000003062 neural network model Methods 0.000 claims abstract description 7
- 238000012549 training Methods 0.000 claims abstract description 7
- 238000012545 processing Methods 0.000 claims abstract description 6
- 238000006243 chemical reaction Methods 0.000 claims description 3
- 238000004590 computer program Methods 0.000 claims description 2
- 238000010586 diagram Methods 0.000 description 4
- 238000004891 communication Methods 0.000 description 2
- 230000001360 synchronised effect Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/50—Centralised arrangements for answering calls; Centralised arrangements for recording messages for absent or busy subscribers ; Centralised arrangements for recording messages
- H04M3/527—Centralised call answering arrangements not requiring operator intervention
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Theoretical Computer Science (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- General Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Biomedical Technology (AREA)
- Signal Processing (AREA)
- Life Sciences & Earth Sciences (AREA)
- Databases & Information Systems (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Telephonic Communication Services (AREA)
- Machine Translation (AREA)
Abstract
The invention discloses a voice recognition method based on intelligent outbound and text analysis, which comprises the following steps: acquiring a first voice stream of a call between an AI robot and a target client, wherein the first voice stream is obtained by the AI robot through calling out the target client based on a preset AI technique; converting the first voice stream into a first text, and sending the first text to a text analysis robot model for processing to obtain a classification label of the first text and a classification accuracy of the first text, wherein the analysis robot comprises a preconfigured analysis rule and a text analysis model, the analysis rule is used for outputting the classification accuracy, the text analysis model is obtained by training a neural network model through a preset single text corpus or multiple text corpora, and the classification label is used for describing an intention category of the target client.
Description
Technical Field
The application relates to the field of intelligent outbound, in particular to a voice recognition method and device based on intelligent outbound and text analysis.
Background
The existing intelligent outbound system mainly carries out outbound by matching an AI robot with a conversation, a circuit and the like, can realize repeated outbound work in large batch under limited resources, and greatly reduces the labor and time cost. However, the outbound is only a marketing means, and not a final purpose, and for the merchant, it is important to find a new business opportunity after how to implement a marketing purpose in the outbound process, and the existing outbound system cannot implement the identification of the business opportunity.
At present, there are many tools for imaging (labeling) customers through text analysis so as to realize accurate positioning of customers and further discover business opportunities. However, the above schemes mostly perform recognition after manually acquiring a text, and cannot be effectively combined with an intelligent outbound system to realize automatic business recognition after the outbound is finished. How to effectively combine intelligent callouts with text analysis is the next direction of research.
Disclosure of Invention
The technical problem to be solved by the embodiment of the application is to provide a voice recognition method and a voice recognition device based on intelligent outbound and text analysis, so as to solve the technical problem that the existing intelligent outbound and text analysis cannot be effectively combined to realize automatic business recognition after the outbound.
In order to achieve the above purpose, the embodiments of the present application adopt the following technical solutions:
in a first aspect, an embodiment of the present application provides a speech recognition method based on intelligent outbound and text analysis, where the method includes:
acquiring a first voice stream of a call between an AI robot and a target client, wherein the first voice stream is obtained by the AI robot through calling out the target client based on a preset AI technique;
converting the first voice stream into a first text, and sending the first text to a text analysis robot model for processing to obtain a classification label of the first text and a classification accuracy of the first text, wherein the analysis robot comprises a preconfigured analysis rule and a text analysis model, the analysis rule is used for outputting the classification accuracy, the text analysis model is obtained by training a neural network model through a preset single text corpus or multiple text corpora, and the classification label is used for describing an intention category of the target client.
In a second aspect, an embodiment of the present application provides a speech recognition apparatus based on intelligent outbound and text analysis, the apparatus includes:
the system comprises a first acquisition unit, a second acquisition unit and a third acquisition unit, wherein the first acquisition unit is used for acquiring a first voice stream of a call between an AI robot and a target client, and the first voice stream is obtained by calling the target client out through the AI robot based on a preset AI technique;
a first conversion unit, configured to convert the first voice stream into a first text;
the first sending unit is used for sending the first text to a text analysis robot model for processing to obtain a classification label of the first text and the classification accuracy of the first text, wherein the analysis robot comprises a preset analysis rule and a text analysis model, the analysis rule is used for outputting the classification accuracy, the text analysis model is obtained by training a neural network model through a preset single text corpus or a plurality of text corpora, and the classification label is used for describing the intention category of the user target.
In a third aspect, an embodiment of the present application provides an electronic device, which includes a processor and a memory, where the memory stores at least one instruction, at least one program, a set of codes to be executed, or a set of instructions, and the at least one instruction, the at least one program, the set of codes to be executed, or the set of instructions is executed by the processor to implement the method for speech recognition based on intelligent outbound and text analysis as described in the first aspect.
In a fourth aspect, embodiments of the present application provide a computer-readable storage medium, in which at least one instruction, at least one program, a set of codes, or a set of instructions is stored, and the at least one instruction, the at least one program, the set of codes, or the set of instructions is executed by a processor to implement the method for speech recognition based on intelligent callout and text analysis as described in the first aspect.
The beneficial effects of the embodiment of the application are that: the embodiment of the application provides a voice recognition method and a voice recognition device based on intelligent outbound and text analysis.
Drawings
FIG. 1 is a schematic flow chart illustrating a speech recognition method based on intelligent outbound and text analysis according to an embodiment of the present application;
fig. 2 is a schematic structural diagram of a speech recognition apparatus based on intelligent outbound and text analysis according to an embodiment of the present application;
fig. 3 is a schematic structural diagram of an electronic device according to an embodiment of the present application.
Detailed Description
The technical solutions of the present application are further described in detail with reference to the following specific embodiments, and it is obvious that the described embodiments are only a part of the embodiments of the present application, but not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
The application provides a voice recognition method and device based on intelligent outbound and text analysis, and aims to solve the technical problem that the existing intelligent outbound and text analysis cannot be effectively combined to realize automatic business recognition after the outbound.
The technical solutions provided by the embodiments of the present application are described in detail below with reference to the accompanying drawings.
Referring to fig. 1, a flow chart of a speech recognition method based on intelligent outbound and text analysis according to an embodiment of the present application is shown, where the method includes:
step S101, a first voice stream of a call between an AI robot and a target client is obtained, wherein the first voice stream is obtained by calling the target client out through the AI robot based on a preset AI technique;
it is to be understood that the first voice stream is a voice stream generated during the communication between the AI robot and the target client, and includes a voice stream of the AI robot and a voice stream of the target client.
Step S102, converting the first voice stream into a first text, and sending the first text to a text analysis robot model for processing to obtain a classification label of the first text and a classification accuracy of the first text, wherein the analysis robot comprises a preconfigured analysis rule and a text analysis model, the analysis rule is used for outputting the classification accuracy, the text analysis model is obtained by training a neural network model through a preset single text corpus or multiple text corpora, and the classification label is used for describing an intention category of the target client.
With respect to step S102, the first voice stream is converted into a first text through voice to text.
In one embodiment, the category label includes any one of a consultation of a commodity, an intention to purchase, and a complaint suggestion.
In one embodiment, the text analysis model is parametrically adjusted by the classification label of the first text and the classification accuracy of the first text.
It can be understood that the parameters of the text analysis model are adjusted by the classification label and the classification accuracy of the first text, so that the text analysis model is optimized.
In one embodiment, prior to the AI robot making an outbound call to the target customer based on a preset AI conversation:
creating an outbound task, and configuring a preset AI speech and an outbound line for the AI robot;
and introducing the outbound information of the target client into the AI robot to execute an outbound task.
It is understood that the outbound information of the target client may include the name, outbound number, hobbies, etc. of the target client, which is not limited in this application.
Referring to fig. 2, a schematic structural diagram of a speech recognition apparatus based on intelligent outbound and text analysis according to an embodiment of the present application is shown, where the apparatus includes:
a first obtaining unit 201, configured to obtain a first voice stream for a call between the AI robot and a target client, where the first voice stream is obtained
The first voice flow is obtained by calling out the target client through the AI robot based on a preset AI technique;
a first conversion unit 202, configured to convert the first voice stream into a first text;
the first sending unit 203 is configured to send the first text to a text analysis robot model for processing to obtain a classification label of the first text and a classification accuracy of the first text, where the analysis robot includes a preconfigured analysis rule and a text analysis model, the analysis rule is used to output the classification accuracy, the text analysis model is obtained by training a neural network model through a preset single text corpus or multiple text corpora, and the classification label is used to describe an intention category of the target client.
Referring to fig. 3, a schematic structural diagram of an electronic device according to an embodiment of the present application is shown, where the electronic device may include: at least one network interface 302, memory 303, and at least one processor 301. The various components in the electronic device are coupled together by a bus system 304. It will be appreciated that the bus system 304 is used to enable communications among the components. The bus system 304 includes a power bus, a control bus, and a status signal bus in addition to a data bus, but for clarity of illustration, the various buses are labeled as bus system 304 in FIG. 3.
In some embodiments, memory 303 stores elements, executable modules or data structures, or a subset thereof, or an expanded set thereof as follows: an operating system 3031 and application programs 3032.
The operating system 3031 includes various system programs, such as a framework layer, a core library layer, a driver layer, and the like, and is configured to implement various outgoing services and process hardware-based tasks. The application 3032 includes various applications, such as a Media Player (Media Player), a Browser (Browser), and the like, and implements various application services. The program for implementing the method of the embodiment of the present application may be included in an application program.
In the above embodiment, the electronic device further includes: at least one instruction, at least one program, set of codes, or set of instructions stored on the memory 303 that is executable by the processor 301 to perform the steps of implementing any of the intelligent callout and text analysis based speech recognition methods described in the embodiments of the present application.
In one embodiment, the present application further provides a computer-readable storage medium having at least one instruction, at least one program, a set of codes, or a set of instructions stored therein, which when executed by a processor, implement the steps of any of the intelligent callout and text analysis based speech recognition methods described in the embodiments of the present application.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above may be implemented by hardware instructions of a computer program, and that the at least one instruction, the at least one program, the code set, or the instruction set may be stored in a non-volatile computer-readable storage medium, and when executed, the at least one instruction, the at least one program, the code set, or the instruction set may implement the steps of any of the mapping methods described in the embodiments of the present application. Any reference to memory, storage, database, or other medium used in the embodiments provided herein may include non-volatile and/or volatile memory, among others. Non-volatile memory can include read-only memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), Rambus Direct RAM (RDRAM), direct bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).
The embodiments of the present application have been described above with reference to the accompanying drawings, but the present application is not limited to the above-described embodiments, which are only illustrative and not restrictive; meanwhile, for a person skilled in the art, according to the idea of the present application, there may be variations in the specific embodiments and the application scope, which are within the protection scope of the present application.
Claims (8)
1. A speech recognition method based on intelligent outbound and text analysis is characterized by comprising the following steps:
acquiring a first voice stream of a call between the AI robot and a target client, wherein the first voice stream passes through
The AI robot carries out outbound acquisition on the target client based on a preset AI operation;
converting the first voice stream into a first text, and sending the first text to a text analysis robot
The model is processed to obtain a classification label of the first text and the classification accuracy of the first text, wherein the analysis robot comprises a preconfigured analysis rule and a text analysis model, the analysis rule is used for outputting the classification accuracy, the text analysis model is obtained by training a neural network model through a preset single text corpus or multiple text corpora, and the classification label is used for describing the intention category of the target client.
2. The method of claim 1, wherein the speech recognition method based on intelligent outbound and text analysis
Characteristically, the first voice stream is converted to first text by voice to text.
3. The method of claim 1, wherein the speech recognition method based on intelligent outbound and text analysis
Characteristically, the method further comprises: and performing parameter adjustment on the text analysis model through the classification label of the first text and the classification accuracy of the first text.
4. The method of claim 1, wherein the speech recognition method based on intelligent outbound and text analysis
Characterized in that, before the AI robot makes an outbound call to the target customer based on a preset AI conversation:
creating an outbound task, and configuring a preset AI speech and an outbound line for the AI robot;
and introducing the outbound information of the target client into the AI robot to execute an outbound task.
5. The method of claim 1, wherein the speech recognition method based on intelligent outbound and text analysis
The classification label comprises any one of commodity consultation, purchase intention and complaint suggestion.
6. A speech recognition device based on intelligent outbound and text analysis, the device comprising:
a first acquisition unit for acquiring a first voice stream of a call between the AI robot and a target client, wherein
The first voice flow is obtained by calling out the target client through the AI robot based on a preset AI technique;
a first conversion unit, configured to convert the first voice stream into a first text;
a first sending unit, configured to send the first text to a text analysis robot model for processing to obtain
And obtaining a classification label of the first text and the classification accuracy of the first text, wherein the analysis robot comprises a preconfigured analysis rule and a text analysis model, the analysis rule is used for outputting the classification accuracy, the text analysis model is obtained by training a neural network model through a preset single text corpus or multiple text corpora, and the classification label is used for describing the intention category of the target client.
7. An electronic device, characterized in that said electronic device comprises a processor and a memory, said memory
Stored with at least one instruction, at least one program, set of instructions to be executed by the processor to implement the method of speech recognition based on intelligent callout and text analysis according to any one of claims 1 to 5.
8. A computer-readable storage medium having stored therein a computer program product
At least one instruction, at least one program, set of instructions to be coded or a set of instructions to be executed by a processor to implement the intelligent callout and text analysis based speech recognition method according to any one of claims 1-5.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111312262.8A CN114025050A (en) | 2021-11-08 | 2021-11-08 | Speech recognition method and device based on intelligent outbound and text analysis |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111312262.8A CN114025050A (en) | 2021-11-08 | 2021-11-08 | Speech recognition method and device based on intelligent outbound and text analysis |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114025050A true CN114025050A (en) | 2022-02-08 |
Family
ID=80062094
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111312262.8A Pending CN114025050A (en) | 2021-11-08 | 2021-11-08 | Speech recognition method and device based on intelligent outbound and text analysis |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114025050A (en) |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110351443A (en) * | 2019-06-17 | 2019-10-18 | 深圳壹账通智能科技有限公司 | Intelligent outgoing call processing method, device, computer equipment and storage medium |
CN111833871A (en) * | 2020-07-07 | 2020-10-27 | 信雅达系统工程股份有限公司 | Intelligent outbound system based on intention recognition and method thereof |
CN111916111A (en) * | 2020-07-20 | 2020-11-10 | 中国建设银行股份有限公司 | Intelligent voice outbound method and device with emotion, server and storage medium |
CN112185358A (en) * | 2020-08-24 | 2021-01-05 | 维知科技张家口有限责任公司 | Intention recognition method, model training method, device, equipment and medium |
CN112422749A (en) * | 2020-12-08 | 2021-02-26 | 浙江百应科技有限公司 | Method for preventing harassment outbound based on intelligent dialogue analysis |
KR102241532B1 (en) * | 2021-01-15 | 2021-04-16 | (주)두타위즈 | Intelligent callbot server and unmanned counsel systeim using thereof |
CN113094481A (en) * | 2021-03-03 | 2021-07-09 | 北京智齿博创科技有限公司 | Intention recognition method and device, electronic equipment and computer readable storage medium |
-
2021
- 2021-11-08 CN CN202111312262.8A patent/CN114025050A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110351443A (en) * | 2019-06-17 | 2019-10-18 | 深圳壹账通智能科技有限公司 | Intelligent outgoing call processing method, device, computer equipment and storage medium |
CN111833871A (en) * | 2020-07-07 | 2020-10-27 | 信雅达系统工程股份有限公司 | Intelligent outbound system based on intention recognition and method thereof |
CN111916111A (en) * | 2020-07-20 | 2020-11-10 | 中国建设银行股份有限公司 | Intelligent voice outbound method and device with emotion, server and storage medium |
CN112185358A (en) * | 2020-08-24 | 2021-01-05 | 维知科技张家口有限责任公司 | Intention recognition method, model training method, device, equipment and medium |
CN112422749A (en) * | 2020-12-08 | 2021-02-26 | 浙江百应科技有限公司 | Method for preventing harassment outbound based on intelligent dialogue analysis |
KR102241532B1 (en) * | 2021-01-15 | 2021-04-16 | (주)두타위즈 | Intelligent callbot server and unmanned counsel systeim using thereof |
CN113094481A (en) * | 2021-03-03 | 2021-07-09 | 北京智齿博创科技有限公司 | Intention recognition method and device, electronic equipment and computer readable storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112492111B (en) | Intelligent voice outbound method, device, computer equipment and storage medium | |
CN110457679B (en) | User portrait construction method, device, computer equipment and storage medium | |
CN109492104B (en) | Training method, classification method, system, device and medium of intention classification model | |
CN110890088B (en) | Voice information feedback method and device, computer equipment and storage medium | |
CN113239147A (en) | Intelligent conversation method, system and medium based on graph neural network | |
CN111597818A (en) | Call quality inspection method, call quality inspection device, computer equipment and computer readable storage medium | |
CN111291157B (en) | Response method, device, terminal and storage medium | |
CN112235470B (en) | Incoming call client follow-up method, device and equipment based on voice recognition | |
CN110866115A (en) | Sequence labeling method, system, computer equipment and computer readable storage medium | |
CN112836521A (en) | Question-answer matching method and device, computer equipment and storage medium | |
CN116432665B (en) | Dialogue model construction method, text generation method, device, system and equipment | |
CN114025050A (en) | Speech recognition method and device based on intelligent outbound and text analysis | |
CN115964115A (en) | Numerical control machine tool interaction method based on pre-training reinforcement learning and related equipment | |
CN116563034A (en) | Purchase prediction method, device, equipment and storage medium based on artificial intelligence | |
CN115858757A (en) | Customer service mail response method and device, equipment, medium and product thereof | |
CN113992806A (en) | Intelligent voice RPA robot outbound method and device | |
US20210264439A1 (en) | System and method to generate digital responses to a customer query | |
CN112380031A (en) | Method and device for pushing messages in real time in cross-application mode and computing equipment | |
US20210264450A1 (en) | System and method to generate digital responses to a customer query | |
CN112035643B (en) | Method and device for multiplexing capacity of conversation robot | |
CN116684529A (en) | Outbound processing method, outbound processing device, computer equipment and storage medium | |
CN113467774B (en) | WEB terminal business software development framework and method | |
CN115567646A (en) | Intelligent outbound method, device, computer equipment and storage medium | |
CN117079640A (en) | Voice monitoring method, device, computer equipment and computer readable storage medium | |
CN116383389A (en) | Artificial intelligence-based intention recognition method, apparatus, computer device and medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20220208 |