CN114025050A - Speech recognition method and device based on intelligent outbound and text analysis - Google Patents

Speech recognition method and device based on intelligent outbound and text analysis Download PDF

Info

Publication number
CN114025050A
CN114025050A CN202111312262.8A CN202111312262A CN114025050A CN 114025050 A CN114025050 A CN 114025050A CN 202111312262 A CN202111312262 A CN 202111312262A CN 114025050 A CN114025050 A CN 114025050A
Authority
CN
China
Prior art keywords
text
analysis
robot
outbound
text analysis
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111312262.8A
Other languages
Chinese (zh)
Inventor
吴永江
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Baiying Technology Co Ltd
Original Assignee
Zhejiang Baiying Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang Baiying Technology Co Ltd filed Critical Zhejiang Baiying Technology Co Ltd
Priority to CN202111312262.8A priority Critical patent/CN114025050A/en
Publication of CN114025050A publication Critical patent/CN114025050A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/50Centralised arrangements for answering calls; Centralised arrangements for recording messages for absent or busy subscribers ; Centralised arrangements for recording messages
    • H04M3/527Centralised call answering arrangements not requiring operator intervention
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/16Speech classification or search using artificial neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Biomedical Technology (AREA)
  • Signal Processing (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Telephonic Communication Services (AREA)
  • Machine Translation (AREA)

Abstract

The invention discloses a voice recognition method based on intelligent outbound and text analysis, which comprises the following steps: acquiring a first voice stream of a call between an AI robot and a target client, wherein the first voice stream is obtained by the AI robot through calling out the target client based on a preset AI technique; converting the first voice stream into a first text, and sending the first text to a text analysis robot model for processing to obtain a classification label of the first text and a classification accuracy of the first text, wherein the analysis robot comprises a preconfigured analysis rule and a text analysis model, the analysis rule is used for outputting the classification accuracy, the text analysis model is obtained by training a neural network model through a preset single text corpus or multiple text corpora, and the classification label is used for describing an intention category of the target client.

Description

Speech recognition method and device based on intelligent outbound and text analysis
Technical Field
The application relates to the field of intelligent outbound, in particular to a voice recognition method and device based on intelligent outbound and text analysis.
Background
The existing intelligent outbound system mainly carries out outbound by matching an AI robot with a conversation, a circuit and the like, can realize repeated outbound work in large batch under limited resources, and greatly reduces the labor and time cost. However, the outbound is only a marketing means, and not a final purpose, and for the merchant, it is important to find a new business opportunity after how to implement a marketing purpose in the outbound process, and the existing outbound system cannot implement the identification of the business opportunity.
At present, there are many tools for imaging (labeling) customers through text analysis so as to realize accurate positioning of customers and further discover business opportunities. However, the above schemes mostly perform recognition after manually acquiring a text, and cannot be effectively combined with an intelligent outbound system to realize automatic business recognition after the outbound is finished. How to effectively combine intelligent callouts with text analysis is the next direction of research.
Disclosure of Invention
The technical problem to be solved by the embodiment of the application is to provide a voice recognition method and a voice recognition device based on intelligent outbound and text analysis, so as to solve the technical problem that the existing intelligent outbound and text analysis cannot be effectively combined to realize automatic business recognition after the outbound.
In order to achieve the above purpose, the embodiments of the present application adopt the following technical solutions:
in a first aspect, an embodiment of the present application provides a speech recognition method based on intelligent outbound and text analysis, where the method includes:
acquiring a first voice stream of a call between an AI robot and a target client, wherein the first voice stream is obtained by the AI robot through calling out the target client based on a preset AI technique;
converting the first voice stream into a first text, and sending the first text to a text analysis robot model for processing to obtain a classification label of the first text and a classification accuracy of the first text, wherein the analysis robot comprises a preconfigured analysis rule and a text analysis model, the analysis rule is used for outputting the classification accuracy, the text analysis model is obtained by training a neural network model through a preset single text corpus or multiple text corpora, and the classification label is used for describing an intention category of the target client.
In a second aspect, an embodiment of the present application provides a speech recognition apparatus based on intelligent outbound and text analysis, the apparatus includes:
the system comprises a first acquisition unit, a second acquisition unit and a third acquisition unit, wherein the first acquisition unit is used for acquiring a first voice stream of a call between an AI robot and a target client, and the first voice stream is obtained by calling the target client out through the AI robot based on a preset AI technique;
a first conversion unit, configured to convert the first voice stream into a first text;
the first sending unit is used for sending the first text to a text analysis robot model for processing to obtain a classification label of the first text and the classification accuracy of the first text, wherein the analysis robot comprises a preset analysis rule and a text analysis model, the analysis rule is used for outputting the classification accuracy, the text analysis model is obtained by training a neural network model through a preset single text corpus or a plurality of text corpora, and the classification label is used for describing the intention category of the user target.
In a third aspect, an embodiment of the present application provides an electronic device, which includes a processor and a memory, where the memory stores at least one instruction, at least one program, a set of codes to be executed, or a set of instructions, and the at least one instruction, the at least one program, the set of codes to be executed, or the set of instructions is executed by the processor to implement the method for speech recognition based on intelligent outbound and text analysis as described in the first aspect.
In a fourth aspect, embodiments of the present application provide a computer-readable storage medium, in which at least one instruction, at least one program, a set of codes, or a set of instructions is stored, and the at least one instruction, the at least one program, the set of codes, or the set of instructions is executed by a processor to implement the method for speech recognition based on intelligent callout and text analysis as described in the first aspect.
The beneficial effects of the embodiment of the application are that: the embodiment of the application provides a voice recognition method and a voice recognition device based on intelligent outbound and text analysis.
Drawings
FIG. 1 is a schematic flow chart illustrating a speech recognition method based on intelligent outbound and text analysis according to an embodiment of the present application;
fig. 2 is a schematic structural diagram of a speech recognition apparatus based on intelligent outbound and text analysis according to an embodiment of the present application;
fig. 3 is a schematic structural diagram of an electronic device according to an embodiment of the present application.
Detailed Description
The technical solutions of the present application are further described in detail with reference to the following specific embodiments, and it is obvious that the described embodiments are only a part of the embodiments of the present application, but not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
The application provides a voice recognition method and device based on intelligent outbound and text analysis, and aims to solve the technical problem that the existing intelligent outbound and text analysis cannot be effectively combined to realize automatic business recognition after the outbound.
The technical solutions provided by the embodiments of the present application are described in detail below with reference to the accompanying drawings.
Referring to fig. 1, a flow chart of a speech recognition method based on intelligent outbound and text analysis according to an embodiment of the present application is shown, where the method includes:
step S101, a first voice stream of a call between an AI robot and a target client is obtained, wherein the first voice stream is obtained by calling the target client out through the AI robot based on a preset AI technique;
it is to be understood that the first voice stream is a voice stream generated during the communication between the AI robot and the target client, and includes a voice stream of the AI robot and a voice stream of the target client.
Step S102, converting the first voice stream into a first text, and sending the first text to a text analysis robot model for processing to obtain a classification label of the first text and a classification accuracy of the first text, wherein the analysis robot comprises a preconfigured analysis rule and a text analysis model, the analysis rule is used for outputting the classification accuracy, the text analysis model is obtained by training a neural network model through a preset single text corpus or multiple text corpora, and the classification label is used for describing an intention category of the target client.
With respect to step S102, the first voice stream is converted into a first text through voice to text.
In one embodiment, the category label includes any one of a consultation of a commodity, an intention to purchase, and a complaint suggestion.
In one embodiment, the text analysis model is parametrically adjusted by the classification label of the first text and the classification accuracy of the first text.
It can be understood that the parameters of the text analysis model are adjusted by the classification label and the classification accuracy of the first text, so that the text analysis model is optimized.
In one embodiment, prior to the AI robot making an outbound call to the target customer based on a preset AI conversation:
creating an outbound task, and configuring a preset AI speech and an outbound line for the AI robot;
and introducing the outbound information of the target client into the AI robot to execute an outbound task.
It is understood that the outbound information of the target client may include the name, outbound number, hobbies, etc. of the target client, which is not limited in this application.
Referring to fig. 2, a schematic structural diagram of a speech recognition apparatus based on intelligent outbound and text analysis according to an embodiment of the present application is shown, where the apparatus includes:
a first obtaining unit 201, configured to obtain a first voice stream for a call between the AI robot and a target client, where the first voice stream is obtained
The first voice flow is obtained by calling out the target client through the AI robot based on a preset AI technique;
a first conversion unit 202, configured to convert the first voice stream into a first text;
the first sending unit 203 is configured to send the first text to a text analysis robot model for processing to obtain a classification label of the first text and a classification accuracy of the first text, where the analysis robot includes a preconfigured analysis rule and a text analysis model, the analysis rule is used to output the classification accuracy, the text analysis model is obtained by training a neural network model through a preset single text corpus or multiple text corpora, and the classification label is used to describe an intention category of the target client.
Referring to fig. 3, a schematic structural diagram of an electronic device according to an embodiment of the present application is shown, where the electronic device may include: at least one network interface 302, memory 303, and at least one processor 301. The various components in the electronic device are coupled together by a bus system 304. It will be appreciated that the bus system 304 is used to enable communications among the components. The bus system 304 includes a power bus, a control bus, and a status signal bus in addition to a data bus, but for clarity of illustration, the various buses are labeled as bus system 304 in FIG. 3.
In some embodiments, memory 303 stores elements, executable modules or data structures, or a subset thereof, or an expanded set thereof as follows: an operating system 3031 and application programs 3032.
The operating system 3031 includes various system programs, such as a framework layer, a core library layer, a driver layer, and the like, and is configured to implement various outgoing services and process hardware-based tasks. The application 3032 includes various applications, such as a Media Player (Media Player), a Browser (Browser), and the like, and implements various application services. The program for implementing the method of the embodiment of the present application may be included in an application program.
In the above embodiment, the electronic device further includes: at least one instruction, at least one program, set of codes, or set of instructions stored on the memory 303 that is executable by the processor 301 to perform the steps of implementing any of the intelligent callout and text analysis based speech recognition methods described in the embodiments of the present application.
In one embodiment, the present application further provides a computer-readable storage medium having at least one instruction, at least one program, a set of codes, or a set of instructions stored therein, which when executed by a processor, implement the steps of any of the intelligent callout and text analysis based speech recognition methods described in the embodiments of the present application.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above may be implemented by hardware instructions of a computer program, and that the at least one instruction, the at least one program, the code set, or the instruction set may be stored in a non-volatile computer-readable storage medium, and when executed, the at least one instruction, the at least one program, the code set, or the instruction set may implement the steps of any of the mapping methods described in the embodiments of the present application. Any reference to memory, storage, database, or other medium used in the embodiments provided herein may include non-volatile and/or volatile memory, among others. Non-volatile memory can include read-only memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), Rambus Direct RAM (RDRAM), direct bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).
The embodiments of the present application have been described above with reference to the accompanying drawings, but the present application is not limited to the above-described embodiments, which are only illustrative and not restrictive; meanwhile, for a person skilled in the art, according to the idea of the present application, there may be variations in the specific embodiments and the application scope, which are within the protection scope of the present application.

Claims (8)

1. A speech recognition method based on intelligent outbound and text analysis is characterized by comprising the following steps:
acquiring a first voice stream of a call between the AI robot and a target client, wherein the first voice stream passes through
The AI robot carries out outbound acquisition on the target client based on a preset AI operation;
converting the first voice stream into a first text, and sending the first text to a text analysis robot
The model is processed to obtain a classification label of the first text and the classification accuracy of the first text, wherein the analysis robot comprises a preconfigured analysis rule and a text analysis model, the analysis rule is used for outputting the classification accuracy, the text analysis model is obtained by training a neural network model through a preset single text corpus or multiple text corpora, and the classification label is used for describing the intention category of the target client.
2. The method of claim 1, wherein the speech recognition method based on intelligent outbound and text analysis
Characteristically, the first voice stream is converted to first text by voice to text.
3. The method of claim 1, wherein the speech recognition method based on intelligent outbound and text analysis
Characteristically, the method further comprises: and performing parameter adjustment on the text analysis model through the classification label of the first text and the classification accuracy of the first text.
4. The method of claim 1, wherein the speech recognition method based on intelligent outbound and text analysis
Characterized in that, before the AI robot makes an outbound call to the target customer based on a preset AI conversation:
creating an outbound task, and configuring a preset AI speech and an outbound line for the AI robot;
and introducing the outbound information of the target client into the AI robot to execute an outbound task.
5. The method of claim 1, wherein the speech recognition method based on intelligent outbound and text analysis
The classification label comprises any one of commodity consultation, purchase intention and complaint suggestion.
6. A speech recognition device based on intelligent outbound and text analysis, the device comprising:
a first acquisition unit for acquiring a first voice stream of a call between the AI robot and a target client, wherein
The first voice flow is obtained by calling out the target client through the AI robot based on a preset AI technique;
a first conversion unit, configured to convert the first voice stream into a first text;
a first sending unit, configured to send the first text to a text analysis robot model for processing to obtain
And obtaining a classification label of the first text and the classification accuracy of the first text, wherein the analysis robot comprises a preconfigured analysis rule and a text analysis model, the analysis rule is used for outputting the classification accuracy, the text analysis model is obtained by training a neural network model through a preset single text corpus or multiple text corpora, and the classification label is used for describing the intention category of the target client.
7. An electronic device, characterized in that said electronic device comprises a processor and a memory, said memory
Stored with at least one instruction, at least one program, set of instructions to be executed by the processor to implement the method of speech recognition based on intelligent callout and text analysis according to any one of claims 1 to 5.
8. A computer-readable storage medium having stored therein a computer program product
At least one instruction, at least one program, set of instructions to be coded or a set of instructions to be executed by a processor to implement the intelligent callout and text analysis based speech recognition method according to any one of claims 1-5.
CN202111312262.8A 2021-11-08 2021-11-08 Speech recognition method and device based on intelligent outbound and text analysis Pending CN114025050A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111312262.8A CN114025050A (en) 2021-11-08 2021-11-08 Speech recognition method and device based on intelligent outbound and text analysis

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111312262.8A CN114025050A (en) 2021-11-08 2021-11-08 Speech recognition method and device based on intelligent outbound and text analysis

Publications (1)

Publication Number Publication Date
CN114025050A true CN114025050A (en) 2022-02-08

Family

ID=80062094

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111312262.8A Pending CN114025050A (en) 2021-11-08 2021-11-08 Speech recognition method and device based on intelligent outbound and text analysis

Country Status (1)

Country Link
CN (1) CN114025050A (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110351443A (en) * 2019-06-17 2019-10-18 深圳壹账通智能科技有限公司 Intelligent outgoing call processing method, device, computer equipment and storage medium
CN111833871A (en) * 2020-07-07 2020-10-27 信雅达系统工程股份有限公司 Intelligent outbound system based on intention recognition and method thereof
CN111916111A (en) * 2020-07-20 2020-11-10 中国建设银行股份有限公司 Intelligent voice outbound method and device with emotion, server and storage medium
CN112185358A (en) * 2020-08-24 2021-01-05 维知科技张家口有限责任公司 Intention recognition method, model training method, device, equipment and medium
CN112422749A (en) * 2020-12-08 2021-02-26 浙江百应科技有限公司 Method for preventing harassment outbound based on intelligent dialogue analysis
KR102241532B1 (en) * 2021-01-15 2021-04-16 (주)두타위즈 Intelligent callbot server and unmanned counsel systeim using thereof
CN113094481A (en) * 2021-03-03 2021-07-09 北京智齿博创科技有限公司 Intention recognition method and device, electronic equipment and computer readable storage medium

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110351443A (en) * 2019-06-17 2019-10-18 深圳壹账通智能科技有限公司 Intelligent outgoing call processing method, device, computer equipment and storage medium
CN111833871A (en) * 2020-07-07 2020-10-27 信雅达系统工程股份有限公司 Intelligent outbound system based on intention recognition and method thereof
CN111916111A (en) * 2020-07-20 2020-11-10 中国建设银行股份有限公司 Intelligent voice outbound method and device with emotion, server and storage medium
CN112185358A (en) * 2020-08-24 2021-01-05 维知科技张家口有限责任公司 Intention recognition method, model training method, device, equipment and medium
CN112422749A (en) * 2020-12-08 2021-02-26 浙江百应科技有限公司 Method for preventing harassment outbound based on intelligent dialogue analysis
KR102241532B1 (en) * 2021-01-15 2021-04-16 (주)두타위즈 Intelligent callbot server and unmanned counsel systeim using thereof
CN113094481A (en) * 2021-03-03 2021-07-09 北京智齿博创科技有限公司 Intention recognition method and device, electronic equipment and computer readable storage medium

Similar Documents

Publication Publication Date Title
CN112492111B (en) Intelligent voice outbound method, device, computer equipment and storage medium
CN110457679B (en) User portrait construction method, device, computer equipment and storage medium
CN109492104B (en) Training method, classification method, system, device and medium of intention classification model
CN110890088B (en) Voice information feedback method and device, computer equipment and storage medium
CN113239147A (en) Intelligent conversation method, system and medium based on graph neural network
CN111597818A (en) Call quality inspection method, call quality inspection device, computer equipment and computer readable storage medium
CN111291157B (en) Response method, device, terminal and storage medium
CN112235470B (en) Incoming call client follow-up method, device and equipment based on voice recognition
CN110866115A (en) Sequence labeling method, system, computer equipment and computer readable storage medium
CN112836521A (en) Question-answer matching method and device, computer equipment and storage medium
CN116432665B (en) Dialogue model construction method, text generation method, device, system and equipment
CN114025050A (en) Speech recognition method and device based on intelligent outbound and text analysis
CN115964115A (en) Numerical control machine tool interaction method based on pre-training reinforcement learning and related equipment
CN116563034A (en) Purchase prediction method, device, equipment and storage medium based on artificial intelligence
CN115858757A (en) Customer service mail response method and device, equipment, medium and product thereof
CN113992806A (en) Intelligent voice RPA robot outbound method and device
US20210264439A1 (en) System and method to generate digital responses to a customer query
CN112380031A (en) Method and device for pushing messages in real time in cross-application mode and computing equipment
US20210264450A1 (en) System and method to generate digital responses to a customer query
CN112035643B (en) Method and device for multiplexing capacity of conversation robot
CN116684529A (en) Outbound processing method, outbound processing device, computer equipment and storage medium
CN113467774B (en) WEB terminal business software development framework and method
CN115567646A (en) Intelligent outbound method, device, computer equipment and storage medium
CN117079640A (en) Voice monitoring method, device, computer equipment and computer readable storage medium
CN116383389A (en) Artificial intelligence-based intention recognition method, apparatus, computer device and medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20220208