CN110299140A

CN110299140A - A kind of key content extraction algorithm based on Intelligent dialogue

Info

Publication number: CN110299140A
Application number: CN201910524215.6A
Authority: CN
Inventors: 王磊
Original assignee: Zhejiang Baiying Technology Co Ltd
Current assignee: Zhejiang Baiying Technology Co Ltd
Priority date: 2019-06-18
Filing date: 2019-06-18
Publication date: 2019-10-01

Abstract

The invention discloses a kind of key content extraction algorithm based on Intelligent dialogue, more particularly to network communication field, including robot, the robot interior is integrated with ASR speech recognition system, the robot connects nlp server, the specific steps are as follows: S1, voice are converted into text and are sent to nlp server；S2, text word cutting are simultaneously indicated with vector；S3, two-way gru is run to each word, obtains the expression of another vector；S4, each word distribute a lable label and are converted to probability by softmax layers by score normalization；S5, it decodes to obtain optimal sequence using Viterbi, improve viterbi algorithm and is optimized using array is rolled；S6, the sequence that will have been marked identify harassing call.The present invention extracts crucial information by artificial intelligence from dialogue, allows robot to answer and answer, key message is recorded, while user can be facilitated to carry out decision and judge whether to call back to avoid important phone is lost.

Description

A kind of key content extraction algorithm based on Intelligent dialogue

Technical field

The present invention relates to network communication technology fields, it is more particularly related to a kind of pass based on Intelligent dialogue Key contents extraction algorithm.

Background technique

Network communication nowadays has become the major way of the daily communication of people, also expedites the emergence of while bringing convenient for people Various Dark Industry Links.Wherein harassing call is the most serious, or even influences the routine work of people.How effectively to avoid Harassing call also becomes the hot spot paid close attention in recent years.

At present in the industry for harassing call by following several: carrying out phone mark by user, electricity can be harassed to avoid part Words, but a large amount of harassing call is broadcasted automatically by machine, is labeled in mass data, is needed to expend a large amount of manpower object Power；Robot is allowed to answer, presets simple question-response form, while recording the content of dialogue, but need user again complete Whole listens a conversation content, and whether be harassing call, under efficiency is very low if screening.

To sum up, existing Barassment preventing telephone is still to be improved in efficiency.

Summary of the invention

In order to overcome the drawbacks described above of the prior art, the embodiment of the present invention is provided in a kind of key based on Intelligent dialogue Hold extraction algorithm, by artificial intelligence, crucial information is extracted from dialogue, allows robot to answer and answer, by key message Record, while user can be facilitated to carry out decision and judge whether to call back to avoid important phone is lost, robot by sound Sound is sent to ASR identification in real time, obtains the recognition result of textual form, is used for natural language processing, excavates key message, obtains Preset question and answer, plays back with audio form, accomplishes robot auto-pickup, while mentioning to key content Take record.

To achieve the above object, the invention provides the following technical scheme: a kind of key content based on Intelligent dialogue extracts Algorithm, including robot, the robot interior are integrated with ASR speech recognition system, and the robot connects nlp server, Specific step is as follows for key content extraction algorithm:

S1, when visitor or robot initiate speech exchange, visitor's sound is sent into ASR speech recognition system in real time by robot Speech recognition is carried out in system, converts text for voice, is sent on nlp server after saving in the form of text, is returned crucial Information；

The text of input is carried out word cutting by S2, nlp server, is indicated to each word with term vector, and term vector It is made of 3 parts: the term vector of word2vec training, character level other vector sum word position vector, to position, character Embedding runs a two-way lstm, then splices the hidden layer of these three, obtains the vector of a regular length；

S3, the term vector according to obtained in step S2 run two-way gru to each word in sentence, then obtain another Vector indicates；

S4, finally with a full articulamentum lable label is distributed to each word, and by softmax layers by score It is normalized, is converted to probability；

S5, decode to obtain optimal sequence using Viterbi, and viterbi algorithm improved, using roll array into Row optimization；

S6, the sequence that will finally mark extract desired information by state automata, identify harassing call.

In a preferred embodiment, in one layer of crf of softmax layers of connection in the step S4.

In a preferred embodiment, it in the step S5, can also be needed to carry out mark to text according to business, Identify harassing call.

In a preferred embodiment, the robot interior is provided with audio player, can be used in extract Key content played back with audio form, accomplish robot auto-pickup, while record is extracted to key content.

Technical effect and advantage of the invention:

1, the present invention extracts crucial information by artificial intelligence from dialogue, allows robot to answer and answer, will be crucial Information is recorded, while user can be facilitated to carry out decision and judge whether to call back, and closed to avoid important phone is lost During key contents extraction, ASR speech recognition system identifies the speech exchange between visitor or robot, is converted into text, so After be sent to nlp server and carry out word cutting, then each word is indicated with term vector, to position, character Embedding runs a two-way lstm, then splices the hidden layer of these three, obtains the vector of a regular length, so Two-way gru is run to each word in sentence afterwards, the expression of another vector is then obtained, finally with a full articulamentum to each word A lable label is distributed, and score is normalized by softmax layers, probability is converted to, is decoded using Viterbi Optimal sequence is obtained, and viterbi algorithm is improved, is optimized using array is rolled, greatly saves space, finally The sequence that will have been marked is extracted desired information by state automata, or needs to play text according to business Mark identifies harassing call；

2, it is connected to one layer of crf for softmax layers of the present invention, advantage is can during it is labeled for a position To use the information marked before this, after avoiding the occurrence of softmax layers of output, the case where verb is followed by verb, benefit are generated Optimal sequence is obtained with Viterbi decoding, to save space, being improved to viterbi algorithm, is carried out using array is rolled Optimization, keyword extraction accuracy are high；

3, robot by sound be sent in real time ASR identification, obtain the recognition result of textual form, be used for natural language Processing is excavated key message, obtains preset question and answer, played back with audio form, accomplish that robot connects automatically It listens, while record is extracted to key content.

Detailed description of the invention

Fig. 1 is the overview flow chart of the key content extraction algorithm of the invention based on Intelligent dialogue.

Fig. 2 is the key content extraction algorithm schematic diagram of the invention based on Intelligent dialogue.

Specific embodiment

Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts every other Embodiment shall fall within the protection scope of the present invention.

Embodiment 1:

A kind of key content extraction algorithm based on Intelligent dialogue according to shown in Fig. 1-2, including robot, the machine People has been internally integrated ASR speech recognition system, and the robot connects nlp server, key content extraction algorithm specific steps It is as follows:

S5, it decodes to obtain optimal sequence using Viterbi, to save space, and improving viterbi algorithm, use Array is rolled to optimize；

Further, do not have after softmax layers of output in the step S4 in one layer of crf of softmax layers of connection The relationship for considering front and back label, can generate the case where verb is followed by verb, for the generation for avoiding such case, Wo Men Softmax layers have met one layer of crf, and advantage is to can use before this during it is labeled for a position The information of mark.

Further, the robot interior is provided with audio player, and the key content that can be used in extract is with sound Frequency form plays back, and accomplishes robot auto-pickup, while extracting record to key content.

Embodiment 2:

A kind of key content extraction algorithm based on Intelligent dialogue, including robot, the robot interior are integrated with ASR Speech recognition system, the robot connect nlp server, and specific step is as follows for key content extraction algorithm:

S6, finally the sequence marked is needed to carry out mark to text according to business, identifies harassing call.

Embodiment 3:

As shown in Fig. 2, specific operating procedure is as follows:

S1, when visitor or robot initiate speech exchange, visitor's sound is sent into ASR speech recognition system in real time by robot Speech recognition is carried out in system, such as the text of identification is " you hundred answer ", at this point, ASR speech recognition system converts voice to Text is sent on nlp server after saving in the form of text, returns to key message；

The text of input is carried out word cutting by S2, nlp server, is indicated to each word with term vector, as X1, X2, X3, X4, and term vector is made of 3 parts: the term vector of word2vec training, the other vector sum word position vector of character level, it is right Position, character embedding run a two-way lstm, i.e. corresponding two h3 of X1 corresponding two h1, X2 corresponding two h2, X3, X4 corresponds to two h4, then splices the hidden layer of these three, obtains the vector of a regular length；

S3, the term vector according to obtained in step S2 run two-way gru to each word in sentence, then obtain another Vector indicates, obtains P1, P2, P3, P4；

S4, finally with a full articulamentum to each word distribute a lable label, i.e. lable1, lable2, Lable3, lable4, and score is normalized by softmax layers, it is converted to probability；

The several points that should finally illustrate are: firstly, in the description of the present application, it should be noted that unless otherwise prescribed and It limits, term " installation ", " connected ", " connection " shall be understood in a broad sense, can be mechanical connection or electrical connection, be also possible to two Connection inside element, can be directly connected, and "upper", "lower", "left", "right" etc. are only used for indicating relative positional relationship, when The absolute position for being described object changes, then relative positional relationship may change；

Secondly: the present invention discloses in embodiment attached drawing, relates only to the structure being related to the embodiment of the present disclosure, other knots Structure, which can refer to, to be commonly designed, and under not conflict situations, the same embodiment of the present invention and different embodiments be can be combined with each other；

Last: the foregoing is only a preferred embodiment of the present invention, is not intended to restrict the invention, all in the present invention Spirit and principle within, any modification, equivalent replacement, improvement and so on, should be included in protection scope of the present invention it It is interior.

Claims

1. a kind of key content extraction algorithm based on Intelligent dialogue, it is characterised in that: including robot, the robot interior It is integrated with ASR speech recognition system, the robot connects nlp server, and specific step is as follows for key content extraction algorithm:

S1, when visitor or robot initiate speech exchange, visitor's sound is sent into ASR speech recognition system by robot in real time Speech recognition is carried out, converts text for voice, is sent on nlp server after saving in the form of text, key message is returned；

The text of input is carried out word cutting by S2, nlp server, is indicated to each word with term vector, and term vector is by 3 It is grouped as: the term vector of word2vec training, the other vector sum word position vector of character level, to the embedding of position, character A two-way lstm is run, then the hidden layer of these three is spliced, obtains the vector of a regular length；

S3, the term vector according to obtained in step S2 run two-way gru to each word in sentence, then obtain another vector It indicates；

S4, finally with a full articulamentum lable label is distributed to each word, and is carried out score by softmax layers Normalization, is converted to probability；

S5, it decodes to obtain optimal sequence using Viterbi, and viterbi algorithm is improved, array progress is excellent using rolling Change；

2. a kind of key content extraction algorithm based on Intelligent dialogue according to claim 1, it is characterised in that: the step In one layer of crf of softmax layers of connection in rapid S4.

3. a kind of key content extraction algorithm based on Intelligent dialogue according to claim 1, it is characterised in that: the step In rapid S5, it can also be needed to carry out mark to text according to business, identify harassing call.

4. a kind of key content extraction algorithm based on Intelligent dialogue according to claim 1, it is characterised in that: the machine Device people is internally provided with audio player, can be used in playing back the key content of extraction with audio form, accomplishes machine People's auto-pickup, while record is extracted to key content.