CN106372055B - A kind of semanteme similar processing method and system in man-machine natural language interaction - Google Patents

A kind of semanteme similar processing method and system in man-machine natural language interaction Download PDF

Info

Publication number
CN106372055B
CN106372055B CN201610709517.7A CN201610709517A CN106372055B CN 106372055 B CN106372055 B CN 106372055B CN 201610709517 A CN201610709517 A CN 201610709517A CN 106372055 B CN106372055 B CN 106372055B
Authority
CN
China
Prior art keywords
user
read statement
sentence
search database
preliminary search
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201610709517.7A
Other languages
Chinese (zh)
Other versions
CN106372055A (en
Inventor
彭军辉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Listening Robot Technology Co Ltd
Original Assignee
Beijing Listening Robot Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Listening Robot Technology Co Ltd filed Critical Beijing Listening Robot Technology Co Ltd
Priority to CN201610709517.7A priority Critical patent/CN106372055B/en
Publication of CN106372055A publication Critical patent/CN106372055A/en
Application granted granted Critical
Publication of CN106372055B publication Critical patent/CN106372055B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)

Abstract

The present invention relates to the similar processing method and system of semanteme in a kind of man-machine natural language interaction, are related to natural language human-machine interactions field.Purpose is to solve the problem of that existing human-computer interaction technology causes human-computer interaction not realize normally there are accuracy rate is low on semantic understanding.This method realizes process are as follows: S1, establishes preliminary search database and receives user's read statement;S2, the sentence in preliminary search database is screened according to the format of user's read statement;S3, the sentence filtered out in preliminary search database and user's read statement are subjected to semantic comparison, and export final result.The format that the present invention passes through user's read statement first carries out preliminary screening to the sentence in database, then the similitude in user's read statement and database between problem sentence is compared by the comparison of Semantic Similarity, so that robot is improved 10% to 25% to the accuracy of semantic understanding, interactive process is made to become more natural, smooth.

Description

A kind of semanteme similar processing method and system in man-machine natural language interaction
Technical field
The present invention relates to natural language human-machine interactions fields.
Background technique
At present in field of human-computer interaction, when comparing the similitude of two words, clause is not handled, is not concerned in sentence Relationship between word and word, or even it is not concerned with function word.Such as input " you and Xiao Ming are more severe than whom " and " Xiao Ming in robot With you than more severe ", robot is the difference of this two word of can not distinguishing one from the other.For some function words, robot also can not be into Row is distinguished, such as the difference between " your What for " and " your What for ".
But in customer service field, in terms of robot question and answer, as long as robot cannot accurately distinguish the meaning of two words, machine Device people cannot accurate understanding user be intended to, customer satisfaction system answer cannot be given.With current technology, present semantic understanding is put down Equal accuracy only has 64%, is also much unable to reach the man-machine purpose normally interacted.
Summary of the invention
Technical problem to be solved by the invention is to provide the similar processing sides of semanteme in a kind of man-machine natural language interaction Method and system, it is therefore intended that solve existing human-computer interaction technology there are accuracys rate on semantic understanding it is low, cause human-computer interaction without The problem of method is normally realized.
The technical scheme to solve the above technical problems is that the semanteme in a kind of man-machine natural language interaction is similar Processing method, it is achieved in the following ways:
S1, it establishes preliminary search database and receives user's read statement;
S2, the sentence in preliminary search database is screened according to the format of user's read statement;
S3, the sentence filtered out in preliminary search database and user's read statement are subjected to semantic comparison, and exported most Terminate fruit.
Further, S2 specific implementation process includes:
Subject, predicate and object in S21, extraction user's input language;
The master of all sentences in S22, the subject by user's input language, predicate and object and preliminary search database Language, predicate and object compare;
S23, the language with subject identical as user's input language, predicate and object is filtered out in preliminary search database Sentence.
Further, the S3 specific implementation process includes:
S31, user's read statement is subjected to phrase fractionation;
S32, by phrases all in user's read statement respectively with wrapped in the sentence that is filtered out in preliminary search database The phrase contained compares;
S33, it is compared between acquisition every two sentence according to the phrase of user's read statement and preliminary search database Semantic similar value, and final result is exported according to the result of semantic similar value.
The acquisition process of the semanteme similar value are as follows: by each sentence pair in user's read statement and preliminary search database Than identical phrase number later divided by phrase number all in user's read statement as semanteme similar value.
The beneficial effects of the present invention are: the present invention pass through first the format of user's read statement to the sentence in database into Then row preliminary screening compares the phase in user's read statement and database between problem sentence by the comparison of Semantic Similarity Like property, optimum is exported makes robot improve 10% to 25% to the accuracy of semantic understanding to user, makes human-computer dialogue Process become more natural, smooth.
A kind of similar processing system of semanteme in man-machine natural language interaction, the system include:
Database module, for establishing preliminary search database and receiving user's read statement;
Sentence screening module, for being sieved according to the format of user's read statement to the sentence in preliminary search database Choosing;
Semantic contrast module, sentence and user's read statement for will filter out in preliminary search database carry out semantic Comparison, and export final result.
Further, the sentence screening module includes:
Sentence extraction module, for extracting subject, predicate and object in user's input language;
Format contrast module, for will be in subject, predicate and the object and preliminary search database in user's input language Subject, predicate and the object of all sentences compare;
The selection result obtains module, has master identical as user's input language for filtering out in preliminary search database The sentence of language, predicate and object.
Further, the semantic contrast module includes:
Phrase splits module, for user's read statement to be carried out phrase fractionation;
Phrase contrast module, for filtering out phrases all in user's read statement with preliminary search database respectively Sentence included in phrase compare;
As a result output module, for obtaining every two according to the comparison of the phrase of user's read statement and preliminary search database Semantic similar value between a sentence, and final result is exported according to the result of semantic similar value.
Further, the acquisition process of the semantic similar value are as follows: will be every in user's read statement and preliminary search database Identical phrase number is semantic similar value divided by phrase number all in user's read statement after a sentence comparison.
Detailed description of the invention
Fig. 1 is the flow chart of the similar processing method of semanteme in man-machine natural language interaction described in the embodiment of the present invention;
Fig. 2 is according to the format of user's read statement described in the embodiment of the present invention to the sentence in preliminary search database The flow chart screened;
Fig. 3 be described in the embodiment of the present invention by the sentence filtered out in preliminary search database and user's read statement into The flow chart of the semantic comparison of row;
Fig. 4 is the principle signal of the similar processing system of semanteme in man-machine natural language interaction described in the embodiment of the present invention Figure;
Fig. 5 is the schematic illustration of sentence screening module 2 described in the embodiment of the present invention;
Fig. 6 is the schematic illustration of semantic contrast module 3 described in the embodiment of the present invention.
In attached drawing, parts list represented by the reference numerals are as follows:
1, Database module, 2, sentence screening module, 3, semantic contrast module, 4, sentence extraction module, 5, format Contrast module, 6, the selection result acquisition module, 7, phrase fractionation module, 8, phrase contrast module, 9, result output module.
Specific embodiment
The principle and features of the present invention will be described below with reference to the accompanying drawings, and the given examples are served only to explain the present invention, and It is non-to be used to limit the scope of the invention.
Embodiment 1
As shown in Figure 1, the present embodiment proposes the similar processing method of semanteme in a kind of man-machine natural language interaction, it is It is accomplished by the following way:
S1, it establishes preliminary search database and receives user's read statement;
S2, the sentence in preliminary search database is screened according to the format of user's read statement;
S3, the sentence filtered out in preliminary search database and user's read statement are subjected to semantic comparison, and exported most Terminate fruit.
In the present embodiment, in the beginning handled user's read statement, first to the format for read statement into Row extracts, and the trunk portion by extracting problem compares so that carrying out the first step deletes choosing;Detailed process is as shown in Figure 2:
Subject, predicate and object in S21, extraction user's input language;
The master of all sentences in S22, the subject by user's input language, predicate and object and preliminary search database Language, predicate and object compare;
S23, the language with subject identical as user's input language, predicate and object is filtered out in preliminary search database Sentence.
After carrying out preliminary screening, can may also there are the Subject, Predicate and Object and user's read statement of many problems in database Subject, Predicate and Object is identical, but in comparison, and sentence corresponding with user's read statement is very in preliminary search database It is few, it is then then non-by the workload for comparing the phrase of the phrase of each sentence filtered out and user's read statement Often small, versus speed is also very fast, and detailed process is as shown in Figure 3:
S31, user's read statement is subjected to phrase fractionation;
S32, by phrases all in user's read statement respectively with wrapped in the sentence that is filtered out in preliminary search database The phrase contained compares;
S33, it is compared between acquisition every two sentence according to the phrase of user's read statement and preliminary search database Semantic similar value, and final result is exported according to the result of semantic similar value.
Wherein, the acquisition process of semantic similar value are as follows: by each sentence in user's read statement and preliminary search database Identical phrase number is semantic similar value divided by phrase number all in user's read statement after comparison.
There are ten words: A1+A2+A3+A4+A5+A6+A7+A8+A9+ A0, and there are five word and user's read statement are complete with an identical sentence with its Subject, Predicate and Object in preliminary search database It is exactly the same: A2+A3+A4+A5+A6, since the clause of the two sentences is identical, then it is assumed that the similitude of the two sentences is 50%, if there are four word is identical, similitude 40%, and so on.According to user's read statement and preliminary search data Sentence carries out semantic comparison in library, filters out the highest sentence of semantic similar value, then the sentence is final output sentence.
Embodiment 2
As shown in figure 4, the present embodiment proposes the similar processing system of semanteme in a kind of man-machine natural language interaction, this is System includes:
Database module 1, for establishing preliminary search database and receiving user's read statement;
Sentence screening module 2, for being carried out according to the format of user's read statement to the sentence in preliminary search database Screening;
Semantic contrast module 3, sentence and user's read statement for will filter out in preliminary search database carry out language Justice comparison, and export final result.
Preferably, as shown in figure 5, the sentence screening module 2 includes:
Sentence extraction module 4, for extracting subject, predicate and object in user's input language;
Format contrast module 5, for will be in subject, predicate and the object and preliminary search database in user's input language Subject, predicate and the object of all sentences compare;
The selection result obtains module 6, for being filtered out in preliminary search database with identical as user's input language The sentence of subject, predicate and object.
Preferably, as shown in fig. 6, the semanteme contrast module 3 includes:
Phrase splits module 7, for user's read statement to be carried out phrase fractionation;
Phrase contrast module 8, for screening phrases all in user's read statement with preliminary search database respectively Phrase included in sentence out compares;
As a result output module 9, it is every for being obtained according to the comparison of the phrase of user's read statement and preliminary search database Semantic similar value between two sentences, and final result is exported according to the result of semantic similar value.
Preferably, the acquisition process of the semantic similar value are as follows: will be every in user's read statement and preliminary search database Identical phrase number is semantic similar value divided by phrase number all in user's read statement after a sentence comparison.
The format that the present embodiment passes through user's read statement first carries out preliminary screening to the sentence in database, then leads to The similitude in the comparison comparison user's read statement and database of Semantic Similarity between problem sentence is crossed, optimum is defeated Out to user, so that robot is improved 10% to 25% to the accuracy of semantic understanding, become interactive process more certainly So, smooth.
The foregoing is merely presently preferred embodiments of the present invention, is not intended to limit the invention, it is all in spirit of the invention and Within principle, any modification, equivalent replacement, improvement and so on be should all be included in the protection scope of the present invention.

Claims (2)

1. the similar processing method of semanteme in a kind of man-machine natural language interaction, which is characterized in that it is real in the following manner Existing:
S1, it establishes preliminary search database and receives user's read statement;
S2, the sentence in preliminary search database is screened according to the format of user's read statement;
S3, the sentence filtered out in preliminary search database and user's read statement are subjected to semantic comparison, and export and most terminates Fruit;
Wherein, the S2 specific implementation process includes:
Subject, predicate and object in S21, extraction user's read statement;
The subject of all sentences, meaning in S22, the subject by user's read statement, predicate and object and preliminary search database Language and object compare;
S23, the sentence with subject identical as user's read statement, predicate and object is filtered out in preliminary search database;
Wherein, the S3 specific implementation process includes:
S31, user's read statement is subjected to phrase fractionation;
S32, by phrases all in user's read statement respectively and included in the sentence that is filtered out in preliminary search database Phrase compares;
S33, the semanteme obtained between every two sentence is compared according to the phrase of user's read statement and preliminary search database Similar value, and final result is exported according to the result of semantic similar value;
The acquisition process of the semanteme similar value are as follows: each sentence in user's read statement and preliminary search database is compared it Identical phrase number is semantic similar value divided by phrase number all in user's read statement afterwards.
2. the similar processing system of semanteme in a kind of man-machine natural language interaction, which is characterized in that it includes:
Database module (1), for establishing preliminary search database and receiving user's read statement;
Sentence screening module (2), for being sieved according to the format of user's read statement to the sentence in preliminary search database Choosing;
Semantic contrast module (3), sentence and user's read statement for will filter out in preliminary search database carry out semantic Comparison, and export final result;
Wherein, the sentence screening module (2) includes:
Sentence extraction module (4), for extracting subject, predicate and object in user's read statement;
Format contrast module (5), for by institute in subject, predicate and the object and preliminary search database in user's read statement There are the subject, predicate and object of sentence to compare;
The selection result obtains module (6), has master identical as user's read statement for filtering out in preliminary search database The sentence of language, predicate and object;
Wherein, the semantic contrast module (3) includes:
Phrase splits module (7), for user's read statement to be carried out phrase fractionation;
Phrase contrast module (8), for filtering out phrases all in user's read statement with preliminary search database respectively Sentence included in phrase compare;
As a result output module (9), for obtaining every two according to the comparison of the phrase of user's read statement and preliminary search database Semantic similar value between a sentence, and final result is exported according to the result of semantic similar value;
The acquisition process of the semanteme similar value are as follows: each sentence in user's read statement and preliminary search database is compared it Identical phrase number is semantic similar value divided by phrase number all in user's read statement afterwards.
CN201610709517.7A 2016-08-23 2016-08-23 A kind of semanteme similar processing method and system in man-machine natural language interaction Active CN106372055B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610709517.7A CN106372055B (en) 2016-08-23 2016-08-23 A kind of semanteme similar processing method and system in man-machine natural language interaction

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610709517.7A CN106372055B (en) 2016-08-23 2016-08-23 A kind of semanteme similar processing method and system in man-machine natural language interaction

Publications (2)

Publication Number Publication Date
CN106372055A CN106372055A (en) 2017-02-01
CN106372055B true CN106372055B (en) 2019-10-29

Family

ID=57879031

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610709517.7A Active CN106372055B (en) 2016-08-23 2016-08-23 A kind of semanteme similar processing method and system in man-machine natural language interaction

Country Status (1)

Country Link
CN (1) CN106372055B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109815484B (en) * 2018-12-21 2022-03-15 平安科技(深圳)有限公司 Semantic similarity matching method and matching device based on cross attention mechanism
CN110019688A (en) * 2019-01-23 2019-07-16 艾肯特公司 The method that robot is trained

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1928864A (en) * 2006-09-22 2007-03-14 浙江大学 FAQ based Chinese natural language ask and answer method
CN101286161A (en) * 2008-05-28 2008-10-15 华中科技大学 Intelligent Chinese request-answering system based on concept

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2008253551A (en) * 2007-04-05 2008-10-23 Toshiba Corp Image reading report search apparatus

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1928864A (en) * 2006-09-22 2007-03-14 浙江大学 FAQ based Chinese natural language ask and answer method
CN101286161A (en) * 2008-05-28 2008-10-15 华中科技大学 Intelligent Chinese request-answering system based on concept

Also Published As

Publication number Publication date
CN106372055A (en) 2017-02-01

Similar Documents

Publication Publication Date Title
CN106528532B (en) Text error correction method, device and terminal
CN107393554B (en) Feature extraction method for fusion inter-class standard deviation in sound scene classification
CN109241266B (en) Method and device for creating extended question based on standard question in man-machine interaction
CN104504150A (en) News public opinion monitoring system
CN106557508A (en) A kind of text key word extracting method and device
CN105893524B (en) A kind of intelligent answer method and device
CN105653620B (en) Log analysis method and device of intelligent question-answering system
CN106777261A (en) Data query method and device based on multi-source heterogeneous data set
KR20210106372A (en) New category tag mining method and device, electronic device and computer-readable medium
CN105469789A (en) Voice information processing method and voice information processing terminal
CN105931637A (en) User-defined instruction recognition speech photographing system
CN104142831B (en) Application program searching method and device
CN106372055B (en) A kind of semanteme similar processing method and system in man-machine natural language interaction
CN109817206A (en) A kind of voice interaction device and method for automatic terminal equipment
CN106550268B (en) Video processing method and video processing device
JP2011257817A (en) Patent specification analyzer and text analyzer
CN104751856A (en) Voice sentence recognizing method and device
US20190213486A1 (en) Virtual Adaptive Learning of Financial Articles Utilizing Artificial Intelligence
CN110532551A (en) Method, equipment and the storage medium that text key word automatically extracts
CN110321557A (en) A kind of file classification method, device, electronic equipment and storage medium
CN109241438A (en) Across channel focus incident discovery method, apparatus and storage medium based on element
JP2011123565A (en) Faq candidate extracting system and faq candidate extracting program
CN106446046B (en) A method of quickly analysis records in time in relational database
Kumala et al. Indonesian speech emotion recognition using cross-corpus method with the combination of MFCC and teager energy features
Ganoun et al. Performance analysis of spoken arabic digits recognition techniques

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant