WO2023035529A1 - Intent recognition-based information intelligent query method and apparatus, device and medium - Google Patents

Intent recognition-based information intelligent query method and apparatus, device and medium Download PDF

Info

Publication number
WO2023035529A1
WO2023035529A1 PCT/CN2022/071789 CN2022071789W WO2023035529A1 WO 2023035529 A1 WO2023035529 A1 WO 2023035529A1 CN 2022071789 W CN2022071789 W CN 2022071789W WO 2023035529 A1 WO2023035529 A1 WO 2023035529A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
query request
target text
query
intent
Prior art date
Application number
PCT/CN2022/071789
Other languages
French (fr)
Chinese (zh)
Inventor
舒畅
陈又新
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2023035529A1 publication Critical patent/WO2023035529A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3343Query execution using phonetics
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2413Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
    • G06F18/24133Distances to prototypes
    • G06F18/24137Distances to cluster centroïds
    • G06F18/2414Smoothing the distance, e.g. radial basis function networks [RBFN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2415Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking

Definitions

  • This application relates to the field of data query technology, and belongs to the application scenario of intelligent information query based on intention recognition in smart education, and in particular to an information intelligent query method, device, equipment and medium based on intention recognition.
  • Embodiments of the present application provide an intelligent information query method, device, device, and medium based on intent recognition, aiming to solve the problem of low accuracy of information query in existing methods.
  • the embodiment of the present application provides an intelligent information query method based on intent recognition, which includes:
  • the corresponding target text information is extracted from the query request information
  • the association relationship between the feature words in the target text information is identified, and the feature word association information corresponding to the target text information is obtained;
  • a pre-stored information database is queried according to the intent type and the associated information of the characteristic words, so as to obtain an information query result corresponding to the query request information.
  • the embodiment of the present application provides an intelligent information query device based on intention recognition, which includes:
  • a target text information acquisition unit configured to extract corresponding target text information from the query request information if the input query request information is received;
  • An intent type identification unit configured to analyze the target text information according to a preset text intent analysis model to obtain a corresponding intent type
  • a feature word association information acquisition unit configured to identify the association relationship between the feature words in the target text information according to the preset relationship recognition network and the intent type, and obtain the feature word association information corresponding to the target text information ;
  • An information query result acquisition unit configured to query a prestored information database according to the intent type and the feature word association information, so as to obtain an information query result corresponding to the query request information.
  • the embodiment of the present application further provides a computer device, which includes a memory, a processor, and a computer program stored on the memory and operable on the processor, and the processor executes the computer program.
  • the program implements the intelligent information query method based on intent recognition described in the first aspect above.
  • the embodiment of the present application further provides a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the processor executes the above-mentioned first step.
  • the intelligent query method for information based on intent recognition is provided.
  • Embodiments of the present application provide a method, device, device, and medium for intelligent information query based on intent recognition. Extract the corresponding target text information from the query request information input by the user, analyze the target text information according to the text intent analysis model to obtain the intent type, and identify the association between feature words in the target text information according to the intent type and relationship recognition network According to the relational information of the characteristic words, the information database is queried according to the type of intent and the association information of the characteristic words to obtain the information query results.
  • the accurate intention type can be obtained by analyzing the target text information, and the association relationship between the characteristic words in the target text information can be identified, and the information query result can be obtained based on the association relationship between the intention type and the characteristic words, which can greatly The accuracy of information inquiry is increased, and the efficiency of information inquiry is improved at the same time.
  • FIG. 1 is a schematic flow diagram of an information intelligent query method based on intent recognition provided by an embodiment of the present application
  • Fig. 2 is a schematic sub-flow diagram of an information intelligent query method based on intent recognition provided by an embodiment of the present application
  • FIG. 3 is a schematic diagram of another sub-flow of the intent recognition-based intelligent information query method provided by the embodiment of the present application.
  • FIG. 4 is a schematic diagram of another sub-flow of the intelligent information query method based on intent recognition provided by the embodiment of the present application;
  • FIG. 5 is a schematic diagram of another sub-flow of the intelligent information query method based on intent recognition provided by the embodiment of the present application;
  • FIG. 6 is a schematic diagram of another sub-flow of the intelligent information query method based on intent recognition provided by the embodiment of the present application.
  • FIG. 7 is another schematic flowchart of an intelligent information query method based on intent recognition provided by an embodiment of the present application.
  • FIG. 8 is a schematic block diagram of an information intelligent query device based on intent recognition provided by an embodiment of the present application.
  • Fig. 9 is a schematic block diagram of a computer device provided by an embodiment of the present application.
  • Fig. 1 is a schematic flow chart of an intelligent information query method based on intent recognition provided by an embodiment of the present application; the intelligent query method for information based on intent recognition is applied to a user terminal or a management server, the The intelligent query method is executed through the application software installed in the user terminal or the management server.
  • the user terminal is a terminal device that can receive the query request information input by the user and perform information query to obtain the query results, such as desktop computers, notebook computers, and tablets.
  • the management server is a server that can be used to receive query request information from user terminals, perform information query, obtain information query results, and provide feedback, such as servers built by enterprises or government departments.
  • the method includes steps S110-S140.
  • the query request information can be sent to the management server by the user of the user terminal through the user terminal, or can be input by the user into the user terminal.
  • the query request information can be text, voice or short video, and the corresponding target needs to be obtained from the query request information. Text information, and obtain the user's real information query intention based on the target text information.
  • step S110 includes sub-steps S111 , S112 and S113 .
  • the information type includes a text information type and a voice information type.
  • the query request information includes corresponding format identification information.
  • the format identification information is information used to identify the format of the query request information.
  • the information type of the query request information can be determined by the format identification information of the query request information.
  • the corresponding query request information is text information type; if the format identification information is wav, mp3, wma, avi, flv, rmvb, then the corresponding query request information is voice Information type (including audio information type and video information type).
  • the user of the user terminal enters text in the question box on the terminal page and clicks the confirmation button, then the user terminal receives the text information to obtain query request information, or sends the text to the management server as query request information;
  • the voice input button on the page say your own question and click the confirmation button, then the user terminal receives the voice to get the query request information, or sends the recorded voice as the query request information to the management server;
  • the information type of the query request information is a voice information type
  • the query request information may be of audio information type or video information type, both of which include voice information.
  • the speech recognition model is a model that recognizes and converts speech information included in audio information or video information, wherein the speech recognition model includes a noise judgment rule and a text information acquisition model.
  • the noise judgment rule is the rule for judging whether the speech information contains noise.
  • the text information acquisition model is the model for obtaining the corresponding text information from the speech information. If the speech information contains noise, it will affect the information obtained from the speech information. Therefore, before obtaining the corresponding target text information from the voice information, the noise judgment rule can be used to judge whether the voice information contains noise, so as to ensure that more accurate information can be obtained from the noise-free voice information.
  • Target text information is a model that recognizes and converts speech information included in audio information or video information, wherein the speech recognition model includes a noise judgment rule and a text information acquisition model.
  • the noise judgment rule is the rule for judging whether the speech information contains noise.
  • the text information acquisition model is the model for
  • step S112 includes sub-steps S1121 , S1122 and S1123 .
  • the voice information in the query request information contains noise.
  • the voice information can be sampled based on the frequency of the voiceprint signal in the voice information.
  • the sampling frequency can be 5-50Hz, and the sampling frequency is The number of times the loudness value of the voiceprint signal is obtained from the voice information per second, and the unit of the loudness value is decibel (dB). For example, if the sampling frequency is 10 Hz, the loudness value in the frequency range corresponding to the voiceprint signal is obtained from the voice information 10 times per second.
  • the average value of multiple loudness values in the above-mentioned fixed frequency interval can be obtained from the speech information as the target sound signal strength, and the above-mentioned sampling method can be used to obtain the multiplicity of other sound signals not in the above-mentioned fixed frequency interval from the speech information.
  • the average value of the loudness values is used as the background noise signal strength, and it is judged whether the ratio between the background noise signal strength and the target sound signal strength is greater than the ratio threshold preset in the noise judgment rule, and if the ratio is greater than the ratio threshold value, the query request information is judged
  • the voice information in contains noise; if the ratio is not greater than the ratio threshold, it is judged that the voice information in the query request information does not contain noise.
  • the voice information in the query request information does not contain noise, identify the voice information in the query request information according to the text information acquisition model, so as to obtain target text information corresponding to the query request information.
  • the voice information in the query request information does not contain noise, identify the voice information in the query request information according to the text information acquisition model, so as to obtain target text information corresponding to the query request information. If the speech information in the query request information does not contain noise, the speech information can be recognized according to the text information acquisition model to obtain the corresponding target text information.
  • the text information acquisition model includes an acoustic model, a speech feature dictionary and a semantic analysis dictionary .
  • step S1122 includes a sub-step: segmenting the voice information in the query request information according to the acoustic model in the text information acquisition model to obtain a plurality of phonemes contained in the voice information; Match the phonemes according to the phonetic feature dictionary in the text information acquisition model to convert the phonemes into pinyin information; perform semantic analysis on the pinyin information according to the semantic analysis dictionary in the text information acquisition model to obtain Target text information corresponding to the query request information.
  • the voice information in the query request information is segmented according to the acoustic model in the text information acquisition model to obtain a plurality of phonemes contained in the voice information.
  • the voice information included in the audio information or video information is composed of phonemes that are pronounced by multiple characters, and the phonemes of a character include the frequency and timbre of the character's pronunciation.
  • the acoustic model contains the phonemes of all character pronunciations.
  • the phonemes are matched according to the phonetic feature dictionary in the text information acquisition model to convert the phonemes into pinyin information.
  • the phonetic feature dictionary contains the phoneme information corresponding to all characters’ pinyin. By matching the obtained phoneme with the phoneme information corresponding to the character’s pinyin, the phoneme of a single character can be converted into the character’s pinyin matching the phoneme in the phonetic feature dictionary , so as to convert all the phonemes contained in the speech information into pinyin information.
  • the semantic analysis dictionary contains the mapping relationship between a single pinyin information or multiple pinyin phrases and text information. Through the mapping relationship contained in the semantic analysis dictionary, the obtained pinyin information can be semantically analyzed to convert the pinyin information into corresponding target text message for .
  • the voice information in the query request information contains noise, it will affect the accuracy of the acquired target text information.
  • the re-input prompt information can be fed back to prompt the user to re-enter the query request information in a low-noise environment.
  • the query request information is used as the target text information. If the query request information is of text information type, there is no need to process the query request information, and the query request information can be directly used as target text information for subsequent processing.
  • the target text information is parsed according to a preset text intent parsing model to obtain a corresponding intent type.
  • the text intent parsing model includes word segmentation processing rules and a classification neural network.
  • the target text information can be analyzed through the text intent analysis model to obtain the intent type of the target text information.
  • step S120 includes sub-steps S121 and S122.
  • the word segmentation processing can be performed on the target text information according to the word segmentation processing rules to obtain the corresponding word segmentation results, which include feature words composed of one or more characters and attribute information of each feature word.
  • the word segmentation processing rules include a vector conversion database and a part-of-speech tagging network.
  • step S121 includes sub-steps S1211, S1212 and S1213.
  • the vector conversion database is a data table that stores the relationship between characters and unit feature vectors.
  • the vector conversion database contains a unit feature vector corresponding to each character, and the unit feature vector can be used to quantify the characteristics of characters.
  • a unit feature vector corresponding to each character in the target text information can be obtained from the vector conversion database, and the unit feature vectors corresponding to multiple characters contained in the target text information can be combined to obtain A corresponding text feature vector, that is, converting the target text information into a corresponding text feature vector.
  • the corresponding text feature vectors are [101, 9251, 3151, 7821, 6215, 4152, 3324, 3620, 8512, 4831, 8514, 0635, 102] .
  • “101” is the unit feature vector at the beginning of the sentence
  • "102” is the unit feature vector at the end of the sentence.
  • the part-of-speech tagging information corresponding to each character in the target text information can be obtained by performing part-of-speech tagging on the text feature vector through the part-of-speech tagging network.
  • the part-of-speech tagging network can be obtained based on the combination of BERT (Bidirectional Encoder Representations from Transformers) neural network and CRF (Intro2 Conditional Random Field) neural network.
  • the BERT neural network can perform feature calculation on the text feature vector to obtain the feature array corresponding to the text feature vector. If the text feature vector is a 1 ⁇ H-dimensional vector, the obtained feature array contains L ⁇ H (L rows and H columns) features An array of values, each feature value belongs to the value range [0, 1].
  • the CRF neural network is a serialized labeling algorithm network. Input the feature array into the CRF neural network to get the label value corresponding to the feature sequence of each row in the feature array, except for the label value corresponding to the unit feature vector at the beginning of the sentence and the unit feature vector at the end of the sentence. , the remaining multiple tag values correspond to the characters contained in the target text information, that is, the part-of-speech tag information corresponding to the target text information is obtained.
  • word segmentation can be performed on the characters contained in the target text information, and it is judged whether the label values of two adjacent characters are the same. If they are the same, the adjacent characters are combined; characters for word segmentation.
  • word segmentation processing can be performed on all characters contained in the target text information, and a word segmentation result can be obtained.
  • the label value of "Ming” is “203: noun-time”
  • the label value of "day” is “203: noun-time”
  • the two characters contain "noun-time” at the same time, then the two characters can be combined , and use the common label value as the label value of the phrase obtained after combination.
  • the classification neural network can be composed of an input layer, multiple intermediate layers and an output layer, between the input layer and the first intermediate layer, between the intermediate layer and other adjacent intermediate layers, between the last intermediate layer and the output layer
  • the association is performed through an association formula.
  • the number of input nodes contained in the input layer is not less than the feature value contained in the combined feature, and the output layer contains multiple output nodes, each output node corresponds to a classification type, and the combined feature is input into the classification neural network for calculation.
  • the label value of "Ming” is "203”
  • its corresponding unit feature vector in the text feature vector is "9251”
  • the label value and the unit feature vector are combined as the combined feature corresponding to "Ming”. , obtain the combination feature corresponding to each character in the target text information and input it into the classification neural network to obtain the corresponding intent classification.
  • Word segmentation is performed on the target text information to obtain a word segmentation result, which contains a single character or phrase obtained from the word segmentation, and the single character or phrase is used as a feature word contained in the target text information, and based on the relationship recognition network and the obtained intent type.
  • the feature words are identified for the association relationship, and the feature word association information of whether there is an association relationship between the feature words is obtained.
  • the semantic matrix is constructed based on the feature words contained in the target text information, and the association relationship between the feature words is identified through a fully connected network.
  • the intent type is converted into an intent feature vector, and each of the feature words is The unit feature vector of the character in the text feature vector is combined with the label value of the character to obtain the feature word vector of each feature word, and the feature word vector and intention feature vector corresponding to the two feature words are input into the fully connected network at the same time, and based on The normalization function Softmax(f(a,b)) normalizes the output results of the fully connected network to obtain the normalized value, and judges whether the normalized value is greater than the preset threshold, then the two feature words Whether there is an association relationship between them is identified. If the normalized value is greater than the preset threshold, there is an association between the two feature words; if the normalized value is not greater than the preset threshold, there is no relationship between the two feature words. connection relation.
  • Data query can be performed in the pre-stored information database according to the type of intent and the association relationship of feature words, so as to obtain the information query result corresponding to the query request from the information database.
  • the information database is a structured database for recording various information pre-stored in the management server or the user terminal.
  • the information database may contain information related to course schedules and course fees, as well as information related to course exercises, articles, or knowledge points.
  • step S140 includes sub-steps S141, S142 and S143.
  • a corresponding associated query statement can be generated based on the associated multiple feature words and intent types, and the associated query statement can be an SQL query statement.
  • the intent type is "scheduling courses", and there is an association relationship between the feature words “tomorrow afternoon/N-Time” and "English class/OBJ-Course", then the corresponding associated query statement can be generated based on the above information, and the associated query statement It can be used to reflect: Inquire "English Class/OBJ-Course” of "Tomorrow afternoon/N-Time” in "Class Arrangement Information (Course Schedule Information)”.
  • a separate query statement can be generated based on the feature words and the intent type, and the generated separate query statement can also be an SQL query statement.
  • a corresponding separate query statement can be generated based on the above information, and the separate query statement can be used to reflect: In “Payment information (course fee )" to query "Writing Course/OBJ-Course”.
  • the query statement generated based on the intent type and the associated information of the characteristic words may only include the associated query statement, or only the individual query statement, or both the associated query statement and the individual query statement.
  • the information database can be queried according to the obtained query statements.
  • Each query statement can realize a query of the information database, and each query statement can obtain a query result.
  • the information The database is queried, and the obtained query results are synthesized to obtain the information query results corresponding to the query request information
  • step S150 is further included after step S140 .
  • the corresponding abstract information is obtained based on the information query result. Specifically, the abstract information is obtained by hashing the information query result, for example, by using the sha256s algorithm. Uploading summary information to the blockchain guarantees its security and fairness and transparency to users. The user device can download the summary information from the blockchain to verify whether the information query result has been tampered with.
  • the blockchain referred to in this example is a new application mode of computer technologies such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm.
  • Blockchain (Blockchain) essentially a decentralized database, is a series of data blocks associated with each other using cryptographic methods. Each data block contains a batch of network transaction information, which is used to verify its Validity of information (anti-counterfeiting) and generation of the next block.
  • the blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
  • This application can be applied to smart education and other scenarios where information is intelligently searched based on intent recognition, thereby promoting the construction of smart education.
  • the corresponding target text information is extracted from the query request information input by the user, and the target text information is analyzed according to the text intent analysis model to obtain the intent type, and According to the type of intent and the relationship recognition network, the association relationship between the feature words in the target text information is identified to obtain the associated information of the feature words, and the information database is queried according to the type of intent and the associated information of the feature words to obtain the information query result.
  • the accurate intention type can be obtained by analyzing the target text information, and the association relationship between the characteristic words in the target text information can be identified, and the information query result can be obtained based on the association relationship between the intention type and the characteristic words, which can greatly The accuracy of information inquiry is increased, and the efficiency of information inquiry is improved at the same time.
  • the embodiment of the present application also provides an intelligent information query device based on intention recognition, which is used to implement any embodiment of the aforementioned intelligent information query method based on intention recognition.
  • Fig. 8 is a schematic block diagram of an information intelligent query device based on intention recognition provided by an embodiment of the present application.
  • the intelligent information query device 100 based on intention recognition includes a target text information acquisition unit 110 , an intention type identification unit 120 , a characteristic word related information acquisition unit 130 and an information query result acquisition unit 140 .
  • the target text information acquisition unit 110 is configured to extract corresponding target text information from the query request information if the input query request information is received.
  • the target text information acquisition unit 110 includes subunits: a type judgment unit, a speech information recognition unit, and a target text information determination unit.
  • a type judging unit configured to judge the information type of the query request information.
  • the voice information recognition unit is configured to, if the query request information is not text information, recognize the voice information contained in the query request information according to a pre-stored speech recognition model to obtain target text information corresponding to the query request information.
  • the index data set acquisition unit includes a subunit: a noise judgment unit, configured to judge whether the speech information in the query request information contains noise according to the noise judgment rule; a speech recognition unit, using If the voice information in the query request information does not contain noise, identify the voice information in the query request information according to the text information acquisition model to obtain the target text information corresponding to the query request information; prompt An information feedback unit, configured to feed back re-input prompt information to prompt re-input of the query request information if the voice information in the query request information contains noise.
  • a noise judgment unit configured to judge whether the speech information in the query request information contains noise according to the noise judgment rule
  • a speech recognition unit using If the voice information in the query request information does not contain noise, identify the voice information in the query request information according to the text information acquisition model to obtain the target text information corresponding to the query request information
  • prompt An information feedback unit configured to feed back re-input prompt information to prompt re-input of the query request information if the voice information in the query request information contains noise.
  • the target text information determining unit is configured to determine the query request information as the target text information if the information type of the query request information is a text information type.
  • the intent type identification unit 120 is configured to analyze the target text information according to a preset text intent analysis model to obtain the corresponding intent type.
  • the intent type identification unit 120 includes subunits: a word segmentation processing unit, configured to perform word segmentation processing on the target text information according to the word segmentation processing rules to obtain a word segmentation result; a classification unit, configured to classify the The word segmentation result and the character feature vector are input into the classification neural network to obtain the intent type corresponding to the target text information.
  • the word segmentation processing unit includes subunits: a character feature vector acquisition unit, configured to acquire a corresponding character feature vector from the target text information according to the vector conversion database; a part-of-speech tagging information acquisition unit, used Perform part-of-speech tagging on the text feature vector according to the part-of-speech tagging network to obtain the part-of-speech tagging information corresponding to each character in the target text information; the word segmentation result acquisition unit is used to pair The target text information is subjected to word segmentation processing to obtain corresponding word segmentation results.
  • the feature word association information acquisition unit 130 is used to identify the association relationship between the feature words in the target text information according to the preset relationship recognition network and the intent type, and obtain the feature word association corresponding to the target text information information.
  • the information query result acquisition unit 140 is configured to query a pre-stored information database according to the intent type and the feature word association information, so as to obtain an information query result corresponding to the query request information.
  • the information query result acquisition unit 140 includes a subunit: an associated query statement generation unit, configured to generate a corresponding query statement according to a plurality of associated characteristic words in the characteristic word association information and the intent type. Associated query statement; a separate query statement generation unit, used to generate a separate query statement according to the feature words and the intent type that do not have an associated relationship in the feature word association relationship; an information query unit, used to generate a separate query statement based on the associated query statement and /or the separate query statement is used to query the information database to obtain a corresponding information query result.
  • the intelligent information query device 100 based on intention identification further includes a subunit: an information query result storage unit, configured to upload the information query result to the block chain.
  • the intelligent information query device based on intent recognition applies the above-mentioned intelligent query method based on intent recognition, extracts the corresponding target text information from the query request information input by the user, and analyzes the target text information according to the text intent analysis model.
  • the text information is analyzed to obtain the intent type, and the relationship between the feature words in the target text information is identified according to the intent type and the relationship recognition network to obtain the feature word association information, and the information database is queried according to the intent type and feature word association information to obtain the information query result.
  • the accurate intention type can be obtained by analyzing the target text information, and the association relationship between the characteristic words in the target text information can be identified, and the information query result can be obtained based on the association relationship between the intention type and the characteristic words, which can greatly The accuracy of information inquiry is increased, and the efficiency of information inquiry is improved at the same time.
  • the above-mentioned intelligent information query device based on intention recognition can be implemented in the form of a computer program, and the computer program can be run on the computer device as shown in FIG. 9 .
  • FIG. 9 is a schematic block diagram of a computer device provided by an embodiment of the present application.
  • the computer device may be a user terminal or a management server for executing the intent recognition-based intelligent information query method to perform intelligent information query based on intent recognition.
  • the computer device 500 includes a processor 502 connected through a system bus 501 , a memory and a network interface 505 , wherein the memory may include a storage medium 503 and an internal memory 504 .
  • the storage medium 503 can store an operating system 5031 and a computer program 5032 .
  • the computer program 5032 When executed, it can cause the processor 502 to execute an intelligent information query method based on intent recognition, wherein the storage medium 503 can be a volatile storage medium or a non-volatile storage medium.
  • the processor 502 is used to provide calculation and control capabilities and support the operation of the entire computer device 500 .
  • the internal memory 504 provides an environment for running the computer program 5032 in the storage medium 503.
  • the processor 502 can execute an intelligent information query method based on intent recognition.
  • the network interface 505 is used for network communication, such as providing data transmission and the like.
  • the network interface 505 is used for network communication, such as providing data transmission and the like.
  • FIG. 9 is only a block diagram of a partial structure related to the solution of this application, and does not constitute a limitation on the computer device 500 on which the solution of this application is applied.
  • the specific computer device 500 may include more or fewer components than shown, or combine certain components, or have a different arrangement of components.
  • the processor 502 is configured to run the computer program 5032 stored in the memory, so as to realize the corresponding functions in the above-mentioned intelligent information query method based on intent recognition.
  • the embodiment of the computer device shown in FIG. 9 does not constitute a limitation on the specific composition of the computer device.
  • the computer device may include more or less components than those shown in the illustration. Or combine certain components, or different component arrangements.
  • the computer device may only include a memory and a processor. In such an embodiment, the structures and functions of the memory and the processor are consistent with those of the embodiment shown in FIG. 9 , and will not be repeated here.
  • the processor 502 may be a central processing unit (Central Processing Unit, CPU), and the processor 502 may also be other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), Application Specific Integrated Circuit (ASIC), off-the-shelf programmable gate array (Field-Programmable Gate Array, FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc.
  • the general-purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
  • a computer readable storage medium may be a volatile or non-volatile computer-readable storage medium.
  • the computer-readable storage medium stores a computer program, wherein when the computer program is executed by the processor, the above-mentioned method for intelligent information query based on intention recognition is implemented.
  • the disclosed devices, devices and methods can be implemented in other ways.
  • the device embodiments described above are only illustrative.
  • the division of the units is only logical function division.
  • there may be other division methods, and units with the same function may also be combined into one Units such as multiple units or components may be combined or integrated into another system, or some features may be omitted, or not implemented.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be indirect coupling or communication connection through some interfaces, devices or units, and may also be electrical, mechanical or other forms of connection.
  • the units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in one place, or may be distributed to multiple network units. Part or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment of the present application.
  • each functional unit in each embodiment of the present application may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit.
  • the above-mentioned integrated units can be implemented in the form of hardware or in the form of software functional units.
  • the integrated unit is realized in the form of a software function unit and sold or used as an independent product, it can be stored in a computer-readable storage medium.
  • the read storage medium includes several instructions to make a computer device (which may be a personal computer, a management server, or a network device, etc.) execute all or part of the steps of the methods described in the various embodiments of the present application.
  • the aforementioned computer-readable storage medium includes: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), magnetic disk or optical disk, and other media that can store program codes.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Databases & Information Systems (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Mathematical Physics (AREA)
  • Evolutionary Computation (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Acoustics & Sound (AREA)
  • Probability & Statistics with Applications (AREA)
  • Human Computer Interaction (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present application discloses an information intelligent query method and apparatus based on intent recognition, a device and a medium. The method comprises: extracting and obtaining corresponding target text information from query request information inputted by a user; parsing the target text information according to a text intent parsing model to obtain an intent type; according to the intent type and a relationship recognition network, recognizing the association between feature words in the target text information to obtain feature word association information; and querying an information database according to the intent type and the feature word association information to obtain an information query result.

Description

基于意图识别的信息智能查询方法、装置、设备及介质Information intelligent query method, device, equipment and medium based on intent recognition
本申请要求于2021年09月10日提交中国专利局、申请号为202111060715.2,发明名称为“基于意图识别的信息智能查询方法、装置、设备及介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application with the application number 202111060715.2 submitted to the China Patent Office on September 10, 2021, and the title of the invention is "Intent Recognition-Based Information Intelligent Query Method, Device, Equipment, and Medium", the entire content of which Incorporated in this application by reference.
技术领域technical field
本申请涉及数据查询技术领域,属于智慧教育中基于意图识别进行信息智能化查询的应用场景,尤其涉及一种基于意图识别的信息智能查询方法、装置、设备及介质。This application relates to the field of data query technology, and belongs to the application scenario of intelligent information query based on intention recognition in smart education, and in particular to an information intelligent query method, device, equipment and medium based on intention recognition.
背景技术Background technique
随着互联网技术的快速发展,信息处理的智能化程度也越来越高。企业在答复客户的信息时,通常会基于用户的提问进行语义分析,并根据语义分析确定对应的关键词,基于关键词对数据库中所包含的数据信息进行查询以获取对应的查询结果。然而发明人发现,同一关键词可在多个意图下获取到各不相同的多个查询结果,当查询意图不明确时则会导致基于关键词查询得到的结果不准确,导致难以从多个查询结果中获取与查询意图相对应的准确查询结果进行反馈,极大影响了信息查询的准确性。With the rapid development of Internet technology, the intelligence of information processing is also getting higher and higher. When enterprises reply to customers' information, they usually perform semantic analysis based on the user's questions, and determine the corresponding keywords based on the semantic analysis, and query the data information contained in the database based on the keywords to obtain the corresponding query results. However, the inventors found that the same keyword can obtain multiple different query results under multiple intentions. Accurate query results corresponding to the query intent are obtained from the results for feedback, which greatly affects the accuracy of information query.
发明内容Contents of the invention
本申请实施例提供了一种基于意图识别的信息智能查询方法、装置、设备及介质,旨在解决现有技术方法中所存在的对信息进行查询的准确性较低的问题。Embodiments of the present application provide an intelligent information query method, device, device, and medium based on intent recognition, aiming to solve the problem of low accuracy of information query in existing methods.
第一方面,本申请实施例提供了一种基于意图识别的信息智能查询方法,其包括:In the first aspect, the embodiment of the present application provides an intelligent information query method based on intent recognition, which includes:
若接收到所输入的查询请求信息,从所述查询请求信息中提取对应的目标文字信息;If the input query request information is received, the corresponding target text information is extracted from the query request information;
根据预置的文本意图解析模型对所述目标文本信息进行解析得到对应的意图类型;Analyzing the target text information according to a preset text intent analysis model to obtain a corresponding intent type;
根据预置的关系识别网络及所述意图类型对所述目标文本信息中特征词之间进行关联关系识别,得到与所述目标文本信息对应的特征词关联信息;According to the preset relationship recognition network and the intent type, the association relationship between the feature words in the target text information is identified, and the feature word association information corresponding to the target text information is obtained;
根据所述意图类型及所述特征词关联信息对预存的信息数据库进行查询,以获取与所述查询请求信息对应的信息查询结果。A pre-stored information database is queried according to the intent type and the associated information of the characteristic words, so as to obtain an information query result corresponding to the query request information.
第二方面,本申请实施例提供了一种基于意图识别的信息智能查询装置,其包括:In the second aspect, the embodiment of the present application provides an intelligent information query device based on intention recognition, which includes:
目标文字信息获取单元,用于若接收到所输入的查询请求信息,从所述查询请求信息中提取对应的目标文字信息;A target text information acquisition unit, configured to extract corresponding target text information from the query request information if the input query request information is received;
意图类型识别单元,用于根据预置的文本意图解析模型对所述目标文本信息进行解析得到对应的意图类型;An intent type identification unit, configured to analyze the target text information according to a preset text intent analysis model to obtain a corresponding intent type;
特征词关联信息获取单元,用于根据预置的关系识别网络及所述意图类型对所述目标文本信息中特征词之间进行关联关系识别,得到与所述目标文本信息对应的特征词关联信息;A feature word association information acquisition unit, configured to identify the association relationship between the feature words in the target text information according to the preset relationship recognition network and the intent type, and obtain the feature word association information corresponding to the target text information ;
信息查询结果获取单元,用于根据所述意图类型及所述特征词关联信息对预存的信息数据库进行查询,以获取与所述查询请求信息对应的信息查询结果。An information query result acquisition unit, configured to query a prestored information database according to the intent type and the feature word association information, so as to obtain an information query result corresponding to the query request information.
第三方面,本申请实施例又提供了一种计算机设备,其包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现上述第一方面所述的基于意图识别的信息智能查询方法。In the third aspect, the embodiment of the present application further provides a computer device, which includes a memory, a processor, and a computer program stored on the memory and operable on the processor, and the processor executes the computer program. The program implements the intelligent information query method based on intent recognition described in the first aspect above.
第四方面,本申请实施例还提供了一种计算机可读存储介质,其中所述计算机可读存储介质存储有计算机程序,所述计算机程序当被处理器执行时使所述处理器执行上述第一方面所述的基于意图识别的信息智能查询方法。In a fourth aspect, the embodiment of the present application further provides a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the processor executes the above-mentioned first step. In one aspect, the intelligent query method for information based on intent recognition.
本申请实施例提供了一种基于意图识别的信息智能查询方法、装置、设备及介质。从用户输入的查询请求信息中提取得到对应的目标文字信息,根据文本意图解析模型对目标文本信息进行解析得到意图类型,并根据意图类型及关系识别网络识别目标文本信息中特征词之间的关联关系得到特征词关联信息,根据意图类型及特征词关联信息查询信息数据库得到信 息查询结果。通过上述方法,可通过对目标文本信息进行解析得到准确的意图类型,并识别目标文本信息中特征词之间进行关联关系,基于意图类型及特征词之间的关联关系得到信息查询结果,可大幅增加进行信息查询的准确性,同时提高进行信息查询的效率。Embodiments of the present application provide a method, device, device, and medium for intelligent information query based on intent recognition. Extract the corresponding target text information from the query request information input by the user, analyze the target text information according to the text intent analysis model to obtain the intent type, and identify the association between feature words in the target text information according to the intent type and relationship recognition network According to the relational information of the characteristic words, the information database is queried according to the type of intent and the association information of the characteristic words to obtain the information query results. Through the above method, the accurate intention type can be obtained by analyzing the target text information, and the association relationship between the characteristic words in the target text information can be identified, and the information query result can be obtained based on the association relationship between the intention type and the characteristic words, which can greatly The accuracy of information inquiry is increased, and the efficiency of information inquiry is improved at the same time.
附图说明Description of drawings
为了更清楚地说明本申请实施例技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions of the embodiments of the present application more clearly, the drawings that need to be used in the description of the embodiments will be briefly introduced below. Obviously, the drawings in the following description are some embodiments of the present application. Ordinary technicians can also obtain other drawings based on these drawings on the premise of not paying creative work.
图1为本申请实施例提供的基于意图识别的信息智能查询方法的流程示意图;FIG. 1 is a schematic flow diagram of an information intelligent query method based on intent recognition provided by an embodiment of the present application;
图2为本申请实施例提供的基于意图识别的信息智能查询方法的子流程示意图;Fig. 2 is a schematic sub-flow diagram of an information intelligent query method based on intent recognition provided by an embodiment of the present application;
图3为本申请实施例提供的基于意图识别的信息智能查询方法的另一子流程示意图;FIG. 3 is a schematic diagram of another sub-flow of the intent recognition-based intelligent information query method provided by the embodiment of the present application;
图4为本申请实施例提供的基于意图识别的信息智能查询方法的另一子流程示意图;FIG. 4 is a schematic diagram of another sub-flow of the intelligent information query method based on intent recognition provided by the embodiment of the present application;
图5为本申请实施例提供的基于意图识别的信息智能查询方法的另一子流程示意图;FIG. 5 is a schematic diagram of another sub-flow of the intelligent information query method based on intent recognition provided by the embodiment of the present application;
图6为本申请实施例提供的基于意图识别的信息智能查询方法的另一子流程示意图;FIG. 6 is a schematic diagram of another sub-flow of the intelligent information query method based on intent recognition provided by the embodiment of the present application;
图7为本申请实施例提供的基于意图识别的信息智能查询方法的另一流程示意图;FIG. 7 is another schematic flowchart of an intelligent information query method based on intent recognition provided by an embodiment of the present application;
图8为本申请实施例提供的基于意图识别的信息智能查询装置的示意性框图;FIG. 8 is a schematic block diagram of an information intelligent query device based on intent recognition provided by an embodiment of the present application;
图9为本申请实施例提供的计算机设备的示意性框图。Fig. 9 is a schematic block diagram of a computer device provided by an embodiment of the present application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The following will clearly and completely describe the technical solutions in the embodiments of the present application with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.
应当理解,当在本说明书和所附权利要求书中使用时,术语“包括”和“包含”指示所描述特征、整体、步骤、操作、元素和/或组件的存在,但并不排除一个或多个其它特征、整体、步骤、操作、元素、组件和/或其集合的存在或添加。It should be understood that when used in this specification and the appended claims, the terms "comprising" and "comprises" indicate the presence of described features, integers, steps, operations, elements and/or components, but do not exclude one or Presence or addition of multiple other features, integers, steps, operations, elements, components and/or collections thereof.
还应当理解,在此本申请说明书中所使用的术语仅仅是出于描述特定实施例的目的而并不意在限制本申请。如在本申请说明书和所附权利要求书中所使用的那样,除非上下文清楚地指明其它情况,否则单数形式的“一”、“一个”及“该”意在包括复数形式。It should also be understood that the terminology used in the specification of this application is for the purpose of describing particular embodiments only and is not intended to limit the application. As used in this specification and the appended claims, the singular forms "a", "an" and "the" are intended to include plural referents unless the context clearly dictates otherwise.
还应当进一步理解,在本申请说明书和所附权利要求书中使用的术语“和/或”是指相关联列出的项中的一个或多个的任何组合以及所有可能组合,并且包括这些组合。It should also be further understood that the term "and/or" used in the description of the present application and the appended claims refers to any combination and all possible combinations of one or more of the associated listed items, and includes these combinations .
请参阅图1,图1是本申请实施例提供的基于意图识别的信息智能查询方法的流程示意图;该基于意图识别的信息智能查询方法应用于用户终端或管理服务器中,该基于意图识别的信息智能查询方法通过安装于用户终端或管理服务器中的应用软件进行执行,用户终端即是可用于接收用户输入的查询请求信息并进行信息查询获取查询结果的终端设备,如台式电脑、笔记本电脑、平板电脑或手机等,管理服务器即是可用于接收来自用户终端的查询请求信息进行信息查询获取信息查询结果并进行反馈的服务器端,如企业或政府部门所构建的服务器。如图1所示,该方法包括步骤S110~S140。Please refer to Fig. 1, Fig. 1 is a schematic flow chart of an intelligent information query method based on intent recognition provided by an embodiment of the present application; the intelligent query method for information based on intent recognition is applied to a user terminal or a management server, the The intelligent query method is executed through the application software installed in the user terminal or the management server. The user terminal is a terminal device that can receive the query request information input by the user and perform information query to obtain the query results, such as desktop computers, notebook computers, and tablets. For computers or mobile phones, etc., the management server is a server that can be used to receive query request information from user terminals, perform information query, obtain information query results, and provide feedback, such as servers built by enterprises or government departments. As shown in FIG. 1, the method includes steps S110-S140.
S110、若接收到所输入的查询请求信息,从所述查询请求信息中提取对应的目标文字信息。S110. If the input query request information is received, extract corresponding target text information from the query request information.
查询请求信息可以由用户终端的使用者通过用户终端发送至管理服务器,也可以是由使用者输入用户终端,查询请求信息可以是文字、语音或短视频,需从查询请求信息中获取对应的目标文字信息,并基于目标文字信息获取使用者真实的信息查询意图。The query request information can be sent to the management server by the user of the user terminal through the user terminal, or can be input by the user into the user terminal. The query request information can be text, voice or short video, and the corresponding target needs to be obtained from the query request information. Text information, and obtain the user's real information query intention based on the target text information.
在一实施例中,如图2所示,步骤S110包括子步骤S111、S112和S113。In one embodiment, as shown in FIG. 2 , step S110 includes sub-steps S111 , S112 and S113 .
S111、判断所述查询请求信息的信息类型。S111. Determine the information type of the query request information.
其中,所述信息类型包括文字信息类型和语音信息类型。具体的,查询请求信息中包含 对应的格式标识信息,格式标识信息即是用于对查询请求信息的格式进行标识的信息,通过查询请求信息的格式标识信息即可判断查询请求信息的信息类型。Wherein, the information type includes a text information type and a voice information type. Specifically, the query request information includes corresponding format identification information. The format identification information is information used to identify the format of the query request information. The information type of the query request information can be determined by the format identification information of the query request information.
例如,若格式标识信息为txt、xml或string,则对应的查询请求信息为文字信息类型;若格式标识信息为wav、mp3、wma、avi、flv、rmvb,则该对应的查询请求信息为语音信息类型(包括音频信息类型及视频信息类型)。For example, if the format identification information is txt, xml or string, the corresponding query request information is text information type; if the format identification information is wav, mp3, wma, avi, flv, rmvb, then the corresponding query request information is voice Information type (including audio information type and video information type).
例如,用户终端的使用者在终端页面的问题框中输入文字并点击确认按钮,则用户终端接收该文字信息得到查询请求信息,或将该文字作为查询请求信息发送至管理服务器;使用者点击终端页面的语音录入按钮,说出自己的问题并点击确认按钮,则用户终端接收语音得到查询请求信息,或将所录得的语音作为查询请求信息发送至管理服务器;使用者点击终端页面的视频录入按钮,正对用户终端的视频采集设备说出自己的问题并点击确认按钮,则用户终端接收所录入的段视频得到查询请求信息,或将所录得的短视频作为查询请求信息发送至管理服务器。For example, the user of the user terminal enters text in the question box on the terminal page and clicks the confirmation button, then the user terminal receives the text information to obtain query request information, or sends the text to the management server as query request information; The voice input button on the page, say your own question and click the confirmation button, then the user terminal receives the voice to get the query request information, or sends the recorded voice as the query request information to the management server; the user clicks the video input button on the terminal page button, speak your question to the video capture device of the user terminal and click the confirm button, then the user terminal receives the recorded segment of video to get the query request information, or sends the recorded short video as the query request information to the management server .
S112、若所述查询请求信息的信息类型为语音信息类型,则根据预存的语音识别模型对所述查询请求信息包含的语音信息进行识别以得到与所述查询请求信息对应的目标文字信息。S112. If the information type of the query request information is a voice information type, recognize the voice information included in the query request information according to a pre-stored voice recognition model to obtain target text information corresponding to the query request information.
若查询请求信息为语音信息类型,则该查询请求信息可以是音频信息类型或视频信息类型,音频信息或视频信息中均包含语音信息。语音识别模型即是对音频信息或视频信息中所包含的语音信息进行识别及转换的模型,其中,所述语音识别模型包括噪音判断规则及文字信息获取模型。噪音判断规则即为对语音信息中是否包含噪音进行判断的规则,文字信息获取模型即为从语音信息中获取对应文字信息的模型,若语音信息中包含噪音,则会影响从语音信息中获取到的目标文字信息的精确度,因此在从语音信息中获取对应的目标文字信息之前,可通过噪音判断规则对语音信息中是否包含噪音进行判断,以确保从无噪音的语音信息中获取更加准确的目标文字信息。If the query request information is of voice information type, the query request information may be of audio information type or video information type, both of which include voice information. The speech recognition model is a model that recognizes and converts speech information included in audio information or video information, wherein the speech recognition model includes a noise judgment rule and a text information acquisition model. The noise judgment rule is the rule for judging whether the speech information contains noise. The text information acquisition model is the model for obtaining the corresponding text information from the speech information. If the speech information contains noise, it will affect the information obtained from the speech information. Therefore, before obtaining the corresponding target text information from the voice information, the noise judgment rule can be used to judge whether the voice information contains noise, so as to ensure that more accurate information can be obtained from the noise-free voice information. Target text information.
在一实施例中,如图3所示,步骤S112包括子步骤S1121、S1122和S1123。In one embodiment, as shown in FIG. 3 , step S112 includes sub-steps S1121 , S1122 and S1123 .
S1121、根据所述噪音判断规则对所述查询请求信息中的语音信息是否包含噪音进行判断。S1121. Determine whether the voice information in the query request information contains noise according to the noise determination rule.
根据所述噪音判断规则对所述查询请求信息中的语音信息是否包含噪音进行判断。具体的,由于人类说话时发出声音的频率处于一个固定频率区间(85Hz~1100Hz),可基于语音信息中声纹信号的频率对语音信息进行采样,采样频率可以是5-50Hz,采样频率即为每秒钟从语音信息获取声纹信号响度值的次数,响度值的单位为分贝(decibel,dB)。例如,若采样频率为10Hz,则每秒钟从语音信息中获取10次处于声纹信号对应频率区间的响度值。可基于上述采样方法从语音信息中获取处于上述固定频率区间的多个响度值的平均值作为目标声音信号强度,采用上述采样方法从语音信息中获取未处于上述固定频率区间的其他声音信号的多个响度值的平均值作为背景噪声信号强度,判断背景噪声信号强度与目标声音信号强度之间的比值是否大于噪音判断规则中所预设的比例阈值,若比值大于比例阈值则判断该查询请求信息中的语音信息包含噪音;若比值不大于比例阈值则判断该查询请求信息中的语音信息不包含噪音。It is judged according to the noise judgment rule whether the voice information in the query request information contains noise. Specifically, since the frequency of human speech is in a fixed frequency range (85Hz-1100Hz), the voice information can be sampled based on the frequency of the voiceprint signal in the voice information. The sampling frequency can be 5-50Hz, and the sampling frequency is The number of times the loudness value of the voiceprint signal is obtained from the voice information per second, and the unit of the loudness value is decibel (dB). For example, if the sampling frequency is 10 Hz, the loudness value in the frequency range corresponding to the voiceprint signal is obtained from the voice information 10 times per second. Based on the above-mentioned sampling method, the average value of multiple loudness values in the above-mentioned fixed frequency interval can be obtained from the speech information as the target sound signal strength, and the above-mentioned sampling method can be used to obtain the multiplicity of other sound signals not in the above-mentioned fixed frequency interval from the speech information. The average value of the loudness values is used as the background noise signal strength, and it is judged whether the ratio between the background noise signal strength and the target sound signal strength is greater than the ratio threshold preset in the noise judgment rule, and if the ratio is greater than the ratio threshold value, the query request information is judged The voice information in contains noise; if the ratio is not greater than the ratio threshold, it is judged that the voice information in the query request information does not contain noise.
S1122、若所述查询请求信息中的语音信息不包含噪音,根据所述文字信息获取模型对所述查询请求信息中的语音信息进行识别,以得到与所述查询请求信息对应的目标文字信息。S1122. If the voice information in the query request information does not contain noise, identify the voice information in the query request information according to the text information acquisition model, so as to obtain target text information corresponding to the query request information.
若所述查询请求信息中的语音信息不包含噪音,根据所述文字信息获取模型对所述查询请求信息中的语音信息进行识别,以得到与所述查询请求信息对应的目标文字信息。若查询请求信息中的语音信息不包含噪音,则可根据文字信息获取模型对语音信息进行识别以获取对应的目标文字信息,具体的,文字信息获取模型包括声学模型、语音特征词典及语义解析词典。If the voice information in the query request information does not contain noise, identify the voice information in the query request information according to the text information acquisition model, so as to obtain target text information corresponding to the query request information. If the speech information in the query request information does not contain noise, the speech information can be recognized according to the text information acquisition model to obtain the corresponding target text information. Specifically, the text information acquisition model includes an acoustic model, a speech feature dictionary and a semantic analysis dictionary .
在一实施例中,步骤S1122包括子步骤:根据所述文字信息获取模型中的声学模型对所述查询请求信息中的语音信息进行切分以得到所述语音信息中所包含的多个音素;根据所述 文字信息获取模型中的语音特征词典对所述音素进行匹配以将所述音素转换为拼音信息;根据所述文字信息获取模型中的语义解析词典对所述拼音信息进行语义解析以得到与所述查询请求信息对应的目标文字信息。In one embodiment, step S1122 includes a sub-step: segmenting the voice information in the query request information according to the acoustic model in the text information acquisition model to obtain a plurality of phonemes contained in the voice information; Match the phonemes according to the phonetic feature dictionary in the text information acquisition model to convert the phonemes into pinyin information; perform semantic analysis on the pinyin information according to the semantic analysis dictionary in the text information acquisition model to obtain Target text information corresponding to the query request information.
根据所述文字信息获取模型中的声学模型对所述查询请求信息中的语音信息进行切分以得到所述语音信息中所包含的多个音素。具体的,音频信息或视频信息中所包含的语音信息由多个字符发音的音素而组成,一个字符的音素包括该字符发音的频率和音色。声学模型中包含所有字符发音的音素,通过将语音信息与声学模型中所有的音素进行匹配,即可对语音信息中单个字符的音素进行切分,通过切分最终得到查询请求信息的语音信息中所包含的多个音素。The voice information in the query request information is segmented according to the acoustic model in the text information acquisition model to obtain a plurality of phonemes contained in the voice information. Specifically, the voice information included in the audio information or video information is composed of phonemes that are pronounced by multiple characters, and the phonemes of a character include the frequency and timbre of the character's pronunciation. The acoustic model contains the phonemes of all character pronunciations. By matching the voice information with all the phonemes in the acoustic model, the phonemes of a single character in the voice information can be segmented, and finally the voice information of the query request information can be obtained through segmentation. Multiple phonemes included.
根据所述文字信息获取模型中的语音特征词典对所述音素进行匹配以将所述音素转换为拼音信息。语音特征词典中包含所有字符拼音对应的音素信息,通过将所得到的音素与字符拼音对应的音素信息进行匹配,即可将单个字符的音素转换为语音特征词典中与该音素相匹配的字符拼音,以实现将语音信息中所包含的所有音素转换为拼音信息。The phonemes are matched according to the phonetic feature dictionary in the text information acquisition model to convert the phonemes into pinyin information. The phonetic feature dictionary contains the phoneme information corresponding to all characters’ pinyin. By matching the obtained phoneme with the phoneme information corresponding to the character’s pinyin, the phoneme of a single character can be converted into the character’s pinyin matching the phoneme in the phonetic feature dictionary , so as to convert all the phonemes contained in the speech information into pinyin information.
根据所述文字信息获取模型中的语义解析词典对所述拼音信息进行语义解析以得到与所述查询请求信息对应的目标文字信息。语义解析词典中包含单个拼音信息或多个拼音词组与文字信息之间的映射关系,通过语义解析词典中所包含的映射关系即可对所得到的拼音信息进行语义解析以将拼音信息转换为对应的目标文字信息。Perform semantic analysis on the pinyin information according to the semantic analysis dictionary in the text information acquisition model to obtain target text information corresponding to the query request information. The semantic analysis dictionary contains the mapping relationship between a single pinyin information or multiple pinyin phrases and text information. Through the mapping relationship contained in the semantic analysis dictionary, the obtained pinyin information can be semantically analyzed to convert the pinyin information into corresponding target text message for .
S1123、若所述查询请求信息中的语音信息包含噪音,反馈重新输入的提示信息以提示再次输入所述查询请求信息。S1123. If the voice information in the query request information contains noise, feed back re-input prompt information to prompt re-input of the query request information.
若查询请求信息中的语音信息包含噪音,则会影响所获取到的目标文字信息的精确度,此时可反馈重新输入的提示信息,以提示用户在低噪声环境下重新输入查询请求信息。If the voice information in the query request information contains noise, it will affect the accuracy of the acquired target text information. At this time, the re-input prompt information can be fed back to prompt the user to re-enter the query request information in a low-noise environment.
S113、若所述查询请求信息的信息类型为文字信息类型,将所述查询请求信息确定为目标文字信息。S113. If the information type of the query request information is text information, determine the query request information as target text information.
若所述查询请求信息为文字信息类型,将所述查询请求信息作为目标文字信息。若查询请求信息为文字信息类型,则无需对该查询请求信息进行处理,可将查询请求信息直接作为目标文字信息进行后续处理。If the query request information is of text information type, the query request information is used as the target text information. If the query request information is of text information type, there is no need to process the query request information, and the query request information can be directly used as target text information for subsequent processing.
S120、根据预置的文本意图解析模型对所述目标文本信息进行解析得到对应的意图类型。S120. Analyze the target text information according to a preset text intent analysis model to obtain a corresponding intent type.
根据预置的文本意图解析模型对所述目标文本信息进行解析得到对应的意图类型。所述文本意图解析模型包括分词处理规则及分类神经网络。可通过文本意图解析模型对目标文本信息进行解析,得到目标文本信息的意图类型。The target text information is parsed according to a preset text intent parsing model to obtain a corresponding intent type. The text intent parsing model includes word segmentation processing rules and a classification neural network. The target text information can be analyzed through the text intent analysis model to obtain the intent type of the target text information.
在更具体的实施例中,如图4所示,步骤S120包括子步骤S121和S122。In a more specific embodiment, as shown in FIG. 4, step S120 includes sub-steps S121 and S122.
S121、根据所述分词处理规则对所述目标文本信息进行分词处理得到分词结果。S121. Perform word segmentation processing on the target text information according to the word segmentation processing rule to obtain a word segmentation result.
根据所述分词处理规则对所述目标文本信息进行分词处理得到分词结果。可根据分词处理规则对目标文本信息进行分词处理,得到对应的分词结果,分词结果中包含由一个或多个字符组成的特征词,以及每一特征词的属性信息。所述分词处理规则包括向量转换数据库及词性标注网络。Perform word segmentation processing on the target text information according to the word segmentation processing rule to obtain a word segmentation result. The word segmentation processing can be performed on the target text information according to the word segmentation processing rules to obtain the corresponding word segmentation results, which include feature words composed of one or more characters and attribute information of each feature word. The word segmentation processing rules include a vector conversion database and a part-of-speech tagging network.
在更具体的实施例中,如图5所示,步骤S121包括子步骤S1211、S1212和S1213。In a more specific embodiment, as shown in FIG. 5, step S121 includes sub-steps S1211, S1212 and S1213.
S1211、根据所述向量转换数据库从所述目标文本信息中获取对应的文字特征向量。S1211. Obtain a corresponding character feature vector from the target text information according to the vector conversion database.
向量转换数据库即为对字符与单位特征向量之间关联关系进行存储的数据表,向量转换数据库中包含每一字符对应的一个单位特征向量,单位特征向量可用于对字符的特征进行量化表示。根据目标文本信息即可从向量转换数据库中获取该目标文本信息中每一字符对应的一个单位特征向量,将该目标文本信息中包含的多个字符所对应的单位特征向量进行组合,即可得到对应的一个文字特征向量,也即是将目标文本信息转换为对应的文字特征向量。The vector conversion database is a data table that stores the relationship between characters and unit feature vectors. The vector conversion database contains a unit feature vector corresponding to each character, and the unit feature vector can be used to quantify the characteristics of characters. According to the target text information, a unit feature vector corresponding to each character in the target text information can be obtained from the vector conversion database, and the unit feature vectors corresponding to multiple characters contained in the target text information can be combined to obtain A corresponding text feature vector, that is, converting the target text information into a corresponding text feature vector.
例如,目标文本信息为“明天下午是否有英语课?”,对应得到的文字特征向量为[101, 9251,3151,7821,6215,4152,3324,3620,8512,4831,8514,0635,102]。其中“101”为句首单位特征向量,“102”为句尾单位特征向量。For example, if the target text information is "Is there an English class tomorrow afternoon?", the corresponding text feature vectors are [101, 9251, 3151, 7821, 6215, 4152, 3324, 3620, 8512, 4831, 8514, 0635, 102] . Among them, "101" is the unit feature vector at the beginning of the sentence, and "102" is the unit feature vector at the end of the sentence.
S1212、根据所述词性标注网络对所述文字特征向量进行词性标注,得到所述目标文本信息中每一字符对应的词性标注信息。S1212. Perform part-of-speech tagging on the text feature vector according to the part-of-speech tagging network to obtain part-of-speech tagging information corresponding to each character in the target text information.
可通过词性标注网络对文字特征向量进行词性标注,得到目标文本信息中每一字符对应的词性标注信息。具体的,词性标注网络可基于BERT(Bidirectional Encoder Representations from Transformers)神经网络与CRF(Intro2 Conditional Random Field)神经网络进行组合得到。BERT神经网络可将文字特征向量进行特征计算得到文字特征向量对应的特征数组,若文字特征向量为1×H维的向量,则所得到的特征数组包含L×H(L行H列)个特征值的数组,每一特征值均属于[0,1]这一取值范围。CRF神经网络为一个序列化标注算法网络,将特征数组输入CRF神经网络,即可得到特征数组中每一行特征序列对应的标注值,除去句首单位特征向量及句尾单位特征向量对应的标注值,剩余的多个标注值即与目标文本信息中所包含的字符相对应,也即是得到与目标文本信息对应的词性标注信息。The part-of-speech tagging information corresponding to each character in the target text information can be obtained by performing part-of-speech tagging on the text feature vector through the part-of-speech tagging network. Specifically, the part-of-speech tagging network can be obtained based on the combination of BERT (Bidirectional Encoder Representations from Transformers) neural network and CRF (Intro2 Conditional Random Field) neural network. The BERT neural network can perform feature calculation on the text feature vector to obtain the feature array corresponding to the text feature vector. If the text feature vector is a 1×H-dimensional vector, the obtained feature array contains L×H (L rows and H columns) features An array of values, each feature value belongs to the value range [0, 1]. The CRF neural network is a serialized labeling algorithm network. Input the feature array into the CRF neural network to get the label value corresponding to the feature sequence of each row in the feature array, except for the label value corresponding to the unit feature vector at the beginning of the sentence and the unit feature vector at the end of the sentence. , the remaining multiple tag values correspond to the characters contained in the target text information, that is, the part-of-speech tag information corresponding to the target text information is obtained.
S1213、根据每一字符对应的词性标注信息对所述目标文本信息进行分词处理,得到对应的分词结果。S1213. Perform word segmentation processing on the target text information according to the part-of-speech tagging information corresponding to each character, to obtain a corresponding word segmentation result.
根据字符标注信息即可对目标文本信息中包含的字符进行分词处理,判断相邻两个字符的标注值是否相同,若相同则将相邻连个字符进行组合,若不相同则将相邻两个字符进行分词处理。通过上述方法,即可对目标文本信息包含的所有字符进行分词处理,得到分词结果。According to the character label information, word segmentation can be performed on the characters contained in the target text information, and it is judged whether the label values of two adjacent characters are the same. If they are the same, the adjacent characters are combined; characters for word segmentation. Through the above method, word segmentation processing can be performed on all characters contained in the target text information, and a word segmentation result can be obtained.
例如,“明”的标注值“203:名词-时间”,“天”的标注值为“203:名词-时间”,两个字符同时包含“名词-时间”,则可将两个字符进行组合,并将共同的标注值作为组合后所得到的词组的标注值。For example, the label value of "Ming" is "203: noun-time", and the label value of "day" is "203: noun-time", and the two characters contain "noun-time" at the same time, then the two characters can be combined , and use the common label value as the label value of the phrase obtained after combination.
S122、将所述分词结果及所述文字特征向量输入所述分类神经网络,以获取与所述目标文本信息对应的意图类型。S122. Input the word segmentation result and the text feature vector into the classification neural network, so as to obtain the intent type corresponding to the target text information.
将分词结果中每一字符或词组的标注值与文字特征向量进行组合得到组合特征,并输入分类神经网络以获取对应的意图类型。分类神经网络可由一个输入层、多个中间层及一个输出层组成,输入层与首个中间层之间、中间层与前后相邻的其他中间层之间、末尾中间层与输出层之间均通过关联公式进行关联,例如某一关联公式可表示为y=r×x+t,r和t即为该关联公式中的参数值。输入层中包含的输入节点的数量不小于组合特征中包含的特征值相等,输出层中包含多个输出节点,每一输出节点对应一个分类类型,将组合特征输入分类神经网络进行计算,即可从输出层中获取对应的输出结果,输出结果中包含与每一输出节点对应的匹配概率,可获取匹配概率最大的一个输出节点对应的分类类型作为与目标文本信息对应的意图类型。Combine the label value of each character or phrase in the word segmentation result with the text feature vector to obtain the combined feature, and input it into the classification neural network to obtain the corresponding intent type. The classification neural network can be composed of an input layer, multiple intermediate layers and an output layer, between the input layer and the first intermediate layer, between the intermediate layer and other adjacent intermediate layers, between the last intermediate layer and the output layer The association is performed through an association formula. For example, a certain association formula can be expressed as y=r×x+t, and r and t are the parameter values in the association formula. The number of input nodes contained in the input layer is not less than the feature value contained in the combined feature, and the output layer contains multiple output nodes, each output node corresponds to a classification type, and the combined feature is input into the classification neural network for calculation. Obtain the corresponding output result from the output layer, the output result contains the matching probability corresponding to each output node, and the classification type corresponding to an output node with the highest matching probability can be obtained as the intent type corresponding to the target text information.
如,“明”的标注值为“203”,其在文字特征向量中对应的单位特征向量为“9251”,则将该标注值与该单位特征向量进行组合作为与“明”对应的组合特征,获取目标文本信息中每一字符对应的组合特征并输入分类神经网络,即可得到对应的意图分类。For example, the label value of "Ming" is "203", and its corresponding unit feature vector in the text feature vector is "9251", then the label value and the unit feature vector are combined as the combined feature corresponding to "Ming". , obtain the combination feature corresponding to each character in the target text information and input it into the classification neural network to obtain the corresponding intent classification.
S130、根据预置的关系识别网络及所述意图类型对所述目标文本信息中特征词之间进行关联关系识别,得到与所述目标文本信息对应的特征词关联信息。S130. Perform association relationship identification between feature words in the target text information according to the preset relationship identification network and the intent type, and obtain feature word association information corresponding to the target text information.
对目标文本信息进行分词处理后得到分词结果,分词结果中包含分词得到的单个字符或词组,将单个字符或词组作为目标文本信息包含的特征词,并基于关系识别网络及所得到的意图类型对特征词进行关联关系识别,得到特征词之间是否存在关联关系的特征词关联信息。Word segmentation is performed on the target text information to obtain a word segmentation result, which contains a single character or phrase obtained from the word segmentation, and the single character or phrase is used as a feature word contained in the target text information, and based on the relationship recognition network and the obtained intent type. The feature words are identified for the association relationship, and the feature word association information of whether there is an association relationship between the feature words is obtained.
具体的,基于目标文本信息中包含的特征词构建语义矩阵,并通过一个全连接网络对特征词之间的关联关系进行识别,首先将意图类型转换为意图特征向量,并将特征词中每一字符在文字特征向量中的单位特征向量与字符的标注值进行组合,得到每一特征词的特征词向量,将两个特征词对应的特征词向量及意图特征向量同时输入全连接网络,并基于归一化函 数Softmax(f(a,b))对全连接网络的输出结果进行归一化处理,得到归一化数值,判断归一化数值是否大于预置阈值,即可对两个特征词之间是否存在关联关系进行识别,若归一化数值大于预置阈值,则两个特征词之间存在关联关系;若归一化数值不大于预置阈值,则两个特征词之间不存在关联关系。Specifically, the semantic matrix is constructed based on the feature words contained in the target text information, and the association relationship between the feature words is identified through a fully connected network. First, the intent type is converted into an intent feature vector, and each of the feature words is The unit feature vector of the character in the text feature vector is combined with the label value of the character to obtain the feature word vector of each feature word, and the feature word vector and intention feature vector corresponding to the two feature words are input into the fully connected network at the same time, and based on The normalization function Softmax(f(a,b)) normalizes the output results of the fully connected network to obtain the normalized value, and judges whether the normalized value is greater than the preset threshold, then the two feature words Whether there is an association relationship between them is identified. If the normalized value is greater than the preset threshold, there is an association between the two feature words; if the normalized value is not greater than the preset threshold, there is no relationship between the two feature words. connection relation.
例如,基于“明天下午是否有英语课?”中包含的特征词构建得到的语义矩阵如表1所示。For example, the semantic matrix constructed based on the feature words contained in "Is there an English class tomorrow afternoon?" is shown in Table 1.
Figure PCTCN2022071789-appb-000001
Figure PCTCN2022071789-appb-000001
表1Table 1
若“明天下午/N-Time”这一特征词与“英语课/OBJ-Course”这一特征词存在关联关系,则可在语义矩阵中对相应关联关系进行标识。If the characteristic word "Tomorrow afternoon/N-Time" has an association relationship with the characteristic word "English class/OBJ-Course", the corresponding association relationship can be identified in the semantic matrix.
S140、根据所述意图类型及所述特征词关联信息对预存的信息数据库进行查询,以获取与所述查询请求信息对应的信息查询结果。S140. Query a pre-stored information database according to the intent type and the associated information of the characteristic words, so as to obtain an information query result corresponding to the query request information.
可根据意图类型及特征词关联关系,在预存的信息数据库进行数据查询,以从信息数据库中获取与查询请求对应的信息查询结果。其中,信息数据库即为管理服务器或用户终端中预先存储的用于记载各种信息的结构化数据库。例如对于在线教育这一场景,信息数据库中可包含与课程时间安排、课程费用相关的信息,还可包含课程习题、文章或知识点相关的信息。Data query can be performed in the pre-stored information database according to the type of intent and the association relationship of feature words, so as to obtain the information query result corresponding to the query request from the information database. Wherein, the information database is a structured database for recording various information pre-stored in the management server or the user terminal. For example, in the scenario of online education, the information database may contain information related to course schedules and course fees, as well as information related to course exercises, articles, or knowledge points.
在更具体的实施例中,如图6所示,步骤S140包括子步骤S141、S142和S143。In a more specific embodiment, as shown in FIG. 6, step S140 includes sub-steps S141, S142 and S143.
S141、根据所述特征词关联信息中存在关联关系的多个特征词及所述意图类型生成对应的关联查询语句。S141. Generate a corresponding associated query sentence according to the plurality of associated characteristic words in the characteristic word association information and the intent type.
若特征词关联信息中两个特征词或多个特征词之间存在关联关系,可基于存在关联关系的多个特征词及意图类型生成对应的关联查询语句,关联查询语句可以是SQL查询语句。If there is an association relationship between two or more feature words in the feature word association information, a corresponding associated query statement can be generated based on the associated multiple feature words and intent types, and the associated query statement can be an SQL query statement.
例如,意图类型为“排课”、特征词“明天下午/N-Time”与“英语课/OBJ-Course”之间存在关联关系,则可基于上述信息生成对应的关联查询语句,关联查询语句即可用于体现:在“排课信息(课程时间安排信息)”中查询“明天下午/N-Time”的“英语课/OBJ-Course”。For example, if the intent type is "scheduling courses", and there is an association relationship between the feature words "tomorrow afternoon/N-Time" and "English class/OBJ-Course", then the corresponding associated query statement can be generated based on the above information, and the associated query statement It can be used to reflect: Inquire "English Class/OBJ-Course" of "Tomorrow afternoon/N-Time" in "Class Arrangement Information (Course Schedule Information)".
S142、根据所述特征词关联关系中不存在关联关系的特征词及所述意图类型生成单独查询语句。S142. Generate a separate query statement according to the feature words that do not have an association relationship in the feature word association relationship and the intent type.
若关联特征信息中包含不存在关联关系的特征词,则可基于该特征词与意图类型生成单独查询语句,所生成的单独查询语句也可以是SQL查询语句。If the associated feature information contains feature words that do not have an association relationship, a separate query statement can be generated based on the feature words and the intent type, and the generated separate query statement can also be an SQL query statement.
例如,意图类型为“购买-付费”,特征词为“写作课/OBJ-Course”,则可根据上述信息生成对应的单独查询语句,单独查询语句即可用于体现:在“付费信息(课程费用)”中查询“写作课/OBJ-Course”。For example, if the intent type is "purchase-pay" and the feature word is "writing class/OBJ-Course", then a corresponding separate query statement can be generated based on the above information, and the separate query statement can be used to reflect: In "Payment information (course fee )" to query "Writing Course/OBJ-Course".
基于意图类型及特征词关联信息对应生成的查询语句中,可仅包含关联查询语句,也可仅包含单独查询语句,还可同时包含关联查询语句与单独查询语句。The query statement generated based on the intent type and the associated information of the characteristic words may only include the associated query statement, or only the individual query statement, or both the associated query statement and the individual query statement.
S143、根据所述关联查询语句和/或所述单独查询语句,对所述信息数据库进行查询,以获取对应的信息查询结果。S143. Perform a query on the information database according to the associated query statement and/or the individual query statement, so as to obtain a corresponding information query result.
可根据所得到的查询语句对信息数据库进行查询,每一条查询语句可实现对信息数据库 进行一次查询,则每一条查询语句均可得到一个查询结果,基于所生成的一条或多条查询语句对信息数据库进行查询,并将所得到的查询结果进行综合,即可获取得到与查询请求信息对应的信息查询结果The information database can be queried according to the obtained query statements. Each query statement can realize a query of the information database, and each query statement can obtain a query result. Based on the generated one or more query statements, the information The database is queried, and the obtained query results are synthesized to obtain the information query results corresponding to the query request information
在一实施例中,如图7所示,步骤S140之后还包括步骤S150。In one embodiment, as shown in FIG. 7 , step S150 is further included after step S140 .
S150、将所述信息查询结果上传至区块链中。S150. Upload the information query result to the blockchain.
将所述信息查询结果上传至区块链中。基于信息查询结果得到对应的摘要信息,具体来说,摘要信息由信息查询结果进行散列处理得到,比如利用sha256s算法处理得到。将摘要信息上传至区块链可保证其安全性和对用户的公正透明性。用户设备可以从区块链中下载得该摘要信息,以便查证信息查询结果是否被篡改。本示例所指区块链是分布式数据存储、点对点传输、共识机制、加密算法等计算机技术的新型应用模式。区块链(Blockchain),本质上是一个去中心化的数据库,是一串使用密码学方法相关联产生的数据块,每一个数据块中包含了一批次网络交易的信息,用于验证其信息的有效性(防伪)和生成下一个区块。区块链可以包括区块链底层平台、平台产品服务层以及应用服务层等。Upload the information query result to the block chain. The corresponding abstract information is obtained based on the information query result. Specifically, the abstract information is obtained by hashing the information query result, for example, by using the sha256s algorithm. Uploading summary information to the blockchain guarantees its security and fairness and transparency to users. The user device can download the summary information from the blockchain to verify whether the information query result has been tampered with. The blockchain referred to in this example is a new application mode of computer technologies such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm. Blockchain (Blockchain), essentially a decentralized database, is a series of data blocks associated with each other using cryptographic methods. Each data block contains a batch of network transaction information, which is used to verify its Validity of information (anti-counterfeiting) and generation of the next block. The blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
本申请可应用于智慧教育等基于意图识别进行信息智能化查询的场景中,从而推动智慧教育的建设。This application can be applied to smart education and other scenarios where information is intelligently searched based on intent recognition, thereby promoting the construction of smart education.
在本申请实施例所提供的基于意图识别的信息智能查询方法中,从用户输入的查询请求信息中提取得到对应的目标文字信息,根据文本意图解析模型对目标文本信息进行解析得到意图类型,并根据意图类型及关系识别网络识别目标文本信息中特征词之间的关联关系得到特征词关联信息,根据意图类型及特征词关联信息查询信息数据库得到信息查询结果。通过上述方法,可通过对目标文本信息进行解析得到准确的意图类型,并识别目标文本信息中特征词之间进行关联关系,基于意图类型及特征词之间的关联关系得到信息查询结果,可大幅增加进行信息查询的准确性,同时提高进行信息查询的效率。In the intelligent information query method based on intent recognition provided in the embodiment of the present application, the corresponding target text information is extracted from the query request information input by the user, and the target text information is analyzed according to the text intent analysis model to obtain the intent type, and According to the type of intent and the relationship recognition network, the association relationship between the feature words in the target text information is identified to obtain the associated information of the feature words, and the information database is queried according to the type of intent and the associated information of the feature words to obtain the information query result. Through the above method, the accurate intention type can be obtained by analyzing the target text information, and the association relationship between the characteristic words in the target text information can be identified, and the information query result can be obtained based on the association relationship between the intention type and the characteristic words, which can greatly The accuracy of information inquiry is increased, and the efficiency of information inquiry is improved at the same time.
本申请实施例还提供一种基于意图识别的信息智能查询装置,该基于意图识别的信息智能查询装置用于执行前述的基于意图识别的信息智能查询方法的任一实施例,具体地,请参阅图8,图8为本申请实施例提供的基于意图识别的信息智能查询装置的示意性框图。The embodiment of the present application also provides an intelligent information query device based on intention recognition, which is used to implement any embodiment of the aforementioned intelligent information query method based on intention recognition. Specifically, please refer to Fig. 8, Fig. 8 is a schematic block diagram of an information intelligent query device based on intention recognition provided by an embodiment of the present application.
如图8所示,基于意图识别的信息智能查询装置100包括目标文字信息获取单元110、意图类型识别单元120、特征词关联信息获取单元130和信息查询结果获取单元140。As shown in FIG. 8 , the intelligent information query device 100 based on intention recognition includes a target text information acquisition unit 110 , an intention type identification unit 120 , a characteristic word related information acquisition unit 130 and an information query result acquisition unit 140 .
目标文字信息获取单元110,用于若接收到所输入的查询请求信息,从所述查询请求信息中提取对应的目标文字信息。The target text information acquisition unit 110 is configured to extract corresponding target text information from the query request information if the input query request information is received.
在一实施例中,所述目标文字信息获取单元110包括子单元:类型判断单元、语音信息识别单元和目标文字信息确定单元。In an embodiment, the target text information acquisition unit 110 includes subunits: a type judgment unit, a speech information recognition unit, and a target text information determination unit.
类型判断单元,用于判断所述查询请求信息的信息类型。A type judging unit, configured to judge the information type of the query request information.
语音信息识别单元,用于若所述查询请求信息不为文字信息,根据预存的语音识别模型对所述查询请求信息包含的语音信息进行识别以得到与所述查询请求信息对应的目标文字信息。The voice information recognition unit is configured to, if the query request information is not text information, recognize the voice information contained in the query request information according to a pre-stored speech recognition model to obtain target text information corresponding to the query request information.
在一实施例中,所述指标数据集获取单元包括子单元:噪声判断单元,用于根据所述噪音判断规则对所述查询请求信息中的语音信息是否包含噪音进行判断;语音识别单元,用于若所述查询请求信息中的语音信息不包含噪音,根据所述文字信息获取模型对所述查询请求信息中的语音信息进行识别,以得到与所述查询请求信息对应的目标文字信息;提示信息反馈单元,用于若所述查询请求信息中的语音信息包含噪音,反馈重新输入的提示信息以提示再次输入所述查询请求信息。In one embodiment, the index data set acquisition unit includes a subunit: a noise judgment unit, configured to judge whether the speech information in the query request information contains noise according to the noise judgment rule; a speech recognition unit, using If the voice information in the query request information does not contain noise, identify the voice information in the query request information according to the text information acquisition model to obtain the target text information corresponding to the query request information; prompt An information feedback unit, configured to feed back re-input prompt information to prompt re-input of the query request information if the voice information in the query request information contains noise.
目标文字信息确定单元,用于若所述查询请求信息的信息类型为文字信息类型,将所述查询请求信息确定为目标文字信息。The target text information determining unit is configured to determine the query request information as the target text information if the information type of the query request information is a text information type.
意图类型识别单元120,用于根据预置的文本意图解析模型对所述目标文本信息进行解 析得到对应的意图类型。The intent type identification unit 120 is configured to analyze the target text information according to a preset text intent analysis model to obtain the corresponding intent type.
在一实施例中,所述意图类型识别单元120包括子单元:分词处理单元,用于根据所述分词处理规则对所述目标文本信息进行分词处理得到分词结果;分类单元,用于将所述分词结果及所述文字特征向量输入所述分类神经网络,以获取与所述目标文本信息对应的意图类型。In one embodiment, the intent type identification unit 120 includes subunits: a word segmentation processing unit, configured to perform word segmentation processing on the target text information according to the word segmentation processing rules to obtain a word segmentation result; a classification unit, configured to classify the The word segmentation result and the character feature vector are input into the classification neural network to obtain the intent type corresponding to the target text information.
在一实施例中,所述分词处理单元包括子单元:文字特征向量获取单元,用于根据所述向量转换数据库从所述目标文本信息中获取对应的文字特征向量;词性标注信息获取单元,用于根据所述词性标注网络对所述文字特征向量进行词性标注,得到所述目标文本信息中每一字符对应的词性标注信息;分词结果获取单元,用于根据每一字符对应的词性标注信息对所述目标文本信息进行分词处理,得到对应的分词结果。In one embodiment, the word segmentation processing unit includes subunits: a character feature vector acquisition unit, configured to acquire a corresponding character feature vector from the target text information according to the vector conversion database; a part-of-speech tagging information acquisition unit, used Perform part-of-speech tagging on the text feature vector according to the part-of-speech tagging network to obtain the part-of-speech tagging information corresponding to each character in the target text information; the word segmentation result acquisition unit is used to pair The target text information is subjected to word segmentation processing to obtain corresponding word segmentation results.
特征词关联信息获取单元130,用于根据预置的关系识别网络及所述意图类型对所述目标文本信息中特征词之间进行关联关系识别,得到与所述目标文本信息对应的特征词关联信息。The feature word association information acquisition unit 130 is used to identify the association relationship between the feature words in the target text information according to the preset relationship recognition network and the intent type, and obtain the feature word association corresponding to the target text information information.
信息查询结果获取单元140,用于根据所述意图类型及所述特征词关联信息对预存的信息数据库进行查询,以获取与所述查询请求信息对应的信息查询结果。The information query result acquisition unit 140 is configured to query a pre-stored information database according to the intent type and the feature word association information, so as to obtain an information query result corresponding to the query request information.
在一实施例中,所述信息查询结果获取单元140包括子单元:关联查询语句生成单元,用于根据所述特征词关联信息中存在关联关系的多个特征词及所述意图类型生成对应的关联查询语句;单独查询语句生成单元,用于根据所述特征词关联关系中不存在关联关系的特征词及所述意图类型生成单独查询语句;信息查询单元,用于根据所述关联查询语句和/或所述单独查询语句,对所述信息数据库进行查询,以获取对应的信息查询结果。In one embodiment, the information query result acquisition unit 140 includes a subunit: an associated query statement generation unit, configured to generate a corresponding query statement according to a plurality of associated characteristic words in the characteristic word association information and the intent type. Associated query statement; a separate query statement generation unit, used to generate a separate query statement according to the feature words and the intent type that do not have an associated relationship in the feature word association relationship; an information query unit, used to generate a separate query statement based on the associated query statement and /or the separate query statement is used to query the information database to obtain a corresponding information query result.
在一实施例中,所述基于意图识别的信息智能查询装置100还包括子单元:信息查询结果存储单元,用于将所述信息查询结果上传至区块链中。In an embodiment, the intelligent information query device 100 based on intention identification further includes a subunit: an information query result storage unit, configured to upload the information query result to the block chain.
在本申请实施例所提供的基于意图识别的信息智能查询装置应用上述基于意图识别的信息智能查询方法,从用户输入的查询请求信息中提取得到对应的目标文字信息,根据文本意图解析模型对目标文本信息进行解析得到意图类型,并根据意图类型及关系识别网络识别目标文本信息中特征词之间的关联关系得到特征词关联信息,根据意图类型及特征词关联信息查询信息数据库得到信息查询结果。通过上述方法,可通过对目标文本信息进行解析得到准确的意图类型,并识别目标文本信息中特征词之间进行关联关系,基于意图类型及特征词之间的关联关系得到信息查询结果,可大幅增加进行信息查询的准确性,同时提高进行信息查询的效率。The intelligent information query device based on intent recognition provided in the embodiment of the present application applies the above-mentioned intelligent query method based on intent recognition, extracts the corresponding target text information from the query request information input by the user, and analyzes the target text information according to the text intent analysis model. The text information is analyzed to obtain the intent type, and the relationship between the feature words in the target text information is identified according to the intent type and the relationship recognition network to obtain the feature word association information, and the information database is queried according to the intent type and feature word association information to obtain the information query result. Through the above method, the accurate intention type can be obtained by analyzing the target text information, and the association relationship between the characteristic words in the target text information can be identified, and the information query result can be obtained based on the association relationship between the intention type and the characteristic words, which can greatly The accuracy of information inquiry is increased, and the efficiency of information inquiry is improved at the same time.
上述基于意图识别的信息智能查询装置可以实现为计算机程序的形式,该计算机程序可以在如图9所示的计算机设备上运行。The above-mentioned intelligent information query device based on intention recognition can be implemented in the form of a computer program, and the computer program can be run on the computer device as shown in FIG. 9 .
请参阅图9,图9是本申请实施例提供的计算机设备的示意性框图。该计算机设备可以是用于执行基于意图识别的信息智能查询方法以基于意图识别进行信息智能化查询的的用户终端或管理服务器。Please refer to FIG. 9 , which is a schematic block diagram of a computer device provided by an embodiment of the present application. The computer device may be a user terminal or a management server for executing the intent recognition-based intelligent information query method to perform intelligent information query based on intent recognition.
参阅图9,该计算机设备500包括通过系统总线501连接的处理器502、存储器和网络接口505,其中,存储器可以包括存储介质503和内存储器504。Referring to FIG. 9 , the computer device 500 includes a processor 502 connected through a system bus 501 , a memory and a network interface 505 , wherein the memory may include a storage medium 503 and an internal memory 504 .
该存储介质503可存储操作系统5031和计算机程序5032。该计算机程序5032被执行时,可使得处理器502执行基于意图识别的信息智能查询方法,其中,存储介质503可以为易失性的存储介质或非易失性的存储介质。The storage medium 503 can store an operating system 5031 and a computer program 5032 . When the computer program 5032 is executed, it can cause the processor 502 to execute an intelligent information query method based on intent recognition, wherein the storage medium 503 can be a volatile storage medium or a non-volatile storage medium.
该处理器502用于提供计算和控制能力,支撑整个计算机设备500的运行。The processor 502 is used to provide calculation and control capabilities and support the operation of the entire computer device 500 .
该内存储器504为存储介质503中的计算机程序5032的运行提供环境,该计算机程序5032被处理器502执行时,可使得处理器502执行基于意图识别的信息智能查询方法。The internal memory 504 provides an environment for running the computer program 5032 in the storage medium 503. When the computer program 5032 is executed by the processor 502, the processor 502 can execute an intelligent information query method based on intent recognition.
该网络接口505用于进行网络通信,如提供数据信息的传输等。本领域技术人员可以理 解,图9中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备500的限定,具体的计算机设备500可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。The network interface 505 is used for network communication, such as providing data transmission and the like. Those skilled in the art can understand that the structure shown in FIG. 9 is only a block diagram of a partial structure related to the solution of this application, and does not constitute a limitation on the computer device 500 on which the solution of this application is applied. The specific computer device 500 may include more or fewer components than shown, or combine certain components, or have a different arrangement of components.
其中,所述处理器502用于运行存储在存储器中的计算机程序5032,以实现上述的基于意图识别的信息智能查询方法中对应的功能。Wherein, the processor 502 is configured to run the computer program 5032 stored in the memory, so as to realize the corresponding functions in the above-mentioned intelligent information query method based on intent recognition.
本领域技术人员可以理解,图9中示出的计算机设备的实施例并不构成对计算机设备具体构成的限定,在其他实施例中,计算机设备可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。例如,在一些实施例中,计算机设备可以仅包括存储器及处理器,在这样的实施例中,存储器及处理器的结构及功能与图9所示实施例一致,在此不再赘述。Those skilled in the art can understand that the embodiment of the computer device shown in FIG. 9 does not constitute a limitation on the specific composition of the computer device. In other embodiments, the computer device may include more or less components than those shown in the illustration. Or combine certain components, or different component arrangements. For example, in some embodiments, the computer device may only include a memory and a processor. In such an embodiment, the structures and functions of the memory and the processor are consistent with those of the embodiment shown in FIG. 9 , and will not be repeated here.
应当理解,在本申请实施例中,处理器502可以是中央处理单元(Central Processing Unit,CPU),该处理器502还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。其中,通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。It should be understood that in the embodiment of the present application, the processor 502 may be a central processing unit (Central Processing Unit, CPU), and the processor 502 may also be other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), Application Specific Integrated Circuit (ASIC), off-the-shelf programmable gate array (Field-Programmable Gate Array, FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. Wherein, the general-purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
在本申请的另一实施例中提供计算机可读存储介质。该计算机可读存储介质可以为易失性或非易失性的计算机可读存储介质。该计算机可读存储介质存储有计算机程序,其中计算机程序被处理器执行时实现上述的基于意图识别的信息智能查询方法。In another embodiment of the present application a computer readable storage medium is provided. The computer-readable storage medium may be a volatile or non-volatile computer-readable storage medium. The computer-readable storage medium stores a computer program, wherein when the computer program is executed by the processor, the above-mentioned method for intelligent information query based on intention recognition is implemented.
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,上述描述的设备、装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、计算机软件或者二者的结合来实现,为了清楚地说明硬件和软件的可互换性,在上述说明中已经按照功能一般性地描述了各示例的组成及步骤。这些功能究竟以硬件还是软件方式来执行取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。Those skilled in the art can clearly understand that, for the convenience and brevity of description, the specific working process of the above-described equipment, devices and units can refer to the corresponding process in the foregoing method embodiments, and details are not repeated here. Those of ordinary skill in the art can realize that the units and algorithm steps of the examples described in conjunction with the embodiments disclosed herein can be implemented by electronic hardware, computer software, or a combination of the two. In order to clearly illustrate the relationship between hardware and software Interchangeability. In the above description, the composition and steps of each example have been generally described according to their functions. Whether these functions are implemented by hardware or software depends on the specific application and design constraints of the technical solution. Those skilled in the art may use different methods to implement the described functions for each specific application, but such implementation should not be regarded as exceeding the scope of the present application.
在本申请所提供的几个实施例中,应该理解到,所揭露的设备、装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为逻辑功能划分,实际实现时可以有另外的划分方式,也可以将具有相同功能的单元集合成一个单元,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另外,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口、装置或单元的间接耦合或通信连接,也可以是电的,机械的或其它的形式连接。In the several embodiments provided in this application, it should be understood that the disclosed devices, devices and methods can be implemented in other ways. For example, the device embodiments described above are only illustrative. For example, the division of the units is only logical function division. In actual implementation, there may be other division methods, and units with the same function may also be combined into one Units such as multiple units or components may be combined or integrated into another system, or some features may be omitted, or not implemented. In addition, the mutual coupling or direct coupling or communication connection shown or discussed may be indirect coupling or communication connection through some interfaces, devices or units, and may also be electrical, mechanical or other forms of connection.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本申请实施例方案的目的。The units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in one place, or may be distributed to multiple network units. Part or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment of the present application.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以是两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit. The above-mentioned integrated units can be implemented in the form of hardware or in the form of software functional units.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分,或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个计算机可读存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,管理服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的计算机可读存储介质包括:U盘、移动硬盘、只读存储器(ROM, Read-Only Memory)、磁碟或者光盘等各种可以存储程序代码的介质。If the integrated unit is realized in the form of a software function unit and sold or used as an independent product, it can be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present application is essentially or the part that contributes to the prior art, or all or part of the technical solution can be embodied in the form of software products, and the computer software products are stored in a computer. The read storage medium includes several instructions to make a computer device (which may be a personal computer, a management server, or a network device, etc.) execute all or part of the steps of the methods described in the various embodiments of the present application. The aforementioned computer-readable storage medium includes: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), magnetic disk or optical disk, and other media that can store program codes.
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到各种等效的修改或替换,这些修改或替换都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以权利要求的保护范围为准。The above is only a specific embodiment of the application, but the scope of protection of the application is not limited thereto. Any person familiar with the technical field can easily think of various equivalents within the scope of the technology disclosed in the application. Modifications or replacements, these modifications or replacements shall be covered within the scope of protection of this application. Therefore, the protection scope of the present application should be based on the protection scope of the claims.

Claims (20)

  1. 一种基于意图识别的信息智能查询方法,包括:A method for intelligent information query based on intent recognition, comprising:
    若接收到所输入的查询请求信息,从所述查询请求信息中提取对应的目标文字信息;If the input query request information is received, the corresponding target text information is extracted from the query request information;
    根据预置的文本意图解析模型对所述目标文本信息进行解析得到对应的意图类型;Analyzing the target text information according to a preset text intent analysis model to obtain a corresponding intent type;
    根据预置的关系识别网络及所述意图类型对所述目标文本信息中特征词之间进行关联关系识别,得到与所述目标文本信息对应的特征词关联信息;According to the preset relationship recognition network and the intent type, the association relationship between the feature words in the target text information is identified, and the feature word association information corresponding to the target text information is obtained;
    根据所述意图类型及所述特征词关联信息对预存的信息数据库进行查询,以获取与所述查询请求信息对应的信息查询结果。A pre-stored information database is queried according to the intent type and the associated information of the characteristic words, so as to obtain an information query result corresponding to the query request information.
  2. 根据权利要求1所述的基于意图识别的信息智能查询方法,其中,所述从所述查询请求信息中提取对应的目标文字信息,包括:The intelligent information query method based on intent recognition according to claim 1, wherein said extracting corresponding target text information from said query request information comprises:
    判断所述查询请求信息的信息类型;judging the information type of the query request information;
    若所述查询请求信息的信息类型为语音信息类型,则根据预存的语音识别模型对所述查询请求信息包含的语音信息进行识别以得到与所述查询请求信息对应的目标文字信息;If the information type of the query request information is a voice information type, then recognize the voice information contained in the query request information according to a pre-stored voice recognition model to obtain target text information corresponding to the query request information;
    若所述查询请求信息的信息类型为文字信息类型,将所述查询请求信息确定为目标文字信息。If the information type of the query request information is a text information type, the query request information is determined as target text information.
  3. 根据权利要求2所述的基于意图识别的信息智能查询方法,其中,所述语音识别模型包括噪音判断规则及文字信息获取模型,所述根据预存的语音识别模型对所述查询请求信息包含的语音信息进行识别以得到与所述查询请求信息对应的目标文字信息,包括:The intelligent information query method based on intention recognition according to claim 2, wherein the voice recognition model includes noise judgment rules and text information acquisition models, and the voice information contained in the query request information is analyzed according to the pre-stored voice recognition model. Information is identified to obtain the target text information corresponding to the query request information, including:
    根据所述噪音判断规则对所述查询请求信息中的语音信息是否包含噪音进行判断;judging whether the voice information in the query request information contains noise according to the noise judging rule;
    若所述查询请求信息中的语音信息不包含噪音,根据所述文字信息获取模型对所述查询请求信息中的语音信息进行识别,以得到与所述查询请求信息对应的目标文字信息;If the voice information in the query request information does not contain noise, identify the voice information in the query request information according to the text information acquisition model, so as to obtain target text information corresponding to the query request information;
    若所述查询请求信息中的语音信息包含噪音,反馈重新输入的提示信息以提示再次输入所述查询请求信息。If the voice information in the query request information contains noise, feedback re-input prompt information to prompt re-input of the query request information.
  4. 根据权利要求1所述的基于意图识别的信息智能查询方法,其中,所述文本意图解析模型包括分词处理规则及分类神经网络,所述根据预置的文本意图解析模型对所述目标文本信息进行解析得到对应的意图类型,包括:The information intelligent query method based on intent recognition according to claim 1, wherein the text intent analysis model includes word segmentation processing rules and a classification neural network, and the target text information is processed according to the preset text intent analysis model Parse to get the corresponding intent type, including:
    根据所述分词处理规则对所述目标文本信息进行分词处理得到分词结果;performing word segmentation processing on the target text information according to the word segmentation processing rules to obtain word segmentation results;
    将所述分词结果及所述文字特征向量输入所述分类神经网络,以获取与所述目标文本信息对应的意图类型。Input the word segmentation result and the character feature vector into the classification neural network to obtain the intent type corresponding to the target text information.
  5. 根据权利要求4所述的基于意图识别的信息智能查询方法,其中,所述分词处理规则包括向量转换数据库及词性标注网络,所述根据所述分词处理规则对所述目标文本信息进行分词处理得到分词结果,包括:The information intelligent query method based on intent recognition according to claim 4, wherein the word segmentation processing rules include a vector conversion database and a part-of-speech tagging network, and the target text information is subjected to word segmentation processing according to the word segmentation processing rules to obtain Word segmentation results, including:
    根据所述向量转换数据库从所述目标文本信息中获取对应的文字特征向量;Obtaining a corresponding character feature vector from the target text information according to the vector conversion database;
    根据所述词性标注网络对所述文字特征向量进行词性标注,得到所述目标文本信息中每一字符对应的词性标注信息;Carry out part-of-speech tagging to the character feature vector according to the part-of-speech tagging network, and obtain the part-of-speech tagging information corresponding to each character in the target text information;
    根据每一字符对应的词性标注信息对所述目标文本信息进行分词处理,得到对应的分词结果。The target text information is subjected to word segmentation processing according to the part-of-speech tagging information corresponding to each character, and a corresponding word segmentation result is obtained.
  6. 根据权利要求1所述的基于意图识别的信息智能查询方法,其中,所述根据所述意图类型及所述特征词关联信息对预存的信息数据库进行查询,以获取与所述查询请求信息对应的信息查询结果,包括:The intelligent information query method based on intent recognition according to claim 1, wherein the pre-stored information database is queried according to the type of intent and the associated information of the feature words to obtain information corresponding to the query request information Information query results, including:
    根据所述特征词关联信息中存在关联关系的多个特征词及所述意图类型生成对应的关联查询语句;Generate a corresponding associated query statement according to a plurality of associated characteristic words in the characteristic word association information and the intent type;
    根据所述特征词关联关系中不存在关联关系的特征词及所述意图类型生成单独查询语句;Generate a separate query statement according to the feature words that do not have an association relationship in the feature word association relationship and the intent type;
    根据所述关联查询语句和/或所述单独查询语句,对所述信息数据库进行查询,以获取对应的信息查询结果。According to the associated query statement and/or the individual query statement, the information database is queried to obtain a corresponding information query result.
  7. 根据权利要求1所述的基于意图识别的信息智能查询方法,其中,还包括:The intelligent information query method based on intention recognition according to claim 1, further comprising:
    将所述信息查询结果上传至区块链中。Upload the information query result to the block chain.
  8. 一种基于意图识别的信息智能查询装置,包括:An information intelligent query device based on intention recognition, comprising:
    目标文字信息获取单元,用于若接收到所输入的查询请求信息,从所述查询请求信息中提取对应的目标文字信息;A target text information acquisition unit, configured to extract corresponding target text information from the query request information if the input query request information is received;
    意图类型识别单元,用于根据预置的文本意图解析模型对所述目标文本信息进行解析得到对应的意图类型;An intent type identification unit, configured to analyze the target text information according to a preset text intent analysis model to obtain a corresponding intent type;
    特征词关联信息获取单元,用于根据预置的关系识别网络及所述意图类型对所述目标文本信息中特征词之间进行关联关系识别,得到与所述目标文本信息对应的特征词关联信息;A feature word association information acquisition unit, configured to identify the association relationship between the feature words in the target text information according to the preset relationship recognition network and the intent type, and obtain the feature word association information corresponding to the target text information ;
    信息查询结果获取单元,用于根据所述意图类型及所述特征词关联信息对预存的信息数据库进行查询,以获取与所述查询请求信息对应的信息查询结果。An information query result acquisition unit, configured to query a prestored information database according to the intent type and the feature word association information, so as to obtain an information query result corresponding to the query request information.
  9. 一种计算机设备,包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,其中,所述处理器执行所述计算机程序时实现以下步骤:A computer device, comprising a memory, a processor, and a computer program stored on the memory and operable on the processor, wherein the processor implements the following steps when executing the computer program:
    若接收到所输入的查询请求信息,从所述查询请求信息中提取对应的目标文字信息;If the input query request information is received, the corresponding target text information is extracted from the query request information;
    根据预置的文本意图解析模型对所述目标文本信息进行解析得到对应的意图类型;Analyzing the target text information according to a preset text intent analysis model to obtain a corresponding intent type;
    根据预置的关系识别网络及所述意图类型对所述目标文本信息中特征词之间进行关联关系识别,得到与所述目标文本信息对应的特征词关联信息;According to the preset relationship recognition network and the intent type, the association relationship between the feature words in the target text information is identified, and the feature word association information corresponding to the target text information is obtained;
    根据所述意图类型及所述特征词关联信息对预存的信息数据库进行查询,以获取与所述查询请求信息对应的信息查询结果。A pre-stored information database is queried according to the intent type and the associated information of the characteristic words, so as to obtain an information query result corresponding to the query request information.
  10. 根据权利要求9所述的计算机设备,其中,所述从所述查询请求信息中提取对应的目标文字信息,包括:The computer device according to claim 9, wherein said extracting corresponding target text information from said query request information comprises:
    判断所述查询请求信息的信息类型;judging the information type of the query request information;
    若所述查询请求信息的信息类型为语音信息类型,则根据预存的语音识别模型对所述查询请求信息包含的语音信息进行识别以得到与所述查询请求信息对应的目标文字信息;If the information type of the query request information is a voice information type, then recognize the voice information contained in the query request information according to a pre-stored voice recognition model to obtain target text information corresponding to the query request information;
    若所述查询请求信息的信息类型为文字信息类型,将所述查询请求信息确定为目标文字信息。If the information type of the query request information is a text information type, the query request information is determined as target text information.
  11. 根据权利要求10所述的计算机设备,其中,所述语音识别模型包括噪音判断规则及文字信息获取模型,所述根据预存的语音识别模型对所述查询请求信息包含的语音信息进行识别以得到与所述查询请求信息对应的目标文字信息,包括:The computer device according to claim 10, wherein the speech recognition model includes a noise judgment rule and a text information acquisition model, and the speech information contained in the query request information is recognized according to the pre-stored speech recognition model to obtain a The target text information corresponding to the query request information includes:
    根据所述噪音判断规则对所述查询请求信息中的语音信息是否包含噪音进行判断;judging whether the voice information in the query request information contains noise according to the noise judging rule;
    若所述查询请求信息中的语音信息不包含噪音,根据所述文字信息获取模型对所述查询请求信息中的语音信息进行识别,以得到与所述查询请求信息对应的目标文字信息;If the voice information in the query request information does not contain noise, identify the voice information in the query request information according to the text information acquisition model, so as to obtain target text information corresponding to the query request information;
    若所述查询请求信息中的语音信息包含噪音,反馈重新输入的提示信息以提示再次输入所述查询请求信息。If the voice information in the query request information contains noise, feedback re-input prompt information to prompt re-input of the query request information.
  12. 根据权利要求9所述的计算机设备,其中,所述文本意图解析模型包括分词处理规则及分类神经网络,所述根据预置的文本意图解析模型对所述目标文本信息进行解析得到对应的意图类型,包括:The computer device according to claim 9, wherein the text intent analysis model includes word segmentation processing rules and a classification neural network, and the target text information is analyzed according to the preset text intent analysis model to obtain the corresponding intent type ,include:
    根据所述分词处理规则对所述目标文本信息进行分词处理得到分词结果;performing word segmentation processing on the target text information according to the word segmentation processing rules to obtain word segmentation results;
    将所述分词结果及所述文字特征向量输入所述分类神经网络,以获取与所述目标文本信息对应的意图类型。Input the word segmentation result and the character feature vector into the classification neural network to obtain the intent type corresponding to the target text information.
  13. 根据权利要求12所述的计算机设备,其中,所述分词处理规则包括向量转换数据库及词性标注网络,所述根据所述分词处理规则对所述目标文本信息进行分词处理得到分词结果,包括:The computer device according to claim 12, wherein the word segmentation processing rules include a vector conversion database and a part-of-speech tagging network, and performing word segmentation processing on the target text information according to the word segmentation processing rules to obtain word segmentation results includes:
    根据所述向量转换数据库从所述目标文本信息中获取对应的文字特征向量;Obtaining a corresponding character feature vector from the target text information according to the vector conversion database;
    根据所述词性标注网络对所述文字特征向量进行词性标注,得到所述目标文本信息中每一字符对应的词性标注信息;Carry out part-of-speech tagging to the character feature vector according to the part-of-speech tagging network, and obtain the part-of-speech tagging information corresponding to each character in the target text information;
    根据每一字符对应的词性标注信息对所述目标文本信息进行分词处理,得到对应的分词结果。The target text information is subjected to word segmentation processing according to the part-of-speech tagging information corresponding to each character, and a corresponding word segmentation result is obtained.
  14. 根据权利要求9所述的计算机设备,其中,所述根据所述意图类型及所述特征词关联信息对预存的信息数据库进行查询,以获取与所述查询请求信息对应的信息查询结果,包括:The computer device according to claim 9, wherein the querying of the pre-stored information database according to the type of intent and the associated information of the characteristic words to obtain an information query result corresponding to the query request information includes:
    根据所述特征词关联信息中存在关联关系的多个特征词及所述意图类型生成对应的关联查询语句;Generate a corresponding associated query statement according to a plurality of associated characteristic words in the characteristic word association information and the intent type;
    根据所述特征词关联关系中不存在关联关系的特征词及所述意图类型生成单独查询语句;Generate a separate query statement according to the feature words that do not have an association relationship in the feature word association relationship and the intent type;
    根据所述关联查询语句和/或所述单独查询语句,对所述信息数据库进行查询,以获取对应的信息查询结果。According to the associated query statement and/or the individual query statement, the information database is queried to obtain a corresponding information query result.
  15. 根据权利要求9所述的计算机设备,其中,还包括:The computer device according to claim 9, further comprising:
    将所述信息查询结果上传至区块链中。Upload the information query result to the block chain.
  16. 一种计算机可读存储介质,其中,所述计算机可读存储介质存储有计算机程序,当所述计算机程序被处理器执行时执行以下操作:A computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the following operations are performed:
    若接收到所输入的查询请求信息,从所述查询请求信息中提取对应的目标文字信息;If the input query request information is received, the corresponding target text information is extracted from the query request information;
    根据预置的文本意图解析模型对所述目标文本信息进行解析得到对应的意图类型;Analyzing the target text information according to a preset text intent analysis model to obtain a corresponding intent type;
    根据预置的关系识别网络及所述意图类型对所述目标文本信息中特征词之间进行关联关系识别,得到与所述目标文本信息对应的特征词关联信息;According to the preset relationship recognition network and the intent type, the association relationship between the feature words in the target text information is identified, and the feature word association information corresponding to the target text information is obtained;
    根据所述意图类型及所述特征词关联信息对预存的信息数据库进行查询,以获取与所述查询请求信息对应的信息查询结果。A pre-stored information database is queried according to the intent type and the associated information of the characteristic words, so as to obtain an information query result corresponding to the query request information.
  17. 根据权利要求16所述的计算机可读存储介质,其中,所述从所述查询请求信息中提取对应的目标文字信息,包括:The computer-readable storage medium according to claim 16, wherein said extracting corresponding target text information from said query request information comprises:
    判断所述查询请求信息的信息类型;judging the information type of the query request information;
    若所述查询请求信息的信息类型为语音信息类型,则根据预存的语音识别模型对所述查询请求信息包含的语音信息进行识别以得到与所述查询请求信息对应的目标文字信息;If the information type of the query request information is a voice information type, then recognize the voice information contained in the query request information according to a pre-stored voice recognition model to obtain target text information corresponding to the query request information;
    若所述查询请求信息的信息类型为文字信息类型,将所述查询请求信息确定为目标文字信息。If the information type of the query request information is a text information type, the query request information is determined as target text information.
  18. 根据权利要求17所述的计算机可读存储介质,其中,所述语音识别模型包括噪音判断规则及文字信息获取模型,所述根据预存的语音识别模型对所述查询请求信息包含的语音信息进行识别以得到与所述查询请求信息对应的目标文字信息,包括:The computer-readable storage medium according to claim 17, wherein the speech recognition model includes a noise judgment rule and a text information acquisition model, and the speech information contained in the query request information is recognized according to the pre-stored speech recognition model To obtain the target text information corresponding to the query request information, including:
    根据所述噪音判断规则对所述查询请求信息中的语音信息是否包含噪音进行判断;judging whether the voice information in the query request information contains noise according to the noise judging rule;
    若所述查询请求信息中的语音信息不包含噪音,根据所述文字信息获取模型对所述查询请求信息中的语音信息进行识别,以得到与所述查询请求信息对应的目标文字信息;If the voice information in the query request information does not contain noise, identify the voice information in the query request information according to the text information acquisition model, so as to obtain target text information corresponding to the query request information;
    若所述查询请求信息中的语音信息包含噪音,反馈重新输入的提示信息以提示再次输入所述查询请求信息。If the voice information in the query request information contains noise, feedback re-input prompt information to prompt re-input of the query request information.
  19. 根据权利要求16所述的计算机可读存储介质,其中,所述文本意图解析模型包括分词处理规则及分类神经网络,所述根据预置的文本意图解析模型对所述目标文本信息进行解析得到对应的意图类型,包括:The computer-readable storage medium according to claim 16, wherein the text intent analysis model includes word segmentation processing rules and a classification neural network, and the target text information is analyzed according to the preset text intent analysis model to obtain corresponding intent types, including:
    根据所述分词处理规则对所述目标文本信息进行分词处理得到分词结果;performing word segmentation processing on the target text information according to the word segmentation processing rules to obtain word segmentation results;
    将所述分词结果及所述文字特征向量输入所述分类神经网络,以获取与所述目标文本信息对应的意图类型。Input the word segmentation result and the character feature vector into the classification neural network to obtain the intent type corresponding to the target text information.
  20. 根据权利要求19所述的计算机可读存储介质,其中,所述分词处理规则包括向量转 换数据库及词性标注网络,所述根据所述分词处理规则对所述目标文本信息进行分词处理得到分词结果,包括:The computer-readable storage medium according to claim 19, wherein the word segmentation processing rules include a vector conversion database and a part-of-speech tagging network, and performing word segmentation processing on the target text information according to the word segmentation processing rules to obtain word segmentation results, include:
    根据所述向量转换数据库从所述目标文本信息中获取对应的文字特征向量;Obtaining a corresponding character feature vector from the target text information according to the vector conversion database;
    根据所述词性标注网络对所述文字特征向量进行词性标注,得到所述目标文本信息中每一字符对应的词性标注信息;Carry out part-of-speech tagging to the character feature vector according to the part-of-speech tagging network, and obtain the part-of-speech tagging information corresponding to each character in the target text information;
    根据每一字符对应的词性标注信息对所述目标文本信息进行分词处理,得到对应的分词结果。The target text information is subjected to word segmentation processing according to the part-of-speech tagging information corresponding to each character, and a corresponding word segmentation result is obtained.
PCT/CN2022/071789 2021-09-10 2022-01-13 Intent recognition-based information intelligent query method and apparatus, device and medium WO2023035529A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111060715.2 2021-09-10
CN202111060715.2A CN113761206A (en) 2021-09-10 2021-09-10 Intelligent information query method, device, equipment and medium based on intention recognition

Publications (1)

Publication Number Publication Date
WO2023035529A1 true WO2023035529A1 (en) 2023-03-16

Family

ID=78794688

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/071789 WO2023035529A1 (en) 2021-09-10 2022-01-13 Intent recognition-based information intelligent query method and apparatus, device and medium

Country Status (2)

Country Link
CN (1) CN113761206A (en)
WO (1) WO2023035529A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113761206A (en) * 2021-09-10 2021-12-07 平安科技(深圳)有限公司 Intelligent information query method, device, equipment and medium based on intention recognition

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110046231A (en) * 2018-12-21 2019-07-23 阿里巴巴集团控股有限公司 A kind of customer service information processing method, server and system
US20190266182A1 (en) * 2018-02-27 2019-08-29 Micware Co., Ltd. Information retrieval apparatus, information retrieval system, and information retrieval method
CN111026921A (en) * 2019-12-26 2020-04-17 深圳前海环融联易信息科技服务有限公司 Graph-based incidence relation obtaining method and device and computer equipment
CN113076431A (en) * 2021-04-28 2021-07-06 平安科技(深圳)有限公司 Question and answer method and device for machine reading understanding, computer equipment and storage medium
CN113761206A (en) * 2021-09-10 2021-12-07 平安科技(深圳)有限公司 Intelligent information query method, device, equipment and medium based on intention recognition

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109522393A (en) * 2018-10-11 2019-03-26 平安科技(深圳)有限公司 Intelligent answer method, apparatus, computer equipment and storage medium
CN111694938B (en) * 2020-04-27 2024-05-14 平安科技(深圳)有限公司 Emotion recognition-based reply method and device, computer equipment and storage medium
CN112115252B (en) * 2020-08-26 2023-06-02 罗彤 Intelligent auxiliary writing processing method and device, electronic equipment and storage medium
CN112380421A (en) * 2020-11-11 2021-02-19 北京希瑞亚斯科技有限公司 Resume searching method and device, electronic equipment and computer storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190266182A1 (en) * 2018-02-27 2019-08-29 Micware Co., Ltd. Information retrieval apparatus, information retrieval system, and information retrieval method
CN110046231A (en) * 2018-12-21 2019-07-23 阿里巴巴集团控股有限公司 A kind of customer service information processing method, server and system
CN111026921A (en) * 2019-12-26 2020-04-17 深圳前海环融联易信息科技服务有限公司 Graph-based incidence relation obtaining method and device and computer equipment
CN113076431A (en) * 2021-04-28 2021-07-06 平安科技(深圳)有限公司 Question and answer method and device for machine reading understanding, computer equipment and storage medium
CN113761206A (en) * 2021-09-10 2021-12-07 平安科技(深圳)有限公司 Intelligent information query method, device, equipment and medium based on intention recognition

Also Published As

Publication number Publication date
CN113761206A (en) 2021-12-07

Similar Documents

Publication Publication Date Title
US11615799B2 (en) Automated meeting minutes generator
WO2020253350A1 (en) Network content publication auditing method and apparatus, computer device and storage medium
WO2020244073A1 (en) Speech-based user classification method and device, computer apparatus, and storage medium
WO2020147428A1 (en) Interactive content generation method and apparatus, computer device, and storage medium
Li et al. Spatiotemporal and frequential cascaded attention networks for speech emotion recognition
US8452772B1 (en) Methods, systems, and articles of manufacture for addressing popular topics in a socials sphere
WO2021114841A1 (en) User report generating method and terminal device
US9483582B2 (en) Identification and verification of factual assertions in natural language
CN113836277A (en) Machine learning system for digital assistant
US20130262106A1 (en) Method and system for automatic domain adaptation in speech recognition applications
WO2022252636A1 (en) Artificial intelligence-based answer generation method and apparatus, device, and storage medium
US10224030B1 (en) Dynamic gazetteers for personalized entity recognition
WO2022166218A1 (en) Method for adding punctuation during voice recognition and voice recognition device
Koumpis et al. Automatic summarization of voicemail messages using lexical and prosodic features
WO2023124648A1 (en) Text summary generation method and apparatus, device and storage medium
CN112669842A (en) Man-machine conversation control method, device, computer equipment and storage medium
WO2021063089A1 (en) Rule matching method, rule matching apparatus, storage medium and electronic device
KR101677859B1 (en) Method for generating system response using knowledgy base and apparatus for performing the method
WO2021051564A1 (en) Speech recognition method, apparatus, computing device and storage medium
WO2023035529A1 (en) Intent recognition-based information intelligent query method and apparatus, device and medium
US8401855B2 (en) System and method for generating data for complex statistical modeling for use in dialog systems
US11062700B1 (en) Query answering with controlled access knowledge graph
WO2023272616A1 (en) Text understanding method and system, terminal device, and storage medium
JP5897718B2 (en) Voice search device, computer-readable storage medium, and voice search method
WO2021159756A1 (en) Method for response obligation detection based on multiple modes, and system and apparatus

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22866009

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE