CN110956962A

CN110956962A - Reply information determination method, device and equipment for vehicle-mounted robot

Info

Publication number: CN110956962A
Application number: CN201910990232.9A
Authority: CN
Inventors: 裴丽珊
Original assignee: FAW Group Corp
Current assignee: FAW Group Corp
Priority date: 2019-10-17
Filing date: 2019-10-17
Publication date: 2020-04-03

Abstract

The invention discloses a method, a device and equipment for determining reply information of a vehicle-mounted robot. The method comprises the following steps: performing word segmentation on the received user voice to obtain a voice word segmentation set of the user voice; forming a voice input file of the user voice according to the voice word set and a pre-formed phrase list dictionary; determining target semantic parameters corresponding to each row in the voice input file, and searching the target semantic parameters in a predetermined voice reply template, wherein the voice reply template is determined based on a given sample corpus and comprises at least one standard semantic parameter and a standard reply participle sequence arranged according to a corresponding statement expression rule; and determining reply information corresponding to the user voice according to the search result. According to the technical scheme of the embodiment of the invention, the effect of accurately matching the user voice with the reply information in the voice reply template and improving the accuracy of the determination of the reply information of the vehicle-mounted robot is achieved.

Description

Reply information determination method, device and equipment for vehicle-mounted robot

Technical Field

The embodiment of the invention relates to the technical field of intelligent robots, in particular to a method, a device and equipment for determining reply information of a vehicle-mounted robot.

Background

With the rapid development of intelligent networking in the vehicle manufacturing industry and the internet industry, the artificial intelligence technology has become one of the indispensable subjects in the field of intelligent networking, and research and development of intelligent chat robots in a vehicle-mounted scene has also become a development hotspot in which the industry is imperative.

At present, the word vector construction of the intelligent chat robot based on the seq2seq model mainly adopts a word bag model vector representation method. The vector representation method is a simplified expression model under natural language processing and information retrieval, and is characterized in that a text, a paragraph or a document is regarded as a disordered vocabulary set, grammar and word sequence are omitted, each word in the text is counted, and word vector index content is represented by the occurrence frequency of the words in a list.

However, the vector representation method does not keep the order of words in the original sentence, only takes each word group as individual statistics, and in the practical question answering application, the word vectors constructed by the method cannot be directly converted into the sequence recognizable by the computer, and the reply information of the reducible sentence structure cannot be formed, so that the accuracy of determining the reply information is influenced.

Disclosure of Invention

The invention provides a method, a device and equipment for determining reply information of a vehicle-mounted robot, which improve the accuracy of determining the reply information of the vehicle-mounted robot.

In a first aspect, an embodiment of the present invention provides a reply information determination method for a vehicle-mounted robot, including:

performing word segmentation on the received user voice to obtain a voice word segmentation set of the user voice;

forming a voice input file of the user voice according to the voice word set and a pre-formed phrase list dictionary;

determining target semantic parameters corresponding to each row in the voice input file, and searching the target semantic parameters in a predetermined voice reply template, wherein the voice reply template is determined based on a given sample corpus and comprises at least one standard semantic parameter and a standard reply participle sequence arranged according to a corresponding statement expression rule;

and determining reply information corresponding to the user voice according to the search result.

In a second aspect, an embodiment of the present invention further provides a reply information determination apparatus for a vehicle-mounted robot, including:

the voice acquisition module is used for performing word segmentation processing on the received user voice to obtain a voice word segmentation set of the user voice;

the file generation module is used for forming a voice input file of the user voice according to the voice word set and a pre-formed phrase list dictionary;

a template determination module for determining a voice response template based on a given sample corpus;

the parameter searching module is used for determining target semantic parameters corresponding to each row in the voice input file and searching the target semantic parameters in a predetermined voice reply template;

and the information determining module is used for determining the reply information corresponding to the user voice according to the search result.

In a third aspect, an embodiment of the present invention further provides an apparatus, where the apparatus includes:

one or more processors;

storage means for storing one or more programs;

when the one or more programs are executed by the one or more processors, the one or more processors implement the reply information determination method of the in-vehicle robot as provided in any embodiment of the present invention.

In a fourth aspect, embodiments of the present invention provide a storage medium containing computer-executable instructions that, when executed by a computer processor, are used to perform a reply information determination method for a vehicle-mounted robot as provided in any of the embodiments of the present invention.

The embodiment of the invention obtains the voice word segmentation set of the user voice by word segmentation processing of the received user voice; forming a voice input file of the user voice according to the voice word set and a pre-formed phrase list dictionary; determining target semantic parameters corresponding to each row in the voice input file, and searching the target semantic parameters in a predetermined voice reply template, wherein the voice reply template is determined based on a given sample corpus and comprises at least one standard semantic parameter and a standard reply participle sequence arranged according to a corresponding statement expression rule; and determining reply information corresponding to the user voice according to the search result. And forming a voice input file of the user voice according to the voice word segmentation set and a pre-formed phrase list dictionary, so that the word sequence in the original voice can be reserved, and the inaccuracy of sentence structure reduction is avoided. The target semantic parameters in the voice input file are matched with the standard semantic parameters in the voice reply template, the reply information corresponding to the voice of the user is determined according to the matching result, the problem that the question and the answer are easy to be confused when the voice of the user is matched with the sentence of the reply template is solved, and the accuracy of determining the reply information of the vehicle-mounted robot is improved.

Drawings

Fig. 1 is a flowchart of a reply information determination method of a vehicle-mounted robot according to a first embodiment of the present invention;

fig. 2 is a flowchart of a reply information determination method of a vehicle-mounted robot according to a second embodiment of the present invention;

FIG. 3 is a flowchart of determining a voice response template in a second embodiment of the present invention;

fig. 4 is a schematic structural diagram of a reply information determination device of a vehicle-mounted robot in a third embodiment of the present invention;

fig. 5 is a schematic structural diagram of an apparatus according to a fourth embodiment of the present invention.

Detailed Description

The present invention will be described in further detail with reference to the accompanying drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the invention and are not limiting of the invention. It should be further noted that, for the convenience of description, only some of the structures related to the present invention are shown in the drawings, not all of the structures.

Example one

Fig. 1 is a flowchart of a reply information determination method for a vehicle-mounted robot according to an embodiment of the present invention, where the present embodiment is applicable to a case where the vehicle-mounted robot determines reply information, the method may be executed by a reply information determination device of the vehicle-mounted robot, the reply information determination device of the vehicle-mounted robot may be implemented by software and/or hardware, and the reply information determination device of the vehicle-mounted robot may be configured on a computing device, and specifically includes the following steps:

and 110, performing word segmentation on the received user voice to obtain a voice word segmentation set of the user voice.

The user voice can be understood as voice information collected by a sound collection device, and the voice information can contain one or more sentences, and can be specifically represented by a group of character strings.

The word segmentation process can be understood as Chinese word segmentation, that is, a process of recombining continuous character sequences into word sequences according to a certain specification, and specifically, the word segmentation process can adopt a word segmentation method based on character string matching, a word segmentation method based on understanding, a word segmentation method based on statistics, and the like.

Specifically, the received user voice signal can be subjected to pre-emphasis processing, the high-frequency part of the voice signal is emphasized, the high-frequency resolution is increased, meanwhile, the received user voice signal can be subjected to windowing processing, regular background noise is eliminated, the background noise is converted into a character string mode, the converted user voice is subjected to word segmentation processing, and after the processing, pause word removal and low-frequency word removal processing can be carried out to form a voice word segmentation set of the user voice.

The received user voice is processed by word segmentation, the user voice can be converted into standard voice sequences, the computer can conveniently count and recognize the word sequences, the problem that the received user voice is difficult to recognize is avoided, and the reading efficiency of the subsequent computer on the received user voice is improved.

And step 120, forming a voice input file of the user voice according to the voice word set and a pre-formed phrase list dictionary.

The phrase list dictionary may be understood as that the phrases in the sample corpus are sequentially arranged and numbered according to a preset ordering method according to a preset sample corpus to generate a phrase set with numbers corresponding to the phrases one by one, specifically, the preset ordering method may adopt ordering according to the frequency of occurrence of the phrases in the sample corpus, ordering according to the weight of the phrases in a sample lexicon, ordering according to the sequence of occurrence of the phrases in the sample corpus, and the like.

Specifically, the phrases in the voice segmentation set are assigned according to the numbers corresponding to the phrases in the phrase list dictionary, that is, the voice segmentation set can be represented as a matrix consisting of the numbers, and the matrix is used as the voice input file of the voice of the user.

And forming a voice input file of the user voice according to the voice word segmentation set and a pre-formed phrase list dictionary, so that the word sequence in the original voice can be reserved, and the inaccuracy of sentence structure reduction is avoided.

Step 130, determining target semantic parameters corresponding to each line in the voice input file, and searching the target semantic parameters in a predetermined voice reply template, wherein the voice reply template is determined based on a given sample corpus and comprises at least one standard semantic parameter and a standard reply participle sequence arranged according to a corresponding statement expression rule.

The semantic parameters can be understood as mathematical expressions which comprehensively embody grammatical information, semantic information and pragmatic information in a sentence, and can be specifically represented by a group of ordered numbers.

The voice response template may be understood as a matrix file containing a plurality of semantic parameters and response sentences corresponding to the semantic parameters one by one, wherein the response sentences may be represented by a standard response complex word sequence.

Specifically, firstly, sample question sentences and sample answer sentences in a sample corpus set are extracted, a sample question sentence file, a sample answer sentence file and a sample collation file are constructed, semantic parameters and answer sentences in the sample answer sentence file are matched according to the semantic parameters corresponding to the question sentences in the sample question sentence file, the matching conditions of the semantic parameters and the answer sentences are collated according to the sample collation file, and a voice answer template containing standard semantic parameters and corresponding standard answer participle sequences is generated. And extracting target semantic parameters corresponding to each sentence in the voice input file, and performing matching search on the extracted target semantic parameters and the standard semantic parameters in the voice reply template to obtain a search result.

And step 140, determining reply information corresponding to the user voice according to the search result.

The reply information of the user voice is a standard reply word segmentation sequence or a preset reply sentence which is obtained by matching the semantic parameters in the voice reply template.

Specifically, if the search result shows that the target semantic parameter is matched with the standard semantic parameter in the voice reply template, the question provided by the user is in the range of the voice reply template, and at the moment, the standard reply participle sequence corresponding to the standard semantic parameter is output as the reply information of the voice of the user; otherwise, the question posed by the user is indicated to be beyond the range of the voice response template, and at the moment, a preset response sentence is output as the response information of the voice of the user.

According to the technical scheme of the embodiment, the received user voice is subjected to word segmentation processing to obtain a voice word segmentation set of the user voice; forming a voice input file of the user voice according to the voice word set and a pre-formed phrase list dictionary; determining target semantic parameters corresponding to each row in the voice input file, and searching the target semantic parameters in a predetermined voice reply template, wherein the voice reply template is determined based on a given sample corpus and comprises at least one standard semantic parameter and a standard reply participle sequence arranged according to a corresponding statement expression rule; and determining reply information corresponding to the user voice according to the search result. And forming a voice input file of the user voice according to the voice word segmentation set and a pre-formed phrase list dictionary, so that the word sequence in the original voice can be reserved, and the inaccuracy of sentence structure reduction is avoided. The target semantic parameters in the voice input file are matched with the standard semantic parameters in the voice reply template, the reply information corresponding to the voice of the user is determined according to the matching result, the problem that the question and the answer are easy to be confused when the voice of the user is matched with the sentence of the reply template is solved, and the accuracy of determining the reply information of the vehicle-mounted robot is improved.

Example two

Fig. 2 is a flowchart of a reply information determination method for a vehicle-mounted robot according to a second embodiment of the present invention. The technical scheme of the embodiment is further refined on the basis of the technical scheme, and specifically comprises the following steps:

step 201, performing word segmentation processing on the received user voice to obtain a voice word segmentation set of the user voice.

Step 202, performing word segmentation on a predetermined sample corpus to obtain a corpus word segmentation set.

The sample corpus can be a sentence set conforming to an application environment determined according to an actual application situation, such as a control corpus, a music corpus, a radio corpus, a telephone corpus, a navigation corpus, a video corpus, a charging corpus, a weather corpus, a stock corpus, a hotel corpus, and the like. The sample corpus can be in a question-answer form, i.e. a question sentence and an answer sentence can be distinguished according to the number of lines.

And 203, counting the occurrence frequency of each participle in the corpus participle set.

Specifically, each participle in the corpus participle set can be regarded as a phrase form, and the occurrence frequency of each phrase in the whole corpus participle set can be calculated through a traversal method and is used as the occurrence frequency of each participle.

And 204, arranging corresponding corpus participles according to the occurrence frequency from large to small, extracting the corpus participles in the preset number and corresponding arrangement serial numbers to form a phrase list dictionary.

Specifically, the words in the corpus word set are arranged in a descending order according to the occurrence frequency of the words, if the occurrence frequency of the two words is the same, the words are arranged in a first-to-last order, and after the arrangement is completed, a preset number of corpus words and the corresponding arrangement sequence number are selected to generate a word group list dictionary.

Optionally, the set number may be any positive integer value not less than 95% of the number of the corpus participles in the corpus participle set.

And step 205, initializing and constructing a voice input matrix.

The voice word segmentation set comprises at least one line of word segmentation processing first word segmentation sequences formed after the user voice is processed.

The first word segmentation sequence can be understood as user voice after word segmentation processing of one sentence in the voice word segmentation set, and the user voice after word segmentation processing is sequentially stored in each line of the voice word segmentation set by taking the sentence as a unit.

The number of rows of the voice input matrix is equal to the number of rows of a first word segmentation sequence contained in the voice word segmentation set, and the number of columns of the voice input matrix is equal to the number of word segmentation of a first word segmentation contained in the longest first word segmentation sequence in the voice word segmentation set.

Step 206, aiming at each first participle in the voice participle set, if the first participle exists in the phrase list dictionary, taking the arrangement serial number corresponding to the first participle as a target coding value; otherwise, the preset coding value is used as the target coding value of the first word segmentation.

The preset code value can adopt zero value, pad character or other fixed character set according to user habit.

Specifically, each first word in the voice word set is assigned, a phrase list dictionary is traversed, and if the first word exists in the phrase list dictionary, the corresponding arrangement serial number of the first word in the phrase list dictionary is used as a target coding value; if the first segmentation word does not exist in the phrase list dictionary, the preset coding value is used as the target coding value of the first segmentation word.

Step 207, taking the word segmentation sequence as a row unit, correspondingly writing the target coding value of the first segmentation contained in each word segmentation sequence into the voice input matrix, and filling the empty elements in the voice input matrix with the preset coding values.

And step 208, taking the assigned voice input matrix as a voice input file of the user voice.

Step 209, determining target semantic parameters corresponding to each line in the voice input file, and searching the target semantic parameters in a predetermined voice reply template, wherein the voice reply template is determined based on a given sample corpus and comprises at least one standard semantic parameter and a standard reply participle sequence arranged according to a corresponding sentence expression rule.

The target semantic parameters can be understood as mathematical expressions comprehensively embodied by grammar information, semantic information and pragmatic information in sentences corresponding to each line in the voice input file.

Further, FIG. 3 provides a flow chart for determining a voice response template. As shown in fig. 3, the sample corpus includes at least one group of sample question-and-answer sentences, each group of sample question-and-answer sentences includes a row of sample question-and-answer sentences and a row of associated sample answer-and-answer sentences; accordingly, the process of determining the voice response template includes the steps of:

step 2091, extracting sample question sentences and sample answer sentences in the sample corpus to form a sample question sentence set and a sample answer sentence set.

Step 2092, forming a sample question file according to the sample question set, and forming a sample answer file and a sample proofreading file according to the sample answer set integrated statement expression rule.

Specifically, forming a sample question file according to the sample question set includes:

and performing word segmentation on the sample question set to obtain a question word set comprising at least one line of second word segmentation sequence.

And initializing to construct a sample question matrix, wherein the row number of the sample question matrix is equal to the row number of a second participle sequence contained in the question participle set, and the column number of the sample question matrix is equal to the participle number of a second participle contained in the longest second participle sequence in the question participle set.

And aiming at each second participle in the sample question sentence set, taking the corresponding arrangement serial number of the second participle in the phrase list dictionary as a target coding value of the second participle.

And correspondingly writing target coding values of second participles contained in each participle sequence into the sample question matrix by taking the participle sequence as a row unit, and filling empty elements in the sample question matrix with preset coding values.

And taking the sample question matrix after assignment as the sample question file.

Specifically, forming a sample sentence answering file and a sample proofreading file according to the sample sentence answering set and combination sentence expression rule includes:

and performing word segmentation on the sample answer sentence set to obtain an answer sentence word set containing at least one line of third word sequences.

And aiming at each third participle in the sample answer sentence set, taking the corresponding arrangement serial number of the third participle in the phrase list dictionary as a target coding value of the third participle.

And initializing to construct a sample answer sentence matrix, wherein the row number of the sample answer sentence matrix is equal to the row number of a third participle sequence contained in the answer sentence participle set, and the column number of the sample answer sentence matrix is equal to the participle number of a third participle contained in the longest third participle sequence in the answer sentence participle set.

And correspondingly writing target coding values of third participles contained in each participle sequence into the sample answer sentence matrix by taking the participle sequence as a row unit, and filling null elements in the sample answer sentence matrix by adopting preset coding values.

And initializing to construct a sample proofreading matrix, wherein the row number of the sample proofreading matrix is equal to the row number of a third participle sequence contained in the answer sentence participle set, and the column number of the sample proofreading matrix is equal to the participle number of a third participle contained in the longest third participle sequence in the answer sentence participle set.

And correspondingly writing target coding values of third participles contained in each participle sequence into the sample proofreading matrix by taking the participle sequence as a row unit, and filling empty elements in the sample proofreading matrix with preset coding values.

And taking the sample sentence answering matrix after assignment as the sample sentence answering file, and taking the sample proofreading matrix after assignment as the sample proofreading file.

The written sample sentence-answering matrix takes a stop symbol as an end identifier, the written sample proofreading matrix takes a start symbol as an initial identifier, and the stop symbol as an end identifier. Alternatively, the initiator may be go and the rest may be eos.

Step 2093, determining standard semantic parameters corresponding to sample participles of each line of the sample question document.

And 2094, combining the standard semantic parameters with the sample answer sentence file and the sample proofreading file to form a voice answer template containing the standard semantic parameters and corresponding standard answer segmentation sequences.

Specifically, the standard semantic parameters are matched with the sample answer file, and an initial voice answer template corresponding to each participle sequence in the answer file is generated according to each standard semantic parameter.

Matching each participle sequence in the initial voice reply template with the sample proofreading file one by one, replacing participle sequences with different matching results with corresponding participle sequences in the sample proofreading file, taking the replaced participle sequences as the standard reply participle sequences, and generating the voice reply template with the standard semantic parameters corresponding to the standard reply participle sequences.

And step 210, determining reply information corresponding to the user voice according to the search result.

Specifically, if the target semantic parameter is the standard semantic parameter in the voice reply template, the standard reply participle sequence corresponding to the target semantic parameter in the voice reply template is used as the reply information of the user voice; otherwise, the preset reply sentence is used as the reply information of the user voice.

According to the technical scheme of the embodiment, the answer information corresponding to the voice of the user is determined according to the matching result by utilizing the matching of the semantic parameters and the target semantic parameters in the voice answer template containing the semantic parameters and the answer participle sequence, so that the confusion probability of similar semantic question-answer sentences is reduced, and the accuracy of answer information determination is improved.

EXAMPLE III

Fig. 4 is a schematic structural diagram of a reply information determination apparatus for a vehicle-mounted robot according to a third embodiment of the present invention, where the reply information determination apparatus for a vehicle-mounted robot includes: a voice acquisition module 310, a file generation module 320, a template determination module 330, a parameter lookup module 340, and an information determination module 350.

The voice acquiring module 310 is configured to perform word segmentation on the received user voice to obtain a voice word segmentation set of the user voice; a file generating module 320, configured to form a voice input file of the user voice according to the voice word set and a pre-formed phrase list dictionary; a template determination module 330 for determining a voice response template based on a given sample corpus; a parameter searching module 340, configured to determine target semantic parameters corresponding to each line in the voice input file, and search the target semantic parameters in a predetermined voice reply template; and an information determining module 350, configured to determine reply information corresponding to the user voice according to the search result.

According to the technical scheme, the problem that the similar question sentences and answer sentences are easy to confuse when the user voice is matched with the answer template sentences is solved, and the accuracy of determining the answer information of the vehicle-mounted robot is improved.

Optionally, the file generating module 320 includes:

the phrase list dictionary forming unit is used for carrying out word segmentation on a predetermined sample corpus to obtain a corpus word segmentation set; counting the occurrence frequency of each participle in the corpus participle set; and arranging corresponding corpus participles according to the occurrence frequency from large to small, and extracting the corpus participles in the preset number and corresponding arrangement serial numbers to form a phrase list dictionary.

The input file forming unit is used for initializing and constructing a voice input matrix, wherein the row number of the voice input matrix is equal to the row number of a first participle sequence contained in the voice participle set, and the column number of the voice input matrix is equal to the participle number of a first participle contained in the longest first participle sequence in the voice participle set; aiming at each first participle in the voice participle set, if the first participle exists in the phrase list dictionary, taking a sequence number corresponding to the first participle as a target coding value; otherwise, using a preset coding value as a target coding value of the first word segmentation; correspondingly writing target coding values of first participles contained in each participle sequence into the voice input matrix by taking the participle sequence as a row unit, and filling empty elements in the voice input matrix with the preset coding values; and taking the assigned voice input matrix as a voice input file of the voice of the user.

Optionally, the template determining module 330 includes:

and the sentence set forming unit is used for extracting the sample question sentences and the sample answer sentences in the sample corpus set to form a sample question sentence set and a sample answer sentence set.

And the file forming unit is used for forming a sample question file according to the sample question set and forming a sample answer file and a sample proofreading file according to the sample answer set integrated statement expression rule.

And the semantic parameter determining unit is used for determining standard semantic parameters corresponding to each line of sample participles of the sample question sentence file.

And the reply template forming unit is used for combining each standard semantic parameter according to the sample reply sentence file and the sample proofreading file to form a voice reply template containing each standard semantic parameter and a corresponding standard reply participle sequence.

Optionally, the file forming unit is further configured to perform word segmentation on the sample question set to obtain a question word set including at least one line of the second word segmentation sequence; initializing and constructing a sample question matrix, wherein the row number of the sample question matrix is equal to the row number of a second participle sequence contained in the question participle set, and the column number of the sample question matrix is equal to the participle number of a second participle contained in the longest second participle sequence in the question participle set; aiming at each second participle in the sample question sentence set, taking the corresponding arrangement serial number of the second participle in the phrase list dictionary as a target coding value of the second participle; correspondingly writing target coding values of second participles contained in each participle sequence into the sample question matrix by taking the participle sequence as a row unit, and filling empty elements in the sample question matrix with preset coding values; and taking the sample question matrix after assignment as the sample question file.

Optionally, the file forming unit is further configured to perform word segmentation on the sample answer set to obtain an answer word segmentation set including at least one row of the third entry sequence; aiming at each third participle in the sample answer sentence set, taking the corresponding arrangement serial number of the third participle in the phrase list dictionary as a target coding value of the third participle; initializing and constructing a sample answer sentence matrix, wherein the row number of the sample answer sentence matrix is equal to the row number of a third participle sequence contained in the answer sentence participle set, and the column number of the sample answer sentence matrix is equal to the participle number of a third participle contained in the longest third participle sequence in the answer sentence participle set; correspondingly writing target coding values of third participles contained in each participle sequence into the sample answer sentence matrix by taking the participle sequence as a row unit, and filling null elements in the sample answer sentence matrix with preset coding values; initializing and constructing a sample proofreading matrix, wherein the row number of the sample proofreading matrix is equal to the row number of a third participle sequence contained in the answer sentence participle set, and the column number of the sample proofreading matrix is equal to the participle number of a third participle contained in the longest third participle sequence in the answer sentence participle set; correspondingly writing target coding values of third participles contained in each participle sequence into the sample proofreading matrix by taking the participle sequence as a row unit, and filling empty elements in the sample proofreading matrix with preset coding values; and taking the sample sentence answering matrix after assignment as the sample sentence answering file, and taking the sample proofreading matrix after assignment as the sample proofreading file.

Optionally, the reply template forming unit is further configured to match the standard semantic parameters with the sample reply sentence file, and generate an initial voice reply template in which each standard semantic parameter corresponds to each participle sequence in the reply file; matching each participle sequence in the initial voice reply template with the sample proofreading file one by one, replacing participle sequences with different matching results with corresponding participle sequences in the sample proofreading file, taking the replaced participle sequences as the standard reply participle sequences, and generating the voice reply template with the standard semantic parameters corresponding to the standard reply participle sequences.

Optionally, the information determining module 350 is specifically configured to: if the target semantic parameter is the standard semantic parameter in the voice reply template, taking the standard reply participle sequence corresponding to the target semantic parameter in the voice reply template as the reply information of the user voice; otherwise, the preset reply sentence is used as the reply information of the user voice.

The reply information determining device of the vehicle-mounted robot provided by the embodiment of the invention can execute the reply information determining method of the vehicle-mounted robot provided by any embodiment of the invention, and has corresponding functional modules and beneficial effects of the executing method.

Example four

Fig. 5 is a schematic structural diagram of an apparatus according to a fourth embodiment of the present invention, as shown in fig. 5, the apparatus includes a processor 410, a memory 420, an input device 430, and an output device 440; the number of processors 410 in the device may be one or more, and one processor 4100 is exemplified in fig. 5; the processor 410, the memory 420, the input device 430 and the output device 440 in the apparatus may be connected by a bus or other means, for example, in fig. 5.

The memory 420 serves as a computer-readable storage medium, and may be used for storing software programs, computer-executable programs, and modules, such as program instructions/modules corresponding to the missed fee detection method in the embodiment of the present invention (for example, the voice obtaining module 310, the file generating module 320, the template determining module 330, the parameter searching module 340, and the information determining module 350). The processor 410 executes various functional applications of the device and data processing by running software programs, instructions, and modules stored in the memory 420, that is, implements the reply information determination method of the in-vehicle robot described above.

The memory 420 may mainly include a program storage area and a data storage area, wherein the program storage area may store an operating system, an application program required for at least one function; the storage data area may store data created according to the use of the terminal, and the like. Further, the memory 420 may include high speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device. In some examples, memory 420 may further include memory located remotely from processor 410, which may be connected to devices through a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.

The input device 430 may be used to receive input numeric or character information and generate key signal inputs related to user settings and function control of the computing device, and may include a keyboard and a mouse, etc. The output device 440 may include a display device such as a display screen.

EXAMPLE five

The fifth embodiment of the present invention further provides a storage medium containing computer-executable instructions, which are used for executing a reply information determination method of a vehicle-mounted robot when executed by a computer processor.

Of course, the storage medium containing the computer-executable instructions provided by the embodiments of the present invention is not limited to the method operations described above, and may also perform related operations in a reply information determination method for a vehicle-mounted robot provided by any embodiments of the present invention.

From the above description of the embodiments, it is obvious for those skilled in the art that the present invention can be implemented by software and necessary general hardware, and certainly, can also be implemented by hardware, but the former is a better embodiment in many cases. Based on such understanding, the technical solutions of the present invention may be embodied in the form of a software product, which may be stored in a computer-readable storage medium, such as a floppy disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a FLASH Memory (FLASH), a hard disk or an optical disk of a computer, and includes several instructions for enabling a computer device (which may be a personal computer, a server, or a network device) to execute the methods according to the embodiments of the present invention.

It should be noted that, in the embodiment of the reply information determination apparatus for a vehicle-mounted robot, the units and modules included in the apparatus are merely divided according to the functional logic, but are not limited to the above division as long as the corresponding functions can be realized; in addition, specific names of the functional units are only for convenience of distinguishing from each other, and are not used for limiting the protection scope of the present invention.

It is to be noted that the foregoing is only illustrative of the preferred embodiments of the present invention and the technical principles employed. It will be understood by those skilled in the art that the present invention is not limited to the particular embodiments described herein, but is capable of various obvious changes, rearrangements and substitutions as will now become apparent to those skilled in the art without departing from the scope of the invention. Therefore, although the present invention has been described in greater detail by the above embodiments, the present invention is not limited to the above embodiments, and may include other equivalent embodiments without departing from the spirit of the present invention, and the scope of the present invention is determined by the scope of the appended claims.

Claims

1. A reply information determination method for a vehicle-mounted robot, comprising:

2. The method of claim 1, wherein forming the phrase list dictionary comprises:

performing word segmentation on a predetermined sample corpus to obtain a corpus word segmentation set;

counting the occurrence frequency of each participle in the corpus participle set;

and arranging corresponding corpus participles according to the occurrence frequency from large to small, and extracting the corpus participles in the preset number and corresponding arrangement serial numbers to form a phrase list dictionary.

3. The method of claim 2, wherein the speech segmentation set comprises at least one row of first segmentation sequences formed after segmentation processing the speech of the user;

correspondingly, the forming of the voice input file of the user voice according to the voice word set and the pre-formed phrase list dictionary includes:

initializing and constructing a voice input matrix, wherein the number of rows of the voice input matrix is equal to the number of rows of a first participle sequence contained in the voice participle set, and the number of columns of the voice input matrix is equal to the number of participles of a first participle contained in the longest first participle sequence in the voice participle set;

aiming at each first participle in the voice participle set, if the first participle exists in the phrase list dictionary, taking a sequence number corresponding to the first participle as a target coding value; otherwise, using a preset coding value as a target coding value of the first word segmentation;

correspondingly writing target coding values of first participles contained in each participle sequence into the voice input matrix by taking the participle sequence as a row unit, and filling empty elements in the voice input matrix with the preset coding values;

and taking the assigned voice input matrix as a voice input file of the voice of the user.

4. The method of claim 2, wherein the sample corpus comprises at least one set of sample question-and-answer sentences, each set of sample question-and-answer sentences comprising a row of sample question-and-answer sentences and an associated row of sample answer-and-answer sentences;

accordingly, the process of determining the voice response template includes the steps of:

extracting sample question sentences and sample answer sentences in the sample corpus set to form a sample question sentence set and a sample answer sentence set;

forming a sample question file according to the sample question set, and forming a sample answer file and a sample proofreading file according to the sample answer set and the integrated statement expression rule;

determining standard semantic parameters corresponding to sample participles of each line of the sample question sentence file;

and combining each standard semantic parameter according to the sample answer file and the sample proofreading file to form a voice answer template comprising each standard semantic parameter and a corresponding standard answer participle sequence.

5. The method of claim 4, wherein forming a sample question file from the set of sample questions comprises:

performing word segmentation on the sample question set to obtain a question word set comprising at least one line of second word segmentation sequence;

initializing and constructing a sample question matrix, wherein the row number of the sample question matrix is equal to the row number of a second participle sequence contained in the question participle set, and the column number of the sample question matrix is equal to the participle number of a second participle contained in the longest second participle sequence in the question participle set;

aiming at each second participle in the sample question sentence set, taking the corresponding arrangement serial number of the second participle in the phrase list dictionary as a target coding value of the second participle;

correspondingly writing target coding values of second participles contained in each participle sequence into the sample question matrix by taking the participle sequence as a row unit, and filling empty elements in the sample question matrix with preset coding values;

6. The method of claim 4, wherein said forming a sample sentence file and a sample collation file according to said sample sentence set binding sentence expression rules comprises:

word segmentation processing is carried out on the sample answer sentence set, and an answer sentence word segmentation set comprising at least one line of third word segmentation sequence is obtained;

aiming at each third participle in the sample answer sentence set, taking the corresponding arrangement serial number of the third participle in the phrase list dictionary as a target coding value of the third participle;

initializing and constructing a sample answer sentence matrix, wherein the row number of the sample answer sentence matrix is equal to the row number of a third participle sequence contained in the answer sentence participle set, and the column number of the sample answer sentence matrix is equal to the participle number of a third participle contained in the longest third participle sequence in the answer sentence participle set;

correspondingly writing target coding values of third participles contained in each participle sequence into the sample answer sentence matrix by taking the participle sequence as a row unit, and filling null elements in the sample answer sentence matrix with preset coding values;

initializing and constructing a sample proofreading matrix, wherein the row number of the sample proofreading matrix is equal to the row number of a third participle sequence contained in the answer sentence participle set, and the column number of the sample proofreading matrix is equal to the participle number of a third participle contained in the longest third participle sequence in the answer sentence participle set;

correspondingly writing target coding values of third participles contained in each participle sequence into the sample proofreading matrix by taking the participle sequence as a row unit, and filling empty elements in the sample proofreading matrix with preset coding values;

7. The method of claim 4, wherein combining each of the standard semantic parameters with the sample sentence file and the sample collation file to form a voice response template including each of the standard semantic parameters and a corresponding standard response participle sequence comprises:

matching the standard semantic parameters with the sample answer file to generate an initial voice answer template corresponding to each participle sequence in the answer file;

8. The method according to any one of claims 1-7, wherein said determining reply information corresponding to said user speech according to the search result comprises:

if the target semantic parameter is the standard semantic parameter in the voice reply template, taking the standard reply participle sequence corresponding to the target semantic parameter in the voice reply template as the reply information of the user voice; otherwise, the preset reply sentence is used as the reply information of the user voice.

9. A reply information determination device for a vehicle-mounted robot, comprising:

10. An apparatus, characterized in that the apparatus comprises:

one or more processors;

storage means for storing one or more programs;

when the one or more programs are executed by the one or more processors, the one or more processors implement the reply information determination method of the in-vehicle robot according to any of claims 1 to 8.