CN112259102A

CN112259102A - Retail scene voice interaction optimization method based on knowledge graph

Info

Publication number: CN112259102A
Application number: CN202011179681.4A
Authority: CN
Inventors: 李抒雁; 沙涛
Original assignee: Shixiang Intelligent Technology Suzhou Co ltd
Current assignee: Shixiang Intelligent Technology Suzhou Co ltd
Priority date: 2020-10-29
Filing date: 2020-10-29
Publication date: 2021-01-22

Abstract

The invention discloses a retail scene voice interaction optimization method based on a knowledge graph, which comprises the following steps: s1, inputting voice, and performing language recognition by the system through the processor; s2, extracting labels and association relations from brand commodity information, and constructing a brand knowledge graph; s3, voice interactive shopping searching based on the online knowledge map; s4, analyzing the recorded voice data, and adding homophones and synonyms; s5, applying the new data training model to the business, and greatly improving the use experience in the voice interaction shopping scene. The brand knowledge map is established by receiving brand professional words and learning different regional pronunciations and synonyms in the whole country, and the voice recognition effect can be greatly improved by combining with voice recognition and NLP. According to the test, the recognition of the special brand words reaches 92%, the text intention extraction accuracy reaches 96%, and the relation understanding accuracy in the scene reaches 90%. Meanwhile, scene word search is optimized, and commodity display and transaction are greatly promoted in voice interactive shopping.

Description

Retail scene voice interaction optimization method based on knowledge graph

Technical Field

The invention relates to the technical field of voice interaction, in particular to a retail scene voice interaction optimization method based on a knowledge graph.

Background

The voice is the first attribute among three basic attributes of the shape, the sound and the meaning of the language, the human language is formed in the form of voice firstly, and the voice has a language without characters in the world but does not have the language without voice, and plays a decisive supporting role in the language;

however, the retail scene voice interaction optimization method based on the knowledge graph in the market at present only is suitable for individual customization and use by directly carrying out personalized training on the model, the difficulty in use in the vertical industry is high, the model is difficult to train, each brand has own exclusive knowledge content and is difficult to train, and retail commodities are updated quickly.

Disclosure of Invention

The invention provides a retail scene voice interaction optimization method based on a knowledge graph, which can effectively solve the problems that the retail scene voice interaction optimization method based on the knowledge graph in the current market is provided in the background technology, the model is directly trained in an individualized way and is only suitable for individual customization, the use difficulty in the vertical industry is high, the model is difficult to train, each brand has own exclusive knowledge content and is not easy to train, and retail commodities are updated quickly.

In order to achieve the purpose, the invention provides the following technical scheme: a retail scene voice interaction optimization method based on a knowledge graph comprises the following steps:

s1, inputting voice, and performing language recognition by the system through the processor;

s2, extracting labels and association relations from brand commodity information, and constructing a brand knowledge graph;

s3, voice interactive shopping searching based on the online knowledge map;

s4, analyzing the recorded voice data, and adding homophones and synonyms;

and S5, applying the new data training model to the business.

According to the technical scheme, learning is carried out through reading linguistic data in the S1, a word stock of the user is established, and the dialect identification problem is solved;

according to the agreement of users, the special calling methods and different accents of commodities in different regions in a wide area are collected anonymously, in a voice shopping guide scene, default voices are related to services, and corpora are collected through machine learning and supervised learning to build a word stock in a vertical industry.

According to the technical scheme, the brand knowledge map is established in the S2 by receiving brand professional words and learning pronunciations and synonyms of different regions in the country, and the voice recognition effect can be greatly improved by combining with voice recognition and NLP.

According to the technical scheme, the accuracy of brand special word recognition, special word meaning analysis and scene relation analysis in the step S2 are matched with each other.

According to the technical scheme, data in the voice interactive shopping search in the S3 are updated every week, the data are stored in the server and are received and transmitted through the system;

the system performs voice recognition and input through network and personal input.

According to the technical scheme, the homophones and the synonyms in the S4 are listed according to original words, are divided into homophones, harmonious characters and harmonious characters, and comprise numbers, Arabic letters and a few voices.

According to the technical scheme, the step S5 is applied to the service, the voice data is transmitted to the voice system, the data of the voice data is stored and applied, and a loop is formed with the step S1.

According to the above technical solution, the complete specific person isolated word speech recognition system in S1 generally includes speech input, speech signal preprocessing, feature extraction, training and recognition;

the process of speech recognition can be regarded as the process of pattern matching, the process of pattern matching refers to the process of making unknown pattern and a certain model in the model base obtain the best matching according to certain criteria, the reference template that needs to be used in the pattern matching is obtained through template training, in the training stage, after certain processing is carried out on the characteristic parameters, a model is established for each entry and is stored as the template base, in the recognition stage, speech signals pass through the same channel to obtain speech characteristic parameters, a test template is generated and is matched with the reference template, the reference template with the highest matching score is used as the recognition result, and meanwhile, the accuracy of recognition can be improved with the help of a priori knowledge.

Compared with the prior art, the invention has the beneficial effects that: the voice interactive shopping system is scientific and reasonable in structure, safe and convenient to use, and greatly improves the use experience in the voice interactive shopping scene. The brand knowledge map is established by receiving brand professional words and learning different regional pronunciations and synonyms in the whole country, and the voice recognition effect can be greatly improved by combining with voice recognition and NLP. According to the test, the recognition of the special brand words reaches 92%, the text intention extraction accuracy reaches 96%, and the relation understanding accuracy in the scene reaches 90%. Meanwhile, scene word search is optimized, and commodity display and transaction are greatly promoted in voice interactive shopping.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention.

In the drawings:

FIG. 1 is a schematic flow chart diagram illustrating the steps of the optimization method of the present invention;

FIG. 2 is a data comparison diagram of the present invention.

Detailed Description

The preferred embodiments of the present invention will be described in conjunction with the accompanying drawings, and it will be understood that they are described herein for the purpose of illustration and explanation and not limitation.

Example (b): as shown in fig. 1-2, the invention provides a technical solution, a retail scene voice interaction optimization method based on knowledge graph, comprising the following steps:

s3, voice interactive shopping searching based on the online knowledge map;

s4, analyzing the recorded voice data, and adding homophones and synonyms;

and S5, applying the new data training model to the business.

According to the technical scheme, learning is carried out through reading linguistic data in S1, a word stock of the user is established, and the dialect identification problem is solved;

According to the technical scheme, the brand knowledge map is established in the S2 by receiving and recording brand professional words and learning pronunciations and synonyms of different regions in the country, and the voice recognition effect can be greatly improved by combining with voice recognition and NLP.

According to the technical scheme, the accuracy of brand special word recognition, special word meaning analysis and scene relation analysis in the S2 are matched with each other.

According to the technical scheme, data in voice interactive shopping search in the S3 are updated every week, the data are stored in the server and are received and transmitted through the system;

According to the technical scheme, homophones and synonyms in the S4 are listed according to original words, are divided into homophones, harmonious characters and harmonious characters, and comprise numbers, Arabic letters and a few voices.

According to the technical scheme, the S5 is applied to the service, the voice data is transmitted to the voice system, the data of the voice data is stored and applied, and a loop is formed with the step S1.

According to the technical scheme, the complete specific person isolated word voice recognition system in the S1 generally comprises voice input, voice signal preprocessing, feature extraction, training and recognition;

Finally, it should be noted that: although the present invention has been described in detail with reference to the foregoing embodiments, it will be apparent to those skilled in the art that changes may be made in the embodiments and/or equivalents thereof without departing from the spirit and scope of the invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A retail scene voice interaction optimization method based on a knowledge graph is characterized by comprising the following steps: the method comprises the following steps:

s3, voice interactive shopping searching based on the online knowledge map;

s4, analyzing the recorded voice data, and adding homophones and synonyms;

and S5, applying the new data training model to the business.

2. The retail scene voice interaction optimization method based on knowledge graph according to claim 1, characterized in that in S1, learning is performed by reading corpora, a word stock of itself is established, and dialect recognition problem is solved;

3. The method for retail scene voice interaction optimization based on knowledge graph according to claim 1, wherein in the step S2, a brand knowledge graph is established by listing brand professional words and learning different regional pronunciations and synonyms across the country, and the voice recognition effect can be greatly improved by combining with voice recognition and NLP.

4. The method of claim 3, wherein the accuracies of brand specific word recognition, specific word meaning analysis and scene relationship analysis in S2 are matched with each other.

5. The method of claim 1, wherein the data in the voice-interactive shopping search in S3 is updated weekly, stored in a server, and received and transmitted through a system;

6. The method of claim 1, wherein the homophones and synonyms in the S4 are listed according to original words, and are divided into homophones, harmonious multiwords and harmonious homowords, and include numbers, arabic letters and a few voices.

7. The method of claim 1, wherein the step of S5 is applied to business, the voice data is transmitted to a voice system, and the data is stored for application, and the steps of S1 are looped.

8. The method of claim 1, wherein the system for recognizing the complete isolated word from the specific person in S1 generally comprises inputting speech, preprocessing speech signals, extracting features, training and recognizing;