CN108255956A - The method and system of dictionary are adaptively obtained based on historical data and machine learning - Google Patents

The method and system of dictionary are adaptively obtained based on historical data and machine learning Download PDF

Info

Publication number
CN108255956A
CN108255956A CN201711391038.6A CN201711391038A CN108255956A CN 108255956 A CN108255956 A CN 108255956A CN 201711391038 A CN201711391038 A CN 201711391038A CN 108255956 A CN108255956 A CN 108255956A
Authority
CN
China
Prior art keywords
dictionary
user
result
plane
sentence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201711391038.6A
Other languages
Chinese (zh)
Other versions
CN108255956B (en
Inventor
蔡劲松
苏少炜
陈孝良
冯大航
常乐
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BEIJING WISDOM TECHNOLOGY Co Ltd
Beijing SoundAI Technology Co Ltd
Original Assignee
BEIJING WISDOM TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING WISDOM TECHNOLOGY Co Ltd filed Critical BEIJING WISDOM TECHNOLOGY Co Ltd
Priority to CN201711391038.6A priority Critical patent/CN108255956B/en
Publication of CN108255956A publication Critical patent/CN108255956A/en
Application granted granted Critical
Publication of CN108255956B publication Critical patent/CN108255956B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/211Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)

Abstract

Present disclose provides a kind of method for adaptively obtaining voice lexicon based on historical data and machine learning, including:Step S1, the sentence mould that semantic plane is carried out to voice recognition result are classified, and find the kinetonucleus in phonetic order and relative dynamic member;Step S2 wins out the dynamic member in phonetic order, with reference to machine learning and user's history data, selects several dictionaries;Step S3, the participle of syntax plane is carried out with the method in natural language processing in the dictionary of selection, the result in comprehensive multiple dictionary fields is assessed, and asks for the highest field of point value of evaluation as optimal result, the optimal result is exported, while updates user's history data;Step S4 by the sentence category analysis (sca) of optimal result combination pragmatic plane, determines final dictionary field.By the service condition combination machine learning of user's history dictionary, corresponding field is adaptively obtained from the historical data of user, so as to considerably increase flexibility and accuracy.

Description

The method and system of dictionary are adaptively obtained based on historical data and machine learning
Technical field
This disclosure relates to artificial intelligent voice interaction field more particularly to one kind are adaptive based on historical data and machine learning The method and system in dictionary field should be obtained.
Background technology
One of the probing direction of intelligent sound box as man-machine interaction mode, under continuous development in recent years, various manufacturers All in exploitation ASR and carry out lexical comprehension using Chinese word segmentation.The characteristics of due to Chinese, the analysis of complex sentence it is sufficiently complex and Elapsed time.So ASR manufacturers can generally allow user that corresponding dictionary is selected to match corresponding field, such as music field is chatted Its field etc., to reduce the complexity of algorithm.
But the phonetic order that intelligent sound box receives is mostly simple sentence, i.e., only there are one kinetonucleus structure, and is mostly to pray making Sentence and interrogative sentence, pattern are also than relatively limited, this allows our the phonetic order features received according to intelligent sound box, The optimization in result and corresponding field is identified.
The selection in existing intelligent sound box voice lexicon field lacks flexibility, usually needs specified manually or passes through Call parameters are manually filling when ASR services are applied for.And when specify dictionary field after, do not do Method is adjusted correspondingly according to the usage scenario and historical data of user.
Disclosure
(1) technical problems to be solved
Present disclose provides a kind of method and system that voice lexicon is adaptively obtained based on historical data and machine learning, At least partly to solve the technical issues of set forth above.
(2) technical solution
According to one aspect of the disclosure, it provides one kind and voice word is adaptively obtained based on historical data and machine learning The method in library, including:Step S1, the sentence mould that semantic plane is carried out to voice recognition result are classified, and are found dynamic in phonetic order Core and relative dynamic member;Step S2 wins out the dynamic member in the phonetic order, with reference to machine learning and user's history Data select several dictionaries;Step S3 carries out point of syntax plane in the dictionary of selection with natural language processing method Word, the result in comprehensive multiple dictionary fields are assessed, and ask for the highest field of point value of evaluation as optimal result, described in output Optimal result, while update user's history data;Step S4 by the sentence category analysis (sca) of optimal result combination pragmatic plane, is determined most Whole dictionary field.
In the disclosure some embodiments, the sentence mould classification that semantic plane is carried out in the step S1 is calculated using pattern match Method obtains the kinetonucleus in phonetic order and relative dynamic member.
In the disclosure some embodiments, the step S2 includes:By the kinetonucleus in phonetic order and relative Dynamic member separation, and win and set out member, the several dictionaries gone out with reference to selected by machine deep learning according to dynamic member;And it is gone through according to user History data choose the most used several dictionary fields of user.
In the disclosure some embodiments, in the step S3, with the N- in natural language processing in the dictionary of selection Shortest-path method carries out the participle of syntax plane;It is using greedy algorithm or Dijkstra shortest paths in selection shortest path Algorithm.
In the disclosure some embodiments, in the step S3, the result in comprehensive multiple dictionary fields carries out assessment and includes: Correlation degree between word and word is assessed and shortest path algorithm result is assessed;Update user's history data Including:Update user's history dictionary field service condition and the power by historical data and machine learning to noun in dictionary Value optimizes.
In the disclosure some embodiments, further included before the step S1:Step S0, ASR identification engine receives user Phonetic order is sent out, speech recognition is carried out, obtains voice recognition result.
It is a kind of another aspect of the present disclosure provides that voice is adaptively obtained based on historical data and machine learning The system in dictionary field, including:Semantic plane analysis module carries out sentence mould classification, and by classification results to voice recognition result It is sent to a variety of dictionaries of selection;Syntax two dimensional analysis module, to carrying out syntax participle, and comprehensive multiple words in the dictionary of selection The result in library field is assessed, and exports the optimal result;Pragmatic two dimensional analysis module, by the optimal result combination pragmatic The sentence category analysis (sca) of plane determines final dictionary field.
In the disclosure some embodiments, the semantic plane analysis module includes:Sentence mould classification submodule, to the knowledge Other result carries out the sentence mould classification of semantic plane, finds the kinetonucleus in phonetic order and relative dynamic member;Machine choice Submodule after winning and setting out member, is sent to the several dictionaries gone out with reference to selected by machine deep learning;History selects submodule Block according to user's history data, is sent to the most used several dictionary fields of the user.
In the disclosure some embodiments, the syntax two dimensional analysis module includes:Submodule is segmented, in the dictionary of selection The middle N- shortest-path methods in natural language processing carry out the participle of syntax plane;Assessment and update submodule, by right Correlation degree between word and word is assessed, and shortest path algorithm result is assessed, and asks for the highest field of point value of evaluation As optimal result;And update user's history dictionary field service condition and by historical data and machine learning to word The weights of noun optimize in library.
In the disclosure some embodiments, ASR identification engines send out phonetic order for receiving user, carry out voice knowledge Not, voice recognition result is obtained.
(3) advantageous effect
It can be seen from the above technical proposal that the disclosure is based on historical data and machine learning adaptively obtains voice lexicon Method and system at least have the advantages that one of them:
(1) by the service condition of user's history dictionary, the high dictionary of frequency of use is preferentially found out, in combination with engineering Exercises are supplement, and corresponding field is adaptively obtained from the historical data of user, is avoided through parameter or its other party Formula forces designated user using specific field, so as to considerably increase flexibility and accuracy;
(2) by the way that the analysis of sentence is divided into three different aspects, comprehensive three aspects syntax, sentence mould and pragmatic side The analysis result in face reduces the complexity of analysis, improves identification accuracy.
Description of the drawings
Fig. 1 is the method stream that the embodiment of the present disclosure adaptively obtains voice lexicon field based on historical data and machine learning Cheng Tu.
Fig. 2 is the system knot that the embodiment of the present disclosure adaptively obtains voice lexicon field based on historical data and machine learning Structure schematic diagram.
Specific embodiment
Present disclose provides it is a kind of based on historical data and machine learning adaptively obtain voice lexicon field method and System.The disclosure is used the Type division of sentence into the analysis method of three planes:It is syntax, semantic and pragmatic.Its In, according to the sentence type that the syntax plane of sentence branches away, sentence pattern is can be described as, for example sentence is divided into subject-predicate sentence and non-subject-predicate Sentence.According to the sentence type that sentence semantics plane branches away, a mould is can be described as, for example sentence is divided into " kinetonucleus+take charge ", " is moved Core+take charge+visitor's thing ".According to the sentence type branched away in sentence pragmatic plane, a class is can be described as, for example sentence is divided into old State sentence, interrogative sentence, imperative sentence etc..
Because the sentence type come out from three two dimensional analysis is different, and the combination of different level can cause sentence The selection in sub- analysis result and field is more reasonable.Disclosure user speech instructs the adaptively selected of dictionary field so that uses Family or developer need not specify corresponding field, and can be according to the historical data of user instruction and with reference to machine learning Obtained field is supplement, is rapidly selected corresponding field.Using three aspects of the analysis of sentence, the analysis of sentence is carried out, Reduce the complexity of analysis.
Before the solution of description problem, the definition for first defining some specific vocabulary is helpful.
ASR Automatic Speech Recognition automatic speech recognition technologies;
Kinetonucleus is generally by the predicate of sentence or the verb of predicate head and Adjective ingredient;
Dynamic member kinetonucleus in connection with enforceable semantic component.
Purpose, technical scheme and advantage to make the disclosure are more clearly understood, below in conjunction with specific embodiment, and reference The disclosure is further described in attached drawing.
Disclosure some embodiments will be done with reference to appended attached drawing in rear and more comprehensively describe to property, some of but not complete The embodiment in portion will be shown.In fact, the various embodiments of the disclosure can be realized in many different forms, and should not be construed To be limited to this several illustrated embodiment;Relatively, these embodiments are provided so that the disclosure meets applicable legal requirement.
In first exemplary embodiment of the disclosure, provide a kind of adaptive based on historical data and machine learning The method for obtaining voice lexicon field.Fig. 1 is the flow chart of the adaptively selected dictionary field flow chart of the first embodiment of the present disclosure. As shown in Figure 1, the disclosure is included based on the method that historical data and machine learning adaptively obtain voice lexicon:
Step S0, ASR identification engine receives user and sends out phonetic order, carries out speech recognition, obtains recognition result;
Step S1, the sentence mould that semantic plane is carried out to the recognition result are classified, find kinetonucleus in phonetic order and Relative dynamic member, dynamic member is objective thing etc. of taking charge, most of to be represented by noun ingredient;
Step S2 wins out the dynamic member in phonetic order, and several dictionaries, while basis are selected with reference to machine deep learning User's history data select several dictionary fields that the user is the most used;
Step S3 carries out syntax plane in the dictionary of selection with the N- shortest-path methods in natural language processing Participle, the result in comprehensive multiple dictionary fields are assessed, and are asked for the highest field of point value of evaluation as optimal result, are exported institute Optimal result is stated, while updates user's history data;
Step S4 carries out the sentence category analysis (sca) of pragmatic plane, determines final dictionary field.
The each step for adaptively obtaining the method for voice lexicon to the present embodiment individually below is described in detail.
The sentence mould classification of semantic plane is carried out in the step S1, using pattern matching algorithm, so as to obtain phonetic order In kinetonucleus and relative dynamic member.
The kinetonucleus in phonetic order and relative dynamic member are detached, and win and set out member in the step S2, root According to several dictionaries that dynamic member goes out with reference to selected by machine deep learning, such as:Music, navigation etc.;According to historical data, choose and use The most used several dictionary fields in family, such as:Chat, star etc..
In the step S3, syntax is carried out with the N- shortest-path methods in natural language processing in the dictionary of selection The participle of plane;The principle of the N- shortest-path methods is:Each sentence will generate a directed acyclic graph, each word As a vertex of figure, while representing possible participle.Each side has that there are one weights (initial value 1), represents that the word goes out Existing probability;Preferably, the weights are using the value of TF-IDF obtained in dictionary;In above-mentioned directed acyclic graph, N items are found Weights and maximum path.In general, shortest path more than one, is to solve suboptimal solution using greedy algorithm in selection shortest path Or Dijkstra shortest path firsts.Since the optimal solution and suboptimal solution of shortest path are not much different in participle effect, It is preferred that optimal path is solved using greedy algorithm;
The result in comprehensive multiple dictionary fields carries out assessment and includes:(for example voice refers to correlation degree between word and word Enable as " I wants to listen turning one's head again for Huan Jiang Yu ", the correlation degree of " Jiang Yuhuan " and " turning one's head again " here just than " educating " and The correlation degree of " turning one's head again " is high) it is assessed, to shortest path algorithm result, (for example " Jiang Yuhuan " this word is just than " educating " higher is evaluated in shortest path algorithm) assessed;
Update user's history data include:Update user's history dictionary field service condition and by historical data, with And machine learning optimizes the weights of noun in dictionary.
The step S4, the sentence category analysis (sca) for carrying out pragmatic plane include:Sentence put forward query or issue an order etc. into Row analysis finally determines dictionary field according to analysis result with reference to the optimal result.
The disclosure integrates machine learning according to the historical data of user, adaptively finds matched user thesaurus field, And dynamically update the service condition in user thesaurus field.In terms of the analysis of sentence, the kinetonucleus of sentence is found out in terms of sentence mould With dynamic member, the analysis in terms of syntax is carried out in dictionary to dynamic member.Analysis in terms of last synthetic sentence mould and in terms of syntax, carries out The analysis of pragmatic side by disclosed method, can improve dictionary field selection accuracy, and improve the identification of phonetic order Accuracy.
So far, the first embodiment of the present disclosure adaptively obtains the side in voice lexicon field based on historical data and machine learning Method introduction finishes.
In second exemplary embodiment of the disclosure, provide a kind of adaptive based on historical data and machine learning The system for obtaining voice lexicon field.Fig. 2 is based on historical data for the embodiment of the present disclosure and machine learning adaptively obtains voice The system structure diagram in dictionary field.As shown in Fig. 2, system includes:ASR identifications engine, semantic plane analysis module, syntax Two dimensional analysis module and pragmatic two dimensional analysis module.
The system that voice lexicon field is adaptively obtained based on historical data and machine learning to the present embodiment individually below Various pieces be described in detail.
ASR identification engines send out phonetic order for receiving user, carry out speech recognition, obtain recognition result;
Semantic plane analysis module, the semantic plane analysis module include:
Sentence mould classification submodule, the sentence mould that semantic plane is carried out to the recognition result are classified, are found in phonetic order Kinetonucleus and relative dynamic member (most of to be represented by noun ingredient, that is, objective thing etc. of taking charge);
Machine choice submodule after winning and setting out member, is sent to the several words gone out with reference to selected by machine deep learning Library;
History selects submodule, after winning and setting out member, according to user's history data, while is sent to the user using most Frequent several dictionary fields.
Syntax two dimensional analysis module, the syntax two dimensional analysis module include:
Submodule is segmented, carrying out syntax with the N- shortest-path methods in natural language processing in the dictionary of selection puts down The participle in face, the principle of the N- shortest-path methods are:Each sentence will generate a directed acyclic graph, and each word is made For a vertex of figure, while representing possible participle.Each side has there are one weights (initial value 1), represents that the word occurs Probability;Preferably, the weights are using the value of TF-IDF obtained in dictionary;In above-mentioned directed acyclic graph, N items power is found Value and maximum path.In general, shortest path more than one, selection shortest path be using greedy algorithm solve suboptimal solution or Dijkstra shortest path firsts.It is excellent since the optimal solution and suboptimal solution of shortest path are not much different in participle effect Choosing solves optimal path using greedy algorithm;
Assessment and update submodule, the result in comprehensive multiple dictionary fields are assessed, and ask for the highest neck of point value of evaluation Domain exports the optimal result, while update user's history data as optimal result;
It is described that the highest field of point value of evaluation is taken to include:Between word and word correlation degree (such as phonetic order be " I Want to listen turning one's head again for Huan Jiang Yu ", the correlation degree of " Jiang Yuhuan " and " turning one's head again " here just " is returned than " educating " and again It is first " correlation degree it is high) assessed, to shortest path algorithm result, (for example " Jiang Yuhuan " this word just exists than " educating " Higher is evaluated in shortest path algorithm) it is assessed,
Update user's history data include:Update user's history dictionary field service condition and by historical data and Machine learning optimizes the weights of noun in dictionary.
Pragmatic two dimensional analysis module carries out the sentence category analysis (sca) of pragmatic plane, determines final dictionary field.
In order to achieve the purpose that brief description, in above-described embodiment 1, any technical characteristic narration for making same application is all And in this, without repeating identical narration.
So far, the second embodiment of the present disclosure is based on what historical data and machine learning adaptively obtained voice lexicon field The various pieces introduction of system finishes.
So far, attached drawing is had been combined the embodiment of the present disclosure is described in detail.It should be noted that it in attached drawing or says In bright book text, the realization method that is not painted or describes is form known to a person of ordinary skill in the art in technical field, and It is not described in detail.In addition, the above-mentioned definition to each element and method be not limited in mentioning in embodiment it is various specific Structure, shape or mode, those of ordinary skill in the art simply can be changed or replaced to it.
Furthermore word "comprising" does not exclude the presence of element or step not listed in the claims.Before element Word "a" or "an" does not exclude the presence of multiple such elements.
In addition, unless specifically described or the step of must sequentially occur, there is no restriction in more than institute for the sequence of above-mentioned steps Row, and can change or rearrange according to required design.And above-described embodiment can be based on the considerations of design and reliability, that This mix and match is used using or with other embodiment mix and match, i.e., the technical characteristic in different embodiments can be freely combined Form more embodiments.
Algorithm and display be not inherently related to any certain computer, virtual system or miscellaneous equipment provided herein. Various general-purpose systems can also be used together with teaching based on this.As described above, required by constructing this kind of system Structure be obvious.In addition, the disclosure is not also directed to any certain programmed language.It should be understood that it can utilize various Programming language realizes content of this disclosure described here, and the description done above to language-specific is to disclose this public affairs The preferred forms opened.
The disclosure can be by means of including the hardware of several different elements and by means of properly programmed computer It realizes.The all parts embodiment of the disclosure can be with hardware realization or to be run on one or more processor Software module is realized or is realized with combination thereof.It it will be understood by those of skill in the art that can be in practice using micro- Processor or digital signal processor (DSP) are some or all in the relevant device according to the embodiment of the present disclosure to realize The some or all functions of component.The disclosure be also implemented as a part for performing method as described herein or Whole equipment or program of device (for example, computer program and computer program product).Such journey for realizing the disclosure Sequence can may be stored on the computer-readable medium or can have the form of one or more signal.Such signal can It obtains either providing on carrier signal or providing in the form of any other to download from internet website.
Those skilled in the art, which are appreciated that, to carry out adaptively the module in the equipment in embodiment Change and they are arranged in one or more equipment different from the embodiment.It can be the module or list in embodiment Member or component be combined into a module or unit or component and can be divided into addition multiple submodule or subelement or Sub-component.Other than such feature and/or at least some of process or unit exclude each other, it may be used any Combination is disclosed to all features disclosed in this specification (including adjoint claim, abstract and attached drawing) and so to appoint Where all processes or unit of method or equipment are combined.Unless expressly stated otherwise, this specification is (including adjoint power Profit requirement, abstract and attached drawing) disclosed in each feature can be by providing the alternative features of identical, equivalent or similar purpose come generation It replaces.If also, in the unit claim for listing equipment for drying, several in these devices can be by same hard Part item embodies.
Similarly, it should be understood that in order to simplify the disclosure and help to understand one or more of each open aspect, Above in the description of the exemplary embodiment of the disclosure, each feature of the disclosure is grouped together into single implementation sometimes In example, figure or descriptions thereof.However, the method for the disclosure should be construed to reflect following intention:I.e. required guarantor The disclosure of shield requires features more more than the feature being expressly recited in each claim.More precisely, as following Claims reflect as, open aspect is all features less than single embodiment disclosed above.Therefore, Thus the claims for following specific embodiment are expressly incorporated in the specific embodiment, wherein each claim is in itself All as the separate embodiments of the disclosure.
Particular embodiments described above has carried out the purpose, technical solution and advantageous effect of the disclosure further in detail It describes in detail bright, it should be understood that the foregoing is merely the specific embodiment of the disclosure, is not limited to the disclosure, it is all Within the spirit and principle of the disclosure, any modification, equivalent substitution, improvement and etc. done should be included in the guarantor of the disclosure Within the scope of shield.

Claims (10)

1. a kind of method that voice lexicon is adaptively obtained based on historical data and machine learning, including:
Step S1, the sentence mould that semantic plane is carried out to voice recognition result are classified, find kinetonucleus in phonetic order and and its Relevant dynamic member;
Step S2 wins out the dynamic member in the phonetic order, with reference to machine learning and user's history data, selects several words Library;
Step S3 carries out the participle of syntax plane, comprehensive multiple dictionary necks in the dictionary of selection with natural language processing method The result in domain is assessed, and is asked for the highest field of point value of evaluation as optimal result, is exported the optimal result, update simultaneously User's history data;
Step S4 by the sentence category analysis (sca) of optimal result combination pragmatic plane, determines final dictionary field.
2. according to the method described in claim 1, wherein, the sentence mould classification of semantic plane is carried out in the step S1 using pattern Matching algorithm obtains the kinetonucleus in phonetic order and relative dynamic member.
3. according to the method described in claim 1, wherein, the step S2 includes:
It by the kinetonucleus in phonetic order and relative dynamic member separation, and wins and sets out member, it is deep to combine machine according to dynamic member The several dictionaries gone out selected by degree study;And according to user's history data, choose the most used several dictionary fields of user.
4. according to the method described in claim 1, wherein,
In the step S3, syntax plane is carried out with the N- shortest-path methods in natural language processing in the dictionary of selection Participle;It is using greedy algorithm or Dijkstra shortest path firsts in selection shortest path.
5. according to the method described in claim 1, wherein, in the step S3,
The result in comprehensive multiple dictionary fields carries out assessment and includes:Correlation degree between word and word is assessed and right Shortest path algorithm result is assessed;
Update user's history data include:It updates user's history dictionary field service condition and passes through historical data and machine Study optimizes the weights of noun in dictionary.
6. it according to the method described in claim 1, is further included before the step S1:
Step S0, ASR identification engine receives user and sends out phonetic order, carries out speech recognition, obtains voice recognition result.
7. a kind of system that voice lexicon field is adaptively obtained based on historical data and machine learning, including:
Semantic plane analysis module carries out sentence mould classification to voice recognition result, and classification results is sent to a variety of words of selection Library;
Syntax two dimensional analysis module, to carrying out syntax participle in the dictionary of selection, and the result in comprehensive multiple dictionary fields into Row assessment, exports the optimal result;
Pragmatic two dimensional analysis module by the sentence category analysis (sca) of the optimal result combination pragmatic plane, determines final dictionary field.
8. system according to claim 7, wherein, the semantic plane analysis module includes:
Sentence mould classification submodule, the sentence mould that semantic plane is carried out to the recognition result are classified, and find the kinetonucleus in phonetic order And relative dynamic member;
Machine choice submodule after winning and setting out member, is sent to the several dictionaries gone out with reference to selected by machine deep learning;
History selects submodule, according to user's history data, is sent to the most used several dictionary fields of the user.
9. system according to claim 7, wherein, the syntax two dimensional analysis module includes:
Submodule is segmented, syntax plane is carried out with the N- shortest-path methods in natural language processing in the dictionary of selection Participle;
Assessment and update submodule, are assessed by the correlation degree between word and word, shortest path algorithm result are carried out Assessment, asks for the highest field of point value of evaluation as optimal result;And update user's history dictionary field service condition and The weights of noun in dictionary are optimized by historical data and machine learning.
10. system according to claim 7, further includes,
ASR identifies engine, sends out phonetic order for receiving user, carries out speech recognition, obtain voice recognition result.
CN201711391038.6A 2017-12-21 2017-12-21 Method and system for adaptively acquiring word bank field based on historical data and machine learning Active CN108255956B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711391038.6A CN108255956B (en) 2017-12-21 2017-12-21 Method and system for adaptively acquiring word bank field based on historical data and machine learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711391038.6A CN108255956B (en) 2017-12-21 2017-12-21 Method and system for adaptively acquiring word bank field based on historical data and machine learning

Publications (2)

Publication Number Publication Date
CN108255956A true CN108255956A (en) 2018-07-06
CN108255956B CN108255956B (en) 2020-04-03

Family

ID=62723438

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711391038.6A Active CN108255956B (en) 2017-12-21 2017-12-21 Method and system for adaptively acquiring word bank field based on historical data and machine learning

Country Status (1)

Country Link
CN (1) CN108255956B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109543182A (en) * 2018-11-15 2019-03-29 广东电网有限责任公司信息中心 A kind of electric power enterprise based on solr engine takes turns interactive semantic analysis method more
CN110188174A (en) * 2019-04-19 2019-08-30 浙江工业大学 A kind of professional domain FAQ intelligent answer method excavated based on specialized vocabulary
CN110544480A (en) * 2019-09-05 2019-12-06 苏州思必驰信息科技有限公司 Voice recognition resource switching method and device
CN111581946A (en) * 2020-04-21 2020-08-25 上海爱数信息技术股份有限公司 Language sequence model decoding method

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101655837A (en) * 2009-09-08 2010-02-24 北京邮电大学 Method for detecting and correcting error on text after voice recognition
CN102867511A (en) * 2011-07-04 2013-01-09 余喆 Method and device for recognizing natural speech
CN103389979A (en) * 2012-05-08 2013-11-13 腾讯科技(深圳)有限公司 System, device and method for recommending classification lexicon in input method
CN105138663A (en) * 2015-09-01 2015-12-09 百度在线网络技术(北京)有限公司 Word bank query method and device
CN105654954A (en) * 2016-04-06 2016-06-08 普强信息技术(北京)有限公司 Cloud voice recognition system and method
CN105868176A (en) * 2016-03-02 2016-08-17 北京同尘世纪科技有限公司 Text based video synthesis method and system
CN106156002A (en) * 2016-06-30 2016-11-23 乐视控股(北京)有限公司 The system of selection of participle dictionary and system
CN106202301A (en) * 2016-07-01 2016-12-07 武汉泰迪智慧科技有限公司 A kind of intelligent response system based on degree of depth study
CN106446022A (en) * 2016-08-29 2017-02-22 华东师范大学 Formal semantic reasoning and deep learning-based natural language knowledge mining method
CN106875941A (en) * 2017-04-01 2017-06-20 彭楚奥 A kind of voice method for recognizing semantics of service robot
CN107015969A (en) * 2017-05-19 2017-08-04 四川长虹电器股份有限公司 Can self-renewing semantic understanding System and method for

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101655837A (en) * 2009-09-08 2010-02-24 北京邮电大学 Method for detecting and correcting error on text after voice recognition
CN102867511A (en) * 2011-07-04 2013-01-09 余喆 Method and device for recognizing natural speech
CN103389979A (en) * 2012-05-08 2013-11-13 腾讯科技(深圳)有限公司 System, device and method for recommending classification lexicon in input method
CN105138663A (en) * 2015-09-01 2015-12-09 百度在线网络技术(北京)有限公司 Word bank query method and device
CN105868176A (en) * 2016-03-02 2016-08-17 北京同尘世纪科技有限公司 Text based video synthesis method and system
CN105654954A (en) * 2016-04-06 2016-06-08 普强信息技术(北京)有限公司 Cloud voice recognition system and method
CN106156002A (en) * 2016-06-30 2016-11-23 乐视控股(北京)有限公司 The system of selection of participle dictionary and system
CN106202301A (en) * 2016-07-01 2016-12-07 武汉泰迪智慧科技有限公司 A kind of intelligent response system based on degree of depth study
CN106446022A (en) * 2016-08-29 2017-02-22 华东师范大学 Formal semantic reasoning and deep learning-based natural language knowledge mining method
CN106875941A (en) * 2017-04-01 2017-06-20 彭楚奥 A kind of voice method for recognizing semantics of service robot
CN107015969A (en) * 2017-05-19 2017-08-04 四川长虹电器股份有限公司 Can self-renewing semantic understanding System and method for

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
刘灿群: ""试论谓词性"就是"开头句的句模"", 《广东农工商职业技术学院学报》 *
张普: "《汉语信息处理研究》", 31 August 1992, 北京:北京语言学院出版社 *
范晓: "《语法研究和探索(七)》", 31 December 1995 *
邵敬敏: "《现代汉语通论(第三版)(上册)》", 31 August 2016, 上海:上海教育出版社 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109543182A (en) * 2018-11-15 2019-03-29 广东电网有限责任公司信息中心 A kind of electric power enterprise based on solr engine takes turns interactive semantic analysis method more
CN110188174A (en) * 2019-04-19 2019-08-30 浙江工业大学 A kind of professional domain FAQ intelligent answer method excavated based on specialized vocabulary
CN110188174B (en) * 2019-04-19 2021-10-29 浙江工业大学 Professional field FAQ intelligent question and answer method based on professional vocabulary mining
CN110544480A (en) * 2019-09-05 2019-12-06 苏州思必驰信息科技有限公司 Voice recognition resource switching method and device
CN110544480B (en) * 2019-09-05 2022-03-11 思必驰科技股份有限公司 Voice recognition resource switching method and device
CN111581946A (en) * 2020-04-21 2020-08-25 上海爱数信息技术股份有限公司 Language sequence model decoding method
CN111581946B (en) * 2020-04-21 2023-10-13 上海爱数信息技术股份有限公司 Language sequence model decoding method

Also Published As

Publication number Publication date
CN108255956B (en) 2020-04-03

Similar Documents

Publication Publication Date Title
JP7346609B2 (en) Systems and methods for performing semantic exploration using natural language understanding (NLU) frameworks
US10152971B2 (en) System and method for advanced turn-taking for interactive spoken dialog systems
US9190054B1 (en) Natural language refinement of voice and text entry
JP5149737B2 (en) Automatic conversation system and conversation scenario editing device
CN104538024B (en) Phoneme synthesizing method, device and equipment
US20150279366A1 (en) Voice driven operating system for interfacing with electronic devices: system, method, and architecture
EP2317508B1 (en) Grammar rule generation for speech recognition
Orosz et al. PurePos 2.0: a hybrid tool for morphological disambiguation
CN109196495A (en) Fine granularity natural language understanding
CN108255956A (en) The method and system of dictionary are adaptively obtained based on historical data and machine learning
Watts Unsupervised learning for text-to-speech synthesis
CN109408622A (en) Sentence processing method and its device, equipment and storage medium
JP2021528761A (en) Limited natural language processing
Rieser et al. Natural language generation as incremental planning under uncertainty: Adaptive information presentation for statistical dialogue systems
KR20130128717A (en) Conversation managemnt system and method thereof
Dethlefs et al. Conditional random fields for responsive surface realisation using global features
CN109637537A (en) A kind of method that automatic acquisition labeled data optimizes customized wake-up model
US20060229863A1 (en) System for generating and selecting names
JP5625827B2 (en) Morphological analyzer, speech synthesizer, morphological analysis method, and morphological analysis program
Granroth-Wilding et al. Harmonic analysis of music using combinatory categorial grammar
Alías et al. Towards high-quality next-generation text-to-speech synthesis: A multidomain approach by automatic domain classification
Keh et al. Pancetta: Phoneme aware neural completion to elicit tongue twisters automatically
CN114970733A (en) Corpus generation method, apparatus, system, storage medium and electronic device
Kominek Tts from zero: Building synthetic voices for new languages
US11900918B2 (en) Method for training a linguistic model and electronic device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
PE01 Entry into force of the registration of the contract for pledge of patent right

Denomination of invention: Method and system of adaptive acquisition of thesaurus domain based on historical data and machine learning

Effective date of registration: 20200925

Granted publication date: 20200403

Pledgee: Haidian Beijing science and technology enterprise financing Company limited by guarantee

Pledgor: BEIJING SOUNDAL TECHNOLOGY Co.,Ltd.

Registration number: Y2020990001168

PE01 Entry into force of the registration of the contract for pledge of patent right
PC01 Cancellation of the registration of the contract for pledge of patent right

Date of cancellation: 20211202

Granted publication date: 20200403

Pledgee: Haidian Beijing science and technology enterprise financing Company limited by guarantee

Pledgor: SOUNDAI TECHNOLOGY Co.,Ltd.

Registration number: Y2020990001168

PC01 Cancellation of the registration of the contract for pledge of patent right