CN109101499A

CN109101499A - Artificial intelligent voice learning method neural network based

Info

Publication number: CN109101499A
Application number: CN201810874085.4A
Authority: CN
Inventors: 王大江
Original assignee: Individual
Current assignee: Beijing Zhongke Huilian Technology Co ltd
Priority date: 2018-08-02
Filing date: 2018-08-02
Publication date: 2018-12-28
Anticipated expiration: 2038-08-02
Also published as: CN109101499B

Abstract

In order to further increase the efficiency and accuracy when translation on line, the present invention provides a kind of artificial intelligent voice learning methods neural network based, including (1) based on using the progress foreign language-Chinese text artificial intelligence study of self-adaptive growth type neural network；(2) voiced translation of foreign language-Chinese is carried out.The present invention can be by big data foreign language-Chinese bilingual dictionary that machine learning obtains, semantic and context matching is carried out based on 6 rank depth probability analysis methods, to which way compared with prior art reduces the operand more than 40% or more, translation efficiency is improved while ensuring translation accuracy.

Description

Artificial intelligent voice learning method neural network based

Technical field

The present invention relates to voice control technology fields, more particularly, to a kind of artificial intelligence language neural network based Sound learning method.

Background technique

With the development of science and technology and economic globalization, either in daily life or sphere of learning is linked up, Translation on line exchange has existed more and more demands.Although having existed simultaneous interpretation, portable machine interpreting equipment etc. It comes into being, but in the usage scenario of the meeting or classroom for being related to professional domain etc., the accuracy of conventional machines interpreting equipment And the efficiency of simultaneous interpretation personnel makes people worried.Especially when certain side's word speed is very fast, machine translation will be difficult to be competent at, And simultaneous interpretation personnel then need to repeat the language being not kept pace with using the mode reaffirmed, to give some usage scenario bands Come the experience having some setbacks.

In order to meet the efficiency of translation on line and the raising demand of accuracy simultaneously, application No. is CN201710203439.8 Chinese invention patent application disclose a kind of multilingual intelligence pretreatment real-time statistics machine translation system, comprising: receive mould Block, preprocessing module, machine translation module and post-processing module.The receiving module includes text language receiving module and voice Recognition result receiving module；The preprocessing module includes Text Pretreatment module and speech recognition result preprocessing module；Machine Device translation module, the machine translation module are used to learn the translation of phrase-for-phrase, and to by preprocessing module processing Phrase finds out corresponding translation phrase, and phrase is connected into complete sentence；Post-processing module, the post-processing module are used It is handled in doing word lattice gauge, capital and small letter standardization and format specificationization to translation result, it is made to be more nearly target language The communicative habits of speech, and exported as final result.However, this system, which solves dynamics for the above-mentioned drawback of the prior art, to be had Limit.

Summary of the invention

In order to further increase the efficiency and accuracy when translation on line, the present invention provides a kind of neural network based Artificial intelligent voice learning method, comprising:

(1) based on using the progress foreign language-Chinese text artificial intelligence study of self-adaptive growth type neural network, the step Include:

(10) word library is established；

(20) voice prediction model is established；

(2) voiced translation of foreign language-Chinese is carried out.

Further, the step (2) includes: that (10) establish word library；

(20) voice prediction model is established；

(30) voice of input is converted into text；

(40) according to the word library and voice prediction model, translation text is determined.

Further, the step (10) include: according to dictionary establish foreign language word and it is corresponding with the foreign language word in The first association between the word of literary meaning, wherein with the first ordinal position mark in dictionary when the translation of Chinese word is multiple The Chinese translation word of knowledge is main Chinese translation word and the Chinese translation word of ordinal position is translated as secondary Chinese later Cliction language.

Further, the step (20) includes:

(201) word cutting is carried out according to foreign language article and obtains foreign language word and according to the Chinese translation word of the foreign language article, Establish the second level word connected after foreign language word and Chinese translation word and the Chinese translation word second is associated with；

(202) the first association is associated with second and is indexed；

Further, the step (201) includes: that machine learning is carried out in a manner of unsupervised learning according to foreign language article.

Further, the step (201) includes: to carry out machine to foreign language article and its translation using stochastic gradient descent method Device study.

Further, the step (202) includes:

It is associated as major key with first, the information relevant to the first association occurred from the second association is indexed.

Further, the information relevant to the first association that is described to be associated as major key with first, occurring from the second association It is indexed and includes:

(2021) major key information determines: assuming that English word Ei corresponds to main Chinese translation word Cj in the first association；And According to the second association, the second level word connected after word Cj constitutes set { Sm, Pm }, then using word Cj as major key, wherein Pm It is that word Sm appears in probability of the Cj later as the second level word connected, i, j and m are the natural number since 1；

(2022) probability of word Cj appearance is defined:

p(S_m|C_j)=χ_gh(p_j),

Wherein

M=1,2,3,4,5,6；AndFor with forMean value, ξ_mFor The m rank diagonal matrix of variance,

(2023) according to Probability p (S_m|C_j) determine when word Cj takes current meaning and the matching degree of context:

It calculatesWherein p ' indicates to carry out difference to p；

It calculatesWhether less than the first preset threshold: when small Yu Shi, the position for determining that j is indicated in Cj meet the corresponding context of Ei, otherwise enable j=j+1, step (2022) are jumped to, if j Its maximum value is reached by traversal, then enable j=1 and is continued step (2024), u and v are natural number；

(2024) when correction second level word of the Sm as the connecting of Cj and the matching degree of context:

It calculatesWhether less than second Preset threshold: it when being less than, determines that the second level word of connecting of the Sm as Cj meets context, otherwise enables m=m+1, jump to step Suddenly (2022) enable m=1 if m reaches its maximum value by traversal.

Further, the step (30) includes:

(301) linear analysis is made to primary speech signal, obtains weighting cepstral coefficients as speech characteristic parameter；

(302) speech model is obtained according to speech characteristic parameter；

(303) voice to be identified is matched with speech model, is searched for using frame synchronous network, to each frame voice An output probability value is determined for different models, is retained mulitpath in the matching process, is finally recalled matching result out；

(304) matched result is distributed with state duration and optimal path probability distribution carries out differentiating that rejection is fallen to know Voice except other range, obtains correct recognition result.

Further, the step (40) includes:

Voice is generated based on STT technology, using Chinese translation word.

The beneficial effect comprise that big data foreign language-Chinese bilingual dictionary that machine learning obtains, base can be passed through Semantic and context matching is carried out in 6 rank depth probability analysis methods, so that way compared with prior art reduces and is more than 40% or more operand, improves translation efficiency while ensuring translation accuracy.

Detailed description of the invention

Fig. 1 shows the flow chart of the method for the present invention.

Specific embodiment

As shown in Figure 1, preferred embodiment in accordance with the present invention, the present invention provides a kind of artificial intelligence neural network based Energy phonetic study method, comprising:

(10) word library is established；

(20) voice prediction model is established；

(2) voiced translation of foreign language-Chinese is carried out.

Preferably, the step (2) includes:

(30) voice of input is converted into text；

Preferably, the step (10) includes: to establish foreign language word and Chinese corresponding with the foreign language word according to dictionary The first association between the word of meaning, wherein with the first ordinal position mark in dictionary when the translation of Chinese word is multiple Chinese translation word be main Chinese translation word and the Chinese translation word of ordinal position is as secondary Chinese translation later Word.

Preferably, the step (20) includes:

(202) the first association is associated with second and is indexed；

Preferably, the step (201) includes: that machine learning is carried out in a manner of unsupervised learning according to foreign language article.

Preferably, the step (201) includes: to carry out machine to foreign language article and its translation using stochastic gradient descent method Study.

Preferably, the step (202) includes:

It is associated as major key with first, the information relevant to the first association occurred from the second association is indexed.The master Key is the major key for indicating the database of text corresponding relationship of foreign language and Chinese.

Preferably, described to be associated as major key with first, from the second association the information relevant to the first association that occurs into Line index includes:

(2022) probability of word Cj appearance is defined:

p(S_m|C_j)=χ_gh(p_j),

Wherein

It calculatesWherein p ' indicates to carry out difference to p；

Preferably, the step (30) includes:

(302) speech model is obtained according to speech characteristic parameter；

Preferably, the step (40) includes:

Based on STT technology, i.e., Speech to Text technology, utilize Chinese translation word to generate voice.

The above-described embodiments merely illustrate the principles and effects of the present invention, and is not intended to limit the present invention.It is any ripe The personage for knowing this technology all without departing from the spirit and scope of the present invention, carries out modifications and changes to above-described embodiment.Cause This, institute is complete without departing from the spirit and technical ideas disclosed in the present invention by those of ordinary skill in the art such as At all equivalent modifications or change, should be covered by the claims of the present invention.

Claims

1. a kind of artificial intelligent voice learning method neural network based, comprising:

(1) based on using the progress foreign language-Chinese text artificial intelligence study of self-adaptive growth type neural network, the step packet It includes:

(10) word library is established；

(20) voice prediction model is established；

(2) voiced translation of foreign language-Chinese is carried out.

2. the method according to claim 1, wherein the step (2) includes:

(30) voice of input is converted into text；

3. the method according to claim 1, wherein the step (10) includes: to establish outer cliction according to dictionary The first association between language and the word of Chinese meaning corresponding with the foreign language word, wherein when the translation of Chinese word is multiple With the Chinese translation word of the first ordinal position mark in dictionary for main Chinese translation word and later in ordinal position Literary translation word is as secondary Chinese translation word.

4. according to the method described in claim 3, it is characterized in that, the step (20) includes:

(201) word cutting is carried out according to foreign language article and obtains foreign language word and according to the Chinese translation word of the foreign language article, foundation The second of the second level word connected after foreign language word and Chinese translation word and the Chinese translation word is associated with；

(202) the first association is associated with second and is indexed.

5. according to the method described in claim 4, it is characterized in that, the step (201) includes: according to foreign language article with no prison It superintends and directs mode of learning and carries out machine learning.

6. according to the method described in claim 4, it is characterized in that, the step (201) includes: using stochastic gradient descent method Machine learning is carried out to foreign language article and its translation.

7. according to the method described in claim 4, it is characterized in that, the step (202) includes:

8. going out from the second association the method according to the description of claim 7 is characterized in that described be associated as major key with first Existing information relevant to the first association, which is indexed, includes:

(2021) major key information determines: assuming that English word Ei corresponds to main Chinese translation word Cj in the first association；And according to Second association, the second level word that word Cj is connected later constitute set { Sm, Pm }, then using word Cj as major key, wherein Pm is word Language Sm appears in probability of the Cj later as the second level word connected, and i, j and m are the natural number since 1；

(2022) probability of word Cj appearance is defined:

p(S_m|C_j)=χ_gh(p_j),

Wherein

For withFor mean value, ξ_mFor the m rank diagonal matrix of variance,

It calculatesWherein p ' indicates to carry out difference to p；

It calculatesWhether less than the first preset threshold: when being less than, The position for determining that j in Cj is indicated meets the corresponding context of Ei, otherwise enables j=j+1, jumps to step (2022), if j by time It goes through and reaches its maximum value, then enable j=1 and continue step (2024), u and v are natural number；

It calculatesIt is whether default less than second Threshold value: it when being less than, determines that the second level word of connecting of the Sm as Cj meets context, otherwise enables m=m+1, jump to step (2022), if m reaches its maximum value by traversal, m=1 is enabled.

9. the method according to claim 1, wherein the step (30) includes:

(302) speech model is obtained according to speech characteristic parameter；

(303) voice to be identified is matched with speech model, is searched for using frame synchronous network, each frame voice is directed to Different models determines an output probability value, retains mulitpath in the matching process, finally recalls matching result out；

(304) matched result is distributed with state duration and optimal path probability distribution carries out differentiating that rejection is fallen to identify model Voice except enclosing obtains correct recognition result.

10. the method according to claim 1, wherein the step (40) includes:

Voice is generated based on STT technology, using Chinese translation word.