CN103474061A - Automatic distinguishing method based on integration of classifier for Chinese dialects - Google Patents

Automatic distinguishing method based on integration of classifier for Chinese dialects Download PDF

Info

Publication number
CN103474061A
CN103474061A CN2013104161737A CN201310416173A CN103474061A CN 103474061 A CN103474061 A CN 103474061A CN 2013104161737 A CN2013104161737 A CN 2013104161737A CN 201310416173 A CN201310416173 A CN 201310416173A CN 103474061 A CN103474061 A CN 103474061A
Authority
CN
China
Prior art keywords
classification
dialect
gmm
chinese
vector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2013104161737A
Other languages
Chinese (zh)
Inventor
朱贺
高红民
王慧斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hohai University HHU
Original Assignee
Hohai University HHU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hohai University HHU filed Critical Hohai University HHU
Priority to CN2013104161737A priority Critical patent/CN103474061A/en
Publication of CN103474061A publication Critical patent/CN103474061A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Machine Translation (AREA)

Abstract

The invention discloses an automatic distinguishing method based on integration of a classifier for Chinese dialects. The method includes the four steps of Chinese dialect phonetic feature extraction, dialect model matching marking, classification vector extraction and rear end classification. A mode of two-stage feature extraction is adopted, and a GMM is used as a high-class feature extractor to be used. In the calculation process, phonetic features are sent to the GMM containing dialect phonetic prior knowledge to be marked, normalization and difference calculation is carried out the obtained marks, and classification vectors with the difference degree among high classes and the intra-class polymerization degree are formed. The classification vectors are sent to a rear end SVM classifier to be classified. The technical advantages of data distribution fitting of the GMM and classification surface modeling of an SVM are integrated, and distinguishing for Chinese dialects belonging to which the variety of dialect areas is finally achieved. The method can be used for distinguishing Chinese telephone dialect voice and the like stably and reliably, and is high in accuracy.

Description

Chinese dialect identification method based on Multiple Classifier Fusion
Technical field
The present invention relates to the speech identifying method of multiple Classifiers Combination technology, relate in particular to a kind of Chinese dialects discrimination method, belong to field of voice signal.
Background technology
Chinese dialect identification is the voice that utilize one section input of Computer Analysis, differentiates the voice processing technology in the affiliated dialect of speaker zone.In the such country multi-national, that in many ways say of China, the research that knowledge debated automatically in Chinese dialects is that basis has been set up in the accessible communication between China various nationalities, along with the fast development of China's science and technology, wherein fermenting especially huge using value and wide application prospect.The branch as the Research of Speech Recognition, in research in early days, the Chinese dialects identification system often adopts the layout strategy of single classifier list feature, ignored the application of information fusion in system, make system place one's entire reliance upon a certain sorter and a certain feature, restricted the raising of system performance.
Many information fusion are focuses of current information processing research field, it not only can be more comprehensively, describe objective phenomenon in detail, can also realize the excavation of the deep information.In the speech processes field, the information fusion mode is mainly adopted in two ways: one, many Fusion Features; Two, multiple Classifiers Combination.The former adopts the layout strategy of many feature-single classifiers, by the weighted sum of different characteristic score, makes in a system and uses a plurality of features simultaneously, thereby the decision-making of higher accuracy is provided; The latter adopts the layout strategy of multi-categorizer, will have in complementary Multiple Classifier Fusion to a system, and the difference by different sorters on classification policy realizes multiple classifition integrated classification result.In the research of corresponding Multiple Classifier Fusion, be mostly for the speech recognition with text-dependent at present, and can adapt to the syncretizing mechanism of text-independent speech recognition actually rare.
Summary of the invention
Goal of the invention: for problems of the prior art, the present invention be take two-level classifier as framework, has proposed a set of new Multiple Classifier Fusion mechanism, specifically a kind of Chinese dialect identification method based on Multiple Classifier Fusion.The present invention can better extract different information between class Chinese dialects phonetic feature, and more adapts to and the recognition systems such as the dialect of text-independent, languages identification, significantly improves classification capacity and robustness.
In Multiple Classifier Fusion, the performance of emerging system depends primarily on following 2 points: one, sorter chooses; Two, the design of syncretizing mechanism.Choosing of sorter, usually require multi-categorizer to there is complementarity on classification policy, thereby realize the decision-making that degree of confidence is higher after fusion.Given this, this paper selects production sorter gauss hybrid models (GMM) and deterministic sorter supporting vector machine (SVM) for merging object.As the production sorter, GMM has data fitting ability preferably, can describe preferably the distribution of overall data.But, due to need to be from complete data learning parameter, for the data volume of training set, require too high and cycle of training is longer.Compare, SVM does not possess capability of fitting that data preferably distribute but the state of interpretive classification face comparatively clearly.Therefore, GMM and SVM have the complementary advantage that its fusion can be brought into play to two kinds of sorters on principle.For the design of syncretizing mechanism, can take the rear end mark to merge and two kinds of modes of multistage fusion.The former carries out degree of confidence marking to the decision-making of SVM, and the marking of itself and GMM is weighted and asks, and with this, carries out the classification decision-making; The maker of the latter using GMM as classification vector, generate the classification vector that contains global information and send into SVM and classified.In dialect identification due to the distribution of data too complexity and data volume too huge, should not use SVM classified and give a mark the raw tone feature, during this external mark merges, the selection of weight also has certain difficulty, therefore, the multistage classifier emerging system is adapted to the Chinese dialects Research on Identification more.The two-level classifier of tradition based on GMM, SVM merges the common Fisher of employing kernel function as syncretizing mechanism, and the acoustic information that not only contains dialect phonetic in extracted feature also contains the global information of this dialect, is a kind of senior classification vector.But, wherein also exist many limitations.At first, the mapping space of Fisher kernel function exists the hidden danger of dimension disaster, is difficult to meet big data quantity and speech recognition text-independent.Secondly, for same speech primitive, certain correlativity is arranged between the marking of different dialect models, as shown in table 1, and plant the class representativeness that correlativity has affected classification vector.Finally, for the dialect identification, we expect that characteristic of division embodies difference between the class of dialect, and different dialect models are to the otherness between one section speech score.
The marking of the different dialect models of table .1 to speech primitive
Figure BDA0000381213120000021
Technical scheme: a kind of Chinese dialect identification method based on Multiple Classifier Fusion, select production sorter gauss hybrid models (GMM) and deterministic sorter supporting vector machine (SVM) for merging object, production sorter gauss hybrid models is the production probability statistics model, and its probability density computing formula is:
P ( x | W n ) = Σ i = 1 k w ni 1 ( 2 π ) N | Σ ni | 1 / 2 exp ( - 1 2 ( x - μ ni ) T Σ ni - 1 ( x - μ ni ) ) - - - ( 1 )
Wherein, the acoustic feature that X is a speech primitive, w ni, μ ni, Σ nirepresent respectively weight, average and the covariance matrix of each Gaussian Mixture unit in dialect GMM, k is for mixing first dimension.Input Chinese dialects signal carries out speech feature extraction, in the leaching process of new characteristic of division, at first utilizes the GMM of known training sample set training dialect; Then speech data is input in the various dialect GMM that design, speech primitive is carried out to likelihood marking, form mark vector [P (x i| μ 1Σ 1) P (x i| μ 2Σ 2) ... P (x i| μ nΣ n)], realize the mapping from the raw tone feature space to minute number space.Finally this vector is carried out to normalized and calculus of differences.Its calculation procedure is as follows:
One, the score of voice is carried out to normalized:
SV i=(1/C i)·[P(x i1Σ 1)P(x i2Σ 2)…P(x iNΣ N)] (2)
C wherein ibe normalized factor, get in literary composition: C i = max n ( P ( x i | μ n Σ n ) ) , n = 1 . . . N
Two, calculate the mark difference:
φ′(x i)=[(SV i1-SV i2)(SV i1-SV i3)…(SV i1-SV iN),(SV i2-SV i3)(SV i2-SV i4)
…(SV i2-SV iN),…,(SV iN-1-SV iN)] (3)
Subsequently, based on training classification vector training svm classifier device.
The present invention adopts technique scheme, has following beneficial effect: it can well solve Fisher kernel function existing problem in the classification vector design, embodies different information between class simultaneously, more adapts to the speech recognition work such as dialect, Language Identification.
The accompanying drawing explanation
The method flow diagram that Fig. 1 is the embodiment of the present invention;
The distribution plan that Fig. 2 is the primary features vector classification vector in the embodiment of the present invention.
Embodiment
Below in conjunction with specific embodiment, further illustrate the present invention, should understand these embodiment only is not used in and limits the scope of the invention for the present invention is described, after having read the present invention, those skilled in the art all fall within the application's claims limited range to the modification of the various equivalent form of values of the present invention.
As shown in Figure 1, Chinese dialect identification method based on Multiple Classifier Fusion, select production sorter gauss hybrid models (GMM) and deterministic sorter supporting vector machine (SVM) for merging object, production sorter gauss hybrid models is the production probability statistics model, it can roughly describe out the global information in data space, and its probability density computing formula is:
P ( x | W n ) = Σ i = 1 k w ni 1 ( 2 π ) N | Σ ni | 1 / 2 exp ( - 1 2 ( x - μ ni ) T Σ ni - 1 ( x - μ ni ) ) - - - ( 1 )
Wherein, the acoustic feature that X is a speech primitive, w ni, μ ni, Σ nirepresent respectively weight, average and the covariance matrix of each Gaussian Mixture unit in dialect GMM, k is for mixing first dimension.In the leaching process of new characteristic of division, at first utilize the GMM of known training sample set training dialect.Then speech data is input in the various dialect GMM that design, speech primitive is carried out to likelihood marking, form mark vector [P (x i| μ 1Σ 1) P (x i| μ 2Σ 2) ... P (x i| μ nΣ n)], realize the mapping from the raw tone feature space to minute number space.Finally this vector is carried out to normalized and calculus of differences.Its calculation procedure is as follows:
One, the score of voice is carried out to normalized:
SV i=(1/C i)·[P(x i1Σ 1)P(x i2Σ 2)…P(x iNΣ N)] (2)
C wherein ibe normalized factor, get in literary composition: C i = max n ( P ( x i | μ n Σ n ) ) , n = 1 . . . N
Two, calculate the mark difference:
φ′(x i)=[(SV i1-SV i2)(SV i1-SV i3)…(SV i1-SV iN),(SV i2-SV i3)(SV i2-SV i4)
…(SV i2-SV iN),…,(SV iN-1-SV iN)] (3)
In this fusion process, the impact of correlativity between the marking of having mentioned in having reduced above by normalized on the one hand on discrimination, different information between the class of the proposition dialect of the mark calculus of differences success after by different dialect GMM, a certain primitive being given a mark on the other hand, make
Figure BDA0000381213120000042
(x i) in not only include acoustic information, global information and also contain different information between the dialect class.Distribution (as shown in Figure 2) by Wu Fangyan, Guangdong dialect, three kinds of dialect classification vector of Fujian dialect, can see that the dialect phonetic classification vector of the same race of comparing with primitive character under new syncretizing mechanism has embodied difference effect between stronger cluster and class, is more applicable for dialect identification work.
Due to svm classifier device classification capacity and outstanding generalization preferably, in merging, the device of multiclass classification usually select SVM as the rear end sorter.The Chinese dialects identification is a multi-class classification problem in essence, and the solution of this problem at present is mainly to adopt the classification strategies such as decision Tree algorithms, " one to one " classification policy, " one-to-many ".But, due to the complicacy that multi-class sample data distributes, experimental results demonstrate that the identification system based on above strategy is unsatisfactory when processing multi-class classification problem.The present invention adopts the ECOC algorithm, and this algorithm is treated the sub-category binary-coding that carries out, and usings this label as classification.In the process of coding, in algorithm requirement code matrix, between the code word of each row and column, to keep independence and separability.Accordingly, the ECOC algorithm requires when 3≤k≤7, and the maximum length of code book should be 2 k-1-1 dimension, wherein k is the classification number.Its coding rule is: the first row contains unit vector, the second row code book be by 2 k-2individual 0 and 2 k-2individual 1 alternately forms, and by that analogy, the capable code book of i is by 2 k-iindividual 0 and 2 k-iindividual 1 alternately forms.Suppose take that 4 class problems are object of classification, just need the code book of 7 dimensions to carry out code Design, as shown in table 1, the vector of wherein going is the coding of ECOC algorithm for each classification.Class label design category device according to the column vector in code word matrix, obtain f 1, f 2... f nn≤2 k-1-1.In test process, this algorithm is at first to inputting voice according to f 1, f 2... f nclassifying rules is classified, and then according to classification results, unknown voice is encoded, and designs the code book of these voice, finally itself and known classification code book is complementary, and realizes decision-making.The ECOC algorithm is used and carries out matching measurement based on Hamming distance from nearest neighbor algorithm, has certain fault-tolerance, and these characteristics are particularly important in multi-class classification.This paper is used the ECOC algorithm to realize the identification of multiclass dialect, as shown in table 2.
Table .2 classification coding
Figure BDA0000381213120000051
During training, adopt the training data voice, train respectively 128 dimension GMM models of different dialects, and output is to the likelihood marking of every section training utterance (15s).By normalization and Difference Calculation, generate the training classification vector subsequently.Subsequently, based on training classification vector training svm classifier device.
During test, the Chinese dialects speech data of input, according to above-mentioned flow process, is sent in the GMM model and is given a mark, and extract classification vector, classified.
So far complete the identification of Chinese dialects.

Claims (4)

1. the Chinese dialect identification method based on Multiple Classifier Fusion, it is characterized in that: select GMM and SVM for merging object, input Chinese dialects signal carries out speech feature extraction, in the leaching process of new characteristic of division, at first utilizes the GMM of known training sample set training dialect; Then speech data is input in the GMM of the various dialects that design, speech primitive is carried out to likelihood marking, form mark vector [P (x i| μ 1Σ 1) P (x i| μ 2Σ 2) ... P (x i| μ nΣ n)], realize the mapping from the raw tone feature space to minute number space; Secondly this mark vector is carried out to normalized and calculus of differences; Subsequently, based on training classification vector training svm classifier device.
2. the Chinese dialect identification method based on Multiple Classifier Fusion as claimed in claim 1, it is characterized in that: GMM is the production probability statistics model, its probability density computing formula is:
P ( x | W n ) = Σ i = 1 k w ni 1 ( 2 π ) N | Σ ni | 1 / 2 exp ( - 1 2 ( x - μ ni ) T Σ ni - 1 ( x - μ ni ) ) - - - ( 1 )
Wherein, the acoustic feature that X is a speech primitive, w ni, μ ni, Σ nirepresent respectively weight, average and the covariance matrix of each Gaussian Mixture unit in dialect GMM, k is for mixing first dimension.
3. the Chinese dialect identification method based on Multiple Classifier Fusion as claimed in claim 1 is characterized in that: described mark vector is carried out to normalized and calculus of differences is calculated as follows:
One, the score of voice is carried out to normalized:
SV i=(1/C i)·[P(x i1Σ 1)P(x i2Σ 2)…P(x iNΣ N)] (2)
C wherein ibe normalized factor, get: C i = max n ( P ( x i | μ n Σ n ) ) , n = 1 . . . N ;
Two, calculate the mark difference:
φ′(x i)=[(SV i1-SV i2)(SV i1-SV i3)…(SV i1-SV iN),(SV i2-SV i3)(SV i2-SV i4)
…(SV i2-SV iN),…,(SV iN-1-SV iN)] (3)。
4. the Chinese dialect identification method based on Multiple Classifier Fusion as claimed in claim 1 is characterized in that: in training classification vector training svm classifier device, adopt the ECOC algorithm to treat the sub-category binary-coding that carries out, using this label as classification; In the process of coding, in requirement code matrix, between the code word of each row and column, to keep independence and separability; When 3≤k≤7, the maximum length of code book should be 2 k-1-1 dimension, wherein k is the classification number; Coding rule is: the first row contains unit vector, the second row code book be by 2 k-2individual 0 and 2 k-2individual 1 alternately forms, and by that analogy, the capable code book of i is by 2 k-iindividual 0 and 2 k-iindividual 1 alternately forms; Suppose take that 4 class problems are object of classification, just need the code book of 7 dimensions to carry out code Design, the row vector is the coding of ECOC algorithm for each classification; Class label design category device according to the column vector in code word matrix, obtain f 1, f 2... f nn≤2 k-1-1; In test process, this algorithm is at first to inputting voice according to f 1, f 2... f nclassifying rules is classified, and then according to classification results, unknown voice is encoded, and designs the code book of these voice, finally itself and known classification code book is complementary.
CN2013104161737A 2013-09-12 2013-09-12 Automatic distinguishing method based on integration of classifier for Chinese dialects Pending CN103474061A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2013104161737A CN103474061A (en) 2013-09-12 2013-09-12 Automatic distinguishing method based on integration of classifier for Chinese dialects

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2013104161737A CN103474061A (en) 2013-09-12 2013-09-12 Automatic distinguishing method based on integration of classifier for Chinese dialects

Publications (1)

Publication Number Publication Date
CN103474061A true CN103474061A (en) 2013-12-25

Family

ID=49798882

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2013104161737A Pending CN103474061A (en) 2013-09-12 2013-09-12 Automatic distinguishing method based on integration of classifier for Chinese dialects

Country Status (1)

Country Link
CN (1) CN103474061A (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104036774A (en) * 2014-06-20 2014-09-10 国家计算机网络与信息安全管理中心 Method and system for recognizing Tibetan dialects
CN104766607A (en) * 2015-03-05 2015-07-08 广州视源电子科技股份有限公司 Television program recommendation method and system
CN105654102A (en) * 2014-11-10 2016-06-08 富士通株式会社 Data processing device and data processing method
CN105810191A (en) * 2016-03-08 2016-07-27 江苏信息职业技术学院 Prosodic information-combined Chinese dialect identification method
CN107452379A (en) * 2017-08-17 2017-12-08 广州腾猴科技有限公司 The identification technology and virtual reality teaching method and system of a kind of dialect language
CN108231063A (en) * 2016-12-13 2018-06-29 中国移动通信有限公司研究院 A kind of recognition methods of phonetic control command and device
CN108461091A (en) * 2018-03-14 2018-08-28 南京邮电大学 Intelligent crying detection method towards domestic environment
CN108877784A (en) * 2018-09-05 2018-11-23 河海大学 A kind of robust speech recognition methods based on accents recognition
CN109461457A (en) * 2018-12-24 2019-03-12 安徽师范大学 A kind of audio recognition method based on SVM-GMM model
CN111179916A (en) * 2019-12-31 2020-05-19 广州市百果园信息技术有限公司 Re-scoring model training method, voice recognition method and related device
CN113192491A (en) * 2021-04-28 2021-07-30 平安科技(深圳)有限公司 Acoustic model generation method and device, computer equipment and storage medium
CN113673643A (en) * 2021-08-19 2021-11-19 江苏农牧人电子商务股份有限公司 Method and system for supervising agricultural product supply
CN114360500A (en) * 2021-09-14 2022-04-15 腾讯科技(深圳)有限公司 Speech recognition method and device, electronic equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1787075A (en) * 2005-12-13 2006-06-14 浙江大学 Method for distinguishing speek speek person by supporting vector machine model basedon inserted GMM core
CN101894548A (en) * 2010-06-23 2010-11-24 清华大学 Modeling method and modeling device for language identification
US20110196678A1 (en) * 2007-08-22 2011-08-11 Nec Corporation Speech recognition apparatus and speech recognition method
CN103077709A (en) * 2012-12-28 2013-05-01 中国科学院声学研究所 Method and device for identifying languages based on common identification subspace mapping

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1787075A (en) * 2005-12-13 2006-06-14 浙江大学 Method for distinguishing speek speek person by supporting vector machine model basedon inserted GMM core
US20110196678A1 (en) * 2007-08-22 2011-08-11 Nec Corporation Speech recognition apparatus and speech recognition method
CN101894548A (en) * 2010-06-23 2010-11-24 清华大学 Modeling method and modeling device for language identification
CN103077709A (en) * 2012-12-28 2013-05-01 中国科学院声学研究所 Method and device for identifying languages based on common identification subspace mapping

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
徐颖: "语种识别声学建模方法研究", 《中国科学技术大学 硕士论文》 *
顾明亮 夏玉果 张长水;: "基于支撑矢量机的汉语方言辨识", 《计算机工程与应用》 *

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104036774A (en) * 2014-06-20 2014-09-10 国家计算机网络与信息安全管理中心 Method and system for recognizing Tibetan dialects
CN104036774B (en) * 2014-06-20 2018-03-06 国家计算机网络与信息安全管理中心 Tibetan dialect recognition methods and system
CN105654102A (en) * 2014-11-10 2016-06-08 富士通株式会社 Data processing device and data processing method
CN104766607A (en) * 2015-03-05 2015-07-08 广州视源电子科技股份有限公司 Television program recommendation method and system
CN105810191B (en) * 2016-03-08 2019-11-29 江苏信息职业技术学院 Merge the Chinese dialects identification method of prosodic information
CN105810191A (en) * 2016-03-08 2016-07-27 江苏信息职业技术学院 Prosodic information-combined Chinese dialect identification method
CN108231063A (en) * 2016-12-13 2018-06-29 中国移动通信有限公司研究院 A kind of recognition methods of phonetic control command and device
CN107452379B (en) * 2017-08-17 2021-01-05 广州腾猴科技有限公司 Dialect language identification method and virtual reality teaching method and system
CN107452379A (en) * 2017-08-17 2017-12-08 广州腾猴科技有限公司 The identification technology and virtual reality teaching method and system of a kind of dialect language
CN108461091A (en) * 2018-03-14 2018-08-28 南京邮电大学 Intelligent crying detection method towards domestic environment
CN108877784A (en) * 2018-09-05 2018-11-23 河海大学 A kind of robust speech recognition methods based on accents recognition
CN109461457A (en) * 2018-12-24 2019-03-12 安徽师范大学 A kind of audio recognition method based on SVM-GMM model
CN111179916A (en) * 2019-12-31 2020-05-19 广州市百果园信息技术有限公司 Re-scoring model training method, voice recognition method and related device
CN111179916B (en) * 2019-12-31 2023-10-13 广州市百果园信息技术有限公司 Training method for re-scoring model, voice recognition method and related device
CN113192491A (en) * 2021-04-28 2021-07-30 平安科技(深圳)有限公司 Acoustic model generation method and device, computer equipment and storage medium
CN113192491B (en) * 2021-04-28 2024-05-03 平安科技(深圳)有限公司 Acoustic model generation method, acoustic model generation device, computer equipment and storage medium
CN113673643A (en) * 2021-08-19 2021-11-19 江苏农牧人电子商务股份有限公司 Method and system for supervising agricultural product supply
CN114360500A (en) * 2021-09-14 2022-04-15 腾讯科技(深圳)有限公司 Speech recognition method and device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN103474061A (en) Automatic distinguishing method based on integration of classifier for Chinese dialects
CN103400577B (en) The acoustic model method for building up of multilingual speech recognition and device
CN105244029B (en) Voice recognition post-processing method and system
CN108763510A (en) Intension recognizing method, device, equipment and storage medium
CN103177733B (en) Standard Chinese suffixation of a nonsyllabic "r" sound voice quality evaluating method and system
CN106776538A (en) The information extracting method of enterprise's noncanonical format document
Shum et al. On the use of spectral and iterative methods for speaker diarization
CN105261367B (en) A kind of method for distinguishing speek person
CN106503805A (en) A kind of bimodal based on machine learning everybody talk with sentiment analysis system and method
CN102664011B (en) Method for quickly recognizing speaker
CN104200814B (en) Speech-emotion recognition method based on semantic cell
CN106202032A (en) A kind of sentiment analysis method towards microblogging short text and system thereof
CN103578481B (en) A kind of speech-emotion recognition method across language
CN110992988B (en) Speech emotion recognition method and device based on domain confrontation
CN105205124A (en) Semi-supervised text sentiment classification method based on random feature subspace
CN105912525A (en) Sentiment classification method for semi-supervised learning based on theme characteristics
Garg et al. Speech based Emotion Recognition based on hierarchical decision tree with SVM, BLG and SVR classifiers
DeMarco et al. Native accent classification via i-vectors and speaker compensation fusion.
CN109036454A (en) The isolated method and system of the unrelated single channel recording of speaker based on DNN
Jarvis Data mining with learner corpora
CN107463703A (en) English social media account number classification method based on information gain
CN105609116A (en) Speech emotional dimensions region automatic recognition method
CN107885723A (en) Conversational character differentiating method and system
CN105095196A (en) Method and device for finding new word in text
CN111159332A (en) Text multi-intention identification method based on bert

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20131225