CN106569998A - Text named entity recognition method based on Bi-LSTM, CNN and CRF - Google Patents

Text named entity recognition method based on Bi-LSTM, CNN and CRF Download PDF

Info

Publication number
CN106569998A
CN106569998A CN201610959519.1A CN201610959519A CN106569998A CN 106569998 A CN106569998 A CN 106569998A CN 201610959519 A CN201610959519 A CN 201610959519A CN 106569998 A CN106569998 A CN 106569998A
Authority
CN
China
Prior art keywords
vector
character
word
lstm
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610959519.1A
Other languages
Chinese (zh)
Inventor
汤斯亮
吴飞
张宁
戴洪良
庄越挺
张寅�
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University ZJU
Original Assignee
Zhejiang University ZJU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University ZJU filed Critical Zhejiang University ZJU
Priority to CN201610959519.1A priority Critical patent/CN106569998A/en
Publication of CN106569998A publication Critical patent/CN106569998A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • G06F40/295Named entity recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Abstract

The invention discloses a text named entity recognition method based on Bi-LSTM, CNN and CRF. The method includes the following steps: (1) using a convolutional nerve network to encode and convert information on text word character level to a character vector; (2) combining the character vector and word vector into a combination which, as an input, is transmitted to a bidirectional LSTM neural network to build a model for contextual information of every word; and (3) in the output end of the LSTM neural network, utilizing continuous conditional random fields to carry out label decoding to a whole sentence, and mark the entities in the sentence. The invention is an end-to-end model without the need of data pre-processing in the un-marked corpus with the exception of the pre-trained word vector, therefore the invention can be widely applied for statement marking of different languages and fields.

Description

A kind of text based on Bi-LSTM, CNN and CRF names entity recognition method
Technical field
The present invention relates to natural language processing, more particularly to it is a kind of based on two-way LSTM neutral nets, convolutional neural networks Entity recognition method is named with the text of condition random field (CRF).
Background technology
Natural language processing (Nature Language Processing, abbreviation NLP) is a collection linguistics and calculating The cross discipline that machine science is integrated.Name Entity recognition (Named Entity Recognition, abbreviation NER) is nature language A basic task in speech process, it is intended to identify the proper noun and significant numeral classifier phrase in natural language text, And classified.With the rise of information extraction and big data concept, name Entity recognition task is increasingly subject to people's attention, Become the important component part of the natural language processings such as the analysis of public opinion, information retrieval, automatic question answering, machine translation.How from sea Automatically identify name entity in the internet text message of amount, accurately and rapidly, be increasingly becoming academia and industrial quarters is closed The hot issue of note.
High performance name entity recognition method is normally applied condition random field (CRF), SVM and perceptron to be come to manual place The feature of reason is modeled.Some researchers propose a kind of efficient neutral net mould for only needing to a small amount of Feature Engineering Type, the model need not from a large amount of unlabeled data learnings to primary word vector characteristics, but by unsupervised learning Method extracted vector feature from mass data.
But this model has many restrictions.First, the model make use of a simple feedforward neural network, the network The contextual information of each word is limited among the window of regular length, this method is abandoned between long range word Information;Secondly because only rely on term vector information, the method can not probe into word character level characteristics, such as prefix Information and suffix information etc., these information are often very useful, especially in the new word of the very poor term vector expression of training effect In.We seek a kind of significantly more efficient neural network model that can solve the problem that the problems referred to above.
Recurrent neural network (Recurrent neural network, abbreviation RNN) can process the input of variable-length Vector, and can have memory over a long time in processing procedure.In recent years, RNNs various NLP tasks (such as speech recognition, Machine translation and Language Modeling etc.) there is huge success in process.Shot and long term with forget gate memory artificial neuron The important dependence of long-distance dependence can easily be learned for network (Long-short term memory, abbreviation LSTM) Practise.
It is upper for the uncertain length of word for the sequence mark task of name Entity recognition and speech recognition etc. Hereafter problem, the restricted problem of context in other words, two-way LSTM (Bi-LSTM) neutral net is efficient:When LSTMs is in life In name Entity recognition task by past information to learn when, the restriction of computing capability and the quality of term vector can limit them Efficiency.
Convolutional neural networks (Convolutional neural network, abbreviation CNN) are equally used in NLP tasks The information of character aspect is modeled, and existing Successful utilization CNN extracts character feature and applies features to name in fact The example gone in body identification mission.
The content of the invention
The invention aims to the entity in natural text is detected and is labeled, to obtain effective text This information, proposes that a kind of text name based on two-way LSTM neutral nets, convolutional neural networks and condition random field (CRF) is real Body recognition methods.
Text based on Bi-LSTM, CNN and CRF names entity recognition method, comprises the steps:
(1) data prediction is carried out according to text data, subordinate sentence, participle is carried out to text, obtain character feature vector;
(2) character feature vector is carried out to each word using convolutional neural networks to extract;
(3) character feature vector is combined with term vector, LSTM neutral nets is passed to, using two-way LSTM neutral nets Read statement information characteristics are obtained to train;
(4) for semantic feature resulting in (3), entity mark is carried out to each word using condition random field, Mark the entity information in statement sequence.
Above steps can specifically using mode is implemented as described below:
Data prediction is carried out to text data, subordinate sentence, participle are carried out to text, the step for obtaining character feature vector can It is implemented as follows:
(1) subordinate sentence and participle are carried out to document using natural language processing instrument so that document is with sentence and one per Token is presented;
(2) sentence, word and the label to obtaining in previous step (1) is counted, and forms sentence table, vocabulary and label Table;
(3) character in the word list in previous step (1) is counted, forms character list;
(4) character feature vector is formed using the good term vector of existing pre-training and character list.
Described utilization convolutional neural networks are carried out to each word can be specifically real the step of character feature vector is extracted It is now as follows:
(1) C is made to be character list, d is the dimension of each character vector, and character vector matrix is:Q∈Rd×|C|
(2) word k ∈ V are preset by continuous character [c1,c2,...,cl] composition, wherein l for word k length, then k Character vector matrix is by Ck∈Rd×lBe given, wherein i-th is classified as character ciVector;
(3) in CkWith the kernel H ∈ R that a width is wd×wBetween realize convolutional layer, it is inclined to convolutional layer addition after this Value bias is put, and whole convolution results are carried out nonlinear regression to realize Feature Mapping map fk∈Rl-w+1, wherein, map letter Number fkI-th element fk[i] is given by (1) formula;
fk[i]=tanh (<Ck[*,i:i+w-1],H>+b) (1)
Wherein * be all line numbers, Ck[*,i:I+w-1] it is CkIn i-th row to the i-th+w-1 row,<,>For in Frobenius Product, b is bias vector;
(4) it is last, withExpress as the character pair of kernel H.
By character feature vector combine with term vector, be passed to LSTM neutral nets, using two-way LSTM neutral nets come The step of training obtains read statement information characteristics is as follows:
(1) x is definedtFor the input character feature vector of t, htIt is the hidden layer that all useful informations are stored in t State vector, σ is that sigmoid returns layer, and * is inner product, Ui,Uf,Uc,UoTo be directed to input x under different conditionstWeight matrix, Wi,Wf,Wc,WoTo hide layer state htWeight matrix, bi,bf,bc,boFor bias vector;
(2) t forget gate calculating as shown in (2) formula:
ft=σ (Wfht-1+Ufxt+bf) (2)
(3) h is updated in tt-1All information of middle storage, computing formula is as shown in (3), (4) formula:
it=σ (Wiht-1+Uixt+bi) (3)
WhereinFor the vector that t be introduced into cell state;
(4) information updating for storing the t-1 moment in t is the storage information of t, and computing formula is formula (5):
WhereinFor the vector of t cell state;
(5) it is shown in the output such as formula (6) of t, and update ht, computing formula such as formula (7):
ot=σ (Woht-1+Uoxt+bo) (6)
ht=ot*tanh(Ct) (7)
Wherein otFor the output of t;htFor the vector of t hidden layer;
(6) h in above-mentioned stepstAll information of storage last time, with same method a g is arranged againtFor depositing Following information of storage, most latter two hidden layer information forms last output vector by cascading.
Entity mark is carried out to each word using condition random field, marking the entity information in statement sequence can have Body realizes that step is as follows:
(1) with z={ z1,z2,...,znRepresent list entries, wherein n for list entries length, ziFor i-th word Input vector, y={ y1,y2,...,ynFor z reality output sequence label, Y (z)={ y'1,y'2,...,y'nFor z can The output label sequence of energy;
(2) bar of the condition random field probabilistic model definition for given list entries z on be possible to sequence label y' Part Probability p (y | z;W,b):
Wherein,For potential gain function, W is weight vectors, and b is bias vector;And by',yRespectively with label to (y', y) corresponding weight vectors and bias vector;
(3) condition random field (CRF) training stage, use maximal condition Likelihood estimation select cause likelihood ratio L (W, B) maximum parameter, for training set { (zi,yi), the logarithm of likelihood ratio is:
(4) find the y of highest conditional probability in sequence label to carry out label for labelling to sequence:
Present invention beneficial effect compared with prior art:
A kind of combined with two-way LSTM neutral nets using convolutional neural networks method of the present invention come to character feature and Word feature is extracted, contrast and traditional method such as hidden Markov model, the present invention using vocabulary and context according to Bad relation, and carry out Probability statistic abstraction on this basis, so as to obtain text message in entity information;Using nerve net The mode that network is combined with condition random field, application scenarios extensively, only need to be trained one time for the corresponding different pieces of information of different problems, With regard to can solve the problem that multi-field name entity mark problem, possess advantage end to end.
Description of the drawings
Fig. 1 is the committed step workflow diagram that the text based on Bi-LSTM, CNN and CRF names Entity recognition;
Fig. 2 is convolutional neural networks structural representation;
Fig. 3 is two-way LSTM neutral nets schematic diagram.
Fig. 4 is based on the Named Entity Extraction Model training process algorithm of two-way LSTM and CRF.
Specific embodiment
The present invention is further elaborated with reference to the accompanying drawings and detailed description.
Present invention is generally directed to name Entity recognition task by above neural network ensemble, present a kind of based on two-way The mixed model of LSTM, CNN, the model can be while learning character characteristic vector and term vector, on this basis arrive study Characteristic vector label for labelling is carried out by condition random field.Thus obtaining the name Entity recognition result of efficiently and accurately. As shown in figure 1, be the committed step workflow diagram of the text name Entity recognition based on Bi-LSTM, CNN and CRF, wherein often The character list of individual word reaches and is completed by the convolutional neural networks structure of Fig. 2;Character feature vector is with term vector after cascade Result be input in two-way LSTM neutral nets, wherein LSTM neutral nets as shown in Figure 3, in Fig. 1 dotted line represent input There is one layer of dropout layer (preventing neutral net from producing over-fitting) in vector sum output vector.
The text based on Bi-LSTM, CNN and CRF of the present invention names entity recognition method, its concrete steps (1)~(4) It is as follows:
(1) data prediction is carried out according to text data, subordinate sentence, participle is carried out to text, obtain character feature vector.Tool Body step is as follows:
(1.1) subordinate sentence and participle are carried out to document using natural language processing instrument so that document is with sentence and per one Individual word (token) is presented;
(1.2) sentence, word and label are counted, forms sentence table, vocabulary and label list;
(1.3) character in word list is counted, forms character list;
(1.4) character feature vector is formed using the good term vector of existing pre-training and character list.
(2) character feature vector is carried out to each word using convolutional neural networks to extract.Comprise the following steps that:
(2.1) C is made to be character list, d is the dimension of each character vector, and character vector matrix is:Q∈Rd×C
(2.2) word k ∈ V are preset by continuous character [c1,c2,...,cl] composition, wherein l is the length of word k, then k Character vector matrix by Ck∈Rd×lBe given, wherein i-th is classified as character ciVector;
(2.3) in CkWith the kernel H ∈ R that a width is wd×wBetween realize convolutional layer, after this give convolutional layer addition Bias bias, and whole convolution results are carried out nonlinear regression to realize Feature Mapping map fk∈Rl-w+1, wherein, mapping Function fkI-th element fk[i] is given by (1) formula;
fk[i]=tanh (<Ck[*,i:i+w-1],H>+b) (1)
Wherein * be all line numbers, Ck[*,i:I+w-1] it is CkIn i-th row to the i-th+w-1 row,<,>For in Frobenius Product, b is bias vector;
(2.4) it is last, withExpress as the character pair of kernel H.
(3) character feature vector is combined with term vector, is passed to LSTM neutral nets.As shown in Figure 4, using two-way LSTM neutral nets obtain read statement information characteristics to train.Comprise the following steps that:
(3.1) x is definedtFor the input character feature vector of t, htIt is to store hiding for all useful informations in t Layer state vector, σ is that sigmoid returns layer, and * is inner product, Ui,Uf,Uc,UoTo be directed to input x under different conditionstWeight square Battle array, Wi,Wf,Wc,WoTo hide layer state htWeight matrix, bi,bf,bc,boFor bias vector;
(3.2) t forget gate calculating as shown in (2) formula:
ft=σ (Wfht-1+Ufxt+bf) (2)
(3.3) h is updated in tt-1All information of middle storage, computing formula is as shown in (3), (4) formula:
it=σ (Wiht-1+Uixt+bi) (3)
WhereinFor the vector that t be introduced into cell state;
(3.4) information updating for storing the t-1 moment in t is the storage information of t, and computing formula is formula (5):
WhereinFor the vector of t cell state;
(3.5) it is shown in the output such as formula (6) of t, and update ht, computing formula such as formula (7):
ot=σ (Woht-1+Uoxt+bo) (6)
ht=ot*tanh(Ct) (7)
Wherein otFor the output of t;htFor the vector of t hidden layer;
(3.6) h in above-mentioned stepstAll information of storage last time, with same method a g is arranged againtFor Following information of storage, most latter two hidden layer information forms last output vector by cascading.
(4) for semantic feature resulting in (3), entity mark is carried out to each word using condition random field, Mark the entity information in statement sequence.Comprise the following steps that:
(4.1) with z={ z1,z2,...,znRepresent list entries, wherein n for list entries length, ziFor i-th word Input vector, y={ y1,y2,...,ynFor z reality output sequence label, Y (z)={ y'1,y'2,...,y'nFor z's Possible output label sequence;
(4.2) condition random field probabilistic model is defined for given list entries z is on be possible to sequence label y' Conditional probability p (y | z;W,b):
Wherein,For potential gain function, W is weight vectors, and b is bias vector;And by',yRespectively with label to (y', y) corresponding weight vectors and bias vector;
(4.3) condition random field (CRF) training stage, maximal condition Likelihood estimation is used to select to cause likelihood ratio L (W, b) maximum parameter, for training set { (zi,yi), the logarithm of likelihood ratio is:
(4.4) find the y of highest conditional probability in sequence label to carry out label for labelling to sequence:
Embodiment
By taking the document of New York Times English news as an example, text name entity is carried out during said method is applied in text and is known Not, design parameter and way are as follows in each step:
1. subordinate sentence and participle are carried out to document using natural language processing instrument so that document each word is a line, sentence It is spaced apart with space between sentence;
2. the sentence in 1, word and label counted respectively, sentence table, vocabulary and label list are formed, in Training document Label has " PER (name) " " LOC (place name) " " ORG (tissue) " " FAC (mechanism) " and " GPE (geopolitical name) " five classes, surveys Label is " * " in examination document, Jing statistics, the total sentence 17 of document, word 466 in 1;
3. word list enters line character statistics in pair 1, forms character list C;
4. using the dimensional vectors of GloVe 100 composition 100 in combination with vocabulary in 2 disclosed in 600,000,000 Stamfords for having trained Dimension term vector;
Character feature vector is carried out to each word with convolutional neural networks to extract:
5. make C be character list, d is the dimension of each character vector, and character vector matrix is:Q∈Rd×|C|
6. assume word k ∈ V by continuous character [c1,c2,...,cl] composition, wherein l for word k length, then k Character vector matrix is by Ck∈Rd×lBe given, wherein i-th is classified as character ciVector;
7. in CkWith the kernel H ∈ R that a width is wd×wBetween realize convolutional layer, it is inclined to convolutional layer addition after this Value bias is put, and whole convolution results are carried out nonlinear regression to realize Feature Mapping map fk∈Rl-w+1, especially, fk's I-th element is given by (1) formula:
fk[i]=tanh (<Ck[*,i:i+w-1],H>+b) (1)
Wherein * be all line numbers, Ck[*,i:I+w-1] it is CkIn i-th row to the i-th+w-1 row,<A,B>=Tr (ABT) be Frobenius inner products.
8. withExpress as the character pair of kernel H;
Using term vector resulting in character vector feature and 4 resulting in 8 as being input to two-way LSTM nerve nets In network:
9. the mode that two-way LSTM neural network parameters update is that, with 30 as batchsize, 0.9 is the boarding steps of momentum term Degree descent algorithm updates, and initialization learning rate is 0.015, and after each iteration, learning rate more new formula is
ηt0(1+ρt) (1)
Wherein retardation rate ρ=0.05, t is the number of times for completing iteration;
10. x is definedtFor the input character feature vector of t, htIt is the hidden layer that all useful informations are stored in t State vector, σ is that sigmoid returns layer, and * is inner product, Ui,Uf,Uc,UoTo be directed to input x under different conditionstWeight matrix, Wi,Wf,Wc,WoTo hide layer state htWeight matrix, bi,bf,bc,boFor bias vector;
11. t forget gate calculating as shown in (2) formula:
ft=σ (Wfht-1+Ufxt+bf) (2)
12. update h in tt-1All information of middle storage, computing formula is as shown in (3), (4) formula:
it=σ (Wiht-1+Uixt+bi) (3)
13. information updatings for storing the t-1 moment in t are the storage information of t, and computing formula is formula (5):
14. is shown in the output such as formula (6) of t, and updates ht, computing formula such as formula (7):
ot=σ (Woht-1+Uoxt+bo) (6)
ht=ot*tanh(Ct) (7)
H in 15. above-mentioned stepstAll information of storage last time, with same method a g is arranged againtFor depositing Following information of storage, most latter two hidden layer information forms last output vector by cascading.
Entity mark is carried out to each word using condition random field (CRF):
16. use z={ z1,z2,...,znRepresent list entries, wherein ziFor the input vector of i-th word, y={ y1, y2,...,ynFor z reality output sequence label, Y (z)={ y'1,y'2,...,y'nFor z possible output label sequence;
17. condition random fields (CRF) probabilistic model is defined for given list entries z is on be possible to sequence label y Conditional probability p (y | z;W,b):
Wherein,For potential gain function, W is weight vectors, and b is bias vector;And by',yRespectively with label to (y', y) corresponding weight vectors and bias vector;
18. condition random fields (CRF) training stage, we use maximal condition possibility predication, for training set { (zi, yi), the logarithm of likelihood ratio is:
Maximum likelihood training process selects to cause likelihood ratio L (W, b) maximum parameter.
19. find the y of highest conditional probability to carry out label for labelling to sequence in sequence label:
20. station location markers by the word for having marked in original go out, and annotation results are put in order feed back to user.
Following table is the final annotation results in part of selected news documents.

Claims (5)

1. a kind of text based on Bi-LSTM, CNN and CRF names entity recognition method, it is characterised in that comprise the steps:
(1) data prediction is carried out according to text data, subordinate sentence, participle is carried out to text, obtain character feature vector;
(2) character feature vector is carried out to each word using convolutional neural networks to extract;
(3) character feature vector is combined with term vector, is passed to LSTM neutral nets, instructed using two-way LSTM neutral nets Get read statement information characteristics;
(4) for semantic feature resulting in (3), entity mark is carried out to each word using condition random field, is marked The entity information gone out in statement sequence.
2. a kind of text based on Bi-LSTM, CNN and CRF according to claim 1 names entity recognition method, and it is special It is to carry out data prediction to text data to levy, and to text subordinate sentence, participle are carried out, and obtains the step of character feature vector such as Under:
(1) subordinate sentence and participle are carried out to document using natural language processing instrument so that document is with sentence and per token Present;
(2) sentence, word and label are counted, forms sentence table, vocabulary and label list;
(3) character in word list is counted, forms character list;
(4) character feature vector is formed using the good term vector of existing pre-training and character list.
3. a kind of text based on Bi-LSTM, CNN and CRF according to claim 1 names entity recognition method, and it is special Levy is that the step of described utilization convolutional neural networks carry out character feature vector extraction to each word is as follows:
(1) C is made to be character list, d is the dimension of each character vector, and character vector matrix is:Q∈Rd×|C|
(2) word k ∈ V are preset by continuous character [c1,c2,...,cl] composition, wherein l is the length of word k, then the character of k Vector matrix is by Ck∈Rd×lBe given, wherein i-th is classified as character ciVector;
(3) in CkWith the kernel H ∈ R that a width is wd×wBetween realize convolutional layer, after this give convolutional layer addition bias Bias, and whole convolution results are carried out nonlinear regression to realize Feature Mapping map fk∈Rl-w+1, wherein, mapping function fk I-th element fk[i] is given by (1) formula;
fk[i]=tanh (<Ck[*,i:i+w-1],H>+b) (1)
Wherein * be all line numbers, Ck[*,i:I+w-1] it is CkIn i-th row to the i-th+w-1 row,<,>For Frobenius inner products, b is Bias vector;
(4) it is last, withExpress as the character pair of kernel H.
4. a kind of text based on Bi-LSTM, CNN and CRF according to claim 1 names entity recognition method, and it is special It is to combine character feature vector with term vector to levy, and is passed to LSTM neutral nets, is instructed using two-way LSTM neutral nets The step of getting read statement information characteristics is as follows:
(1) x is definedtFor the input character feature vector of t, htIt is the hiding layer state that all useful informations are stored in t Vector, σ is that sigmoid returns layer, and * is inner product, Ui,Uf,Uc,UoTo be directed to input x under different conditionstWeight matrix, Wi, Wf,Wc,WoTo hide layer state htWeight matrix, bi,bf,bc,boFor bias vector;
(2) t forget gate calculating as shown in (2) formula:
ft=σ (Wfht-1+Ufxt+bf) (2)
(3) h is updated in tt-1All information of middle storage, computing formula is as shown in (3), (4) formula:
it=σ (Wiht-1+Uixt+bi) (3)
C ~ t = tanh ( W c h t - 1 + U c x t + b c ) - - - ( 4 )
WhereinFor the vector that t be introduced into cell state;
(4) information updating for storing the t-1 moment in t is the storage information of t, and computing formula is formula (5):
C t = f t * C t - 1 + i t * C ~ t - - - ( 5 )
WhereinFor the vector of t cell state;
(5) it is shown in the output such as formula (6) of t, and update ht, computing formula such as formula (7):
ot=σ (Woht-1+Uoxt+bo) (6)
ht=ot*tanh(Ct) (7)
Wherein otFor the output of t;htFor the vector of t hidden layer;
(6) h in above-mentioned stepstAll information of storage last time, with same method a g is arranged againtFor storage not The information come, most latter two hidden layer information forms last output vector by cascading.
5. a kind of text based on Bi-LSTM, CNN and CRF according to claim 1 names entity recognition method, and it is special It is to carry out entity mark to each word using condition random field to levy, the step of mark the entity information in statement sequence It is as follows:
(1) with z={ z1,z2,...,znRepresent list entries, wherein n for list entries length, ziFor the input of i-th word Vector, y={ y1,y2,...,ynFor z reality output sequence label, Y (z)={ y'1,y'2,...,y'nIt is the possible of z Output label sequence;
(2) condition of the condition random field probabilistic model definition for given list entries z on be possible to sequence label y' is general Rate p (y | z;W,b):
p ( y | z ; W , b ) = &Pi; i = 1 n &psi; i ( y i - 1 , y i , z ) &Sigma; y &prime; &Element; Y ( z ) &Pi; i = 1 n &psi; i ( y &prime; i - 1 , y &prime; i , z ) - - - ( 8 )
Wherein,For potential gain function, W is weight vectors, and b is bias vector; And by',yRespectively with label to (y', y) corresponding weight vectors and bias vector;
(3) condition random field (CRF) training stage, use maximal condition Likelihood estimation select cause likelihood ratio L (W, b) most Big parameter, for training set { (zi,yi), the logarithm of likelihood ratio is:
L ( W , b ) = &Sigma; i log p ( y | z ; W , b ) - - - ( 9 )
(4) find the y of highest conditional probability in sequence label to carry out label for labelling to sequence:
y = argmax y &Element; Y ( z ) p ( y | z ; W , b ) - - - ( 10 ) .
CN201610959519.1A 2016-10-27 2016-10-27 Text named entity recognition method based on Bi-LSTM, CNN and CRF Pending CN106569998A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610959519.1A CN106569998A (en) 2016-10-27 2016-10-27 Text named entity recognition method based on Bi-LSTM, CNN and CRF

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610959519.1A CN106569998A (en) 2016-10-27 2016-10-27 Text named entity recognition method based on Bi-LSTM, CNN and CRF

Publications (1)

Publication Number Publication Date
CN106569998A true CN106569998A (en) 2017-04-19

Family

ID=58535610

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610959519.1A Pending CN106569998A (en) 2016-10-27 2016-10-27 Text named entity recognition method based on Bi-LSTM, CNN and CRF

Country Status (1)

Country Link
CN (1) CN106569998A (en)

Cited By (198)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107168957A (en) * 2017-06-12 2017-09-15 云南大学 A kind of Chinese word cutting method
CN107220506A (en) * 2017-06-05 2017-09-29 东华大学 Breast cancer risk assessment analysis system based on depth convolutional neural networks
CN107247704A (en) * 2017-06-09 2017-10-13 阿里巴巴集团控股有限公司 Term vector processing method, device and electronic equipment
CN107451433A (en) * 2017-06-27 2017-12-08 中国科学院信息工程研究所 A kind of information source identification method and apparatus based on content of text
CN107526799A (en) * 2017-08-18 2017-12-29 武汉红茶数据技术有限公司 A kind of knowledge mapping construction method based on deep learning
CN107526798A (en) * 2017-08-18 2017-12-29 武汉红茶数据技术有限公司 A kind of Entity recognition based on neutral net and standardization integrated processes and model
CN107545571A (en) * 2017-09-22 2018-01-05 深圳天琴医疗科技有限公司 A kind of image detecting method and device
CN107562716A (en) * 2017-07-18 2018-01-09 阿里巴巴集团控股有限公司 Term vector processing method, device and electronic equipment
CN107562784A (en) * 2017-07-25 2018-01-09 同济大学 Short text classification method based on ResLCNN models
CN107609009A (en) * 2017-07-26 2018-01-19 北京大学深圳研究院 Text emotion analysis method, device, storage medium and computer equipment
CN107608970A (en) * 2017-09-29 2018-01-19 百度在线网络技术(北京)有限公司 part-of-speech tagging model generating method and device
CN107622050A (en) * 2017-09-14 2018-01-23 武汉烽火普天信息技术有限公司 Text sequence labeling system and method based on Bi LSTM and CRF
CN107644014A (en) * 2017-09-25 2018-01-30 南京安链数据科技有限公司 A kind of name entity recognition method based on two-way LSTM and CRF
CN107679224A (en) * 2017-10-20 2018-02-09 竹间智能科技(上海)有限公司 It is a kind of towards the method and system without structure text intelligent answer
CN107679234A (en) * 2017-10-24 2018-02-09 上海携程国际旅行社有限公司 Customer service information providing method, device, electronic equipment, storage medium
CN107729326A (en) * 2017-09-25 2018-02-23 沈阳航空航天大学 Neural machine translation method based on Multi BiRNN codings
CN107797989A (en) * 2017-10-16 2018-03-13 平安科技(深圳)有限公司 Enterprise name recognition methods, electronic equipment and computer-readable recording medium
CN107797988A (en) * 2017-10-12 2018-03-13 北京知道未来信息技术有限公司 A kind of mixing language material name entity recognition method based on Bi LSTM
CN107797987A (en) * 2017-10-12 2018-03-13 北京知道未来信息技术有限公司 A kind of mixing language material name entity recognition method based on Bi LSTM CNN
CN107797992A (en) * 2017-11-10 2018-03-13 北京百分点信息科技有限公司 Name entity recognition method and device
CN107832289A (en) * 2017-10-12 2018-03-23 北京知道未来信息技术有限公司 A kind of name entity recognition method based on LSTM CNN
CN107832400A (en) * 2017-11-01 2018-03-23 山东大学 A kind of method that location-based LSTM and CNN conjunctive models carry out relation classification
CN107844472A (en) * 2017-07-18 2018-03-27 阿里巴巴集团控股有限公司 Term vector processing method, device and electronic equipment
CN107885721A (en) * 2017-10-12 2018-04-06 北京知道未来信息技术有限公司 A kind of name entity recognition method based on LSTM
CN107908614A (en) * 2017-10-12 2018-04-13 北京知道未来信息技术有限公司 A kind of name entity recognition method based on Bi LSTM
CN107943847A (en) * 2017-11-02 2018-04-20 平安科技(深圳)有限公司 Business connection extracting method, device and storage medium
CN107967251A (en) * 2017-10-12 2018-04-27 北京知道未来信息技术有限公司 A kind of name entity recognition method based on Bi-LSTM-CNN
CN107977353A (en) * 2017-10-12 2018-05-01 北京知道未来信息技术有限公司 A kind of mixing language material name entity recognition method based on LSTM-CNN
CN107992468A (en) * 2017-10-12 2018-05-04 北京知道未来信息技术有限公司 A kind of mixing language material name entity recognition method based on LSTM
CN108038104A (en) * 2017-12-22 2018-05-15 北京奇艺世纪科技有限公司 A kind of method and device of Entity recognition
CN108170675A (en) * 2017-12-27 2018-06-15 哈尔滨福满科技有限责任公司 A kind of name entity recognition method based on deep learning towards medical field
CN108182976A (en) * 2017-12-28 2018-06-19 西安交通大学 A kind of clinical medicine information extracting method based on neural network
CN108229582A (en) * 2018-02-01 2018-06-29 浙江大学 Entity recognition dual training method is named in a kind of multitask towards medical domain
CN108228568A (en) * 2018-01-24 2018-06-29 上海互教教育科技有限公司 A kind of mathematical problem semantic understanding method
CN108228564A (en) * 2018-01-04 2018-06-29 苏州大学 The name entity recognition method of confrontation study is carried out in crowdsourcing data
CN108268643A (en) * 2018-01-22 2018-07-10 北京邮电大学 A kind of Deep Semantics matching entities link method based on more granularity LSTM networks
CN108268444A (en) * 2018-01-10 2018-07-10 南京邮电大学 A kind of Chinese word cutting method based on two-way LSTM, CNN and CRF
CN108334567A (en) * 2018-01-16 2018-07-27 北京奇艺世纪科技有限公司 Rubbish text method of discrimination, device and server
CN108363695A (en) * 2018-02-23 2018-08-03 西南交通大学 A kind of user comment attribute extraction method based on bidirectional dependency syntax tree characterization
CN108388560A (en) * 2018-03-17 2018-08-10 北京工业大学 GRU-CRF meeting title recognition methods based on language model
CN108460013A (en) * 2018-01-30 2018-08-28 大连理工大学 A kind of sequence labelling model based on fine granularity vocabulary representation model
CN108491382A (en) * 2018-03-14 2018-09-04 四川大学 A kind of semi-supervised biomedical text semantic disambiguation method
CN108536679A (en) * 2018-04-13 2018-09-14 腾讯科技(成都)有限公司 Name entity recognition method, device, equipment and computer readable storage medium
CN108536754A (en) * 2018-03-14 2018-09-14 四川大学 Electronic health record entity relation extraction method based on BLSTM and attention mechanism
CN108563725A (en) * 2018-04-04 2018-09-21 华东理工大学 A kind of Chinese symptom and sign composition recognition methods
CN108563626A (en) * 2018-01-22 2018-09-21 北京颐圣智能科技有限公司 Medical text name entity recognition method and device
CN108628823A (en) * 2018-03-14 2018-10-09 中山大学 In conjunction with the name entity recognition method of attention mechanism and multitask coordinated training
CN108647225A (en) * 2018-03-23 2018-10-12 浙江大学 A kind of electric business grey black production public sentiment automatic mining method and system
CN108717410A (en) * 2018-05-17 2018-10-30 达而观信息科技(上海)有限公司 Name entity recognition method and system
CN108717406A (en) * 2018-05-10 2018-10-30 平安科技(深圳)有限公司 Text mood analysis method, device and storage medium
CN108762523A (en) * 2018-06-04 2018-11-06 重庆大学 Output characters through input method prediction technique based on capsule networks
CN108804654A (en) * 2018-06-07 2018-11-13 重庆邮电大学 A kind of collaborative virtual learning environment construction method based on intelligent answer
CN108805224A (en) * 2018-05-28 2018-11-13 中国人民解放军国防科技大学 Multi-symbol hand-drawn sketch recognition method and device with sustainable learning ability
CN108829719A (en) * 2018-05-07 2018-11-16 中国科学院合肥物质科学研究院 The non-true class quiz answers selection method of one kind and system
CN108829681A (en) * 2018-06-28 2018-11-16 北京神州泰岳软件股份有限公司 A kind of name entity extraction method and device
CN108874997A (en) * 2018-06-13 2018-11-23 广东外语外贸大学 A kind of name name entity recognition method towards film comment
CN108875034A (en) * 2018-06-25 2018-11-23 湖南丹尼尔智能科技有限公司 A kind of Chinese Text Categorization based on stratification shot and long term memory network
CN108874765A (en) * 2017-05-15 2018-11-23 阿里巴巴集团控股有限公司 Term vector processing method and processing device
CN108920448A (en) * 2018-05-17 2018-11-30 南京大学 A method of the comparison based on shot and long term memory network extracts
CN108920468A (en) * 2018-05-07 2018-11-30 内蒙古工业大学 A kind of bilingual kind of inter-translation method of illiteracy Chinese based on intensified learning
CN108920445A (en) * 2018-04-23 2018-11-30 华中科技大学鄂州工业技术研究院 A kind of name entity recognition method and device based on Bi-LSTM-CRF model
CN108920446A (en) * 2018-04-25 2018-11-30 华中科技大学鄂州工业技术研究院 A kind of processing method of Engineering document
CN108932226A (en) * 2018-05-29 2018-12-04 华东师范大学 A kind of pair of method without punctuate text addition punctuation mark
WO2018218706A1 (en) * 2017-05-27 2018-12-06 中国矿业大学 Method and system for extracting news event based on neural network
CN108959252A (en) * 2018-06-28 2018-12-07 中国人民解放军国防科技大学 Semi-supervised Chinese named entity recognition method based on deep learning
CN108984520A (en) * 2018-06-19 2018-12-11 中国科学院自动化研究所 Stratification text subject dividing method
CN109002436A (en) * 2018-07-12 2018-12-14 上海金仕达卫宁软件科技有限公司 Medical text terms automatic identifying method and system based on shot and long term memory network
CN109117472A (en) * 2018-11-12 2019-01-01 新疆大学 A kind of Uighur name entity recognition method based on deep learning
CN109145286A (en) * 2018-07-02 2019-01-04 昆明理工大学 Based on BiLSTM-CRF neural network model and merge the Noun Phrase Recognition Methods of Vietnamese language feature
CN109165384A (en) * 2018-08-23 2019-01-08 成都四方伟业软件股份有限公司 A kind of name entity recognition method and device
CN109165279A (en) * 2018-09-06 2019-01-08 深圳和而泰数据资源与云技术有限公司 information extraction method and device
CN109213997A (en) * 2018-08-16 2019-01-15 昆明理工大学 A kind of Chinese word cutting method based on two-way length memory network model in short-term
CN109241520A (en) * 2018-07-18 2019-01-18 五邑大学 A kind of sentence trunk analysis method and system based on the multilayer error Feedback Neural Network for segmenting and naming Entity recognition
CN109255119A (en) * 2018-07-18 2019-01-22 五邑大学 A kind of sentence trunk analysis method and system based on the multitask deep neural network for segmenting and naming Entity recognition
CN109255020A (en) * 2018-09-11 2019-01-22 浙江大学 A method of talked with using convolution and generates model solution dialogue generation task
CN109271631A (en) * 2018-09-12 2019-01-25 广州多益网络股份有限公司 Segmenting method, device, equipment and storage medium
CN109271494A (en) * 2018-08-10 2019-01-25 西安交通大学 A kind of system automatically extracting Chinese question and answer sentence focus
CN109284400A (en) * 2018-11-28 2019-01-29 电子科技大学 A kind of name entity recognition method based on Lattice LSTM and language model
CN109284361A (en) * 2018-09-29 2019-01-29 深圳追科技有限公司 A kind of entity abstracting method and system based on deep learning
CN109299458A (en) * 2018-09-12 2019-02-01 广州多益网络股份有限公司 Entity recognition method, device, equipment and storage medium
CN109308304A (en) * 2018-09-18 2019-02-05 深圳和而泰数据资源与云技术有限公司 Information extraction method and device
CN109359291A (en) * 2018-08-28 2019-02-19 昆明理工大学 A kind of name entity recognition method
CN109388807A (en) * 2018-10-30 2019-02-26 中山大学 The method, apparatus and storage medium of electronic health record name Entity recognition
CN109388795A (en) * 2017-08-07 2019-02-26 芋头科技(杭州)有限公司 A kind of name entity recognition method, language identification method and system
CN109426660A (en) * 2017-08-17 2019-03-05 中国海洋大学 English email composition assistant based on long memory network in short-term
CN109446521A (en) * 2018-10-18 2019-03-08 京东方科技集团股份有限公司 Name entity recognition method, device, electronic equipment, machine readable storage medium
CN109471895A (en) * 2018-10-29 2019-03-15 清华大学 The extraction of electronic health record phenotype, phenotype name authority method and system
CN109492215A (en) * 2018-09-18 2019-03-19 平安科技(深圳)有限公司 News property recognition methods, device, computer equipment and storage medium
CN109493956A (en) * 2018-10-15 2019-03-19 海口市人民医院(中南大学湘雅医学院附属海口医院) Diagnosis guiding method
CN109493166A (en) * 2018-10-23 2019-03-19 深圳智能思创科技有限公司 A kind of construction method for e-commerce shopping guide's scene Task conversational system
CN109543764A (en) * 2018-11-28 2019-03-29 安徽省公共气象服务中心 A kind of warning information legitimacy detection method and detection system based on intelligent semantic perception
CN109558569A (en) * 2018-12-14 2019-04-02 昆明理工大学 A kind of Laotian part-of-speech tagging method based on BiLSTM+CRF model
CN109657239A (en) * 2018-12-12 2019-04-19 电子科技大学 The Chinese name entity recognition method learnt based on attention mechanism and language model
CN109657135A (en) * 2018-11-13 2019-04-19 华南理工大学 A kind of scholar user neural network based draws a portrait information extraction method and model
CN109670172A (en) * 2018-12-06 2019-04-23 桂林电子科技大学 A kind of scenic spot anomalous event abstracting method based on complex neural network
CN109670164A (en) * 2018-04-11 2019-04-23 东莞迪赛软件技术有限公司 Healthy the analysis of public opinion method based on the more word insertion Bi-LSTM residual error networks of deep layer
CN109710922A (en) * 2018-12-06 2019-05-03 深港产学研基地产业发展中心 Text recognition method, device, computer equipment and storage medium
CN109753660A (en) * 2019-01-07 2019-05-14 福州大学 A kind of acceptance of the bid webpage name entity abstracting method based on LSTM
CN109815253A (en) * 2018-12-26 2019-05-28 出门问问信息科技有限公司 A kind of the subject entity recognition method and device of query statement
CN109815952A (en) * 2019-01-24 2019-05-28 珠海市筑巢科技有限公司 Brand name recognition methods, computer installation and computer readable storage medium
CN109871545A (en) * 2019-04-22 2019-06-11 京东方科技集团股份有限公司 Name entity recognition method and device
CN109871535A (en) * 2019-01-16 2019-06-11 四川大学 A kind of French name entity recognition method based on deep neural network
CN109871537A (en) * 2019-01-31 2019-06-11 沈阳雅译网络技术有限公司 A kind of high-precision Thai subordinate sentence method
CN109871843A (en) * 2017-12-01 2019-06-11 北京搜狗科技发展有限公司 Character identifying method and device, the device for character recognition
CN109918647A (en) * 2019-01-30 2019-06-21 中国科学院信息工程研究所 A kind of security fields name entity recognition method and neural network model
CN109933801A (en) * 2019-03-25 2019-06-25 北京理工大学 Two-way LSTM based on predicted position attention names entity recognition method
CN109960782A (en) * 2018-12-27 2019-07-02 同济大学 A kind of Tibetan language segmenting method and device based on deep neural network
CN109992770A (en) * 2019-03-04 2019-07-09 昆明理工大学 A kind of Laotian name entity recognition method based on combination neural net
CN109992782A (en) * 2019-04-02 2019-07-09 深圳市华云中盛科技有限公司 Legal documents name entity recognition method, device and computer equipment
CN110008469A (en) * 2019-03-19 2019-07-12 桂林电子科技大学 A kind of multi-level name entity recognition method
CN110019795A (en) * 2017-11-09 2019-07-16 普天信息技术有限公司 The training method and system of sensitive word detection model
CN110019676A (en) * 2017-12-01 2019-07-16 北京搜狗科技发展有限公司 A kind of method, apparatus and equipment identifying core word in query information
CN110019711A (en) * 2017-11-27 2019-07-16 吴谨准 A kind of control method and device of pair of medicine text data structureization processing
CN110046353A (en) * 2019-04-22 2019-07-23 重庆理工大学 Aspect level emotion analysis method based on multi-language level mechanism
WO2019149135A1 (en) * 2018-02-05 2019-08-08 阿里巴巴集团控股有限公司 Word vector generation method, apparatus and device
CN110110335A (en) * 2019-05-09 2019-08-09 南京大学 A kind of name entity recognition method based on Overlay model
CN110162749A (en) * 2018-10-22 2019-08-23 哈尔滨工业大学(深圳) Information extracting method, device, computer equipment and computer readable storage medium
CN110196963A (en) * 2018-02-27 2019-09-03 北京京东尚科信息技术有限公司 Model generation, the method for semantics recognition, system, equipment and storage medium
CN110223737A (en) * 2019-06-13 2019-09-10 电子科技大学 A kind of chemical composition of Chinese materia medica name entity recognition method and device
CN110222337A (en) * 2019-05-28 2019-09-10 浙江邦盛科技有限公司 A kind of Chinese address segmenting method based on transformer and CRF
CN110263323A (en) * 2019-05-08 2019-09-20 清华大学 Keyword abstraction method and system based on the long Memory Neural Networks in short-term of fence type
CN110263325A (en) * 2019-05-17 2019-09-20 交通银行股份有限公司太平洋信用卡中心 Chinese automatic word-cut
CN110298016A (en) * 2018-03-21 2019-10-01 普天信息技术有限公司 A kind of part-of-speech tagging method and device
CN110298036A (en) * 2019-06-06 2019-10-01 昆明理工大学 A kind of online medical text symptom identification method based on part of speech increment iterative
CN110298035A (en) * 2019-06-04 2019-10-01 平安科技(深圳)有限公司 Word vector based on artificial intelligence defines method, apparatus, equipment and storage medium
CN110308240A (en) * 2019-05-24 2019-10-08 深圳大学 A kind of electronic nose method for quickly identifying
CN110321566A (en) * 2019-07-10 2019-10-11 北京邮电大学 Chinese name entity recognition method, device, computer equipment and storage medium
CN110321547A (en) * 2018-03-30 2019-10-11 北京四维图新科技股份有限公司 A kind of name entity determines method and device
WO2019205319A1 (en) * 2018-04-25 2019-10-31 平安科技(深圳)有限公司 Commodity information format processing method and apparatus, and computer device and storage medium
CN110444261A (en) * 2019-07-11 2019-11-12 新华三大数据技术有限公司 Sequence labelling network training method, electronic health record processing method and relevant apparatus
CN110490756A (en) * 2019-07-12 2019-11-22 北京邮电大学 The national safe emergency event portrait method of social networks based on attention mechanism
CN110502742A (en) * 2019-07-11 2019-11-26 中国科学院计算技术研究所 A kind of complexity entity abstracting method, device, medium and system
CN110555207A (en) * 2018-06-01 2019-12-10 海信集团有限公司 Sentence recognition method, sentence recognition device, machine equipment and computer-readable storage medium
CN110619124A (en) * 2019-09-19 2019-12-27 成都数之联科技有限公司 Named entity identification method and system combining attention mechanism and bidirectional LSTM
CN110678881A (en) * 2017-05-19 2020-01-10 易享信息技术有限公司 Natural language processing using context-specific word vectors
CN110688854A (en) * 2019-09-02 2020-01-14 平安科技(深圳)有限公司 Named entity recognition method, device and computer readable storage medium
CN110738319A (en) * 2019-11-11 2020-01-31 四川隧唐科技股份有限公司 LSTM model unit training method and device for recognizing bid-winning units based on CRF
CN110738182A (en) * 2019-10-21 2020-01-31 四川隧唐科技股份有限公司 LSTM model unit training method and device for high-precision identification of bid amount
CN110750965A (en) * 2019-09-16 2020-02-04 平安科技(深圳)有限公司 English text sequence labeling method and system and computer equipment
CN110750992A (en) * 2019-10-09 2020-02-04 吉林大学 Named entity recognition method, device, electronic equipment and medium
CN110826298A (en) * 2019-11-13 2020-02-21 北京万里红科技股份有限公司 Statement coding method used in intelligent auxiliary password-fixing system
CN110852103A (en) * 2019-10-28 2020-02-28 青岛聚好联科技有限公司 Named entity identification method and device
CN110851597A (en) * 2019-10-28 2020-02-28 青岛聚好联科技有限公司 Method and device for sentence annotation based on similar entity replacement
CN110867225A (en) * 2019-11-04 2020-03-06 山东师范大学 Character-level clinical concept extraction named entity recognition method and system
CN110956041A (en) * 2019-11-27 2020-04-03 重庆邮电大学 Depth learning-based co-purchase recombination bulletin summarization method
CN111008526A (en) * 2019-12-06 2020-04-14 安徽理工大学 Named entity identification method based on dual-channel neural network
CN111027325A (en) * 2019-12-09 2020-04-17 北京知道创宇信息技术股份有限公司 Model generation method, entity identification device and electronic equipment
CN111104437A (en) * 2018-10-09 2020-05-05 哈尔滨工业大学 Test data unified retrieval method and system based on object model
CN111143574A (en) * 2019-12-05 2020-05-12 大连民族大学 Query and visualization system construction method based on minority culture knowledge graph
CN111160031A (en) * 2019-12-13 2020-05-15 华南理工大学 Social media named entity identification method based on affix perception
CN111177414A (en) * 2019-12-31 2020-05-19 厦门快商通科技股份有限公司 Entity pre-labeling method, device and equipment
CN111191452A (en) * 2019-12-24 2020-05-22 中国铁道科学研究院集团有限公司电子计算技术研究所 Railway text named entity recognition method and device
CN111191668A (en) * 2018-11-15 2020-05-22 零氪科技(北京)有限公司 Method for identifying disease content in medical record text
CN111209738A (en) * 2019-12-31 2020-05-29 浙江大学 Multi-task named entity recognition method combining text classification
CN111259672A (en) * 2020-02-12 2020-06-09 新疆大学 Chinese tourism field named entity identification method based on graph convolution neural network
CN111274395A (en) * 2020-01-19 2020-06-12 河海大学 Power grid monitoring alarm event identification method based on convolution and long-short term memory network
CN111274820A (en) * 2020-02-20 2020-06-12 齐鲁工业大学 Intelligent medical named entity identification method and device based on neural network
CN111368542A (en) * 2018-12-26 2020-07-03 北京大学 Text language association extraction method and system based on recurrent neural network
RU2726739C1 (en) * 2019-07-29 2020-07-15 Бейджин Сяоми Интеллиджент Текнолоджи Ко., Лтд. Method, apparatus and device for natural language processing
CN111444715A (en) * 2020-03-24 2020-07-24 腾讯科技(深圳)有限公司 Entity relationship identification method and device, computer equipment and storage medium
CN111476022A (en) * 2020-05-15 2020-07-31 湖南工商大学 Method, system and medium for recognizing STM entity by embedding and mixing L characters of entity characteristics
CN111523325A (en) * 2020-04-20 2020-08-11 电子科技大学 Chinese named entity recognition method based on strokes
CN111563380A (en) * 2019-01-25 2020-08-21 浙江大学 Named entity identification method and device
CN111581474A (en) * 2020-04-02 2020-08-25 昆明理工大学 Evaluation object extraction method of case-related microblog comments based on multi-head attention system
CN111597792A (en) * 2020-03-05 2020-08-28 苏州浪潮智能科技有限公司 Sentence-level convolution LSTM training method, equipment and readable medium
CN111694936A (en) * 2020-04-26 2020-09-22 平安科技(深圳)有限公司 Method and device for identifying AI intelligent interview, computer equipment and storage medium
CN111739520A (en) * 2020-08-10 2020-10-02 腾讯科技(深圳)有限公司 Speech recognition model training method, speech recognition method and device
CN111737949A (en) * 2020-07-22 2020-10-02 江西风向标教育科技有限公司 Topic content extraction method and device, readable storage medium and computer equipment
CN111950277A (en) * 2019-04-30 2020-11-17 中移(苏州)软件技术有限公司 Business situation entity determining method, device and storage medium
CN111967265A (en) * 2020-08-31 2020-11-20 广东工业大学 Chinese word segmentation and entity identification combined learning method capable of automatically generating data set
WO2020232861A1 (en) * 2019-05-20 2020-11-26 平安科技(深圳)有限公司 Named entity recognition method, electronic device and storage medium
CN112084783A (en) * 2020-09-24 2020-12-15 中国民航大学 Entity identification method and system based on civil aviation non-civilized passengers
CN112115714A (en) * 2020-09-25 2020-12-22 平安国际智慧城市科技股份有限公司 Deep learning sequence labeling method and device and computer readable storage medium
CN112183086A (en) * 2020-09-23 2021-01-05 北京先声智能科技有限公司 English pronunciation continuous reading mark model based on sense group labeling
CN112307764A (en) * 2019-07-30 2021-02-02 百度(美国)有限责任公司 Coreference-aware representation learning for neural named entity recognition
CN112395882A (en) * 2020-12-07 2021-02-23 震坤行网络技术(南京)有限公司 Method, electronic device and storage medium for named entity recognition
CN112599124A (en) * 2020-11-20 2021-04-02 内蒙古电力(集团)有限责任公司电力调度控制分公司 Voice scheduling method and system for power grid scheduling
CN112651245A (en) * 2020-12-28 2021-04-13 南京邮电大学 Sequence annotation model and sequence annotation method
WO2021073179A1 (en) * 2019-10-15 2021-04-22 华为技术有限公司 Named entity identification method and device, and computer-readable storage medium
CN112989796A (en) * 2021-03-10 2021-06-18 北京大学 Text named entity information identification method based on syntactic guidance
CN113035303A (en) * 2021-02-09 2021-06-25 北京工业大学 Method and system for labeling named entity category of Chinese electronic medical record
US11055557B2 (en) 2018-04-05 2021-07-06 Walmart Apollo, Llc Automated extraction of product attributes from images
CN113076127A (en) * 2021-04-25 2021-07-06 南京大学 Method, system, electronic device and medium for extracting question and answer content in programming environment
CN113190602A (en) * 2021-04-09 2021-07-30 桂林电子科技大学 Event joint extraction method integrating word features and deep learning
CN113255320A (en) * 2021-05-13 2021-08-13 北京熙紫智数科技有限公司 Entity relation extraction method and device based on syntax tree and graph attention machine mechanism
CN113326380A (en) * 2021-08-03 2021-08-31 国能大渡河大数据服务有限公司 Equipment measurement data processing method, system and terminal based on deep neural network
CN113377953A (en) * 2021-05-31 2021-09-10 电子科技大学 Entity fusion and classification method based on PALC-DCA model
CN113488196A (en) * 2021-07-26 2021-10-08 西南交通大学 Drug specification text named entity recognition modeling method
CN113515946A (en) * 2021-06-22 2021-10-19 湖北亿咖通科技有限公司 Information processing method and device
CN113536799A (en) * 2021-08-10 2021-10-22 西南交通大学 Medical named entity recognition modeling method based on fusion attention
CN114021658A (en) * 2021-11-10 2022-02-08 北京交通大学 Training method, application method and system of named entity recognition model
CN114386425A (en) * 2022-03-24 2022-04-22 天津思睿信息技术有限公司 Big data system establishing method for processing natural language text content
CN114444485A (en) * 2022-01-24 2022-05-06 四川大学 Cloud environment network equipment entity identification method
US11487944B1 (en) * 2019-12-09 2022-11-01 Asapp, Inc. System, method, and computer program for obtaining a unified named entity recognition model with the collective predictive capabilities of teacher models with different tag sets using marginal distillation
US11521639B1 (en) 2021-04-02 2022-12-06 Asapp, Inc. Speech sentiment analysis using a speech sentiment classifier pretrained with pseudo sentiment labels
CN115688777A (en) * 2022-09-28 2023-02-03 北京邮电大学 Named entity recognition system for nested and discontinuous entities of Chinese financial text
US11593558B2 (en) * 2017-08-31 2023-02-28 Ebay Inc. Deep hybrid neural network for named entity recognition
US11763803B1 (en) 2021-07-28 2023-09-19 Asapp, Inc. System, method, and computer program for extracting utterances corresponding to a user problem statement in a conversation between a human agent and a user

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104933152A (en) * 2015-06-24 2015-09-23 北京京东尚科信息技术有限公司 Named entity recognition method and device
CN105260360A (en) * 2015-10-27 2016-01-20 小米科技有限责任公司 Named entity identification method and device
CN105630768A (en) * 2015-12-23 2016-06-01 北京理工大学 Cascaded conditional random field-based product name recognition method and device

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104933152A (en) * 2015-06-24 2015-09-23 北京京东尚科信息技术有限公司 Named entity recognition method and device
CN105260360A (en) * 2015-10-27 2016-01-20 小米科技有限责任公司 Named entity identification method and device
CN105630768A (en) * 2015-12-23 2016-06-01 北京理工大学 Cascaded conditional random field-based product name recognition method and device

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
C´ICERO NOGUEIRA DOS SANTOS 等: "Learning Character-level Representations for Part-of-Speech Tagging", 《PROCEEDINGS OF THE 31ST INTERNATIONAL CONFERENCE ON MACHINE LEARNING》 *
GUILLAUME LAMPLE 等: "Neural Architectures for Named Entity Recognition", 《PROCEEDINGS OF NAACL-HLT 2016》 *
JASON P.C. CHIU 等: "Named Entity Recognition with Bidirectional LSTM-CNNs", 《TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS》 *
XUEZHE MA 等: "End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF", 《HTTP://ARXIV.ORG/PDF/1603.01354.PDF》 *

Cited By (279)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108874765A (en) * 2017-05-15 2018-11-23 阿里巴巴集团控股有限公司 Term vector processing method and processing device
CN108874765B (en) * 2017-05-15 2021-12-24 创新先进技术有限公司 Word vector processing method and device
CN110678881B (en) * 2017-05-19 2023-10-03 硕动力公司 Natural language processing using context-specific word vectors
CN110678881A (en) * 2017-05-19 2020-01-10 易享信息技术有限公司 Natural language processing using context-specific word vectors
WO2018218706A1 (en) * 2017-05-27 2018-12-06 中国矿业大学 Method and system for extracting news event based on neural network
CN107220506A (en) * 2017-06-05 2017-09-29 东华大学 Breast cancer risk assessment analysis system based on depth convolutional neural networks
CN107247704A (en) * 2017-06-09 2017-10-13 阿里巴巴集团控股有限公司 Term vector processing method, device and electronic equipment
CN107247704B (en) * 2017-06-09 2020-09-08 阿里巴巴集团控股有限公司 Word vector processing method and device and electronic equipment
CN107168957A (en) * 2017-06-12 2017-09-15 云南大学 A kind of Chinese word cutting method
CN107451433A (en) * 2017-06-27 2017-12-08 中国科学院信息工程研究所 A kind of information source identification method and apparatus based on content of text
CN107451433B (en) * 2017-06-27 2020-05-22 中国科学院信息工程研究所 Information source identification method and device based on text content
CN107562716A (en) * 2017-07-18 2018-01-09 阿里巴巴集团控股有限公司 Term vector processing method, device and electronic equipment
CN107844472B (en) * 2017-07-18 2021-08-24 创新先进技术有限公司 Word vector processing method and device and electronic equipment
CN107844472A (en) * 2017-07-18 2018-03-27 阿里巴巴集团控股有限公司 Term vector processing method, device and electronic equipment
CN107562784A (en) * 2017-07-25 2018-01-09 同济大学 Short text classification method based on ResLCNN models
CN107609009A (en) * 2017-07-26 2018-01-19 北京大学深圳研究院 Text emotion analysis method, device, storage medium and computer equipment
CN109388795A (en) * 2017-08-07 2019-02-26 芋头科技(杭州)有限公司 A kind of name entity recognition method, language identification method and system
CN109388795B (en) * 2017-08-07 2022-11-08 芋头科技(杭州)有限公司 Named entity recognition method, language recognition method and system
CN109426660A (en) * 2017-08-17 2019-03-05 中国海洋大学 English email composition assistant based on long memory network in short-term
CN107526798A (en) * 2017-08-18 2017-12-29 武汉红茶数据技术有限公司 A kind of Entity recognition based on neutral net and standardization integrated processes and model
CN107526798B (en) * 2017-08-18 2020-09-01 武汉红茶数据技术有限公司 Entity identification and normalization combined method and model based on neural network
CN107526799A (en) * 2017-08-18 2017-12-29 武汉红茶数据技术有限公司 A kind of knowledge mapping construction method based on deep learning
US11593558B2 (en) * 2017-08-31 2023-02-28 Ebay Inc. Deep hybrid neural network for named entity recognition
CN107622050B (en) * 2017-09-14 2021-02-26 武汉烽火普天信息技术有限公司 Bi-LSTM and CRF-based text sequence labeling system and method
CN107622050A (en) * 2017-09-14 2018-01-23 武汉烽火普天信息技术有限公司 Text sequence labeling system and method based on Bi LSTM and CRF
CN107545571A (en) * 2017-09-22 2018-01-05 深圳天琴医疗科技有限公司 A kind of image detecting method and device
CN107729326A (en) * 2017-09-25 2018-02-23 沈阳航空航天大学 Neural machine translation method based on Multi BiRNN codings
CN107729326B (en) * 2017-09-25 2020-12-25 沈阳航空航天大学 Multi-BiRNN coding-based neural machine translation method
CN107644014A (en) * 2017-09-25 2018-01-30 南京安链数据科技有限公司 A kind of name entity recognition method based on two-way LSTM and CRF
CN107608970A (en) * 2017-09-29 2018-01-19 百度在线网络技术(北京)有限公司 part-of-speech tagging model generating method and device
CN107885721A (en) * 2017-10-12 2018-04-06 北京知道未来信息技术有限公司 A kind of name entity recognition method based on LSTM
CN107797987A (en) * 2017-10-12 2018-03-13 北京知道未来信息技术有限公司 A kind of mixing language material name entity recognition method based on Bi LSTM CNN
CN107832289A (en) * 2017-10-12 2018-03-23 北京知道未来信息技术有限公司 A kind of name entity recognition method based on LSTM CNN
CN107908614A (en) * 2017-10-12 2018-04-13 北京知道未来信息技术有限公司 A kind of name entity recognition method based on Bi LSTM
CN107967251A (en) * 2017-10-12 2018-04-27 北京知道未来信息技术有限公司 A kind of name entity recognition method based on Bi-LSTM-CNN
CN107797987B (en) * 2017-10-12 2021-02-09 北京知道未来信息技术有限公司 Bi-LSTM-CNN-based mixed corpus named entity identification method
CN107977353A (en) * 2017-10-12 2018-05-01 北京知道未来信息技术有限公司 A kind of mixing language material name entity recognition method based on LSTM-CNN
CN107992468A (en) * 2017-10-12 2018-05-04 北京知道未来信息技术有限公司 A kind of mixing language material name entity recognition method based on LSTM
CN107797988A (en) * 2017-10-12 2018-03-13 北京知道未来信息技术有限公司 A kind of mixing language material name entity recognition method based on Bi LSTM
CN107797989A (en) * 2017-10-16 2018-03-13 平安科技(深圳)有限公司 Enterprise name recognition methods, electronic equipment and computer-readable recording medium
WO2019075967A1 (en) * 2017-10-16 2019-04-25 平安科技(深圳)有限公司 Enterprise name recognition method, electronic device, and computer-readable storage medium
CN107679224A (en) * 2017-10-20 2018-02-09 竹间智能科技(上海)有限公司 It is a kind of towards the method and system without structure text intelligent answer
CN107679224B (en) * 2017-10-20 2020-09-08 竹间智能科技(上海)有限公司 Intelligent question and answer method and system for unstructured text
CN107679234A (en) * 2017-10-24 2018-02-09 上海携程国际旅行社有限公司 Customer service information providing method, device, electronic equipment, storage medium
CN107679234B (en) * 2017-10-24 2020-02-11 上海携程国际旅行社有限公司 Customer service information providing method, customer service information providing device, electronic equipment and storage medium
CN107832400A (en) * 2017-11-01 2018-03-23 山东大学 A kind of method that location-based LSTM and CNN conjunctive models carry out relation classification
CN107832400B (en) * 2017-11-01 2019-04-16 山东大学 A kind of method that location-based LSTM and CNN conjunctive model carries out relationship classification
CN107943847B (en) * 2017-11-02 2019-05-17 平安科技(深圳)有限公司 Business connection extracting method, device and storage medium
CN107943847A (en) * 2017-11-02 2018-04-20 平安科技(深圳)有限公司 Business connection extracting method, device and storage medium
CN110019795A (en) * 2017-11-09 2019-07-16 普天信息技术有限公司 The training method and system of sensitive word detection model
CN107797992A (en) * 2017-11-10 2018-03-13 北京百分点信息科技有限公司 Name entity recognition method and device
CN110019711A (en) * 2017-11-27 2019-07-16 吴谨准 A kind of control method and device of pair of medicine text data structureization processing
CN110019676A (en) * 2017-12-01 2019-07-16 北京搜狗科技发展有限公司 A kind of method, apparatus and equipment identifying core word in query information
CN109871843B (en) * 2017-12-01 2022-04-08 北京搜狗科技发展有限公司 Character recognition method and device for character recognition
CN109871843A (en) * 2017-12-01 2019-06-11 北京搜狗科技发展有限公司 Character identifying method and device, the device for character recognition
CN108038104A (en) * 2017-12-22 2018-05-15 北京奇艺世纪科技有限公司 A kind of method and device of Entity recognition
CN108170675A (en) * 2017-12-27 2018-06-15 哈尔滨福满科技有限责任公司 A kind of name entity recognition method based on deep learning towards medical field
CN108182976A (en) * 2017-12-28 2018-06-19 西安交通大学 A kind of clinical medicine information extracting method based on neural network
CN108228564B (en) * 2018-01-04 2021-02-19 苏州大学 Named entity recognition method, device and readable storage medium for counterlearning on crowdsourced data
CN108228564A (en) * 2018-01-04 2018-06-29 苏州大学 The name entity recognition method of confrontation study is carried out in crowdsourcing data
CN108268444A (en) * 2018-01-10 2018-07-10 南京邮电大学 A kind of Chinese word cutting method based on two-way LSTM, CNN and CRF
CN108268444B (en) * 2018-01-10 2021-11-02 南京邮电大学 Chinese word segmentation method based on bidirectional LSTM, CNN and CRF
CN108334567A (en) * 2018-01-16 2018-07-27 北京奇艺世纪科技有限公司 Rubbish text method of discrimination, device and server
CN108268643A (en) * 2018-01-22 2018-07-10 北京邮电大学 A kind of Deep Semantics matching entities link method based on more granularity LSTM networks
CN108563626B (en) * 2018-01-22 2022-01-25 北京颐圣智能科技有限公司 Medical text named entity recognition method and device
CN108563626A (en) * 2018-01-22 2018-09-21 北京颐圣智能科技有限公司 Medical text name entity recognition method and device
CN108228568B (en) * 2018-01-24 2021-06-04 上海互教教育科技有限公司 Mathematical problem semantic understanding method
CN108228568A (en) * 2018-01-24 2018-06-29 上海互教教育科技有限公司 A kind of mathematical problem semantic understanding method
CN108460013B (en) * 2018-01-30 2021-08-20 大连理工大学 Sequence labeling model and method based on fine-grained word representation model
CN108460013A (en) * 2018-01-30 2018-08-28 大连理工大学 A kind of sequence labelling model based on fine granularity vocabulary representation model
CN108229582A (en) * 2018-02-01 2018-06-29 浙江大学 Entity recognition dual training method is named in a kind of multitask towards medical domain
US11030411B2 (en) 2018-02-05 2021-06-08 Alibaba Group Holding Limited Methods, apparatuses, and devices for generating word vectors
WO2019149135A1 (en) * 2018-02-05 2019-08-08 阿里巴巴集团控股有限公司 Word vector generation method, apparatus and device
CN108363695B (en) * 2018-02-23 2020-04-24 西南交通大学 User comment attribute extraction method based on bidirectional dependency syntax tree representation
CN108363695A (en) * 2018-02-23 2018-08-03 西南交通大学 A kind of user comment attribute extraction method based on bidirectional dependency syntax tree characterization
CN110196963A (en) * 2018-02-27 2019-09-03 北京京东尚科信息技术有限公司 Model generation, the method for semantics recognition, system, equipment and storage medium
CN108536754A (en) * 2018-03-14 2018-09-14 四川大学 Electronic health record entity relation extraction method based on BLSTM and attention mechanism
CN108628823B (en) * 2018-03-14 2022-07-01 中山大学 Named entity recognition method combining attention mechanism and multi-task collaborative training
CN108491382A (en) * 2018-03-14 2018-09-04 四川大学 A kind of semi-supervised biomedical text semantic disambiguation method
CN108628823A (en) * 2018-03-14 2018-10-09 中山大学 In conjunction with the name entity recognition method of attention mechanism and multitask coordinated training
CN108388560A (en) * 2018-03-17 2018-08-10 北京工业大学 GRU-CRF meeting title recognition methods based on language model
CN110298016A (en) * 2018-03-21 2019-10-01 普天信息技术有限公司 A kind of part-of-speech tagging method and device
CN108647225A (en) * 2018-03-23 2018-10-12 浙江大学 A kind of electric business grey black production public sentiment automatic mining method and system
CN110321547A (en) * 2018-03-30 2019-10-11 北京四维图新科技股份有限公司 A kind of name entity determines method and device
CN108563725A (en) * 2018-04-04 2018-09-21 华东理工大学 A kind of Chinese symptom and sign composition recognition methods
US11055557B2 (en) 2018-04-05 2021-07-06 Walmart Apollo, Llc Automated extraction of product attributes from images
CN109670164A (en) * 2018-04-11 2019-04-23 东莞迪赛软件技术有限公司 Healthy the analysis of public opinion method based on the more word insertion Bi-LSTM residual error networks of deep layer
CN108536679A (en) * 2018-04-13 2018-09-14 腾讯科技(成都)有限公司 Name entity recognition method, device, equipment and computer readable storage medium
CN108536679B (en) * 2018-04-13 2022-05-20 腾讯科技(成都)有限公司 Named entity recognition method, device, equipment and computer readable storage medium
CN108920445B (en) * 2018-04-23 2022-06-17 华中科技大学鄂州工业技术研究院 Named entity identification method and device based on Bi-LSTM-CRF model
CN108920445A (en) * 2018-04-23 2018-11-30 华中科技大学鄂州工业技术研究院 A kind of name entity recognition method and device based on Bi-LSTM-CRF model
WO2019205319A1 (en) * 2018-04-25 2019-10-31 平安科技(深圳)有限公司 Commodity information format processing method and apparatus, and computer device and storage medium
CN108920446A (en) * 2018-04-25 2018-11-30 华中科技大学鄂州工业技术研究院 A kind of processing method of Engineering document
CN108829719A (en) * 2018-05-07 2018-11-16 中国科学院合肥物质科学研究院 The non-true class quiz answers selection method of one kind and system
CN108920468A (en) * 2018-05-07 2018-11-30 内蒙古工业大学 A kind of bilingual kind of inter-translation method of illiteracy Chinese based on intensified learning
CN108717406A (en) * 2018-05-10 2018-10-30 平安科技(深圳)有限公司 Text mood analysis method, device and storage medium
CN108717406B (en) * 2018-05-10 2021-08-24 平安科技(深圳)有限公司 Text emotion analysis method and device and storage medium
CN108717410A (en) * 2018-05-17 2018-10-30 达而观信息科技(上海)有限公司 Name entity recognition method and system
CN108920448B (en) * 2018-05-17 2021-09-14 南京大学 Comparison relation extraction method based on long-term and short-term memory network
CN108920448A (en) * 2018-05-17 2018-11-30 南京大学 A method of the comparison based on shot and long term memory network extracts
CN108805224A (en) * 2018-05-28 2018-11-13 中国人民解放军国防科技大学 Multi-symbol hand-drawn sketch recognition method and device with sustainable learning ability
CN108805224B (en) * 2018-05-28 2021-10-01 中国人民解放军国防科技大学 Multi-symbol hand-drawn sketch recognition method and device with sustainable learning ability
CN108932226A (en) * 2018-05-29 2018-12-04 华东师范大学 A kind of pair of method without punctuate text addition punctuation mark
CN110555207A (en) * 2018-06-01 2019-12-10 海信集团有限公司 Sentence recognition method, sentence recognition device, machine equipment and computer-readable storage medium
CN108762523A (en) * 2018-06-04 2018-11-06 重庆大学 Output characters through input method prediction technique based on capsule networks
CN108804654A (en) * 2018-06-07 2018-11-13 重庆邮电大学 A kind of collaborative virtual learning environment construction method based on intelligent answer
CN108874997A (en) * 2018-06-13 2018-11-23 广东外语外贸大学 A kind of name name entity recognition method towards film comment
CN108984520A (en) * 2018-06-19 2018-12-11 中国科学院自动化研究所 Stratification text subject dividing method
CN108875034A (en) * 2018-06-25 2018-11-23 湖南丹尼尔智能科技有限公司 A kind of Chinese Text Categorization based on stratification shot and long term memory network
CN108959252A (en) * 2018-06-28 2018-12-07 中国人民解放军国防科技大学 Semi-supervised Chinese named entity recognition method based on deep learning
CN108829681A (en) * 2018-06-28 2018-11-16 北京神州泰岳软件股份有限公司 A kind of name entity extraction method and device
CN108829681B (en) * 2018-06-28 2022-11-11 鼎富智能科技有限公司 Named entity extraction method and device
CN108959252B (en) * 2018-06-28 2022-02-08 中国人民解放军国防科技大学 Semi-supervised Chinese named entity recognition method based on deep learning
CN109145286A (en) * 2018-07-02 2019-01-04 昆明理工大学 Based on BiLSTM-CRF neural network model and merge the Noun Phrase Recognition Methods of Vietnamese language feature
CN109002436A (en) * 2018-07-12 2018-12-14 上海金仕达卫宁软件科技有限公司 Medical text terms automatic identifying method and system based on shot and long term memory network
CN109241520A (en) * 2018-07-18 2019-01-18 五邑大学 A kind of sentence trunk analysis method and system based on the multilayer error Feedback Neural Network for segmenting and naming Entity recognition
CN109241520B (en) * 2018-07-18 2023-05-23 五邑大学 Sentence trunk analysis method and system based on multi-layer error feedback neural network for word segmentation and named entity recognition
CN109255119A (en) * 2018-07-18 2019-01-22 五邑大学 A kind of sentence trunk analysis method and system based on the multitask deep neural network for segmenting and naming Entity recognition
CN109271494A (en) * 2018-08-10 2019-01-25 西安交通大学 A kind of system automatically extracting Chinese question and answer sentence focus
CN109271494B (en) * 2018-08-10 2021-04-27 西安交通大学 System for automatically extracting focus of Chinese question and answer sentences
CN109213997B (en) * 2018-08-16 2021-11-19 昆明理工大学 Chinese word segmentation method based on bidirectional long-time and short-time memory network model
CN109213997A (en) * 2018-08-16 2019-01-15 昆明理工大学 A kind of Chinese word cutting method based on two-way length memory network model in short-term
CN109165384A (en) * 2018-08-23 2019-01-08 成都四方伟业软件股份有限公司 A kind of name entity recognition method and device
CN109359291A (en) * 2018-08-28 2019-02-19 昆明理工大学 A kind of name entity recognition method
CN109165279A (en) * 2018-09-06 2019-01-08 深圳和而泰数据资源与云技术有限公司 information extraction method and device
CN109255020A (en) * 2018-09-11 2019-01-22 浙江大学 A method of talked with using convolution and generates model solution dialogue generation task
CN109299458B (en) * 2018-09-12 2023-03-28 广州多益网络股份有限公司 Entity identification method, device, equipment and storage medium
CN109299458A (en) * 2018-09-12 2019-02-01 广州多益网络股份有限公司 Entity recognition method, device, equipment and storage medium
CN109271631B (en) * 2018-09-12 2023-01-24 广州多益网络股份有限公司 Word segmentation method, device, equipment and storage medium
CN109271631A (en) * 2018-09-12 2019-01-25 广州多益网络股份有限公司 Segmenting method, device, equipment and storage medium
CN109308304A (en) * 2018-09-18 2019-02-05 深圳和而泰数据资源与云技术有限公司 Information extraction method and device
CN109492215A (en) * 2018-09-18 2019-03-19 平安科技(深圳)有限公司 News property recognition methods, device, computer equipment and storage medium
CN109284361A (en) * 2018-09-29 2019-01-29 深圳追科技有限公司 A kind of entity abstracting method and system based on deep learning
CN111104437A (en) * 2018-10-09 2020-05-05 哈尔滨工业大学 Test data unified retrieval method and system based on object model
CN109493956A (en) * 2018-10-15 2019-03-19 海口市人民医院(中南大学湘雅医学院附属海口医院) Diagnosis guiding method
CN109446521A (en) * 2018-10-18 2019-03-08 京东方科技集团股份有限公司 Name entity recognition method, device, electronic equipment, machine readable storage medium
CN110162749A (en) * 2018-10-22 2019-08-23 哈尔滨工业大学(深圳) Information extracting method, device, computer equipment and computer readable storage medium
CN109493166A (en) * 2018-10-23 2019-03-19 深圳智能思创科技有限公司 A kind of construction method for e-commerce shopping guide's scene Task conversational system
CN109493166B (en) * 2018-10-23 2021-12-28 深圳智能思创科技有限公司 Construction method for task type dialogue system aiming at e-commerce shopping guide scene
CN109471895B (en) * 2018-10-29 2021-02-26 清华大学 Electronic medical record phenotype extraction and phenotype name normalization method and system
CN109471895A (en) * 2018-10-29 2019-03-15 清华大学 The extraction of electronic health record phenotype, phenotype name authority method and system
CN109388807A (en) * 2018-10-30 2019-02-26 中山大学 The method, apparatus and storage medium of electronic health record name Entity recognition
CN109388807B (en) * 2018-10-30 2021-09-21 中山大学 Method, device and storage medium for identifying named entities of electronic medical records
CN109117472A (en) * 2018-11-12 2019-01-01 新疆大学 A kind of Uighur name entity recognition method based on deep learning
CN109657135B (en) * 2018-11-13 2023-06-23 华南理工大学 Scholars user portrait information extraction method and model based on neural network
CN109657135A (en) * 2018-11-13 2019-04-19 华南理工大学 A kind of scholar user neural network based draws a portrait information extraction method and model
CN111191668B (en) * 2018-11-15 2023-04-28 零氪科技(北京)有限公司 Method for identifying disease content in medical record text
CN111191668A (en) * 2018-11-15 2020-05-22 零氪科技(北京)有限公司 Method for identifying disease content in medical record text
CN109284400A (en) * 2018-11-28 2019-01-29 电子科技大学 A kind of name entity recognition method based on Lattice LSTM and language model
CN109543764A (en) * 2018-11-28 2019-03-29 安徽省公共气象服务中心 A kind of warning information legitimacy detection method and detection system based on intelligent semantic perception
CN109284400B (en) * 2018-11-28 2020-10-23 电子科技大学 Named entity identification method based on Lattice LSTM and language model
CN109670172A (en) * 2018-12-06 2019-04-23 桂林电子科技大学 A kind of scenic spot anomalous event abstracting method based on complex neural network
CN109710922A (en) * 2018-12-06 2019-05-03 深港产学研基地产业发展中心 Text recognition method, device, computer equipment and storage medium
CN109657239B (en) * 2018-12-12 2020-04-21 电子科技大学 Chinese named entity recognition method based on attention mechanism and language model learning
CN109657239A (en) * 2018-12-12 2019-04-19 电子科技大学 The Chinese name entity recognition method learnt based on attention mechanism and language model
CN109558569A (en) * 2018-12-14 2019-04-02 昆明理工大学 A kind of Laotian part-of-speech tagging method based on BiLSTM+CRF model
CN109815253A (en) * 2018-12-26 2019-05-28 出门问问信息科技有限公司 A kind of the subject entity recognition method and device of query statement
CN111368542A (en) * 2018-12-26 2020-07-03 北京大学 Text language association extraction method and system based on recurrent neural network
CN109960782A (en) * 2018-12-27 2019-07-02 同济大学 A kind of Tibetan language segmenting method and device based on deep neural network
CN109753660B (en) * 2019-01-07 2023-06-13 福州大学 LSTM-based winning bid web page named entity extraction method
CN109753660A (en) * 2019-01-07 2019-05-14 福州大学 A kind of acceptance of the bid webpage name entity abstracting method based on LSTM
CN109871535B (en) * 2019-01-16 2020-01-10 四川大学 French named entity recognition method based on deep neural network
CN109871535A (en) * 2019-01-16 2019-06-11 四川大学 A kind of French name entity recognition method based on deep neural network
CN109815952A (en) * 2019-01-24 2019-05-28 珠海市筑巢科技有限公司 Brand name recognition methods, computer installation and computer readable storage medium
CN111563380A (en) * 2019-01-25 2020-08-21 浙江大学 Named entity identification method and device
CN109918647A (en) * 2019-01-30 2019-06-21 中国科学院信息工程研究所 A kind of security fields name entity recognition method and neural network model
CN109871537B (en) * 2019-01-31 2022-12-27 沈阳雅译网络技术有限公司 High-precision Thai sentence segmentation method
CN109871537A (en) * 2019-01-31 2019-06-11 沈阳雅译网络技术有限公司 A kind of high-precision Thai subordinate sentence method
CN109992770A (en) * 2019-03-04 2019-07-09 昆明理工大学 A kind of Laotian name entity recognition method based on combination neural net
CN110008469A (en) * 2019-03-19 2019-07-12 桂林电子科技大学 A kind of multi-level name entity recognition method
CN110008469B (en) * 2019-03-19 2022-06-07 桂林电子科技大学 Multilevel named entity recognition method
CN109933801A (en) * 2019-03-25 2019-06-25 北京理工大学 Two-way LSTM based on predicted position attention names entity recognition method
CN109992782A (en) * 2019-04-02 2019-07-09 深圳市华云中盛科技有限公司 Legal documents name entity recognition method, device and computer equipment
CN109871545A (en) * 2019-04-22 2019-06-11 京东方科技集团股份有限公司 Name entity recognition method and device
CN110046353B (en) * 2019-04-22 2022-05-13 重庆理工大学 Aspect level emotion analysis method based on multi-language level mechanism
CN110046353A (en) * 2019-04-22 2019-07-23 重庆理工大学 Aspect level emotion analysis method based on multi-language level mechanism
CN111950277A (en) * 2019-04-30 2020-11-17 中移(苏州)软件技术有限公司 Business situation entity determining method, device and storage medium
CN110263323A (en) * 2019-05-08 2019-09-20 清华大学 Keyword abstraction method and system based on the long Memory Neural Networks in short-term of fence type
CN110263323B (en) * 2019-05-08 2020-08-28 清华大学 Keyword extraction method and system based on barrier type long-time memory neural network
CN110110335A (en) * 2019-05-09 2019-08-09 南京大学 A kind of name entity recognition method based on Overlay model
CN110110335B (en) * 2019-05-09 2023-01-06 南京大学 Named entity identification method based on stack model
CN110263325B (en) * 2019-05-17 2023-05-12 交通银行股份有限公司太平洋信用卡中心 Chinese word segmentation system
CN110263325A (en) * 2019-05-17 2019-09-20 交通银行股份有限公司太平洋信用卡中心 Chinese automatic word-cut
WO2020232861A1 (en) * 2019-05-20 2020-11-26 平安科技(深圳)有限公司 Named entity recognition method, electronic device and storage medium
CN110308240A (en) * 2019-05-24 2019-10-08 深圳大学 A kind of electronic nose method for quickly identifying
CN110222337B (en) * 2019-05-28 2022-12-02 浙江邦盛科技股份有限公司 Chinese address word segmentation method based on transform and CRF
CN110222337A (en) * 2019-05-28 2019-09-10 浙江邦盛科技有限公司 A kind of Chinese address segmenting method based on transformer and CRF
CN110298035A (en) * 2019-06-04 2019-10-01 平安科技(深圳)有限公司 Word vector based on artificial intelligence defines method, apparatus, equipment and storage medium
CN110298035B (en) * 2019-06-04 2023-12-01 平安科技(深圳)有限公司 Word vector definition method, device, equipment and storage medium based on artificial intelligence
CN110298036A (en) * 2019-06-06 2019-10-01 昆明理工大学 A kind of online medical text symptom identification method based on part of speech increment iterative
CN110298036B (en) * 2019-06-06 2022-07-22 昆明理工大学 Online medical text symptom identification method based on part-of-speech incremental iteration
CN110223737A (en) * 2019-06-13 2019-09-10 电子科技大学 A kind of chemical composition of Chinese materia medica name entity recognition method and device
CN110321566A (en) * 2019-07-10 2019-10-11 北京邮电大学 Chinese name entity recognition method, device, computer equipment and storage medium
CN110321566B (en) * 2019-07-10 2020-11-13 北京邮电大学 Chinese named entity recognition method and device, computer equipment and storage medium
CN110502742A (en) * 2019-07-11 2019-11-26 中国科学院计算技术研究所 A kind of complexity entity abstracting method, device, medium and system
CN110444261B (en) * 2019-07-11 2023-02-03 新华三大数据技术有限公司 Sequence labeling network training method, electronic medical record processing method and related device
CN110444261A (en) * 2019-07-11 2019-11-12 新华三大数据技术有限公司 Sequence labelling network training method, electronic health record processing method and relevant apparatus
CN110490756A (en) * 2019-07-12 2019-11-22 北京邮电大学 The national safe emergency event portrait method of social networks based on attention mechanism
RU2726739C1 (en) * 2019-07-29 2020-07-15 Бейджин Сяоми Интеллиджент Текнолоджи Ко., Лтд. Method, apparatus and device for natural language processing
US11501078B2 (en) 2019-07-29 2022-11-15 Beijing Xiaomi Intelligent Technology Co., Ltd. Method and device for performing reinforcement learning on natural language processing model and storage medium
CN112307764B (en) * 2019-07-30 2024-01-19 百度(美国)有限责任公司 Co-fingered aware representation learning for neural named entity recognition
CN112307764A (en) * 2019-07-30 2021-02-02 百度(美国)有限责任公司 Coreference-aware representation learning for neural named entity recognition
CN110688854A (en) * 2019-09-02 2020-01-14 平安科技(深圳)有限公司 Named entity recognition method, device and computer readable storage medium
WO2021051574A1 (en) * 2019-09-16 2021-03-25 平安科技(深圳)有限公司 English text sequence labelling method and system, and computer device
CN110750965B (en) * 2019-09-16 2023-06-30 平安科技(深圳)有限公司 English text sequence labeling method, english text sequence labeling system and computer equipment
CN110750965A (en) * 2019-09-16 2020-02-04 平安科技(深圳)有限公司 English text sequence labeling method and system and computer equipment
CN110619124B (en) * 2019-09-19 2023-06-16 成都数之联科技股份有限公司 Named entity identification method and system combining attention mechanism and bidirectional LSTM
CN110619124A (en) * 2019-09-19 2019-12-27 成都数之联科技有限公司 Named entity identification method and system combining attention mechanism and bidirectional LSTM
CN110750992A (en) * 2019-10-09 2020-02-04 吉林大学 Named entity recognition method, device, electronic equipment and medium
WO2021073179A1 (en) * 2019-10-15 2021-04-22 华为技术有限公司 Named entity identification method and device, and computer-readable storage medium
CN110738182A (en) * 2019-10-21 2020-01-31 四川隧唐科技股份有限公司 LSTM model unit training method and device for high-precision identification of bid amount
CN110851597A (en) * 2019-10-28 2020-02-28 青岛聚好联科技有限公司 Method and device for sentence annotation based on similar entity replacement
CN110852103A (en) * 2019-10-28 2020-02-28 青岛聚好联科技有限公司 Named entity identification method and device
CN110867225A (en) * 2019-11-04 2020-03-06 山东师范大学 Character-level clinical concept extraction named entity recognition method and system
CN110738319A (en) * 2019-11-11 2020-01-31 四川隧唐科技股份有限公司 LSTM model unit training method and device for recognizing bid-winning units based on CRF
CN110826298A (en) * 2019-11-13 2020-02-21 北京万里红科技股份有限公司 Statement coding method used in intelligent auxiliary password-fixing system
CN110956041A (en) * 2019-11-27 2020-04-03 重庆邮电大学 Depth learning-based co-purchase recombination bulletin summarization method
CN111143574A (en) * 2019-12-05 2020-05-12 大连民族大学 Query and visualization system construction method based on minority culture knowledge graph
CN111008526A (en) * 2019-12-06 2020-04-14 安徽理工大学 Named entity identification method based on dual-channel neural network
US11487944B1 (en) * 2019-12-09 2022-11-01 Asapp, Inc. System, method, and computer program for obtaining a unified named entity recognition model with the collective predictive capabilities of teacher models with different tag sets using marginal distillation
CN111027325B (en) * 2019-12-09 2023-11-28 北京知道创宇信息技术股份有限公司 Model generation method, entity identification device and electronic equipment
CN111027325A (en) * 2019-12-09 2020-04-17 北京知道创宇信息技术股份有限公司 Model generation method, entity identification device and electronic equipment
CN111160031A (en) * 2019-12-13 2020-05-15 华南理工大学 Social media named entity identification method based on affix perception
CN111191452A (en) * 2019-12-24 2020-05-22 中国铁道科学研究院集团有限公司电子计算技术研究所 Railway text named entity recognition method and device
CN111209738B (en) * 2019-12-31 2021-03-26 浙江大学 Multi-task named entity recognition method combining text classification
CN111209738A (en) * 2019-12-31 2020-05-29 浙江大学 Multi-task named entity recognition method combining text classification
CN111177414A (en) * 2019-12-31 2020-05-19 厦门快商通科技股份有限公司 Entity pre-labeling method, device and equipment
CN111274395A (en) * 2020-01-19 2020-06-12 河海大学 Power grid monitoring alarm event identification method based on convolution and long-short term memory network
CN111259672A (en) * 2020-02-12 2020-06-09 新疆大学 Chinese tourism field named entity identification method based on graph convolution neural network
CN111274820B (en) * 2020-02-20 2023-04-07 齐鲁工业大学 Intelligent medical named entity identification method and device based on neural network
CN111274820A (en) * 2020-02-20 2020-06-12 齐鲁工业大学 Intelligent medical named entity identification method and device based on neural network
CN111597792A (en) * 2020-03-05 2020-08-28 苏州浪潮智能科技有限公司 Sentence-level convolution LSTM training method, equipment and readable medium
CN111597792B (en) * 2020-03-05 2023-01-06 苏州浪潮智能科技有限公司 Sentence-level convolution LSTM training method, equipment and readable medium
CN111444715A (en) * 2020-03-24 2020-07-24 腾讯科技(深圳)有限公司 Entity relationship identification method and device, computer equipment and storage medium
CN111581474A (en) * 2020-04-02 2020-08-25 昆明理工大学 Evaluation object extraction method of case-related microblog comments based on multi-head attention system
CN111581474B (en) * 2020-04-02 2022-07-29 昆明理工大学 Evaluation object extraction method of case-related microblog comments based on multi-head attention system
CN111523325A (en) * 2020-04-20 2020-08-11 电子科技大学 Chinese named entity recognition method based on strokes
CN111694936B (en) * 2020-04-26 2023-06-06 平安科技(深圳)有限公司 Method, device, computer equipment and storage medium for identification of AI intelligent interview
CN111694936A (en) * 2020-04-26 2020-09-22 平安科技(深圳)有限公司 Method and device for identifying AI intelligent interview, computer equipment and storage medium
WO2021217866A1 (en) * 2020-04-26 2021-11-04 平安科技(深圳)有限公司 Method and apparatus for ai interview recognition, computer device and storage medium
CN111476022A (en) * 2020-05-15 2020-07-31 湖南工商大学 Method, system and medium for recognizing STM entity by embedding and mixing L characters of entity characteristics
CN111737949A (en) * 2020-07-22 2020-10-02 江西风向标教育科技有限公司 Topic content extraction method and device, readable storage medium and computer equipment
CN111739520A (en) * 2020-08-10 2020-10-02 腾讯科技(深圳)有限公司 Speech recognition model training method, speech recognition method and device
CN111967265A (en) * 2020-08-31 2020-11-20 广东工业大学 Chinese word segmentation and entity identification combined learning method capable of automatically generating data set
CN111967265B (en) * 2020-08-31 2023-09-15 广东工业大学 Chinese word segmentation and entity recognition combined learning method for automatic generation of data set
CN112183086A (en) * 2020-09-23 2021-01-05 北京先声智能科技有限公司 English pronunciation continuous reading mark model based on sense group labeling
CN112084783A (en) * 2020-09-24 2020-12-15 中国民航大学 Entity identification method and system based on civil aviation non-civilized passengers
CN112084783B (en) * 2020-09-24 2022-04-12 中国民航大学 Entity identification method and system based on civil aviation non-civilized passengers
CN112115714B (en) * 2020-09-25 2023-08-18 深圳平安智慧医健科技有限公司 Deep learning sequence labeling method, device and computer readable storage medium
CN112115714A (en) * 2020-09-25 2020-12-22 平安国际智慧城市科技股份有限公司 Deep learning sequence labeling method and device and computer readable storage medium
CN112599124A (en) * 2020-11-20 2021-04-02 内蒙古电力(集团)有限责任公司电力调度控制分公司 Voice scheduling method and system for power grid scheduling
CN112395882A (en) * 2020-12-07 2021-02-23 震坤行网络技术(南京)有限公司 Method, electronic device and storage medium for named entity recognition
CN112395882B (en) * 2020-12-07 2021-04-06 震坤行网络技术(南京)有限公司 Method, electronic device and storage medium for named entity recognition
CN112651245A (en) * 2020-12-28 2021-04-13 南京邮电大学 Sequence annotation model and sequence annotation method
CN113035303A (en) * 2021-02-09 2021-06-25 北京工业大学 Method and system for labeling named entity category of Chinese electronic medical record
CN112989796A (en) * 2021-03-10 2021-06-18 北京大学 Text named entity information identification method based on syntactic guidance
CN112989796B (en) * 2021-03-10 2023-09-22 北京大学 Text naming entity information identification method based on syntactic guidance
US11521639B1 (en) 2021-04-02 2022-12-06 Asapp, Inc. Speech sentiment analysis using a speech sentiment classifier pretrained with pseudo sentiment labels
CN113190602A (en) * 2021-04-09 2021-07-30 桂林电子科技大学 Event joint extraction method integrating word features and deep learning
CN113190602B (en) * 2021-04-09 2022-03-25 桂林电子科技大学 Event joint extraction method integrating word features and deep learning
CN113076127B (en) * 2021-04-25 2023-08-29 南京大学 Method, system, electronic device and medium for extracting question and answer content in programming environment
CN113076127A (en) * 2021-04-25 2021-07-06 南京大学 Method, system, electronic device and medium for extracting question and answer content in programming environment
CN113255320A (en) * 2021-05-13 2021-08-13 北京熙紫智数科技有限公司 Entity relation extraction method and device based on syntax tree and graph attention machine mechanism
CN113377953A (en) * 2021-05-31 2021-09-10 电子科技大学 Entity fusion and classification method based on PALC-DCA model
CN113515946B (en) * 2021-06-22 2024-01-05 亿咖通(湖北)技术有限公司 Information processing method and device
CN113515946A (en) * 2021-06-22 2021-10-19 湖北亿咖通科技有限公司 Information processing method and device
CN113488196A (en) * 2021-07-26 2021-10-08 西南交通大学 Drug specification text named entity recognition modeling method
CN113488196B (en) * 2021-07-26 2023-04-07 西南交通大学 Drug specification text named entity recognition modeling method
US11763803B1 (en) 2021-07-28 2023-09-19 Asapp, Inc. System, method, and computer program for extracting utterances corresponding to a user problem statement in a conversation between a human agent and a user
CN113326380B (en) * 2021-08-03 2021-11-02 国能大渡河大数据服务有限公司 Equipment measurement data processing method, system and terminal based on deep neural network
CN113326380A (en) * 2021-08-03 2021-08-31 国能大渡河大数据服务有限公司 Equipment measurement data processing method, system and terminal based on deep neural network
CN113536799A (en) * 2021-08-10 2021-10-22 西南交通大学 Medical named entity recognition modeling method based on fusion attention
CN113536799B (en) * 2021-08-10 2023-04-07 西南交通大学 Medical named entity recognition modeling method based on fusion attention
CN114021658A (en) * 2021-11-10 2022-02-08 北京交通大学 Training method, application method and system of named entity recognition model
CN114444485A (en) * 2022-01-24 2022-05-06 四川大学 Cloud environment network equipment entity identification method
CN114386425A (en) * 2022-03-24 2022-04-22 天津思睿信息技术有限公司 Big data system establishing method for processing natural language text content
CN114386425B (en) * 2022-03-24 2022-06-10 天津思睿信息技术有限公司 Big data system establishing method for processing natural language text content
CN115688777B (en) * 2022-09-28 2023-05-05 北京邮电大学 Named entity recognition system for nested and discontinuous entities of Chinese financial text
CN115688777A (en) * 2022-09-28 2023-02-03 北京邮电大学 Named entity recognition system for nested and discontinuous entities of Chinese financial text

Similar Documents

Publication Publication Date Title
CN106569998A (en) Text named entity recognition method based on Bi-LSTM, CNN and CRF
CN106980683B (en) Blog text abstract generating method based on deep learning
CN107291693B (en) Semantic calculation method for improved word vector model
CN108984526B (en) Document theme vector extraction method based on deep learning
CN108595632B (en) Hybrid neural network text classification method fusing abstract and main body characteristics
CN108363743B (en) Intelligent problem generation method and device and computer readable storage medium
CN109902298B (en) Domain knowledge modeling and knowledge level estimation method in self-adaptive learning system
CN107967257B (en) Cascading composition generating method
CN110765775B (en) Self-adaptive method for named entity recognition field fusing semantics and label differences
CN111209738B (en) Multi-task named entity recognition method combining text classification
CN110222163A (en) A kind of intelligent answer method and system merging CNN and two-way LSTM
CN110019685B (en) Deep text matching method and device based on sequencing learning
CN107967318A (en) A kind of Chinese short text subjective item automatic scoring method and system using LSTM neutral nets
CN109635124A (en) A kind of remote supervisory Relation extraction method of combination background knowledge
CN107025284A (en) The recognition methods of network comment text emotion tendency and convolutional neural networks model
CN110287323B (en) Target-oriented emotion classification method
CN109977199B (en) Reading understanding method based on attention pooling mechanism
CN110263174B (en) Topic category analysis method based on focus attention
CN111274790B (en) Chapter-level event embedding method and device based on syntactic dependency graph
CN113626589B (en) Multi-label text classification method based on mixed attention mechanism
CN114756681B (en) Evaluation and education text fine granularity suggestion mining method based on multi-attention fusion
CN111078833A (en) Text classification method based on neural network
CN110826338A (en) Fine-grained semantic similarity recognition method for single-choice gate and inter-class measurement
CN113704416B (en) Word sense disambiguation method and device, electronic equipment and computer-readable storage medium
CN112749562A (en) Named entity identification method, device, storage medium and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20170419

WD01 Invention patent application deemed withdrawn after publication