CN107944014A - A kind of Chinese text sentiment analysis method based on deep learning - Google Patents

A kind of Chinese text sentiment analysis method based on deep learning Download PDF

Info

Publication number
CN107944014A
CN107944014A CN201711307041.5A CN201711307041A CN107944014A CN 107944014 A CN107944014 A CN 107944014A CN 201711307041 A CN201711307041 A CN 201711307041A CN 107944014 A CN107944014 A CN 107944014A
Authority
CN
China
Prior art keywords
chinese
lstm
sentence
text
sentiment analysis
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711307041.5A
Other languages
Chinese (zh)
Inventor
严勤
丁聪
陈葛恒
肖丽莎
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hohai University HHU
Original Assignee
Hohai University HHU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hohai University HHU filed Critical Hohai University HHU
Priority to CN201711307041.5A priority Critical patent/CN107944014A/en
Publication of CN107944014A publication Critical patent/CN107944014A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/374Thesaurus
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Machine Translation (AREA)

Abstract

The invention discloses a kind of Chinese text sentiment analysis method based on deep learning, network text is obtained, Chinese sentence is rationally designed to the conversion logic of mathematic vector, first term vector dictionary is constructed with Chinese words segmentation combination term vector learning tool, then sentence vector conversion is carried out with LSTM MP models, representative sentences vector is subjected to just negative emotional semantic classification eventually through Softmax graders, achieve the purpose that sentiment analysis, the algorithm classification accuracy rate is high, it is efficient, flexibility ratio is high and avoids a large amount of manual works of supervised learning method, it is effectively improved the efficiency and accuracy of text emotion tendency classification, automated integration degree is high to save a large amount of manpowers.

Description

A kind of Chinese text sentiment analysis method based on deep learning
Technical field
The present invention relates to a kind of Chinese text sentiment analysis method based on deep learning, belong to natural language processing and depth Spend learning art field.
Background technology
Developing rapidly for internet makes microblogging and social networks become popular communication exchange form.Hundreds of millions of reflection people The information of viewpoint and attitude issued by platforms such as Twitter, Facebook and shared with owner daily, this is just to monitoring Chance is provided with analysis individual enterprise or social public sphere viewpoint, mood.
Text emotion analysis is the viewpoint to people, mood, attitude and real to product, service, tissue, event etc. Body Sentiment orientation makes effectively analysis and then further does a kind of technology of information induction.The sea produced for network media Data are measured, extract valuable mood and viewpoint, and accurate text emotion analysis is made to it, having in numerous areas should With value, such as:Enterprise can carry out after-sale service adjustment and market strategy formulation according to the relevant feedback mood of its product; Government can analyze according to a large amount of text moods of social platform and make the policy-system for being more in line with masses' demand;Finance side Face can also extract according to the mood viewpoint of various financal messagings and excavate trend prediction for carrying out a certain financial market etc..
The comparison that research in terms of text (English) sentiment analysis carries out abroad is early, achievement also comparative maturity, such as: The polarity that Turney and Pang realizes product and film comment with different methods respectively is classified, and subsequent Pang and Snyder again will A variety of methods are combined and attempted again.In addition, Pang and Lee also expands the polarity basis of classification task of film comment Star rating prediction is opened up, while restaurant is commented on after doing depth analysis and is used for predicting that the grade of restaurant's each side is commented by Snyder Usual practice such as food, environment etc. (totally 5 grades).Gruhl et al. passes through the sentiment analysis prognostic chart to online Internet chat information Book sales trend, Mishne et al. capture the information with mood to predict box office receipts from blog.Yet with English with The otherness of Chinese text structure, the sentiment analysis research difficulty of Chinese text is much greater, along with starting evening, mark text Expect that limited, technical method such as falls behind at the factor, the research of Chinese text sentiment analysis still has greatly improved space.
Chinese text sentiment analysis method is mostly based on rule and has supervision to be based on machine learning, limitation at present at present There is the following aspects:(1) due to varying with each individual to linguistry rule, Judgment by emotion Rulemaking is by formulation people's research level Limitation;(2) certain methods artificially carry out Feature Selection, therefore sentiment analysis effect is by people by rule of thumb when sentence characteristics extract Influenced for factor bigger etc..
The content of the invention
In order to solve the above technical problem, the present invention provides a kind of Chinese text sentiment analysis side based on deep learning Method.
In order to achieve the above object, the technical solution adopted in the present invention is:
A kind of Chinese text sentiment analysis method based on deep learning, comprises the following steps,
Step 1, training LSTM-MP models and Softmax graders;
Detailed process is as follows:
Obtain network text;
The network text of acquisition is pre-processed, obtains the Chinese sentence in network text;
Centering sentence carries out Chinese word segmentation and builds term vector dictionary;
Some Chinese sentences are manually marked, as LSTM-MP model experiment data, remaining Chinese sentence conduct LSTM-MP model training data;
LSTM-MP models are trained with LSTM-MP model trainings data;
LSTM-MP model experiments data are all converted into sentence vector with trained LSTM-MP models;
Tested some vectors as Softmax classifier training data, remaining vector as Softmax graders Data;
Softmax graders are trained with Softmax classifier trainings data, are tested with Softmax graders test data Trained Softmax graders;
Step 2, sentiment analysis is carried out with trained LSTM-MP models and Softmax graders.
The process that design multithreading reptile carries out network text acquisition is,
Choose the url list of appropriate website homepage URL initialization reptiles;
The html document of each website homepage is obtained, parses message in html document corresponding URL, it is corresponding to message Url list is added to after URL duplicate removals;
If there is the message newly issued, the corresponding URL of new information is added to url list;
Corresponding html document is obtained according to URL;
The html document that will be got, carries out information extraction using information extraction technique, extracts the message text of page-out Behind part, local data base is stored according to form is formulated.
The process that network text is pre-processed is the escape character in network text to be replaced, in network text Punctuation mark lack of standardization be replaced.
Centering sentence carries out Chinese word segmentation and simultaneously builds term vector dictionary, and detailed process is,
Centering sentence carries out Chinese word segmentation;
Term vector learning tool is debugged;
The Chinese word input word vector learning tool that Chinese word segmentation is obtained, carries out term vector dictionary creation.
Best Match Method is selected to carry out Chinese word segmentation.
LSTM-MP model training data are converted into term vector sequence, then train LSTM-MP models;
LSTM-MP model experiment data are converted into term vector sequence, then with trained LSTM-MP models by its turn It is changed to sentence vector.
The process that sentiment analysis is carried out with trained LSTM-MP models and Softmax graders is,
Obtain the network text that need to be analyzed;
The network text that need to be analyzed is pre-processed, obtains the Chinese sentence that need to be analyzed;
Chinese word segmentation is carried out to the Chinese sentence that need to be analyzed and builds term vector dictionary;
The Chinese sentence that need to be analyzed is converted into term vector sequence;
Term vector sequence is converted into sentence vector with trained LSTM-MP models;
Sentence vector is subjected to mood classification with trained Softmax graders.
The beneficial effect that the present invention is reached:The present invention obtains network text, rationally designs Chinese sentence to number The conversion logic of vector is learned, constructs term vector dictionary, Ran Houyong with Chinese words segmentation combination term vector learning tool first LSTM-MP models carry out sentence vector conversion, and representative sentences vector is carried out positive negative emotion point eventually through Softmax graders Class, achievees the purpose that sentiment analysis, and the algorithm classification accuracy rate is high, efficient, flexibility ratio is high and avoids supervised learning side A large amount of manual works of method, are effectively improved the efficiency and accuracy of text emotion tendency classification, automated integration degree Gao Jie Save a large amount of manpowers.
Brief description of the drawings
Fig. 1 is the flow chart of the present invention;
Fig. 2 is multithreading reptile structure chart;
Fig. 3 is Recursive Neural Network Structure figure;
Fig. 4 is LSTM structure charts;
Fig. 5 is LSTM-MP structure charts.
Embodiment
The invention will be further described below in conjunction with the accompanying drawings.Following embodiments are only used for clearly illustrating the present invention Technical solution, and be not intended to limit the protection scope of the present invention and limit the scope of the invention.
As shown in Figure 1, a kind of 1, Chinese text sentiment analysis method based on deep learning, comprises the following steps:
Step 1, training LSTM-MP models and Softmax graders;
Detailed process is as follows:
A the progress network text acquisition (as shown in Figure 2) of multithreading reptile) is designed, is comprised the following steps that:
A1 the url list of appropriate website homepage URL initialization reptiles) is chosen;
Investigated by collecting, choose content of text source --- Baidu's news, Sina's finance and economics, Jingdone district with emotion viewpoint Commented on Deng store, with the url list of website homepage URL initialization reptiles.
A2 the html document of each website homepage) is obtained, the corresponding URL of message in html document is parsed, to prevent data Impact analysis is repeated as a result, being added to url list after URL corresponding to message progress duplicate removals, will if there is the message newly issued The corresponding URL of new information is added to url list.
A3 corresponding html document) is obtained according to URL, queue management scheduling gives download queue according to the idle condition of thread URL distributes thread.
A4) the html document that will be got, carries out information extraction using information extraction technique, is extracting the information of page-out just Behind literary part, local data base is stored according to form is formulated.
B) network text (html text) of acquisition is pre-processed, obtains the Chinese sentence in network text.
Some additional characters are understood by escape to be distinguished with the key symbol in html text, therefore get Html text needs to be replaced escape symbol, specific as shown in table 1.
1 escape symbol of table is replaced
Due to not having separator between word, along with the inadequate specification of network text, Chinese and English punctuation mark is used with, full-shape Half-angle is mashed up etc., and participle is difficult, and in order to improve the accuracy of participle, the punctuation mark lack of standardization of network text is carried out Replace, it is specific as shown in table 2.
2 punctuation mark of table is replaced
Punctuate before replacement Punctuate after replacement
【】 []
, ,
.
C) centering sentence carries out Chinese word segmentation and builds term vector dictionary, comprises the following steps that:
C1) Best Match Method is selected to carry out Chinese word segmentation.So-called Best Match Method is exactly on the basis of Forward Maximum Method A kind of method of enterprising line efficiency optimization, the basic step of Forward Maximum Method are:It is 4 to define word maximum length first, so The matching of 4 words is from left to right proceeded by afterwards, is continued if dictionary matching success, it is unsuccessful to cut a word continuation from back Matching remains an individual character until successful match or only, and best match is exactly on this basis by the word order of dictionary according to probability of occurrence Size is ranked up, probability of occurrence it is big come before, it is so more efficient in word match.
C2) term vector learning tool is debugged.
Need to debug learning tool word2vec before term vector study, choose different parameters (sample rate, line Number of passes, window size etc.) to test repeatedly, the optimized parameter determined in debugging process is:Window size 5, sampling threshold 500, line Number of passes 12, term vector dimension 50.
C3) the Chinese word input word vector learning tool word2vec for obtaining Chinese word segmentation, carries out term vector dictionary Structure, specific instructions are:./word2vec–train/home/exer/gold.txt–output/home/exer/ golddic.txt–cbow 0–size 50–window 5–negative 0–hs 1–sample 500–thread 12– binary 0。
D) some Chinese sentences are manually marked, as LSTM-MP model experiment data, remaining Chinese sentence conduct LSTM-MP model training data.
Assuming that there are 48786 Chinese sentences, then 34150 are used as LSTM-MP model training data, 14636 conducts LSTM-MP model experiment data, LSTM-MP model experiment data probably account for 30%.
E) with LSTM-MP model trainings data training LSTM-MP models, comprise the following steps that:
E1) LSTM-MP modellings.
LSTM model refinement Self-Recursive Neural Network.Each sentence is that the word for having varying number is formed, it is assumed that word Quantity is n, and the dimension of each term vector is to be fixed as m, and the vector dimension of that each sentence is n × m, because n is change, So n × m cannot be fixed, it is impossible to it is trained with common feedforward neural network, and the appearance of recurrent neural network is fine Solve the problems, such as this, it allows sentence word recursion cycle to input.
But recurrent neural network, there is also problem, when sentence is long, the recursive number of plies can excessively cause whole network Gradient disappears or explosion, and LSTM is just into the solver of this problem, it inherits the tactic pattern of Recursive Networks model more Its shortcoming has been mended, mnemon has been introduced as calculate node, avoids gradient disappearance problem, a mnemon is by 4 parts Composition:One input gate, the neural unit that a band connects certainly, an out gate and a forgetting door.Input gate can determine Whether input signal can influence and change the current state of mnemon, and out gate can then determine current mnemon Whether location mode that will be coupled to other has an impact state, and forgeing door can currently be determined by adjusting from connection Whether the previous state of (removing) this mnemon is forgotten.
Recurrence son coding is concrete structure as shown in figure 3, being a vector lists A (x by a sentence expression1,x2, x3,...,xn), wherein x1,x2,x3,...,xnTo form the term vector of sentence, first xnWith xn-1It is spliced into 2m dimensional vectors (xn-1, xn), as the input layer of 2m node, it is m (identical with term vector dimension) then to define hidden layer number of nodes, and output layer is certain And 2m nodes, hidden layer output h is obtained after calculating1, then the h obtained with first layer1With xn-2It is spliced into new 2m dimensional vectors (xn-2,h1) input as second layer hidden layer, repeat step n-1 times, finishes to obtain one until n term vector all calculates The vectorial s of a whole sentence identical with term vector dimension, its dimension are identical with term vector.
During this, a sentence is decomposed for a binary tree structure, and recurrence is a triple each time, by Two child nodes and father node composition (P, a C1C2), wherein C1,C2It is two term vector x1,x2Corresponding word, P are by x1,x2 The hidden layer being calculated exports corresponding word.Corresponding triple child nodes are probably term vector xi, i ∈ [1, n], also may be used It can be the node of non-terminal location.First node h first1Vector is by (C1.C2)=(x4.x5) (() represents two term vectors Combination) be calculated:
P=f (W(1)[C1;C2]+b(1))
Wherein W(1)It is the coefficient matrix of m × 2m, b(1)For amount of bias, function f is activation primitive tanh tanh functions, P is that hidden layer exports, [C '1;C′2] represent the corresponding phrase of term vector group after reconstruct, if by [C '1;C′2] reconstruct can To be expressed as:
[C′1;C′2]=W(2)p+b(2)
Wherein, C '1,C′2Be reconstruct after the corresponding word of term vector, W(2)It is the coefficient matrix of 2m × m, b(2)It is inclined to reconstruct The amount of putting.
Reconstructed error is minimized in the training process, with the Euclidean distance between input and reconstruct weigh them it Between error be:
Wherein, Erec() is the standard for weighing error between input and reconstruct.
Here it is once recursive calculating process now we obtained h1, then (C1.C2)=(x3.h1) it is exactly next Input, repeats above-mentioned calculation procedure until obtaining the vector representation s of sentence.
LSTM is to introduce mnemon on the basis of recurrence own coding to be improved, to tackle the defects of its gradient disappears, Work as last word w according to the input logic identical with the sub- coding of recurrencenAfter the completion of input, hnFor output such as Fig. 4 institutes of LSMT Show, it is considered that can be as the vector table of whole sentence using last term vector as the final output obtained after output Show carry out classification based training, but test proves that the result that so obtains and be not so good as people's will, added on the basis of LSTM average Pond layer is improved as shown in Figure 5.LSTM-MP is not with final output hnFor sentence vector representation, but by all notes of LSTM The average pond layer of output input of unit is recalled, so as to obtain final sentence vector
E2 Chinese sentence) is converted into term vector sequence.Keep former order constant after Chinese sentence participle, form Chinese word Word order arranges, and the term vector for finding each word is then corresponded according to the term vector dictionary built, Chinese sequence of terms is turned It is changed to term vector sequence.
E3 the term vector sequence training LSTM-MP models after LSTM-MP model training data conversions) are used.
F LSTM-MP model experiment data) are converted into term vector sequence, then with trained LSTM-MP models by its Be converted to sentence vector.
G) surveyed some vectors as Softmax classifier training data, remaining vector as Softmax graders Try data.Here 70% Softmax classifier training data are used as, 30% is used as Softmax grader test datas.
H) with Softmax classifier trainings data training Softmax graders;Surveyed with Softmax graders test data Trained Softmax graders are tried, i.e., Softmax graders test data are inputted into Softmax graders, by classification results It is compared with manually mark, if accuracy rate is not less than the threshold value of setting, then it is assumed that Softmax classifier trainings are completed.
Step 2, sentiment analysis is carried out with trained LSTM-MP models and Softmax graders, concretely comprised the following steps:
21) network text that need to be analyzed is obtained;
22) network text that need to be analyzed is pre-processed, obtains the Chinese sentence that need to be analyzed;
23) Chinese word segmentation is carried out to the Chinese sentence that need to be analyzed and builds term vector dictionary;
24) the Chinese sentence that need to be analyzed is converted into term vector sequence,
25) term vector sequence is converted into sentence vector with trained LSTM-MP models;
26) sentence vector is subjected to mood classification with trained Softmax graders.
The above method obtains network text, the conversion logic of the Chinese sentence of rational design to mathematic vector, first Term vector dictionary is constructed with Chinese words segmentation combination term vector learning tool, then carrying out sentence vector with LSTM-MP models turns Change, representative sentences vector is subjected to just negative emotional semantic classification eventually through Softmax graders, achievees the purpose that sentiment analysis, should Algorithm classification accuracy rate is high, efficient, flexibility ratio and a large amount of manual works for avoiding supervised learning method, effective to improve The efficiency and accuracy of text emotion tendency classification, automated integration degree is high to save a large amount of manpowers;This method entirety accuracy rate is 78.02%, the classification accuracy of wherein active mood reaches 77.58%, recall rate 81.19%, and F-measure is 79.10%;The classification accuracy of mood expected to fall reaches 78.55%, recall rate 74.59%, F-measure 74.55%.
The above is only the preferred embodiment of the present invention, it is noted that for the ordinary skill people of the art For member, without departing from the technical principles of the invention, some improvement and deformation can also be made, these are improved and deformation Also it should be regarded as protection scope of the present invention.

Claims (7)

  1. A kind of 1. Chinese text sentiment analysis method based on deep learning, it is characterised in that:Comprise the following steps,
    Step 1, training LSTM-MP models and Softmax graders;
    Detailed process is as follows:
    Obtain network text;
    The network text of acquisition is pre-processed, obtains the Chinese sentence in network text;
    Centering sentence carries out Chinese word segmentation and builds term vector dictionary;
    Some Chinese sentences are manually marked, as LSTM-MP model experiment data, remaining Chinese sentence is as LSTM- MP model training data;
    LSTM-MP models are trained with LSTM-MP model trainings data;
    LSTM-MP model experiments data are all converted into sentence vector with trained LSTM-MP models;
    Using some vectors as Softmax classifier training data, remaining vector number is tested as Softmax graders According to;
    Softmax graders are trained with Softmax classifier trainings data, is tested and trained with Softmax graders test data Good Softmax graders;
    Step 2, sentiment analysis is carried out with trained LSTM-MP models and Softmax graders.
  2. A kind of 2. Chinese text sentiment analysis method based on deep learning according to claim 1, it is characterised in that:If The process that meter multithreading reptile carries out network text acquisition is,
    Choose the url list of appropriate website homepage URL initialization reptiles;
    The html document of each website homepage is obtained, parses message in html document corresponding URL, URL corresponding to message is gone Url list is added to after weight;
    If there is the message newly issued, the corresponding URL of new information is added to url list;
    Corresponding html document is obtained according to URL;
    The html document that will be got, carries out information extraction using information extraction technique, extracts the message text part of page-out Afterwards, according to formulation form deposit local data base.
  3. A kind of 3. Chinese text sentiment analysis method based on deep learning according to claim 1, it is characterised in that:Net The process that network text is pre-processed is that the escape character in network text is replaced, to the mark lack of standardization in network text Point symbol is replaced.
  4. A kind of 4. Chinese text sentiment analysis method based on deep learning according to claim 1, it is characterised in that:It is right Chinese sentence carries out Chinese word segmentation and simultaneously builds term vector dictionary, and detailed process is,
    Centering sentence carries out Chinese word segmentation;
    Term vector learning tool is debugged;
    The Chinese word input word vector learning tool that Chinese word segmentation is obtained, carries out term vector dictionary creation.
  5. A kind of 5. Chinese text sentiment analysis method based on deep learning according to claim 4, it is characterised in that:Choosing Chinese word segmentation is carried out with Best Match Method.
  6. A kind of 6. Chinese text sentiment analysis method based on deep learning according to claim 1, it is characterised in that:Will LSTM-MP model training data are converted to term vector sequence, then train LSTM-MP models;
    LSTM-MP model experiment data are converted into term vector sequence, are then converted into trained LSTM-MP models Sentence vector.
  7. A kind of 7. Chinese text sentiment analysis method based on deep learning according to claim 1, it is characterised in that:With The process that trained LSTM-MP models and Softmax graders carry out sentiment analysis is,
    Obtain the network text that need to be analyzed;
    The network text that need to be analyzed is pre-processed, obtains the Chinese sentence that need to be analyzed;
    Chinese word segmentation is carried out to the Chinese sentence that need to be analyzed and builds term vector dictionary;
    The Chinese sentence that need to be analyzed is converted into term vector sequence;
    Term vector sequence is converted into sentence vector with trained LSTM-MP models;
    Sentence vector is subjected to mood classification with trained Softmax graders.
CN201711307041.5A 2017-12-11 2017-12-11 A kind of Chinese text sentiment analysis method based on deep learning Pending CN107944014A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711307041.5A CN107944014A (en) 2017-12-11 2017-12-11 A kind of Chinese text sentiment analysis method based on deep learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711307041.5A CN107944014A (en) 2017-12-11 2017-12-11 A kind of Chinese text sentiment analysis method based on deep learning

Publications (1)

Publication Number Publication Date
CN107944014A true CN107944014A (en) 2018-04-20

Family

ID=61946440

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711307041.5A Pending CN107944014A (en) 2017-12-11 2017-12-11 A kind of Chinese text sentiment analysis method based on deep learning

Country Status (1)

Country Link
CN (1) CN107944014A (en)

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108764753A (en) * 2018-06-06 2018-11-06 平安科技(深圳)有限公司 Test method, apparatus, computer equipment and the storage medium of business personnel's ability
CN108875024A (en) * 2018-06-20 2018-11-23 清华大学深圳研究生院 File classification method, system, readable storage medium storing program for executing and electronic equipment
CN108984775A (en) * 2018-07-24 2018-12-11 南京新贝金服科技有限公司 A kind of public sentiment monitoring method and system based on comment on commodity
CN109255027A (en) * 2018-08-27 2019-01-22 上海宝尊电子商务有限公司 A kind of method and apparatus of electric business comment sentiment analysis noise reduction
CN109388806A (en) * 2018-10-26 2019-02-26 北京布本智能科技有限公司 A kind of Chinese word cutting method based on deep learning and forgetting algorithm
CN109446331A (en) * 2018-12-07 2019-03-08 华中科技大学 A kind of text mood disaggregated model method for building up and text mood classification method
CN109710946A (en) * 2019-01-15 2019-05-03 福州大学 A kind of joint debate digging system and method based on dependence analytic tree
CN109739960A (en) * 2018-12-11 2019-05-10 中科恒运股份有限公司 Sentiment analysis method, sentiment analysis device and the terminal of text
CN109815456A (en) * 2019-02-13 2019-05-28 北京航空航天大学 A method of it is compressed based on term vector memory space of the character to coding
CN110059927A (en) * 2019-03-18 2019-07-26 平安科技(深圳)有限公司 Assessment method, device, equipment and storage medium on performance line
WO2019214145A1 (en) * 2018-05-10 2019-11-14 平安科技(深圳)有限公司 Text sentiment analyzing method, apparatus and storage medium
CN110502633A (en) * 2019-07-19 2019-11-26 中山大学 Network comment management method based on machine learning
CN110516242A (en) * 2019-08-26 2019-11-29 北京亚联之星信息技术有限公司 The method and apparatus for identifying negative financial Information based on machine learning algorithm
CN110909545A (en) * 2019-11-26 2020-03-24 电子科技大学 Black guide detection method based on gradient lifting algorithm
CN111221962A (en) * 2019-11-18 2020-06-02 重庆邮电大学 Text emotion analysis method based on new word expansion and complex sentence pattern expansion
CN111241842A (en) * 2018-11-27 2020-06-05 阿里巴巴集团控股有限公司 Text analysis method, device and system
CN109635274B (en) * 2018-10-25 2023-10-27 平安科技(深圳)有限公司 Text input prediction method, device, computer equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070203875A1 (en) * 2006-02-24 2007-08-30 Intervoice Limited Partnership System and method for retrieving files from a file server using file attributes
CN105022725A (en) * 2015-07-10 2015-11-04 河海大学 Text emotional tendency analysis method applied to field of financial Web
CN105389331A (en) * 2015-09-21 2016-03-09 中国人民解放军国防科学技术大学 Open source software analyzing and comparing method based on market requirements
CN106599933A (en) * 2016-12-26 2017-04-26 哈尔滨工业大学 Text emotion classification method based on the joint deep learning model
CN106776581A (en) * 2017-02-21 2017-05-31 浙江工商大学 Subjective texts sentiment analysis method based on deep learning

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070203875A1 (en) * 2006-02-24 2007-08-30 Intervoice Limited Partnership System and method for retrieving files from a file server using file attributes
CN105022725A (en) * 2015-07-10 2015-11-04 河海大学 Text emotional tendency analysis method applied to field of financial Web
CN105389331A (en) * 2015-09-21 2016-03-09 中国人民解放军国防科学技术大学 Open source software analyzing and comparing method based on market requirements
CN106599933A (en) * 2016-12-26 2017-04-26 哈尔滨工业大学 Text emotion classification method based on the joint deep learning model
CN106776581A (en) * 2017-02-21 2017-05-31 浙江工商大学 Subjective texts sentiment analysis method based on deep learning

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019214145A1 (en) * 2018-05-10 2019-11-14 平安科技(深圳)有限公司 Text sentiment analyzing method, apparatus and storage medium
CN108764753A (en) * 2018-06-06 2018-11-06 平安科技(深圳)有限公司 Test method, apparatus, computer equipment and the storage medium of business personnel's ability
CN108875024A (en) * 2018-06-20 2018-11-23 清华大学深圳研究生院 File classification method, system, readable storage medium storing program for executing and electronic equipment
CN108875024B (en) * 2018-06-20 2020-10-20 清华大学深圳研究生院 Text classification method and system, readable storage medium and electronic equipment
CN108984775A (en) * 2018-07-24 2018-12-11 南京新贝金服科技有限公司 A kind of public sentiment monitoring method and system based on comment on commodity
CN108984775B (en) * 2018-07-24 2020-05-22 南京新贝金服科技有限公司 Public opinion monitoring method and system based on commodity comments
CN109255027A (en) * 2018-08-27 2019-01-22 上海宝尊电子商务有限公司 A kind of method and apparatus of electric business comment sentiment analysis noise reduction
CN109255027B (en) * 2018-08-27 2022-06-24 上海宝尊电子商务有限公司 E-commerce comment sentiment analysis noise reduction method and device
CN109635274B (en) * 2018-10-25 2023-10-27 平安科技(深圳)有限公司 Text input prediction method, device, computer equipment and storage medium
CN109388806B (en) * 2018-10-26 2023-06-27 北京布本智能科技有限公司 Chinese word segmentation method based on deep learning and forgetting algorithm
CN109388806A (en) * 2018-10-26 2019-02-26 北京布本智能科技有限公司 A kind of Chinese word cutting method based on deep learning and forgetting algorithm
CN111241842B (en) * 2018-11-27 2023-05-30 阿里巴巴集团控股有限公司 Text analysis method, device and system
CN111241842A (en) * 2018-11-27 2020-06-05 阿里巴巴集团控股有限公司 Text analysis method, device and system
CN109446331A (en) * 2018-12-07 2019-03-08 华中科技大学 A kind of text mood disaggregated model method for building up and text mood classification method
CN109739960A (en) * 2018-12-11 2019-05-10 中科恒运股份有限公司 Sentiment analysis method, sentiment analysis device and the terminal of text
CN109710946A (en) * 2019-01-15 2019-05-03 福州大学 A kind of joint debate digging system and method based on dependence analytic tree
CN109815456A (en) * 2019-02-13 2019-05-28 北京航空航天大学 A method of it is compressed based on term vector memory space of the character to coding
CN110059927A (en) * 2019-03-18 2019-07-26 平安科技(深圳)有限公司 Assessment method, device, equipment and storage medium on performance line
CN110059927B (en) * 2019-03-18 2024-04-16 平安科技(深圳)有限公司 Performance online assessment method, device, equipment and storage medium
CN110502633A (en) * 2019-07-19 2019-11-26 中山大学 Network comment management method based on machine learning
CN110516242A (en) * 2019-08-26 2019-11-29 北京亚联之星信息技术有限公司 The method and apparatus for identifying negative financial Information based on machine learning algorithm
CN111221962A (en) * 2019-11-18 2020-06-02 重庆邮电大学 Text emotion analysis method based on new word expansion and complex sentence pattern expansion
CN111221962B (en) * 2019-11-18 2023-05-26 重庆邮电大学 Text emotion analysis method based on new word expansion and complex sentence pattern expansion
CN110909545A (en) * 2019-11-26 2020-03-24 电子科技大学 Black guide detection method based on gradient lifting algorithm

Similar Documents

Publication Publication Date Title
CN107944014A (en) A kind of Chinese text sentiment analysis method based on deep learning
CN111144131B (en) Network rumor detection method based on pre-training language model
CN110059188B (en) Chinese emotion analysis method based on bidirectional time convolution network
CN109635108B (en) Man-machine interaction based remote supervision entity relationship extraction method
CN112199608B (en) Social media rumor detection method based on network information propagation graph modeling
CN106776538A (en) The information extracting method of enterprise's noncanonical format document
CN108363790A (en) For the method, apparatus, equipment and storage medium to being assessed
CN109558487A (en) Document Classification Method based on the more attention networks of hierarchy
CN107092596A (en) Text emotion analysis method based on attention CNNs and CCR
CN106021410A (en) Source code annotation quality evaluation method based on machine learning
CN106202032A (en) A kind of sentiment analysis method towards microblogging short text and system thereof
CN109740655B (en) Article scoring prediction method based on matrix decomposition and neural collaborative filtering
CN112231562A (en) Network rumor identification method and system
CN108536801A (en) A kind of civil aviaton's microblogging security public sentiment sentiment analysis method based on deep learning
CN107688576B (en) Construction and tendency classification method of CNN-SVM model
CN113590764B (en) Training sample construction method and device, electronic equipment and storage medium
CN106682089A (en) RNNs-based method for automatic safety checking of short message
CN110223675A (en) The screening technique and system of training text data for speech recognition
CN109325125B (en) Social network rumor detection method based on CNN optimization
CN105975497A (en) Automatic microblog topic recommendation method and device
CN107357895A (en) A kind of processing method of the text representation based on bag of words
CN110910175A (en) Tourist ticket product portrait generation method
CN111177010B (en) Software defect severity identification method
CN113094502A (en) Multi-granularity takeaway user comment sentiment analysis method
CN109299251A (en) A kind of abnormal refuse messages recognition methods and system based on deep learning algorithm

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180420