CN103345525B - File classification method, device and processor - Google Patents

File classification method, device and processor Download PDF

Info

Publication number
CN103345525B
CN103345525B CN201310308226.3A CN201310308226A CN103345525B CN 103345525 B CN103345525 B CN 103345525B CN 201310308226 A CN201310308226 A CN 201310308226A CN 103345525 B CN103345525 B CN 103345525B
Authority
CN
China
Prior art keywords
emotion
probability
sample
classification
joint
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310308226.3A
Other languages
Chinese (zh)
Other versions
CN103345525A (en
Inventor
李寿山
高伟
周国栋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou University
Original Assignee
Suzhou University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou University filed Critical Suzhou University
Priority to CN201310308226.3A priority Critical patent/CN103345525B/en
Publication of CN103345525A publication Critical patent/CN103345525A/en
Application granted granted Critical
Publication of CN103345525B publication Critical patent/CN103345525B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The present invention relates to natural language processing and mode identification technology, disclose a kind of file classification method, device and processor.In the method, first the probability of each emotional category and each emotion classification it is belonging respectively to according to grader acquisition emotion test sample and mood test sample, then emotion test sample and mood test sample are chosen as common sample, and obtain the joint probability of described common sample, described joint probability and probability are weighted summation, and obtain the emotional category belonging to text to be sorted, and/or emotion classification according to the result of summation.During this, emotion and the emotion of text to be sorted are combined, to realize the emotional semantic classification to text, and/or emotion classification.Owing to there being close contact between emotion and the emotion of one text, by the emotion of text to be sorted and emotion associating, with treat classifying text classify time, the precision of text classification can be effectively improved.

Description

File classification method, device and processor
Technical field
The present invention relates to natural language processing and mode identification technology, particularly relate to a kind of text and divide Class method, device and processor.
Background technology
In the activity of the mankind, people is to the attitude of things or viewpoint, often by emotion and the shadow of emotion Ring.Wherein, emotion is whether people meets oneself needs to objective things and a kind of attitude of producing is experienced, Including positive, intermediate or passive etc., and emotion refer in people psychoreaction and impression, such as happiness, Anger, sorrow and pleasure etc..Emotional semantic classification contributes to the pluses and minuses of consumption habit and the product understanding user, in order to Product review is analyzed and decision-making;Contribute to understanding satisfaction and the demand of the common people, find society in time Can hot issue;Contribute to analyzing the focus public feelings information of current social, to user, enterprise or government's machine Structures etc. provide the foundation of decision references.And emotion has potential field, theme and the spy of independence in period By the emotion of people, point, is able to observe that the viewpoint of things is inclined to by people.
In view of this, it is often necessary to text is carried out emotional semantic classification or emotion classification.Existing emotional semantic classification Method and emotion sorting technique, mainly after obtaining text to be sorted, carry out feelings to emotion test sample Sense mark, carries out emotion mark, and treats respectively according to the test sample after mark mood test sample The text of classification carries out emotional semantic classification or emotion classification respectively.
But, inventor finds in the research process of the application, uses aforesaid way to treat classifying text When carrying out emotional semantic classification or emotion classification, the classification results precision got is relatively low.
Summary of the invention
In view of this, it is an object of the invention to provide a kind of file classification method, device and processor, Specific embodiments is as follows:
A kind of file classification method, including:
Obtaining grader, described grader includes: emotion classifiers and emotion grader;
Obtain emotion test sample and the mood test sample of text to be sorted, according to described grader, right Described emotion test sample and mood test sample are classified, and obtain described emotion test sample and feelings Thread test sample is belonging respectively to the probability of each emotional category and each emotion classification;
Choose emotion test sample and mood test sample as common sample, and obtain described common sample Joint probability, described joint probability includes: emotion joint probability, and/or emotion joint probability, wherein, Described emotion joint probability p (si|ej) representing: the emotion classification of certain sample is ejIn the case of, its emotion Classification is siProbability, described emotion joint probability p (ei|sj) representing: the emotional category of certain sample is sj's In the case of, its emotion classification is eiProbability;
Described probability and joint probability are weighted summation, and obtain according to the result of weighted sum Emotional category belonging to text to be sorted, and/or emotion classification.
Preferably, described probability and joint probability are weighted summation, and according to weighted sum Result obtains the emotional category belonging to text to be sorted, including:
Probability according to described common sample and emotion joint probability, obtain the feelings of each common sample Sense transition probability, wherein, the algorithm obtaining described transference probability is:
p reansfer ( s i | X ) = Σ j = 1 N e p ( s i | e i ) * p ( e i | X ) ;
Wherein, p (ej| X) it is the common sample X probability that belongs to all kinds of emotion classification, p (si|ej) it is Emotion joint probability, the emotion classification being certain common sample is ejTime, emotional category is siProbability, NeFor the emotion classification number of emotion classification, ptransfer(si| X) it is the transference probability of sample X;
According to described transference probability, obtain the final emotion probability of each common sample, wherein, obtain The algorithm taking described final emotion probability is:
pjoint(si|X)=(1-λ)p(si|X)+λptransfer(si|X);
Wherein, p (si| X) it is the sample X probability that belongs to all kinds of emotional category, ptransfer(si| X) it is sample The transference probability of X, λ is the weight parameter set, pjoint(si| X) it is final emotion probability;
Obtaining in described common sample, the common sample that the value of final emotion probability is maximum, as feelings Sample is demarcated in sense, and determines that described emotion demarcation emotional category belonging to sample is described text institute to be sorted The emotional category belonged to.
Preferably, described probability and joint probability are weighted summation, and according to weighted sum Result obtains the emotion classification belonging to text to be sorted, including:
Probability according to described common sample and joint probability, the emotion obtaining each common sample turns Moving probability, wherein, the algorithm obtaining described emotion transition probability is:
p reansfer ( e i | X ) = Σ j = 1 N s p ( e i | s j ) * p ( s j | X )
Wherein, p (sj| X) it is the common sample X probability that belongs to all kinds of emotional category, p (ei|sj) it is feelings Thread joint probability, the emotional category being certain common sample is sjTime, emotion classification is eiProbability, Ns For the emotional category number of emotional semantic classification, ptransfer(ei| X) it is the emotion transition probability of sample X;
According to described emotion transition probability, obtain the final emotion probability of each common sample, wherein, obtain The algorithm taking described final emotion probability is:
pjoint(ei|X)=(1-λ)p(ei|X)+λptransfer(ei|X);
Wherein, p (ei| X) it is the sample X probability that belongs to all kinds of emotion classification, ptransfer(ei| X) be The emotion transition probability of sample X, λ is the weight parameter set, pjoint(ei| X) it is final emotion probability;
Obtaining in described common sample, the common sample that the value of final emotion probability is maximum, as feelings Thread demarcates sample, and determines that described emotion demarcation emotion classification belonging to sample is described text institute to be sorted The emotion classification belonged to.
Preferably,
The interval of described weight parameter λ is: (0.4,0.8).
Preferably, described acquisition grader includes:
Obtain respectively and there is Emotion tagging and the corpus of emotion mark;
Described emotion classifiers is obtained according to the described corpus with Emotion tagging, and according to described tool The corpus marked of being in a bad mood obtains described emotion grader.
Preferably,
Described emotional category includes: positive, neutral and passive;
Described emotion classification includes: glad, sad, worried, angry, surprised, angry, loss of emotion.
Accordingly, the invention also discloses a kind of document sorting apparatus, including:
Grader acquisition module, is used for obtaining grader, and described grader includes: emotion classifiers and feelings Thread grader;
Probability acquisition module, for obtaining emotion test sample and the mood test sample of text to be sorted This, according to described grader, classify to emotion test sample and mood test sample, and obtain institute State emotion test sample and mood test sample is belonging respectively at the beginning of each emotional category and each emotion classification Beginning probability;
Joint probability acquisition module, is used for choosing emotion test sample and mood test sample as common sample Basis, and obtain the joint probability of described common sample, described joint probability includes: emotion joint probability, And/or emotion joint probability, wherein, described emotion joint probability p (si|ej) represent: the emotion of certain sample Classification is ejIn the case of, its emotional category is siProbability, described emotion joint probability p (ei|sj) represent: The emotional category of certain sample is sjIn the case of, its emotion classification is eiProbability;
Text categories acquisition module, for described probability and joint probability being weighted summation, and Result according to weighted sum obtains the emotional category belonging to text to be sorted, and/or emotion classification.
Preferably, described text categories acquisition module includes: emotional category obtains submodule, described emotion Classification obtains submodule and includes:
Transference probability acquiring unit, for the probability according to described common sample and emotion associating Probability, obtains the transference probability of each common sample, wherein, obtains described transference probability Algorithm is:
p reansfer ( s i | X ) = Σ j = 1 N e p ( s i | e i ) * p ( e j | X )
Wherein, p (ej| X) it is the common sample X probability that belongs to all kinds of emotion classification, p (si|ej) it is Emotion joint probability, the emotion classification being certain common sample is ejTime, emotional category is siProbability, NeFor the emotion classification number of emotion classification, ptransfer(si| X) it is the transference probability of sample X;
Final emotion probability acquiring unit, for according to described transference probability, obtains each common sample This final emotion probability, wherein, the algorithm obtaining described final emotion probability is:
pjoint(si|X)=(1-λ)p(si|X)+λptransfer(si|X);
Wherein, p (si| X) it is the sample X probability that belongs to all kinds of emotional category, ptransfer(si| X) it is sample The transference probability of X, λ is the weight parameter set, pjoint(si| X) it is final emotion probability;
Emotional category determines unit, is used for obtaining in described common sample, and the value of final emotion probability is maximum Common sample, demarcate sample as emotion, and determine that described emotion demarcates emotion belonging to sample Classification is the emotional category belonging to described text to be sorted.
Preferably, described text categories acquisition module includes: emotion classification obtains submodule, described emotion Classification obtains submodule and includes:
Emotion transition probability acquiring unit, for the probability according to described common sample and joint probability, Obtaining the emotion transition probability of each common sample, wherein, the algorithm obtaining described emotion transition probability is:
p transfer ( e i | X ) = Σ j = 1 N s p ( e i | s j ) * p ( s j | X )
Wherein, p (sj| X) it is the common sample X probability that belongs to all kinds of emotional category, p (ei|sj) it is feelings Thread joint probability, the emotional category being certain common sample is sjTime, emotion classification is eiProbability, Ns For the emotional category number of emotional semantic classification, ptransfer(ei| X) it is the emotion transition probability of sample X;
Final emotion probability acquiring unit, for according to described emotion transition probability, obtains each common sample This final emotion probability, wherein, the algorithm obtaining described final emotion probability is:
pjoint(ei|X)=(1-λ)p(ei|X)+λptransfer(ei|X);
Wherein, p (ei| X) it is the sample X probability that belongs to all kinds of emotion classification, ptransfer(ei| X) be The emotion transition probability of sample X, λ is the weight parameter set, pjoint(ei| X) it is final emotion probability;
Emotion classification determination unit, is used for obtaining in described common sample, and the value of final emotion probability is maximum Common sample, demarcate sample as emotion, and determine that described emotion demarcates emotion belonging to sample Classification is the emotion classification belonging to described text to be sorted.
Preferably, described grader acquisition module includes:
Corpus acquiring unit, has Emotion tagging and the corpus of emotion mark for obtaining respectively;
Grader acquiring unit, obtains described emotion for having the corpus of Emotion tagging described in basis Grader, and obtain described emotion grader according to the described corpus with emotion mark.
Accordingly, the invention also discloses a kind of processor, the chip of described processor is integrated with as above Described document sorting apparatus.
File classification method disclosed in this invention, first obtains emotion test sample and feelings according to grader Thread test sample is belonging respectively to the probability of each emotional category and each emotion classification, then chooses feelings Sense test sample and mood test sample are as common sample, and obtain the joint probability of described common sample, Described joint probability and described probability are weighted summation, and obtain according to the result of weighted sum Emotional category belonging to text to be sorted, and/or emotion classification.
During this, emotion and the emotion of text to be sorted are combined, to realize the feelings to text Sense classification, and/or emotion classification.Owing to there being close contact between emotion and the emotion of one text, Emotion and the emotion of text to be sorted are being combined, when text is classified, is taking full advantage of test specimens In Ben between emotion and emotion substantial connection provide additional amount of information, with treat classifying text classify time, The precision of text classification can be effectively improved.
It addition, file classification method disclosed in the present application, carry out for emotion and two kinds of classification task of emotion Combination learning, simply carries out literary composition according to emotion test sample and mood test sample relative in prior art For the method for this classification, add the quantity of test sample, be also correspondingly improved the essence of text classification Degree.
Accompanying drawing explanation
In order to be illustrated more clearly that the embodiment of the present invention or technical scheme of the prior art, below will be to reality Execute the required accompanying drawing used in example or description of the prior art to be briefly described, it should be apparent that below, Accompanying drawing in description is only some embodiments of the present invention, for those of ordinary skill in the art, On the premise of not paying creative work, it is also possible to obtain other accompanying drawing according to these accompanying drawings.
Fig. 1 is the workflow schematic diagram of a kind of file classification method disclosed in the embodiment of the present invention;
Fig. 2 is the workflow schematic diagram of the embodiment of the present invention another file classification method disclosed;
Fig. 3 is the workflow schematic diagram of the embodiment of the present invention another file classification method disclosed;
Fig. 4 is the accuracy rate experiment knot that file classification method disclosed in the embodiment of the present invention carries out emotional semantic classification Really comparison diagram;
Fig. 5 is the accuracy rate experiment knot that file classification method disclosed in the embodiment of the present invention carries out emotion classification Really comparison diagram;
Fig. 6 is the structural representation of a kind of document sorting apparatus disclosed in the embodiment of the present invention.
Detailed description of the invention
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out Clearly and completely describe, it is clear that described embodiment is only a part of embodiment of the present invention, and It is not all, of embodiment.Based on the embodiment in the present invention, those of ordinary skill in the art are not making Go out the every other embodiment obtained under creative work premise, broadly fall into the scope of protection of the invention.
In order to solve prior art when carrying out text classification, the problem that the nicety of grading that had is low, this Application provides a kind of file classification method, to improve the precision of text classification.
Workflow schematic diagram shown in Figure 1, file classification method disclosed in the present application includes:
Step S11, acquisition grader, described grader includes: emotion classifiers and emotion grader.
Pass through emotion classifiers, it is possible to obtaining the emotional category of text, wherein, described emotional category is general Including: positive, neutral and passive;By emotion grader, it is possible to obtain the emotion classification of text, its In, described emotion classification generally comprises: glad, sad, worried, angry, surprised, angry, merciless Thread.It is, of course, also possible to include other emotional category and emotion classification, this is not limited by the application.
Step S12, the emotion test sample obtaining text to be sorted and mood test sample, according to described point Class device, classifies to described emotion test sample and mood test sample, and obtains described emotion test Sample and mood test sample are belonging respectively to the probability of each emotional category and each emotion classification.
By described emotion classifiers, it is possible to obtain each emotion test sample and adhere to separately and each emotional category Probability;By described emotion grader, it is possible to obtain each mood test sample and adhere to separately and each The probability of emotion classification.
Step S13, choose emotion test sample and mood test sample as common sample, and obtain described The joint probability of common sample, described joint probability includes: emotion joint probability, and/or emotion associating is general Rate, wherein, described emotion joint probability p (si|ej) representing: the emotion classification of certain sample is ejSituation Under, its emotional category is siProbability, described emotion joint probability p (ei|sj) represent: the feelings of certain sample Sense classification is sjIn the case of, its emotion classification is eiProbability.
Described emotion test sample and mood test sample is being got by emotion classifiers and emotion grader After this probability, select a number of emotion test sample and mood test sample as common sample This.Wherein, emotion test sample and mood test sample as common sample typically have equal number Amount, such as, selects 400 emotion test samples and 400 mood test samples as jointly simultaneously Sample.It is of course also possible to select the whole emotion test samples in step S12 and mood test sample to make For common sample.
When needing to carry out emotional semantic classification for text to be sorted, after choosing common sample, need to obtain The emotion joint probability of described common sample;When needing to carry out emotion classification for text to be sorted, then Need to obtain the emotion joint probability of described common sample;When text to be sorted simultaneously need to carry out emotion and divide When class and emotion classification, then need to obtain emotion joint probability and the emotion joint probability of described common sample. Wherein, described emotion joint probability p (si|ej) representing: the emotion classification of certain sample is ejIn the case of, Its emotional category is siProbability;Described emotion joint probability p (ei|sj) represent: the emotion class of certain sample Wei sjIn the case of, its emotion classification is eiProbability.
Utilize common sample acquisition emotion joint probability p (si|ej) and emotion joint probability p (ei|sj) algorithm For:
p ( s i | e j ) = 1 + Σ X ∈ L Sharing I ( Y s ( X ) = s i . Y E ( X ) = e j ) N s + Σ X ∈ L Sharing I ( Y E ( X ) = e j )
p ( e i | s j ) = 1 + Σ X ∈ L Sharing I ( Y E ( X ) = e i , Y s ( X ) = s j ) N e + Σ X ∈ L Sharing I ( Y s ( X ) = s j )
In above-mentioned algorithmic formula, LSharingRepresent is common sample, say, that sample X belongs to common Sample;YS(X) emotional category belonging to sample X, Y are representedE(X) the emotion classification belonging to sample X is represented, NsFor the emotional category number of emotional semantic classification, NeIt is the number of emotion classification in emotion classification, and, I ( x ) = 1 , x = True 0 , x = False . Wherein, I(x) it is an indicator function, when x is True, its value is 1, X be false duration be 0.
According to above-mentioned algorithm, when calculating the emotion joint probability of a certain common sample, need to know Described common sample is belonging respectively to the probability of each emotional category and each emotion classification.And in step In S12, according to emotion classifiers and emotion grader, obtain described emotion test sample and emotion has been surveyed Sample is originally belonging respectively to the probability of each emotional category and each emotion classification, then may utilize emotion Grader, obtains emotion test sample and belongs to the probability of each emotion classification, and utilize emotional semantic classification Device, obtains mood test sample and belongs to the probability of each emotional category, and then public according to above-mentioned algorithm Formula obtains the emotion joint probability of common sample, and/or emotion joint probability.Or, it would however also be possible to employ people The mode of work mark, utilizes manual type to carry out emotion in the common sample chosen from emotion test sample Mark, and utilize manual type to carry out Emotion tagging in the common sample chosen from mood test sample, Thus improve the described emotion joint probability of acquisition, and/or the precision of emotion joint probability.
Step S14, described probability and joint probability are weighted summation, and according to weighted sum Result obtains the emotional category belonging to text to be sorted, and/or emotion classification.
When classifying text only need to be treated carries out emotional semantic classification, need to obtain common sample probability and Emotion joint probability;When classifying text only need to be treated carries out emotion classification, need to obtain common sample Probability and emotion joint probability;Treat classifying text when needs simultaneously and carry out emotional semantic classification and emotion is divided During class, then need to obtain the probability of described common sample, emotion joint probability and emotion joint probability.
File classification method disclosed in this invention, first obtains emotion test sample and feelings according to grader Thread test sample is belonging respectively to the probability of each emotional category and each emotion classification, then chooses feelings Sense test sample and mood test sample are as common sample, and obtain the joint probability of described common sample, Described joint probability and described probability are weighted summation, and obtain according to the result of weighted sum Emotional category belonging to text to be sorted, and/or emotion classification.
During this, emotion and the emotion of text to be sorted are combined, to realize the feelings to text Sense classification, and/or emotion classification.Owing to there being close contact between emotion and the emotion of one text, Emotion and the emotion of text to be sorted are being combined, when text is classified, is taking full advantage of test specimens In Ben between emotion and emotion substantial connection provide additional amount of information, with treat classifying text classify time, The precision of text classification can be effectively improved.
It addition, file classification method disclosed in the present application, carry out for emotion and two kinds of classification task of emotion Combination learning, simply carries out literary composition according to emotion test sample and mood test sample relative in prior art For the method for this classification, add the quantity of test sample, be also correspondingly improved the essence of text classification Degree.
Further, workflow schematic diagram shown in Figure 2, in step S14, to described initially Probability and joint probability are weighted summation, and obtain belonging to text to be sorted according to the result of weighted sum Emotional category, including:
Step S141, according to the probability of described common sample and emotion joint probability, obtain each altogether With the transference probability of sample, wherein, the algorithm obtaining described transference probability is:
P transfer ( s i | X ) = Σ j = 1 N e p ( s i | e j ) * p ( e j | X ) ;
Wherein, p (ej| X) it is the common sample X probability that belongs to all kinds of emotion classification, p (si|ej) it is Emotion joint probability, the emotion classification being certain common sample is ejTime, emotional category is siProbability, NeFor the emotion classification number of emotion classification, ptransfer(si| X) it is the transference probability of sample X.
In step s 12, according to emotion grader, common sample X can be obtained and belong to all kinds of emotion class Other probability p (ej| X), and, the scheme provided according to step S13, emotion associating can be obtained Probability p (si|ej), substituted in above-mentioned algorithmic formula, transference Probability p can be obtainedtransfer(si|X)。
Step S142, according to described transference probability, obtain the final emotion probability of each common sample, Wherein, the algorithm obtaining described final emotion probability is:
pjoint(si|X)=(1-λ)p(si|X)+λptransfer(si|X);
Wherein, p (si| X) it is the sample X probability that belongs to all kinds of emotional category, ptransfer(si| X) it is sample The transference probability of X, λ is the weight parameter set, pjoint(si| X) it is final emotion probability.
In above-mentioned algorithmic formula, λ is weighting parameters, is any number between 0 to 1, is used for regulating Emotion probability and the importance of emotion probability, be embodied as the interval of middle λ usually: (0.4,0.8). When emotion probability is identical with the importance of emotion probability, the value of λ is 0.5.
Step S143, obtain in described common sample, the common sample that the value of final emotion probability is maximum, Demarcate sample as emotion, and determine that described emotion demarcation emotional category belonging to sample is treated described in being Emotional category belonging to classifying text.
The emotional category that text to be sorted finally determines is:Wherein, Ysentiment(X) represent is the emotional category that finally determines of text to be sorted, pjoint(si| X) represent is sample The final emotion probability of X,Represent is the common of the value maximum of final emotion probability Emotional category belonging to sample.
According to the operation of step S141 to step S143, by the probability got in advance and emotion Transition probability, can obtain the emotional category of text to be sorted.Further, text is being carried out emotional semantic classification Time, take full advantage of the extraneous information that the substantial connection between emotional training sample and emotion and emotion provides Amount, improves the accuracy rate of emotional semantic classification.
Further, workflow schematic diagram shown in Figure 3, in step S14, to described initially Probability and joint probability are weighted summation, and obtain belonging to text to be sorted according to the result of weighted sum Emotion classification, including:
Step S144, according to the probability of described common sample and joint probability, obtain each common sample This emotion transition probability, wherein, the algorithm obtaining described emotion transition probability is:
p transfer ( e i | X ) = Σ j = 1 N s p ( e i | s j ) * p ( s j | X ) .
Wherein, p (sj| X) it is the common sample X probability that belongs to all kinds of emotional category, p (ei|sj) it is emotion Joint probability, the emotional category being certain common sample is sjTime, emotion classification is eiProbability, NsFor The emotional category number of emotional semantic classification, ptransfer(ei| X) it is the emotion transition probability of sample X;
In step s 12, according to emotion classifiers, common sample X can be obtained and belong to all kinds of emotion class Other probability p (sj| X), and, the scheme provided according to step S13, emotion associating can be obtained Probability p (ei|sj), substituted in above-mentioned algorithmic formula, emotion transition probability p can be obtainedtransfer(ei|X)。
Step S145, according to described emotion transition probability, obtain the final emotion probability of each common sample, Wherein, the algorithm obtaining described final emotion probability is:
pjoint(ei|X)=(1-λ)p(ei|X)+λptransfer(ei|X);
Wherein, p (ei| X) it is the sample X probability that belongs to all kinds of emotion classification, ptransfer(ei| X) be The emotion transition probability of sample X, λ is the weight parameter set, pjoint(ei| X) it is final emotion probability.
In above-mentioned algorithmic formula, λ is weighting parameters, can choose any number between 0 to 1, be used for Regulation emotion probability and the importance of emotion probability, be embodied as the interval of middle λ usually: (0.4, 0.8).When emotion probability is identical with the importance of emotion probability, the value of λ is 0.5.
Step S146, obtain in described common sample, the common sample that the value of final emotion probability is maximum, Demarcate sample as emotion, and determine that described emotion demarcation emotion classification belonging to sample is treated described in being Emotion classification belonging to classifying text.
The emotion classification that text to be sorted finally determines is:Wherein, Yemotion(X) represent is the emotion classification that finally determines of text to be sorted, pjoint(ei| X) represent is sample The final emotion probability of X,Represent is the common sample of the value maximum of final emotion probability Emotion classification belonging to Ben.
According to scheme disclosed in step S144 and step S146, it is possible to shifted by the emotion of common sample Probability and probability, determine the emotion classification belonging to text to be sorted, and, text is being carried out feelings During thread classification, make use of the extraneous information that the substantial connection between emotion training sample and emotion and emotion provides Amount, improves the accuracy rate of emotion classification.
Further, disclosed described acquisition grader in step s 11, including:
First, acquisition has Emotion tagging and the corpus of emotion mark respectively;
Then, according to the described corpus described emotion classifiers of acquisition with Emotion tagging, and according to The described corpus with emotion mark obtains described emotion grader.
Wherein, there is Emotion tagging and the corpus of emotion mark, by obtaining training text in advance, Then described training text is carried out artificial Emotion tagging and emotion mark and obtains.Manually marking During note, it is mainly based upon the unitary characteristic of word.
Described emotion classifiers and emotion grader utilize the algorithm of machine learning, such as maximum entropy algorithm, SVM (Support Vector Machine, support vector machine) algorithm, bayesian algorithm etc., and there is emotion The disaggregated model that the corpus of mark and emotion mark generates, for this disaggregated model, works as input During sample to be sorted, emotion classifiers then can export the probability of the text each emotional category corresponding, Emotion grader then can export the probability of the text each emotion classification corresponding.It is to say, pass through Described state emotion classifiers and emotion grader, emotion test sample and mood test sample can be obtained respectively Originally the probability of each emotional category and each emotion classification it is belonging respectively to.
File classification method disclosed in the present application, it is achieved that emotion test sample and mood test sample Combination learning, thus improve the precision of text classification.In order to prove the effectiveness of the method for the present invention, We utilize Ren-CECps language material to carry out emotion and the experiment of emotion classification.Described Ren-CECps language Material comprises 34,603 sentences.In specific experiment, have selected 3000,5000,8000 sentences respectively As emotion test sample, additionally select 3000,5000,8000 sentences as mood test sample, And according to scheme disclosed in the present application, text carried out emotional semantic classification and emotion classification.Wherein, weight λ is set It is set to 0.5, and respectively selects 400 sentences as altogether from described emotion test sample and mood test sample Same sample.
File classification method disclosed in the present application shown in Figure 4 emotion on Ren-CECps language material Classification accuracy experimental result, wherein, what axis of abscissas represented is the quantity of the emotion test sample chosen, What axis of ordinates represented is the precision of classification, and what " single task emotional semantic classification " represented is only to utilize to have feelings The method that the sample of sense mark carries out emotional semantic classification, what " combination learning emotional semantic classification " represented is according to this The disclosed mode of application carries out emotional semantic classification.
From the correction data shown in Fig. 4, it can be seen that the method for the present invention is in emotional semantic classification task Classifying quality is compared traditional emotional semantic classification and is significantly improved, and average percentage improvement is 2.4%.
File classification method disclosed in the present application shown in Figure 5 emotion on Ren-CECps language material Classification accuracy experimental result, wherein, what axis of abscissas represented is the quantity of the mood test sample chosen, What axis of ordinates represented is the precision of classification, and what " classification of single task emotion " represented is only to utilize to have feelings The method that the sample of thread mark carries out emotion classification, what " classification of combination learning emotion " represented is according to this The disclosed mode of application carries out emotion classification.
From the correction data shown in Fig. 5, it can be seen that the method for the present invention is in emotion classification task Classifying quality is compared traditional emotion classification and is significantly improved, and average percentage improvement is 2.9%.
According to above-mentioned experimental result, the file classification method that the application provides take full advantage of emotion and Being closely connected between emotion, thus improve the precision that text is classified.
Accordingly, disclosed herein as well is a kind of document sorting apparatus, structural representation shown in Figure 6 Figure, described document sorting apparatus includes: grader acquisition module 11, probability acquisition module 12, connection Conjunction probability acquisition module 13 and text categories acquisition module 14, wherein,
Described grader acquisition module 11, is used for obtaining grader, and described grader includes: emotional semantic classification Device and emotion grader;
Described probability acquisition module 12, for obtaining emotion test sample and the emotion of text to be sorted Test sample, according to described grader, classifies to emotion test sample and mood test sample, and Obtain described emotion test sample and mood test sample is belonging respectively to each emotional category and each emotion class Other probability;
Described joint probability acquisition module 13, is used for choosing emotion test sample and mood test sample conduct Common sample, and obtain the joint probability of described common sample, described joint probability includes: emotion is combined Probability, and/or emotion joint probability, wherein, described emotion joint probability p (si|ej) represent: certain sample Emotion classification be ejIn the case of, its emotional category is siProbability, described emotion joint probability p (ei|sj) Represent: the emotional category of certain sample is sjIn the case of, its emotion classification is eiProbability;
Described text categories acquisition module 14, for being weighted asking to described probability and joint probability With, and obtain the emotional category belonging to text to be sorted, and/or emotion classification according to the result of weighted sum.
Document sorting apparatus disclosed in the present application, first passes through grader acquisition module and obtains emotion classifiers With emotion grader;Then by probability acquisition module, by the described emotion classifiers that gets and Emotion grader, obtains emotion test sample and mood test sample is belonging respectively to each emotional category with each The probability of individual emotion classification;Joint probability acquisition module chooses common sample, and obtain described jointly The joint probability of sample;Finally, by text categories acquisition module according to described probability and joint probability It is weighted summation, and according to the emotional category belonging to the result acquisition text to be sorted of weighted sum, and/ Or emotion classification.
Document sorting apparatus disclosed in the present application, is emotion and the emotion of text to be sorted to be combined, To realize the emotional semantic classification to text, and/or emotion classification.Due between emotion and the emotion of one text There is close contact, by the emotion of text to be sorted and emotion associating, carry out treating classifying text During classification, the precision of text classification can be effectively improved.
It addition, document sorting apparatus disclosed in the present application, carry out for emotion and two kinds of classification task of emotion Combination learning, simply carries out literary composition according to emotion test sample and mood test sample relative in prior art For the method for this classification, add the quantity of sample, be also correspondingly improved the precision of text classification.
Further, text categories acquisition module 14 disclosed in the present application includes: emotional category obtains Submodule, described emotional category obtains submodule and includes: transference probability acquiring unit, final emotion Probability acquiring unit and emotional category determine unit, wherein,
Described transference probability acquiring unit, for the probability according to described common sample and emotion Joint probability, obtains the transference probability of each common sample, wherein, obtains described transference general The algorithm of rate is:
p reansfer ( s i | X ) = Σ j = 1 N e p ( s i | e j ) * p ( e j | X ) ;
Wherein, p (ej| X) it is the common sample X probability that belongs to all kinds of emotion classification, p (si|ej) it is Emotion joint probability, the emotion classification being certain common sample is ejTime, emotional category is siProbability, NeFor the emotion classification number of emotion classification, ptransfer(si| X) it is the transference probability of sample X;
Described final emotion probability acquiring unit, for according to described transference probability, obtains each altogether With the final emotion probability of sample, wherein, the algorithm obtaining described final emotion probability is:
pjoint(si|X)=(1-λ)p(si|X)+λptransfer(si|X);
Wherein, p (si| X) it is the sample X probability that belongs to all kinds of emotional category, ptransfer(si| X) it is sample The transference probability of X, λ is the weight parameter set, pjoint(si| X) it is final emotion probability;
Described emotional category determines unit, is used for obtaining in described common sample, the value of final emotion probability Maximum common sample, demarcates sample as emotion, and determines that described emotion is demarcated belonging to sample Emotional category is the emotional category belonging to described text to be sorted.
True according to described transference probability acquiring unit, final emotion probability acquiring unit and emotional category Cell, it is possible to realize the emotional semantic classification to text.
Further, described text categories acquisition module includes: emotion classification obtains submodule, described feelings Thread classification obtain submodule include: emotion transition probability acquiring unit, final emotion probability acquiring unit and Emotion classification determination unit, wherein,
Described emotion transition probability acquiring unit, for the probability according to described common sample and associating Probability, obtains each common sample emotion transition probability, wherein, obtains the calculation of described emotion transition probability Method is:
p transfer ( e i | X ) = Σ j = 1 N s p ( e i | s j ) * p ( s j | X )
Wherein, p (sj| X) it is the common sample X probability that belongs to all kinds of emotional category, p (ei|sj) it is feelings Thread joint probability, the emotional category being certain common sample is sjTime, emotion classification is eiProbability, Ns For the emotional category number of emotional semantic classification, ptransfer(ei| X) it is the emotion transition probability of sample X;
Described final emotion probability acquiring unit, for according to described emotion transition probability, obtains each altogether With the final emotion probability of sample, wherein, the algorithm obtaining described final emotion probability is:
pjoint(ei|X)=(1-λ)p(ei|X)+λptransfer(ei|X);
Wherein, p (ei| X) it is the sample X probability that belongs to all kinds of emotion classification, ptransfer(ei| X) be The emotion transition probability of sample X, λ is the weight parameter set, pjoint(ei| X) it is final emotion probability;
Described emotion classification determination unit, is used for obtaining in described common sample, the value of final emotion probability Maximum common sample, demarcates sample as emotion, and determines that described emotion is demarcated belonging to sample Emotion classification is the emotion classification belonging to described text to be sorted.
True according to described emotion transition probability acquiring unit, final emotion probability acquiring unit and emotion classification Cell, it is possible to realize the emotion to text and classify.
Further, described grader acquisition module includes: corpus acquiring unit and grader obtain Unit, wherein,
Described corpus acquiring unit, has Emotion tagging and the training of emotion mark for obtaining respectively Language material;
Described grader acquiring unit, obtains described for having the corpus of Emotion tagging described in basis Emotion classifiers, and obtain described emotion grader according to the described corpus with emotion mark.
Accordingly, the invention also discloses a kind of processor, the chip of described processor is integrated with above-mentioned Document sorting apparatus.
Wherein, described document sorting apparatus includes: grader acquisition module 11, probability acquisition module 12, joint probability acquisition module 13 and text categories acquisition module 14.Further, described text categories Acquisition module 14 includes: emotional category obtains submodule, and/or emotion classification obtains submodule, described Emotional category obtains submodule and includes: transference probability acquiring unit, final emotion probability acquiring unit Determining unit with emotional category, described emotion classification obtains submodule and includes: emotion transition probability obtains single Emotion probability acquiring unit first, final and emotion classification determination unit.
It addition, described grader acquisition module 11 includes: corpus acquiring unit and grader obtain single Unit.
Utilize described processor that text is classified, it is possible to emotion and the emotion of text to be sorted to be carried out Associating, to realize the emotional semantic classification to text, and/or emotion classification.Emotion and feelings due to one text There is close contact between thread, emotion and the emotion of text to be sorted are being combined, with to literary composition to be sorted When originally classifying, the precision of text classification can be effectively improved.
File classification method disclosed in the present application and document sorting apparatus, it is possible to be applied to the emotion to text, And/or emotion classification carries out the technical field classified.Such as, when getting newsletter archive and microblogging text, By file classification method disclosed in the present application and document sorting apparatus, it is possible to obtain the emotion class of the text Not, contribute to understanding satisfaction and the demand of the common people, find the hot spot of society in time, and, pass through Get the emotion classification of the text, contribute to getting the common people and the viewpoint of things is inclined to.
Those of ordinary skill in the art are it is to be appreciated that combine each of the embodiments described herein description The unit of example and algorithm steps, it is possible to electronic hardware or computer software and the knot of electronic hardware Incompatible realization.These functions perform with hardware or software mode actually, depend on the spy of technical scheme Fixed application and design constraint.Professional and technical personnel can use not Tongfang to each specifically should being used for Method realizes described function, but this realization is it is not considered that beyond the scope of this invention.
Those skilled in the art is it can be understood that arrive, and for convenience and simplicity of description, above-mentioned retouches The specific works process of system, device and the unit stated, is referred to the correspondence in preceding method embodiment Process, does not repeats them here.
In several embodiments provided herein, it should be understood that disclosed system, device and Method, can realize by another way.Such as, device embodiment described above is only shown Meaning property, such as, the division of described unit, be only a kind of logic function and divide, actual can when realizing There to be other dividing mode, the most multiple unit or assembly can in conjunction with or be desirably integrated into another System, or some features can ignore, or do not perform.Another point, shown or discussed each other Coupling direct-coupling or communication connection can be the INDIRECT COUPLING by some interfaces, device or unit Or communication connection, can be electrical, machinery or other form.
The described unit illustrated as separating component can be or may not be physically separate, makees The parts shown for unit can be or may not be physical location, i.e. may be located at a place, Or can also be distributed on multiple NE.Can select according to the actual needs part therein or The whole unit of person realizes the purpose of the present embodiment scheme.
It addition, each functional unit in each embodiment of the present invention can be integrated in a processing unit, Can also be that unit is individually physically present, it is also possible to two or more unit are integrated in a list In unit.
If described function realizes and as independent production marketing or use using the form of SFU software functional unit Time, can be stored in a computer read/write memory medium.Based on such understanding, the present invention's Part or the part of this technical scheme that prior art is contributed by technical scheme the most in other words can Embodying with the form with software product, this computer software product is stored in a storage medium, Including some instructions with so that computer equipment (can be personal computer, server, or The network equipment etc.) perform all or part of step of method described in each embodiment of the present invention.And it is aforesaid Storage medium includes: USB flash disk, portable hard drive, read only memory (ROM, Read-Only Memory), Random access memory (RAM, Random Access Memory), magnetic disc or CD etc. are various can To store the medium of program code.
Described above to the disclosed embodiments, makes professional and technical personnel in the field be capable of or uses The present invention.Multiple amendment to these embodiments will be aobvious and easy for those skilled in the art See, generic principles defined herein can without departing from the spirit or scope of the present invention, Realize in other embodiments.Therefore, the present invention is not intended to be limited to the embodiments shown herein, And it is to fit to the widest scope consistent with principles disclosed herein and features of novelty.

Claims (11)

1. a file classification method, it is characterised in that including:
Obtaining grader, described grader includes: emotion classifiers and emotion grader;
Obtain emotion test sample and the mood test sample of text to be sorted, according to described grader, right Described emotion test sample and mood test sample are classified, and obtain described emotion test sample and feelings Thread test sample is belonging respectively to the probability of each emotional category and each emotion classification;
Choose emotion test sample and mood test sample as common sample, and obtain described common sample Joint probability, described joint probability includes: emotion joint probability, and/or emotion joint probability, wherein, Described emotion joint probability p (si|ej) representing: the emotion classification of certain sample is ejIn the case of, its emotion Classification is siProbability, described emotion joint probability p (ei|sj) representing: the emotional category of certain sample is sj's In the case of, its emotion classification is eiProbability;
Described probability and joint probability are weighted summation, and obtain according to the result of weighted sum Emotional category belonging to text to be sorted, and/or emotion classification.
Method the most according to claim 1, it is characterised in that general to described probability and associating Rate is weighted summation, and obtains the emotional category belonging to text to be sorted according to the result of weighted sum, Including:
Probability according to described common sample and emotion joint probability, obtain the feelings of each common sample Sense transition probability, wherein, the algorithm obtaining described transference probability is:
p t r a n s f e r ( s i | X ) = Σ j = 1 N e p ( s i | e j ) * p ( e j | X ) ;
Wherein, p (ej| X) it is the common sample X probability that belongs to all kinds of emotion classification, p (si|ej) it is Emotion joint probability, the emotion classification being certain common sample is ejTime, emotional category is siProbability, NeFor the emotion classification number of emotion classification, ptransfer(si| X) it is the transference probability of sample X;
According to described transference probability, obtain the final emotion probability of each common sample, wherein, obtain The algorithm taking described final emotion probability is:
pjoint(si| X)=(1-λ) p (si|X)+λptransfer(si|X);
Wherein, p (si| X) it is the sample X probability that belongs to all kinds of emotional category, ptransfer(si| X) it is sample The transference probability of X, λ is the weight parameter set, pjoint(si| X) it is final emotion probability;
Obtaining in described common sample, the common sample that the value of final emotion probability is maximum, as feelings Sample is demarcated in sense, and determines that described emotion demarcation emotional category belonging to sample is described text institute to be sorted The emotional category belonged to.
Method the most according to claim 1, it is characterised in that general to described probability and associating Rate is weighted summation, and obtains the emotion classification belonging to text to be sorted according to the result of weighted sum, Including:
Probability according to described common sample and joint probability, the emotion obtaining each common sample turns Moving probability, wherein, the algorithm obtaining described emotion transition probability is:
p t r a n s f e r ( e i | X ) = Σ j = 1 N s p ( e i | s j ) * p ( s j | X )
Wherein, p (sj| X) it is the common sample X probability that belongs to all kinds of emotional category, p (ei|sj) it is feelings Thread joint probability, the emotional category being certain common sample is sjTime, emotion classification is eiProbability, Ns For the emotional category number of emotional semantic classification, ptransfer(ei| X) it is the emotion transition probability of sample X;
According to described emotion transition probability, obtain the final emotion probability of each common sample, wherein, obtain The algorithm taking described final emotion probability is:
pjoint(ei| X)=(1-λ) p (ei|X)+λptransfer(ei|X);
Wherein, p (ei| X) it is the sample X probability that belongs to all kinds of emotion classification, ptransfer(ei| X) be The emotion transition probability of sample X, λ is the weight parameter set, pjoint(ei| X) it is final emotion probability;
Obtaining in described common sample, the common sample that the value of final emotion probability is maximum, as feelings Thread demarcates sample, and determines that described emotion demarcation emotion classification belonging to sample is described text institute to be sorted The emotion classification belonged to.
The most according to the method in claim 2 or 3, it is characterised in that
The interval of described weight parameter λ is: (0.4,0.8).
Method the most according to claim 1, it is characterised in that described acquisition grader includes:
Obtain respectively and there is Emotion tagging and the corpus of emotion mark;
Described emotion classifiers is obtained according to the described corpus with Emotion tagging, and according to described tool The corpus marked of being in a bad mood obtains described emotion grader.
Method the most according to claim 1, it is characterised in that
Described emotional category includes: positive, neutral and passive;
Described emotion classification includes: glad, sad, worried, angry, surprised, angry, loss of emotion.
7. a document sorting apparatus, it is characterised in that including:
Grader acquisition module, is used for obtaining grader, and described grader includes: emotion classifiers and feelings Thread grader;
Probability acquisition module, for obtaining emotion test sample and the mood test sample of text to be sorted This, according to described grader, classify to emotion test sample and mood test sample, and obtain institute State emotion test sample and mood test sample is belonging respectively at the beginning of each emotional category and each emotion classification Beginning probability;
Joint probability acquisition module, is used for choosing emotion test sample and mood test sample as common sample Basis, and obtain the joint probability of described common sample, described joint probability includes: emotion joint probability, And/or emotion joint probability, wherein, described emotion joint probability p (si|ej) represent: the emotion of certain sample Classification is ejIn the case of, its emotional category is siProbability, described emotion joint probability p (ei|sj) represent: The emotional category of certain sample is sjIn the case of, its emotion classification is eiProbability;
Text categories acquisition module, for described probability and joint probability being weighted summation, and Result according to weighted sum obtains the emotional category belonging to text to be sorted, and/or emotion classification.
Device the most according to claim 7, it is characterised in that described text categories acquisition module bag Including: emotional category obtains submodule, described emotional category obtains submodule and includes:
Transference probability acquiring unit, for the probability according to described common sample and emotion associating Probability, obtains the transference probability of each common sample, wherein, obtains described transference probability Algorithm is:
p t r a n s f e r ( s i | X ) = Σ j = 1 N e p ( s i | e j ) * p ( e j | X ) ;
Wherein, p (ej| X) it is the common sample X probability that belongs to all kinds of emotion classification, p (si|ej) it is Emotion joint probability, the emotion classification being certain common sample is ejTime, emotional category is siProbability, NeFor the emotion classification number of emotion classification, ptransfer(si| X) it is the transference probability of sample X;
Final emotion probability acquiring unit, for according to described transference probability, obtains each common sample This final emotion probability, wherein, the algorithm obtaining described final emotion probability is:
pjoint(si| X)=(1-λ) p (si|X)+λptransfer(si|X);
Wherein, p (si| X) it is the sample X probability that belongs to all kinds of emotional category, ptransfer(si| X) it is sample The transference probability of X, λ is the weight parameter set, pjoint(si| X) it is final emotion probability;
Emotional category determines unit, is used for obtaining in described common sample, and the value of final emotion probability is maximum Common sample, demarcate sample as emotion, and determine that described emotion demarcates emotion belonging to sample Classification is the emotional category belonging to described text to be sorted.
Device the most according to claim 7, it is characterised in that described text categories acquisition module bag Including: emotion classification obtains submodule, described emotion classification obtains submodule and includes:
Emotion transition probability acquiring unit, for the probability according to described common sample and joint probability, Obtaining the emotion transition probability of each common sample, wherein, the algorithm obtaining described emotion transition probability is:
p t r a n s f e r ( e i | X ) = Σ j = 1 N s p ( e i | s j ) * p ( s j | X )
Wherein, p (sj| X) it is the common sample X probability that belongs to all kinds of emotional category, p (ei|sj) it is feelings Thread joint probability, the emotional category being certain common sample is sjTime, emotion classification is eiProbability, Ns For the emotional category number of emotional semantic classification, ptransfer(ei| X) it is the emotion transition probability of sample X;
Final emotion probability acquiring unit, for according to described emotion transition probability, obtains each common sample This final emotion probability, wherein, the algorithm obtaining described final emotion probability is:
pjoint(ei| X)=(1-λ) p (ei|X)+λptransfer(ei|X);
Wherein, p (ei| X) it is the sample X probability that belongs to all kinds of emotion classification, ptransfer(ei| X) be The emotion transition probability of sample X, λ is the weight parameter set, pjoint(ei| X) it is final emotion probability;
Emotion classification determination unit, is used for obtaining in described common sample, and the value of final emotion probability is maximum Common sample, demarcate sample as emotion, and determine that described emotion demarcates emotion belonging to sample Classification is the emotion classification belonging to described text to be sorted.
Device the most according to claim 7, it is characterised in that described grader acquisition module bag Include:
Corpus acquiring unit, has Emotion tagging and the corpus of emotion mark for obtaining respectively;
Grader acquiring unit, obtains described emotion for having the corpus of Emotion tagging described in basis Grader, and obtain described emotion grader according to the described corpus with emotion mark.
11. 1 kinds of processors, it is characterised in that be integrated with claim 7 on the chip of described processor To the document sorting apparatus described in any one of claim 10.
CN201310308226.3A 2013-07-22 2013-07-22 File classification method, device and processor Active CN103345525B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310308226.3A CN103345525B (en) 2013-07-22 2013-07-22 File classification method, device and processor

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310308226.3A CN103345525B (en) 2013-07-22 2013-07-22 File classification method, device and processor

Publications (2)

Publication Number Publication Date
CN103345525A CN103345525A (en) 2013-10-09
CN103345525B true CN103345525B (en) 2016-08-24

Family

ID=49280320

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310308226.3A Active CN103345525B (en) 2013-07-22 2013-07-22 File classification method, device and processor

Country Status (1)

Country Link
CN (1) CN103345525B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104809232B (en) * 2015-05-11 2018-06-01 苏州大学 A kind of Sentence-level mood sorting technique based on mood transition probability between sentence
CN107368611B (en) * 2017-08-11 2018-06-26 同济大学 A kind of short text classification method
CN108256552A (en) * 2017-12-18 2018-07-06 广东广业开元科技有限公司 Common people close friend's index assessment method and system based on big data sorting algorithm
CN108197667A (en) * 2018-01-30 2018-06-22 安徽斛兵信息科技有限公司 Personal abnormal emotion detection method and device based on dialogue

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2693923C (en) * 2007-07-19 2013-01-29 Research In Motion Limited Method and system for reducing contexts for context based compression systems
JP5156428B2 (en) * 2008-02-13 2013-03-06 Kddi株式会社 Content classification device, program, and recording medium
CN101876987A (en) * 2009-12-04 2010-11-03 中国人民解放军信息工程大学 Overlapped-between-clusters-oriented method for classifying two types of texts
CN102945268A (en) * 2012-10-25 2013-02-27 北京腾逸科技发展有限公司 Method and system for excavating comments on characteristics of product

Also Published As

Publication number Publication date
CN103345525A (en) 2013-10-09

Similar Documents

Publication Publication Date Title
CN102682124B (en) Emotion classifying method and device for text
Wan et al. Long-length legal document classification
CN106445919A (en) Sentiment classifying method and device
CN105930411A (en) Classifier training method, classifier and sentiment classification system
CN102541838B (en) Method and equipment for optimizing emotional classifier
WO2022062193A1 (en) Individual credit assessment and explanation method and apparatus based on time sequence attribution analysis, and device and storage medium
CN110619044A (en) Emotion analysis method, system, storage medium and equipment
CN103345525B (en) File classification method, device and processor
CN110210027A (en) Fine granularity sentiment analysis method, apparatus, equipment and medium based on integrated study
CN109800309A (en) Classroom Discourse genre classification methods and device
CN103942214B (en) Natural image classification method and device on basis of multi-modal matrix filling
Orosoo et al. Performance analysis of a novel hybrid deep learning approach in classification of quality-related English text
De et al. A novel model of supervised clustering using sentiment and contextual analysis for fake news detection
Syn et al. Using latent semantic analysis to identify quality in use (qu) indicators from user reviews
Lenin et al. Learning from Imbalanced Educational Data Using Ensemble Machine Learning Algorithms.
CN103279549A (en) Method and device for acquiring target data of target objects
Tiwana et al. Comparative analysis of context based classification of twitter
Corringham et al. Bert classification of paris agreement climate action plans
Cates et al. A machine learning approach to research curation for investment process
CN112084376A (en) Map knowledge based recommendation method and system and electronic device
Chaib et al. Improved Multi-label Medical Text Classification Using Features Cooperation
Thripuranthakam et al. Stock Market Prediction Using Machine Learning and Twitter Sentiment Analysis: A Survey
Jalaeian Zaferani et al. Automatic personality recognition and perception using deep learning and supervised evaluation method
Daroczy et al. Text Classification Kernels for Quality Prediction over the C3 Data Set
AU2021107101A4 (en) A machine learning based system for classification using deviation parameters

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CP02 Change in the address of a patent holder
CP02 Change in the address of a patent holder

Address after: Suzhou City, Jiangsu province 215123 Xiangcheng District Ji Road No. 8

Patentee after: Soochow University

Address before: 215123 Suzhou Industrial Park, Jiangsu Road, No. 199

Patentee before: Soochow University