CN107066442A - Detection method, device and the electronic equipment of mood value - Google Patents

Detection method, device and the electronic equipment of mood value Download PDF

Info

Publication number
CN107066442A
CN107066442A CN201710081287.9A CN201710081287A CN107066442A CN 107066442 A CN107066442 A CN 107066442A CN 201710081287 A CN201710081287 A CN 201710081287A CN 107066442 A CN107066442 A CN 107066442A
Authority
CN
China
Prior art keywords
mood
classification results
tested
value
text set
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710081287.9A
Other languages
Chinese (zh)
Inventor
王胜
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201710081287.9A priority Critical patent/CN107066442A/en
Publication of CN107066442A publication Critical patent/CN107066442A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/353Clustering; Classification into predefined classes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/105Human resources

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Databases & Information Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiments of the invention provide a kind of detection method, device and the electronic equipment of mood value, wherein, the detection method of mood value includes:Feature extraction is carried out to the multiple texts being tested in text set, multiple characteristic vectors are generated;The multiple characteristic vector is inputted into mood grader respectively, generate based on positive mood, negative emotions, neutral mood multiple first classification results;According to the multiple first classification results, the corresponding mood value of tested text set is calculated.Detection method, device and the electronic equipment of the mood value of the embodiment of the present invention, mood value is calculated by multiple first classification results based on positive mood, negative emotions, neutral mood, the calculation of the classification results for being based only on positive mood and negative emotions compared with prior art, results in more accurate mood value.

Description

Detection method, device and the electronic equipment of mood value
Technical field
The present invention relates to field of computer technology, more particularly to a kind of detection method, device and the electronic equipment of mood value.
Background technology
The core resource of enterprise be people in itself, to being the management to people in the nature of management of enterprise, good manager needs The affective state of each employee and the emotional state to working and living on corresponding post are grasped in time, by understanding employee Last state, by employee and the effective and reasonable distribution of other ERMs, so as to reach Enterprise Efficiency optimization collocation.
Enterprise can produce substantial amounts of text information during office, the daily work report of such as employee, Weekly Work Report, Work exchange content (work mail, work chat record) etc., can be by the extraction and analysis to these text informations, to detect The mood value of employee, so as to preferably grasp the mood dynamic of employee.
In the prior art, the analysis of text based mood is main calculates detected personnel's using two methods classified Mood value, is generally only divided into two classes for the emotional semantic classification for being detected personnel, i.e., positive mood and negative emotions, based on such The mood value that model algorithm is obtained not is very accurate.Because often neutral mood also influences whether employee in real work life The weights in attitude and temperature and then influence model algorithm on a certain event, finally will also have influence on company manager to member The judgement and decision-making of work affective state.
The content of the invention
The embodiment of the present invention provides a kind of detection method, device and the electronic equipment of mood value, to improve the feelings detected The accuracy of thread value.
An aspect of of the present present invention there is provided a kind of detection method of mood value, including:
Feature extraction is carried out to the multiple texts being tested in text set, multiple characteristic vectors are generated;
The multiple characteristic vector is inputted into mood grader respectively, generation is based on positive mood, negative emotions, neutral feelings Multiple first classification results of thread;
According to the multiple first classification results, the corresponding mood value of tested text set is calculated.
The second aspect of the present invention there is provided a kind of detection means of mood value, including:
Feature vector generation module, for carrying out feature extraction to the multiple texts being tested in text set, generates multiple spies Levy vector;
Mood grader, for obtaining the multiple characteristic vector as input, generation is based on positive mood, negative feelings Multiple first classification results of thread, neutral mood;
Mood value computing module, for according to the multiple first classification results, calculating the corresponding feelings of tested text set Thread value.
The third aspect of the present invention there is provided a kind of electronic equipment, including:
Memory, for storage program;
Processor, coupled to the memory, for performing described program, for:
Feature extraction is carried out to the multiple texts being tested in text set, multiple characteristic vectors are generated;
The multiple characteristic vector is inputted into mood grader respectively, generation is based on positive mood, negative emotions, neutral feelings Multiple first classification results of thread;
According to the multiple first classification results, the corresponding mood value of tested text set is calculated.
Detection method, device and the electronic equipment of the mood value of the embodiment of the present invention, by based on positive mood, negative feelings Thread, multiple first classification results of neutral mood calculate mood value, compared with prior art be based only on positive mood and negative The calculation of the classification results of face mood, results in more accurate mood value.
Described above is only the general introduction of technical solution of the present invention, in order to better understand the technological means of the present invention, And can be practiced according to the content of specification, and in order to allow above and other objects of the present invention, feature and advantage can Become apparent, below especially exemplified by the embodiment of the present invention.
Brief description of the drawings
Fig. 1 is the technical principle schematic diagram of inventive embodiments;
The schematic flow sheet that Fig. 2 applies for the technical scheme of the embodiment of the present invention in enterprise;
Fig. 3 is the schematic flow sheet of the detection method of the mood value of the embodiment of the present invention one;
Fig. 4 is the structural representation of the mathematical modeling of the online awareness device grader of the embodiment of the present invention one;
Fig. 5 is the structural representation of the detection means of the mood value of the embodiment of the present invention two;
Fig. 6 is the structural representation of the electronic equipment of the embodiment of the present invention three.
Embodiment
The exemplary embodiment of the disclosure is more fully described below with reference to accompanying drawings.Although showing the disclosure in accompanying drawing Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here Limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure Complete conveys to those skilled in the art.
Relational language explanation:
Mood mining analysis:Based on related data index, excavated using relevant way method and analyze tested personnel's Mood, possible mood includes positive active mood, negative negative feeling, neutral mood.
Perceptron (Perception):Perceptron is by american computer scientist's Rosenblat (Frank Roseblatt) proposed in nineteen fifty-seven, be earliest artificial neural network, a kind of feedforward of simplest form can be regarded as Formula artificial neural network and binary linearity grader.Perceptron principle comes from simplest neuron models adaptive characteristic, It includes input layer and output layer, and input layer and output layer are joined directly together.Perceptron can be used in the embodiment of the present invention It is used as mood grader.
Corpus:Refer to the extensive e-text storehouse through scientific sampling and processing.By computer analysis tool, researcher Language theory and the application study of correlation can be carried out.Corpus is deposited in the basic resource of corpus linguistics research, corpus What is put is the true linguistic data occurred in the actual use of language, is the main money of empiricism speech research method Source.Applied to lexicography, language teaching, conventional language research, the research based on statistics or example in natural language processing etc. Aspect.
TF-IDF(term frequency–inverse document frequency):It is that one kind is used for information retrieval With the conventional weighting technique of data mining.More directly for, TF-IDF is a kind of statistical method, for assess a word to N The importance of wherein one article in article or a corpus.The number of times that one of word occurs in an article can not Show the importance of the word, such as similar " everybody ", " ", " ", " obtaining " are although time that this common word occurs in article Number is a lot, but can not represent that these words are just critically important, and now we are accomplished by TF-IDF statistical methods to calculate the weight of some word The property wanted.The number of times that the importance of word occurs with it in an article is directly proportional increase, but simultaneously can be as it is in N articles The frequency occurred in (article set) is inversely proportional decline.
Stop words:Stop words refers to during text-processing that if running into them then stopping immediately being handled, and is thrown away not It is included in the word of statistical disposition.These words, which are thrown away, can increase recall precision and the degree of accuracy.Stop words mainly includes English character, number Word, mathematical character, punctuation mark and the extra-high Chinese word character of frequency of use etc..
The technical principle to the present invention is illustrated below:
The basic fundamental thought of the embodiment of the present invention is that positive mood and negative emotions will be based only in the prior art The methods of two classification calculate the mood value of detected personnel (employee of such as company), change into based on positive mood, negative Mood calculates final mood value with the methods of three classification of neutral mood.In view of in real work life, often largely Neutral mood also influence whether attitude and temperature of the employee to a certain event, therefore, during the determination of mood value, embody The word or sentence of neutral mood are also particularly important.The present invention passes through based on positive mood, negative emotions and neutral mood Three classification results calculate final mood value, the accuracy of mood value is substantially increased, so as to being preferably that enterprise manages Judgement and decision-making of the reason person to employee's affective state provide reliable foundation.
Specifically, in embodiments of the present invention, by the sub-classifier of three two classification (in actual applications using perception Device as the embodiment of the present invention mood grader) realize three classification to positive mood, negative emotions and neutral mood.Will The value of the classification results of positive mood, negative emotions and neutral mood is respectively set as 1, -1 and 0 these three mood values.Such as Shown in Fig. 1, it is the technical principle schematic diagram of the embodiment of the present invention, in embodiments of the present invention, constructs three perceptrons:Its Middle perceptron 1 is only that 1 and -1 are classified to mood value;Perceptron 2 is only classified to mood value for 1 and 0;Perceptron 3 be only that 0 and -1 are classified to mood value, and final classification results are drawn final by three perceptrons with the mode voted The classification results of mood value, i.e., finally belong to positive mood, negative emotions and neutral mood.
The technical scheme of the embodiment of the present invention can apply in enterprise, and enterprise can produce a large amount of during office Text information, such as the daily work report of employee, Weekly Work Report, work exchange content (work mail, work chat record), Analyzed by the mood to these texts, it can be deduced that the mood value of employee, so as to carry out the important of internal decision making as enterprise Foundation.
As shown in Fig. 2 the schematic flow sheet that the technical scheme that it is the embodiment of the present invention is applied in enterprise, the technical side Case can be realized on the server or business platform of OA (office automation) system of enterprise.First, server can be right The textual resources such as weekly, daily paper, chat record and the contact mail of employee, which carry out data acquisition, (can gather certain time period Interior textual resources, for example, can carry out the collection of textual resources with unit around), can be to all kinds of during collection Textual resources are screened and integrated and simple data cleansing, then form (the i.e. pending mood analysis of tested text set Text intersection), the tested text set can be formed by some employee, so that the mood value finally given is should The mood value of employee, naturally it is also possible to form tested text set for some department or whole company, so as to obtain bigger In the range of mood value.Then, each text in the tested text set based on formation carries out feature extraction, formed feature to Amount, and progress mood classification in above-mentioned mood grader is input to, obtain based on positive mood, negative emotions and neutral mood Mood classification results.Finally, the classification results based on each text, it is determined that the entirely mood value of tested text set, at this During, can set different weights to each text according to actual needs, for example can according to the type of text it is different come Weight (weight of such as weekly is higher than the weight of daily paper) is set, can also be set according to the difference of the employee of the generation of text Weight (weight of the weight of the high employee of such as rank employee low higher than rank).
By such as Fig. 2 processing procedure, the mood value corresponding to tested text set can be obtained, so as to be used as business administration The foundation of decision-making.Because tested text set corresponds generally to certain time period (with unit around), enterprise can also be based on the past The Employees'Emotions value of period predicts the mood value of the employee of current or next period, so as to preferably to member The mood tendency of work is tackled.
Technical scheme will be further illustrated by several specific embodiments below.
Embodiment one
As shown in figure 3, its schematic flow sheet for the detection method of the mood value of the embodiment of the present invention one, the present embodiment Method includes:
S101:Feature extraction is carried out to the multiple texts being tested in text set, multiple characteristic vectors are generated.The step can be with Performed by the OA system servers of enterprise or the business platform of enterprises, as previously mentioned, tested text set can come From in textual resources such as the weekly of employee, daily paper, chat record and contact mails.
Specifically, in this step, characteristic vector can be produced in the following way:
S1011:It is each in text set for being tested according to the keyword dictionary that the keyword quantity pre-established is N Text, calculates the corresponding TF-IDF values of each keyword in keyword dictionary.Keyword dictionary mentioned here is for above-mentioned What tested text set was extracted, the dimension of characteristic vector can be greatly reduced by extracting keyword, so as to improve mood The efficiency of classification, sets up process about keyword dictionary, will be described in detail later.
S1012:The corresponding characteristic vector of each text is generated according to the corresponding TF-IDF values of each keyword, wherein, it is special The dimension for levying vector is that the component in N, each dimension of characteristic vector is the corresponding TF- of each keyword in keyword dictionary IDF values.That is, it is the characteristic vector being made up of N number of TF-IDF values to finally enter the characteristic vector in mood grader.
S102:Multiple characteristic vectors are inputted into mood grader respectively, generation is based on positive mood, negative emotions, neutrality Multiple first classification results of mood.The step can be held by the OA system servers of enterprise or the business platform of enterprises OK, mood grader can be arranged on OA system servers or business platform.Specifically, as described previously, mood Grader can be realized with the sub-classifier of three two classification, can specifically be included:For entering to positive mood and negative emotions First sub-classifier of row classification, for positive mood and neutral mood are classified the second sub-classifier, for centering The 3rd sub-classifier that vertical mood and negative emotions are classified.Based on the framework of such mood grader, this step can be with Specifically include:
S1021:Each characteristic vector is inputted into each sub-classifier respectively, the second classification knot of each sub-classifier is generated Really.In actual applications, each sub-classifier can use online awareness device (Perception) grader, the grader mathematics The structural representation of model is as shown in figure 4, its specific formula form of expression is as follows:
Wherein, w=[w1,w2,w3,...,wn]T, be the weight vectors (or weight vector) of perceptron, be set-point or Person's preset value, x=[x1,x2,x3,...,xn]TFor the n dimensional feature vectors of input, the x in the n dimensional feature vectors1To xnCorrespond to P in Fig. 41To pn, b is bias, is set-point or preset value.
Function f (L) it is defined as:
Wherein, the independent variable of the X representative functions f (L) in formula (2), equivalent to the w in formula (1)TX, f (L) are excitation function (or being activation primitive), t is scalar output (i.e. the mood value as classification results of perceptron final output).Perceptron Output behavior is tried to achieve after the characteristic vector of input and the inner product of weight vector, through a scalar result obtained by an activation primitive.
In actual applications, the value of the first classification results is typically set as 1, -1 and 0, corresponds respectively to positive feelings Thread, negative emotions and neutral mood, and in the mathematical modeling of said sensed device, the original output result of each perceptron For -1 and+1, the difference of implication classified according to each perceptron mood is so also needed to, output result is converted to and first The second unified classification results of the values of classification results, for example ,+1 and -1 can directly corresponding to of being exported of the first sub-classifier In positive mood and negative emotions, and+the 1 of the second mood sub-classifier output corresponds to positive mood and neutral feelings with -1 Thread, that is, need to be converted to+1 and 0 unified with the first classification results, and similarly the 3rd mood sub-classifier is also required to carry out output knot The conversion of fruit.In a word, it with the values of the first classification results is unified that the value of the second classification results of final output, which is,.
S1022:Ballot computing is carried out to the second classification results of each sub-classifier, each characteristic vector is obtained corresponding First classification results.
As shown in fig. 1, according to the difference of each sub-classifier, same characteristic vector may go out second classification results Existing the second different classification results.For example, some characteristic vector is positive mood in the classification results of the first sub-classifier, the The classification results of two sub-classifiers are positive mood, and are neutral mood in the classification results of the 3rd sub-classifier.So need By Voting Algorithm, to obtain the first final classification results.Specific ballot mode can be tied with the second classification occupied the majority Fruit exports as the first classification results.Such as the example above, the second classification results that there is two sub-classifiers are positive mood, Second classification results of one sub-classifier are neutral mood, then the first classification results exported are that positive mood, i.e. output are made It is 1 for the mood value of classification results.
S103:According to multiple first classification results, the corresponding mood value of tested text set is calculated.The step can be by looking forward to The OA system servers of industry or the business platform of enterprises are performed.In step S102 above, for tested text The full text of this concentration has calculated the first classification results, in this step, and the first classification results of full text are carried out After conformity calculation, it is possible to obtain the corresponding mood value of tested text set.In view of each text is for final Employees'Emotions For analysis, significance level is probably differentiated.For example, tested text set from the daily paper of the same employee, weekly and toward Carry out the documents such as mail to be constituted, in view of weekly is more regular, therefore, weekly is for the analysis of mood value, its significance level Daily paper can be higher than.Considered based on such, can be in the corresponding mood value of the final tested text set of calculating determination, by each text Originally processing is weighted, weights can be decided according to the actual requirements.For another example several employees of the tested text set by a department Daily paper, weekly and the contact document such as mail constituted, in this case, can also be set according to the rank of different employees The weights of text are determined, for example, the weights of the text of department manager are higher than the weights of the text of general employee.
Based on above-mentioned consideration, step S103 can be specially:According to the weighted value of each text set in advance, to each First classification results of text are weighted summation operation, to obtain the corresponding mood value of tested text set.
In addition, as the linear two classification device of standard, can also further use PA (passive-aggressive) When algorithm improves accuracy, i.e. repairing positive weights vector, a corrected parameter is added, when predicting correct, it is not necessary to adjustment power Weight vector, during prediction error, active accommodation weight vectors, its advantage is to reduce the classification number of mistake, so as to improve correlation Accuracy.
The tested corresponding mood value of text set embodies the mood value for the personnel to form tested text set, for example, the tested text This collection comes from the daily paper within past one week, weekly and contact mail of same employee, then final mood value just embodies The synthesis mood value that the employee was embodied past one week.Value setting based on the first classification results above, finally The corresponding mood value of tested text set can value between+1 and-1, by final mood value, can understand and obtain exactly The emotional status of employee is known, so as to provide important evidence for the internal control and decision-making of enterprise.
Above-mentioned text can be an envelope mail, a daily paper or weekly or division statement therein etc.. In practical application, the calculating of Employees'Emotions value is general in units of week, according to the mood value of employee's last week, prediction this week employee's Mood is with respect to being last week more preferable or more bad.The calculation formula of mood value is:Assuming that i-th week mood tendency is+1, -1 and 0 The quantity of text be respectively xi、yi、zi, then the mood value of the prediction in this week is SiFor:
si=(zi*w1+w2)*(w3*xi+w4*yi+w5) ... ... ... ... ... ... formula (3)
Wherein, w1、w2、w3、w4The weighted value being inclined to for each different mood.For neutral text, it is not pair The mood value of employee is not contributed, and it can reflect temperature of some employee to something, is inclined to by adjusting different moods The weighted value of text cause the final mood value to be reasonable and accurate.
Elaborate the extraction process of keyword again below, specifically, before step S101, can also include:
S1001:Remove the stop words in tested text set.Stop words mainly includes auxiliary words of mood, adverbial word, preposition, connection Word etc., these usual words have no clear and definite meaning, and only putting it into a complete sentence just has certain effect, such as common " ", " " etc.The dimension of characteristic vector can be greatly reduced by removing stop words.
S1002:According to default corpus, it is tested in text set, extracts the word conduct that TF-IDF values are higher than predetermined threshold value The keyword of the tested text set, and according to the whole keywords extracted, generation is tested the corresponding keyword of text set with this Dictionary, keyword is included in corpus.Keyword can be specifically selected based on FudanNLP corpus, predetermined threshold value can To set according to actual needs.
By two steps above, can remove a large amount of be inclined on text semantic influences and unconspicuous word, greatly Reduce the dimension of characteristic vector.Counted according to actual experiment, when not extracting keyword, for carrying out Employees'Emotions detection In text, containing more than 16000 word in 100000 training texts, and extract after keyword, 100000 training texts contain There are 2000 keywords, so that the dimension of characteristic vector greatly reduces.
Embodiment two
As shown in figure 5, its for the embodiment of the present invention two mood value detection means structural representation, including feature to Measure generation module 11, mood grader 12, mood value computing module 13:
Feature vector generation module 11, for carrying out feature extraction to the multiple texts being tested in text set, is generated multiple Characteristic vector.In this feature vector generation module 11, feature extraction is carried out to the multiple texts being tested in text set, generated many The processing of individual characteristic vector can be specifically included:
According to the keyword dictionary that the keyword quantity pre-established is N, for each text being tested in text set, meter Calculate the corresponding TF-IDF values of each keyword in keyword dictionary;
The corresponding characteristic vector of each text is generated according to the corresponding TF-IDF values of each keyword, wherein, characteristic vector Dimension be N, the component in each dimension of characteristic vector is the corresponding TF-IDF values of each keyword in keyword dictionary.
Mood grader 12, for obtaining multiple characteristic vectors as input, generation based on positive mood, negative emotions, Multiple first classification results of neutral mood.Specifically, mood grader 12 can be specifically included:For to positive mood and negative The first sub-classifier 121 that face mood is classified, the second subclassification for being classified to positive mood and neutral mood Device 122, the 3rd sub-classifier 123 for being classified to neutral mood and negative emotions.
Based on the framework of above three sub-classifier, the multiple characteristic vectors of above-mentioned acquisition are as input, and generation is based on just Face mood, negative emotions, the processing of multiple first classification results of neutral mood can be specifically included:
Each characteristic vector is inputted into each sub-classifier respectively, the second classification results of each sub-classifier are generated;
Ballot computing is carried out to the second classification results of each sub-classifier, each characteristic vector is obtained corresponding first point Class result.
Mood value computing module 13, for according to multiple first classification results, calculating the corresponding mood of tested text set Value.In mood value computing module 13, the first classification results using value as 1, -1 and 0 can correspond respectively to positive mood, it is negative Mood and neutral mood, correspondingly, according to multiple first classification results, calculate the place of the corresponding mood value of tested text set Reason can be specifically included:According to the weighted value of each text set in advance, the first classification results of each text are added Summation operation is weighed, to obtain the corresponding mood value of tested text set.
In addition, the detection means of the present embodiment, can also include:
Stop words removes module 14, for removing the stop words in tested text set;
Keyword extracting module 15, for according to default corpus, extracting in multiple texts in tested text set, TF-IDF values are tested the keyword of text set higher than the word of predetermined threshold value as this, and according to the whole keywords extracted, it is raw Into keyword dictionary corresponding with the tested text set, keyword is included in corpus.
The detection means of the mood value of the present embodiment, passes through three points based on positive mood, negative emotions and neutral mood Class result calculates final mood value, substantially increases the accuracy of mood value, so as to being preferably company manager couple The judgement and decision-making of employee's affective state provide reliable foundation.
Embodiment three
The built-in function and structure of the detection means of mood value are the foregoing described, Fig. 6 is electricity provided in an embodiment of the present invention The structural representation of sub- equipment, as shown in fig. 6, in practice, the detection means of the mood value in above-described embodiment can be realized as one Electronic equipment is planted, can be included:Memory 111 and processor 112.
Memory 111, for storage program.
In addition to said procedure, memory 111 is also configured to the other various data of storage to support in electronic equipment On operation.The example of these data includes the instruction of any application program or method for operating on an electronic device, example Such as electric business platform, telephone book data, message, picture, video etc. on shopping platform on line, line.
Memory 111 can realize by any kind of volatibility or non-volatile memory device or combinations thereof, Such as static RAM (SRAM), Electrically Erasable Read Only Memory (EEPROM), erasable programmable is read-only Memory (EPROM), programmable read only memory (PROM), read-only storage (ROM), magnetic memory, flash memory, disk Or CD.
Processor 112, coupled to memory 111, for performing the program in memory 111, for:
Feature extraction is carried out to the multiple texts being tested in text set, multiple characteristic vectors are generated;
Multiple characteristic vectors are inputted into mood grader respectively, generated based on positive mood, negative emotions, neutral mood Multiple first classification results;
According to multiple first classification results, the corresponding mood value of tested text set is calculated.
Wherein, mood grader can include:For the first subclassification classified to positive mood and negative emotions Device, for positive mood and neutral mood are classified the second sub-classifier, for entering to neutral mood and negative emotions 3rd sub-classifier of row classification,
Multiple characteristic vectors are then inputted into mood grader respectively, generation is based on positive mood, negative emotions, neutral mood Multiple first classification results can include:
Each characteristic vector is inputted into each sub-classifier respectively, the second classification results of each sub-classifier are generated;
Ballot computing is carried out to the second classification results of each sub-classifier, each characteristic vector is obtained corresponding first point Class result.
Above-mentioned specific processing operation is described in detail in preceding embodiment, will not be repeated here.
Further, as shown in fig. 6, electronic equipment can also include:Communication component 113, power supply module 114, audio-frequency assembly 115th, other components such as display 116.Members are only schematically provided in Fig. 6, are not meant to that electronic equipment only includes Fig. 6 Shown component.
Communication component 113 is configured to facilitate the communication of wired or wireless way between electronic equipment and other equipment.Electricity Sub- equipment can access the wireless network based on communication standard, such as WiFi, 2G or 3G, or combinations thereof.It is exemplary at one In embodiment, communication component 113 receives broadcast singal or broadcast correlation from external broadcasting management system via broadcast channel Information.In one exemplary embodiment, communication component 113 also includes near-field communication (NFC) module, to promote junction service. For example, radio frequency identification (RFID) technology, Infrared Data Association (IrDA) technology, ultra wide band (UWB) skill can be based in NFC module Art, bluetooth (BT) technology and other technologies are realized.
Power supply module 114, electric power is provided for the various assemblies of electronic equipment.Power supply module 114 can include power management System, one or more power supplys, and other generate, manage and distribute the component that electric power is associated with for electronic equipment.
Audio-frequency assembly 115 is configured as output and/or input audio signal.For example, audio-frequency assembly 115 includes a Mike Wind (MIC), when electronic equipment be in operator scheme, when such as call model, logging mode and speech recognition mode, microphone by with It is set to reception external audio signal.The audio signal received can be further stored in memory 111 or via communication set Part 113 is sent.In certain embodiments, audio-frequency assembly 115 also includes a loudspeaker, for exports audio signal.
Display 116 includes screen, and its screen can include liquid crystal display (LCD) and touch panel (TP).If screen Curtain includes touch panel, and screen may be implemented as touch-screen, to receive the input signal from user.Touch panel includes one Individual or multiple touch sensors are with the gesture on sensing touch, slip and touch panel.Touch sensor can not only sense tactile Touch or sliding action border, but also detection and touch or slide related duration and pressure.
One of ordinary skill in the art will appreciate that:Realizing all or part of step of above-mentioned each method embodiment can lead to The related hardware of programmed instruction is crossed to complete.Foregoing program can be stored in a computer read/write memory medium.The journey Sequence upon execution, performs the step of including above-mentioned each method embodiment;And foregoing storage medium includes:ROM, RAM, magnetic disc or Person's CD etc. is various can be with the medium of store program codes.
Finally it should be noted that:Various embodiments above is merely illustrative of the technical solution of the present invention, rather than its limitations;To the greatest extent The present invention is described in detail with reference to foregoing embodiments for pipe, it will be understood by those within the art that:Its according to The technical scheme described in foregoing embodiments can so be modified, or which part or all technical characteristic are entered Row equivalent substitution;And these modifications or replacement, the essence of appropriate technical solution is departed from various embodiments of the present invention technology The scope of scheme.

Claims (12)

1. a kind of detection method of mood value, it is characterised in that including:
Feature extraction is carried out to the multiple texts being tested in text set, multiple characteristic vectors are generated;
The multiple characteristic vector is inputted into mood grader respectively, generated based on positive mood, negative emotions, neutral mood Multiple first classification results;
According to the multiple first classification results, the corresponding mood value of tested text set is calculated.
2. detection method according to claim 1, it is characterised in that
The mood grader includes:For positive mood and negative emotions are classified the first sub-classifier, for pair The second sub-classifier that positive mood and neutral mood are classified, for neutral mood and negative emotions are classified the Three sub-classifiers,
The multiple characteristic vector is inputted into mood grader respectively, generated based on positive mood, negative emotions, neutral mood Multiple first classification results include:
Each characteristic vector is inputted into each sub-classifier respectively, the second classification results of each sub-classifier are generated;
Ballot computing is carried out to the second classification results of each sub-classifier, the corresponding institute of each characteristic vector is obtained State the first classification results.
3. detection method according to claim 1, it is characterised in that
Feature extraction is carried out to the multiple texts being tested in text set, generating multiple characteristic vectors includes:
According to the keyword dictionary that the keyword quantity pre-established is N, for each text being tested in text set, institute is calculated State the corresponding TF-IDF values of each keyword in keyword dictionary;
The corresponding characteristic vector of each text is generated according to the corresponding TF-IDF values of each described keyword, wherein, the feature The dimension of vector is that the component in N, each dimension of the characteristic vector is the corresponding TF- of each keyword in keyword dictionary IDF values.
4. detection method according to claim 1, it is characterised in that the value of first classification results is 1, -1 and 0, Positive mood, negative emotions and neutral mood are corresponded respectively to,
According to the multiple first classification results, calculating the tested corresponding mood value of text set includes:
According to the weighted value of each text set in advance, summation operation is weighted to the first classification results of each text, To obtain the corresponding mood value of tested text set.
5. detection method according to claim 1, it is characterised in that carried out in multiple texts to being tested in text set special Levy extraction, before generating multiple characteristic vectors, in addition to:
Remove the stop words in the tested text set;
According to default corpus, in tested text set, word of the TF-IDF values higher than predetermined threshold value is extracted as the tested text The keyword of this collection, and according to the whole keywords extracted, generation is tested the corresponding keyword dictionary of text set with this, described Keyword is included in the corpus.
6. a kind of detection means of mood value, it is characterised in that including:
Feature vector generation module, for carrying out feature extraction to the multiple texts being tested in text set, generate multiple features to Amount;
Mood grader, for obtaining the multiple characteristic vector as input, generation based on positive mood, negative emotions, in Multiple first classification results of vertical mood;
Mood value computing module, for according to the multiple first classification results, calculating the corresponding mood value of tested text set.
7. detection means according to claim 6, its characteristic value is,
The mood grader includes:For positive mood and negative emotions are classified the first sub-classifier, for pair The second sub-classifier that positive mood and neutral mood are classified, for neutral mood and negative emotions are classified the Three sub-classifiers,
The multiple characteristic vector is obtained as input, generate based on positive mood, negative emotions, neutral mood multiple first Classification results include:
Each characteristic vector is inputted into each sub-classifier respectively, the second classification results of each sub-classifier are generated;
Ballot computing is carried out to the second classification results of each sub-classifier, the corresponding institute of each characteristic vector is obtained State the first classification results.
8. detection means according to claim 6, it is characterised in that in the feature vector generation module, to tested Multiple texts in text set carry out feature extraction, and generating multiple characteristic vectors includes:
According to the keyword dictionary that the keyword quantity pre-established is N, for each text being tested in text set, institute is calculated State the corresponding TF-IDF values of each keyword in keyword dictionary;
The corresponding characteristic vector of each text is generated according to the corresponding TF-IDF values of each described keyword, wherein, the feature The dimension of vector is that the component in N, each dimension of the characteristic vector is the corresponding TF- of each keyword in keyword dictionary IDF values.
9. detection means according to claim 6, it is characterised in that the value of first classification results is 1, -1 and 0, Positive mood, negative emotions and neutral mood are corresponded respectively to,
In the mood value computing module, according to the multiple first classification results, the corresponding feelings of tested text set are calculated Thread value includes:
According to the weighted value of each text set in advance, summation operation is weighted to the first classification results of each text, To obtain the corresponding mood value of tested text set.
10. detection means according to claim 6, it is characterised in that also include:
Stop words removes module, for removing the stop words in the tested text set;
Keyword extracting module, for according to default corpus, in tested text set, extracts TF-IDF values and is higher than default threshold The word of value is tested the keyword of text set as this, and according to the whole keywords extracted, generation and the tested text set pair The keyword dictionary answered, the keyword is included in the corpus.
11. a kind of electronic equipment, it is characterised in that including:
Memory, for storage program;
Processor, coupled to the memory, for performing described program, for:
Feature extraction is carried out to the multiple texts being tested in text set, multiple characteristic vectors are generated;
The multiple characteristic vector is inputted into mood grader respectively, generated based on positive mood, negative emotions, neutral mood Multiple first classification results;
According to the multiple first classification results, the corresponding mood value of tested text set is calculated.
12. electronic equipment according to claim 11, it is characterised in that
The mood grader includes:For positive mood and negative emotions are classified the first sub-classifier, for pair The second sub-classifier that positive mood and neutral mood are classified, for neutral mood and negative emotions are classified the Three sub-classifiers,
The multiple characteristic vector is inputted into mood grader respectively, generated based on positive mood, negative emotions, neutral mood Multiple first classification results include:
Each characteristic vector is inputted into each sub-classifier respectively, the second classification results of each sub-classifier are generated;
Ballot computing is carried out to the second classification results of each sub-classifier, the corresponding institute of each characteristic vector is obtained State the first classification results.
CN201710081287.9A 2017-02-15 2017-02-15 Detection method, device and the electronic equipment of mood value Pending CN107066442A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710081287.9A CN107066442A (en) 2017-02-15 2017-02-15 Detection method, device and the electronic equipment of mood value

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710081287.9A CN107066442A (en) 2017-02-15 2017-02-15 Detection method, device and the electronic equipment of mood value

Publications (1)

Publication Number Publication Date
CN107066442A true CN107066442A (en) 2017-08-18

Family

ID=59621362

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710081287.9A Pending CN107066442A (en) 2017-02-15 2017-02-15 Detection method, device and the electronic equipment of mood value

Country Status (1)

Country Link
CN (1) CN107066442A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019096047A1 (en) * 2017-11-16 2019-05-23 阿里巴巴集团控股有限公司 Interface display method, interface data processing method, client and server
CN110245236A (en) * 2019-06-25 2019-09-17 北京向上一心科技有限公司 Information demonstrating method, device and electronic equipment
CN110609996A (en) * 2018-06-15 2019-12-24 阿里巴巴集团控股有限公司 Text emotion recognition method and device and electronic equipment
WO2020119533A1 (en) * 2018-12-14 2020-06-18 深圳壹账通智能科技有限公司 Public sentiment warning method and apparatus based on recurrent neural network algorithm, terminal and medium
TWI712948B (en) * 2018-09-20 2020-12-11 大數據股份有限公司 Method for document sentimental analysis, apparatus and computer program product thereof

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100217595A1 (en) * 2009-02-24 2010-08-26 Korea Institute Of Science And Technology Method For Emotion Recognition Based On Minimum Classification Error
CN102929937A (en) * 2012-09-28 2013-02-13 福州博远无线网络科技有限公司 Text-subject-model-based data processing method for commodity classification
CN103365997A (en) * 2013-07-12 2013-10-23 华东师范大学 Opinion mining method based on ensemble learning
CN103761239A (en) * 2013-12-09 2014-04-30 国家计算机网络与信息安全管理中心 Method for performing emotional tendency classification to microblog by using emoticons
CN104331451A (en) * 2014-10-30 2015-02-04 南京大学 Recommendation level scoring method for theme-based network user comments
CN106202481A (en) * 2016-07-18 2016-12-07 量子云未来(北京)信息科技有限公司 The evaluation methodology of a kind of perception data and system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100217595A1 (en) * 2009-02-24 2010-08-26 Korea Institute Of Science And Technology Method For Emotion Recognition Based On Minimum Classification Error
CN102929937A (en) * 2012-09-28 2013-02-13 福州博远无线网络科技有限公司 Text-subject-model-based data processing method for commodity classification
CN103365997A (en) * 2013-07-12 2013-10-23 华东师范大学 Opinion mining method based on ensemble learning
CN103761239A (en) * 2013-12-09 2014-04-30 国家计算机网络与信息安全管理中心 Method for performing emotional tendency classification to microblog by using emoticons
CN104331451A (en) * 2014-10-30 2015-02-04 南京大学 Recommendation level scoring method for theme-based network user comments
CN106202481A (en) * 2016-07-18 2016-12-07 量子云未来(北京)信息科技有限公司 The evaluation methodology of a kind of perception data and system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
杜亚楠 刘业政: "基于修正G2特征筛选的中文微博情感组合分类", 《情报学报》 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019096047A1 (en) * 2017-11-16 2019-05-23 阿里巴巴集团控股有限公司 Interface display method, interface data processing method, client and server
CN110609996A (en) * 2018-06-15 2019-12-24 阿里巴巴集团控股有限公司 Text emotion recognition method and device and electronic equipment
CN110609996B (en) * 2018-06-15 2023-05-02 阿里巴巴集团控股有限公司 Text emotion recognition method and device and electronic equipment
TWI712948B (en) * 2018-09-20 2020-12-11 大數據股份有限公司 Method for document sentimental analysis, apparatus and computer program product thereof
WO2020119533A1 (en) * 2018-12-14 2020-06-18 深圳壹账通智能科技有限公司 Public sentiment warning method and apparatus based on recurrent neural network algorithm, terminal and medium
CN110245236A (en) * 2019-06-25 2019-09-17 北京向上一心科技有限公司 Information demonstrating method, device and electronic equipment
CN110245236B (en) * 2019-06-25 2021-07-20 北京向上一心科技有限公司 Information presentation method and device and electronic equipment

Similar Documents

Publication Publication Date Title
CN109783632B (en) Customer service information pushing method and device, computer equipment and storage medium
CN107066442A (en) Detection method, device and the electronic equipment of mood value
US10387784B2 (en) Technical and semantic signal processing in large, unstructured data fields
Wesslen Computer-assisted text analysis for social science: Topic models and beyond
WO2018184518A1 (en) Microblog data processing method and device, computer device and storage medium
CN112396108A (en) Service data evaluation method, device, equipment and computer readable storage medium
CN112084334B (en) Label classification method and device for corpus, computer equipment and storage medium
CN107807968A (en) Question and answer system, method and storage medium based on Bayesian network
CN108228808A (en) Determine the method, apparatus of focus incident and storage medium and electronic equipment
CN110941953A (en) Automatic identification method and system for network false comments considering interpretability
Kathuria et al. Real time sentiment analysis on twitter data using deep learning (Keras)
US20220156862A1 (en) System and method for analyzing grantability of a legal filing
US11099107B2 (en) Component testing plan considering distinguishable and undistinguishable components
Khemani et al. A review on reddit news headlines with nltk tool
CN113392920A (en) Method, apparatus, device, medium, and program product for generating cheating prediction model
CN113761193A (en) Log classification method and device, computer equipment and storage medium
Richetti et al. Analysis of knowledge-intensive processes focused on the communication perspective
CN111859955A (en) Public opinion data analysis model based on deep learning
Pohl et al. Online processing of social media data for emergency management
KR101592756B1 (en) Method for dialogue management using memory intensity of user knowledge and apparatus for performing the method
CN114462742A (en) Risk prompting method, device, equipment and computer storage medium
CN112069807A (en) Text data theme extraction method and device, computer equipment and storage medium
CN111242519A (en) User characteristic data generation method and device and electronic equipment
Ulyanovska et al. STUDY OF THE PROCESS OF IDENTIFYING THE AUTHORSHIP OF TEXTS WRITTEN IN NATURAL LANGUAGE.
Gupta A Directionality & Novel Function for Stock Price Modelling Prediction Dynamics based on Multiple Learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20170818

RJ01 Rejection of invention patent application after publication