CN108563625A - Text analyzing method, apparatus, electronic equipment and computer storage media - Google Patents

Text analyzing method, apparatus, electronic equipment and computer storage media Download PDF

Info

Publication number
CN108563625A
CN108563625A CN201810007810.8A CN201810007810A CN108563625A CN 108563625 A CN108563625 A CN 108563625A CN 201810007810 A CN201810007810 A CN 201810007810A CN 108563625 A CN108563625 A CN 108563625A
Authority
CN
China
Prior art keywords
text
analyzed
emotional expression
attribute
expression
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810007810.8A
Other languages
Chinese (zh)
Inventor
沈文策
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujian Cnfol Information Technology Co Ltd
Original Assignee
Fujian Cnfol Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujian Cnfol Information Technology Co Ltd filed Critical Fujian Cnfol Information Technology Co Ltd
Priority to CN201810007810.8A priority Critical patent/CN108563625A/en
Publication of CN108563625A publication Critical patent/CN108563625A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Machine Translation (AREA)

Abstract

An embodiment of the present invention provides a kind of text analyzing method, apparatus, electronic equipment and computer storage medias.Wherein, text analysis method includes:Obtain text to be analyzed;Wherein, text to be analyzed includes evaluation object;The attribute of evaluation object is identified from text to be analyzed;Emotional expression corresponding with attribute is extracted from text to be analyzed;According to emotional expression, the Sentiment orientation expressed by text to be analyzed is determined.Compared with the prior art is by way of manual analysis text, the embodiment of the present invention is by taking above-mentioned technical proposal, it solves the technical issues of how improving text analyzing efficiency, risk control capability, the ability of market prediction ability and customer relation management can be obviously improved.Especially to the comment data of the fields such as news, video display, product sentence level, analyze more accurate and effective.

Description

Text analyzing method, apparatus, electronic equipment and computer storage media
Technical field
The present invention relates to technical field of character recognition, more particularly to a kind of text analyzing method, apparatus, electronic equipment and Computer storage media.
Background technology
Currently, in the fields such as consumer goods industry, service trade, health medical treatment industry, financial services, enterprise, tissue Equal hope find the opinion of client or the public to its products & services;Meanwhile consumer also is intended to know before it buys commodity Use the user of the commodity for the evaluation of the commodity.That is, when user is before making decisions, it is desirable to first understand him The viewpoint (also referred to as emotion) of people, so that understand other people by viewpoint is to product and/or the expressed Sentiment orientation out of service Positive or negative, and then can be that decision makes anticipation by Sentiment orientation.
However, under the environment of times of information explosion instantly, the prior art is by manually analyzing text, to obtain Related Sentiment orientation, and then for instructing the decision in actual production, life.
Therefore, there is the defect for causing efficiency low because taking manual type to analyze text in the prior art.
Invention content
The embodiment of the present invention is designed to provide a kind of text analyzing method, apparatus, electronic equipment and computer storage Medium, to improve the efficiency of text analyzing.
To achieve the goals above, in a first aspect, providing following technical scheme:
A kind of text analyzing method, the method includes:
Obtain text to be analyzed;Wherein, the text to be analyzed includes evaluation object;
The attribute of the evaluation object is identified from the text to be analyzed;
Emotional expression corresponding with the attribute is extracted from the text to be analyzed;
According to the emotional expression, the corresponding Sentiment orientation of the text to be analyzed is determined.
Optionally, described the step of identifying the attribute of the evaluation object from the text to be analyzed, including:
The attribute of the evaluation object is identified from the text to be analyzed using support vector machine classifier.
Optionally, described the step of extracting emotional expression corresponding with the attribute from the text to be analyzed, packet It includes:
Use condition random field models extract emotional expression corresponding with the attribute from the text to be analyzed.
Optionally, described according to the emotional expression, the step of determining the text to be analyzed corresponding Sentiment orientation, packet It includes:
Judge that the emotional expression is positive emotion expression or negative emotion expression;
If the emotional expression is positive emotion expression, it is determined that the corresponding Sentiment orientation of the text to be analyzed is just To Sentiment orientation;
If the emotional expression is negative emotion expression, it is determined that the corresponding Sentiment orientation of the text to be analyzed is negative To Sentiment orientation.
To achieve the goals above, second aspect additionally provides following technical scheme:
A kind of text analyzing device, described device include:
Acquisition module, for obtaining text to be analyzed;Wherein, the text to be analyzed includes evaluation object;
Identification module, the attribute for identifying the evaluation object from the text to be analyzed;
Extraction module, for extracting emotional expression corresponding with the attribute from the text to be analyzed;
Determining module, for according to the emotional expression, determining the corresponding Sentiment orientation of the text to be analyzed.
Optionally, the identification module is specifically used for:
The attribute of the evaluation object is identified from the text to be analyzed using support vector machine classifier.
Optionally, the extraction module is specifically used for:
Use condition random field models extract emotional expression corresponding with the attribute from the text to be analyzed.
Optionally, the determining module specifically includes:
Judging submodule, for judging that the emotional expression is positive emotion expression or negative emotion expression;
First determination sub-module, if expressed for positive emotion for the emotional expression, it is determined that the text to be analyzed This corresponding Sentiment orientation is positive Sentiment orientation;
Second determination sub-module, if expressed for negative emotion for the emotional expression, it is determined that the text to be analyzed This corresponding Sentiment orientation is negative sense Sentiment orientation.
To achieve the goals above, the third aspect additionally provides following technical scheme:
A kind of electronic equipment, including processor, communication interface, memory and communication bus, wherein the processor, institute It states communication interface and the memory completes mutual communication by the communication bus;
The memory, for storing computer program;
The processor when for executing the program stored on memory, realizes the method and step described in first aspect.
To achieve the goals above, fourth aspect additionally provides following technical scheme:
A kind of computer readable storage medium is stored with computer program in the computer readable storage medium, described The method and step described in first aspect is realized when computer program is executed by processor.
A kind of text analyzing method, apparatus of offer of the embodiment of the present invention, electronic equipment and computer storage media.Wherein, Text analysis method includes:Obtain text to be analyzed;Wherein, text to be analyzed includes evaluation object;From text to be analyzed Identify the attribute of evaluation object;Emotional expression corresponding with attribute is extracted from text to be analyzed;According to emotional expression, really Sentiment orientation expressed by fixed text to be analyzed.Compared with the prior art is by way of manual analysis text, the present invention is implemented Example passes through the corresponding emotion table of the attribute for extracting with being identified from text to be analyzed by taking above-mentioned technical proposal It reaches;Then, judge that the emotional expression gone out expressed by text to be analyzed is inclined to by the emotional expression, to improve text point The efficiency of analysis.
Other features and advantages of the present invention will be illustrated in the following description, also, partly becomes from specification It obtains it is clear that being emerged from by implementing the present invention.The purpose of the present invention and other advantages can by specification, Specifically noted structure is realized and is obtained in claims and attached drawing.
Certainly, implement any of the products of the present invention or method it is not absolutely required at the same reach all the above excellent Point.
Description of the drawings
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below There is attached drawing needed in technology description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this Some embodiments of invention for those of ordinary skill in the art without creative efforts, can be with Obtain other attached drawings according to these attached drawings.
Fig. 1 is the flow diagram of the text analyzing method of the embodiment of the present invention;
Fig. 2 be the embodiment of the present invention according to emotional expression, determine the tool of the corresponding Sentiment orientation step of text to be analyzed Body flow diagram;
Fig. 3 is the structural schematic diagram of the text analyzing device of the embodiment of the present invention;
Fig. 4 is the structural schematic diagram of the determining module of the embodiment of the present invention;
Fig. 5 is the structural schematic diagram of the electronic equipment of the embodiment of the present invention.
Specific implementation mode
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation describes, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts every other Embodiment shall fall within the protection scope of the present invention.
Emotion (also referred to as viewpoint) analysis is a subdivision research field of text mining.Specifically, sentiment analysis belongs to letter The crossing research field of breath retrieval, natural language processing and artificial intelligence.Sentiment analysis is with natural language processing, text analyzing Extracted automatically with related computer technology or classifying text in emotion.The main study subject of sentiment analysis is internet Zhong Hai The texts such as the existing comment on commodity of amount, microblogging, blog and forum postings.By analyzing these texts, enterprise can be by a large amount of Product review obtains the data of consumer feedback in real time and rapidly, to promote product competitiveness, to more fully understand the network user Behavior provides powerful decision support for enterprise and tissue.From the point of view of the granularity level of analysis, sentiment analysis can be divided into coarseness With fine granularity step analysis, it is entire text that coarseness step analysis, which has chapter grade and Sentence-level sentiment analysis, hypotheses, It include only a kind of emotion with sentence.Such hypotheses make the analysis and identification of the two levels not go out in text and sentence not Same emotional expression object and its corresponding emotion.Therefore, there are the low defects of efficiency for existing coarseness analytical technology.
Requirement with organizations and individuals to sentiment analysis is higher and higher, and the granularity level of sentiment analysis is just gradually to particulate Spend analysis level development.Fine granularity level sentiment analysis is then the analysis based on evaluation object and its attribute.Based on this, for existing There are defect existing for technology, the embodiment of the present invention to provide a kind of text analyzing method, as shown in Figure 1, this method includes following step Rapid S100 to step S130.Wherein:
S100:Obtain text to be analyzed;Wherein, text to be analyzed includes evaluation object.
Wherein, text to be analyzed is the text for expressing viewpoint.Such as:Some or certain objects are seen in the expression such as comment on commodity The text of method, the text etc. of expression own self emotion (for example, happiness, anger, grief and joy).
Wherein, evaluation object refers to the evaluation object of viewpoint, that is to say a part for the entity, the entity that viewpoint evaluated Or some or certain attributes of the entity.Wherein, entity can be a product, service, theme, individual, tissue, proposition or Event etc..In practical applications, e (T, W) can be used to describe an entity.Wherein, T indicates a hierarchical relationship, the level Relationship includes component, sub-component etc.;Each component or sub-component have the attribute of their own.W indicates an attribute set of e.
For example, by taking " my disagreeable cloudy day " this text as an example, evaluation object is " cloudy day ".For another example, with 6 hand of millet For machine, which is an entity.6 mobile phone of millet includes attribute set, including:Price, size, weight, screen The attributes such as curtain, memory, battery.Wherein, battery includes the attributes such as cruising ability, weight, service life.
S110:The attribute of evaluation object is identified from text to be analyzed.
This step can utilize the method based on supervised learning to identify the attribute of evaluation object from text to be analyzed.
In some embodiments, this step specifically includes:It is identified from text to be analyzed using support vector machine classifier Go out the attribute of evaluation object.
Wherein, sample space is mapped to a higher-dimension by support vector machines by a Nonlinear Mapping (i.e. kernel function), Even infinite dimensional feature space, by the problem of Nonlinear separability in sample space be converted into higher dimensional space it is linear can The problem of dividing.Support vector machine classifier can be trained to obtain by training data.In the training process, training corpus is first built Library;Then, attribute word and the non-attribute word in training corpus are marked manually;Then, word position, semanteme, word is interdependent Relationship, morphology etc. are used as feature;A sentiment dictionary is built further according to training data;Then, it is based on the sentiment dictionary, is trained Predefined evaluation object attribute.Finally, accuracy rate, recall rate and F1 (estimating) are calculated by training data, and obtained accordingly To best support vector machine classifier.
The present embodiment can utilize the word segmentation result and emotion that support vector machine classifier will go out from text dividing to be analyzed Dictionary is matched, if successful match, it is determined that the word segmentation result is the attribute of evaluation object.
Certainly, there are many noises in text to be analyzed sometimes, or the considerations of for convenient for handling;To evaluation object Attribute be identified before, text to be analyzed can also be pre-processed.Pretreatment includes formalization processing, at participle Reason, denoising, best match.
Wherein, formalization processing refers to indicating text with vector space model.The embodiment of the present invention is analysed to text Originally it is expressed as space vector, in order to carry out sentiment analysis.
For example, for text D=D { t to be analyzed1,w1;t2,w2;......wi, wherein tiIndicate ith feature , i=1,2 ... n, characteristic item are, for example, word, word, phrase etc.;wiIndicate characteristic item tiWeights, n indicate characteristic item it is total Number.Wherein, weights can pass through the acquisitions such as word frequency method, information gain method, expectation Cross-Entropy Method.Thus, it is possible to be analysed to text It is expressed as vector:(w1,w2,...wn)。
Wherein, word segmentation processing refers to by continuous Chinese character sequence, and according to scheduled rule, cutting is word or phrase.In reality In the application of border, it can consider to carry out using Uni-Gram, two-dimensional grammar model or ternary syntactic model according to actual conditions Word segmentation processing.
It illustrates below and word segmentation processing is illustrated:
With " I likes playing wechat!" for this text;It is after participle:I, like, play, wechat,!.
Wherein, denoising refers to removing irrelevant information (for example,@lllu111, html) in text to be analyzed, not having point Class meaning function word (for example, eh, I etc.), to reduce processing complexity.
Wherein, best match is in order to ensure the attribute identified can correctly reflect positive negative sense Sentiment orientation.With " free and unfettered Outside distant method " for a word, if what is extracted is " carefree " word, positive emotion expression can be erroneously interpreted as, and " carefree method It is in itself a negative emotion expression outside ".So the matched method of longest may be used in the embodiment of the present invention.
In addition, for commenting on class text, due to it with timeliness n, short text, the spies such as irregularly express, contain much information Point needs to consider that these factors are pre-processed in practical applications.
S120:Emotional expression corresponding with attribute is extracted from text to be analyzed.
Wherein, emotional expression is embodied by emotion word.For example, for " the screen size reasonable design of iPhone, color Also gorgeous ", it is therein " reasonable ", " gorgeous " for emotional expression.In order to which the word being analysed in text all remains, this Full vocabulary method may be used in embodiment, all regard all words as emotional expression.
Wherein, the progress of supervised learning method may be used in extracting method.
In some embodiments, step S120 is specifically included:Use condition random field models are extracted from text to be analyzed Go out emotional expression corresponding with attribute.
Wherein, conditional random field models are a kind of undirected sequence moulds, pass through different label judgement evaluation objects Sentiment orientation, i.e., the corresponding Sentiment orientation of text to be analyzed.Wherein, it when conditional random field models carry out text mark, uses Emotional expression label is:Positive emotional expression starts label (PB), the positive subsequent label of emotional expression (PI), negative sense emotional expression Start label (NB), the subsequent label of negative sense emotional expression (NI), other (O).
For example, in the case of for given observation sequence x, the condition distribution p (y | x) of hidden sequences y is modeled. That is being based on training data, observation sequence x is identified by selecting the maximized hidden sequences ies of p (y | x).Therefore, this mistake Journey can be considered sequence label problem to solve.Table one schematically illustrates emotional expression annotation results.
Table one:
In Table 1, serial number indicates position of each word in sentence;Observation sequence indicates the defeated of conditional random field models Enter;Annotation results indicate the output of conditional random field models.
Specifically, the present embodiment may comprise steps of:Obtain training corpus and extraction emotional expression.
Wherein, language material is divided into the language material marked and the language material not marked.For example, the language material marked is:To businessman's Comment, the comment etc. to product, these language materials can determine the Sentiment orientation of client by star or scoring.The language material not marked is such as The comment etc. of news, these language materials are then needing disaggregated model before or are manually being labeled, and manually to the positive and negative of language material Tendency.
When extracting emotional expression, by the conditional random field models after training to the attribute of evaluation object in text to be analyzed Corresponding emotional expression is labeled, then, according to annotation results, you can obtain emotional expression.Wherein, training condition is random Field model can be based on training corpus, training obtains by building sentiment dictionary.
The extraction performance of conditional random field models can be evaluated by accuracy rate, recall rate and F values (estimating).Its In:
Wherein, P indicates that accuracy rate, R indicate recall rate;F representing measures.
Certainly, extracted from text to be analyzed in use condition random field models emotional expression corresponding with attribute it Before, it can also be pre-processed, which can refer to aforementioned preprocessing process, and details are not described herein.
It, can be with as it can be seen that extract emotional expression corresponding with attribute from text to be analyzed by conditional random field models Excavated from a large amount of text consumer to each attributive character of certain product hold positive attitude still hold reverse side attitude and To the Sentiment orientation of entire product.
S130:According to emotional expression, the corresponding Sentiment orientation of text to be analyzed is determined.
Specifically, this step determines the corresponding Sentiment orientation of text to be analyzed according to emotion word.
As an example, the embodiment of the present invention can determine Sentiment orientation by way of scoring.With comment on commodity data " product Matter is pretty good.It is exactly that delivery is too slow, is received with more than ten talentes.Quality is pretty good, has game card to send " for, commentator thinks this The advantages of product is " quality is pretty good ", the disadvantage is that " delivery is too slow ", overall evaluation is " quality is pretty good, has game card to send ", and And play the satisfaction of " 5.0 scores " to indicate commentator to the product.The present embodiment can indicate negative sense Sentiment orientation to comment 1~2 point, 3~5 points indicate positive Sentiment orientation.
In some embodiments, as shown in Fig. 2, step S130 specifically comprises the following steps S131 to step S133.
S131:Judge that emotional expression is positive emotion expression or negative emotion expression;
For example, " this chair quality is fine " expresses for positive emotion;" my disagreeable cloudy day " is that negative emotion is expressed.
S132:If emotional expression is positive emotion expression, it is determined that the corresponding Sentiment orientation of text to be analyzed is forward direction Sentiment orientation.
With "@lllu111:Hello bicycle is really praised very much!Can freely ride 135 days, recommend you use Oh~" for, such as Fruit emotional expression is " praising ", " recommendations ", it is determined that the corresponding Sentiment orientation of text to be analyzed is forward direction Sentiment orientation.
S133:If emotional expression is negative emotion expression, it is determined that the corresponding Sentiment orientation of text to be analyzed is negative sense Sentiment orientation.
For example, by taking " I disagreeable cloudy day " as an example, emotional expression is " disagreeable ", it is determined that the corresponding emotion of text to be analyzed is inclined To for negative sense Sentiment orientation.
Based on the above-mentioned technical proposal, the embodiment of the present invention can be by excavating the text rich in emotional semantic information The knowledge that magnanimity can be extracted improves text analyzing efficiency, can accurately grasp user's public opinion trend, can be obviously improved risk The ability of management and control ability, market prediction ability and customer relation management.Especially to the fields such as news, video display, product Sentence-level Other comment data is analyzed more accurate and effective.
It should be noted that the step in above method embodiment can execute parallel, can also sequentially it execute, the present invention This is not construed as limiting.And above method embodiment can either hardware be also or the combination of software and hardware gives by software It realizes.
The embodiment of the present invention also provides a kind of text analyzing device, and text analytical equipment can be used for executing above-mentioned text point Analyse embodiment of the method.As shown in figure 3, the device includes:
Acquisition module 31, for obtaining text to be analyzed;Wherein, text to be analyzed includes evaluation object;
Identification module 32, the attribute for identifying evaluation object from text to be analyzed;
Extraction module 33, for extracting emotional expression corresponding with attribute from text to be analyzed;
Determining module 34, for according to emotional expression, determining the corresponding Sentiment orientation of text to be analyzed.
The embodiment of the present invention is extracted by taking above-mentioned technical proposal, by extraction module 33 from text to be analyzed: Emotional expression corresponding with the attribute identified by identification module 32;Then, text to be analyzed is judged by determining module 34 The emotional expression tendency gone out expressed by this, to improve the efficiency of text analyzing.
In some embodiments, above-mentioned identification module 32 is specifically used for:
The attribute of evaluation object is identified from text to be analyzed using support vector machine classifier.
In some embodiments, said extracted module 33 is specifically used for:
Use condition random field models extract emotional expression corresponding with attribute from text to be analyzed.
In some embodiments, as shown in figure 4, determining module 34 specifically includes:
Judging submodule 41, for judging that emotional expression is positive emotion expression or negative emotion expression;
First determination sub-module 42, if expressed for positive emotion for emotional expression, it is determined that text to be analyzed corresponds to Sentiment orientation be positive Sentiment orientation;
Second determination sub-module 43, if expressed for negative emotion for emotional expression, it is determined that text to be analyzed corresponds to Sentiment orientation be negative sense Sentiment orientation.
Explanation in relation to above-mentioned text analyzing device can refer to the related description in aforementioned texts analysis method embodiment, Details are not described herein.
Based on technical concept identical with embodiment of the method, the embodiment of the present invention additionally provides a kind of electronic equipment, such as Fig. 5 It is shown, including processor 51, communication interface 52, memory 53 and communication bus 54, wherein processor 51, communication interface 52 are deposited Reservoir 53 completes mutual communication by communication bus 54,
Memory 53, for storing computer program;
Processor 51 when for executing the program stored on memory 53, realizes above-mentioned text analyzing embodiment of the method Described in method and step.
The communication bus 54 that above-mentioned electronic equipment is mentioned can be Peripheral Component Interconnect standard (Peripheral Component Interconnect, PCI) bus or expanding the industrial standard structure (Extended Industry Standard Architecture, EISA) bus etc..The communication bus can be divided into address bus, data/address bus, controlling bus etc..For just It is only indicated with a thick line in expression, Fig. 5, it is not intended that an only bus or a type of bus.
Above-mentioned communication interface 52 is for the communication between above-mentioned electronic equipment and other equipment.
Above-mentioned memory 53 may include random access memory (Random Access Memory, RAM), can also wrap Include nonvolatile memory (non-volatile memory, NVM), for example, at least a magnetic disk storage.Optionally, it stores Device can also be at least one storage device for being located remotely from aforementioned processor.
Above-mentioned processor 51 can be general processor, including central processing unit (Central Processing Unit, abbreviation CPU), network processing unit (Ne twork Processor, NP) etc.;It can also be digital signal processor (Digital Signal Processing, DSP), application-specific integrated circuit (Application Specific Integrated Circuit, ASIC), field programmable gate array (Field-Programmable Gate Array, FPGA) or other can Programmed logic device, discrete gate or transistor logic, discrete hardware components.
Above-mentioned electronic equipment includes but not limited to smart mobile phone, computer, personal digital assistant, wearable device etc..
In the present embodiment, the program stored on memory 53 is executed by processor 51, by from text to be analyzed Extract emotional expression corresponding with the attribute identified;Then, text institute to be analyzed table is judged by the emotional expression The emotional expression tendency is reached, to improve the efficiency of text analyzing.
Based on technical concept identical with embodiment of the method, the embodiment of the present invention additionally provides a kind of computer-readable storage Medium.It is stored with computer program in the computer readable storage medium, is realized when computer program is executed by processor above-mentioned Method and step described in text analyzing embodiment of the method.
It will be understood by those skilled in the art that above computer program may include some instructions, so that computing device (for example, personal computer, server etc.) executes the method and step described in any of the above-described text analyzing embodiment of the method.
Above computer readable storage medium storing program for executing can include but is not limited to random access memory (RAM), dynamic random is deposited Access to memory (DRAM), static RAM (SRAM), read-only memory (ROM), programmable read only memory (PROM), Erarable Programmable Read only Memory (EPROM), electrically erasable programmable read-only memory (EEPROM), flash memory (example Such as, NOR type flash memory or NAND-type flash memory), Content Addressable Memory (CAM), polymer memory is (for example, ferroelectric polymers Memory), phase transition storage, ovonic memory, silicon-oxide-nitride silicon-silica-silicon (Silicon- Oxide-Nitride-Oxide-Silicon, SONOS) memory, magnetic card or light-card, also or any other appropriate type Computer readable storage medium.
In the present embodiment, when computer program is executed by processor, by extracting and being identified from text to be analyzed The corresponding emotional expression of attribute gone out;Then, the emotional expression gone out expressed by text to be analyzed is judged by the emotional expression Tendency, to improve the efficiency of text analyzing.
It should be noted that herein, relational terms such as first and second and the like are used merely to a reality Body or operation are distinguished with another entity or operation, are deposited without necessarily requiring or implying between these entities or operation In any actual relationship or order or sequence.Moreover, the terms "include", "comprise" or its any other variant are intended to Non-exclusive inclusion, so that the process, method, article or equipment including a series of elements is not only wanted including those Element, but also include other elements that are not explicitly listed, or further include for this process, method, article or equipment Intrinsic element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that There is also other identical elements in process, method, article or equipment including the element.
Each embodiment in this specification is all made of relevant mode and describes, the highlights of each of the examples are with The difference of other embodiment, the same or similar parts between the embodiments can be referred to each other.Especially for text point For analysis apparatus embodiment, since it is substantially similar to the method embodiment, so description is fairly simple, related place is referring to side The part of method embodiment illustrates.
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the scope of the present invention.It is all Any modification, equivalent replacement, improvement and so within the spirit and principles in the present invention, are all contained in protection scope of the present invention It is interior.

Claims (10)

1. a kind of text analyzing method, which is characterized in that the method includes:
Obtain text to be analyzed;Wherein, the text to be analyzed includes evaluation object;
The attribute of the evaluation object is identified from the text to be analyzed;
Emotional expression corresponding with the attribute is extracted from the text to be analyzed;
According to the emotional expression, the corresponding Sentiment orientation of the text to be analyzed is determined.
2. according to the method described in claim 1, it is characterized in that, described identify the evaluation from the text to be analyzed The step of attribute of object, including:
The attribute of the evaluation object is identified from the text to be analyzed using support vector machine classifier.
3. according to the method described in claim 1, it is characterized in that, described extract and the category from the text to be analyzed The step of property corresponding emotional expression, including:
Use condition random field models extract emotional expression corresponding with the attribute from the text to be analyzed.
4. according to the method described in claim 1, it is characterized in that, described according to the emotional expression, determine described to be analyzed The step of text corresponding Sentiment orientation, including:
Judge that the emotional expression is positive emotion expression or negative emotion expression;
If the emotional expression is positive emotion expression, it is determined that the corresponding Sentiment orientation of the text to be analyzed is positive feelings Sense tendency;
If the emotional expression is negative emotion expression, it is determined that the corresponding Sentiment orientation of the text to be analyzed is negative sense feelings Sense tendency.
5. a kind of text analyzing device, which is characterized in that described device includes:
Acquisition module, for obtaining text to be analyzed;Wherein, the text to be analyzed includes evaluation object;
Identification module, the attribute for identifying the evaluation object from the text to be analyzed;
Extraction module, for extracting emotional expression corresponding with the attribute from the text to be analyzed;
Determining module, for according to the emotional expression, determining the corresponding Sentiment orientation of the text to be analyzed.
6. device according to claim 5, which is characterized in that the identification module is specifically used for:
The attribute of the evaluation object is identified from the text to be analyzed using support vector machine classifier.
7. device according to claim 5, which is characterized in that the extraction module is specifically used for:
Use condition random field models extract emotional expression corresponding with the attribute from the text to be analyzed.
8. device according to claim 5, which is characterized in that the determining module specifically includes:
Judging submodule, for judging that the emotional expression is positive emotion expression or negative emotion expression;
First determination sub-module, if expressed for positive emotion for the emotional expression, it is determined that the text pair to be analyzed The Sentiment orientation answered is positive Sentiment orientation;
Second determination sub-module, if expressed for negative emotion for the emotional expression, it is determined that the text pair to be analyzed The Sentiment orientation answered is negative sense Sentiment orientation.
9. a kind of electronic equipment, which is characterized in that including processor, communication interface, memory and communication bus, wherein described Processor, the communication interface and the memory complete mutual communication by the communication bus;
The memory, for storing computer program;
The processor when for executing the program stored on memory, realizes any method steps of claim 1-4 Suddenly.
10. a kind of computer readable storage medium, which is characterized in that be stored with computer in the computer readable storage medium Program realizes claim 1-4 any method and steps when the computer program is executed by processor.
CN201810007810.8A 2018-01-04 2018-01-04 Text analyzing method, apparatus, electronic equipment and computer storage media Pending CN108563625A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810007810.8A CN108563625A (en) 2018-01-04 2018-01-04 Text analyzing method, apparatus, electronic equipment and computer storage media

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810007810.8A CN108563625A (en) 2018-01-04 2018-01-04 Text analyzing method, apparatus, electronic equipment and computer storage media

Publications (1)

Publication Number Publication Date
CN108563625A true CN108563625A (en) 2018-09-21

Family

ID=63529715

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810007810.8A Pending CN108563625A (en) 2018-01-04 2018-01-04 Text analyzing method, apparatus, electronic equipment and computer storage media

Country Status (1)

Country Link
CN (1) CN108563625A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110175238A (en) * 2019-05-31 2019-08-27 杭州网易再顾科技有限公司 Public sentiment classification method, medium, device and calculating equipment
CN110187760A (en) * 2019-05-14 2019-08-30 北京百度网讯科技有限公司 Intelligent interactive method and device
CN110309506A (en) * 2019-05-28 2019-10-08 北京三快在线科技有限公司 Statement analytical method, device, electronic equipment and readable storage medium storing program for executing
CN112069311A (en) * 2020-08-04 2020-12-11 北京声智科技有限公司 Text extraction method, device, equipment and medium
CN112256826A (en) * 2020-10-19 2021-01-22 网易(杭州)网络有限公司 Emotion analysis method, evaluation method and emotion analysis model training method and device
CN112308387A (en) * 2020-10-20 2021-02-02 深圳思为科技有限公司 Client intention degree evaluation method and device and cloud server
CN113420122A (en) * 2021-06-24 2021-09-21 平安科技(深圳)有限公司 Method, device and equipment for analyzing text and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103049435A (en) * 2013-01-04 2013-04-17 浙江工商大学 Text fine granularity sentiment analysis method and text fine granularity sentiment analysis device
CN103646088A (en) * 2013-12-13 2014-03-19 合肥工业大学 Product comment fine-grained emotional element extraction method based on CRFs and SVM
CN104915443A (en) * 2015-06-29 2015-09-16 北京信息科技大学 Extraction method of Chinese Microblog evaluation object
CN106844632A (en) * 2017-01-20 2017-06-13 清华大学 Based on the product review sensibility classification method and device that improve SVMs

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103049435A (en) * 2013-01-04 2013-04-17 浙江工商大学 Text fine granularity sentiment analysis method and text fine granularity sentiment analysis device
CN103646088A (en) * 2013-12-13 2014-03-19 合肥工业大学 Product comment fine-grained emotional element extraction method based on CRFs and SVM
CN104915443A (en) * 2015-06-29 2015-09-16 北京信息科技大学 Extraction method of Chinese Microblog evaluation object
CN106844632A (en) * 2017-01-20 2017-06-13 清华大学 Based on the product review sensibility classification method and device that improve SVMs

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
施寒潇: "细粒度情感分析研究", 《中国博士学位论文全文数据库信息科技辑》 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110187760A (en) * 2019-05-14 2019-08-30 北京百度网讯科技有限公司 Intelligent interactive method and device
CN110309506A (en) * 2019-05-28 2019-10-08 北京三快在线科技有限公司 Statement analytical method, device, electronic equipment and readable storage medium storing program for executing
CN110309506B (en) * 2019-05-28 2021-04-27 北京三快在线科技有限公司 Statement analysis method and device, electronic equipment and readable storage medium
CN110175238A (en) * 2019-05-31 2019-08-27 杭州网易再顾科技有限公司 Public sentiment classification method, medium, device and calculating equipment
CN112069311A (en) * 2020-08-04 2020-12-11 北京声智科技有限公司 Text extraction method, device, equipment and medium
CN112256826A (en) * 2020-10-19 2021-01-22 网易(杭州)网络有限公司 Emotion analysis method, evaluation method and emotion analysis model training method and device
CN112308387A (en) * 2020-10-20 2021-02-02 深圳思为科技有限公司 Client intention degree evaluation method and device and cloud server
CN112308387B (en) * 2020-10-20 2024-05-14 深圳思为科技有限公司 Customer intention evaluation method and device and cloud server
CN113420122A (en) * 2021-06-24 2021-09-21 平安科技(深圳)有限公司 Method, device and equipment for analyzing text and storage medium

Similar Documents

Publication Publication Date Title
US11620455B2 (en) Intelligently summarizing and presenting textual responses with machine learning
CN108563625A (en) Text analyzing method, apparatus, electronic equipment and computer storage media
CN108804512B (en) Text classification model generation device and method and computer readable storage medium
CN112632385B (en) Course recommendation method, course recommendation device, computer equipment and medium
CN111260437B (en) Product recommendation method based on commodity-aspect-level emotion mining and fuzzy decision
CN107992531A (en) News personalization intelligent recommendation method and system based on deep learning
CN107391760A (en) User interest recognition methods, device and computer-readable recording medium
CN110825956A (en) Information flow recommendation method and device, computer equipment and storage medium
CN109933782B (en) User emotion prediction method and device
CN112347367A (en) Information service providing method, information service providing device, electronic equipment and storage medium
Aralikatte et al. Fault in your stars: an analysis of android app reviews
Alamsyah et al. Measuring Public Sentiment Towards Services Level in Online Forum Using Naive Bayes Classifier and Word Cloud
CN117216275A (en) Text processing method, device, equipment and storage medium
CN111950265A (en) Domain lexicon construction method and device
CN110647504B (en) Method and device for searching judicial documents
CN112487132A (en) Keyword determination method and related equipment
CN114780724A (en) Case classification method and device, computer equipment and storage medium
CN110717008B (en) Search result ordering method and related device based on semantic recognition
CN113837836A (en) Model recommendation method, device, equipment and storage medium
CN113392218A (en) Training method of text quality evaluation model and method for determining text quality
CN109241993B (en) Evaluation object emotion classification method and device integrating user and overall evaluation information
Arora et al. Evaluation Of Product Reviews Using Deep Learning Classifier Models
CN112732908A (en) Test question novelty evaluation method and device, electronic equipment and storage medium
CN112287215A (en) Intelligent employment recommendation method and device
CN113536802A (en) Method, device, equipment and storage medium for judging emotion of text data in languages

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180921