CN106126502A - A kind of emotional semantic classification system and method based on support vector machine - Google Patents

A kind of emotional semantic classification system and method based on support vector machine Download PDF

Info

Publication number
CN106126502A
CN106126502A CN201610529672.0A CN201610529672A CN106126502A CN 106126502 A CN106126502 A CN 106126502A CN 201610529672 A CN201610529672 A CN 201610529672A CN 106126502 A CN106126502 A CN 106126502A
Authority
CN
China
Prior art keywords
text
word
feature
support vector
vector machine
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610529672.0A
Other languages
Chinese (zh)
Other versions
CN106126502B (en
Inventor
王欣
钟吉英
赵亮
谭斌
于成业
郝妙
赵海臣
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sichuan Changhong Electric Co Ltd
Original Assignee
Sichuan Changhong Electric Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sichuan Changhong Electric Co Ltd filed Critical Sichuan Changhong Electric Co Ltd
Priority to CN201610529672.0A priority Critical patent/CN106126502B/en
Publication of CN106126502A publication Critical patent/CN106126502A/en
Application granted granted Critical
Publication of CN106126502B publication Critical patent/CN106126502B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/247Thesauruses; Synonyms
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/34Browsing; Visualisation therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/374Thesaurus
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates

Abstract

The present invention relates to the analysis of public opinion technology, it discloses a kind of emotional semantic classification system and method based on support vector machine, from user comment information, find public sentiment for quick, accurate.The present invention utilizes reptile module to obtain user and is published in the review information of forum, by data being carried out the pretreatment such as participle, obtain the feature phrase of comment text and there is the training data of typicality, subsequently training data is carried out Emotion tagging, and utilize support vector machine that training data is calculated, obtain disaggregated model, according to disaggregated model, evaluation text to be sorted is analyzed, obtain the affective state estimated, finally utilize visualization model, show classification results, user is helped quickly to understand user feeling based on different entities object (keyword), and and then understand internet public feelings, it is applicable to website, the analysis of public opinion of forum.

Description

A kind of emotional semantic classification system and method based on support vector machine
Technical field
The present invention relates to the analysis of public opinion technology, be specifically related to a kind of emotional semantic classification system based on support vector machine and side Method.
Background technology
Along with the fast development of the Internet, the data on the Internet present explosive growth.According to incompletely statistics, 1 minute In, the upper newly-increased microblogging of Twitter reaches 100,000.And at home, Sina's microblog users number 6.5 hundred million, day any active ues reach 4600 Ten thousand, Tengxun's microblog users number 6.2 hundred million, day any active ues about 100,000,000;Moreover, valuable information in traditional forum website About 1 year about 100,000,000.The hugest any active ues and abundant in content, the comment that emotion is distinct issued thereof behind, The most numerous valuable information.Analysis to these information, can help to find commentator's emotion to special body, example As: microblogging/forum user, for enterprise " front " or the evaluation of " negatively ", for the viewpoint etc. of social group's event, thus is helped Help others grasp spin, problem analysis cause etc..
But, comment text is classified, and finds that the emotion preference of user is a challenging job, example As: certain user A has delivered the model of " conwoman that telecommunications worker is pretended to be in attention ", and user B replys and says that " money of old man is good Deceive." discounting for the scene of text, only sentence itself is carried out emotion differentiation, often obtain inconsistent judged result. To this end, we have developed a kind of sensibility classification method based on support vector machine, for user is published in microblogging, forum Text message is classified, and then analyzes the public sentiment situation for special body.
Summary of the invention
The technical problem to be solved is: propose a kind of emotional semantic classification system based on support vector machine and side Method, finds public sentiment for quick, accurate from user comment information.
The technical solution adopted for the present invention to solve the technical problems is:
A kind of emotional semantic classification system based on support vector machine, comprising:
Data acquisition and pretreatment module, be responsible for utilizing web crawlers to carry out data and crawl, and what acquisition user was delivered comments Opinion information, and review information is carried out pretreatment;
Feature Words and training sample generation module, be responsible for using the comment text through pretreatment as input, choose with The high frequency words of specific part of speech is as Feature Words, and adds feature dictionary;Choose the evaluation text comprising Feature Words as training sample This, and the emotion of training sample is manually marked;
Svm classifier module, is responsible for based on feature dictionary, and training sample extracts characteristic vector, and vector is supported in input Machine generates disaggregated model;Utilize disaggregated model that the to be sorted emotion value evaluating text is calculated, analyze the emotion of text Orientation;
Visualization model, is responsible for representing analysis result in web terminal.
Additionally, present invention also offers a kind of sensibility classification method based on support vector machine, it comprises the following steps:
A, utilize web crawlers to carry out data to crawl, obtain the review information that user is delivered, and review information is carried out Pretreatment;
B, using the comment text through pretreatment as input, choose the high frequency words with specific part of speech as Feature Words, And add feature dictionary;Choose the evaluation text comprising Feature Words as training sample, and the emotion of training sample is carried out people Work marks;
C, based on feature dictionary, to training sample extract characteristic vector, input support vector machine generate disaggregated model; Utilize disaggregated model that the to be sorted emotion value evaluating text is calculated, analyze the orientation of emotion of text;
D, analysis result is represented in web terminal.
As optimizing further, in step A, described utilize web crawlers to carry out data to crawl, obtain what user was delivered Review information, specifically includes:
From the beginning of the website specified, crawling webpage with the pattern of breadth-first, the webpage got for each, to it Page source code resolves, and obtains user comment information in webpage, the review information write into Databasce that will obtain.
As optimizing further, in step A, described review information is carried out pretreatment, specifically includes:
Use Chinese word segmentation tool kit that the evaluation information of user is carried out participle, and mark part of speech.
As optimizing further, in step B, described in choose the high frequency words with specific part of speech as Feature Words, specifically wrap Include: be that noun, verb and adjectival frequent words are as Feature Words based on FindCover algorithm picks part of speech.
As optimizing further, described is noun, verb and adjectival high frequency based on FindCover algorithm picks part of speech Word as Feature Words, method particularly includes:
Determine the input of FindCover algorithm: participle also marks the evaluation text collection U of part of speech, Feature Words number n, spy Levy word length L, part of speech set P;
Determine the output of FindCover algorithm: feature phrase S;
The process of choosing includes:
Step 1, initialization set S, A;
Step 2, calculating mapping relations Map M, be mapped to one group of text id:M comprising this word by each word word (word);
Step 3, when gathering S and not comprising n word, then find word word so that it is satisfied three conditions:
I () part of speech meets the requirement of P;
(ii) length meets the requirement of L;
(iii) current coverage rate coverage=| M (word)-A | is maximum;
If coverage rate coverage=0 of the word that step 4 searches out, then terminate circulation, otherwise, word is added S, adds A by M (word), returns step 3 and continues cycling through, until set S comprises n word or the coverage rate of word searched out Coverage=0;
Step 5, return set S are as feature phrase.
As optimizing further, the value of described n, P, L can be adjusted according to practical situation.
As optimizing further, in step B, described in choose the evaluation text comprising Feature Words as training sample, specifically Including:
The feature phrase S returned according to FindCover algorithm, uses following strategy to choose training sample: first, exports institute There is the evaluation text collection U comprising Feature WordsfIf, | Uf| > 1% | U |, then from UfIn randomly choose 1% | U | individual evaluation text make For training sample;Otherwise export UfAs training sample.
As optimizing further, in step C, described characteristic vector that training sample is extracted, input support vector machine generation Disaggregated model, specifically includes:
First according to Feature Words, the text in sample data is converted to shape such as "<labelling>feature 1: number feature 2: individual Number ... feature n: number " form, according to three way classification, then<labelling>value be positive, negative or neutral;According to two way classification, then<labelling>value is positive and negative;The training data will changed subsequently It is input in LIBSVM storehouse carry out classification based training.
As optimizing further, in step D, analysis result is represented in web terminal, described in the content that represents include: " front ", " negatively ", the ratio of " neutral " of text based on particular keywords, the urtext that emotion is relevant, temporally tie up Degree represents the emotion change of text.
The invention has the beneficial effects as follows: utilize reptile module to obtain user and be published in the review information of forum, pass through logarithm According to carrying out the pretreatment such as participle, obtain the feature phrase of comment text and there is the training data of typicality, subsequently to training Data carry out Emotion tagging, and utilize support vector machine to calculate training data, obtain disaggregated model, according to classification mould Type, is analyzed evaluation text to be sorted, obtains the affective state estimated, finally utilizes visualization model, shows classification As a result, help user quickly to understand user feeling based on different entities object (keyword), and and then understand internet public feelings.
Accompanying drawing explanation
Fig. 1 is present invention emotional semantic classification based on support vector machine system architecture diagram.
Detailed description of the invention
As it is shown in figure 1, as one embodiment of the present of invention, emotional semantic classification system based on support vector machine includes:
Data acquisition and pretreatment module, be responsible for utilizing web crawlers to carry out data and crawl, and what acquisition user was delivered comments Opinion information, and review information is carried out pretreatment;
Feature Words and training sample generation module, be responsible for using the comment text through pretreatment as input, choose with The high frequency words of specific part of speech is as Feature Words, and adds feature dictionary;Choose the evaluation text comprising Feature Words as training sample This, and the emotion of training sample is manually marked;
Svm classifier module, is responsible for based on feature dictionary, and training sample extracts characteristic vector, and vector is supported in input Machine generates disaggregated model;Utilize disaggregated model that the to be sorted emotion value evaluating text is calculated, analyze the emotion of text Orientation;
Visualization model, is responsible for representing analysis result in web terminal.
Below each functional module is implemented and illustrates:
(1) data acquisition and pretreatment module (Data Collection and Preprocessing Module, letter Claim CPM)
The main flow of data acquisition is as follows:
(1) from the beginning of the website (initial website) specified, webpage is crawled with the pattern of breadth-first;
(2) webpage got for each, resolves its page source code, letter relevant in obtaining webpage Breath, such as: user comment information etc.;
(3) data base is write data into.
The main flow of data prediction is the Chinese word segmentation tool kit utilizing the Chinese Academy of Sciences the to research and develop evaluation text to user Carry out participle, and mark part of speech.
(2) Feature Words and training sample generation module (Training Data Generation Module is called for short TGM)
In view of the present invention will use support vector machine (Support Vector Machine, hereinafter referred to as SVM) to comment Text is classified, and therefore extracts one group of representative Feature Words, and chooses high-quality training sample on this basis It is to ensure that the key of classification quality.To this end, we adopt carries out selecting of Feature Words and training sample with the following method.Main step Rapid as follows:
(A) the choosing of Feature Words
TGM uses algorithm FindCover to choose typical Feature Words.Additionally, according to actual observation, TGM chooses part of speech For the word of noun (n), verb (v) and adjective (a) as Feature Words, i.e. the input P of FindCover algorithm be array n, v,a};In this external Practical Calculation, TGM chooses the word of length L > 1 as Feature Words.It is noted that for n, P and L Value, can be adjusted according to actual needs.
Algorithm FindCover
Input: participle mark the evaluation text collection U of part of speech, Feature Words number n, Feature Words length L, part of speech set P
Output: feature phrase
1. initialize set S, A;Here set S is the set for storage feature phrase;Here set A is for evaluating The subset of text collection U, is specifically designed to the text id corresponding to Feature Words word deposited in S.
2. calculate mapping relations Map M, each word word is mapped to one group of text id:M comprising this word (word);
3. when S does not comprises n word, then find word word so that it is satisfied three conditions:
I () part of speech meets the requirement of P;
(ii) length meets the requirement of L;
(iii) current coverage rate coverage=| M (word)-A | is maximum;
4. if coverage rate coverage=0 of the word searched out, then terminate circulation, otherwise, word is added S, by M (word) add A, return step 3 and continue cycling through, until set S comprises n word or the coverage rate of word searched out Coverage=0;
5. return set S as feature phrase.
(B) the choosing of training sample
The feature phrase S, TGM returned according to FindCover uses following strategy to choose training sample: first, exports institute There is the evaluation text collection U comprising Feature Wordsf.If | Uf| > 1% | U |, then from UfIn randomly choose 1% | U | individual evaluation text make For training sample;Otherwise export UfAs training sample.Selected training sample will carry out artificial emotion mark.Actually used mistake Cheng Zhong, can be divided into 2 classes by text according to emotion, it may be assumed that front, negatively;Also three classes, i.e. front it are divided into, neutral, negatively.
(3) svm classifier module (SVM Training Module is called for short STM)
First text in sample data is converted to shape such as according to Feature Words by STM: "<labelling>feature 1: number feature 2: Number ... feature n: number " form, wherein according to three way classification, then<labelling>can with value as positive, Negative or neutral;According to two way classification, then<labelling>can be with value as positive with negative.STM subsequently will The training data changed is input in LIBSVM storehouse carry out classification based training.After obtaining training result, STM applies these to classify Text to be sorted is calculated by rule, analyzes the orientation of emotion of text.
(4) visualization model (Visualization Module is called for short VM)
Analysis result is represented by VM at Web end, and main content viewable includes: (1) text based on particular keywords " front ", " negatively ", the ratio of " neutral ";(2) urtext that emotion is relevant;(3) temporally dimension represents the feelings of text Sense change.

Claims (10)

1. an emotional semantic classification system based on support vector machine, it is characterised in that including:
Data acquisition and pretreatment module, be responsible for utilizing web crawlers to carry out data and crawl, and obtains the comment letter that user is delivered Breath, and review information is carried out pretreatment;
Feature Words and training sample generation module, be responsible for using the comment text through pretreatment as input, choose with specific The high frequency words of part of speech is as Feature Words, and adds feature dictionary;Choose the evaluation text comprising Feature Words as training sample, and The emotion of training sample is manually marked;
Svm classifier module, is responsible for based on feature dictionary, and training sample extracts characteristic vector, and input support vector machine is raw Constituent class model;Utilize disaggregated model that the to be sorted emotion value evaluating text is calculated, analyze the orientation of emotion of text;
Visualization model, is responsible for representing analysis result in web terminal.
2. a sensibility classification method based on support vector machine, it is characterised in that comprise the following steps:
A, utilize web crawlers to carry out data to crawl, obtain the review information that user is delivered, and review information is carried out pre-place Reason;
B, using the comment text through pretreatment as input, choose the high frequency words with specific part of speech as Feature Words, and add Enter feature dictionary;Choose the evaluation text comprising Feature Words as training sample, and the emotion of training sample is manually marked Note;
C, based on feature dictionary, to training sample extract characteristic vector, input support vector machine generate disaggregated model;Utilize The to be sorted emotion value evaluating text is calculated by disaggregated model, analyzes the orientation of emotion of text;
D, analysis result is represented in web terminal.
A kind of sensibility classification method based on support vector machine, it is characterised in that in step A, institute State and utilize web crawlers to carry out data to crawl, obtain the review information that user is delivered, specifically include:
From the beginning of the website specified, crawling webpage with the pattern of breadth-first, the webpage got for each, to its page Source code resolves, and obtains user comment information in webpage, the review information write into Databasce that will obtain.
A kind of sensibility classification method based on support vector machine, it is characterised in that in step A, institute State and review information carried out pretreatment, specifically include:
Use Chinese word segmentation tool kit that the evaluation information of user is carried out participle, and mark part of speech.
A kind of sensibility classification method based on support vector machine, it is characterised in that in step B, institute State and choose the high frequency words with specific part of speech as Feature Words, specifically include:
It is that noun, verb and adjectival frequent words are as Feature Words based on FindCover algorithm picks part of speech.
A kind of sensibility classification method based on support vector machine, it is characterised in that described based on FindCover algorithm picks part of speech be noun, verb and adjectival frequent words as Feature Words, method particularly includes:
Determine the input of FindCover algorithm: participle also marks the evaluation text collection U of part of speech, Feature Words number n, Feature Words Length L, part of speech set P;
Determine the output of FindCover algorithm: feature phrase S;
The process of choosing includes:
Step 1, initialization set S, A;
Step 2, calculating mapping relations Map M, be mapped to one group of text id:M comprising this word by each word word (word);
Step 3, when gathering S and not comprising n word, then find word word so that it is satisfied three conditions:
I () part of speech meets the requirement of P;
(ii) length meets the requirement of L;
(iii) current coverage rate coverage=| M (word)-A | is maximum;
If coverage rate coverage=0 of the word that step 4 searches out, then terminate circulation, otherwise, word is added S, by M (word) add A, return step 3 and continue cycling through, until set S comprises n word or the coverage rate of word searched out Coverage=0;
Step 5, return set S are as feature phrase.
A kind of sensibility classification method based on support vector machine, it is characterised in that described n, P, L Value can be adjusted according to practical situation.
A kind of sensibility classification method based on support vector machine, it is characterised in that in step B, institute State and choose the evaluation text comprising Feature Words as training sample, specifically include:
The feature phrase S returned according to FindCover algorithm, uses following strategy to choose training sample: first, exports all bags Evaluation text collection U containing Feature WordsfIf, | Uf| > 1% | U |, then from UfIn randomly choose 1% | U | individual evaluation text as instruction Practice sample;Otherwise export UfAs training sample.
A kind of sensibility classification method based on support vector machine, it is characterised in that in step C, institute Stating and training sample extracts characteristic vector, input support vector machine generates disaggregated model, specifically includes:
First according to Feature Words the text in sample data is converted to shape as "<labelling>feature 1: number feature 2: number ... Feature n: number " form, according to three way classification, then<labelling>value is positive, negative or neutral;If adopting With two way classification, then<labelling>value is positive and negative;Subsequently the training data changed is input to LIBSVM Storehouse carries out classification based training.
A kind of sensibility classification method based on support vector machine, it is characterised in that in step D, Analysis result is represented in web terminal, described in the content that represents include: " front " of text based on particular keywords, " negative Face ", the ratio of " neutral ", urtext that emotion is relevant, temporally dimension represent the emotion change of text.
CN201610529672.0A 2016-07-07 2016-07-07 A kind of emotional semantic classification system and method based on support vector machines Active CN106126502B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610529672.0A CN106126502B (en) 2016-07-07 2016-07-07 A kind of emotional semantic classification system and method based on support vector machines

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610529672.0A CN106126502B (en) 2016-07-07 2016-07-07 A kind of emotional semantic classification system and method based on support vector machines

Publications (2)

Publication Number Publication Date
CN106126502A true CN106126502A (en) 2016-11-16
CN106126502B CN106126502B (en) 2018-10-30

Family

ID=57283438

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610529672.0A Active CN106126502B (en) 2016-07-07 2016-07-07 A kind of emotional semantic classification system and method based on support vector machines

Country Status (1)

Country Link
CN (1) CN106126502B (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106407449A (en) * 2016-09-30 2017-02-15 四川长虹电器股份有限公司 Emotion classification method based on support vector machine
CN106649890A (en) * 2017-02-07 2017-05-10 税云网络科技服务有限公司 Data storage method and device
CN106682192A (en) * 2016-12-29 2017-05-17 北京奇虎科技有限公司 Method and device for training answer intention classification model based on search keywords
CN106776557A (en) * 2016-12-13 2017-05-31 竹间智能科技(上海)有限公司 Affective state memory recognition methods and the device of emotional robot
CN107229684A (en) * 2017-05-11 2017-10-03 合肥美的智能科技有限公司 Statement classification method, system, electronic equipment, refrigerator and storage medium
CN107291902A (en) * 2017-06-23 2017-10-24 中国人民解放军国防科学技术大学 Automatic marking method is checked in a kind of popular contribution based on hybrid classification technology
CN110377727A (en) * 2019-06-06 2019-10-25 深思考人工智能机器人科技(北京)有限公司 A kind of multi-tag file classification method and device based on multi-task learning
CN110689033A (en) * 2018-07-05 2020-01-14 第四范式(北京)技术有限公司 Data acquisition method, device and equipment for model training and storage medium
CN112487266A (en) * 2019-09-12 2021-03-12 北京国双科技有限公司 Emotion labeling method and device, computer equipment and storage medium
WO2021093349A1 (en) * 2019-11-15 2021-05-20 Midea Group Co., Ltd. System, method, and user interface for facilitating product research and development
CN113553422A (en) * 2021-07-16 2021-10-26 山东建筑大学 User preference prediction method and system based on language value convolution rule inference network

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1256931A1 (en) * 2001-05-11 2002-11-13 Sony France S.A. Method and apparatus for voice synthesis and robot apparatus
CN103034626A (en) * 2012-12-26 2013-04-10 上海交通大学 Emotion analyzing system and method
CN103116644A (en) * 2013-02-26 2013-05-22 华南理工大学 Method for mining orientation of Web themes and supporting decisions
CN104731770A (en) * 2015-03-23 2015-06-24 中国科学技术大学苏州研究院 Chinese microblog emotion analysis method based on rules and statistical model
CN104965822A (en) * 2015-07-29 2015-10-07 中南大学 Emotion analysis method for Chinese texts based on computer information processing technology

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1256931A1 (en) * 2001-05-11 2002-11-13 Sony France S.A. Method and apparatus for voice synthesis and robot apparatus
CN103034626A (en) * 2012-12-26 2013-04-10 上海交通大学 Emotion analyzing system and method
CN103116644A (en) * 2013-02-26 2013-05-22 华南理工大学 Method for mining orientation of Web themes and supporting decisions
CN104731770A (en) * 2015-03-23 2015-06-24 中国科学技术大学苏州研究院 Chinese microblog emotion analysis method based on rules and statistical model
CN104965822A (en) * 2015-07-29 2015-10-07 中南大学 Emotion analysis method for Chinese texts based on computer information processing technology

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106407449A (en) * 2016-09-30 2017-02-15 四川长虹电器股份有限公司 Emotion classification method based on support vector machine
CN106776557A (en) * 2016-12-13 2017-05-31 竹间智能科技(上海)有限公司 Affective state memory recognition methods and the device of emotional robot
CN106776557B (en) * 2016-12-13 2020-09-08 竹间智能科技(上海)有限公司 Emotional state memory identification method and device of emotional robot
CN106682192B (en) * 2016-12-29 2020-07-03 北京奇虎科技有限公司 Method and device for training answer intention classification model based on search keywords
CN106682192A (en) * 2016-12-29 2017-05-17 北京奇虎科技有限公司 Method and device for training answer intention classification model based on search keywords
CN106649890A (en) * 2017-02-07 2017-05-10 税云网络科技服务有限公司 Data storage method and device
CN106649890B (en) * 2017-02-07 2020-07-14 税云网络科技服务有限公司 Data storage method and device
CN107229684A (en) * 2017-05-11 2017-10-03 合肥美的智能科技有限公司 Statement classification method, system, electronic equipment, refrigerator and storage medium
CN107229684B (en) * 2017-05-11 2021-05-18 合肥美的智能科技有限公司 Sentence classification method and system, electronic equipment, refrigerator and storage medium
CN107291902B (en) * 2017-06-23 2020-05-08 中国人民解放军国防科学技术大学 Automatic marking method for public contribution review based on mixed classification technology
CN107291902A (en) * 2017-06-23 2017-10-24 中国人民解放军国防科学技术大学 Automatic marking method is checked in a kind of popular contribution based on hybrid classification technology
CN110689033A (en) * 2018-07-05 2020-01-14 第四范式(北京)技术有限公司 Data acquisition method, device and equipment for model training and storage medium
CN110377727A (en) * 2019-06-06 2019-10-25 深思考人工智能机器人科技(北京)有限公司 A kind of multi-tag file classification method and device based on multi-task learning
CN110377727B (en) * 2019-06-06 2022-06-17 深思考人工智能机器人科技(北京)有限公司 Multi-label text classification method and device based on multi-task learning
CN112487266A (en) * 2019-09-12 2021-03-12 北京国双科技有限公司 Emotion labeling method and device, computer equipment and storage medium
WO2021093349A1 (en) * 2019-11-15 2021-05-20 Midea Group Co., Ltd. System, method, and user interface for facilitating product research and development
CN113553422A (en) * 2021-07-16 2021-10-26 山东建筑大学 User preference prediction method and system based on language value convolution rule inference network

Also Published As

Publication number Publication date
CN106126502B (en) 2018-10-30

Similar Documents

Publication Publication Date Title
CN106126502B (en) A kind of emotional semantic classification system and method based on support vector machines
Li et al. Twiner: named entity recognition in targeted twitter stream
CN106294593B (en) In conjunction with the Relation extraction method of subordinate clause grade remote supervisory and semi-supervised integrated study
CN103678564B (en) Internet product research system based on data mining
CN106407236B (en) A kind of emotion tendency detection method towards comment data
CN105528437B (en) A kind of question answering system construction method extracted based on structured text knowledge
CN104881458B (en) A kind of mask method and device of Web page subject
CN101127042A (en) Sensibility classification method based on language model
CN102200975B (en) Vertical search engine system using semantic analysis
CN104268160A (en) Evaluation object extraction method based on domain dictionary and semantic roles
CN106096664A (en) A kind of sentiment analysis method based on social network data
CN103246644B (en) Method and device for processing Internet public opinion information
CN105843796A (en) Microblog emotional tendency analysis method and device
CN110413787B (en) Text clustering method, device, terminal and storage medium
CN104008091A (en) Sentiment value based web text sentiment analysis method
CN108376133A (en) The short text sensibility classification method expanded based on emotion word
CN106202584A (en) A kind of microblog emotional based on standard dictionary and semantic rule analyzes method
CN105183715B (en) A kind of word-based distribution and the comment spam automatic classification method of file characteristics
CN107451118A (en) Sentence-level sensibility classification method based on Weakly supervised deep learning
CN102929861A (en) Method and system for calculating text emotion index
CN103761239A (en) Method for performing emotional tendency classification to microblog by using emoticons
CN108763348A (en) A kind of classification improved method of extension short text word feature vector
CN105512333A (en) Product comment theme searching method based on emotional tendency
CN104899335A (en) Method for performing sentiment classification on network public sentiment of information
CN106407449A (en) Emotion classification method based on support vector machine

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant