CN101710333A - Network text segmenting method based on genetic algorithm - Google Patents
Network text segmenting method based on genetic algorithm Download PDFInfo
- Publication number
- CN101710333A CN101710333A CN200910219163A CN200910219163A CN101710333A CN 101710333 A CN101710333 A CN 101710333A CN 200910219163 A CN200910219163 A CN 200910219163A CN 200910219163 A CN200910219163 A CN 200910219163A CN 101710333 A CN101710333 A CN 101710333A
- Authority
- CN
- China
- Prior art keywords
- text
- population
- vocabulary
- individuality
- corpus
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (1)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2009102191638A CN101710333B (en) | 2009-11-26 | 2009-11-26 | Network text segmenting method based on genetic algorithm |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2009102191638A CN101710333B (en) | 2009-11-26 | 2009-11-26 | Network text segmenting method based on genetic algorithm |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101710333A true CN101710333A (en) | 2010-05-19 |
CN101710333B CN101710333B (en) | 2012-07-04 |
Family
ID=42403123
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2009102191638A Active CN101710333B (en) | 2009-11-26 | 2009-11-26 | Network text segmenting method based on genetic algorithm |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101710333B (en) |
Cited By (37)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101968798A (en) * | 2010-09-10 | 2011-02-09 | 中国科学技术大学 | Community recommendation method based on on-line soft constraint LDA algorithm |
CN102024065A (en) * | 2011-01-18 | 2011-04-20 | 中南大学 | SIMD optimization-based webpage duplication elimination and concurrency method |
CN102439597A (en) * | 2011-07-13 | 2012-05-02 | 华为技术有限公司 | Parameter deducing method, computing device and system based on potential dirichlet model |
CN102609407A (en) * | 2012-02-16 | 2012-07-25 | 复旦大学 | Fine-grained semantic detection method of harmful text contents in network |
CN102855312A (en) * | 2012-08-24 | 2013-01-02 | 武汉大学 | Domain-and-theme-oriented Web service clustering method |
CN102929937A (en) * | 2012-09-28 | 2013-02-13 | 福州博远无线网络科技有限公司 | Text-subject-model-based data processing method for commodity classification |
CN103365978A (en) * | 2013-07-01 | 2013-10-23 | 浙江大学 | Traditional Chinese medicine data mining method based on LDA (Latent Dirichlet Allocation) topic model |
CN103914445A (en) * | 2014-03-05 | 2014-07-09 | 中国人民解放军装甲兵工程学院 | Data semantic processing method |
CN104281692A (en) * | 2014-10-13 | 2015-01-14 | 安徽华贞信息科技有限公司 | Method and system for realizing paragraph dimensionalized description |
CN104281567A (en) * | 2014-10-13 | 2015-01-14 | 安徽华贞信息科技有限公司 | Latent semantic analysis method and system |
CN104317579A (en) * | 2014-10-13 | 2015-01-28 | 安徽华贞信息科技有限公司 | Method and system for business performance of text document |
CN104317785A (en) * | 2014-10-13 | 2015-01-28 | 安徽华贞信息科技有限公司 | Internet paragraph level topic identifying system |
WO2015165230A1 (en) * | 2014-04-28 | 2015-11-05 | 华为技术有限公司 | Social contact message monitoring method and device |
CN105136714A (en) * | 2015-09-06 | 2015-12-09 | 河南工业大学 | Terahertz spectral wavelength selection method based on genetic algorithm |
CN105389306A (en) * | 2015-11-02 | 2016-03-09 | 国网福建省电力有限公司 | Latent semantic analysis based intelligent parsing method for application form |
CN105787088A (en) * | 2016-03-14 | 2016-07-20 | 南京理工大学 | Text information classifying method based on segmented encoding genetic algorithm |
CN106355628A (en) * | 2015-07-16 | 2017-01-25 | 中国石油化工股份有限公司 | Image-text knowledge point marking method and device and image-text mark correcting method and system |
WO2017035922A1 (en) * | 2015-09-02 | 2017-03-09 | 杨鹏 | Online internet topic mining method based on improved lda model |
CN106502983A (en) * | 2016-10-17 | 2017-03-15 | 清华大学 | The event driven collapse Gibbs sampling method of implicit expression Di Li Cray model |
CN106709011A (en) * | 2016-12-26 | 2017-05-24 | 武汉大学 | Positional concept hierarchy disambiguation calculation method based on spatial locating cluster |
CN106815310A (en) * | 2016-12-20 | 2017-06-09 | 华南师范大学 | A kind of hierarchy clustering method and system to magnanimity document sets |
CN107239438A (en) * | 2016-03-28 | 2017-10-10 | 阿里巴巴集团控股有限公司 | A kind of document analysis method and device |
CN108009151A (en) * | 2017-11-29 | 2018-05-08 | 深圳中泓在线股份有限公司 | Newsletter archive automatic segmentation method and apparatus, server and readable storage medium storing program for executing |
CN108038173A (en) * | 2017-12-07 | 2018-05-15 | 广东工业大学 | A kind of Web page classification method, system and a kind of Web page classifying equipment |
CN109299239A (en) * | 2018-09-29 | 2019-02-01 | 福建弘扬软件股份有限公司 | ES-based electronic medical record retrieval method |
CN109325092A (en) * | 2018-11-27 | 2019-02-12 | 中山大学 | Merge the nonparametric parallelization level Di Li Cray process topic model system of phrase information |
CN109829151A (en) * | 2018-11-27 | 2019-05-31 | 国网浙江省电力有限公司 | A kind of text segmenting method based on layering Di Li Cray model |
CN109918659A (en) * | 2019-02-28 | 2019-06-21 | 华南理工大学 | A method of based on not retaining optimum individual genetic algorithm optimization term vector |
CN109977227A (en) * | 2019-03-19 | 2019-07-05 | 中国科学院自动化研究所 | Text feature, system, device based on feature coding |
CN110110326A (en) * | 2019-04-25 | 2019-08-09 | 西安交通大学 | A kind of text cutting method based on subject information |
CN110222654A (en) * | 2019-06-10 | 2019-09-10 | 北京百度网讯科技有限公司 | Text segmenting method, device, equipment and storage medium |
CN111797634A (en) * | 2020-06-04 | 2020-10-20 | 语联网(武汉)信息技术有限公司 | Document segmentation method and device |
CN112667817A (en) * | 2020-12-31 | 2021-04-16 | 杭州电子科技大学 | Text emotion classification integration system based on roulette attribute selection |
CN112988981A (en) * | 2021-05-14 | 2021-06-18 | 哈尔滨工业大学(深圳)(哈尔滨工业大学深圳科技创新研究院) | Automatic labeling method based on genetic algorithm |
CN113191133A (en) * | 2021-04-21 | 2021-07-30 | 北京邮电大学 | Audio text alignment method and system based on Doc2Vec |
CN113366511A (en) * | 2020-01-07 | 2021-09-07 | 支付宝(杭州)信息技术有限公司 | Named entity identification and extraction using genetic programming |
CN113673255A (en) * | 2021-08-25 | 2021-11-19 | 北京市律典通科技有限公司 | Text function region splitting method and device, computer equipment and storage medium |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101287229A (en) * | 2008-05-26 | 2008-10-15 | 北京捷讯畅达科技发展有限公司 | Natural language processing technique and device applying to query by short message service of mobile phone |
-
2009
- 2009-11-26 CN CN2009102191638A patent/CN101710333B/en active Active
Cited By (59)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101968798A (en) * | 2010-09-10 | 2011-02-09 | 中国科学技术大学 | Community recommendation method based on on-line soft constraint LDA algorithm |
CN102024065B (en) * | 2011-01-18 | 2013-01-02 | 中南大学 | SIMD optimization-based webpage duplication elimination and concurrency method |
CN102024065A (en) * | 2011-01-18 | 2011-04-20 | 中南大学 | SIMD optimization-based webpage duplication elimination and concurrency method |
CN102439597B (en) * | 2011-07-13 | 2014-12-24 | 华为技术有限公司 | Parameter deducing method, computing device and system based on potential dirichlet model |
WO2012106885A1 (en) * | 2011-07-13 | 2012-08-16 | 华为技术有限公司 | Latent dirichlet allocation-based parameter inference method, calculation device and system |
US9213943B2 (en) | 2011-07-13 | 2015-12-15 | Huawei Technologies Co., Ltd. | Parameter inference method, calculation apparatus, and system based on latent dirichlet allocation model |
CN102439597A (en) * | 2011-07-13 | 2012-05-02 | 华为技术有限公司 | Parameter deducing method, computing device and system based on potential dirichlet model |
CN102609407A (en) * | 2012-02-16 | 2012-07-25 | 复旦大学 | Fine-grained semantic detection method of harmful text contents in network |
CN102609407B (en) * | 2012-02-16 | 2014-10-29 | 复旦大学 | Fine-grained semantic detection method of harmful text contents in network |
CN102855312A (en) * | 2012-08-24 | 2013-01-02 | 武汉大学 | Domain-and-theme-oriented Web service clustering method |
CN102855312B (en) * | 2012-08-24 | 2013-08-14 | 武汉大学 | Domain-and-theme-oriented Web service clustering method |
CN102929937A (en) * | 2012-09-28 | 2013-02-13 | 福州博远无线网络科技有限公司 | Text-subject-model-based data processing method for commodity classification |
CN102929937B (en) * | 2012-09-28 | 2015-09-16 | 福州博远无线网络科技有限公司 | Based on the data processing method of the commodity classification of text subject model |
CN103365978B (en) * | 2013-07-01 | 2017-03-29 | 浙江大学 | TCM data method for digging based on LDA topic models |
CN103365978A (en) * | 2013-07-01 | 2013-10-23 | 浙江大学 | Traditional Chinese medicine data mining method based on LDA (Latent Dirichlet Allocation) topic model |
CN103914445A (en) * | 2014-03-05 | 2014-07-09 | 中国人民解放军装甲兵工程学院 | Data semantic processing method |
US10250550B2 (en) | 2014-04-28 | 2019-04-02 | Huawei Technologies Co., Ltd. | Social message monitoring method and apparatus |
WO2015165230A1 (en) * | 2014-04-28 | 2015-11-05 | 华为技术有限公司 | Social contact message monitoring method and device |
CN104317785A (en) * | 2014-10-13 | 2015-01-28 | 安徽华贞信息科技有限公司 | Internet paragraph level topic identifying system |
CN104281567A (en) * | 2014-10-13 | 2015-01-14 | 安徽华贞信息科技有限公司 | Latent semantic analysis method and system |
CN104281692A (en) * | 2014-10-13 | 2015-01-14 | 安徽华贞信息科技有限公司 | Method and system for realizing paragraph dimensionalized description |
CN104317579A (en) * | 2014-10-13 | 2015-01-28 | 安徽华贞信息科技有限公司 | Method and system for business performance of text document |
CN106355628A (en) * | 2015-07-16 | 2017-01-25 | 中国石油化工股份有限公司 | Image-text knowledge point marking method and device and image-text mark correcting method and system |
CN106355628B (en) * | 2015-07-16 | 2019-07-05 | 中国石油化工股份有限公司 | The modification method and system of picture and text knowledge point mask method and device, picture and text mark |
WO2017035922A1 (en) * | 2015-09-02 | 2017-03-09 | 杨鹏 | Online internet topic mining method based on improved lda model |
CN105136714A (en) * | 2015-09-06 | 2015-12-09 | 河南工业大学 | Terahertz spectral wavelength selection method based on genetic algorithm |
CN105136714B (en) * | 2015-09-06 | 2017-10-10 | 河南工业大学 | A kind of tera-hertz spectra Wavelength selecting method based on genetic algorithm |
CN105389306A (en) * | 2015-11-02 | 2016-03-09 | 国网福建省电力有限公司 | Latent semantic analysis based intelligent parsing method for application form |
CN105787088A (en) * | 2016-03-14 | 2016-07-20 | 南京理工大学 | Text information classifying method based on segmented encoding genetic algorithm |
CN105787088B (en) * | 2016-03-14 | 2018-12-07 | 南京理工大学 | A kind of text information classification method based on segment encoding genetic algorithm |
CN107239438A (en) * | 2016-03-28 | 2017-10-10 | 阿里巴巴集团控股有限公司 | A kind of document analysis method and device |
CN106502983A (en) * | 2016-10-17 | 2017-03-15 | 清华大学 | The event driven collapse Gibbs sampling method of implicit expression Di Li Cray model |
CN106502983B (en) * | 2016-10-17 | 2019-05-10 | 清华大学 | The event driven collapse Gibbs sampling method of implicit Di Li Cray model |
CN106815310A (en) * | 2016-12-20 | 2017-06-09 | 华南师范大学 | A kind of hierarchy clustering method and system to magnanimity document sets |
CN106815310B (en) * | 2016-12-20 | 2020-04-21 | 华南师范大学 | Hierarchical clustering method and system for massive document sets |
CN106709011A (en) * | 2016-12-26 | 2017-05-24 | 武汉大学 | Positional concept hierarchy disambiguation calculation method based on spatial locating cluster |
CN106709011B (en) * | 2016-12-26 | 2019-07-23 | 武汉大学 | A kind of position concept level resolution calculation method based on space orientation cluster |
CN108009151A (en) * | 2017-11-29 | 2018-05-08 | 深圳中泓在线股份有限公司 | Newsletter archive automatic segmentation method and apparatus, server and readable storage medium storing program for executing |
CN108038173A (en) * | 2017-12-07 | 2018-05-15 | 广东工业大学 | A kind of Web page classification method, system and a kind of Web page classifying equipment |
CN109299239A (en) * | 2018-09-29 | 2019-02-01 | 福建弘扬软件股份有限公司 | ES-based electronic medical record retrieval method |
CN109299239B (en) * | 2018-09-29 | 2021-11-23 | 福建弘扬软件股份有限公司 | ES-based electronic medical record retrieval method |
CN109829151A (en) * | 2018-11-27 | 2019-05-31 | 国网浙江省电力有限公司 | A kind of text segmenting method based on layering Di Li Cray model |
CN109325092A (en) * | 2018-11-27 | 2019-02-12 | 中山大学 | Merge the nonparametric parallelization level Di Li Cray process topic model system of phrase information |
CN109918659A (en) * | 2019-02-28 | 2019-06-21 | 华南理工大学 | A method of based on not retaining optimum individual genetic algorithm optimization term vector |
CN109918659B (en) * | 2019-02-28 | 2023-06-20 | 华南理工大学 | Method for optimizing word vector based on unreserved optimal individual genetic algorithm |
CN109977227A (en) * | 2019-03-19 | 2019-07-05 | 中国科学院自动化研究所 | Text feature, system, device based on feature coding |
CN110110326A (en) * | 2019-04-25 | 2019-08-09 | 西安交通大学 | A kind of text cutting method based on subject information |
CN110110326B (en) * | 2019-04-25 | 2020-10-27 | 西安交通大学 | Text cutting method based on subject information |
CN110222654A (en) * | 2019-06-10 | 2019-09-10 | 北京百度网讯科技有限公司 | Text segmenting method, device, equipment and storage medium |
CN113366511A (en) * | 2020-01-07 | 2021-09-07 | 支付宝(杭州)信息技术有限公司 | Named entity identification and extraction using genetic programming |
CN113366511B (en) * | 2020-01-07 | 2022-03-25 | 支付宝(杭州)信息技术有限公司 | Named entity identification and extraction using genetic programming |
CN111797634A (en) * | 2020-06-04 | 2020-10-20 | 语联网(武汉)信息技术有限公司 | Document segmentation method and device |
CN111797634B (en) * | 2020-06-04 | 2023-09-08 | 语联网(武汉)信息技术有限公司 | Document segmentation method and device |
CN112667817B (en) * | 2020-12-31 | 2022-05-31 | 杭州电子科技大学 | Text emotion classification integration system based on roulette attribute selection |
CN112667817A (en) * | 2020-12-31 | 2021-04-16 | 杭州电子科技大学 | Text emotion classification integration system based on roulette attribute selection |
CN113191133A (en) * | 2021-04-21 | 2021-07-30 | 北京邮电大学 | Audio text alignment method and system based on Doc2Vec |
CN112988981A (en) * | 2021-05-14 | 2021-06-18 | 哈尔滨工业大学(深圳)(哈尔滨工业大学深圳科技创新研究院) | Automatic labeling method based on genetic algorithm |
CN113673255A (en) * | 2021-08-25 | 2021-11-19 | 北京市律典通科技有限公司 | Text function region splitting method and device, computer equipment and storage medium |
CN113673255B (en) * | 2021-08-25 | 2023-06-30 | 北京市律典通科技有限公司 | Text function area splitting method and device, computer equipment and storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN101710333B (en) | 2012-07-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101710333B (en) | Network text segmenting method based on genetic algorithm | |
CN106844424B (en) | LDA-based text classification method | |
CN100353361C (en) | New method of characteristic vector weighting for text classification and its device | |
Zamani et al. | Neural query performance prediction using weak supervision from multiple signals | |
CN103984681B (en) | News event evolution analysis method based on time sequence distribution information and topic model | |
CN104268197B (en) | A kind of industry comment data fine granularity sentiment analysis method | |
CN102073730B (en) | Method for constructing topic web crawler system | |
Takanobu et al. | A Weakly Supervised Method for Topic Segmentation and Labeling in Goal-oriented Dialogues via Reinforcement Learning. | |
CN105045812A (en) | Text topic classification method and system | |
CN105760493A (en) | Automatic work order classification method for electricity marketing service hot spot 95598 | |
CN109670039A (en) | Sentiment analysis method is commented on based on the semi-supervised electric business of tripartite graph and clustering | |
CN105608200A (en) | Network public opinion tendency prediction analysis method | |
CN102591862A (en) | Control method and device of Chinese entity relationship extraction based on word co-occurrence | |
CN103514183A (en) | Information search method and system based on interactive document clustering | |
CN101980199A (en) | Method and system for discovering network hot topic based on situation assessment | |
CN103995876A (en) | Text classification method based on chi square statistics and SMO algorithm | |
CN101714135B (en) | Emotional orientation analytical method of cross-domain texts | |
CN105095183A (en) | Text emotional tendency determination method and system | |
Fitriyani et al. | The K-means with mini batch algorithm for topics detection on online news | |
CN109446423A (en) | A kind of Judgment by emotion system and method for news and text | |
CN106202530A (en) | Data processing method and device | |
CN102436512A (en) | Preference-based web page text content control method | |
Foong et al. | Text summarization using latent semantic analysis model in mobile android platform | |
CN117474126A (en) | LLaMa2 big data model design method for initial examination and evaluation of manuscript | |
Tizhoosh et al. | Poetic features for poem recognition: A comparative study |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: NANTONG LONGXIANG ELECTRICAL EQUIPMENT CO., LTD. Free format text: FORMER OWNER: NORTHWESTERN POLYTECHNICAL UNIVERSITY Effective date: 20140814 Owner name: NORTHWESTERN POLYTECHNICAL UNIVERSITY Effective date: 20140814 |
|
C41 | Transfer of patent application or patent right or utility model | ||
COR | Change of bibliographic data |
Free format text: CORRECT: ADDRESS; FROM: 710072 XI AN, SHAANXI PROVINCE TO: 226600 NANTONG, JIANGSU PROVINCE |
|
TR01 | Transfer of patent right |
Effective date of registration: 20140814 Address after: 226600 No. 69 Donghai Road, Haian Development Zone, Nantong, Jiangsu Patentee after: NANTONG LONGXIANG ELECTRIC EQUIPMENT CO., LTD. Patentee after: Northwestern Polytechnical University Address before: 710072 Xi'an friendship West Road, Shaanxi, No. 127 Patentee before: Northwestern Polytechnical University |