CN102693273A - 无监督消息聚类 - Google Patents
无监督消息聚类 Download PDFInfo
- Publication number
- CN102693273A CN102693273A CN2012100717956A CN201210071795A CN102693273A CN 102693273 A CN102693273 A CN 102693273A CN 2012100717956 A CN2012100717956 A CN 2012100717956A CN 201210071795 A CN201210071795 A CN 201210071795A CN 102693273 A CN102693273 A CN 102693273A
- Authority
- CN
- China
- Prior art keywords
- message
- bunch
- value
- vector
- label vector
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
- G06F16/355—Class or cluster creation or modification
Abstract
Description
Claims (15)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/051299 | 2011-03-18 | ||
US13/051,299 US8666984B2 (en) | 2011-03-18 | 2011-03-18 | Unsupervised message clustering |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102693273A true CN102693273A (zh) | 2012-09-26 |
CN102693273B CN102693273B (zh) | 2016-12-21 |
Family
ID=46829300
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210071795.6A Active CN102693273B (zh) | 2011-03-18 | 2012-03-19 | 无监督消息聚类 |
Country Status (2)
Country | Link |
---|---|
US (1) | US8666984B2 (zh) |
CN (1) | CN102693273B (zh) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107679052A (zh) * | 2016-06-09 | 2018-02-09 | 株式会社岛津制作所 | 大数据分析方法以及利用了该分析方法的质谱分析系统 |
CN109147446A (zh) * | 2018-08-20 | 2019-01-04 | 国政通科技有限公司 | 电子考试系统 |
CN109933610A (zh) * | 2019-02-18 | 2019-06-25 | 阿里巴巴集团控股有限公司 | 数据处理方法、装置、计算机设备及存储介质 |
CN110268428A (zh) * | 2017-02-20 | 2019-09-20 | 谷歌有限责任公司 | 基于主题的消息分组和概括 |
CN111242040A (zh) * | 2020-01-15 | 2020-06-05 | 佳都新太科技股份有限公司 | 一种动态人脸聚类方法、装置、设备和存储介质 |
CN111865760A (zh) * | 2020-06-29 | 2020-10-30 | 维沃移动通信有限公司 | 消息显示方法及装置 |
CN112732914A (zh) * | 2020-12-30 | 2021-04-30 | 深圳市网联安瑞网络科技有限公司 | 基于关键词匹配的文本聚类方法、系统、储存介质及终端 |
Families Citing this family (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102831116A (zh) * | 2011-06-14 | 2012-12-19 | 国际商业机器公司 | 用于文档聚类的方法及系统 |
CA2832918C (en) * | 2011-06-22 | 2016-05-10 | Rogers Communications Inc. | Systems and methods for ranking document clusters |
US8954458B2 (en) | 2011-07-11 | 2015-02-10 | Aol Inc. | Systems and methods for providing a content item database and identifying content items |
US9407463B2 (en) * | 2011-07-11 | 2016-08-02 | Aol Inc. | Systems and methods for providing a spam database and identifying spam communications |
US20130086072A1 (en) * | 2011-10-03 | 2013-04-04 | Xerox Corporation | Method and system for extracting and classifying geolocation information utilizing electronic social media |
US10733669B2 (en) | 2012-08-02 | 2020-08-04 | Chicago Mercantile Exchange Inc. | Message processing |
US9535996B1 (en) | 2012-08-30 | 2017-01-03 | deviantArt, Inc. | Selecting content objects for recommendation based on content object collections |
WO2014110583A1 (en) * | 2013-01-14 | 2014-07-17 | Zoosk, Inc. | System and method for improving messages |
CN103136359B (zh) * | 2013-03-07 | 2016-01-20 | 宁波成电泰克电子信息技术发展有限公司 | 单文档摘要生成方法 |
CN104111921B (zh) * | 2013-04-16 | 2018-11-09 | 北京三星通信技术研究有限公司 | 获取网络反馈的方法及设备 |
US11086905B1 (en) * | 2013-07-15 | 2021-08-10 | Twitter, Inc. | Method and system for presenting stories |
US10296616B2 (en) * | 2014-07-31 | 2019-05-21 | Splunk Inc. | Generation of a search query to approximate replication of a cluster of events |
US10621181B2 (en) * | 2014-12-30 | 2020-04-14 | Jpmorgan Chase Bank Usa, Na | System and method for screening social media content |
CN105843798A (zh) * | 2016-04-05 | 2016-08-10 | 江苏鼎中智能科技有限公司 | 一种基于长短信息分治策略的互联网信息采集融合方法 |
US10489466B1 (en) * | 2016-09-29 | 2019-11-26 | EMC IP Holding Company LLC | Method and system for document similarity analysis based on weak transitive relation of similarity |
US10769159B2 (en) * | 2016-12-22 | 2020-09-08 | Aon Global Operations Plc, Singapore Branch | Systems and methods for data mining of historic electronic communication exchanges to identify relationships, patterns, and correlations to deal outcomes |
US10606853B2 (en) | 2016-12-22 | 2020-03-31 | Aon Global Operations Ltd (Singapore Branch) | Systems and methods for intelligent prospect identification using online resources and neural network processing to classify organizations based on published materials |
US9946789B1 (en) * | 2017-04-28 | 2018-04-17 | Shenzhen Cestbon Technology Co. Limited | Classifying electronic messages using individualized artificial intelligence techniques |
US10447635B2 (en) | 2017-05-17 | 2019-10-15 | Slice Technologies, Inc. | Filtering electronic messages |
US11734096B2 (en) * | 2017-10-23 | 2023-08-22 | Vmware, Inc. | Disaster prediction recovery: statistical content based filter for software as a service |
FR3077148A1 (fr) * | 2018-01-22 | 2019-07-26 | Davidson Si | Procede et dispositif electronique de selection d'au moins un message parmi un ensemble de plusieurs messages, programme d'ordinateur associe |
US11803883B2 (en) | 2018-01-29 | 2023-10-31 | Nielsen Consumer Llc | Quality assurance for labeled training data |
US11010376B2 (en) * | 2018-10-20 | 2021-05-18 | Verizon Patent And Licensing Inc. | Methods and systems for determining search parameters from a search query |
US10956672B1 (en) * | 2018-12-19 | 2021-03-23 | Imperva, Inc. | High volume message classification and distribution |
US10951695B2 (en) | 2019-02-14 | 2021-03-16 | Aon Global Operations Se Singapore Branch | System and methods for identification of peer entities |
US11330009B2 (en) * | 2020-03-04 | 2022-05-10 | Sift Science, Inc. | Systems and methods for machine learning-based digital content clustering, digital content threat detection, and digital content threat remediation in machine learning task-oriented digital threat mitigation platform |
US20210304121A1 (en) * | 2020-03-30 | 2021-09-30 | Coupang, Corp. | Computerized systems and methods for product integration and deduplication using artificial intelligence |
Family Cites Families (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6961954B1 (en) * | 1997-10-27 | 2005-11-01 | The Mitre Corporation | Automated segmentation, information extraction, summarization, and presentation of broadcast news |
AU2001264676A1 (en) * | 2000-05-19 | 2001-12-03 | Intellibridge Corporation | Method and apparatus for providing customized information |
US20020051077A1 (en) * | 2000-07-19 | 2002-05-02 | Shih-Ping Liou | Videoabstracts: a system for generating video summaries |
US7225233B1 (en) * | 2000-10-03 | 2007-05-29 | Fenton James R | System and method for interactive, multimedia entertainment, education or other experience, and revenue generation therefrom |
JP2002175005A (ja) * | 2000-12-06 | 2002-06-21 | Rettsu International:Kk | 語学の学習用教材及びその作製方法 |
US7685265B1 (en) * | 2003-11-20 | 2010-03-23 | Microsoft Corporation | Topic-based notification service |
US7158966B2 (en) * | 2004-03-09 | 2007-01-02 | Microsoft Corporation | User intent discovery |
US7693817B2 (en) * | 2005-06-29 | 2010-04-06 | Microsoft Corporation | Sensing, storing, indexing, and retrieving data leveraging measures of user activity, attention, and interest |
EP2035967A2 (en) * | 2006-05-02 | 2009-03-18 | Koninklijke Philips Electronics N.V. | System and method for associating a category label of one user with a category label defined by another user |
US8375039B2 (en) * | 2006-08-11 | 2013-02-12 | Microsoft Corporation | Topic centric media sharing |
US9817902B2 (en) * | 2006-10-27 | 2017-11-14 | Netseer Acquisition, Inc. | Methods and apparatus for matching relevant content to user intention |
US7693902B2 (en) * | 2007-05-02 | 2010-04-06 | Yahoo! Inc. | Enabling clustered search processing via text messaging |
US7783597B2 (en) * | 2007-08-02 | 2010-08-24 | Abaca Technology Corporation | Email filtering using recipient reputation |
US7996390B2 (en) | 2008-02-15 | 2011-08-09 | The University Of Utah Research Foundation | Method and system for clustering identified forms |
US8578274B2 (en) * | 2008-09-26 | 2013-11-05 | Radius Intelligence. Inc. | System and method for aggregating web feeds relevant to a geographical locale from multiple sources |
US8539359B2 (en) | 2009-02-11 | 2013-09-17 | Jeffrey A. Rapaport | Social network driven indexing system for instantly clustering people with concurrent focus on same topic into on-topic chat rooms and/or for generating on-topic search results tailored to user preferences regarding topic |
US20100257028A1 (en) * | 2009-04-02 | 2010-10-07 | Talk3, Inc. | Methods and systems for extracting and managing latent social networks for use in commercial activities |
US20110125844A1 (en) * | 2009-05-18 | 2011-05-26 | Telcordia Technologies, Inc. | mobile enabled social networking application to support closed, moderated group interactions for purpose of facilitating therapeutic care |
TW201118589A (en) * | 2009-06-09 | 2011-06-01 | Ebh Entpr Inc | Methods, apparatus and software for analyzing the content of micro-blog messages |
US8230350B2 (en) * | 2009-07-03 | 2012-07-24 | Tweetdeck, Inc. | System and method for managing and displaying data messages |
US20110055723A1 (en) * | 2009-08-25 | 2011-03-03 | Simon Samuel Lightstone | Collaboratively interactive micro-blog posts |
US20110231478A1 (en) * | 2009-09-10 | 2011-09-22 | Motorola, Inc. | System, Server, and Mobile Device for Content Provider Website Interaction and Method Therefore |
CN101794303A (zh) * | 2010-02-11 | 2010-08-04 | 重庆邮电大学 | 采用特征扩展分类文本及构造文本分类器的方法和装置 |
US8396874B2 (en) * | 2010-02-17 | 2013-03-12 | Yahoo! Inc. | System and method for using topic messages to understand media relating to an event |
US20110218931A1 (en) * | 2010-03-03 | 2011-09-08 | Microsoft Corporation | Notifications in a Social Network Service |
US8510348B2 (en) * | 2010-03-03 | 2013-08-13 | Wgrs Licensing Company, Llc | Systems and methods for creating and using imbedded shortcodes and shortened physical and internet addresses |
US8990065B2 (en) * | 2011-01-11 | 2015-03-24 | Microsoft Technology Licensing, Llc | Automatic story summarization from clustered messages |
-
2011
- 2011-03-18 US US13/051,299 patent/US8666984B2/en active Active
-
2012
- 2012-03-19 CN CN201210071795.6A patent/CN102693273B/zh active Active
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107679052A (zh) * | 2016-06-09 | 2018-02-09 | 株式会社岛津制作所 | 大数据分析方法以及利用了该分析方法的质谱分析系统 |
CN110268428A (zh) * | 2017-02-20 | 2019-09-20 | 谷歌有限责任公司 | 基于主题的消息分组和概括 |
CN109147446A (zh) * | 2018-08-20 | 2019-01-04 | 国政通科技有限公司 | 电子考试系统 |
CN109933610A (zh) * | 2019-02-18 | 2019-06-25 | 阿里巴巴集团控股有限公司 | 数据处理方法、装置、计算机设备及存储介质 |
CN109933610B (zh) * | 2019-02-18 | 2023-08-01 | 创新先进技术有限公司 | 数据处理方法、装置、计算机设备及存储介质 |
CN111242040A (zh) * | 2020-01-15 | 2020-06-05 | 佳都新太科技股份有限公司 | 一种动态人脸聚类方法、装置、设备和存储介质 |
CN111242040B (zh) * | 2020-01-15 | 2022-08-02 | 佳都科技集团股份有限公司 | 一种动态人脸聚类方法、装置、设备和存储介质 |
CN111865760A (zh) * | 2020-06-29 | 2020-10-30 | 维沃移动通信有限公司 | 消息显示方法及装置 |
CN112732914A (zh) * | 2020-12-30 | 2021-04-30 | 深圳市网联安瑞网络科技有限公司 | 基于关键词匹配的文本聚类方法、系统、储存介质及终端 |
Also Published As
Publication number | Publication date |
---|---|
US20120239650A1 (en) | 2012-09-20 |
CN102693273B (zh) | 2016-12-21 |
US8666984B2 (en) | 2014-03-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102693273A (zh) | 无监督消息聚类 | |
CN102402605B (zh) | 用于搜索引擎索引的混合分布模型 | |
US8630972B2 (en) | Providing context for web articles | |
Wetzker et al. | A hybrid approach to item recommendation in folksonomies | |
US9262528B2 (en) | Intent management tool for identifying concepts associated with a plurality of users' queries | |
CN101641697B (zh) | 对网页的相关搜索查询及其应用 | |
US8095547B2 (en) | Method and apparatus for detecting spam user created content | |
CN101796795B (zh) | 分布式系统 | |
CN101401062A (zh) | 确定相关来源、查询及合并多个内容来源的结果的方法和系统 | |
CN104885081A (zh) | 搜索系统和相应方法 | |
CN101305371A (zh) | 对博客文档进行排名 | |
EP3047403A1 (en) | Improvements in website traffic optimization | |
CN103150374A (zh) | 一种识别微博异常用户的方法和系统 | |
CN102368252A (zh) | 将搜索查询应用到内容集 | |
CN102646108A (zh) | 使用主题意识文件评级器的信息检索 | |
CN104969254A (zh) | 内容的个性化概要 | |
CN102955844A (zh) | 基于主题版本呈现搜索结果 | |
CN115760258A (zh) | 投标文件智能生成方法、系统、计算机装置和存储介质 | |
TWI584136B (zh) | Graphic code library updates, query methods and related devices | |
US20170147652A1 (en) | Search servers, end devices, and search methods for use in a distributed network | |
CN108764770A (zh) | 自动查找物流信息的方法、装置及终端设备 | |
CN109960719A (zh) | 一种文件处理方法和相关装置 | |
US9552415B2 (en) | Category classification processing device and method | |
CN110737432A (zh) | 一种基于词根表的脚本辅助设计方法及装置 | |
CN110545233B (zh) | 一种信息推送方法、装置、电子设备及存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1173532 Country of ref document: HK |
|
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
ASS | Succession or assignment of patent right |
Owner name: MICROSOFT TECHNOLOGY LICENSING LLC Free format text: FORMER OWNER: MICROSOFT CORP. Effective date: 20150625 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20150625 Address after: Washington State Applicant after: Micro soft technique license Co., Ltd Address before: Washington State Applicant before: Microsoft Corp. |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: GR Ref document number: 1173532 Country of ref document: HK |