HK1159815A1 - Method and apparatus for data categorizing - Google Patents

Method and apparatus for data categorizing

Info

Publication number
HK1159815A1
HK1159815A1 HK12100209.3A HK12100209A HK1159815A1 HK 1159815 A1 HK1159815 A1 HK 1159815A1 HK 12100209 A HK12100209 A HK 12100209A HK 1159815 A1 HK1159815 A1 HK 1159815A1
Authority
HK
Hong Kong
Prior art keywords
data categorizing
categorizing
data
Prior art date
Application number
HK12100209.3A
Other languages
English (en)
Inventor
Ling Zhong
Hualei Liu
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Publication of HK1159815A1 publication Critical patent/HK1159815A1/xx

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/355Class or cluster creation or modification
HK12100209.3A 2010-03-09 2012-01-09 Method and apparatus for data categorizing HK1159815A1 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010101221412A CN102193936B (zh) 2010-03-09 2010-03-09 一种数据分类的方法及装置

Publications (1)

Publication Number Publication Date
HK1159815A1 true HK1159815A1 (en) 2012-08-03

Family

ID=44560907

Family Applications (1)

Application Number Title Priority Date Filing Date
HK12100209.3A HK1159815A1 (en) 2010-03-09 2012-01-09 Method and apparatus for data categorizing

Country Status (5)

Country Link
US (1) US20110225161A1 (zh)
EP (1) EP2545511A4 (zh)
CN (1) CN102193936B (zh)
HK (1) HK1159815A1 (zh)
WO (1) WO2011112236A1 (zh)

Families Citing this family (38)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102332137A (zh) * 2011-09-23 2012-01-25 纽海信息技术(上海)有限公司 商品匹配方法及系统
US20130268328A1 (en) * 2012-04-09 2013-10-10 Yahoo! Inc. Generating a deal score to indicate a relative value of an offer
CN103377216A (zh) * 2012-04-24 2013-10-30 苏州引角信息科技有限公司 产品信息库的构建方法及系统
CN103577989B (zh) * 2012-07-30 2017-11-14 阿里巴巴集团控股有限公司 一种基于产品识别的信息分类方法及信息分类系统
US9110983B2 (en) * 2012-08-17 2015-08-18 Intel Corporation Traversing data utilizing data relationships
CN103678335B (zh) * 2012-09-05 2017-12-08 阿里巴巴集团控股有限公司 商品标识标签的方法、装置及商品导航的方法
CN103729365A (zh) * 2012-10-12 2014-04-16 阿里巴巴集团控股有限公司 一种搜索方法和系统
CN104008101B (zh) * 2013-02-21 2019-02-12 北京京东尚科信息技术有限公司 货物分类检验方法及检验装置
US9483741B2 (en) 2013-03-28 2016-11-01 Wal-Mart Stores, Inc. Rule-based item classification
US9436919B2 (en) 2013-03-28 2016-09-06 Wal-Mart Stores, Inc. System and method of tuning item classification
CN103235822B (zh) * 2013-05-03 2016-05-25 富景天策(北京)气象科技有限公司 数据库的生成及查询方法
US10678878B2 (en) 2013-05-20 2020-06-09 Tencent Technology (Shenzhen) Company Limited Method, device and storing medium for searching
CN104077337B (zh) * 2013-05-20 2015-11-25 腾讯科技(深圳)有限公司 搜索方法及装置
CN103294798B (zh) * 2013-05-27 2016-08-31 北京尚友通达信息技术有限公司 基于二元切词和支持向量机的商品自动分类方法
US10489842B2 (en) * 2013-09-30 2019-11-26 Ebay Inc. Large-scale recommendations for a dynamic inventory
CN103544264A (zh) * 2013-10-17 2014-01-29 常熟市华安电子工程有限公司 一种商品标题优化工具
CN103605815B (zh) * 2013-12-11 2016-08-31 焦点科技股份有限公司 一种适用于b2b电子商务平台的商品信息自动分类推荐方法
US20150331936A1 (en) * 2014-05-14 2015-11-19 Faris ALQADAH Method and system for extracting a product and classifying text-based electronic documents
US9607098B2 (en) 2014-06-02 2017-03-28 Wal-Mart Stores, Inc. Determination of product attributes and values using a product entity graph
CN104408635A (zh) * 2014-12-01 2015-03-11 银联智惠信息服务(上海)有限公司 商户类别信息识别方法和装置
CN106570573B (zh) * 2015-10-13 2022-05-27 菜鸟智能物流控股有限公司 预测包裹属性信息的方法及装置
CN105589847B (zh) * 2015-12-22 2019-02-15 北京奇虎科技有限公司 带权重的文章标识方法和装置
CN106919543A (zh) * 2015-12-24 2017-07-04 阿里巴巴集团控股有限公司 确定商品对象标题文本的方法及装置
CN107203542A (zh) * 2016-03-17 2017-09-26 阿里巴巴集团控股有限公司 词组提取方法及装置
CN107203507B (zh) * 2016-03-17 2019-08-13 阿里巴巴集团控股有限公司 特征词汇提取方法及装置
CN107766394B (zh) * 2016-08-23 2021-12-21 阿里巴巴集团控股有限公司 业务数据处理方法及其系统
US10200759B1 (en) * 2017-07-28 2019-02-05 Rovi Guides, Inc. Systems and methods for identifying and correlating an advertised object from a media asset with a demanded object from a group of interconnected computing devices embedded in a living environment of a user
CN110147483B (zh) * 2017-09-12 2023-09-29 阿里巴巴集团控股有限公司 一种标题重建方法及装置
CN108171586A (zh) * 2018-01-23 2018-06-15 北京值得买科技股份有限公司 一种商品聚类方法及装置
CN108388555A (zh) * 2018-02-01 2018-08-10 口碑(上海)信息技术有限公司 基于行业类别的商品去重方法及装置
CN108491873B (zh) * 2018-03-19 2019-05-14 广州蓝深科技有限公司 一种基于数据分析的商品分类方法
CN109543940B (zh) * 2018-10-12 2024-04-09 中国平安人寿保险股份有限公司 活动评估方法、装置、电子设备及存储介质
CN111625620A (zh) * 2019-02-28 2020-09-04 北京京东尚科信息技术有限公司 信息处理方法和装置
CN111723566B (zh) * 2019-03-21 2024-01-23 阿里巴巴集团控股有限公司 产品信息的重构方法和装置
CN110647630A (zh) * 2019-09-30 2020-01-03 浙江执御信息技术有限公司 检测同款商品的方法及装置
US20210304121A1 (en) * 2020-03-30 2021-09-30 Coupang, Corp. Computerized systems and methods for product integration and deduplication using artificial intelligence
CN112181968A (zh) * 2020-09-29 2021-01-05 京东数字科技控股股份有限公司 统一商品信息的方法、装置、系统及存储介质
US11829396B1 (en) * 2022-01-25 2023-11-28 Wizsoft Ltd. Method and system for retrieval based on an inexact full-text search

Family Cites Families (47)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2943447B2 (ja) * 1991-01-30 1999-08-30 三菱電機株式会社 テキスト情報抽出装置とテキスト類似照合装置とテキスト検索システムとテキスト情報抽出方法とテキスト類似照合方法、及び、質問解析装置
US5371807A (en) * 1992-03-20 1994-12-06 Digital Equipment Corporation Method and apparatus for text classification
US5331554A (en) * 1992-12-10 1994-07-19 Ricoh Corporation Method and apparatus for semantic pattern matching for text retrieval
US5438628A (en) * 1993-04-19 1995-08-01 Xerox Corporation Method for matching text images and documents using character shape codes
US7082426B2 (en) * 1993-06-18 2006-07-25 Cnet Networks, Inc. Content aggregation method and apparatus for an on-line product catalog
US6714933B2 (en) * 2000-05-09 2004-03-30 Cnet Networks, Inc. Content aggregation method and apparatus for on-line purchasing system
CN1158460A (zh) * 1996-12-31 1997-09-03 复旦大学 一种跨语种语料自动分类与检索方法
US6742003B2 (en) * 2001-04-30 2004-05-25 Microsoft Corporation Apparatus and accompanying methods for visualizing clusters of data and hierarchical cluster classifications
US6751600B1 (en) * 2000-05-30 2004-06-15 Commerce One Operations, Inc. Method for automatic categorization of items
US7076485B2 (en) * 2001-03-07 2006-07-11 The Mitre Corporation Method and system for finding similar records in mixed free-text and structured data
US7716161B2 (en) * 2002-09-24 2010-05-11 Google, Inc, Methods and apparatus for serving relevant advertisements
US20040093200A1 (en) * 2002-11-07 2004-05-13 Island Data Corporation Method of and system for recognizing concepts
US20040102957A1 (en) * 2002-11-22 2004-05-27 Levin Robert E. System and method for speech translation using remote devices
US7516070B2 (en) * 2003-02-19 2009-04-07 Custom Speech Usa, Inc. Method for simultaneously creating audio-aligned final and verbatim text with the assistance of a speech recognition program as may be useful in form completion using a verbal entry method
WO2005027092A1 (ja) * 2003-09-08 2005-03-24 Nec Corporation 文書作成閲覧方法、文書作成閲覧装置、文書作成閲覧ロボットおよび文書作成閲覧プログラム
US20080235018A1 (en) * 2004-01-20 2008-09-25 Koninklikke Philips Electronic,N.V. Method and System for Determing the Topic of a Conversation and Locating and Presenting Related Content
JP4366249B2 (ja) * 2004-06-02 2009-11-18 パイオニア株式会社 情報処理装置、その方法、そのプログラム、そのプログラムを記録した記録媒体、および、情報取得装置
WO2006046390A1 (ja) * 2004-10-29 2006-05-04 Matsushita Electric Industrial Co., Ltd. 情報検索装置
US8903827B2 (en) * 2004-10-29 2014-12-02 Ebay Inc. Method and system for categorizing items automatically
EP1848192A4 (en) * 2005-02-08 2012-10-03 Nippon Telegraph & Telephone END DEVICE, SYSTEM, METHOD AND PROGRAM FOR INFORMATION COMMUNICATION AND RECORDING MEDIUM WHICH RECORDED THE PROGRAM
US20070055526A1 (en) * 2005-08-25 2007-03-08 International Business Machines Corporation Method, apparatus and computer program product providing prosodic-categorical enhancement to phrase-spliced text-to-speech synthesis
US7574449B2 (en) * 2005-12-02 2009-08-11 Microsoft Corporation Content matching
JP4961755B2 (ja) * 2006-01-23 2012-06-27 富士ゼロックス株式会社 単語アライメント装置、単語アライメント方法、単語アライメントプログラム
US7698140B2 (en) * 2006-03-06 2010-04-13 Foneweb, Inc. Message transcription, voice query and query delivery system
US20100138451A1 (en) * 2006-04-03 2010-06-03 Assaf Henkin Techniques for facilitating on-line contextual analysis and advertising
US20070294610A1 (en) * 2006-06-02 2007-12-20 Ching Phillip W System and method for identifying similar portions in documents
JP5223673B2 (ja) * 2006-06-29 2013-06-26 日本電気株式会社 音声処理装置およびプログラム、並びに、音声処理方法
JP4125780B2 (ja) * 2006-11-09 2008-07-30 松下電器産業株式会社 コンテンツ検索装置
CN101004737A (zh) * 2007-01-24 2007-07-25 贵阳易特软件有限公司 基于关键词的个性化文档处理系统
WO2008090609A1 (ja) * 2007-01-25 2008-07-31 Fujitsu Limited 嗜好番組抽出装置
US8122032B2 (en) * 2007-07-20 2012-02-21 Google Inc. Identifying and linking similar passages in a digital text corpus
US7945525B2 (en) * 2007-11-09 2011-05-17 International Business Machines Corporation Methods for obtaining improved text similarity measures which replace similar characters with a string pattern representation by using a semantic data tree
US20090132385A1 (en) * 2007-11-21 2009-05-21 Techtain Inc. Method and system for matching user-generated text content
US8077984B2 (en) * 2008-01-04 2011-12-13 Xerox Corporation Method for computing similarity between text spans using factored word sequence kernels
US20090292677A1 (en) * 2008-02-15 2009-11-26 Wordstream, Inc. Integrated web analytics and actionable workbench tools for search engine optimization and marketing
US7958136B1 (en) * 2008-03-18 2011-06-07 Google Inc. Systems and methods for identifying similar documents
JP5224868B2 (ja) * 2008-03-28 2013-07-03 株式会社東芝 情報推薦装置および情報推薦方法
US8145482B2 (en) * 2008-05-25 2012-03-27 Ezra Daya Enhancing analysis of test key phrases from acoustic sources with key phrase training models
US8214346B2 (en) * 2008-06-27 2012-07-03 Cbs Interactive Inc. Personalization engine for classifying unstructured documents
US8060513B2 (en) * 2008-07-01 2011-11-15 Dossierview Inc. Information processing with integrated semantic contexts
US8577930B2 (en) * 2008-08-20 2013-11-05 Yahoo! Inc. Measuring topical coherence of keyword sets
US20100250526A1 (en) * 2009-03-27 2010-09-30 Prochazka Filip Search System that Uses Semantic Constructs Defined by Your Social Network
US8306807B2 (en) * 2009-08-17 2012-11-06 N T repid Corporation Structured data translation apparatus, system and method
US20110258054A1 (en) * 2010-04-19 2011-10-20 Sandeep Pandey Automatic Generation of Bid Phrases for Online Advertising
US9560206B2 (en) * 2010-04-30 2017-01-31 American Teleconferencing Services, Ltd. Real-time speech-to-text conversion in an audio conference session
KR101196935B1 (ko) * 2010-07-05 2012-11-05 엔에이치엔(주) 실시간 인기 키워드에 대한 대표 문구를 제공하는 방법 및 시스템
US8407215B2 (en) * 2010-12-10 2013-03-26 Sap Ag Text analysis to identify relevant entities

Also Published As

Publication number Publication date
EP2545511A1 (en) 2013-01-16
CN102193936B (zh) 2013-09-18
CN102193936A (zh) 2011-09-21
EP2545511A4 (en) 2016-03-16
WO2011112236A1 (en) 2011-09-15
US20110225161A1 (en) 2011-09-15

Similar Documents

Publication Publication Date Title
HK1159815A1 (en) Method and apparatus for data categorizing
IL242091B (en) Device, system and method
ZA201105101B (en) Data processing apparatus and method
EP2609720A4 (en) METHOD AND APPARATUS FOR FILTERING CONTINUOUS DIFFUSION DATA
GB2479922B (en) Data transmission apparatus and method
EP2715550A4 (en) APPARATUSES AND METHODS FOR ENSURING DATA INTEGRITY
EP2715549A4 (en) APPARATUSES AND METHODS FOR ENSURING DATA INTEGRITY
GB201019798D0 (en) Data processing apparatus and method
GB2470611B (en) Apparatus and method for processing data
EP2613443A4 (en) DATA PROCESSING DEVICE AND DATA PROCESSING METHOD
GB201103737D0 (en) Method and apparatus for transferring data
EP2618491A4 (en) DATA PROCESSING DEVICE AND DATA PROCESSING METHOD
EP2790434A4 (en) DATA TRANSMISSION PROCESS AND DEVICE
EP2506522A4 (en) METHOD AND DEVICE FOR PUSHING DATA
EP2549390A4 (en) DATA PROCESSING DEVICE AND DATA PROCESSING METHOD
HK1183394A1 (zh) 信息處理裝置和信息處理方法
HUE044124T2 (hu) Eljárás és berendezés adatok veszteséges tömörítõ kódolására
EP2761447A4 (en) DEVICE AND METHOD FOR SYNCHRONIZING APPLICATION DATA
EP2645579A4 (en) DATA PROCESSING DEVICE AND DATA PROCESSING METHOD
PT2793227T (pt) Método, dispositivo e sistema para processamento de dados áudio
EP2512064A4 (en) METHOD AND APPARATUS FOR CONFIGURING DATA
GB2490773B (en) Method and apparatus for the classification of data
EP2616948A4 (en) METHOD AND APPARATUS FOR MANAGING DATA
EP2622544A4 (en) METHOD AND DEVICE FOR DATA PROCESSING
HK1181153A1 (zh) 種數據容災處理的方法和裝置