WO2014108004A1 - 一种微博用户身份识别方法及系统 - Google Patents
一种微博用户身份识别方法及系统 Download PDFInfo
- Publication number
- WO2014108004A1 WO2014108004A1 PCT/CN2013/088616 CN2013088616W WO2014108004A1 WO 2014108004 A1 WO2014108004 A1 WO 2014108004A1 CN 2013088616 W CN2013088616 W CN 2013088616W WO 2014108004 A1 WO2014108004 A1 WO 2014108004A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- user
- feature
- user behavior
- information
- identified
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2457—Query processing with adaptation to user needs
- G06F16/24578—Query processing with adaptation to user needs using ranking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/30—Authentication, i.e. establishing the identity or authorisation of security principals
- G06F21/31—User authentication
- G06F21/316—User authentication by observing the pattern of computer usage, e.g. typical user behaviour
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/285—Clustering or classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/958—Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/30—Authentication, i.e. establishing the identity or authorisation of security principals
- G06F21/31—User authentication
Definitions
- the present invention relates to the field of computer information processing technologies, and in particular, to a microblog user identification method and system.
- the identification process mainly involves the user information registered and stored in the network by Weibo users. For example, the log, temporary information and registration information of the website that the user visits the website are obtained from the website to realize the user identification; or the Chinese text classification method is used to identify the microblog user.
- the process of identifying the user, the temporary information, and the registration information of the website to be identified by the user to be authenticated is implemented by the website, and the data of the user identification process is based on the user registration information and the user registration information.
- the user's log and temporary information make data acquisition more difficult and less accurate.
- the object of the present invention is to provide a microblog user identification method and system with high accuracy and real-time performance.
- the invention provides a microblog user identification method, which comprises:
- the identity of the user to be identified is determined.
- the invention also provides a microblog user body system, including:
- An information obtaining unit configured to obtain feature database information of only user behavior data and user behavior
- a preprocessing unit configured to preprocess the acquired user behavior data to be identified
- a semantic unit reconstruction unit configured to perform semantic unit reconstruction on the pre-processed user behavior data
- An attribute and weight information obtaining unit configured to acquire attribute information of the semantic unit and a corresponding weight thereof;
- a behavior feature extraction unit configured to acquire the user behavior feature to be identified according to attribute information of the semantic unit and a corresponding weight thereof;
- a comparing unit configured to compare each of the feature types in the feature library information of the user behavior feature to be identified
- the identity determining unit is configured to determine the identity of the user to be identified when the similarity between the feature type of the user behavior feature to be identified and the feature database of the user behavior exceeds a preset threshold.
- the microblog user identification method and system provided by the present invention obtains only the user behavior data and the feature database information of the user behavior; preprocesses the acquired user behavior data to be recognized; and the preprocessed user behavior data And performing the semantic unit reconstruction; acquiring the attribute information of the semantic unit and the corresponding weight; acquiring the to-be-identified user behavior feature according to the attribute information of the semantic unit and the corresponding weight; The behavior feature is compared with each feature type in the feature database information of the user behavior; when the user behavior feature to be identified is similar to a feature type in the feature database information of the user behavior > 3 ⁇ 4 exceeds a preset threshold, then The identity of the user to be identified is determined.
- the microblog user identification method and system provided by the invention can effectively improve the accuracy and real-time performance of the microblog user identification.
- FIG. 1 is a flowchart of a microblog user identity identification method according to an embodiment of the present invention
- FIG. 2 is a flowchart of constructing a feature database of user behavior in a microblog user identity recognition method according to the present invention
- 3 is a 3 ⁇ 4 ⁇ 2 diagram of a feature library for updating user behavior in a microblog user identification method provided by the present invention
- FIG. 4 is a schematic structural diagram of a microblog user identity recognition system according to an embodiment of the present invention
- FIG. 5 is a schematic structural diagram of another microblog user identity recognition system according to an embodiment of the present invention
- FIG. 1 is a schematic diagram of a microblog user identification method according to an embodiment of the present invention, where the method includes:
- Step 101 Obtain feature database information of only user behavior data and user behavior;
- Step 102 Pre-process the acquired user behavior data to be identified; the pre-processing mainly includes behavior data screening, spelling correction, word segmentation, and part-of-speech tagging.
- Step 103 Perform semantic unit reconstruction on the pre-processed user behavior data.
- the semantic unit reconstruction is a method for applying word-of-speech information to perform word adhesion on the basis of preprocessing, and constructing by combining specific words.
- a semantic unit (word string) that contains richer semantics.
- Step 104 Obtain attribute information of the semantic unit and its corresponding weight; wherein, the attribute information of the semantic unit refers to counting a word frequency and a document frequency of each semantic unit; and the weight of the semantic unit is a TFDID function. Realize the weight calculation of user behavior characteristics and realize the numericalization of user behavior characteristics.
- Step 105 Acquire the user behavior feature to be identified according to the attribute information of the semantic unit and its corresponding weight; the user behavior feature to be identified refers to the feature that is extracted to represent the user behavior, and the feature item ( That is, the semantic unit) has a good degree of discrimination, for a single user to be identified
- sort the keywords according to the word weight and word frequency filter out stop words or non-stop words according to the stop word table (satisfying the word length is greater than the maximum length or less than the minimum length) ; select the part of speech as "a,,,” cw,,, "v,,, “j,,,”ns,,,”nr,,,”nf,,”nz, or a word containing "no,” .
- Step 106 Comparing the user behavior feature to be identified with each feature type in the feature database information of the user behavior; the comparing process includes performing user classification, mainly adopting a KNN algorithm, and the K value selection method adopts a probability distribution.
- the method that is, the ratio of similar feature vectors to feature vector spaces.
- the classification idea is: Compare the similarity sim(u, C) of each user category in the information of the user to be identified and the user behavior characteristic database, compare the similarity sim(u, Cui) of the user and each category, if sim If (u, C) is greater than the experience threshold, or if most sim(u, Cui) is greater than the experience threshold, then the user is considered to have a correlation with the category, and the user category with the highest similarity is selected to determine the user identity.
- the similarity between the feature vectors is calculated by measuring the cosine similarity.
- the specific steps are as follows:
- Step 107 When the similarity of a feature type of the feature behavior information to be identified and the feature database information of the user behavior exceeds a preset threshold, the identity of the user to be identified is determined.
- the method may further include constructing The process of characterizing the user's behavior.
- FIG. 2 is a flowchart of a feature library for constructing a user behavior in a microblog user identification method according to an embodiment of the present invention, where the method includes:
- Step 201 Acquire known user behavior data; specifically, obtain known user behavior data, that is, training data; the training data is used to construct a feature library of user behavior.
- each word contains word string information and part of speech after processing, and the tools for word segmentation and part-of-speech tagging are all from known technologies, and will not be described here.
- Step 203 Perform semantic unit reconstruction on the pre-processed user behavior data.
- the semantic unit is reconstructed as: because the long word string contains more semantic information than the short word string, and has stronger expression capability, Therefore, the semantic unit reconstruction is based on the processing result in step 201, and the words are glued to adjacent specific words by a specific rule, thereby generating a longer semantic string.
- the adjacent words to be processed in this step include "ns,, place name, "nr,” person name, "nf, institution name, “nz,” proper noun and "j, abbreviation, etc., and the rule of processing is the combination for the first time. All words between the type of word and the last occurrence of the type of word appear.
- the word string of the tagged post is "cw", which is more important in feature selection and weight calculation.
- Step 204 Obtain attribute information of the semantic unit and its corresponding weight
- the obtaining the attribute information of the semantic unit in steps 201 and 202, uniformly numbering the semantic unit, establishing a microblog-semantic unit index vector, and attribute information of the semantic unit according to the user, including word frequency sum.
- Document frequency preparing for individual user behavior feature extraction, performing word frequency and document frequency statistics according to the same identity user, preparing for class behavior feature extraction of the same identity category, and processing the result information into the data structure as shown in FIG. 6.
- the weights of the semantic units taken are:
- the stop words are filtered according to the stop word list commonly used in the field of natural language processing, and the semantic units whose word frequency is less than the empirical threshold and whose part of speech is not including "n” or "cw" are filtered out.
- the TF-IDF weight calculation method calculates the weight of each semantic unit, and assigns a higher weight to a specific type of semantic unit.
- the specific method is that the part of the word is "nr", as shown in the following formula (2),
- the confound word as shown in the following formula (3),
- Step 205 Acquire the known according to the attribute information of the semantic unit and its corresponding weight. User behavior characteristics; get i1 ⁇ 2 as:
- the training data of the obtained known user identity mainly adopts a method of combining chi-square statistics, part of speech and word frequency; firstly calculating the chi-square value corresponding to the user category of each semantic unit, and sorting the semantic units according to the chi-square value Filter out words whose length is equal to 1, and whose part of speech is non-nr; filter out stop words or non-stop words according to the stop word table (satisfying the word length is greater than the maximum length or less than the minimum length); select the part of speech as "a,, , "CW,,,, "V,,,"j,,, “118,,,,"111",,,"1 ⁇ ,,,"112, or a word containing "No,”; none of the above information When distinguishing, choose a semantic unit with a large word frequency.
- Step 206 Store the acquired known user behavior characteristics in a feature database of the user behavior according to a category.
- FIG. 3 is a flowchart of a feature database for updating a user behavior in a microblog user identification method according to an embodiment of the present invention, where the process includes:
- Step 301 Acquire at least one semantic unit of the user to be identified that determines the identity of the user, and user type information corresponding to the identity of the user;
- Step 302 Compare user type information of the semantic unit and the user identity, and give similarity between the semantic unit and the user type information of the user identity. This step may use a chi-square statistical method to calculate a semantic unit. Correlation is evaluated by the obtained chi-square value with the chi-square value of the user category.
- Step 303 Sort the semantic units according to the order of the degree of appearance;
- Step 304 Obtain the top-n semantic units before the similarity as the behavior characteristics of the user of the type;
- Step 305 The user's behavioral characteristics are added to the corresponding categories of the feature library of the user behavior.
- the behavior feature includes at least one semantic unit; as shown in FIG. 6, the semantic unit attribute information includes at least: an index value, a character information, a part of speech, a word frequency, and a document frequency; The semantic unit includes at least one word; the attribute information of the word includes: an index of the word, a word frequency, a document frequency, an IDF value, and a weight.
- FIG. 4 is a schematic diagram of a microblog user identification system according to an embodiment of the present invention, the system includes:
- the information obtaining unit 401 is configured to obtain feature database information of only user behavior data and user behavior;
- the pre-processing unit 402 is configured to pre-process the user behavior data to be identified by the spring;
- a semantic unit reconstruction unit 403 configured to perform semantic unit reconstruction on the pre-processed user behavior data
- An attribute and weight information obtaining unit 404 configured to acquire attribute information of the semantic unit and a corresponding weight thereof;
- the behavior feature extraction unit 405 is configured to acquire the user behavior feature to be identified according to the attribute information of the semantic unit and the corresponding weight thereof;
- the comparing unit 406 is configured to compare the to-be-identified user behavior feature with each feature type in the feature library information of the user behavior;
- the identity determining unit 407 is configured to determine the identity of the user to be identified when the similarity between the feature type of the feature to be identified and the feature database of the user behavior exceeds a preset threshold.
- the system further includes: a feature library construction unit 501 and/or an information feedback unit 502 of user behavior.
- the feature library construction unit 501 of the user behavior is configured to acquire the known user behavior data; pre-process the acquired known user behavior data; and perform the semantic unit reconstruction by using the pre-processed known user behavior data; Obtaining the attribute information of the semantic unit and its corresponding weight; acquiring the known user behavior feature according to the attribute information of the semantic unit and its corresponding weight; and the obtained known user behavior characteristic, Stored in the feature library of the user behavior by category.
- the information feedback unit 502 is configured to acquire at least one semantic unit of the user to be identified that determines the identity of the user, and user type information corresponding to the identity of the user; compare user type information of the semantic unit with the user identity, And the similarity between the semantic unit and the user type information of the user identity; sorting the semantic units according to the order of appearance degree; obtaining the top-n semantic units before the similarity A behavioral characteristic of the user of the type; adding the behavioral characteristics of the user to a corresponding category of the feature library of the user behavior.
- the behavior feature described above includes at least one semantic unit; the semantic unit attribute information includes at least The index value, the character information, the part of speech, the word frequency and the document frequency; the semantic unit includes at least one word; the attribute information of the word includes: an index of the word, a word frequency, a document frequency, an IDF value, and a weight.
- the above pre-processing operations mainly include: behavior data screening, spelling correction, word segmentation and part-of-speech tagging.
- the microblog user identification method and system provided by the present invention obtains only the user behavior data and the feature database information of the user behavior; preprocesses the acquired user behavior data to be recognized; and the preprocessed user behavior data And performing the semantic unit reconstruction; acquiring the attribute information of the semantic unit and the corresponding weight; acquiring the to-be-identified user behavior feature according to the attribute information of the semantic unit and the corresponding weight; The behavior feature is compared with each feature type in the feature database information of the user behavior; when the user behavior feature to be identified is similar to a feature type in the feature database information of the user behavior > 3 ⁇ 4 exceeds a preset threshold, then The identity of the user to be identified is determined.
- the microblog user identification method and system provided by the invention can effectively improve the accuracy and real-time performance of the microblog user identification.
- a computer readable medium having computer executable instructions that, when executed by a computer, perform a microblog user identification method, the method comprising: obtaining user behavior data to be identified and a user Character library information of the behavior; pre-processing the acquired user behavior data to be identified; performing the semantic unit reconstruction on the pre-processed user behavior data; acquiring attribute information of the semantic unit and its corresponding weight; And comparing the attribute information of the semantic unit and the corresponding weight thereof to obtain the user behavior feature to be identified; comparing the user behavior feature to be identified with each feature type in the feature database information of the user behavior; The identity of the user to be identified is determined by determining that the similarity between the user behavior feature and one of the feature database information of the user behavior exceeds a preset threshold.
- a computer is also provided that includes one or more computer readable media with computer executable instructions that, when executed by a computer, perform the above described microblog user identification method.
- Exemplary operating environment includes one or more computer readable media with computer executable instructions that, when executed by a computer, perform the above described microblog user identification method.
- a computer or computing device such as described herein, has hardware, including one or more processors or processing units, system memory, and some form of computer-readable media.
- computer-readable media includes computer storage media and communication media.
- Computer storage includes any method for storing information such as computer readable instructions, data structures, program modules or other data or The volatility and non-volatility of technology implementations are both mobile and non-removable.
- Communication media typically embody computer readable instructions, data structures, program modules or other data in a modulated data signal, such as a carrier wave or other transmission mechanism, and includes any information delivery medium. Combinations of any of the above are also included within the scope of computer readable.
- the computer can be used to one or more remote computers, such as logical connections of remote computers operating in a networked environment.
- remote computers such as logical connections of remote computers operating in a networked environment.
- the computing system environment is not intended to suggest any limitation as to the scope of use or functionality of any aspect of the invention.
- the computer environment should not be interpreted as having any dependency or requirement relating to any one or combination of components illustrated in the exemplary operating environment.
- Examples of well-known computing systems, environments, and/or configurations suitable for use in aspects of the present invention include, but are not limited to: personal computers, server computers, handheld or laptop devices, multiprocessor systems, microprocessor based System, set-top box, programmable consumer electronics, mobile phone, network PC, small computer, mainframe computer, distributed computing environment including any of the above systems or devices, and the like.
- Computer executable instructions can be organized as software into one or more computer executable components or modules.
- program modules include, but are not limited to, routines, programs, objects, components, and data structures that perform particular tasks or implement particular abstract data types. Any number of such components or modules and their organization may be utilized to implement aspects of the present invention.
- aspects of the invention are not limited to the specific computer-executable instructions or specific components or modules illustrated in the figures and described herein.
- Other embodiments of the invention may be in packages or components. Aspects of the invention may also be implemented in a distributed computing environment where tasks are set up by remote processing linked through a communications network.
- program modules can be located in a memory storage, including memory storage.
- the methods and systems of the present invention may be implemented in a number of ways.
- the methods and systems of the present invention can be implemented in software, hardware, firmware, or any combination of software, hardware, or firmware.
- the above sequence of steps for the method is for illustrative purposes only, and the steps of the method of the present invention are not limited to the above The order of the description, unless otherwise specified.
- the invention may also be embodied as a program recorded in a recording medium, the program comprising machine readable instructions for implementing the method according to the invention.
- the present invention also stores a recording medium for executing a program according to the method of the present invention.
Abstract
Description
Claims
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/760,048 US20150356091A1 (en) | 2013-01-09 | 2013-12-05 | Method and system for identifying microblog user identity |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310008156.X | 2013-01-09 | ||
CN201310008156.XA CN103914494B (zh) | 2013-01-09 | 2013-01-09 | 一种微博用户身份识别方法及系统 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2014108004A1 true WO2014108004A1 (zh) | 2014-07-17 |
Family
ID=51040184
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2013/088616 WO2014108004A1 (zh) | 2013-01-09 | 2013-12-05 | 一种微博用户身份识别方法及系统 |
Country Status (3)
Country | Link |
---|---|
US (1) | US20150356091A1 (zh) |
CN (1) | CN103914494B (zh) |
WO (1) | WO2014108004A1 (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106878275A (zh) * | 2017-01-03 | 2017-06-20 | 阿里巴巴集团控股有限公司 | 身份验证方法及装置和服务器 |
CN113297397A (zh) * | 2021-05-12 | 2021-08-24 | 山东大学 | 一种基于层次化多模态信息融合的信息匹配方法及系统 |
Families Citing this family (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105447038A (zh) * | 2014-08-29 | 2016-03-30 | 国际商业机器公司 | 用于获取用户特征的方法和系统 |
CN105591747B (zh) * | 2014-12-30 | 2019-11-22 | 中国银联股份有限公司 | 基于用户网络行为特征的辅助身份验证方法 |
CN105989149A (zh) * | 2015-03-02 | 2016-10-05 | 苏宁云商集团股份有限公司 | 一种用户设备指纹的提取和识别方法及系统 |
CN105989268A (zh) * | 2015-03-02 | 2016-10-05 | 苏宁云商集团股份有限公司 | 一种人机识别的安全访问方法和系统 |
CN104778388A (zh) * | 2015-05-04 | 2015-07-15 | 苏州大学 | 一种两个不同平台下同一用户识别方法及系统 |
CN107025567A (zh) * | 2016-02-01 | 2017-08-08 | 秒针信息技术有限公司 | 一种数据处理方法和装置 |
CN105808529B (zh) * | 2016-03-10 | 2018-06-08 | 语联网(武汉)信息技术有限公司 | 一种语料划分领域的方法和装置 |
CN106295701A (zh) * | 2016-08-11 | 2017-01-04 | 五八同城信息技术有限公司 | 用户识别方法及装置 |
CN106327555A (zh) * | 2016-08-24 | 2017-01-11 | 网易(杭州)网络有限公司 | 一种获得唇形动画的方法及装置 |
WO2018226948A1 (en) * | 2017-06-09 | 2018-12-13 | Humada Holdings Inc. | Providing user specific information for services |
CN110019722B (zh) * | 2017-12-21 | 2023-11-24 | 株式会社理光 | 对话模型的回复排序方法、装置及计算机可读存储介质 |
CN108573134A (zh) * | 2018-04-04 | 2018-09-25 | 阿里巴巴集团控股有限公司 | 一种识别身份的方法、装置及电子设备 |
CN111309774A (zh) * | 2018-12-11 | 2020-06-19 | 北京嘀嘀无限科技发展有限公司 | 数据处理方法、装置、电子设备及存储介质 |
CN110009056B (zh) * | 2019-04-15 | 2021-07-30 | 秒针信息技术有限公司 | 一种社交账号的分类方法及分类装置 |
CN110110084A (zh) * | 2019-04-23 | 2019-08-09 | 北京科技大学 | 高质量用户生成内容的识别方法 |
CN110245687B (zh) * | 2019-05-17 | 2021-06-04 | 腾讯科技(上海)有限公司 | 用户分类方法以及装置 |
CN112413832B (zh) * | 2019-08-23 | 2021-11-30 | 珠海格力电器股份有限公司 | 一种基于用户行为的用户身份识别方法及其电器设备 |
CN110795570B (zh) * | 2019-10-11 | 2022-06-17 | 上海上湖信息技术有限公司 | 一种用户时序行为特征提取方法及装置 |
CN110866114B (zh) * | 2019-10-16 | 2023-05-26 | 平安科技(深圳)有限公司 | 对象行为的识别方法、装置及终端设备 |
CN111368552B (zh) * | 2020-02-26 | 2023-09-26 | 北京市公安局 | 一种面向特定领域的网络用户群组划分方法及装置 |
CN111370086A (zh) * | 2020-02-27 | 2020-07-03 | 平安国际智慧城市科技股份有限公司 | 电子病例检测方法、装置、计算机设备和存储介质 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101295381A (zh) * | 2008-06-25 | 2008-10-29 | 北京大学 | 一种垃圾邮件检测方法 |
CN102654859A (zh) * | 2011-03-01 | 2012-09-05 | 北京彩云在线技术开发有限公司 | 一种歌曲推荐方法及系统 |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7716225B1 (en) * | 2004-06-17 | 2010-05-11 | Google Inc. | Ranking documents based on user behavior and/or feature data |
CN101187920A (zh) * | 2006-11-17 | 2008-05-28 | 财团法人资讯工业策进会 | 行为特征评估系统与方法 |
US20080312985A1 (en) * | 2007-06-18 | 2008-12-18 | Microsoft Corporation | Computerized evaluation of user impressions of product artifacts |
CN102012900B (zh) * | 2009-09-04 | 2013-01-30 | 阿里巴巴集团控股有限公司 | 信息检索方法和系统 |
CN102355664A (zh) * | 2011-08-09 | 2012-02-15 | 郑毅 | 一种基于用户的社交网络对用户身份进行识别与匹配的方法 |
CN102289522B (zh) * | 2011-09-19 | 2014-08-13 | 北京金和软件股份有限公司 | 一种对于文本智能分类的方法 |
US9003025B2 (en) * | 2012-07-05 | 2015-04-07 | International Business Machines Corporation | User identification using multifaceted footprints |
-
2013
- 2013-01-09 CN CN201310008156.XA patent/CN103914494B/zh not_active Expired - Fee Related
- 2013-12-05 WO PCT/CN2013/088616 patent/WO2014108004A1/zh active Application Filing
- 2013-12-05 US US14/760,048 patent/US20150356091A1/en not_active Abandoned
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101295381A (zh) * | 2008-06-25 | 2008-10-29 | 北京大学 | 一种垃圾邮件检测方法 |
CN102654859A (zh) * | 2011-03-01 | 2012-09-05 | 北京彩云在线技术开发有限公司 | 一种歌曲推荐方法及系统 |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106878275A (zh) * | 2017-01-03 | 2017-06-20 | 阿里巴巴集团控股有限公司 | 身份验证方法及装置和服务器 |
CN106878275B (zh) * | 2017-01-03 | 2020-05-19 | 阿里巴巴集团控股有限公司 | 身份验证方法及装置和服务器 |
CN113297397A (zh) * | 2021-05-12 | 2021-08-24 | 山东大学 | 一种基于层次化多模态信息融合的信息匹配方法及系统 |
CN113297397B (zh) * | 2021-05-12 | 2022-08-09 | 山东大学 | 一种基于层次化多模态信息融合的信息匹配方法及系统 |
Also Published As
Publication number | Publication date |
---|---|
US20150356091A1 (en) | 2015-12-10 |
CN103914494A (zh) | 2014-07-09 |
CN103914494B (zh) | 2017-05-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2014108004A1 (zh) | 一种微博用户身份识别方法及系统 | |
US10764353B2 (en) | Automatic genre classification determination of web content to which the web content belongs together with a corresponding genre probability | |
US11544459B2 (en) | Method and apparatus for determining feature words and server | |
KR102345455B1 (ko) | 질의 응답 시스템에서 컴퓨터 발생형 자연 언어 출력 | |
US9898554B2 (en) | Implicit question query identification | |
US9594826B2 (en) | Co-selected image classification | |
US11727053B2 (en) | Entity recognition from an image | |
CN107204960B (zh) | 网页识别方法及装置、服务器 | |
US9672251B1 (en) | Extracting facts from documents | |
AU2017355420B2 (en) | Systems and methods for event detection and clustering | |
WO2020087774A1 (zh) | 基于概念树的意图识别方法、装置及计算机设备 | |
CN104838413A (zh) | 基于用户提交来调整内容递送 | |
US10565253B2 (en) | Model generation method, word weighting method, device, apparatus, and computer storage medium | |
WO2019218476A1 (zh) | 一种数据的导出方法及设备 | |
CN106844640A (zh) | 一种网页数据分析处理方法 | |
CN105574200A (zh) | 基于历史记录的用户兴趣提取方法 | |
US20210141822A1 (en) | Systems and methods for identifying latent themes in textual data | |
US20150104065A1 (en) | Apparatus and method for recognizing object in image | |
US20160034589A1 (en) | Method and system for search term whitelist expansion | |
KR102151858B1 (ko) | 링크드 데이터와 문자열 데이터를 이용한 온톨로지 인스턴스 확장 방법 및 시스템 | |
WO2021103594A1 (zh) | 一种默契度检测方法、设备、服务器及可读存储介质 | |
US9323721B1 (en) | Quotation identification | |
US10963743B2 (en) | Machine learning with small data sets | |
KR102315350B1 (ko) | 질의 처리 자동화 장치 및 방법 | |
CN117751368A (zh) | 隐私敏感神经网络训练 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 13871143 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 14760048 Country of ref document: US |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 13871143 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC ( EPO FORM 1205A DATED 26/01/2016 ) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 13871143 Country of ref document: EP Kind code of ref document: A1 |