US11238310B2 - Training data acquisition method and device, server and storage medium - Google Patents
Training data acquisition method and device, server and storage medium Download PDFInfo
- Publication number
- US11238310B2 US11238310B2 US17/164,112 US202117164112A US11238310B2 US 11238310 B2 US11238310 B2 US 11238310B2 US 202117164112 A US202117164112 A US 202117164112A US 11238310 B2 US11238310 B2 US 11238310B2
- Authority
- US
- United States
- Prior art keywords
- classification
- image
- target
- determining
- images
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G06K9/6256—
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/58—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/58—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/583—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/217—Validation; Performance evaluation; Active pattern learning techniques
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
-
- G06K9/6262—
-
- G06K9/6267—
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/774—Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20081—Training; Learning
Definitions
- an image related to an untrusted click is filtered, in order to filter out an image if a click on the image by the user is determined to be untrusted.
- the image is filtered by analyzing a click rate, and specifically by distinguishing a trusted click from an untrusted click based on a total number of clicks and/or a click ratio between different objects clicked. See the following documents:
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Life Sciences & Earth Sciences (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Library & Information Science (AREA)
- Computing Systems (AREA)
- Multimedia (AREA)
- Software Systems (AREA)
- Medical Informatics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
classifying the at least one image into at least one classification based on a visual entity, and determining a click on an image as the untrusted click, in a case that the number of images in the visual-entity-based classification of the clicked image does not satisfy a predetermined condition, or in a case that a difference and/or ratio between the number of images in the visual-entity-based classification of the clicked image and that in a visual-entity-based classification with the largest number of images does not satisfy a predetermined condition.
-
- a first filtering subunit configured to filter out the image related to the untrusted click from the at least one image, and determine the target-classification pair according to a filtering result; and a second determining subunit configured to evaluate accuracy of the target-classification pair or of the target-classification group, and determine the training data according to the accuracy.
Claims (15)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US17/164,112 US11238310B2 (en) | 2017-09-29 | 2021-02-01 | Training data acquisition method and device, server and storage medium |
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201710912302.XA CN107679183B (en) | 2017-09-29 | 2017-09-29 | Training data acquisition method and device for classifier, server and storage medium |
| CN201710912302.X | 2017-09-29 | ||
| US16/050,288 US10936906B2 (en) | 2017-09-29 | 2018-07-31 | Training data acquisition method and device, server and storage medium |
| US17/164,112 US11238310B2 (en) | 2017-09-29 | 2021-02-01 | Training data acquisition method and device, server and storage medium |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US16/050,288 Continuation US10936906B2 (en) | 2017-09-29 | 2018-07-31 | Training data acquisition method and device, server and storage medium |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| US20210182611A1 US20210182611A1 (en) | 2021-06-17 |
| US11238310B2 true US11238310B2 (en) | 2022-02-01 |
Family
ID=61138634
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US16/050,288 Active 2038-10-09 US10936906B2 (en) | 2017-09-29 | 2018-07-31 | Training data acquisition method and device, server and storage medium |
| US17/164,112 Active US11238310B2 (en) | 2017-09-29 | 2021-02-01 | Training data acquisition method and device, server and storage medium |
Family Applications Before (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US16/050,288 Active 2038-10-09 US10936906B2 (en) | 2017-09-29 | 2018-07-31 | Training data acquisition method and device, server and storage medium |
Country Status (2)
| Country | Link |
|---|---|
| US (2) | US10936906B2 (en) |
| CN (1) | CN107679183B (en) |
Families Citing this family (17)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN109033169B (en) * | 2018-06-21 | 2021-08-10 | 东南大学 | Mobile traffic classification method based on multistage weight conversion and convolutional neural network |
| CN109241135B (en) * | 2018-08-23 | 2021-03-05 | 吾达软件(武汉)股份有限公司 | Mining system for intelligently extracting data |
| CN111259697A (en) * | 2018-11-30 | 2020-06-09 | 百度在线网络技术(北京)有限公司 | Method and apparatus for transmitting information |
| CN110166560B (en) * | 2019-05-24 | 2021-08-20 | 北京百度网讯科技有限公司 | A service configuration method, apparatus, device and storage medium |
| CN110147851B (en) * | 2019-05-29 | 2022-04-01 | 北京达佳互联信息技术有限公司 | Image screening method and device, computer equipment and storage medium |
| CN110175256B (en) * | 2019-05-30 | 2024-06-07 | 上海联影医疗科技股份有限公司 | Image data retrieval method, device, equipment and storage medium |
| JP7601549B2 (en) * | 2019-08-29 | 2024-12-17 | 日本光電工業株式会社 | Subject discrimination device, subject discrimination method, computer program, and non-transitory computer-readable medium |
| CN110704711A (en) * | 2019-09-11 | 2020-01-17 | 中国海洋大学 | Object automatic recognition system for lifelong learning |
| US11487963B2 (en) | 2019-09-16 | 2022-11-01 | International Business Machines Corporation | Automatically determining whether an activation cluster contains poisonous data |
| US11645515B2 (en) * | 2019-09-16 | 2023-05-09 | International Business Machines Corporation | Automatically determining poisonous attacks on neural networks |
| US11538236B2 (en) | 2019-09-16 | 2022-12-27 | International Business Machines Corporation | Detecting backdoor attacks using exclusionary reclassification |
| CN111400534B (en) * | 2020-03-05 | 2023-09-19 | 杭州海康威视系统技术有限公司 | Cover determination method and device for image data and computer storage medium |
| CN111626874B (en) * | 2020-05-25 | 2023-04-25 | 泰康保险集团股份有限公司 | Method, device, equipment and storage medium for processing claim data |
| CN112131743B (en) * | 2020-09-23 | 2024-03-29 | 中广核工程有限公司 | Three-dimensional training methods and systems for nuclear power plant equipment |
| CN112348110B (en) * | 2020-11-18 | 2022-10-04 | 北京市商汤科技开发有限公司 | Model training and image processing method and device, electronic equipment and storage medium |
| CN115457326A (en) * | 2022-09-15 | 2022-12-09 | 同盾科技有限公司 | Cartoon image classification method, device, electronic equipment and storage medium |
| CN117809192B (en) * | 2024-03-01 | 2024-04-26 | 南京信息工程大学 | A thunderstorm identification method based on DENCLUE clustering algorithm |
Citations (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101116072A (en) | 2005-02-03 | 2008-01-30 | 英国电讯有限公司 | Method and system for sorting and presenting search results |
| CN103279504A (en) | 2013-05-10 | 2013-09-04 | 百度在线网络技术(北京)有限公司 | Searching method and device based on ambiguity resolution |
| CN103412881A (en) | 2013-07-17 | 2013-11-27 | 北京奇虎科技有限公司 | Method and system for providing search result |
| CN103810241A (en) | 2013-11-22 | 2014-05-21 | 北京奇虎科技有限公司 | Filtering method and device for low-frequency clicks |
| US8923655B1 (en) * | 2011-10-14 | 2014-12-30 | Google Inc. | Using senses of a query to rank images associated with the query |
| US8995716B1 (en) * | 2012-07-12 | 2015-03-31 | Google Inc. | Image search results by seasonal time period |
| CN105612514A (en) | 2013-08-05 | 2016-05-25 | 脸谱公司 | Systems and methods for image classification by correlating contextual cues with images |
| US20160224593A1 (en) * | 2013-10-11 | 2016-08-04 | Huawei Technologies Co., Ltd. | Image re-ranking method and apparatus |
| CN106021364A (en) | 2016-05-10 | 2016-10-12 | 百度在线网络技术(北京)有限公司 | Method and device for establishing picture search correlation prediction model, and picture search method and device |
| US20160342859A1 (en) * | 2015-05-18 | 2016-11-24 | Facebook, Inc. | Logo detection |
| CN106339756A (en) | 2016-08-25 | 2017-01-18 | 北京百度网讯科技有限公司 | Training data generation method and device and searching method and device |
| US20170329804A1 (en) * | 2016-05-10 | 2017-11-16 | Libo Fu | Method And Apparatus Of Generating Image Characteristic Representation Of Query, And Image Search Method And Apparatus |
-
2017
- 2017-09-29 CN CN201710912302.XA patent/CN107679183B/en active Active
-
2018
- 2018-07-31 US US16/050,288 patent/US10936906B2/en active Active
-
2021
- 2021-02-01 US US17/164,112 patent/US11238310B2/en active Active
Patent Citations (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101116072A (en) | 2005-02-03 | 2008-01-30 | 英国电讯有限公司 | Method and system for sorting and presenting search results |
| US8923655B1 (en) * | 2011-10-14 | 2014-12-30 | Google Inc. | Using senses of a query to rank images associated with the query |
| US8995716B1 (en) * | 2012-07-12 | 2015-03-31 | Google Inc. | Image search results by seasonal time period |
| CN103279504A (en) | 2013-05-10 | 2013-09-04 | 百度在线网络技术(北京)有限公司 | Searching method and device based on ambiguity resolution |
| CN103412881A (en) | 2013-07-17 | 2013-11-27 | 北京奇虎科技有限公司 | Method and system for providing search result |
| CN105612514A (en) | 2013-08-05 | 2016-05-25 | 脸谱公司 | Systems and methods for image classification by correlating contextual cues with images |
| US20160224593A1 (en) * | 2013-10-11 | 2016-08-04 | Huawei Technologies Co., Ltd. | Image re-ranking method and apparatus |
| CN103810241A (en) | 2013-11-22 | 2014-05-21 | 北京奇虎科技有限公司 | Filtering method and device for low-frequency clicks |
| US20160342859A1 (en) * | 2015-05-18 | 2016-11-24 | Facebook, Inc. | Logo detection |
| CN106021364A (en) | 2016-05-10 | 2016-10-12 | 百度在线网络技术(北京)有限公司 | Method and device for establishing picture search correlation prediction model, and picture search method and device |
| US20170329804A1 (en) * | 2016-05-10 | 2017-11-16 | Libo Fu | Method And Apparatus Of Generating Image Characteristic Representation Of Query, And Image Search Method And Apparatus |
| CN106339756A (en) | 2016-08-25 | 2017-01-18 | 北京百度网讯科技有限公司 | Training data generation method and device and searching method and device |
Non-Patent Citations (4)
| Title |
|---|
| First Office Action dated Sep. 23, 2019 for Chinese Application No. 201710912302.X. |
| Search Report dated Jun. 10, 2020 issued in connection with corresponding Chinese Application No. 201710912302.X. |
| Search Report dated Sep. 11, 2019 for Chinese Application No. 201710912302.X. |
| Second Office Action dated Jun. 17, 2020 issued in connection with corresponding Chinese Application No. 201710912302.X. |
Also Published As
| Publication number | Publication date |
|---|---|
| CN107679183B (en) | 2020-11-06 |
| US20190102655A1 (en) | 2019-04-04 |
| CN107679183A (en) | 2018-02-09 |
| US10936906B2 (en) | 2021-03-02 |
| US20210182611A1 (en) | 2021-06-17 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11238310B2 (en) | Training data acquisition method and device, server and storage medium | |
| CN107491432B (en) | Low-quality article identification method and device based on artificial intelligence, equipment and medium | |
| WO2022134794A1 (en) | Method and apparatus for processing public opinions about news event, storage medium, and computer device | |
| CN105279495B (en) | A video description method based on deep learning and text summarization | |
| CN110717534B (en) | A target classification and localization method based on network supervision | |
| US9767386B2 (en) | Training a classifier algorithm used for automatically generating tags to be applied to images | |
| Xu et al. | Remote sensing image scene classification based on generative adversarial networks | |
| CN114372532B (en) | Method, device, equipment, medium and product for determining label labeling quality | |
| US10685236B2 (en) | Multi-model techniques to generate video metadata | |
| CN111460153A (en) | Hot topic extraction method and device, terminal device and storage medium | |
| CN109684476B (en) | Text classification method, text classification device and terminal equipment | |
| CN106372132A (en) | Artificial intelligence-based query intention prediction method and apparatus | |
| CN113569888B (en) | Image annotation method, device, equipment and medium | |
| JP2018509664A (en) | Model generation method, word weighting method, apparatus, device, and computer storage medium | |
| CN104142995A (en) | Social Event Recognition Method Based on Visual Attributes | |
| CN110516259B (en) | Method and device for identifying technical keywords, computer equipment and storage medium | |
| CN114416998B (en) | Text label identification method and device, electronic equipment and storage medium | |
| CN112069335A (en) | Image classification method and device, electronic equipment and storage medium | |
| Berg et al. | Do you see what I see? Measuring the semantic differences in image‐recognition services' outputs | |
| CN114780712B (en) | News thematic generation method and device based on quality evaluation | |
| CN110019763B (en) | Text filtering method, system, equipment and computer readable storage medium | |
| CN117009595A (en) | Text paragraph acquisition method and device, storage medium, and program product | |
| CN118733704B (en) | Public opinion data analysis method, device, electronic device and readable storage medium | |
| CN109840468A (en) | A kind of generation method and equipment of customer analysis report | |
| CN115879002B (en) | Training sample generation method, model training method and device |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
| AS | Assignment |
Owner name: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD., CHINA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:LI, SU;FU, LIBO;SIGNING DATES FROM 20180507 TO 20180620;REEL/FRAME:055396/0227 |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: AWAITING TC RESP., ISSUE FEE NOT PAID |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT RECEIVED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED |
|
| STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
| MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 4 |