GB2558721B - Using machine learning to detect a constituent image within a composite image - Google Patents

Using machine learning to detect a constituent image within a composite image Download PDF

Info

Publication number
GB2558721B
GB2558721B GB1717849.2A GB201717849A GB2558721B GB 2558721 B GB2558721 B GB 2558721B GB 201717849 A GB201717849 A GB 201717849A GB 2558721 B GB2558721 B GB 2558721B
Authority
GB
United Kingdom
Prior art keywords
image
detect
machine learning
constituent
composite image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
GB1717849.2A
Other versions
GB2558721A (en
GB201717849D0 (en
Inventor
Pavetic Filip
Hong Thomas Leung King
Tochilkin Dmitrii
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Google LLC
Original Assignee
Google LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Google LLC filed Critical Google LLC
Publication of GB201717849D0 publication Critical patent/GB201717849D0/en
Publication of GB2558721A publication Critical patent/GB2558721A/en
Application granted granted Critical
Publication of GB2558721B publication Critical patent/GB2558721B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/25Determination of region of interest [ROI] or a volume of interest [VOI]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/764Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/41Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • H04N5/272Means for inserting a foreground image in a background image, i.e. inlay, outlay
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20021Dividing image into blocks, subimages or windows
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20076Probabilistic image processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Multimedia (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Medical Informatics (AREA)
  • Mathematical Physics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Databases & Information Systems (AREA)
  • Signal Processing (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Image Analysis (AREA)
GB1717849.2A 2017-01-13 2017-10-30 Using machine learning to detect a constituent image within a composite image Active GB2558721B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201762446057P 2017-01-13 2017-01-13
US15/444,054 US10586111B2 (en) 2017-01-13 2017-02-27 Using machine learning to detect which part of the screen includes embedded frames of an uploaded video

Publications (3)

Publication Number Publication Date
GB201717849D0 GB201717849D0 (en) 2017-12-13
GB2558721A GB2558721A (en) 2018-07-18
GB2558721B true GB2558721B (en) 2020-04-29

Family

ID=60327376

Family Applications (1)

Application Number Title Priority Date Filing Date
GB1717849.2A Active GB2558721B (en) 2017-01-13 2017-10-30 Using machine learning to detect a constituent image within a composite image

Country Status (5)

Country Link
US (4) US10586111B2 (en)
CN (1) CN108305299A (en)
DE (1) DE102017125463A1 (en)
GB (1) GB2558721B (en)
WO (1) WO2018132145A1 (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10586111B2 (en) 2017-01-13 2020-03-10 Google Llc Using machine learning to detect which part of the screen includes embedded frames of an uploaded video
US10474926B1 (en) * 2017-11-16 2019-11-12 Amazon Technologies, Inc. Generating artificial intelligence image processing services
CN109492522B (en) * 2018-09-17 2022-04-01 中国科学院自动化研究所 Specific object detection model training program, apparatus, and computer-readable storage medium
WO2020077117A1 (en) 2018-10-11 2020-04-16 Tesla, Inc. Systems and methods for training machine models with augmented data
US10799183B2 (en) * 2018-11-07 2020-10-13 General Electric Company Methods and systems for whole body imaging
US11042611B2 (en) * 2018-12-10 2021-06-22 XNOR.ai, Inc. Digital watermarking of machine-learning models
DE102018009990A1 (en) 2018-12-24 2020-06-25 Mario Tykve Process for object-related storage and reproduction of digital images
US11893824B2 (en) 2019-02-14 2024-02-06 Nec Corporation Image processing device, fingerprint collation system, image processing method, and recording medium
CN110366002B (en) * 2019-06-14 2022-03-11 北京字节跳动网络技术有限公司 Video file synthesis method, system, medium and electronic device
US11120311B2 (en) * 2019-10-18 2021-09-14 Midea Group Co., Ltd. Adjusting machine settings through multi-pass training of object detection models
US12002257B2 (en) * 2021-11-29 2024-06-04 Google Llc Video screening using a machine learning video screening model trained using self-supervised training

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100067865A1 (en) * 2008-07-11 2010-03-18 Ashutosh Saxena Systems, Methods and Devices for Augmenting Video Content

Family Cites Families (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US1020427A (en) 1911-07-08 1912-03-19 Maurice E Kellogg Store-fixture.
US7697024B2 (en) * 2005-11-03 2010-04-13 Broadcom Corp. Method and system of tracking and stabilizing an image transmitted using video telephony
US8009921B2 (en) * 2008-02-19 2011-08-30 Xerox Corporation Context dependent intelligent thumbnail images
JP5633734B2 (en) * 2009-11-11 2014-12-03 ソニー株式会社 Information processing apparatus, information processing method, and program
US8874090B2 (en) * 2010-04-07 2014-10-28 Apple Inc. Remote control operations in a video conference
US9711182B2 (en) * 2011-06-07 2017-07-18 In Situ Media Corporation System and method for identifying and altering images in a digital video
US8761498B1 (en) * 2012-01-26 2014-06-24 Google Inc. Face and license plate detection in street level images with 3-D road width features estimated from laser data
US9148699B2 (en) * 2012-06-01 2015-09-29 Texas Instruments Incorporated Optimized algorithm for construction of composite video from a set of discrete video sources
US9465813B1 (en) * 2012-11-09 2016-10-11 Amazon Technologies, Inc. System and method for automatically generating albums
KR102000536B1 (en) * 2012-12-28 2019-07-16 삼성전자주식회사 Photographing device for making a composion image and method thereof
KR102090624B1 (en) * 2013-02-26 2020-03-18 삼성전자 주식회사 Apparatus and method for processing a image in device
US9973722B2 (en) * 2013-08-27 2018-05-15 Qualcomm Incorporated Systems, devices and methods for displaying pictures in a picture
CN103679142B (en) * 2013-12-02 2016-09-07 宁波大学 A kind of recognition method for target human body based on space constraint
US10521671B2 (en) * 2014-02-28 2019-12-31 Second Spectrum, Inc. Methods and systems of spatiotemporal pattern recognition for video content development
US10230866B1 (en) * 2015-09-30 2019-03-12 Amazon Technologies, Inc. Video ingestion and clip creation
CN105678338B (en) * 2016-01-13 2020-04-14 华南农业大学 Target tracking method based on local feature learning
US10157332B1 (en) * 2016-06-06 2018-12-18 A9.Com, Inc. Neural network-based image manipulation
US10204274B2 (en) * 2016-06-29 2019-02-12 Cellular South, Inc. Video to data
WO2018033156A1 (en) * 2016-08-19 2018-02-22 北京市商汤科技开发有限公司 Video image processing method, device, and electronic apparatus
US10303743B2 (en) * 2016-10-28 2019-05-28 Facebook, Inc. Automatic placement of electronic media content items within an online document
US10346723B2 (en) * 2016-11-01 2019-07-09 Snap Inc. Neural network for object detection in images
WO2018084577A1 (en) * 2016-11-03 2018-05-11 Samsung Electronics Co., Ltd. Data recognition model construction apparatus and method for constructing data recognition model thereof, and data recognition apparatus and method for recognizing data thereof
US10586111B2 (en) * 2017-01-13 2020-03-10 Google Llc Using machine learning to detect which part of the screen includes embedded frames of an uploaded video
US10867416B2 (en) * 2017-03-10 2020-12-15 Adobe Inc. Harmonizing composite images using deep learning
CN107330439B (en) * 2017-07-14 2022-11-04 腾讯科技(深圳)有限公司 Method for determining posture of object in image, client and server
US10984572B1 (en) * 2020-08-06 2021-04-20 Triple Lift, Inc. System and method for integrating realistic effects onto digital composites of digital visual media

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100067865A1 (en) * 2008-07-11 2010-03-18 Ashutosh Saxena Systems, Methods and Devices for Augmenting Video Content

Also Published As

Publication number Publication date
US20210374418A1 (en) 2021-12-02
US20240104435A1 (en) 2024-03-28
GB2558721A (en) 2018-07-18
CN108305299A (en) 2018-07-20
DE102017125463A1 (en) 2018-07-19
US11829854B2 (en) 2023-11-28
US10586111B2 (en) 2020-03-10
US20180204065A1 (en) 2018-07-19
WO2018132145A1 (en) 2018-07-19
US20200210709A1 (en) 2020-07-02
GB201717849D0 (en) 2017-12-13
US11093751B2 (en) 2021-08-17

Similar Documents

Publication Publication Date Title
GB2558721B (en) Using machine learning to detect a constituent image within a composite image
EP3270771A4 (en) Wearable apparatus with a stretch sensor
HUE063327T2 (en) Method to produce a thermoplastic wear resistant foil
GB201505278D0 (en) A composite material
GB201706512D0 (en) A composite component
GB2533494B (en) A composite ionically-conductive material
HK1216788A1 (en) Enhanced search results associated with a modular search object framework
EP3145383A4 (en) 3d laparoscopic image capture apparatus with a single image sensor
GB201621706D0 (en) A composite component
TWM532958U (en) Composite printing screen
EP3112154A4 (en) Screen printing machine
GB2523583C (en) Forming a composite component
EP3132382A4 (en) Image acquisition using a level-indication icon
GB201608707D0 (en) A bicycle folding system
GB201613026D0 (en) A composite component
SG11201705166WA (en) Thermoplastic vulcanizate including a block composite
SG11201704518UA (en) A bathing apparatus with recycling system
AU356322S (en) A bicycle frame
GB2554948B (en) Video monitoring using machine learning
GB2548113B (en) A composite component
GB2522489B (en) A collapsible airer for clothes
GB2527959B (en) A composite structural element
GB2546242B (en) A tricycle
ZA201604478B (en) A composite structure
GB2541908B (en) A folding cycle