JP2014232533A5 - - Google Patents

Download PDF

Info

Publication number
JP2014232533A5
JP2014232533A5 JP2014103364A JP2014103364A JP2014232533A5 JP 2014232533 A5 JP2014232533 A5 JP 2014232533A5 JP 2014103364 A JP2014103364 A JP 2014103364A JP 2014103364 A JP2014103364 A JP 2014103364A JP 2014232533 A5 JP2014232533 A5 JP 2014232533A5
Authority
JP
Japan
Prior art keywords
representation
generating
string
embedding
character
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
JP2014103364A
Other languages
Japanese (ja)
Other versions
JP2014232533A (en
Filing date
Publication date
Priority claimed from US13/903,218 external-priority patent/US9384423B2/en
Application filed filed Critical
Publication of JP2014232533A publication Critical patent/JP2014232533A/en
Publication of JP2014232533A5 publication Critical patent/JP2014232533A5/ja
Withdrawn legal-status Critical Current

Links

Claims (5)

信頼度を算出する方法において、
候補文字列を生成するように入力テキスト画像についての文字認識を行うテキスト認識システムにより、
候補文字列に基づいて第1の表現を生成することと、
入力テキスト画像に基づいて第2の表現を生成することと、
共通埋め込み空間内の前記第1の表現と前記第2の表現との間の算出された類似度に基づいて前記候補文字列における信頼度を算出することと、
を備え、
前記第1の表現を生成することが、
前記候補文字列を複数の領域に分割することと、
前記複数の領域のそれぞれの表現を抽出することと、
を備え、
前記第1の表現が、前記複数の領域のそれぞれの表現を集約することにより、文字列表現として得られ、
前記第1の表現及び前記第2の表現のうちの少なくとも1つが前記共通埋め込み空間内に射影され、
前記文字認識を行うこと、前記第1の表現を生成すること、前記第2の表現を生成すること、及び、前記信頼度を算出することのうちの少なくとも1つが、コンピュータプロセッサによって行われる、方法。
In the method of calculating the reliability,
A text recognition system that performs character recognition on the input text image to generate candidate strings,
Generating a first representation based on the candidate string;
Generating a second representation based on the input text image;
Calculating a reliability in the candidate character string based on the calculated similarity between the first expression and the second expression in a common embedding space;
With
Generating the first representation comprises:
Dividing the candidate character string into a plurality of regions;
Extracting a representation of each of the plurality of regions;
With
The first representation is obtained as a character string representation by aggregating the respective representations of the plurality of regions,
At least one of the first representation and the second representation is projected into the common embedding space;
A method in which at least one of performing the character recognition, generating the first representation, generating the second representation, and calculating the reliability is performed by a computer processor. .
前記算出された類似度が、前記射影された第1の表現及び第2の表現のドット積として算出される、請求項1に記載の方法。   The method of claim 1, wherein the calculated similarity is calculated as a dot product of the projected first representation and second representation. 前記第1の表現及び前記第2の表現は、多次元特徴ベクトルを備える、請求項1に記載の方法。   The method of claim 1, wherein the first representation and the second representation comprise multidimensional feature vectors. 前記第1の表現を生成することが、ベクトル空間に文字列を埋め込むことを備え、
前記埋め込むことが、
前記文字列から特徴のセットを抽出することと、
前記文字列から抽出された特徴のセットに基づいて前記文字列表現を生成することと、
を備える、請求項1に記載の方法。
Generating the first representation comprises embedding a string in a vector space;
Said embedding,
Extracting a set of features from the string;
Generating the string representation based on a set of features extracted from the string;
The method of claim 1, comprising:
前記埋め込むことが、空間ピラミッドバッグオブキャラクタを生成することを備える、請求項4に記載の方法。   The method of claim 4, wherein the embedding comprises generating a spatial pyramid bag of character.
JP2014103364A 2013-05-28 2014-05-19 System and method for ocr output verification Withdrawn JP2014232533A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US13/903,218 US9384423B2 (en) 2013-05-28 2013-05-28 System and method for OCR output verification
US13/903,218 2013-05-28

Publications (2)

Publication Number Publication Date
JP2014232533A JP2014232533A (en) 2014-12-11
JP2014232533A5 true JP2014232533A5 (en) 2017-06-29

Family

ID=50732041

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2014103364A Withdrawn JP2014232533A (en) 2013-05-28 2014-05-19 System and method for ocr output verification

Country Status (3)

Country Link
US (1) US9384423B2 (en)
EP (1) EP2808827B1 (en)
JP (1) JP2014232533A (en)

Families Citing this family (83)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2513431B (en) * 2013-04-25 2018-12-05 Testplant Europe Ltd Method for creating a label
US9934526B1 (en) * 2013-06-27 2018-04-03 A9.Com, Inc. Text recognition for search results
US20150006361A1 (en) 2013-06-28 2015-01-01 Google Inc. Extracting Card Data Using Three-Dimensional Models
US9245191B2 (en) * 2013-09-05 2016-01-26 Ebay, Inc. System and method for scene text recognition
US9792301B2 (en) * 2014-09-26 2017-10-17 Conduent Business Services, Llc Multi-query privacy-preserving parking management system and method
KR20150081838A (en) * 2014-01-07 2015-07-15 한국전자통신연구원 Apparatus and method for searching wanted vehicle
JP5664813B1 (en) * 2014-06-10 2015-02-04 富士ゼロックス株式会社 Design management apparatus and program
US9892337B1 (en) 2014-06-27 2018-02-13 Blinker, Inc. Method and apparatus for receiving a refinancing offer from an image
US10579892B1 (en) * 2014-06-27 2020-03-03 Blinker, Inc. Method and apparatus for recovering license plate information from an image
US10540564B2 (en) 2014-06-27 2020-01-21 Blinker, Inc. Method and apparatus for identifying vehicle information from an image
US9594971B1 (en) * 2014-06-27 2017-03-14 Blinker, Inc. Method and apparatus for receiving listings of similar vehicles from an image
US9563814B1 (en) * 2014-06-27 2017-02-07 Blinker, Inc. Method and apparatus for recovering a vehicle identification number from an image
US10733471B1 (en) 2014-06-27 2020-08-04 Blinker, Inc. Method and apparatus for receiving recall information from an image
US10572758B1 (en) 2014-06-27 2020-02-25 Blinker, Inc. Method and apparatus for receiving a financing offer from an image
US9600733B1 (en) * 2014-06-27 2017-03-21 Blinker, Inc. Method and apparatus for receiving car parts data from an image
US9607236B1 (en) * 2014-06-27 2017-03-28 Blinker, Inc. Method and apparatus for providing loan verification from an image
US9818154B1 (en) 2014-06-27 2017-11-14 Blinker, Inc. System and method for electronic processing of vehicle transactions based on image detection of vehicle license plate
US9589201B1 (en) * 2014-06-27 2017-03-07 Blinker, Inc. Method and apparatus for recovering a vehicle value from an image
US9754171B1 (en) 2014-06-27 2017-09-05 Blinker, Inc. Method and apparatus for receiving vehicle information from an image and posting the vehicle information to a website
US9589202B1 (en) * 2014-06-27 2017-03-07 Blinker, Inc. Method and apparatus for receiving an insurance quote from an image
US9779318B1 (en) 2014-06-27 2017-10-03 Blinker, Inc. Method and apparatus for verifying vehicle ownership from an image
US10515285B2 (en) 2014-06-27 2019-12-24 Blinker, Inc. Method and apparatus for blocking information from an image
US9760776B1 (en) 2014-06-27 2017-09-12 Blinker, Inc. Method and apparatus for obtaining a vehicle history report from an image
US10867327B1 (en) 2014-06-27 2020-12-15 Blinker, Inc. System and method for electronic processing of vehicle transactions based on image detection of vehicle license plate
US9773184B1 (en) 2014-06-27 2017-09-26 Blinker, Inc. Method and apparatus for receiving a broadcast radio service offer from an image
US9558419B1 (en) * 2014-06-27 2017-01-31 Blinker, Inc. Method and apparatus for receiving a location of a vehicle service center from an image
US9396404B2 (en) 2014-08-04 2016-07-19 Datalogic ADC, Inc. Robust industrial optical character recognition
US9430766B1 (en) 2014-12-09 2016-08-30 A9.Com, Inc. Gift card recognition using a camera
US9721186B2 (en) * 2015-03-05 2017-08-01 Nant Holdings Ip, Llc Global signatures for large-scale image recognition
US10769200B1 (en) * 2015-07-01 2020-09-08 A9.Com, Inc. Result re-ranking for object recognition
CN106326821B (en) * 2015-07-07 2019-08-30 北京易车互联信息技术有限公司 The method and device of License Plate
US9798948B2 (en) 2015-07-31 2017-10-24 Datalogic IP Tech, S.r.l. Optical character recognition localization tool
CN107924470A (en) * 2015-08-21 2018-04-17 3M创新有限公司 Increase is arranged on the diversity of the character on optical activity product
US10078889B2 (en) * 2015-08-25 2018-09-18 Shanghai United Imaging Healthcare Co., Ltd. System and method for image calibration
US11238362B2 (en) * 2016-01-15 2022-02-01 Adobe Inc. Modeling semantic concepts in an embedding space as distributions
US10019640B2 (en) * 2016-06-24 2018-07-10 Accenture Global Solutions Limited Intelligent automatic license plate recognition for electronic tolling environments
US10474923B2 (en) * 2016-06-27 2019-11-12 Facebook, Inc. Systems and methods for incremental character recognition to recognize characters in images
US10255516B1 (en) * 2016-08-29 2019-04-09 State Farm Mutual Automobile Insurance Company Systems and methods for using image analysis to automatically determine vehicle information
US10102453B1 (en) * 2017-08-03 2018-10-16 Gyrfalcon Technology Inc. Natural language processing via a two-dimensional symbol having multiple ideograms contained therein
US10366302B2 (en) 2016-10-10 2019-07-30 Gyrfalcon Technology Inc. Hierarchical category classification scheme using multiple sets of fully-connected networks with a CNN based integrated circuit as feature extractor
US10339445B2 (en) 2016-10-10 2019-07-02 Gyrfalcon Technology Inc. Implementation of ResNet in a CNN based digital integrated circuit
US10083171B1 (en) * 2017-08-03 2018-09-25 Gyrfalcon Technology Inc. Natural language processing using a CNN based integrated circuit
US10360470B2 (en) 2016-10-10 2019-07-23 Gyrfalcon Technology Inc. Implementation of MobileNet in a CNN based digital integrated circuit
US10366328B2 (en) 2017-09-19 2019-07-30 Gyrfalcon Technology Inc. Approximating fully-connected layers with multiple arrays of 3x3 convolutional filter kernels in a CNN based integrated circuit
KR101873576B1 (en) * 2016-10-31 2018-07-03 한국전자통신연구원 System and method for recognizing information from vehicle license plate
US10216766B2 (en) * 2017-03-20 2019-02-26 Adobe Inc. Large-scale image tagging using image-to-topic embedding
US10275646B2 (en) 2017-08-03 2019-04-30 Gyrfalcon Technology Inc. Motion recognition via a two-dimensional symbol having multiple ideograms contained therein
US10950124B2 (en) 2017-08-22 2021-03-16 Q-Free Netherlands B.V. License plate recognition
US10192148B1 (en) * 2017-08-22 2019-01-29 Gyrfalcon Technology Inc. Machine learning of written Latin-alphabet based languages via super-character
CN107679074B (en) * 2017-08-25 2021-05-04 百度在线网络技术(北京)有限公司 Picture generation method and equipment
CN107992872B (en) * 2017-12-25 2020-04-28 广东小天才科技有限公司 Method for carrying out text recognition on picture and mobile terminal
US10521654B2 (en) 2018-03-29 2019-12-31 Fmr Llc Recognition of handwritten characters in digital images using context-based machine learning
US11763188B2 (en) * 2018-05-03 2023-09-19 International Business Machines Corporation Layered stochastic anonymization of data
US10417342B1 (en) 2018-07-03 2019-09-17 Gyrfalcon Technology Inc. Deep learning device for local processing classical chinese poetry and verse
US10311149B1 (en) * 2018-08-08 2019-06-04 Gyrfalcon Technology Inc. Natural language translation device
US10387772B1 (en) 2018-10-22 2019-08-20 Gyrfalcon Technology Inc. Ensemble learning based image classification systems
RU2743898C1 (en) 2018-11-16 2021-03-01 Общество С Ограниченной Ответственностью "Яндекс" Method for performing tasks
US10963717B1 (en) * 2018-12-21 2021-03-30 Automation Anywhere, Inc. Auto-correction of pattern defined strings
JP7404625B2 (en) 2019-01-23 2023-12-26 富士フイルムビジネスイノベーション株式会社 Information processing device and program
US11386636B2 (en) 2019-04-04 2022-07-12 Datalogic Usa, Inc. Image preprocessing for optical character recognition
RU2744032C2 (en) * 2019-04-15 2021-03-02 Общество С Ограниченной Ответственностью "Яндекс" Method and system for determining result of task execution in crowdsourced environment
US11281911B2 (en) 2019-04-27 2022-03-22 Gyrfalcon Technology Inc. 2-D graphical symbols for representing semantic meaning of a video clip
US10713830B1 (en) 2019-05-13 2020-07-14 Gyrfalcon Technology Inc. Artificial intelligence based image caption creation systems and methods thereof
SG10201904554TA (en) * 2019-05-21 2019-09-27 Alibaba Group Holding Ltd Methods and devices for quantifying text similarity
RU2744038C2 (en) 2019-05-27 2021-03-02 Общество С Ограниченной Ответственностью «Яндекс» Method and a system for determining the result of a task in the crowdsourcing environment
US11227174B1 (en) 2019-06-10 2022-01-18 James Alves License plate recognition
KR102264988B1 (en) * 2019-06-27 2021-06-16 경북대학교 산학협력단 Traditional Korean character Hanja Recognition System and method using thereof
US11526723B2 (en) 2019-07-09 2022-12-13 Gyrfalcon Technology Inc. Apparatus and methods of obtaining multi-scale feature vector using CNN based integrated circuits
CN110796134B (en) * 2019-08-06 2023-03-28 汕头大学 Method for combining words of Chinese characters in strong-noise complex background image
RU2019128272A (en) 2019-09-09 2021-03-09 Общество С Ограниченной Ответственностью «Яндекс» Method and System for Determining User Performance in a Computer Crowdsourced Environment
US11302108B2 (en) * 2019-09-10 2022-04-12 Sap Se Rotation and scaling for optical character recognition using end-to-end deep learning
CN110717493B (en) * 2019-09-16 2022-04-01 浙江大学 License plate recognition method containing stacked characters based on deep learning
CN110660051B (en) * 2019-09-20 2022-03-15 西南石油大学 Tensor voting processing method based on navigation pyramid
RU2019135532A (en) 2019-11-05 2021-05-05 Общество С Ограниченной Ответственностью «Яндекс» Method and system for selecting a label from a plurality of labels for a task in a crowdsourced environment
US11562203B2 (en) 2019-12-30 2023-01-24 Servicenow Canada Inc. Method of and server for training a machine learning algorithm for estimating uncertainty of a sequence of models
US11481691B2 (en) 2020-01-16 2022-10-25 Hyper Labs, Inc. Machine learning-based text recognition system with fine-tuning model
RU2020107002A (en) 2020-02-14 2021-08-16 Общество С Ограниченной Ответственностью «Яндекс» METHOD AND SYSTEM FOR RECEIVING A LABEL FOR A DIGITAL PROBLEM PERFORMED IN A CROWDSORING ENVIRONMENT
US20220058415A1 (en) * 2020-08-24 2022-02-24 Electronic Transaction Consultants, Llc Gamified Alphanumeric Character Identification
US20220414328A1 (en) * 2021-06-23 2022-12-29 Servicenow Canada Inc. Method and system for predicting field value using information extracted from a document
KR102509943B1 (en) * 2021-07-20 2023-03-14 강상훈 Writing support apparatus for electronic document
US20230084845A1 (en) * 2021-09-13 2023-03-16 Microsoft Technology Licensing, Llc Entry detection and recognition for custom forms
US20230090269A1 (en) * 2021-09-22 2023-03-23 International Business Machines Corporation Historical image search
US20230133690A1 (en) * 2021-11-01 2023-05-04 Salesforce.Com, Inc. Processing forms using artificial intelligence models

Family Cites Families (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5805747A (en) * 1994-10-04 1998-09-08 Science Applications International Corporation Apparatus and method for OCR character and confidence determination using multiple OCR devices
US5774588A (en) * 1995-06-07 1998-06-30 United Parcel Service Of America, Inc. Method and system for comparing strings with entries of a lexicon
WO2000057350A1 (en) * 1999-03-19 2000-09-28 Raf Technology, Inc. Rollup functions for efficient storage, presentation, and analysis of data
JP4527322B2 (en) 2001-07-25 2010-08-18 日本電気株式会社 Image search device, image search method, and image search program
US7236632B2 (en) * 2003-04-11 2007-06-26 Ricoh Company, Ltd. Automated techniques for comparing contents of images
US7680330B2 (en) * 2003-11-14 2010-03-16 Fujifilm Corporation Methods and apparatus for object recognition using textons
US7756341B2 (en) 2005-06-30 2010-07-13 Xerox Corporation Generic visual categorization method and system
US7680341B2 (en) 2006-05-05 2010-03-16 Xerox Corporation Generic visual classification with gradient components-based dimensionality enhancement
US7885466B2 (en) 2006-09-19 2011-02-08 Xerox Corporation Bags of visual context-dependent words for generic visual categorization
US20080240572A1 (en) 2007-03-26 2008-10-02 Seiko Epson Corporation Image Search Apparatus and Image Search Method
US7933454B2 (en) 2007-06-25 2011-04-26 Xerox Corporation Class-based image enhancement system
US7885794B2 (en) 2007-11-30 2011-02-08 Xerox Corporation Object comparison, retrieval, and categorization methods and apparatuses
US8009921B2 (en) 2008-02-19 2011-08-30 Xerox Corporation Context dependent intelligent thumbnail images
US8111923B2 (en) 2008-08-14 2012-02-07 Xerox Corporation System and method for object class localization and semantic class based image segmentation
WO2010021326A1 (en) 2008-08-19 2010-02-25 リンテック株式会社 Moulded article, method for producing the same, electronic device member, and electronic device
US9183227B2 (en) 2008-09-19 2015-11-10 Xerox Corporation Cross-media similarity measures through trans-media pseudo-relevance feedback and document reranking
US8463051B2 (en) 2008-10-16 2013-06-11 Xerox Corporation Modeling images as mixtures of image models
US8249343B2 (en) 2008-10-15 2012-08-21 Xerox Corporation Representing documents with runlength histograms
US8774498B2 (en) 2009-01-28 2014-07-08 Xerox Corporation Modeling images as sets of weighted features
US8150858B2 (en) 2009-01-28 2012-04-03 Xerox Corporation Contextual similarity measures for objects and retrieval, classification, and clustering using same
US8175376B2 (en) 2009-03-09 2012-05-08 Xerox Corporation Framework for image thumbnailing based on visual similarity
US8280828B2 (en) 2009-06-12 2012-10-02 Xerox Corporation Fast and efficient nonlinear classifier generated from a trained linear classifier
US8644622B2 (en) 2009-07-30 2014-02-04 Xerox Corporation Compact signature for unordered vector sets with application to image retrieval
US8380647B2 (en) 2009-08-14 2013-02-19 Xerox Corporation Training a classifier by dimension-wise embedding of training data
US9355337B2 (en) 2009-08-25 2016-05-31 Xerox Corporation Consistent hierarchical labeling of image and image regions
US8171049B2 (en) 2009-09-18 2012-05-01 Xerox Corporation System and method for information seeking in a multimedia collection
US20110137898A1 (en) 2009-12-07 2011-06-09 Xerox Corporation Unstructured document classification
US20110194733A1 (en) * 2010-02-11 2011-08-11 Tc License Ltd. System and method for optical license plate matching
US8532399B2 (en) 2010-08-20 2013-09-10 Xerox Corporation Large scale image classification
US8340429B2 (en) 2010-09-18 2012-12-25 Hewlett-Packard Development Company, Lp Searching document images
US8731317B2 (en) 2010-09-27 2014-05-20 Xerox Corporation Image classification employing image vectors compressed using vector quantization
US8370338B2 (en) 2010-12-03 2013-02-05 Xerox Corporation Large-scale asymmetric comparison computation for binary embeddings
US8447767B2 (en) 2010-12-15 2013-05-21 Xerox Corporation System and method for multimedia information retrieval
US8483440B2 (en) * 2011-04-13 2013-07-09 Xerox Corporation Methods and systems for verifying automatic license plate recognition results
US8533204B2 (en) 2011-09-02 2013-09-10 Xerox Corporation Text-based searching of image data
US8582819B2 (en) 2011-11-18 2013-11-12 Xerox Corporation Methods and systems for improving yield in wanted vehicle searches
US8588470B2 (en) 2011-11-18 2013-11-19 Xerox Corporation Methods and systems for improved license plate signature matching by similarity learning on synthetic images

Similar Documents

Publication Publication Date Title
JP2014232533A5 (en)
WO2018014109A8 (en) System and method for analyzing and searching for features associated with objects
JP2020525901A5 (en)
JP2019506664A5 (en)
JP2016535335A5 (en)
KR102043960B1 (en) Method and systems of face expression features classification robust to variety of face image appearance
GB2544660A (en) Visual interactive search
JP2016538661A5 (en)
WO2015148190A3 (en) Training, recognition, and generation in a spiking deep belief network (dbn)
JP2015148979A5 (en)
WO2014186302A3 (en) Predicting behavior using features derived from statistical information
JP2017204673A5 (en)
JP2016522910A5 (en)
JP2017188137A5 (en)
JP2015225657A5 (en)
JP2013536958A5 (en)
CN106372051B8 (en) A kind of method for visualizing and system of patent map
JP2018045302A5 (en)
GB2565701A (en) Repair diagnostic system and method
WO2016043846A4 (en) A general formal concept analysis (fca) framework for classification
PE20150308A1 (en) SYSTEMS AND METHODS FOR THE PROCESSING OF GEOPHYSICAL DATA
JP2015127936A5 (en)
JP2016529598A5 (en)
Cai et al. Fast pedestrian detection with adaboost algorithm using GPU
JP2013097395A5 (en)