CN101464903A - OCR picture and text recognition and retrieval method and system through web mode - Google Patents

OCR picture and text recognition and retrieval method and system through web mode Download PDF

Info

Publication number
CN101464903A
CN101464903A CNA2009100761552A CN200910076155A CN101464903A CN 101464903 A CN101464903 A CN 101464903A CN A2009100761552 A CNA2009100761552 A CN A2009100761552A CN 200910076155 A CN200910076155 A CN 200910076155A CN 101464903 A CN101464903 A CN 101464903A
Authority
CN
China
Prior art keywords
text
picture
ocr
literal
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2009100761552A
Other languages
Chinese (zh)
Inventor
凌辉
黄惠良
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
JIANGYIN MINGLUN TECHNOLOGY Co Ltd
Original Assignee
JIANGYIN MINGLUN TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by JIANGYIN MINGLUN TECHNOLOGY Co Ltd filed Critical JIANGYIN MINGLUN TECHNOLOGY Co Ltd
Priority to CNA2009100761552A priority Critical patent/CN101464903A/en
Publication of CN101464903A publication Critical patent/CN101464903A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Character Discrimination (AREA)

Abstract

The invention discloses a method for retrieving and recognizing OCR picture and text in a web manner. The method comprises the following steps: acquiring text information and picture information from a picture and text to be recognized; storing the text information and the picture information into an OCR database; and retrieving the full text in the OCR database. The invention further discloses a system for retrieving and recognizing OCR picture and text, which comprises a picture and text information acquisition unit, an OCR database and a retrieval unit. By utilizing the OCR picture and text recognizing technique, the invention ensures that the recognition is more efficient, editable text formatting can be exported, and the required information resource can be retrieved conveniently and effectively by utilizing the full text retrieval technique and inputting characters embedded in the picture information.

Description

A kind of web of utilization mode is carried out OCR picture and text recognition and retrieval method and system
Technical field
The present invention relates to picture and text recognition technology field, particularly relate to a kind of OCR (OpticalCharacter Recognition, optical character identification) picture and text recognition and retrieval method and system.
Background technology
Retrieval is meant that information organizes by certain mode, and finds out the process and the technology of relevant information according to information user's needs, promptly finds out the process of needed information from ensemble of communication.
Owing to can not discern well to the literal in the image file, so there is very big difficulty in the retrieval to the picture and text form that can not arbitrarily edit, this makes management organization face the picture format of different content, seem so at a loss as to what to do, have to spend a large amount of human and material resources, with manual type rearrangement, typing, classification, just can be unified into certain text formatting then and retrieve again.
Summary of the invention
The problem that the embodiment of the invention will solve provides a kind of web of utilization mode and carries out OCR picture and text recognition and retrieval method and system, to overcome the very difficult defective that the picture and text form that can not arbitrarily edit is retrieved in the prior art.
For achieving the above object, the technical scheme of the embodiment of the invention provides a kind of web of utilization mode to carry out OCR picture and text recognition and retrieval method, said method comprising the steps of: A. obtains literal and the pictorial information in the picture and text to be identified; B. described literal and pictorial information are stored in the OCR database; C. according to keyword, in described OCR database, carry out full-text search.
Wherein, steps A specifically comprises: A1. obtains picture and text to be identified; A2. described picture and text are carried out printed page analysis; A3. described picture and text are carried out OCR identification, obtain literal and pictorial information in the described picture and text.
Wherein, after steps A, also comprise: D. proofreads described Word message.
Wherein, step D specifically comprises: D1. laterally proofreads described Word message; D2. described Word message is vertically proofreaded.
Wherein, before step B, also comprise: E. exports to the text formatting file that can edit, duplicate or quote with described literal and pictorial information.
The technical scheme of the embodiment of the invention also provides a kind of OCR picture and text identification searching system, and described system comprises: the graph text information acquiring unit is used for obtaining the literal and the pictorial information of picture and text to be identified; The OCR database is used to store described literal and pictorial information; Retrieval unit is used for according to keyword, carries out full-text search in described OCR database.
Wherein, described Word message acquiring unit comprises: picture and text obtain subelement, are used to obtain picture and text to be identified; The printed page analysis subelement is used for described picture and text are carried out printed page analysis; Picture and text recognin unit is used for described picture and text are carried out OCR identification, obtains the Word message in the described picture and text.
Wherein, to obtain subelement be to possess to take or the equipment of scan function to described picture and text.
Wherein, to obtain subelement be scanner, digital camera, integral machine or shooting mobile phone to described picture and text.
Wherein, described system also comprises the check and correction unit, is used for described Word message is laterally proofreaded and vertically check and correction.
Compared with prior art, technical scheme of the present invention has following advantage:
The embodiment of the invention is utilized OCR picture and text recognition technology, with its efficient identification, derives editable text formatting, utilizes global search technology again, is embedded in literal in the picture information by input, can retrieve needed information resources easily and efficiently.
Description of drawings
Fig. 1 is the process flow diagram that a kind of web of utilization mode of the embodiment of the invention is carried out OCR picture and text recognition and retrieval method;
Fig. 2 is that the another kind of the embodiment of the invention utilizes the web mode to carry out the process flow diagram of OCR picture and text recognition and retrieval method;
Fig. 3 is that the another kind of the embodiment of the invention utilizes the web mode to carry out the process flow diagram of OCR picture and text recognition and retrieval method
Fig. 4 is that the another kind of the embodiment of the invention utilizes the web mode to carry out the process flow diagram of OCR picture and text recognition and retrieval method
Fig. 5 is the structural drawing that a kind of web of utilization mode of the embodiment of the invention is carried out OCR picture and text identification searching system.
Embodiment
Below in conjunction with drawings and Examples, the specific embodiment of the present invention is described in further detail.Following examples are used to illustrate the present invention, but are not used for limiting the scope of the invention.
Embodiment one
A kind of web of utilization mode of the embodiment of the invention is carried out OCR picture and text recognition and retrieval method as shown in Figure 1, may further comprise the steps:
Step s101 obtains picture and text to be identified.Present embodiment obtains picture and text to be identified by any equipment that possesses shooting, scan function such as scanner, digital camera, integral machine, shooting mobile phones.
Step s102 carries out printed page analysis to described picture and text.
Step s103 carries out OCR identification to described picture and text, obtains literal and pictorial information in the described picture and text.
Step s104 is stored in described literal and pictorial information in the OCR database.
Step s105 according to keyword, carries out full-text search in described OCR database.Present embodiment utilizes global search technology, is embedded in literal in the picture information by input, can make things convenient for to retrieve needed information resources efficiently.
Embodiment two
A kind of web of utilization mode of the embodiment of the invention is carried out OCR picture and text recognition and retrieval method as shown in Figure 2, may further comprise the steps:
Step s201 obtains picture and text to be identified.Present embodiment obtains picture and text to be identified by any equipment that possesses shooting, scan function such as scanner, digital camera, integral machine, shooting mobile phones.
Step s202 carries out printed page analysis to described picture and text.
Step s203 carries out OCR identification to described picture and text, obtains literal and pictorial information in the described picture and text.
Step s204 proofreads described Word message.Present embodiment is analyzed automatically to the complicated space of a whole page, and the text of the various mixing forms of intellectual analysis is carried out horizontal and vertical comprehensive school team at the identification file, need not too much manual intervention.
Step s205 is stored in described literal and pictorial information in the OCR database.
Step s206 according to keyword, carries out full-text search in described OCR database.Present embodiment utilizes global search technology, is embedded in literal in the picture information by input, can make things convenient for to retrieve needed information resources efficiently.
Embodiment three
A kind of web of utilization mode of the embodiment of the invention is carried out OCR picture and text recognition and retrieval method as shown in Figure 3, may further comprise the steps:
Step s301 obtains picture and text to be identified.Present embodiment obtains picture and text to be identified by any equipment that possesses shooting, scan function such as scanner, digital camera, integral machine, shooting mobile phones.
Step s302 carries out printed page analysis to described picture and text.
Step s303 carries out OCR identification to described picture and text, obtains literal and pictorial information in the described picture and text.
Step s304 exports to the text formatting file that can edit, duplicate or quote with described literal and pictorial information.In the present embodiment, described text formatting file comprises the multiple text formatting files of editing, duplicating and quote such as word, rtf.
Step s305 is stored in described Word message in the OCR database.
Step s306 according to keyword, carries out full-text search in described OCR database.Present embodiment utilizes global search technology, is embedded in literal in the picture information by input, can make things convenient for to retrieve needed information resources efficiently.
Embodiment four
A kind of web of utilization mode of the embodiment of the invention is carried out OCR picture and text recognition and retrieval method as shown in Figure 4, may further comprise the steps:
Step s401 obtains picture and text to be identified.Present embodiment obtains picture and text to be identified by any equipment that possesses shooting, scan function such as scanner, digital camera, integral machine, shooting mobile phones.
Step s402 carries out printed page analysis to described picture and text.
Step s403 carries out OCR identification to described picture and text, obtains literal and pictorial information in the described picture and text.
Step s404 proofreads described Word message.Present embodiment is analyzed automatically to the complicated space of a whole page, and the text of the various mixing forms of intellectual analysis is carried out horizontal and vertical comprehensive school team at the identification file, need not too much manual intervention.
Step s405 exports to the text formatting file that can edit, duplicate or quote with described literal and pictorial information.In the present embodiment, described text formatting file comprises the multiple text formatting files of editing, duplicating and quote such as word, rtf.
Step s406 is stored in described literal and pictorial information in the OCR database.
Step s407 according to keyword, carries out full-text search in described OCR database.Present embodiment utilizes global search technology, is embedded in literal in the picture information by input, can make things convenient for to retrieve needed information resources efficiently.
A kind of web of utilization mode of the embodiment of the invention is carried out OCR picture and text identification searching system as shown in Figure 5, comprises graph text information acquiring unit, check and correction unit, OCR database and retrieval unit.Wherein, the check and correction unit is connected with the OCR database with the graph text information acquiring unit respectively, and retrieval unit is connected with the OCR database.
The graph text information acquiring unit is used for obtaining the literal and the pictorial information of picture and text to be identified; The check and correction unit is used for described Word message is laterally proofreaded and vertically check and correction; The OCR database is used to store described literal and pictorial information; Retrieval unit is used for according to keyword, carries out full-text search in described OCR database.
The graph text information acquiring unit comprises that picture and text obtain subelement, printed page analysis subelement and picture and text recognin unit, and wherein the printed page analysis subelement obtains subelement with picture and text respectively and is connected with picture and text recognin unit.
It is the equipment that possesses shooting or scan function that picture and text obtain subelement, is used to obtain picture and text to be identified, can be scanner, digital camera, integral machine or shooting mobile phone etc.; The printed page analysis subelement is used for described picture and text are carried out printed page analysis; Picture and text recognin unit is used for described picture and text are carried out OCR identification, obtains literal and pictorial information in the described picture and text.
The picture and text form data that the present invention can not arbitrarily edit, rely on the advantage of OCR research and development technology, it is arbitrarily exported to the multiple text formatting files of editing, duplicating and quote such as word, rtf of appointment, pictograph can be stored in the database after treatment, be convenient to large volume document storage, management, share, transmission and retrieval.Recognition accuracy height of the present invention, strong robustness, seamless integration the overall process of printed page analysis, image recognition, Intelligent Recognition and full-text search.The present invention can be by any equipment that possesses shooting, scan function such as scanner, digital camera, integral machine, shooting mobile phones, anywhere or anytime the picture and text in the image file are carried out OCR identification, existing OCR product all be software and hardware combining together, and the present invention has broken away from the constraint of hardware, realized the random combination of single software and multiple hardwares, make full use of existing equipment, finish the document sharing and the retrieval work in loaded down with trivial details typing, arrangement and later stage.
The present invention utilizes OCR picture and text recognition technology, with its efficient identification, derive text formattings such as editable word, rtf, utilize global search technology again, be embedded in literal in the picture information, can make things convenient for to retrieve needed information resources efficiently by input, thereby can finish Intelligent Recognition fast, efficiently, accurately to picture format, fully satisfied the typing work of different demands such as managerial personnel, clerical workforce,, improved efficient for it has saved a large amount of time.
The present invention can analyze automatically for the complicated space of a whole page, and the text of the various mixing forms of intellectual analysis is carried out laterally and vertical school team comprehensively at the identification file, need not too much manual intervention.And the present invention can carry out layout reversion, has accurately kept former space of a whole page form, accurately recovers the text original appearance.The present invention has powerful official document processing power, can accurately reproduce the official document original appearance.The present invention has realized the random combination of single software and multiple hardwares, makes full use of existing equipment, finishes loaded down with trivial details typing, housekeeping.The present invention utilizes global search technology, and input is embedded in the Word message in the picture information, can find needed photo information fast, efficiently
The above only is a preferred implementation of the present invention; should be pointed out that for those skilled in the art, under the prerequisite that does not break away from the technology of the present invention principle; can also make some improvements and modifications, these improvements and modifications also should be considered as protection scope of the present invention.

Claims (10)

1, a kind of web of utilization mode is carried out OCR picture and text recognition and retrieval method, it is characterized in that, said method comprising the steps of:
A. obtain the Word message in the picture and text to be identified;
B. described literal and pictorial information are stored in the OCR database;
C. according to keyword, in described OCR database, carry out full-text search.
2, OCR picture and text recognition and retrieval method as claimed in claim 1 is characterized in that steps A specifically comprises:
A1. obtain picture and text to be identified;
A2. described picture and text are carried out printed page analysis;
A3. described picture and text are carried out OCR identification, obtain literal and pictorial information in the described picture and text.
3, the web of utilization mode as claimed in claim 1 is carried out OCR picture and text recognition and retrieval method, it is characterized in that, after steps A, also comprises:
D. described Word message is proofreaded.
4, OCR picture and text recognition and retrieval method as claimed in claim 3 is characterized in that step D specifically comprises:
D1. described Word message is laterally proofreaded;
D2. described Word message is vertically proofreaded.
5, OCR picture and text recognition and retrieval method as claimed in claim 1 is characterized in that, before step B, also comprises:
E. described literal and pictorial information are exported to the text formatting file that can edit, duplicate or quote.
6, a kind of OCR picture and text identification searching system is characterized in that described system comprises:
The graph text information acquiring unit is used for obtaining the literal and the pictorial information of picture and text to be identified;
The OCR database is used to store described Word message;
Retrieval unit is used for according to keyword, carries out full-text search in described OCR database.
7, OCR picture and text identification searching system as claimed in claim 6 is characterized in that described Word message acquiring unit comprises:
Picture and text obtain subelement, are used to obtain picture and text to be identified;
The printed page analysis subelement is used for described picture and text are carried out printed page analysis;
Picture and text recognin unit is used for described picture and text are carried out OCR identification, obtains literal and pictorial information in the described picture and text.
8, OCR picture and text identification searching system as claimed in claim 7 is characterized in that it is the equipment that possesses shooting or scan function that described picture and text obtain subelement.
9, OCR picture and text identification searching system as claimed in claim 8 is characterized in that it is scanner, digital camera, integral machine or shooting mobile phone that described picture and text obtain subelement.
10, OCR picture and text identification searching system as claimed in claim 6 is characterized in that described system also comprises the check and correction unit, is used for described Word message is laterally proofreaded and vertically check and correction.
CNA2009100761552A 2009-01-09 2009-01-09 OCR picture and text recognition and retrieval method and system through web mode Pending CN101464903A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA2009100761552A CN101464903A (en) 2009-01-09 2009-01-09 OCR picture and text recognition and retrieval method and system through web mode

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA2009100761552A CN101464903A (en) 2009-01-09 2009-01-09 OCR picture and text recognition and retrieval method and system through web mode

Publications (1)

Publication Number Publication Date
CN101464903A true CN101464903A (en) 2009-06-24

Family

ID=40805478

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2009100761552A Pending CN101464903A (en) 2009-01-09 2009-01-09 OCR picture and text recognition and retrieval method and system through web mode

Country Status (1)

Country Link
CN (1) CN101464903A (en)

Cited By (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102262614A (en) * 2010-05-31 2011-11-30 汉王科技股份有限公司 Longitudinal proofreading method and device
CN102467653A (en) * 2010-10-29 2012-05-23 方正国际软件(北京)有限公司 Image-text recognition method and system thereof
CN102663905A (en) * 2012-04-28 2012-09-12 苏州世华天翼电子科技有限公司 Preparation method of electronic teaching materials
CN103020119A (en) * 2012-11-16 2013-04-03 北京北森测评技术有限公司 Conversion method, device and system for converting paper edition resume into electronic edition resume
CN103218351A (en) * 2013-03-15 2013-07-24 杭州中元数据科技有限公司 Modern local literature electronic book manufacture method
CN103279754A (en) * 2013-06-25 2013-09-04 觅林网络科技(上海)有限公司 Business card cloud identification method and system
CN103336759A (en) * 2013-07-04 2013-10-02 力嘉包装(深圳)有限公司 Device and method for automatically proofreading pre-printing image and text
CN103544186A (en) * 2012-07-16 2014-01-29 富士通株式会社 Method and equipment for discovering theme key words in picture
CN103577414A (en) * 2012-07-20 2014-02-12 富士通株式会社 Data processing method and device
CN103678407A (en) * 2012-09-24 2014-03-26 富士通株式会社 Data processing method and data processing device
CN103714047A (en) * 2013-11-12 2014-04-09 知识产权出版社 Lateral proofreading and double-layer PDF file outputting method and device
CN103914539A (en) * 2014-04-01 2014-07-09 百度在线网络技术(北京)有限公司 Information search method and device
CN104618991A (en) * 2014-12-31 2015-05-13 上海连尚网络科技有限公司 Wifi (wireless fidelity) connecting method and wifi connecting system for mobile terminal
CN104715233A (en) * 2014-12-30 2015-06-17 上海孩子国科教设备有限公司 Character conversion method and system
CN104731798A (en) * 2013-12-19 2015-06-24 鸿合科技有限公司 Text retrieval method and text retrieval device
CN105608131A (en) * 2015-12-17 2016-05-25 山东尚德软件股份有限公司 Method for realizing electronization of retrieval and utilization of file information
CN105653733A (en) * 2016-02-26 2016-06-08 百度在线网络技术(北京)有限公司 Searching method and device
CN106708963A (en) * 2016-12-01 2017-05-24 武汉大思想信息股份有限公司 Website editor article input method and system under artificial intelligence mode
CN106909270A (en) * 2016-07-20 2017-06-30 阿里巴巴集团控股有限公司 Chat data input method, device and communicating terminal
CN107292302A (en) * 2016-03-31 2017-10-24 高德信息技术有限公司 Detect the method and system of point of interest in picture
CN108491839A (en) * 2018-03-27 2018-09-04 北京小米移动软件有限公司 Information acquisition method and device
CN108492841A (en) * 2018-03-20 2018-09-04 深圳市通天科技有限公司 A kind of language and characters logger
CN108897862A (en) * 2018-07-02 2018-11-27 广东飞企互联科技股份有限公司 One kind being based on government document picture retrieval method and system
CN109858014A (en) * 2018-12-10 2019-06-07 西南石油大学 Language message active critique system and its active proofreading method
CN110807121A (en) * 2019-09-29 2020-02-18 广东墨痕教育科技有限公司 Electronic education resource matching method based on image-text intelligent identification and computer readable storage medium
CN112445920A (en) * 2019-09-05 2021-03-05 柯尼卡美能达株式会社 Idea proposal support system, idea proposal support device and method, and storage medium
CN113806472A (en) * 2020-06-17 2021-12-17 中国人寿资产管理有限公司 Method and equipment for realizing full-text retrieval of character, picture and image type scanning piece
CN114328804A (en) * 2020-09-27 2022-04-12 广州市久邦数码科技有限公司 Method and system for searching key words containing character pictures
WO2022100338A1 (en) * 2020-11-10 2022-05-19 腾讯科技(深圳)有限公司 Picture search method and apparatus, electronic device, computer-readable storage medium, and computer program product
CN117688162A (en) * 2024-01-16 2024-03-12 广东铭太信息科技有限公司 Full text retrieval method and system based on OCR (optical character recognition)

Cited By (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102262614A (en) * 2010-05-31 2011-11-30 汉王科技股份有限公司 Longitudinal proofreading method and device
CN102467653A (en) * 2010-10-29 2012-05-23 方正国际软件(北京)有限公司 Image-text recognition method and system thereof
CN102663905A (en) * 2012-04-28 2012-09-12 苏州世华天翼电子科技有限公司 Preparation method of electronic teaching materials
CN103544186A (en) * 2012-07-16 2014-01-29 富士通株式会社 Method and equipment for discovering theme key words in picture
CN103544186B (en) * 2012-07-16 2017-03-01 富士通株式会社 The method and apparatus excavating the subject key words in picture
CN103577414B (en) * 2012-07-20 2017-04-12 富士通株式会社 Data processing method and device
CN103577414A (en) * 2012-07-20 2014-02-12 富士通株式会社 Data processing method and device
CN103678407A (en) * 2012-09-24 2014-03-26 富士通株式会社 Data processing method and data processing device
CN103020119A (en) * 2012-11-16 2013-04-03 北京北森测评技术有限公司 Conversion method, device and system for converting paper edition resume into electronic edition resume
CN103218351B (en) * 2013-03-15 2016-06-22 杭州中元数据科技有限公司 Modern local literature electronic book manufacture method
CN103218351A (en) * 2013-03-15 2013-07-24 杭州中元数据科技有限公司 Modern local literature electronic book manufacture method
CN103279754A (en) * 2013-06-25 2013-09-04 觅林网络科技(上海)有限公司 Business card cloud identification method and system
CN103336759A (en) * 2013-07-04 2013-10-02 力嘉包装(深圳)有限公司 Device and method for automatically proofreading pre-printing image and text
CN103714047B (en) * 2013-11-12 2017-10-10 北京中献电子技术开发中心 The method and apparatus laterally proofreaded and export bilayer PDF
CN103714047A (en) * 2013-11-12 2014-04-09 知识产权出版社 Lateral proofreading and double-layer PDF file outputting method and device
CN104731798A (en) * 2013-12-19 2015-06-24 鸿合科技有限公司 Text retrieval method and text retrieval device
CN104731798B (en) * 2013-12-19 2018-08-31 鸿合科技股份有限公司 A kind of character search method and device
CN103914539A (en) * 2014-04-01 2014-07-09 百度在线网络技术(北京)有限公司 Information search method and device
CN104715233A (en) * 2014-12-30 2015-06-17 上海孩子国科教设备有限公司 Character conversion method and system
CN104618991A (en) * 2014-12-31 2015-05-13 上海连尚网络科技有限公司 Wifi (wireless fidelity) connecting method and wifi connecting system for mobile terminal
CN105608131A (en) * 2015-12-17 2016-05-25 山东尚德软件股份有限公司 Method for realizing electronization of retrieval and utilization of file information
CN105653733A (en) * 2016-02-26 2016-06-08 百度在线网络技术(北京)有限公司 Searching method and device
CN107292302A (en) * 2016-03-31 2017-10-24 高德信息技术有限公司 Detect the method and system of point of interest in picture
CN106909270A (en) * 2016-07-20 2017-06-30 阿里巴巴集团控股有限公司 Chat data input method, device and communicating terminal
CN106708963B (en) * 2016-12-01 2020-02-18 武汉大思想信息股份有限公司 Website editor article entry method and system in artificial intelligence mode
CN106708963A (en) * 2016-12-01 2017-05-24 武汉大思想信息股份有限公司 Website editor article input method and system under artificial intelligence mode
CN108492841A (en) * 2018-03-20 2018-09-04 深圳市通天科技有限公司 A kind of language and characters logger
CN108491839A (en) * 2018-03-27 2018-09-04 北京小米移动软件有限公司 Information acquisition method and device
CN108897862A (en) * 2018-07-02 2018-11-27 广东飞企互联科技股份有限公司 One kind being based on government document picture retrieval method and system
CN109858014A (en) * 2018-12-10 2019-06-07 西南石油大学 Language message active critique system and its active proofreading method
CN112445920A (en) * 2019-09-05 2021-03-05 柯尼卡美能达株式会社 Idea proposal support system, idea proposal support device and method, and storage medium
CN110807121A (en) * 2019-09-29 2020-02-18 广东墨痕教育科技有限公司 Electronic education resource matching method based on image-text intelligent identification and computer readable storage medium
CN113806472A (en) * 2020-06-17 2021-12-17 中国人寿资产管理有限公司 Method and equipment for realizing full-text retrieval of character, picture and image type scanning piece
CN113806472B (en) * 2020-06-17 2023-12-26 中国人寿资产管理有限公司 Method and equipment for realizing full-text retrieval of text picture and image type scanning piece
CN114328804A (en) * 2020-09-27 2022-04-12 广州市久邦数码科技有限公司 Method and system for searching key words containing character pictures
WO2022100338A1 (en) * 2020-11-10 2022-05-19 腾讯科技(深圳)有限公司 Picture search method and apparatus, electronic device, computer-readable storage medium, and computer program product
CN117688162A (en) * 2024-01-16 2024-03-12 广东铭太信息科技有限公司 Full text retrieval method and system based on OCR (optical character recognition)

Similar Documents

Publication Publication Date Title
CN101464903A (en) OCR picture and text recognition and retrieval method and system through web mode
US8892990B2 (en) Automatic creation of a table and query tools
CN101571861B (en) Method and device for converting data table
CN101446971B (en) Method for building content management system and device thereof
CN106777027B (en) Large-scale parallel processing row-column mixed data storage device and storage and query method
CN103309998A (en) Message query method, message query device and terminal equipment
US8467613B2 (en) Automatic retrieval of object interaction relationships
CN107291778B (en) Data collection method and device
CN105589936A (en) Data query method and system
CN104158945A (en) Conversation information obtaining method, device and system
CN112650529B (en) System and method for configurable generation of mobile terminal APP codes
CN101859303A (en) Metadata management method and management system
CN103020119A (en) Conversion method, device and system for converting paper edition resume into electronic edition resume
CN108319608A (en) The method, apparatus and system of access log storage inquiry
CN111563366A (en) Document processing method and device and electronic equipment
CN102486775A (en) Method and device for querying business data
CN113051303A (en) Business data processing method and device, electronic equipment and storage medium
CN102955779A (en) Method and device for searching software
CN109271448A (en) It is the data synchronous system and method for platform based on database
CN103136264A (en) Accessory inquiring method and user terminal
CN105512270B (en) Method and device for determining related objects
CN103020189A (en) Data processing device and method
CN101872344A (en) Control method for image scanning
CN111241142A (en) Scientific and technological achievement conversion pushing system and method
CN103488440A (en) Bill printing device and bill printing method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Open date: 20090624