CN112199545A - Keyword display method and device based on picture character positioning and storage medium - Google Patents

Keyword display method and device based on picture character positioning and storage medium Download PDF

Info

Publication number
CN112199545A
CN112199545A CN202011316753.5A CN202011316753A CN112199545A CN 112199545 A CN112199545 A CN 112199545A CN 202011316753 A CN202011316753 A CN 202011316753A CN 112199545 A CN112199545 A CN 112199545A
Authority
CN
China
Prior art keywords
picture
target
keyword
detected
characters
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202011316753.5A
Other languages
Chinese (zh)
Other versions
CN112199545B (en
Inventor
吴俊洋
王晓斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hunan Eefung Software Co ltd
Original Assignee
Hunan Eefung Software Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hunan Eefung Software Co ltd filed Critical Hunan Eefung Software Co ltd
Priority to CN202011316753.5A priority Critical patent/CN112199545B/en
Publication of CN112199545A publication Critical patent/CN112199545A/en
Application granted granted Critical
Publication of CN112199545B publication Critical patent/CN112199545B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/5846Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using extracted text
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/153Segmentation of character regions using recognition of characters or words
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Molecular Biology (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Library & Information Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • Multimedia (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Databases & Information Systems (AREA)
  • Character Input (AREA)
  • Character Discrimination (AREA)

Abstract

The invention provides a keyword display method based on picture character positioning, which comprises the following steps: acquiring a picture to be detected; identifying characters in the picture to be detected to obtain an identification result, wherein the identification result comprises the identified characters and coordinates corresponding to each character area; matching in the recognition result based on the target keyword to obtain a matching result; if the characters corresponding to the target keywords are matched in the matching result, acquiring a target character area containing the matching result from the picture to be detected; and displaying the target keyword area by calculation based on a preset display rule and the coordinate corresponding to the target character area. The user can judge whether the picture to be detected contains the target keyword or not and can quickly find the position of the target keyword.

Description

Keyword display method and device based on picture character positioning and storage medium
Technical Field
The invention relates to the technical field of picture display, in particular to a keyword display method and device based on picture character positioning and a storage medium.
Background
In recent years, with the rapid increase of the number of users of social platforms such as microblogs, instagrams and the like, people are willing to publish and forward own life interests and smells on the platforms in the form of pictures or other pictures, the pictures spread on the platforms reach a mass level, the pictures with text information become a novel blog carrier, the influence of the novel blog carrier is the same as that of traditional blossoms, and the picture form even has better operability and greater attraction than the traditional blog. The quality of characters in the pictures is uneven, the information concealment is high, although the transmission efficiency of the characters cannot be achieved through the transmission of the character information in the picture form, the examination and verification of relevant departments are easily avoided, the transmission of bad information is increased, and the public opinion guidance of some hot events is controlled. In the era of diversified public opinion transmission ways, how to quickly screen out key pictures from massive pictures and quickly find key information is a direction worth paying attention.
Although the OCR application at present can extract the character information in the picture, it can only perform positioning detection and recognition on all characters in the target picture. On one hand, the workload of departments such as platform managers and network security is increasing, and pictures containing sensitive characters need to be effectively monitored; on the other hand, most social platforms cannot search the blog article only containing the picture through the keyword, and cannot quickly find the key information contained in the target picture. This situation leaves the user missing a lot of valuable information when retrieving content. The user inputs interested contents and vocabularies in a foreground search box, and the system can feed back all matched pictures and mark key information in the pictures, which is a requirement that the current OCR application cannot be deeply realized and is a difficult problem to be solved by the invention.
Disclosure of Invention
The invention aims to overcome the defects of the prior art and provides a keyword display method, a keyword display device and a storage medium based on picture character positioning, aiming at enabling a user to judge whether a picture to be detected contains a target keyword and quickly find the position of the target keyword.
The invention is realized by the following steps:
the invention provides a keyword display method based on picture character positioning, which comprises the following steps:
acquiring a picture to be detected;
identifying characters in the picture to be detected to obtain an identification result, wherein the identification result comprises the identified characters and coordinates corresponding to each character area;
matching in the recognition result based on the target keyword to obtain a matching result;
if the characters corresponding to the target keywords are matched in the matching result, acquiring a target character area containing the matching result from the picture to be detected;
and displaying the target keyword area by calculation based on a preset display rule and the coordinate corresponding to the target character area.
In one implementation manner, the step of identifying the characters in the picture to be detected and obtaining the identification result includes:
segmenting the picture to be detected to obtain at least one detection frame, and outputting coordinates of a character area of each detection frame to form a coordinate result set;
cutting the character area of each detection frame along the detection frame, and rotating to a detection direction, wherein the detection direction is horizontal or vertical;
performing character recognition on the character area of each detection frame, and outputting recognition characters line by line; and the recognized characters and coordinates are combined into a recognition result.
In one implementation, after the step of matching in the recognition result based on the target keyword to obtain a matching result, the method further includes:
if the characters corresponding to the target keywords are matched in the matching result, determining that the picture to be detected is a key picture;
otherwise, filtering the picture to be detected and acquiring a new picture to be detected again.
In one implementation manner, the step of displaying the target keyword region by calculation based on a preset display rule and a coordinate corresponding to the target text region includes:
acquiring a coordinate point of a detection frame of the target character area;
and determining a target area corresponding to the target keyword based on the frame of the detection box.
In one implementation, the method further comprises:
drawing the coordinate points corresponding to the target keywords in the to-be-detected picture in a highlight mode;
and displaying the picture to be detected with the highlighted keyword.
In one implementation, the text area of each detection box is subjected to text recognition, and recognition characters are output line by line; and the step of forming a recognition result by the recognized characters and coordinates includes:
performing character recognition on the character area of each detection box by adopting an OCR (optical character recognition), and outputting recognized characters line by line; and the recognized characters and coordinates are combined into a recognition result.
In addition, the invention also discloses a keyword display device based on picture character positioning, which comprises a processor and a memory connected with the processor through a communication bus; wherein,
the memory is used for storing a keyword display program based on picture character positioning;
the processor is configured to execute the keyword display program based on the picture character positioning to implement any one of the keyword display steps based on the picture character positioning.
And a computer storage medium storing one or more programs, the one or more programs being executable by one or more processors to cause the one or more processors to perform any of the picture text positioning based keyword display steps is disclosed.
The keyword display method based on the picture character positioning has the following beneficial effects: the user can find the key information wanted by the user from the massive pictures. On the basis of obtaining the position of the character area and the recognition result, the invention screens out pictures containing key information by matching the user-defined key words, calculates the position coordinates of the key words in each key picture, and highlights the key words on the pictures.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a schematic flow chart of a keyword display method based on image character positioning according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of OCR text detection and coordinate output of a detection box according to an embodiment of the present invention;
FIG. 3 is a schematic diagram of an OCR detection box cropping and rotation according to an embodiment of the present invention;
FIG. 4 is a schematic diagram of OCR text recognition according to an embodiment of the present invention;
fig. 5 is a schematic diagram illustrating screening of key pictures based on customized keywords according to an embodiment of the present invention, where fig. 5 shows a result that matching is successful, marked as a key picture, and a result that matching is unsuccessful;
FIG. 6 is a schematic diagram of keyword coordinate calculation and highlight visualization based on coordinates of a detection box according to an embodiment of the present invention;
fig. 7 is a schematic diagram illustrating a principle and a flow of calculating coordinates of keywords according to an embodiment of the present invention, and fig. 6 and 7 are schematic diagrams illustrating a method for locating city a and extracting text of a line where city a is located.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1, a keyword display method based on picture character positioning, the method comprising:
and S101, acquiring a picture to be detected.
It should be noted that there may be a plurality of pictures to be detected, so as to meet the matching requirements of the user, for example, a huge number of pictures on the social media.
S102, identifying the characters in the picture to be detected, and acquiring an identification result, wherein the identification result comprises the identified characters and the coordinates corresponding to each character area.
It should be noted that, firstly, the characters on the picture to be detected need to be recognized, and specifically, the step of character recognition includes, but is not limited to:
1) character detection: and performing character detection on the picture to be detected, outputting the coordinates of each character area, and forming a coordinate result set. And cutting the character area in each detection frame divided by the picture to be detected along the detection frame, and rotating to be horizontal (rotating to be vertical when vertical expression is performed). And then, performing character recognition on each cut picture, and outputting recognition results line by line to form a recognition result set.
In one implementation, algorithms such as DBNet are used to detect all text regions in a picture to be detected, and the text regions are framed by region coordinate points. The whole algorithm implementation process can be roughly divided into the following steps:
the first step is as follows: inputting a picture to be detected, and obtaining a plurality of characteristic graphs through CNN learning characteristics;
the second step is that: fusing the characteristics, wherein the fused characteristic graph is 1/4 of the original graph;
the third step: performing characteristic regression, regressing a segmentation graph and a threshold graph, and weighting and fusing to obtain a final characteristic graph;
the fourth step: and (5) obtaining a character detection result and outputting coordinates by adopting final characteristics on the original image, and referring to fig. 2.
2) Cutting and correcting: the text area is cut out along the detection frame in the picture to be detected, a plurality of text line pictures are obtained, and the pictures are rotated to be horizontal (vertical if the area is vertically expressed, such as couplet), referring to fig. 3.
3) Character recognition: and (5) identifying characters in each text line picture by using algorithms such as CRNN and the like, and outputting the characters line by line. The whole algorithm implementation process can be roughly divided into the following steps:
the first step is as follows: a convolutional layer, which uses CNN to extract a characteristic sequence from an image to be detected;
the second step is that: a loop layer, using bidirectional LSTM, to correlate context and predict the label distribution of the characteristic sequence;
the third step: and the translation layer is used for calculating the output probability of the label by combining CTC to obtain a character result, and the figure 4 is referred.
S103, matching is carried out in the recognition result based on the target keyword so as to obtain a matching result.
And S104, if the characters corresponding to the target keywords are matched in the matching result, acquiring a target character area containing the matching result in the picture to be detected.
It should be noted that the target keyword may be a user-defined keyword input by the user, and the target keyword is matched with the recognition result set in step S102, and if the result set includes the keyword, the picture to be detected is marked as a key picture, and is added to the key picture set.
And S105, displaying the target keyword area through calculation based on a preset display rule and the corresponding coordinate of the target character area.
And extracting the coordinate points of the target character area contained in the coordinate result set, using the coordinate points as known conditions, calculating the coordinates of the target keyword through a slope formula, and forming a keyword coordinate set corresponding to the target keyword.
And drawing each coordinate point in the keyword coordinate set on the original drawing, and connecting the corresponding coordinate points through straight lines to enable the target keyword to be highlighted on the original drawing.
The invention accurately positions the process to the user-defined keywords, thereby screening out the pictures which are valuable to the user and enabling the user to quickly find the key information contained in the pictures.
A user inputs a keyword which the user wants to query in a foreground search box, and after the background takes the keyword, the user can judge whether the identification result of the target picture contains the keyword, if so, the picture to be detected (with a corresponding matching result) is marked as a key picture and added into a key picture set, and the set is used as the input of the subsequent steps, as shown in fig. 5.
The whole coordinate value calculation process can be roughly divided into the following steps (taking the calculation of the coordinates of a keyword as an example here):
1) according to the input custom keyword, extracting detection box coordinates containing the keyword, namely box = [ [ a1, b1], [ a2, b2], [ a3, b3], [ a4, b4] ] from the key picture, wherein the coordinates are regarded as known conditions;
2) the value of each coordinate point is obtained by an element extraction rule of the list, such as a1= box [0] [0], b1= box [0] [1], and the like, and 8 values are extracted as a known condition for coordinate calculation;
3) calculating the width p _ w and the height p _ h (distance formula between two points) of the detection frame according to the coordinate values obtained in the step 2);
4) respectively obtaining the character length of the recognition result and the character length of the target keyword in the detection box through len (recognition result) and len (keyword), and obtaining the initial position pid of the keyword in the recognition result through an index function (recognition result, index (keyword, index initial position)) (if a plurality of pids exist, all positions are traversed by adding a plurality of cycles);
5) if the distance between the head of the keyword detection box and the head of the original detection box is w1, and the distance between the tail of the keyword detection box and the head of the original detection box is w2, then:
w1= p _ w (pid)/len (recognition result),
w2= p _ w (pid + len (keyword))/len (recognition result);
6) since the keywords (the characters corresponding to the target keywords are partial characters in the sentence corresponding to the picture to be detected) are part of the recognition result of the whole sentence, they form an angle with the virtual coordinate axis. Let the coordinates of the highlight box of the keyword be [ [ x1, y1], [ x2, y2], [ x3, y3], [ x4, y4] ], and calculated according to the trigonometric function "sin α = opposite side/oblique side, cos α = adjacent side/oblique side", similarly to the slope formula, as follows:
(x1-a1)/w1 = (a2-a1)/p_w —> x1 = w1*(a2-a1)/p_w + a1,
(x2-a1)/w2 = (a2-a1)/p_w —> x2 = w2*(a2-a1)/p_w + a1,
by analogy, all values in the coordinates of the key word highlight box are calculated, and the first two steps are referred to in fig. 6;
7) if a plurality of keywords exist in one picture, putting all the obtained coordinates in the step 6) into a list.
The calculation route of the entire keyword coordinates refers to fig. 7.
The method comprises the steps of establishing a virtual coordinate axis by taking the center of an original drawing as an origin, drawing coordinate points of keywords (each keyword comprises 4 coordinate points) in the coordinate axis, and then connecting adjacent points through straight lines with custom colors to form a rectangle, so that a keyword frame is highlighted in the rectangle, and referring to the last two steps of FIG. 6.
In addition, the invention also discloses a keyword display device based on picture character positioning, which comprises a processor and a memory connected with the processor through a communication bus; wherein,
the memory is used for storing a keyword display program based on picture character positioning;
the processor is configured to execute the keyword display program based on the picture character positioning to implement any one of the keyword display steps based on the picture character positioning.
And a computer storage medium storing one or more programs, the one or more programs being executable by one or more processors to cause the one or more processors to perform any of the picture text positioning based keyword display steps is disclosed.
The keyword display method based on the picture character positioning has the following beneficial effects: the user can find the key information wanted by the user from the massive pictures. On the basis of obtaining the position of the character area and the recognition result, the invention screens out pictures containing key information by matching the user-defined key words, calculates the position coordinates of the key words in each key picture, and highlights the key words on the pictures.
The foregoing embodiments are merely illustrative of the principles of the invention and its efficacy, and are not to be construed as limiting the invention. Any person skilled in the art can modify or change the above-mentioned embodiments without departing from the spirit and scope of the present invention. Accordingly, it is intended that all equivalent modifications or changes which can be made by those skilled in the art without departing from the spirit and technical spirit of the present invention be covered by the claims of the present invention.

Claims (7)

1. A keyword display method based on picture character positioning is characterized by comprising the following steps:
acquiring a picture to be detected;
identifying characters in the picture to be detected to obtain an identification result, wherein the identification result comprises the identified characters and coordinates corresponding to each character area;
matching in the recognition result based on the target keyword to obtain a matching result;
if the characters corresponding to the target keywords are matched in the matching result, acquiring a target character area containing the matching result from the picture to be detected;
displaying a target keyword area by calculation based on a preset display rule and a coordinate corresponding to the target character area;
wherein, the displaying the target key area by calculating based on the preset display rule and the coordinate corresponding to the target text area comprises: acquiring a coordinate point of a detection frame of the target character area; and determining a target area corresponding to the target key side based on the frame of the detection frame.
2. The method for displaying keywords based on image text positioning according to claim 1, wherein the step of identifying the characters in the image to be detected and obtaining the identification result comprises:
segmenting the picture to be detected to obtain at least one detection frame, and outputting coordinates of a character area of each detection frame to form a coordinate result set;
cutting the character area of each detection frame along the detection frame, and rotating to a detection direction, wherein the detection direction is horizontal or vertical;
performing character recognition on the character area of each detection frame, and outputting recognition characters line by line; and the recognized characters and coordinates are combined into a recognition result.
3. The method for displaying keywords based on graphic text orientation as claimed in claim 1, wherein after the step of matching in the recognition result based on the target keyword to obtain a matching result, the method further comprises:
if the characters corresponding to the target keywords are matched in the matching result, determining that the picture to be detected is a key picture;
otherwise, filtering the picture to be detected and acquiring a new picture to be detected again.
4. The method for displaying keywords based on graphic text positioning as claimed in claim 1, further comprising:
drawing the coordinate points corresponding to the target keywords in the to-be-detected picture in a highlight mode;
and displaying the picture to be detected with the highlighted keyword.
5. The method for displaying keywords based on image text positioning according to claim 2, characterized in that the text area of each detection box is subjected to text recognition, and recognition characters are output line by line; and the step of forming a recognition result by the recognized characters and coordinates includes:
performing character recognition on the character area of each detection box by adopting an OCR (optical character recognition), and outputting recognized characters line by line; and the recognized characters and coordinates are combined into a recognition result.
6. A keyword display device based on picture character positioning is characterized by comprising a processor and a memory connected with the processor through a communication bus; wherein,
the memory is used for storing a keyword display program based on picture character positioning;
the processor is configured to execute the keyword display program based on photo text positioning to implement the keyword display step based on photo text positioning according to any one of claims 1 to 5.
7. A computer storage medium, characterized in that the computer storage medium stores one or more programs executable by one or more processors to cause the one or more processors to perform the keyword display step based on picture text positioning according to any one of claims 1 to 5.
CN202011316753.5A 2020-11-23 2020-11-23 Keyword display method and device based on picture character positioning and storage medium Active CN112199545B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011316753.5A CN112199545B (en) 2020-11-23 2020-11-23 Keyword display method and device based on picture character positioning and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011316753.5A CN112199545B (en) 2020-11-23 2020-11-23 Keyword display method and device based on picture character positioning and storage medium

Publications (2)

Publication Number Publication Date
CN112199545A true CN112199545A (en) 2021-01-08
CN112199545B CN112199545B (en) 2021-09-07

Family

ID=74033708

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011316753.5A Active CN112199545B (en) 2020-11-23 2020-11-23 Keyword display method and device based on picture character positioning and storage medium

Country Status (1)

Country Link
CN (1) CN112199545B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112784192A (en) * 2021-01-22 2021-05-11 南京万得资讯科技有限公司 Method for cleaning embedded advertisements in page text content
CN112882678A (en) * 2021-03-15 2021-06-01 百度在线网络技术(北京)有限公司 Image-text processing method, display method, device, equipment and storage medium
CN113221894A (en) * 2021-05-31 2021-08-06 中邮信息科技(北京)有限公司 License plate number identification method and device of vehicle, electronic equipment and storage medium
CN113398602A (en) * 2021-07-15 2021-09-17 网易(杭州)网络有限公司 Information processing method, information processing device, storage medium and computer equipment
CN115952278A (en) * 2023-03-14 2023-04-11 北京有生博大软件股份有限公司 Layout file highlighting method and system based on keyword positioning

Citations (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6470336B1 (en) * 1999-08-25 2002-10-22 Matsushita Electric Industrial Co., Ltd. Document image search device and recording medium having document search program stored thereon
US20030142359A1 (en) * 2002-01-29 2003-07-31 Bean Heather N. Method and apparatus for the automatic generation of image capture device control marks
US20040175036A1 (en) * 1997-12-22 2004-09-09 Ricoh Company, Ltd. Multimedia visualization and integration environment
US20060062453A1 (en) * 2004-09-23 2006-03-23 Sharp Laboratories Of America, Inc. Color highlighting document image processing
CN101105893A (en) * 2006-07-14 2008-01-16 沈阳江龙软件开发科技有限公司 Automobile video frequency discrimination speed-testing method
CN101211341A (en) * 2006-12-29 2008-07-02 上海芯盛电子科技有限公司 Image intelligent mode recognition and searching method
CN101226595A (en) * 2007-01-15 2008-07-23 夏普株式会社 Document image processing apparatus and document image processing process
CN101354705A (en) * 2007-07-23 2009-01-28 夏普株式会社 Apparatus and method for processing document image
CN101354624A (en) * 2008-05-15 2009-01-28 中国人民解放军国防科学技术大学 Surface computing platform of four-way CCD camera collaborative work and multi-contact detection method
CN101520783A (en) * 2008-02-29 2009-09-02 富士通株式会社 Method and device for searching keywords based on image content
CN101566897A (en) * 2009-06-03 2009-10-28 广东威创视讯科技股份有限公司 Positioning device of touch screen and positioning method of touch screen
CN101571875A (en) * 2009-05-05 2009-11-04 程治永 Realization method of image searching system based on image recognition
CN101751785A (en) * 2010-01-12 2010-06-23 杭州电子科技大学 Automatic license plate recognition method based on image processing
CN101820489A (en) * 2009-02-27 2010-09-01 佳能株式会社 Image processing equipment and image processing method
CN103544186A (en) * 2012-07-16 2014-01-29 富士通株式会社 Method and equipment for discovering theme key words in picture
CN103617422A (en) * 2013-10-29 2014-03-05 浙江工业大学 A social relation management method based on business card recognition
CN103679218A (en) * 2013-11-19 2014-03-26 华东师范大学 Handwritten form keyword detection method
CN105023178A (en) * 2015-08-12 2015-11-04 电子科技大学 Main body-based electronic commercere commendation method
CN105468732A (en) * 2015-11-23 2016-04-06 中国科学院信息工程研究所 Image keyword inspecting method and device
CN105518712A (en) * 2015-05-28 2016-04-20 北京旷视科技有限公司 Keyword notification method, equipment and computer program product based on character recognition
CN105701516A (en) * 2016-01-20 2016-06-22 福州大学 Method for automatically marking image on the basis of attribute discrimination
CN106384112A (en) * 2016-09-08 2017-02-08 西安电子科技大学 Rapid image text detection method based on multi-channel and multi-dimensional cascade filter
CN109447078A (en) * 2018-10-23 2019-03-08 四川大学 A kind of detection recognition method of natural scene image sensitivity text
CN109766881A (en) * 2018-11-28 2019-05-17 北京捷通华声科技股份有限公司 A kind of character identifying method and device of vertical text image
CN109858475A (en) * 2019-01-08 2019-06-07 平安科技(深圳)有限公司 Picture character localization method, device, medium and computer equipment
CN109919108A (en) * 2019-03-11 2019-06-21 西安电子科技大学 Remote sensing images fast target detection method based on depth Hash auxiliary network
US10445569B1 (en) * 2016-08-30 2019-10-15 A9.Com, Inc. Combination of heterogeneous recognizer for image-based character recognition
CN110880000A (en) * 2019-11-27 2020-03-13 上海智臻智能网络科技股份有限公司 Picture character positioning method and device, computer equipment and storage medium
CN111125408A (en) * 2019-10-11 2020-05-08 平安科技(深圳)有限公司 Search method and device based on feature extraction, computer equipment and storage medium

Patent Citations (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040175036A1 (en) * 1997-12-22 2004-09-09 Ricoh Company, Ltd. Multimedia visualization and integration environment
US6470336B1 (en) * 1999-08-25 2002-10-22 Matsushita Electric Industrial Co., Ltd. Document image search device and recording medium having document search program stored thereon
US20030142359A1 (en) * 2002-01-29 2003-07-31 Bean Heather N. Method and apparatus for the automatic generation of image capture device control marks
US20060062453A1 (en) * 2004-09-23 2006-03-23 Sharp Laboratories Of America, Inc. Color highlighting document image processing
CN101105893A (en) * 2006-07-14 2008-01-16 沈阳江龙软件开发科技有限公司 Automobile video frequency discrimination speed-testing method
CN101211341A (en) * 2006-12-29 2008-07-02 上海芯盛电子科技有限公司 Image intelligent mode recognition and searching method
CN101226595A (en) * 2007-01-15 2008-07-23 夏普株式会社 Document image processing apparatus and document image processing process
CN101354705A (en) * 2007-07-23 2009-01-28 夏普株式会社 Apparatus and method for processing document image
CN101520783A (en) * 2008-02-29 2009-09-02 富士通株式会社 Method and device for searching keywords based on image content
CN101354624A (en) * 2008-05-15 2009-01-28 中国人民解放军国防科学技术大学 Surface computing platform of four-way CCD camera collaborative work and multi-contact detection method
CN101820489A (en) * 2009-02-27 2010-09-01 佳能株式会社 Image processing equipment and image processing method
CN101571875A (en) * 2009-05-05 2009-11-04 程治永 Realization method of image searching system based on image recognition
CN101566897A (en) * 2009-06-03 2009-10-28 广东威创视讯科技股份有限公司 Positioning device of touch screen and positioning method of touch screen
CN101751785A (en) * 2010-01-12 2010-06-23 杭州电子科技大学 Automatic license plate recognition method based on image processing
CN103544186A (en) * 2012-07-16 2014-01-29 富士通株式会社 Method and equipment for discovering theme key words in picture
CN103617422A (en) * 2013-10-29 2014-03-05 浙江工业大学 A social relation management method based on business card recognition
CN103679218A (en) * 2013-11-19 2014-03-26 华东师范大学 Handwritten form keyword detection method
CN105518712A (en) * 2015-05-28 2016-04-20 北京旷视科技有限公司 Keyword notification method, equipment and computer program product based on character recognition
CN105023178A (en) * 2015-08-12 2015-11-04 电子科技大学 Main body-based electronic commercere commendation method
CN105468732A (en) * 2015-11-23 2016-04-06 中国科学院信息工程研究所 Image keyword inspecting method and device
CN105701516A (en) * 2016-01-20 2016-06-22 福州大学 Method for automatically marking image on the basis of attribute discrimination
US10445569B1 (en) * 2016-08-30 2019-10-15 A9.Com, Inc. Combination of heterogeneous recognizer for image-based character recognition
CN106384112A (en) * 2016-09-08 2017-02-08 西安电子科技大学 Rapid image text detection method based on multi-channel and multi-dimensional cascade filter
CN109447078A (en) * 2018-10-23 2019-03-08 四川大学 A kind of detection recognition method of natural scene image sensitivity text
CN109766881A (en) * 2018-11-28 2019-05-17 北京捷通华声科技股份有限公司 A kind of character identifying method and device of vertical text image
CN109858475A (en) * 2019-01-08 2019-06-07 平安科技(深圳)有限公司 Picture character localization method, device, medium and computer equipment
CN109919108A (en) * 2019-03-11 2019-06-21 西安电子科技大学 Remote sensing images fast target detection method based on depth Hash auxiliary network
CN111125408A (en) * 2019-10-11 2020-05-08 平安科技(深圳)有限公司 Search method and device based on feature extraction, computer equipment and storage medium
CN110880000A (en) * 2019-11-27 2020-03-13 上海智臻智能网络科技股份有限公司 Picture character positioning method and device, computer equipment and storage medium

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112784192A (en) * 2021-01-22 2021-05-11 南京万得资讯科技有限公司 Method for cleaning embedded advertisements in page text content
CN112882678A (en) * 2021-03-15 2021-06-01 百度在线网络技术(北京)有限公司 Image-text processing method, display method, device, equipment and storage medium
CN112882678B (en) * 2021-03-15 2024-04-09 百度在线网络技术(北京)有限公司 Image-text processing method, image-text processing display method, image-text processing device, image-text processing equipment and storage medium
CN113221894A (en) * 2021-05-31 2021-08-06 中邮信息科技(北京)有限公司 License plate number identification method and device of vehicle, electronic equipment and storage medium
CN113398602A (en) * 2021-07-15 2021-09-17 网易(杭州)网络有限公司 Information processing method, information processing device, storage medium and computer equipment
CN113398602B (en) * 2021-07-15 2024-04-30 网易(杭州)网络有限公司 Information processing method, information processing device, storage medium and computer equipment
CN115952278A (en) * 2023-03-14 2023-04-11 北京有生博大软件股份有限公司 Layout file highlighting method and system based on keyword positioning

Also Published As

Publication number Publication date
CN112199545B (en) 2021-09-07

Similar Documents

Publication Publication Date Title
CN112199545B (en) Keyword display method and device based on picture character positioning and storage medium
US10438077B2 (en) Face liveness detection method, terminal, server and storage medium
US8867779B2 (en) Image tagging user interface
CN106575195B (en) Improved drag and drop operations on mobile devices
US8396246B2 (en) Tagging images with labels
JP6951905B2 (en) How to cut out lines and words for handwritten text images
WO2021088422A1 (en) Application message notification method and device
WO2019020061A1 (en) Video dialogue processing method, video client, video server, and computer readable storage medium
CN109492168B (en) Visual tourism interest recommendation information generation method based on tourism photos
CN115620014B (en) Pipeline instrument flow chart information extraction method and equipment based on deep learning
WO2019148923A1 (en) Method and apparatus for searching for images with image, electronic device, and storage medium
CN114494751A (en) License information identification method, device, equipment and medium
CN111191591A (en) Watermark detection method, video processing method and related equipment
Meena et al. Image splicing forgery detection using noise level estimation
JP5480008B2 (en) Summary manga image generation apparatus, program and method for generating manga content summary
US20180336243A1 (en) Image Search Method, Apparatus and Storage Medium
US10963690B2 (en) Method for identifying main picture in web page
CN112532884A (en) Identification method and device and electronic equipment
CN113591657B (en) OCR layout recognition method and device, electronic equipment and medium
WO2022105120A1 (en) Text detection method and apparatus from image, computer device and storage medium
US11995144B2 (en) Webpage illustration processing method, system, device and storage medium
US11048713B2 (en) System and method for visual exploration of search results in two-mode networks
JP6770227B2 (en) Image processing device, image area detection method and image area detection program
CN115004261A (en) Text line detection
CN113449713B (en) Method and device for cleaning training data of face detection model

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant