CN204046697U - A kind of graphics context collection recognition device - Google Patents

A kind of graphics context collection recognition device Download PDF

Info

Publication number
CN204046697U
CN204046697U CN201420038757.5U CN201420038757U CN204046697U CN 204046697 U CN204046697 U CN 204046697U CN 201420038757 U CN201420038757 U CN 201420038757U CN 204046697 U CN204046697 U CN 204046697U
Authority
CN
China
Prior art keywords
image information
information
camera
recognition unit
unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201420038757.5U
Other languages
Chinese (zh)
Inventor
陈旭
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201420038757.5U priority Critical patent/CN204046697U/en
Application granted granted Critical
Publication of CN204046697U publication Critical patent/CN204046697U/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/04Scanning arrangements, i.e. arrangements for the displacement of active reading or reproducing elements relative to the original or reproducing medium, or vice versa
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06KGRAPHICAL DATA READING; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K7/00Methods or arrangements for sensing record carriers, e.g. for reading patterns
    • G06K7/10Methods or arrangements for sensing record carriers, e.g. for reading patterns by electromagnetic radiation, e.g. optical sensing; by corpuscular radiation
    • G06K7/10544Methods or arrangements for sensing record carriers, e.g. for reading patterns by electromagnetic radiation, e.g. optical sensing; by corpuscular radiation by scanning of the records by radiation in the optical part of the electromagnetic spectrum
    • G06K7/10821Methods or arrangements for sensing record carriers, e.g. for reading patterns by electromagnetic radiation, e.g. optical sensing; by corpuscular radiation by scanning of the records by radiation in the optical part of the electromagnetic spectrum further details of bar or optical code scanning devices
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/62Text, e.g. of license plates, overlay texts or captions on TV images
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/50Constructional details
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Electromagnetism (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Toxicology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • User Interface Of Digital Computer (AREA)
  • Studio Devices (AREA)
  • Image Analysis (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A kind of graphics context collection recognition device that the utility model embodiment provides, comprise: multipoint images information acquisition unit and image information recognition unit, described multipoint images information acquisition unit is connected with described image information recognition unit, and the multi-angle of the subject of collection and/or multipoint image information is sent to the foundation of described image information recognition unit as graph-text content information corresponding to image information described in the identification of described image information recognition unit; Wherein, institute's multipoint images information acquisition unit comprises at least one movable camera; Or described multipoint images information acquisition unit comprises multiple camera; Or described multipoint images information acquisition unit comprises a fixed camera, described fixed camera comprises multiple camera lens.The realization of the utility model embodiment makes can obtain neatly in image information collecting process the image information needing to gather different angles in image information region or diverse location, thus makes in follow-up picture and text identification processing procedure and accurately can identify corresponding graph-text content information.

Description

A kind of graphics context collection recognition device
Technical field
The utility model relates to IMAQ identifying processing technical field, particularly relates to a kind of graphics context collection recognition device.
Background technology
Along with the development of image processing techniques, the application of corresponding IMAQ recognition technology is also increasingly extensive, but, in current IMAQ recognition technology, be generally and adopt single common camera to be fixedly installed in the acquisition operations of carrying out image in image-region to be collected, the processing mode of this collection image makes the image information collected often cannot react actual conditions in collected region truely and accurately, namely the image really and accurately in defendant's pickup area cannot be obtained, and then cause accurately to identify corresponding graph-text content information in follow-up identification processing procedure.
Particularly, if carry out IMAQ with common camera, in order to obtain the covering to required acquisition target, then need to arrange corresponding camera more at a distance.If attempt with wide-angle lens or flake mirror in the covering of short distance to required acquisition target, larger distortion will be produced.And its single focal length of single fixing camera causes captured object only the clearest in that sub-fraction that focal length is suitable, other parts are not then due at best focus position place, then clear not.And be difficult to realize distortionless shooting to curved surface (such as to the macrobending in the middle part of the book opened).And, the shooting angle of common camera also can only directly over effectively could take whole subject, if from the side or inclined-plane shooting; easily make part not unintelligible at pinpointed focus, and due to projection principle, image objects apart from camera far-end is little, and resolution can decline a lot.Therefore, then require very high to resolution ratio of camera head according to common single fixing camera (i.e. common camera) to comprehensive covering of required acquisition target, and desirable shooting effect cannot be reached.
Moreover current Book Publishing amount is very large.But also there is a part of crowd at present, as children, blind person, the elderly etc., its inconvenience is directly read books, needs aid reading audible device to read for this part crowd, and at present not for the aid reading audible device of general book.
Summary of the invention
The purpose of this utility model is to provide a kind of graphics context collection recognition device, thus can collect the image information of expectation accurately and easily, to improve the accuracy of identification processing procedure.
The purpose of this utility model is achieved through the following technical solutions:
A kind of graphics context collection recognition device, comprise: multipoint images information acquisition unit and image information recognition unit, described multipoint images information acquisition unit is connected with described image information recognition unit, and the multi-angle of the subject of collection and/or multipoint image information is sent to the foundation of described image information recognition unit as graph-text content information corresponding to image information described in the identification of described image information recognition unit; Wherein,
Institute's multipoint images information acquisition unit comprises at least one movable camera, photo angle and/or the position of described movable camera are adjustable, described movable camera is connected with drive motors, and described drive motors controls rotation and/or the movement of described movable camera; Or, described multipoint images information acquisition unit comprises multiple camera, i.e. two or three or four or more camera, and each camera is fixed camera or movable camera, photo angle and/or the position of described movable camera are adjustable, described movable camera is Non-follow control or is connected with drive motors, and described drive motors controls rotation and/or the movement of described movable camera; Or described multipoint images information acquisition unit comprises a fixed camera, described fixed camera comprises multiple camera lens.
Described movable camera comprises rotatable camera head and/or packaged type camera, and namely described movable camera is rotatable removable or removable rotatable; Or described movable camera comprises one or more movable camera lens; Described movable camera is arranged at needs the position in the region gathering described image information to comprise: above the edge in described region and/or oblique upper and/or directly over.
Described fixed camera comprises one or more camera lens, and described camera is arranged at and needs the position in the region gathering described image information to comprise: above the edge in described region and/or oblique upper and/or directly over.
Described image information recognition unit comprises picture and text identification module and action recognition module.
This device also comprises:
Audio unit, be connected with described image information recognition unit, and described multipoint images information acquisition unit by read page current in the books of collection or printed matter current reading location reading operations indicating positions or the image information that comprises Bibliographical Information or comprise page number information pass to described image information recognition unit, and described image information recognition unit according to described current read page or printed matter current reading location or reading operations indicating positions the image information that comprises Bibliographical Information or comprise page number information identifies this current read page or printed matter current reading location or reading operations instruction or after audio-frequency information that the word content information of Bibliographical Information or page number information is corresponding, obtain described audio-frequency information, again the audio-frequency information of described correspondence is exported with audio form by loud speaker,
And/or,
Be connected with described image information recognition unit, and for obtaining the audio input unit of audio-frequency information;
And/or,
Be connected with described image information recognition unit and/or described audio input unit, and for the memory cell of memory content information;
And/or,
Be connected with described image information recognition unit, and for the display unit of displaying contents information;
And/or,
Be connected with described image information recognition unit, and for the communication unit with compunication.
The technical scheme provided as can be seen from above-mentioned the utility model, a kind of graphics context collection recognition device that the utility model embodiment provides is owing to have employed unique camera arrangement, make can obtain neatly in image information collecting process the image information (i.e. the image information of different angles and/or diverse location) needing to gather (i.e. subject) multiple spot in image information region, thus the image information collected can be made can to react actual conditions in collected region truely and accurately, and then make accurately to identify corresponding graph-text content information in follow-up picture and text identification processing procedure.Thus making this device also can be, but not limited to as a kind of picture and text input device etc., picture and text typing is in full typing such as, or divides picture and text to carry out typing etc. along with the selection portion of instruction printed matter being carried out to reading operations.On the other hand, in this graphics context collection recognition device, due to the identification of graph-text content information accurately can be carried out, thus picture and text identifying processing can be carried out for general book, and realize the aid reading sounding process for general book in conjunction with corresponding vocal function, thus provide a kind of aid reading audible device that can carry out auxiliary sounding to general book for people.
Accompanying drawing explanation
In order to be illustrated more clearly in the technical scheme of the utility model embodiment, below the accompanying drawing used required in describing embodiment is briefly described, apparently, accompanying drawing in the following describes is only embodiments more of the present utility model, for those of ordinary skill in the art, under the prerequisite not paying creative work, other accompanying drawings can also be obtained according to these accompanying drawings.
The structural representation one of the graphics context collection recognition device that Fig. 1 provides for the utility model embodiment;
The structural representation two of the graphics context collection recognition device that Fig. 2 provides for the utility model embodiment;
The structural representation one comprising the graphics context collection recognition device of multiple camera that Fig. 3 provides for the utility model embodiment;
The structural representation two comprising the graphics context collection recognition device of multiple camera that Fig. 4 provides for the utility model embodiment;
The structural representation three comprising the graphics context collection recognition device of multiple camera that Fig. 5 provides for the utility model embodiment;
The structural representation being arranged at the camera above edge that Fig. 6 provides for the utility model embodiment;
The structural representation one being arranged at the camera of oblique upper that Fig. 7 provides for the utility model embodiment;
The structural representation two being arranged at the camera of oblique upper that Fig. 8 provides for the utility model embodiment;
Fig. 9 for the utility model embodiment provide be arranged at directly over the structural representation of camera;
The schematic diagram one of multiple camera shooting books top-surface cambers that Figure 10 provides for the utility model embodiment;
The schematic diagram two of multiple camera shooting books top-surface cambers that Figure 11 provides for the utility model embodiment;
The schematic diagram three of multiple camera shooting books top-surface cambers that Figure 12 provides for the utility model embodiment;
The schematic diagram of the rotatable camera head rotation front shooting books top-surface camber that Figure 13 provides for the utility model embodiment;
The schematic diagram of the rotatable camera head rotation rear shooting books top-surface camber that Figure 14 provides for the utility model embodiment;
The Application Example structural representation that Figure 15 provides for the utility model embodiment.
Embodiment
Below in conjunction with the accompanying drawing in the utility model embodiment, be clearly and completely described the technical scheme in the utility model embodiment, obviously, described embodiment is only the utility model part embodiment, instead of whole embodiments.Based on embodiment of the present utility model, those of ordinary skill in the art, not making the every other embodiment obtained under creative work prerequisite, belong to protection range of the present utility model.
Below in conjunction with accompanying drawing, the utility model embodiment is described in further detail.
The utility model embodiment provides a kind of graphics context collection recognition device, its specific implementation structure as shown in Figure 1, can comprise: multipoint images information acquisition unit and image information recognition unit, described multipoint images information acquisition unit is connected with described image information recognition unit, and the image information of collection is passed to the foundation of described image information recognition unit as graph-text content information corresponding to image information described in the identification of described image information recognition unit, so that described image information recognition unit can identify graph-text content information according to described image information, described multipoint images information acquisition unit adopts shooting style to obtain described image information, namely the camera shooting that described multipoint images information acquisition unit is comprised by it obtains described image information.
Further, for realizing for the multi-angle of subject and/or the shooting of multipoint multipoint images, corresponding multipoint images information acquisition unit can adopt following arbitrary structure to realize:
(1) described multipoint images information acquisition unit can comprise at least one movable camera, described movable camera is connected with drive motors, described drive motors controls rotation and/or the movement of described movable camera, and described movable camera controls its activity to carry out multi-angle and/or the shooting of multipoint multipoint images to subject based on the control information of predetermined control mode or reception.Such as, automatically (as done automatic adjustment according to the feedback after photographic images identification) is controlled according to feedback, such as when the finger place of showing goes beyond the scope or segment word has gone beyond the scope or the page number has gone beyond the scope, then automatically adjust angle and/or the position of movable camera, such product in the course of the work without the need to or need manual intervention less, or, as described in controlling according to the control information (predetermined control information etc. that the specific limb action performed as user or user are inputted by operation push-button) of user's input, camera rotates or moves, or, also can automatically control described camera according to the time interval preset rotate or move, to carry out for the multi-angle of subject and/or the shooting of multipoint multiple spot.
Particularly, described movable camera comprises rotatable camera head and/or packaged type camera, and namely described movable camera is rotatable removable or removable rotatable; Or described movable camera comprises one or more movable camera lens; If described movable camera comprises multiple, then each camera of comprising of multiple movable camera is for gathering the graph-text content information of all or part of scene; Described movable camera can be arranged at needs the position in the region gathering described image information to comprise: above the edge in described region and/or oblique upper and/or directly over.
Further, as shown in figure 15, corresponding multipoint images information acquisition unit specifically can comprise camera, and for driving the steer motor of camera activity and corresponding mechanism/or linear electric motors and corresponding mechanism.
(2) described multipoint images information acquisition unit comprises multiple camera, described multiple camera has been used for multi-angle and/or the shooting of multipoint multipoint images, and each camera is fixed camera or movable camera, described movable camera is Non-follow control or is connected with drive motors, described drive motors controls rotation and/or the movement of described movable camera, namely described movable camera be based on predetermined control mode or based on receive control information or its activity of Non-follow control with subject is carried out multi-angle and/or multipoint multipoint images shooting.Such as, automatically (as done automatic adjustment according to the feedback after photographic images identification) is controlled according to feedback, such as when the finger place of showing goes beyond the scope or segment word has gone beyond the scope or the page number has gone beyond the scope, then automatically adjust angle and/or the position of movable camera, such product in the course of the work without the need to or need manual intervention less, or, as described in controlling according to the control information of user's input (the special limbs performed as user determine the predetermined control information etc. that action or user are inputted by operation push-button), camera rotates or movement, or, also can automatically control described camera according to the time interval preset rotate or move, to carry out for the multi-angle of subject and/or the shooting of multipoint multiple spot.
Particularly, described fixed camera comprises one or more camera lens, if and described fixed camera comprises multiple camera lens, then control each camera lens based on the control information of predetermined control mode or reception and gather described graph-text content information, and described predetermined control mode comprises and controls whole camera lens in multiple camera lens or partial lens carries out multi-angle and/or the shooting of multipoint multipoint images;
Each camera that described multiple camera comprises is for gathering the graph-text content information of all or part of scene;
In the program (2), described camera can be arranged at needs the position in the region gathering described image information to comprise: above the edge in described region and/or oblique upper and/or directly over.
That is, in this graphics context collection recognition device, corresponding multipoint images information acquisition unit can comprise multiple camera, as shown in Fig. 3, Fig. 4 and Fig. 5, corresponding multiple camera can be arranged at subject region with fixing or mobilizable mode edge above and/or oblique upper and/or directly over, such as, above the edge that can be arranged at books and/or oblique upper and/or directly over, the position arranged when reading as reader needs not affect reader's read books, specifically can with reference to shown in Fig. 6, Fig. 7, Fig. 8 and Fig. 9.Simultaneously, owing to have employed multiple camera, substantially reduce and the shooting required for each camera is covered, increase overall covering, thus ensure to cover the shooting required for identification, multiple camera can be taken separately and carry out respective identification work, also can shooting results is comprehensive after for identification.
(3) described multipoint images information acquisition unit comprises a fixed camera, described fixed camera comprises multiple camera lens, and control based on the control information of predetermined control mode or reception each camera lens that described multiple case for lense contains and gather described graph-text content information, thus can realize carrying out multi-angle and/or the shooting of multipoint multipoint images to subject by described multiple camera lens, and described predetermined control mode comprises the whole camera lens in the multiple camera lens of control or partial lens carries out multi-angle and/or multipoint multipoint images is taken.Such as, automatically (as done automatic adjustment according to the feedback after photographic images identification) is controlled according to feedback, such as when the finger place of showing goes beyond the scope or segment word has gone beyond the scope or the page number has gone beyond the scope, then automatically adjust angle and/or the position of movable camera, such product in the course of the work without the need to or need manual intervention less, or, according to the control information of user's input (limbs performed as user determine the predetermined control information etc. that action or user are inputted by operation push-button) control, each camera lens is taken the multi-angle of subject and/or multiposition, or, also can automatically control each camera lens described according to each camera lens collection multi-angle of subject preset and/or the mode of multipoint image information to carry out taking (such as, each camera lens can be set and obtain corresponding image information as taking subject successively, also can set each camera lens to take subject simultaneously and obtain corresponding image information, or, also corresponding image information can be obtained by setting section lens shooting subject, etc.).
Particularly, in the program (3), described fixed camera can be arranged at needs the position in the region gathering described image information to comprise: above the edge in described region and/or oblique upper and/or directly over, specifically can with reference to shown in Fig. 6, Fig. 7, Fig. 8 and Fig. 9.
Fixing camera has the fixing visual field usually, but multiple camera can form comprehensive covering, and movable camera has the fixing visual field when a certain angle position, but it changes the visual field by movable, therefore also comprehensive covering can be formed, in concrete enforcement, if fixing camera has loosened, movable camera can not be considered as, equally, even if camera can be movable, if but be not to obtain required special result in its course of work by activity, such as comprehensively cover, then in fact still belong to fixing camera scheme.Such as, if camera that can be movable adjusts to suitable angle position etc. in advance, but do not need to carry out activity in actual use, or activity is very little to effects such as special effect such as comprehensively cover, then in fact still belong to fixing camera scheme.
In the graphics context collection recognition device that the utility model embodiment provides, described graph-text content information specifically can be, but not limited to comprise: the picture of printed matter or word content information, and/or, the pictorial information of space still life, and/or, limb action information, and/or, printed matter is carried out to the indication information of reading operations, and/or, the action message of operation object; Namely described graph-text content information can be the pictorial information of the picture of printed matter or word content information, space still life, limb action information, at least one item carried out printed matter in the indication information of reading operations and the action message of operation object.Corresponding corresponding image information recognition unit comprises picture and text identification module and action recognition module, for the corresponding picture and text of identification or action message, corresponding picture and text identification module and action recognition module are already present module in prior art, therefore are not described in detail its specific implementation at this.
That is, corresponding image information recognition unit can identify picture in printed matter or Word message according to the image information collected, or, also the picture (as determined the content informations such as the corresponding picture of corresponding space still life or explanatory note according to the image information of space still life collected) of space still life can be identified, or, also the limb action information (as identified the execution instruction implication etc. corresponding to predetermined limb action) such as the gesture motion of user's execution can be identified, or, also the action message of user operation object can be identified, or, also reading operations instruction when user reads printed matter can be identified, etc..Further, indication information printed matter being carried out to reading operations can be realized by the action message of limb action information or operation object, namely can using the indication information of the action of specific limb action or operation object as certain reading operations; That is, describedly can to comprise the indication information that printed matter carries out reading operations: the reading instruction operation information carried out on printed matter by hand or hand-held object, indicate the instruction of a bit reading as determined or determine to need the instruction of reading content or determine whether the instruction etc. that needs are read, such as hand is given directions on printed matter, clicking, double-clicking, sliding, page turning etc.
A kind of graphics context collection recognition device that the utility model embodiment provides is owing to have employed unique camera arrangement, make can obtain neatly in image information collecting process the multipoint images information needing to gather subject, namely be taken corresponding different angles and/or the image information of diverse location is gathered, thus the image information collected can be made can to react the actual conditions of subject truely and accurately, and then make accurately to identify corresponding graph-text content information in follow-up picture and text identification processing procedure, as identified word in printed matter or pictorial information exactly, or, identify the implication of the limb action of user, or, identify the implication of the action that user operation object performs, or, identify user by limb action or operation object to the implication of the reading operations of the printed matters such as books, or, identify the word or picture etc. of user's instruction.
In the graphics context collection recognition device that the utility model embodiment provides, for ease of user based on the sound reading of this device realization for printed matter, then as shown in Figure 2, audio unit can also be comprised in the apparatus, described multipoint images information acquisition unit by read page current in the books of collection or printed matter current reading location or reading operations indicating positions or comprise Bibliographical Information or the image information that comprises page number information pass to described image information recognition unit, described image information recognition unit identifies audio-frequency information corresponding to word content information that is that identify this current read page or printed matter current reading location or reading operations indicating positions or Bibliographical Information according to described current read page or printed matter current reading location or reading operations indicating positions or the image information that comprises Bibliographical Information or comprise page number information or page number information and notifies described audio unit, the audio-frequency information of described correspondence exports with audio form by described audio unit, thus can realize reading aloud for the sound of word content in printed matter, be convenient to the inconvenient crowd intuitively books read and obtain content information in general book.
Further, with reference to shown in Figure 15, corresponding image information recognition unit can comprise CPU(central processing unit) and the parts such as memory, corresponding audio unit can comprise loud speaker and corresponding drive circuit.
Carrying out in sound reading operating process by described audio unit to printed matter, described multipoint images information acquisition unit also comprises read location information acquisition module, for the character image information by camera collection user reading operations position (i.e. user specify printed matter current reading location), and the word content that described in the identification of described image information recognition unit, the character image packets of information of user's reading operations position contains, and will identify that the audio-frequency information that the described word content determined is corresponding or the audio-frequency information that the conversion of described word content obtains notify described audio unit.Wherein, audio-frequency information corresponding to described word content can read aloud audio-frequency information for this segment word content, also can be other audio-frequency informations corresponding to this article word content, as audio-frequency informations such as the explanation explanations to this word content.
Corresponding Text region progresses into the practical stage at present, corresponding identification processing procedure can comprise: first to the Image semantic classification of taking pictures, this preliminary treatment mainly comprises the process such as binaryzation, noise remove, inclination calibration, then character features extraction is carried out, comprise after word image graph thinning, obtain the stroke end points of word, the quantity in crosspoint and position, or with stroke section for feature, coordinate comparison method to compare, thus identify word.Because character recognition technology has been prior art, therefore be no longer described in detail at this.
In this graphics context collection recognition device, due to the identification of graph-text content information accurately can be carried out, thus picture and text identifying processing can be carried out for general book, and realize the aid reading sounding process for general book in conjunction with corresponding vocal function, thus provide a kind of aid reading audible device that can carry out auxiliary sounding to general book for people, this just makes children, blind person, the inconvenience such as the elderly can carry out aid reading by this graphics context collection recognition device to the crowd that books are directly read, be very easy to the reading operations of this part crowd to general book.And the accuracy of identifying can also ensure that books reading process can be carried out swimmingly, further ensure reading user and there is preferably reading experience.
In the graphics context collection recognition device that the utility model embodiment provides, for ease of preserving the graph-text content information identified, memory cell can also be comprised in the apparatus, for preserving the described graph-text content information that described image information recognition unit identifies, to facilitate follow-up calling described graph-text content information.
In the graphics context collection recognition device that the utility model embodiment provides, the image information including the Bibliographical Information of books that described multipoint images information acquisition unit can also gather also passes to described image information recognition unit, described image information recognition unit according to described in include the Bibliographical Information of books image information identify book name.Further, described book name can also be exported by the mode of audio frequency or display, such as, book name can be read aloud out by described audio unit, or demonstrate book name by display screen.
Further, described multipoint images information acquisition unit can by the image information of described camera collection figure book envelope as the image information of Bibliographical Information comprising described books, described image information recognition unit then can by identifying that described figure book envelope (comprises front cover, back cover etc.) image information in word determination book name, or, also can by identifying the image information determination book name of described figure book envelope, or, can also by identifying the label determination book name in the image information of described figure book envelope, corresponding label comprises special label or coding, or also can comprise ISBN bar code (International Standard Book Number, International Standard Book Number) etc. the label that existed at present or coding.
Due to the front cover of this book every and back cover image all different, therefore contrast can be carried out by the image information photographed identify, or extraction Characteristic Contrast thus identify to be specially which books, thus determine corresponding book name.And, for ease of identifying, the label being convenient to accordingly identify can also be set in books, make the concrete book name can determining current books according to this label, corresponding label can for being printed on the label on books, also can for being pasted on the label on books, and corresponding label can be picture or the content information such as coding or word.Because concrete image recognition technology has been prior art, therefore be no longer described in detail at this.
In the graphics context collection recognition device that the embodiment of the present invention provides, described multipoint images information acquisition unit can also gather the image information that includes page number information and pass to described image information recognition unit, described image information recognition unit according to described in include page number information image information identify the page number.Further, described book name can also be exported by the mode of audio frequency or display, such as, by the bright reading page number of described audio unit, or the page number can be demonstrated by display screen.
Described page number information acquisition module by identifying that the image information of page determines the page number of current reading in described books, or, by identifying that word in the image information of page in described books or the digital page number determine the page number of current reading.
This graphics context collection recognition device can also comprise display unit, for display setting content information and/or gather in identifying the image and Word message and/or the outside content information obtained that obtain, such as, the information such as the page number or book name of current books reading can be shown, or, show needle is to the explanation descriptive information (as introduction of authors etc.) of books, or, show the operational order of the user that described image information recognition unit identifies, or, play the video information being used for making an explanation to books, etc.
Particularly, this graphics context collection recognition device can also comprise following any one or multinomial unit:
Audio input unit, for obtaining audio-frequency information.Audio-frequency information after corresponding acquisition can be preserved by memory cell.
Memory cell, comprise audio information and/or preserve the image and/or Word message that gather and obtain in identifying and/or the content information preserving outside acquisition, the voice messaging preserved can be play by audio unit when needed, such as, by the cooperation of audio input unit and this memory cell and audio unit, the pronunciation of user in Course of Language Learning can be corrected whether accurate etc.
Communication unit, for communicating with between computer.
Input unit, is connected with described image information recognition unit, for obtaining input information, and such as key-press input, handwriting input, screen input etc.
Moreover, for strengthening the interactive processing between user and this graphics context collection recognition device, promote the experience that user uses this graphics context collection recognition device, interactive processing module can also be comprised in the apparatus, for obtaining the interactive operation control information of user, and perform predetermined interactive operation according to described interactive operation control information, and described interactive operation control information comprises at least one item in limb action, the action of operation object, voice messaging, screen input or operation push-button; In interactive processing process, graphics context collection recognition device can also play particular hint acoustic information by described audio unit to user, or also can show specific content information by described display unit to user, and user can according to the specific content information of corresponding voice prompt information or display to the corresponding interactive operation control information of graphics context collection recognition device transmission, so that carry out interaction with graphics context collection recognition device.Particularly, corresponding interactive operation control information can be included in the reading operations indication information etc. that printed matter carries out, to carry out interactive operation by limb action, can be controlled to carry out interaction to reading method or reading content by interactive between the action of hand or hand-held object and graphics context collection recognition device, as the content etc. by a prearranged gesture control re-reading current location for the user reading general book.Can be identified by this interactive processing module and read the limb action of user or operation object action, so that this device can and be read between user carry out interaction, thus promote the reading experience reading user, books are vocalized media and interactive medium.
In the utility model embodiment, closely just can be undistorted to the covering of required acquisition target by corresponding multipoint images collection.Particularly, corresponding movable camera or multiple camera have multiple focal length, and institute's all parts of acquisition target so just can be made all to be in pinpointed focus, all clear to ensure the image of each several part.
Such as, with reference to shown in Figure 10, Figure 11, Figure 12, Figure 13 and Figure 14, owing to have employed the structure of multi-cam or movable camera (as can rotary head type camera), suitable shooting angle and position can be had for curved surface (the macrobending faces in the middle part of such as books), therefore effectively can carry out shooting to it to identify, corresponding shooting angle no matter directly over or inclined-plane, side effectively can take reference object, each several part all gets a distinct image and good resolution in good focal length.And corresponding multipoint images collection makes resolution ratio of camera head requirement lower, carry out taking the resolution (being more conducive to identifying) that can reach higher to captured thing with the camera of same resolution in other words.
Further, as shown in figure 15, except above-mentioned audio input unit (i.e. voice input module), handwriting input module can also be comprised, screen input module or gesture input module etc.Corresponding communication interface can be USB interface etc.Corresponding memory cell can also comprise extension storage space, as SD card etc.
The above; be only the utility model preferably embodiment; but protection range of the present utility model is not limited thereto; anyly be familiar with those skilled in the art in the technical scope that the utility model discloses; the change that can expect easily or replacement, all should be encompassed within protection range of the present utility model.Therefore, protection range of the present utility model should be as the criterion with the protection range of claims.

Claims (5)

1. a graphics context collection recognition device, it is characterized in that, comprise: multipoint images information acquisition unit and image information recognition unit, described multipoint images information acquisition unit is connected with described image information recognition unit, and the multi-angle of the subject of collection and/or multipoint image information is sent to the foundation of described image information recognition unit as graph-text content information corresponding to image information described in the identification of described image information recognition unit; Wherein,
Institute's multipoint images information acquisition unit comprises at least one movable camera, photo angle and/or the position of described movable camera are adjustable, described movable camera is connected with drive motors, and described drive motors controls rotation and/or the movement of described movable camera; Or, described multipoint images information acquisition unit comprises multiple camera, and each camera is fixed camera or movable camera, photo angle and/or the position of described movable camera are adjustable, described movable camera is Non-follow control or is connected with drive motors, and described drive motors controls rotation and/or the movement of described movable camera; Or described multipoint images information acquisition unit comprises a fixed camera, described fixed camera comprises multiple camera lens.
2. graphics context collection recognition device according to claim 1, is characterized in that, described movable camera comprises rotatable camera head and/or packaged type camera; Or described movable camera comprises one or more movable camera lens; Described movable camera is arranged at needs the position in the region gathering described image information to comprise: above the edge in described region and/or oblique upper and/or directly over.
3. graphics context collection recognition device according to claim 1, it is characterized in that, described fixed camera comprises one or more camera lens, and described camera is arranged at and needs the position in the region gathering described image information to comprise: above the edge in described region and/or oblique upper and/or directly over.
4. graphics context collection recognition device according to claim 1, is characterized in that, described image information recognition unit comprises picture and text identification module and action recognition module.
5. the graphics context collection recognition device according to claim 1,2,3 or 4, is characterized in that, this device also comprises:
Audio unit, be connected with described image information recognition unit, and described multipoint images information acquisition unit by read page current in the books of collection or printed matter current reading location reading operations indicating positions or the image information that comprises Bibliographical Information or comprise page number information pass to described image information recognition unit, and described image information recognition unit according to described current read page or printed matter current reading location or reading operations indicating positions the image information that comprises Bibliographical Information or comprise page number information identifies this current read page or printed matter current reading location or reading operations instruction or Bibliographical Information or after audio-frequency information that the word content information of page number information is corresponding, obtain described audio-frequency information, again the audio-frequency information of described correspondence is exported with audio form by loud speaker,
And/or,
Be connected with described image information recognition unit, and for obtaining the audio input unit of audio-frequency information;
And/or,
Be connected with described image information recognition unit and/or described audio input unit, and for the memory cell of memory content information;
And/or,
Be connected with described image information recognition unit, and for the display unit of displaying contents information;
And/or,
Be connected with described image information recognition unit, and for the communication unit with compunication;
And/or,
Be connected with described image information recognition unit, and for obtaining the input unit of input information.
CN201420038757.5U 2013-01-25 2014-01-21 A kind of graphics context collection recognition device Expired - Fee Related CN204046697U (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201420038757.5U CN204046697U (en) 2013-01-25 2014-01-21 A kind of graphics context collection recognition device

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
CN201310028174 2013-01-25
CN201320040638.9 2013-01-25
CN201320040638 2013-01-25
CN201310028174.4 2013-01-25
CN201420038757.5U CN204046697U (en) 2013-01-25 2014-01-21 A kind of graphics context collection recognition device

Publications (1)

Publication Number Publication Date
CN204046697U true CN204046697U (en) 2014-12-24

Family

ID=50529124

Family Applications (7)

Application Number Title Priority Date Filing Date
CN201710012059.6A Withdrawn CN106803864A (en) 2013-01-25 2014-01-21 Certain graphics context collection identifying device
CN201410028668.7A Expired - Fee Related CN103761893B (en) 2013-01-25 2014-01-21 A kind of book reader
CN201420038757.5U Expired - Fee Related CN204046697U (en) 2013-01-25 2014-01-21 A kind of graphics context collection recognition device
CN201710012060.9A Withdrawn CN106791263A (en) 2013-01-25 2014-01-21 One class graphics context collection identifying device
CN201710012058.1A Withdrawn CN106791262A (en) 2013-01-25 2014-01-21 Graphics context collection identifying device
CN201420037927.8U Expired - Fee Related CN203773763U (en) 2013-01-25 2014-01-21 Book reader
CN201410027696.7A Expired - Fee Related CN103763453B (en) 2013-01-25 2014-01-21 A kind of graphics context collection identification device

Family Applications Before (2)

Application Number Title Priority Date Filing Date
CN201710012059.6A Withdrawn CN106803864A (en) 2013-01-25 2014-01-21 Certain graphics context collection identifying device
CN201410028668.7A Expired - Fee Related CN103761893B (en) 2013-01-25 2014-01-21 A kind of book reader

Family Applications After (4)

Application Number Title Priority Date Filing Date
CN201710012060.9A Withdrawn CN106791263A (en) 2013-01-25 2014-01-21 One class graphics context collection identifying device
CN201710012058.1A Withdrawn CN106791262A (en) 2013-01-25 2014-01-21 Graphics context collection identifying device
CN201420037927.8U Expired - Fee Related CN203773763U (en) 2013-01-25 2014-01-21 Book reader
CN201410027696.7A Expired - Fee Related CN103763453B (en) 2013-01-25 2014-01-21 A kind of graphics context collection identification device

Country Status (1)

Country Link
CN (7) CN106803864A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111583734A (en) * 2019-02-19 2020-08-25 阿里巴巴集团控股有限公司 Touch reading method and touch reading pen
CN111701883A (en) * 2020-07-01 2020-09-25 湖南德荣医疗器械物流配送服务有限公司 Intelligent sorting equipment and method for intelligent warehousing

Families Citing this family (40)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106803864A (en) * 2013-01-25 2017-06-06 陈旭 Certain graphics context collection identifying device
CN104199834B (en) * 2014-08-04 2018-11-27 徐�明 The method and system for obtaining remote resource from information carrier surface interactive mode and exporting
CN104157171B (en) * 2014-08-13 2016-11-09 三星电子(中国)研发中心 A kind of point-of-reading system and method thereof
CN104217197B (en) * 2014-08-27 2018-04-13 华南理工大学 A kind of reading method and device of view-based access control model gesture
US9973671B2 (en) * 2014-08-27 2018-05-15 Symbol Technologies, Llc Method and apparatus for directing data capture devices in a mobile unit with a single operation
CN104202640B (en) * 2014-08-28 2016-03-30 深圳市国华识别科技开发有限公司 Based on intelligent television intersection control routine and the method for image recognition
CN104793852A (en) * 2015-04-02 2015-07-22 广东小天才科技有限公司 Method and device for detecting page upturning movement on electronic reading machine
CN104766501B (en) * 2015-04-09 2017-07-14 广东小天才科技有限公司 A kind of point reader and its method for correcting coordinate
CN105046253B (en) * 2015-06-24 2018-05-11 山西同方知网数字出版技术有限公司 A kind of paper strip of paper used for sealing automatic recognition system and method based on OCR
CN105139699A (en) * 2015-08-04 2015-12-09 广东小天才科技有限公司 Method and device for automatically turning pages
CN105447499B (en) * 2015-10-23 2018-09-04 北京爱乐宝机器人科技有限公司 A kind of books interactive approach, device and equipment
CN106408560B (en) * 2016-09-05 2020-01-03 广东小天才科技有限公司 Method and device for rapidly acquiring effective image
CN106515258B (en) * 2016-11-10 2017-12-19 深圳市科迈爱康科技有限公司 Notebook, intelligent terminal and notebook content indexing creation method
CN108242233B (en) * 2016-12-26 2020-11-03 腾讯科技(深圳)有限公司 Audio data playing method and device
CN107085724A (en) * 2017-03-31 2017-08-22 上海斐讯数据通信技术有限公司 A kind of book pages information carrying means and method
CN107657747A (en) * 2017-10-17 2018-02-02 梁北洪 A kind of books, system of studying at school on a temporary basis and its encouragement reading method
CN107977694B (en) * 2017-11-28 2021-07-13 倍仪昇智能科技(苏州)有限公司 Automatic analysis system for photographing, inputting and identifying samples
CN108156374B (en) * 2017-12-25 2020-12-08 努比亚技术有限公司 Image processing method, terminal and readable storage medium
CN108257615A (en) * 2018-01-15 2018-07-06 北京物灵智能科技有限公司 A kind of user language appraisal procedure and system
CN108536287B (en) * 2018-03-26 2021-03-02 深圳市同维通信技术有限公司 Method and device for reading according to user instruction
CN108762507B (en) * 2018-05-30 2021-05-04 辽东学院 Image tracking method and device
CN108845786A (en) * 2018-05-31 2018-11-20 北京智能管家科技有限公司 Intelligent reading partner method, apparatus, equipment and storage medium
CN108875694A (en) * 2018-07-04 2018-11-23 百度在线网络技术(北京)有限公司 Speech output method and device
WO2020034519A1 (en) * 2018-08-17 2020-02-20 中国图书进出口(集团)大连有限公司 Spatial audio reading system and method
CN109448453B (en) * 2018-10-23 2021-10-12 昆明微想智森科技股份有限公司 Point reading question-answering method and system based on image recognition tracking technology
CN109703232A (en) * 2018-11-30 2019-05-03 数景智能科技(宁波)有限公司 A kind of intelligence books note
CN109726697B (en) * 2019-01-04 2021-07-20 北京灵优智学科技有限公司 Online video system and method integrating AV video communication and AI real object identification
CN109753554B (en) * 2019-01-14 2021-03-30 广东小天才科技有限公司 Searching method based on three-dimensional space positioning and family education equipment
CN111695372B (en) * 2019-03-12 2023-10-27 阿里巴巴集团控股有限公司 Click-to-read method and click-to-read data processing method
CN110060524A (en) * 2019-04-30 2019-07-26 广东小天才科技有限公司 The method and reading machine people that a kind of robot assisted is read
CN110287881A (en) * 2019-06-26 2019-09-27 上海交通大学 Books identifying system, books recognition methods, electronic device and storage medium
CN110489005B (en) * 2019-06-28 2022-12-27 浙江工业大学 Two-dimensional point display with touch positioning function and two-dimensional contact driving method thereof
CN110992738A (en) * 2019-10-21 2020-04-10 北京十分科技有限公司 Automatic reading method and device for reading material
CN111178348B (en) * 2019-12-09 2024-03-22 广东小天才科技有限公司 Method for tracking target object and sound box equipment
CN110949037A (en) * 2019-12-16 2020-04-03 中国工程物理研究院激光聚变研究中心 Automatic identification and ejection device for file box
CN111027341A (en) * 2019-12-28 2020-04-17 安徽硕威智能科技有限公司 Interaction method, device and system based on OID two-dimensional password identification and storage medium thereof
CN111698384B (en) * 2020-06-22 2022-06-21 上海肇观电子科技有限公司 Image processing apparatus
CN112329563A (en) * 2020-10-23 2021-02-05 复旦大学 Intelligent reading auxiliary method and system based on raspberry pie
CN112258911B (en) * 2020-11-12 2021-11-09 吉林农业科技学院 English word memory device based on time delay detects
CN113496637B (en) * 2021-06-18 2023-01-03 湖南华壹影业有限公司 Auxiliary training system for image information space-time scanning

Family Cites Families (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1202471C (en) * 2001-09-19 2005-05-18 力新国际科技股份有限公司 Book makign system and method
US7660458B1 (en) * 2004-12-14 2010-02-09 Google Inc. Three-dimensional model construction using unstructured pattern
CN2786695Y (en) * 2005-04-06 2006-06-07 极鼎科技股份有限公司 Interactive studying equipment
CN2807364Y (en) * 2005-06-24 2006-08-16 赵舜培 Electronic learning auxiliary device
CN200944633Y (en) * 2006-08-15 2007-09-05 沈洪 Multi-scene imaging apparatus
CN1952974B (en) * 2006-11-10 2010-06-09 中山大学 Automatic pinpoint system and method of books in library
CN201035576Y (en) * 2006-12-20 2008-03-12 北京恒基伟业投资发展有限公司 Device of implementing instant translation using digital camera technique
CN201097383Y (en) * 2007-01-12 2008-08-06 林良锐 Book hearing machine
CN201063025Y (en) * 2007-05-11 2008-05-21 葛宇 Detecting apparatus with multiple angle camera head
CN201194112Y (en) * 2008-05-12 2009-02-11 刘建生 Video shooting type multifunctional click-to-read machine
CN101630193A (en) * 2008-07-15 2010-01-20 张雪峰 Hand induction equipment
CN101676970B (en) * 2008-09-16 2013-03-20 深圳市王菱科技开发有限公司 Reading processing system with multiple electronic read data functions
CN201294001Y (en) * 2008-11-26 2009-08-19 佛山市安讯智能科技有限公司 Recognizer for collecting and processing image words
US8332741B2 (en) * 2008-12-08 2012-12-11 Qurio Holdings, Inc. Method and system for on-demand narration of a customized story
CN201315627Y (en) * 2008-12-30 2009-09-23 蒋清晓 Multifunctional portable-type electronic typoscope
CN201364654Y (en) * 2009-03-11 2009-12-16 中国长城计算机深圳股份有限公司 Book point-reading system
CN101527828B (en) * 2009-04-14 2011-08-10 华为终端有限公司 Image acquisition equipment
CN201540655U (en) * 2009-05-13 2010-08-04 崔伟 Phonic book
CN102110332B (en) * 2009-12-24 2013-01-09 上海阿艾依智控系统有限公司 Book registering and managing device based on computer vision and radio frequency identification technology
CN101776953A (en) * 2009-12-29 2010-07-14 胡世曦 Optical positioning method and finger mouse integrated with keyboard
CN201611477U (en) * 2010-03-12 2010-10-20 北京汇冠新技术股份有限公司 Optical touch screen
CN201622581U (en) * 2010-04-08 2010-11-03 陶懿 Processing device of image taking and character recognition
CN101833663B (en) * 2010-04-21 2012-10-10 北方工业大学 Binocular electronic reader
CN202551228U (en) * 2012-02-16 2012-11-21 北京奥美达科技有限公司 Visual aid
CN102566603A (en) * 2012-02-29 2012-07-11 天津天地伟业数码科技有限公司 Architecture and method for positioning dome camera
CN102682608B (en) * 2012-05-16 2014-02-19 江苏尤特斯新技术有限公司 Method for image forensics of motor vehicle plate covering and staining behaviors
CN106803864A (en) * 2013-01-25 2017-06-06 陈旭 Certain graphics context collection identifying device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111583734A (en) * 2019-02-19 2020-08-25 阿里巴巴集团控股有限公司 Touch reading method and touch reading pen
CN111701883A (en) * 2020-07-01 2020-09-25 湖南德荣医疗器械物流配送服务有限公司 Intelligent sorting equipment and method for intelligent warehousing

Also Published As

Publication number Publication date
CN103761893A (en) 2014-04-30
CN103763453A (en) 2014-04-30
CN203773763U (en) 2014-08-13
CN103763453B (en) 2019-09-10
CN106803864A (en) 2017-06-06
CN106791263A (en) 2017-05-31
CN103761893B (en) 2018-11-16
CN106791262A (en) 2017-05-31

Similar Documents

Publication Publication Date Title
CN204046697U (en) A kind of graphics context collection recognition device
KR102559028B1 (en) Method and apparatus for recognizing handwriting
CN111050017A (en) Picture and text photographing equipment
US8538087B2 (en) Aiding device for reading a printed text
US8154644B2 (en) System and method for manipulation of a digital image
EP2065871B1 (en) Reading device for blind or visually impaired persons
JP2011516924A (en) Multi-mode learning system
DE202014004572U1 (en) Device and graphical user interface for switching between camera interfaces
US8113841B2 (en) Reading device for blind or visually impaired persons
CN107731020B (en) Multimedia playing method, device, storage medium and electronic equipment
CN110111612A (en) A kind of photo taking type reading method, system and point read equipment
CN104835361B (en) A kind of electronic dictionary
WO2018108177A1 (en) Method for teaching painting using robot, device and robot therefor
CN110225202A (en) Processing method, device, mobile terminal and the storage medium of audio stream
CN111078179B (en) Dictation, newspaper and read progress control method and electronic equipment
Saleous et al. Read2Me: A cloud-based reading aid for the visually impaired
CN110971924B (en) Method, device, storage medium and system for beautifying in live broadcast process
Shilkrot et al. FingerReader: A finger-worn assistive augmentation
CN112329563A (en) Intelligent reading auxiliary method and system based on raspberry pie
CN116048254A (en) Content identification method applied to intelligent equipment, intelligent equipment and intelligent pen
CN210955115U (en) Desktop type auxiliary reading equipment
WO2021084761A1 (en) Image reading device
WO2019090525A1 (en) Information recording method and information recording device
CN114359920A (en) Image processing method, device, equipment and storage medium
JP2021081762A (en) Question creation system, image formation device, question creation program, and question creation device

Legal Events

Date Code Title Description
C14 Grant of patent or utility model
GR01 Patent grant
CP02 Change in the address of a patent holder

Address after: 100083 Beijing Chaoyang District 100107 Mailbox 015 sub-box

Patentee after: Chen Xu

Address before: 100083 No. 1702, 2nd Floor, Dormitory Tower, Erlizhuang District, Haidian District, Beijing

Patentee before: Chen Xu

CP02 Change in the address of a patent holder
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20141224

Termination date: 20190121

CF01 Termination of patent right due to non-payment of annual fee