CN204046697U

CN204046697U - A kind of graphics context collection recognition device

Info

Publication number: CN204046697U
Application number: CN201420038757.5U
Authority: CN
Inventors: 陈旭
Original assignee: Individual
Current assignee: Individual
Priority date: 2013-01-25
Filing date: 2014-01-21
Publication date: 2014-12-24
Anticipated expiration: 2024-01-21
Also published as: CN103761893A; CN103763453A; CN203773763U; CN103763453B; CN106803864A; CN106791263A; CN103761893B; CN106791262A

Abstract

A kind of graphics context collection recognition device that the utility model embodiment provides, comprise: multipoint images information acquisition unit and image information recognition unit, described multipoint images information acquisition unit is connected with described image information recognition unit, and the multi-angle of the subject of collection and/or multipoint image information is sent to the foundation of described image information recognition unit as graph-text content information corresponding to image information described in the identification of described image information recognition unit; Wherein, institute's multipoint images information acquisition unit comprises at least one movable camera; Or described multipoint images information acquisition unit comprises multiple camera; Or described multipoint images information acquisition unit comprises a fixed camera, described fixed camera comprises multiple camera lens.The realization of the utility model embodiment makes can obtain neatly in image information collecting process the image information needing to gather different angles in image information region or diverse location, thus makes in follow-up picture and text identification processing procedure and accurately can identify corresponding graph-text content information.

Description

A kind of graphics context collection recognition device

Technical field

The utility model relates to IMAQ identifying processing technical field, particularly relates to a kind of graphics context collection recognition device.

Background technology

Along with the development of image processing techniques, the application of corresponding IMAQ recognition technology is also increasingly extensive, but, in current IMAQ recognition technology, be generally and adopt single common camera to be fixedly installed in the acquisition operations of carrying out image in image-region to be collected, the processing mode of this collection image makes the image information collected often cannot react actual conditions in collected region truely and accurately, namely the image really and accurately in defendant's pickup area cannot be obtained, and then cause accurately to identify corresponding graph-text content information in follow-up identification processing procedure.

Particularly, if carry out IMAQ with common camera, in order to obtain the covering to required acquisition target, then need to arrange corresponding camera more at a distance.If attempt with wide-angle lens or flake mirror in the covering of short distance to required acquisition target, larger distortion will be produced.And its single focal length of single fixing camera causes captured object only the clearest in that sub-fraction that focal length is suitable, other parts are not then due at best focus position place, then clear not.And be difficult to realize distortionless shooting to curved surface (such as to the macrobending in the middle part of the book opened).And, the shooting angle of common camera also can only directly over effectively could take whole subject, if from the side or inclined-plane shooting; easily make part not unintelligible at pinpointed focus, and due to projection principle, image objects apart from camera far-end is little, and resolution can decline a lot.Therefore, then require very high to resolution ratio of camera head according to common single fixing camera (i.e. common camera) to comprehensive covering of required acquisition target, and desirable shooting effect cannot be reached.

Moreover current Book Publishing amount is very large.But also there is a part of crowd at present, as children, blind person, the elderly etc., its inconvenience is directly read books, needs aid reading audible device to read for this part crowd, and at present not for the aid reading audible device of general book.

Summary of the invention

The purpose of this utility model is to provide a kind of graphics context collection recognition device, thus can collect the image information of expectation accurately and easily, to improve the accuracy of identification processing procedure.

The purpose of this utility model is achieved through the following technical solutions:

A kind of graphics context collection recognition device, comprise: multipoint images information acquisition unit and image information recognition unit, described multipoint images information acquisition unit is connected with described image information recognition unit, and the multi-angle of the subject of collection and/or multipoint image information is sent to the foundation of described image information recognition unit as graph-text content information corresponding to image information described in the identification of described image information recognition unit; Wherein,

Institute's multipoint images information acquisition unit comprises at least one movable camera, photo angle and/or the position of described movable camera are adjustable, described movable camera is connected with drive motors, and described drive motors controls rotation and/or the movement of described movable camera; Or, described multipoint images information acquisition unit comprises multiple camera, i.e. two or three or four or more camera, and each camera is fixed camera or movable camera, photo angle and/or the position of described movable camera are adjustable, described movable camera is Non-follow control or is connected with drive motors, and described drive motors controls rotation and/or the movement of described movable camera; Or described multipoint images information acquisition unit comprises a fixed camera, described fixed camera comprises multiple camera lens.

Described movable camera comprises rotatable camera head and/or packaged type camera, and namely described movable camera is rotatable removable or removable rotatable; Or described movable camera comprises one or more movable camera lens; Described movable camera is arranged at needs the position in the region gathering described image information to comprise: above the edge in described region and/or oblique upper and/or directly over.

Described fixed camera comprises one or more camera lens, and described camera is arranged at and needs the position in the region gathering described image information to comprise: above the edge in described region and/or oblique upper and/or directly over.

Described image information recognition unit comprises picture and text identification module and action recognition module.

This device also comprises:

Audio unit, be connected with described image information recognition unit, and described multipoint images information acquisition unit by read page current in the books of collection or printed matter current reading location reading operations indicating positions or the image information that comprises Bibliographical Information or comprise page number information pass to described image information recognition unit, and described image information recognition unit according to described current read page or printed matter current reading location or reading operations indicating positions the image information that comprises Bibliographical Information or comprise page number information identifies this current read page or printed matter current reading location or reading operations instruction or after audio-frequency information that the word content information of Bibliographical Information or page number information is corresponding, obtain described audio-frequency information, again the audio-frequency information of described correspondence is exported with audio form by loud speaker,

And/or,

Be connected with described image information recognition unit, and for obtaining the audio input unit of audio-frequency information;

And/or,

Be connected with described image information recognition unit and/or described audio input unit, and for the memory cell of memory content information;

And/or,

Be connected with described image information recognition unit, and for the display unit of displaying contents information;

And/or,

Be connected with described image information recognition unit, and for the communication unit with compunication.

The technical scheme provided as can be seen from above-mentioned the utility model, a kind of graphics context collection recognition device that the utility model embodiment provides is owing to have employed unique camera arrangement, make can obtain neatly in image information collecting process the image information (i.e. the image information of different angles and/or diverse location) needing to gather (i.e. subject) multiple spot in image information region, thus the image information collected can be made can to react actual conditions in collected region truely and accurately, and then make accurately to identify corresponding graph-text content information in follow-up picture and text identification processing procedure.Thus making this device also can be, but not limited to as a kind of picture and text input device etc., picture and text typing is in full typing such as, or divides picture and text to carry out typing etc. along with the selection portion of instruction printed matter being carried out to reading operations.On the other hand, in this graphics context collection recognition device, due to the identification of graph-text content information accurately can be carried out, thus picture and text identifying processing can be carried out for general book, and realize the aid reading sounding process for general book in conjunction with corresponding vocal function, thus provide a kind of aid reading audible device that can carry out auxiliary sounding to general book for people.

Accompanying drawing explanation

In order to be illustrated more clearly in the technical scheme of the utility model embodiment, below the accompanying drawing used required in describing embodiment is briefly described, apparently, accompanying drawing in the following describes is only embodiments more of the present utility model, for those of ordinary skill in the art, under the prerequisite not paying creative work, other accompanying drawings can also be obtained according to these accompanying drawings.

The structural representation one of the graphics context collection recognition device that Fig. 1 provides for the utility model embodiment;

The structural representation two of the graphics context collection recognition device that Fig. 2 provides for the utility model embodiment;

The structural representation one comprising the graphics context collection recognition device of multiple camera that Fig. 3 provides for the utility model embodiment;

The structural representation two comprising the graphics context collection recognition device of multiple camera that Fig. 4 provides for the utility model embodiment;

The structural representation three comprising the graphics context collection recognition device of multiple camera that Fig. 5 provides for the utility model embodiment;

The structural representation being arranged at the camera above edge that Fig. 6 provides for the utility model embodiment;

The structural representation one being arranged at the camera of oblique upper that Fig. 7 provides for the utility model embodiment;

The structural representation two being arranged at the camera of oblique upper that Fig. 8 provides for the utility model embodiment;

Fig. 9 for the utility model embodiment provide be arranged at directly over the structural representation of camera;

The schematic diagram one of multiple camera shooting books top-surface cambers that Figure 10 provides for the utility model embodiment;

The schematic diagram two of multiple camera shooting books top-surface cambers that Figure 11 provides for the utility model embodiment;

The schematic diagram three of multiple camera shooting books top-surface cambers that Figure 12 provides for the utility model embodiment;

The schematic diagram of the rotatable camera head rotation front shooting books top-surface camber that Figure 13 provides for the utility model embodiment;

The schematic diagram of the rotatable camera head rotation rear shooting books top-surface camber that Figure 14 provides for the utility model embodiment;

The Application Example structural representation that Figure 15 provides for the utility model embodiment.

Embodiment

Below in conjunction with the accompanying drawing in the utility model embodiment, be clearly and completely described the technical scheme in the utility model embodiment, obviously, described embodiment is only the utility model part embodiment, instead of whole embodiments.Based on embodiment of the present utility model, those of ordinary skill in the art, not making the every other embodiment obtained under creative work prerequisite, belong to protection range of the present utility model.

Below in conjunction with accompanying drawing, the utility model embodiment is described in further detail.

The utility model embodiment provides a kind of graphics context collection recognition device, its specific implementation structure as shown in Figure 1, can comprise: multipoint images information acquisition unit and image information recognition unit, described multipoint images information acquisition unit is connected with described image information recognition unit, and the image information of collection is passed to the foundation of described image information recognition unit as graph-text content information corresponding to image information described in the identification of described image information recognition unit, so that described image information recognition unit can identify graph-text content information according to described image information, described multipoint images information acquisition unit adopts shooting style to obtain described image information, namely the camera shooting that described multipoint images information acquisition unit is comprised by it obtains described image information.

Further, for realizing for the multi-angle of subject and/or the shooting of multipoint multipoint images, corresponding multipoint images information acquisition unit can adopt following arbitrary structure to realize:

(1) described multipoint images information acquisition unit can comprise at least one movable camera, described movable camera is connected with drive motors, described drive motors controls rotation and/or the movement of described movable camera, and described movable camera controls its activity to carry out multi-angle and/or the shooting of multipoint multipoint images to subject based on the control information of predetermined control mode or reception.Such as, automatically (as done automatic adjustment according to the feedback after photographic images identification) is controlled according to feedback, such as when the finger place of showing goes beyond the scope or segment word has gone beyond the scope or the page number has gone beyond the scope, then automatically adjust angle and/or the position of movable camera, such product in the course of the work without the need to or need manual intervention less, or, as described in controlling according to the control information (predetermined control information etc. that the specific limb action performed as user or user are inputted by operation push-button) of user's input, camera rotates or moves, or, also can automatically control described camera according to the time interval preset rotate or move, to carry out for the multi-angle of subject and/or the shooting of multipoint multiple spot.

Particularly, described movable camera comprises rotatable camera head and/or packaged type camera, and namely described movable camera is rotatable removable or removable rotatable; Or described movable camera comprises one or more movable camera lens; If described movable camera comprises multiple, then each camera of comprising of multiple movable camera is for gathering the graph-text content information of all or part of scene; Described movable camera can be arranged at needs the position in the region gathering described image information to comprise: above the edge in described region and/or oblique upper and/or directly over.

Further, as shown in figure 15, corresponding multipoint images information acquisition unit specifically can comprise camera, and for driving the steer motor of camera activity and corresponding mechanism/or linear electric motors and corresponding mechanism.

(2) described multipoint images information acquisition unit comprises multiple camera, described multiple camera has been used for multi-angle and/or the shooting of multipoint multipoint images, and each camera is fixed camera or movable camera, described movable camera is Non-follow control or is connected with drive motors, described drive motors controls rotation and/or the movement of described movable camera, namely described movable camera be based on predetermined control mode or based on receive control information or its activity of Non-follow control with subject is carried out multi-angle and/or multipoint multipoint images shooting.Such as, automatically (as done automatic adjustment according to the feedback after photographic images identification) is controlled according to feedback, such as when the finger place of showing goes beyond the scope or segment word has gone beyond the scope or the page number has gone beyond the scope, then automatically adjust angle and/or the position of movable camera, such product in the course of the work without the need to or need manual intervention less, or, as described in controlling according to the control information of user's input (the special limbs performed as user determine the predetermined control information etc. that action or user are inputted by operation push-button), camera rotates or movement, or, also can automatically control described camera according to the time interval preset rotate or move, to carry out for the multi-angle of subject and/or the shooting of multipoint multiple spot.

Particularly, described fixed camera comprises one or more camera lens, if and described fixed camera comprises multiple camera lens, then control each camera lens based on the control information of predetermined control mode or reception and gather described graph-text content information, and described predetermined control mode comprises and controls whole camera lens in multiple camera lens or partial lens carries out multi-angle and/or the shooting of multipoint multipoint images;

Each camera that described multiple camera comprises is for gathering the graph-text content information of all or part of scene;

In the program (2), described camera can be arranged at needs the position in the region gathering described image information to comprise: above the edge in described region and/or oblique upper and/or directly over.

That is, in this graphics context collection recognition device, corresponding multipoint images information acquisition unit can comprise multiple camera, as shown in Fig. 3, Fig. 4 and Fig. 5, corresponding multiple camera can be arranged at subject region with fixing or mobilizable mode edge above and/or oblique upper and/or directly over, such as, above the edge that can be arranged at books and/or oblique upper and/or directly over, the position arranged when reading as reader needs not affect reader's read books, specifically can with reference to shown in Fig. 6, Fig. 7, Fig. 8 and Fig. 9.Simultaneously, owing to have employed multiple camera, substantially reduce and the shooting required for each camera is covered, increase overall covering, thus ensure to cover the shooting required for identification, multiple camera can be taken separately and carry out respective identification work, also can shooting results is comprehensive after for identification.

(3) described multipoint images information acquisition unit comprises a fixed camera, described fixed camera comprises multiple camera lens, and control based on the control information of predetermined control mode or reception each camera lens that described multiple case for lense contains and gather described graph-text content information, thus can realize carrying out multi-angle and/or the shooting of multipoint multipoint images to subject by described multiple camera lens, and described predetermined control mode comprises the whole camera lens in the multiple camera lens of control or partial lens carries out multi-angle and/or multipoint multipoint images is taken.Such as, automatically (as done automatic adjustment according to the feedback after photographic images identification) is controlled according to feedback, such as when the finger place of showing goes beyond the scope or segment word has gone beyond the scope or the page number has gone beyond the scope, then automatically adjust angle and/or the position of movable camera, such product in the course of the work without the need to or need manual intervention less, or, according to the control information of user's input (limbs performed as user determine the predetermined control information etc. that action or user are inputted by operation push-button) control, each camera lens is taken the multi-angle of subject and/or multiposition, or, also can automatically control each camera lens described according to each camera lens collection multi-angle of subject preset and/or the mode of multipoint image information to carry out taking (such as, each camera lens can be set and obtain corresponding image information as taking subject successively, also can set each camera lens to take subject simultaneously and obtain corresponding image information, or, also corresponding image information can be obtained by setting section lens shooting subject, etc.).

Particularly, in the program (3), described fixed camera can be arranged at needs the position in the region gathering described image information to comprise: above the edge in described region and/or oblique upper and/or directly over, specifically can with reference to shown in Fig. 6, Fig. 7, Fig. 8 and Fig. 9.

Fixing camera has the fixing visual field usually, but multiple camera can form comprehensive covering, and movable camera has the fixing visual field when a certain angle position, but it changes the visual field by movable, therefore also comprehensive covering can be formed, in concrete enforcement, if fixing camera has loosened, movable camera can not be considered as, equally, even if camera can be movable, if but be not to obtain required special result in its course of work by activity, such as comprehensively cover, then in fact still belong to fixing camera scheme.Such as, if camera that can be movable adjusts to suitable angle position etc. in advance, but do not need to carry out activity in actual use, or activity is very little to effects such as special effect such as comprehensively cover, then in fact still belong to fixing camera scheme.

In the graphics context collection recognition device that the utility model embodiment provides, described graph-text content information specifically can be, but not limited to comprise: the picture of printed matter or word content information, and/or, the pictorial information of space still life, and/or, limb action information, and/or, printed matter is carried out to the indication information of reading operations, and/or, the action message of operation object; Namely described graph-text content information can be the pictorial information of the picture of printed matter or word content information, space still life, limb action information, at least one item carried out printed matter in the indication information of reading operations and the action message of operation object.Corresponding corresponding image information recognition unit comprises picture and text identification module and action recognition module, for the corresponding picture and text of identification or action message, corresponding picture and text identification module and action recognition module are already present module in prior art, therefore are not described in detail its specific implementation at this.

That is, corresponding image information recognition unit can identify picture in printed matter or Word message according to the image information collected, or, also the picture (as determined the content informations such as the corresponding picture of corresponding space still life or explanatory note according to the image information of space still life collected) of space still life can be identified, or, also the limb action information (as identified the execution instruction implication etc. corresponding to predetermined limb action) such as the gesture motion of user's execution can be identified, or, also the action message of user operation object can be identified, or, also reading operations instruction when user reads printed matter can be identified, etc..Further, indication information printed matter being carried out to reading operations can be realized by the action message of limb action information or operation object, namely can using the indication information of the action of specific limb action or operation object as certain reading operations; That is, describedly can to comprise the indication information that printed matter carries out reading operations: the reading instruction operation information carried out on printed matter by hand or hand-held object, indicate the instruction of a bit reading as determined or determine to need the instruction of reading content or determine whether the instruction etc. that needs are read, such as hand is given directions on printed matter, clicking, double-clicking, sliding, page turning etc.

A kind of graphics context collection recognition device that the utility model embodiment provides is owing to have employed unique camera arrangement, make can obtain neatly in image information collecting process the multipoint images information needing to gather subject, namely be taken corresponding different angles and/or the image information of diverse location is gathered, thus the image information collected can be made can to react the actual conditions of subject truely and accurately, and then make accurately to identify corresponding graph-text content information in follow-up picture and text identification processing procedure, as identified word in printed matter or pictorial information exactly, or, identify the implication of the limb action of user, or, identify the implication of the action that user operation object performs, or, identify user by limb action or operation object to the implication of the reading operations of the printed matters such as books, or, identify the word or picture etc. of user's instruction.

In the graphics context collection recognition device that the utility model embodiment provides, for ease of user based on the sound reading of this device realization for printed matter, then as shown in Figure 2, audio unit can also be comprised in the apparatus, described multipoint images information acquisition unit by read page current in the books of collection or printed matter current reading location or reading operations indicating positions or comprise Bibliographical Information or the image information that comprises page number information pass to described image information recognition unit, described image information recognition unit identifies audio-frequency information corresponding to word content information that is that identify this current read page or printed matter current reading location or reading operations indicating positions or Bibliographical Information according to described current read page or printed matter current reading location or reading operations indicating positions or the image information that comprises Bibliographical Information or comprise page number information or page number information and notifies described audio unit, the audio-frequency information of described correspondence exports with audio form by described audio unit, thus can realize reading aloud for the sound of word content in printed matter, be convenient to the inconvenient crowd intuitively books read and obtain content information in general book.

Further, with reference to shown in Figure 15, corresponding image information recognition unit can comprise CPU(central processing unit) and the parts such as memory, corresponding audio unit can comprise loud speaker and corresponding drive circuit.

Carrying out in sound reading operating process by described audio unit to printed matter, described multipoint images information acquisition unit also comprises read location information acquisition module, for the character image information by camera collection user reading operations position (i.e. user specify printed matter current reading location), and the word content that described in the identification of described image information recognition unit, the character image packets of information of user's reading operations position contains, and will identify that the audio-frequency information that the described word content determined is corresponding or the audio-frequency information that the conversion of described word content obtains notify described audio unit.Wherein, audio-frequency information corresponding to described word content can read aloud audio-frequency information for this segment word content, also can be other audio-frequency informations corresponding to this article word content, as audio-frequency informations such as the explanation explanations to this word content.

Corresponding Text region progresses into the practical stage at present, corresponding identification processing procedure can comprise: first to the Image semantic classification of taking pictures, this preliminary treatment mainly comprises the process such as binaryzation, noise remove, inclination calibration, then character features extraction is carried out, comprise after word image graph thinning, obtain the stroke end points of word, the quantity in crosspoint and position, or with stroke section for feature, coordinate comparison method to compare, thus identify word.Because character recognition technology has been prior art, therefore be no longer described in detail at this.

In this graphics context collection recognition device, due to the identification of graph-text content information accurately can be carried out, thus picture and text identifying processing can be carried out for general book, and realize the aid reading sounding process for general book in conjunction with corresponding vocal function, thus provide a kind of aid reading audible device that can carry out auxiliary sounding to general book for people, this just makes children, blind person, the inconvenience such as the elderly can carry out aid reading by this graphics context collection recognition device to the crowd that books are directly read, be very easy to the reading operations of this part crowd to general book.And the accuracy of identifying can also ensure that books reading process can be carried out swimmingly, further ensure reading user and there is preferably reading experience.

In the graphics context collection recognition device that the utility model embodiment provides, for ease of preserving the graph-text content information identified, memory cell can also be comprised in the apparatus, for preserving the described graph-text content information that described image information recognition unit identifies, to facilitate follow-up calling described graph-text content information.

In the graphics context collection recognition device that the utility model embodiment provides, the image information including the Bibliographical Information of books that described multipoint images information acquisition unit can also gather also passes to described image information recognition unit, described image information recognition unit according to described in include the Bibliographical Information of books image information identify book name.Further, described book name can also be exported by the mode of audio frequency or display, such as, book name can be read aloud out by described audio unit, or demonstrate book name by display screen.

Further, described multipoint images information acquisition unit can by the image information of described camera collection figure book envelope as the image information of Bibliographical Information comprising described books, described image information recognition unit then can by identifying that described figure book envelope (comprises front cover, back cover etc.) image information in word determination book name, or, also can by identifying the image information determination book name of described figure book envelope, or, can also by identifying the label determination book name in the image information of described figure book envelope, corresponding label comprises special label or coding, or also can comprise ISBN bar code (International Standard Book Number, International Standard Book Number) etc. the label that existed at present or coding.

Due to the front cover of this book every and back cover image all different, therefore contrast can be carried out by the image information photographed identify, or extraction Characteristic Contrast thus identify to be specially which books, thus determine corresponding book name.And, for ease of identifying, the label being convenient to accordingly identify can also be set in books, make the concrete book name can determining current books according to this label, corresponding label can for being printed on the label on books, also can for being pasted on the label on books, and corresponding label can be picture or the content information such as coding or word.Because concrete image recognition technology has been prior art, therefore be no longer described in detail at this.

In the graphics context collection recognition device that the embodiment of the present invention provides, described multipoint images information acquisition unit can also gather the image information that includes page number information and pass to described image information recognition unit, described image information recognition unit according to described in include page number information image information identify the page number.Further, described book name can also be exported by the mode of audio frequency or display, such as, by the bright reading page number of described audio unit, or the page number can be demonstrated by display screen.

Described page number information acquisition module by identifying that the image information of page determines the page number of current reading in described books, or, by identifying that word in the image information of page in described books or the digital page number determine the page number of current reading.

This graphics context collection recognition device can also comprise display unit, for display setting content information and/or gather in identifying the image and Word message and/or the outside content information obtained that obtain, such as, the information such as the page number or book name of current books reading can be shown, or, show needle is to the explanation descriptive information (as introduction of authors etc.) of books, or, show the operational order of the user that described image information recognition unit identifies, or, play the video information being used for making an explanation to books, etc.

Particularly, this graphics context collection recognition device can also comprise following any one or multinomial unit:

Audio input unit, for obtaining audio-frequency information.Audio-frequency information after corresponding acquisition can be preserved by memory cell.

Memory cell, comprise audio information and/or preserve the image and/or Word message that gather and obtain in identifying and/or the content information preserving outside acquisition, the voice messaging preserved can be play by audio unit when needed, such as, by the cooperation of audio input unit and this memory cell and audio unit, the pronunciation of user in Course of Language Learning can be corrected whether accurate etc.

Communication unit, for communicating with between computer.

Input unit, is connected with described image information recognition unit, for obtaining input information, and such as key-press input, handwriting input, screen input etc.

Moreover, for strengthening the interactive processing between user and this graphics context collection recognition device, promote the experience that user uses this graphics context collection recognition device, interactive processing module can also be comprised in the apparatus, for obtaining the interactive operation control information of user, and perform predetermined interactive operation according to described interactive operation control information, and described interactive operation control information comprises at least one item in limb action, the action of operation object, voice messaging, screen input or operation push-button; In interactive processing process, graphics context collection recognition device can also play particular hint acoustic information by described audio unit to user, or also can show specific content information by described display unit to user, and user can according to the specific content information of corresponding voice prompt information or display to the corresponding interactive operation control information of graphics context collection recognition device transmission, so that carry out interaction with graphics context collection recognition device.Particularly, corresponding interactive operation control information can be included in the reading operations indication information etc. that printed matter carries out, to carry out interactive operation by limb action, can be controlled to carry out interaction to reading method or reading content by interactive between the action of hand or hand-held object and graphics context collection recognition device, as the content etc. by a prearranged gesture control re-reading current location for the user reading general book.Can be identified by this interactive processing module and read the limb action of user or operation object action, so that this device can and be read between user carry out interaction, thus promote the reading experience reading user, books are vocalized media and interactive medium.

In the utility model embodiment, closely just can be undistorted to the covering of required acquisition target by corresponding multipoint images collection.Particularly, corresponding movable camera or multiple camera have multiple focal length, and institute's all parts of acquisition target so just can be made all to be in pinpointed focus, all clear to ensure the image of each several part.

Such as, with reference to shown in Figure 10, Figure 11, Figure 12, Figure 13 and Figure 14, owing to have employed the structure of multi-cam or movable camera (as can rotary head type camera), suitable shooting angle and position can be had for curved surface (the macrobending faces in the middle part of such as books), therefore effectively can carry out shooting to it to identify, corresponding shooting angle no matter directly over or inclined-plane, side effectively can take reference object, each several part all gets a distinct image and good resolution in good focal length.And corresponding multipoint images collection makes resolution ratio of camera head requirement lower, carry out taking the resolution (being more conducive to identifying) that can reach higher to captured thing with the camera of same resolution in other words.

Further, as shown in figure 15, except above-mentioned audio input unit (i.e. voice input module), handwriting input module can also be comprised, screen input module or gesture input module etc.Corresponding communication interface can be USB interface etc.Corresponding memory cell can also comprise extension storage space, as SD card etc.

The above; be only the utility model preferably embodiment; but protection range of the present utility model is not limited thereto; anyly be familiar with those skilled in the art in the technical scope that the utility model discloses; the change that can expect easily or replacement, all should be encompassed within protection range of the present utility model.Therefore, protection range of the present utility model should be as the criterion with the protection range of claims.

Claims

1. a graphics context collection recognition device, it is characterized in that, comprise: multipoint images information acquisition unit and image information recognition unit, described multipoint images information acquisition unit is connected with described image information recognition unit, and the multi-angle of the subject of collection and/or multipoint image information is sent to the foundation of described image information recognition unit as graph-text content information corresponding to image information described in the identification of described image information recognition unit; Wherein,

Institute's multipoint images information acquisition unit comprises at least one movable camera, photo angle and/or the position of described movable camera are adjustable, described movable camera is connected with drive motors, and described drive motors controls rotation and/or the movement of described movable camera; Or, described multipoint images information acquisition unit comprises multiple camera, and each camera is fixed camera or movable camera, photo angle and/or the position of described movable camera are adjustable, described movable camera is Non-follow control or is connected with drive motors, and described drive motors controls rotation and/or the movement of described movable camera; Or described multipoint images information acquisition unit comprises a fixed camera, described fixed camera comprises multiple camera lens.

2. graphics context collection recognition device according to claim 1, is characterized in that, described movable camera comprises rotatable camera head and/or packaged type camera; Or described movable camera comprises one or more movable camera lens; Described movable camera is arranged at needs the position in the region gathering described image information to comprise: above the edge in described region and/or oblique upper and/or directly over.

3. graphics context collection recognition device according to claim 1, it is characterized in that, described fixed camera comprises one or more camera lens, and described camera is arranged at and needs the position in the region gathering described image information to comprise: above the edge in described region and/or oblique upper and/or directly over.

4. graphics context collection recognition device according to claim 1, is characterized in that, described image information recognition unit comprises picture and text identification module and action recognition module.

5. the graphics context collection recognition device according to claim 1,2,3 or 4, is characterized in that, this device also comprises:

Audio unit, be connected with described image information recognition unit, and described multipoint images information acquisition unit by read page current in the books of collection or printed matter current reading location reading operations indicating positions or the image information that comprises Bibliographical Information or comprise page number information pass to described image information recognition unit, and described image information recognition unit according to described current read page or printed matter current reading location or reading operations indicating positions the image information that comprises Bibliographical Information or comprise page number information identifies this current read page or printed matter current reading location or reading operations instruction or Bibliographical Information or after audio-frequency information that the word content information of page number information is corresponding, obtain described audio-frequency information, again the audio-frequency information of described correspondence is exported with audio form by loud speaker,

And/or,

Be connected with described image information recognition unit, and for the communication unit with compunication;

And/or,

Be connected with described image information recognition unit, and for obtaining the input unit of input information.