CN101615253B

CN101615253B - System and method for instantly identifying file contents

Info

Publication number: CN101615253B
Application number: CN 200810130719
Authority: CN
Inventors: 范钦雄; 卢凯杰
Original assignee: Individual
Current assignee: Individual
Priority date: 2008-06-27
Filing date: 2008-06-27
Publication date: 2012-12-05
Anticipated expiration: 2028-06-27
Also published as: CN101615253A

Abstract

The invention discloses a system for instantly identifying file contents, which can instantly identify a file with a special structure. The system comprises a file structure analyzing device, a reading schedule setting device, a positioning device and an identifying module, wherein the file structure analyzing device is used for marking a file into a plurality of blocks according to at least one structural characteristic in the file; the reading schedule setting device is used for setting a reading schedule to read the blocks; the positioning device is used for positioning a block in the reading process; and the identifying device is used for identifying the block in the reading process to output the content of the block in the reading process. The invention can be applied to the robot field to design a robot with the function of imitating people to read files.

Description

File content immediately identifying system and method

Technical field

The present invention relates to a kind of identification system and method, but particularly relate to a kind of system and method for immediately identifying file content.

Background technology

In the daily life, we need change into editable file with Miscellaneous Documents often.In general, file must be scanned into image file earlier, then utilizes optical character identification (Optical Character Recognition, OCR) software, the character in the identification file.Perhaps, utilize scanning identification pen,, word for word scan word for word identification with manual mode.Yet with the file identification, the former lacks maneuverability, and the latter can't handle a large amount of files automatically.

In the field of robot, development robotic vision function is a trend.Robot with immediately identifying ability more near human behavior, also becomes the robot vision field and uses a target of demanding urgently breaking through.If robot can adopt with seeing with the mode reading file of reading as human, use in each field, for example, the field of service type robot has certain potential business opportunity.

Yet, in the traditional reading file technology, utilize high-resolution digital camera (or scanner) once to take (or scanning) whole part of file, again the image of obtaining is done identification.This discrimination method need jumbo memory, and identification process can spend long time.

Another kind of discrimination method utilizes the low resolution video camera to take whole part of file several times, then the partial image of obtaining is done skew corrected individually, and they are bonded into the big image of opening, and again this is opened image greatly and does identification.This discrimination method on the step of skew corrected and map interlinking, needs a large amount of computing times.In addition, use this discrimination method, wayward image quality.

Therefore, above-mentioned traditional discrimination method is not suitable for the content of immediately identifying file, more unlike the habit of true man's reading file.So, be necessary to develop a kind of new discrimination method, when being used in the field of robot, make robot have the function of emulation people reading file.

Summary of the invention

First purpose of the present invention is to provide a kind of file content immediately identifying system and method, and it is the system or the method for identification file content immediately.

Second purpose of the present invention is to provide a kind of file content immediately identifying system and method, and it can identification have the system or the method for the file of ad hoc structure.

The 3rd purpose of the present invention is to provide a kind of file content immediately identifying system and method, and it has the system or the method for emulation people reading file function.

According to above-mentioned purpose of the present invention, the present invention provides a kind of file content immediately identifying system, includes: a document structure analysis device, be used at least one architectural feature according to a file, and this document is marked as several blocks; One reads the flow process setting device, is used to be provided with one and reads flow process to read these blocks; One location device is used for locating one and reads block; And a device for identifying, be used for identification and read block, with export this read in the content of block.

According to above-mentioned purpose of the present invention, the present invention provides a kind of file content immediately identifying method, includes: at least one architectural feature according in the file is marked as several blocks with this document; Be provided with one and read flow process to read these blocks; Middle block is read in location one; And identification this read middle block, with export this read in the content of block.

Utilization the present invention can the various dissimilar file contents of immediately identifying, and for example, books and newspapers, map, music score, project blue print, pipeline wiring diagram etc. have the file of ad hoc structure property.

Under natural scene, authentic document possibly present torsional deformation, and the present invention can utilize vision detecting and the skill of following the trail of, and confirms the position of file, and considers the problem that ornaments are crooked.In addition, can increase the resolution of block image through amplifying the label pad in the file, thereby improve the sense of block content.

The present invention can be applicable to robot and reads on the dissimilar files; It is to adopt with seeing with the skill of reading; Can reach the effect of immediately identifying, and can be under the situation that aquatic foods are artificially got involved less, let robot accomplish the identification of heap file in regular turn and reach the purpose of reading.In addition, also can transfer the file content after the identification to voice signal, let robot according to file content and read aloud.

In the field of robot, the present invention can be applicable to like intelligence development educational robot, amusement and recreation robot, reaches medical auxiliary robot etc., also might be applied to other field.

Description of drawings

Fig. 1 shows the synoptic diagram of file content immediately identifying of the present invention system.

Fig. 2 shows the process flow diagram of file content immediately identifying method of the present invention.

Fig. 3 shows a kind of example that is applied to the discrimination method of the English file of identification.

Embodiment

Fig. 1 shows the synoptic diagram of file content immediately identifying of the present invention system.File content immediately identifying of the present invention system 10 comprises that mainly a document structure analysis device 121, reads flow process setting device 122, a location device 133 and a device for identifying 136.Have some characteristic in a structural file, for example, the paragraph in the English file or with the word of blank spaces etc.The present invention utilizes the characteristic of this structural file, and document structure analysis device 121 becomes several blocks with file mark, reads flow process setting device 122 and is provided with one and reads flow process to read these blocks of document structure analysis device 121 marks.Locating device 133 receives this that read that flow process setting device 122 is provided with and reads flow process.When this reads flow process when execution, the block that is reading locating device 133 location one.After locating device 133 was accomplished this block that is reading of location, this block that is reading of device for identifying 136 identifications was to export the content of this block that is reading.

Fig. 2 shows the process flow diagram of file content immediately identifying method of the present invention.Please be simultaneously with reference to figure 1 and Fig. 2.Be example with the English file of identification below, as embodiments of the invention.

At first, in step S202, whether vision detecting exists with follow-up mechanism 110 detecting files, if exist then confirm the position (step S204) of file.The position of file possibly change the position because of various factors, and at this moment, the vision detecting is searched file with follow-up mechanism 110 in a scope, if find this part file, then replaces the position of original record with new position.

In step S206, when the vision detecting detected file with follow-up mechanism 110, document structure analysis device 121 became block with each word or sign flag with blank spaces, and these blocks are commonly referred to as the word block at this.

In step S208, read flow process setting device 122 flow process that reads that reads by these word blocks of document structure analysis device 121 marks is set.One is the most simply read mode is that basipetal mode reads these word blocks according to right by a left side.

In step S230, according to step S208 set read flow process, locating device 133 is word for word done the action of location to these word blocks.Locating device 133 control one motor 144, with the camera lens of an image capture unit 145 facing to the next word block that will be read.The word block that the camera lens of image capture unit 145 faces representes that this word block is the word block in reading.Locating device 133 is all carried out same positioning step to each word block.

In step S232, the word block capture during image capture unit 145 reads each, the image of being obtained can be deposited into the file of various image formats, like the BMP image shelves of uncompressed, or the JPEG image shelves through compressing.Perhaps, with the image of the being obtained memory that writes direct.Because the resolution of considering to the image data of being obtained is too low, in this step, can amplify this word block in reading, obtain the image data of high-resolution, can solve the problem that is difficult for identification because of the composition pixel of word very little like this.

In step S236, the image data that image capture unit 145 is obtained is sent to device for identifying 136.Device for identifying 136 is with optical character identification (Optical Character Recognition; OCR) image data of this word block in reading of technological identification is then exported the content of this word block.The content of this word block of output can be the character code like ASCII (American Standard Code for Information Interchange), can directly be editor or convert other signal again at general PC.

In step S238, the content of this word block is converted into a voice signal through voice conversion device 137.

More than, if be provided with among the step S208 read flow process and accomplish the time, whether then step S202 detecting is got back to by system has another part file to exist.Otherwise, get back to step S230, continue to carry out location, capture, the identification of next word block.

Moreover locating device 133 also can be located a local block of the word block in reading, and for example, forms the character of this word.At this moment, image capture unit 145 is respectively to each character capture, each character of device for identifying 136 identifications.Afterwards, again with the synthetic word of the character group after the identification.

Fig. 3 shows a kind of example that is applied to the discrimination method of the English file of identification.The image of the word block of being obtained by step S230, S232 can carry out identification according to the following step.With word " robot " is example, at first confirms the target character position, the initial character " r " of word for example, and capture the image (step S356) of this character " r ".With the normalization of the image of this character " r ", that is, the image of acquisition character is adjusted into fixed size (step S358).Transfer the image of this character " r " to black-and-white image, this moment, the colour of each pixel was 0 or 1, that is binaryzation (step S360).Step S362 then captures the characteristic of the digital date after this binaryzation, is attached to the data bank of the sample character set of before being trained.Step S362, with the characteristic and the training sample character set of the character that is captured " r " compare, identification.If all characters " r ", " o ", " b ", " o ", and " t " all accomplish identification, then finish this word of identification, otherwise continue identification character late (step S368).Step S370 continues to confirm next target character position, like " o ".So, again with the synthetic word of the character group after the identification.

Be noted that, when in step S206, a structural file being done the block mark, can use plural architectural feature to carry out the mark of block.For example, can be divided into paragraph, ranks to English file, reach word, carry out the mark of block according to these three kinds of architectural features.Then, set the flow process that reads of these three kinds of structures, for example, read first word of first section first row earlier.

In addition, according to the present invention, except that above-mentioned be that block is the embodiment of identification with the word, be that the embodiment of block can implement equally with paragraph or ranks.

Among the present invention; More particularly; Image capture unit can use the low resolution PTZ video camera (Pan Tilt Zoom camera) that generally is used for video monitoring; This kind video camera can wide-angle rotates, tilts, focusing automatically, high magnification are amplified, and the demand of looking is carried fixing at one or the platform that moves on, be rich in motor-driven and independence.

Claims

1. file content immediately identifying system, it is characterized in that: this system comprises:

One document structure analysis device is used at least one architectural feature according to a file, and this document is marked as several blocks;

One reads the flow process setting device, is used to be provided with one and reads flow process to read these blocks;

One location device is used for locating one and reads block; And

One device for identifying, this reads block to be used for identification, to export the content that this reads middle block.

2. file content immediately identifying according to claim 1 system, it is characterized in that: this system also comprises a voice conversion device, is used for converting this content that reads block into a voice signal.

3. file content immediately identifying according to claim 1 system, it is characterized in that: this locating device reads middle block through controlling a motor to locate this.

4. file content immediately identifying according to claim 1 system; It is characterized in that: this system also comprises an image capture unit; This reads block and becomes an image data to be used for capture; Wherein this device for identifying identification this read in the image data of block, with export this read in the content of block.

5. file content immediately identifying according to claim 1 system is characterized in that: this locating device location this read in a local block of block, this device for identifying identification should the part block, to export the content of this part block.

6. file content immediately identifying method, it is characterized in that: this method comprises:

At least one architectural feature according in the file is marked as several blocks with this document;

Be provided with one and read flow process to read these blocks;

Middle block is read in location one; And

This reads middle block identification, to export the content that this reads middle block.

7. file content immediately identifying method according to claim 6 is characterized in that: it is a voice signal that this method also comprises this content that reads middle block of conversion.

8. file content immediately identifying method according to claim 6; It is characterized in that: this method also comprise capture this read in block become an image data; Wherein in the step of identification, be identification this read in the image data of block, with export this read in the content of block.

9. file content immediately identifying method according to claim 6 is characterized in that: this method also comprises the location, and this reads a local block of middle block.

10. file content immediately identifying method according to claim 9 is characterized in that: this method also comprises identification should the part block, to export the content of this part block.