CN109462712A - A kind of books extracts content automatic processing method - Google Patents

A kind of books extracts content automatic processing method Download PDF

Info

Publication number
CN109462712A
CN109462712A CN201811281802.9A CN201811281802A CN109462712A CN 109462712 A CN109462712 A CN 109462712A CN 201811281802 A CN201811281802 A CN 201811281802A CN 109462712 A CN109462712 A CN 109462712A
Authority
CN
China
Prior art keywords
books
server
information
processing method
page
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811281802.9A
Other languages
Chinese (zh)
Inventor
段民兴
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201811281802.9A priority Critical patent/CN109462712A/en
Publication of CN109462712A publication Critical patent/CN109462712A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/00127Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture
    • H04N1/00249Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture with a photographic apparatus, e.g. a photographic printer or a projector
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/03Arrangements for converting the position or the displacement of a member into a coded form
    • G06F3/041Digitisers, e.g. for touch screens or touch pads, characterised by the transducing means
    • G06F3/046Digitisers, e.g. for touch screens or touch pads, characterised by the transducing means by electromagnetic means
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/153Segmentation of character regions using recognition of characters or words
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/06Protocols specially adapted for file transfer, e.g. file transfer protocol [FTP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/00127Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture
    • H04N1/00204Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture with a digital computer or a digital computer system, e.g. an internet server
    • H04N1/00244Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture with a digital computer or a digital computer system, e.g. an internet server with a server, e.g. an internet server
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/00127Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture
    • H04N1/00281Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture with a telecommunication apparatus, e.g. a switched network of teleprinters for the distribution of text-based information, a selective call terminal
    • H04N1/00315Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture with a telecommunication apparatus, e.g. a switched network of teleprinters for the distribution of text-based information, a selective call terminal with a radio transmission apparatus
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/00519Constructional details not otherwise provided for, e.g. housings, covers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/00519Constructional details not otherwise provided for, e.g. housings, covers
    • H04N1/00559Mounting or support of components or elements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/04Scanning arrangements, i.e. arrangements for the displacement of active reading or reproducing elements relative to the original or reproducing medium, or vice versa
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Human Computer Interaction (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Electromagnetism (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present invention provides a kind of books to take passages content automatic processing method, include the following steps: to be labeled the point of interest in read nationality by reading person, Sign-On services end, books to be captured are placed at filming apparatus, the information page and corresponding mark page to books successively shoot and be sent to server-side;Server-side is stored to received picture and is established the extracts text including the book information content, and reading person enters server-side using identity information after the completion of current book is shot, and is carried out online reading, modification or downloading to the extracts text and is used.In the present invention, server-side can be cut to shooting picture automatically according to the program step of establishment, identify, sort, combine etc. and be operated, directly form the document comprising all information of current book and marked content, reading person can download at any time or print web the document, the extracts process for greatly simplifying reading person, saves a large amount of time.

Description

A kind of books extracts content automatic processing method
Technical field
The present invention relates to education sectors, can obtain phase by shooting to the marked content on books more particularly to a kind of Answer the books extracts content automatic processing method that automatic arranging conclusion is carried out after content.
Background technique
During daily study or reading, many reading persons can carry out oneself interested content in read nationality Mark, in case the later period is re-reading, especially student, need the frequent write-read style of calligraphy meeting or to the wrong topic in paper copy with always Knot.
If the content marked in a books is more and distributed more widely, does not browse carefully and might have omission, and Every page goes to search the meaning that mark loses mark again.If it is desired to the content after mark is taken passages or is summarized, then It needs largely to copy work or constantly browses, this process can delay the plenty of time.In addition, if books are to borrow , then the time that may be copied all does not have.
And for student, the specified books read have a content and summary for largely needing to copy, and daily paper or It is workbook, even more needs to study each wrong topic repeatedly;In above process, what multistage was fallen copies or searches former topic all It can devote a tremendous amount of time, so that the idea that makes students neglect initial or going concern to make a copy of content again without energy is accumulate The meaning contained.
Summary of the invention
The invention aims to provide it is a kind of can obtain corresponding contents by shooting to the marked content on books after The books for carrying out automatic arranging conclusion take passages content automatic processing method.
Particularly, the present invention provides a kind of books extracts content automatic processing method, includes the following steps:
Step 100, the point of interest in read nationality is labeled by reading person;
Step 200, start filming apparatus, Sign-On services end, server-side is built after verifying to the identity information of reading person Found storage catalogue corresponding to the identity information;
Step 300, books to be captured are placed at filming apparatus, information page to books and corresponding mark page according to Secondary shooting is simultaneously sent to server-side;
Step 400, server-side stores received picture, is identified and is established to information page and believed including the books The extracts text of content is ceased, while being identified after being cut to the mark part in mark page picture, in a predetermined order to sanction Rear content is cut to be ranked up and be inserted into extracts text;
Step 500, reading person enters server-side using identity information after the completion of current book is shot, to extracts text This progress online reading, modification or downloading use.
In an embodiment of the invention, the server-side is that remote server, mobile phone and the filming apparatus connect The computer connect or the computer with the filming apparatus one.
In an embodiment of the invention, the mark is by using different with text on books and page color The content part that the color pencil of cause is write out;Or
It is write with color pencil in the two ends of extracts part, to limit the starting and ending of mark;Or
With color pencil extracts partial content iris out completely come.
In an embodiment of the invention, the mark is the sticky paper by not destroying with toughness and page structure It is attached to the interior section that the two ends of page corresponding contents cross.
In an embodiment of the invention, the information page includes at least the back cover that the books have bar code.
In an embodiment of the invention, described at the same to mark page picture in mark part cut after know Other process is as follows:
1) it is interrelated to store the photo-document of picture in books respectively to be divided into two when establishing for extracts text, and deposits Store up the text document of plain textual information in books;
2) color on the page is carried out range division according to brightness value by the server-side, then by preset mark face The range of luminance values of color determines the paragraph marked out in current image;
3) paragraph after determination is cut, will remove after cutting behind non-mark part picture by page number sequence addition figure In piece document;Simultaneously
4) OCR identification is carried out to the text in picture after cutting respectively, and be added by page number sequence into text document.
5) link that can mutually quote particular content is provided in photo-document and text document.
In an embodiment of the invention, the mode logged in includes:
1) beats in reading person's identity information by shooting style, is identified by the server-side;
2) is logged in by the log-in interface of the filming apparatus or the server-side;
3) logs in the server-side by the mobile phone of registration in advance.
In an embodiment of the invention, reading person's identity information is expressed by information board, the letter Writing position is provided on breath plate, reading person, which directly writes in the corresponding place of writing position, can express self-identity information Content, is shot to information board after writing using filming apparatus and is uploaded to server-side, and server-side passes through preset knowledge Other mode carries out corresponding written contents to be recognized and converted into specific identity information.
In an embodiment of the invention, the information board is magnetic sheet, carries out mark writing, benefit by magnetic pen Mark after writing is wiped with magnetic stripe;The magnetic stripe is arranged in the back side of the magnetic sheet and can be in two phase of magnetic sheet It is moved back and forth between opposite side;Or the information board is re-erable blank, the size of the blank, which is less than minimum, to clap Take the photograph the size of books.
In an embodiment of the invention, be provided at the writing position on the information board planning write region and Non-erasable lines are provided with the identification code for representing current line information in lines side.
In an embodiment of the invention, the filming apparatus includes the shoot part with camera, and by the bat The portion of taking the photograph is supported on the shooting bracket of overlying regions to be captured, and the shoot part is provided with the buffer area of temporary camera content of shooting, With the delivery module that buffer area content is transferred to server-side.
In an embodiment of the invention, be provided at the region to be captured of the filming apparatus for place to Shooting books and size are greater than the maximum dark identification pad that can shoot books;The side edge of the dark identification pad is provided with control The control switch of the shoot part work is made, and for clamping the clamp structure after books are opened with respect to both sides.
In an embodiment of the invention, it is provided with placement on the filming apparatus or clamps the fixed structure of pen, The fixed structure is the pen container for directly inserting pen, or by the clamping clip of resilient clamp pen body, or can be directly connected to pen Elastic cable.
In an embodiment of the invention, promising shooting environmental is set on the shoot part, the illumination of brightness is provided Lamp;The alarm of acousto-optic prompting is carried out after the completion of being provided with shooting on the shoot part.
In an embodiment of the invention, the shooting bracket includes one for maintaining the pedestal of steadily of centre of gravity, With the support rod being vertically mounted on pedestal, the support rod is height-adjustable lifting structure and the relatively described pedestal can diameter To rotation, the shoot part is mounted on the end of support rod standing by one end.
In the present invention, reading person, which only needs to carry out corresponding shooting operation, can easily obtain all marked contents, service End the operation such as can be cut, identify, sort, combine to shooting picture automatically according to the program step of establishment, directly formation one A document comprising all information of current book and marked content, and reading person can download at any time or print web the document, greatly The big extracts process for simplifying reading person, saves a large amount of time, while so that the energy of reading person is completely placed in extracts and summing up In the content come, increase the enjoyment of reading and study.
Detailed description of the invention
Fig. 1 is the books extracts content automatic processing method flow diagram of one embodiment of the present invention;
Fig. 2 is the filming apparatus structural schematic diagram of one embodiment of the present invention;
Fig. 3 is the information board structural schematic diagram of one embodiment of the present invention.
Specific embodiment
As shown in Figure 1, one embodiment of the present invention discloses a kind of books extracts content automatic processing method, generally Include the following steps:
Step 100, the point of interest in read nationality is labeled by reading person;
Here books include but is not limited to the contents such as literature works, outside reading, workbook, paper.
The mode of mark, which can be, writes corresponding contents progress pandect using the color pencil (mark pen) met certain condition, Or write with color pencil in the two ends of extracts part, to limit the starting and ending of mark;Or with color pencil extracts Partial content is irised out completely to be come.Meeting certain condition here refers to that the color of mark pen needs and has face in books to be marked Color differentiates, and obscures when the later period being avoided to identify.
Mark can also be attached to the two ends of page corresponding contents by not destroying with toughness and the sticky paper of page structure To cross content to be marked.Sticky paper can be needed using conventional Sticky Note, its same color and have face in books to be marked Color differentiates.
Mark can also be the content being directly written on the page by reading person.And the content of mark can have multistage or more Place.
Step 200, start filming apparatus, Sign-On services end, server-side is built after verifying to the identity information of reading person Found storage catalogue corresponding to the identity information;
Here filming apparatus, which can be mobile phone, camera, scanner one kind, can obtain the device of content of pages, preferably originally Body has the device of network upload function.
The mode wherein logged in includes:
1) beats in reading person's identity information by shooting style, is identified by server-side;
After will indicating that the article of reading person's identity information is obtained by way of image, by server-side according to criterion of identification Carry out the identification of information (specific identification method illustrates below).Shown herein as identity information article can be chest card, specially The mark version of registration information can be proved with a system such as card, identity card, student's identity card.Can also be indicates its login account The mixed information of digital information, alphabetical information or both.
2) is logged in by the log-in interface of filming apparatus or server-side;
I.e. directly using input account and password in the way of logged in by the log-in interface provided.
3) passes through the mobile phone Sign-On services end of registration in advance.
Although using mobile phone as explanation here, other smart machines are equally applicable, if tablet computer, personal electricity Brain.It is registered in server-side in advance, can be logged according to cell-phone number or account automatic identification or after verifying when shooting and uploading.
Here all equipment that can complete the treatment process that this method provides are represented with server-side, as remote server, Mobile phone, tablet computer, PC, or have setting for processing function with FPGA, the microprocessor chip of filming apparatus one etc. It is standby to be ok.
Server-side interprets identity information picture by conventional identification method, then with the account information that has saved Compare the identity to confirm reading person.Here conventional identification method can be using two-value method removal picture background and After interference, corresponding character is plucked out or directly in such a way that OCR is identified as text.
Transmission mode between filming apparatus and server-side can be real by wireline cable, cable network or wireless network It is existing.
Step 300, books to be captured are placed at filming apparatus, information page to books and corresponding mark page according to Secondary shooting is simultaneously sent to server-side;
Here shooting process is to shoot the content of all mark pages respectively.
Information page refers to the cover of the books, the published information content of inside front cover and the back cover with bar code;Bar code therein It is the bar code for illustrating information of link, photographer can according to need the corresponding page of selection and shoot, if the books It shoots and uploads after can also voluntarily being selected there are also other identification informations.Information page facilitates inquiry of the later period to extracts content.
Step 400, server-side stores received picture, is identified and is established to information page and believed including the books The extracts text of content is ceased, while being identified after being cut to the mark part in mark page picture, in a predetermined order to sanction Rear content is cut to be ranked up and be inserted into extracts text;
Both include marked content in the single page picture of transmission, while also including non-marked content, while there are also the page numbers above Information, thus need first to current single page picture carry out page number identification, then carry out corresponding marked content cut and Text region.
Cover in information page can be retained in the form of picture, and the content of other information page can recognize that written word forms extracts Text, to facilitate the approximate contents for being directly acquainted with the book.
It is as follows to the cutting and identification process of mark:
1) extracts text is divided into two interrelated documents when establishing, one be picture in storing books picture it is literary Shelves, another is the text document for storing plain textual information in books;
2) color on the page is carried out range division according to brightness value by server-side, due to mark or hand-written interior appearance Color is different from the original color of books, therefore different colors can be made to be sorted out according to the division of colour gamut, then by presetting The range of luminance values of mark color determine the paragraph marked out in current image.
By taking picture is gray scale picture after shooting as an example: tonal range is 1~255, wherein light color can be used in the color marked In yellow, intensity value ranges are between 1~50, and the font in general book is all black to show, intensity value ranges Between 200~255, by image recognition method, by intensity value ranges selecting in 1~50, if it is that full text word is write, Then OCR identification can be carried out using the word content Chong Die with its gray value as mark section;If it is head and the tail section write, then according to according to Secondary pairing is known otherwise, and all marks are considered as an entirety by pairs of mode, two pairing gray scales are then marked it Between content identified by OCR after as marked content;In such a way that sticky paper is labeled, known using same pairing Other mode is used as marked content after being identified the content that two are matched between gray scale mark by OCR.
If it is color image, then can be identified by the different colour gamut of RGB three, it is specific know otherwise with ash Degree knowledge is identical otherwise, and only original gray scale, is changed to the range of corresponding color, and such as original font uses black, then Yellow, green can be used in the color of mark, and being selected in identification according to the color gamut of R, G, B three can will be corresponding Mark section identifies.
3) paragraph after determination is cut, will remove after cutting behind non-mark part picture by page number sequence addition figure In piece document;
After the content for identifying different modes mark, can will mark or annotation content part from former single page picture Central Plains Sample is cut, and is saved as an independent picture, is needed when stored by page number information where it while being saved, with convenient Sequence.
4) OCR identification is carried out to the text in picture after cutting respectively, and be added by page number sequence into text document.
While being cut to marked content part or before or after, the mark or annotation content can be carried out OCR identification, typesetting forms corresponding editable text section again, this literary field contents identified is equally believed with the corresponding page number It ceases while saving.Word content after different labeled or annotation identify independently is used as a paragraph storage, avoids not Mutually obscure between the word content of same page difference section.
5) link that can mutually quote particular content is provided in photo-document and text document.
Respectively marking in the photo-document and text document that same books ultimately generate needs between picture and corresponding OCR text It wants mutually to call, such as when checking the mark picture that some cuts out, can be guided text text by the link of side Correspondence OCR content in shelves is shown.On the contrary, corresponding mark figure can also be linked when seeing the content in text document Piece.
Step 500, reading person enters server-side using identity information after the completion of current book is shot, to extracts text This progress online reading, modification or downloading use.
It is stored in treated content in server-side, reading person can log in and check, according to their own needs to this The extracts text of generation carries out various operations, such as downloads entire document, or the entire document of printing, or print in specified documentation section Hold.
In addition, server-side can also provide corresponding beautification template, the marked content selected is embedded in, finally beat It prints off and, used as motto one kind.Or establish corresponding index key and search interface, so that reading person is in input phase After the keyword answered, directly the mark pictures containing the keyword all in the books or mark text importing can be come out.
In addition, reading person can directly be edited, such as to final extracts text: the mistake in modification original OCR identification Word, or the annotation content that addition is new, or delete some annotation contents etc..
In present embodiment, reading person only needs to carry out corresponding shooting operation, and server-side can be automatically according to establishment Program step the operation such as cut, identify, sort, combine to shooting picture, directly formation one includes current book institute There is the document of information and marked content, and reading person can download at any time or print web the document, greatly simplifies plucking for reading person Process is copied, a large amount of time is saved, while so that the energy of reading person is completely placed in extracts and summing up in the content come, increases and reads The enjoyment of book and study.
Present embodiment can be applied to library, school or bookstore, so that the people of multiple and different needs uses.Server-side It can also be corresponding cloud disk or the unit service device of filming apparatus be installed, to be taken to not particular person or specific crowd Business, the management such as school to each class course, examination.User can be by long-range account number game server, to store to oneself Content of shooting in region such as is checked, is downloaded, being printed at the processing.
As shown in Fig. 2, in an embodiment of the invention, providing a kind of specific filming apparatus 10, shooting dress The shoot part 2 including being equipped with camera 21 is set, and the shooting bracket 22 of adjustment 2 shooting height of shoot part, the shoot part 2 pass through bat The top that bracket 22 is supported on region to be captured is taken the photograph, downward, wireless mode will be provided in shoot part 2 to be clapped shooting direction Take the photograph the Wireless transceiver module 211 that rear image is transferred to remote server (server-side).Wireless transceiver module 211 therein can be One of wifi module, mobile module or bluetooth module.
Shooting bracket 22 can adopt a split structure, and specifically include the pedestal 221 for being used to maintain steadily of centre of gravity, and The support rod 222 being vertically mounted on pedestal 221, the support rod 222 are height-adjustable lifting structure and opposite pedestal 221 Can radial rotary, shoot part 2 then by one end be mounted on support rod 222 standing end on.Lifting structure can be two phases The tube-in-tube structure being mutually socketed, socket position recycles the fixing bolt of side to carry out the two after determining relatively fixed.Shoot part 2 The position that can be carried out with support rod 222 on both horizontally and vertically adjusts, to adapt to the books shooting of different-thickness.Shoot part 2 Switch may be provided on pedestal 221, and various signal cables then can be by passing through in support rod 222.
The books for being ready for respective markers are placed on to the lower section of shoot part 2, the page for carrying out mark are opened, by shooting Camera 210 in portion 2 is taken pictures, and is then conveyed directly to server-side by Wireless transceiver module.The same books can be only One page is shot, is uploaded after can also successively shooting the multipage marked respectively.Server-side can be according to the program step of establishment The operation such as cut, identified, being sorted to picture.After the completion of current book shooting, press i.e. complete after the shutdown switch of shoot part 2 At the secondary shooting process, the information with particular picture as completion can also be, such as the bar code page of books back cover, work as service When terminating to the picture with the pattern, the instruction for stopping shooting being sent to filming apparatus.It, can be with to improve shooting effect Under the premise of meeting Text region requirement using can automatic focusing camera 210, make reading person only need to pay close attention to the shooting page ?.
Present embodiment is only completed shooting and upload function using filming apparatus as an independent equipment, specific to locate Reason process is carried out by remote server.The cost and complex operations that filming apparatus can be reduced in this way make more people share a shooting Device is especially suitable for refectory each class independent use.
In an embodiment of the invention, the identity information of reading person can carry out table by specific information board It reaches, which is the plate that can show written information, is provided with the position for writing, reading person is using accordingly Sign pen directly write at the writing position of information board 1, different writing points or written information represent different generations Code such as writes the number combination for indicating register account number exactly, or specific code identification is indicated by the writing to corresponding position Information.Information board is shot and is uploaded to server-side after being write using 10 Duis of filming apparatus, and server-side passes through to the code Identification is convertible into the identity information of corresponding reading person, and the identification method in server-side can be preset identification rule Then, corresponding content can be corresponded to the specified point of specific location to be converted, write on the left of some special identification point After indicate 1, right side indicates 2 after writing, top indicates 3 etc. after writing.
As shown in figure 3, specific information board 1 can be magnetic sheet, conventional magnetic writing product is can be selected in magnetic sheet 11, Its principle is: when coming close to or in contact with 11 writing surface of magnetic sheet with magnetic magnetic pen 12, the iron filings at the back side can be adsorbed To 11 back side of magnetic sheet, and then corresponding trace is shown on the writing surface.Magnetic stripe 13 be used for on current magnetic sheet 11 The content of writing is wiped, and to facilitate next bit reading person to use, magnetic stripe is arranged in the back side of 13 magnetic sheets 11 and can be in magnetic It being moved back and forth between two relative sides of property plate 11, the length of magnetic stripe 13 is at least identical as the height of 11 writing surface of magnetic sheet, 13 width of magnetic stripe, which is subject to, to suck back the iron filings being adsorbed on magnetic sheet 11;It is needed when having written contents on magnetic sheet 11 again When wiping, magnetic stripe 13 is moved by one end of magnetic sheet 11 to the other end, magnetic stripe 13 can be by expressions all on magnetic sheet 11 The iron filings of mark siphon away, thus one new writing interface of display.
In use, reading person directly writes out corresponding identification information using magnetic pen 12 on magnetic sheet 11, such as directly write 6054, it is then placed in the shooting area of the lower section of shoot part, is uploaded after being shot by shoot part 2 and carry out identification, identification After, then carry out corresponding book contents shooting.The letter of 13 pairs of magnetic stripe writings can be directly utilized using the magnetic sheet 11 finished Breath is wiped.
For convenience of the planning and identification of mark, the settable painting that corresponding information is indicated using written contents on magnetic sheet 11 Region 14 is wiped, e.g., corresponding point is marked in different location, then represents corresponding digital number, to be compiled according to different numbers Number identify different users.
In the another embodiment of invention, information board 1 can also be using existing re-erable blank (in figure It is not shown), it is write by dedicated white board marker, content erasing is then carried out by corresponding whiteboard eraser.The use of blank Mode is not repeated herein with the usage mode of aforementioned magnetic sheet 11.The size of blank also needs that book can be shot less than minimum The size of nationality avoids shooting the identification of picture because of Color influences.In addition, can equally be arranged using written contents on blank come table That shows corresponding information embrocates region 14.
Region 14 of embrocating in each embodiment is used to form a specific shape, by reading person in different position books After writing corresponding mark, server-side can according to meaning preset at this come identify account representated by written information letter Breath.Region 14 of embrocating in present embodiment is the table being made of non-erasable lines, and the side of table is provided with generation The identification code of the current line information of table.After inserting respective token in the different column of table, it can be known automatically according to identification code Not Chu its represent meaning.Such as: table is 4*4 format, and laterally the first column is indicated with x, and longitudinal first column is indicated with y, remaining 3*3 is used for for using person writing, wherein three columns of the second row respectively indicate 7,8,9;Three columns of the third line respectively indicate 4,5,6; Three columns of fourth line respectively indicate in 1,2,3.7 it is corresponding be exactly x1, y1,4 it is corresponding it is corresponding to be exactly x1, y2,6 is exactly x3, Mark in corresponding column can be converted to corresponding number combination to analogize by y2.
It in an embodiment of the invention, can be to be captured below shoot part 2 for convenience of adjustment camera site The dark identification pad 3 for putting books to be captured is placed at region, when shooting, reading person only needs books to be captured It spreads out and is placed on dark identification pad 3, books can be made to be in the shooting area of standard.
Vision difference can be generated with the books of conventional white using dark color identification pad 3, facilitate the later period to shooting picture Identification.In addition, to make the shooting picture surrounding obtained there are region easy to identify, the size of dark color identification pad 3 is preferably greater than Maximum can shoot the size of books.Additionally corresponding books can be marked by A5, A4, A3 equidimension on dark color identification pad 3 Scope identifier.
Information board 1 can be made of one with dark color identification pad 3, can also be individually placed.As long as not influencing the shooting of books ?.
Further, in the opposite side edge for opening books two sides of dark identification pad 3, control 2 work of shoot part can be set The control switch of work.Some books can close if not having to hand and pinning automatically after opening, and can not go under state of pinning By the control switch of shoot part 2, therefore, reading person may make to exist on dark color identification pad 3 the control switch setting of shoot part 2 Touching control switch, convenient shooting immediately are gone while pinning books.
In other embodiments, it can also be opened in the corresponding position installation of dark identification pad 3 for clamping books Afterwards with respect to the clamp structure (not shown) on both sides, such as elastic strip or Flexible clamp, elastic strip can be attached to when not in use It, when in use, can be to book by open books side gripper between elastic strip and dark identification pad 3 on dark color identification pad 3 Page forms corresponding fixed.And Flexible clamp then can respectively clamp the two sides page opened, and books is made to be maintained at opening State.
Information is recorded for convenience of reading person, the fixed structure 23 placed or clamp pen 5 can be set on filming apparatus 10, Pen container as directly inserted pen 5, or by the clamping clip of resilient clamp pen body, or elastic cable that pen 5 can be directly connected to etc.. Specific fixed structure 23 can be set at shooting bracket 22.Here pen 5 either directly written contents lettering pen, It is also possible to the e-Pointer being identified to corresponding contents in books.
To obtain optimal shooting effect, it can be set to shooting environmental on shoot part 2 and the headlamp of brightness be provided 212.The headlamp 212 can only can also export constant brightness according to ambient brightness adjust automatically brightness.In addition, to remind Whether reading person's shooting is completed, and the alarm 4 of acousto-optic prompting is carried out after the completion of shooting can be set on filming apparatus 10.Such as, Every shooting and after having uploaded one page, when automatically into the shooting state of lower one page, alarm 4 outwardly passes through light or sound Sound is prompted, and is reminded reading person that can replace next extracts page or is carried out further work.
So far, although those skilled in the art will appreciate that present invention has been shown and described in detail herein multiple shows Example property embodiment still without departing from the spirit and scope of the present invention, still can according to the present disclosure directly Determine or deduce out many other variations or modifications consistent with the principles of the invention.Therefore, the scope of the present invention is understood that and recognizes It is set to and covers all such other variations or modifications.

Claims (15)

1. a kind of books take passages content automatic processing method, which comprises the steps of:
Step 100, the point of interest in read nationality is labeled by reading person;
Step 200, start filming apparatus, Sign-On services end, server-side the identity information of reading person is verified after establish with The corresponding storage catalogue of identity information;
Step 300, books to be captured are placed at filming apparatus, the information page and corresponding mark page to books are successively clapped It takes the photograph and is sent to server-side;
Step 400, server-side stores received picture, is identified and is established including in the book information to information page The extracts text of appearance, at the same to mark page picture in mark part cut after identify, in a predetermined order to cutting after Content is ranked up and is inserted into extracts text;
Step 500, reading person current book shoot after the completion of, enter server-side using identity information, to the extracts text into Row online reading, modification or downloading use.
2. books according to claim 1 take passages content automatic processing method, which is characterized in that
The server-side is remote server, mobile phone, the computer connecting with the filming apparatus or integrated with the filming apparatus Computer.
3. books according to claim 1 take passages content automatic processing method, which is characterized in that
The mark is to write by using the color pencil inconsistent with text on books and page color to corresponding contents, Or corresponding contents are irised out to come completely, or write in the two ends of corresponding contents to limit range.
4. books according to claim 1 take passages content automatic processing method, which is characterized in that
The mark be by with teach that text and page color are inconsistent on auxiliary book and do not destroy the colored sticky paper of page structure It is attached to the two ends of corresponding contents respectively.
5. books according to claim 1 take passages content automatic processing method, which is characterized in that
The information page includes at least the back cover that the books have bar code.
6. books according to claim 1 take passages content automatic processing method, which is characterized in that
Process that is described while identifying after part is cut to marking in mark page picture is as follows:
1) extracts text is divided into two interrelated photo-documents to store picture in books respectively, and storage book when establishing The text document of plain textual information in nationality;
2) color on the page is carried out range division according to brightness value by the server-side, then by preset mark color Range of luminance values determines the paragraph marked out in current image;
3) paragraph after determination is cut, will remove after cutting behind non-mark part picture by page number sequence and picture text is added In shelves;Simultaneously
4) OCR identification is carried out to the text in picture after cutting respectively, and be added by page number sequence into text document.
5) link that can mutually quote particular content is provided in photo-document and text document.
7. books according to claim 1 take passages content automatic processing method, which is characterized in that the mode packet logged in It includes:
1) beats in reading person's identity information by shooting style, is identified by the server-side;
2) is logged in by the log-in interface of the filming apparatus or the server-side;
3) logs in the server-side by the mobile phone of registration in advance.
8. books according to claim 7 take passages content automatic processing method, which is characterized in that
Reading person's identity information is expressed by information board, and writing position is provided on the information board, and reading person is straight It connects to write in the corresponding place of writing position and can express the content of self-identity information, using filming apparatus to information after writing Plate is shot and is uploaded to server-side, and server-side identifies corresponding written contents by preset identification method And it is converted into specific identity information.
9. books according to claim 8 take passages content automatic processing method, which is characterized in that
The information board is magnetic sheet, carries out mark writing by magnetic pen, is wiped using magnetic stripe mark after writing;Institute Magnetic stripe is stated the back side of the magnetic sheet is arranged in and can move back and forth between two relative sides of magnetic sheet;Or the information Plate is re-erable blank, and the size of the blank is less than the minimum size that can shoot books.
10. books according to claim 8 take passages content automatic processing method, which is characterized in that
It is provided with planning at writing position on the information board and writes region and non-erasable lines, is arranged in lines side There is the identification code for representing current line information.
11. books according to claim 1 take passages content automatic processing method, which is characterized in that
The filming apparatus includes the shoot part with camera, and the shoot part is supported on to the shooting of overlying regions to be captured Bracket, the shoot part are provided with the buffer area of temporary camera content of shooting, and buffer area content is transferred to the biography of server-side Send module.
12. books according to claim 11 take passages content automatic processing method, which is characterized in that
It is provided at the region to be captured of the filming apparatus and is greater than maximum for placing books to be captured and size and can shoot The dark identification pad of books;The control switch for controlling the shoot part work is provided in the side edge of the dark identification pad, With for clamp books open after with respect to both sides clamp structure.
13. books according to claim 12 take passages content automatic processing method, which is characterized in that
It is provided with placement on the filming apparatus or clamps the fixed structure of pen, the fixed structure is the pen for directly inserting pen Cylinder, or by the clamping clip of resilient clamp pen body, or the elastic cable of pen can be directly connected to.
14. books according to claim 13 take passages content automatic processing method, which is characterized in that
Promising shooting environmental is set on the shoot part, the headlamp of brightness is provided;It is provided with and has shot on the shoot part At the rear alarm for carrying out acousto-optic prompting.
15. books according to claim 14 take passages content automatic processing method, which is characterized in that
The shooting bracket includes one for maintaining the pedestal of steadily of centre of gravity, and the support rod being vertically mounted on pedestal, institute State support rod be height-adjustable lifting structure and the relatively described pedestal can radial rotary, the shoot part passes through one end installation On the end that support rod erects.
CN201811281802.9A 2018-10-31 2018-10-31 A kind of books extracts content automatic processing method Pending CN109462712A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811281802.9A CN109462712A (en) 2018-10-31 2018-10-31 A kind of books extracts content automatic processing method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811281802.9A CN109462712A (en) 2018-10-31 2018-10-31 A kind of books extracts content automatic processing method

Publications (1)

Publication Number Publication Date
CN109462712A true CN109462712A (en) 2019-03-12

Family

ID=65608980

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811281802.9A Pending CN109462712A (en) 2018-10-31 2018-10-31 A kind of books extracts content automatic processing method

Country Status (1)

Country Link
CN (1) CN109462712A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112137287A (en) * 2020-09-25 2020-12-29 华北水利水电大学 Device for extracting language characters
CN113361404A (en) * 2021-06-02 2021-09-07 北京百度网讯科技有限公司 Method, apparatus, device, storage medium and program product for recognizing text

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101237500A (en) * 2006-12-14 2008-08-06 三星电子株式会社 Image forming apparatus and method of controlling the same
CN101916327A (en) * 2010-07-09 2010-12-15 北京商纳科技有限公司 Method and system for generating wrong answer list
CN102811303A (en) * 2011-05-30 2012-12-05 苏州巴米特信息科技有限公司 Multifunctional scanning pen
CN103514297A (en) * 2013-10-16 2014-01-15 上海合合信息科技发展有限公司 Method and device for increasing annotation data in text and method and device for querying annotation data in text
CN104735205A (en) * 2015-04-23 2015-06-24 刘贝 Mobile phone holder and utilization method thereof
CN205792932U (en) * 2016-06-03 2016-12-07 北京好运到信息科技有限公司 A kind of portable unit for shooting file and picture
CN206850879U (en) * 2017-07-13 2018-01-05 成都航空职业技术学院 A kind of reading note scanning means

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101237500A (en) * 2006-12-14 2008-08-06 三星电子株式会社 Image forming apparatus and method of controlling the same
CN101916327A (en) * 2010-07-09 2010-12-15 北京商纳科技有限公司 Method and system for generating wrong answer list
CN102811303A (en) * 2011-05-30 2012-12-05 苏州巴米特信息科技有限公司 Multifunctional scanning pen
CN103514297A (en) * 2013-10-16 2014-01-15 上海合合信息科技发展有限公司 Method and device for increasing annotation data in text and method and device for querying annotation data in text
CN104735205A (en) * 2015-04-23 2015-06-24 刘贝 Mobile phone holder and utilization method thereof
CN205792932U (en) * 2016-06-03 2016-12-07 北京好运到信息科技有限公司 A kind of portable unit for shooting file and picture
CN206850879U (en) * 2017-07-13 2018-01-05 成都航空职业技术学院 A kind of reading note scanning means

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112137287A (en) * 2020-09-25 2020-12-29 华北水利水电大学 Device for extracting language characters
CN113361404A (en) * 2021-06-02 2021-09-07 北京百度网讯科技有限公司 Method, apparatus, device, storage medium and program product for recognizing text

Similar Documents

Publication Publication Date Title
US7604161B2 (en) Question paper forming apparatus and question paper forming method
CN106781784A (en) A kind of intelligence correction system
US9454696B2 (en) Dynamically generating table of contents for printable or scanned content
US20070031802A1 (en) Educational material processing apparatus, educational material processing method, educational material processing program and computer-readable recording medium
US20150187219A1 (en) Systems and methods for computer-assisted grading of printed tests
CN112534464A (en) Job arrangement and correction method, system and server
CN102521516A (en) Method and system for automatically creating error homework textbooks
EP1635249A1 (en) Terminal device, display system, display method, program, and recording medium
CN112669179A (en) Intelligent homework correcting method, device, system and server
CN109462712A (en) A kind of books extracts content automatic processing method
SE0101872D0 (en) System and a method
CN110309754B (en) Problem acquisition method and system
CN208798048U (en) A kind of books extracts content-specific filming apparatus
CN114266328A (en) Digitized job correction method, server, storage medium and system
CN111198952A (en) Method for acquiring wrong question set by labeling auxiliary books
CN106528820A (en) Picture annotation processing method and system, and terminal
US20070121805A1 (en) Message recording unit, message reproducing unit and message recording/reproducing method
KR101168969B1 (en) Textbook set code information for interlocking mobile
CN117099107A (en) System and method for facilitating information extraction and organization from paper and other physical writing surfaces
JP2006119712A (en) Information management terminal device and program, and document for electronic pen
CN209767637U (en) Integrated high-speed scanner
CN113903039A (en) Color-based answer area acquisition method for answer sheet
JP2006119713A (en) Editing terminal device, program, and document for electronic pen
CN113034994A (en) Intelligent academic data system
JP2006106182A (en) Marking support system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190312

RJ01 Rejection of invention patent application after publication