CN1319817A - System and method for establishing personalized file in electronic form - Google Patents

System and method for establishing personalized file in electronic form Download PDF

Info

Publication number
CN1319817A
CN1319817A CN01112120A CN01112120A CN1319817A CN 1319817 A CN1319817 A CN 1319817A CN 01112120 A CN01112120 A CN 01112120A CN 01112120 A CN01112120 A CN 01112120A CN 1319817 A CN1319817 A CN 1319817A
Authority
CN
China
Prior art keywords
document
individualized
user
search
electronic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN01112120A
Other languages
Chinese (zh)
Other versions
CN1127031C (en
Inventor
安·纽曼-科林斯
唐·鲁特勒支·戴
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of CN1319817A publication Critical patent/CN1319817A/en
Application granted granted Critical
Publication of CN1127031C publication Critical patent/CN1127031C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/169Annotation, e.g. comment data or footnotes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Document Processing Apparatus (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention provides a method, a system and a program product for electronically preparing a customized document from at least one electronic reference document. In this method, first of all, an electronic reference document is selected. Next, the electronic reference document is analyzed into subcomponents automatically. Next in this method, a customized document is prepared by automatically summarizing similar items out of the subcomponents and the document is automatically outputted, together with one set of navigation affordance based on the selection of a user before document generation. In this method, notes inputted by the user before document generation are automatically contained in the customized document and indexes for customized document are prepared on the basis of the index option and index depth of user selection.

Description

Create the system and method for individualized document in the electronics mode
Generally speaking, the present invention relates to create electronic document, particularly the combination as the reference document produces electronic document.More particularly, the present invention relates to a kind of method, system and program product, be used to analyze some segments of reference material or reference material, and these segments are gathered together by certain sortord, have the individualized document of homing capability with establishment.The invention still further relates to a kind of method, system and program product, be used for the individual electronic document being indexed and massaging according to the keyword of user's suggestion.
Utilizing one or more other information sources is a general function in current science or the business environment as the synthetic individualized document of reference material.Student or professional etc. often wish by the information source edit file of having delivered (or data) to create they individual's works.For example, one can be utilized historical book, newspaper or magazine article that finds in the electronic databank (as the Internet) and the report of writing recently as a reference the student who studies history who writes about the comprehensive article of American Revolution.In typical document combined process, in the document of being created, include only the relevant portion in the reference material.The document of being created is linked at together with a kind of form of adhesive aggregation normally title, index, with reference to individual's notes and commentary of paragraph and author and finishes.
Developing rapidly of the Internet partly because current available bulk information is arranged on it, makes the Internet become one of data retrieval resource that is widely used most.The current the Internet that uses allows the user to squeeze into a search inquiry, and as the response of this inquiry is received to the hypertext link of some websites on the Internet, has the information relevant with searching request on those websites.In current internet environment, this information finds on WWW (Web) website that uses hypertext markup language (HTML) to create mostly.This information finds in web documents, and they tend to show one piece of article or page-level piecemeal, that is to say, is editing or is remaining single with fashionable this entire document of other sets of documentation.Usually can cause at the enterprising line search of creating by HTML of website and to hit, some of them hit just according to website that this hypertext link is associated in a single speech has appearred.Containing the document that single speech hits is not the material that will search for usually, has however but been sent back.Also have, if a user wishes to visit the one section html document that comprises search query term, this user has to download (promptly opening) entire document and search in the whole text in his computer system.Have again, if a user wishes by the synthetic document of two document creation that contains this search inquiry, the user has to these two documents are entirely linked, perhaps the some parts of artificial these documents of cut and paste in word-processing application.
In artificial document building-up process, the document draftsman reads over whole reference material, selects or the interested part of highlight, those parts is copied in his notebook or on his computing machine.Then, the draftsman repeats this process to next reference material.In some cases, when the draftsman creates his document, at first to read all reference materials, from each reference material, carefully select each single part then.
What carry out in electronic media is similar process, just can obtain reference material in the electronics mode.The draftsman reads over some online information source and selects relevant part with in the individualized document that is included in him.Then, these relative sections can be sheared and paste or copy in the word-processing application in certain other modes, by the draftsman they are carried out manual link there.
Above-mentioned two kinds of individualized document synthetic methods all special charges time and efficient are low, particularly only wish the relevant information combination in the future can be all the more so when easily carrying out reference by visiting single document as the user.
In the general classes dramatic growth of information distribution site on the Internet, yet do not finish being correlated with, assembling and re-using of resource at these websites.Some have work that the conception that a plurality of documents on the WWW is gathered into a single document had been discussed earlier.For example, United States Patent (USP) 5,924,090 discloses a kind of categorizing system, utilize this system according to its content use metadata attributes every be put into of all categories or subclass in.This categorizing system is utilized a kind of device, and it is searched for a database and Search Results is put in one group of maximally related classification, thereby makes the user can have to those relevant records.
At " can expect " (The Atlantic Monthly (Atlantic Monthly) as us, in July, 1945, the 101-108 page or leaf, it reprints in Sloan.stanford.edu/mousesite/Secondary/Bush.html with the http agreement), this author has described the conception of memex device.The memex device helps INFORMATION DISCOVERY and information to synthesize for re-using.This article has also been discussed association index, can couple together two or more items of information by the conjunctive word that the user determines with it, and create the part that a tail tag is represented the corresponding information item.
Neither one is talked about permission from the synthetic homing capability of property document one by one of coherent reference material (but be a kind of physical indicator with apperceive characteristic, how its indication utilizes or finish whatsit) in the above-mentioned list of references.
The present invention recognizes, if having a kind of method, system and program product to be used for that the electronic reference material breakdown is become ingredient and by the synthetic effectively property electronic document one by one of each ingredient of this electronic reference material, that will be good.A kind of method, system and program product, it allows user's searching for reference material, the guideline of sending in the document generation utility routine according to the user produces a synthetic document that only comprises relevant information automatically by selected reference material then, and such method, system and program product will be welcome improvement.If such method, system and program product allow automatically a document (for example individualized document that is produced) to be carried out formatting and index, also will be good.These and other benefits have been recognized in the present invention.
Disclosed a kind of method, system and program product, be used for creating property document one by one in the electronics mode by at least one electronic reference material.This method is at first selected the electronic reference material.Then, the resolved one-tenth subconstiuent of this electronic reference material.This method is assembled similar in the middle of described subconstiuent then, to create individualized document, has one group of homing capability when it is output automatically, and this group homing capability is to form according to the selection that the user was done before the generation document.
In a most preferred embodiment, the notes and commentary that the user sent into before this method, system and program product also produced with document are come this individualized document of note, and create an index according to the Yellow Book and the index degree of depth that the user selects for this individualized document.This index is to utilize to take from the central keyword establishment of subconstiuent.
Above-mentioned and other purposes, characteristics and advantage of the present invention will become obvious in the detailed written description hereinafter.
In claims, proposed to believe and to have characterized new features of the present invention.Yet, consult the following detailed description of illustrated embodiment in conjunction with the accompanying drawings, will understand the present invention itself and best using method, further purpose and advantage best, here,
Figure 1A is for realizing the block scheme of the employed data handling system of most preferred embodiment of the present invention;
Figure 1B is for realizing the block scheme of the employed client-server-data bank network of most preferred embodiment of the present invention;
Fig. 2 is graphic user interface (GUI) figure of information center's application program, and according to one embodiment of present invention, in this application program, the user can select to produce electronic document and the option of indexing;
Fig. 3 A is the search GUI that is used for the retrieving reference material according to one embodiment of present invention;
Fig. 3 B is the note GUI that is used for the new individualized document of creating of note according to one embodiment of present invention;
Fig. 4 is the logical flow chart that produces the individual electronic document process according to one embodiment of present invention;
Fig. 5 is the logical flow chart that according to one embodiment of present invention a document is carried out electronic editing index process; And
Fig. 6 is the block flow diagram that produces the individual electronic document process according to one embodiment of present invention.
With reference now to accompanying drawing,,, described to be used for the basic structure of the data handling system 20 of most preferred embodiment of the present invention among the figure particularly with reference to Figure 1A.Data handling system 20 has at least one CPU (central processing unit) (CPU) or processor to be contained in the system unit 22.System unit 22 links to each other with the plurality of peripheral device, comprises input/output device such as display monitor 96, keyboard 82, figure indicating device 84 and printer 94, uses for user interface.The permanent memory device (as hard disk) that also has that is contained in the system unit 22 is used to store operating system and user's programs/applications program of data handling system, and scratchpad memory device (as random access memory or RAM), it is used to realize programmed instruction by CPU.System unit 22 is communicated by letter with peripheral unit by various devices, comprises by bus or direct channel (utilizing bus bridge that not only bus can be provided).
Data handling system 20 can have many additional parts, and these do not draw in the drawings, as is used for the serial port, parallel port and the USB port that are connected with modulator-demodular unit 92 or CD ROM78 etc.In this embodiment of the present invention, can carry out and the communicating by letter of data handling system 20 via linking modulator-demodular unit 92 on wire over ground or the system for wireless cellular telephony, conversely, modulator-demodular unit 92 links to each other with local network supplier (as Internet service provider (ISP)) again.In addition, data handling system 20 also can be linked a network via network adapter.The data that are transmitted arrive modulator-demodular unit or network card, and processed, so that received by CPU or other software application of data handling system.In this most preferred embodiment, Internet service provider provides reference data, and these reference datas can download in the data handling system 20 via modulator-demodular unit 92.Modulator-demodular unit 92 also can provide and being connected of other source of reference data, as server, BBBS (Bulletin Board System)BS (BBS) or the Internet (comprising WWW).
Those skilled in the art can further understand, what may be used in combination with those parts shown in Figure 1A also has a miscellaneous part, for example, the display adapter that links to each other with processor can be used to control 30, one Memory Controllers of a video display monitor and can be used as interface between temporary storage device and the CPU.Data handling system 20 also comprises a firmware, and its fundamental purpose is to be used for finding out and the load operation system from one of peripheral unit (a normally permanent storage apparatus) when data handling system 20 is connected first.In this most preferred embodiment, data handling system contains a fast relatively CPU and enough big temporary storage device and the space on permanent storage, and needed other hardware componenies.
Traditional data handling system often utilizes a graphic user interface (GUI) to user's presenting information.GUI is by being loaded into software creation on the data handling system, specifically, is this data handling system and operating system application program teamwork.Most preferred embodiment of the present invention is by realizing that based on the application program of GUI this application program has several user interfaces, and supports to be stored in functional part on the medium as program code, and this medium links to each other with processor and also can be read by this processor.
Realization of the present invention is taking place on the data handling system as mentioned above.Yet, should be appreciated that the data handling system of other types is possible, they can have some or more some more above-mentioned basic element of character.For example, can utilize single-use document synthesis system to replace the conventional data disposal system.
The present invention can realize in the network environment as shown in Figure 1B.Network environment comprises a client computer and a server 153, and the present invention realizes as information center's application 151 on client computer, and server 153 is as the source or the pipeline of the synthetic used reference data 155 of individualized document.Network environment can be a Local Area Network or wide area network (WAN), as the Internet.Most preferred embodiment of the present invention be with data handling system that wide area network links to each other on realize that it has the explorer ability to be used to search for the Internet to obtain relevant reference material.Here will with the data handling system that is connected the present invention be described with reference to a wide area network (WAN).
WWW (Web) is a graphical interaction interface that is used for the Internet, noun the Internet and Web conversion use mutually in this whole instructions.With data handling system that Web links to each other on have different computer program application (be the Web browser client computer, hereinafter be called Web browser) to be used to visit the server that links to each other with Web.Information be as web storage on a Web server.A webpage comprises one or more figures and/or text display, and they can be linked at together and can utilize Web browser to download to the client data disposal system.Each webpage in Web has a unique address, or uniform resource locator (URL), and it can utilize TCP (TCP/IP) visit.Webpage often fetches expression by a corresponding hypertext link in client browser, and this link also can provide the information about content of pages.
Current webpage design carries out the transition to extensible markup language (XML) form from the html format that use represents the page-level piecemeal, and it represents the dynamic extensible mechanism of describing document content, meticulousr piecemeal and other functional elements that can not obtain in HTML.XML was developed by WWW advisory committee in 1996.It is a file specification, is used for there being the data of structure to be put into a text, and it allows the single composition of visit text file/data then.The text that uses the XML form to prepare can be viewed thereafter, need not to use in order to produce the program of this document.The text formatting of XML file is easy to be produced and read in a kind of mode of not obscuring by a computing machine, and is independent of platform.XML utilizes mark (promptly by '<' and '〉' bracket speech) and attribute partition data piece.XML comprises the sentence structure that is used in reference to an XML document each several part (data block).XML allows the WWW author to increase mark to web documents, with the implication of appointment search inquiry, thereby makes inquiry more accurate.XML also provides the viewing information of customization by handling corresponding data.The present invention when some steps shown in the process flow diagram of realizing Fig. 4 and Fig. 5 with the XML function as a kind of means.
The present invention utilizes the function of XML language, to allow by a plurality of XML document establishments of finding in the database or to synthesize individualized document.For the purposes of the present invention, database one speech is meant any set of creating one or more reference materials of being selected by the user in the individualized document process.The invention provides a system that finds and reuse information, it produces the individualized document that a quilt is correlated with and is watched, fully understood.The present invention relies on XML document type definition (DTD) to force semanteme tissue to data, and utilizes XSL as the data filter technology, and it provides the transition coding service for synthetic result shared.
The present invention is mainly realizing in the GUI of information center shown in Fig. 3 A.For the purposes of the present invention, the GUI of information center is meant an inlet towards product or territory.The GUI of information center also can be called resource center or document and produce the center.The element that exists in the GUI of information center utilizes Widget (Widget) to create, and they add user interface to and provide more facility selective to the user.In this most preferred embodiment, the GUI300 of information center has browser function, and it can searched on the Internet according to the inquiry that the user sends into.The GUI300 of information center uses search GUI201 visit WWW shown in Figure 2.In Fig. 2, a search inquiry is admitted to inquiry field 207, and can replenish the contextual search item of sending in context field 205 and the classification field 203.Latter two field is to be used for by further determining accurately to be located this search by the general area of reference.Because the meticulous partitioned searching ability of XML format file, make this characteristic utilize the search utility routine that strengthens and enable to take place more accurate hitting.The user selects to submit to inquire button 209 that searching request is sent to the Internet.When hitting, an advertised window 211 is reminded his search of user success already.
Forward Fig. 3 A now to, when the user sent into a search terms in the search field 321, the Web-browser function of the GUI300 of information center was activated, and it opens the search GUI201 of Fig. 2.Relevant hitting as hypertext link is transmitted back in first frame 323 of the GUI of information center.Then, the user can select " vendors' cart " of some articles from finishing here.The user selects the there, and he believes the article that is included as the required good reference material of generation individualized document, and they are copied to reference field 303 (by drag-and-drop operation or double-click selection etc.).Generate mirror image to selected being linked in the reference field 303 of reference.When selected link, the actual text of these documents (promptly being not only hypertext link) is downloaded to the reference memory block of the GUI300 of information center, and temporarily is stored in the there when analysis and synthesis step generation.For ease of demonstration, demonstrate 3 reference documents as the selected synthetic document of document that is used for.In case selected desirable reference documents, the user can send into him and wish the form, index and the annotation information that reflect in the individualized document that will produce.
In form, index and the comment field of the GUI300 of information center, the user can be new individualized document input customized information, for example homing capability and note.For the purposes of the present invention, homing capability is defined as being meant title, index, reference field, topic head and/or the subtitle head that comprises in this individualized document, the hypertext link that arrives reference source etc.Like this, customized information comprises the new individualized document title of sending in the header field 305, be the individualized document desirable index level of indexing what the index degree of depth was selected to send in the district 311, and the individualized document that the Optional Form from form drop-down menu 307 is selected in tabulating represents form.Should be appreciated that the customized information of other types is available, above-mentioned tabulation does not mean that limitation of the invention.The form style comprises for example Lotus style and IBM style etc.
The user can send into individual's notes and commentary that will comprise in individualized document in notes and commentary district 309.Certainly, further notes and commentary can be in being added to this individualized document in the future, and the present invention only provides the descriptive summary of a weak point or to the introduction of synthetic material.In another embodiment, the user is allowed to according to the criterion of indexing note be carried out in whole individualized document segmentation.Like this, but each mainly beginning and the ending of retrieval topic head in the individualized document that user's note is finished.For example, the user can write a summary sentences or paragraph at the place that begins of each major part the beginning the place and write an introductory paragraph of this individualized document, adds conclusion in the ending of individualized document.Carry out the not only input and the selection of user's notes and commentary among the notes and commentary GUI that describes in Fig. 3 B, this will be described below.
At last, the user selects to select output intent in the district 313 in output.The user selects one of multiple output intent, this comprises the new XML source of browser, PDF, download, submit to print, as the document files preservation etc.In one embodiment, individualized document is to export as a new XML document, and it can outwards send on the Internet.
In another embodiment, the user also can be chosen in the catalogue that a reference material that will comprise is created at the individualized document end.The reference material that comprises in catalogue fetches with hypertext link and represents, and hauls out the whole text or the relevant portion of reference material with permission user afterwards.Once all these zones all complete, then the user selects submit button 315, to begin to produce document.
Fig. 3 B shows the graphic user interface 350 that is used to import user's comment, and these comments are as the note of the selected part of the individual electronic document that is about to build up.GUI350 comprises two frames, segment frame 351 and annotation box 353.The segment frame comprises the contents list of desirable individualized document layout.But each in contents list is an options, and when it was chosen, it was coupled together the notes and commentary that write in the annotation box 353 and this characteristic item.Like this, for example, the user selects foreword-summary item, sends into the notes and commentary of being write then in annotation box 353.When he had finished his notes and commentary, he selected another again and sends into corresponding notes and commentary.Another selection is caused opening another page or leaf in the annotation box 353, perhaps, if the formerly selected mistake of this item is then opened the previous frame corresponding with this.Below this frame be position (or arrangement) icon 357 that is used to determine this notes and commentary position in this individualized document segment.Demonstrate 3 possible positions among the figure, they have corresponding button.These buttons be before button 359A, button 359B and selector button 359C afterwards.Various modifications to this arrangement function are possible.These buttons can perhaps be applied to whole individualized document by single selection after each selection.In case the notes and commentary of finishing, the user selects return push-button 361, and it closes GUI350.
In another most preferred embodiment, GUI350 is used as the framework of creating individualized document.For each writes in the contents list notes and commentary are used to search for relevant reference information.Like this, when selecting return push-button 361, the keyword in each notes and commentary is selected as the search inquiry district that Fig. 3 A sent in search word or phrase.These search words are sending out on the Internet, and return hitting at the predetermined search word of particular sequence.Then the user for each segment select relevant hit and these hit to submit to produce the document utility routine.In one embodiment, by keyword and the information creating index terms or the indexing head that from each coherent reference material, extract.
Fig. 4 shows that the user carries out search database and submits the reciprocal process of coherent reference material to producing the document utility routine thereafter.This process enters piece 403 then in piece 401 beginnings, and the user sends into search inquiry there.In case return Search Results, then in piece 405, select suitable coupling with for referencial use.Then, the user sends into title, index, note, form and output information in piece 407.In case the user has sent into all necessary informations, then submit to by the user and produce the individualized document request, so process finishes at piece 411 at piece 409.As discussing, can finish other realizations of the present invention with different procedural orders with reference to figure 3B.Here just explanation for example of the order that represents.
With reference now to Fig. 6,, shows the block flow diagram of major function of the present invention among the figure.As shown in FIG., 3 reference documents doc.A601, doc.B602 and doc.C603 are as the input of creating individualized document doc.D615.Each reference documents has a plurality of subdivisions based on its XML form.The selection of similar terms in the reference documents is based on the semantic marker of their XML form.This information has metadata, and they can come separately according to theme, title and author.So may carry out by contextual search.Grammatical analysis utility routine 607 is isolated each single ingredient of reference material respectively, and the part that these are separated is delivered to editor utilities 609.Editor utilities 609 combines the similar segment in each reference documents (for example W among the Doc.A601 and the W among the Doc.B603 etc.).The order that can determine by the user or carry out this combination by default order.Then, utilize other bottom ingredients in XML form, indexing head and the reference documents, edit out individualized document, will be combined to from the similar area in whole 3 sources here under the specific indexing head by reference documents.Then edited segment is delivered to the utility routine of indexing, its creates the index of the segment edited, as hereinafter with reference to as described in the figure 6.613 pairs of individualized document formattings of formatter utility routine then, comprise insert index, individual notes and commentary, title, with reference to segment etc.The output intent of selecting according to the user is exported individualized document doc.D615 then.Though each utility routine be describe by a particular order and show that with connection chain the order of describing these utility routines is unessential for various embodiment of the present invention.In some applications, specific utility routine, as the utility routine 611 of indexing can be used as independent utility.Have again, realize that with other functional parts each specific function of the present invention described herein is possible.
Fig. 5 shows the process of document being indexed according to the present invention.The process of indexing enters piece 503 then in piece 501 beginnings.At piece 503, the user is prompted to send into the desirable index degree of depth.In case receive the index depth information, then in piece 505 loadings topic head and crosshead head list of categories, these topic heads can be provided by the user, or extract from index data base according to related subject.Then, at piece 507, utilization topic head and depth information search the document.The complexity that depends on the document, this search can be finished in paragraph level or segment stages.Next, produce main some distribution plan at piece 509 based on search.Then, at piece 511, be mapped to their main points separately with the point of each main spot correlation.Determine whether to reach the needed degree of depth at piece 513 then.If do not reach the needed degree of depth as yet, then continue to carry out seeking the step of next point at piece 511, just repeat to produce the more depth indexing of more tiny point at every turn in the next degree of depth/rank.In case reached the desirable degree of depth, then exported this index to the user at piece 515.At piece 517, this index is added in the place that begins of the document then.This process finishes at piece 519 then.
Should be appreciated that within the scope of the present invention, for finishing the function of indexing, other procedure blocks may be necessary, perhaps these procedure blocks can be with different series arrangement.For example, can in document, sequentially assess every class topic head and crosshead head to ending from document is initial.Have again, index and on original document, to finish, also can on newly-built synthetic document (i.e. an individualized document), finish.The index of the GUI300 of information center is selected Qu Haike to expand to allow the user to send into desirable individualized document internal information to represent sequence, and it also directly influences the layout of index.
Like this, operational format of the present invention turns to the information of semantic XML unit, and this helps itself to carry out OO discovery.The present invention realizes a query interface, and it is mapped to user's intention the semanteme of information database.The present invention has also realized an interactive interface, and it makes the user can add note, selects homing capability (contents table, general index or be subjected to the index of subject matter restricted, to other sources or from the set of links in other sources).This interface also enables to select show style (usually, the appearance and the sensation of the information of table retransmitted in the enterprise business rule influence), and select desirable form as a result (to be reintroduced in the document or to output in other databases as a new message unit, use for the individual as readable line format, as the page format result to be suitable for printing etc.).
The present invention realizes in the document or the search in the document database, and filters out the continuous item corresponding with search inquiry, returns a synthetic document ready-made, that highly link then, and it is note and related those projecting points neatly.This synthetic document or can be used as the replenishing of existing research information set perhaps can be used as a regeneration document and transfers out other databases then, should synthetic sharing of documents thereby realize.
At last, importantly, although exemplifying embodiment of the present invention is to describe in the environment of global function data handling system, but it will be understood to those of skill in the art that, the software aspect of the embodiment of the invention can be as various forms of program product issues, no matter the actual particular type that carries out the used signal bearing medium of this issue how, exemplifying embodiment of the present invention similarly is suitable for.The example of signal bearing medium comprises recordable-type media, as floppy disk, hard disk drive, CDROM, and transmission type media, as numeral and communication link simulation.
Although specifically show and described the present invention, it will be understood to those of skill in the art that the various changes that to carry out on form and the details here, and do not leave the spirit and scope of the present invention with reference to most preferred embodiment.

Claims (27)

1. create the method for individualized document by at least one electronic reference document in the electronics mode for one kind in data handling system, described method comprises following steps:
Described at least one electronic reference document is selected in response user's input;
Automatically described at least one electronic reference document is resolved to the plurality of sub composition;
Respond finishing of described analyzing step, automatically from described subconstiuent, assemble similar terms, to create described individualized document; And
According to the selection that the user is done before creating document, output has the described individualized document of one group of homing capability that produces automatically.
2. the method for claim 1 further comprises following steps:
Produce the preceding notes and commentary of sending into by described user with document and come the described individualized document of note; And
According to the Yellow Book and the index degree of depth that the user selects, described individualized document is indexed the keyword that the wherein said step utilization of indexing is extracted from described subconstiuent.
3. the process of claim 1 wherein that described selection step comprises the step among the temporary that the content replication of described at least one electronic reference document is arrived with described data handling system links to each other.
4. the method for claim 1 further comprises the step of described at least one electronic reference document being carried out contextual search, and wherein said search is to carry out according to inquiry and contextual information that the user sends into.
5. the method for claim 4, wherein set up with extensible markup language at described at least one electronic document that carries out in the search step, have meta-tag to be used to distinguish the relevant segment of described at least one electronic document, and wherein said carry out material database of search step search with obtain described at least one have the electronic document of described meta-tag.
6. the method for claim 5, wherein said search is being carried out on the Internet, described material database comprises at least one webpage by relevant hypertext link representative, wherein said at least one electronic document is positioned on described at least one webpage, and described selection step comprises the hypertext link of selecting described at least one electronic document.
7. the described subconstiuent of described at least one electronic document of sending into by the user before the method for claim 5, wherein said analyzing step are utilized described meta-tag and produced document of the incompatible description of criteria set.
8. the method for claim 7, wherein said agglomeration step comprises the following steps:
Described subconstiuent is matched each other;
Described subconstiuent with analogous element is made up, to create the subclass group; And
Link described subclass group, to produce described individualized document.
9. the method for claim 8, wherein said output step comprises the following steps:
The form preference that the user was selected before the generation document is applied to described individualized document;
In described individualized document, pre-determine title and summary; And
Described index in the described individualized document is placed on predetermined position.
10. create the computer program of individualized document by at least one electronic reference document in the electronics mode, described program product comprises:
Computer-readable medium; And
Programmed instruction on described computer-readable medium is used for:
Described at least one electronic reference document is selected in response user's input;
Automatically described at least one electronic reference document is resolved to the plurality of sub composition;
Respond finishing of described analyzing step, automatically from described subconstiuent, assemble similar terms, to create described individualized document; And
According to the selection of user before executive routine, output has the described individualized document of one group of homing capability that produces automatically.
11. the computer program of claim 10 further comprises programmed instruction, is used for:
Produce the preceding notes and commentary of sending into by described user with document and come the described individualized document of note; And
According to the Yellow Book and the index degree of depth that the user selects, described individualized document is indexed the keyword that the wherein said step utilization of indexing is extracted from described subconstiuent.
12. the program product of claim 10, wherein said option program instruction comprises used programmed instruction among the temporary that the content replication of described at least one electronic reference document is arrived with described data handling system links to each other.
13. the computer program of claim 10 further comprises described at least one electronic reference document is carried out the used programmed instruction of contextual search, wherein said search is to carry out according to inquiry and contextual information that the user sends into.
14. the computer program of claim 13, wherein at least one electronic document in the described programmed instruction of searching for is set up with extensible markup language, have meta-tag to be used to distinguish the relevant segment of described at least one electronic document, and wherein said carry out material database of search utility instruction search with obtain described at least one have the electronic document of described meta-tag.
15. the computer program of claim 14, wherein said search is being carried out on the Internet, described material database comprises at least one webpage by relevant hypertext link representative, wherein said at least one e-file is positioned on described at least one webpage, and described option program instruction comprises the used programmed instruction of hypertext link of selecting described at least one electronic document.
16. the computer program of claim 14, the described subconstiuent of described at least one electronic document of sending into by the user before wherein said analysis program instruction utilizes described meta-tag and produces document of the incompatible description of criteria set.
17. the computer program of claim 16, wherein said agglomerative procedure instruction comprises that programmed instruction is used for:
Described subconstiuent is matched each other;
Described subconstiuent with analogous element is made up, to create the subclass group; And
Link described subclass group, to produce described individualized document.
18. the computer program of claim 17, wherein said written-out program instruction comprises that programmed instruction is used for:
The form preference that the user was selected before the generation document is applied to described individualized document;
In described individualized document, pre-determine title and summary; And
Described index in the described individualized document is placed on predetermined position.
19. the data handling system by online reference material generation individual electronic document, described disposal system comprises:
Processor and data storage area;
Connect medium, be used for described processor is linked the coherent reference material database;
First graphic user interface (GUI) that is stored in described data storage area and can be carried out by described processor is used to send into a search terms and at the enterprising line search of described database;
The 2nd GUI that is stored in described data storage area and can be carried out by described processor is used for selecting the coherent reference material that returns from the search that a described GUI carries out and sends into the relevant homing capability of individualized document form of wishing with the user;
Produce utility routine with described the 2nd GUI linked document, be used for by the synthetic property document one by one of the ingredient of described coherent reference material, wherein said individualized document has described homing capability, and described document generation utility routine is stored in described data storage area and can be carried out by described processor.
20. create the method for individualized document in the electronics mode for one kind, comprise the following step:
Establishment comprises the individualized document note summary of keyword, is used for being included in described individualized document;
Described note summary is submitted to search engine, enable to search for and extract the reference material that has about the ingredient of described keyword;
Receive one group of described reference material; And
Utilize the described ingredient of described note summary and described one group of reference material, synthetic automatically described individualized document.
21. the method for claim 20, wherein the described note summary in described foundation step comprises title, foreword and to the notes and commentary of one or more described keywords.
22. the method for claim 20, wherein said automatic synthesis step comprises the following steps:
For described individualized document is selected the formatting style;
For described individualized document is selected output intent; And
After being synthesized, creates described individualized document described individualized document index.
23. the method for claim 22, wherein said foundation step comprise the step of selecting the described index degree of depth.
24. create the system of individualized document in the electronics mode for one kind, comprise:
The note utility routine is used to send into the user's notes and commentary about described individualized document;
The search utility routine is used in data for electronic documents storehouse search and the relevant reference material of those keywords during described user makes commentary and annotation;
User interface is used to allow the user to select to be included in coherent reference material and homing capability among the described individualized document;
Document produces utility routine, is used to utilize described coherent reference material, described user notes and commentary and described homing capability, produces described document automatically; And
The output utility routine is used to export described individualized document.
25. the system of claim 24 further comprises the utility routine of indexing, and is used for creating automatically the index of described individualized document.
26. a computer program that is used for creating in the electronics mode individualized document comprises:
Computer-readable medium; And
Be stored in the programmed instruction on the described computer-readable medium, comprise:
The note utility routine is used to send into the user's notes and commentary about described individualized document;
The search utility routine is used in data for electronic documents storehouse search and the relevant reference material of those keywords during described user makes commentary and annotation;
User interface is used to allow the user to select to be included in coherent reference material and homing capability among the described individualized document;
Document produces utility routine, is used to utilize described coherent reference material, described user notes and commentary and described homing capability, produces described document automatically; And
The output utility routine is used to export described individualized document.
27. the computer program of claim 26 further comprises the utility routine of indexing, and is used for creating automatically the index of described individualized document.
CN01112120A 2000-03-31 2001-03-29 System and method for establishing personalized file in electronic form Expired - Fee Related CN1127031C (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US54043500A 2000-03-31 2000-03-31
US09/540,435 2000-03-31

Publications (2)

Publication Number Publication Date
CN1319817A true CN1319817A (en) 2001-10-31
CN1127031C CN1127031C (en) 2003-11-05

Family

ID=24155457

Family Applications (1)

Application Number Title Priority Date Filing Date
CN01112120A Expired - Fee Related CN1127031C (en) 2000-03-31 2001-03-29 System and method for establishing personalized file in electronic form

Country Status (5)

Country Link
JP (1) JP2001306552A (en)
KR (1) KR100403947B1 (en)
CN (1) CN1127031C (en)
AU (1) AU781901B2 (en)
SG (1) SG96607A1 (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100538608C (en) * 2004-02-10 2009-09-09 微软公司 Be coated with the system and method that China ink uses dynamic digital zooming interface in conjunction with numeral
CN101539905A (en) * 2009-04-27 2009-09-23 浙江大学 Embedded multi-format electronic document marking method
CN101408876B (en) * 2007-10-09 2011-03-16 中兴通讯股份有限公司 Method and system for searching full text of electric document
US8301631B2 (en) 2009-05-30 2012-10-30 Edmond Kwok-Keung Chow Methods and systems for annotation of digital information
CN103492996A (en) * 2011-02-24 2014-01-01 谷歌公司 Electronic book interface system and method
US9015166B2 (en) 2009-05-30 2015-04-21 Edmond Kwok-Keung Chow Methods and systems for annotation of digital information
CN105608227A (en) * 2016-01-26 2016-05-25 唐山新质点科技有限公司 Document data retrieval method and device
WO2022184012A1 (en) * 2021-03-01 2022-09-09 北京字跳网络技术有限公司 Document creation method and apparatus, and device and storage medium
CN113157996B (en) * 2020-01-23 2022-09-16 久瓴(上海)智能科技有限公司 Document information processing method and device, computer equipment and readable storage medium

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7356537B2 (en) * 2002-06-06 2008-04-08 Microsoft Corporation Providing contextually sensitive tools and help content in computer-generated documents
KR100980575B1 (en) * 2008-04-07 2010-09-06 송영주 Developing multiple-coninuous guide linked information block system and its personalized utilization method
US8352514B2 (en) * 2008-12-10 2013-01-08 Ck12 Foundation Association and extraction of content artifacts from a graphical representation of electronic content
EP2620748A3 (en) * 2012-01-26 2016-04-20 Hyundai Motor Company Device for providing or generating intertwined information related to a space of interest.
CN104021131B (en) * 2013-03-01 2017-08-08 中国移动通信集团浙江有限公司 A kind of dissemination method, the apparatus and system of the various dimensions page
KR102183815B1 (en) * 2019-02-15 2020-11-27 리걸테크 주식회사 Data Management System and Data Management Method
KR102633515B1 (en) * 2020-12-23 2024-02-06 정신호 System for mobile contents generation

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE69531599T2 (en) * 1994-12-20 2004-06-24 Sun Microsystems, Inc., Mountain View Method and device for finding and obtaining personalized information
US5708825A (en) * 1995-05-26 1998-01-13 Iconovex Corporation Automatic summary page creation and hyperlink generation
US6029182A (en) * 1996-10-04 2000-02-22 Canon Information Systems, Inc. System for generating a custom formatted hypertext document by using a personal profile to retrieve hierarchical documents

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100538608C (en) * 2004-02-10 2009-09-09 微软公司 Be coated with the system and method that China ink uses dynamic digital zooming interface in conjunction with numeral
CN101408876B (en) * 2007-10-09 2011-03-16 中兴通讯股份有限公司 Method and system for searching full text of electric document
CN101539905A (en) * 2009-04-27 2009-09-23 浙江大学 Embedded multi-format electronic document marking method
US8301631B2 (en) 2009-05-30 2012-10-30 Edmond Kwok-Keung Chow Methods and systems for annotation of digital information
US9015166B2 (en) 2009-05-30 2015-04-21 Edmond Kwok-Keung Chow Methods and systems for annotation of digital information
CN103492996A (en) * 2011-02-24 2014-01-01 谷歌公司 Electronic book interface system and method
US9501461B2 (en) 2011-02-24 2016-11-22 Google Inc. Systems and methods for manipulating user annotations in electronic books
US10067922B2 (en) 2011-02-24 2018-09-04 Google Llc Automated study guide generation for electronic books
CN105608227A (en) * 2016-01-26 2016-05-25 唐山新质点科技有限公司 Document data retrieval method and device
CN105608227B (en) * 2016-01-26 2019-02-19 唐山新质点科技有限公司 Document data search method and device
CN113157996B (en) * 2020-01-23 2022-09-16 久瓴(上海)智能科技有限公司 Document information processing method and device, computer equipment and readable storage medium
WO2022184012A1 (en) * 2021-03-01 2022-09-09 北京字跳网络技术有限公司 Document creation method and apparatus, and device and storage medium

Also Published As

Publication number Publication date
KR20010094955A (en) 2001-11-03
CN1127031C (en) 2003-11-05
AU7186600A (en) 2001-10-04
AU781901B2 (en) 2005-06-23
SG96607A1 (en) 2003-06-16
KR100403947B1 (en) 2003-10-30
JP2001306552A (en) 2001-11-02

Similar Documents

Publication Publication Date Title
Ovsiannikov et al. Annotation technology
Denoue et al. An annotation tool for Web browsers and its applications to information retrieval.
US6968332B1 (en) Facility for highlighting documents accessed through search or browsing
Travis et al. The SGML implementation guide: a blueprint for SGML migration
Hammer et al. Semistructured data: The TSIMMIS experience
US6654737B1 (en) Hypertext-based database architecture
Alexa et al. A review of software for text analysis
CN1127031C (en) System and method for establishing personalized file in electronic form
US20030050927A1 (en) System and method for location, understanding and assimilation of digital documents through abstract indicia
US20070231781A1 (en) Estimation of adaptation effort based on metadata similarity
US6654758B1 (en) Method for searching multiple file types on a CD ROM
US20020013792A1 (en) Virtual tags and the process of virtual tagging
Ohene-Djan et al. Personalising electronic books
JP4469432B2 (en) INTERNET INFORMATION PROCESSING DEVICE, INTERNET INFORMATION PROCESSING METHOD, AND COMPUTER-READABLE RECORDING MEDIUM CONTAINING PROGRAM FOR CAUSING COMPUTER TO EXECUTE THE METHOD
Harper et al. Middleware to expand context and preview in hypertext
Browne et al. Website Indexing: enhancing access to information within websites
Nagao et al. Semantic Transcoding: Making the WWW More Understandable and Usable with External Annotations
Gueguen Digitized special collections and multiple user groups
JP2000250908A (en) Support device for production of electronic book
WO2001029709A1 (en) System and method for location, understanding and assimilation of digital documents through abstract indicia
Ahonen et al. Design and implementation of a document assembly workbench
Chang An electronic finding aid using extensible markup language (XML) and encoded archival description (EAD)
Browne et al. Website indexing
Broady et al. Internet and the humanities: the promises of Integrated Open Hypermedia
Ford et al. Interactive multimedia publishing systems

Legal Events

Date Code Title Description
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C06 Publication
PB01 Publication
C14 Grant of patent or utility model
GR01 Patent grant
C19 Lapse of patent right due to non-payment of the annual fee
CF01 Termination of patent right due to non-payment of annual fee