CN103914487A - Document collection, identification and association system - Google Patents

Document collection, identification and association system Download PDF

Info

Publication number
CN103914487A
CN103914487A CN201310006234.2A CN201310006234A CN103914487A CN 103914487 A CN103914487 A CN 103914487A CN 201310006234 A CN201310006234 A CN 201310006234A CN 103914487 A CN103914487 A CN 103914487A
Authority
CN
China
Prior art keywords
document
relation
mark
module
graph
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201310006234.2A
Other languages
Chinese (zh)
Other versions
CN103914487B (en
Inventor
邓寅生
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201310006234.2A priority Critical patent/CN103914487B/en
Publication of CN103914487A publication Critical patent/CN103914487A/en
Application granted granted Critical
Publication of CN103914487B publication Critical patent/CN103914487B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/93Document management systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a document collection, identification and association system and accordingly a computer system based knowledge management system in a certain professional field is built and the learning efficiency and the utilization efficiency of professional field knowledge are improved. According to the technical scheme, a series of documents which have the logical relation between the documents with a group of keywords are identified and associated in an unspecific document in a keyword search mode and the group of keywords are combined according to certain logic to name a relation graph which is formed by the series of documents.

Description

Collection, mark and the associated system of document
Technical field
The present invention relates to document system, relate in particular to collection, mark and associated efficient disposal system to online or unit document (containing handheld device) in a certain particular professional field.
Background technology
By the search of existing many documents and the system of displaying in the world, professional and technical personnel obtains, learns and study document, and need to from the document of many parts of date of formation differences, author's difference (independent author or associating author), obtain effective information as the relevant reference frame of acting criterion.The a certain knowledge content that may finally need only accounts for its document content of inquiring about below 5%, and these knowledge contents may be dispersed among several not obvious relevant documents.
The applicant recognizes, need to be these professional and technical personnel, searches out meet they require, customizing messages accurately from them the field of being concerned about, the answer that need to extract corresponding information at magnanimity document is very consuming time.And can provide the personnel of relevant similar service very rare for these professionals.
Therefore, the applicant recognizes to set up and a kind ofly better gathers, identifies for document and associated systems approach.
Summary of the invention
The object of the invention is to address the above problem, a kind of collection, mark and associated system of document is provided, built the Knowledge Management System based on computer system of a certain professional domain, improved learning efficiency and utilization ratio to professional domain knowledge.
Technical scheme of the present invention is: the present invention has disclosed a kind of collection, mark and associated system of document, comprise document classification storage administration Platform Server and document library Platform Server, wherein document classification storage administration Platform Server comprises graph of a relation apparatus for establishing between the mark of the harvester of single document, single document and associated apparatus and document, the host node of document memory storage is deployed on document classification storage administration Platform Server, the image release of the host node of document memory storage is deployed on document library Platform Server, wherein:
The harvester of single document, for collecting the document of required management type, preparatory processing and system introducing;
The mark of single document and associated apparatus, according to different dimensions and level, default technical term is classified and defined, set up and safeguard the lists of keywords of corresponding professional domain, single document is defined according to different attributes and level, several document element are set in single document, document element is carried out to the system banner of several keywords, define issuable logical relation list between any two single documents or document element, and realize the association setting of two logical relations between single document by the logical relation kind of having set,
Graph of a relation apparatus for establishing between document, defines graph of a relation between document, and the relation between each ingredient of graph of a relation between document is defined;
Document memory storage, according to calling of graph of a relation apparatus for establishing between the mark of the harvester of single document, single document and associated apparatus, document, relevant information is stored in the database of document classification storage administration Platform Server, the formatted file of appointment is filed in document library Platform Server, and by data switch engine, related data information is transmitted to data between document classification storage administration Platform Server and document library Platform Server.
According to an embodiment of the system of the collection of document of the present invention, mark and association, the harvester of single document further comprises:
Form sorting module, is organized into document the form of appointment;
Classified information identification module, linking format sorting module, adds preliminary classified information mark on request by formatted file;
File imports module, and link sort message identification module imports to the formatted file that has added classified information mark in system.
According to an embodiment of the system of the collection of document of the present invention, mark and association, mark and the associated apparatus of single document further comprise:
Keyword dimension setting module, sets the dimension of keyword;
Key definition module, connects keyword dimension setting module, and the corresponding keyword of the each dimension of keyword is defined;
Document classification setting module, according to keyword to the setting of classifying of single document;
Document fragment setting module, according to keyword to the setting of classifying of each document fragment of document.
According to an embodiment of the system of the collection of document of the present invention, mark and association, mark and the associated apparatus of single document also comprise:
Document element arranges module, is several document element by the document fragment combination with same keyword mark of single document;
Document element identification module, carries out the system banner of several keywords to document element;
Logic association module, defines issuable logical relation list between any two single documents, realizes the association of the logical relation between two single documents or document element by the logical relation kind of having set in system.
According to an embodiment of the system of the collection of document of the present invention, mark and association, between document, graph of a relation apparatus for establishing further comprises:
Keyword name module, names by specific one group of keyword graph of a relation between arbitrary concrete document;
Graph of a relation generation module between document, generate graph of a relation between document, comprise representing of the pattern identification of the logical relation between the representing of a series of document unit arranged by the certain logic relation between document element in graph of a relation between document, document element, single document element.
According to an embodiment of the system of the collection of document of the present invention, mark and association, document memory storage further comprises:
Relational DBMS, for setting up document classification storage administration platform;
Document library management system, for setting up document library platform;
Write operation module, to the write operation of calling performing database of each device;
Preserve operational module, to each device call file function and preserve graph of a relation file between corresponding single document files or document;
Platform data transport module transmits related data by data switch engine between document classification storage administration Platform Server and document library Platform Server.
According to an embodiment of the system of the collection of document of the present invention, mark and association, system also comprises document textual research and explain acquisition platform server, comprising:
Document textual research and explain harvester, gathers user's input data relevant to the explanatory content of document;
Data acquisition audit device, examines the input data that collect;
Document textual research and explain memory storage will be stored by the relevant input data link of document explanatory content of audit in graph of a relation between corresponding document or document.
According to an embodiment of the system of the collection of document of the present invention, mark and association, logical relation between document includes but not limited to derived relation, parallel relation or relation, logical relation with relation, relation of inclusion, revision relation, covering relation, uncertainty relation, wherein unique icon in the logical relation correspondence system between each document.
According to an embodiment of the collection of document of the present invention, mark and associated system, concrete implementation also comprises the service architecture system building based on cloud, realizes services such as data query, program updates and the file update processing in high in the clouds.
According to an embodiment of the system of the collection of document of the present invention, mark and association, document includes but not limited to paper, teaching material, historical document, laws and regulations, training courseware, news and bulletin, includes but not limited to the multimedia medium of word, audio frequency, video, webpage.
The present invention has also disclosed a kind of collection, mark and associated system of document, moves in single device in the mode of standalone version, comprising:
The harvester of single document, for collecting the document of required management type, preparatory processing and system introducing;
The mark of single document and associated apparatus, according to different dimensions and level, default technical term is classified and defined, set up and safeguard the lists of keywords of corresponding professional domain, single document is defined according to different attributes and level, several document element are set in single document, document element is carried out to the system banner of several keywords, define issuable logical relation list between any two single documents or document element, and realize the association setting of two logical relations between single document by the logical relation kind of having set,
Graph of a relation apparatus for establishing between document, defines graph of a relation between document, and the relation between each ingredient of graph of a relation between document is defined;
Document memory storage, according to calling of graph of a relation apparatus for establishing between the mark of the harvester of single document, single document and associated apparatus, document, relevant information is stored in the database of single device, the formatted file of appointment is filed in the database of single device;
Standalone version packing and issuing device, the file of specified format after the data of finally preserving by document memory storage and filing is packaged into a complete issue parcel, and generates targetedly distributing device program executable file and supporting ancillary documents according to the difference of target platform;
Client erecting device, by carrying out the program executable file of distributing device, will issue complete being deployed in single device of parcel, comprising: the file of specified format after the data of finally preserving by document memory storage and filing.
According to an embodiment of the system of the collection of document of the present invention, mark and association, the harvester of single document further comprises:
Form sorting module, is organized into document the form of appointment;
Classified information identification module, linking format sorting module, adds preliminary classified information mark on request by formatted file;
File imports module, and link sort message identification module imports to the formatted file that has added classified information mark in system.
According to an embodiment of the system of the collection of document of the present invention, mark and association, mark and the associated apparatus of single document further comprise:
Keyword dimension setting module, sets the dimension of keyword;
Key definition module, connects keyword dimension setting module, and the corresponding keyword of the each dimension of keyword is defined;
Document classification setting module, according to keyword to the setting of classifying of single document;
Document fragment setting module, according to keyword to the setting of classifying of each document fragment of document.
According to an embodiment of the system of the collection of document of the present invention, mark and association, mark and the associated apparatus of single document also comprise:
Document element arranges module, is several document element by the document fragment combination with same keyword mark of single document;
Document element identification module, carries out the system banner of several keywords to document element;
Logic association module, defines issuable logical relation list between any two single documents, realizes the association of the logical relation between two single documents or document element by the logical relation kind of having set in system.
According to an embodiment of the system of the collection of document of the present invention, mark and association, between document, graph of a relation apparatus for establishing further comprises:
Keyword name module, names by specific one group of keyword graph of a relation between arbitrary concrete document;
Graph of a relation generation module between document, generate graph of a relation between document, comprise representing of the pattern identification of the logical relation between the representing of a series of document unit arranged by the certain logic relation between document element in graph of a relation between document, document element, single document element.
According to an embodiment of the system of the collection of document of the present invention, mark and association, system also comprises document textual research and explain acquisition subsystem, comprising:
Document textual research and explain harvester, gathers user's input data relevant to the explanatory content of document;
Data acquisition audit device, examines the input data that collect;
Document textual research and explain memory storage will be stored by the relevant input data link of document explanatory content of audit in graph of a relation between corresponding document or document.
According to an embodiment of the system of the collection of document of the present invention, mark and association, logical relation between document includes but not limited to derived relation, parallel relation or relation, logical relation with relation, relation of inclusion, revision relation, covering relation, uncertainty relation, wherein unique icon in the logical relation correspondence system between each document.
According to an embodiment of the system of the collection of document of the present invention, mark and association, document includes but not limited to paper, teaching material, historical document, laws and regulations, training courseware, news and bulletin, includes but not limited to the multimedia medium of word, audio frequency, video, webpage.
The present invention contrasts prior art following beneficial effect: the solution of the present invention is to search out in keyword search mode a series of document that has document logical relation with one group of specific keyword in magnanimity document, and to the graph of a relation between this specific a series of document with one group with it the keyword of strong correlation name.Particularly, the solution of the present invention is the increasing document of a certain professional domain of sortord Collection and conservation with agreement by some station server groups, and is positioned over corresponding database and preserves.After up-to-date document being gathered by document classification storage administration platform, by the keyword of multiple dimensions, document is carried out to the classification of document fragment, become document element according to the document slice groups of all correspondences of keyword abstraction of specifying, set up keyword index, and produce the graph of a relation that meets human brain thinking logic by document element, and with the crucial phrase of multiple dimensions, the document graph of a relation is named simultaneously.By data switch engine, by the sorted document of key definition document fragment and the index thereof of multiple dimensions, and the document relationships figure that meets human brain thinking logic is sent to document library platform.
By building of this system, can help user from the magnanimity document of database, to search graph of a relation complete content and relevant information between a certain concrete document with the fastest speed, improve learning efficiency and utilization ratio to this professional domain knowledge.
Accompanying drawing explanation
Fig. 1 is the block diagram corresponding to an embodiment of the system of the collection of document of the present invention, mark and association.
Fig. 2 A-2D shows respectively the refined structure of each device in system.
Fig. 3 is the block diagram corresponding to application drawing 1 system of the present invention.
Fig. 4 realizes schematic diagram corresponding to the database aspect of the harvester of single document of the present invention.
Fig. 5 is the block diagram corresponding to graph of a relation definition between the document collection processing in the present invention and document.
Fig. 6 realizes schematic diagram corresponding to graph of a relation apparatus for establishing database aspect between single document identification associated apparatus of the present invention and document.
Fig. 7 realizes block diagram corresponding to data circulation part between the document classification storage administration platform in the present invention, document library platform.
Fig. 8 is the block diagram of another embodiment of the system of collection, mark and the association of document of the present invention.
Fig. 9 is the system operational flow diagram of the embodiment shown in Fig. 8.
Figure 10 is the refined structure figure of document textual research and explain acquisition platform server.
Embodiment
Below in conjunction with drawings and Examples, the invention will be further described.
Fig. 1 shows the structure of an embodiment of the system of collection, mark and the association of document of the present invention.System of the present invention had both been applicable to online document, was also applicable to unit document (comprising handheld device).Embodiments of the invention illustrate as an example of online document example, and seemingly, difference is only to make into standalone version to the application class of unit document, and this is well known to those skilled in the art.Refer to Fig. 1, the system of the present embodiment comprises document classification storage administration Platform Server 10, document library Platform Server 12.
Document classification storage administration Platform Server 10, except common central processing unit, operating system and data switch engine, also comprises control applying portion: graph of a relation apparatus for establishing 104 and be deployed in the host node 106 of the document memory storage on document classification storage administration Platform Server between the mark of the harvester 100 of single document, single document and associated apparatus 102, document.
Document library Platform Server 12, except common central processing unit, operating system and data switch engine, also comprises control applying portion: the image release 124 that is deployed in the host node of the document memory storage on document library Platform Server.
The harvester 100 of single document has totally been realized the function of collection, preparatory processing and the import system of required management type document.Fig. 2 A shows the inner structure of the harvester 100 of single document, in conjunction with Fig. 2 A, the harvester 100 of single document is deployed on document classification storage administration Platform Server 10, and device 100 comprises: form sorting module 1000, classified information identification module 1002, file import module 1004.
Form sorting module 1000 is organized into document the formatted file of appointment outside system.
Classified information identification module 1002 adds as requested preliminary classification information by formatted file outside system, includes but not limited to: heading message, identification number information, document header, document text message, accessory information, multi-language version information etc.
File import module 1004 by formatted file by system introducing to document classification storage administration Platform Server 10.
Fig. 3 shows the operational scheme of system of the present invention, illustrates that the operational scheme of harvester 100 of single document is as follows in conjunction with Fig. 3.
First, provide the knowledge base that comprises at least one data structure that the document files of specified format and document data are associated (document information underlying table, author's table, document be contents table, document antistop list in full in full).Fig. 4 shows relation between the table of database aspect of the harvester 100 of single document.
System of the present invention offers system tool and its implementation of a set of complete collection specified documents of user, and user can initiate the flow process that a document gathers.The 1st row part that flow process is shown in Figure 5.
User can judge the document of being collected by previous step judge whether it has the value of including, if not, this flow process stops, otherwise proceeds subsequent treatment.
Then, upload in system temporary library after document being organized into the specified file format that system can identify.In response to the upload request receiving from requestor, used upload file is sent to server end by the mode of document flow, the file branch that meets call format specifying is read and resolved.
The document of submitting to is examined, judged whether its form and content meet the requirements, if undesirable, return file and upload this step requirement processing again of temporary library.If after audit is passed through, be deposited in document information underlying table by point field of the information in specified format file and after conversion, the author of the document is deposited into in author table, (document can have multiple authors, therefore many records have been allowed), wherein underlying table id field is the external key of document information underlying table, and in full (document text can have multiple keywords in antistop list to deposit keyword corresponding document text in document, therefore allowed many records), wherein contents table ID is the outer strong of contents table in full in full.
After aforesaid operations is all successful, specified format file is deposited in document library, and execution result is fed back to requestor.The operation of above-mentioned write into Databasce and document library is all called document memory storage 106 and is realized.
The mark of single document and associated apparatus 102 are one of important component parts of the present invention, be deployed on document classification storage administration Platform Server 10, it mainly realizes following functions: 1, according to different dimensions, default technical term classified and defined, setting up and safeguard the lists of keywords of corresponding professional domain; 2, single document is defined according to different attributes, these association attributeses become the querying condition of system; 3, several document element are set in single document; 4, define issuable logical relation list between any two single documents or document element; 5, realize the association setting of two logical relations between single document by the logical relation kind of having set.
Fig. 2 B shows the mark of single document and the inner structure of associated apparatus 102.In conjunction with Fig. 2 B, the mark of single document and associated apparatus 102 comprise: keyword dimension setting module 1020, key definition module 1022, document element identification module 1023, document classification setting module 1024, document fragment setting module 1026.
In addition, the mark of single document and associated apparatus 102 also comprise: document element arranges module 1021, logic association module 1025.It is several document element by the document fragment combination with same keyword mark of single document that document element arranges module 1021.Logic association module 1025 defines issuable logical relation list between any two single documents, realizes the association of the logical relation between two single documents or document element by the logical relation kind of having set in system.
Keyword dimension setting module 1020 is set the dimension of keyword.Key definition module 1022 connects keyword dimension setting module 1022, and the corresponding keyword of the each dimension of keyword is defined.Document element identification module 1023 carries out the system banner of several keywords to document element.Document classification setting module 1024 according to keyword to the setting of classifying of single document.Document fragment setting module 1026 according to keyword to the setting of classifying of each document fragment of document.
When single document carries out attribute-bit, set up the keyword classification system of multiple dimensions, use keyword document to be carried out to the division of Multi-angle omnibearing.Concrete grammar comprises: each the document fragment for document identifies respectively one group of keyword; In same document, by having, implication is similar, the document fragment of the close keyword of concept is defined as several document element from different dimensions; According to classifying, the thinking of destructing construction sets the logical relation between these document element, and each logical relation is set to an exclusive icon identifies, the picture that the most substantially represents of composition is referred to as graph of a relation between document and represents.For example, derived relation represents: document B writes according to a certain document fragment of document A.Parallel relation represents: the relation between two or more documents of writing for the common a certain document fragment based on document A is parallel document.While setting parallel document, an issuing time left side, residence early, the right side, residence in issuing time evening.
Between document, graph of a relation generally can define respectively in the keyword of several different dimensions and at least select to be no less than the keyword composition of 2 according to different professional domains.
The mark of single document is being received single document being identified and carrying out associative operation after associated order and carry out alternately with user of operator with associated apparatus 102, and the relation between internal database table refers to Fig. 6.
The internal operation flow process of the mark of single document and associated apparatus 102 is referring to shown in Fig. 5 the 2nd row.The knowledge base that comprises at least one data structure that keyword data and document data, document fragment data are associated (antistop list, document information underlying table, document be contents table, document antistop list, document segment contents table, document segment antistop list in full in full) is provided.
System provides the function that keyword dimension is defined, the keyword kind field in correspondence database antistop list.System provides the function that confirmed keyword dimension is edited to concrete keyword, includes but not limited to: the attributes such as keyword title, keyword dimension (kind) are edited, and initiate the flow process of a key definition.
System provides keyword necessity and the each setup of attribute situation function of examining thereof to submitting to, if audit not by, the step of returning concrete keyword editor, if audit by; data are preserved in the antistop list of database.
System provides the function that document is defined respectively to corresponding keyword by document fragment.This function deposits data in document segment contents table, document segment antistop list.The document fragment wherein underlying table id field of document segment contents table is that the paragraph Table I D of the external key document segment antistop list of the self-propagation id field of document information underlying table is the external key of the self-propagation id field of document segment contents table, and keyword id field is the external key of the self-propagation id field of antistop list.
System provides and formally deposits document in document classification storage administration platform database and document library, and carries out the function of issuing operation.
The operation of above-mentioned write into Databasce, document library is all finally to call document memory storage 106 to realize.
Between document, graph of a relation apparatus for establishing 104 is deployed on document classification storage administration Platform Server 10, and it defines graph of a relation between document, and the relation between each ingredient of graph of a relation between document is defined.
As shown in Figure 2 C, between document, graph of a relation apparatus for establishing 104 comprises graph of a relation generation module 1042 between keyword name module 1040 and document.Keyword name module 1040 is named by specific one group of keyword graph of a relation between arbitrary concrete document.Between document, graph of a relation generation module 1042 is for generating graph of a relation between document, comprises between document representing of the pattern identification of the logical relation between the representing of relevant documentation one-element group in graph of a relation, document element, single document element.
Between document, graph of a relation apparatus for establishing 104 is receiving the rear execution of graph of a relation foundation name associative operation between operator's document, and carries out alternately with user, and flow process refers to shown in Fig. 5 the 3rd row, and between database table, relation refers to Fig. 6.
Provide and comprise at least one data structure that graph of a relation data between keyword data and document data, document are associated knowledge base of (antistop list, document information underlying table, document in full contents table, document are related between header table, document the corresponding paragraph table of graph of a relation between graph of a relation base table, document in full between antistop list, document segment contents table, document segment antistop list, document).
In system, providing a set of complete creates and the function of maintenance process graph of a relation between document.System provides a kind of function that defines the involved keyword dimension of graph of a relation between this document.And define and in designed keyword dimension, need corresponding concrete keyword.Deposit data in the document knowledge table of nodding, wherein keyword dimension 1ID~keyword dimension [N] ID is respectively the external key of antistop list self-propagation id field.
System will be listed all qualified documents according to the keyword setting, and by meet several quantity descending sort simultaneously.
System provides a kind of being listed in all qualified documents to filter out the function that meets the document element of graph of a relation concept between this document most.With regard to the logical relation between the document in a certain particular professional field, can be divided into N class (N is natural number) logical relation, such as: derived relation (being that A derives from B), parallel relation/with relation (being that A is parallel with B) or relation (being that A or B all set up), relation of inclusion (being that A comprises B), revision relation (be B to the part of A explain, content revises), (content of B comprises A to covering relation completely, but obviously complete than A, extensively admit in the industry B rather than A, A is covered by B), uncertainty relation (A is contrary with B).
For instance, be divided into 10 chapters in A teaching material, every chapter divides 10 joints.The 4th chapter and the 5th chapter are explained respectively two different attributes of same thing, belong to parallel relation.The 1st chapter and the 1st chapter Section 2 belong to relation of inclusion, and the 1st chapter comprises the 1st chapter Section 2.The 8th chapter Section 3 and the 8th chapter Section 4 set forth two of same thing contrary but not confirmed theory hypothesis all, the former sets up, and the latter is untenable, on the contrary also in this way, both are uncertainty relations.The 9th chapter Section 7 and the 9th chapter Section 8 set forth two of same thing parallel but not confirmed theory hypothesis all, the former sets up with the latter and sets up and do not have to be related to, both are or relation.X chapter in B teaching material is the textual research and explain to A teaching material the 5th chapter, and the former with the latter is derived relation.
At this, system will invest unique pattern identification for the logical relation between each document, and the mark using this specific identifier as the logical relation between two document element in the time showing, so that system user directly understands and identification.
Each single document can be broken down into several document fragments, and each document fragment can be defined as a document element.For any document of a certain professional domain, must have the attribute of more than one technical term in this field, this technical term can be the keyword corresponding with the document unit document fragment by the formal definition of computer system assignment.
For instance, document fragment X and Y are parallel relation, and the keyword that document fragment X is corresponding is A, B, C, D, and the keyword that document fragment Y is corresponding is B, C, D, E, and when searching for B-C-D keyword, system shows that result is B-C-D.
Each document element of choosing can embody with the form recording in graph of a relation base table between document, wherein the knowledge Table I D field of nodding is the external key that is related to the self-propagation id field of header table between document, document underlying table ID is the external key of the self-propagation id field of document information underlying table, and element id field produces automatically according to rule.Specific rules is:
Document element: the numeral that when " PF "+selection element, timestamp is changed;
Derived relation: the numeral that when " PL "+selection element, timestamp is changed;
Parallel relation: the numeral that when " PE "+selection element, timestamp is changed;
Revision relation: the numeral that when " PM "+selection element, timestamp is changed;
Covering relation: the numeral that when " PN "+selection element, timestamp is changed;
Relation of inclusion: the numeral that when " PQ "+selection element, timestamp is changed;
Uncertainty relation: the numeral that when " PT "+selection element, timestamp is changed.
Between sublist document, in the corresponding paragraph table of graph of a relation, need to insert the concrete corresponding paragraph of selected document element simultaneously, wherein between document, graph of a relation base table id field is the external key of graph of a relation base table self-propagation id field between document, and paragraph sequence number field is the external key of the paragraph row sequence number field of document segment contents table.
System provides carries out layout to filtered out document element, the function of the logical relation between these document element is set simultaneously, and this logical relation includes but are not limited to: derived relation, parallel relation, revision relation, covering relation, relation of inclusion, uncertainty relation etc.
The method realizing for: first add the document element of the annexation of wanting to designing in district, adjustment coordinate position; Add being related in design district of required design, system will draw relational graph effect in real time again, and can be according to details such as the position of the mobile adjustment relationship elements of pulling of user, size, thicknesses; The document element element of setting respectively the connection two ends of relationship elements, document element element can only be selected in the two ends of relationship elements, and document element element also can only be coupled together by relationship elements.
Take derived relation as example, between document, in graph of a relation base table, derived relation element need to arrange respectively its upper element ID, lower element ID.Two document element simultaneously that chosen by upper element ID, lower element ID, under will upgrading equally in this table element ID and corresponding on element ID, and skip to it need to be set by the corresponding document element of upper element ID the lower element entity ID that relationship elements is directly connected to, to need to being set by the corresponding document element of lower element ID, it skips the upper element entity ID that relationship elements is directly connected to.The upper element ID here, lower element ID, upper element entity ID, lower element entity ID are the external keys of the element id field in graph of a relation base table between document.
Other as parallel relation, revision relation, covering relation, relation of inclusion, uncertainty relation be all to process by the disposal route identical with derived relation;
Between document, in graph of a relation base table, need for document element to record that its element in design district starts X coordinate, element starts Y coordinate, to design the upper left corner, district as (0,0) point simultaneously.
Between document, in graph of a relation base table, need to record its element in design district starts X coordinate, element and starts that Y coordinate, element finish X coordinate, element finishes Y coordinate for each relationship elements simultaneously, take the design upper left corner, district as (0,0) point, and lines picture flow data.
Wherein lines picture flow data is the details such as size, thickness of finally deciding relationship elements in design district to be converted to very general polar plot png picture format and again converts binary picture flow data to store database into.
System provides the function to graph of a relation is examined between submitted to document, judge whether logical relation between definition and the document element of graph of a relation between the document arranges etc. correct, if incorrect, this step of the establishment of returning graph of a relation between document is re-executed, otherwise audit is by proceeding subsequent treatment.
System provides and formally deposits graph of a relation between document in document classification storage administration platform database and document library, and carries out the function of issuing operation.
The operation of above-mentioned write into Databasce, document library is all finally to call document memory storage 106 to realize.
The host node 106 of document memory storage is deployed on document classification storage administration Platform Server 10, and at the image release 124 of document library platform deploy host node.Document memory storage 106 stores relevant information in the database of document classification storage administration Platform Server into according to calling of graph of a relation apparatus for establishing between the mark of the harvester of single document, single document and associated apparatus, document, the formatted file of appointment is filed in document library Platform Server, and by data switch engine, related data information is transmitted to data between document classification storage administration Platform Server and document library Platform Server.
As shown in Figure 2 D, document memory storage 106 comprises Relational DBMS 1060, document library management system 1061, write operation module 1062, preserves the peaceful platform data transmission module 1064 of operational module 1063.Relational DBMS 1060 is for setting up document classification storage administration platform.Document library management system 1061 is for setting up document library platform.The write operation of calling performing database of write operation module 1062 to each device.Preserve operational module 1063 to each device call file function and preserve graph of a relation file between corresponding single document files or document.Platform data transport module 1064 transmits related data by data switch engine between document classification storage administration Platform Server 10 and document library Platform Server 12.
The interactive approach of document memory storage 106 executing data library storage and document library filing after receiving from the request of other devices.
Refer to Fig. 7, document memory storage 106 provides the knowledge base that comprises at least one data structure that all data of all native system platforms are all associated.Provide and comprise the document library that at least one can file by version specified format file through configuration.Provide and comprise at least a set of complete database call interface, for graph of a relation apparatus for establishing 104 between the mark of the harvester 100 of single document, single document and associated apparatus 102, document, as required.Provide and comprise at least a set of complete document library calling interface, the harvester 100 of the single document of confession, the mark of single document and associated apparatus 102 are used for filing and renewal specified format file.
Provide and comprise at least a set of complete data synchronization mechanism, and calling data switching engine can circulate appropriate data in time between the two at document classification storage administration platform, document library platform.
Document classification storage administration Platform Server 10 carries out the mutual transmission of data by interface routine and document library Platform Server 12, part realizes and refers to Fig. 7.The mode that the data that it sends needs write by far-end is written to document library platform and treats synchronizing signal table and relevant temporary table, then carries out relevant subsequent processing by the interface routine of document library platform.It also also initiatively captures basis the data in return path signal table and synchronous temporary table for the treatment of in document library platform simultaneously.
When carry out various issue operations on document classification storage administration Platform Server 10, include but are not limited to: keyword is issued, single document is issued, between document when graph of a relation issue etc., first will treat that synchronizing signal is written to temporary table, when the performance period starts so that interface routine circulates, carry out follow-up relevant treatment.
Scheduling timing device on document classification storage administration Platform Server 10, according to the time step vector setting, timing cycle executive's interface program, once because interface routine does not complete data transmission work in a time step vector, or because the situations such as abnormal appear in interface routine, possesses the function of intelligent restoration.
Document library Platform Server 12 obtains by interface routine the data that document classification storage administration storehouse Platform Server 10 passes over, and part realizes and refers to Fig. 6.To the related data for the treatment of synchronizing signal table and synchronous temporary table of this platform, the data of target database are upgraded to processing according to the interface routine active push of document classification storage administration Platform Server 10.Simultaneously for including but are not limited to by this platform: the data that the operations such as user behavior information produce capture afterwards and write treats return path signal table and synchronous temporary table, so that the interface routine of document classification storage administration platform carries out subsequent treatment.
In the time that document library Platform Server 12 receives between document the data such as graph of a relation by interface routine, the function that can trigger full-text search engine and rebuild index.
Fig. 8 shows the structure of another embodiment of system of the present invention.The system of the present embodiment is except the document classification storage administration Platform Server and document library Platform Server of the embodiment shown in Fig. 1, also comprised document textual research and explain acquisition platform server, this server and document classification storage administration Platform Server, client access device all have alternately.Figure 10 shows the refined structure of document textual research and explain acquisition platform server, and document textual research and explain acquisition platform server comprises document textual research and explain harvester 160, data acquisition audit device 162, document textual research and explain memory storage 164.And the module identical with Fig. 1 embodiment do not repeat them here.
Document textual research and explain harvester 160 gathers the input data that user is relevant to the explanatory content of document.Data acquisition audit device 162 is examined the input data that collect.Document textual research and explain memory storage 164 will be joined in corresponding original text and be stored by the relevant input data of document explanatory content of audit.
Fig. 9 shows the operational scheme of system.The displaying of document library platform derives from two aspects, is that index is set up in various dimensions key definition and maintenance, up-to-date document collection, document arrangement and the various dimensions definition identical with Fig. 1 embodiment, graph of a relation is set up and maintenance on the one hand; The setting of document textual research and explain collection, document textual research and explain audit and corresponding relation on the other hand.
It should be noted that, in the present invention, can be collected, definition, associated, search for and the document that represents includes but not limited to paper, teaching material, historical document, laws and regulations, training courseware, news and bulletin etc., include but not limited to the multimedia mediums such as word, audio frequency, video, webpage, the knowledge that includes but not limited to a certain particular professional field (can be natural science knowledge, also can be social science knowledge), be also not limited to Chinese or other word.
In addition, the concrete implementation of such scheme also comprises the service architecture system building based on cloud, for example, be deployed in the service such as data query, program updates and file update processing in high in the clouds.
Above embodiment is all based on that on-line documentation describes, and after such scheme of the present invention also can slightly make an amendment, is applied to unit document.System is such as, above to move in single device (unit mode is moved computing machine, handheld device etc.) in the mode of standalone version.Standalone version system comprises: graph of a relation apparatus for establishing, document memory storage, standalone version packing and issuing device and client erecting device between the harvester of single document, the mark of single document and associated apparatus, document.
The harvester of single document for the document of required management type is collected, preparatory processing and system introducing.The harvester of single document further comprises: form sorting module, classified information identification module, file import module.Form sorting module is organized into document the form of appointment.Classified information identification module linking format sorting module, adds preliminary classified information mark on request by formatted file.File imports module link sort message identification module, and the formatted file that has added classified information mark is imported in system.
The mark of single document and associated apparatus are classified and define default technical term according to different dimensions and level, set up and safeguard the lists of keywords of corresponding professional domain, single document is defined according to different attributes and level, several document element are set in single document, document element is carried out to the system banner of several keywords, define issuable logical relation list between any two single documents or document element, and realize the association setting of two logical relations between single document by the logical relation kind of having set.Mark and the associated apparatus of single document further comprise: keyword dimension setting module, key definition module, document classification setting module, document fragment setting module.Keyword dimension setting module is set the dimension of keyword.Key definition module connects keyword dimension setting module, and the corresponding keyword of the each dimension of keyword is defined.Document classification setting module according to keyword to the setting of classifying of single document.Document fragment setting module according to keyword to the setting of classifying of each document fragment of document.In addition, the mark of single document and associated apparatus also comprise: document element arranges module, document element identification module, logic association module.It is several document element by the document fragment combination with same keyword mark of single document that document element arranges module.Document element identification module carries out the system banner of several keywords to document element.Issuable logical relation list between any two single documents of logic association module definition realizes the association of the logical relation between two single documents or document element in system by the logical relation kind of having set.
Between document, graph of a relation apparatus for establishing defines graph of a relation between document, and the relation between each ingredient of graph of a relation between document is defined.Between document, graph of a relation apparatus for establishing further comprises: graph of a relation generation module between keyword name module, document.Keyword name module is named by specific one group of keyword graph of a relation between arbitrary concrete document.Between document, graph of a relation generation module generates graph of a relation between document, comprises representing of the pattern identification of the logical relation between the representing of a series of document unit arranged by the certain logic relation between document element in graph of a relation between document, document element, single document element.
Document memory storage stores relevant information in the database of single device into according to calling of graph of a relation apparatus for establishing between the mark of the harvester of single document, single document and associated apparatus, document, the formatted file of appointment is filed in the database of single device.
The file of specified format after the data of finally preserving by document memory storage and filing is packaged into a complete issue parcel by standalone version packing and issuing device, and generate targetedly distributing device program executable file and supporting ancillary documents according to the difference of target platform.
Client erecting device, by carrying out the program executable file of distributing device, will be issued complete being deployed in single device of parcel, comprising: the file of specified format after the data of finally preserving by document memory storage and filing.
System also comprises document textual research and explain acquisition subsystem, and document textual research and explain acquisition subsystem comprises: document textual research and explain harvester, data acquisition audit device, document textual research and explain memory storage.Document textual research and explain harvester gathers user's input data relevant to the explanatory content of document.Data acquisition audit device is examined the input data that collect.Document textual research and explain memory storage will be stored by the relevant input data link of document explanatory content of audit in graph of a relation between corresponding document or document.
In the embodiment of standalone version, logical relation between document includes but not limited to derived relation, parallel relation or relation, logical relation with relation, relation of inclusion, revision relation, covering relation, uncertainty relation, wherein unique icon in the logical relation correspondence system between each document.Document includes but not limited to paper, teaching material, historical document, laws and regulations, training courseware, news and bulletin, includes but not limited to the multimedia medium of word, audio frequency, video, webpage.
Above-described embodiment is available to those of ordinary skills and realizes and use of the present invention; those of ordinary skills can be without departing from the present invention in the case of the inventive idea; above-described embodiment is made to various modifications or variation; thereby protection scope of the present invention do not limit by above-described embodiment, and it should be the maximum magnitude that meets the inventive features that claims mention.

Claims (18)

1. the collection of a document, mark and associated system, comprise document classification storage administration Platform Server and document library Platform Server, wherein document classification storage administration Platform Server comprises graph of a relation apparatus for establishing between the mark of the harvester of single document, single document and associated apparatus and document, the host node of document memory storage is deployed on document classification storage administration Platform Server, the image release of the host node of document memory storage is deployed on document library Platform Server, wherein:
The harvester of single document, for collecting the document of required management type, preparatory processing and system introducing;
The mark of single document and associated apparatus, according to different dimensions and level, default technical term is classified and defined, set up and safeguard the lists of keywords of corresponding professional domain, single document is defined according to different attributes and level, several document element are set in single document, document element is carried out to the system banner of several keywords, define issuable logical relation list between any two single documents or document element, and realize the association setting of two logical relations between single document by the logical relation kind of having set,
Graph of a relation apparatus for establishing between document, defines graph of a relation between document, and the relation between each ingredient of graph of a relation between document is defined;
Document memory storage, according to calling of graph of a relation apparatus for establishing between the mark of the harvester of single document, single document and associated apparatus, document, relevant information is stored in the database of document classification storage administration Platform Server, the formatted file of appointment is filed in document library Platform Server, and by data switch engine, related data information is transmitted to data between document classification storage administration Platform Server and document library Platform Server.
2. the collection of document according to claim 1, mark and associated system, is characterized in that, the harvester of single document further comprises:
Form sorting module, is organized into document the form of appointment;
Classified information identification module, linking format sorting module, adds preliminary classified information mark on request by formatted file;
File imports module, and link sort message identification module imports to the formatted file that has added classified information mark in system.
3. the collection of document according to claim 2, mark and associated system, is characterized in that, mark and the associated apparatus of single document further comprise:
Keyword dimension setting module, sets the dimension of keyword;
Key definition module, connects keyword dimension setting module, and the corresponding keyword of the each dimension of keyword is defined;
Document classification setting module, according to keyword to the setting of classifying of single document;
Document fragment setting module, according to keyword to the setting of classifying of each document fragment of document.
4. the collection of document according to claim 3, mark and associated system, is characterized in that, mark and the associated apparatus of single document also comprise:
Document element arranges module, is several document element by the document fragment combination with same keyword mark of single document;
Document element identification module, carries out the system banner of several keywords to document element;
Logic association module, defines issuable logical relation list between any two single documents, realizes the association of the logical relation between two single documents or document element by the logical relation kind of having set in system.
5. the collection of document according to claim 4, mark and associated system, is characterized in that, between document, graph of a relation apparatus for establishing further comprises:
Keyword name module, names by specific one group of keyword graph of a relation between arbitrary concrete document;
Graph of a relation generation module between document, generate graph of a relation between document, comprise representing of the pattern identification of the logical relation between the representing of a series of document unit arranged by the certain logic relation between document element in graph of a relation between document, document element, single document element.
6. the collection of document according to claim 5, mark and associated system, is characterized in that, document memory storage further comprises:
Relational DBMS, for setting up document classification storage administration platform;
Document library management system, for setting up document library platform;
Write operation module, to the write operation of calling performing database of each device;
Preserve operational module, to each device call file function and preserve graph of a relation file between corresponding single document files or document;
Platform data transport module transmits related data by data switch engine between document classification storage administration Platform Server and document library Platform Server.
7. the collection of document according to claim 1, mark and associated system, is characterized in that, system also comprises document textual research and explain acquisition platform server, comprising:
Document textual research and explain harvester, gathers user's input data relevant to the explanatory content of document;
Data acquisition audit device, examines the input data that collect;
Document textual research and explain memory storage will be stored by the relevant input data link of document explanatory content of audit in graph of a relation between corresponding document or document.
8. the collection of document according to claim 1, mark and associated system, it is characterized in that, logical relation between document includes but not limited to derived relation, parallel relation or relation, logical relation with relation, relation of inclusion, revision relation, covering relation, uncertainty relation, wherein unique icon in the logical relation correspondence system between each document.
9. the collection of document according to claim 1, mark and associated system, it is characterized in that, the services such as concrete implementation also comprises the service architecture system building based on cloud, data query, program updates and the file update processing in realization such as high in the clouds.
10. the collection of document according to claim 1, mark and associated system, it is characterized in that, document includes but not limited to paper, teaching material, historical document, laws and regulations, training courseware, news and bulletin, includes but not limited to the multimedia medium of word, audio frequency, video, webpage.
Collection, mark and the associated system of 11. 1 kinds of documents, move in single device in the mode of standalone version, comprising:
The harvester of single document, for collecting the document of required management type, preparatory processing and system introducing;
The mark of single document and associated apparatus, according to different dimensions and level, default technical term is classified and defined, set up and safeguard the lists of keywords of corresponding professional domain, single document is defined according to different attributes and level, several document element are set in single document, document element is carried out to the system banner of several keywords, define issuable logical relation list between any two single documents or document element, and realize the association setting of two logical relations between single document by the logical relation kind of having set,
Graph of a relation apparatus for establishing between document, defines graph of a relation between document, and the relation between each ingredient of graph of a relation between document is defined;
Document memory storage, according to calling of graph of a relation apparatus for establishing between the mark of the harvester of single document, single document and associated apparatus, document, relevant information is stored in the database of single device, the formatted file of appointment is filed in the database of single device;
Standalone version packing and issuing device, the file of specified format after the data of finally preserving by document memory storage and filing is packaged into a complete issue parcel, and generates targetedly distributing device program executable file and supporting ancillary documents according to the difference of target platform;
Client erecting device, by carrying out the program executable file of distributing device, will issue complete being deployed in single device of parcel, comprising: the file of specified format after the data of finally preserving by document memory storage and filing.
Collection, mark and the associated system of 12. documents according to claim 11, is characterized in that, the harvester of single document further comprises:
Form sorting module, is organized into document the form of appointment;
Classified information identification module, linking format sorting module, adds preliminary classified information mark on request by formatted file;
File imports module, and link sort message identification module imports to the formatted file that has added classified information mark in system.
Collection, mark and the associated system of 13. documents according to claim 12, is characterized in that, mark and the associated apparatus of single document further comprise:
Keyword dimension setting module, sets the dimension of keyword;
Key definition module, connects keyword dimension setting module, and the corresponding keyword of the each dimension of keyword is defined;
Document classification setting module, according to keyword to the setting of classifying of single document;
Document fragment setting module, according to keyword to the setting of classifying of each document fragment of document.
Collection, mark and the associated system of 14. documents according to claim 13, is characterized in that, mark and the associated apparatus of single document also comprise:
Document element arranges module, is several document element by the document fragment combination with same keyword mark of single document;
Document element identification module, carries out the system banner of several keywords to document element;
Logic association module, defines issuable logical relation list between any two single documents, realizes the association of the logical relation between two single documents or document element by the logical relation kind of having set in system.
Collection, mark and the associated system of 15. documents according to claim 14, is characterized in that, between document, graph of a relation apparatus for establishing further comprises:
Keyword name module, names by specific one group of keyword graph of a relation between arbitrary concrete document;
Graph of a relation generation module between document, generate graph of a relation between document, comprise representing of the pattern identification of the logical relation between the representing of a series of document unit arranged by the certain logic relation between document element in graph of a relation between document, document element, single document element.
Collection, mark and the associated system of 16. documents according to claim 11, is characterized in that, system also comprises document textual research and explain acquisition subsystem, comprising:
Document textual research and explain harvester, gathers user's input data relevant to the explanatory content of document;
Data acquisition audit device, examines the input data that collect;
Document textual research and explain memory storage will be stored by the relevant input data link of document explanatory content of audit in graph of a relation between corresponding document or document.
Collection, mark and the associated system of 17. documents according to claim 11, it is characterized in that, logical relation between document includes but not limited to derived relation, parallel relation or relation, logical relation with relation, relation of inclusion, revision relation, covering relation, uncertainty relation, wherein unique icon in the logical relation correspondence system between each document.
Collection, mark and the associated system of 18. documents according to claim 11, it is characterized in that, document includes but not limited to paper, teaching material, historical document, laws and regulations, training courseware, news and bulletin, includes but not limited to the multimedia medium of word, audio frequency, video, webpage.
CN201310006234.2A 2013-01-08 2013-01-08 The collection of document, the system identifying and associating Expired - Fee Related CN103914487B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310006234.2A CN103914487B (en) 2013-01-08 2013-01-08 The collection of document, the system identifying and associating

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310006234.2A CN103914487B (en) 2013-01-08 2013-01-08 The collection of document, the system identifying and associating

Publications (2)

Publication Number Publication Date
CN103914487A true CN103914487A (en) 2014-07-09
CN103914487B CN103914487B (en) 2016-12-28

Family

ID=51040178

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310006234.2A Expired - Fee Related CN103914487B (en) 2013-01-08 2013-01-08 The collection of document, the system identifying and associating

Country Status (1)

Country Link
CN (1) CN103914487B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104765830A (en) * 2015-04-13 2015-07-08 天脉聚源(北京)传媒科技有限公司 Information searching method and device
CN104765833A (en) * 2015-04-13 2015-07-08 天脉聚源(北京)传媒科技有限公司 Word association table generating method and device
CN104765828A (en) * 2015-04-13 2015-07-08 天脉聚源(北京)传媒科技有限公司 Dictionary data sheet generating method and device and dictionary data sheet application method and device
CN104765831A (en) * 2015-04-13 2015-07-08 天脉聚源(北京)传媒科技有限公司 Dictionary sheet generating method and device and dictionary sheet application method and device
WO2015176526A1 (en) * 2014-05-23 2015-11-26 邓寅生 Superimposed-relationship-based document identification, association, search, and display system
CN106469214A (en) * 2016-09-06 2017-03-01 北京百度网讯科技有限公司 Information demonstrating method based on artificial intelligence and device
CN109101512A (en) * 2017-06-21 2018-12-28 北京国双科技有限公司 The construction method of law databases, law data query method and device

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090144279A1 (en) * 2007-12-03 2009-06-04 Fast Search & Transfer Asa Method for improving search efficiency in enterprise search system
CN101566991A (en) * 2008-04-25 2009-10-28 张宝永 Method and system for improving function of computer for searching professional information
CN102855227A (en) * 2012-09-12 2013-01-02 汉柏科技有限公司 Document processing system and method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090144279A1 (en) * 2007-12-03 2009-06-04 Fast Search & Transfer Asa Method for improving search efficiency in enterprise search system
CN101566991A (en) * 2008-04-25 2009-10-28 张宝永 Method and system for improving function of computer for searching professional information
CN102855227A (en) * 2012-09-12 2013-01-02 汉柏科技有限公司 Document processing system and method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
张明宝等: "一种改进的基于文档结构的信息检索方法", 《信息系统》 *
贾西平等: "基于主题的文档检索模型", 《华南理工大学学报(自然科学版)》 *

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015176526A1 (en) * 2014-05-23 2015-11-26 邓寅生 Superimposed-relationship-based document identification, association, search, and display system
US10719560B2 (en) 2014-05-23 2020-07-21 Yinsheng DENG System for identifying, associating, searching and presenting documents based on relation combination
CN104765828B (en) * 2015-04-13 2018-06-19 天脉聚源(北京)传媒科技有限公司 A kind of generation of dictionary data table and application process and device
CN104765831A (en) * 2015-04-13 2015-07-08 天脉聚源(北京)传媒科技有限公司 Dictionary sheet generating method and device and dictionary sheet application method and device
CN104765828A (en) * 2015-04-13 2015-07-08 天脉聚源(北京)传媒科技有限公司 Dictionary data sheet generating method and device and dictionary data sheet application method and device
CN104765830A (en) * 2015-04-13 2015-07-08 天脉聚源(北京)传媒科技有限公司 Information searching method and device
CN104765831B (en) * 2015-04-13 2018-06-19 天脉聚源(北京)传媒科技有限公司 A kind of generation of dictionary sheet and its application process and device
CN104765833B (en) * 2015-04-13 2018-06-19 天脉聚源(北京)传媒科技有限公司 A kind of generation method and device of word association table
CN104765830B (en) * 2015-04-13 2018-11-20 天脉聚源(北京)传媒科技有限公司 A kind of information search method and device
CN104765833A (en) * 2015-04-13 2015-07-08 天脉聚源(北京)传媒科技有限公司 Word association table generating method and device
CN106469214A (en) * 2016-09-06 2017-03-01 北京百度网讯科技有限公司 Information demonstrating method based on artificial intelligence and device
CN106469214B (en) * 2016-09-06 2019-10-15 北京百度网讯科技有限公司 Information demonstrating method and device based on artificial intelligence
CN109101512A (en) * 2017-06-21 2018-12-28 北京国双科技有限公司 The construction method of law databases, law data query method and device

Also Published As

Publication number Publication date
CN103914487B (en) 2016-12-28

Similar Documents

Publication Publication Date Title
CN105095320B (en) The mark of document based on relationship stack combinations, association, the system searched for and showed
CN105095319B (en) The mark of document based on time series, association, the system searched for and showed
CN103914487A (en) Document collection, identification and association system
US11899681B2 (en) Knowledge graph building method, electronic apparatus and non-transitory computer readable storage medium
CN103914488A (en) Document collection, identification, association, search and display system
CN102279894A (en) Method for searching, integrating and providing comment information based on semantics and searching system
CN108509405A (en) A kind of generation method of PowerPoint, device and equipment
CN100576209C (en) Associated data index, retrieval, store and present the control information system constituting method
CN103914486A (en) Document search and display system
CN105930479A (en) Data skew processing method and apparatus
CN102567423B (en) Method and system for associated search of poetry
CN103942268A (en) Method and device for combining search and application and application interface
US11928083B2 (en) Determining collaboration recommendations from file path information
KR101955376B1 (en) Processing method for a relational query in distributed stream processing engine based on shared-nothing architecture, recording medium and device for performing the method
US20200311151A1 (en) Document structures for searching within and across messages
CN103809915B (en) The reading/writing method of a kind of disk file and device
CN113407678B (en) Knowledge graph construction method, device and equipment
CN105183736A (en) Universal searching system according to network equipment configuration and state information, and universal searching method thereof
Castellano et al. A web text mining flexible architecture
CN102508828A (en) Method for finding path relationship of graph based on multiple agent routes
CN114218114B (en) Full-automatic test data generation method based on interface flow arrangement
CN104298685A (en) Method and device for achieving heterogeneous system unified searching
Lang et al. The next-generation search engine: Challenges and key technologies
Rettberg et al. Mining the Knowledge Base: Exploring Methodologies for Analysing the Field of Electronic Literature
CN113987146B (en) Dedicated intelligent question-answering system of electric power intranet

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20161228

Termination date: 20220108

CF01 Termination of patent right due to non-payment of annual fee