EP0985184A1 - Knowledge brokers using signed feature constraints - Google Patents

Knowledge brokers using signed feature constraints

Info

Publication number
EP0985184A1
EP0985184A1 EP98917545A EP98917545A EP0985184A1 EP 0985184 A1 EP0985184 A1 EP 0985184A1 EP 98917545 A EP98917545 A EP 98917545A EP 98917545 A EP98917545 A EP 98917545A EP 0985184 A1 EP0985184 A1 EP 0985184A1
Authority
EP
European Patent Office
Prior art keywords
document
feature constraint
feature
entity
data processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
EP98917545A
Other languages
German (de)
French (fr)
Inventor
Jean-Marc Andreoli
Uwe Borghoff
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xerox Corp
Original Assignee
Xerox Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xerox Corp filed Critical Xerox Corp
Publication of EP0985184A1 publication Critical patent/EP0985184A1/en
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/02Digital computers in general; Data processing equipment in general manually operated with input through keyboard and computation using a built-in program, e.g. pocket calculators
    • G06F15/0225User interface arrangements, e.g. keyboard, display; Interfaces to other computer systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3322Query formulation using system suggestions

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Computer Hardware Design (AREA)
  • Human Computer Interaction (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A method of implementing a knowledge broker software agent which processes knowledge requests expressed in terms of feature constraints, if necessary decomposing the requests into subrequests. The invention employs a subset of feature constraints, called 'signed feature constraints' (SFC), each feature constraint having at least a positive component and a negative component, each of the positive component and the negative component including one or more relations, such as sort relations and feature relations. An advantage is that SFCs can be used in knowledge retrieval engines to specify, in a common language, (i) knowledge search requests, (ii) the answers to these requests and (iii) the state of the knowledge retrieving agents (referred to herein as knowledge brokers). A method for solving SFCs is also disclosed.

Description

KNOWLEDGE BROKERS USING SIGNED FEATURE CONSTRAINTS
The present invention relates to data processing, and more particularly relates to the transfer between computing devices, and the retrieval by such devices, of information or knowledge using signed feature constraints.
With widespread availability of new electronic sources of information, such as e-mail, internet access and on-line information repositories, the number of electronic documents available to a computer user is multiplying. Documents can also be built dynamically by accessing and combining information existing over distributed sources. Hierarchical mark-up languages such as SGML can be used to define document templates that can be dynamically filled in with heterogeneous components. These documents can in turn be made permanent by storing them in document management systems, thus entering them in the normal document lifecycle.
Attempts at standardisation have led to the Document Management Alliance (DMA) industry standard, concerned with search, retrieval, storage and conversion of electronic documents on heterogeneous document management systems.
In implementing a system a knowledge brokerage system for carrying out search and retrieval in accordance with such a standard, knowledge brokers may be used. Brokers are software agents, which can process knowledge search requests. Knowledge is taken here to be any piece of electronic information intended to be publicly accessible. Different, possibly distributed, information sources are assumed to be available, from a simple file in a user's directory to a database local to a site, up to a wide area information service (WAIS) on the internet, for example.
When receiving a request, a broker may have sufficient knowledge to process it, or may need to retrieve more knowledge. For that purpose, it releases sub-requests, aimed at other brokers. Thus, knowledge retrieval is achieved by the collaboration of all thebrokers that are alternatively service providers processing requests and clients of these services generating sub-requests.
In order to collaborate, the brokers must at least understand each other. This means that all the requests must be formulated in a common language (and also all the answers to the requests), even if the brokers may perform local translations. Logic provides the adequate language for such a purpose. A request can be expressed by a pair (x, P) where x is a logical variable and P a logical formula involving x, meaning "Retrieve knowledge objects x such that the property expressed by formula P holds". Interestingly, an answer to such a request can be expressed in the same formalism, i.e. a pair(x, Q) meaning "77?ere exists a knowledge object x satisfying the property expressed by formula Q". The requirement here is that P must be a logical consequence of Q, so that the answer contains at least as much knowledge as the request. Moreover, the same logical formalism can be used to capture the scope of a broker, i.e. the area of knowledge it is concerned with: a broker with scope (x, R) means "/ am not capable of retrieving knowledge objects x which do not satisfy the property expressed by formula R". In many situations, the scope of a broker may vary, because it gets specialised or, on the contrary, expands its capacities, either externally or due to the knowledge retrieval process itself. In other words, logic provides a common languagewhere requests, answers and scopes can be expressed. Brokers then perform logical operations on these three components. The most important logical operation, from which all the others can be reconstructed, is satisfiability checking, i.e. deciding whether some object could satisfy the property expressed by a formula, or, on the contrary, whether it is intrinsically contradictory. Unfortunately, it is well known that this operation, forft// Classical Logic, is not algorithmic, i.e. it is provabiy impossible to write a program which implements it and always terminates. Given this limitation, a lot of research in knowledge representation has been focused on identifying fragments of Classical Logic in which satisfiability is algorithmically decidable. The trade-off here is between expressive power and tractability: the empty fragment, for example, is obviously tractable, but it is not very expressive.
The most popular fragment which emerged is known as "feature constraints" (FCs). The satisfiability problem in this case is also known as "feature constraint solving".
As is known, feature constraints are built from atomic constraints, which are either sorts or features. A sort is a unary relation, expressing a property of a single entity. For example, P: person expresses that an entity P is of sort person. A feature is a binary relation expressing a property linking two entities. For example, P: emploγer->E expresses that entity p has an employer, which is an entity E. Apart from sorts and features, most feature systems also allow built-in relations such as equality and disequality.
The full fragment of feature constraints, where the atomic components mentioned above are allowed to be combined by all the logical connectives (conjunction, disjunction, negation and quantifiers), although very expressive, is hardly tractable. A subfragment called "basic feature constraints" (BFC) has been considered, where negation and disjunction are simply forbidden. Efficient constraint solving algorithms have been proposed for this sub-fragment. However, a drawback is that the complete absence of negation puts strong limitations on the kind of operations a knowledge broker may wish to perform.
There is therefore a need for techniques that avoid the abovementioned problems and provide tractable solutions. It would be desirable to have a system that provided brokers with a more powerful set of available operations.
The present invention provides a method carried out in a data processing device including a processor, memory, and a user interface, the data processing device being couple in a network to one or more other data processing devices, at least one of the data processing devices including means for storing a repository of electronic documents, comprising: (a) receiving at least one user input designating a feature constraint, said feature constraint comprising at least a positive component and a negative component, each of the positive component and the negative component including one or more relations, the or each relation defining a document related entity and a property of the entity, (b) solving the feature constraint to determine from the positive and negative components one or more document references, the or each document reference corresponding to a document within said repository satisfying said feature constraint.
The invention further provides a method carried out in a data processing device including a processor, memory, and a user interface, comprising: (i) receiving a first user input designating a graphical object corresponding to a stored feature constraint, said feature constraint comprising at least a positive component and a negative component, each of the positive component and the negative component including one or more relations, the or each relation defining a document related entity and a property of the entity, ) receiving a second user input indicating that the feature constraint is to be sent to another data processing device, (k) encoding the feature constraint in a data packet, and (I) transmitting the data packet.
In each case, the method may include retrieving knowledge from a repository stored in a data processing device at each of a plurality of locations, and optionally combining each piece of knowledge so obtained to generate a new document.
The invention further provides a data processing device when suitably programmed for carrying out the methods as set forth above, or according to any of the appended claims, the device comprising a processor, a memory, and a user interface.
The invention further provides a data processing device comprising: a processor, a memory coupled to the processor, and a user interface coupled to the processor and to the memory and adapted to be operable by a user to generate user inputs, the data processing device being couple in a network to one or more other data processing devices, at least one of the data processing devices including means for storing a repository of electronic documents, the data processing device further comprising means for receiving at least one user input designating a feature constraint, said feature constraint comprising at least a positive component and a negative component, each of the positive component and the negative component including one or more relations, the or each relation defining a document related entity and a property of the entity, means for solving the feature constraint to determine from the positive and negative components one or more document references, the or each document reference corresponding to a document within said repository satisfying said feature constraint.
The invention further provides a system for accessing or distributing electronic documents, according to claim 19 of the appended claims.
The invention further provides a portable device for accessing or distributing electronic documents, according to claim 20 of the appended claims.
The invention further provides a apparatus for scanning, copying and/or printing documents, according to claim 21 of the appended claims.
The invention employs a subset of feature constraints — "signed feature constraints" (SFC) — and a method for solving SFCs. An advantage is that SFCs can be used in knowledge retrieval engines to specify, in a common language, (i) knowledge search requests, (ii) the answers to these requests and (iii) the state of the knowledge retrieving agents (referred to herein as knowledge brokers).
The infra-structure required to support collaboration, and the way knowledge is stored locally within each broker, may be in accordance with the model disclosed in Andreoli et al (1996), The Constraint-Based Knowledge Broker Model: Semantics, Implementation and Analysis, J. Symbolic Computation). The following discussion addresses rather the knowledge manipulations occurring within each broker. Embodiments of the invention will now be described, by way of example, with reference to the accompanying drawings, in which:
Figure 1 illustrates schematically a data processing network that may be used to implement an embodiment of the invention;
Figure 2 illustrates schematically the scope defined by a signed feature constraint;
Figure 3 is a view of the user interface of a fixed computing device at one instant during the entry by a user of a query;
Figure 4 shows a schematic flow chart of the steps in entering elements of a query using the interface of Fig. 3;
Figure 5 illustrates a paper form suitable for use by a user in an alternative embodiment of the invention, for entering a query;
Figure 6 shows a schematic flow chart of the steps in entering elements of a query using the paper form of Fig. 5;
Figure 7 is a schematic flow chart of the steps in using a feature constraint to retrieve document references and display or print corresponding documents;
Figure 8 illustrates a portion of a list of hits obtained during the process of Fig. 7;
Figure 9 shows selected hits from the list of Fig. 8 after transformation into HTML format; and
Figure 10 illustrates a more detailed presentation of a single selected hit.
1. System hardware
It will be appreciated that the present invention may be implemented using conventional computer network technology, either using a local area network (LAN) or, more suitably, a wide area network (WAN). The invention has been implemented using conventional web browser software (e.g. Netscape) providing cross-platform communication and document transfer over the internet. This is schematically illustrated in Fig. 1. It will be appreciated that each machine 2, 4, 6 forming part of the network 21 may be a PC running Windows™, a Mac running MacOS, or a minicomputer running UNIX, which are well known in the art. For example, the PC hardware configuration is discussed in detail inTAie Art of Electronics, 2nd Edn, Ch. 10, P. Horowitz and W. Hill, Cambridge University Press, 1989. However, it will be appreciated that the invention may be implemented using different system configurations: see EP-A-691 ,619 (hereafter "EP'619"). [Exemplary network configurations are discussed in detail in, for example, EP-A-772,857 and EP-A- (corresponding to US application S.N. 08/668,704).]
Upon request of a user at machine 22 a document stored on machine 26 may be retrieved and sent from machine 26 over the internet, via any number of intermediate machine 24 to machine 22. As is well known, the document may be retrieved using as a unique identifier its World Wide Web URL, as discussed in EP'619. Preferably also connected to the network 21 is any number of printers or multifunction devices (capable of scanning/printing/faxing, etc.) (not shown), as discussed in EP'619. Multifunction devices are discussed in more detail in EP-A-741 ,487. Each machine coupled to the network may be equipped with appropriate hardware and software, which is known in the art, for communication with portable computing devices, such as personal digital assistants (PDAs), handheld PCs, or pocket or wristwatch computers. In this way, the requesting machine may generate a request in response to receiving a data packet from a user of a portable computing device, as discussed in further detail in international patent application WO-A- , based on British patent application 9708175.6
(reference R/97005), filed concurrently herewith.
2. Principles of feature constraints
As mentioned above, the full fragment of feature constraints, where the atomic components mentioned above are allowed to be combined by all the logical connectives (conjunction, disjunction, negation and quantifiers), although very expressive, is hardly tractable. A subfragment called "basic feature constraints" (BFC) has been considered, where negation and disjunction are simply forbidden. Efficient constraint solving algorithms have been proposed for this sub-fragment. However, a drawback is that the complete absence of negation puts strong limitations on the kind of operations a knowledge broker may wish to perform.
In preferred embodiments, the present invention makes use of a powerful operation, referred to as "scope-splitting", which relies on the use of negation. Indeed, a broker may wish to split its scope, specified by a pair (x, P) according to a criterion expressed by a formula F, thus creating two brokers with scope P Λ F and P Λ-, F. Thus, a broker in charge of bibliographic information may wish to split its scope into two new scopes: "books written after 1950", which can be represented by the BFC x
X : book
X : year -> ϊ
Y > 1950 and its complement, i.e. "books written before 1950 or documents which are not books"; this latter scope cannot be expressed using BFC, because negation and disjunction cannot be dispensed with. It has been discovered that the scope splitting operation is useful in many situations, for example to implement brokers capable of memorising and re-using information gathered during their lifetime. Embodiments of the present invention make use of, on the one hand, a fragment of feature constraints, called "signed feature constraints" (SFC), which allows limited use of negation, precisely capable of expressing the kind of split scope mentioned above, and on the other hand, an efficient constraint solving method for SFC.
2.1 Signed Feature Constraints
A signed feature constraint is composed of a positive part and a list of negative parts, both of them being basic feature constraints. For example, the following signed feature constraint p
+ P : : person ,
P : : eπιployer-> E ,
Ξ . : "Xerox"
- P : : nationality-> N,
N : : "American" - P : spouse-> '
P' : person
P' : employer-> E1
E1 : "Xerox" specifies a Xerox employee who is not American and is not married to another Xerox employee. This is represent it graphically as in Fig. 2. The round boxes denote the entities (logical variables), the sort relations (unary) are represented by dashed arrows labelled by the name of the sort in a square box, the feature relations (binary) are represented by plain arrows labelled by the name of the feature in a square box. The built-in predicates (not present in the example) are represented by rhombuses. The positive part of the SFC is contained in the top box and marks the distinguished entity of the scope (p in the example) by a double round box. The negative parts of the SFC are contained in the lower boxes in grey.
The main interest of SFC comes from the following property:
If the scope of a broker is represented by an SFC e0, and this scope is split by a BFC e, then the two resulting split scopes e*, e' are both SFC. Indeed, e+ is obtained by merging the positive part of e0 with the BFC e; and e" is obtained by extending e0 with a new negative part containing e alone. For example, assume a broker in charge of a bibliographical database containing various documents (books, videos etc.) about Art, but not authored by an American. It is represented by the SFC x
+X : topic-> T
T : "Art"
-X : author-> A
A : nationality-> N
N : "American"
It may be split by the constraint "books written after 1950", expressed by the BFC x x : book
X : year-> Y
Y > 1950
The resulting scopes are simply x
+X : book X : topic-> T X : year-> Y T : "Art"
Y > 1950
-X : author-> A A : nationality-> N N : "American" i.e. "Art books written after 1950 but not by an American author" and X
+X : topic-> T
T : "Art"
-X : author-> A
A : nationality-> N
N : "American"
-X : book
X : year-> Y
Y > 1950 i.e. "Art documents not authored by an American but not books subsequent to 1950".
2.2 Solving Signed Feature Constraints
Most constraint systems make a number of assumptions on the nature of sorts and features, called the axioms of the systems. These axioms are crucial to the satisfiability algorithm, since they determine when a feature constraint is contradictory and when it is satisfiable.
2.2.1 Feature Axioms
For the purpose of simplicity, the embodiment disclosed here makes use of a slight variant of the basic axiom system used in Aϊt-Kaci H. et a/. (1994), A Feature-Based Constraint-System for Logic Programming with -Entailment, Theoretical Computer Science 122, pp. 263-283, although it will be appreciated by persons skilled in the art that the principles of the method apply to other sets of axioms as well.
1. Features are functional: this means that if two pairs of entities that are constrained by the same feature have the same first term, they also have the same second term. For example, it can be considered that the feature spouse is functional (within a specific cultural setting), meaning that a person cannot have two spouses: if, for a person x, we have x: spouse->γ and x: spouse->z, then the entities Y and z coincide (i.e. denote the same person). Other systems allow multi-valued features.
2. Sorts are disjoint: this means that no entity can be of two distinct sorts. For example, a book is not a person: we cannot have an entity x with x:book and X: erson. Other systems consider hierarchies of sorts where some entities can have multiple sorts as long as they have a common denominator in the hierarchy.
3. There is a distinguished subset of sorts, called "value" sorts, so that no two distinct entities can be of the same value sort. Traditional basic elements (strings, numbers, etc.) are typical value sorts: for example, the string "Xerox" or the number 1950 are value sorts. Value sorts are not allowed to have features: this is the only axiom connecting sorts and features. Other systems consider more refined connections between sorts and features.
4. There is a distinguished built-in binary predicate, equality, with the traditional congruence axioms (which involve sorts and features). The axioms describing all the other built-in predicates are assumed to contain no mention of sorts and features. These axioms are formally written in section A: Axioms in the Appendix at the end of this disclosure. They form a theory T.
2.2.2 Constraint Satisfaction
First, it is assumed that satisfiability over built-in predicates is decidable. This means that there is an algorithm which, given a formula F using only built-in predicates (F is also called a built-in constraint), can decide whether F is a logical consequence of the theory T (written \-τ F).
Constraint satisfaction over BFCs is defined by a set of conditional rewrite rules over BFCs (section B.1 of the Appendix) which have the following properties
(a) The system of rules is convergent and hence defines a "normal form" for BFCs. This can be shown in a classical way by proving that the system is "Church-Rosser" (critical pairs converge) and "Noetherian" (the size of the terms strictly decrease by rewriting).
(b) A BFC is satisfiable if and only if its normal form is not reduced to the contradiction. One implication can be proved by showing that rewrite steps preserve satisfiability. The reverse implication can be proved by displaying a model that satisfies BFCs whose normal form is not reduced to the contradiction.
Thus the rewrite rules describe the steps of the constraint satisfaction algorithm. This algorithm always terminates because the system of rewrite rules is convergent. It is to be noted that the definition of the rules relies on satisfiability tests of built-in constraints, which has been assumed decidable. This means that the algorithm is modular and can accommodate any kind of built-in constraints as long as a proper built-in constraint satisfaction algorithm is provided.
Using rewrite rules for constraint satisfaction algorithm can be implemented in a naive way in some symbolic language like Lisp or Prolog, or can be optimised, taking into account the properties of the specific built-in constraints which are used.
The algorithm for constraint satisfaction over SFCs (section B.2 of the Appendix) can informally be described as follows. Given an SFC, its positive component is first normalised by the algorithm for BFCs. If the result is a contradiction, the whole SFC is unsatisfiable. Otherwise, the positive component (normalised) is inserted in each of the negative components, which are then normalised by the algorithm for BFCs. If a resulting negative component has a contradictory normal form, it is eliminated, and if it has a tautological normal form the whole SFC is unsatisfiable. The normal form for SFCs thus obtained has the following property:
An SFC is satisfiable if and only if its normal form is not reduced to the contradiction. As in the previous case, the difficult part of the implication can be proved using model theory.
3. User transactions with feature constraints
Figure 3 is a view of the user interface of a fixed computing device at one instant during the entry by a user of a query. Within a dedicated window a main query entry box 30 is displayed, in a form well known in the art. For example, the user may be interested in entering a query along the lines 'books or articles after 1990 in which the title contains "constraints" but does not contain "internet"'. The query box 30 includes boxes 31 , 32 which the user can select by mouse inputs, and can use to type in, or complete elements (e.g. "books/articles") of the query. Buttons 33 may be used to select document- related entities, such as "title" and a constraint applying to it, such as "contains not". Additional buttons 34, 36 allow the user to restart, add to, edit and build up a query. Each element of the query is gradually added to the current specification of the query, which is displayed in its current state in box 37. When the specification of the query is completed, button 38 is pressed to launch the search.
Figure 4 shows a schematic flow chart of the steps in entering elements (e.g. date after 90) of a query using the interface of Fig. 3. Initially, the knowledge broker main query window is displayed (step s41). The elements of the query are then received in turn as they are keyed in by the user (s42). For each query element confirmed by the user, the "current specification" is updated to include it any displayed (s43). On completion of the specification of the query, and selection by the user of the "Submit" button in Fig. 3, each query element is converted (s44) to its corresponding logical relations) - see section 2 above. The feature constraint is then compiled (s45) from the set of logical relations.
Figure 5 illustrates a paper form 50 suitable for use for entering a query by a user in an alternative embodiment of the invention. This embodiment is suitable for the user of a multifunction device, or a user having a scanner coupled to a computer. The form 50 used has several sections 51 , 52, 53, 54, enabling the user to enter information about the type of document, author's name, date, and topic; however, it will be appreciated that any number of sections may be used, for entering any kind of information that a user may expect to have about a document. In this case, next to each option indicated by human readable text is a box which, when checked by a user, enables the choice to be determined by machine reading, as is known in the art. Certain boxes (55, 56, and 57) may be used to enter handwritten information, alternatively, the query may be entered on a sheet entirely in typewritten or handwritten form, with the content of the query being determined by OCR and, where necessary, handwriting recognition.
Figure 6 shows a schematic flow chart of the steps in entering elements of a query using the paper form of Fig. 5. First, the sheet is scanned and a bitmapped image data file corresponding to the content of the sheet is stored (step s60). Then, (s61) an analysis is made of the image data at the locations corresponding the boxes 55-58, either as to whether the box was checked, or to extract the information written in the box. Then, for each section 51-54, the specified query element is determined (s62), where necessary by applying handwriting recognition and OCR (s63). Each query element is then converted to the corresponding logical reiation(s) - see section 2 above. The feature constraint is then compiled (s45) from the set of logical relations.
Figure 7 is a schematic flow chart of the steps in using a feature constraint to retrieve document references and display or print corresponding documents. This may be performed by a conventional computer device, or by a multifunction device or printer equipped with a user interface. Initially, a FC is received from a user, for example in a data packet from a user of a portable device, as illustrated in Fig. 8, or by input directly into the machine by a user operating a keyboard and mouse, or touch screen, as is well known in the art. Upon the instigation of the user by appropriate input, the FC is solved as described in section 3 above, and the resulting request in the appropriate form passed to the search engine (s92). The search request is used to search all available repositories for documents satisfying the FC (s93); and if necessary, the request may be broken down into subrequests as discussed in more detail in Andreoli et al (1996), The Constraint-Based Knowledge Broker Model: Semantics, Implementation and Analysis, J. Symbolic Computation).
Once obtained by the search engine, a list of hits — of documents satisfying the FC — is displayed (s94), as shown in Fig. 8. Then, in response to appropriate user input, operations may be performed to display individual hits with expanded detail of the document, to convert the document information to HTML format, or to download the document from the repository (s95).
Figure 9 shows selected hits from the list of Fig. 8 after transformation into HTML format. As can be seen, for each hit there is displayed further information, such as author name, http_url, information source and title. If desired, the user can view the document for hit 1 by mouse clicking on the http_uri displayed. The document can then be printed, if needed (s96).
Figure 10 illustrates a more detailed presentation of a single selected hit, i.e. with a set of attributes of the document. It can be seen that against one or more of the attributes are displayed URLs providing links to further pages providing information related to those attributes.
Appendix
A Axioms of the System
There are three sets of axioms.
Specific axioms for features and sorts :
Let r, T1 denote any sorts, and / denote any feature.
Vr, y, 2 I->JΛI- Z
Vr -<(x :τAx:τ') if r φ r'
Vr, y ι:rΛy:rDι = !/ ifrisa value sort
Vr, y -.(x :τΛx ~y) if r is a value sort
Congruence axioms for equality :
Let p denote any built-in predicate. The traditional congruence axioms are:
Vr r = x
Vx,y x=yD =x
Vr;'y, z ι = jΛy = :D- = -
"ix,y x:τAx = y y:τ
Vx.y.z r — yΛx = zDz — y
Vr, y, -r ι-!/Λy= ι-- Vr, y p(x) Λxi=yD p(y) where i is some index in the list of variable x and y is identical to £ except that y,- = y.
Built-in predicate axioms :
They must not mention sorts and features. For-example, disequality can be axiomatized by
Vr,y yV = y
Vr -i*≠x)
Precedence constraints are axiomatized by
Vr ->(r < x)
Vr, y, z x <yΛy< zD x <z
The built-in predicates >, <, > can then be defined from < and equality.
B Constraint Satisfaction
B.l The BFC case
We represent a BFC as a pair (3 | T) where B is a built-in constraint and T an unordered list of sort and feature constraints (read conjunctively). ±. denotes the contradiction. There are two sets of rewrite rules. The following rules correspond to simplifications of the BFCs.
{B \ x ~ y, z - t, T) ~ (S Λ y = - | r — y, T) if \-τ B D x = z
(B I x : r, y : r, T) >— (B | r : r, T) if H - S D r = y and r is not a value sort
(S I 1 : r, y : r, T) •— (B Λ x = y | r : r, T) if r is a value sort
The following rules correspond to the detection of inconsistencies.
(5 I T) . — J_ if hr -B
(S I r : r, y : r1 , T) — 1 if r-r S D ι = y and r jέ r'
(B I x : r, y → z, V) >— ±. if r-7- B D ar = y and r is a value sort The following property justifies the algorithm
(5 I r> H-. 1 if and only if -r V-(5 Λ /\ c)
-er
B.2 The SFC case
We represent an SFC as an unordered list of BFCs prefixed with a sign (+ or -); by definition, one and only one component is positive. Let S be an SFC. The SFC-normal form of S is written S" and is obtamed by the following algorithm:
Let c be the BFC-normal form of the positive component of S. If c, s= ± Then
Return A. Else c„ is of the form (5, | rβ)
Let {(Bi I r,-)},-=1 „ be the list of negative components of S.
Foreach t = 1, ... , n
Let a be the BFC normal form of [B„ A Bi \ Va, V_). If there exists t" € 1, .. . , n such that a — (B | T) and I-7- B and T is empty Then
Return i. Else
Let I = {i € 1, . . . , n such that c< φ } Return {+c„, {-< }<<=/}
The following property justifies the algorithm n [+(B0 I rβ) , {-{5i I r }?al]- = 1 if and only if Hr V-.[(B.Λ c)]

Claims

Claims:
1. A method carried out in a data processing device including a processor, memory, and a user interface, the data processing device being couple in a network to one or more other data processing devices, at least one of the data processing devices including means for storing a repository of electronic documents, comprising:
(a) receiving at least one user input designating a feature constraint, said feature constraint comprising at least a positive component and a negative component, each of the positive component and the negative component including one or more relations, the or each relation defining a document related entity and a property of the entity,
(b) solving the feature constraint to determine from the positive and negative components one or more document references, the or each document reference corresponding to a document within said repository satisfying said feature constraint.
2. The method of claim 1 , further comprising the step of:
(c) displaying the or each document reference determined in step (b).
3. The method of claim 2, further comprising the step of:
(d) in response to a second user input designating one of the displayed document references, retrieving the document corresponding to said document reference from said repository and, optionally, displaying said document or a portion thereof.
4. The method of claim 2 or 3, further comprising the step of:
(e) in response to a third user input, causing the document corresponding to said document reference to be printed.
5. The method of any of the preceding claims, wherein step (a) comprises:
(a1) receiving at least one user input, the user input(s) defining at least one relation, the or each relation defining a document related entity and a property of the entity,
(a2) compiling a feature constraint, the feature constraint including the relation(s) received in step 01).
6. The method of any of the preceding claims, wherein, prior to step (a2), step (a1) is repeated a plurality of times.
7. The method of claim 5 or 6, wherein the data processing device comprising device comprises a portable computing device, and step (a1) comprises receiving user inputs via touchscreen or a keyboard.
8. The method of claim 5 or 6, wherein the data processing device comprising device comprises a fixed computing device, and step (a1) comprises receiving user inputs via touchscreen, a keyboard, and/or mouse.
9. The method of claim 8, wherein the fixed computing device comprises a multifunction device including scanning means, wherein step (a1) comprises:
(a1 i) scanning an image-bearing portable medium to produce electronic signals corresponding to at least one predetermined portion of the medium; and
(alii) determining from said signals a relation corresponding thereto.
10. The method of claim 9, wherein said medium bears, for each said predetermined portion, a human readable indication of the disposition of said predetermined portion and, associated therewith, a human readable designator of the relation corresponding to said predetermined portion.
11. The method of any of claims 1 to 4, wherein step (a) comprises:
(f) receiving a data packet
(g) decoding the data packet to derive a feature constraint, the feature constraint comprising one or more relations, the or each relation defining a document related entity and a property of the entity,
(h) storing the feature constraint derived in step g).
12. A method carried out in a data processing device including a processor, memory, and a user interface, comprising:
(i) receiving a first user input designating a graphical object corresponding to a stored feature constraint, , said feature constraint comprising at least a positive component and a negative component, each of the positive component and the negative component including one or more relations, the or each relation defining a document related entity and a property of the entity,
(j) receiving a second user input indicating that the feature constraint is to be sent to another data processing device,
(k) encoding the feature constraint in a data packet, and
(I) transmitting the data packet.
13. The method of any of the preceding claims, wherein one of said relations is a sort relation, the sort relation indicating the sort of which the respective entity comprises.
14. The method of any of the preceding claims, wherein one of said relations is a feature relation, the feature relation indicating that the respective entity has an attribute, the value of said attribute being a second entity.
15. The method of any of the preceding claims, wherein the property is a value, or range of values, which said entity may have.
16. A data processing device when suitably programmed for carrying out the method of any of the preceding claims, the device comprising a processor, a memory, and a user interface.
17. A data processing device comprising: a processor, a memory coupled to the processor, and a user interface coupled to the processor and to the memory and adapted to be operable by a user to generate user inputs, the data processing device being couple in a network to one or more other data processing devices, at least one of the data processing devices including means for storing a repository of electronic documents, the data processing device further comprising means for receiving at least one user input designating a feature constraint, said feature constraint comprising at least a positive component and a negative component, each of the positive component and the negative component including one or more relations, the or each relation defining a document related entity and a property of the entity, means for solving the feature constraint to determine from the positive and negative components one or more document references, the or each document reference corresponding to a document within said repository satisfying said feature constraint.
18. The method of claim 17, wherein said means for receiving a feature constraint comprises means for receiving at least one user input, the user input(s) defining at least one relation, and for compiling the feature constraint, the feature constraint including the received relations.
19. A system for accessing or distributing electronic documents, including: means storing a repository of electronic documents, each document having a corresponding document reference, and a plurality of objects, at least one of said objects being portable or mobile, each object including means for communicating with the or each other object and with a user interface, and means for receiving, storing and/or transmitting a feature constraint, said feature constraint comprising at least a positive component and a negative component, each of the positive component and the negative component including one or more relations, the or each relation defining a document related entity and a property of the entity.
20. A portable device for accessing or distributing electronic documents, including: means for communicating with fixed or mobile electronic devices and with a user interface, at least one of said devices including means storing a repository of electronic documents, each document having a corresponding document reference, and means for receiving, storing and/or transmitting a feature constraint, said feature constraint comprising at least a positive component and a negative component, each of the positive component and the negative component including one or more relations, the or each relation defining a document related entity and a property of the entity.
21. An apparatus for scanning, copying and/or printing documents, including: means for accessing a repository of electronic documents, each electronic document having a corresponding document reference, means for communicating with one or more of a plurality of objects, at least one of said objects being portable or mobile, and with a user interface, and means for receiving, storing and/or transmitting a feature constraint, said feature constraint comprising at least a positive component and a negative component, each of the positive component and the negative component including one or more relations, the or each relation defining a document related entity and a property of the entity.
EP98917545A 1997-04-23 1998-04-23 Knowledge brokers using signed feature constraints Ceased EP0985184A1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
GB9708172 1997-04-23
GBGB9708172.3A GB9708172D0 (en) 1997-04-23 1997-04-23 Knowledge brokers using signed feature constraints
PCT/IB1998/000758 WO1998048361A1 (en) 1997-04-23 1998-04-23 Knowledge brokers using signed feature constraints

Publications (1)

Publication Number Publication Date
EP0985184A1 true EP0985184A1 (en) 2000-03-15

Family

ID=10811192

Family Applications (1)

Application Number Title Priority Date Filing Date
EP98917545A Ceased EP0985184A1 (en) 1997-04-23 1998-04-23 Knowledge brokers using signed feature constraints

Country Status (4)

Country Link
EP (1) EP0985184A1 (en)
JP (1) JP2001521664A (en)
GB (1) GB9708172D0 (en)
WO (1) WO1998048361A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6381598B1 (en) 1998-12-22 2002-04-30 Xerox Corporation System for providing cross-lingual information retrieval
US6434546B1 (en) 1998-12-22 2002-08-13 Xerox Corporation System and method for transferring attribute values between search queries in an information retrieval system

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5862321A (en) * 1994-06-27 1999-01-19 Xerox Corporation System and method for accessing and distributing electronic documents

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO9848361A1 *

Also Published As

Publication number Publication date
JP2001521664A (en) 2001-11-06
WO1998048361A1 (en) 1998-10-29
GB9708172D0 (en) 1997-06-11

Similar Documents

Publication Publication Date Title
US11790009B2 (en) Knowledge operating system
US20210149980A1 (en) Systems and method for investigating relationships among entities
US6778979B2 (en) System for automatically generating queries
US6732090B2 (en) Meta-document management system with user definable personalities
US7133862B2 (en) System with user directed enrichment and import/export control
US8219557B2 (en) System for automatically generating queries
US7117432B1 (en) Meta-document management system with transit triggered enrichment
US20030061201A1 (en) System for propagating enrichment between documents
EP1143356A2 (en) Meta-document and method of managing meta-documents
US20210342541A1 (en) Stable identification of entity mentions
WO2005099381A2 (en) Expression and time-based data creation and creator-controlled organization
US7020670B1 (en) Document constraint descriptors obtained from user signals indicating attribute-value relations
KR20120058544A (en) Image element searching
Völkel Personal knowledge models with semantic technologies
WO1998048359A1 (en) Feature constraint based document information retrieval and distribution
Kettler et al. A template-based markup tool for semantic web content
Chan Beyond keyword and cue-phrase matching: A sentence-based abstraction technique for information extraction
EP0985184A1 (en) Knowledge brokers using signed feature constraints
Müller Inducing conceptual user models
Carr et al. Towards a knowledge-aware office environment
Troyer et al. Exploiting link types during the conceptual design of websites
Snyder et al. An overview of CARAT: a computer assisted research and analysis tool
Rohmer Lessons for the future of Semantic Desktops learnt from 10 years of experience with the IDELIANCE Semantic Networks Manager.
Kratkiewicz et al. An adaptive semantic approach to personal information management
Decker Simplifying Semantic Annotations in Text

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 19991123

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): DE FR GB

REG Reference to a national code

Ref country code: DE

Ref legal event code: R003

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED

18R Application refused

Effective date: 20111122