WO2005094158A2 - Sistema y programa para clasificar categorizaciones complejas - Google Patents
Sistema y programa para clasificar categorizaciones complejas Download PDFInfo
- Publication number
- WO2005094158A2 WO2005094158A2 PCT/ES2005/000165 ES2005000165W WO2005094158A2 WO 2005094158 A2 WO2005094158 A2 WO 2005094158A2 ES 2005000165 W ES2005000165 W ES 2005000165W WO 2005094158 A2 WO2005094158 A2 WO 2005094158A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- entities
- entity
- instance
- category
- tree structure
- Prior art date
Links
- 230000007246 mechanism Effects 0.000 claims abstract description 6
- 238000000034 method Methods 0.000 claims description 24
- 238000007373 indentation Methods 0.000 claims description 9
- 230000008859 change Effects 0.000 claims description 8
- 230000014509 gene expression Effects 0.000 claims description 3
- 238000004590 computer program Methods 0.000 claims 3
- 238000011161 development Methods 0.000 abstract description 2
- 230000009897 systematic effect Effects 0.000 abstract description 2
- 230000015572 biosynthetic process Effects 0.000 abstract 1
- 230000009471 action Effects 0.000 description 21
- 238000010586 diagram Methods 0.000 description 9
- 230000006870 function Effects 0.000 description 7
- 230000000007 visual effect Effects 0.000 description 6
- 230000009885 systemic effect Effects 0.000 description 5
- 230000007935 neutral effect Effects 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- 238000007726 management method Methods 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 230000000670 limiting effect Effects 0.000 description 2
- 230000008520 organization Effects 0.000 description 2
- 241000167854 Bourreria succulenta Species 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 235000019693 cherries Nutrition 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000013523 data management Methods 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 235000014101 wine Nutrition 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
- G06F16/353—Clustering; Classification into predefined classes
Definitions
- the present invention corresponds to the sector of computer tools to facilitate the classification of information.
- the present invention presents a type of organization for sets of entities, where said entities can be for example objects, concepts, ideas, terms or others, which facilitates the conceptualization of classifications and the conduct of searches.
- the invention facilitates the realization of systematic categorizations in which there are different criteria for organizing the information, and facilitates its inspection and use by the user.
- the invention unites in the same tree the categories that are used to classify the instances that are being classified and the different criteria that define the different hierarchies of categories. That is, in the same tree it creates a multicriteria classification, in which different hierarchies of categories belonging to different criteria coexist. This tree can be displayed graphically in an alborea structure.
- Figure 1 shows a simple example of a possible tree structure for a multi-criteria classification of words.
- the logical organization of entities through parent-child relations will be called a tree, and the tree structure will be called the representation of said tree in a graphical interface.
- the following agreement will be followed: - the different instances are marked between points, such as ".martillo.” - in Illustration 1 these instances would be the specific words,
- Illustration 1 Words Name According to nature Entity .martillo. .brother, .writer, .cereza. Attribute. Height. .honesty. Event According to duration Punctual. Durative .concert, .storm. According to action Action. Concert,. No action. Storm. Other .metro, .field. According to meaning It has use .martillo. It has a .writer function. It has a relationship. Other. Height.
- the tree structure can show the different instances in duplicate or repeated manner in the different positions that correspond to the different categories. That is, ". Arrive.” it appears both in “According to duration>Punctual” and in “According to action>Action”
- the invention refers both to the case in which there are still no instances, and therefore only the criteria and categories appear, as to the case in which there are instances that are shown. Thus, it could also be used in a case where only criteria and categories are shown, and instances are not shown. In this case, the categories and criteria shown could be used to perform searches against a database in which the instances were stored.
- OPTIONAL ASPECTS The invention allows to construct different embodiments with different optional aspects. To facilitate the explanation of the advantages of the invention and without limiting effects, some of these optional aspects are described below: 1.
- a relational database is used in which there are two tables. One table is used to store the instances and the other table is used to store the categories and criteria.
- a different code is assigned to each record in the categories and criteria table (that is, a different code is assigned to each category and each criterion), for example a numerical code where each code is an integer.
- the user can select a set of categories, which may or may not belong to different classification criteria, and the system searches for instances that have certain relationships with those categories.
- the user could only select the "Entity” category (dependent on Name> According to Nature and the system could return the instances: “.martillo.”, “. Brother.”, “. writer. “and” .cereza. ", that is, all those instances that have the category” Entity. "If the user instead simultaneously selects” Entity "and” Has use "(where” Has use "is dependent on Name> According to Meaning), the system could only return ".martillo.”, Since it is the only instance that belongs to both categories.
- a summary tree structure is characterized by being a tree structure that contains only the nodes selected in the main tree structure at a given time. For example, in Figure 1 you can select certain nodes, for example those that are shown in bold letters in Figure 2. A possible tree structure summary for this situation would be the structure shown in Figure 3.
- summary tree structure information could also be shown as in Figure 4.
- ADVANTAGES OF THE INVENTION 1 It allows the simple merging of categorizations based on different criteria, so that the user can understand the effects of multi-categorization easily. 2. It allows the simple realization of sophisticated searches, because the user only has to select in the same control all the categories that he wants and then he will have to combine in some way to perform the search. 3. It allows the creation of simple user interfaces, as it allows multicriteria classifications to be carried out with a single tree-type computer control.
- Two nodes that could be criteria ie "Application” and “Catalog” are parents of nodes that are not categories, but are apparently products (Programming, Typing, Editing, Text Processing).
- UVW printer Company B .XYZ-890.
- Figure 1 shows a typical graph used in systemic linguistics work.
- Figure 2 shows a block diagram of the preferred embodiment.
- Figure 3 shows a schematic example of what the preferred embodiment would look like for a classification fragment,
- Figure 4 shows a block diagram of an alternative embodiment.
- the invention is constructed with a computerized system.
- a computerized system which may be based, for example, on the PC Dell Dimension XPS ® ®, to which he also adds a mouse and keyboard user to interact with the system.
- an operating system can be, for example, Microsoft ® Windows
- Figure 2 shows a block diagram of the preferred embodiment, in which a screen 2001 is observed to observe the behavior of the invention; a processing unit 2002 that produces the functionality of the invention; interaction means 2003, which could be for example a mouse, a keyboard, a stylus or other means; and 2004 data containing the categories, criteria and instances that are being classified by the invention.
- the invention uses a tree-type computer control, such as the Microsoft TreeView ® control.
- Figure 3 schematically shows how a tree structure according to the present invention could be made for a fragment of the classification of Figure 1.
- the following means are used to distinguish the criterion nodes from the other nodes.
- the invention is used to perform searches on a set of categorized instances. For this, it is necessary first to have categorized these instances, that is, to have assigned the categories to which the instances belong within the different criteria.
- two special procedures are used to facilitate the categorization of instances. For this, the concept of DOMAIN is introduced.
- a domain is a set of sibling criteria that includes all siblings of those criteria. In these circumstances, if a given instance belongs to a category of one of the criteria, it must also belong to some category of each of the other criteria in the domain.
- a neutral criterion is that for which it is not necessary to select any category, and for which there is indeed no selected category, for example, in Figure 1, if "hammer” is being categorized and the "Seg” category has been selected n Nature> entity “must also select a category belonging to the criterion” According meaning "they belong to the same domain. However, it is not necessary to select any category of the criteria “According to duration” or “According to action”. However, if the category "According to nature>event> According to duration>Punctual" has been selected. It is necessary to select a category belonging to the "According to action” criteria, as they belong to the same domain.
- the procedure to perform the categorization of instances includes the following steps: 1. Select the instance to be categorized
- each database there is a single database for each type of object (for example, a database for words, a database for books etc.), and in each database there are two tables. Instances are stored in one table and categories and criteria are stored in the other table. In both tables, the database system assigns correlative numerical codes to the entities that are created (be they instances, categories, or criteria). To create the classifications of the instances, hyphens are used around the codes of the categories to which the instance belongs, such as in "-1 -23-22-"
- FIG. 4 shows another possible embodiment of the invention, comprising a processing unit 4001 that executes a program capable of organizing entities in the manner set forth in this invention. This would be the case, for example, of a company providing a data access service through the Internet, which users would access remotely through personal computers.
- the invention can be used through an independent computerized system 4002 to which it is linked by a telecommunication system 4003.
- the data managed by the unit 4001 may be integrated together with the unit 4001 or may be dispersed, such as the data 4005, 4006, 4007, 4008, to which the unit 4001 would be linked by a system of Telecommunication 4004.
- the most effective tree structures are tower-like structures, which are characterized in that the different nodes are placed one above the other, and are distinguished fundamentally by the level of indentation.
- the illustrations shown in this document and the Microsoft Treeview® control are examples of tone structures. These structures are much more comfortable than those used for example in systemic linguistics, such as the one shown in Figure 1.
- tree structures can also be created using text controls, and placing them over each other, and applying different levels of indentation to each other.
- An example of these structures are those created on the Internet pages with HTML language, and would be similar to the structures shown in the Illustrations in this document.
- tree structures can be created that do not have added functionality to expand and contract nodes, but are permanently open.
- the main advantage of the invention is the separation between criteria and categories and the procedures of search and categorization management.
- the invention can also be carried out with other tree structure designs.
- One of these designs is shown in Figure 15.
- the criteria nodes are not at a higher level than the categories that dominate directly, but simply differ in text and format but have the Same level of indentation.
- This tree structure design facilitates the relationship of some categories with their parent categories, as can be seen by inspecting "Name” and "Entity”, where it is clear that "Entity” is a category directly dependent on "Name”.
- a criterion can be expanded or contracted, and it would result in the categories that depend on it appearing or disappearing without the criterion node itself appearing or disappearing. For example, if the criterion "According to nature" is contracted, the result would be that shown in Figure 16.
- Illustration 15 Words Name According to nature Entity .martillo. .brother, .writer, .cereza. Attribute. Height. .honesty. Event According to duration Punctual. Durative .concert, .storm. According to action Action. Concert,. No action. Storm. Other .metro, .field. According to meaning It has use .martillo. It has a .writer function. It has a relationship. Other. Height. Adjective Adverb Verb Closed Class
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Exhaust Gas After Treatment (AREA)
- Measurement Of Resistance Or Impedance (AREA)
- Nitrogen Condensed Heterocyclic Rings (AREA)
Abstract
Description
Claims
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2007510050A JP2008547065A (ja) | 2004-03-30 | 2005-03-29 | 複雑なカテゴリー化のための分類ツール |
US10/599,384 US20070150519A1 (en) | 2004-03-30 | 2005-03-29 | Organiser for complex categorisations |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
ESP200400776 | 2004-03-30 | ||
ES200400776 | 2004-03-30 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2005094158A2 true WO2005094158A2 (es) | 2005-10-13 |
WO2005094158A3 WO2005094158A3 (es) | 2005-11-10 |
Family
ID=35064172
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/ES2005/000165 WO2005094158A2 (es) | 2004-03-30 | 2005-03-29 | Sistema y programa para clasificar categorizaciones complejas |
Country Status (3)
Country | Link |
---|---|
US (1) | US20070150519A1 (es) |
JP (1) | JP2008547065A (es) |
WO (1) | WO2005094158A2 (es) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8452790B1 (en) * | 2008-06-13 | 2013-05-28 | Ustringer LLC | Method and apparatus for distributing content |
US9367609B1 (en) | 2010-03-05 | 2016-06-14 | Ustringer LLC | Method and apparatus for submitting, organizing, and searching for content |
KR101139238B1 (ko) * | 2011-06-20 | 2012-05-14 | 유택상 | 아이디어 창출 지원 방법 및 시스템 |
CN103049444B (zh) | 2011-10-12 | 2016-09-28 | 阿里巴巴集团控股有限公司 | 一种数据信息分类结构的存储方法和系统 |
US20140280042A1 (en) * | 2013-03-13 | 2014-09-18 | Sap Ag | Query processing system including data classification |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5696916A (en) * | 1985-03-27 | 1997-12-09 | Hitachi, Ltd. | Information storage and retrieval system and display method therefor |
US6055515A (en) * | 1996-07-30 | 2000-04-25 | International Business Machines Corporation | Enhanced tree control system for navigating lattices data structures and displaying configurable lattice-node labels |
WO2001079964A2 (en) * | 2000-04-14 | 2001-10-25 | Realnetworks, Inc. | System and method of managing metadata |
US20020107893A1 (en) * | 2001-02-02 | 2002-08-08 | Hitachi, Ltd. | Method and system for displaying data with tree structure |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5838965A (en) * | 1994-11-10 | 1998-11-17 | Cadis, Inc. | Object oriented database management system |
US6112201A (en) * | 1995-08-29 | 2000-08-29 | Oracle Corporation | Virtual bookshelf |
US5953724A (en) * | 1997-11-24 | 1999-09-14 | Lowry Software, Incorporated | Global database library data structure for hierarchical graphical listing computer software |
US6397221B1 (en) * | 1998-09-12 | 2002-05-28 | International Business Machines Corp. | Method for creating and maintaining a frame-based hierarchically organized databases with tabularly organized data |
US6868525B1 (en) * | 2000-02-01 | 2005-03-15 | Alberti Anemometer Llc | Computer graphic display visualization system and method |
US8396859B2 (en) * | 2000-06-26 | 2013-03-12 | Oracle International Corporation | Subject matter context search engine |
US6834282B1 (en) * | 2001-06-18 | 2004-12-21 | Trilogy Development Group, Inc. | Logical and constraint based browse hierarchy with propagation features |
US7324447B1 (en) * | 2002-09-30 | 2008-01-29 | Packeteer, Inc. | Methods, apparatuses and systems facilitating concurrent classification and control of tunneled and non-tunneled network traffic |
JP2004177996A (ja) * | 2002-11-22 | 2004-06-24 | Toshiba Corp | 階層型データベース装置及び階層型データベースの構築方法 |
CA2536179A1 (en) * | 2003-08-27 | 2005-03-10 | Sox Limited | Method of building persistent polyhierarchical classifications based on polyhierarchies of classification criteria |
-
2005
- 2005-03-29 WO PCT/ES2005/000165 patent/WO2005094158A2/es not_active Application Discontinuation
- 2005-03-29 US US10/599,384 patent/US20070150519A1/en not_active Abandoned
- 2005-03-29 JP JP2007510050A patent/JP2008547065A/ja active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5696916A (en) * | 1985-03-27 | 1997-12-09 | Hitachi, Ltd. | Information storage and retrieval system and display method therefor |
US6055515A (en) * | 1996-07-30 | 2000-04-25 | International Business Machines Corporation | Enhanced tree control system for navigating lattices data structures and displaying configurable lattice-node labels |
WO2001079964A2 (en) * | 2000-04-14 | 2001-10-25 | Realnetworks, Inc. | System and method of managing metadata |
US20020107893A1 (en) * | 2001-02-02 | 2002-08-08 | Hitachi, Ltd. | Method and system for displaying data with tree structure |
Also Published As
Publication number | Publication date |
---|---|
JP2008547065A (ja) | 2008-12-25 |
US20070150519A1 (en) | 2007-06-28 |
WO2005094158A3 (es) | 2005-11-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Martin | Meaning matters: A short history of systemic functional linguistics | |
US6658406B1 (en) | Method for selecting terms from vocabularies in a category-based system | |
US7383503B2 (en) | Filtering a collection of items | |
Bateman et al. | Multimodality and empiricism | |
US6816175B1 (en) | Orthogonal browsing in object hierarchies | |
Fluit et al. | Ontology-based information visualization: toward semantic web applications | |
US20150026159A1 (en) | Digital Resource Set Integration Methods, Interfaces and Outputs | |
Piasecki et al. | WordNetLoom: a WordNet development system integrating form-based and graph-based perspectives | |
WO2005094158A2 (es) | Sistema y programa para clasificar categorizaciones complejas | |
Hlava | The Taxobook: History, Theories, and Concepts of Knowledge Organization, Part 1 of a Part-3 Series | |
US9002851B2 (en) | Accessing stored electronic resources | |
Ridi | Hypertext | |
Collins | Docuburst: Document content visualization using language structure | |
Mírovský | Netgraph: A tool for searching in prague dependency treebank 2.0 | |
Herrero-Solana et al. | Bibliographic displays of Web-based OPACs: Multivariate analysis applied to Latin-American catalogues | |
Mastrodomenico | The Python Book | |
Sciore | Understanding Oracle APEX 5 Application Development | |
Piasecki et al. | WordnetLoom: a graph-based visual wordnet development framework | |
Nualart et al. | Texty, a visualization tool to aid selection of texts from search outputs | |
Revere et al. | Transhierarchy: A stable tree view with transclusion for hypertext navigation | |
Gayoso-Cabada et al. | Learning object repositories with dynamically reconfigurable metadata schemata | |
Crofts | Museum informatics: the challenge of integration | |
Tyrkkö | Network graphs to the rescue, or how to visualise distributions and networks in corpora and language | |
Weiner | Modelling Lexical Structures in the Oxford English Dictionary | |
Limeback | Simply SQL: The Fun and Easy Way to Learn Best-Practice SQL |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2007150519 Country of ref document: US Ref document number: 10599384 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2007510050 Country of ref document: JP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWW | Wipo information: withdrawn in national office |
Country of ref document: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2005729483 Country of ref document: EP |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 2005729483 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 10599384 Country of ref document: US |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: COMMUNICATION UNDER RULE 69 EPC ( EPO FORM 1205A DATED 21/05/07 ) |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 05729483 Country of ref document: EP Kind code of ref document: A2 |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 05729483 Country of ref document: EP Kind code of ref document: A2 |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 5729483 Country of ref document: EP |