KR101447526B1 - Method and apparatus for sorting personal information database based on an address and for grouping information from the sorted database - Google Patents
Method and apparatus for sorting personal information database based on an address and for grouping information from the sorted database Download PDFInfo
- Publication number
- KR101447526B1 KR101447526B1 KR1020130011879A KR20130011879A KR101447526B1 KR 101447526 B1 KR101447526 B1 KR 101447526B1 KR 1020130011879 A KR1020130011879 A KR 1020130011879A KR 20130011879 A KR20130011879 A KR 20130011879A KR 101447526 B1 KR101447526 B1 KR 101447526B1
- Authority
- KR
- South Korea
- Prior art keywords
- address
- database
- entry
- sub
- information
- Prior art date
Links
Images
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
Abstract
The present invention arranges and processes a database based on an address. An apparatus according to the present invention includes a classification unit configured to classify entries in a database into entries divided into a plurality of sub-databases different from each other according to a classification level designated as an address element; For each of a plurality of sub-databases, an entry obtained from the database in a manner that the entries of the sub-database are arranged in a specified order based on the address information of each entry, To the network. According to the classification, each of the sub-databases is composed of entries having the same information for the designated address element, and the sub-databases are entries having different information for the designated address element.
Description
The present invention is directed to sorting personal information databases obtained through various methods, particularly large databases, based on their address fields and using the aligned databases.
Companies that sell products or provide services to customers collect their personal information through various channels for marketing or publicity. For example, when a customer contacts with a customer, such as when purchasing a product, when receiving a service, or when receiving an after-sales service for a product that is sold, the customer is actively involved with the home or work address, telephone number, We collect the same personal information. This is because personal information such as a customer's address is expected to greatly contribute to the company's business activities, or to play a positive role in future corporate profits.
According to this background, when individual information is available, companies gather it separately and indiscriminately and database it, and the size of the personal information database is quite large. For example, the number of personal information collected can range from millions to tens of millions. Therefore, it is possible to reduce the expense of the enterprise by using the collected large-capacity personal information database more efficiently. For example, if you can extract the personal information of the family members listed in the personal information database, do not send the company's publicity or event guide to them individually, For example, a householder or a housewife. In this way, it is possible to reduce the cost of mailing the mail while achieving almost the same effect as the publicity.
However, as mentioned above, the personal information database collected by companies and the like, by registering the addresses, names, telephone numbers, and ages individually acquired from individuals in the personal information database at the time of their acquisition, Are not sorted according to a specific field in the database, for example, in the order of addresses (i.e., in alphabetical order). Typically, a particular address element of the address field (the address element is an element of the address system, such as "city / city", "city / county", "town / county / Herein, the elements of "city / province", "city / county / district" and "town / county / province" are generally referred to as "upper address" (Hereinafter referred to as a " sub address ") of a specific address element is requested from a personal information database, The database entries are used in a way to obtain database entries that match the information, where all the entries are searched. For example, according to a query designating "Seoul" for the address field, all entries of the personal information database are searched to obtain personal information entries including "Seoul " in the address field.
In this way, if you want to know the personal information that appears to belong to a particular group, for example, individuals belonging to the same household, it takes a considerable amount of time to retrieve the addresses listed in the database. For example, with respect to a personal information database having N listed entries, which is constructed as illustrated in FIG. 1, an entry estimated to belong to the same household as an individual corresponding to the registered
Therefore, according to the above-described method, it takes a great amount of time to extract entries belonging to a specific group, for example, the same household, from the personal information database.
In order to extract entries belonging to a specific group according to the above-described method, the personal information database must be loaded on the memory of the computing device. Since the amount of information is very large, In general, since an execution object other than a database, for example, a script for database processing, a computer operating system (O / S), and a basic process occupies memory resources, the entire personal information database is usually loaded into memory I can not. In this case, a portion of the personal information database that is compared to a particular entry is loaded into memory and the rest is stored on the hard disk, causing a swap of data between the memory and the hard disk do. This further increases the amount of time it takes to identify a particular group of entries.
The present invention provides a method and an apparatus for sorting a database based on an address field in order to distinguish entries corresponding to a specific group from a collected large-capacity personal information database at a higher speed than conventional ones The purpose.
It is another object of the present invention to provide a method and apparatus for adaptively classifying entries of a database in a personal information database adaptively to a memory resource of a computing device used for distinguishing entries corresponding to a specific group.
It is still another object of the present invention to provide a method of distinguishing entries according to various grouping conditions when distinguishing entries corresponding to a specific group on the basis of an address field in a personal information database in which entries are sorted according to the above object And a device.
It is a further object of the present invention to allow a user to freely designate and extract desired information for an inputted personal information database and also to identify a desired specific group based on freely specified information about the extracted information And to provide a device and method for enabling the device to be used.
It is to be understood that the object of the present invention is not limited to the explicitly stated objects, but, of course, it is an object of the present invention to achieve the effect which can be derived from the following specific and exemplary description of the present invention.
According to an aspect of the present invention, an apparatus for sorting and processing a database on the basis of an address includes a plurality of sub-databases different from each other according to a classification level designated as an address element, A sorting unit configured to classify the sub-databases into a plurality of sub-databases, and a sorting unit configured to sort entries of the sub-databases corresponding to the plurality of sub- , And an organization configured to add an entry obtained from the database to the corresponding sub-database. And, with this apparatus, each of the sub-databases is composed of entries having the same information for the designated address element, and the sub-databases are entries having different information for the designated address element .
In one embodiment of the present invention, the classification level designates at least one address element starting from the highest address element in the address system through an external input or an information file.
In one embodiment according to the present invention, the address element includes elements of "city / city "," city / district ", and "east / The designated address element includes at least the elements of the " state / city ".
In one embodiment of the present invention, the address information of each entry of each sub-database may be an address code converted from the address data of the corresponding entry in the database. In this case, , And a numeric string uniquely assigned to the upper address of the address data.
According to an embodiment of the present invention, information corresponding to each entry of the input database is extracted in accordance with a specified format, and each extracted information is arranged in a designated order to constitute each personal information entry And a processing unit. In the present embodiment, the classification section causes the classification to be performed on each of the personal information entries configured by the processing section. Here, the specified format and order may be defined by an arbitrary name variable designating corresponding information in the entry of the input database, and an order of listing the variables.
According to an exemplary embodiment of the present invention, the apparatus further includes a grouping unit for comparing the entries of the sub-databases among the sub-databases to identify entries matching the set grouping requirement. In the present embodiment, in the mutual comparison of the entries, the grouping unit compares the entry with the contiguous entries from the next entry to the mismatched entry in the matching element of the address specified by the grouping requirement .
In an embodiment of the present invention in which the grouping unit is included, the grouping unit may add, to the entries classified according to the set grouping requirement, indication information indicating a result according to the set grouping requirement. In addition, in the present embodiment, the grouping requirement is defined by variables having an arbitrary name designating corresponding information in the entry of the sub-database and logical functions applied to the variables, At least a matching requirement for a telephone number and a matching requirement for a telephone number. The grouping requirement may be specified via external input or through an information file.
According to an embodiment of the present invention, the apparatus further comprises a merging unit for merging the plurality of sub-databases into a single database. The merging unit may select an arbitrary entry in each of the sub-databases and associate the sub-databases with each other in accordance with the order relationship between the sub-databases, So that the single database is constructed.
According to another aspect of the present invention, a method of processing a database according to an address comprises the steps of: identifying information corresponding to an address element specified as a classification level for an arbitrary entry of a database; , Sorting the arbitrary entry based on the address information of the entry, registering the arbitrary entry in a sub-database specified for the classification, and registering the arbitrary entry in a position in the sub-database based on the address information of the entry It consists of two steps. And, in the method, each of the sub-databases in which each entry of the database is classified and registered by the two steps is configured with entries having the same information for the specified address element.
In an embodiment according to the present invention, the method further comprises the steps of: selecting an entry in one of the sub-databases; determining, for the selected entry, And ending the confirmation of the selected entry in the entry that is inconsistent with the matching element of the address specified by the grouping requirement.
Further, in an embodiment according to the present invention, the method further comprises: selecting an arbitrary entry in each of the sub-databases, and comparing the address information of the selected entries with each other, And configuring the sub-databases into a single database by sequentially arranging the sub-databases according to the determined mutual contexts.
According to another aspect of the present invention, a storage medium on which data is recorded includes an address processing program recorded in the medium for sorting and processing entries of the database by addresses. When the address processing program is loaded into a computing device capable of reading the program and executed, the address processing program may be configured to execute, based on the classification level designated as the address element, Sub-databases into entries having the same information for the specified address element by classifying the sub-databases into entries divided into sub-databases, Database, an entry obtained from the database is added to the corresponding sub-database in such a manner that the entries of the corresponding sub-database are arranged in the order specified based on the address information of each entry .
At least one embodiment of the present invention described above or described in detail below with reference to the accompanying drawings is to provide a method of classifying addresses from a large volume database based on listed address information, Allows the database to be sized appropriately. Accordingly, when a computing device provided for database processing has limited memory resources, each database having the classified entries is made to have a size appropriate for the capacity of the resource, thereby eliminating restrictions on the database processing, It is possible to eliminate the factor of swapping. In the latter case, the processing time is shortened.
In addition, according to the present invention, each partitioned database is arranged such that its entries are sorted in order based on the address, so that it is possible to distinguish entries belonging to a specific group, for example, the same household, Since the area can also be limited to the adjacent entry (s) of the entry, the time required for the search is also significantly shortened. And, in the entry grouping for the database, if the grouping requirement is not satisfied satisfactorily by the specified grouping requirement, that is, if the number of grouped entries is too large or too small, The time for obtaining the same result is reduced to a ratio of several tens of times as compared with the case of performing a database operation in which a randomly inserted personal information is inserted.
Figure 1 is a simplified illustration of an example of a generic database built with collected personal information,
2 is a block diagram illustrating a configuration of an apparatus for performing sorting of a personal information database according to an address and an information grouping method using the sorted database according to an embodiment of the present invention,
3 is a flowchart of a method for sorting a personal information database based on an address, according to an embodiment of the present invention,
4A is a definition example of a field input / output format that allows a user to freely select required field information from an existing database and configure a database having information suitable for use according to an embodiment of the present invention,
4B schematically illustrates the construction of a new entry by arranging the information of an arbitrary entry of an inputted existing database in a specified order according to a prescribed field input / output format according to an embodiment of the present invention And,
FIG. 5A illustrates an example of designating the depth of an address based on an address, in classifying entries of a database according to an embodiment of the present invention,
5B schematically shows a method of operating and referencing a classification reference table so that the databases can be classified and classified based on classification levels according to an embodiment of the present invention,
Fig. 6 shows an example in which, for the inputted database entry, a unique code for classifying the entry is added according to the embodiment of Fig. 5B,
7 illustrates an example in which an existing database is divided into a plurality of sub-databases based on a classification level and constructed according to an embodiment of the present invention,
8A and 8B are diagrams illustrating a method of configuring a sub-database according to an exemplary embodiment of the present invention so as to enable binary search for registered entries, ≪ RTI ID = 0.0 > - < / RTI > in an ordering of addresses in the database,
9A to 9C are various examples for specifying a condition of grouping in grouping entries belonging to a specific group for a database according to embodiments of the present invention,
FIG. 9D shows an example showing that the field information can be freely selected for a database for which grouping of entries is desired, and the grouping condition can be freely defined according to the promised syntax as the selected information, according to an embodiment of the present invention ego,
10 illustrates an example of a database in which entries are arranged in order based on addresses according to an embodiment of the present invention,
11 shows an example of converting address data in a text format of personal information into a unique numeric code corresponding thereto according to another embodiment of the present invention,
Fig. 12 is an example of a database obtained by applying the conversion process according to the embodiment of Fig. 11 to the information described in the database illustrated in Fig. 10,
Figure 13 is a diagram showing, together with its execution components, that each partitioned sub-database can be merged in an address-based manner in accordance with one embodiment of the present invention,
FIG. 14 schematically illustrates the process of merging each sub-database in an order-based manner in accordance with the embodiment of FIG.
Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings.
FIG. 2 is a block diagram illustrating an apparatus for performing an information grouping method using an ordered list and sorting of personal information lists according to an address according to an embodiment of the present invention. Referring to FIG. 2,
Each
In an embodiment of the present invention, the
3 is a flow diagram of a method for sorting personal information listings based on an address, in accordance with an embodiment of the present invention. Hereinafter, the operation of the apparatus having the configuration illustrated in FIG. 2 to mechanically perform the sorting of the personal information list according to a given condition will be described in detail and with reference to the flowchart illustrated in FIG.
The preprocessing
The field input /
On the other hand, in the case where the information for enabling the
In another embodiment according to the present invention, the
The
The classifying
When the personal information entry is input, the
When the personal information entry to which the classification code is added is inputted from the
If it is confirmed in step S320 that the sub-database for the same code as the classification code of the currently input personal information entry is constructed, the
The
First, the
Upon completion of the processing (S321, S322, or S323) for the entered personal information entry as described above, the
The classification of the personal information entries according to the specified
According to the embodiment of the present invention, when the construction of the sub-database having the index table is completed according to the above-described embodiment, the entries indicated by the items are rearranged So that the entries themselves can be arranged in the order of their address data. Of course, in the present embodiment, the index table and the information for indexing the entry are unnecessary and thus removed from the sub-database.
When all the above processes are completed, each sub-database (200 k , k = 1, 2, 3, ..) constructed by the
Each of the sub-databases 200 constructed by the
Meanwhile, as described above, when the
The
9A to 9C are partial examples of the
The requirements of the grouping condition (3a) defined by the equation as illustrated in Figs. 9A to 9C can be freely defined through a variable arbitrarily defined by the user. FIG. 9D is an example created to specify the requirement of FIG. 9A as an example of this. As illustrated in FIG. 9D, functions 910 for specifying corresponding information in an entry of a sub-database to be used for grouping, and
That is, the
The
When the search for the
In the embodiments according to the present invention described so far, the text information of the address field of the pre-built database 1a is used as it is. In another embodiment of the present invention, the address data of the personal information entry can be converted into a code by a series of numbers and used. If the address data is used as a numeric code according to the present embodiment, an operation of comparing a part of the address data of the input entry performed by the classifying
In the embodiment according to the present invention, the classifying
According to the present embodiment, after the address data of the personal information entry is replaced by the corresponding numeric code (hereinafter also referred to as "address code") by the classifying
In an embodiment according to the present invention, an apparatus for performing the sorting and grouping method of the personal information database according to the present invention is characterized in that, as illustrated in FIG. 13, 200) into a single personal information list. The merging
The merging
In the above-described embodiment, the first entry is selected for each sub-database and the address information is used for mutual comparison to determine the placement order of each sub-database. However, in another embodiment according to the present invention, An entry at any position in the sub-database may be selected. Of course, the selection position may not be the same for each sub-divisor.
As described above, a personal information database in which all the entries are arranged in order based on addresses can be used for the above-described grouping as well as for other purposes. Of course, in this case too, an order-based feature based on the address ensures fast searching.
The embodiments of the present invention described above can be combined and implemented together if they are not mutually incompatible.
It will be apparent to those skilled in the art that various modifications and variations can be made in the present invention without departing from the spirit or scope of the invention as defined in the appended claims. , Alteration, substitution, addition, or the like.
21: pre-processor 22: address parser
23: classification section 24: DB organization section
25: grouping unit 26: merging unit
200 k : sub-database
Claims (15)
A classification unit configured to classify entries of the database into entries divided into a plurality of sub-databases different from each other according to the classification level designated by the address element;
For each of the plurality of sub-databases, an entry obtained from the database in a manner that the entries of the sub-database are arranged in a specified order based on the address information of each entry, An organizational unit configured to add to the database,
And a merging unit for merging the plurality of sub-databases into a single database,
Wherein each of the plurality of sub-databases is configured with entries having the same information for the designated address element,
The merging unit selects an arbitrary entry in each of the sub-databases, and according to the mutual relationship between the plurality of sub-databases identified through the mutual comparison of the address information of each selected entry, And arranging the databases in the order of the addresses of the databases.
Wherein the classification level arranges and processes the database according to an address that specifies at least one address element starting from a highest address element in an address scheme through an external input or an information file.
The address element includes elements of "city / city "," city / district ", and "east / A device that sorts and processes a database according to an address that contains an element of "time".
Wherein the address information of each entry of each sub-database base is an address code converted from the address data of the corresponding entry in the database, and the address code is a number uniquely allocated to the upper address of the address data A device that sorts and processes a database according to an address that contains a column.
Further comprising a processing unit for extracting information corresponding to information of each entry of the input database in accordance with a specified format and arranging the extracted information in a specified order so as to form individual information entries,
Wherein said classifying unit arranges and processes the database according to an address that causes said classification to be performed for each of said personal information entries constituted by said processing unit.
Wherein the specified format and order are arranged according to an address that is defined by variables of an arbitrary name designating corresponding information in an entry of the input database and an order of enumeration of the variables.
Further comprising a grouping unit for comparing entries of any sub-database among the sub-databases to distinguish entries matching the set grouping requirement,
Wherein the grouping unit compares, in the mutual comparison of the entries, an address for comparing an arbitrary entry with consecutive entries from the next entry to an inconsistent entry in a matching element of an address specified by the grouping requirement A device that sorts and processes a database accordingly.
Wherein the grouping unit arranges the database according to an address that is further configured to add, to the entries classified according to the set grouping requirement, instruction information indicating a result according to the set grouping requirement.
The grouping requirement may be such that the database is aligned according to an address that is defined by variables having an arbitrary name designating corresponding information in the entry of the sub-database and logical functions applied to the variables Processing device.
Wherein the grouping requirement aligns and processes the database according to an address that is specified via an external input or through an information file, including at least a matching requirement for the address and a matching requirement for the telephone number.
A step of confirming information corresponding to an address element specified as a classification level for an arbitrary entry in the database;
Classifying the arbitrary entry based on the identified information, registering the arbitrary entry in a sub-database designated for the classification among a plurality of sub-databases, A second step of registering in a position corresponding to the order in the sub-database,
Merging the plurality of sub-databases into a single database,
Wherein each of the plurality of sub-databases into which each entry of the database is classified by the step 2 is configured as entries having the same information with respect to the designated address element,
In the step 3, an arbitrary entry is selected from each of the sub-databases, and according to the mutual relationship between the plurality of sub-databases identified through the mutual comparison of the address information of each selected entry, And arranging the databases in order to configure the single database.
Selecting any entry in one of the sub-databases;
Confirming whether the selected entry satisfies the set grouping requirement sequentially from the next entry, and ending the confirmation for the selected entry in the entry that is inconsistent with the matching element of the address specified by the grouping requirement The method comprising the steps of:
An address processing program for sorting and processing the entries of the database based on the address is recorded,
When the address processing program is loaded into a computing device capable of reading the program and executed,
By subdividing the entries of the input database into entries divided into a plurality of different sub-databases in accordance with the classification level designated by the address element, each of the sub-databases finally assigns the specified A function to configure the address element as entries having the same information,
For each of the plurality of sub-databases, an entry obtained from the database in a manner that entries of the corresponding sub-database are arranged in the order specified based on the address information of each entry, And the like,
Selecting an arbitrary entry in each of the plurality of sub-databases, and selecting the entry in the plurality of sub-databases according to the mutual relationship between the plurality of sub- Wherein the program is a program capable of performing a function of constituting a single database by disposing and merging in order.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020130011879A KR101447526B1 (en) | 2013-02-01 | 2013-02-01 | Method and apparatus for sorting personal information database based on an address and for grouping information from the sorted database |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020130011879A KR101447526B1 (en) | 2013-02-01 | 2013-02-01 | Method and apparatus for sorting personal information database based on an address and for grouping information from the sorted database |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20140099083A KR20140099083A (en) | 2014-08-11 |
KR101447526B1 true KR101447526B1 (en) | 2014-10-08 |
Family
ID=51745613
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020130011879A KR101447526B1 (en) | 2013-02-01 | 2013-02-01 | Method and apparatus for sorting personal information database based on an address and for grouping information from the sorted database |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR101447526B1 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106269528B (en) * | 2015-05-20 | 2018-12-21 | 菜鸟智能物流控股有限公司 | Method and device for providing address sorting information |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20010044398A (en) * | 2001-02-16 | 2001-06-05 | 조덕영 | SLAAM System for list merge/purge |
JP2007179329A (en) * | 2005-12-28 | 2007-07-12 | Alps Sha:Kk | Address character string acquisition method and address character string acquisition system |
-
2013
- 2013-02-01 KR KR1020130011879A patent/KR101447526B1/en active IP Right Grant
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20010044398A (en) * | 2001-02-16 | 2001-06-05 | 조덕영 | SLAAM System for list merge/purge |
JP2007179329A (en) * | 2005-12-28 | 2007-07-12 | Alps Sha:Kk | Address character string acquisition method and address character string acquisition system |
Also Published As
Publication number | Publication date |
---|---|
KR20140099083A (en) | 2014-08-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR101276602B1 (en) | System and method for searching and matching data having ideogrammatic content | |
AU2010249253B2 (en) | A method for automatically indexing documents | |
US7672958B2 (en) | Method and system to identify records that relate to a pre-defined context in a data set | |
KR102063384B1 (en) | Method for Registering Product Information by using Image | |
JP2005519370A (en) | Effective storage of indented threads in threaded discussion applications | |
CN109947949A (en) | Knowledge information intelligent management, device and server | |
KR100491250B1 (en) | Method and System for registering goods information | |
CN104636429A (en) | Trademark category retrieval method and device | |
CN110659282A (en) | Data route construction method and device, computer equipment and storage medium | |
AU2002331728A1 (en) | A method for automatically indexing documents | |
CN110727857A (en) | Method and device for identifying key features of potential users aiming at business objects | |
KR101803150B1 (en) | Important precedents extraction and sorting method using Big Data | |
CN110956271B (en) | Multi-stage classification method and device for mass data | |
KR100991661B1 (en) | System for automatic classification dewey decimal classification/korea decimal classification of book and method for the same | |
CN104933096B (en) | Abnormal key recognition methods, device and the data system of database | |
US20140149341A1 (en) | System and method for refining address database for improving performance of automated mail sorting machine | |
CN110313159B (en) | Communication data modeling method | |
KR101447526B1 (en) | Method and apparatus for sorting personal information database based on an address and for grouping information from the sorted database | |
CN111104476B (en) | Archive data generation method, archive data generation device, and readable storage medium | |
CN112579629A (en) | Method for helping purchasers of electronic component enterprises to accurately find products | |
CN115510289B (en) | Data cube configuration method and device, electronic equipment and storage medium | |
JP2006065706A (en) | System and method for supporting application requirement design | |
JP5148157B2 (en) | Customer information management device | |
CN110781211A (en) | Data analysis method and device | |
CN113031835A (en) | Menu data processing method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
E902 | Notification of reason for refusal | ||
E701 | Decision to grant or registration of patent right | ||
GRNT | Written decision to grant | ||
FPAY | Annual fee payment |
Payment date: 20170802 Year of fee payment: 4 |
|
FPAY | Annual fee payment |
Payment date: 20180813 Year of fee payment: 5 |
|
FPAY | Annual fee payment |
Payment date: 20190903 Year of fee payment: 6 |