KR101134073B1 - Search Method for using word association between search keyword and search result and system thereof - Google Patents

Search Method for using word association between search keyword and search result and system thereof Download PDF

Info

Publication number
KR101134073B1
KR101134073B1 KR1020090129156A KR20090129156A KR101134073B1 KR 101134073 B1 KR101134073 B1 KR 101134073B1 KR 1020090129156 A KR1020090129156 A KR 1020090129156A KR 20090129156 A KR20090129156 A KR 20090129156A KR 101134073 B1 KR101134073 B1 KR 101134073B1
Authority
KR
South Korea
Prior art keywords
search
user
content
word
related
Prior art date
Application number
KR1020090129156A
Other languages
Korean (ko)
Other versions
KR20110072296A (en
Inventor
최진근
Original Assignee
최진근
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 최진근 filed Critical 최진근
Priority to KR1020090129156A priority Critical patent/KR101134073B1/en
Publication of KR20110072296A publication Critical patent/KR20110072296A/en
Application granted granted Critical
Publication of KR101134073B1 publication Critical patent/KR101134073B1/en

Links

Images

Abstract

The present invention relates to a search method and a search system for providing a search result page by classifying search results in real time by receiving a search request from a user. According to an aspect of the present invention, there is provided a search method comprising: selecting a plurality of first content links ranked above in a first search result searched using a user search keyword; Analyzing first content information and selecting first related words having a high frequency of appearance; Selecting a plurality of second content links ranked above in a second search result searched by a user search keyword and a first related word; Selecting second related words having a high frequency of appearance in the second content information; Extracting a multimedia information link ranked above in a multimedia search result using the second related word as a search keyword; And generating, in real time, a search result page including the user search keyword, the first related word, the URL of the second content link, the second related word, and the multimedia information link for the second content information to the user terminal. .

Description

Search method for using word association between search terms and search results and system

The present invention relates to a search method and a search system for analyzing a search result searched by a search word, extracting a related word, and performing a re-search using a combination of a search word and a related word to provide a user with search results classified by related words.

Existing search systems provide a search result by searching a search database with a search word received from a user, and in some cases, provide searched data classified into categories. Categories that categorize search results include sites, web documents, knowledge, videos, music, images, dictionaries, and news.

Here, the category of the search result is defined by the external identification of the data and indicates a field (site, web document, knowledge, blog, news, shopping) or a format (image, video, music). If a user searches with a specific field or format of the search result in mind, the search results classified into categories are helpful. On the other hand, categorized search results are not helpful if a user searches because they do not know what the search term is or if they search with a specific knowledge content in mind. This is because the user must directly grasp the contents of all the data classified into the categories to determine whether the information is the desired information.

Therefore, when a user searches to obtain the contents of a search word or related contents, a process of individually checking the individual information of the search result is involved regardless of the category classification. The actual success of the search is determined based on the contents of the information confirming the search result, not the presence or absence of the search result. However, it is very difficult to provide a search service that defines the contents (topics), relations, and the like of the search results and classifies them by the defined contents.

Accordingly, the applicant has applied for patent applications Nos. 10-2005-104668 and 10-2008-118067 to provide information retrieval technology of graphic topology structure using knowledge nodes (word nodes) that can grasp the meaning of individual information at a glance. have. This technology provides the content relevance of information as a search result so that a user can immediately specify a search target. That is, when the search server receives a search word from the user, the search server presents a node graph showing the relation of related words, and the user intuitively judges the content, and when the user selects a specific word node according to the determined correlation, the search server responds to the search. It was to provide information. This saves users the trouble of reviewing the search results and repeating additional searches, or reviewing the search results to discard unnecessary information and collect necessary information.

Furthermore, the applicant has applied for the patent application Nos. 10-2009-33995, 10-2009-33996 and 10-2009-107536 associated with the above technology, and the search service of audiovisual information and the cross-search service of content information and audiovisual information and its database Has provided automatic build technology. What is described in these three applications is that the information search results are not directly provided to the user's keyword search request, but the commonality of the words related to the search results is provided first. Therefore, the user can grasp the contents in advance from the relation of the words and then receive and review the search results corresponding to the desired contents.

Therefore, Applicant applies the technical mechanism of the above application to the existing search system and does not immediately present the search result to the user's search request, but defines the word association between the search term and the search result and searches the content classified according to the defined association. We want to provide a result.

The present invention has been made in view of the above-described points, and upon receiving a user's search request, real-time analysis of the search results searched for by the user's search term defines a relationship between the search word and related words, and for each related word of the relationship. An object of the present invention is to provide a search method and a search system for classifying and providing search results.

According to a search method using the word association between the search word and the search result of the present invention for achieving the above object, by receiving a search request from the user terminal using a wired, wireless network to classify the contents of the search result in real time In a search method of a search server provided, (S21) the search server searches a plurality of first content links ranked higher in a first search result received through a search site using a user search keyword requested from a user terminal. Screening; (S22) analyzing first content information corresponding to each first content link, and selecting a plurality of first related words by a number set by a user in order of appearance frequency across all first content information; (S23) selecting a plurality of second content links ranked higher in a second search result received from the search site by using a combination of the user search keyword and the first related word as a search keyword; (S24) selecting a plurality of second related words by a number set by a user in order of appearance frequency in second content information corresponding to each second content link; (S25) extracting one or more multimedia information links ranked higher in the multimedia search result using one or more second related words as search keywords; And (S26) real-time generation of a search result page including a user search keyword, a first related word, a URL of a second content link, a second related word, and a multimedia information link for each second content information and providing the same to a user terminal. And the search server analyzes the word association between the user search keyword and the search result and provides search results classified by the first related word.

In addition, the search server is a search keyword input method, a search site, a multimedia information search site, the number of first content links, the number of second content links, the number of first related words, the number of second related words, and the multimedia information link from a user terminal. At least one of the numbers may be set as a search option.

The search server may receive a search request including a search keyword from a user terminal.

In addition, the search server is characterized in that the weight of the frequency for the corresponding word is differentially assigned according to the region in which the word appears when calculating the frequency of the first or second related words.

According to a preferred feature of the invention, it is characterized in that the URL of the audiovisual information linked to each of the second content information is extracted and determined to be a multimedia information link corresponding to the second content information.

Furthermore, the search server may search for at least one or more of a video, an image, music, and a web map with the multimedia information.

Preferably, the search server extracts one or more multimedia information links ranked higher in the multimedia search results collected from the search site using the second related word as a search keyword in the order of high frequency.

Further, the search server may extract one or more multimedia information links ranked higher in the multimedia search results collected from the search site by using search keywords combining two or more words among the second related words.

The search server may generate the search keyword by combining two or more words in the order of appearance frequency among the second related words.

Further, the search server may display word association relations of a plurality of first related words with respect to a user search keyword in order of frequency of the first related words, and correspond to the respective first related words (the second related words of the related words). Multimedia information) and (individual search result) are generated and provided to the user terminal.

The search server may generate a second search result by classifying the entire first content information into first related words.

On the other hand, according to the search system of the present invention, in a search system that receives a user's search request using a wired, wireless network and classifies and provides the contents of the search result in real time, the search request received from the user terminal is extracted Analyze the contents of the first search result searched by the user search keyword and the contents of the second search result searched by the combination of the user search keyword and the first related word extracted in the order of the high frequency of the first search result. The user terminal is generated by extracting a second related word of the order, generating a search result page including a second search result arranged in the order of the frequency of the first related word and the second related word and a multimedia information link searched by the second related word. Providing search server; And requesting a search by transmitting a user search keyword to the search server, receiving the search result page including multimedia information of a search result and an individual search result sorted by the frequency of first and second related words, and receiving the search result page. Including a user terminal to display, the search server analyzes the word association between the user search keyword and the search result and provides search results classified by the first related word.

According to the present invention, when providing a search result searched in real time with a user search keyword, the relevant words having a word association with the user search keyword are extracted from the search result, and a search result classified into individual related words is provided.

In particular, we categorize individual content into relevant words that are relevant to your search keywords on the various themes of your search results. You can visit

In addition, the user may be provided with multimedia data classified separately from the search results classified as the related words of the word association relationship, so that the user can play the multimedia data in advance in the state of knowing what the content is.

Hereinafter, the configuration of a preferred embodiment of the present invention with reference to the accompanying drawings. Prior to this, terms or words used in the specification and claims should not be construed as having a conventional or dictionary meaning, and the inventors should properly explain the concept of terms in order to best explain their own invention. Based on the principle that can be defined, it should be interpreted as meaning and concept corresponding to the technical idea of the present invention.

<1. System configuration>

1 shows a schematic configuration of a retrieval system 1 according to an embodiment of the present invention.

Search system 1 according to an embodiment of the present invention is a search server for providing a search result page that the content classification of the search result data for each word appearing at the highest frequency after analyzing the search results searched by the user search keyword in real time ( 2) and a user terminal 3 that requests a search by transmitting a user search keyword to the search server 2 using a wired or wireless network, and receives a search result page in which the contents of the search result are classified from the search server 2. It is configured to include.

In the present invention, the wired / wireless network typically includes all communication networks capable of Internet communication using various protocols such as mobile communication networks, wired, wireless public networks or dedicated networks.

The user terminal 3 includes a computer terminal, a mobile communication terminal, and other portable terminals, which are provided with a web browser and which can access a home page. That is, the user terminal 3 requests a search to the search server 2 by inputting a user search keyword into a search box of a web page regardless of the type of paper or the communication method, and searches the content by the search server 2. It is a terminal that receives a page and displays it on the screen.

The search server 2 analyzes the words of the search results searched by the user search keyword and classifies the search results according to the word association of the analyzed words and the user search keyword. The search result page in which the classified search results are arranged is generated and provided to the user terminal 3.

First, the search server 2 provides a search page to the user terminal 3 and receives a user search keyword input from the search window to receive a search request. When the search request is received, the first search result is searched by the user search keyword. When the first search is completed, the first search result is analyzed to extract the first related words and sorted by high extraction frequency. Once the first related words are sorted, the secondary search results are searched for by the combination of the user search keyword and each first related word. Accordingly, the second search result is a content classification of the first search result by the first related word. After the secondary search is completed, the secondary search results are analyzed to extract the second related words and sort them by high extraction frequency. Then, the multimedia information is searched in the third ordered second related word.

Next, when the third search is completed, the search server 3 generates a search result page of the multimedia information search result that is third searched from the user search keyword, the first search result related to the first relevant word, and the second search result. To provide to the user terminal (3).

Here, the user search keyword and the first related word used as the search keyword in the second search appear in the order of high frequency with respect to the user search keyword, and there is a word association of the first relevant word. Therefore, when the search user is provided with the secondary search results classified by the first related words having the word association relationship, the contents of the search result can be grasped in advance from the word association.

In the present invention, the search results are not limited to specific types or formats of data such as sites, knowledge, dictionaries, news, blogs, and multimedia. However, the search results of the multimedia data format are classified into third search results.

2 shows the internal structure of a search server 2 according to an embodiment of the present invention.

The search server 2 according to an embodiment of the present invention analyzes the first content information by the first content selection means 21 for selecting the first content link from the first search result searched by the user search keyword, and the first content information. A first related word selecting means 22 for selecting a related word, a second content selecting means for selecting a second content link from a second search result searched using a combination of a user search keyword and the first related word as a search keyword ( 23), a second related word selecting means 24 for extracting a second related word having a high frequency of appearance from the second content information, and a multimedia for extracting a multimedia information link from a multimedia search result using the second related word as a search keyword. A search result page including the user search keyword, the first related word, the URL of the second content link, the second related word and the multimedia information link for the information extraction means 25 and the second content information. It is characterized in that it comprises a search result page providing means (26) for providing the user to the user terminal (3).

Detailed functions and operations of the individual components constituting the above-described search server 2 will be described through a search method described below.

<2. Method composition>

The search method using the word association between the search word and the search result according to the embodiment of the present invention can be preferably realized through the construction of the search system 1 described above.

3 shows a schematic sequence of a search method according to an embodiment of the present invention. Reference is appropriately made to FIGS. 4 to 7 for explanation of the individual steps of FIG. 3. 4-7 illustrate a model that defines word associations of a search result according to an embodiment of the invention.

First, the user accesses the home page of the search server 2 using the user terminal 3. The user terminal 3 receives a web page provided by the search server 2 and displays it on the screen. At this time, a search box is displayed on the web page, and the user inputs a user search keyword and presses a search button. The search server 2 receives a search request including a user search keyword from the user terminal 3.

Next, the first content selection means 21 of the search server 2 extracts the user search keyword from the received search request, transmits the user search keyword to the search site, and ranks higher in the received first search results. A plurality of first content links are selected (S21).

4 schematically illustrates a process of the search server 2 selecting the first content link from the first search result searched by the user search keyword.

The first content selection means 21 performs a first search with a user search keyword. FIG. 4 illustrates that 300 content links of the content link A1 to the content link A300 have been searched as the first search result. In this case, the search server 2 may perform the search on at least one or more company or third party search sites (eg, Naver, Yahoo, Google, etc.) previously designated by the search user or the operator of the search server 2.

When searching using a database of a third-party search site, the public API provided by the search site can be used. The open API refers to an interface for requesting a search from the search site. The search server 2 may request a search by transmitting a user search keyword to a program coded according to an API protocol proposed by the search site.

Then, the first content screening means 21 selects the top 10 in order of the number of user search keywords included in the title, tag, text, and the like of the content information corresponding to the respective content links, for the first 300 searched content links. First content links (first content link 1 to first content link 10) are selected. At this time, the selection number of the top 10 first content links is only one embodiment, and it is apparent to those skilled in the art that the service provider defaults or the various numbers may be specified by the user. In addition, the top 10 orders may be applied to various criteria such as the number of visits (hits), the latest registration date of the user.

When the selection of the first related word is completed, the first related word selecting means 22 analyzes the corresponding first content information for each first content link to search for a plurality of first occurrences of a plurality of first high frequency items across all the first content information. 1 extract the related words (S22)

FIG. 5 schematically illustrates a process of selecting a first related word for a first content link selected from the first search result of FIG. 4.

First, the following word extraction method is applied to the first relevant word selection means 22 to extract a word by analyzing a search result.

When ten first content links are selected, the first related word selection means 22 analyzes the frequency for each word that can be identified for each content link. That is, a title, a tag, a text, and the like are analyzed for each piece of first content information, and a list of identifiable words is extracted, and then frequency is analyzed for each word. Here, the frequency means the number of times a word appears in a title, tag, text, and the like.

Here, as an example of a method of analyzing the frequency of each word for each first content information, a morpheme analysis method may be used. That is, frequency can be calculated for each word through morphological analysis using a commercially available morphological analysis tool. As another example, the techniques disclosed in Korean Patent Publication No. 2001-0055114, Korean Patent Publication No. 2004-0101678, Korean Patent Publication No. 2002-0054254 and the like may be used.

Preferably, in the process of calculating the frequency of the word, the weight for each word appearance region may be differentially assigned to the frequency. For example, if a particular word appears in the text, the frequency may be counted as 1, and if it appears in the title, the frequency may be calculated as 2 by applying a weight. This is because, when a specific word is located at the same place as the title than when it is located in the text, it may be determined that the relatedness of the content of the word to the corresponding content information is relatively high. The weighting form for such a frequency is only one embodiment, and the weighting method may be variously modified.

When the frequency analysis for each word is completed for each first content information as described above, the search server 2 extracts a plurality of high frequency words with high frequency. That is, the frequency of each word is compared with respect to each piece of first content information, and a certain number of words having a high frequency are selected.

Then, the first related word selecting means 22 compares the total frequency of the extracted words, and selects a plurality of first related words having a high total frequency. Here, the total frequency means the sum of all the frequencies in the first content information with respect to each of the higher frequency words analyzed from the first content information. For example, a word to A n 1 times in the first content information A1, a first n 2 times in the content information A2, the first content information in the A3 n 3 times, ..., 300 n in the first content information A300 If times were exposed, then the total frequency of the particular word A is calculated as n 1 + n 2 + n 3 + ... + n 300 . This overall frequency calculation process is repeated for higher frequency words extracted from each content information.

The above word extraction method will be described with reference to FIG. 5. For each of the ten content links (first content links 1 to 1 content link 10) selected by the search server 2, the corresponding word content extraction methods correspond to individual first content links. 15 high frequency words are extracted for each piece of first content information. Therefore, 150 high frequency words are extracted for all ten pieces of first content information.

Then, the first related word selecting means 22 removes duplication from the extracted total 150 high frequency words. In this embodiment, it is assumed that the total number of high frequency words from which duplicates are removed is 80. Then, for each of the 80 upper frequency words, the frequency in the first content information 1 to 10 is summed to obtain the total frequency. As described above, if the frequency of the extracted specific word is 1 to 5, the first content information 4 to 2, and the first content information 7 to 3, the total frequency of the word becomes 10. In this way, when the total frequency for the 80 extracted words is calculated, five words are selected as the first related words in order of the highest frequency words.

At this time, like the embodiment of FIG. 4, the number of extraction of the 15 high frequency words and the number of selection of the first related word of 5 are just one embodiment, and the present invention is not limited to this specific number. .

When the selection of the first related word is completed, the second content selection means 23 searches for the second search result from the search site using the mutual combination of the user search keyword and the first related word as a search keyword, and the plurality of ranks ranked at the top. Select the second content link (S23). Here, the number of the selected second content links may also be predetermined by the user or the search service provider. Meanwhile, it is also possible to generate the second search result by classifying each of the 300 first search results of FIG. 4 searched by the user search keyword into respective first related words.

FIG. 6 is a diagram schematically illustrating a process of searching and selecting a second content link by a search server 2 using a combination of a user search keyword and a first related word.

As described in the embodiments of FIGS. 4 and 5, five words of “first related word-1” to “first related word-5” are the first related words from the first search result searched by the user search keyword. When a word is selected, a total of five combinations of the user search keyword and the first related word may be made.

Then, the second content screening means 23 secondly searches the content link through the search site for each of the five word combinations and receives the result to select a predetermined number of higher content links from the search results. For example, as shown in FIG. 6, a search result page is transmitted by searching for a content link including a word combination of "user search keyword * first related word-1" from a search site. In addition, the fifty second content links (second content links 1 to 2 content links 50) ranked above are selected from among the content links included in the received search result page. Of course, the number of 50 screening may also be preset.

Although, in the embodiment of FIG. 6, the second content link is selected only for one word combination "user search keyword * first related word-1" for convenience of description, the content link for each of the remaining four word combinations. 50 screens are selected. Therefore, the total number of second content links selected by the search server 2 for the five word combinations is 250.

When the search for the second content link is completed, the second related word selecting means 24 extracts a plurality of second related words having a high frequency of appearance from the second content information corresponding to each second content link (S24). ). Since the total number of the second content links is 250, the extraction of the second related word is performed 250 times.

That is, the second related word selecting means 24 downloads the second content information from the server providing the corresponding content information by using the respective second content links, and then displays the title, tag, text, etc. of the second content information. After analyzing and extracting a list of identifiable words, frequency is analyzed for each word to extract a predetermined number of second related words. The extraction of the second related word and the frequency analysis for each word are substantially the same as the extraction of the first related word and the frequency analysis for each word. Therefore, it will be apparent to those skilled in the art that the above-mentioned "weight for each word appearance area" may be reflected in the frequency when calculating the frequency of each second related word.

When the extraction of the second related word is completed, the multimedia information extracting means 25 extracts one or more multimedia information links ranked higher in the multimedia search result using the one or more second related words as a search keyword (S25).

Here, the multimedia information link may be a link to one or more multimedia data of a video, music, image, and web map. In addition, the number of extraction of the multimedia information link can be appropriately adjusted.

The multimedia information extracting means 25 may extract one or more multimedia information links ranked higher in the multimedia search results collected from the search site using the word having the highest frequency of appearance among the second related words. For example, when three words P, Q, and R exist in the order of high frequency as the second related word, the multimedia search may be performed on only P, and the multimedia information link may be extracted from the search results.

In addition, the search server 2 may extract one or more multimedia information links ranked higher in the multimedia search results collected from the search site using a search keyword combining two or more words among the second related words. For example, when P, Q, and R exist as the second related words as in the above-described embodiment, multimedia may be generated using a word combination of P and Q, a word combination of P and R, or a word combination of P, Q, and R. A search may be performed and the multimedia information link may be extracted from the search result accordingly.

In this case, the search keyword may be generated by combining two or more words in the order of appearance frequency among the second related words. That is, it is good to construct a word combination using the second related word having a relatively high frequency.

Further, when the number of multimedia information links is n, the search server 2 designates related words as search keywords in the order of appearance frequency among the second related words, and when the number of extracted multimedia information links becomes n, Until the multimedia search results corresponding to the relevant words can be collected. For example, if it is designated to extract three multimedia information links, if the number of the multimedia information links extracted by using the second related word having the highest frequency of appearance among the second related words is three or more, the other second related information You no longer need to perform a multimedia search on the words. However, if the number of extracted multimedia information links is less than three, the multimedia information links are extracted using a second related word having a high frequency of appearance, and the number of multimedia information links is insufficient.

FIG. 7 illustrates a data structure and actual data defining word associations of the 50 selected second content links after searching with the user search keyword and the first related word-1 in FIG. 6.

For convenience of explanation, it is assumed that the user search keyword is "Radiostar", and the first related words extracted in order of frequency from the first content link are "Park Joong-hun", "Anseong", "Singer", "Musical", and "Gangwon-do". . This assumption is shown in Table 1.

User search keywords First Related Words (Frequency)

radio star

Jung Hoon Park
Ahn Sung-ki Singer musical Gangwon-do

The combination of "search keyword * first related word-1" in FIG. 6 is "radio star * Park Joong Hoon". In addition, "serial numbers" 1 to 50 of FIG. 7 correspond to a data structure of second content information corresponding to the 50 second content links (second content links 1 to 2 content links 50) selected in FIG. 6. That is, the related data for "second content link 1" of FIG. 6 corresponds to the serial number 1 sector of FIG. In FIG. 7, although two pieces of second content information exist, substantially 50 pieces of information exist.

A search result that defines word association according to an embodiment of the present invention is a "search keyword", "first related word", "second content link", "second related word" "multimedia information link" and "audio-visual information link" Data structure.

First, the search server 2 first searches for a search result corresponding to "radiostar" and selects first content information. Thereafter, the first content information is analyzed to extract a plurality of "first related words" in order of frequency of word appearance.

Next, the second content information is secondary searched and selected by combining "Radio Star" and "Park Joong Hoon". A plurality of second "words related to words" are extracted by analyzing word occurrence frequencies of the second content information, and a third search for multimedia information is performed using the second words.

Here, the "multimedia information link" is the second related word and the result of searching the multimedia information in the search site, and the "audio-visual information link" is the internal link or attached to the second content information. Although two multimedia information links and one audiovisual information link are illustrated in FIG. 7, two and one are merely examples, and it is apparent to those skilled in the art that URLs for a plurality of videos, images, music, web maps, and the like may be stored.

In addition, the second content information (second search result) corresponds to the content classification of the first content information (first search result) by "first related word". That is, since the frequency of the first related word "Park Joong Hoon" is the highest among the first search results searched for by the user search keyword "Radio Star", the second content link searched by the word combination of "Radio Star * Park Joong Hoon" is "Radio Star". It can be judged that the content is most relevant. That is, when the search server 2 sorts the total search results of "Radio Star" by the five first related words in Table 1, it is classified into five groups according to the content relevance. Thus, if the user is provided with a search results page that includes a second content link categorized into five groups, the information content of the second content link can be estimated from the word association of the user search keyword with the respective first related word. That is, in the present invention, since the user is provided with a content classified search result page, the user can know in advance what the content of the subject is without visiting by clicking an individual link.

Meanwhile, as described above, when the number of extraction of the multimedia information link is predetermined, the search server 2 may search and select the multimedia information link by sequentially using a word having a high frequency among the second related words.

For example, it is assumed that among the second related words of the second content information about the serial number 1 of FIG. 5, the order of 'highest', 'best' and 'junjun' are in the order of high frequency. In this case, if two multimedia information links are searched for content information No. 1, first, the most frequent word 'Bee and You' is designated as a search word and the multimedia information link is searched through a search site, and a search result page is received. . At this time, if the number of links to the multimedia information included in the search results page is two or more, the top of the multimedia information links included in the search results page without searching the multimedia information link by using the 'lowest' and 'Lee Jun-ik' which are relatively low in frequency. The URL information of the two pieces of multimedia information ranked at is extracted and stored as a multimedia information link corresponding to the first content information.

However, if the number of multimedia information links included in the search result page is one or less, the next highest frequency word 'high' is designated as a search word, and the search site receives a search result page for the multimedia information link. The highest ranked multimedia information ranked in the search result page is designated as the remaining multimedia information, and the URL of the designated multimedia information link is extracted and stored as multimedia data corresponding to the first content information. If the multimedia information is not searched even by the search for the next higher frequency word, the above-described process is repeated with the lowest frequency 'Lee Jun-ik' as a search word and the URL of the multimedia information link is stored in the database. When the search server 2 extracts the URL of the multimedia information, it has already been mentioned that the public API provided by the existing search site such as YouTube or Google can be used.

On the other hand, Figure 7 shows a data structure for the 50 second content information for the word combination 'Radio Star, Park Joong Hoon', but referring to Table 1, such data is' Radio Star, Ahn Sung Ki ',' Radio Star, A data structure for 50 pieces of second content information may also be constructed for word combinations of 'singer', 'radio star, musical' and 'radio star, Gangwon-do'. Accordingly, a data structure for 250 pieces of second content information that is relevant as a user search keyword of 'radio star' may be constructed. In other words, the data structure classified by the related words by analyzing the word frequency with "Park Joong-Hoon", "Anseong", "Singer", "Musical", and "Gangwon-do" with hundreds of related content information with one user search keyword "Radiostar" Can be generated in real time. In addition, since the number of the related content information can be expanded according to the number of the first and second related words, the selection number of the first and second content links, and the like, a data structure for providing relevant content information, audiovisual information, and the like. Can be easily built and expanded.

6 and 7, only the case of combining two words such as 'radio star, Park Jung-hoon', or 'radio star, Ahn Sung-ki' has been described. However, the number of words that can be combined can be extended beyond that. That is, when the search keyword and the two or more first related words are combined, the number of combination words used when searching for the second content information may be extended to three or more.

For example, in the embodiment of FIG. 7, if one of the second related words 'bee and you' is added to generate a word combination of 'radio star, Park Joong-hun, rain and you', the second content for the word combination is generated. The sorting means 23 receives a search result page for the second content information from the search site, and sorts the plurality of second content links ranked above. In this case, 'radio star' may be stored as a search keyword in the data structure, 'Park Joong Hoon' as the first related word-1, and 'rain and you' as the first related word-2. Therefore, according to this embodiment, it is possible to continue to expand the additional connection structure between the relevant words.

Of course, it is obvious that the above-described data structure can be constructed as a database and utilized as a search DB. That is, it generates a search result that defines word relevance by real-time search for the user's search request, responds to the search result page, saves the real-time generated search result in the search DB, and then searches the search result from the search DB the next time a user search request is made. It is possible to provide as a search results page. If the search results are provided using the search DB, the remaining data should be periodically updated with the latest data using the user search keyword.

8 illustrates a screen in which a user sets a search option according to an embodiment of the present invention.

The search server 2 receives data extraction information necessary to generate a search result having the data structure of FIG. 7 in real time from the user terminal 3 and stores it as a search option. The initial default value is a value set by the service provider and is displayed only when the user wants to change the search option.

The screen interface includes a search keyword input method, search site access information, search site access information to collect multimedia search results, the number of first and second content links, the number of first and second related words, and the multimedia information link to be extracted. It is provided with an interface for entering the number of.

When data extraction information is input through the interface screen, the search database construction apparatus 100 according to the present invention performs the above-described operation according to the input data extraction information.

For example, as shown in FIG. 8, when 'direct input' is input as the search keyword input method, the search keyword is directly input from the user terminal 3. In addition, when 'Naver' is input to the search site, the first content sorting means 21 and the second content sorting means 23 designate a search engine of the 'Naver' search site, request a search, and receive a search result. do. Then, when the 'youtube' is input to the multimedia information search site, the multimedia information extracting means 25 searches for and extracts the multimedia information link from the YouTube site.

In addition, when '10' and '50' are input as the number of the first and second content links, the first and second content selection means 21 and 23 may generate 10 and 50 first and second content links, respectively. Dogs are screened. As shown in the figure, when '5' and '3' are input as the number of first and second related words, the first related word selecting means 22 selects five first related words, and the second related words. The selecting means 24 selects three second related words. In addition, if the number of multimedia information links is input as '2', two multimedia information links corresponding to the second content information are extracted.

Although not shown in the figure, a selection category may be input for the first and second content links such as web pages, sites, intellectuals, videos, images, blogs, and the like as search option information for data extraction. In this case, the first and second content links are searched and selected from the input category.

Meanwhile, it will be apparent to those skilled in the art that the data extraction information input screen shown in FIG. 8 is only an example and may be configured in various forms.

When the generation of the search result is completed by the real-time search, the search result page providing means 26 provides the user search keyword, the first related word, the URL of the second content link, the second related word and the multimedia for each second content information. The search result page including the information link is generated in real time and provided to the user terminal 3 (S26).

9 illustrates a search result page categorized by word relevance according to an embodiment of the present invention.

The search result page includes a search box 201 and a search button 202, in which a user search keyword is displayed.

The first related words tab 203 lists the first related words categorized by the word association with respect to the user search keyword. The search box 201, the search button 202, and the first related word tab 203 are always displayed at the top of the screen of the search results page, and the user selects a specific tab at a position below the first related word tab 203. Each time a search result page of the corresponding first related word is displayed. For example, when the user presses the tab of "first related word 1", the combination of "user search keyword + first related word 1" is sent to the search server 2 to receive a corresponding search result page. Of course, when the user presses the "first related word 2" tab, the user may receive and display a search result page corresponding to "user search keyword + second related word 2".

On the other hand, it is also possible to configure the first related word tab 203 as a button, to display all 250 contents on a search result page, and to move to the corresponding position whenever a button of a specific first related word is selected. For example, when the user presses the button "first related word 2", the user moves to the position of the 51st second content link and displays the screen.

Below the first related word tab 203, the second content links searched using the respective first related word are displayed respectively. Next to each individual second content link is a second related word 204. A plurality of second related words 204 may be listed, and next to them, a multimedia button 205 for inquiring multimedia data searched by the second related words is displayed. The multimedia button 205 displays multimedia classification titles such as "video", "music", "image", and "web map", and may include the number of data in parentheses next to the button title.

Accordingly, when a search result page provided from the search server 2 is displayed after a user inputs a search keyword, five first related words arranged according to content relevance through the first related word tab 203 are displayed. The words are presented. The display of the relationship between the user search keyword and the first related word may be interfaced using various structures such as graphs, trees, or tables and symbols (parentheses, arrows, etc.).

In addition, the user may view the content information (second content link) classified as the first related word of interest and determine whether to visit by selecting a specific content link while recognizing the content in advance. In addition, it is possible to determine whether to query the multimedia data while receiving the most relevant second related words related to the individual second content links and also recognizing the content as the second related words.

The search result page screen shown in FIG. 9 is merely an example, and it is apparent to those skilled in the art that various forms can be configured.

As described above, an embodiment of a search method and a search system using a word association between a search word and a search result according to the present invention is constructed. While the present invention has been particularly shown and described with reference to exemplary embodiments thereof, it is to be understood that the invention is not limited to the disclosed exemplary embodiments. It goes without saying that various modifications and variations are possible within the scope of equivalence of the scope.

The following drawings, which are attached to this specification, illustrate preferred embodiments of the present invention, and together with the detailed description of the present invention serve to further understand the technical spirit of the present invention. It should not be construed as limited to.

1 is a schematic structural diagram of a search system according to an embodiment of the present invention;

2 is a schematic internal structural diagram of a search server according to an embodiment of the present invention;

3 is a schematic flowchart of a search method according to an embodiment of the present invention;

4 to 7 illustrate example word definitions of a search result according to an exemplary embodiment of the present invention.

8 is an exemplary view of a search option setting screen using word relevance according to an embodiment of the present invention.

9 is an exemplary view of a search result page categorized by word relevance according to an embodiment of the present invention.

Claims (13)

  1. In a search method of a search server for receiving a search request from a user terminal using a wired, wireless network to classify and provide the contents of the search results in real time,
    (S21) the search server selecting a plurality of first content links ranked higher in the first search result received through the search site by using the user search keyword requested from the user terminal;
    (S22) analyzing first content information corresponding to each first content link and selecting a plurality of first related words by a number set by a user in order of appearance frequency across all first content information;
    (S23) selecting a plurality of second content links ranked higher in a second search result received from the search site by using a combination of the user search keyword and the first related word as a search keyword;
    (S24) selecting a plurality of second related words by a number set by a user in order of appearance frequency in second content information corresponding to each second content link;
    (S25) extracting one or more multimedia information links ranked higher in the multimedia search result using one or more second related words as search keywords; And
    (S26) generating, in real time, a search result page including a user search keyword, a first related word, a URL of a second content link, a second related word, and a multimedia information link for each second content information to a user terminal; step
    And a search server analyzing the word association between the user search keyword and the search result and providing search results classified by the first related word.
  2. The method of claim 1,
    The search server,
    Search for at least one of a search keyword input method, a search site, a multimedia information search site, a first content link number, a second content link number, a first related word number, a second related word number, and a multimedia information link number from a user terminal; Search method, which can be set as an option.
  3. The method of claim 1,
    The search server,
    A search method comprising receiving a search request including a search keyword from a user terminal.
  4. The method according to any one of claims 1 to 3,
    The search server,
    The method of claim 1, wherein the weighting of the frequency of the first or second related words is differentially assigned according to the region in which the word appears.
  5. The method of claim 1,
    The search server,
    And extracting the URL of the audiovisual information linked to the respective second content information to determine the multimedia information link corresponding to the second content information.
  6. The method according to claim 1 or 5,
    The search server,
    And searching for at least one of a video, an image, a music, and a web map as the multimedia information.
  7. The method of claim 1,
    The search server,
    And searching for one or more multimedia information links ranked higher in the multimedia search results collected from the search site using the second related word as the search keyword in the order of high frequency.
  8. The method according to claim 1 or 7,
    The search server,
    A search method comprising extracting one or more multimedia information links ranked at the top from a multimedia search result collected from a search site by using a search keyword combining two or more words among the second related words.
  9. The method of claim 8,
    The search server,
    The search method of claim 2, wherein the search keyword is generated by combining two or more words in the order of appearance frequency among the second related words.
  10. The method of claim 9,
    The search server,
    The word association relation of the plurality of first related words is displayed in the frequency order of the first related words with respect to the user search keyword, and the corresponding (multimedia information of the second related words) and (individual search results) corresponding to the respective first related words are displayed. The search method, characterized in that to generate the search results page that is displayed to provide to the user terminal.
  11. The method of claim 10,
    And the search server generates a second search result by classifying the first search result searched by the user search keyword into the first related word.
  12. In a search system that receives a user's search request using a wired, wireless network and classifies the content of the search result in real time,
    Receives a search request from a user terminal, analyzes the contents of the first search result searched with the extracted user search keyword, and searches with a combination of the user search keyword and the first related words extracted in the order of higher frequency of the first search result. Analyze the contents of the secondary search results to extract the second related words in the higher frequency order, and the second search results arranged in the frequency order of the first related words and the second related words and the multimedia information links searched by the second related words. A search server generating a search result page including the search result page and providing the search result page to the user terminal; And
    Send a user search keyword to the search server to request a search, and receive and display the search result page including multimedia information of individual search results and search results sorted by the frequency of first and second related words. User terminal
    And a search server analyzing the word association between the user search keyword and the search result and providing search results classified by the first related word.
  13. The method of claim 12,
    The search server,
    Means for receiving a search request from a user terminal to extract a user search keyword, and using the user search keyword to select a plurality of first content links ranked above in a first search result received through a search site;
    Means for analyzing first content information corresponding to each first content link and selecting a plurality of first related words by a number set by a user in order of appearance frequency across all first content information;
    Means for selecting a plurality of second content links ranked higher in a second search result received from the search site using the mutual combination of the user search keyword and the first related word as a search keyword;
    Means for selecting a plurality of second related words by a number set by a user in order of appearance frequency in corresponding second content information for each second content link;
    Means for extracting at least one multimedia information link ranked above in a multimedia search result using at least one second related word as a search keyword; And
    Means for generating in real time a search result page including a user search keyword, a first related word, a URL of a second content link, a second related word, and a multimedia information link for each second content information to a user terminal;
    Search system comprising a.
KR1020090129156A 2009-12-22 2009-12-22 Search Method for using word association between search keyword and search result and system thereof KR101134073B1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020090129156A KR101134073B1 (en) 2009-12-22 2009-12-22 Search Method for using word association between search keyword and search result and system thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020090129156A KR101134073B1 (en) 2009-12-22 2009-12-22 Search Method for using word association between search keyword and search result and system thereof

Publications (2)

Publication Number Publication Date
KR20110072296A KR20110072296A (en) 2011-06-29
KR101134073B1 true KR101134073B1 (en) 2012-04-13

Family

ID=44403231

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020090129156A KR101134073B1 (en) 2009-12-22 2009-12-22 Search Method for using word association between search keyword and search result and system thereof

Country Status (1)

Country Link
KR (1) KR101134073B1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101331453B1 (en) * 2011-08-10 2013-11-20 (주)다음소프트 A method of extend keyword advertisement based on associative word
KR101458140B1 (en) * 2012-05-10 2014-11-12 최진근 System for gathering information using word association and method thereof

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070027869A1 (en) 2005-07-29 2007-02-01 Collins Robert J System and method for reordering a result set copyright notice
KR20080093605A (en) * 2007-04-17 2008-10-22 (주)야긴스텍 Intelligent ecm system based on the ontology
KR20090081270A (en) * 2008-01-23 2009-07-28 삼성전자주식회사 Method and system for searching contents

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070027869A1 (en) 2005-07-29 2007-02-01 Collins Robert J System and method for reordering a result set copyright notice
KR20080093605A (en) * 2007-04-17 2008-10-22 (주)야긴스텍 Intelligent ecm system based on the ontology
KR20090081270A (en) * 2008-01-23 2009-07-28 삼성전자주식회사 Method and system for searching contents

Also Published As

Publication number Publication date
KR20110072296A (en) 2011-06-29

Similar Documents

Publication Publication Date Title
JP5175339B2 (en) Method and system for providing appropriate information to users of devices in a local network
CA2648269C (en) Information analyzing method and apparatus
JP5114380B2 (en) Reranking and enhancing the relevance of search results
US9192684B1 (en) Customization of search results for search queries received from third party sites
US9690786B2 (en) Systems and methods for dynamically creating hyperlinks associated with relevant multimedia content
KR101278406B1 (en) System and method for assisting search requests with vertical suggestions
US7099861B2 (en) System and method for facilitating internet search by providing web document layout image
US8176068B2 (en) Method and system for suggesting search queries on electronic devices
US6256648B1 (en) System and method for selecting and displaying hyperlinked information resources
US20110252016A1 (en) Providing Relevance-Ordered Categories of Information
JP4623820B2 (en) Network-based information retrieval system and document search promotion method
JP4648455B2 (en) Personalized search method and personalized search system
US8577856B2 (en) System and method for enabling search of content
US20130238613A1 (en) Blending Mobile Search Results
US6490579B1 (en) Search engine system and method utilizing context of heterogeneous information resources
JP2005500624A (en) Strategic information hub
JP5313931B2 (en) A framework for correlating content on the local network with information on the external network
US7917840B2 (en) Dynamic aggregation and display of contextually relevant content
JP5608286B2 (en) Infinite browsing
US7290061B2 (en) System and method for internet content collaboration
US20060288015A1 (en) Electronic content classification
US8352396B2 (en) Systems and methods for improving web site user experience
CN101563691B (en) Techniques for including collection items in search results
JP2011530118A (en) Providing posts to discussion threads in response to search queries
JP2009140444A (en) Merchandise retrieval device and merchandise retrieval method

Legal Events

Date Code Title Description
A201 Request for examination
E902 Notification of reason for refusal
E701 Decision to grant or registration of patent right
GRNT Written decision to grant
LAPS Lapse due to unpaid annual fee