KR101272254B1 - System for providing searching service and method for generating related keyword - Google Patents

System for providing searching service and method for generating related keyword Download PDF

Info

Publication number
KR101272254B1
KR101272254B1 KR1020110088071A KR20110088071A KR101272254B1 KR 101272254 B1 KR101272254 B1 KR 101272254B1 KR 1020110088071 A KR1020110088071 A KR 1020110088071A KR 20110088071 A KR20110088071 A KR 20110088071A KR 101272254 B1 KR101272254 B1 KR 101272254B1
Authority
KR
South Korea
Prior art keywords
search
search word
candidate
original
word
Prior art date
Application number
KR1020110088071A
Other languages
Korean (ko)
Other versions
KR20130024554A (en
Inventor
김찬주
Original Assignee
주식회사 다음커뮤니케이션
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 주식회사 다음커뮤니케이션 filed Critical 주식회사 다음커뮤니케이션
Priority to KR1020110088071A priority Critical patent/KR101272254B1/en
Publication of KR20130024554A publication Critical patent/KR20130024554A/en
Application granted granted Critical
Publication of KR101272254B1 publication Critical patent/KR101272254B1/en

Links

Images

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Computational Linguistics (AREA)

Abstract

The search service providing system according to the present invention selects two search terms input by a user terminal as a pair of search terms consisting of original search terms and candidate search terms, and a candidate search term selector which determines the validity of the search term pairs, the original search terms and the candidates. And a feature extracting unit extracting a feature value of the search word, and an identical intention search word determining unit determining the candidate search word as the same intention search word of the original search word based on the feature value.

Description

Search service providing system and the method of generating the same intention query {SYSTEM FOR PROVIDING SEARCHING SERVICE AND METHOD FOR GENERATING RELATED KEYWORD}

The present invention relates to a search service providing system and a method of generating the same intention search word.

Internet users search to find desired information on portal sites. In this case, the user's search may be performed by variously inputting a keyword related to the information to be searched for. In particular, portal sites may create specialized content or search results in order to provide more useful information to the user, and may display the corresponding content or search results in a predetermined search word.

The search word input by the user to receive the search service may have various forms. In this case, storing information about several search terms having one purpose, that is, search terms having the same intention and providing accurate search results, is an important factor in determining the quality of a search service.

However, it is not easy to manually build accurate information on search terms with the same intention, and it is difficult for the operator to respond quickly to user demands because there are many parts that the operator does not know.

The technical problem to be achieved by the present invention is to improve the search quality by efficiently database the same intention search words having the same or similar meaning.

The search service providing system according to an exemplary embodiment of the present invention selects two search terms input by a user terminal as a pair of search terms consisting of original search terms and candidate search terms, and a candidate search term selector which determines the validity of the search term pairs. And a feature extractor which extracts a feature value of the candidate search word, and an identical intention search word determiner that determines the candidate search word as a search word having the same intention of the original search word based on the feature value.

The apparatus may further include a search result generator that generates and stores a specific search result corresponding to the original search word in advance.

The validity may be determined based on whether or not the specific search result is exposed.

The candidate search word selector may recognize the validity when the specific search result is exposed to the original search word and the specific search result is not exposed to the candidate search word.

The original search word and the candidate search word may be continuously input by the same user terminal within a predetermined time.

The feature value may include an edit distance between the candidate search word and the original search word, the number of pairings of the candidate search word and the original search word, and the number of pairs of the candidate search word and the other search word where the candidate search word was first searched. , The number of searches for the candidate search term, the number of searches for the original search term, the ratio of the number of searches for the candidate search term and the number of searches for the original search term, a value of whether the first search word of the candidate search word and the original search word is the same, The candidate search word and the original search word may include at least one of a ratio in which the candidate search word is included in the candidate search word and the candidate search word and a common letter of the original search word in the original search word.

The feature value is a value obtained by dividing the number of pairings of the candidate search term and the original search term by the number of pairings of the candidate search term and the other search term where the candidate search term was first searched, and the number of pairings of the candidate search term and the original search term. At least one of a value obtained by dividing the number of searches of the candidate search word and a number of pairings of the candidate search word and the original search word by the number of searches of the original search word may be used.

The same intention search word determining unit may determine the same intention search word according to a score by calculating a score by combining the feature values.

The same intention search word determiner may determine the candidate search word as one of a short search word, a similar search word, and an extended search word according to the method of combining the feature values and the score.

According to another aspect of the present invention, there is provided a method of generating a search query of the same intention, wherein the search service providing system generates a search word having the same intention with respect to the original search word, selecting a search word pair consisting of the original search word and the candidate search word. Determining a validity of a pair of search words, extracting feature values of the original search word and the candidate search word, and determining the candidate search word as the same intention search word of the original search word based on the feature value; .

The determining of the validity may include determining whether a predetermined specific search result is exposed to the original search word, determining whether the specific search result is exposed to the candidate search word, and the specific search result to the original search word. May be exposed, and acknowledging the validity when the specific search result is not exposed to the candidate search word.

The original search word and the candidate search word may be received from the same user terminal.

The original search word and the candidate search word may be continuously input within a predetermined time.

An edit distance of the candidate search word and the original search word, the number of pairing of the candidate search word and the original search word, the number of pairing of the candidate search word and the other search word where the candidate search word was first searched, The number of searches, the number of searches for the original search term, the ratio of the number of searches for the candidate search term and the number of searches for the original search term, a value of whether the first search word of the candidate search word and the original search word is the same, the candidate search word and the circle The common letter of the search word may include at least one of a ratio included in the candidate search word and a ratio in which the common search word and the common letter of the original search word are included in the original search word.

The feature value is a value obtained by dividing the number of pairings of the candidate search term and the original search term by the number of pairings of the candidate search term and the other search term where the candidate search term was first searched, and the number of pairings of the candidate search term and the original search term. At least one of a value obtained by dividing the number of searches of the candidate search word and a number of pairings of the candidate search word and the original search word by the number of searches of the original search word may be used.

The determining of the same intention search word may include calculating a score by combining the feature values and determining the same intention search word according to the score.

The same intention search word may include a short search word, a similar search word, and an extended search word.

According to an embodiment of the present invention, the search quality can be improved by generating the same intention search word with the same intention that meets the needs of the user accurately and efficiently.

1 is a block diagram of a search service providing system according to an exemplary embodiment of the present invention.
2 is a flowchart illustrating a method of generating the same intention search word according to another exemplary embodiment of the present invention.
3 is a diagram illustrating a method of generating the same intention search word according to another exemplary embodiment of the present invention.
4 is a flowchart illustrating a method for generating the same intention search word according to another exemplary embodiment of the present invention.

DETAILED DESCRIPTION Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings so that those skilled in the art may easily implement the present invention. The present invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. In the drawings, parts irrelevant to the description are omitted in order to clearly describe the present invention, and like reference numerals designate like parts throughout the specification.

Throughout the specification, when a part is said to "include" a certain component, it means that it can further include other components, without excluding other components unless specifically stated otherwise. Also, the terms " part, "" module," and " module ", etc. in the specification mean a unit for processing at least one function or operation and may be implemented by hardware or software or a combination of hardware and software have.

A search service providing system and a method of generating the same intention search word according to an embodiment of the present invention will be described in detail with reference to the accompanying drawings.

1 is a block diagram of a search service providing system according to an exemplary embodiment of the present invention.

Referring to FIG. 1, the search service providing system 100 is a server that provides a search service to the user terminal 200. The search service providing system 100 may be configured based on a search term of the user terminal 200 and a predetermined search result. Also determine the search word.

In this case, the user terminal 200 is a device in which a user connects to and communicates with the publication service system 100 through a network. For example, the user terminal 200 may be a computer, a personal digital assistant (PDA), a mobile communication terminal, and a television. Various communication devices such as TV, etc. may be used.

The search service providing system 100 includes a search result generator 110, a candidate search term selector 120, a feature extractor 130, an identical intention search term determiner 140, and a provider 150.

The search result generator 110 generates a specific search result for a specific search word. The specific search result includes necessary contents according to the classification of the information provided by the search service providing system 100.

The candidate keyword selecting unit 120 selects a keyword to be determined as the same intention search word as a candidate, determines the validity as a candidate, and includes a continuous keyword selecting unit 121 and a validity determining unit 122.

The continuous search term selector 121 selects a plurality of search term pairs each consisting of two search terms continuously input by one user terminal 200. At this time, the continuous search term selector 121 may limit the difference between the two search terms input time within a predetermined time.

The validity determination unit 122 exposes a specific search result generated by the search result generator 110 to one search term from a plurality of search term pairs selected by the continuous search term selector 121, and the search result to another search term. The searcher pairs for which the specific search result generated by the generator 110 is not exposed are selected. In this case, a search term for which a specific search result is not exposed is called a candidate search term, and a search term to which a specific search result is exposed is called an original search term. As described above, the validity determination unit 122 determines whether the pair of search terms selected by the continuous search term selecting unit 121 is valid as a candidate for determining the same intention search term based on whether the specific search result is exposed or not. That is, the validity determination unit 122 is a candidate included in the search term pair when one search term among the search term pairs selected by the continuous search term selector 121 is exposed and the other search term is not exposed. It is determined that the search term is valid.

The feature extractor 130 extracts feature values of the original search word and the candidate search word.

The feature value extracted by the feature extractor 130 may be, for example, an edit distance cpl of the candidate search word and the original search word. The edit distance is the number of phonemes different from the candidate search word and the original search word. If the edit distance is less than 4, for example, it can be determined that there are many typos. If the edit distance is large, the candidate search word and the original search word may be irrelevant. high.

The feature value extracted by the feature extractor 130 may be a number cnt of pairing pairs of the candidate search word and the original search word.

The feature value extracted by the feature extractor 130 may be a pairing number (pair_cnt) of a candidate search term and another search term. That is, the number of pairing of the candidate search word and the search word other than the original search word as well as the pairing of the candidate search word and the original search word among pairings in which the candidate search word is searched first.

The feature value extracted by the feature extractor 130 is a value obtained by dividing the number of pairs of the candidate query and the original search query (cnt) by the number of pairs of the candidate search query and the other search query pair (cnt / pair_cnt). Can be.

The feature value extracted by the feature extractor 130 may be the number of searches for the candidate search word A_cnt.

The feature value extracted by the feature extractor 130 may be a value (cnt / A_cnt) obtained by dividing the number of pairs of the candidate search word and the original search word cnt by the number of searches of the candidate search word A_cnt.

The feature value extracted by the feature extractor 130 may be a search number B_cnt of the original search word.

The feature value extracted by the feature extractor 130 may be a value (cnt / B_cnt) obtained by dividing the number of pairs of the candidate search word and the original search word cnt by the number of search of the original search word B_cnt.

The feature value extracted by the feature extractor 130 may be a ratio (query_ratio) of the number of searches A_cnt of the candidate search word and the number of searches B_cnt of the original search word.

The feature value extracted by the feature extractor 130 may be a value (first_c) for whether the first search word of the candidate search word and the original search word are the same, and the value may be expressed as 0 if not equal.

The feature value extracted by the feature extractor 130 may be a ratio (inter_A_query) in which the common letters of the candidate search word and the original search word are included in the candidate search word, and the ratio in which the common letters of the candidate search word and the original search word are included in the original search word ( inter_B_query).

The same intention search word determiner 140 determines the candidate search word as the same intention search word based on at least one of the feature values extracted by the feature extractor 130. That is, the same intention search word determiner 140 combines the feature values determined by the feature extractor 130 to determine a candidate search word having a reference value or more as the same intention search word. The same intention search term determining unit 140 may determine the candidate search term as one of a short term search term, a similar search term, and an extended search term.

The provider 150 may provide the determined intention search word as additional information to the user terminal 200 or provide a search result for the original search word as a search result for the same intention search word. Meanwhile, the provider 150 may classify the same intention search word into categories such as a place and a group, store the same, and provide the same to the user terminal 200.

Now, a method of generating the same intention search word according to another embodiment of the present invention will be described in detail with reference to FIG. 2.

2 is a flowchart illustrating a method for generating the same intention search word according to another embodiment of the present invention, and FIG. 3 is a view for explaining a method for generating the same intention search word according to another embodiment of the present invention.

Referring to FIG. 2, first, the search service providing system 100 generates and stores a specific search result for a specific search word (S210).

Thereafter, the user terminal 200 inputs a search word to the search service providing system 100 (S220). Then, the search service providing system 100 provides the search result to the user terminal 200 according to the search word (S230). In this case, the search service providing system 100 provides the user terminal 200 with a specific search hook when the search word input by the user terminal 200 corresponds to a specific search word for a specific search result previously generated and stored. If the search word does not correspond to the search word, the user terminal 200 provides the search result according to the search word input to the user terminal 200.

Subsequently, the search service providing system 100 selects the original search word and the candidate search word as continuous search words according to a predetermined condition (S250). In this case, the predetermined conditions are two search words that are continuously input within a predetermined time from the same user terminal 200.

The search service providing system 100 displays a search result generated by the search result generator 110 in one of two selected search terms, and searches generated by the search result generator 110 in another search term. By selecting and filtering only consecutive search terms when no results are exposed, the validity of the search term pairs as candidates for determining the same intention search terms is determined (S260).

Subsequently, the search service providing system 100 extracts feature values of the continuous search word (S270). Thereafter, the search service providing system 100 determines the same intention search word based on the feature value (S280). An example of calculating the score based on the feature value is shown in FIG. 3.

Referring to FIG. 3, examples of search term pairs consisting of various candidate search terms and original search terms are shown. The feature value of each pair of search terms may be extracted as shown in FIG. 3, and a score may be calculated accordingly to determine a candidate search term as the same intention search term according to the score.

Referring back to FIG. 2, the search service providing system 100 stores a specific search result for the original search word as a search result for the same intention search word (S290).

As such, if the user automatically generates the same intention query according to the exposure history of the specific search result for the user's consecutive search terms, the same intention search query database can be quickly and accurately. Because it is extracted, it can generate the same intention search word with high actual use and can quickly respond to user's request.

Now, a method of generating the same intention search word according to another exemplary embodiment of the present invention will be described in detail with reference to FIG. 4.

4 is a flowchart illustrating a method for generating the same intention search word according to another exemplary embodiment of the present invention.

Referring to FIG. 4, the user terminal 200 inputs a search word to the search service providing system 100 in operation S410. Then, the search service providing system 100 provides a search result to the user terminal 200 (S420). In this case, unlike the embodiment of FIG. 2, the search service providing system 100 provides a general search result without generating a specific search result for a specific search word in advance.

Thereafter, the search service providing system 100 selects the original search word and the candidate search word as consecutive search terms among the search terms input by the user terminal 200 (S430). In this case, the predetermined conditions are two search words that are continuously input within a predetermined time from the same user terminal 200.

Subsequently, the search service providing system 100 extracts feature values of the continuous search word (S450). Thereafter, the search service providing system 100 determines the same intention search word based on the feature value (S460).

Thereafter, the search service providing system 100 classifies and stores the same intention search word for each category (S470). For example, it may be a category such as a school, a hospital, a terminal, an association, a research society, and a fellowship.

The embodiments of the present invention described above are not implemented only by the apparatus and method, but may be implemented through a program for realizing the function corresponding to the configuration of the embodiment of the present invention or a recording medium on which the program is recorded.

While the present invention has been particularly shown and described with reference to exemplary embodiments thereof, it is to be understood that the invention is not limited to the disclosed exemplary embodiments, It belongs to the scope of right.

Claims (17)

A candidate search term selector which selects two search terms input by a user terminal as a search term pair consisting of original search terms and candidate search terms, and determines the validity of the search term pairs;
A feature extractor which extracts feature values of the original search word and the candidate search word,
An identical intention search word determining unit configured to determine the candidate search word as the same intention search word of the original search word based on the feature value; and
A search result generator that generates and stores a specific search result corresponding to the original search term in advance
Lt; / RTI >
The validity is determined based on the exposure of the specific search result,
The candidate search term selector recognizes the validity when the specific search result is exposed to the original search word and the specific search result is not exposed to the candidate search word.
The same intention search word determining unit determines the same intention search word according to a score by calculating a score by combining the feature values.
The same intention search term determining unit determines the candidate search term as one of a short term search term, a similar search term, and an extended search term according to the score.
Search service provision system.
delete delete delete In claim 1,
And the original search word and the candidate search word are continuously input by the same user terminal within a predetermined time.
In claim 1,
The feature value is,
An edit distance of the candidate search word and the original search word, the number of pairing of the candidate search word and the original search word, the number of pairing of the candidate search word and the other search word where the candidate search word was first searched, The number of searches, the number of searches for the original search term, the ratio of the number of searches for the candidate search term and the number of searches for the original search term, a value for whether the first search word of the candidate search word and the original search word is the same, the candidate search word and the circle A search service providing system comprising at least one of a ratio in which common letters of a search word are included in the candidate search word, and a ratio in which common letters of the candidate search word and the original search word are included in the original search word.
The method of claim 6,
The feature value is,
A value obtained by dividing the number of pairings of the candidate search term and the original search term by the number of pairings of the candidate search term and the other search term in which the candidate search term was first searched,
A value obtained by dividing the number of pairings of the candidate search term and the original search term by the number of searches of the candidate search term, and
A value obtained by dividing the number of pairings of the candidate search term and the original search term by the number of searches of the original search term
At least one of the search service providing system.
delete delete The search service providing system generates the same intention query for the original query,
Selecting a search word pair consisting of the original search word and the candidate search word,
Determining the validity of the pair of search terms,
Extracting feature values of the original search word and the candidate search word, and
Determining the candidate search word as the same intention search word of the original search word based on the feature value;
Lt; / RTI >
The step of determining the validity may comprise:
Determining whether a predetermined specific search result is exposed to the original search word,
Determining whether the specific search result is exposed to the candidate search word, and
Acknowledging the validity when the specific search result is exposed to the original search term and the specific search result is not exposed to the candidate search term.
Lt; / RTI >
Determining the same intention search word,
Calculating a score by combining the feature values, and determining the same intention search word according to the score
Lt; / RTI >
The same intention search word includes a short term search query, a similar search word, and an extended search word.
delete 11. The method of claim 10,
And the original search term and the candidate search term are received from the same user terminal.
11. The method of claim 10,
And the original search word and the candidate search word are continuously input within a predetermined time.
11. The method of claim 10,
An edit distance of the candidate search word and the original search word, the number of pairing of the candidate search word and the original search word, the number of pairing of the candidate search word and the other search word where the candidate search word was first searched, The number of searches, the number of searches for the original search term, the ratio of the number of searches for the candidate search term and the number of searches for the original search term, a value of whether the first search word of the candidate search word and the original search word is the same, the candidate search word and the circle And generating at least one of a ratio in which common letters of a search word are included in the candidate search word and a ratio in which common letters of the candidate search word and the original search word are included in the original search word.
The method of claim 14,
The feature value is,
A value obtained by dividing the number of pairings of the candidate search term and the original search term by the number of pairings of the candidate search term and the other search term in which the candidate search term was first searched,
A value obtained by dividing the number of pairings of the candidate search term and the original search term by the number of searches of the candidate search term, and
A value obtained by dividing the number of pairings of the candidate search term and the original search term by the number of searches of the original search term
At least one of the same intention query generating method.
delete delete
KR1020110088071A 2011-08-31 2011-08-31 System for providing searching service and method for generating related keyword KR101272254B1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020110088071A KR101272254B1 (en) 2011-08-31 2011-08-31 System for providing searching service and method for generating related keyword

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020110088071A KR101272254B1 (en) 2011-08-31 2011-08-31 System for providing searching service and method for generating related keyword

Publications (2)

Publication Number Publication Date
KR20130024554A KR20130024554A (en) 2013-03-08
KR101272254B1 true KR101272254B1 (en) 2013-06-13

Family

ID=48176603

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020110088071A KR101272254B1 (en) 2011-08-31 2011-08-31 System for providing searching service and method for generating related keyword

Country Status (1)

Country Link
KR (1) KR101272254B1 (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001043236A (en) * 1999-07-30 2001-02-16 Matsushita Electric Ind Co Ltd Synonym extracting method, document retrieving method and device to be used for the same
KR20100083614A (en) * 2009-01-14 2010-07-22 오의진 Intension search method based on search intension of user

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001043236A (en) * 1999-07-30 2001-02-16 Matsushita Electric Ind Co Ltd Synonym extracting method, document retrieving method and device to be used for the same
KR20100083614A (en) * 2009-01-14 2010-07-22 오의진 Intension search method based on search intension of user

Also Published As

Publication number Publication date
KR20130024554A (en) 2013-03-08

Similar Documents

Publication Publication Date Title
US11868389B2 (en) Search method and apparatus, and electronic device and storage medium
CN109033229B (en) Question and answer processing method and device
JP5575902B2 (en) Information retrieval based on query semantic patterns
CN102043833B (en) Search method and device based on query word
US10783885B2 (en) Image display device, method for driving the same, and computer readable recording medium
US9767409B1 (en) Latent feature based tag routing
US10885107B2 (en) Music recommendation method and apparatus
CN108446316B (en) association word recommendation method and device, electronic equipment and storage medium
CN110232137A (en) A kind of data processing method, device and electronic equipment
KR102601545B1 (en) Geographic position point ranking method, ranking model training method and corresponding device
KR20120037841A (en) Method for personalized searching of mobile terminal and mobile terminal performing the same
CN110968801A (en) Real estate product searching method, storage medium and electronic device
CN103136213A (en) Method and device for providing related words
JP2018518764A (en) Object search method, apparatus and server
CN111159334A (en) Method and system for house source follow-up information processing
CN112015918A (en) Data processing method and device
CN110543484A (en) prompt word recommendation method and device, storage medium and processor
JP2014085862A (en) Prediction server, program, and method for predicting number of future comments on prediction target content
CN115309954A (en) Data retrieval method, device, equipment and storage medium
US20130304370A1 (en) Method and apparatus to provide location information
CN106407332B (en) Search method and device based on artificial intelligence
CN111666417B (en) Method, device, electronic equipment and readable storage medium for generating synonyms
CN110351183B (en) Resource collection method and device in instant messaging
KR20090010752A (en) System and method for generating relating data class
KR101272254B1 (en) System for providing searching service and method for generating related keyword

Legal Events

Date Code Title Description
A201 Request for examination
E902 Notification of reason for refusal
AMND Amendment
E601 Decision to refuse application
AMND Amendment
X701 Decision to grant (after re-examination)
GRNT Written decision to grant
FPAY Annual fee payment

Payment date: 20160412

Year of fee payment: 4

FPAY Annual fee payment

Payment date: 20170322

Year of fee payment: 7

FPAY Annual fee payment

Payment date: 20190329

Year of fee payment: 9