CN105589863B - Searching method, data processing method, device and system - Google Patents

Searching method, data processing method, device and system Download PDF

Info

Publication number
CN105589863B
CN105589863B CN201410566247.XA CN201410566247A CN105589863B CN 105589863 B CN105589863 B CN 105589863B CN 201410566247 A CN201410566247 A CN 201410566247A CN 105589863 B CN105589863 B CN 105589863B
Authority
CN
China
Prior art keywords
name
same
unique identifier
search
searching
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410566247.XA
Other languages
Chinese (zh)
Other versions
CN105589863A (en
Inventor
陈祺
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201410566247.XA priority Critical patent/CN105589863B/en
Publication of CN105589863A publication Critical patent/CN105589863A/en
Application granted granted Critical
Publication of CN105589863B publication Critical patent/CN105589863B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

A searching method and a data processing method, device and system are provided, wherein the searching method comprises the following steps: receiving a search request of a user, wherein the search request comprises a target name; searching a preset homonymy corresponding relation, and determining a unique identifier corresponding to the target name, wherein the homonymy corresponding relation comprises the corresponding relation between the name and the unique identifier; and searching according to the unique identifier, and outputting a search result, wherein the identifier of the item in the search result comprises the unique identifier. By setting the corresponding relation between the names and the unique identifiers, the same two or more names can be effectively distinguished by using the unique identifiers, and by setting the unique identifiers on the items, the same names can be searched by using the unique identifiers, so that the searching accuracy is effectively improved, the item confusion of the same names is avoided, and the searching experience of a user is improved.

Description

Searching method, data processing method, device and system
Technical Field
The present invention relates to the field of search technologies, and in particular, to a search method, a data processing method, an apparatus, and a system.
Background
An important search content in video search is name search, and the name search effect is good or bad, which has great influence on user experience. In video data, the names of people and movie works are the same, and if the names of people and movie works are not distinguished, confusion occurs, and users cannot search for expected results.
At present, video name searching is supported by famous food websites such as hundredths, search libraries and the like. Most websites of the same-name stars and same-name film and television works are not distinguished, and all works and related information which have the same name but are not actually the same star are put together for display only according to the relevance of the text. The Baidu and the database searching are better in this respect, when a user searches for a name, the name can be distinguished by clustering related documents obtained by searching and mining the relationship between people, however, in actual operation, this method still cannot completely distinguish the data of two stars with the same name, for example: when searching for "big Songjia" in a hundred-degree video, the results displayed in the popular information area actually include the information of "big Songjia" and "small Songjia", and the information of the two persons cannot be really and completely distinguished. Therefore, in the prior art, the name searching mode is difficult to effectively distinguish the same name stars, and the searching experience of the user is influenced.
Disclosure of Invention
In view of this, the present invention provides a searching method, a data processing method, an apparatus and a system, which are used for searching for data with the same name to effectively distinguish the data with the same name, so that the searched data is more accurate, and the searching experience of the user is improved.
The searching method provided by the embodiment of the invention comprises the following steps:
receiving a search request of a user, wherein the search request comprises a target name;
searching a preset homonymy corresponding relation, and determining a unique identifier corresponding to the target name, wherein the homonymy corresponding relation comprises the corresponding relation between the name and the unique identifier;
and searching according to the unique identifier, and outputting a search result, wherein the identifier of the item in the search result comprises the unique identifier.
The data processing method provided by the embodiment of the invention comprises the following steps:
carrying out name data mining to obtain all the same names;
determining an item associated with each of said all of the same names;
and setting a unique identifier for each name in all the same names to obtain a corresponding relation of the same name, and setting the unique identifier which is the same as the name for the item associated with the name, wherein the corresponding relation of the same name comprises the corresponding relation between the name and the unique identifier.
The online search system provided by the embodiment of the invention comprises:
the receiving module is used for receiving a search request of a user, wherein the search request comprises a target name;
the searching module is used for searching a preset homonymy corresponding relation after the receiving module receives the search request, and determining a unique identifier corresponding to the target name, wherein the homonymy corresponding relation comprises the corresponding relation between the name and the unique identifier;
and the output module is used for searching according to the unique identifier after the search module determines the unique identifier, and outputting a search result, wherein the identifier of the item in the search result comprises the unique identifier.
The data processing device provided in the embodiment of the invention comprises:
the mining module is used for mining the data of the names to obtain all names with the same name;
the association module is used for determining an item associated with each of all the same names after the mining module obtains all the same names;
the setting module is used for setting a unique identifier for each name in all the same names after the association module determines the items associated with each name in all the same names to obtain a corresponding relationship with the same name, and setting the unique identifier which is the same as the name for the items associated with the name, wherein the corresponding relationship with the same name comprises the corresponding relationship between the name and the unique identifier.
According to the technical scheme, the embodiment of the invention has the following advantages:
the online search system receives a search request of a user, the search request comprises a target name, a preset corresponding relation with the same name is searched, the corresponding relation with the same name comprises a corresponding relation between the name and a unique identifier, the unique identifier corresponding to the target name is determined, searching is carried out according to the unique identifier, a search result is output, and the identifier of an item in the search result comprises the unique identifier. By presetting the homonymy corresponding relation containing the corresponding relation between the name and the unique mark, the same name can be effectively distinguished through the unique mark, the unique mark of the target name is used for searching, all items containing the unique mark can be obtained to serve as items related to the target name, the conditions of confusion and inaccuracy of the searching result are avoided, the searching result is more accurate, and the searching experience of a user is improved.
In order to make the aforementioned and other objects, features and advantages of the invention comprehensible, preferred embodiments accompanied with figures are described in detail below.
Drawings
Fig. 1 is a schematic diagram of a structure of a server in an embodiment of the present invention;
FIG. 2 is a diagram of an embodiment of a data processing method according to the present invention;
FIG. 3 is a diagram illustrating an embodiment of a search method according to an embodiment of the present invention;
FIG. 4 is another diagram illustrating an embodiment of a search method according to an embodiment of the present invention;
FIG. 5 is a diagram illustrating the structure of a data processing apparatus according to an embodiment of the present invention;
FIG. 6 is a diagram illustrating the structure of an online search system according to an embodiment of the present invention;
FIG. 7 is another diagram illustrating the structure of the online search system according to an embodiment of the present invention.
Detailed Description
To further illustrate the technical means and effects of the present invention adopted to achieve the predetermined objects, the following detailed description of the embodiments, structures, features and effects according to the present invention will be made with reference to the accompanying drawings and preferred embodiments.
In an embodiment of the present invention, the search method and the data processing method are executed by a server, and for better understanding of the technical solution in the embodiment of the present invention, please refer to fig. 1, which is a schematic diagram of the structure of the server in the embodiment of the present invention, the server 100 may have a relatively large difference due to different configurations or performances, and may include one or more Central Processing Units (CPUs) 122 (e.g., one or more processors) and a memory 132, one or more storage media 130 (e.g., one or more mass storage devices) storing an application program 142 or data 144, wherein the memory 132 and the storage media 130 may be temporarily stored or permanently stored, the program stored in the storage media 130 may include one or more modules (not shown), each of which may include a series of instruction operations in the server, further, the central processing unit 122 may be configured to communicate with the storage media 130, execute a series of instruction operations in the storage media 130 on the server 100, the server 100 may further include one or more power supplies 126, one or more wireless network interfaces 150, one or more wireless network input/output ports 141, such as a wireless network interface 141, a wireless network interface 158, a wireless network interface, and/or the like.
In the embodiment of the invention, a corresponding relation of the same name is preset in the online searching system, the corresponding relation of the same name comprises a corresponding relation between the name and the unique identification, wherein the name in the corresponding relation of the same name has at least one same or similar name, and in addition, the unique identification of the item in the database is also preset, so that the same name can be searched by using the unique identification.
It should be noted that, in the embodiment of the present invention, the method for searching for the target name by setting the corresponding relationship of the same name in advance is suitable for searching for the same-name stars, the same-name dramas, and the like.
In order to better understand the technical solution in the embodiment of the present invention, a data processing method before performing the same name search is described below, with reference to fig. 2, including:
201. carrying out name data mining to obtain all the same names;
in the embodiment of the invention, the data processing device carries out off-line data processing on the data, firstly carries out name data mining to obtain all the same names.
The data processing device can capture data such as a database, news data, video data and the like through a webpage crawler, perform data mining on names to obtain a name table containing all names, and perform data mining on the same names except a conventional name processing flow to obtain all the same names in the name table, wherein the same names refer to that each name in the same name table contains at least one name same as the name.
In an embodiment of the present invention, the data processing apparatus may obtain a plurality of different types of the same name, for example: the same name of star, the same name of movie and television, the same name of plants, the same name of animals and the like. In practical application, data mining can be performed on the same name of a certain type according to specific needs to obtain the same name of all types.
It should be noted that, in order to better distinguish the same names, the same names of all names can be determined by manual editing.
202. Determining an item associated with each of all of the same names;
in an embodiment of the invention, the data processing apparatus will determine the items associated with each of all of the same names. Wherein the items may be news, pictures, videos, libraries, and the like.
Taking the example that the same name includes stars with the same name, the following concrete steps are as follows: for each name of the same-name star and the corresponding alias, the related works, related information and the like can be associated with the name to distinguish different data of the same-name star. For example: the character names of the first star A and the second star B are both C, the first star A participates in a hot new work in the near future, and simultaneously, a large amount of related information, news and videos containing the new work and the character name C appear.
It should be noted that, in order to improve the accuracy of the association, the association may also be performed in a manual association manner.
203. And setting a unique identifier for each name in all the same names to obtain the corresponding relation of the same names, and setting the unique identifier which is the same as the name for the item associated with the name.
In the embodiment of the present invention, after determining the item associated with each name in all the same names, the data processing apparatus sets a unique identifier for each name in all the same names to obtain a corresponding relationship with the same name, where the corresponding relationship with the same name includes the corresponding relationship between the name and the unique identifier, and sets a unique identifier that is the same as the name for the item associated with the name, so that the name and the unique identifier have a corresponding relationship, and the unique identifier and the item have a corresponding relationship.
It should be noted that, in the embodiment of the present invention, one item may have a plurality of unique identifiers, for example: the lead actor for a television show contains actors A, B and C, and the lead actor for that television show contains a unique identification of actors A, B and C.
In the embodiment of the present invention, the data processing method and the search method may be executed by the same server, or may be executed by different servers, and if the data processing method and the search method are executed by different servers, the data processing apparatus sets unique identifiers for all names in the same name and items associated with the names, and then introduces the obtained correspondence between the name and the unique identifier in the correspondence with the same name and the items identified with the unique identifier into another server, where the server includes an online search system.
The data processing apparatus further updates the corresponding relationship of the same name in real time or at regular time, taking the corresponding relationship of the same name as the corresponding relationship of the same name star as an example: the data processing device also carries out data mining on items such as news, videos and pictures of newly generated stars in real time or at regular time, adds the name of the new same-name star to the same-name corresponding relation of the star if the new same-name star exists, sets a unique identifier of the name of the new same-name star, determines the items related to the new same-name star and sets the unique identifier with the same name. If some stars in the same-name corresponding relationship have new news, videos, pictures and other items, the newly added items are identified according to the unique identification of the name of the star in the same-name corresponding relationship.
In the embodiment of the invention, the data processing device performs data mining on the names to obtain all the same names, associates each name in all the same names with the related item, sets the unique identifier of the name to obtain the corresponding relation of the same name, and sets the unique identifier which is the same as the item associated with the name, so that the corresponding relation between the name and the unique identifier in the corresponding relation of the same name can be effectively used for distinguishing the same name, for example, two stars with the same name, and the unique identifier is set on the item to search the name by using the unique identifier, thereby effectively improving the accuracy of searching the same name, avoiding the confusion of the items with the same name and improving the search experience of a user.
Referring to fig. 3, an embodiment of a search method according to an embodiment of the present invention includes:
301. receiving a search request of a user, wherein the search request comprises a target name;
in the embodiment of the invention, a user can use an online search function on a display interface of a client to input a target name on an online search page and determine a search, the client generates a search request and sends the search request to an online search system, and the online search system is borne on a server.
The online search system receives a search request of a user, wherein the search request comprises a target name.
It should be noted that, in the embodiment of the present invention, after receiving a search request from a user, an online search system parses the search request, determines a keyword in the search request, and determines a type of the keyword, where the type may be names of plants, animals, people, movie and television shows, and if the keyword is determined to be a name of a person, it is further determined whether the name of the person is a name of a star, the online search system performs a search in a name table of the star, if the name of the person is determined to be a name of a star, the online search system further searches for a corresponding relationship of the star with the same name, determines whether the name of the person is a name of a same name, and if the name of the person is a name of a same name, the online search is performed according to the technical solution in the embodiment of the present invention, and a search result of the name of the person is obtained. Or, if it is determined that the keyword is a movie title, further determining whether the movie title is a movie with the same or similar title, if so, searching for the corresponding relationship of the same title of the movie title, determining the unique identifier corresponding to the movie title, and executing according to the technical solution of the embodiment of the present invention to obtain the search result of the movie title.
302. Searching a preset homonymy corresponding relation, and determining a unique identifier corresponding to the target name, wherein the homonymy corresponding relation comprises the corresponding relation between the name and the unique identifier;
in the embodiment of the invention, the online searching system searches the preset corresponding relation of the same name, the corresponding relation of the same name comprises the corresponding relation between the name and the unique identifier, and the unique identifier corresponding to the target name is determined. It should be noted that the target name may be a star name, an animal name, a plant name, a movie name, or the like. If the target name is the name of a star, the target name may be the original name of the star, or may be an alias or a past name of the star.
In the embodiment of the invention, if only one name which is the same as the target name exists in the preset homonymy corresponding relation, the preset homonymy corresponding relation is searched, the homonymy corresponding relation comprises the corresponding relation between the name and the unique identifier, and the unique identifier corresponding to the target name is determined; for example: the target name is the name of the star, and is "small songjia", the same name corresponding relationship of the star contains the same names of two stars, which are both "songjia", one of the "songjia" is named "small songjia", the other "songjia" is named "large songjia", therefore, if the target name is "small songjia", a unique identifier corresponding to the "small songjia" is searched in the same name corresponding relationship.
If the preset homonymy corresponding relation comprises at least two names which are the same as the target name, searching the preset homonymy corresponding relation comprising the corresponding relation between the names and the unique identification, and determining one unique identification which is corresponding to the name meeting the condition in the at least two names which are the same as the target name. The condition may be that the search volume is the largest, or the click rate of the user is the largest. Taking the search of the same-name stars as an example, the same-name correspondence relationship of the same-name stars includes that the names of two same stars are both "songjia", one of the "songjias" is named "small songjia", the other "songjia" is named "large songjia", if the target name is "songjia", the on-line search system can determine that two names same as the target name exist, look up the correspondence relationship between the preset name and the unique identifier, determine that the unique identifier of the "songjia" with the name of "small songjia" is a, the unique identifier of the "songjia" with the name of "large songjia" is B, and select the unique identifier B of the "songjia" with the name of "large songjia" satisfying the search quantity from the unique identifiers a and B as the unique identifier of the target name of "songjia".
It should be noted that, in the embodiment of the present invention, the data processing apparatus may obtain a plurality of homonymous correspondences of different types, for example: the corresponding relation of the same name of the star, the corresponding relation of the same name of the movie and television play, the corresponding relation of the same name of the plant, the corresponding relation of the same name of the animal, etc. When the online search system searches, the type of the keyword can be determined, wherein the type of the keyword can be animal, human, movie and television drama, plant name and the like. And according to the type of the keyword, using the same-name corresponding relation of the same type to determine the corresponding unique identifier.
303. And searching according to the unique identifier, and outputting a search result, wherein the identifier of the item in the search result comprises the unique identifier.
In the embodiment of the invention, the online search system searches according to the determined unique identifier and outputs the search result, wherein the identifier of the item in the search result comprises the unique identifier.
In the embodiment of the invention, an online search system receives a search request of a user sent by a client, the search request comprises a target name, a preset homonymy corresponding relation comprising a corresponding relation between the name and a unique identifier is searched, a unique identifier corresponding to the target name is determined, searching is carried out according to the unique identifier, a search result is output, and the identifier of an item in the search result comprises the unique identifier, wherein the corresponding relation between the preset name and the unique identifier is searched, so that the unique identifier can be used for distinguishing two stars with the same name, such as the same name, and the unique identifier is used for searching to obtain the item comprising the unique identifier, so that the search result is more accurate, and the search experience of the user can be effectively improved.
For better understanding of the technical solution in the embodiment of the present invention, please refer to fig. 4, which is an embodiment of a search method in the embodiment of the present invention, including:
401. receiving a search request of a user, wherein the search request comprises a target name;
in the embodiment of the invention, after the user inputs the keywords into the online search page of the client, a search request is generated and sent to the online search system of the server.
After the online search system receives the search request, the keyword contained in the search request is analyzed, if the keyword is determined to be the name, a preset name table is searched, the type of the keyword is determined, the type can be the name of a person, the name of an object, the name of a movie and television, the name of a plant, the name of an animal and the like, if the keyword is determined to be the name of a star, whether the keyword is the name of the star or not is determined, if the keyword is determined to be the name of the star, a preset corresponding relation of the same name of the star is further searched, if the corresponding relation of the same name contains the name of the star, the search is determined to be the search of the same name, and the keyword contained in the search request is the name of the target person.
402. Searching a preset homonymy corresponding relation, and determining a unique identifier corresponding to the target name, wherein the homonymy corresponding relation comprises the corresponding relation between the name and the unique identifier;
in the embodiment of the present invention, the online search system searches for a correspondence between a name and a unique identifier in a preset homonymy correspondence, and determines a unique identifier corresponding to a target name, specifically: if only one name which is the same as the target name exists in the preset homonymy corresponding relation, searching the corresponding relation between the name and the unique identifier contained in the homonymy corresponding relation, and determining the unique identifier corresponding to the target name; if at least two names which are the same as the target name are contained in the preset homonym corresponding relation, searching the corresponding relation between the name contained in the homonym corresponding relation and the unique identifier, and determining the unique identifier corresponding to the name which meets the condition in the at least two names which are the same as the target name, wherein the meeting condition can be that the search amount is the maximum or the click rate of the user is the maximum.
403. Searching according to the search type and the unique identifier, and outputting a search result, wherein the type of the item in the search result is the same as the search type, the identifier of the item comprises the unique identifier, and the search type is contained in the search request;
in the embodiment of the present invention, when a user inputs a search keyword on a search page, the search type may be selected, for example, the search type may be: news, web pages, pictures, music, microblogs, maps, questions and answers, shopping, and the like. And the search type is included in a search request sent to the online search system so that data of the same type as the search type can be obtained.
In the embodiment of the invention, the online search system searches according to the search type and the unique identifier and outputs the search result, wherein the type of the item in the search result is the same as the search type, and the identifier of the item comprises the unique identifier.
The searching by the online searching system according to the searching type and the unique identifier may specifically be: the online searching system searches according to the unique identifier to obtain the item containing the unique identifier in the identifier; and acquiring items with the same type as the search type from the items containing the unique identification and outputting the items as search results. It should be noted that, the step of outputting the search result by the online search system may be: and the online search system sends the search result to the client of the user, and the client loads the search result and displays the search result on a display interface of the client according to a preset mode. For example: taking the same-name person search as an example, if the target name is 'small songjia', the determined unique identifier is a, and the search type is a picture, the online search system searches all items, acquires the items including the unique identifier a in the identifier, acquires the items of which the type is the picture from all the items including the unique identifier a, outputs the searched picture to the client in a preset mode, and displays the picture on the client.
404. And outputting first prompt information, wherein the first prompt information comprises other names which are similar or identical to the target name in the same-name corresponding relation.
In the embodiment of the present invention, after obtaining a search result searched according to a search type and a unique identifier, the online search system further outputs first prompt information, where the first prompt information includes other names similar to or the same as a target name in a same-name correspondence relationship, and a same-name star search is taken as an example, for example: if the target task name is 'small Songjia' and the search type is a picture, the online search system determines a name 'large Songjia' similar to the 'small Songjia' after obtaining the picture of the 'small Songjia', the name 'large Songjia' is contained in the first prompt message and is output to the client of the user, and a search link of the 'large Songjia' is displayed on the client to prompt the user that other stars which are the same as or similar to the target name of the target searched by the user exist. If the star clicks the name of the prompt, namely 'big Songjia', the online search system searches according to the name of the prompt, so that the user can conveniently search the people with the same name.
In the embodiment of the present invention, if the online search system searches for a different type of name that is exactly the same as the target name, the second prompt information is output, and the second prompt information includes information of the different type of name. Specifically, the method comprises the following steps: after receiving a search request, the online search system determines the type of a keyword in the search request, and respectively searches under a classification table such as a plant, an animal, a person name, a movie and television series name, and if the keyword is determined to be the name of a same-name star and also to be the name of a medicinal material, the online search system searches for the same-name character according to the technical scheme in the embodiment of the invention to obtain a search result, outputs the search result to a client and displays the search result, and also outputs second prompt information to the client, wherein the second prompt information comprises the names of the medicinal materials.
It should be noted that, in the embodiment of the present invention, the data processing apparatus may set the homonymy correspondence relationship for different types.
In the embodiment of the invention, after an online search system receives a search request containing a target name from a user, a same-name corresponding relation which is preset and contains a corresponding relation between the name and a unique identifier is searched, a unique identifier corresponding to the target name is determined, searching is carried out according to a search type and the unique identifier, a search result is output, the type of an item in the search result is the same as the search type, the identifier of the item contains the unique identifier, and the search type is contained in the search request. The unique identifier is used for searching for the corresponding relation between the preset name and the unique identifier, so that the same name, such as two stars with the same name, can be distinguished by the unique identifier, and the item containing the unique identifier is obtained by searching through the unique identifier, so that the searching result is more accurate, and the searching experience of a user can be effectively improved. In addition, the online search system also outputs first prompt information, the first prompt information comprises other names similar to or identical to the target name in the same name corresponding relationship, and if the names of different types identical to the target name are searched, second prompt information is output, the second prompt information comprises information of the names of different types, so that the search experience of a user can be effectively improved, the same names under the same type can be effectively distinguished, for example, the same-name stars can be distinguished, or the same names of different types can be effectively distinguished, for example, the stars and the non-person names with the same name can be distinguished, and the search result is more accurate.
Referring to fig. 5, a schematic structural diagram of a data processing apparatus according to an embodiment of the present invention includes:
the mining module 501 is configured to mine data of names to obtain all the same names;
an association module 502, configured to determine, after the mining module 501 obtains all the same names, an item associated with each of the all the same names;
a setting module 503, configured to set a unique identifier for each name in all the same names after the association module 502 determines the item associated with each name in all the same names, so as to obtain a corresponding relationship with the same name, and set a unique identifier that is the same as the name for the item associated with the name, where the corresponding relationship with the same name includes a corresponding relationship between the name and the unique identifier.
The mining module 501 includes:
the first mining module 504 is configured to capture data by using a web crawler, and perform data mining to obtain a name table;
the second mining module 505 is configured to mine the names in the name table with the same name, so as to obtain all the same names.
In an embodiment of the present invention, the data processing apparatus further includes:
an importing module 506, configured to import, after the setting module 503 sets the unique identifier, an item that includes a same-name correspondence relationship between a name and the unique identifier and the identified unique identifier into the online search system.
In order to better distinguish the same name, the same name in the name table can be determined in a manual editing mode.
It should be noted that, in the embodiment of the present invention, one item may have a plurality of unique identifiers, for example: the lead actor for a television show contains actors A, B and C, and the lead actor for that television show contains a unique identification of actors A, B and C.
It should be noted that, in the embodiment of the present invention, the data processing apparatus may be a server.
In the embodiment of the present invention, the mining module 501 in the data processing apparatus performs name data mining to obtain all the same names, specifically: a first mining module 504 in the mining module 501 captures data by using a web crawler, and performs data mining to obtain a name table; and then the second mining module 505 mines the same name for the names in the name table to get all the same names. Next, the association module 502 determines the items associated with each of the all of the same names; and the setting module 503 sets a unique identifier for each of all the same names to obtain the corresponding relationship of the same name, and sets a unique identifier which is the same as the name for the item associated with the name. Finally, the import module 506 imports the item containing the same-name correspondence of the correspondence between the name and the unique identifier and the identified unique identifier into the online search system.
In the embodiment of the invention, the data processing device performs data mining on the names to obtain all the same names, associates the names in all the same names with the related items, sets the unique identifiers corresponding to the names to obtain the corresponding relation of the same names, and sets the unique identifiers corresponding to the names to have the unique identifiers same as the names, so that the corresponding relation between the names and the unique identifiers in the corresponding relation of the same names can be effectively used for distinguishing the same names, such as distinguishing same-name stars, and the unique identifiers are set on the items to enable the names to be searched by using the unique identifiers, thereby effectively improving the accuracy of searching, avoiding the confusion of the items with the same names, and improving the searching experience of users.
Please refer to fig. 6, which is a schematic diagram illustrating a structure of an online search system according to an embodiment of the present invention, including:
a receiving module 601, configured to receive a search request of a user, where the search request includes a target name;
a searching module 602, configured to search a preset homonymy correspondence after the receiving module 601 receives the search request, and determine a unique identifier corresponding to the target name, where the homonymy correspondence includes a correspondence between the name and the unique identifier;
and the output module 603 is configured to, after the finding module 602 determines the unique identifier, perform a search according to the unique identifier, and output a search result, where the identifier of the item in the search result includes the unique identifier.
In the embodiment of the present invention, a receiving module 601 in an online search system receives a search request of a user, where the search request includes a target name; then, the searching module 602 searches for a preset homonymy correspondence relationship containing a correspondence relationship between the name and the unique identifier, and determines a unique identifier corresponding to the target name; and finally, the output module 603 searches according to the unique identifier and outputs a search result, wherein the identifier of the item in the search result comprises the unique identifier.
In the embodiment of the invention, an online search system receives a search request of a user sent by a client, the search request comprises a target name, a preset homonymy corresponding relation comprising a corresponding relation between the name and a unique identifier is searched, a unique identifier corresponding to the target name is determined, searching is carried out according to the unique identifier, a search result is output, and the identifier of an item in the search result comprises the unique identifier, wherein the unique identifier is searched, so that the unique identifier can be used for distinguishing the same name, such as two stars with the same name, and the unique identifier is used for searching to obtain the item comprising the unique identifier, so that the search result is more accurate, and the search experience of the user can be effectively improved.
For better understanding of the online search system in the embodiment of the present invention, please refer to fig. 7, which is a schematic diagram of the structure of the online search system in the embodiment of the present invention, including: the receiving module 601, the searching module 602, and the outputting module 603 in the embodiment shown in fig. 6 are similar to those in the embodiment shown in fig. 6, and are not repeated here.
In this embodiment of the present invention, the searching module 602 specifically includes:
a first searching module 701, configured to, after the receiving module 601 receives the search request, if only one name that is the same as the target name exists in the preset homonymy correspondence, search for a correspondence between a name included in the homonymy correspondence and the unique identifier, and determine a unique identifier corresponding to the target name; take the same star as an example, for example: the target name is 'small Songjia', the same name corresponding relation contains the same names of two stars which are 'Songjia', one of the 'Songjia' is named 'small Songjia', the other 'Songjia' is named 'big Songjia', therefore, if the target name is 'small Songjia', a unique identifier corresponding to the 'small Songjia' is searched in the same name corresponding relation.
A second searching module 702, configured to, after the receiving module 601 receives the search request, if at least two names that are the same as the target name are included in the preset homonymy correspondence, search for a correspondence between the name and the unique identifier included in the homonymy correspondence, and determine a unique identifier corresponding to a name that meets the condition, among the at least two names that are the same as the target name. The condition may be that the search volume is the largest, or the click rate of the user is the largest, taking the same star as an example, for example: the same name corresponding relation comprises the same names of two stars which are 'songjia', one of the 'songjias' is named as 'small songjia', the other 'songjia' is named as 'large songjia', if the target name is 'songjia', the online search system can determine that two names which are the same as the target name exist, then the corresponding relation between the preset name and the unique identification is searched, the unique identification of the 'songjia' which is named as 'small songjia' is determined to be A, the unique identification of the 'songjia' which is named as 'large songjia' is determined to be B, and the unique identification B of the 'songjia' which is named as 'large songjia' and meets the maximum search quantity is selected from the unique identifications A and B to be used as the unique identification of the target name 'songjia'.
In this embodiment of the present invention, the output module 603 is specifically configured to, after the search module 601 determines the unique identifier, perform a search according to the search type and the unique identifier, and output a search result, where the type of an item in the search result is the same as the search type, and the identifier of the item includes the unique identifier, and the search type is included in the search request.
In the embodiment of the present invention, the output module 603 includes:
the obtaining module 703 is configured to, after the finding module 602 determines the unique identifier, perform a search according to the unique identifier, and obtain an item that includes the unique identifier in the identifier;
an obtaining and outputting module 704, configured to, after the obtaining module 703 obtains the items, obtain the items of the same type as the search type from the items and output the items as the search result.
In an embodiment of the present invention, the online search system further includes:
the first output module 705 is configured to output first prompt information after the output module 603 outputs the search result, where the first prompt information includes other names similar to or the same as the target name in the same-name correspondence relationship. For example: if the target task name is 'small Songjia' and the search type is a picture, the online search system determines a name 'large Songjia' similar to the 'small Songjia' after obtaining the picture of the 'small Songjia', the name 'large Songjia' is contained in the first prompt message and is output to the client of the user, and a search link of the 'large Songjia' is displayed on the client to prompt the user that other stars which are the same as or similar to the target name of the target searched by the user exist. If the star clicks the name of the prompt, namely 'big Songjia', the online search system searches according to the name of the prompt, so that the user can conveniently search the people with the same name.
It should be noted that, in the embodiment of the present invention, when a user enters a search keyword in a search page, a search type may be selected, for example, the search type may be: news, web pages, pictures, music, microblogs, maps, questions and answers, shopping, and the like. And the search type is included in a search request sent to the online search system so that data of the same type as the search type can be obtained.
In the embodiment of the present invention, a receiving module 601 in an online search system receives a search request of a user, where the search request includes a target name; then, the searching module 602 searches for a corresponding relationship of the same name, which includes a preset corresponding relationship between the name and the unique identifier, and determines a unique identifier corresponding to the target name, specifically, if only one name identical to the target name exists in the preset corresponding relationship of the same name, the first searching module 701 searches for a corresponding relationship between the name and the unique identifier included in the corresponding relationship of the same name, and determines a unique identifier corresponding to the target name; if at least two names identical to the target name are included in the preset identical-name correspondence, the second searching module 702 searches for a correspondence between the name included in the identical-name correspondence and the unique identifier, and determines a unique identifier corresponding to the name satisfying the condition among the at least two names identical to the target name. Then, the output module 603 searches according to the unique identifier, and outputs a search result, where the identifier of the item in the search result includes the unique identifier, specifically: an acquisition module 703 in the output module 603 searches according to the unique identifier to acquire an item containing the unique identifier in the identifier; the acquisition output module 704 in the output module 603 acquires an item of the same type as the search type from the items and outputs as a search result. Next, the first output module 705 outputs first prompt information, where the first prompt information includes other names similar to or the same as the target name in the same-name correspondence relationship.
In the embodiment of the invention, after an online search system receives a search request containing a target name from a user, a same-name corresponding relation which is preset and contains a corresponding relation between the name and a unique identifier is searched, a unique identifier corresponding to the target name is determined, searching is carried out according to a search type and the unique identifier, a search result is output, the type of an item in the search result is the same as the search type, the identifier of the item contains the unique identifier, and the search type is contained in the search request. The unique identifier is used for searching for the corresponding relation between the preset name and the unique identifier, so that the same name, such as two stars with the same name, can be distinguished by the unique identifier, and the item containing the unique identifier is obtained by searching through the unique identifier, so that the searching result is more accurate, and the searching experience of a user can be effectively improved. In addition, the online search system also outputs first prompt information which comprises other names similar to or identical to the target name in the same-name corresponding relationship, so that the search experience of the user can be effectively improved.
Although the present invention has been described with reference to the preferred embodiments, it should be understood that various changes, substitutions and alterations can be made herein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims (17)

1. A method of searching, comprising:
receiving a search request sent by a client of a user, wherein the search request comprises a target name;
when the target name is a same name, searching a preset same name corresponding relation, and determining a unique identifier corresponding to the target name, wherein the same name corresponding relation comprises the corresponding relation between the name and the unique identifier;
searching according to the unique identifier, outputting a search result, and sending the search result to the client of the user so as to enable the user to search according to the unique identifier
The client of the user loads and displays the search result, wherein the identification of the item in the search result comprises the unique identification;
and when the names of different types which are completely the same as the target name are searched, sending prompt information containing the names of different types to the client of the user.
2. The method according to claim 1, wherein the searching for a preset homonym correspondence and determining a unique identifier corresponding to the target name comprises:
if only one name which is the same as the target name exists in the preset homonymy corresponding relation, searching the corresponding relation between the name in the homonymy corresponding relation and the unique identifier, and determining the unique identifier corresponding to the target name;
if at least two names which are the same as the target name are contained in the preset homonymy corresponding relation, searching the corresponding relation between the name in the homonymy corresponding relation and the unique identifier, and determining the unique identifier corresponding to the name which meets the condition in the at least two names which are the same as the target name.
3. The method according to claim 2, wherein the searching according to the unique identifier and outputting a search result, where the identifier of the item in the search result includes the unique identifier, includes:
and searching according to the search type and the unique identifier, and outputting a search result, wherein the type of the item in the search result is the same as the search type, the identifier of the item comprises the unique identifier, and the search type is contained in the search request.
4. The method of claim 3, wherein the searching according to the search type and the unique identifier and outputting the search result comprises:
searching according to the unique identifier to obtain an item containing the unique identifier in the identifier;
and acquiring the items with the same type as the search type from the items and outputting the items as the search result.
5. The method according to any one of claims 2 to 4, further comprising:
and outputting first prompt information, wherein the first prompt information comprises other names which are similar or identical to the target name in the corresponding relation of the same name.
6. A data processing method, comprising:
carrying out name data mining to obtain all the same names;
determining an item associated with each of said all of the same names;
setting a unique identifier for each name in all the same names to obtain a corresponding relationship of the same name, and setting a unique identifier which is the same as the name for the item associated with the name, wherein the corresponding relationship of the same name comprises the corresponding relationship between the name and the unique identifier,
the corresponding relation of the same name is used for searching the corresponding relation of the same name when a target name contained in a search request is the same name after the online search system receives the search request sent by a client of a user, so as to determine a unique identifier corresponding to the target name.
7. The method of claim 6, wherein said performing name data mining for all names that are the same comprises:
capturing data by using a webpage crawler, and mining the data to obtain a name table;
and mining the same name of the name in the name table to obtain all the same names in the name table.
8. The method according to claim 6 or 7, characterized in that the method further comprises:
and importing the same-name corresponding relation containing the corresponding relation between the name and the unique identification and the items of the identified unique identification into the online search system.
9. An online search system, comprising:
the system comprises a receiving module, a searching module and a searching module, wherein the receiving module is used for receiving a searching request sent by a client of a user, and the searching request comprises a target name;
the searching module is used for searching a preset homonym corresponding relation after the receiving module receives the searching request when the target name is the homonym name, and determining a unique identifier corresponding to the target name, wherein the homonym corresponding relation comprises the corresponding relation between the name and the unique identifier;
and the output module is used for searching according to the unique identifier after the search module determines the unique identifier, outputting a search result, and sending the search result to the client of the user so as to load and display the search result by the client of the user, wherein the identifier of the item in the search result comprises the unique identifier, and when a non-homogeneous name which is completely the same as the target name is searched, the prompt information comprising the non-homogeneous name is sent to the client of the user.
10. The online search system of claim 9, wherein the lookup module specifically comprises:
the first searching module is used for searching the corresponding relation between the name and the unique identifier in the same-name corresponding relation and determining the unique identifier corresponding to the target name if only one name which is the same as the target name exists in the preset same-name corresponding relation after the receiving module receives the search request;
and a second searching module, configured to, after the receiving module receives the search request, if at least two names that are the same as the target name are included in a preset homonymy correspondence, search for a correspondence between a name in the homonymy correspondence and a unique identifier, and determine a unique identifier corresponding to a name that satisfies a condition, among the at least two names that are the same as the target name.
11. The online search system of claim 10, wherein the output module is specifically configured to, after the lookup module determines the unique identifier, perform a search according to a search type and the unique identifier, and output a search result, where a type of an item in the search result is the same as the search type, and the identifier of the item includes the unique identifier, and the search type is included in the search request.
12. The online search system of claim 11, wherein the output module comprises:
the acquisition module is used for searching according to the unique identifier after the search module determines the unique identifier, and acquiring the item containing the unique identifier in the identifier;
and the acquisition output module is used for acquiring the items with the same type as the search type from the items and outputting the items as the search result after the acquisition module acquires the items.
13. The online search system according to any one of claims 10 to 12, further comprising:
and the first output module is used for outputting first prompt information after the output module outputs the search result, wherein the first prompt information comprises other names which are similar to or identical to the target name in the corresponding relation of the same name.
14. A data processing apparatus, comprising:
the mining module is used for mining the data of the names to obtain all the same names;
the association module is used for determining an item associated with each of all the same names after the mining module obtains all the same names;
the setting module is used for setting a unique identifier for each name in all the same names after the association module determines the item associated with each name in all the same names to obtain a corresponding relationship with the same name, and setting the unique identifier which is the same as the name for the item associated with the name, wherein the corresponding relationship with the same name comprises the corresponding relationship between the name and the unique identifier, and after the corresponding relationship with the same name is used for an online search system to receive a search request sent by a client of a user, when a target name contained in the search request is the same name, the corresponding relationship with the same name is searched to determine the unique identifier corresponding to the target name.
15. The data processing apparatus of claim 14, wherein the mining module comprises:
the first mining module is used for capturing data by using a webpage crawler and mining the data to obtain a name table;
and the second mining module is used for mining the same name of the name in the name table to obtain all the same name in the name table.
16. A data processing apparatus according to claim 14 or 15, characterized in that the apparatus further comprises:
and the importing module is used for importing the same-name corresponding relation containing the corresponding relation between the name and the unique identifier and the items of the identified unique identifier into the online search system after the unique identifier is set by the setting module.
17. A computer-readable storage medium, wherein the storage medium has stored therein executable instructions that, when executed, implement the search method of any one of claims 1 to 5 or the data processing method of any one of claims 6 to 8.
CN201410566247.XA 2014-10-22 2014-10-22 Searching method, data processing method, device and system Active CN105589863B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410566247.XA CN105589863B (en) 2014-10-22 2014-10-22 Searching method, data processing method, device and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410566247.XA CN105589863B (en) 2014-10-22 2014-10-22 Searching method, data processing method, device and system

Publications (2)

Publication Number Publication Date
CN105589863A CN105589863A (en) 2016-05-18
CN105589863B true CN105589863B (en) 2020-07-14

Family

ID=55929451

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410566247.XA Active CN105589863B (en) 2014-10-22 2014-10-22 Searching method, data processing method, device and system

Country Status (1)

Country Link
CN (1) CN105589863B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108268582B (en) * 2017-07-14 2021-05-07 阿里巴巴(中国)有限公司 Information query method and device
CN112434127B (en) * 2020-11-03 2023-10-17 咪咕文化科技有限公司 Text information searching method, apparatus and readable storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1687924A (en) * 2005-04-28 2005-10-26 中国科学院计算技术研究所 Method for producing internet personage information search engine
CN101158964A (en) * 2007-11-09 2008-04-09 深圳市迅雷网络技术有限公司 System, device and method for automatically modifying target downloading file information
CN101454750A (en) * 2006-03-31 2009-06-10 谷歌公司 Disambiguation of named entities
CN102103640A (en) * 2011-02-28 2011-06-22 纽海信息技术(上海)有限公司 Method and device for providing article lists
CN102831128A (en) * 2011-06-15 2012-12-19 富士通株式会社 Method and device for sorting information of namesake persons on Internet
CN102867060A (en) * 2012-09-19 2013-01-09 广东欧珀移动通信有限公司 Method, device and system for data fast search
CN103559270A (en) * 2013-11-04 2014-02-05 北京中搜网络技术股份有限公司 Method for storing and managing entries

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1687924A (en) * 2005-04-28 2005-10-26 中国科学院计算技术研究所 Method for producing internet personage information search engine
CN101454750A (en) * 2006-03-31 2009-06-10 谷歌公司 Disambiguation of named entities
CN101158964A (en) * 2007-11-09 2008-04-09 深圳市迅雷网络技术有限公司 System, device and method for automatically modifying target downloading file information
CN102103640A (en) * 2011-02-28 2011-06-22 纽海信息技术(上海)有限公司 Method and device for providing article lists
CN102831128A (en) * 2011-06-15 2012-12-19 富士通株式会社 Method and device for sorting information of namesake persons on Internet
CN102867060A (en) * 2012-09-19 2013-01-09 广东欧珀移动通信有限公司 Method, device and system for data fast search
CN103559270A (en) * 2013-11-04 2014-02-05 北京中搜网络技术股份有限公司 Method for storing and managing entries

Also Published As

Publication number Publication date
CN105589863A (en) 2016-05-18

Similar Documents

Publication Publication Date Title
US11921805B2 (en) Web document enhancement
US11899681B2 (en) Knowledge graph building method, electronic apparatus and non-transitory computer readable storage medium
US9304979B2 (en) Authorized syndicated descriptions of linked web content displayed with links in user-generated content
US20130268597A1 (en) Relevance-Based Aggregated Social Feeds
US10503803B2 (en) Animated snippets for search results
Qian et al. Social media based event summarization by user–text–image co-clustering
US10210181B2 (en) Searching and annotating within images
JP2015204103A (en) Interactive search and recommendation method and device thereof
US9069794B1 (en) Determining location information for images using landmark, caption, and metadata location data
JP2019511065A (en) Information retrieval method and apparatus
CN107145497B (en) Method for selecting image matched with content based on metadata of image and content
US10061806B2 (en) Presenting previously selected search results
US10691746B2 (en) Images for query answers
US11249993B2 (en) Answer facts from structured content
KR102361112B1 (en) Extracting similar group elements
US20180285444A1 (en) Rewriting contextual queries
CN101894109A (en) Database building method and device
CN111104583B (en) Live broadcast room recommendation method, storage medium, electronic equipment and system
CN105786858A (en) Information search system and method
CN105589863B (en) Searching method, data processing method, device and system
KR20150045236A (en) Method and apparatus for managing scrap information
CN111259225B (en) New media information display method and device, electronic equipment and computer readable medium
CN104850608A (en) Method for searching keywords on information exhibiting page
CN106776862B (en) Game video searching method and device
US10878054B2 (en) Data processing method and apparatus

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant