CN105574175A - Processing method and device for optimizing search result title - Google Patents

Processing method and device for optimizing search result title Download PDF

Info

Publication number
CN105574175A
CN105574175A CN201510964509.2A CN201510964509A CN105574175A CN 105574175 A CN105574175 A CN 105574175A CN 201510964509 A CN201510964509 A CN 201510964509A CN 105574175 A CN105574175 A CN 105574175A
Authority
CN
China
Prior art keywords
title
data section
search
title data
search result
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510964509.2A
Other languages
Chinese (zh)
Inventor
郑思晴
王洁
王艳丽
吴凯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd, Qizhi Software Beijing Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201510964509.2A priority Critical patent/CN105574175A/en
Publication of CN105574175A publication Critical patent/CN105574175A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • G06F16/9577Optimising the visualization of content, e.g. distillation of HTML documents
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a processing method and a processing device for optimizing a search result title. The processing method comprises the following steps: according to a search term input by a user, obtaining a search result item matched with the search term, and extracting a title data segment corresponding to the search result item; removing redundant information from the title data segment, and optimizing the title data segment to obtain the optimized search result title; generating the search result item based on the optimized search result title, and loading the search result item into a search result page. By removing the redundant information from the title data segment corresponding to the search result item and optimizing the title data segment, the title data segment of a search result is not redundant any more, and after unnecessary fields are removed, a theme to be expressed by the title data segment is more simplified and highlighted; meanwhile, the screen space occupied when the title data segment is displayed is saved, the user can read the title data segment at a glance, the reading efficiency of the user is improved, and the inquiring and reading time of the user is shortened.

Description

The disposal route of Optimizing Search result title and device
Technical field
The present invention relates to computer software fields, be specifically related to a kind of disposal route and device of Optimizing Search result title.
Background technology
When user's inputted search word is searched for, under existing search pattern, the web page title that title in Search Results directly uses search engine collecting to arrive mostly, too many process is not done for search engine optimization and other reasons, therefore the title of Search Results there will be redundancy, unnecessary field, make title long, word is intensive to be piled up, both the theme that fuzzy title will be expressed, more screen space is taken again when title shows, caption text needs folds to show, and affects the reading experience of user.
Summary of the invention
In view of the above problems, the present invention is proposed to provide a kind of overcoming the problems referred to above or the disposal route of Optimizing Search result title solved the problem at least in part and device.
According to an aspect of the present invention, provide a kind of disposal route of Optimizing Search result title, it comprises: according to the search word of user's input, obtain the search result items of mating with described search word, extract the title data section that described search result items is corresponding; Remove the redundant information in described title data section, then process is optimized to described title data section, the Search Results title after being optimized; Generate search result items based on the Search Results title after described optimization to be loaded in search results pages.
Further, the redundant information in described removal title data section comprises further: remove the Repeating Field in described title data section; And/or, remove site name suffix content in described title data section; And/or, remove described title data section mid band and typonym.
Further, the redundant information in described removal title data section also comprises: for the video title data section of showing with intelligence summary form, remove attribute description field.
Further, the redundant information in described removal title data section also comprises: for the title data section comprising Search Results source icon, remove the source description field in described title data section.
Further, describedly process is optimized to title data section comprises further: all redundant symbols in described title data section are replaced with space.
Further, described redundant symbol comprises: separator, underscore, bracket, punctuation mark, and/or " " symbol.
Further, describedly process is optimized to title data section comprises further: for only having a word before suspension points in title data section and be not the word that search is relevant, this word is removed together with suspension points.
Further, described method also comprises: grab the Search Results thumbnail that described search result items is corresponding; Describedly Search Results title after search result items and described process is loaded into search results pages comprises further: the Search Results title after described search result items, described process and described Search Results thumbnail are loaded in search results pages.
According to a further aspect in the invention, provide a kind for the treatment of apparatus of Search Results title, it comprises: search module, is suitable for the search word according to user's input, obtain the search result items of mating with described search word, and grab title data section corresponding to described search result items; Processing module, is suitable for removing the redundant information in described title data section, is then optimized process to described title data section, obtains the Search Results title after processing; Load-on module, is suitable for the Search Results title after by described search result items and described process and is loaded in search results pages.
Further, described processing module comprises further: removal unit, is suitable for removing the Repeating Field in described title data section; And/or, remove site name suffix content in described title data section; And/or, remove described title data section mid band and typonym.
Further, described removal unit is also suitable for: for the video title data section of showing with intelligence summary form, remove attribute description field.
Further, described removal unit is also suitable for: for the title data section comprising Search Results source icon, remove the source description field in described title data section.
Further, described processing module comprises further: optimize unit, be suitable for all redundant symbols in described title data section to replace with space.
Further, described redundant symbol comprises: separator, underscore, bracket, punctuation mark, and/or " " symbol.
Further, described optimization unit is also suitable for: for only having a word before suspension points in title data section and be not the word that search is relevant, is removed by this word together with suspension points.
Further, described search module is also suitable for: grab the Search Results thumbnail that described search result items is corresponding; Described load-on module is further adapted for: the Search Results title after described search result items, described process and described Search Results thumbnail are loaded in search results pages.
According to disposal route and the device of Optimizing Search result title of the present invention, the redundant information in title data section corresponding for search result items can be removed, and carry out optimization process.Search Results title after optimization generates search result items and is loaded in search results pages.Make the title no longer redundancy of Search Results like this, after getting rid of unnecessary field, the theme that title can be made to express is simplified more, outstanding, the screen space taken when simultaneously saving title display, can be very clear when user reads, improve the reading efficiency of user, the time saved user's inquiry and read.
Above-mentioned explanation is only the general introduction of technical solution of the present invention, in order to technological means of the present invention can be better understood, and can be implemented according to the content of instructions, and can become apparent, below especially exemplified by the specific embodiment of the present invention to allow above and other objects of the present invention, feature and advantage.
Accompanying drawing explanation
By reading hereafter detailed description of the preferred embodiment, various other advantage and benefit will become cheer and bright for those of ordinary skill in the art.Accompanying drawing only for illustrating the object of preferred implementation, and does not think limitation of the present invention.And in whole accompanying drawing, represent identical parts by identical reference symbol.In the accompanying drawings:
Fig. 1 shows the process flow diagram of the disposal route of Optimizing Search result title according to an embodiment of the invention;
Fig. 2 shows the process flow diagram of the disposal route of Optimizing Search result title in accordance with another embodiment of the present invention;
Fig. 3 shows the structured flowchart of the treating apparatus of Optimizing Search result title according to an embodiment of the invention.
Embodiment
Below with reference to accompanying drawings exemplary embodiment of the present disclosure is described in more detail.Although show exemplary embodiment of the present disclosure in accompanying drawing, however should be appreciated that can realize the disclosure in a variety of manners and not should limit by the embodiment set forth here.On the contrary, provide these embodiments to be in order to more thoroughly the disclosure can be understood, and complete for the scope of the present disclosure can be conveyed to those skilled in the art.
Fig. 1 shows the process flow diagram of an embodiment of the disposal route of Optimizing Search result title provided by the invention, and as shown in Figure 1, the method for the present embodiment specifically comprises the steps:
Step S101, according to the search word of user's input, obtains the search result items of mating with search word, extracts the title data section that search result items is corresponding.
According to the search word of user's input, obtained the search result items of mating with search word by search engine.Contain each webpage with this search word related content in search result items, each webpage needs the title data section extracting its correspondence.Title data section point understands the content of this webpage, also matches with search word simultaneously.Search word, web page source, web site name, type of webpage etc. are generally comprised in title data section.In title data section, search word may occur more than once.
Step S102, removes the redundant information in title data section, is then optimized process to title data section, the Search Results title after being optimized.
The title data section obtained is processed, removes the redundant information in title data section, comprise further: remove the Repeating Field in title data section; And/or, remove site name suffix content in title data section; And/or, remove title data section mid band and typonym.
When including Repeating Field in title, one time is retained for Repeating Field.During as inquired about Beijing weather forecast, web page title display " Beijing weather forecast 30 days _ Beijing weather forecast inquiry _ Beijing weather forecast in 30 days one month _ Beijing ", wherein " Beijing weather forecast " occurred 3 times, the meaning that duplicate contents is too not large when title shows, and occupy longer length.Title only shows " Beijing weather forecast 30 days " just can describe the key content of webpage for 1 time very accurately.After removing Repeating Field, title is shown as " Beijing weather forecast 30 days ".
Site name suffix content is contained, as " XXX_ searches question and answer well ", " XXX-Beijing weather net ", " literature city, XXX-Jinjiang " etc. toward contact in title.Site name suffix content designates web page contents and derives from which website, concrete source web does not associate with web page contents, and the meaning that will show with title is too not large yet to be associated, therefore removal station roll-call suffix content, title content is not affected, also makes title content more simple and clear.In addition web page source arranges position display in search results pages, does not need in title, repeat display.
Input certain video or certain song, this clear and definite type of certain novel search word time, often containing the typonym such as " TV play ", " variety ", " film ", " song ", " novel " in the title obtained, the title of the channel such as " liking strange skill ", " PPS " sometimes also can be contained.The field removing these channels and type does not affect title content, after removal the subject content of the understanding title that user still can be clear and definite.
The redundant information removed in title data section also comprises: for the video title data section of showing with intelligence summary form, remove attribute description field.The intelligence summary of video search includes picture and the duration of video, clicks video pictures and can play video.If this video is TV play, intelligence summary also show the collection number of current video; If this video is album, intelligence summary also show the number of songs of special edition.The field such as " watching online ", " high definition is watched online " is generally also comprised in the title data section of video, these field descriptions attribute of this video, from intelligence summary, clearly can demonstrate the attribute that this video " is watched " online, the attribute field such as " watching online ", " high definition is watched online " in title data section can be removed.
The redundant information removed in title data section also comprises: for the title data section comprising Search Results source icon, remove the source description field in title data section.As Search Results comes from " Jingdone district " website, the red icon of " JD " can be shown in the foremost of title data section, clearly represent that this search derives from " Jingdone district " website, removed the field in " Jingdone district " in title data section, do not affect the content of title data section.Other websites, as the website such as " bean cotyledon ", " Suning easily purchases ", all can show the icon of oneself website, remove corresponding website sources field in title data section before title data section.
Be optimized process to title data section to comprise further: redundant symbols all in title data section is replaced with space.Sometimes comprise a lot of redundancies in search title to meet, as "! ", ", ", ", ", "? ", "-(separator) ", " _ (underscore) " etc. symbol.As " 2011MAMA awards ceremony-MissA [GoodBye.Baby]-high definition is watched-PPTV online and is gathered power ... ".Search title is that search engine directly extracts from Search Results site title a bit, some uses SEO (SearchEngineOptimization) search engine optimization, after site title is improved, the title data section extracted by search engine.In these titles, redundant symbol and word mix, user is affected to the reading of title and understanding from typesetting format, redundancy is met and replaces with space, as the title " 2011MAMA awards ceremony MissAGoodByeBaby " after replacement, title can be made more neat from typesetting format, easy-to-read and understanding.The redundant symbol of replacing is needed to comprise: separator, underscore, bracket, punctuation mark, and/or " " symbol.Above-mentioned mention "! ", ", ", ", ", "? ", be and illustrate, punctuation mark is not limited only to this.
Be optimized process to title data section to comprise further: for only having a word before suspension points in title data section and be not the word that search is relevant, this word is removed together with suspension points.As search " a cool breeze blows gently ", display title " Wang Fei-a cool breeze blows gently, and film " port Embarrassing " theme song-high definition MV plays online ... _ sound ... ", " sound ... " at title end is after suspension points is replaced, it is incomplete that independent " sound " word makes title read statement, and " sound " does not associate with search word, easilier cause understanding problem to user, therefore after suspension points is replaced, " sound " word irrelevant with search that this is independent is removed.Title after optimization is " Wang Fei a cool breeze blows gently film " port Embarrassing " theme song ", and statement completes, and user can not be made to produce ambiguity.
Step S103, is loaded in search results pages based on the Search Results title generation search result items after optimizing.
By performing step S102, Search Results title after optimization being combined with the content in former search result items, generates new search result items, and be loaded in search results pages.
According to the disposal route of Optimizing Search result title of the present invention, title data section in search result items is removed redundant information, to go forward side by side one-step optimization process, make the title data section no longer redundancy of Search Results, after getting rid of unnecessary field, the theme that title data section can be made to express is simplified more, give prominence to, the screen space taken when simultaneously saving the display of title data section, can be very clear when user reads, improve the reading efficiency of user, the time saved user's inquiry and read.
Fig. 2 shows the process flow diagram of another embodiment of the disposal route of Optimizing Search result title provided by the invention, and as shown in Figure 2, the method for the present embodiment specifically comprises the steps:
Step S201, according to the search word of user's input, obtains the search result items of mating with search word, extracts the title data section that search result items is corresponding.
According to the search word of user's input, obtained the search result items of mating with search word by search engine.Contain each webpage with this search word related content in search result items, each webpage needs the title data section extracting its correspondence.Title data section point understands the content of this webpage, also matches with search word simultaneously.Search word, web page source, web site name, type of webpage etc. are generally comprised in title data section.In title data section, search word may occur more than once.
Step S202, captures the Search Results thumbnail that search result items is corresponding.
In search result items except the title data section mentioned, also comprise the picture of Search Results, step S202 captures thumbnail from search result items.If this search is video search, thumbnail can be the picture of this video, shows the duration of this video on thumbnail simultaneously, clicks thumbnail and can play this video.If this video is TV play, thumbnail also should show the collection number of current video; If this video is album, thumbnail also should show the number of songs of special edition.If search is novel, caricature search, thumbnail can be novel front cover.If search for as personage's search, thumbnail can be personal portrait etc.If time in search result items without picture, corresponding intelligence summary can be increased, be equipped with pure background color picture for thumbnail with the word of simplifying, giving top priority to what is the most important content.More than be and illustrate, during actual enforcement, be not limited only to this.
Step S203, removes the redundant information in title data section, is then optimized process to title data section, the Search Results title after being optimized.
This step S203 is identical with the step S102 of Fig. 1 embodiment, please refer to the description of step S102 in Fig. 1 embodiment.
Step S204, is loaded into the Search Results title after search result items, process and Search Results thumbnail in search results pages.
Search Results title after search result items and process is loaded into search results pages comprise further: the Search Results title after search result items, process and Search Results thumbnail are loaded in search results pages.By performing step S202 and step S203, the Search Results thumbnail that after former search result items, step S203 being optimized, Search Results title, step S202 capture, is loaded in search results pages.
According to the disposal route of Optimizing Search result title of the present invention, except removing redundant information to title data section, to go forward side by side one-step optimization process, further increase search thumbnail, make the title data section no longer redundancy of Search Results, its theme of expressing is simplified more, outstanding, the adding of picture simultaneously, make user get more information about search content.Improve the reading efficiency of user, the time saving user's inquiry and read.
Fig. 3 shows the functional block diagram of an embodiment of the treating apparatus of Optimizing Search result title provided by the invention, and as shown in Figure 3, the device of the present embodiment comprises with lower module:
Search module 301, is suitable for, according to the search word of user's input, obtaining the search result items of mating with search word, and grabbing title data section corresponding to search result items.
According to the search word of user's input, obtained the search result items of mating with search word by search module 301.Contain each webpage with this search word related content in search result items, each webpage needs the title data section being extracted its correspondence by search module 301.Title data section point understands the content of this webpage, also matches with search word simultaneously.Search word, web page source, web site name, type of webpage etc. are generally comprised in title data section.In title data section, search word may occur more than once.
Search module 301 is also suitable for: grab the Search Results thumbnail that described search result items is corresponding.
In search result items except the title data section mentioned, also comprise the picture of Search Results, search module 301 captures thumbnail from search result items.If this search is video search, thumbnail can be the picture of this video, shows the duration of this video on thumbnail simultaneously, clicks thumbnail and can play this video.If this video is TV play, thumbnail also should show the collection number of current video; If this video is album, thumbnail also should show the number of songs of special edition.If search is novel, caricature search, thumbnail can be novel front cover.If search for as personage's search, thumbnail can be personal portrait etc.If time in search result items without picture, corresponding intelligence summary can be increased, be equipped with pure background color picture for thumbnail with the word of simplifying, giving top priority to what is the most important content.More than be and illustrate, during actual enforcement, be not limited only to this.
Processing module 302, is suitable for removing the redundant information in title data section, is then optimized process to title data section, obtains the Search Results title after processing.
Processing module 302 comprises further: removal unit 3021, is suitable for removing the Repeating Field in title data section; And/or, remove site name suffix content in title data section; And/or, remove title data section mid band and typonym.
When including Repeating Field in title, removal unit 3021 only retains one time for Repeating Field.During as inquired about Beijing weather forecast, web page title display " Beijing weather forecast 30 days _ Beijing weather forecast inquiry _ Beijing weather forecast in 30 days one month _ Beijing ", wherein " Beijing weather forecast " occurred 3 times, the meaning that duplicate contents is too not large when title shows, and occupy longer length.Title only shows " Beijing weather forecast 30 days " just can describe the key content of webpage for 1 time very accurately.After removal unit 3021 removes Repeating Field, title is shown as " Beijing weather forecast 30 days ".
Site name suffix content is contained, as " XXX_ searches question and answer well ", " XXX-Beijing weather net ", " literature city, XXX-Jinjiang " etc. toward contact in title.Site name suffix content designates web page contents and derives from which website, concrete source web does not associate with web page contents, the meaning that will show with title is too not large yet to be associated, therefore removal unit 3021 removal station roll-call suffix content, title content is not affected, also makes title content more simple and clear.In addition web page source arranges position display in search results pages, does not need in title, repeat display.
Input certain video or certain song, this clear and definite type of certain novel search word time, often containing the typonym such as " TV play ", " variety ", " film ", " song ", " novel " in the title obtained, the title of the channel such as " liking strange skill ", " PPS " sometimes also can be contained.The field that removal unit 3021 removes these channels and type does not affect title content, after removal the subject content of the understanding title that user still can be clear and definite.
Removal unit 3021 is also suitable for: for the video title data section of showing with intelligence summary form, remove attribute description field.The intelligence summary of video search includes picture and the duration of video, clicks video pictures and can play video.If this video is TV play, intelligence summary also show the collection number of current video; If this video is album, intelligence summary also show the number of songs of special edition.The field such as " watching online ", " high definition is watched online " is generally also comprised in the title data section of video, these field descriptions attribute of this video, from intelligence summary, clearly can demonstrate the attribute that this video " is watched " online, the attribute field such as " watching online ", " high definition is watched online " in title data section can be removed.
Removal unit 3021 is also suitable for: for the title data section comprising Search Results source icon, remove the source description field in described title data section.As Search Results comes from " Jingdone district " website, the red icon of " JD " can be shown in the foremost of title data section, clearly represent that this search derives from " Jingdone district " website, removed the field in " Jingdone district " in title data section, do not affect the content of title data section.Other websites, as the website such as " bean cotyledon ", " Suning easily purchases ", all can show the icon of oneself website, remove corresponding website sources field in title data section before title data section.
Processing module comprises further: optimize unit 3022, be suitable for all redundant symbols in described title data section to replace with space.Sometimes comprise a lot of redundancies in search title to meet, as "! ", ", ", ", ", "? ", "-(separator) ", " _ (underscore) " etc. symbol.As " 2011MAMA awards ceremony-MissA [GoodBye.Baby]-high definition is watched-PPTV online and is gathered power ... ".Search title is that search engine directly extracts from Search Results site title a bit, some uses SEO (SearchEngineOptimization) search engine optimization, after site title is improved, the title data section extracted by search engine.In these titles, redundant symbol and word mix, user is affected to the reading of title and understanding from typesetting format, redundancy is met and replaces with space, the title " 2011MAMA awards ceremony MissAGoodByeBaby " after unit 3022 replacement is optimized as performed, title can be made more neat from typesetting format, easy-to-read and understanding.The redundant symbol of replacing is needed to comprise: separator, underscore, bracket, punctuation mark, and/or " " symbol.Above-mentioned mention "! ", ", ", ", ", "? ", be and illustrate, punctuation mark is not limited only to this.
Optimize unit 3022 to be also suitable for: for only having a word before suspension points in title data section and be not the word that search is relevant, this word is removed together with suspension points.As search " a cool breeze blows gently ", display title " Wang Fei-a cool breeze blows gently, and film " port Embarrassing " theme song-high definition MV plays online ... _ sound ... ", " sound ... " at title end is after suspension points is replaced, it is incomplete that independent " sound " word makes title read statement, and " sound " does not associate with search word, easilier cause understanding problem to user, therefore after suspension points is replaced, " sound " word irrelevant with search that this is independent is removed.Perform optimize unit 3022 optimize after title be " Wang Fei a cool breeze blows gently film " port Embarrassing " theme song ", statement completes, and user can not be made to produce ambiguity.
Load-on module 303, is suitable for the Search Results title after by search result items and process and is loaded in search results pages.
By performing processing module 302, Search Results title after optimization being combined with the content in former search result items, generates new search result items, and be loaded in search results pages.
Load-on module 303 is further adapted for: the Search Results title after search result items, process and Search Results thumbnail are loaded in search results pages.
Search Results title after search result items and process is loaded into search results pages comprise further: the Search Results title after search result items, process and Search Results thumbnail are loaded in search results pages.By performing and, by former search result items, processing module 302 optimizes rear Search Results title, search module 301 captures Search Results thumbnail, be loaded in search results pages.
According to the treating apparatus of Optimizing Search result title of the present invention, title data section in search result items is removed redundant information, to go forward side by side one-step optimization process, make the title data section no longer redundancy of Search Results, after getting rid of unnecessary field, the theme that title data section can be made to express is simplified more, give prominence to, the screen space taken when simultaneously saving the display of title data section, can be very clear when user reads, improve the reading efficiency of user, the time saved user's inquiry and read.And the increase of Search Results thumbnail, user can be made to get more information about search content.
Intrinsic not relevant to any certain computer, virtual system or miscellaneous equipment with display at this algorithm provided.Various general-purpose system also can with use based on together with this teaching.According to description above, the structure constructed required by this type systematic is apparent.In addition, the present invention is not also for any certain programmed language.It should be understood that and various programming language can be utilized to realize content of the present invention described here, and the description done language-specific is above to disclose preferred forms of the present invention.
In instructions provided herein, describe a large amount of detail.But can understand, embodiments of the invention can be put into practice when not having these details.In some instances, be not shown specifically known method, structure and technology, so that not fuzzy understanding of this description.
Similarly, be to be understood that, in order to simplify the disclosure and to help to understand in each inventive aspect one or more, in the description above to exemplary embodiment of the present invention, each feature of the present invention is grouped together in single embodiment, figure or the description to it sometimes.But, the method for the disclosure should be construed to the following intention of reflection: namely the present invention for required protection requires feature more more than the feature clearly recorded in each claim.Or rather, as claims below reflect, all features of disclosed single embodiment before inventive aspect is to be less than.Therefore, the claims following embodiment are incorporated to this embodiment thus clearly, and wherein each claim itself is as independent embodiment of the present invention.
Those skilled in the art are appreciated that and adaptively can change the module in the equipment in embodiment and they are arranged in one or more equipment different from this embodiment.Module in embodiment or unit or assembly can be combined into a module or unit or assembly, and multiple submodule or subelement or sub-component can be put them in addition.Except at least some in such feature and/or process or unit be mutually repel except, any combination can be adopted to combine all processes of all features disclosed in this instructions (comprising adjoint claim, summary and accompanying drawing) and so disclosed any method or equipment or unit.Unless expressly stated otherwise, each feature disclosed in this instructions (comprising adjoint claim, summary and accompanying drawing) can by providing identical, alternative features that is equivalent or similar object replaces.
In addition, those skilled in the art can understand, although embodiments more described herein to comprise in other embodiment some included feature instead of further feature, the combination of the feature of different embodiment means and to be within scope of the present invention and to form different embodiments.Such as, in the following claims, the one of any of embodiment required for protection can use with arbitrary array mode.
All parts embodiment of the present invention with hardware implementing, or can realize with the software module run on one or more processor, or realizes with their combination.It will be understood by those of skill in the art that the some or all functions that microprocessor or digital signal processor (DSP) can be used in practice to realize according to the some or all parts in the treating apparatus of the Optimizing Search result title of the embodiment of the present invention.The present invention can also be embodied as part or all equipment for performing method as described herein or device program (such as, computer program and computer program).Realizing program of the present invention and can store on a computer-readable medium like this, or the form of one or more signal can be had.Such signal can be downloaded from internet website and obtain, or provides on carrier signal, or provides with any other form.
The present invention will be described instead of limit the invention to it should be noted above-described embodiment, and those skilled in the art can design alternative embodiment when not departing from the scope of claims.In the claims, any reference symbol between bracket should be configured to limitations on claims.Word " comprises " not to be got rid of existence and does not arrange element in the claims or step.Word "a" or "an" before being positioned at element is not got rid of and be there is multiple such element.The present invention can by means of including the hardware of some different elements and realizing by means of the computing machine of suitably programming.In the unit claim listing some devices, several in these devices can be carry out imbody by same hardware branch.Word first, second and third-class use do not represent any order.Can be title by these word explanations.

Claims (10)

1. a disposal route for Optimizing Search result title, it comprises:
According to the search word of user's input, obtain the search result items of mating with described search word, extract the title data section that described search result items is corresponding;
Remove the redundant information in described title data section, then process is optimized to described title data section, the Search Results title after being optimized;
Generate search result items based on the Search Results title after described optimization to be loaded in search results pages.
2. method according to claim 1, wherein, the redundant information in described removal title data section comprises further:
Remove the Repeating Field in described title data section;
And/or, remove site name suffix content in described title data section;
And/or, remove described title data section mid band and typonym.
3. method according to claim 2, wherein, the redundant information in described removal title data section also comprises: for the video title data section of showing with intelligence summary form, remove attribute description field.
4. method according to claim 2, wherein, the redundant information in described removal title data section also comprises: for the title data section comprising Search Results source icon, remove the source description field in described title data section.
5. the method according to any one of claim 1-4, wherein, is describedly optimized process to title data section and comprises further: all redundant symbols in described title data section are replaced with space.
6. method according to claim 5, wherein, described redundant symbol comprises: separator, underscore, bracket, punctuation mark, and/or " " symbol.
7. the method according to any one of claim 1-6, wherein, is describedly optimized process to title data section and comprises further: for only having a word before suspension points in title data section and be not the word that search is relevant, is removed by this word together with suspension points.
8. the method according to any one of claim 1-7, wherein, described method also comprises: grab the Search Results thumbnail that described search result items is corresponding;
Describedly Search Results title after search result items and described process is loaded into search results pages comprises further: the Search Results title after described search result items, described process and described Search Results thumbnail are loaded in search results pages.
9. a treating apparatus for Search Results title, it comprises:
Search module, is suitable for, according to the search word of user's input, obtaining the search result items of mating with described search word, and grabbing title data section corresponding to described search result items;
Processing module, is suitable for removing the redundant information in described title data section, is then optimized process to described title data section, obtains the Search Results title after processing;
Load-on module, is suitable for the Search Results title after by described search result items and described process and is loaded in search results pages.
10. device according to claim 9, wherein, described processing module comprises further: removal unit, is suitable for removing the Repeating Field in described title data section; And/or, remove site name suffix content in described title data section; And/or, remove described title data section mid band and typonym.
CN201510964509.2A 2015-12-21 2015-12-21 Processing method and device for optimizing search result title Pending CN105574175A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510964509.2A CN105574175A (en) 2015-12-21 2015-12-21 Processing method and device for optimizing search result title

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510964509.2A CN105574175A (en) 2015-12-21 2015-12-21 Processing method and device for optimizing search result title

Publications (1)

Publication Number Publication Date
CN105574175A true CN105574175A (en) 2016-05-11

Family

ID=55884306

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510964509.2A Pending CN105574175A (en) 2015-12-21 2015-12-21 Processing method and device for optimizing search result title

Country Status (1)

Country Link
CN (1) CN105574175A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106484660A (en) * 2016-10-21 2017-03-08 合网络技术(北京)有限公司 Title treating method and apparatus

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101599058A (en) * 2009-06-01 2009-12-09 杨马起 Handle and filter the method and system of contact method in the computer character information
CN102023998A (en) * 2009-09-21 2011-04-20 创新科技有限公司 Method and device for processing webpage so as to display on handheld equipment
CN104317931A (en) * 2014-10-31 2015-01-28 北京奇虎科技有限公司 Webpage title determining method and device
CN104331458A (en) * 2014-10-31 2015-02-04 北京奇虎科技有限公司 Method and device using anchor text as webpage title
CN104915443A (en) * 2015-06-29 2015-09-16 北京信息科技大学 Extraction method of Chinese Microblog evaluation object
CN105095175A (en) * 2014-04-18 2015-11-25 北京搜狗科技发展有限公司 Method and device for obtaining truncated web title

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101599058A (en) * 2009-06-01 2009-12-09 杨马起 Handle and filter the method and system of contact method in the computer character information
CN102023998A (en) * 2009-09-21 2011-04-20 创新科技有限公司 Method and device for processing webpage so as to display on handheld equipment
CN105095175A (en) * 2014-04-18 2015-11-25 北京搜狗科技发展有限公司 Method and device for obtaining truncated web title
CN104317931A (en) * 2014-10-31 2015-01-28 北京奇虎科技有限公司 Webpage title determining method and device
CN104331458A (en) * 2014-10-31 2015-02-04 北京奇虎科技有限公司 Method and device using anchor text as webpage title
CN104915443A (en) * 2015-06-29 2015-09-16 北京信息科技大学 Extraction method of Chinese Microblog evaluation object

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
南京航空航天大学图书馆组: "《网络信息采集与应用》", 30 September 2005, 清华大学出版社 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106484660A (en) * 2016-10-21 2017-03-08 合网络技术(北京)有限公司 Title treating method and apparatus

Similar Documents

Publication Publication Date Title
US20180101614A1 (en) Machine Learning-Based Data Aggregation Using Social Media Content
US8868609B2 (en) Tagging method and apparatus based on structured data set
US8010344B2 (en) Dictionary word and phrase determination
CN102831127B (en) Method, device and system for processing repeating data
CN101187941B (en) Apparatus and method for optimized index search
CN104765809A (en) Preview method and device of search pictures of mobile terminal
CN104077388A (en) Summary information extraction method and device based on search engine and search engine
CN104462506A (en) Method and device for establishing knowledge graph based on user annotation information
CN105095168A (en) Automatic generation method and device for contract files
CN104699751A (en) Search recommending method and device based on search terms
CN103870461A (en) Topic recommendation method, device and server
CN105095391A (en) Device and method for identifying organization name by word segmentation program
CN105512104A (en) Dictionary dimension reducing method and device and information classifying method and device
CN102982118A (en) Searching method and device based on favorites
WO2014000130A1 (en) Method or system for automated extraction of hyper-local events from one or more web pages
CN104462504A (en) Method and device for providing reasoning process data in search
CN104331438A (en) Method and device for selectively extracting content of novel webpage
Skare The paratext of digital documents
JP6868576B2 (en) Event presentation system and event presentation device
CN103761231A (en) Method and device for providing media content information of page by search engine
CN105574175A (en) Processing method and device for optimizing search result title
CN105159921A (en) Method and apparatus for de-duplicating point-of-interest (POI) data in map
CN104778232A (en) Searching result optimizing method and device based on long query
JP5423470B2 (en) Name identification check support device, name identification check support program, and name identification check support method
CN113743432A (en) Image entity information acquisition method, device, electronic device and storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20160511