CN101566995A - Method and system for integral release of internet information - Google Patents

Method and system for integral release of internet information Download PDF

Info

Publication number
CN101566995A
CN101566995A CNA2008101050657A CN200810105065A CN101566995A CN 101566995 A CN101566995 A CN 101566995A CN A2008101050657 A CNA2008101050657 A CN A2008101050657A CN 200810105065 A CN200810105065 A CN 200810105065A CN 101566995 A CN101566995 A CN 101566995A
Authority
CN
China
Prior art keywords
neologisms
new words
internet
internet new
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2008101050657A
Other languages
Chinese (zh)
Inventor
张扬
林凡
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Sogou Technology Development Co Ltd
Original Assignee
Beijing Sogou Technology Development Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Sogou Technology Development Co Ltd filed Critical Beijing Sogou Technology Development Co Ltd
Priority to CNA2008101050657A priority Critical patent/CN101566995A/en
Publication of CN101566995A publication Critical patent/CN101566995A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

The invention provides a method, a device and a system for integral release of internet information, wherein the method comprises the following steps: acquiring internet new words; acquiring description information aiming at self attributes of the internet new words; acquiring service resources correlative with the internet new words; exhibiting the internet new words; and receiving an information acquisition request of a user aiming at one internet new word, releasing the description information aiming at the self attributes of the internet new words, and correlative service resources or links of the correlative service resources. The method, the device and the system can dig new words and correlative various pieces of information from various resources of the internet, and exhibit various pieces of the information correlative with the new words and the discovery process to users by adopting various modes; and by exhibiting the fashionable hot words and the correlative information to terminal users in time, the method, the device and the system can help the users obtain the latest trends of the internet in time and increase the velocity of acquiring and propagating fresh information on the internet.

Description

A kind of method and system of internet information integrate release
Technical field
The present invention relates to the internet information spreading technical field, particularly relate to a kind of method and system of internet information integrate release.
Background technology
At present along with Internet technology is used more and more widely, routine work that people are a lot of and amusement are all carried out on network, people the internet information quantity that can obtain explosive growth also appears.Obtain in the custom in people's information, all have the demand that fresh information is in time obtained usually.
Wherein, fresh information can comprise information such as various news informations at current point in time, opinion piece, these information can realize obtaining by internet information access ports such as portal websites, for example, portal standing-meeting such as Sohu, the Sina various information in its website that upgrade in time are to satisfy the demand that the user in time obtains.The user can obtain the various information at current point in time that it provides by landing certain portal website.
But, when the user wishes in the various fresh information of obtaining on the information range under the current point in time, but wish obtaining the various relevant informations of certain special topic on nearly a period of time on the information depth, perhaps wish to understand certain fresh incident to this user, and the various relevant informations of this incident on a period of time, then just can't realize by the way.And in fact, the user is more stronger for the demand of obtaining of back one category information.As, coming to this for internet new words, the user is when browsing some information or from friend, get internet new words of cicada (for this user), but but do not understand its concrete condition, then just wish to find the various information relevant, to do further understanding with this internet new words.Described neologisms of the present invention can comprise: that various new things are summed up out because of contacting, that use and spread wide the in a large number in daily life entry of people comprises modish vocabulary, the newsmaker, major event, aims at the specific appellation of a certain class crowd's use etc.Neologisms generally have widely used, characteristics such as popular, spoken breviaryization for a long time, as Eight Do's and Eight Don'ts, " roadshow ", " subprime mortgage ", " Hong lie prone ", " blog fight ", " Free-Hug Campaign " etc.
In order to satisfy the demand, the mode that the user can be by various Info Links (as, related news recommendation, related article recommendation etc.), from an Info Link to other relevant informations, obtain the various information relevant with this internet new words.But, common Info Link limited amount, and for surpassing the link of two-stage, its correlativity with raw information can obviously descend, therefore, under this mode, the energy that the user need be expensive is on information access process, and efficient is obviously low.
Along with the development of internet information search engine technology, people more and more realize by the keyword search technology in order to obtain the various information relevant with this internet new words.But also there are a lot of defectives in this mode: at first, the user must know the existence of these neologisms, just can carry out follow-up search inquiry, and under many circumstances, the user does not also know the existence of these neologisms, and promptly this mode has still limited various fresh information on the internet the velocity of propagation relevant with these neologisms to a certain extent; Secondly, though Search Results and this neologisms have certain degree of correlation, but the degree of correlation of each bar Search Results and these neologisms differs, the information description dimension differs, so need the user to read after a large amount of Search Results, could be than these neologisms of more comprehensive understanding, information acquisition efficiency is still relatively lower.
In a word, pressing for the urgent technical matters that solves of those skilled in the art is exactly: how can a kind of information distribution scheme that can promote internet fresh information velocity of propagation of creationary proposition.
Summary of the invention
Technical matters to be solved by this invention provides a kind of solution of internet information integrate release, by this solution, excavate the relation of the various information in internet, service and internet new words, with the internet new words is that core integrates concentrated issue with various relevant informations, thereby can promote with the internet new words is the velocity of propagation of the relevant fresh information of core, improves user's information acquisition efficiency.
In order to address the above problem, the invention discloses a kind of method of internet information integrate release, comprising: obtain internet new words; Obtain descriptor at the internet new words self attributes; Obtain the Service Source relevant with internet new words; Represent internet new words; Receive the information acquisition request of user at an internet new words, issue is at the descriptor of this internet new words self attributes, and the link of related service resource or related service resource.
Preferably, can obtain internet new words in the following manner: obtain the neologisms candidate; According to presetting the neologisms feature, described neologisms candidate is screened, obtain neologisms.
Preferably, described neologisms feature comprises frequecy characteristic, and described frequecy characteristic comprises: these neologisms candidate uses these neologisms candidate's situation statistics, any one or combination in any among the statistical nature three of these neologisms candidate in webpage as counting, the input method user of query word in search engine logs.
Preferably, described neologisms feature also can comprise temporal characteristics, and described temporal characteristics comprises that characteristic and the utilization rate characteristic that grows steadily appears in burst.
Preferably, can obtain the Service Source relevant with internet new words in the following manner: the Service Source relevant with corresponding internet new words obtained in inquiry in various types of Service Source set; Described Service Source type comprises search service, Desktop Product, news, blog, recreation, relational network, label, aggregated content, online dictionary or wireless value-added service.
Preferably, can represent internet new words by user side application program or application plug; Perhaps, also can represent internet new words by the Website page mode.
Preferably, described descriptor at the internet new words self attributes comprises the neologisms definition; Described neologisms definition is obtained by info web is excavated.
Preferably, described descriptor at the internet new words self attributes also can comprise corresponding neologisms in a period of time, the statistical trend graph or the evaluating of occurrence number on certain dimension; Described dimension comprises info web, inquiry log, user's input or the click of user's neologisms.
Preferably, described descriptor at the internet new words self attributes also can comprise the correct coding character string of corresponding neologisms at specific input method; Described correct coding character string by gather, the input condition of analysis user coded string obtains.
According to another preferred embodiment of the present invention, a kind of system of internet information integrate release is also disclosed, comprising:
Be used to obtain the unit of internet new words;
Be used to obtain unit at the descriptor of internet new words self attributes;
Be used to obtain the unit of the Service Source relevant with internet new words;
Be used to represent the unit of internet new words;
Release unit is used to receive the information acquisition request of user at an internet new words, and issue is at the descriptor of this internet new words self attributes, and the link of related service resource or related service resource.
Preferably, the described unit that is used to obtain internet new words may further include: the neologisms candidate unit is used to obtain the neologisms candidate; The screening unit is used for according to presetting the neologisms feature described neologisms candidate being screened, and obtains neologisms.
Preferably, described neologisms feature comprises frequecy characteristic, and described frequecy characteristic comprises: these neologisms candidate uses these neologisms candidate's situation statistics, any one or combination in any among the statistical nature three of these neologisms candidate in webpage as counting, the input method user of query word in search engine logs.
Preferably, described neologisms feature also can comprise temporal characteristics, and described temporal characteristics comprises that characteristic and the utilization rate characteristic that grows steadily appears in burst.
Preferably, the described Service Source relevant with internet new words obtains by inquiring about in various types of Service Source set according to corresponding neologisms; Described Service Source type comprises search service, Desktop Product, news, blog, recreation, relational network, label, aggregated content, online dictionary or wireless value-added service.
Preferably, the described unit that is used to represent internet new words adopts user side application program or application plug to represent internet new words; Perhaps, also can adopt the Website page mode to represent internet new words.
Preferably, described descriptor at the internet new words self attributes comprises the neologisms definition; Described neologisms definition is obtained by info web is excavated.
Preferably, described descriptor at the internet new words self attributes also can comprise corresponding neologisms in a period of time, the statistical trend graph or the evaluating of occurrence number on certain dimension; Described dimension comprises info web, inquiry log, user's input or the click of user's neologisms.
Preferably, described descriptor at the internet new words self attributes also can comprise the correct coding character string of corresponding neologisms at specific input method; Described correct coding character string by gather, the input condition of analysis user coded string obtains.
According to another preferred embodiment of the present invention, a kind of device of internet information integrate release is also disclosed, comprising:
The neologisms information database is used to store internet new words, at the descriptor of internet new words self attributes, the Service Source information relevant with internet new words, and the mapping relations between the three;
Interface module is used to represent internet new words, and receives the information acquisition request of user at an internet new words;
Release module is used for when the information acquisition request received at an internet new words, obtains and issue the descriptor at this internet new words self attributes and the link of related service resource or related service resource from described neologisms information database.
Preferably, the described Service Source relevant with internet new words obtains by inquiring about in various types of Service Source set according to corresponding neologisms; Described Service Source type comprises search service, Desktop Product, news, blog, recreation, relational network, label, aggregated content, online dictionary or wireless value-added service.
Preferably, described interface module adopts user side application program or application plug to represent internet new words; Perhaps, also can adopt the Website page mode to represent internet new words.
Preferably, described descriptor at the internet new words self attributes comprises the neologisms definition; Described neologisms definition is obtained by info web is excavated.
Preferably, described descriptor at the internet new words self attributes also can comprise corresponding neologisms in a period of time, the statistical trend graph or the evaluating of occurrence number on certain dimension; Described dimension comprises info web, inquiry log, user's input or the click of user's neologisms.
Preferably, described descriptor at the internet new words self attributes also can comprise the correct coding character string of corresponding neologisms at specific input method; Described correct coding character string by gather, the input condition of analysis user coded string obtains.
Compared with prior art, the present invention has the following advantages:
The present invention excavates neologisms and relevant various information thereof from the various resources of internet, and adopts multiple mode various information and discovery procedure that neologisms are relevant to represent to the user; These modish popular vocabulary and relevant information thereof are in time represented to the terminal user, can help the user in time to obtain the up-to-date trend in internet, promote obtaining and velocity of propagation of internet fresh information.
Secondly, the present invention represents in the process relevant information, with the internet new words is that core has been integrated the various services that the service provider provides, for the user provides an informix interface of pressing close to very much demand, therefore the present invention can obviously improve service quality, promote the user capture amount, strengthen user's loyalty, accelerate the service pushing speed and strengthen user satisfaction.
Description of drawings
Fig. 1 is the flow chart of steps of the method embodiment of a kind of internet information integrate release of the present invention;
Fig. 2 is the structured flowchart of the system embodiment of a kind of internet information integrate release of the present invention;
Fig. 3 is the structural relation figure of the specific implementation preferred embodiment of a kind of internet information integrate release of the present invention system;
Fig. 4 is the interface synoptic diagram of a kind of neologisms prompting of the present invention;
Fig. 5 is the structured flowchart of the device embodiment of a kind of internet information integrate release of the present invention;
Fig. 6 is interface synoptic diagram at concrete neologisms " the king oak is prosperous " issue relevant information of the present invention.
Embodiment
For above-mentioned purpose of the present invention, feature and advantage can be become apparent more, the present invention is further detailed explanation below in conjunction with the drawings and specific embodiments.
The present invention can describe in the general context of the computer executable instructions of being carried out by computing machine, for example program module.Usually, program module comprises the routine carrying out particular task or realize particular abstract, program, object, assembly, data structure or the like.Also can in distributed computing environment, put into practice the present invention, in these distributed computing environment, by by communication network connected teleprocessing equipment execute the task.In distributed computing environment, program module can be arranged in the local and remote computer-readable storage medium that comprises memory device.
With reference to Fig. 1, show the method embodiment of a kind of internet information integrate release of the present invention, specifically can may further comprise the steps:
Step 101, obtain internet new words.
Step 102, obtain descriptor at the internet new words self attributes.
Step 103, obtain the Service Source relevant with internet new words.
Concrete, can obtain the Service Source relevant with internet new words in the following manner: the Service Source relevant with corresponding internet new words obtained in inquiry in various types of Service Source set; Described Service Source type can comprise search service, Desktop Product, news, blog, recreation, relational network, label, aggregated content, online dictionary or wireless value-added service.In a word, by step 101,102 and 103 can set up an internet new words, at the descriptor of himself attribute and the mapping relations between the relative various Service Source.
Step 104, represent internet new words.
Concrete, can represent internet new words by user side application program or application plug; Also can represent internet new words by the Website page mode.
For example, can set up neologisms issue homepage, be index with time, liveness, classification or with first letter of pinyin or the like, shows nearly internet new words (being the PULL pattern) in two years.Certainly, each neologisms back all is mapped with descriptor and the relative Service Source at himself attribute.The user can browse or inquire about own required neologisms, and then obtain the various relevant informations of these neologisms by step 105 by this neologisms issue homepage; Do not need the user to obtain the various relevant informations of these neologisms again by manual search, screening, analysis, improved the speed that information is propagated.
Again for example, the neologisms that can also adopt system desktop to eject hurdle, input method/instant chat/WDS software upgrade modes such as prompting, the prompting of aggregated content (RSS) information push, show internet new words (being the PUSH pattern) in this month or the like.Because this mode is mainly used in the neologisms prompting, can't show a large amount of neologisms, therefore, generally can be used for recommending the up-to-date neologisms or the neologisms of this customization to the user, this mode has the characteristics such as timely, that mode is succinct of upgrading.The user can simply browse the neologisms that represented, thereby determines whether it needs further to understand its a lot of relevant information behind; If desired, then clickthrough just can enter step 105.
Certainly, step 104 under the situation that capacity allows, can also represent some simple descriptors, for example the label of these neologisms, classification or the like except only showing neologisms itself.
Below really simple syndication (RSS) is simply introduced.
Really simple syndication (RSS) (Really Simple Syndication) is a kind of description and synchronous website format of content, is that present most popular XML uses.Be used for RSS of the present invention, it realizes roughly can being divided into following two classes:
First kind RSS reader can by ordered neologisms supply, can automatically, periodically upgrade the neologisms prompting for operating in the application program on the computer desktop.RSS reader at the news reading has been proposed, as Awasu, FeedDemon and RSSReader in the prior art; And Zhou Botong sees all over the world, and rich many moneys RSS reader of readding or the like is so the specific implementation details just no longer describes in detail at this.
The second class RSS reader can be for being embedded in the application program of having moved in computing machine.For example, the present invention can be embedded in the function of RSS reader in input method/instant chat/WDS software/browser; When the backstage analysis has obtained the neologisms that the user customized (as the neologisms of entertainment field), when perhaps the server end neologisms under the default mode upgrade automatically, then eject floating frame and point out.
Step 105, receive the information acquisition request of user at an internet new words, issue is at the descriptor of this internet new words self attributes, and the link of related service resource or related service resource.
General, step 105 a further information releasing can adopt the mode of independent new window or new web page to issue.If can directly issue the related service resource in this webpage, then directly issue gets final product, for example customized demand input window of search result list or customize services or the like; If can't directly issue the related service resource, then can issue its link, for example relevant blog, recreation or Desktop Product or the like.
Below how step 101 being obtained internet new words simply introduces.
The present invention can adopt variety of way to obtain neologisms, as, can adopt the mentioned various obtain manners of publication file, concrete condition does not repeat them here.Provide the possible implementation of a kind of the present invention below:
A, obtain the neologisms candidate;
The mode of obtaining the neologisms candidate also can be diversified.
For example, can be by collecting the internet language material, participle behind the removal noise; Then each word segmentation result is mated in standard dictionary,, can determine that then this word segmentation result is a neologisms candidate if in standard dictionary, do not exist.
Again for example, can collect user's query word from inquiry log, screening obtains the satisfactory query word of a collection of enquiry frequency; Respectively these query words are mated in standard dictionary then,, can determine that then this query word is a neologisms candidate if in standard dictionary, do not exist.
Again for example, can collect neologisms (generally speaking, these neologisms all are not have in the standard dictionary) and input number of times thereof in the input method user thesaurus,, can determine that then this speech is a neologisms candidate if the input number of times is higher than certain threshold value.
B, foundation preset the neologisms feature, and described neologisms candidate is screened, and obtain neologisms.
In specific implementation, because the difference of neologisms screening institute Consideration, the neologisms feature of institute's foundation also may be different, provide some neologisms features that the present invention may relate to below.
In a preferred embodiment of the invention, described neologisms feature can comprise frequecy characteristic, and described frequecy characteristic comprises: these neologisms candidate uses these neologisms candidate's situation statistics, any one or combination in any among the statistical nature three of these neologisms candidate in webpage as counting, the input method user of query word in search engine logs.Preferably, when this three meets certain condition, just these neologisms candidate is defined as qualified neologisms.
Simultaneously, may also need to consider temporal characteristics, described temporal characteristics comprises that characteristic and the utilization rate characteristic that grows steadily appears in burst.Because neologisms generally all are emergent, and because its propagation within the specific limits, so in a period of time, the utilization rate of neologisms is to present the characteristic that grows steadily; So the present invention can adopt above-mentioned temporal characteristics to screen neologisms.
Need to prove that neologisms feature of the present invention also may relate to grammar property, information science feature, headline is hit or the rubbish speech such as hits at feature, introduces in detail again in the object lesson of back.
Below the resulting descriptor of step 102 is simply introduced.
Described descriptor at the internet new words self attributes, its purpose helps the user better to understand these neologisms exactly.Because describe the difference of angle, described descriptor may have diversity, below the simple example explanation.
In a preferred embodiment of the invention, described descriptor at the internet new words self attributes can include the neologisms definition; Described neologisms definition is obtained by info web is excavated.For example, adjacent part comprises vocabulary before and after this neologisms: " being meant ", " source ", " definition ", " being " printed words, then whole or whole section can be extracted, as the definition of these neologisms.Certainly, for the purpose of accurately, the definition that the mode that can also adopt manual synchronizing or adopt the user to upload is obtained these neologisms.
In another preferred embodiment of the present invention, described descriptor at the internet new words self attributes can also comprise corresponding neologisms in a period of time, the statistical trend graph or the evaluating of occurrence number on certain dimension; Described dimension comprises info web, inquiry log, user's input or the click of user's neologisms.
For example, X-axis is a time shaft, and Y-axis is a user inquiring time number axis, then draws the trend map that comes out and can represent these neologisms in a period of time, the conversion trend on the search inquiry dimension.It is a user feedback dimension of the present invention that user's neologisms are clicked, after specifically being meant application the present invention, the user clicks the number of times or the frequency of checking this neologisms relevant information, then to a certain extent feedback user to the attention rate of these neologisms, so this feedback information also can be recorded in the attribute description information at these neologisms.
For the neologisms attribute that is illustrated under each dimension more directly perceived, also can adopt the mode of evaluating to realize, as: A dimension (8 minutes); B dimension (9 minutes); C dimension (8.5 minutes) or the like, the user can come into plain view, and checks the situation of these neologisms on each dimension simultaneously.
In another preferred embodiment of the present invention, described descriptor at the internet new words self attributes can also comprise the correct coding character string of corresponding neologisms at specific input method; Described correct coding character string can by gather, the input condition of analysis user coded string obtains.Because it is general for the huge input method user group of sample size, the frequency of input correct coding character string can be higher than the frequency of input error coded string far away, therefore, can by gather, the input condition of analysis user coded string obtains the correct coding character string of these neologisms.
For example, in some cases, the user may and unclear for certain neologisms, if when importing by keyboard (as, with other users' information interaction), what kind of (for example, for spelling input method, not knowing its pronunciation) its correct coded string should be; By the displaying of this attribute description information, can help the user correctly to import.Certainly, the attribute description information in this example is primarily aimed at non-Roman alphabets such as China, Japan and Korea; If the present invention is applied on the roman character language, then can not use the attribute description information in this example.
With reference to Fig. 2, show the system embodiment of a kind of internet information integrate release of the present invention, specifically can comprise with lower member:
Neologisms acquiring unit 201 is used to obtain internet new words;
Descriptor acquiring unit 202 is used to obtain the descriptor at the internet new words self attributes;
Service Source acquiring unit 203 is used to obtain the Service Source relevant with internet new words; The described Service Source relevant with internet new words can obtain by inquiring about in various types of Service Source set according to corresponding neologisms; Described Service Source type can comprise search service, Desktop Product, news, blog, recreation, relational network, label, aggregated content, online dictionary or wireless value-added service etc.;
Neologisms represent unit 204, are used to represent internet new words; Can adopt user side application program or application plug to represent internet new words; Perhaps, adopt the Website page mode to represent internet new words; Be that the present invention promptly can the application server end pushes the mode of (push), also can adopt the situation (pull pattern) of user's active inquiry;
Release unit 205 is used to receive the information acquisition request of user at an internet new words, and issue is at the descriptor of this internet new words self attributes, and the link of related service resource or related service resource.
In another preferred embodiment of the present invention, the described unit 201 that is used to obtain internet new words further comprises: the neologisms candidate unit is used to obtain the neologisms candidate; The screening unit is used for according to presetting the neologisms feature described neologisms candidate being screened, and obtains neologisms.Wherein, described neologisms feature can comprise frequecy characteristic, and described frequecy characteristic comprises: these neologisms candidate uses these neologisms candidate's situation statistics, any one or combination in any among the statistical nature three of these neologisms candidate in webpage as counting, the input method user of query word in search engine logs.Preferably, described neologisms feature can also comprise temporal characteristics, and described temporal characteristics comprises that characteristic and the utilization rate characteristic that grows steadily appears in burst.
In another preferred embodiment of the present invention, described descriptor at the internet new words self attributes comprises the neologisms definition; Described neologisms definition is obtained by info web is excavated.Further, described descriptor at the internet new words self attributes can also comprise corresponding neologisms in a period of time, the statistical trend graph or the evaluating of occurrence number on certain dimension; Described dimension comprises info web, inquiry log, user's input or the click of user's neologisms.In some cases, described descriptor at the internet new words self attributes can also comprise the correct coding character string of corresponding neologisms at specific input method; Described correct coding character string by gather, the input condition of analysis user coded string obtains.
Because system embodiment shown in Figure 2 is corresponding to method embodiment shown in Figure 1, this example does not describe part in detail and sees also preceding method embodiment associated description and get final product.
With reference to Fig. 3, show the specific implementation preferred embodiment of a kind of internet information integrate release of the present invention system, specifically can comprise with lower member:
The directed module 301 that grasps of language material: be used for obtaining text data, for example, the internet information source of neologisms may occur from the orientation source.Concrete, can comprise webpage language materials such as webpage, news, forum, blog; The user inquiring daily record; The input method user thesaurus; User speech chat sample (need by the conversion of speech-to-text); Chat record language material or the like.Need to prove that the extracting process that relates to user data should not relate to the privacy of particular user.
The directed mode of specifically obtaining language material that grasps module 301 of language material can be to use oriented network spider (focused crawler) to grasp, and perhaps obtains from the storage server of anonymous data (as input method user thesaurus, chat record etc.).Grasp for the oriented network spider, choosing of website can be to specify website to grasp, and also can be based on the classification point that grasps web page contents and filter.Because it is not an emphasis of the present invention, is not described in detail in this.
Data purification pretreatment module 302: be used for removing the format information, interfere information of the original language material that module 301 grasped or other and the irrelevant data (being noise information) of new word discovery.For example, remove the html tag of webpage, the webpage invalid content is filtered voice-enabled chat record noise etc., prepares for generating the neologisms candidate.
Neologisms candidate generation module 303: be used for generating the neologisms candidate, be convenient to the performed proof procedure of module 304, module 305 according to certain rule and method; Neologisms candidate generation module 303 has been equivalent to finish roughly selecting of neologisms.Among the embodiment of front by the agency of several feasible neologisms candidate obtain manners, do not repeat them here.
The automatic authentication module 304 of neologisms.The automatic authentication module 304 of neologisms is used for doing further filtration after neologisms candidate generation module 303, picks out the higher entry of quality from the neologisms candidate, exports as neologisms.This module can be set corresponding decision principle according to the practical application needs and neologisms are screened and verify, and is for example, rule-based or differentiate based on statistics and to carry out.306 management of neologisms data memory module can be transferred in the neologisms of automatic authentication module 304 demonstration validations of neologisms.The neologisms feature that the automatic authentication module of neologisms 304 is related includes but not limited to range that frequency, time distribution character, syntax rule, keyword in context, user use and frequency etc., and referring to table 1, details are as follows.
Table 1 neologisms validation template
Figure A20081010506500171
Concrete, for the counting of the search engine inquiry in frequecy characteristic dimension, when preferred the realization, should consider the occurrence number in inquiry log, also will consider these neologisms candidate simultaneously separately as the number of times of a query string, the latter can improve the possibility that neologisms are set up.
Again for example, for the statistics of (input method etc.) user thesaurus in frequecy characteristic dimension, when preferred the realization, should consider the number of times that the user imports, also to consider the region when these neologisms are used by the user, also will consider the absolute frequency that the user imports simultaneously, with the minimizing screening deviation of trying one's best.
Again for example, for the webpage statistical nature dimension in the frequecy characteristic, when preferred the realization, need to consider that these neologisms candidate more appears in the classification language materials such as forum, blog, still more appear in the generic web page, different Web page classifyings has different statistical weights.Simultaneously, also need to consider whether need in time these neologisms candidate's language material source to be distinguished, for example, give the higher neologisms weight of webpage that is grasped in the recent period.
In a word, neologisms candidate's checking is one and integrates the process that various features is taken all factors into consideration, because each candidate may have several simultaneously to being judged to the favourable and disadvantageous feature of neologisms.Candidate's generation strategy can be rule or statistic descriminant technique, determines the weight of each neologisms feature.If necessary, in order to improve treatment effeciency and quality, proof procedure also can add manual intervention.
Manual intervention authentication module 305 (optional).The artificial precision that participates in the raising new word discovery that checking can be bigger, and can provide and feed back to the automatic authentication module 304 of neologisms improving pattern rule, but under the situation of internet mass information, groundwork also needs to rely on the automatic authentication module 304 of neologisms.
Neologisms data memory module 306.The neologisms data are stored and organized by practical application, comprised functions such as storage, distribution, backup.Concrete, neologisms data memory module 306 can be stored the neologisms of demonstration validation and be organized, preferably, neologisms data memory module 306 can also be used to store attribute description information and the corresponding Service Source information of being obtained from descriptor acquisition module 307 and service resource acquisition module 308 at neologisms, so that follow-up calling when representing to the user.
Descriptor acquisition module 307 can obtain various descriptors at the neologisms self attributes according to aforesaid various implementations, specifically no longer repeats.
Service Source acquisition module 308 also can obtain the various Service Sources that may provide at these neologisms according to aforesaid various implementations, specifically no longer repeats.
Because independent neologisms, jerky often hard to understand for domestic consumer, therefore, need descriptor acquisition module 307 and service resource acquisition module 308 further to process and integrate information that some other user can understand and the Service Source that may need, as definition, source, example sentence, the classification of neologisms, enliven web page listings that time, spelling method and this neologisms once occurred etc.; Thereby can provide one to be core to the user, integrate the integrated service interface of various relevant informations, the inner link between these information is excavated out for the user, to improve user's information acquisition efficiency with neologisms.
Neologisms internal application module 309.The neologisms data of being stored in the neologisms data memory module 306 also can be applied on some internal services (described internal services is at direct user oriented external service), for various internal application provide service.Dictionary resources when judging as the word-dividing mode of search engine or neologisms candidate or the like, the adding of neologisms can improve the effect of these internal application.
The user side neologisms upgrade reminding module 310.In this example, collect the neologisms data obtain from the backstage, upgrade prompting in regular synchronous mode to the user, cause that the user clicks, what forward the neologisms relevant information to represents module 311-313.Preferably, can be embedded in the input method application, when neologisms are updated to the input method dictionary, will upgrade prompting and being shown to the user upgrading reminding module 310.
Neologisms are concentrated and are represented module 311.The user is by the operation to renewal reminding module 310, trigger the concentrated module 311 that represents of neologisms, be responsible for the detailed information that neologisms are correlated with are presented to the user by the concentrated module 311 that represents of neologisms, this module mainly provides the descriptor at the neologisms self attributes, and the link of related service resource or Service Source can be provided.
Online dictionary/dictionary wiki module 312.The user concentrates by neologisms and represents Service Source or its link that module 311 is showed, can trigger module 312.Online dictionary/dictionary wiki module 312 can provide the neologisms of similar wiki/ encyclopaedia dictionary to represent function, and the user can in time feed back, revise the relevant wrong or careless omission of entry, sets up customized label (tag) simultaneously, convenient identical hobby user's contact.
In one embodiment of the invention, the partial function of online dictionary/dictionary wiki module 312 can directly be integrated in neologisms and concentrate and to represent the representing in the interface of module 311, makes things convenient for the user directly to call.
Special Service Source represents module 313.For example, neologisms are concentrated and to be represented module 311 related news, personal homepage, blogroll etc. can directly be provided, and representing module 313 by special Service Source provides corresponding service to get final product to the user.
Search results pages represents module 314.The principle that represents module 313 with special Service Source is identical, and search results pages represents module 314 and is responsible for to concentrate from neologisms and represents the search class neologisms inquiry that module 311 transfers search results pages is provided.Preferably, to represent the search service that module 314 relates to can be special search such as Webpage search, music/picture/video/map to search results pages.Such as a new building title, provide map search can make things convenient for the user to find the concrete orientation of building easily.
Nusrmgr.cpl module 315.In this example, the user may be switched redirect between several services, and nusrmgr.cpl module 315 can provide unique identification for the user between each service, one-stop service is provided, and need not to carry out once more identification.For example, the user can also be provided with middle customization, the service of cancellation new word discovery self-defined by this module, and can provide service to improve feedback in time, promotes the quality of respective services.
Provide a typical application scene of the present invention below:
1, user's first is opened a chat window, activates input method software, prepares input characters.The network monitoring program of this input method (watchdog routine on resident backstage often is called daemon) is by calling system interface, the lastest imformation of receiving remote monitoring server.The remote monitoring server transmit a request to the neologisms server, the neologisms in the acquisition request set time section.The result that the neologisms server returns is not for empty, and the input method watchdog routine has learnt that thus neologisms upgrade, so by tabulation of remote monitoring downloaded neologisms and relevant information, organize the data exhibiting content.
2, this input method software upgrades prompting (referring to Fig. 4) at conspicuous position (for example, the desktop lower right corner) ejection neologisms, represents the neologisms tabulation of upgrading this week, affiliated classification and setting, provides link can supply the user to click.Fig. 4 shows the neologisms of this week and recommends, and has provided the information such as label of each neologisms, so that the user judges roughly whether it needs to understand certain neologisms; And Fig. 4 customizes the renewal prompting that RSS serves before giving this user.User's first is glanced out briefly, and is very interested in wherein neologisms " Hong lie prone ", but do not know its concrete meaning, so click.
3, the user clicks " bang and lie prone ", triggers open any browser, forwards neologisms to and represents the page.On this page, the user can see this entry:
A) definition: " banging lies prone is exactly the homophonic abbreviation of English home party Chinese in fact, the just private family party of holding of its real implication.Rise in the U.S., very popular in Taiwan in recent years, and import the continent gradually into ".
B) entry operating position statistics: in the statistics of input method anonymous dictionary, had 170 users to import altogether 295 times in nearest 100 days; In the search engine inquiry log statistic, there are 233 inquiries to comprise " bang and lie prone "; The time attribute that query word " bangs and lies prone " increased before 3 months suddenly, formed a crest, became steady afterwards; Statistic runs up to these last few days and becomes significantly, so the new word discovery flow process on backstage has been discerned this neologisms.
4, user's first is had not given full expression to the views, click the link that neologisms represent " related news " on the page, browsed one time " Hong lie prone " relevant news, found that much all just providing definition has just stopped abruptly, those that do not have that he wants Hong lie prone how activity is carried out, the online friend does Hong the gains in depth of comprehension of lying prone etc.
Simultaneously user's first finds that neologisms represent page prompts and do not include this entry on the wiki dictionary, so user's first has been created this entry according to the information content of its acquisition, and has added one and feels useful web page interlinkage.
Further, because dissatisfied to Search Results, user's first has been opened neologisms and has been represented feedback window on the page, has write the suggestion of oneself.
Further, user's first represents the customize services that the page provides by neologisms, has subscribed to the RSS of the Search Results of query word " Hong lie prone ": if Search Results has renewal, can in time notify him.
Preferably, comprising that input method, neologisms represent service such as the page, search, wiki can be all from same service provider, and the user has logined the pass when using input method, saved the worry of using each service all will login one by one.
5, search system can be used automatic or manual method improvement Search Results in time after the feedback that obtains this user.
6, spent several days, search system has grasped new " bang and lie prone " related web page.The polling mechanism of subscriber management server finds that the search results pages that query word " bangs and lies prone " obtains to upgrade, in time notice query of subscription speech upgrades all users (comprising user's first) of result, and pointing out its RSS to subscribe at the desktop pop-up window has renewal (with reference to Fig. 4).
With reference to Fig. 5, show the device embodiment of a kind of internet information integrate release of the present invention, specifically can comprise with lower member:
Neologisms information database 501 is used to store internet new words, at the descriptor of internet new words self attributes, the Service Source information relevant with internet new words, and the mapping relations between the three;
Interface module 502 is used to represent internet new words, and receives the information acquisition request of user at an internet new words; Described interface module 502 can adopt user side application program or application plug to represent internet new words; Perhaps, also can adopt the Website page mode to represent internet new words;
Release module 503, be used for when the information acquisition request received at an internet new words, obtain and issue the descriptor at this internet new words self attributes and the link of related service resource or related service resource from described neologisms information database.The described Service Source relevant with internet new words obtains by inquiring about in various types of Service Source set according to corresponding neologisms; Described Service Source type comprises search service, Desktop Product, news, blog, recreation, relational network, label, aggregated content, online dictionary or wireless value-added service.
Described descriptor at the internet new words self attributes can comprise the neologisms definition; Described neologisms definition is obtained by info web is excavated.Preferably, described descriptor at the internet new words self attributes can also comprise corresponding neologisms in a period of time, the statistical trend graph or the evaluating of occurrence number on certain dimension; Described dimension comprises info web, inquiry log, user's input or the click of user's neologisms.In some cases, described descriptor at the internet new words self attributes also can comprise the correct coding character string of corresponding neologisms at specific input method; Described correct coding character string by gather, the input condition of analysis user coded string obtains.
With reference to Fig. 6, show the synoptic diagram of concrete neologisms of the present invention " the king oak is prosperous " by the interface 600 of release module 503 issue relevant informations.In this interface signal, comprise following column:
Defined field 601.The definition about neologisms " the king oak is prosperous " that obtains from webpage information mining can directly be provided.Perhaps, also can only obtain definition, promptly when these neologisms have related definition in the wiki dictionary, then provide wiki link, otherwise the prompting user can increase entry and the definition editor is provided link to wiki at defined field from the wiki dictionary.
Choose according to hurdle 602.Background system is estimated on each dimension direction neologisms " the king oak is prosperous ", and has provided concrete scoring, therefore, provides the foundation of selection in the mode of a user-friendly similar marking in choosing according to the hurdle.
Trend graph hurdle 603.Represented the number of neologisms " the king oak is prosperous " as user's input/inquiry/media report, each dimension can corresponding curve.Embedded the trend graph signal of neologisms " the king oak is prosperous " in the synoptic diagram as query word.
Spelling hurdle 604.Represent king oak prosperous " wangyuexin " and the usage ratio of " wanglixin " two kinds of spellings in the user, the spelling that general user imports high frequency is exactly correct.Certainly, need artificial the intervention for exception; As, speech " Zhang Baizhi " is exactly an exception, imports in the statistics the user, and its wrong spelling (zhangbozhi) is than correct spelling (zhangbaizhi) height.
The usage of user's input/inquiry usefulness represents hurdle 605.Through statistics, there is not valuable usage presenting information in neologisms " the king oak is prosperous ", so this column should be sky in showing interface; Perhaps be used for the displaying of other column information.But the present invention is in order to clearly demonstrate, and in interface synoptic diagram shown in Figure 6, this column adopts the usage presenting information of neologisms " tear is run quickly " to substitute and describes, and provided statistical informations such as frequency under neologisms " tear is run quickly " all usages situation and number of users.
Five above-mentioned interface columns all are the descriptors at neologisms " the king oak is prosperous " self attributes.Below several columns are displayings at the related service resource information.
Blog hurdle 606.Neologisms " the king oak is prosperous " belong to " star " label, and the link of relevant blog blog.sina.com.cn/wangyuexin911 is provided, and can represent the recent renewal of this blog.This blog may be prosperous oneself the blog of king oak, also may be the prosperous music fan's of king oak blog.
Webpage searching result hurdle 607.In the synoptic diagram of Fig. 6, this column provides a miniature return results (preceding 3), comprises title, link and summary etc.Certainly, can also provide the personalized search result according to this user's customized information.
Interface 608 submitted in neologisms.Provide interface to allow the user that neologisms initiatively are provided, as other fast men can be provided the name of (singer who participates in entertainment " happy male voice " is called for short), bright as Yu Hao, revive, Ji Jie etc.
Usersaccount information hurdle 609.Be used for the explicit user identity, convenient user profile and the unified service login of realization of collecting.
Only be to have provided an interface schematic construction of the present invention above, may comprise that also RSS subscription column, new word information represent management column or the like.Need to prove that actual layout and the column content that represents the page can arbitrarily be arranged, and may can also realize personalized customization at different user.Each module column can drag, convergent-divergent, even can be by oneself hobby additions and deletions.And because label (tag) difference that each neologisms is stamped, the module line type that represents also can be different, and certainly, general trend graph, entry define, choose foundation is necessary.For example, be that " polyphonic word " tag just can not stamped in the entry of monosyllabic word entirely, just do not have " spelling " this hurdle yet.
In a word, the develop rapidly of internet information, various new ideas, hot ticket or personage also emerge in an endless stream.Correspondingly, these new ideas, new things also become people's topics to chat about after dinner, as " Zhou Laohu ", " subprime mortgage ", " the Water Cube ".And they rely on word of mouth, and all characteristics of abbreviation are often arranged.Especially in some exclusive fields, these terms allow common people it seems can't to understand especially, such as " state's war ", " asking group ", " method difficult to understand ", " Hui Lan " etc. in the online game; Other more similarly are to include wrongly written or mispronounced characters as neologisms such as " blog fight ", " rod rod halls ".
On the other hand, when various traditional services constantly promote self performance on the internet, the various new application that have more hommization are also continuing to bring out, and propagate speed and strength to each user but how can improve these new application services, are the problems that need solve as early as possible.
The present invention by neologisms as media, the information of each side is all integrated, can either provide various descriptors to the user at neologisms, be convenient to the user and understand neologisms, can provide and the closely-related new application of these neologisms to the user again, to satisfy user's all demands by a Complex interface as far as possible, improve the efficient that the user seeks relevant information resource and Service Source at certain neologisms.By the present invention, the user can understand the internet trend fast, in time obtains information of interest; And the service provider can quicken the integration of own resource/service, enlarges self product depth of exposure to the user, promotes user's stickiness potentially, obtains more commercial opportunity, additional income.
Each embodiment in this instructions all adopts the mode of going forward one by one to describe, and what each embodiment stressed all is and the difference of other embodiment that identical similar part is mutually referring to getting final product between each embodiment.
More than to the methods, devices and systems of a kind of internet information integrate release provided by the present invention, be described in detail, used specific case herein principle of the present invention and embodiment are set forth, the explanation of above embodiment just is used for helping to understand method of the present invention and core concept thereof; Simultaneously, for one of ordinary skill in the art, according to thought of the present invention, the part that all can change in specific embodiments and applications, in sum, this description should not be construed as limitation of the present invention.

Claims (24)

1, a kind of method of internet information integrate release is characterized in that, comprising:
Obtain internet new words;
Obtain descriptor at the internet new words self attributes;
Obtain the Service Source relevant with internet new words;
Represent internet new words;
Receive the information acquisition request of user at an internet new words, issue is at the descriptor of this internet new words self attributes, and the link of related service resource or related service resource.
2, the method for claim 1 is characterized in that, obtains internet new words in the following manner:
Obtain the neologisms candidate;
According to presetting the neologisms feature, described neologisms candidate is screened, obtain neologisms.
3, method as claimed in claim 2, it is characterized in that, described neologisms feature comprises frequecy characteristic, and described frequecy characteristic comprises: these neologisms candidate uses these neologisms candidate's situation statistics, any one or combination in any among the statistical nature three of these neologisms candidate in webpage as counting, the input method user of query word in search engine logs.
As claim 2 or 3 described methods, it is characterized in that 4, described neologisms feature comprises temporal characteristics, described temporal characteristics comprises that characteristic and the utilization rate characteristic that grows steadily appears in burst.
5, the method for claim 1 is characterized in that, obtains the Service Source relevant with internet new words in the following manner:
The Service Source relevant with corresponding internet new words obtained in inquiry in various types of Service Source set; Described Service Source type comprises search service, Desktop Product, news, blog, recreation, relational network, label, aggregated content, online dictionary or wireless value-added service.
6, the method for claim 1 is characterized in that,
Represent internet new words by user side application program or application plug;
Perhaps, represent internet new words by the Website page mode.
7, the method for claim 1 is characterized in that, described descriptor at the internet new words self attributes comprises the neologisms definition; Described neologisms definition is obtained by info web is excavated.
8, the method for claim 1 is characterized in that,
Described descriptor at the internet new words self attributes comprises corresponding neologisms in a period of time, the statistical trend graph or the evaluating of occurrence number on certain dimension; Described dimension comprises info web, inquiry log, user's input or the click of user's neologisms.
9, the method for claim 1 is characterized in that,
Described descriptor at the internet new words self attributes comprises the correct coding character string of corresponding neologisms at specific input method; Described correct coding character string by gather, the input condition of analysis user coded string obtains.
10, a kind of system of internet information integrate release is characterized in that, comprising:
Be used to obtain the unit of internet new words;
Be used to obtain unit at the descriptor of internet new words self attributes;
Be used to obtain the unit of the Service Source relevant with internet new words;
Be used to represent the unit of internet new words;
Release unit is used to receive the information acquisition request of user at an internet new words, and issue is at the descriptor of this internet new words self attributes, and the link of related service resource or related service resource.
11, system as claimed in claim 10 is characterized in that, the described unit that is used to obtain internet new words further comprises:
The neologisms candidate unit is used to obtain the neologisms candidate;
The screening unit is used for according to presetting the neologisms feature described neologisms candidate being screened, and obtains neologisms.
12, system as claimed in claim 11, it is characterized in that, described neologisms feature comprises frequecy characteristic, and described frequecy characteristic comprises: these neologisms candidate uses these neologisms candidate's situation statistics, any one or combination in any among the statistical nature three of these neologisms candidate in webpage as counting, the input method user of query word in search engine logs.
As claim 11 or 12 described systems, it is characterized in that 13, described neologisms feature comprises temporal characteristics, described temporal characteristics comprises that characteristic and the utilization rate characteristic that grows steadily appears in burst.
14, system as claimed in claim 10 is characterized in that, the described Service Source relevant with internet new words obtains by inquiring about in various types of Service Source set according to corresponding neologisms; Described Service Source type comprises search service, Desktop Product, news, blog, recreation, relational network, label, aggregated content, online dictionary or wireless value-added service.
15, system as claimed in claim 10 is characterized in that, the described unit that is used to represent internet new words adopts user side application program or application plug to represent internet new words; Perhaps, adopt the Website page mode to represent internet new words.
16, system as claimed in claim 10 is characterized in that, described descriptor at the internet new words self attributes comprises the neologisms definition; Described neologisms definition is obtained by info web is excavated.
17, system as claimed in claim 10 is characterized in that,
Described descriptor at the internet new words self attributes comprises corresponding neologisms in a period of time, the statistical trend graph or the evaluating of occurrence number on certain dimension; Described dimension comprises info web, inquiry log, user's input or the click of user's neologisms.
18, system as claimed in claim 10 is characterized in that,
Described descriptor at the internet new words self attributes comprises the correct coding character string of corresponding neologisms at specific input method; Described correct coding character string by gather, the input condition of analysis user coded string obtains.
19, a kind of device of internet information integrate release is characterized in that, comprising:
The neologisms information database is used to store internet new words, at the descriptor of internet new words self attributes, the Service Source information relevant with internet new words, and the mapping relations between the three;
Interface module is used to represent internet new words, and receives the information acquisition request of user at an internet new words;
Release module is used for when the information acquisition request received at an internet new words, obtains and issue the descriptor at this internet new words self attributes and the link of related service resource or related service resource from described neologisms information database.
20, device as claimed in claim 19 is characterized in that, the described Service Source relevant with internet new words obtains by inquiring about in various types of Service Source set according to corresponding neologisms; Described Service Source type comprises search service, Desktop Product, news, blog, recreation, relational network, label, aggregated content, online dictionary or wireless value-added service.
21, device as claimed in claim 19 is characterized in that, described interface module adopts user side application program or application plug to represent internet new words; Perhaps, adopt the Website page mode to represent internet new words.
22, device as claimed in claim 19 is characterized in that, described descriptor at the internet new words self attributes comprises the neologisms definition; Described neologisms definition is obtained by info web is excavated.
23, device as claimed in claim 19 is characterized in that,
Described descriptor at the internet new words self attributes comprises corresponding neologisms in a period of time, the statistical trend graph or the evaluating of occurrence number on certain dimension; Described dimension comprises info web, inquiry log, user's input or the click of user's neologisms.
24, device as claimed in claim 19 is characterized in that,
Described descriptor at the internet new words self attributes comprises the correct coding character string of corresponding neologisms at specific input method; Described correct coding character string by gather, the input condition of analysis user coded string obtains.
CNA2008101050657A 2008-04-25 2008-04-25 Method and system for integral release of internet information Pending CN101566995A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA2008101050657A CN101566995A (en) 2008-04-25 2008-04-25 Method and system for integral release of internet information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA2008101050657A CN101566995A (en) 2008-04-25 2008-04-25 Method and system for integral release of internet information

Publications (1)

Publication Number Publication Date
CN101566995A true CN101566995A (en) 2009-10-28

Family

ID=41283149

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2008101050657A Pending CN101566995A (en) 2008-04-25 2008-04-25 Method and system for integral release of internet information

Country Status (1)

Country Link
CN (1) CN101566995A (en)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101908061A (en) * 2010-07-02 2010-12-08 互动在线(北京)科技有限公司 Method and device for synchronizing entries
CN101984420A (en) * 2010-09-03 2011-03-09 百度在线网络技术(北京)有限公司 Method and equipment for searching pictures based on word segmentation processing
CN102779130A (en) * 2011-05-11 2012-11-14 腾讯科技(深圳)有限公司 Method and device for automatically updating microblog page skin
CN103116653A (en) * 2013-03-05 2013-05-22 清华大学 Service resource searching method and system based on attribute matching
CN103164427A (en) * 2011-12-13 2013-06-19 中国移动通信集团公司 Method and device of news aggregation
CN103248551A (en) * 2012-02-03 2013-08-14 腾讯科技(深圳)有限公司 Information presentation method and system
CN103399890A (en) * 2013-07-22 2013-11-20 百度在线网络技术(北京)有限公司 Method and equipment for collecting words on input method client side
CN103902708A (en) * 2014-03-31 2014-07-02 安徽新华博信息技术股份有限公司 Method for querying data
CN103955453A (en) * 2014-05-23 2014-07-30 清华大学 Method and device for automatically discovering new words from document set
WO2014206186A1 (en) * 2013-06-28 2014-12-31 百度在线网络技术(北京)有限公司 Method and device for generating entry information
CN107229724A (en) * 2017-06-05 2017-10-03 成都知道创宇信息技术有限公司 It is a kind of based on the link methods of marking for browsing record
CN107544685A (en) * 2016-06-29 2018-01-05 百度在线网络技术(北京)有限公司 Information-pushing method and device
CN109120500A (en) * 2017-06-23 2019-01-01 北京搜狗科技发展有限公司 A kind of information processing method and input method system
CN111580786A (en) * 2020-05-06 2020-08-25 厦门理工学院 Internet + -based software engineering development system
CN116340469A (en) * 2023-05-29 2023-06-27 之江实验室 Synonym mining method and device, storage medium and electronic equipment

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101908061A (en) * 2010-07-02 2010-12-08 互动在线(北京)科技有限公司 Method and device for synchronizing entries
CN101984420A (en) * 2010-09-03 2011-03-09 百度在线网络技术(北京)有限公司 Method and equipment for searching pictures based on word segmentation processing
CN102779130A (en) * 2011-05-11 2012-11-14 腾讯科技(深圳)有限公司 Method and device for automatically updating microblog page skin
CN103164427B (en) * 2011-12-13 2016-03-02 中国移动通信集团公司 News Aggreagation method and device
CN103164427A (en) * 2011-12-13 2013-06-19 中国移动通信集团公司 Method and device of news aggregation
CN103248551A (en) * 2012-02-03 2013-08-14 腾讯科技(深圳)有限公司 Information presentation method and system
CN103116653B (en) * 2013-03-05 2016-03-23 清华大学 Based on Service Source searching method and the system of attributes match
CN103116653A (en) * 2013-03-05 2013-05-22 清华大学 Service resource searching method and system based on attribute matching
WO2014206186A1 (en) * 2013-06-28 2014-12-31 百度在线网络技术(北京)有限公司 Method and device for generating entry information
CN103399890B (en) * 2013-07-22 2016-10-26 百度在线网络技术(北京)有限公司 At the method and apparatus that input method client collects words
CN103399890A (en) * 2013-07-22 2013-11-20 百度在线网络技术(北京)有限公司 Method and equipment for collecting words on input method client side
CN103902708A (en) * 2014-03-31 2014-07-02 安徽新华博信息技术股份有限公司 Method for querying data
CN103955453A (en) * 2014-05-23 2014-07-30 清华大学 Method and device for automatically discovering new words from document set
CN107544685A (en) * 2016-06-29 2018-01-05 百度在线网络技术(北京)有限公司 Information-pushing method and device
CN107229724A (en) * 2017-06-05 2017-10-03 成都知道创宇信息技术有限公司 It is a kind of based on the link methods of marking for browsing record
CN107229724B (en) * 2017-06-05 2020-07-21 成都知道创宇信息技术有限公司 Link scoring method based on browsing records
CN109120500A (en) * 2017-06-23 2019-01-01 北京搜狗科技发展有限公司 A kind of information processing method and input method system
CN111580786A (en) * 2020-05-06 2020-08-25 厦门理工学院 Internet + -based software engineering development system
CN116340469A (en) * 2023-05-29 2023-06-27 之江实验室 Synonym mining method and device, storage medium and electronic equipment
CN116340469B (en) * 2023-05-29 2023-08-11 之江实验室 Synonym mining method and device, storage medium and electronic equipment

Similar Documents

Publication Publication Date Title
CN101566995A (en) Method and system for integral release of internet information
CN100568241C (en) Be used for concentrating the method and system of Content Management
CN101199122B (en) Using language models to expand wildcards
CN102368788B (en) Information pushing method and apparatus thereof
WO2020140360A1 (en) Clipboard-based information pushing method and system, and terminal device
US9218414B2 (en) System, method, and user interface for a search engine based on multi-document summarization
US10198776B2 (en) System and method for delivering an open profile personalization system through social media based on profile data structures that contain interest nodes or channels
US8429099B1 (en) Dynamic gazetteers for entity recognition and fact association
CN102708174B (en) Method and device for displaying rich media information in browser
JP6224731B2 (en) Method and apparatus for enriching social media to improve personal user experience
US20080312910A1 (en) Dictionary word and phrase determination
US20150154303A1 (en) System and method for providing content recommendation service
CN110377908B (en) Semantic understanding method, semantic understanding device, semantic understanding equipment and readable storage medium
CN102349087A (en) Automatically providing content associated with captured information, such as information captured in real-time
CN106354861A (en) Automatic film label indexing method and automatic indexing system
CN101329674A (en) System and method for providing personalized searching
WO2013170344A1 (en) Method and system relating to sentiment analysis of electronic content
CN104969254A (en) Personalized summaries for content
CN103092962B (en) A kind of method and system issuing internet information
CN102779114A (en) Unstructured data support generated by utilizing automatic rules
JP2008529179A (en) Method and apparatus for accessing mobile information in natural language
CN102831229A (en) Web page browsing method suitable for blind persons
WO2022262487A1 (en) Form generation method, apparatus and device, and medium
CN101354711A (en) Method, apparatus and system for searching information
CN103678362A (en) Search method and search system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20091028

RJ01 Rejection of invention patent application after publication