CN102541918A - Method and equipment for identifying repeated information - Google Patents

Method and equipment for identifying repeated information Download PDF

Info

Publication number
CN102541918A
CN102541918A CN2010106127421A CN201010612742A CN102541918A CN 102541918 A CN102541918 A CN 102541918A CN 2010106127421 A CN2010106127421 A CN 2010106127421A CN 201010612742 A CN201010612742 A CN 201010612742A CN 102541918 A CN102541918 A CN 102541918A
Authority
CN
China
Prior art keywords
information
server
feature coding
internal memory
issue
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2010106127421A
Other languages
Chinese (zh)
Inventor
陈斌
胡怀文
初永光
韩灵叶
苏磊
李乐
林朝森
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN2010106127421A priority Critical patent/CN102541918A/en
Publication of CN102541918A publication Critical patent/CN102541918A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

The embodiment of the invention discloses a method and equipment for identifying repeated information. When the technical scheme adopted by the embodiment of the invention is used, released information is stored in an internal memory in form of feature codes; when a new information release request is received, the internal memory is searched for a corresponding feature code, and whether the information the information releaser requires to be released is the same as released information is determined by judging if repeated feature codes exist through comparison; and thus, under the condition of guarantying the repeated information identification accuracy, the database access flow load brought by searching the database for repeated information is reduced, the identification efficiency of repeated information is increased, and the repetition identification time in an information release process is reduced.

Description

Duplicate message recognition methods and equipment
Technical field
The application relates to communication technical field, particularly a kind of duplicate message recognition methods and equipment.
Background technology
Ecommerce (Electronic Commerce; EC) typically refer in the wide range of commercial trade activity of all parts of the world; Under the open network environment in the Internet; Based on the browser/server application mode, both parties do not carry out various commercial activities with meeting, a kind of novel commercial operation pattern of shopping online, the online transaction between the trade company and online E-Payment and various commercial activity, transaction, finance activities that realizes the consumer and relevant integrated service activity.
The wide range that ecommerce is contained, generally can be divided into business to business (Business-to-Business, B2B), or business to consumer (Business-to-Customer, B2C) two kinds.Also have consumer to consumer (Customer-to-Customer, C2C) pattern of this big step growth in addition.Along with the increase of domestic Internet (internet) number of users, utilize Internet to carry out shopping at network and gradually popular with the consumption pattern of bank card payment, the market share is also increasing rapidly, and e-commerce website also emerges in an endless stream.
Along with the high speed development of ecommerce, the commodity amount on the internet in the e-commerce website is also in rapid growth, in the merchandise news behind of magnanimity; Exist a large amount of same or analogous merchandise newss, for the supvr of e-commerce website, a large amount of same or analogous merchandise news repeat the audit; Greatly reduce product audit efficient; And the consumption storage space, wasted the audit and the storage resources of merchandise news, and if a large amount of same or similar product of content is showed on the foreground; Then for the user of e-commerce website, the commercial articles searching that also can have a strong impact on the user that repeats to show of identical information is experienced and search efficiency in a large number.
Therefore; In the prior art; Start the product information source at the information management end of e-commerce website and gone the weight technology; Before product information gets into audit, automatically the same or analogous product information of content is return processing through technological means, thereby improve the quality of e-commerce website Global Information.
Usually; In existing technical scheme; The method that above-mentioned product information source goes the weight technology to be adopted is specially data base querying comparison method: be about to get into the product data taking-up of auditing flow, accessing database inquires all product informations of this product respective user.Compare successively through the other products information of program this product information and this user.As relatively predicate duplicate message, then the audit of this product information is handled and carried out retraction operation, otherwise, then proceed all the other audits.
In the process that realizes the application embodiment, the applicant finds that there is following problem at least in prior art:
1, caused the immense pressure of database access
If according to existing technical scheme, use data base querying comparison method, product information of then every audit all need be carried out one query to database.In the quantity of information of needs audits hour, can not produce excessive influence, but need the quantity of information of audit, just can cause great visit pressure database in case increase.
If need examine the audit amount of the product information of hundreds of thousands bar every day calculates with an e-commerce website; So; Database for an e-commerce website; At least will increase the inferior database query operations of hundreds of thousands every day, system resource waste and access of database flow pressure are huge.
2, the comparison efficiency of product information is low
Use data base querying comparison method; When the data query library information, need carry out SQL (S adds cturedQuery Language, SQL) parsing, IO (Input/Output; Input and output) sequence of operations such as operation, Network Transmission; Therefore, when the product information amount of needs inquiries is big, can have a strong impact on the comparison efficiency of product information.
Summary of the invention
The application embodiment provides a kind of duplicate message recognition methods and equipment, solves to have a large amount of repeating datas in the prior art, influences the problem of the experience of product information.
For achieving the above object, the application embodiment provides a kind of duplicate message recognition methods on the one hand, comprising:
When server received information that the information publisher sent issue request, said server was encoded according to the said information issue request institute information releasing generating feature of asking;
Said server correspondence preserve information publisher's identification information and the internal memory of the feature coding that released news in search the feature coding that obtains with said information publisher's the pairing announced information of identification information;
Whether the feature coding that said server is judged said generation exists repetition with the feature coding of the announced information of in internal memory, obtaining;
If the judged result of said server is repetition, the said information issue of the said server refusal issue request information releasing of asking.
On the other hand, the application embodiment also provides a kind of server, comprising:
Receiver module is used for reception information issue request;
Generation module is used for when said receiver module receives the information issue request that information publisher sends, and encodes according to the said information issue request institute information releasing generating feature of asking;
Acquisition module is used for preserving information publisher's identification information and the internal memory of the feature coding that released news is searched the feature coding that obtains with said information publisher's the pairing announced information of identification information in correspondence;
Judge module is used for judging whether the feature coding of announced each information that feature coding that said generation module generates and said acquisition module get access at internal memory exists repetition;
Processing module is used for when the judged result of said judge module is repetition, the said information issue of the refusal issue request information releasing of asking.
Compared with prior art, the application embodiment has the following advantages:
Through using the technical scheme of the application embodiment; With announced information with the stored in form of feature coding in internal memory, when receiving new information issue request, the corresponding feature coding of inquiry in internal memory; Judge that through the feature coding that more whether has repetition whether information releasing that this information publisher asks repeats with releasing news mutually; Thereby, look in guarantee information under the situation of heavy accuracy rate, reduce through the database information of carrying out and repeat to inquire about the database access flow burden of being brought; And the search efficiency that improves duplicate message, shorten the heavy time of looking into of information issuing process.
Description of drawings
Fig. 1 is the schematic flow sheet of a kind of duplicate message recognition methods of the application embodiment proposition;
The schematic flow sheet of the main body thinking of a kind of duplicate message recognition methods that Fig. 2 is proposed for the application embodiment;
The system architecture synoptic diagram of the concrete application scenarios of a kind of duplicate message recognition methods that Fig. 3 is proposed for the application embodiment;
The system architecture synoptic diagram of the concrete application scenarios of a kind of duplicate message recognition methods that Fig. 4 is proposed for the application embodiment;
Fig. 5 is the structural representation of a kind of server of the application embodiment proposition.
Embodiment
Of background technology; Scheme of carrying out duplicate message inquiry through database of the prior art exists the flow burden high; Information contrasts inefficient defective; How to reduce because the flow that the duplicate message inquiry is brought is born, improve the search efficiency of duplicate message, become an important topic of information release field.
Based on this; The application embodiment has proposed a kind ofly to look into heavy method through the memory coding information of carrying out that released news, and duplicate message comparison solution is directly obtained data from internal memory and compared; Accessing database not; Avoided because information is looked into the heavy access of database flow that is brought, and no longer needed the relevant operation of database, improved comparison efficiency greatly.
As shown in Figure 1, the schematic flow sheet of a kind of duplicate message recognition methods that proposes for the application embodiment, this method specifically may further comprise the steps:
Step S101, when server receives information that the information publisher sent issue request, server is encoded according to the information issue request institute information releasing generating feature of asking.
Concrete feature coding generative process is:
At first; Server obtain information issue request the characteristic of asking in the information releasing to be comprised; In the application scenarios of reality; The item types of this characteristic can be preset according to type of info, can carry out in this manner for bulk information type and the similar information to be released of message structure.
For example; Treat the situation of product sold information for the request issue; The item types of preset characteristic can be name of product, product classification, picture feature (the for example similarity of product picture comparison parameter), the description of product, product price and production information etc.; And for another kind of situation, the request scene of giving a news briefing for example is if use the technical scheme that the application embodiment is proposed; The project of the characteristic that then can be provided with can comprise news exercise question, news time of origin, news keyword, news in brief, source of news, newsiness media article and news copyright information etc.; Through the setting of such item types, can identify the characteristic of treating product sold information or news of the issue of asking, so that carry out the identification of follow-up duplicate message.
Concrete; Above-mentioned example only is with in the request issuing process of treating product sold information and news; Set-up mode for the item types that can show the characteristic that information gap is different in nature is illustrated; In the application scenarios of reality, can select according to the scene needs of reality, the variation of concrete item types can't influence the application's protection domain.
After being provided with of the item types of having accomplished above-mentioned characteristic; Receive the request of corresponding information issue when server after; Can obtain the characteristic of the information releasing of asking according to corresponding item types setting; And, the characteristic of being obtained is converted into the feature coding of the information issue request information releasing of asking according to preset algorithm.Concrete algorithm can be the MD5 algorithm; Message Digest 5 and other can carry out code conversion to corresponding information data; To reach the algorithmic rule of corresponding techniques purpose among the application embodiment, the variation of such algorithm types can't influence the application's protection domain.
The purpose of carrying out conversion process like this is can in follow-up comparison process, improve comparison efficiency through the data behind the coding on the one hand, on the other hand, then is to reduce the shared storage space resource of being obtained of characteristic; Especially for announced information; After carrying out encoding process through preset algorithm, characteristic the shared big young pathbreaker in space significantly dwindle, thereby; Make server can store the characteristic that has more released news; Or the characteristic of storage more items type, thereby, the accuracy of raising duplicate message identification and comprehensive.
Step S102, server correspondence preserve information publisher's identification information and the internal memory of the feature coding that released news in search the feature coding that obtains with said information publisher's the pairing announced information of identification information.
Wherein, The feature coding of announced each information promptly is the coded message that the characteristic information of previous announced each information forms after through the preset algorithm conversion process, and this part information is through storage space resource shared after the encoding process seldom, and; The frequency that is called and upgrades again can be than higher; So server is directly stored this part data in internal memory, one side is conveniently called and is upgraded; On the other hand, also avoided the data in the database are frequently called the data communication burden of being brought.
Simultaneously; Identification information with the information publisher in the internal memory is key (key); Feature coding with each information that this information publisher was issued is key assignments (value); Set up the corresponding relation of feature coding of information publisher's identification information and announced each information, so that, carry out origin classification to releasing news and repeat comparison according to information publisher's identification information.
It is pointed out that the difference according to concrete application scenarios, also there is difference in the mode of the feature coding of the announced information of storage and information corresponding publisher's thereof identification information in internal memory:
Situation one, under the separate situation of each information publisher institute information releasing; Be not disturb mutually between different publisher's information releasing, allow different publishers to issue under the situation of information of repetition, for example; Online store, application scenarioss such as individual blog.
The feature coding scope that this step is obtained should be the feature coding that belongs to same information publisher institute information releasing, judges promptly whether information issue information requested publisher had before issued identical information.
In such cases; Whether send with information issue request in order to distinguish the feature coding of having stored in the internal memory by identical information publisher; Just need according in the internal memory with the corresponding canned data publisher's of the feature coding of announced information identification information; Therefore, the implementation of this step is specially:
Server is confirmed information issue information requested publisher's identification information;
Server is according to this identification information, in internal memory, in the feature coding of canned data, obtains the pairing whole feature codings of this identification information, i.e. the announced information of the inquiry pairing information publisher of this identification information in internal memory.
Promptly in such cases, in the internal memory during feature coding of the announced information of storage, the storage of correspondence issue the information publisher's of this information identification information.
Situation two, under the situation that each information publisher institute information releasing is shared, promptly different publisher's information releasing is shared each other, does not allow different publishers to issue under the situation of information of repetition, for example, news website, application scenarioss such as resource sharing platform.
The feature coding scope that this step is obtained is except the feature coding of this information publisher institute information releasing; The feature coding that also will comprise all information that other information publishers issue promptly judges whether exist in all announced all information and this information issue request identical information of information releasing content of asking.
In such cases; When in internal memory, storing the feature coding of announced information; Can continue corresponding stored information publisher's sign, but in such cases, this sign can only comprise the information publisher of real this information of issue; Also can further comprise the information publisher that all asked to issue this information; Promptly when writing down the information publisher who successfully issues this information, can also write down all these information of request issue, but be rejected issue information requested publisher owing to being identified as duplicate message; Thereby, make other ask the information publisher who issues this same or similar information to know that oneself and which or which information publisher ask to have issued same or analogous information again.
In practical application, specifically to use the storage mode of above-mentioned which kind of information publisher's identification information and can select as required, the variation of concrete chosen content can't influence the application's protection domain.
Step S103, server judge whether the feature coding that generates and the feature coding of announced each information of in internal memory, obtaining exist repetition.
If the judged result of server is for existing repetition, execution in step S103;
If the judged result of server is not for existing repetition, execution in step S104.
In practical application; In order to realize follow-up comparison identifying; The feature coding that server generates among the above-mentioned step S101; And the feature coding of being stored in the mentioned internal memory among the step S102, must be coded message according to the identical same type that preset algorithm generated, the coded message of MD5 coded message and other types for example.
Step S104, the server refusal issue information releasing of asking.
In practical application, in order to realize that the processing of this step also comprises to the priority processing and the special management of particular service or premium customers institute requested service:
Server judges whether the information publisher has customized corresponding special issuing service;
If, this information issue request information releasing of asking of server issue, if not, this information issue request information releasing of asking of server refusal issue.
Through such processing; Can be to having opened the user of particular service, and priority or the higher advanced level user of issue authority provide the high-level service that is different from generalized case, both such particular service and advanced level user no longer receive the issue that information repeats and limit; Or suffered issue less-restrictive; Thereby, improve this part user's experience, for it provides high level personalized service.
In above-mentioned processing procedure; The mode of concrete refusal information issue is that server returns the indication of issue refusal to information issue information requested publisher; In order to make the more information issue processing of request situation of understanding oneself of this information publisher; In the indication of issue refusal, can also further carry the cause information of refusal issue.
Further, the technical scheme that proposed of the application embodiment also comprises:
Server generates the sample storehouse that comprises a plurality of mark post information according to the feature coding of being stored in the internal memory that has released news.
In this process, each feature coding that has released news of being stored in the internal memory all can generate a corresponding mark post information stores in this sample storehouse, simultaneously; Mark post information in this sample storehouse has also write down other relevant information of this feature coding, for example: its pairing update time that has released news except storing these feature coding information corresponding; Update times; Be identified the number of times that repeats to the issue request, information such as current state are through such processing; Recorded information issue processing of request situation more accurately, and the record foundation is provided for the corresponding background maintenance.
Further; Since the information in the sample storehouse with respect to the invoked frequency of the feature coding of being stored in the internal memory with call rate request and all will more hang down; So; The sample storehouse need not in internal memory, to store, but can be stored in the local disk or corresponding database of server, and concrete memory location can be selected according to the needs of reality.
Behind the generating run of accomplishing above-mentioned sample storehouse; If server is judged the feature coding of being stored in the internal memory that has released news and is repeated mutually with the feature coding of the information issue request information releasing of asking; Server will be more the running time and the counting of the pairing mark post information of this feature coding in the new samples storehouse, and coded message and the corresponding mark post information thereof of deletion before revising.
Server sends to this information issue information requested publisher with the address information of this mark post information; So that this information publisher can check this mark post information according to received address information; Thereby the repetition situation that affirmation has released news and the information issue is asked is through such processing; Can make the information publisher know that information repeats the issue situation accurately on the one hand; On the other hand, the transmission of address information can not constitute excessive transmission burden to server yet, can not influence the service feature of server.
Step S105, server continue information issue request is carried out other checkings according to preset proof rule.
If other checkings are passed through, execution in step S106;
If other checkings are not passed through, then stop subsequent authentication process, and write down the reason that this information issue request is not passed through this information issue request.
Step S106, server are issued the information releasing of asking, and the pairing feature coding of information is stored to internal memory.
Need to prove; After in storing internal memory into, if the importance of characteristic is higher, server can also carry out back-up processing to the feature coding in the internal memory according to corresponding backup policy; Losing of feature coding information in the time of to prevent server from running into emergency case service can not being provided in the internal memory; For example, in the local disk of server, set up toy data base, the feature coding in the internal memory is backed up according to preset backup cycle; So that when meeting with emergency case, recover corresponding feature coding information in internal memory.
Wherein, concrete backup policy can change according to actual needs, for example; Except above-mentioned cycle backup; Can also comprise timed backup, modes such as Event triggered backup trigger corresponding backup operation, and the memory location of Backup Data also is not limited only to the local disk of server; Also can be other servers that are in a server cluster with this server together, perhaps in the pairing storage resources of database.
It is pointed out that but above-mentioned backup operation is a kind of selection operation, because it can bring the consumption of storage resources; Also can produce a certain amount of traffic load; Whether need pay such resource and load consumption can confirm according to the importance of the characteristic in the internal memory, if do not carry out such backup operation in the system, and server has met with emergency case; So; After server is resumed work or backup server replacement server is started working, can perhaps obtain feature coding in the database again in internal memory in aforesaid sample storehouse.
In the process that technique scheme realizes, server can also carry out the finish message in the internal memory in the following manner:
If mode one announced information is deleted, then stored in the server deletion internal memory by the pairing feature coding of deletion information.
If mode two announced information are modified; Server returns step S103 according to the amended information content, judges whether the feature coding of announced each information of same information publisher of storing in pairing feature coding of the amended information content and the internal memory exists repetition; If do not repeat; Then revise the pairing feature coding of storing in the internal memory of the information that is modified,, then realize corresponding further processing with reference to aforesaid treatment step if repeat.
If the feature coding of being stored in mode three internal memories has reached preset cleaning trigger condition, server is according to the pairing feature coding of canned data in the predetermined strategy deletion internal memory.
In the application scenarios of reality, above-mentioned preset cleaning trigger condition specifically can comprise:
If the feature coding of being stored in 1 internal memory or the total quantity of the mark post information in the sample storehouse have reached preset amount threshold, then the feature coding of being stored in the internal memory is cleared up, simultaneously, clear up corresponding mark post information in the sample storehouse accordingly.
If the size of the storage space that feature coding of being stored in 2 internal memories or the mark post information in the sample storehouse are shared has reached preset space threshold value; Then the feature coding of being stored in the internal memory is cleared up; Simultaneously, clear up corresponding mark post information in the sample storehouse accordingly.
If feature coding of being stored in 3 internal memories or the pairing running time of mark post information in the sample storehouse time span apart from the current time has reached preset time threshold; Then the feature coding of being stored in the internal memory is cleared up; Simultaneously, clear up corresponding mark post information in the sample storehouse accordingly.
In concrete scale removal process, can clear up according to the strategy of customization in advance, for example; Order according to pairing running time (rise time or update time); Preferential cleaning operation time characteristic information the earliest simultaneously, is cleared up corresponding mark post information in the sample storehouse accordingly.
The concrete tactful content of using can be adjusted according to actual needs, and such variation can't influence the application's protection domain.
Such cleaning fundamental purpose is the release to the internal memory space resources, avoids long-term no feature coding information to the taking of internal memory storage space, certainly; This can cause the disappearance of the pairing feature coding of a part of announced information, thereby influences the identification that repeats of this part data, still; Considering the above-mentioned feature coding that Prune Policies was directed against, all is the lower feature coding of probability that under corresponding scene, is used again, so; Know that the repetition identification error that this part feature coding brings will be very little, on the contrary, if avoid this part error; Must need the more feature coding of storage, so, the measure that can take can only be to increase memory size; In concrete application scenarios; Increase cost input and the influence that operation is brought to system of above-mentioned error that internal memory brought through balance, can determine whether to use above-mentioned Prune Policies, such variation belongs to the application's protection domain equally.
Compared with prior art, the application embodiment has the following advantages:
Through using the technical scheme of the application embodiment; With announced information with the stored in form of feature coding in internal memory, when receiving new information issue request, the corresponding feature coding of inquiry in internal memory; Judge that through the feature coding that more whether has repetition whether information releasing that this information publisher asks repeats with releasing news mutually; Thereby, look in guarantee information under the situation of heavy accuracy rate, reduce through the database information of carrying out and repeat to inquire about the database access flow burden of being brought; And the search efficiency that improves duplicate message, shorten the heavy time of looking into of information issuing process.
Below, in conjunction with concrete application scenarios, the technical scheme that the application embodiment is proposed describes.
The schematic flow sheet of the main body thinking of a kind of duplicate message recognition methods that the application embodiment proposed is as shown in Figure 2, and its key point is:
The applied local client equipment of information publisher is responsible for putting and extract the data (as confirming information to be released) that need comparison in order, for example, according to information publisher's operation and the command information of being imported, obtains corresponding information to be released.
Further; Above-mentioned client device is selected corresponding server according to distribution policy; Reporting of the information of carrying out issue request, server are responsible for corresponding data being accomplished the calculating of feature coding according to corresponding algorithm; And already present feature coding in feature coding that calculates and the current internal memory compared, for the information publisher who uses this client device corresponding service is provided according to comparison result.
In order to tackle a large amount of service interaction demands; Above-mentioned server generally can be realized through the server cluster that a plurality of server is formed; In such processing procedure; Client device can be realized operations such as abnormal restoring, mistake retry, and each server in the server cluster then can be realized horizontal extension and load balancing to corresponding request.
In the application process of reality, above-mentioned comparison activity is initiated by client device, and client device is after collecting the good information that needs to compare; According to the handled information issue of each server request total amount in interval (in nearest 1 minute) server cluster of regular hour; Choose and handle the minimum server of quantity (thereby realizing load balancing), to its request service of comparing, if (can not receive new comparison request like this ISP) unusually takes place; Then from remaining server, choosing server again according to above-mentioned rule asks; Under extreme case,, can't accomplish the comparison service if whole servers all exists unusually in the server cluster; Then client device directly returns default result (for example, the current information releasing of asking does not repeat with having released news) for the information publisher.
Concrete, as shown in Figure 3 in practical application, the system architecture synoptic diagram of the concrete application scenarios of a kind of duplicate message recognition methods that is proposed for the application embodiment.
Wherein, The information publisher is through the client device access network; Submission information issue request; Client device is submitted the comparison request to according to the respective server of corresponding distribution policy in server cluster, with the operation of comparing that releases news of storing in information to be released and the internal memory, and carries out subsequent operation according to corresponding comparison result by corresponding server.
In the application of reality; In order better to carry out centralized dispatching and management; Between client device and server cluster, the centralized processing server can also further be set, concentrate the comparison request that client reports that receives; And according to the current Request Processing situation of each server in the server cluster ask accordingly the distribution; Give in the server cluster corresponding server and handle operation, such centralized management can effectively improve treatment effeciency, avoids client device directly and the raising of the network traffics brought of the multiple servers signal post in the server cluster.
And if the access server cluster is a far call for client device, centralized management is conserve network resources effectively, and wherein, far call can use RMI (RemoteMethod Invocation, RMI) technology.
In addition; If the centralized processing server obtains failure for the status poll or the configuration information of one or more server in the server cluster; Then the centralized processing server can be continued to use last time configuration information, avoids the influence for system stability of hardware fault or communication delay.
Need further be pointed out that, in above-mentioned technical scheme, also need keep the consistance of the sample library information in each station server in the server cluster, thereby guarantee the accurate of comparison result through to each the server sync operation in the server cluster.
In concrete enforcement scene, for the above-mentioned technical scheme that the application embodiment proposed, following problem needs to specify:
1, Distributed Calculation is adopted in the comparison service
Through the multiple servers in the server cluster comparison service is provided simultaneously, client device or server can come distributing information issue request according to the information in the status file of each station server in the server cluster, realize load balancing.
Through such processing, the comparison of mass data service scalability is strong, just can realize bigger handling capacity as long as dynamically increase server.
2, the innovation of sample data structure
Sample data (being aforesaid feature coding) is placed in the internal memory with two-way Map form; For example; For treating product sold information, can be respectively be that key and value are placed in the data structure of two-way Map with the MD5 value of the corresponding product related information of the ID of company or information publisher's identification information and company.
(1) in practical application because the MD5 value that has just released news or other coded messages of storing in the internal memory, how with, its storage resources shared in internal memory is considerably less, and, can foresee the EMS memory occupation situation of each object.
(2) for treating product sold information, MD5 value or other coded messages of the ID of company or information publisher's identification information and product related information left in the two-way Map data structure, sample data has realized the level fractionation.Can navigate to same companies or identical information publisher's related data fast; Key value through the sample retrieval data whether exist with identical MD5 value or other coded messages of feature coding value of asking to release news judge whether information repeats; Thereby, the high-performance and the high accuracy of realization information Recognition process.Compare and directly carry out information comparison, the mode of comparing through feature coding obviously has higher treatment effeciency.
3, the data sync between the comparison server cluster
Synchronous employing active push mode between each server in the server cluster; Promptly when the sample information of being stored in the internal memory of a station server wherein changes; Just directly other interior server of announcement server cluster upgrades accordingly; Thereby, avoided the needed extra system expense of synchronous operation of server cluster, and adopted asynchronous realization with the data sync work between the different server; Improve the responding ability of server greatly, strengthened the server cluster configuration flexibility.
4, the expansion of server cluster
When the memory source of server is not enough, except direct increase server self EMS memory resource, also can reach the purpose of exented memory capacity through the number of servers in the increase server cluster, can effectively improve the extendability of system.
5, the reliability of server cluster
Certain station server breaks down in server cluster, when internal storage data is lost, can pass through the dual mode restore data:
(1) recover synchronously through other data in server of cluster, this mode resume speed is very fast, but needs the synchronization policy between each server in the pre-configured server cluster, and, just need in time between each server, to carry out data backup at ordinary times.
(2) the comparison database data through data base persistenceization recovers, and this mode mainly is that the data in the server memory are carried out persistence, is stored in other equipment, and resume speed is slower, and has the data delay situation, but easy to operate.
Certainly, can also directly in releasing news, obtain again, but the load cost of operation can be very high like this.
In each above-mentioned strategy; Synchronization policy between each server of server cluster can be configured according to the different application scene; In the application scenarios of reality,, can between memory usage and system disaster tolerance property, weigh for the configuration of synchronization policy; Memory usage is high more, and disaster tolerance property is low more; Disaster tolerance property is high more, and memory usage is low more.
Compared with prior art, the application embodiment has the following advantages:
Through using the technical scheme of the application embodiment; With announced information with the stored in form of feature coding in internal memory, when receiving new information issue request, inquiry and corresponding feature coding in internal memory; Judge that through the feature coding that more whether has repetition whether information releasing that this information publisher asks repeats with releasing news mutually; Thereby, look in guarantee information under the situation of heavy accuracy rate, reduce through the database information of carrying out and repeat to inquire about the database access flow burden of being brought; And the search efficiency that improves duplicate message, shorten the heavy time of looking into of information issuing process.
In order to realize the technical scheme of the application embodiment, the application embodiment also provides a kind of server, and its structural representation is as shown in Figure 5, specifically comprises:
Receiver module 51 is used for reception information issue request;
Generation module 52 is used for when receiver module 51 receives the information issue request that information publisher sends, encoding according to the information issue request institute information releasing generating feature of asking;
Acquisition module 53 is used for preserving information publisher's identification information and the internal memory of the feature coding that released news is searched the feature coding that obtains with said information publisher's the pairing announced information of identification information in correspondence;
Judge module 54 is used for judging whether the feature coding of announced each information that feature coding that generation module 52 is generated and acquisition module 53 get access at internal memory exists repetition;
Processing module 55 is used for when the judged result of judge module 54 is repetition, and refusal releases news and issues the request information releasing of asking.
Wherein, generation module 52 specifically is used for:
Obtain the characteristic that information issue request institute that receiver module 51 received ask in the information releasing to be comprised, and, convert characteristic into this information and issue the feature coding of asking information releasing according to preset algorithm.
In concrete application scenarios, mentioned situation one among the corresponding aforesaid step S102, the range of information of judging repetition if desired is an identical information publisher institute information releasing, so, acquisition module 53 specifically is used for:
Confirm the pairing identification information of information issue information requested publisher that receiver module 51 is received; According to identification information; In internal memory, in the feature coding of canned data, obtain the pairing whole feature codings of identification information, afterwards; The feature coding that feature coding that judge module 54 gets access to acquisition module 53 and generation module 52 are generated compares, and judges whether to exist repetition.
On the other hand, mentioned situation two among the corresponding aforesaid step S102, the range of information of judging repetition if desired is that all release news, so, acquisition module 53 specifically is used for:
After in internal memory, obtaining said information issue information requested publisher's the feature coding of pairing announced each information of identification information, further obtain the feature coding of other announced each information.
Further, processing module 55 also is used for:
In the judged result of judge module 54 when not repeating; Continuation is carried out other checkings to information issue request according to preset proof rule; If other checkings are passed through, information releasing is asked in the issue that then releases news, and the pairing feature coding of information is stored to internal memory; If other checkings are not passed through, then stop subsequent authentication process, and write down the reason that this information issue request is not passed through this information issue request.
Moreover, processing module 55 also is used for:
When the judged result of judge module 54 is repetition; Judge whether the information publisher has customized corresponding special issuing service, if the issue request information releasing of asking releases news; If not, refusal releases news and issues the request information releasing of asking.
Confirm refusal when processing module 55 and release news issue request institute when ask information releasing, processing module 55 specifically is used for returning the indication of issue refusal to the information publisher, and carries and refuse the cause information issued.
In concrete application scenarios, generation module 52 also is used for the feature coding that has released news stored according to internal memory, generates the sample storehouse that comprises a plurality of mark post information;
Accordingly; Processing module 55; Also be used for when the judged result of judge module 54 is repetition; Upgrade the running time and the counting of the pairing mark post information of feature coding described in the sample storehouse that generation module 52 generated, and the address information of the mark post information that generation module 52 is generated sends to the information publisher, so that the information publisher is according to address information inspection mark post information.
In the application scenarios of reality, processing module 55 also is used for:
When announced information is deleted, deletion store in the internal memory by the pairing feature coding of deletion information; And/or,
When announced information is modified; According to the amended information content, judge whether the feature coding of announced each information of storing in pairing feature coding of the amended information content and the internal memory exists repetition, if do not repeat; Then revise the pairing feature coding of storing in the internal memory of the information that is modified; If repeat, then upgrade the pairing relevant information of feature coding of repetition, and preceding coded message is revised in deletion; And/or,
When announced information has reached the cleaning trigger condition of presetting, according to the pairing feature coding of canned data in the predetermined strategy deletion internal memory.
The content of corresponding preset strategy is with reference to above stated specification, at this repeated description no longer.
Compared with prior art, the application embodiment has the following advantages:
Through using the technical scheme of the application embodiment; With announced information with the stored in form of feature coding in internal memory, when receiving new information issue request, the corresponding feature coding of inquiry in internal memory; Judge that through the feature coding that more whether has repetition whether information releasing that this information publisher asks repeats with releasing news mutually; Thereby, look in guarantee information under the situation of heavy accuracy rate, reduce through the database information of carrying out and repeat to inquire about the database access flow burden of being brought; And the search efficiency that improves duplicate message, shorten the heavy time of looking into of information issuing process.
Through the description of above embodiment, those skilled in the art can be well understood to the application embodiment and can realize through hardware, also can realize by the mode that software adds necessary general hardware platform.Based on such understanding; The technical scheme of the application embodiment can be come out with the embodied of software product, this software product can be stored in a non-volatile memory medium (can be CD-ROM, USB flash disk; Portable hard drive etc.) in; Comprise some instructions with so that computer equipment (can be personal computer, server, or the network equipment etc.) each implements the described method of scene to carry out the application embodiment.
It will be appreciated by those skilled in the art that accompanying drawing is a preferred synoptic diagram of implementing scene, module in the accompanying drawing or flow process might not be that enforcement the application embodiment is necessary.
It will be appreciated by those skilled in the art that the module in the device of implementing in the scene can be distributed in the device of implementing scene according to implementing scene description, also can carry out respective change and be arranged in the one or more devices that are different from this enforcement scene.The module of above-mentioned enforcement scene can be merged into a module, also can further split into a plurality of submodules.
Above-mentioned the application embodiment sequence number is not represented the quality of implementing scene just to description.
More than the disclosed several practical implementation scenes that are merely the application embodiment, still, the application embodiment is not limited thereto, any those skilled in the art can think variation all should fall into the traffic limits scope of the application embodiment.

Claims (21)

1. a duplicate message recognition methods is characterized in that, comprising:
When server received information that the information publisher sent issue request, said server was encoded according to the said information issue request institute information releasing generating feature of asking;
Said server correspondence preserve information publisher's identification information and the internal memory of the feature coding that released news in search the feature coding that obtains with said information publisher's the pairing announced information of identification information;
Whether the feature coding that said server is judged said generation exists repetition with the feature coding of the announced information of in internal memory, obtaining;
If the judged result of said server is repetition, the said information issue of the said server refusal issue request information releasing of asking.
2. the method for claim 1 is characterized in that, said server is encoded according to the said information issue request institute information releasing generating feature of asking, and is specially:
Said server obtain said information issue request the characteristic of asking in the information releasing to be comprised;
Said server converts the said characteristic of obtaining into the feature coding of the said information issue request information releasing of asking according to preset algorithm.
3. according to claim 1 or claim 2 method is characterized in that, when said when asking information releasing to be specially the product category information, said characteristic comprises one or more of following information at least:
Name of product, product classification, picture feature, the description of product, product price and production information.
4. according to claim 1 or claim 2 method is characterized in that, when said when asking information releasing to be specially news category information, said characteristic comprises one or more of following information at least:
News exercise question, news time of origin, news keyword, news in brief, source of news, newsiness media article and news copyright information.
5. according to claim 1 or claim 2 method is characterized in that, the feature coding that said server generates, and the feature coding of storing in the said internal memory are specially the coded message of the same type that generates according to identical preset algorithm.
6. the method for claim 1; It is characterized in that; Said server correspondence preserve information publisher's identification information and the internal memory of the feature coding that released news in search the feature coding that obtains with said information publisher's the pairing announced information of identification information, specifically comprise:
Said server is confirmed the pairing identification information of said information issue information requested publisher;
Said server is according to said identification information, in said internal memory, in the feature coding of canned data, obtains and the pairing feature coding of said identification information.
7. the method for claim 1 is characterized in that, whether the feature coding that said server is judged said generation exists after the repetition with the feature coding of announced each information of in internal memory, obtaining, also comprises:
Said server judges whether the feature coding of all announced each information of storing in said feature coding and the internal memory exists repetition.
8. like claim 1 or 7 described methods, it is characterized in that said server judges that whether the feature coding of announced each information of storing in said feature coding and the internal memory exists after the repetition, also comprises:
If the judged result of said server is not for repeating, said server continues said information issue request is carried out other checkings according to preset proof rule;
If other checkings are passed through, said server is issued the said information issue request information releasing of asking, and the pairing feature coding of said information is stored to said internal memory; If other checkings are not passed through, then stop subsequent authentication process, and write down the reason that said information issue request is not passed through said information issue request.
9. the method for claim 1 is characterized in that, if the judged result of said server is repetition, also comprises:
Said server judges whether said information publisher has customized corresponding special issuing service;
If said server is issued the said information issue request information releasing of asking, if not, the said information issue of the said server refusal issue request information releasing of asking.
10. like claim 1 or 9 described methods, it is characterized in that the said information issue of the said server refusal issue request information releasing of asking specifically comprises:
Said server returns the indication of issue refusal to said information publisher, and carries the cause information of refusal issue.
11. method as claimed in claim 10 is characterized in that, also comprises:
Said server generates the sample storehouse that comprises a plurality of mark post information according to the feature coding of being stored in the internal memory that has released news;
When said server judges that the feature coding of being stored in the internal memory that has released news repeats with the feature coding of the information issue request information releasing of asking mutually, the running time and the counting of the pairing mark post information of feature coding described in the said server update sample storehouse;
Said server sends to said information publisher with the address information of said mark post information, so that said information publisher checks said mark post information according to said address information.
12. the method for claim 1 is characterized in that, also comprises:
If announced information is deleted, said server delete store in the said internal memory by the pairing feature coding of deletion information; And/or,
If announced information is modified; Said server judges whether the feature coding of announced each information of storing in pairing feature coding of the amended information content and the internal memory exists repetition, if do not repeat according to the amended information content; Then revise the pairing feature coding of storing in the internal memory of the information that is modified; If repeat, then upgrade the pairing relevant information of feature coding of repetition, and preceding coded message is revised in deletion; And/or,
If the feature coding of being stored in the internal memory has reached preset cleaning trigger condition, said server is deleted the feature coding of storing in the said internal memory according to predetermined strategy.
13. a server is characterized in that, comprising:
Receiver module is used for reception information issue request;
Generation module is used for when said receiver module receives the information issue request that information publisher sends, and encodes according to the said information issue request institute information releasing generating feature of asking;
Acquisition module is used for preserving information publisher's identification information and the internal memory of the feature coding that released news is searched the feature coding that obtains with said information publisher's the pairing announced information of identification information in correspondence;
Judge module is used for judging whether the feature coding of announced each information that feature coding that said generation module generates and said acquisition module get access at internal memory exists repetition;
Processing module is used for when the judged result of said judge module is repetition, the said information issue of the refusal issue request information releasing of asking.
14. server as claimed in claim 13 is characterized in that, said generation module specifically is used for:
Obtain the characteristic that information issue request institute that said receiver module receives ask in the information releasing to be comprised, and, convert said characteristic into said information and issue the feature coding of asking information releasing according to preset algorithm.
15. server as claimed in claim 13 is characterized in that, said acquisition module specifically is used for:
Confirm the pairing identification information of information issue information requested publisher that said receiver module receives,, in said internal memory, in the feature coding of canned data, obtain and the pairing feature coding of said identification information according to said identification information.
16. server as claimed in claim 13 is characterized in that, said acquisition module specifically is used for:
After in internal memory, obtaining said information issue information requested publisher's the feature coding of pairing announced each information of identification information, obtain the feature coding of other announced each information.
17., it is characterized in that said processing module also is used for like claim 13 or 16 described servers:
In the judged result of said judge module when not repeating; Continuation is carried out other checkings to said information issue request according to preset proof rule; If other checkings are passed through; Then issue the said information issue request information releasing of asking, and the pairing feature coding of said information is stored to said internal memory; If other checkings are not passed through, then stop subsequent authentication process, and write down the reason that said information issue request is not passed through said information issue request.
18. server as claimed in claim 13 is characterized in that, said processing module also is used for:
When the judged result of said judge module is repetition; Judge whether said information publisher has customized corresponding special issuing service, if issue the said information issue request information releasing of asking; If not, the said information issue of the refusal issue request information releasing of asking.
19. like claim 13 or 18 described servers, it is characterized in that, when said processing module confirm the said information issue of refusal issue request when asking information releasing, said processing module specifically is used for:
Return the indication of issue refusal to said information publisher, and carry the cause information of refusal issue.
20. server as claimed in claim 19 is characterized in that,
Said generation module also is used for the feature coding that has released news stored according to internal memory, generates the sample storehouse that comprises a plurality of mark post information;
Said processing module; Also be used for when the judged result of said judge module is repetition; Upgrade the running time and the counting of the pairing mark post information of feature coding described in the sample storehouse that said generation module generates; And the address information of the mark post information that said generation module generated sent to said information publisher, so that said information publisher checks said mark post information according to said address information.
21. server as claimed in claim 13 is characterized in that, said processing module also is used for:
When announced information is deleted, delete store in the said internal memory by the pairing feature coding of deletion information; And/or,
When announced information is modified; According to the amended information content, judge whether the feature coding of announced each information of storing in pairing feature coding of the amended information content and the internal memory exists repetition, if do not repeat; Then revise the pairing feature coding of storing in the internal memory of the information that is modified; If repeat, then upgrade the pairing relevant information of feature coding of repetition, and preceding coded message is revised in deletion; And/or,
When announced information has reached the cleaning trigger condition of presetting, delete the pairing feature coding of storing in the said internal memory of said information according to predetermined strategy.
CN2010106127421A 2010-12-30 2010-12-30 Method and equipment for identifying repeated information Pending CN102541918A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2010106127421A CN102541918A (en) 2010-12-30 2010-12-30 Method and equipment for identifying repeated information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010106127421A CN102541918A (en) 2010-12-30 2010-12-30 Method and equipment for identifying repeated information

Publications (1)

Publication Number Publication Date
CN102541918A true CN102541918A (en) 2012-07-04

Family

ID=46348834

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2010106127421A Pending CN102541918A (en) 2010-12-30 2010-12-30 Method and equipment for identifying repeated information

Country Status (1)

Country Link
CN (1) CN102541918A (en)

Cited By (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102929698A (en) * 2012-09-29 2013-02-13 北京百度网讯科技有限公司 Task re-judgment method and system
CN103530090A (en) * 2013-10-15 2014-01-22 福建榕基软件股份有限公司 Data renaming method and device
CN103634410A (en) * 2013-12-12 2014-03-12 北京奇虎科技有限公司 Data synchronization method based on content distribution network (CDN), client end and server
CN104063374A (en) * 2013-03-18 2014-09-24 阿里巴巴集团控股有限公司 Data deduplication method and equipment
CN104660635A (en) * 2013-11-20 2015-05-27 腾讯科技(北京)有限公司 Message synchronizing method, device and system
CN104702486A (en) * 2013-12-09 2015-06-10 腾讯科技(深圳)有限公司 Information publication method and device
CN104715374A (en) * 2013-12-11 2015-06-17 世纪禾光科技发展(北京)有限公司 Method and system for governing repetition products of e-commerce platform
CN105528453A (en) * 2015-12-24 2016-04-27 浪潮软件集团有限公司 Method, device and system for updating recruitment information
CN106408020A (en) * 2016-09-14 2017-02-15 东莞盛翔精密金属有限公司 Product two-dimensional code marking anti-repeated code detection system and method
CN106446119A (en) * 2016-09-18 2017-02-22 深圳信壹网络有限公司 Method for processing media files
TWI574168B (en) * 2013-07-26 2017-03-11 網路家庭國際資訊股份有限公司 Database system
CN106682870A (en) * 2016-12-12 2017-05-17 武汉图灵创客科技有限公司 Social platform system for 'maker' education
CN106789990A (en) * 2016-12-09 2017-05-31 天脉聚源(北京)传媒科技有限公司 A kind of news push method and device
CN107346310A (en) * 2016-05-05 2017-11-14 腾讯科技(深圳)有限公司 A kind of account complaint processing method and server
CN107704613A (en) * 2017-10-23 2018-02-16 深圳市金立通信设备有限公司 A kind of approaches to IM, terminal and computer-readable recording medium
CN107818390A (en) * 2016-09-12 2018-03-20 方正国际软件(北京)有限公司 A kind of check requirements generation method and device
CN107908775A (en) * 2017-11-30 2018-04-13 掌阅科技股份有限公司 The dynamic of merchandise news shows method, electronic equipment, storage medium
CN108665331A (en) * 2017-03-31 2018-10-16 上海吉会得实业有限公司 The e-commerce platform and method that multiple users share product releases news
CN108805518A (en) * 2018-05-14 2018-11-13 北京车和家信息技术有限公司 Information processing method, device, system and electronic equipment
CN108829726A (en) * 2018-05-09 2018-11-16 麒麟合盛网络技术股份有限公司 A kind of information issuing method and device
CN109302300A (en) * 2017-07-25 2019-02-01 阿里巴巴集团控股有限公司 Data distributing method and device, data processing method and server
CN109582870A (en) * 2018-11-30 2019-04-05 苏州达家迎信息技术有限公司 Information issuing method, device, equipment and storage medium
CN109582871A (en) * 2018-11-30 2019-04-05 苏州达家迎信息技术有限公司 Information issuing method, device, equipment and storage medium
CN109582905A (en) * 2018-11-30 2019-04-05 苏州达家迎信息技术有限公司 Information issuing method, device, equipment and storage medium
CN109658288A (en) * 2018-12-10 2019-04-19 泰康保险集团股份有限公司 The processing method and equipment of service item coding
CN109803022A (en) * 2019-01-30 2019-05-24 浙江蓝鸽科技有限公司 A kind of digitalization resource shared system and its method of servicing
CN110457971A (en) * 2018-05-07 2019-11-15 腾讯科技(深圳)有限公司 Repeat the detection method and device of information of identification code
CN113112335A (en) * 2021-05-08 2021-07-13 拉扎斯网络科技(上海)有限公司 Commodity information processing method and device for shop and computer equipment
WO2022193447A1 (en) * 2021-03-17 2022-09-22 网宿科技股份有限公司 Data packet deduplication and transmission method, electronic device, and storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6658423B1 (en) * 2001-01-24 2003-12-02 Google, Inc. Detecting duplicate and near-duplicate files
CN1567821A (en) * 2003-06-24 2005-01-19 华为技术有限公司 Call ticket repetition removing method
CN101159795A (en) * 2007-10-25 2008-04-09 中兴通讯股份有限公司 Calling list rearrangement method and device
CN101442731A (en) * 2008-12-12 2009-05-27 中国移动通信集团安徽有限公司 Method and apparatus for removing call ticket repeat
CN101500199A (en) * 2009-02-19 2009-08-05 广东创我科技发展有限公司 Message receiving apparatus, processing method and communication terminal
US20100005048A1 (en) * 2008-07-07 2010-01-07 Chandra Bodapati Detecting duplicate records
US20100235333A1 (en) * 2009-03-16 2010-09-16 International Business Machines Corporation Apparatus and method to sequentially deduplicate data

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6658423B1 (en) * 2001-01-24 2003-12-02 Google, Inc. Detecting duplicate and near-duplicate files
CN1567821A (en) * 2003-06-24 2005-01-19 华为技术有限公司 Call ticket repetition removing method
CN101159795A (en) * 2007-10-25 2008-04-09 中兴通讯股份有限公司 Calling list rearrangement method and device
US20100005048A1 (en) * 2008-07-07 2010-01-07 Chandra Bodapati Detecting duplicate records
CN101442731A (en) * 2008-12-12 2009-05-27 中国移动通信集团安徽有限公司 Method and apparatus for removing call ticket repeat
CN101500199A (en) * 2009-02-19 2009-08-05 广东创我科技发展有限公司 Message receiving apparatus, processing method and communication terminal
US20100235333A1 (en) * 2009-03-16 2010-09-16 International Business Machines Corporation Apparatus and method to sequentially deduplicate data

Cited By (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102929698A (en) * 2012-09-29 2013-02-13 北京百度网讯科技有限公司 Task re-judgment method and system
CN104063374A (en) * 2013-03-18 2014-09-24 阿里巴巴集团控股有限公司 Data deduplication method and equipment
TWI574168B (en) * 2013-07-26 2017-03-11 網路家庭國際資訊股份有限公司 Database system
CN103530090B (en) * 2013-10-15 2016-02-03 福建榕基软件股份有限公司 Data rename method and device thereof
CN103530090A (en) * 2013-10-15 2014-01-22 福建榕基软件股份有限公司 Data renaming method and device
CN104660635A (en) * 2013-11-20 2015-05-27 腾讯科技(北京)有限公司 Message synchronizing method, device and system
US10313434B2 (en) 2013-11-20 2019-06-04 Tencent Technology (Shenzhen) Company Limited Method and device for message synchronization
CN104702486A (en) * 2013-12-09 2015-06-10 腾讯科技(深圳)有限公司 Information publication method and device
CN104702486B (en) * 2013-12-09 2019-08-13 腾讯科技(深圳)有限公司 A kind of method and device that message is delivered
CN104715374A (en) * 2013-12-11 2015-06-17 世纪禾光科技发展(北京)有限公司 Method and system for governing repetition products of e-commerce platform
CN103634410B (en) * 2013-12-12 2017-01-11 北京奇安信科技有限公司 Data synchronization method based on content distribution network (CDN), client end and server
CN103634410A (en) * 2013-12-12 2014-03-12 北京奇虎科技有限公司 Data synchronization method based on content distribution network (CDN), client end and server
CN105528453B (en) * 2015-12-24 2018-10-23 浪潮软件集团有限公司 Method, device and system for updating recruitment information
CN105528453A (en) * 2015-12-24 2016-04-27 浪潮软件集团有限公司 Method, device and system for updating recruitment information
CN107346310A (en) * 2016-05-05 2017-11-14 腾讯科技(深圳)有限公司 A kind of account complaint processing method and server
CN107818390A (en) * 2016-09-12 2018-03-20 方正国际软件(北京)有限公司 A kind of check requirements generation method and device
CN106408020A (en) * 2016-09-14 2017-02-15 东莞盛翔精密金属有限公司 Product two-dimensional code marking anti-repeated code detection system and method
CN106446119A (en) * 2016-09-18 2017-02-22 深圳信壹网络有限公司 Method for processing media files
CN106789990A (en) * 2016-12-09 2017-05-31 天脉聚源(北京)传媒科技有限公司 A kind of news push method and device
CN106682870A (en) * 2016-12-12 2017-05-17 武汉图灵创客科技有限公司 Social platform system for 'maker' education
CN108665331A (en) * 2017-03-31 2018-10-16 上海吉会得实业有限公司 The e-commerce platform and method that multiple users share product releases news
CN109302300B (en) * 2017-07-25 2022-03-15 阿里巴巴集团控股有限公司 Data distribution and processing method, system and computer readable recording medium
CN109302300A (en) * 2017-07-25 2019-02-01 阿里巴巴集团控股有限公司 Data distributing method and device, data processing method and server
CN107704613A (en) * 2017-10-23 2018-02-16 深圳市金立通信设备有限公司 A kind of approaches to IM, terminal and computer-readable recording medium
CN107908775A (en) * 2017-11-30 2018-04-13 掌阅科技股份有限公司 The dynamic of merchandise news shows method, electronic equipment, storage medium
CN110457971A (en) * 2018-05-07 2019-11-15 腾讯科技(深圳)有限公司 Repeat the detection method and device of information of identification code
CN110457971B (en) * 2018-05-07 2022-09-16 腾讯科技(深圳)有限公司 Method and device for detecting repeated identification code information
CN108829726A (en) * 2018-05-09 2018-11-16 麒麟合盛网络技术股份有限公司 A kind of information issuing method and device
CN108805518A (en) * 2018-05-14 2018-11-13 北京车和家信息技术有限公司 Information processing method, device, system and electronic equipment
CN109582871A (en) * 2018-11-30 2019-04-05 苏州达家迎信息技术有限公司 Information issuing method, device, equipment and storage medium
CN109582905A (en) * 2018-11-30 2019-04-05 苏州达家迎信息技术有限公司 Information issuing method, device, equipment and storage medium
CN109582870A (en) * 2018-11-30 2019-04-05 苏州达家迎信息技术有限公司 Information issuing method, device, equipment and storage medium
CN109658288A (en) * 2018-12-10 2019-04-19 泰康保险集团股份有限公司 The processing method and equipment of service item coding
CN109803022A (en) * 2019-01-30 2019-05-24 浙江蓝鸽科技有限公司 A kind of digitalization resource shared system and its method of servicing
CN109803022B (en) * 2019-01-30 2022-02-18 浙江蓝鸽科技有限公司 Digital resource sharing system and service method thereof
WO2022193447A1 (en) * 2021-03-17 2022-09-22 网宿科技股份有限公司 Data packet deduplication and transmission method, electronic device, and storage medium
CN113112335A (en) * 2021-05-08 2021-07-13 拉扎斯网络科技(上海)有限公司 Commodity information processing method and device for shop and computer equipment

Similar Documents

Publication Publication Date Title
CN102541918A (en) Method and equipment for identifying repeated information
AU2019295818B2 (en) Block chain-based data processing method and device
US9466063B2 (en) Cluster processing of an aggregated dataset
CN111507709B (en) Data tracing system
CN108446975B (en) Quota management method and device
US20080288522A1 (en) Creating and storing a data field alteration datum using an analytic platform
CN105446991A (en) Data storage method, query method and device
US20130111010A1 (en) Application scope adjustment based on resource consumption
CN106844372B (en) Logistics information query method and device
CN102214187A (en) Complex event processing method and device
CN106326243B (en) Data processing method and device
CN112016921A (en) Transaction processing method, device and equipment
CN110389989B (en) Data processing method, device and equipment
CN105574051A (en) Method for updating user satisfaction rule and processing system
CN116433198A (en) Intelligent supply chain management platform system based on cloud computing
US20140317156A1 (en) Data management for data aggregation
KR20130082719A (en) Apparatus and method for financial data inquiry
CN109741140A (en) A kind of e-commerce system
CN107239962B (en) Method and system for matching multi-dimensional data units in electronic information system
CN113792039A (en) Data processing method and device, electronic equipment and storage medium
CN114153860A (en) Business data management method and device, electronic equipment and storage medium
US20040122695A1 (en) System and method for management of quotations
Wust et al. Xsellerate: supporting sales representatives with real-time information in customer dialogs
CN111274255A (en) Service data monitoring method and system, monitoring architecture, equipment and storage medium
CN100507906C (en) Redundancy-free provision of multi-purpose data

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1168915

Country of ref document: HK

C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20120704