CN106557896A - Network data processing method, apparatus and system - Google Patents

Network data processing method, apparatus and system Download PDF

Info

Publication number
CN106557896A
CN106557896A CN201510623228.0A CN201510623228A CN106557896A CN 106557896 A CN106557896 A CN 106557896A CN 201510623228 A CN201510623228 A CN 201510623228A CN 106557896 A CN106557896 A CN 106557896A
Authority
CN
China
Prior art keywords
address
pending
subordinate
site
mark
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510623228.0A
Other languages
Chinese (zh)
Inventor
凌宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Cainiao Smart Logistics Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201510623228.0A priority Critical patent/CN106557896A/en
Publication of CN106557896A publication Critical patent/CN106557896A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The application provides network data processing method, apparatus and system, and method includes receiving the pending data information that current destination terminal sends;Wherein, the pending data information includes pending address, current destination rank, current destination mark and the current topic mark of article, by administrative division information corresponding with subordinate's site rank of the current destination rank in pending address, it is defined as pending address keyword;In presetting database current data set corresponding with current topic mark, it is determined that subordinate's data acquisition system corresponding with subordinate's site mark of current destination mark;If finding and the pending address keyword destination address key word that the match is successful in subordinate's data acquisition system, push and destination address key word corresponding target subordinate site.The process for determining the subordinate site of article is realized by the application on the server, due to avoiding manual operation process, it is possible to improve classification effectiveness and accuracy rate.

Description

Network data processing method, apparatus and system
Technical field
The application is related to technical field of data processing, more particularly to network data processing method, device and is System.
Background technology
With developing rapidly for network technology, the fast development of net purchase enterprise is all therewith to increase also logistics company. Went from strength to strength along with logistics company in recent years, the site of logistics company has covered China mostly Area.
In order to realize sending with charge free for article, need to classify article in the site of logistics company, so as to Article is distributed to the subordinate site of this site.At present, mainly using manual type pair in logistics company Article is classified, and what its process generally manually learnt each subordinate site of this site in advance by heart sends model with charge free Enclose, then the target subordinate site corresponding to article address is determined in human brain according to article address, by thing Product distribute to target subordinate site;So as to the process for realizing classifying article.
Due to the efficiency and accuracy rate of manual sort's process it is relatively low, so cause taxonomy of goods process into The bottleneck of its professional skill is improved for logistics company.Accordingly, it is now desired to one kind can be entered to article automatically The method of row classification, so as to aid in this site accurately and rapidly to determine the corresponding subordinate site of article, So as to improve the classification effectiveness and accuracy rate of article.
The content of the invention
This application provides network data processing method, apparatus and system, can aid in this using the application Site accurately and rapidly determines the corresponding subordinate site of article, so as to improve the classification effectiveness and standard of article True rate.
To achieve these goals, the application adopts following technological means:
A kind of network data processing method, including:
Receive the pending data information that current destination terminal sends;Wherein, the pending data information Pending address, current destination rank, current destination mark and current topic mark including article, The current topic is designated the mark of the belonged to main body of current destination;
By administrative area corresponding with subordinate's site rank of the current destination rank in the pending address The information of drawing, is defined as pending address keyword;Wherein, the pending address includes multiple administrative areas The information of drawing, each administrative division information one site rank of correspondence;
In presetting database current data set corresponding with current topic mark, it is determined that and institute The subordinate site for stating current destination mark identifies corresponding subordinate's data acquisition system;Wherein, the preset data Storehouse include multiple main bodys mark and with each main body corresponding data acquisition system of mark, each data acquisition system bag Include each rank site mark and with each site corresponding address keyword of mark;The lower series Include several address keywords according to set, and with the one-to-one subordinate site of each address keyword Mark;
If finding and the pending address keyword target that the match is successful in subordinate's data acquisition system Address keyword, then send target corresponding with the destination address key word to the current destination terminal Subordinate site;Wherein, target subordinate site be subordinate's data acquisition system in the destination address The one-to-one subordinate site of key word.
Preferably, the subordinate's site rank by the pending address with the current destination rank Corresponding administrative division information, is defined as pending address keyword, including:
The current destination rank for it is provincial it is other in the case of, by the pending address Zhong Yu cities rank Corresponding borough draws information, is defined as the pending address keyword;
The current destination rank be city-level it is other in the case of, by the pending address with counties and districts' level Not corresponding area's administrative division information, is defined as the pending address keyword;
The current destination rank be counties and districts' rank in the case of, by the pending address with small towns The corresponding small towns administrative division information of rank, is defined as the pending address keyword, or, by institute State it is corresponding with area's rank is contracted in pending address contract area's information, be defined as the pending address and close Keyword.
Preferably, find with the pending address keyword in subordinate's data acquisition system that the match is successful Destination address key word, including:
It is that borough draws information, area's administrative division information or small towns row in the pending address keyword In the case that information is drawn in administrative division, search in subordinate's data acquisition system and the pending address keyword On all four address keyword;
By subordinate's data acquisition system and the on all four address keyword of the pending address keyword, It is defined as and the pending address keyword destination address key word that the match is successful.
Preferably, the determination process for contracting area's information corresponding with area's rank is contracted in pending address, bag Include:
By the pending road information in the pending address, it is defined as hold corresponding with area's rank is contracted Bag area's information.
Preferably, the pending road information includes pending road name and pending path number, Also, the address keyword in subordinate's data acquisition system includes corresponding with each subordinate's site mark Road information;Then find in subordinate's data acquisition system and match into the pending address keyword The destination address key word of work(, including:
It is completely the same with the pending road name in subordinate's data acquisition system lookup, and include institute State the address keyword of pending path number;
Will be subordinate's data acquisition system completely the same with the pending road name, and treat comprising described The address keyword of path number is processed, is defined as and the pending address keyword mesh that the match is successful Mark address keyword.
Preferably, the determination process for contracting area's information corresponding with area's rank is contracted in pending address, bag Include:
By the to-be-processed interest point information in the pending address, it is defined as corresponding with area's rank is contracted Contract area's information.
Preferably, the address keyword in subordinate's data acquisition system includes and each subordinate's site mark one One corresponding interest point information;Then find in subordinate's data acquisition system crucial with the pending address The word destination address key word that the match is successful, including:
Calculate the to-be-processed interest point information similar to the interest point information in subordinate's data acquisition system Degree;
To believe with the to-be-processed interest point information similarity highest point of interest in subordinate's data acquisition system Breath, is defined as the destination address key word.
Preferably, it is described to calculate the to-be-processed interest point information and point of interest in subordinate's data acquisition system The similarity of information, including:
Calculate the editing distance between the to-be-processed interest point information and the interest point information;Wherein, The editing distance is minimum for needed for being converted to the interest point information by the to-be-processed interest point information Edit operation number of times, the edit operation include for a character replacing with another character, insert a word Symbol, and delete a character;
Calculate the quantity of the to-be-processed interest point information and the public substring of the interest point information;Wherein, In the to-be-processed interest point information and the interest point information identical character, it is two neighboring and two with On character be a public substring;
According to the editing distance and the quantity of public substring, to-be-processed interest point letter is calculated by preset formula Similarity between breath and the interest point information.
Preferably, also include:
Receive a main body mark and latest data set corresponding with main body mark;
In the presetting database, it is determined that data with existing set corresponding with main body mark;
The data with existing set is updated using the latest data set.
A kind of network data processing method, including:
Obtain the pending address of one article of current destination;
Using the pending address, current destination rank, current destination mark and current topic mark, Build pending data information;Wherein, the current topic is designated the mark of the belonged to main body of current destination Know;
The pending data information is sent to server;
Wherein, the pending data information being serviced device be used for, receive what current destination terminal sent Pending data information;By the subordinate's site rank in the pending address with the current destination rank Corresponding administrative division information, is defined as pending address keyword;Wherein, the pending address bag Multiple administrative division information are included, each administrative division information one site rank of correspondence;In presetting database Current data set corresponding with current topic mark in, it is determined that with the current destination mark Subordinate site identifies corresponding subordinate's data acquisition system;Wherein, the presetting database includes multiple main body marks Know and with each main body corresponding data acquisition system of mark, each data acquisition system includes the site of each rank Mark and with each site corresponding address keyword of mark;Subordinate's data acquisition system includes several Address keyword, and identify with the one-to-one subordinate site of each address keyword;If under described DBMS set search to the pending address keyword destination address key word that the match is successful, then Send and destination address key word corresponding target subordinate site to the current destination terminal;Wherein, Target subordinate site is one-to-one with the destination address key word in subordinate's data acquisition system Subordinate site.
Preferably, the pending address of the acquisition includes:
The pending address that receive user is input into by input equipment;Or
The described pending address that scanning device sends is received, wherein, the pending address is swept by described Retouch equipment to obtain in the way of scanning Quick Response Code or bar code.
Preferably, also include:
The reception server send with destination address key word corresponding target subordinate site.
Preferably, also include:
Current topic mark and newest number corresponding with current topic mark are sent to the server According to set.
A kind of network data processing device, including:
First receiving unit, for receiving the pending data information of current destination terminal transmission;Wherein, The pending data information includes the pending address of article, current destination rank, current destination mark And current topic is identified, the current topic is designated the mark of the belonged to main body of current destination;
First determining unit, for by the subordinate's net in the pending address with the current destination rank The corresponding administrative division information of point rank, is defined as pending address keyword;Wherein, it is described pending Address includes multiple administrative division information, each administrative division information one site rank of correspondence;
Second determining unit, in presetting database and the current topic corresponding current data of mark In set, it is determined that subordinate's data acquisition system corresponding with subordinate's site mark of current destination mark;Its In, the presetting database include multiple main bodys mark and with each main body corresponding data acquisition system of mark, Each data acquisition system includes that the site mark of each rank and address corresponding with each site mark are crucial Word;Subordinate's data acquisition system includes several address keywords, and with each address keyword one by one Corresponding subordinate site mark;
Destination address key word unit is searched, waits to locate with described for finding in subordinate's data acquisition system The reason address keyword destination address key word that the match is successful;
First transmitting element, if closing with the pending address for finding in subordinate's data acquisition system The keyword destination address key word that the match is successful, then send and target ground to the current destination terminal Location key word corresponding target subordinate site;Wherein, target subordinate site is subordinate's data set With the one-to-one subordinate site of the destination address key word in conjunction.
Preferably, first determining unit, including:
Provincial determining unit, for the current destination rank for it is provincial it is other in the case of, treat described Process the corresponding borough of address Zhong Yu cities rank and draw information, be defined as the pending address keyword;
City-level determining unit, for, in the case of the current destination rank is other for city-level, treating described Area's administrative division information corresponding with counties and districts ranks in address is processed, is defined as the pending address crucial Word;
Counties and districts' level determining unit, in the case of being counties and districts' rank in the current destination rank, by institute State small towns administrative division information corresponding with small towns rank in pending address, be defined as it is described pendingly Location key word, or, area's information is contracted by corresponding with area's rank is contracted in the pending address, really It is set to the pending address keyword.
Preferably, the lookup destination address key word unit, including:
First searching unit, for drawing information, Qu Hang for borough in the pending address keyword In the case that information or small towns administrative division information are drawn in administrative division, search and institute in subordinate's data acquisition system State the on all four address keyword of pending address keyword;
First object address keyword determining unit, for will be subordinate's data acquisition system pending with described The on all four address keyword of address keyword, is defined as matching into the pending address keyword The destination address key word of work(.
Preferably, area's information of contracting corresponding with area's rank is contracted is pending in the pending address Road information, the pending road information include pending road name and pending path number, and And, the address keyword in subordinate's data acquisition system includes one-to-one with each subordinate's site mark Road information;Then include in the lookup destination address key word unit:
Second searching unit, it is complete with the pending road name for searching in subordinate's data acquisition system It is complete consistent, and the address keyword comprising the pending path number;
Second destination address key word determining unit, for will be subordinate's data acquisition system pending with described Road name is completely the same, and the address keyword comprising the pending path number, be defined as with The pending address keyword destination address key word that the match is successful.
Preferably, it is characterised in that it is corresponding with area's rank is contracted contract area's information for it is described pendingly To-be-processed interest point information in location, the address keyword in subordinate's data acquisition system include with each Level site identifies one-to-one interest point information;Then the lookup destination address key word unit includes:
Similarity unit is calculated, for the to-be-processed interest point information being calculated with subordinate's data acquisition system In interest point information similarity;
3rd destination address key word determining unit, for waiting to locate with described in subordinate's data acquisition system Reason interest point information similarity highest interest point information, is defined as the destination address key word.
Preferably, the calculating similarity unit, including:
Editing distance unit is calculated, for calculating the to-be-processed interest point information and the interest point information Between editing distance;Wherein, the editing distance is to be converted to institute by the to-be-processed interest point information The minimum edit operation number of times needed for interest point information is stated, the edit operation includes replacing a character For another character, a character is inserted, and deletes a character;
Public substring unit is calculated, for calculating the to-be-processed interest point information and the interest point information Public substring quantity;Wherein, the to-be-processed interest point information and the interest point information identical In character, two neighboring and plural character is a public substring;
Computing unit, for the quantity according to the editing distance and public substring, is calculated by preset formula Similarity between to-be-processed interest point information and the interest point information.
Preferably, also include:
Updating block, for receive main body mark and with the main body corresponding latest data of mark Set;In the presetting database, it is determined that data with existing set corresponding with main body mark;Profit The data with existing set is updated with the latest data set.
A kind of network data processing device, including:
Acquiring unit, for obtaining the pending address of one article of current destination;
Construction unit, for using the pending address, current destination rank, current destination mark with And current topic mark, build pending data information;Wherein, the current topic is designated current net The mark of the belonged to main body of point;
Second transmitting element, for sending the pending data information to server;
Wherein, the pending data information being serviced device be used for, receive what current destination terminal sent Pending data information;By the subordinate's site rank in the pending address with the current destination rank Corresponding administrative division information, is defined as pending address keyword;Wherein, the pending address bag Multiple administrative division information are included, each administrative division information one site rank of correspondence;In presetting database Current data set corresponding with current topic mark in, it is determined that with the current destination mark Subordinate site identifies corresponding subordinate's data acquisition system;Wherein, the presetting database includes multiple main body marks Know and with each main body corresponding data acquisition system of mark, each data acquisition system includes the site of each rank Mark and with each site corresponding address keyword of mark;Subordinate's data acquisition system includes several Address keyword, and identify with the one-to-one subordinate site of each address keyword;If under described DBMS set search to the pending address keyword destination address key word that the match is successful, then Send and destination address key word corresponding target subordinate site to the current destination terminal;Wherein, Target subordinate site is one-to-one with the destination address key word in subordinate's data acquisition system Subordinate site.
Preferably, the acquiring unit specifically for, receive user by input equipment be input into it is pending Address;Or for receiving the described pending address of scanning device transmission, wherein, the pending address Obtained in the way of scanning Quick Response Code or bar code by the scanning device.
Preferably, the second receiving unit, for the reception server transmission and the destination address key word Corresponding target subordinate site.
Preferably, also include:
Send data cell, for the server send current topic mark and with the current master Body identifies corresponding latest data set.
A kind of network data processing system, including:
Server, several site terminals being connected with the server;It is arbitrary in described several sites Site terminal is current destination terminal;
The current destination terminal, for obtaining the pending address of one article of current destination;Using institute Pending address, current destination rank, current destination mark and current topic mark are stated, is built and is waited to locate Reason data message;Wherein, the current topic is designated the mark of the belonged to main body of current destination;To clothes Business device sends the pending data information;
The server, for receiving the pending data information of current destination terminal transmission;Treat described Administrative division information corresponding with subordinate's site rank of the current destination rank in address is processed, it is determined that For pending address keyword;Wherein, the pending address includes multiple administrative division information, each Administrative division information one site rank of correspondence;It is corresponding with current topic mark in presetting database In current data set, it is determined that lower DBMS corresponding with subordinate's site mark of current destination mark Set;Wherein, the presetting database includes that multiple main bodys are identified and corresponding with each main body mark Data acquisition system, each data acquisition system include that the site of each rank identifies and corresponding with each site mark Address keyword;Subordinate's data acquisition system includes several address keywords, and with each address The one-to-one subordinate site mark of key word;If find in subordinate's data acquisition system to wait to locate with described The reason address keyword destination address key word that the match is successful, then send and institute to the current destination terminal State destination address key word corresponding target subordinate site;Wherein, under target subordinate site is described With the one-to-one subordinate site of the destination address key word in DBMS set.
The application that it can be seen from the above has following technological means:
The application is stored with the presetting database of server the data acquisition system of each main body, data acquisition system Include each rank site mark and with each site corresponding address keyword of mark;This mistake Journey can substitute the process for sending scope with charge free of artificial memory subordinate site.It is pending in server reception one After data message, determine that the address to be checked matched with data acquisition system is crucial in pending address Word, and address keyword to be checked is matched with subordinate data acquisition system, so as to obtain with it is to be checked Address keyword corresponding target subordinate site;This process is substituted and is manually judged under article in human brain The process of level site.
The process for determining the subordinate site of article is realized by the application on the server, due to avoiding artificial behaviour Make process, it is possible to improve classification effectiveness and accuracy rate.
Description of the drawings
In order to be illustrated more clearly that the embodiment of the present application or technical scheme of the prior art, below will be to reality Apply accompanying drawing to be used needed for example or description of the prior art to be briefly described, it should be apparent that, below Accompanying drawing in description is only some embodiments of the present application, for those of ordinary skill in the art, On the premise of not paying creative work, can be with according to these other accompanying drawings of accompanying drawings acquisition.
Structural representations of the Fig. 1 for network data processing system disclosed in the embodiment of the present application;
Fig. 2 is the flow process of structure presetting database in network data processing system disclosed in the embodiment of the present application Figure;
Fig. 3 is the flow process of renewal presetting database in network data processing system disclosed in the embodiment of the present application Figure;
Flow charts of the Fig. 4 for network data processing method disclosed in the embodiment of the present application;
Flow charts of the Fig. 5 for another network data processing method disclosed in the embodiment of the present application;
Flow charts of the Fig. 6 for another network data processing method disclosed in the embodiment of the present application;
Flow charts of the Fig. 7 for another network data processing method disclosed in the embodiment of the present application;
Flow charts of the Fig. 8 for another network data processing method disclosed in the embodiment of the present application;
Flow charts of the Fig. 9 for another network data processing method disclosed in the embodiment of the present application;
Structural representations of the Figure 10 for network data processing device disclosed in the embodiment of the present application;
Figure 11 is that the structure of the first determining unit in network data processing device disclosed in the embodiment of the present application is shown It is intended to;
Figure 12 is lookup destination address key word list in network data processing device disclosed in the embodiment of the present application The structural representation of unit;
Figure 13 is another lookup destination address key in network data processing device disclosed in the embodiment of the present application The structural representation of word unit;
Figure 14 is another lookup destination address key in network data processing device disclosed in the embodiment of the present application The structural representation of word unit;
Figure 15 is the structure of calculating similarity unit in network data processing device disclosed in the embodiment of the present application Schematic diagram;
Structural representations of the Figure 16 for another network data processing device disclosed in the embodiment of the present application.
Specific embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present application, the technical scheme in the embodiment of the present application is carried out Clearly and completely describe, it is clear that described embodiment is only some embodiments of the present application, and It is not all, of embodiment.Based on the embodiment in the application, what those of ordinary skill in the art were obtained Every other embodiment, belongs to the scope of the application protection.
By taking a logistics company as an example, a logistics company has multiple rank sites, for example, provincial Other site, city rank site, counties and districts' rank site, area rank site and small towns rank site are contracted, Wherein, contract area rank site same rank is in small towns rank site.Each level in logistics company Other site can have multiple sites, and for example, the whole nation one has 23 provinces, then a logistics company then may be used With with 23 provincial rank sites.
In logistics company, taxonomy of goods process is the subordinate site that article is determined in this rank site Process.The application, in order to realize article mechanized classification process, is each site of each logistics company One or more terminals are set up, terminal corresponding with each site is referred to as site terminal;One logistics company The collection of site terminal of all sites be collectively referred to as main body.As main body is the logistics company all site ends The set at end, so main body can be from side reaction stream company, i.e., main body is equivalent to logistics company.And And, site corresponding with a site terminal has all properties of the site.
Such as, the attribute that site has belongs to thing by site rank 1, site mark 1 and site The mark 1 of stream company, then the corresponding site terminal in site is also with site terminal rank (with site rank 1 It is identical), site mark (1 identical with site mark) and main body mark (logistics belonged to site The mark 1 of company is identical).
In order to realize mechanized classification process, this application provides a kind of network data processing system.Referring to Fig. 1, this application provides a kind of embodiment of network data processing system, including:
Server 100, several site terminals 200 being connected with the server 100.
Due to including under a main body that several site terminals (have multiple nets under i.e. one logistics company Point), the site terminal of each main body can be connected with server 100, so as to using server 100 Realize taxonomy of goods process;So, several site terminals 100 being connected with server 100 are each Site terminal in main body.
In order to realize mechanized classification process, each of multiple logistics companies is stored first on server 100 Individual rank site sends scope with charge free, sends a logistics company with charge free scope in the application, is closed using address The mode of keyword is represented.For example, the scope of sending with charge free of a site is " Beijing ", then closed using address Keyword " Beijing " sends scope with charge free represent the site;For another example, the scope of sending with charge free of a site is " abundant Magnificent road No. 001-No. 100 ", then using address keyword " Yuhua road No. 001-No. 100 ", represent The site sends scope with charge free.
It can be seen that, the application can utilize the address keyword of each site, and represent each site sends model with charge free Enclose.So, the collection sent scope with charge free, the address keyword of each site can be adopted of a logistics company Close and represent.It is that each site identifies one main body of address keyword composition corresponding with each site mark Data acquisition system, sends the data acquisition system of the main body with charge free scope as logistics company.
The process for sending scope with charge free for storing each logistics company on the server, such as Fig. 2 is described in detail below It is shown, comprise the following steps:
Step S201:Receive in advance multiple main bodys mark and with each main body corresponding data acquisition system of mark; Each data acquisition system includes that the site mark of each rank and address corresponding with each site mark are crucial Word.
Server can receive the data acquisition system of each main body that site terminal sends, or by other services The data acquisition system of each main body that device sends.The data acquisition system of each main body includes corresponding with each rank Data acquisition system.For example, province-level data set corresponding with rank is saved, city-level data set corresponding with city's rank Close, counties and districts' data acquisition system corresponding with counties and districts ranks etc..The rank is in the data acquisition system of each rank Send the address keyword of scope, and site corresponding with each address keyword mark with charge free.
Step S202:By storage each body data set corresponding with each main body mark.
As server storage has the data acquisition system of multiple main bodys, in order to distinguish the data of each logistics company Set, by the storage data acquisition system of each main body corresponding with each main body mark.Server can be with each master Body identify and with each main body corresponding data acquisition system of mark, as representing each logistics company group Send the presetting database of scope.
In the present embodiment, each host complex of the presetting database is stored with tree structure.With As a example by the data acquisition system of one main body, it is each province-level data set in the father node of the data acquisition system the superiors, Two grades of leaf nodes are city-level data acquisition system, and three-level leaf node is counties and districts' data acquisition system, level Four leaf section Point small towns data acquisition system contracts area's data acquisition system;Contract area's data acquisition system can for road information set or Interest point information set.
After the process shown in Fig. 2 is performed, be just stored with server data corresponding with each main body Set, each data acquisition system include the site mark of each rank and corresponding with each site mark Location key word.Can come true on the server according to the data acquisition system of each rank site in subsequent process The subordinate site of earnest product.
Need according to presetting database to determine the subordinate site of article due to server in subsequent process, In order to ensure the accuracy of taxonomy of goods, need persistently to keep presetting database accuracy.Presetting database It is the data acquisition system of each main body mark.But each rank sends scope with charge free simultaneously in each logistics company It is not unalterable, i.e., the data acquisition system of each main body is not unalterable, but can be changed 's.Therefore, server can update existing data acquisition system, to keep the data acquisition system of each main body All the time maintain last state.
The process of server update data acquisition system is described below, referring to Fig. 3, is comprised the following steps:
Step S301:Receive the main body mark and latest data set corresponding with main body mark.
When the data acquisition system of a main body changes, site terminal can obtain latest data collection Close, then site terminal to server sends latest data set and main body mark.Server can connect Receive main body mark and latest data set that site terminal sends.
Step S302:In the presetting database, it is determined that with the main body corresponding data with existing of mark Set.
Presetting database have multiple main bodys mark and with each main body corresponding data acquisition system of mark;So Server can be identified in present count according to main body after main body mark and latest data set is received According to determination data with existing set corresponding with main body mark in storehouse.
Step S303:The data with existing set is updated using the latest data set.
It is determined that after data with existing set corresponding with main body mark, then being replaced using latest data set Data with existing set is changed, so as to realize updating the purpose of presetting database.
By the process shown in Fig. 3, the data acquisition system of each main body can be caused to maintain newest shape all the time State.
The above is discussed in detail a kind of network data processing system, and stores each master on the server The process of the data acquisition system of body.In subsequent process, site terminal can send treating for article to server Processing data information, server just can determine the subordinate site of article according to pending data information.
Due to server for each site terminal implementation procedure be it is consistent, so, by several In the terminal of site, a site terminal is used as current destination terminal, to the application by taking current destination terminal as an example Implementation procedure be described in detail.It is understood that the implementation procedure of other site terminals with it is current The implementation procedure of site terminal is consistent.
On the basis of above-mentioned network data processing system, present invention also provides a kind of network data processing Method, is applied to server.Referring to Fig. 4, methods described is specifically included:
Step S401:Receive the pending data information that current destination terminal sends;Wherein, it is described to wait to locate Reason data message includes the pending address of article, current destination rank, current destination mark and current Main body is identified, and the current topic is designated the mark of the belonged to main body of current destination.
Server can receive the pending data information of current destination terminal transmission, using pending data Information, determines the subordinate site of article in the server.
Step S402:By the subordinate's site rank pair in the pending address with the current destination rank The administrative division information answered, is defined as pending address keyword;Wherein, the pending address includes Multiple administrative division information, each administrative division information one site rank of correspondence.
In order to determine the subordinate site of article, in the pending address of article, pending address keyword is determined, The subordinate site of article is determined by pending address keyword.
As a main body has multiple rank sites, the address keyword used by different stage site is Different, it is possible to corresponding address keyword is extracted in pending address according to the rank of this site, Then, subordinate site is being determined in the server according to address keyword.
The step of determining pending key word can specifically be divided into following three kinds of situations:
The first situation:The current destination rank for it is provincial it is other in the case of, will be described pending Rank corresponding borough in Zhi Zhongyu cities draws information, is defined as the pending address keyword.
The current destination rank of current destination terminal can be obtained in pending data information, in current net Point rank for it is provincial it is other in the case of, illustrate that article should be classified to provincial other subordinate site:City's rank Site.For this purpose, the application determines administrative information region corresponding with city's rank in pending address.
For example, it is assumed that pending address is " Hebei province Baoding Beishi District people's procuratorate ", then can be with " Baoding " is extracted in pending address, " Baoding " is defined as into pending address keyword.
Second situation:The current destination rank be city-level it is other in the case of, will be described pending In location, area's administrative division information corresponding with counties and districts ranks, is defined as the pending address keyword.
The current destination rank of current destination terminal is obtained in pending data information, in current destination level Not Wei city-level it is other in the case of, illustrate that article should be classified to the other subordinate site of city-level:Counties and districts' rank net Point.For this purpose, the application determines administrative information region corresponding with counties and districts ranks in pending address.
For example, it is assumed that pending address is " Hebei province Baoding Beishi District people's procuratorate ", then can be with " Beishi District " is extracted in pending address, " Beishi District " is defined as into pending address keyword.
The third situation:In the case where the current destination rank is counties and districts' rank, will be described pending In address, small towns administrative division information corresponding with small towns rank, is defined as the pending address keyword, Or, area's information is contracted by corresponding with area's rank is contracted in the pending address, be defined as described treating Process address keyword.
The current destination rank of current destination terminal is obtained in pending data information, in current destination level Wei not illustrate that article should be classified to the subordinate site of counties and districts' rank in the case of counties and districts' rank:Township level Area site is contracted in other site.
The subordinate site of counties and districts' rank site can be divided into two kinds:Small towns rank site and contract area site. For small towns rank site and contracting area site, it is determined that pending address keyword be it is different, under Face is directed to small towns rank site and contracts area site and describes in detail respectively:
Small towns rank site is the article to be needed to be dispatched into the subordinate small towns of counties and districts' rank site, therefore can So that small towns administrative division information corresponding with small towns rank is extracted in pending address, it is defined as described treating Process address keyword.For example, it is assumed that pending address is " Hebei province Mancheng County of Baoding Shen Xing towns ", Then can extract in " Shen Xing towns " in pending address, " Shen Xing towns " is defined as into pending address and is closed Keyword.
Contracting area site and being the article needs the subordinate for being dispatched into counties and districts' rank site to contract area, therefore, Can extract in pending address it is corresponding with area's rank is contracted contract area's information, be defined as described waiting to locate Reason address keyword.
It is determined that it is corresponding with area's rank is contracted contract area's information when, two ways can be divided into:
First kind of way:By the pending road information in the pending address, it is defined as and contracts area Rank is corresponding to contract area's information.
It is substantially, by road k-path partition, to therefore, it can the road in pending address due to contracting area's rank Road information, as pending road information, and pending road information is defined as contracting area's information.
For example, pending address is " Hebei province Baoding Xinshi District PiceameyeriRehd. Et Wils. road 86 high aim electrical equipment ", then " PiceameyeriRehd. Et Wils. road 86 " that can be in pending address be defined as it is corresponding with area's rank is contracted contract area's information, Then pending road information " PiceameyeriRehd. Et Wils. road 86 " can be defined as pending address keyword.
The second way:By the to-be-processed interest point information in the pending address, it is defined as and contracts Area's rank is corresponding to contract area's information.
In some cases can be by Partition for Interest Points due to contracting area's rank, therefore, it can will be pending Interest point information in address, as to-be-processed interest point information, and to-be-processed interest point information is determined To contract area's information.
For example, pending address is " Hebei province Baoding Xinshi District PiceameyeriRehd. Et Wils. road 86 high aim electrical equipment ", then Can be defined as at the interest point information in pending address " high aim electrical equipment " and area's rank is contracted to agreeing To-be-processed interest point information " PiceameyeriRehd. Et Wils. road 86 " then can be defined as pendingly by bag area's information Location key word.
Step S403:In presetting database current data set corresponding with current topic mark, It is determined that subordinate's data acquisition system corresponding with subordinate's site mark of current destination mark;Wherein, it is described Presetting database include multiple main bodys mark and with each main body corresponding data acquisition system of mark, often number According to set include each rank site identify and with each site corresponding address keyword of mark;Institute Stating subordinate's data acquisition system includes several address keywords, and one-to-one with each address keyword Subordinate's site mark.
Have and multiple main bodys corresponding data acquisition system of mark in presetting database, different subject identification correspondences Data acquisition system be that different logistics companies send scope with charge free, it is therefore desirable to find in presetting database with Current topic identifies corresponding current data set, and current data set shares to represent and current destination terminal Belonged to logistics company sends scope with charge free.
In each data acquisition system with each rank site corresponding data acquisition system of mark, according to waiting to locate Current destination rank and current destination mark in reason data message, it is determined that the subordinate with current destination mark Site identifies corresponding subordinate's data acquisition system.Subordinate's data acquisition system includes subordinate's site mark and subordinate site Corresponding several address keywords of mark.Each address keyword corresponds to subordinate's site mark.
Step S404:If find in subordinate's data acquisition system matching with the pending address keyword Successfully destination address key word, then send and the destination address key word to the current destination terminal Corresponding target subordinate site;Wherein, target subordinate site in subordinate's data acquisition system with institute State the one-to-one subordinate site of destination address key word.
The inquiry in step S403 is obtained subordinate's data acquisition system and searches what is matched with address keyword to be checked Address keyword, if finding and the address keyword to be checked address keyword that the match is successful, will treat Address keyword is defined as destination address key word.
Subordinate site corresponding with the address keyword that the match is successful is determined in subordinate's data acquisition system, should Subordinate site is defined as the target subordinate site that the target subordinate site of article, i.e. the application finally determine. It is determined that after target subordinate site, server can push target subordinate site to current destination terminal. So that current destination terminal shows target subordinate site, the subordinate site of article is checked for user, from And realize the process of taxonomy of goods.
From the above, it is seen that the application has the advantages that:The application is in the pre- of server If the data acquisition system of each main body that is stored with data base, data acquisition system includes the site mark of each rank Know and with each site corresponding address keyword of mark;This process is similar to artificial memory subordinate net The process for sending scope with charge free of point.After server receives a pending data information, pending The address keyword to be checked matched with data acquisition system is determined in location, and by address keyword to be checked Matched with subordinate data acquisition system, so as to obtain target subordinate corresponding with address keyword to be checked Site;This process is the process of the artificial subordinate site that article is judged in human brain.
The process for determining the subordinate site of article is realized by the application on the server, artificial due to avoiding Operating process, it is possible to improve classification effectiveness and accuracy rate.
For step S404 shown in Fig. 4, it is described below and searches what is matched with address keyword to be checked Several situations of address keyword:
The first situation:It is accurate to inquire about.
As shown in figure 5, the first situation may comprise steps of:
Step S501:It is that borough draws information, counties and districts' administrative division in the pending address keyword In the case of information or small towns administrative division information, search in subordinate's data acquisition system and wait to locate with described The on all four address keyword of reason address keyword.
By several situations shown in step S402, it can be seen that the pending address keyword of the application has Several situations below, borough draw information, counties and districts' administrative division information, small towns administrative division information, Four kinds of information of area's information are contracted, borough draws information, counties and districts' administrative division letter in four kinds of information more than Breath and three kinds of information of small towns administrative division information, the address keyword in subordinate data acquisition system are carried out Timing, typically using accurately mate.Because using if fuzzy matching, may result in matching result not Accurately, there is the problem of maloperation.
Step S502:Will be subordinate's data acquisition system on all four with the pending address keyword Location key word, is defined as and the pending address keyword destination address key word that the match is successful.
In the first case, introduce pending address keyword information, counties and districts' administration are drawn for borough The situation of zoning information and small towns administrative division information, is described below pending address keyword to contract area The situation of information.Pending road information and to-be-processed interest point letter can be divided into again due to contracting area's information Breath, point two ways is carried out as query script during pending address keyword to contracting area's information below It is discussed in detail:
Second situation:Pending address keyword is pending road information.The pending road letter Breath includes the ground in pending road name and pending path number, also, subordinate's data acquisition system Location key word includes identifying one-to-one road information with each subordinate site.
As shown in fig. 6, second situation is comprised the following steps:
Step S601:It is completely the same with the pending road name in subordinate's data acquisition system lookup, And the address keyword comprising the pending path number.
One-to-one road is identified comprising with each subordinate site in the address keyword of subordinate's data acquisition system Information, generally, a site identifies corresponding road information includes road name and path number Scope.For example, " Yuhua road No. 001-No. 100 ".
Each road name in entitled and subordinate's data acquisition system of pending road information can be carried out Match somebody with somebody, search the road name with the on all four address keyword of pending road name.The match is successful Afterwards, that is, after finding the address keyword consistent with pending road name, then judge pending road Whether number is in the range of the path number of address keyword, if pending path number is crucial in address In the range of the path number of word, it is determined that the address keyword is that the match is successful with pending address keyword Address keyword.
Step S602:Will be subordinate's data acquisition system completely the same with the pending road name, and Address keyword comprising the pending path number, is defined as and the pending address keyword With successful destination address key word.
The third situation:Pending address keyword is pending road information.Subordinate's data acquisition system In address keyword include and each subordinate site identifies one-to-one interest point information;Then described Subordinate's data acquisition system finds and the pending address keyword destination address key word that the match is successful.
As shown in fig. 7, being directed to the third situation, step is specifically included:
Step S701:Calculate the point of interest in the to-be-processed interest point information and subordinate's data acquisition system The similarity of information.
As interest point information can be building name, but people generally have letter to building name Claim.For example, " XX scientific and technical research institutes ", is briefly termed as " XX academy " or " XX Research Center ".Therefore, accurately mate is carried out for interest point information, and carry out fuzzy matching.
However, to ensure that the degree of accuracy of matching, the application calculating to-be-processed interest point and subordinate's data acquisition system In similarity between all points of interest.It is understood that the higher to-be-processed interest point letter of similarity Breath is higher with the matching degree of interest point information.
To-be-processed interest point information and interest point information are character string, current calculating character string similarity Method can have the COS distance for calculating two character strings, and distance is less, and to represent similarity higher.Or, The editing distance of two character strings is calculated, the less similarity for representing both of editing distance is higher.Certainly also The similarity between two character strings can be calculated using alternate manner, that is, calculates to-be-processed interest point letter The similarity of breath and interest point information.
Step S702:By in subordinate's data acquisition system with the to-be-processed interest point information similarity highest Interest point information, be defined as the destination address key word.
Determine in subordinate's data acquisition system and believe with to-be-processed interest point information similarity highest point of interest Breath, the interest point information is and the immediate interest point information of to-be-processed interest point, it can be considered that two Person is consistent.Then interest point information is defined as into destination address corresponding with to-be-processed interest point information Key word.
As shown in figure 8, a kind of process for determining similarity is described below.
Step S801:Calculate editor between the to-be-processed interest point information and the interest point information away from From;Wherein, the editing distance is to be converted to the interest point information by the to-be-processed interest point information Required minimum edit operation number of times, the edit operation include for a character replacing with another character, One character of insertion, and delete a character.
With to-be-processed interest point information as " Zhejiang Polytechnical University ", interest point information is for " Zhejiang science and engineering is big Process as a example by " to calculating the similarity of to-be-processed interest point information and interest point information is situated between in detail Continue.
First, the editing distance between to-be-processed interest point information and the interest point information is calculated, wherein, The editing distance refers to minimum by needed for the to-be-processed interest point information is converted to the interest point information Edit operation number of times.The mistake that " Institutes Of Technology Of Zhejiang " is converted to from " Zhejiang Polytechnical University " is referred to as:I.e. " work " is replaced with into " managing ", " industry " is replaced with into " work ";Therefore, to-be-processed interest point information Editing distance between interest point information is 2.
Step S802:Calculate the to-be-processed interest point information and the public substring of the interest point information Quantity;Wherein, it is in the to-be-processed interest point information and the interest point information identical character, adjacent Two and plural character are a public substring.
First, determine the public substring of to-be-processed interest point information and interest point information, from pending interest With interest point information " Institutes Of Technology Of Zhejiang ", point " Zhejiang Polytechnical University " finds that " Zhejiang " one is public Common substring, " university " is a public substring.Therefore, to-be-processed interest point information and interest point information Public substring quantity be 2.
Step S803:According to the editing distance and the quantity of public substring, calculate by preset formula and wait to locate Similarity between reason interest point information and the interest point information.
Assume that preset formula is:Similarity=S2* [1+p/ (S1+1)].Wherein, S1 is editing distance, and S2 is public The altogether quantity of substring, p is customized parameter, concrete numerical value can as the case may be depending on.Certainly may be used also The quantity of editing distance and public substring is considered with using other formula, it is pending emerging so as to obtain The similarity of interesting point information and interest point information.
According to the step shown in Fig. 8, you can be calculated to-be-processed interest point information and interest point information Similarity.
Determine the process of target subordinate site above for server according to pending data, be described below current Site terminal builds the process of pending data.Referring to Fig. 9, the application also provides a kind of network data processing Method, is applied to current destination terminal, and methods described includes:
Step S901:Obtain the pending address of one article of current destination.
User can select an article in current destination, the article need to be allocated to article address pair The target subordinate site answered.In order to realize that automatization determines the process of article target subordinate site, Ke Yi The corresponding current destination terminal of current destination determines the pending data information of article, then by server according to The subordinate site of article is determined according to pending data information.
By the processing procedure of server it is known that pending data information includes pending address, current Site rank, current destination mark and current topic mark, the current topic are designated current destination The mark of belonged to main body.
For this purpose, first obtain article pending address, current destination terminal obtain article pendingly The mode of location can have following two modes:
First kind of way:The pending address that receive user is input into by input equipment.
User can utilize the input equipment being connected with current destination equipment, or, current destination equipment is certainly The input equipment (for example, the equipment such as keyboard) of body to be input into pending address to current destination terminal.Should Mode is more slow, and easily malfunctions.
The second way:Receive that the mode of scanning device scanning Quick Response Code or bar code obtains described waits to locate Reason address.
Typically there are Quick Response Code or bar code in the packaging of article now, include in Quick Response Code or bar code There is the relevant information of article, when there is the address information of article on Quick Response Code or bar code, can be by sweeping Retouch the mode of Quick Response Code or bar code to obtain pending address.
Step S902:Identify and work as using the pending address, current destination rank, current destination Front main body mark, builds pending data information;Wherein, the current topic is designated current destination institute The mark of ownership main body.
In current destination terminal, be provided with current destination current destination rank, current destination mark and The current topic mark of the belonged to logistics company of current destination, therefore, in the pending address for obtaining article Afterwards, just can be using pending address, current destination rank, current destination mark and current topic Mark, builds pending data information.
Step S903:The pending data information is sent to server.
Current destination terminal can send pending data information to server by communication.With Just server can be with by the way of embodiment illustrated in fig. 4, to carry out further to pending data information Process, so that it is determined that the target subordinate site of article, then sends target subordinate site to current net again Point terminal.
Current destination terminal then shows the mesh after the target subordinate site for receiving server transmission Mark subordinate site, so that user can check target subordinate site, and by taxonomy of goods to target subordinate net Point.
It is corresponding with the embodiment shown in Fig. 4, as shown in Figure 10, present invention also provides a kind of network number According to processing meanss, including:
First receiving unit 101, for receiving the pending data information of current destination terminal transmission;Wherein, The pending data information includes the pending address of article, current destination rank, current destination mark And current topic is identified, the current topic is designated the mark of the belonged to main body of current destination;
First determining unit 102, for by the subordinate in the pending address with the current destination rank The corresponding administrative division information of site rank, is defined as pending address keyword;Wherein, it is described to wait to locate Reason address includes multiple administrative division information, each administrative division information one site rank of correspondence;
Second determining unit 103, in presetting database and the current topic corresponding current number of mark According to set, it is determined that subordinate's data acquisition system corresponding with subordinate's site mark of current destination mark; Wherein, the presetting database include multiple main bodys mark and with each main body corresponding data set of mark Close, each data acquisition system include each rank site mark and with each site corresponding address of mark Key word;Subordinate's data acquisition system includes several address keywords, and with each address keyword One-to-one subordinate site mark;
Destination address key word unit 104 is searched, is treated with described for finding in subordinate's data acquisition system Process the address keyword destination address key word that the match is successful;
First transmitting element 105, if for finding and the pending address in subordinate's data acquisition system The successful destination address key word of Keywords matching, then send and the target to the current destination terminal Address keyword corresponding target subordinate site;Wherein, target subordinate site is the lower DBMS With the one-to-one subordinate site of the destination address key word in set.
Additionally, a kind of network data processing device that the application is provided, also includes:
Updating block 106, for receiving a main body mark and newest number corresponding with main body mark According to set;In the presetting database, it is determined that data with existing set corresponding with main body mark; The data with existing set is updated using the latest data set.
As shown in figure 11, first determining unit 102, including:
Provincial determining unit 111, for the current destination rank for it is provincial it is other in the case of, will be described The corresponding borough of pending address Zhong Yu cities rank draws information, is defined as the pending address crucial Word;
City-level determining unit 112, for the current destination rank be city-level it is other in the case of, will be described Area's administrative division information corresponding with counties and districts ranks in pending address, is defined as the pending address and closes Keyword;
Counties and districts levels determining unit 113, in the case of being counties and districts' rank in the current destination rank, will In the pending address, small towns administrative division information corresponding with small towns rank, is defined as described pending Address keyword, or, area's information is contracted by corresponding with area's rank is contracted in the pending address, It is defined as the pending address keyword.
Lookup destination address key word unit 104 in Figure 10, can have following three kinds of specific implementations:
It is the first implementation of lookup destination address key word unit 104 as shown in figure 12:
First searching unit 121, for drawing information, area for borough in the pending address keyword In the case of administrative division information or small towns administrative division information, in subordinate's data acquisition system search with The on all four address keyword of the pending address keyword;
First object address keyword determining unit 122, for waiting to locate subordinate's data acquisition system with described The on all four address keyword of reason address keyword, is defined as matching with the pending address keyword Successful destination address key word.
As shown in figure 13, it is to search 104 second implementation of destination address key word unit:
In second implementation, it is corresponding with area's rank is contracted contract area's information for it is described pendingly Pending road information in location, the pending road information include pending road name and pending Address keyword in path number, also, subordinate's data acquisition system includes and each subordinate's site mark Know one-to-one road information.
So, search destination address key word unit 104 to specifically include:
Second searching unit 131, for searching and the pending road name in subordinate's data acquisition system It is completely the same, and the address keyword comprising the pending path number;
Second destination address key word determining unit 132, for waiting to locate subordinate's data acquisition system with described Reason road name is completely the same, and the address keyword comprising the pending path number, is defined as With the pending address keyword destination address key word that the match is successful.
As shown in figure 14, it is to search 104 the third implementation of destination address key word unit:
In the third implementation, it is corresponding with area's rank is contracted contract area's information for it is described pendingly To-be-processed interest point information in location, the address keyword in subordinate's data acquisition system include with each Level site identifies one-to-one interest point information.So, the lookup destination address key word unit 104 Including:
Similarity unit 141 is calculated, for the to-be-processed interest point information being calculated with subordinate's data set The similarity of the interest point information in conjunction;
3rd destination address key word determining unit 142, for will treat with described in subordinate's data acquisition system Interest point information similarity highest interest point information is processed, is defined as the destination address key word.
As shown in figure 15, the calculating similarity unit 141, including:
Editing distance unit 151 is calculated, is believed with the point of interest for calculating the to-be-processed interest point information Editing distance between breath;Wherein, the editing distance is to be converted to by the to-be-processed interest point information Minimum edit operation number of times needed for the interest point information, the edit operation include replacing a character Another character is changed to, a character is inserted, and is deleted a character;
Public substring unit 152 is calculated, is believed with the point of interest for calculating the to-be-processed interest point information The quantity of the public substring of breath;Wherein, the to-be-processed interest point information is identical with the interest point information Character in, two neighboring and plural character is a public substring;
Computing unit 153, for the quantity according to the editing distance and public substring, based on preset formula Calculate the similarity between to-be-processed interest point information and the interest point information.
Corresponding with a kind of Data Data method shown in Fig. 9, the application is also provided at a kind of network data Reason device.As shown in figure 16, described device is specifically included:
Acquiring unit 161, for obtaining the pending address of one article of current destination.
Wherein, the acquiring unit 161 specifically for receive user by input equipment be input into it is pending Address;Or specifically for receiving the described pending address that scanning device sends, wherein, it is described pending Address is obtained in the way of scanning Quick Response Code or bar code by the scanning device.
Construction unit 162, for using the pending address, current destination rank, current destination mark And current topic mark, build pending data information;Wherein, the current topic is designated currently The mark of the belonged to main body in site;
Second transmitting element 163, for sending the pending data information to server;
Wherein, the pending data information being serviced device be used for, receive what current destination terminal sent Pending data information;By the subordinate's site rank in the pending address with the current destination rank Corresponding administrative division information, is defined as pending address keyword;Wherein, the pending address bag Multiple administrative division information are included, each administrative division information one site rank of correspondence;In presetting database Current data set corresponding with current topic mark in, it is determined that with the current destination mark Subordinate site identifies corresponding subordinate's data acquisition system;Wherein, the presetting database includes multiple main body marks Know and with each main body corresponding data acquisition system of mark, each data acquisition system includes the site of each rank Mark and with each site corresponding address keyword of mark;Subordinate's data acquisition system includes several Address keyword, and identify with the one-to-one subordinate site of each address keyword;If under described DBMS set search to the pending address keyword destination address key word that the match is successful, then Send and destination address key word corresponding target subordinate site to the current destination terminal;Wherein, Target subordinate site is one-to-one with the destination address key word in subordinate's data acquisition system Subordinate site.
Additionally, the application also provides a kind of network data processing device, also include:
Second receiving unit 164, for the corresponding with the destination address key word of the reception server transmission Target subordinate site.
Data cell 165 is sent, and is identified and current with described for current topic being sent to the server Main body identifies corresponding latest data set.
Referring to Fig. 1, this application provides a kind of network data processing system, including:
Server 100, several site terminals 200 being connected with the server.Described several sites In arbitrary site terminal be current destination terminal;
The current destination terminal 200, for obtaining the pending address of one article of current destination;Utilize The pending address, current destination rank, current destination mark and current topic mark, structure are treated Processing data information;Wherein, the current topic is designated the mark of the belonged to main body of current destination;To Server sends the pending data information;
The server 100, for receiving the pending data information of current destination terminal transmission;Will be described Administrative division information corresponding with subordinate's site rank of the current destination rank in pending address, really It is set to pending address keyword;Wherein, the pending address includes multiple administrative division information, often One site rank of individual administrative division information correspondence;It is corresponding with current topic mark in presetting database Current data set in, it is determined that the corresponding lower series of subordinate's site mark with current destination mark According to set;Wherein, the presetting database includes that multiple main bodys are identified and corresponding with each main body mark Data acquisition system, each data acquisition system includes the site mark of each rank and right with each site mark The address keyword answered;Subordinate's data acquisition system includes several address keywords, and with each ground The one-to-one subordinate site mark of location key word;If find in subordinate's data acquisition system treating with described Process the address keyword destination address key word that the match is successful, then to the current destination terminal send with Destination address key word corresponding target subordinate site;Wherein, target subordinate site is described With the one-to-one subordinate site of the destination address key word in subordinate's data acquisition system.
If the function described in the present embodiment method is realized using in the form of SFU software functional unit and as independent When production marketing or use, can be stored in a computing device read/write memory medium.Based on so Understanding, the part or the part of the technical scheme that the embodiment of the present application contributes to prior art can To be embodied in the form of software product, the software product is stored in a storage medium, if including Dry instruction is used so that computing device (can be personal computer, server, mobile computing device Or the network equipment etc.) perform all or part of step of each embodiment methods described of the application.And it is front The storage medium stated includes:USB flash disk, portable hard drive, read only memory (ROM, Read-Only Memory), Random access memory (RAM, Random Access Memory), magnetic disc or CD etc. are various can With the medium of store program codes.
In this specification, each embodiment is described by the way of progressive, and each embodiment is stressed The difference with other embodiments, between each embodiment same or similar part mutually referring to.
The foregoing description of the disclosed embodiments, enables professional and technical personnel in the field to realize or use The application.Various modifications to these embodiments will be aobvious and easy for those skilled in the art See, generic principles defined herein can in the case of without departing from spirit herein or scope, Realize in other embodiments.Therefore, the application is not intended to be limited to the embodiments shown herein, And it is to fit to the most wide scope consistent with principles disclosed herein and features of novelty.

Claims (25)

1. a kind of network data processing method, it is characterised in that include:
Receive the pending data information that current destination terminal sends;Wherein, the pending data information Pending address, current destination rank, current destination mark and current topic mark including article, The current topic is designated the mark of the belonged to main body of current destination;
By administrative area corresponding with subordinate's site rank of the current destination rank in the pending address The information of drawing, is defined as pending address keyword;Wherein, the pending address includes multiple administrative areas The information of drawing, each administrative division information one site rank of correspondence;
In presetting database current data set corresponding with current topic mark, it is determined that and institute The subordinate site for stating current destination mark identifies corresponding subordinate's data acquisition system;Wherein, the preset data Storehouse include multiple main bodys mark and with each main body corresponding data acquisition system of mark, each data acquisition system bag Include each rank site mark and with each site corresponding address keyword of mark;The lower series Include several address keywords according to set, and with the one-to-one subordinate site of each address keyword Mark;
If finding and the pending address keyword target that the match is successful in subordinate's data acquisition system Address keyword, then send target corresponding with the destination address key word to the current destination terminal Subordinate site;Wherein, target subordinate site be subordinate's data acquisition system in the destination address The one-to-one subordinate site of key word.
2. the method for claim 1, it is characterised in that it is described by the pending address with The corresponding administrative division information of subordinate's site rank of the current destination rank, is defined as pending address Key word, including:
The current destination rank for it is provincial it is other in the case of, by the pending address Zhong Yu cities rank Corresponding borough draws information, is defined as the pending address keyword;
The current destination rank be city-level it is other in the case of, by the pending address with counties and districts' level Not corresponding area's administrative division information, is defined as the pending address keyword;
The current destination rank be counties and districts' rank in the case of, by the pending address with small towns The corresponding small towns administrative division information of rank, is defined as the pending address keyword, or, by institute State it is corresponding with area's rank is contracted in pending address contract area's information, be defined as the pending address and close Keyword.
3. method as claimed in claim 2, it is characterised in that find in subordinate's data acquisition system With the pending address keyword destination address key word that the match is successful, including:
It is that borough draws information, area's administrative division information or small towns row in the pending address keyword In the case that information is drawn in administrative division, search in subordinate's data acquisition system and the pending address keyword On all four address keyword;
By subordinate's data acquisition system and the on all four address keyword of the pending address keyword, It is defined as and the pending address keyword destination address key word that the match is successful.
4. method as claimed in claim 2, it is characterised in that in pending address with contract area's rank The corresponding determination process for contracting area's information, including:
By the pending road information in the pending address, it is defined as hold corresponding with area's rank is contracted Bag area's information.
5. method as claimed in claim 4, it is characterised in that the pending road information includes treating Process road name and the address keyword in pending path number, also, subordinate's data acquisition system One-to-one road information is identified including with each subordinate site;Then search in subordinate's data acquisition system To with the pending address keyword destination address key word that the match is successful, including:
It is completely the same with the pending road name in subordinate's data acquisition system lookup, and include institute State the address keyword of pending path number;
Will be subordinate's data acquisition system completely the same with the pending road name, and treat comprising described The address keyword of path number is processed, is defined as and the pending address keyword mesh that the match is successful Mark address keyword.
6. method as claimed in claim 2, it is characterised in that in pending address with contract area's rank The corresponding determination process for contracting area's information, including:
By the to-be-processed interest point information in the pending address, it is defined as corresponding with area's rank is contracted Contract area's information.
7. method as claimed in claim 6, it is characterised in that the address in subordinate's data acquisition system Key word includes identifying one-to-one interest point information with each subordinate site;Then in the lower DBMS Set search to the pending address keyword destination address key word that the match is successful, including:
Calculate the to-be-processed interest point information similar to the interest point information in subordinate's data acquisition system Degree;
To believe with the to-be-processed interest point information similarity highest point of interest in subordinate's data acquisition system Breath, is defined as the destination address key word.
8. method as claimed in claim 7, its feature is being, described to calculate the pending interest The similarity of point information and interest point information in subordinate's data acquisition system, including:
Calculate the editing distance between the to-be-processed interest point information and the interest point information;Wherein, The editing distance is minimum for needed for being converted to the interest point information by the to-be-processed interest point information Edit operation number of times, the edit operation include for a character replacing with another character, insert a word Symbol, and delete a character;
Calculate the quantity of the to-be-processed interest point information and the public substring of the interest point information;Wherein, In the to-be-processed interest point information and the interest point information identical character, it is two neighboring and two with On character be a public substring;
According to the editing distance and the quantity of public substring, to-be-processed interest point letter is calculated by preset formula Similarity between breath and the interest point information.
9. the method as described in any one of claim 1-8, it is characterised in that also include:
Receive a main body mark and latest data set corresponding with main body mark;
In the presetting database, it is determined that data with existing set corresponding with main body mark;
The data with existing set is updated using the latest data set.
10. a kind of network data processing method, it is characterised in that include:
Obtain the pending address of one article of current destination;
Using the pending address, current destination rank, current destination mark and current topic mark, Build pending data information;Wherein, the current topic is designated the mark of the belonged to main body of current destination Know;
The pending data information is sent to server;
Wherein, the pending data information being serviced device be used for, receive what current destination terminal sent Pending data information;By the subordinate's site rank in the pending address with the current destination rank Corresponding administrative division information, is defined as pending address keyword;Wherein, the pending address bag Multiple administrative division information are included, each administrative division information one site rank of correspondence;In presetting database Current data set corresponding with current topic mark in, it is determined that with the current destination mark Subordinate site identifies corresponding subordinate's data acquisition system;Wherein, the presetting database includes multiple main body marks Know and with each main body corresponding data acquisition system of mark, each data acquisition system includes the site of each rank Mark and with each site corresponding address keyword of mark;Subordinate's data acquisition system includes several Address keyword, and identify with the one-to-one subordinate site of each address keyword;If under described DBMS set search to the pending address keyword destination address key word that the match is successful, then Send and destination address key word corresponding target subordinate site to the current destination terminal;Wherein, Target subordinate site is one-to-one with the destination address key word in subordinate's data acquisition system Subordinate site.
11. methods as claimed in claim 10, it is characterised in that the pending address of the acquisition includes:
The pending address that receive user is input into by input equipment;Or
The described pending address that scanning device sends is received, wherein, the pending address is swept by described Retouch equipment to obtain in the way of scanning Quick Response Code or bar code.
12. methods as claimed in claim 11, it is characterised in that also include:
The reception server send with destination address key word corresponding target subordinate site.
13. methods as claimed in claim 11, it is characterised in that also include:
Current topic mark and newest number corresponding with current topic mark are sent to the server According to set.
14. a kind of network data processing devices, it is characterised in that include:
First receiving unit, for receiving the pending data information of current destination terminal transmission;Wherein, The pending data information includes the pending address of article, current destination rank, current destination mark And current topic is identified, the current topic is designated the mark of the belonged to main body of current destination;
First determining unit, for by the subordinate's net in the pending address with the current destination rank The corresponding administrative division information of point rank, is defined as pending address keyword;Wherein, it is described pending Address includes multiple administrative division information, each administrative division information one site rank of correspondence;
Second determining unit, in presetting database and the current topic corresponding current data of mark In set, it is determined that subordinate's data acquisition system corresponding with subordinate's site mark of current destination mark;Its In, the presetting database include multiple main bodys mark and with each main body corresponding data acquisition system of mark, Each data acquisition system includes that the site mark of each rank and address corresponding with each site mark are crucial Word;Subordinate's data acquisition system includes several address keywords, and with each address keyword one by one Corresponding subordinate site mark;
Destination address key word unit is searched, waits to locate with described for finding in subordinate's data acquisition system The reason address keyword destination address key word that the match is successful;
First transmitting element, if closing with the pending address for finding in subordinate's data acquisition system The keyword destination address key word that the match is successful, then send and target ground to the current destination terminal Location key word corresponding target subordinate site;Wherein, target subordinate site is subordinate's data set With the one-to-one subordinate site of the destination address key word in conjunction.
15. devices as claimed in claim 14, it is characterised in that first determining unit, including:
Provincial determining unit, for the current destination rank for it is provincial it is other in the case of, treat described Process the corresponding borough of address Zhong Yu cities rank and draw information, be defined as the pending address keyword;
City-level determining unit, for, in the case of the current destination rank is other for city-level, treating described Area's administrative division information corresponding with counties and districts ranks in address is processed, is defined as the pending address crucial Word;
Counties and districts' level determining unit, in the case of being counties and districts' rank in the current destination rank, by institute State small towns administrative division information corresponding with small towns rank in pending address, be defined as it is described pendingly Location key word, or, area's information is contracted by corresponding with area's rank is contracted in the pending address, really It is set to the pending address keyword.
16. devices as claimed in claim 15, it is characterised in that the lookup destination address key word Unit, including:
First searching unit, for drawing information, Qu Hang for borough in the pending address keyword In the case that information or small towns administrative division information are drawn in administrative division, search and institute in subordinate's data acquisition system State the on all four address keyword of pending address keyword;
First object address keyword determining unit, for will be subordinate's data acquisition system pending with described The on all four address keyword of address keyword, is defined as matching into the pending address keyword The destination address key word of work(.
17. devices as claimed in claim 15, it is characterised in that corresponding with area's rank is contracted to contract Area's information is the pending road information in the pending address, and the pending road information includes treating Process road name and the address keyword in pending path number, also, subordinate's data acquisition system One-to-one road information is identified including with each subordinate site;It is then crucial in the lookup destination address Word unit includes:
Second searching unit, it is complete with the pending road name for searching in subordinate's data acquisition system It is complete consistent, and the address keyword comprising the pending path number;
Second destination address key word determining unit, for will be subordinate's data acquisition system pending with described Road name is completely the same, and the address keyword comprising the pending path number, be defined as with The pending address keyword destination address key word that the match is successful.
18. devices as claimed in claim 15, it is characterised in that corresponding with area's rank is contracted to contract Area's information is the to-be-processed interest point information in the pending address, the ground in subordinate's data acquisition system Location key word includes identifying one-to-one interest point information with each subordinate site;The then lookup target Address keyword unit includes:
Similarity unit is calculated, for the to-be-processed interest point information being calculated with subordinate's data acquisition system In interest point information similarity;
3rd destination address key word determining unit, for waiting to locate with described in subordinate's data acquisition system Reason interest point information similarity highest interest point information, is defined as the destination address key word.
19. devices as claimed in claim 18, its feature is being, described to calculate similarity unit, Including:
Editing distance unit is calculated, for calculating the to-be-processed interest point information and the interest point information Between editing distance;Wherein, the editing distance is to be converted to institute by the to-be-processed interest point information The minimum edit operation number of times needed for interest point information is stated, the edit operation includes replacing a character For another character, a character is inserted, and deletes a character;
Public substring unit is calculated, for calculating the to-be-processed interest point information and the interest point information Public substring quantity;Wherein, the to-be-processed interest point information and the interest point information identical In character, two neighboring and plural character is a public substring;
Computing unit, for the quantity according to the editing distance and public substring, is calculated by preset formula Similarity between to-be-processed interest point information and the interest point information.
20. devices as described in any one of claim 14-19, it is characterised in that also include:
Updating block, for receive main body mark and with the main body corresponding latest data of mark Set;In the presetting database, it is determined that data with existing set corresponding with main body mark;Profit The data with existing set is updated with the latest data set.
21. a kind of network data processing devices, it is characterised in that include:
Acquiring unit, for obtaining the pending address of one article of current destination;
Construction unit, for using the pending address, current destination rank, current destination mark with And current topic mark, build pending data information;Wherein, the current topic is designated current net The mark of the belonged to main body of point;
Second transmitting element, for sending the pending data information to server;
Wherein, the pending data information being serviced device be used for, receive what current destination terminal sent Pending data information;By the subordinate's site rank in the pending address with the current destination rank Corresponding administrative division information, is defined as pending address keyword;Wherein, the pending address bag Multiple administrative division information are included, each administrative division information one site rank of correspondence;In presetting database Current data set corresponding with current topic mark in, it is determined that with the current destination mark Subordinate site identifies corresponding subordinate's data acquisition system;Wherein, the presetting database includes multiple main body marks Know and with each main body corresponding data acquisition system of mark, each data acquisition system includes the site of each rank Mark and with each site corresponding address keyword of mark;Subordinate's data acquisition system includes several Address keyword, and identify with the one-to-one subordinate site of each address keyword;If under described DBMS set search to the pending address keyword destination address key word that the match is successful, then Send and destination address key word corresponding target subordinate site to the current destination terminal;Wherein, Target subordinate site is one-to-one with the destination address key word in subordinate's data acquisition system Subordinate site.
22. devices as claimed in claim 21, it is characterised in that
The acquiring unit is specifically for the pending address that receive user is input into by input equipment;Or For receiving the described pending address of scanning device transmission, wherein, the pending address is swept by described Retouch equipment to obtain in the way of scanning Quick Response Code or bar code.
23. devices as claimed in claim 21, it is characterised in that
Second receiving unit, for the mesh corresponding with the destination address key word that the reception server sends Mark subordinate site.
24. devices as claimed in claim 21, it is characterised in that also include:
Send data cell, for the server send current topic mark and with the current master Body identifies corresponding latest data set.
25. a kind of network data processing systems, it is characterised in that include:
Server, several site terminals being connected with the server;It is arbitrary in described several sites Site terminal is current destination terminal;
The current destination terminal, for obtaining the pending address of one article of current destination;Using institute Pending address, current destination rank, current destination mark and current topic mark are stated, is built and is waited to locate Reason data message;Wherein, the current topic is designated the mark of the belonged to main body of current destination;To clothes Business device sends the pending data information;
The server, for receiving the pending data information of current destination terminal transmission;Treat described Administrative division information corresponding with subordinate's site rank of the current destination rank in address is processed, it is determined that For pending address keyword;Wherein, the pending address includes multiple administrative division information, each Administrative division information one site rank of correspondence;It is corresponding with current topic mark in presetting database In current data set, it is determined that lower DBMS corresponding with subordinate's site mark of current destination mark Set;Wherein, the presetting database includes that multiple main bodys are identified and corresponding with each main body mark Data acquisition system, each data acquisition system include that the site of each rank identifies and corresponding with each site mark Address keyword;Subordinate's data acquisition system includes several address keywords, and with each address The one-to-one subordinate site mark of key word;If find in subordinate's data acquisition system to wait to locate with described The reason address keyword destination address key word that the match is successful, then send and institute to the current destination terminal State destination address key word corresponding target subordinate site;Wherein, under target subordinate site is described With the one-to-one subordinate site of the destination address key word in DBMS set.
CN201510623228.0A 2015-09-25 2015-09-25 Network data processing method, apparatus and system Pending CN106557896A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510623228.0A CN106557896A (en) 2015-09-25 2015-09-25 Network data processing method, apparatus and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510623228.0A CN106557896A (en) 2015-09-25 2015-09-25 Network data processing method, apparatus and system

Publications (1)

Publication Number Publication Date
CN106557896A true CN106557896A (en) 2017-04-05

Family

ID=58416319

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510623228.0A Pending CN106557896A (en) 2015-09-25 2015-09-25 Network data processing method, apparatus and system

Country Status (1)

Country Link
CN (1) CN106557896A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112364114A (en) * 2020-11-16 2021-02-12 深圳壹账通智能科技有限公司 Address standardization method and device, computer equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030085276A1 (en) * 2001-11-07 2003-05-08 Hitachi, Ltd. Distribution management method and system
CN101030274A (en) * 2007-02-14 2007-09-05 河南万和科技有限公司 Transporting commodities-circulation information management
CN101101647A (en) * 2007-06-08 2008-01-09 刘礼维 Logistic combined transportation network data processing system and its data processing method
CN102314645A (en) * 2011-09-26 2012-01-11 深圳市络道科技有限公司 Address matching method and system
CN102799972A (en) * 2012-04-26 2012-11-28 杭州新锐信息技术有限公司 Physical distribution consignment supervisory system and supervisory method thereof

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030085276A1 (en) * 2001-11-07 2003-05-08 Hitachi, Ltd. Distribution management method and system
CN101030274A (en) * 2007-02-14 2007-09-05 河南万和科技有限公司 Transporting commodities-circulation information management
CN101101647A (en) * 2007-06-08 2008-01-09 刘礼维 Logistic combined transportation network data processing system and its data processing method
CN102314645A (en) * 2011-09-26 2012-01-11 深圳市络道科技有限公司 Address matching method and system
CN102799972A (en) * 2012-04-26 2012-11-28 杭州新锐信息技术有限公司 Physical distribution consignment supervisory system and supervisory method thereof

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112364114A (en) * 2020-11-16 2021-02-12 深圳壹账通智能科技有限公司 Address standardization method and device, computer equipment and storage medium

Similar Documents

Publication Publication Date Title
CN106033460A (en) Address data processing method and apparatus
CN106164865A (en) Affairs batch processing for the dependency perception that data replicate
CN103345521B (en) A kind of method and apparatus processing key assignments in Hash table database
CN109840284B (en) Family genetic relationship knowledge graph construction method and system
CN109063113A (en) A kind of fast image retrieval method based on the discrete Hash of asymmetric depth, retrieval model and model building method
CN106462620A (en) Distance queries on massive networks
CN106203494A (en) A kind of parallelization clustering method calculated based on internal memory
CN102915382A (en) Method and device for carrying out data query on database based on indexes
CN110019617B (en) Method and device for determining address identifier, storage medium and electronic device
CN106933883B (en) Method and device for classifying common search terms of interest points based on search logs
CN113190687B (en) Knowledge graph determining method and device, computer equipment and storage medium
CN105550219A (en) Information resource automatic cataloguing method
CN106326438A (en) Personnel information correlating method
CN104794130B (en) Relation query method and device between a kind of table
CN105209858A (en) Non-deterministic disambiguation and matching of business locale data
CN104915388B (en) It is a kind of that method is recommended based on spectral clustering and the book labels of mass-rent technology
Isaj et al. Multi-source spatial entity linkage
CN103793401B (en) Set up the method and device of the shared index of multiple database table
CN110737779A (en) Knowledge graph construction method and device, storage medium and electronic equipment
CN114579794A (en) Multi-scale fusion landmark image retrieval method and system based on feature consistency suggestion
CN104008205A (en) Content routing inquiry method and system
JP5470082B2 (en) Information storage search method and information storage search program
CN109086381A (en) A kind of update generation method of Fuzzy Concept Lattice
CN109614521A (en) A kind of efficient secret protection subgraph inquiry processing method
CN106557896A (en) Network data processing method, apparatus and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20180328

Address after: Four story 847 mailbox of the capital mansion of Cayman Islands, Cayman Islands, Cayman

Applicant after: CAINIAO SMART LOGISTICS HOLDING Ltd.

Address before: Cayman Islands Grand Cayman capital building a four storey No. 847 mailbox

Applicant before: ALIBABA GROUP HOLDING Ltd.

TA01 Transfer of patent application right
RJ01 Rejection of invention patent application after publication

Application publication date: 20170405

RJ01 Rejection of invention patent application after publication