CN105577830A - Optimal selection method and system for downloading list based on statistics - Google Patents

Optimal selection method and system for downloading list based on statistics Download PDF

Info

Publication number
CN105577830A
CN105577830A CN201610072896.3A CN201610072896A CN105577830A CN 105577830 A CN105577830 A CN 105577830A CN 201610072896 A CN201610072896 A CN 201610072896A CN 105577830 A CN105577830 A CN 105577830A
Authority
CN
China
Prior art keywords
download
user
resource
list
downloading
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610072896.3A
Other languages
Chinese (zh)
Inventor
李海兴
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
MAINBO EDUCATION TECHNOLOGY Co Ltd
Original Assignee
MAINBO EDUCATION TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by MAINBO EDUCATION TECHNOLOGY Co Ltd filed Critical MAINBO EDUCATION TECHNOLOGY Co Ltd
Priority to CN201610072896.3A priority Critical patent/CN105577830A/en
Publication of CN105577830A publication Critical patent/CN105577830A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • H04L67/1004Server selection for load balancing
    • H04L67/1023Server selection for load balancing based on a hash applied to IP addresses or costs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/06Protocols specially adapted for file transfer, e.g. file transfer protocol [FTP]

Abstract

The invention relates to an optimal selection method and system for downloading list based on statistics, and belongs to the field of network data transmission. The invention realizes an optimization scheme of resource downloading in the Internet environment. The method comprises the following steps: recording relevant information of user resource downloading and storing the relevant information in a database to serve as basic data of statistic analysis; and in a subsequent user resource downloading process, carrying out statistic analysis according to the recorded stored in the database, and calculating an optimal downloading list for the downloading of the user. The optimal selection method provided by the invention can be used for quickly matching the optimal downloading path, increasing the utilization rate of network resources, effectively improving the resource downloading speed and shortening the resource downloading time.

Description

A kind of download list method for optimizing of Corpus--based Method and system
Technical field
The present invention relates to a kind of download list method for optimizing and system of Corpus--based Method, belong to field of network data transmission, be mainly used in being integrated with in the information system of Internet resources download.
Background technology
Along with computer technology and the propelling of cybertimes, under Internet resources, be loaded in productive life process the application obtained widely.Meanwhile, Internet resources are also in swift and violent growth.In so immense network, how these resources are carried out downloads transmission fast and efficiently, thus facilitate user's down loading network resource better faster, become the challenge that of Current resource huge explosion epoch is huge.
Technology conventional is at present as provided multiple resource server, many download address etc., if application number is the Chinese invention patent application of 201180029787.9, it discloses a kind of system for automation Resourse Distribute to comprise: client computer, described client computer is provided with network interface; Multiple Resource Server system, described multiple Resource Server system is provided with network interface; And operations server system, described operations server Operation system setting has network interface.Described operations server system can be configured to practical activity and obtain client parameter from described client computer, from described multiple Resource Server system Gains resources parameter, obtain the operating parameter relevant to described practical activity, and carry out point being equipped with in described multiple Resource Server system according at least some in described client parameter, described resource parameters and described operating parameter and implement described practical activity.
Above-mentioned technology easily causes the problems such as Resourse Distribute is uneven, speed of download is slow, can not effectively help user to realize quick obtaining resource.
Summary of the invention
For overcoming the above problems, the present invention, by large data statistics, obtains dynamic resource list, promotes user resources speed of download.
Concrete, the invention provides a kind of download list method for optimizing of Corpus--based Method, described method comprises the steps:
Step one, the relevant information of recording user downloaded resources is also stored in database;
Step 2, in the process of subsequent user downloaded resources, the record according to being stored in database carries out statistical analysis;
Step 3, calculates optimum download list and downloads for user.
Further, the download list method for optimizing of Corpus--based Method as above, the concrete grammar of described step one is: when resource first time is downloaded by user, return the download list of acquiescence, invoking server interface after user's downloaded resources, is stored into the time started in downloading task, end time, speed of download, user network information, the Resource Server network information in database.
Further, the download list method for optimizing of Corpus--based Method as above, the concrete grammar of described step 2 is: add up according to the user network information recorded in database and the corresponding Resource Server network information, and packet memory is in database, for statistics download list provides basic data.
Further, the download list method for optimizing of Corpus--based Method as above, the concrete grammar of described step 3 is: when user carries out resource download request, after server receives request, according to the client network information of current request, inquire about the user's Download History that whether there is similar network information in a database; If existed, Download History is sorted, filter out the record composition list that speed, stability are higher, return to active user and download; If there is no, then inquire about the resource downloading record of other users, sort by resource transmission speed, the list after sequence is returned to active user and downloads.
Further, the download list method for optimizing of Corpus--based Method as above, describedly sorts to Download History, filters out speed, record composition list that stability is higher, returns to the concrete grammar that active user carries out downloading to be:
In the entry number found, calculate the download priority valve of certain resource, to every bar record, its weights R is
R=F (probability of success, speed of download, download time, current time)
=probability of success * 60%+ speed of download * 30%+1/ (current time-download time) * 10%
Wherein: the probability of success=this agreement is downloaded number of success/this agreement and downloaded sum
Speed of download=resource size/resource downloading time
R value is larger, downloads priority larger;
According to R value from big to small, corresponding resource address composition is selected to return to the resource downloading list of user.
Moreover the present invention also provides a kind of download list optimum decision system of Corpus--based Method, and described system comprises as lower module:
Logging modle, for recording user downloaded resources relevant information and be stored in database;
Statistical analysis module, in the process of subsequent user downloaded resources, carries out statistical analysis according to the record be stored in database;
Computing module, downloads for user for calculating optimum download list.
Further, the download list optimum decision system of Corpus--based Method as above, the method recording described relevant information of described logging modle is: when resource first time is downloaded by user, return the download list of acquiescence, invoking server interface after user's downloaded resources, is stored into the time started in downloading task, end time, speed of download, user network information, the Resource Server network information in database.
Further, the download list optimum decision system of Corpus--based Method as above, described statistical analysis module is added up according to the user network information recorded in database and the corresponding Resource Server network information, and packet memory is in database, for statistics download list provides basic data.
Further, the download list optimum decision system of Corpus--based Method as above, the computational methods of described computing module are: when user carries out resource download request, after server receives request, according to the client network information of current request, inquire about the user's Download History that whether there is similar network information in a database; If existed, Download History is sorted, filter out the record composition list that speed, stability are higher, return to active user and download; If there is no, then inquire about the resource downloading record of other users, sort by resource transmission speed, the list after sequence is returned to active user and downloads.
Further, the download list optimum decision system of Corpus--based Method as above, described computing module sorts to Download History, filters out the record composition list that speed, stability are higher, returns to the concrete grammar that active user carries out downloading to be:
In the entry number found, calculate the download priority valve of certain resource, to every bar record, its weights R is
R=F (probability of success, speed of download, download time, current time)
=probability of success * 60%+ speed of download * 30%+1/ (current time-download time) * 10%
Wherein: the probability of success=this agreement is downloaded number of success/this agreement and downloaded sum
Speed of download=resource size/resource downloading time
R value is larger, downloads priority larger;
According to R value from big to small, corresponding resource address composition is selected to return to the resource downloading list of user.
Design proposed by the invention can well information system for user provide simply, effective download list, improve the speed of user's Gains resources.
Accompanying drawing explanation
Fig. 1 the present invention is based on the download list method for optimizing of statistics and the flow chart of system.
Fig. 2 is the structure chart of the download list optimum decision system that the present invention is based on statistics.
Embodiment
Below in conjunction with drawings and Examples, the present invention is described in detail.
The present invention, by large data statistics, obtains dynamic resource list, promotes user resources speed of download.Such as, multiple user downloaded resource A, and resource A leaves server 1, server 2, server 3 in.When user's second needs downloaded resources A, system sorts according to the resource address of record data to each server of each user downloaded resources A, will for speed of download faster resource address preferentially return to user's second.
As shown in Figure 1, the download list method for optimizing of Corpus--based Method provided by the present invention comprises the steps:
Step one, when resource first time is downloaded by user, return the download list of acquiescence, invoking server interface after user's downloaded resources, the time started in downloading task, end time, speed of download, user network information, the Resource Server network information are stored in database.
In the process, set up the logical construction of resource downloading information storage medium, this structure must can reflect the relevant information of resource downloading.As: client network and the Resource Server network information downloaded, the download list information etc. of each resource.
Step 2, after resource is downloaded, by the every terms of information in resource downloading process stored in database, comprises the process of successful information and failed download retry, by the corresponding record of different download protocols, and write into Databasce.That is to say and each DownloadDetail is recorded in database, become one or more record.
Step 3, when other users download resource, by Download History in query statistic database, returns the resource downloading list relative to active user's optimum.
In the present invention, setting N represents the last record downloaded for n time of resource, and in practice, n looks for the algorithm of N bar record as follows:
Look for forward from the last item record, at least find often kind minimumly to find 1 record to 3 kinds of download protocols (ftp, http, cloud store), look at most 5, but always search number and be no more than 30.
Comprise different download protocols for needing, its principle is, the file that in general single transmission one is static, be difficult to weigh the efficiency of these three kinds of download protocols, it is generally acknowledged that FTP transmission is faster concerning single binary file, when transmitting multiple file, http, cloud store advantageously.Affect the many factors of speed of download, therefore select which kind of agreement on earth, can with reference to the download statistics data of this resource.
Download Ftp, network manager can distribute specific port for Ftp sometimes, and at some time, because these ports of reason such as safety can be turned off temporarily, therefore when downloading, also should with reference to the information of failed download.Although it is very fast that successful hourly velocity is downloaded in such as certain address, for various reasons (such as port is sealed), failed probability is higher, then when downloading, its weights are not high yet.
In the entry number found, calculate the download priority valve of certain resource, to every bar record, its weights R is
R=F (probability of success, speed of download, download time, current time)
=probability of success * 60%+ speed of download * 30%+1/ (current time-download time) * 10%
Wherein: the probability of success=this agreement is downloaded number of success/this agreement and downloaded sum
Speed of download=resource size/resource downloading time
R value is larger, downloads priority larger.
According to R value from big to small, corresponding resource address composition is selected to return to the resource downloading list of user.
User receives resource downloading list, according to the Article 1 address in download list, carries out resource downloading, and as failed download, attempt Article 2 download address, the rest may be inferred, can download successful resource address until find; As all addresses in download list all can not successfully be downloaded, then invoking server interface, to the resource information that server report cannot be downloaded, in the availability of server analysis resource, can safeguard resource according to these records.
As shown in Figure 2, accordingly, the present invention also provides a kind of download list optimum decision system of Corpus--based Method, and described system comprises as lower module:
Logging modle 1, for recording user downloaded resources relevant information and be stored in database;
Statistical analysis module 2, in the process of subsequent user downloaded resources, carries out statistical analysis according to the record be stored in database;
Computing module 3, downloads for user for calculating optimum download list.
Namely each module realizes three steps of method corresponding to this system.
To sum up, main idea of the present invention is 2 key elements.First point, resource downloading address is carried out statistics according to Download History and is drawn; Second point, no matter resource downloading success or not, all need to report relevant information in downloading process to server.
Obviously, those skilled in the art can carry out various change and modification to the present invention and not depart from the spirit and scope of the present invention.Like this, if belong within the scope of the claims in the present invention and equivalent technology thereof to these amendments of the present invention and modification, then the present invention is also intended to comprise these change and modification.

Claims (10)

1. a download list method for optimizing for Corpus--based Method, is characterized in that described method comprises the steps:
Step one, the relevant information of recording user downloaded resources is also stored in database;
Step 2, in the process of subsequent user downloaded resources, the record according to being stored in database carries out statistical analysis;
Step 3, calculates optimum download list and downloads for user.
2. the download list method for optimizing of Corpus--based Method as claimed in claim 1, is characterized in that:
The concrete grammar of described step one is: when resource first time is downloaded by user, return the download list of acquiescence, invoking server interface after user's downloaded resources, is stored into the time started in downloading task, end time, speed of download, user network information, the Resource Server network information in database.
3. the download list method for optimizing of Corpus--based Method as claimed in claim 1, is characterized in that:
The concrete grammar of described step 2 is: add up according to the user network information recorded in database and the corresponding Resource Server network information, and packet memory is in database, for statistics download list provides basic data.
4. the download list method for optimizing of Corpus--based Method as claimed in claim 1, is characterized in that:
The concrete grammar of described step 3 is: when user carries out resource download request, after server receives request, according to the client network information of current request, inquires about the user's Download History that whether there is similar network information in a database; If existed, Download History is sorted, filter out the record composition list that speed, stability are higher, return to active user and download; If there is no, then inquire about the resource downloading record of other users, sort by resource transmission speed, the list after sequence is returned to active user and downloads.
5. the download list method for optimizing of Corpus--based Method as claimed in claim 4, is characterized in that:
Describedly to sort to Download History, filter out speed, record composition list that stability is higher, returning to the concrete grammar that active user carries out downloading is:
In the entry number found, calculate the download priority valve of certain resource, to every bar record, its weights R is
R=F (probability of success, speed of download, download time, current time)
=probability of success * 60%+ speed of download * 30%+1/ (current time-download time) * 10%
Wherein: the probability of success=this agreement is downloaded number of success/this agreement and downloaded sum
Speed of download=resource size/resource downloading time
R value is larger, downloads priority larger;
According to R value from big to small, corresponding resource address composition is selected to return to the resource downloading list of user.
6. a download list optimum decision system for Corpus--based Method, is characterized in that described system comprises as lower module:
Logging modle, for recording user downloaded resources relevant information and be stored in database;
Statistical analysis module, in the process of subsequent user downloaded resources, carries out statistical analysis according to the record be stored in database;
Computing module, downloads for user for calculating optimum download list.
7. the download list optimum decision system of Corpus--based Method as claimed in claim 6, is characterized in that:
The method recording described relevant information of described logging modle is: when resource first time is downloaded by user, return the download list of acquiescence, invoking server interface after user's downloaded resources, is stored into the time started in downloading task, end time, speed of download, user network information, the Resource Server network information in database.
8. the download list optimum decision system of Corpus--based Method as claimed in claim 6, is characterized in that:
Described statistical analysis module is added up according to the user network information recorded in database and the corresponding Resource Server network information, and packet memory is in database, for statistics download list provides basic data.
9. the download list optimum decision system of Corpus--based Method as claimed in claim 6, is characterized in that:
The computational methods of described computing module are: when user carries out resource download request, after server receives request, according to the client network information of current request, inquire about the user's Download History that whether there is similar network information in a database; If existed, Download History is sorted, filter out the record composition list that speed, stability are higher, return to active user and download; If there is no, then inquire about the resource downloading record of other users, sort by resource transmission speed, the list after sequence is returned to active user and downloads.
10. the download list optimum decision system of Corpus--based Method as claimed in claim 9, is characterized in that:
Described computing module sorts to Download History, filters out the record composition list that speed, stability are higher, returns to the concrete grammar that active user carries out downloading to be:
In the entry number found, calculate the download priority valve of certain resource, to every bar record, its weights R is
R=F (probability of success, speed of download, download time, current time)
=probability of success * 60%+ speed of download * 30%+1/ (current time-download time) * 10%
Wherein: the probability of success=this agreement is downloaded number of success/this agreement and downloaded sum
Speed of download=resource size/resource downloading time
R value is larger, downloads priority larger;
According to R value from big to small, corresponding resource address composition is selected to return to the resource downloading list of user.
CN201610072896.3A 2016-02-02 2016-02-02 Optimal selection method and system for downloading list based on statistics Pending CN105577830A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610072896.3A CN105577830A (en) 2016-02-02 2016-02-02 Optimal selection method and system for downloading list based on statistics

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610072896.3A CN105577830A (en) 2016-02-02 2016-02-02 Optimal selection method and system for downloading list based on statistics

Publications (1)

Publication Number Publication Date
CN105577830A true CN105577830A (en) 2016-05-11

Family

ID=55887474

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610072896.3A Pending CN105577830A (en) 2016-02-02 2016-02-02 Optimal selection method and system for downloading list based on statistics

Country Status (1)

Country Link
CN (1) CN105577830A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106850778A (en) * 2017-01-17 2017-06-13 无锡清华信息科学与技术国家实验室物联网技术中心 A kind of multi-source download performance optimization method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101247405A (en) * 2008-03-27 2008-08-20 深圳市迅雷网络技术有限公司 Method, system and device for calculating download time and resource downloading
US20080201488A1 (en) * 1997-06-18 2008-08-21 Brian Kenner System and method for server-side optimization of data delivery on a distributed computer network
CN102685075A (en) * 2011-03-15 2012-09-19 腾讯科技(深圳)有限公司 Network transmission system, server and client
CN102855238A (en) * 2011-06-28 2013-01-02 腾讯科技(深圳)有限公司 Method and system for downloading resource data

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080201488A1 (en) * 1997-06-18 2008-08-21 Brian Kenner System and method for server-side optimization of data delivery on a distributed computer network
CN101247405A (en) * 2008-03-27 2008-08-20 深圳市迅雷网络技术有限公司 Method, system and device for calculating download time and resource downloading
CN102685075A (en) * 2011-03-15 2012-09-19 腾讯科技(深圳)有限公司 Network transmission system, server and client
CN102855238A (en) * 2011-06-28 2013-01-02 腾讯科技(深圳)有限公司 Method and system for downloading resource data

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106850778A (en) * 2017-01-17 2017-06-13 无锡清华信息科学与技术国家实验室物联网技术中心 A kind of multi-source download performance optimization method and device

Similar Documents

Publication Publication Date Title
US9787706B1 (en) Modular architecture for analysis database
US10402424B1 (en) Dynamic tree determination for data processing
US10922316B2 (en) Using computing resources to perform database queries according to a dynamically determined query size
US9736270B2 (en) Automated client/server operation partitioning
US8738645B1 (en) Parallel processing framework
US10824612B2 (en) Key ticketing system with lock-free concurrency and versioning
US11210211B2 (en) Key data store garbage collection and multipart object management
CN103370917A (en) Message processing method and server
EP4254187A1 (en) Cross-organization & cross-cloud automated data pipelines
US20230315727A1 (en) Cost-based query optimization for untyped fields in database systems
CN109842621A (en) A kind of method and terminal reducing token storage quantity
CN106331160A (en) Data migration method and system
US7979418B1 (en) System, method, and computer program product for processing a prefix tree file utilizing a selected agent
US11475011B2 (en) Pruning cutoffs for database systems
US11853229B2 (en) Method and apparatus for updating cached information, device, and medium
US10235420B2 (en) Bucket skiplists
CN105577830A (en) Optimal selection method and system for downloading list based on statistics
US11334623B2 (en) Key value store using change values for data properties
US20140040479A1 (en) Method for a self organizing load balance in a cloud file server network
US10067678B1 (en) Probabilistic eviction of partial aggregation results from constrained results storage
WO2023005264A1 (en) Data processing method and apparatus
US11055266B2 (en) Efficient key data store entry traversal and result generation
CN108628540A (en) Data storage device and method
US11144593B2 (en) Indexing structure with size bucket indexes
JP2013101539A (en) Sampling device, sampling program, and method therefor

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20160511