CN105760504A - Resource retrieval method based on browser - Google Patents

Resource retrieval method based on browser Download PDF

Info

Publication number
CN105760504A
CN105760504A CN201610097763.1A CN201610097763A CN105760504A CN 105760504 A CN105760504 A CN 105760504A CN 201610097763 A CN201610097763 A CN 201610097763A CN 105760504 A CN105760504 A CN 105760504A
Authority
CN
China
Prior art keywords
browser
method based
resource
retrieval method
farther includes
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610097763.1A
Other languages
Chinese (zh)
Inventor
鲁志军
于晓滨
张俊冯
廖雯
徐佳男
鲍淼
任俊强
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Unionpay Co Ltd
Original Assignee
China Unionpay Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Unionpay Co Ltd filed Critical China Unionpay Co Ltd
Priority to CN201610097763.1A priority Critical patent/CN105760504A/en
Publication of CN105760504A publication Critical patent/CN105760504A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Abstract

The invention provides a resource retrieval method based on a browser.The method includes the steps that a keyword input by a user is received in a search box in the browser; the keyword is analyzed, and data and/or an application program in a resource pool are retrieved based on preset correlation and a matching algorithm; retrieval results are displayed in the browser so that the user can check and select the retrieval results, wherein the retrieval results include data retrieval results and program retrieval results, and the program retrieval results are presented in the form of an application box.An application program source and a data resource can be efficiently and accurately retrieved through the disclosed resource retrieval method based on the browser.

Description

Resource retrieval method based on browser
Technical field
The present invention relates to resource retrieval method, more particularly, to the resource retrieval method based on browser.
Background technology
At present, along with the becoming increasingly abundant of class of business of the increasingly extensive and different field of cyber-net application, become more and more important for target data and application resource are carried out precise search.
In existing technical scheme, generally it is only capable of the key word according to user's input and carries out the lookup of data resource, and for application program, the linking inlet ports only by application program carrys out artificial selection.
Therefore, there are the following problems for above-mentioned existing technical scheme: is difficult to realize efficient, the precise search of application programs or data resource.
Accordingly, there exist following demand: providing can efficiently and correspond exactly to carry out, with program resource and data resource, the resource retrieval method based on browser retrieved.
Summary of the invention
In order to solve the problem existing for above-mentioned prior art, the present invention proposes can efficiently and correspond exactly to carry out, with program resource and data resource, the resource retrieval method based on browser retrieved.
It is an object of the invention to be achieved through the following technical solutions:
A kind of resource retrieval method based on browser, the described resource retrieval method based on browser comprises the following steps:
(A1) search box from browser receives the key word of user's input;
(A2) analyze described key word, and retrieve the data in resource pool and/or application program based on predetermined dependency and matching algorithm;
(A3) showing in described browser that retrieval result is checked for user and selects, wherein, described retrieval result includes data research result and application program retrieval result, and described application program retrieval result presents with the form applying frame.
In scheme disclosed herein above, it is preferable that described step (A3) farther includes: user directly uses the application frame of selected application program to perform the business function that this application program provides.
In scheme disclosed herein above, it is preferable that described application frame is embedded light application framework.
In scheme disclosed herein above, it is preferable that described step (A2) farther includes: based on one or more in following manner, described key word is analyzed: semantic analysis, behavior analysis, privilege analysis and the analysis based on man-machine interaction.
In scheme disclosed herein above, preferably, described resource pool is built as follows: be uniformly accessed into specification according to predetermined data source and gather data from target data source, be uniformly accessed into the specification one or more application programs of access according to predetermined application program, gather target web by spiders mode.
In scheme disclosed herein above, it is preferable that described method farther includes: before user triggers retrieval, the mode by logging in verifies that user identity is to determine user right.
In scheme disclosed herein above, preferably, described step (A3) farther includes: show described retrieval result according to the value of sequence benchmark in the way of sequence, wherein, described sequence benchmark includes single field or multiple field, and when using multiple fields as sequence benchmark, the plurality of field groups is become ordering vector, and Sort Priority from left to right reduces successively in ordering vector.
In scheme disclosed herein above, it is preferable that described step (A3) farther includes: described sequence benchmark farther includes " degree of association ", and namely " degree of association " is comprised in described ordering vector as a field.
In scheme disclosed herein above, it is preferable that described step (A3) farther includes: for Multiple Value Field, multiple field values are spliced into single value according to its storage order separator and participate in sequence.
In scheme disclosed herein above, it is preferable that described step (A3) farther includes: sort described retrieval result for display according to the mode that last in, first out.
In scheme disclosed herein above, it is preferable that described step (A3) farther includes: based on the degree of association of the record retrieved described in the standardization length computation of word frequency, inverse document frequency and document in result.
In scheme disclosed herein above, it is preferable that described step (A3) farther includes: based on one of the following or various ways, the degree of association calculated is weighted: (1) title weighting;(2) relative position weighting;(3) tap weights;(4) record level weighting.
In scheme disclosed herein above, it is preferable that described step (A3) farther includes: based on hit word unit vector length, the degree of association calculated is weighted, in namely recording with one, hit lexeme vector length is as the degree of association of record.
In scheme disclosed herein above, preferably, described step (A3) farther includes: for multilevel sort benchmark, the unique component being weighted by the degree of association of the multiple fields in this sequence benchmark in the integrated value correspondence ordering vector after summation is ranked up operation.
In scheme disclosed herein above, it is preferable that described step (A3) farther includes: consider the time factor of record when the degree of association calculated is weighted.
In scheme disclosed herein above, preferably, described method is capable of the TOPN pattern sequence for retrieval result, it comprises the partial ordered pattern of TOPN, and namely the result set after sequence only has top n record to be ordered into, although remaining record is also retained, but it is unordered, and TOPN cuts out sequencing model, namely the result set after sequence only retains the record that top n is orderly, and remaining record is dropped.
In scheme disclosed herein above, it is preferable that described method is capable of the Chinese sorting based on the Chinese phonetic alphabet and Chinese-character stroke for retrieval result.
Resource retrieval method based on browser disclosed in this invention has the advantage that efficiently and corresponds exactly to carry out retrieving and be ranked up to analyze with program resource and data resource.
Accompanying drawing explanation
Will be more fully understood that by those skilled in the art in conjunction with accompanying drawing, the technical characteristic of the present invention and advantage, wherein:
Fig. 1 is the flow chart of resource retrieval method based on browser according to an embodiment of the invention.
Detailed description of the invention
Fig. 1 is the flow chart of resource retrieval method based on browser according to an embodiment of the invention.As it is shown in figure 1, the resource retrieval method based on browser disclosed in this invention comprises the following steps: to receive in (A1) search box from browser the key word of user's input;(A2) analyze described key word, and retrieve the data in resource pool and/or application program based on predetermined dependency and matching algorithm;(A3) showing in described browser that retrieval result is checked for user and selects, wherein, described retrieval result includes data research result and application program retrieval result, and described application program retrieval result presents with the form applying frame.
Preferably, disclosed in this invention based in the resource retrieval method of browser, described step (A3) farther includes: user directly uses the application frame of selected application program to perform the business function that this application program provides.
Preferably, disclosed in this invention based in the resource retrieval method of browser, described application frame is embedded light application framework (i.e. the application framework that can implement basic service function in embedding browser).
Preferably, disclosed in this invention based in the resource retrieval method of browser, described step (A2) farther includes: based on one or more in following manner, described key word is analyzed: semantic analysis, behavior analysis, privilege analysis and the analysis based on man-machine interaction.
Preferably, disclosed in this invention based on, in the resource retrieval method of browser, building described resource pool as follows: be uniformly accessed into specification according to predetermined data source and gather data (namely setting up data directory) from target data source (such as data base), be uniformly accessed into the specification one or more application programs of access (namely setting up application framework index) according to predetermined application program, gathered target web (namely setting up target web index) by spiders mode.
Preferably, the resource retrieval method based on browser disclosed in this invention farther includes: before user triggers retrieval, the mode by logging in verifies that user identity is to determine user right.
Preferably, disclosed in this invention based in the resource retrieval method of browser, described step (A3) farther includes: show described retrieval result according to the value of sequence benchmark in the way of sequence, wherein, described sequence benchmark includes single field or multiple field, and when using multiple fields as sequence benchmark, the plurality of field groups is become ordering vector, and in ordering vector Sort Priority from left to right successively reduce (namely leftmost field sorts at first, and rightmost field finally sorts.Only when a field has multiple record to have identical value, just the field on the right of this field can be continued sequence).
Preferably, disclosed in this invention based in the resource retrieval method of browser, described step (A3) farther includes: described sequence benchmark farther includes " degree of association ", namely " degree of association " is comprised in described ordering vector (owing to seldom having identical degree of association between each record, so it is possible little to come the role when sequence of the field on the right of degree of association as a field.Exemplarily, the fixing employing of dependency arranges in descending order, and it is arrange by ascending order that field value then can be specified, still arrangement in descending order).
Preferably, based in the resource retrieval method of browser, described step (A3) farther includes: for Multiple Value Field disclosed in this invention, according to its storage order separator, multiple field values is spliced into single value and participates in sequence.
Alternatively, disclosed in this invention based in the resource retrieval method of browser, described step (A3) farther includes: sort described retrieval result for display according to the mode that last in, first out (LIFO).
Preferably, disclosed in this invention based in the resource retrieval method of browser, described step (A3) farther includes: based on the degree of association of the record in retrieval result described in word frequency (number of times that word occurs at document), inverse document frequency (total number of files and the ratio hitting number of files) and the standardization length computation of document.
Preferably, disclosed in this invention based in the resource retrieval method of browser, described step (A3) farther includes: based on one of the following or various ways, the degree of association calculated is weighted: (title of indication is not a field herein in (1) title weighting, but the beginning of data resource (such as a word), namely to the point of impact occurred in title, application can specify corresponding weight coefficient);(2) relative position weighting (namely considers the distance factor between the point of impact so that the point of impact coming from phrase has higher relevance weight);(3) (namely to branch's (leaf node) hit situation in expression formula, and " goodness of fit " of LIKE and INCLUDE function is weighted tap weights;(4) (namely each record imparts dependency weights in advance in record level weighting, it is stored in data base with the form of a field value together with other field data of record, and require when carrying out relevance ranking, these weights are multiplied by the degree of association as this record after the result of calculation of relevance model).
Alternatively, disclosed in this invention based in the resource retrieval method of browser, described step (A3) farther includes: based on hit word unit vector length, the degree of association calculated is weighted, namely in recording with one, hit lexeme vector length (each hit word only calculates once) is as the degree of association recorded (thus, containing the record that the number (not being the number of times of word appearance) hitting word is more many, its degree of association is more big).
Preferably, disclosed in this invention based in the resource retrieval method of browser, described step (A3) farther includes: for multilevel sort benchmark, unique (dependency) component being weighted by the degree of association of the multiple fields in this sequence benchmark in the integrated value correspondence ordering vector after summation is ranked up operation (wherein, deferrization is economized outside search field, other field to participate in relevance ranking, it is switched on " the dependency switch " of field, and the weight of this field is specified when retrieval, even if retrieval pertains only to a field, it is also necessary to specify.If but the field related to during retrieval is all not explicitly specified weight, and default search field is again one of field involved by retrieval, then only calculate the degree of association in default search field, and carry out relevance ranking.For the relevance ranking of multi-field, sometimes also it is insufficient for the demand of all application only by the method that field is weighted, is thus provided that another kind of selection, namely allows the Sort Priority of regulation field, it is achieved carry out relevance ranking respectively by field.The using method of this function is as follows: specify the parameter of " carrying out relevance ranking by field respectively " when submitting retrieval request to, simultaneously " weight " of specific field in a manner described, the sequence priority that to be merely representative of between field relative of " weight " herein, no longer there is absolute value meaning, " weight " is more big, sorts more preferential).
Preferably, disclosed in this invention based in the resource retrieval method of browser, described step (A3) farther includes: consider the time factor (As time goes on the importance of a data record can be gradually lowered) of record when the degree of association calculated is weighted.
Preferably, resource retrieval method based on browser disclosed in this invention is capable of the TOPN pattern sequence for retrieval result, it comprises the partial ordered pattern of TOPN, namely the result set after sequence only has (order specified) that top n record is ordered into, although remaining record is also retained, but it is unordered, and TOPN cuts out sequencing model, namely the result set after sequence only retains (order specified) record that (i.e. storage) top n is orderly, remaining record is dropped (wherein, the partial ordered pattern of TOPN is controlled by " the maximum order recording number " of system, namely when retrieving result record number beyond " maximum order recording number ", the partial ordered pattern of TOPN is inoperative;And when retrieving result record number less than " maximum order recording number ", ranking results only ensures (record number does not change) that top n record is ordered into.In addition, TOPN cuts out sequencing model and is divided into the following two kinds situation: the first cuts out sequence is the sequence submitted to together with search operaqtion, if now retrieval result record number is beyond " maximum order recording number ", " maximum order recording number " the individual record then only formerly retrieved is cut out and is retained the record of top n (order specified) in order, (therefore in this case remaining record does not participate in sequencer procedure, different " maximum order recording number " is possible to obtain different ranking results, different search complete order be also possible to obtain different ranking results);It is the independent sequence after search operaqtion that the second cuts out sequence, this situation the same with the partial ordered pattern of TOPN " maximum order recording number " by system controls, namely when retrieving result record number beyond " maximum order recording number ", TOPN cuts out sequencing model and will not work, and otherwise the result set after sequence only retains the orderly record of top n (record number will change)).
Preferably, the resource retrieval method based on browser disclosed in this invention is capable of the Chinese sorting based on the Chinese phonetic alphabet and Chinese-character stroke for retrieval result.
Therefore, the resource retrieval method based on browser disclosed in this invention has the advantage that can efficiently and correspond exactly to carry out retrieving and be ranked up analyzing with program resource and data resource.
Although the present invention is described by above-mentioned preferred implementation, but its way of realization is not limited to above-mentioned embodiment.It will be appreciated that when without departing from present subject matter and scope, the present invention can be made different changing and modifications by those skilled in the art.

Claims (17)

1., based on a resource retrieval method for browser, the described resource retrieval method based on browser comprises the following steps:
(A1) search box from browser receives the key word of user's input;
(A2) analyze described key word, and retrieve the data in resource pool and/or application program based on predetermined dependency and matching algorithm;
(A3) showing in described browser that retrieval result is checked for user and selects, wherein, described retrieval result includes data research result and application program retrieval result, and described application program retrieval result presents with the form applying frame.
2. the resource retrieval method based on browser according to claim 1, it is characterised in that described step (A3) farther includes: user directly uses the application frame of selected application program to perform the business function that this application program provides.
3. the resource retrieval method based on browser according to claim 2, it is characterised in that described application frame is embedded light application framework.
4. the resource retrieval method based on browser according to claim 3, it is characterized in that, described step (A2) farther includes: based on one or more in following manner, described key word is analyzed: semantic analysis, behavior analysis, privilege analysis and the analysis based on man-machine interaction.
5. the resource retrieval method based on browser according to claim 4, it is characterized in that, build described resource pool as follows: be uniformly accessed into specification according to predetermined data source and gather data from target data source, be uniformly accessed into the specification one or more application programs of access according to predetermined application program, gather target web by spiders mode.
6. the resource retrieval method based on browser according to claim 5, it is characterised in that described method farther includes: the mode by logging in verifies that user identity is to determine user right before user triggers retrieval.
7. the resource retrieval method based on browser according to claim 6, it is characterized in that, described step (A3) farther includes: show described retrieval result according to the value of sequence benchmark in the way of sequence, wherein, described sequence benchmark includes single field or multiple field, and when using multiple fields as sequence benchmark, the plurality of field groups is become ordering vector, and Sort Priority from left to right reduces successively in ordering vector.
8. the resource retrieval method based on browser according to claim 7, it is characterized in that, described step (A3) farther includes: described sequence benchmark farther includes " degree of association ", and namely " degree of association " is comprised in described ordering vector as a field.
9. multiple field values are spliced into single value according to its storage order separator and participate in sequence by the resource retrieval method based on browser according to claim 8, it is characterised in that described step (A3) farther includes: for Multiple Value Field.
10. the resource retrieval method based on browser according to claim 1, it is characterised in that described step (A3) farther includes: sort described retrieval result for display according to the mode that last in, first out.
11. the resource retrieval method based on browser according to claim 10, it is characterised in that described step (A3) farther includes: based on the degree of association of the record retrieved described in the standardization length computation of word frequency, inverse document frequency and document in result.
12. the resource retrieval method based on browser according to claim 11, it is characterised in that described step (A3) farther includes: based on one of the following or various ways, the degree of association calculated is weighted: (1) title weighting;(2) relative position weighting;(3) tap weights;(4) record level weighting.
13. the resource retrieval method based on browser according to claim 12, it is characterized in that, described step (A3) farther includes: based on hit word unit vector length, the degree of association calculated is weighted, and in namely recording with one, hit lexeme vector length is as the degree of association of record.
14. the resource retrieval method based on browser according to claim 13, it is characterized in that, described step (A3) farther includes: for multilevel sort benchmark, the unique component being weighted by the degree of association of the multiple fields in this sequence benchmark in the integrated value correspondence ordering vector after summation is ranked up operation.
15. the resource retrieval method based on browser according to claim 14, it is characterised in that described step (A3) farther includes: consider the time factor of record when the degree of association calculated is weighted.
16. the resource retrieval method based on browser according to claim 15, it is characterized in that, described method is capable of the TOPN pattern sequence for retrieval result, it comprises the partial ordered pattern of TOPN, and namely the result set after sequence only has top n record to be ordered into, although remaining record is also retained, but it is unordered, and TOPN cuts out sequencing model, namely the result set after sequence only retains the record that top n is orderly, and remaining record is dropped.
17. the resource retrieval method based on browser according to claim 16, it is characterised in that described method is capable of the Chinese sorting based on the Chinese phonetic alphabet and Chinese-character stroke for retrieval result.
CN201610097763.1A 2016-02-23 2016-02-23 Resource retrieval method based on browser Pending CN105760504A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610097763.1A CN105760504A (en) 2016-02-23 2016-02-23 Resource retrieval method based on browser

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610097763.1A CN105760504A (en) 2016-02-23 2016-02-23 Resource retrieval method based on browser

Publications (1)

Publication Number Publication Date
CN105760504A true CN105760504A (en) 2016-07-13

Family

ID=56331056

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610097763.1A Pending CN105760504A (en) 2016-02-23 2016-02-23 Resource retrieval method based on browser

Country Status (1)

Country Link
CN (1) CN105760504A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018103585A1 (en) * 2016-12-07 2018-06-14 潘岩 Method and apparatus for sorting webpage information articles
CN109766497A (en) * 2019-01-22 2019-05-17 网易(杭州)网络有限公司 Ranking list generation method and device, storage medium, electronic equipment
CN109766360A (en) * 2019-01-09 2019-05-17 北京一览群智数据科技有限责任公司 A kind of list screening method and device
CN110020227A (en) * 2017-10-31 2019-07-16 北京国双科技有限公司 A kind of data reordering method and device
CN110287288A (en) * 2019-06-18 2019-09-27 北京百度网讯科技有限公司 Recommend the method and apparatus of document
CN110674320A (en) * 2019-09-27 2020-01-10 百度在线网络技术(北京)有限公司 Retrieval method and device and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101510159A (en) * 2009-03-30 2009-08-19 腾讯科技(深圳)有限公司 Application program start-up method and browser
CN101520785A (en) * 2008-02-29 2009-09-02 富士通株式会社 Information retrieval method and system therefor
CN102467502A (en) * 2010-10-29 2012-05-23 北大方正集团有限公司 Retrieval method and system
CN103186572A (en) * 2011-12-29 2013-07-03 腾讯科技(深圳)有限公司 Application program search method, mobile application platform and system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101520785A (en) * 2008-02-29 2009-09-02 富士通株式会社 Information retrieval method and system therefor
CN101510159A (en) * 2009-03-30 2009-08-19 腾讯科技(深圳)有限公司 Application program start-up method and browser
CN102467502A (en) * 2010-10-29 2012-05-23 北大方正集团有限公司 Retrieval method and system
CN103186572A (en) * 2011-12-29 2013-07-03 腾讯科技(深圳)有限公司 Application program search method, mobile application platform and system

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018103585A1 (en) * 2016-12-07 2018-06-14 潘岩 Method and apparatus for sorting webpage information articles
CN110020227A (en) * 2017-10-31 2019-07-16 北京国双科技有限公司 A kind of data reordering method and device
CN110020227B (en) * 2017-10-31 2021-10-15 北京国双科技有限公司 Data sorting method and device
CN109766360A (en) * 2019-01-09 2019-05-17 北京一览群智数据科技有限责任公司 A kind of list screening method and device
CN109766497A (en) * 2019-01-22 2019-05-17 网易(杭州)网络有限公司 Ranking list generation method and device, storage medium, electronic equipment
CN110287288A (en) * 2019-06-18 2019-09-27 北京百度网讯科技有限公司 Recommend the method and apparatus of document
CN110674320A (en) * 2019-09-27 2020-01-10 百度在线网络技术(北京)有限公司 Retrieval method and device and electronic equipment

Similar Documents

Publication Publication Date Title
CN105760504A (en) Resource retrieval method based on browser
US8108395B2 (en) Automatic arrangement of portlets on portal pages according to semantical and functional relationship
US8108398B2 (en) Auto-summary generator and filter
US7546294B2 (en) Automated relevance tuning
TWI486800B (en) System and method for search results ranking using editing distance and document information
KR101301380B1 (en) Ranking functions using a biased click distance of a document on a network
US7065521B2 (en) Method for fuzzy logic rule based multimedia information retrival with text and perceptual features
US7107191B2 (en) Modular architecture for optimizing a configuration of a computer system
CN106021374A (en) Underlay recall method and device for query result
US20070203865A1 (en) Apparatus and methods for an item retrieval system
CN104133855B (en) A kind of method and device of input method intelligent association
CN102693309A (en) Candidate phrase querying method and aided translation system for computer aided translation
CN103488475B (en) Multidimensional data analysis system and multidimensional data analysis method
US20060224359A1 (en) Method and system for optimizing configuration classification of software
CA2562779A1 (en) Data storage and retrieval
CN106599299A (en) Determining method and device of website key words
CN105653701A (en) Model generating method and device as well as word weighting method and device
CN106156114A (en) Patent retrieval method and device
US5742776A (en) Decision support system
Thang et al. An evaluation of diversification techniques
CN106227510A (en) Method and device is recommended in application
CN107239549A (en) Method, device and the terminal of database terminology retrieval
CN110162778A (en) The generation method and device of text snippet
CN106156111A (en) Patent document search method, device and system
CN107562966A (en) The optimization system and method based on intelligence learning for web page interlinkage retrieval ordering

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20160713