CN103853784B - A kind of webpage matching process of mobile terminal, device and system - Google Patents

A kind of webpage matching process of mobile terminal, device and system Download PDF

Info

Publication number
CN103853784B
CN103853784B CN201210518026.6A CN201210518026A CN103853784B CN 103853784 B CN103853784 B CN 103853784B CN 201210518026 A CN201210518026 A CN 201210518026A CN 103853784 B CN103853784 B CN 103853784B
Authority
CN
China
Prior art keywords
web page
page contents
mobile terminal
webpage
expected
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201210518026.6A
Other languages
Chinese (zh)
Other versions
CN103853784A (en
Inventor
林晓丹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201210518026.6A priority Critical patent/CN103853784B/en
Publication of CN103853784A publication Critical patent/CN103853784A/en
Application granted granted Critical
Publication of CN103853784B publication Critical patent/CN103853784B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Embodiment of the present invention proposes a kind of webpage matching process of mobile terminal, device and system.Method includes:It determines that mobile terminal it is expected the web page contents preserved, and extracts the attribute information that the expectation preserves web page contents;In the web page contents of preservation positioned at network side, the corresponding web page contents of attribute information that web page contents are preserved with the expectation are retrieved, and determine the storage attribute of the web page contents retrieved;According to the storage attribute of the web page contents retrieved, in the web page contents for it is expected to preserve described in the network side preservation.Embodiment of the present invention improves the storage efficiency of mobile terminal webpage editing and label setting efficiency.It can be applied in various terminals and cross-platform cross terminal use embodiment of the present invention, the scope of application is very extensive.

Description

A kind of webpage matching process of mobile terminal, device and system
Technical field
Embodiment of the present invention is related to technical field of information processing, more particularly, to a kind of webpage of mobile terminal Method of completing the square, device and system.
Background technology
In the current information age, various information equipments come into being:It is useful for fixed-line telephone, the movement of Tone Via Phone;It is useful for information resources share, the server and PC that handle;It is useful for the various television sets that video data is shown Etc..These equipment generate all in specific area to solve actual demand.With E-consumer, computer, communication (3C)Attention has been put into comprehensive profit is carried out to the information equipment of each different field more and more by the arrival of fusion, people In research, preferably serviced for people with making full use of existing resource equipment.
In existing information of mobile terminal acquiring technology, mainly pass through the wap website orientation information of network side, Ran Houyi The browser module that dynamic terminal is embedded by browser or application program accesses wap websites to obtain related information content.Mesh Before, network notepad plug-in unit has been had on browser of mobile terminal, the mobile terminal webpage browsed can have been cut Series protection is deposited to the notepad memory space of network side.
However, this prior art is only simply to preserve the webpage editing that mobile terminal browsed in network side, when When mobile terminal subsequently has the new webpage editing it is expected to preserve, can not awareness network side whether preserved similar webpage Editing, so as to realize webpage editing classification storage, therefore the storage efficiency of webpage editing is not high.
Moreover, this prior art can not be the content of new web page editing according to the label information for having preserved webpage editing Automatic setting label, so as to also reduce the label of webpage editing setting efficiency.
Invention content
Embodiment of the present invention proposes a kind of mobile terminal webpage matching process, so as to improve the webpage editing of mobile terminal Storage efficiency.
Embodiment of the present invention also proposes a kind of mobile terminal webpage coalignment, so as to which the webpage for improving mobile terminal is cut The storage efficiency collected.
Embodiment of the present invention also proposes a kind of mobile terminal webpage matching system, so as to which the webpage for improving mobile terminal is cut The storage efficiency collected.
The concrete scheme of embodiment of the present invention is as follows:
A kind of webpage matching process of mobile terminal, this method include:
It determines that mobile terminal it is expected the web page contents preserved, and extracts the attribute information that the expectation preserves web page contents;
In the web page contents of preservation positioned at network side, the attribute information phase that web page contents are preserved with the expectation is retrieved Corresponding web page contents, and determine the storage attribute of the web page contents retrieved;
According to the storage attribute of the web page contents retrieved, in the webpage for it is expected to preserve described in the network side preservation Hold.
A kind of webpage coalignment of mobile terminal, the device include web page contents determination unit, corresponding web page contents inspection Cable elements and web page contents storage unit, wherein:
Web page contents determination unit, the web page contents preserved for mobile terminal to be determined it is expected, and extract expectation preservation The attribute information of web page contents;
Corresponding web content retrieval unit, in the web page contents of preservation positioned at network side, retrieving and the phase It hopes the corresponding web page contents of attribute information for preserving web page contents, and determines the storage attribute of the web page contents retrieved;
Web page contents storage unit for the storage attribute of web page contents retrieved according to, is protected in the network side Deposit the web page contents for it is expected to preserve.
A kind of webpage matching system of mobile terminal, which includes mobile terminal and the webpage positioned at network side matches clothes Business device, wherein:
Mobile terminal for determining the web page contents for it is expected to preserve, and extracts the attribute letter that the expectation preserves web page contents Breath, and the attribute information of the web page contents that the expectation is preserved and the web page contents is sent to webpage match server;
Webpage match server preserves in the web page contents of preservation positioned at network side, retrieving with the expectation The corresponding web page contents of attribute information of web page contents, and determine the storage attribute of the web page contents retrieved, and according to The storage attribute of the web page contents retrieved, in the web page contents for it is expected to preserve described in the network side preservation.
It can be seen from the above technical proposal that in embodiments of the present invention, determine that mobile terminal it is expected the webpage preserved Content, and extract the attribute information that the expectation preserves web page contents;In the web page contents of preservation positioned at network side, retrieve The corresponding web page contents of attribute information of web page contents are preserved with the expectation, and determine the storage of the web page contents retrieved Attribute;According to the storage attribute of the web page contents retrieved, in the web page contents for it is expected to preserve described in the network side preservation. It can be seen that after using embodiment of the present invention, different from the webpage that simply preservation mobile terminal is browsed in the prior art Editing, but when mobile terminal has new webpage editing it is expected to preserve, whether can be preserved with retrieval network side similar Webpage editing, and according to retrieval result realize for the webpage editing classification storage, so as to webpage editing storage efficiency It is improved.
Moreover, embodiment of the present invention can be new web page editing according to similar webpage clip tag information has been preserved Content label is set automatically, so as to also improve the label of webpage editing setting efficiency.
Furthermore it is possible to which embodiment of the present invention is applied in various terminals, can the present invention be used with cross-platform cross terminal Embodiment, the scope of application are very extensive.
Description of the drawings
Fig. 1 is the webpage matching process flow chart according to the mobile terminal of embodiment of the present invention;
Fig. 2 is the mobile terminal logging in network notepad flow chart according to embodiment of the present invention;
Fig. 3 is that the mobile terminal webpage of embodiment of the present invention preserves the matching process flow of content with network notepad Figure;
Fig. 4 is the webpage coalignment structure chart according to the mobile terminal of embodiment of the present invention;
Fig. 5 is the webpage matching system structure chart according to the mobile terminal of embodiment of the present invention.
Specific embodiment
To make the object, technical solutions and advantages of the present invention clearer, the present invention is made below in conjunction with the accompanying drawings further Detailed description.
At present, usually there is network notepad plug-in unit on browser of mobile terminal, it can be achieved that the mobile terminal that will be browsed to Webpage editing is preserved to the function of network side notepad.
Embodiment of the present invention can be based on network notepad pin function, realize user in webpage editing, by Has the uniform resource locator of webpage editing in distribution network notepad(URL)Or the information such as keyword, find out network account Similar webpage editing in this, and classified catalogue and/or label information are stored for current web page editing intelligent Matching notebook, etc. Deng.
In embodiments of the present invention, based on wireless network environment, mobile terminal easily subscription information and general are realized Related information content is clipped to network notepad.For example, the browser of mobile terminal and the notepad of network side can carry respectively Network connection is established for application programming interfaces, and completes the access mandate of network notepad.
In embodiments of the present invention, mobile terminal can obtain the URL of current web page editing, and pass through spiders (spider)The element of the current page of technology crawl user's browsing(Including the keyword under meta labels(keywords)With retouch It states(description)Deng), then the interface provided by network notepad, it is searched from network side notepad with similar The webpage editing of URL and/or keyword, then matching result is fed back into browser side, browser-presented gives user final With as a result, simultaneously intelligently matching notebook classification and label for current web page editing.Embodiment of the present invention may need to be related to skill Art includes plug-in part technology, webpage editing technology, keyword crawler technology etc..
Fig. 1 is the webpage matching process flow chart according to the mobile terminal of embodiment of the present invention.
As shown in Figure 1, that the method comprising the steps of is as follows:
Step 101:It determines that mobile terminal it is expected the web page contents preserved, and extracts the attribute that the expectation preserves web page contents Information.
Herein, mobile terminal can include but is not limited to:Functional mobile phone, smart mobile phone, palm PC, personal electricity Brain(PC), tablet computer or personal digital assistant(PDA), etc..
When mobile terminal browses webpage by browser, it will usually it is expected the net for preserving a part of web page contents or whole Page content.The entire Web page that mobile terminal is browsing can be determined as to the mobile terminal at this time it is expected in the webpage preserved Hold;Or the part webpage that mobile terminal is browsing is determined as mobile terminal and it is expected the web page contents preserved.
Also, it is preferred that extract the uniform resource locator corresponding to expectation preservation web page contents(URL)And/or extraction should It is expected to preserve the keyword in web page contents(Keyword).
Step 102:In the web page contents of preservation positioned at network side, the category that web page contents are preserved with the expectation is retrieved Property the corresponding web page contents of information, and determine the web page contents retrieved storage attribute.
Herein, the webpage editing kept before being stored in the server positioned at network side, Er Qie It is retrieved in the server and it is expected to preserve in the corresponding webpage of attribute information of web page contents with identified in the step 101 Hold, and determine the storage attribute of the web page contents retrieved.
In one embodiment, when extraction has expectation to preserve the URL corresponding to web page contents in a step 101, this When retrieve web page contents in the web page contents of preservation positioned at network side, wherein this is retrieved corresponding to web page contents The domain name for presetting rank of URL presets level domain famous prime minister with this of the URL corresponding to expectation preservation web page contents Together.
Domain name(Domain Name), a certain computer on the Internet that forms of name that is separated by a string with point Or the title of unit is calculated, for identifying the electronic bearing of computer in data transmission(Sometimes referred to as geographical location.
Second level domain refers to the domain name under top level domain, and under international top level domain, it refers to the net of domain name registration people Upper title, such as ibm, yahoo, microsoft etc.;Under national top level domain, it is the symbol for representing registered enterprise's classification, Such as com, edu, gov, net etc..Second level domain is divided into as two class of classification domain name and administrative area domain name.Classification domain name totally 6, packet Include the ac for scientific research institution;For the com of industrial, commercial and banking communities enterprise;For the edu of educational institution;For the gov of government department; For inter network information center and the net of Operation Centre;For the org of non-profit organization.
Three-level domain name is generally with letter(A~Z, a~z, capital and small letter etc.), number(0~9)And connector(-)Composition, it is at different levels Real point is used between domain name(.)Connection, the length of three-level domain name is generally no more than 20 characters.
By presetting the correspondence of appropriate level domain name, can be retrieved in the web page contents of preservation of network side Preserving web page contents with the expectation has the web page contents of same levels domain name.
Such as:When the web page contents URL for it is expected to preserve ishttp://tech.qq.com/a/20120910/ 000045.htmWhen;And assume to preset corresponding level domain name for three-level domain name;
At this time in traverses network notepad all saved webpage editings URL, see if there is with it is expected preserve net The identical webpage editing of page content URL three-level domain names, it is opposite with it is expected the attribute information for preserving web page contents if there is being determined as The web page contents answered.
Assuming that the URL for retrieving certain webpage editing is http://tech.qq.com/a/20120910/000071.htm, So to be learnt by URL matchings, the three-level domain name for retrieving web page contents of the URL of webpage editing with it is expected to preserve is identical, It is thus determined that for the corresponding web page contents of attribute information with it is expected preservation web page contents, then determine in the webpage retrieved The storage attribute of appearance(Such as storage catalogue and/or label in network side, etc.).
In another embodiment, when extracting the keyword in expectation preservation web page contents in step 101, at this time Web page contents are retrieved in the web page contents of preservation positioned at network side, wherein this retrieves the key that web page contents are included In word, the keyword number identical with the keyword that the expectation preserves web page contents is more than predetermined threshold value.
Usually, mega elements are had in Webpage, include the information such as keyword inside mega elements.
Signal code for keyword in web interface below:
<html>
<head>
<title>Title content</title>
<Meta name=" description " content=" your description contents ">
<Meta name=" keyword " content=" keywords 1, keyword 2, --- ">
</head>
</html>
By traversing all webpage editings in notepad, then the keyword in these elements is extracted, can obtain every The keyword of a webpage editing.In this way, the keyword first in the webpage of extraction expectation preservation web page contents, then The webpage editing in notepad is traversed, matching has the webpage editing of preservation of predetermined threshold value same keyword, really It is set to the corresponding web page contents of attribute information with it is expected preservation web page contents, then determines the storage of the web page contents retrieved Deposit attribute(Such as storage catalogue and/or label in network side, etc.).
Step 103:According to the storage attribute of the web page contents retrieved, it is expected to preserve described in the network side preservation Web page contents.
In one embodiment, when step 102 determines the store files clip directory of the web page contents retrieved; It can be preserved in the webpage for it is expected to preserve in the store files clip directory of the web page contents retrieved of the network side Hold.
In one embodiment, when step 102 determines the label information of the web page contents retrieved;It can be with needle This is set to retrieve the label information of web page contents the web page contents for it is expected to preserve, and preserve the net for it is expected to preserve Page content.
It is highly preferred that when step 102 determines the store files clip directory and label information of the web page contents retrieved When;The web page contents for it is expected to preserve can be directed to, the label information for retrieving web page contents is set, and in the network side The web page contents retrieved store files clip directory in, preserve it is described it is expected preserve web page contents.
In embodiments of the present invention, it is big to analyze webpage editing in webpage editing that can be similar from network notepad Mostly it is which notebook classification belonged to(That is folder content)And label, intelligence match notebook for the webpage editing currently preserved Classify and stamp relevant label.
Notebook classification is usually provided in network notepad(That is file or catalogue)Function, user can be by different pens Note, which classifies, to be placed in different notebook classifications.After user's editing webpage, can first it match currently with user account Notebook classification of similar webpage editing, can select these highest classification of editing general character, be cut as current web page in this The candidate classification collected.If the common category that similar several editings preserve can not be selected(Such as, the notes one's duty of several webpage editings Class is all different, so as to select), then current web page editing can be placed in the default categories of notebook.
Usually also provided in network notepad to the function that labels of notes, user can by different notes according to oneself Demand is tagged to taking down notes.After user's editing webpage, can first it match currently with similar several in user account sheet The label of a webpage editing selects general character highest predetermined number(Such as two)Label, the time as current web page editing Select label.If the common tag that similar several editings preserve can not be selected(As webpage editing is not all tagged), then Just current web page editing is then given tacit consent to and is not labelled.
It in embodiments of the present invention, can be by a variety of wireless between mobile terminal and the account book server of network side Communication network is communicatively coupled.Direction and time relationship by information transmission, the communication party between mobile terminal and server Formula can be divided into simplex communication, half-duplex operation and full-duplex communication, etc..
In embodiments of the present invention, specific communication protocol can be arranged between mobile terminal and account book server, The form that good data cell uses defined in these communication protocols, the information that information unit should include and meaning, connection side The sequential that formula, information send and receive, so that it is guaranteed that data are successfully transmitted to determining place in network.
For example, the communication protocol that embodiment of the present invention may be used includes but is not limited to:Transmission control protocol/net Border agreement(TCP/IP), hypertext transfer protocol(HTTP), Simple Mail Transfer protocol (SMTP), the 3rd version of post office protocol This(POP3), etc..
Moreover, mobile terminal can perform letter by a variety of communication standards and account book server in embodiment of the present invention Breath interaction.Such as:Global system for mobile communications may be used(GSM), wideband code division multiple access(WCDMA), CDMA 2000 (CDMA-2000), TD SDMA(TD-SCDMA)Etc. various communication standards.
Information exchange form between mobile terminal and account server can have diversified forms.For example, information format It can include but is not limited to:Short message(SMS), Email, instant messaging(IM)Information, multimedia messages(MMS)Or Voice messaging, etc..
Preferably, mobile terminal further comprises before the account book server of logging in network side:
Whether notepad server authentication mobile terminal identity is legal, and and if only if just allows when mobile terminal identity is legal The login of the mobile terminal;Wherein:Notepad server authentication mobile terminal identity it is whether legal including:Verify that mobile terminal is used Whether family fingerprint is with authorizing whether fingerprint match, the verify iris of mobile terminal user with mandate iris matches, verifies shifting The international mobile equipment identification code of dynamic terminal(IEMI)Whether whether just legal or mobile terminal screen protection sets password Really.
In embodiments of the present invention, mobile terminal can be by plug-in unit logging in network notepad, so as to fulfill corresponding The function of the similar webpage editing of retrieval.After mobile terminal installs plug-in unit, browser can jump to stepping on for network notepad Record or enrollment page will start identification authorization program after successfully logining, ask the user whether that the mandate of network notepad is allowed to move Terminal browser creates notebook, accesses notebook classification and label etc..
Fig. 2 is the mobile terminal logging in network notepad flow chart according to embodiment of the present invention.
As shown in Fig. 2, this method includes:
Step 201:Mobile terminal passes through browser login interface logon attempt network notepad.
Step 202:Network notepad judges whether the user identity is correct, if it is performs step 203 and its follow-up Otherwise step performs step 205 and terminates this flow.
Step 203:Judge user's whether on a web browser first time logging in network notepad, if it is perform step 206 and terminate this flow, otherwise perform step 204 and its subsequent step.
Step 204:The mobile terminal logs on to network notepad.
After mobile terminal logs on to network notepad, user access some webpage need preserve webpage editing when, Point plug-in unit icon can pop up the window shared to plug-in unit, and user can select to preserve webpage according to self-demand, then It can be according to the dimensions such as URL and/or page keyword, by the webpage of current clip and the webpage being stored in notepad progress Match.
Based on above-mentioned detailed analysis, Fig. 3 is in the mobile terminal webpage and the preservation of network notepad of embodiment of the present invention The matching process flow chart of appearance.
As shown in figure 3, this method includes:
Step 301:Webpage is opened with mobile phone browser.
Step 302:Mobile phone browser triggers webpage editing function.
Step 303:Judge whether to log in and obtain the mandate of third party's network notepad, if it is perform step 304 And its subsequent step, it otherwise performs step 305 and terminates this flow.
Step 304:Browser triggering editing instruction, and current web page editing is sent to third party's notepad.
Step 305:Log on to third party's network notepad page, obtain corresponding authorize, and terminate this flow.
At this point, flow of the present invention can perform two independent branches respectively, a branch is the sequence since step 306 It performs(Step 306~step 311), another branch is that sequence performs since step 312(Step 312~step 317).
Step 306:Obtain the keyword of current web page tag element(Keyword).
Step 307:Traverse the page-tag element of all webpage editings in third party's network notepad.
Step 308:, similar tag element is matched, and pass through in all webpage editings in third party's network notepad The number of same keyword judges the similarity degree of webpage editing in tag element.Including:Traverse all webpage editings Keyword, to find the keyword identical with current web page, be ranked up according still further to matched keyword number, and according to The sequence of keyword number size is matched, determines the more predetermined number webpage editing of matching keyword number.
Step 309:Judge whether to find the webpage editing identical with current web page keyword, if so, performing step 310, it otherwise performs step 311 and terminates this flow.
Step 310:The similar webpage editing of matching degree highest predetermined number is sent to browser of mobile terminal, at this time These can be shown similar to webpage editing in browser of mobile terminal.Moreover, based on these similar to the common storage of webpage editing Attribute, in the web page contents that the network side preservation it is expected to preserve.
Step 311:Prompting browser does not find similar editing, and terminates this flow.
Step 312:Obtain the URL of current web page editing.
Step 313:Traverse the URL of all saved webpage editings in third party's network notepad.
Step 314:Similar URL is matched, and passes through multilevel field name similarity and determines similar webpage editing.Wherein:It is first Top-level domain is first traversed, if the webpage editing number found is more than the threshold value M of the similar number of predetermined editing, identical one Traversal second level domain is further continued between several network address under grade domain name, and so on, until webpage editing number for finding etc. In or less than M values, so as to obtain similar webpage editing.
Step 315:Judge whether to find the N grades of identical webpages of domain name, if it is perform step 317 and terminate this stream Otherwise journey performs step 316 and terminates this flow.
Step 316:Prompting browser does not find similar editing.
Step 317:The similar webpage editing of matching degree highest predetermined number is sent to browser of mobile terminal, at this time These can be shown similar to webpage editing in browser of mobile terminal.Moreover, based on these similar to the common storage of webpage editing Attribute, in the web page contents that the network side preservation it is expected to preserve.
Based on above-mentioned detailed analysis, embodiment of the present invention also proposed a kind of webpage coalignment of mobile terminal.
Fig. 4 is the webpage coalignment structure chart according to the mobile terminal of embodiment of the present invention.
As shown in figure 4, the device includes web page contents determination unit 401, corresponding web content retrieval unit 402 and webpage Content storage unit 403, wherein:
Web page contents determination unit 401, the web page contents preserved for mobile terminal to be determined it is expected, and extract expectation guarantor Deposit the attribute information of web page contents;
Corresponding web content retrieval unit 402, in the web page contents of preservation positioned at network side, retrieving with being somebody's turn to do It is expected the corresponding web page contents of attribute information of preservation web page contents, and determine the storage category of the web page contents retrieved Property;
Web page contents storage unit 403, for the storage attribute of web page contents retrieved according to, in the network side Preserve the web page contents for it is expected to preserve.
In one embodiment, web page contents determination unit 401 is preserved for extracting the expectation corresponding to web page contents Uniform resource locator;
Corresponding web content retrieval unit 402, for retrieving webpage in the web page contents of preservation positioned at network side Content, the wherein domain name for presetting rank for retrieving the uniform resource locator corresponding to web page contents, with the expectation Preserving this of the uniform resource locator corresponding to web page contents, to preset level domain name identical.
In one embodiment, web page contents determination unit 401, for extracting the pass in expectation preservation web page contents Key word;
Corresponding web content retrieval unit 402, for retrieving webpage in the web page contents of preservation positioned at network side Content, wherein this retrieve in the keyword that web page contents are included, with the expectation preserve web page contents keyword it is identical Keyword number is more than predetermined threshold value.
In one embodiment, corresponding web content retrieval unit 402 is used to determine the web page contents retrieved Store files clip directory;
Web page contents storage unit 403 in the store files clip directory of the web page contents retrieved of the network side, Preserve the web page contents for it is expected to preserve.
Preferably, corresponding web content retrieval unit 402, for determining the label information of the web page contents retrieved;
Web page contents storage unit 403 sets this to retrieve in webpage for being directed to the web page contents for it is expected to preserve The label information of appearance, and preserve the web page contents for it is expected to preserve.
Preferably, corresponding web content retrieval unit 402, for determining that the store files of the web page contents retrieved are pressed from both sides Catalogue and label information;
Web page contents storage unit 403 sets this to retrieve web page contents for the web page contents for it is expected to preserve Label information, and in the store files clip directory of the web page contents retrieved of the network side, preserve the expectation and preserve Web page contents.
Based on above-mentioned detailed analysis, embodiment of the present invention also proposed a kind of webpage matching system of mobile terminal.
Fig. 5 is the webpage matching system structure chart according to the mobile terminal of embodiment of the present invention.
As shown in figure 5, the system includes mobile terminal 501 and the webpage match server 502 positioned at network side, wherein:
Mobile terminal 501 for determining the web page contents for it is expected to preserve, and extracts the attribute that the expectation preserves web page contents Information, and the attribute information of the web page contents that the expectation is preserved and the web page contents is sent to webpage match server;
Webpage match server 502 is protected in the web page contents of preservation positioned at network side, retrieving with the expectation The corresponding web page contents of attribute information of web page contents are deposited, and determine the storage attribute of the web page contents retrieved, and root According to the storage attribute of the web page contents retrieved, in the web page contents for it is expected to preserve described in the network side preservation.
In one embodiment, mobile terminal 501 provide for extracting the unification that the expectation is preserved corresponding to web page contents Source finger URL;
Webpage match server 502, for retrieving web page contents in the web page contents of preservation positioned at network side, In this retrieve the domain name for presetting rank of the uniform resource locator corresponding to web page contents, preserve webpage with the expectation To preset level domain name identical for this of uniform resource locator corresponding to content.
In one embodiment, mobile terminal 501, for extracting the keyword in expectation preservation web page contents;
Webpage match server 502, for retrieving web page contents in the web page contents of preservation positioned at network side, In this retrieve in the keyword that web page contents are included, the identical number of keyword of keyword of web page contents is preserved with the expectation Mesh is more than predetermined threshold value.
Preferably, webpage match server 502, for determining the store files clip directory of the web page contents retrieved, And in the store files clip directory of the web page contents retrieved of the network side, preserve in the webpage for it is expected to preserve Hold.
Preferably, webpage match server 502, for determining the label information of the web page contents retrieved, for institute The label information for it is expected that the web page contents preserved set this to retrieve web page contents is stated, and is preserved in the webpage for it is expected to preserve Hold.
It is highly preferred that webpage match server 502, for determining the store files clip directory of the web page contents retrieved And label information, the label information for retrieving web page contents is set, and in the net for the web page contents for it is expected to preserve In the store files clip directory of the web page contents retrieved of network side, the web page contents for it is expected to preserve are preserved.
Although some concrete forms for the information exchange form being set out in detail above between mobile terminal and server, this Field technology personnel are it is to be appreciated that this enumerate only is exemplary rather than embodiment of the present invention is defined.
Fig. 4 shown devices can be integrated into the hardware entities of various communication networks.It for example, can be by mobile terminal Webpage coalignment be integrated into:Functional mobile phone, smart mobile phone, palm PC, PC(PC), tablet computer or a number Word assistant(PDA), etc. among equipment.
Indeed, it is possible to the webpage coalignment that embodiment of the present invention is proposed is embodied by diversified forms. For example, the application programming interfaces of certain specification can be followed, webpage coalignment is written as being installed to inserting in mobile terminal Part program can also be encapsulated as application program so that user voluntarily downloads use.When being written as plug-in card program, can incite somebody to action It is implemented as a variety of card formats such as ocx, dll, cab.Can also by Flash plug-in units, RealPlayer plug-in units, MMS plug-in units, The particular techniques such as MIDI staffs plug-in unit, ActiveX plug-in units implement the webpage coalignment that embodiment of the present invention is proposed.
For example, the webpage coalignment that embodiment of the present invention is proposed can be arranged on movement eventually by card format In the various application software at end, for example set in communication recording software, and user can facilitate startup, close the plug-in unit. When communicating recording software publication, it is only necessary to built-in plug-in component switch, subscribe to the list interface of catalogue, browser html parsing modules, It is equivalent to and increases a shell to communication recording software, be not related to any content in logic, after user opens plug-in unit, can just go to draw It takes and subscribes to catalogue and subscribed content.
The mobile terminal that the storing mode that instruction or instruction set store is proposed embodiment of the present invention can be passed through Webpage matching process is stored on various storage mediums.These storage mediums include but is not limited to:Floppy disk, CD, DVD, Hard disk, flash memory, USB flash disk, CF cards, SD card, mmc card, SM cards, memory stick(Memory Stick), xD cards etc..
Furthermore it is also possible to the webpage matching process for the mobile terminal that embodiment of the present invention is proposed is applied to based on sudden strain of a muscle It deposits(Nand flash)Storage medium in, such as USB flash disk, CF cards, SD card, SDHC cards, mmc card, SM cards, memory stick, xD cards etc..
In conclusion in embodiments of the present invention, determine that mobile terminal it is expected the web page contents preserved, and extracts the phase Hope the attribute information for preserving web page contents;In the web page contents of preservation positioned at network side, retrieve and preserve net with the expectation The corresponding web page contents of attribute information of page content, and determine the storage attribute of the web page contents retrieved;According to described The storage attribute of the web page contents retrieved, in the web page contents for it is expected to preserve described in the network side preservation.It can be seen that using After embodiment of the present invention, different from the webpage editing that simply preservation mobile terminal is browsed in the prior art, but when shifting When dynamic terminal has new webpage editing expectation preservation, similar webpage editing whether can have been preserved with retrieval network side, and Classification storage for the webpage editing is realized according to retrieval result, is improved so as to the storage efficiency of webpage editing.
Moreover, embodiment of the present invention can be new web page editing according to similar webpage clip tag information has been preserved Content label is set automatically, so as to also improve the label of webpage editing setting efficiency.
Furthermore it is possible to which embodiment of the present invention is applied in various terminals, can the present invention be used with cross-platform cross terminal Embodiment, the scope of application are very extensive.
The foregoing is only a preferred embodiment of the present invention, is not intended to limit the scope of the present invention.It is all Within the spirit and principles in the present invention, any modification, equivalent replacement, improvement and so on should be included in the protection of the present invention Within the scope of.

Claims (15)

1. the webpage matching process of a kind of mobile terminal, which is characterized in that this method includes:
It determines that mobile terminal it is expected the web page contents preserved, and extracts the attribute information that the expectation preserves web page contents;
In the web page contents of preservation positioned at network side, retrieve corresponding with the attribute information of expectation preservation web page contents Web page contents, and determine the web page contents retrieved storage attribute;
According to the storage attribute of the web page contents retrieved, in the web page contents for it is expected to preserve described in the network side preservation;
The web page contents that the determining mobile terminal it is expected to preserve include:The entire Web page that mobile terminal is browsing is determined as The mobile terminal it is expected the web page contents preserved;Or the part webpage that mobile terminal is browsing is determined as mobile terminal and it is expected The web page contents of preservation;
It is described extract the expectation preserve web page contents attribute information be:Extract the keyword in expectation preservation web page contents;
In the web page contents of preservation positioned at network side, the attribute information phase that web page contents are preserved with the expectation is retrieved Corresponding web page contents include:
Web page contents are retrieved in the web page contents of preservation positioned at network side, wherein this retrieves what web page contents were included In keyword, the keyword number identical with the keyword that the expectation preserves web page contents is more than predetermined threshold value.
2. the webpage matching process of mobile terminal according to claim 1, which is characterized in that described extraction expectation preserves The attribute information of web page contents is:Extract the uniform resource locator corresponding to expectation preservation web page contents;
In the web page contents of preservation positioned at network side, the attribute information phase that web page contents are preserved with the expectation is retrieved Corresponding web page contents include:
Web page contents are retrieved in the web page contents of preservation positioned at network side, wherein this is retrieved corresponding to web page contents The domain name for presetting rank of uniform resource locator preserves the uniform resource locator corresponding to web page contents with the expectation This to preset level domain name identical.
3. the webpage matching process of mobile terminal according to claim 1, which is characterized in that described to determine what this was retrieved The storage attribute of web page contents includes:Determine the store files clip directory of the web page contents retrieved;
The storage attribute of the web page contents retrieved described in the basis, in the webpage for it is expected to preserve described in the network side preservation Appearance includes:
In the store files clip directory of the web page contents retrieved of the network side, preserve in the webpage for it is expected to preserve Hold.
4. the webpage matching process of mobile terminal according to claim 1, which is characterized in that described to determine what this was retrieved The storage attribute of web page contents includes:Determine the label information of the web page contents retrieved;
The storage attribute of the web page contents retrieved described in the basis, in the webpage for it is expected to preserve described in the network side preservation Appearance includes:
This is set to retrieve the label information of web page contents for the web page contents for it is expected to preserve, and preserve the expectation and protect The web page contents deposited.
5. the webpage matching process of mobile terminal according to claim 1, which is characterized in that described to determine what this was retrieved The storage attribute of web page contents includes:Determine the store files clip directory and label information of the web page contents retrieved;
The storage attribute of the web page contents retrieved described in the basis, in the webpage for it is expected to preserve described in the network side preservation Appearance includes:
For the web page contents for it is expected to preserve, the label information for retrieving web page contents, and being somebody's turn to do in the network side are set In the store files clip directory of the web page contents retrieved, the web page contents for it is expected to preserve are preserved.
6. the webpage coalignment of a kind of mobile terminal, which is characterized in that the device includes web page contents determination unit, corresponding net Page content retrieval unit and web page contents storage unit, wherein:
Web page contents determination unit, the web page contents preserved for mobile terminal to be determined it is expected, and extract the expectation and preserve webpage The attribute information of content;
Corresponding web content retrieval unit, is protected in the web page contents of preservation positioned at network side, retrieving with the expectation The corresponding web page contents of attribute information of web page contents are deposited, and determine the storage attribute of the web page contents retrieved;
Web page contents storage unit, for the storage attribute of web page contents retrieved according to, in the network side preservation institute State the web page contents for it is expected to preserve;
The web page contents that the determining mobile terminal it is expected to preserve include:The entire Web page that mobile terminal is browsing is determined as The mobile terminal it is expected the web page contents preserved;Or the part webpage that mobile terminal is browsing is determined as mobile terminal and it is expected The web page contents of preservation;Wherein:
Web page contents determination unit, for extracting the keyword in expectation preservation web page contents;
Corresponding web content retrieval unit, for retrieving web page contents in the web page contents of preservation positioned at network side, In this retrieve in the keyword that web page contents are included, the identical number of keyword of keyword of web page contents is preserved with the expectation Mesh is more than predetermined threshold value.
7. the webpage coalignment of mobile terminal according to claim 6, which is characterized in that
Web page contents determination unit, for extracting the uniform resource locator corresponding to expectation preservation web page contents;
Corresponding web content retrieval unit, for retrieving web page contents in the web page contents of preservation positioned at network side, In this retrieve the domain name for presetting rank of the uniform resource locator corresponding to web page contents, preserve webpage with the expectation To preset level domain name identical for this of uniform resource locator corresponding to content.
8. the webpage coalignment of mobile terminal according to claim 6, which is characterized in that
Corresponding web content retrieval unit, for determining the store files clip directory of the web page contents retrieved;
Web page contents storage unit, in the store files clip directory of the web page contents retrieved of the network side, protecting Deposit the web page contents for it is expected to preserve.
9. the webpage coalignment of mobile terminal according to claim 6, which is characterized in that
Corresponding web content retrieval unit, for determining the label information of the web page contents retrieved;
Web page contents storage unit, for being directed to the label that the web page contents for it is expected to preserve set this to retrieve web page contents Information, and preserve the web page contents for it is expected to preserve.
10. the webpage coalignment of mobile terminal according to claim 6, which is characterized in that
Corresponding web content retrieval unit, for determining the store files clip directory of the web page contents retrieved and label letter Breath;
Web page contents storage unit sets this to retrieve the label of web page contents letter for the web page contents for it is expected to preserve Breath, and in the store files clip directory of the web page contents retrieved of the network side, preserve the webpage for it is expected to preserve Content.
11. the webpage matching system of a kind of mobile terminal, which is characterized in that the system includes mobile terminal and positioned at network side Webpage match server, wherein:
Mobile terminal for determining the web page contents for it is expected to preserve, and extracts the attribute information that the expectation preserves web page contents, and The web page contents and the attribute information of the web page contents that the expectation is preserved are sent to webpage match server;
Webpage match server preserves webpage in the web page contents of preservation positioned at network side, retrieving with the expectation The corresponding web page contents of attribute information of content, and determine the storage attribute of the web page contents retrieved, and according to described The storage attribute of the web page contents retrieved, in the web page contents for it is expected to preserve described in the network side preservation;
The web page contents that the determining mobile terminal it is expected to preserve include:The entire Web page that mobile terminal is browsing is determined as The mobile terminal it is expected the web page contents preserved;Or the part webpage that mobile terminal is browsing is determined as mobile terminal and it is expected The web page contents of preservation;Wherein:
Mobile terminal, for extracting the keyword in expectation preservation web page contents;
Webpage match server, for retrieving web page contents in the web page contents of preservation positioned at network side, the wherein inspection Rope goes out in the keyword that web page contents are included, and the keyword number identical with the keyword that the expectation preserves web page contents is more than Predetermined threshold value.
12. the webpage matching system of mobile terminal according to claim 11, which is characterized in that
Mobile terminal, for extracting the uniform resource locator corresponding to expectation preservation web page contents;
Webpage match server, for retrieving web page contents in the web page contents of preservation positioned at network side, the wherein inspection Rope goes out the domain name for presetting rank of the uniform resource locator corresponding to web page contents, and web page contents institute is preserved with the expectation To preset level domain name identical for this of corresponding uniform resource locator.
13. the webpage matching system of mobile terminal according to claim 11, which is characterized in that
Webpage match server, for determining the store files clip directory of the web page contents retrieved, and in the network side In the store files clip directory of the web page contents retrieved, the web page contents for it is expected to preserve are preserved.
14. the webpage matching system of mobile terminal according to claim 11, which is characterized in that
Webpage match server, for determining the label information of the web page contents retrieved, for the net for it is expected to preserve This retrieves the label information of web page contents to page curriculum offering, and preserves the web page contents for it is expected to preserve.
15. the webpage matching system of mobile terminal according to claim 11, which is characterized in that
Webpage match server, for determining the store files clip directory of the web page contents retrieved and label information, for The web page contents for it is expected to preserve set the label information for retrieving web page contents, and this in the network side retrieves In the store files clip directory of web page contents, the web page contents for it is expected to preserve are preserved.
CN201210518026.6A 2012-12-06 2012-12-06 A kind of webpage matching process of mobile terminal, device and system Active CN103853784B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210518026.6A CN103853784B (en) 2012-12-06 2012-12-06 A kind of webpage matching process of mobile terminal, device and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210518026.6A CN103853784B (en) 2012-12-06 2012-12-06 A kind of webpage matching process of mobile terminal, device and system

Publications (2)

Publication Number Publication Date
CN103853784A CN103853784A (en) 2014-06-11
CN103853784B true CN103853784B (en) 2018-06-15

Family

ID=50861450

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210518026.6A Active CN103853784B (en) 2012-12-06 2012-12-06 A kind of webpage matching process of mobile terminal, device and system

Country Status (1)

Country Link
CN (1) CN103853784B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107992556B (en) * 2017-11-28 2020-08-21 福建中金在线信息科技有限公司 Site management method and device, electronic equipment and storage medium
JP7127601B2 (en) * 2019-04-10 2022-08-30 日本電信電話株式会社 Similar Transition Identifying Device, Similar Transition Identifying Method, and Program
CN114281464A (en) * 2021-12-31 2022-04-05 瀚云科技有限公司 Multi-tenant dynamic login page generation method and system

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102236656A (en) * 2010-04-23 2011-11-09 英业达股份有限公司 System and method for providing target data through turning page
CN102737116A (en) * 2012-05-29 2012-10-17 深圳市同洲电子股份有限公司 Method and device for storing webpage resources

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE60045824D1 (en) * 1999-11-03 2011-05-19 Sublinks Aps METHOD, SYSTEM AND COMPUTER READABLE MEDIUM TO VE
US7421438B2 (en) * 2004-04-29 2008-09-02 Microsoft Corporation Metadata editing control
CN101291367A (en) * 2008-05-22 2008-10-22 德信无线通讯科技(北京)有限公司 Browser bookmark displaying method of mobile communication terminal, and mobile communication terminal thereof
CN101459571B (en) * 2008-12-16 2011-04-06 北京大学 Method, system and apparatus for website mirroring

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102236656A (en) * 2010-04-23 2011-11-09 英业达股份有限公司 System and method for providing target data through turning page
CN102737116A (en) * 2012-05-29 2012-10-17 深圳市同洲电子股份有限公司 Method and device for storing webpage resources

Also Published As

Publication number Publication date
CN103853784A (en) 2014-06-11

Similar Documents

Publication Publication Date Title
CN104199851B (en) The method and cloud server of telephone number are extracted by yellow page information
KR101322679B1 (en) An assistant―adviser using the semantic analysis of community exchanges
WO2015196907A1 (en) Search pushing method and device which mine user requirements
CN106933991A (en) A kind of depth analysis towards intelligent terminal and user&#39;s portrait system and method
WO2014180130A1 (en) Method and system for recommending contents
CN107526776A (en) The Computerized method and system of search result is presented
CN104063455A (en) Method and device for acquiring counseling messages of disease based on searching
CN105426759A (en) URL legality determining method and apparatus
CN103853767A (en) Method and device for sharing social circle based on browser
CN105391674A (en) Information processing method and system, server, and client
CN107958078A (en) Information generating method and device
CN110808868B (en) Test data acquisition method and device, computer equipment and storage medium
CA2977847A1 (en) Automated extraction tools and their use in social content tagging systems
CN103853784B (en) A kind of webpage matching process of mobile terminal, device and system
CN108763500A (en) Voice-based Web browser method, device, equipment and storage medium
CN101751462A (en) Network information storage and access methods, equipment and terminals
CN103729178A (en) Method and system for processing multiple tabs of browsers
CN115757991A (en) Webpage identification method and device, electronic equipment and storage medium
CN101470752A (en) Search engine method based on keyword resolution scheduling
CN103617043B (en) A kind of method and system uploaded with picture web data
CN108197112A (en) A kind of method that event is extracted from news
Prusty et al. SMS Fraud detection using machine learning
CN105740453B (en) Information-pushing method and device
US20130230248A1 (en) Ensuring validity of the bookmark reference in a collaborative bookmarking system
CN104090878A (en) Multimedia checking method, terminal, server and system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant