CN110020361A - A kind of web page processing method, device, storage medium and electronic equipment - Google Patents

A kind of web page processing method, device, storage medium and electronic equipment Download PDF

Info

Publication number
CN110020361A
CN110020361A CN201711100114.3A CN201711100114A CN110020361A CN 110020361 A CN110020361 A CN 110020361A CN 201711100114 A CN201711100114 A CN 201711100114A CN 110020361 A CN110020361 A CN 110020361A
Authority
CN
China
Prior art keywords
column
web page
webpage
target webpage
selected column
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711100114.3A
Other languages
Chinese (zh)
Inventor
侯柏岑
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Sogou Technology Development Co Ltd
Original Assignee
Beijing Sogou Technology Development Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Sogou Technology Development Co Ltd filed Critical Beijing Sogou Technology Development Co Ltd
Priority to CN201711100114.3A priority Critical patent/CN110020361A/en
Publication of CN110020361A publication Critical patent/CN110020361A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

Abstract

The embodiment of the invention provides a kind of web page processing method, device, storage medium and electronic equipments, efficiently to export required information from webpage.The method includes: to carry out structural analysis to Webpage, positions the corresponding web page element of each column in the Webpage;According to specified operation, target webpage element in selected column is extracted;Target webpage element executive editor in the selected column is exported.Can convenient, accurately selection target web page element, reduce the operation of selection course, improve the efficiency of target selection and the accuracy of target output.

Description

A kind of web page processing method, device, storage medium and electronic equipment
Technical field
The present invention relates to field of computer technology, more particularly to a kind of web page processing method, device, storage medium and electricity Sub- equipment.
Background technique
Current browser often shows many information, such as top navigation, webpage master when browsing webpage on webpage Topic, Web page text, advertisement etc., full page have information very rich.
When user is interested in the content in webpage, it is desirable to when executing the processing such as duplication, printing, can be selected in webpage Corresponding content is then copied in local document.But during content selection on webpage, user is usually required A starting point is selected to start according to left mouse button on webpage, then mobile mouse, Zhi Daoda in the case where keeping left button selected Left mouse button can be lifted to terminal, then clicks right mouse button on the chosen content again, in a menu selection duplication, ability Obtain the content chosen.
Aforesaid way is not only cumbersome, and during mouse mobile selection, user may be chosen not need Text or information, the accuracy and efficiency such as picture it is lower.
Summary of the invention
The embodiment of the present invention provides a kind of web page processing method, efficiently to export required information from webpage.
Correspondingly, the embodiment of the invention also provides a kind of page processor, a kind of electronic equipment, a kind of readable storages Medium, to guarantee the implementation and application of the above method.
To solve the above-mentioned problems, the embodiment of the invention discloses a kind of web page processing methods, comprising: to Webpage into Row structural analysis positions the corresponding web page element of each column in the Webpage;According to specified operation, extract in selected column Target webpage element;Target webpage element executive editor in the selected column is exported.
Optionally, described that target webpage element executive editor in the selected column is exported, it comprises at least one of the following: Target webpage element in the selected column is copied in shear plate;Print target webpage element in the selected column;It protects Target webpage element is deposited in the selected column to specified address;Target webpage element in the selected column is shared to specified Using.
Optionally, described that structural analysis is carried out to Webpage, position the corresponding webpage of each column in the Webpage Element, comprising: obtain the corresponding web page code of Webpage;Based on the web page code, the corresponding code block of each column is obtained; According to the corresponding web page element of column each in the code block locating web-pages page.
Optionally, described according to the corresponding web page element of column each in the code block locating web-pages page, comprising: foundation The code block determines corresponding node, wherein the node includes: father node and/or child node;It is corresponding according to each node Nodal information determines the corresponding column of each code block;According to keyword locating web-pages element in the column, and record corresponding Location information;The specified operation of the foundation, extracts target webpage element in selected column, comprising: according to the specified operation Select column, and the selection target web page element in the selected column;According to the location information of the target webpage element, Extract target webpage element in the selected column.
Optionally, the web page element comprises at least one of the following: text, picture, audio, animation, video.
Optionally, further includes: each column and the corresponding webpage of the column are shown by window in a browser Element, to select the target webpage element for needing to edit output;Wherein, the window includes editor's output control, the volume Volume output control include it is following at least one: copy control, word depghi, save control, share control.
Optionally, it prints in the selected column before target webpage element, further includes: to target in the selected column Web page element is edited, and the editor comprises at least one of the following operation: modification operation, insertion operation, delete operation.
The embodiment of the invention also discloses a kind of page processors, comprising: analyzing and positioning module, for Webpage Structural analysis is carried out, the corresponding web page element of each column in the Webpage is positioned;Element extraction module, for according to specified Target webpage element in selected column is extracted in operation;Output module is edited, for target webpage element in the selected column Executive editor's output.
Optionally, editor's output module, comprising: duplication submodule is used for target webpage in the selected column Element copies in shear plate;Submodule is printed, for printing target webpage element in the selected column;Submodule is saved, For saving in the selected column target webpage element to specified address;Share submodule, being used for will be in the selected column Target webpage element is shared to specified application.
Optionally, the analyzing and positioning module, comprising: acquisition submodule, for obtaining Webpage corresponding webpage generation Code;Submodule is analyzed, for being based on the web page code, obtains the corresponding code block of each column;Element positioning submodule, is used for According to the corresponding web page element of column each in the code block locating web-pages page.
Optionally, the element positioning submodule, for determining corresponding node according to the code block, wherein described Node includes: father node and/or child node;According to the corresponding nodal information of each node, the corresponding column of each code block is determined;? According to keyword locating web-pages element in the column, and record corresponding location information;The element extraction module includes: choosing Submodule is selected, for selecting column, and the selection target web page element in the selected column according to the specified operation;It mentions Submodule is taken, for the location information according to the target webpage element, extracts target webpage element in the selected column.
Optionally, the web page element comprises at least one of the following: text, picture, audio, animation, video.
Optionally, further includes: display module, for showing each column and described by window in a browser The corresponding web page element of column, to select the target webpage element for needing to edit output;Wherein, the window includes that editor is defeated Control out, the editor export control include it is following at least one: copy control, word depghi save control, share control.
Optionally, submodule is printed, is also used to edit target webpage element in the selected column, the editor Comprise at least one of the following operation: modification operation, insertion operation, delete operation.
The embodiment of the invention also discloses a kind of readable storage medium storing program for executing, which is characterized in that the finger in the storage medium When enabling the processor execution by electronic equipment, so that electronic equipment is able to carry out as described in one or more in the embodiment of the present invention Web page processing method.
The embodiment of the invention also discloses a kind of electronic equipment, which is characterized in that include memory and one or More than one program, perhaps more than one program is stored in memory and is configured to by one or one for one of them It includes the instruction for performing the following operation that a above processor, which executes the one or more programs: to Webpage Structural analysis is carried out, the corresponding web page element of each column in the Webpage is positioned;According to specified operation, selected column is extracted Middle target webpage element;Target webpage element executive editor in the selected column is exported.
Optionally, described that target webpage element executive editor in the selected column is exported, it comprises at least one of the following: Target webpage element in the selected column is copied in shear plate;Print target webpage element in the selected column;It protects Target webpage element is deposited in the selected column to specified address;Target webpage element in the selected column is shared to specified Using.
Optionally, described that structural analysis is carried out to Webpage, position the corresponding webpage of each column in the Webpage Element, comprising: obtain the corresponding web page code of Webpage;Based on the web page code, the corresponding code block of each column is obtained; According to the corresponding web page element of column each in the code block locating web-pages page.
Optionally, described according to the corresponding web page element of column each in the code block locating web-pages page, comprising: foundation The code block determines corresponding node, wherein the node includes: father node and/or child node;It is corresponding according to each node Nodal information determines the corresponding column of each code block;According to keyword locating web-pages element in the column, and record corresponding Location information;The specified operation of the foundation, extracts target webpage element in selected column, comprising: according to the specified operation Select column, and the selection target web page element in the selected column;According to the location information of the target webpage element, Extract target webpage element in the selected column.
Optionally, the web page element comprises at least one of the following: text, picture, audio, animation, video.
Optionally, also comprising the instruction for performing the following operation: each column is shown by window in a browser, And the corresponding web page element of the column, to select the target webpage element for needing to edit output;Wherein, the window packet Include editor's output control, the editor export control include it is following at least one: copy control, word depghi save control, divide Enjoy control.
Optionally, it prints in the selected column before target webpage element, further includes: to target in the selected column Web page element is edited, and the editor comprises at least one of the following operation: modification operation, insertion operation, delete operation.
The embodiment of the present invention includes following advantages:
The embodiment of the present invention can carry out structural analysis to Webpage, position the corresponding net of each column in the Webpage Page element, then according to specified operation, extracts mesh in selected column to obtain each web page element in webpage by structural analysis Mark web page element, can convenient, accurately selection target web page element, then target webpage element in the selected column is executed Editor's output, reduces the operation of selection course, improves the efficiency of target selection and the accuracy of target output.
Detailed description of the invention
Fig. 1 is a kind of step flow chart of web page processing method embodiment of the invention;
Fig. 2 is the step flow chart of another web page processing method embodiment of the invention;
Fig. 3 is a kind of structural block diagram of page processor embodiment of the invention;
Fig. 4 is the structural block diagram of another page processor embodiment of the invention;
Fig. 5 is a kind of structural block diagram of electronic equipment for Web Page Processing shown according to an exemplary embodiment;
Fig. 6 is a kind of structural representation of the electronic equipment for Web Page Processing that the present invention is shown according to another exemplary embodiment Figure.
Specific embodiment
In order to make the foregoing objectives, features and advantages of the present invention clearer and more comprehensible, with reference to the accompanying drawing and specific real Applying mode, the present invention is described in further detail.
Referring to Fig.1, a kind of step flow chart of web page processing method embodiment of the invention is shown, can specifically include Following steps:
Step 102, structural analysis is carried out to Webpage, positions the corresponding web page element of each column in the page.
After inputting web page address by various modes in a browser, request can be issued based on web page address, foundation should Request receives web data, can be parsed based on the web data, render webpage, then open the webpage in a browser.Wherein, The web data includes the web page elements such as web page code and required picture, so as to parse to obtain webpage.The present invention is real The structure of Webpage can be analyzed during web analysis, rendering by applying example, or for the webpage having already turned on The page carries out structural analysis, and structure determination each column therein and each column by analyzing Webpage include Web page element.
Wherein, column is the modules in Webpage, and disparate modules carry different contents, such as theme, text Deng, wherein the column comprises at least one of the following: header, footer, theme, text, advertisement.Header is located at Webpage Top area, such as each component part of recordable website;Corresponding footer is located at the bottom section of Webpage, such as recordable The recommended information of website, such as copyright information, relief regulations, privacy provision;Theme is the theme of website, be usually located at header it Under Webpage top, such as display web site name;Text is the region of Webpage main contents, generally takes up webpage page Most of region in face, such as the video playing area and user comment area of the body of news web page, video web-pages;Advertisement is The region of displayed web page, alternatively referred to as advertisement position in Webpage, advertisement position can be located at Webpage any region, specifically according to It is arranged according to advertisement position and determines.May also include the contents such as header, link in webpage, these contents can be located in corresponding plate, Can be individually for a kind of plate, the embodiment of the present invention to this with no restriction.
Web page element is the basic element for constituting webpage, and the web page element comprises at least one of the following: text, picture, Audio, animation, video.Each version can carry one or more web page elements, such as by text, picture, audio, animation, view The content of each column of one or more compositions of frequency.
To pass through the structure of analysis Webpage, it may be determined that contain in the column and each column that the webpage contains Web page element, and each web page element in each column can be oriented, be convenient for subsequent acquisition.Wherein, web page element can be direct Editor, can also be by forms editors such as links, so as to obtain web page element by link etc. in code.
Step 104, according to specified operation, target webpage element in selected column is extracted.
During user browses each webpage in a browser, for interested content, duplication, printing etc. can be performed and compile Output operation is collected, therefore user can indicate the specified operation to be executed, is then indicated according to the specified operation interested interior Hold and required editor exports operation, can accordingly select column to determine selected column in Webpage, according to the version One or more web page elements, determine target webpage element, then extract target webpage element in selected column in block.
Step 106, target webpage element executive editor in the selected column is exported.
For target webpage element in selected column, corresponding editor's output can be executed according to specified operation, such as replicates phase The content answered, or the corresponding content of printing etc., so as to specify required web page element to carry out editor's output from the page, The operation for reducing selection course improves the efficiency of target selection and the accuracy of target output.
In conclusion structural analysis can be carried out to Webpage, the corresponding webpage of each column in the Webpage is positioned Element, then according to specified operation, extracts target in selected column to obtain each web page element in webpage by structural analysis Web page element, can convenient, accurately selection target web page element, then volume is executed to target webpage element in the selected column Output is collected, the operation of selection course is reduced, improves the efficiency of target selection and the accuracy of target output.
It is described that target webpage element executive editor in the selected column is exported including following in the embodiment of the present invention It is at least one: target webpage element in the selected column is copied in shear plate;Print target network in the selected column Page element;Target webpage element is saved in the selected column to specified address;By target webpage element in the selected column Share to specified application.
Target webpage element in selected column can be copied in shear plate, that is, pass through calling interface for mesh in selected column Mark web page element, such as text, picture copy in shear plate, the process in browser by being replicated after mouse selection content Process it is similar, but the embodiment of the present invention can accurate named web page element be replicated, improve treatment effeciency.
Also can print target webpage element in the selected column can select that is, by calling the relevant interface of printer Target webpage element in column, such as text, picture are transferred to printer, are then printed, and beat in the process and browser The process of printed network page is similar, but the embodiment of the present invention can accurately named web page element be printed, and treatment effeciency is improved.
Also can be reserved for target webpage element in the selected column will be selected to specified address by the interface of calling storage Determining target webpage element in column, such as text, picture, audio, animation, video storage are into specified memory space, thus fastly Interested information in the storage webpage of speed.
Target webpage element in the selected column can also be shared to specified application, i.e., by calling the interface of storage to obtain Take the link of target webpage element such as text, picture, audio, animation, video etc. or the target webpage element in selected column Equal sharing informations, such as the chained address of picture, audio, animation, video, then by target webpage element or target webpage element Sharing information be sent in specified application, which includes various types of applications, such as instant messaging application, social Using, Video Applications etc..
The editor that target webpage element in column is selected in webpage is exported to realize.
Referring to Fig. 2, the step flow chart of another web page processing method embodiment of the invention is shown, specifically can wrap Include following steps:
Step 202, the corresponding web page code of Webpage is obtained.
Step 204, it is based on the web page code, obtains the corresponding code block of each column.
For the Webpage in browser, the corresponding web page code of the Webpage can be obtained, then according to the webpage Code analysis structure of web page, wherein the corresponding code of Webpage is usually one piece one piece, therefore can obtain corresponding code Block, such as the corresponding code block of head, the corresponding code block of body etc..Can also be according to the annotation information in web page code, such as infuse Begin, end in releasing determine the starting and ending of one section of code, to obtain corresponding code block.In the embodiment of the present invention, One column can correspond to a code block, so that the corresponding code block of each column can be obtained by structural analysis.Certainly, certain In the case of, it is also possible to there is the case where multiple plates corresponding big code block, such situation in combination with annotation information etc. from The corresponding filial generation code block of each plate is distinguished in the code block.
Step 206, according to the corresponding web page element of column each in the code block locating web-pages page.
Then corresponding column can be determined respectively according to each code block, the corresponding webpage of the column is positioned in code block Element determines the position where the web page elements such as text, picture.Wherein, described according in the code block locating web-pages page The corresponding web page element of each column, comprising: determine corresponding node according to the code block, wherein the node includes: father's section Point and/or child node;According to the corresponding nodal information of each node, the corresponding column of each code block is determined;In the column according to According to keyword locating web-pages element, and record corresponding location information.
Wherein, it usually not unites completely for the content in each column and column since the design of each webpage is different One mark, but would generally indicate specified content using some general keywords, such as title by " title " or The headline of news web page is " news_title " etc., so as to pass through the corresponding node relationships of code block, and key Word analysis etc. determines the web page element in column and column.It can determine the corresponding node of the code block and corresponding section Point information, specifically can determine the interdependent nodes such as the corresponding node of the code block and father node, the child node of the node, thus According to the corresponding nodal information of each node, the corresponding column of each code block, such as theme, text are determined.Then it is being based on the section The contents such as point information, annotation information, obtain the keyword of web page code corresponding position, determine corresponding webpage according to the keyword Element, and the location information of web page element is recorded, realize the positioning to web page element.It may also be combined with webpage in the embodiment of the present invention The page corresponds to the data such as the label in hypertext markup language (Hyper Text Markup Language, HTML) to analyze version Web page element in block and column.
After orienting column and its web page element, required webpage member is may be selected in user during browsing webpage Element is exported, wherein above-mentioned the step of orienting column and its web page element can parse, the process of rendering in Webpage Middle execution can also execute after the triggering instruction for receiving user.The triggering indicates selection of user's triggering to editor's output, by There are many column and object element in webpage, therefore a settable window shows column and object element, selects convenient for user It is exported after selecting.
In an alternative embodiment of the invention, each column and the column are shown by window in a browser Corresponding web page element, to select the target webpage element for needing to edit output;Wherein, the window includes editor's output control Part, it includes: copy control, word depghi and/or preservation control that the editor, which exports control,.
One window can be set in a browser, each column that Webpage contains is shown in the window, and every The web page element contained in a column, a kind of corresponding Webpage of window can be determined according to triggering instruction, can also be browser Webpage where middle current focus.To which user can select column and the corresponding webpage of the column in the window Element, the selection based on user produce corresponding specified operation.Also, user is before selection column and target webpage element Or later, the output to be executed operation also may be selected, therefore editor's output control can be set in the window, which exports control Editor for triggering target webpage element exports, editor output control include it is following at least one: copy control, printing Control saves control, shares control.For replicating to target webpage element, word depghi is used for target copy control Web page element is printed, and is saved control for saving to target webpage element, is shared control and be used for target webpage member Element is shared to specified application.To indicate specified operation accordingly by the above-mentioned control in triggering window, and then extract mesh Mark the output of web page element postedit.
Step 208, column, and the selection target web page element in selected column are selected according to the specified operation.
Step 210, the location information according to the target webpage element extracts target webpage member in the selected column Element.
User selects column, web page element in the window, and trigger editor output control after, can indicate needed for execute finger Fixed operation can determine that the column of user's selection, i.e., selected column may further determine that the selected version according to the parameter in the specified operation Web page element in block is target webpage element, so as to the location information based on target webpage element, in web page code Extract the target webpage element in corresponding position.
Wherein, according to the difference of target webpage element, extracting mode is there is also certain difference, and text is logical in web page element Web page code often is write direct, therefore can replicate and obtain directly from web page code, and for picture, audio, animation, video etc. Web page element is usually stored in web page code in the form of a link, therefore can be obtained based on the link to storage location request Take the web page elements such as picture, audio, animation, video.Certainly it also can extract the sharing information as web page element such as the link, with Just subsequent execution sharing operation.
For the target webpage element of extraction, behaviour can be exported according to editor's output control determination editor to be executed of triggering Make.
Step 212, target webpage element in the selected column is copied in shear plate.
For the content that needs replicate, target webpage element in selected column can be copied in shear plate, that is, pass through tune Target webpage element in selected column, such as text, picture are copied in shear plate with interface.It is subsequent reproducible to other need The position wanted, such as copy to document, carry out further editing and processing in application of drawing.
Step 214, target webpage element in the selected column is printed.
For the content that needs print, target webpage element in the selected column can print, i.e., by calling printer Relevant interface, target webpage element in column can be selected, such as text, picture are transferred to printer, are then printed.? Corresponding print option can be shown before printing, select the print parameters such as paper, the quantity of printing for user.
Wherein, it prints in the selected column before target webpage element, further includes: to target network in the selected column Page element is edited, and the editor comprises at least one of the following operation: modification operation, insertion operation, delete operation.User exists In printing network page when content, it there may come a time when also to need to edit the content, therefore can also operate target webpage element executive editor, The editor comprises at least one of the following operation: modification operation, insertion operation, delete operation.The target that needs can be printed Web page element copies in an editable interface, is similar to the softwares such as WORD, drawing, required edit operation then can be performed, Such as text, segment word can be deleted, be inserted into other texts or picture in the literature etc., modify certain texts etc.;It is for another example right In picture, it can be inserted into text in picture, other pictures etc. in stickup, can also change lattice to text, picture etc. again typesetting Formula.Content needed for user being obtained by editor in a word, then executes printing again.
Step 216, target webpage element is saved in the selected column to specified address.
For the content that needs save, it can be reserved for target webpage element in the selected column and pass through to specified address Call storage interface by the storages such as target webpage element such as text, picture, audio, animation, video in selected column to specify Memory space in.Hereafter corresponding content can be obtained in the memory space, and executes required operation, such as broadcasting audio, Animation, video etc..
Step 218, target webpage element in the selected column is shared to specified application.
For the content to be shared, target webpage element can be shared to specified application, the i.e. interface by calling storage Obtain the chain of target webpage element such as text, picture, audio, animation, video etc. or the target webpage element in selected column Equal sharing informations, such as the chained address of picture, audio, animation, video are connect, then by target webpage element or target webpage member The sharing information of element is sent in specified application, which includes various types of applications, such as instant messaging application, society Hand over application, Video Applications etc..
In one example, user is interested in the article shown in webpage when browsing webpage, it is desirable to copy to In document, then impression window can be indicated by triggering, such as window is triggered by right-click, double-click left button mode, in the window Show that the column and its web page element of the webpage, these columns and its web page element are by carrying out structure to web page code in mouthful Parsing determination, the word segment in text then may be selected, and trigger copy control, so that browser can be automatically positioned just Word segment i.e. this article in text, the text then extracted in article copy in shear plate.And for the top in webpage Portion's navigation, advertisement etc. will not be replicated due to user and non-selected as selected column and its object element.
The embodiment of the present invention can be according to structure elucidation webpage, thus the net in locating web-pages in each column and each column Page element greatly promotes user information collection, letter to provide the user with the channel of the selection to corresponding contents, output operation Breath saves and the efficiency of the regular jobs such as information printing.And can be realized user in webpage side it is quick, it is convenient, accurately select It selects, thereby executing the operation such as duplication, printing, preservation.
It should be noted that for simple description, therefore, it is stated as a series of action groups for embodiment of the method It closes, but those skilled in the art should understand that, embodiment of that present invention are not limited by the describe sequence of actions, because according to According to the embodiment of the present invention, some steps may be performed in other sequences or simultaneously.Secondly, those skilled in the art also should Know, the embodiments described in the specification are all preferred embodiments, and the related movement not necessarily present invention is implemented Necessary to example.
On the basis of the above embodiments, the embodiment of the invention also provides a kind of page processors, are applied to terminal In the electronic equipments such as equipment.
Referring to Fig. 3, show a kind of structural block diagram of page processor embodiment of the invention, can specifically include as Lower module:
Analyzing and positioning module 302 positions each column pair in the Webpage for carrying out structural analysis to Webpage The web page element answered.
Element extraction module 304, for extracting target webpage element in selected column according to specified operation.
Output module 306 is edited, for exporting to target webpage element executive editor in the selected column.
To sum up, structural analysis can be carried out to Webpage, positions the corresponding web page element of each column in the Webpage, To obtain each web page element in webpage by structural analysis, then according to specified operation, target webpage in selected column is extracted Element, can convenient, accurately selection target web page element, then it is defeated to target webpage element executive editor in the selected column Out, the operation for reducing selection course improves the efficiency of target selection and the accuracy of target output.
Referring to Fig. 4, show a kind of structural block diagram of page processor embodiment of the invention, can specifically include as Lower module:
Analyzing and positioning module 302 positions each column pair in the Webpage for carrying out structural analysis to Webpage The web page element answered.
Display module 308, for showing each column and the corresponding net of the column by window in a browser Page element, to select the target webpage element for needing to edit output;Wherein, the window includes editor's output control, described Editor output control include it is following at least one: copy control, word depghi, save control, share control.
Element extraction module 304, for extracting target webpage element in selected column according to specified operation.
Output module 306 is edited, for exporting to target webpage element executive editor in the selected column.
Wherein, editor's output module 306, comprising: duplication submodule 3062, saves submodule at printing submodule 3064 Block 3066 with share submodule 3068, in which:
Submodule 3062 is replicated, for copying to target webpage element in the selected column in shear plate.
Submodule 3064 is printed, for printing target webpage element in the selected column.
Submodule 3066 is saved, for saving in the selected column target webpage element to specified address.
Share submodule 3068, for sharing target webpage element in the selected column to specified application.
The analyzing and positioning module 302, comprising: acquisition submodule 3022, analysis submodule 3024 and element position submodule Block 3026, in which:
Acquisition submodule 3022, for obtaining the corresponding web page code of Webpage.
Submodule 3024 is analyzed, for being based on the web page code, obtains the corresponding code block of each column.
Element positioning submodule 3026, for according to the corresponding webpage member of column each in the code block locating web-pages page Element.
The element positioning submodule 3026, for determining corresponding node according to the code block, wherein the node It include: father node and/or child node;According to the corresponding nodal information of each node, the corresponding column of each code block is determined;Described According to keyword locating web-pages element in column, and record corresponding location information.
The element extraction module 304, comprising: selection submodule 3042 and extracting sub-module 3044, in which:
Submodule 3042 is selected, for selecting according to the specified operation selection column, and in the selected column Target webpage element.
Extracting sub-module 3044 is extracted in the selected column for the location information according to the target webpage element Target webpage element.
Wherein, the column comprises at least one of the following: header, footer, theme, text, advertisement;The web page element packet Include following at least one: text, picture, audio, animation, video.
The printing submodule 3064 is also used to edit target webpage element in the selected column, the volume It collects and comprises at least one of the following operation: modification operation, insertion operation, delete operation.
The embodiment of the present invention can be according to structure elucidation webpage, thus the net in locating web-pages in each column and each column Page element greatly promotes user information collection, letter to provide the user with the channel of the selection to corresponding contents, output operation Breath saves and the efficiency of the regular jobs such as information printing.And can be realized user in webpage side it is quick, it is convenient, accurately select It selects, thereby executing the operation such as duplication, printing, preservation.
For device embodiment, since it is basically similar to the method embodiment, related so being described relatively simple Place illustrates referring to the part of embodiment of the method.
Fig. 5 is a kind of structural block diagram of electronic equipment 500 for Web Page Processing shown according to an exemplary embodiment. For example, electronic equipment 500 can be mobile phone, computer, digital broadcasting terminal, messaging device, game console put down Panel device, Medical Devices, body-building equipment, personal digital assistant etc.;It is also possible to server device, such as server.
Referring to Fig. 5, electronic equipment 500 may include following one or more components: processing component 502, memory 504, Power supply module 506, multimedia component 508, audio component 510, the interface 512 of input/output (I/O), sensor module 514, And communication component 516.
The integrated operation of the usual controlling electronic devices 500 of processing component 502, such as with display, call, data are logical Letter, camera operation and record operate associated operation.Processing element 502 may include one or more processors 520 to hold Row instruction, to perform all or part of the steps of the methods described above.In addition, processing component 502 may include one or more moulds Block, convenient for the interaction between processing component 502 and other assemblies.For example, processing component 502 may include multi-media module, with Facilitate the interaction between multimedia component 508 and processing component 502.
Memory 504 is configured as storing various types of data to support the operation in equipment 500.These data are shown Example includes the instruction of any application or method for operating on electronic equipment 500, contact data, telephone directory number According to, message, picture, video etc..Memory 504 can by any kind of volatibility or non-volatile memory device or they Combination realize, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM) is erasable Programmable read only memory (EPROM), programmable read only memory (PROM), read-only memory (ROM), magnetic memory, quick flashing Memory, disk or CD.
Electric power assembly 504 provides electric power for the various assemblies of electronic equipment 500.Electric power assembly 504 may include power supply pipe Reason system, one or more power supplys and other with for electronic equipment 500 generate, manage, and distribute the associated component of electric power.
Multimedia component 508 includes the screen of one output interface of offer between the electronic equipment 500 and user. In some embodiments, screen may include liquid crystal display (LCD) and touch panel (TP).If screen includes touch surface Plate, screen may be implemented as touch screen, to receive input signal from the user.Touch panel includes one or more touches Sensor is to sense the gesture on touch, slide, and touch panel.The touch sensor can not only sense touch or sliding The boundary of movement, but also detect duration and pressure associated with the touch or slide operation.In some embodiments, Multimedia component 508 includes a front camera and/or rear camera.When electronic equipment 500 is in operation mode, as clapped When taking the photograph mode or video mode, front camera and/or rear camera can receive external multi-medium data.It is each preposition Camera and rear camera can be a fixed optical lens system or have focusing and optical zoom capabilities.
Audio component 510 is configured as output and/or input audio signal.For example, audio component 510 includes a Mike Wind (MIC), when electronic equipment 500 is in operation mode, when such as call mode, recording mode, and voice recognition mode, microphone It is configured as receiving external audio signal.The received audio signal can be further stored in memory 504 or via logical Believe that component 516 is sent.In some embodiments, audio component 510 further includes a loudspeaker, is used for output audio signal.
I/O interface 512 provides interface between processing component 502 and peripheral interface module, and above-mentioned peripheral interface module can To be keyboard, click wheel, button etc..These buttons may include, but are not limited to: home button, volume button, start button and lock Determine button.
Sensor module 514 includes one or more sensors, for providing the state of various aspects for electronic equipment 500 Assessment.For example, sensor module 514 can detecte the state that opens/closes of equipment 500, the relative positioning of component, such as institute The display and keypad that component is electronic equipment 500 are stated, sensor module 514 can also detect electronic equipment 500 or electronics The position change of 500 1 components of equipment, the existence or non-existence that user contacts with electronic equipment 500,500 orientation of electronic equipment Or the temperature change of acceleration/deceleration and electronic equipment 500.Sensor module 514 may include proximity sensor, be configured to It detects the presence of nearby objects without any physical contact.Sensor module 514 can also include optical sensor, such as CMOS or ccd image sensor, for being used in imaging applications.In some embodiments, which can be with Including acceleration transducer, gyro sensor, Magnetic Sensor, pressure sensor or temperature sensor.
Communication component 516 is configured to facilitate the communication of wired or wireless way between electronic equipment 500 and other equipment. Electronic equipment 400 can access the wireless network based on communication standard, such as WiFi, 2G or 3G or their combination.Show at one In example property embodiment, communication component 514 receives broadcast singal or broadcast from external broadcasting management system via broadcast channel Relevant information.In one exemplary embodiment, the communication component 514 further includes near-field communication (NFC) module, short to promote Cheng Tongxin.For example, radio frequency identification (RFID) technology, Infrared Data Association (IrDA) technology, ultra wide band can be based in NFC module (UWB) technology, bluetooth (BT) technology and other technologies are realized.
In the exemplary embodiment, electronic equipment 500 can be by one or more application specific integrated circuit (ASIC), number Word signal processor (DSP), digital signal processing appts (DSPD), programmable logic device (PLD), field programmable gate array (FPGA), controller, microcontroller, microprocessor or other electronic components are realized, for executing the above method.
In the exemplary embodiment, a kind of non-transitorycomputer readable storage medium including instruction, example are additionally provided It such as include the memory 504 of instruction, above-metioned instruction can be executed by the processor 520 of electronic equipment 500 to complete the above method.Example Such as, the non-transitorycomputer readable storage medium can be ROM, random access memory (RAM), CD-ROM, tape, soft Disk and optical data storage devices etc..
A kind of non-transitorycomputer readable storage medium, when the instruction in the storage medium is by the processing of electronic equipment When device executes, so that electronic equipment is able to carry out a kind of web page processing method, which comprises carry out structure to Webpage Analysis positions the corresponding web page element of each column in the Webpage;According to specified operation, target network in selected column is extracted Page element;Target webpage element executive editor in the selected column is exported.
Optionally, described that target webpage element executive editor in the selected column is exported, it comprises at least one of the following: Target webpage element in the selected column is copied in shear plate;Print target webpage element in the selected column;It protects Target webpage element is deposited in the selected column to specified address;Target webpage element in the selected column is shared to specified Using.
Optionally, described that structural analysis is carried out to Webpage, position the corresponding webpage of each column in the Webpage Element, comprising: obtain the corresponding web page code of Webpage;Based on the web page code, the corresponding code block of each column is obtained; According to the corresponding web page element of column each in the code block locating web-pages page.
Optionally, described according to the corresponding web page element of column each in the code block locating web-pages page, comprising: foundation The code block determines corresponding node, wherein the node includes: father node and/or child node;It is corresponding according to each node Nodal information determines the corresponding column of each code block;According to keyword locating web-pages element in the column, and record corresponding Location information;The specified operation of the foundation, extracts target webpage element in selected column, comprising: according to the specified operation Select column, and the selection target web page element in the selected column;According to the location information of the target webpage element, Extract target webpage element in the selected column.
Optionally, the web page element comprises at least one of the following: text, picture, audio, animation, video.
Optionally, further includes: each column and the corresponding webpage of the column are shown by window in a browser Element, to select the target webpage element for needing to edit output;Wherein, the window includes editor's output control, the volume Volume output control include it is following at least one: copy control, word depghi, save control, share control.
Optionally, it prints in the selected column before target webpage element, further includes: to target in the selected column Web page element is edited, and the editor comprises at least one of the following operation: modification operation, insertion operation, delete operation.
Fig. 6 is a kind of electronic equipment 600 for Web Page Processing that the present invention is shown according to another exemplary embodiment Structural schematic diagram.The electronic equipment 600 can be server, which can generate bigger because of configuration or performance difference Difference, may include one or more central processing units (central processing units, CPU) 622 (for example, One or more processors) and memory 632, the storage of one or more storage application programs 642 or data 644 Medium 630 (such as one or more mass memory units).Wherein, memory 632 and storage medium 630 can be of short duration Storage or persistent storage.The program for being stored in storage medium 630 may include one or more modules (diagram does not mark), Each module may include to the series of instructions operation in server.Further, central processing unit 622 can be set to It is communicated with storage medium 630, executes the series of instructions operation in storage medium 630 on the server.
Server can also include one or more power supplys 626, one or more wired or wireless networks connect Mouthfuls 650, one or more input/output interfaces 658, one or more keyboards 656, and/or, one or one with Upper operating system 641, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM etc..
In the exemplary embodiment, electronic equipment is configured to execute one by one or more than one central processing unit Or more than one program includes the instruction for performing the following operation: carrying out structural analysis to Webpage, positions the net The corresponding web page element of each column in the page page;According to specified operation, target webpage element in selected column is extracted;To the choosing Determine target webpage element executive editor in column to export.
Optionally, described that target webpage element executive editor in the selected column is exported, it comprises at least one of the following: Target webpage element in the selected column is copied in shear plate;Print target webpage element in the selected column;It protects Target webpage element is deposited in the selected column to specified address;Target webpage element in the selected column is shared to specified Using.
Optionally, described that structural analysis is carried out to Webpage, position the corresponding webpage of each column in the Webpage Element, comprising: obtain the corresponding web page code of Webpage;Based on the web page code, the corresponding code block of each column is obtained; According to the corresponding web page element of column each in the code block locating web-pages page.
Optionally, described according to the corresponding web page element of column each in the code block locating web-pages page, comprising: foundation The code block determines corresponding node, wherein the node includes: father node and/or child node;It is corresponding according to each node Nodal information determines the corresponding column of each code block;According to keyword locating web-pages element in the column, and record corresponding Location information;The specified operation of the foundation, extracts target webpage element in selected column, comprising: according to the specified operation Select column, and the selection target web page element in the selected column;According to the location information of the target webpage element, Extract target webpage element in the selected column.
Optionally, the web page element comprises at least one of the following: text, picture, audio, animation, video.
Optionally, also comprising the instruction for performing the following operation: each column is shown by window in a browser, And the corresponding web page element of the column, to select the target webpage element for needing to edit output;Wherein, the window packet Include editor's output control, the editor export control include it is following at least one: copy control, word depghi save control, divide Enjoy control.
Optionally, it prints in the selected column before target webpage element, further includes: to target in the selected column Web page element is edited, and the editor comprises at least one of the following operation: modification operation, insertion operation, delete operation.
All the embodiments in this specification are described in a progressive manner, the highlights of each of the examples are with The difference of other embodiments, the same or similar parts between the embodiments can be referred to each other.
The embodiment of the present invention be referring to according to the method for the embodiment of the present invention, terminal device (system) and computer program The flowchart and/or the block diagram of product describes.It should be understood that flowchart and/or the block diagram can be realized by computer program instructions In each flow and/or block and flowchart and/or the block diagram in process and/or box combination.It can provide these Computer program instructions are set to general purpose computer, special purpose computer, Embedded Processor or other programmable data processing terminals Standby processor is to generate a machine, so that being held by the processor of computer or other programmable data processing terminal devices Capable instruction generates for realizing in one or more flows of the flowchart and/or one or more blocks of the block diagram The device of specified function.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing terminal devices In computer-readable memory operate in a specific manner, so that instruction stored in the computer readable memory generates packet The manufacture of command device is included, which realizes in one side of one or more flows of the flowchart and/or block diagram The function of being specified in frame or multiple boxes.
These computer program instructions can also be loaded into computer or other programmable data processing terminal devices, so that Series of operation steps are executed on computer or other programmable terminal equipments to generate computer implemented processing, thus The instruction executed on computer or other programmable terminal equipments is provided for realizing in one or more flows of the flowchart And/or in one or more blocks of the block diagram specify function the step of.
Although the preferred embodiment of the embodiment of the present invention has been described, once a person skilled in the art knows bases This creative concept, then additional changes and modifications can be made to these embodiments.So the following claims are intended to be interpreted as Including preferred embodiment and fall into all change and modification of range of embodiment of the invention.
Finally, it is to be noted that, herein, relational terms such as first and second and the like be used merely to by One entity or operation are distinguished with another entity or operation, without necessarily requiring or implying these entities or operation Between there are any actual relationship or orders.Moreover, the terms "include", "comprise" or its any other variant meaning Covering non-exclusive inclusion, so that process, method, article or terminal device including a series of elements not only wrap Those elements are included, but also including other elements that are not explicitly listed, or further includes for this process, method, article Or the element that terminal device is intrinsic.In the absence of more restrictions, being wanted by what sentence "including a ..." limited Element, it is not excluded that there is also other identical elements in process, method, article or the terminal device for including the element.
Above to a kind of web page processing method provided by the present invention, a kind of page processor, a kind of storage medium and A kind of electronic equipment is described in detail, and specific case used herein carries out the principle of the present invention and embodiment It illustrates, the above description of the embodiment is only used to help understand the method for the present invention and its core ideas;Meanwhile for ability The those skilled in the art in domain, according to the thought of the present invention, there will be changes in the specific implementation manner and application range, comprehensive Upper described, the contents of this specification are not to be construed as limiting the invention.

Claims (10)

1. a kind of web page processing method characterized by comprising
Structural analysis is carried out to Webpage, positions the corresponding web page element of each column in the Webpage;
According to specified operation, target webpage element in selected column is extracted;
Target webpage element executive editor in the selected column is exported.
2. the method according to claim 1, wherein described execute target webpage element in the selected column Editor's output, comprises at least one of the following:
Target webpage element in the selected column is copied in shear plate;
Print target webpage element in the selected column;
Target webpage element is saved in the selected column to specified address;
Target webpage element in the selected column is shared to specified application.
3. the method according to claim 1, wherein described carry out structural analysis to Webpage, described in positioning The corresponding web page element of each column in Webpage, comprising:
Obtain the corresponding web page code of Webpage;
Based on the web page code, the corresponding code block of each column is obtained;
According to the corresponding web page element of column each in the code block locating web-pages page.
4. according to the method described in claim 3, it is characterized in that, described according to each version in the code block locating web-pages page The corresponding web page element of block, comprising:
Corresponding node is determined according to the code block, wherein the node includes: father node and/or child node;
According to the corresponding nodal information of each node, the corresponding column of each code block is determined;
According to keyword locating web-pages element in the column, and record corresponding location information;
The specified operation of the foundation, extracts target webpage element in selected column, comprising:
Column, and the selection target web page element in the selected column are selected according to the specified operation;
According to the location information of the target webpage element, target webpage element in the selected column is extracted.
5. method according to claim 1 to 4, which is characterized in that the web page element comprises at least one of the following: Text, picture, audio, animation, video.
6. according to the method described in claim 5, it is characterized by further comprising:
Each column and the corresponding web page element of the column are shown by window in a browser, to select to need Edit the target webpage element of output;Wherein, the window includes editor's output control, and it includes following that the editor, which exports control, At least one: copy control, word depghi save control, share control.
7. according to the method described in claim 2, it is characterized in that, in the printing selected column before target webpage element, Further include:
Target webpage element in the selected column is edited, the editor comprises at least one of the following operation: modification behaviour Work, insertion operation, delete operation.
8. a kind of page processor characterized by comprising
Analyzing and positioning module positions the corresponding net of each column in the Webpage for carrying out structural analysis to Webpage Page element;
Element extraction module, for extracting target webpage element in selected column according to specified operation;
Output module is edited, for exporting to target webpage element executive editor in the selected column.
9. a kind of readable storage medium storing program for executing, which is characterized in that when the instruction in the storage medium is held by the processor of electronic equipment When row, so that electronic equipment is able to carry out the web page processing method as described in one or more in claim to a method 1-7.
10. a kind of electronic equipment, which is characterized in that include memory and one or more than one program, wherein one A perhaps more than one program is stored in memory and is configured to execute described one by one or more than one processor A or more than one program includes the instruction for performing the following operation:
Structural analysis is carried out to Webpage, positions the corresponding web page element of each column in the Webpage;
According to specified operation, target webpage element in selected column is extracted;
Target webpage element executive editor in the selected column is exported.
CN201711100114.3A 2017-11-09 2017-11-09 A kind of web page processing method, device, storage medium and electronic equipment Pending CN110020361A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711100114.3A CN110020361A (en) 2017-11-09 2017-11-09 A kind of web page processing method, device, storage medium and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711100114.3A CN110020361A (en) 2017-11-09 2017-11-09 A kind of web page processing method, device, storage medium and electronic equipment

Publications (1)

Publication Number Publication Date
CN110020361A true CN110020361A (en) 2019-07-16

Family

ID=67186768

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711100114.3A Pending CN110020361A (en) 2017-11-09 2017-11-09 A kind of web page processing method, device, storage medium and electronic equipment

Country Status (1)

Country Link
CN (1) CN110020361A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110795050A (en) * 2019-10-29 2020-02-14 北京推想科技有限公司 Webpage printing method and device
CN112015410A (en) * 2020-07-16 2020-12-01 深圳市大富网络技术有限公司 Webpage editing method, device and system and computer storage medium
CN113407168A (en) * 2021-06-07 2021-09-17 远光软件股份有限公司 Editing method and device of page elements, storage medium and terminal
CN113835603A (en) * 2021-08-31 2021-12-24 五八有限公司 Page element selection method and device, electronic equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104965901A (en) * 2015-06-30 2015-10-07 北京奇虎科技有限公司 Method and apparatus for grabbing content of target page
CN104965881A (en) * 2015-06-12 2015-10-07 北京奇虎科技有限公司 Method and device for extracting selected area from page
CN106446139A (en) * 2016-09-20 2017-02-22 微梦创科网络科技(中国)有限公司 Webpage content extracting method and device
CN106489129A (en) * 2016-09-29 2017-03-08 北京小米移动软件有限公司 The method and device that a kind of content is shared
CN106776634A (en) * 2015-11-23 2017-05-31 北京搜狗科技发展有限公司 A kind of method for network access, device and terminal device
CN106844635A (en) * 2017-01-19 2017-06-13 腾讯科技(深圳)有限公司 The edit methods and device of the element in webpage

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104965881A (en) * 2015-06-12 2015-10-07 北京奇虎科技有限公司 Method and device for extracting selected area from page
CN104965901A (en) * 2015-06-30 2015-10-07 北京奇虎科技有限公司 Method and apparatus for grabbing content of target page
CN106776634A (en) * 2015-11-23 2017-05-31 北京搜狗科技发展有限公司 A kind of method for network access, device and terminal device
CN106446139A (en) * 2016-09-20 2017-02-22 微梦创科网络科技(中国)有限公司 Webpage content extracting method and device
CN106489129A (en) * 2016-09-29 2017-03-08 北京小米移动软件有限公司 The method and device that a kind of content is shared
CN106844635A (en) * 2017-01-19 2017-06-13 腾讯科技(深圳)有限公司 The edit methods and device of the element in webpage

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110795050A (en) * 2019-10-29 2020-02-14 北京推想科技有限公司 Webpage printing method and device
CN112015410A (en) * 2020-07-16 2020-12-01 深圳市大富网络技术有限公司 Webpage editing method, device and system and computer storage medium
CN113407168A (en) * 2021-06-07 2021-09-17 远光软件股份有限公司 Editing method and device of page elements, storage medium and terminal
CN113835603A (en) * 2021-08-31 2021-12-24 五八有限公司 Page element selection method and device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
US10880098B2 (en) Collaborative document editing
JP7414842B2 (en) How to add comments and electronic devices
JP6051338B2 (en) Page rollback control method, page rollback control device, terminal, program, and recording medium
US9230356B2 (en) Document collaboration effects
CN107329743A (en) Methods of exhibiting, device and the storage medium of five application page
US9542366B2 (en) Smart text in document chat
US20130160142A1 (en) Track Changes Permissions
CN110020361A (en) A kind of web page processing method, device, storage medium and electronic equipment
CN106569800A (en) Front end interface generation method and apparatus
RU2643437C2 (en) Method and apparatus for selecting information
CN104636164B (en) Start page generation method and device
CN105786944A (en) Method and device for automatically turning pages of browser
CN110704053B (en) Style information processing method and device
RU2734780C1 (en) Method of presenting information, device and storage medium for information therefor
CN104951445B (en) Webpage processing method and device
CN105095163A (en) Web page editing method and device
CN107390974B (en) Code searching method, device, terminal and storage medium for webpage debugging
CN108874758A (en) Notes treating method and apparatus, the device for taking down notes processing
KR101519856B1 (en) apparatus and method for common of contents, communication service system
CN112286617A (en) Operation guidance method and device and electronic equipment
CN107168969A (en) A kind of page elements control method, device and electronic equipment
CN103927334B (en) Webpage acquisition methods and device
CN114020197B (en) Cross-application message processing method, electronic device and readable storage medium
CN106776634A (en) A kind of method for network access, device and terminal device
CN104239407B (en) A kind of method and apparatus for showing webpage

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190716