WO2016045160A1 - 网页数据处理方法和装置 - Google Patents

网页数据处理方法和装置 Download PDF

Info

Publication number
WO2016045160A1
WO2016045160A1 PCT/CN2014/089244 CN2014089244W WO2016045160A1 WO 2016045160 A1 WO2016045160 A1 WO 2016045160A1 CN 2014089244 W CN2014089244 W CN 2014089244W WO 2016045160 A1 WO2016045160 A1 WO 2016045160A1
Authority
WO
WIPO (PCT)
Prior art keywords
webpage
content
display terminal
type
webpage content
Prior art date
Application number
PCT/CN2014/089244
Other languages
English (en)
French (fr)
Inventor
王志国
谭霞
梁权清
Original Assignee
深圳创维-Rgb电子有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳创维-Rgb电子有限公司 filed Critical 深圳创维-Rgb电子有限公司
Priority to US15/125,965 priority Critical patent/US20170083487A1/en
Priority to AU2014406868A priority patent/AU2014406868A1/en
Publication of WO2016045160A1 publication Critical patent/WO2016045160A1/zh

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/14Tree-structured documents
    • G06F40/143Markup, e.g. Standard Generalized Markup Language [SGML] or Document Type Definition [DTD]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/14Digital output to display device ; Cooperation and interconnection of the display device with other functional units
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/14Digital output to display device ; Cooperation and interconnection of the display device with other functional units
    • G06F3/1454Digital output to display device ; Cooperation and interconnection of the display device with other functional units involving copying of the display data of a local workstation or window to a remote workstation or window so that an actual copy of the data is displayed simultaneously on two or more displays, e.g. teledisplay
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/117Tagging; Marking up; Designating a block; Setting of attributes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/134Hyperlinking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/197Version control
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09GARRANGEMENTS OR CIRCUITS FOR CONTROL OF INDICATING DEVICES USING STATIC MEANS TO PRESENT VARIABLE INFORMATION
    • G09G2370/00Aspects of data communication
    • G09G2370/02Networking aspects
    • G09G2370/027Arrangements and methods specific for the display of internet documents

Definitions

  • the present invention relates to the field of communications, and in particular, to a webpage data processing method and apparatus.
  • the webpage that is being browsed on the mobile phone or the PDA is transmitted to the television for display, but these schemes simultaneously transmit the text of the webpage and the multimedia content when the webpage is transmitted to the television, so that it is displayed on the television. It is a picture that mixes and matches text and multimedia content, which easily leads to uncoordinated TV pictures.
  • the main object of the present invention is to solve the technical problem of uncoordinated pictures when browsing a web page in which multimedia content such as pictures, videos or audio is embedded using a display terminal.
  • a webpage data processing method includes the following steps: a first display terminal sends a webpage link request to a server to obtain webpage content included in a webpage link; and the first display terminal is based on the obtained webpage content.
  • the type is classified to form a first preset type of webpage content and a second preset type of webpage content; the first preset type includes a picture type, a video type, and an audio type, and the second preset type includes text
  • the first display terminal outputs the webpage content of the first preset type to the second display terminal for display; and the first display terminal displays the webpage content of the second preset type.
  • the first display terminal sends a webpage link request to the server, and the webpage content included in the webpage link includes: the first display terminal sends the webpage link request to the server; the first display terminal receiving server sends the feedback according to the webpage link request The HTML script file; the first display terminal acquires all webpage content in the HTML script file.
  • the first display terminal performs a classification process according to the type of the acquired webpage content
  • the webpage content of the first preset type and the webpage content of the second preset type specifically include: the first display terminal sequentially reads the HTML script The content of the webpage and the corresponding label in the file; the label is a webpage content type identification label; the first display terminal determines whether the currently read webpage content is the first preset type according to the label corresponding to the currently read webpage content The webpage content; if yes, the first display terminal stores the URL link address in the currently read webpage content into the preset list, and deletes the currently read webpage content and the corresponding label in the HTML script.
  • the first display terminal After the reading of all the webpage content is completed, the first display terminal reorganizes the HTML script file to obtain a new HTML script file; the first display terminal outputs the webpage content of the first preset type to the second display terminal.
  • the displaying is specifically that the first display terminal outputs the preset list to the second display terminal display; the first display terminal displays the first
  • the webpage content of the two preset types is specifically that the first display terminal invokes the browser to load the new HTML script file.
  • the label includes IMG, AUDIO, VIDEO, EMB, and OBJECT; when the label corresponding to the currently read webpage content is EMB or OBJECT, the first display terminal determines the current reading according to the embedded object format in the webpage content. Whether the content of the webpage is the content of the first preset type of webpage.
  • the first display terminal outputs the preset list to the second display terminal, and specifically, the first display terminal outputs the preset list to the second display terminal, so that the second display terminal loads The URL link address in the preset list.
  • the second display terminal performs decoding and playing on the webpage content of the first preset type obtained by loading the URL link address in the preset list.
  • the present invention further provides a webpage data processing apparatus, including a webpage obtaining module, a webpage processing module, an output module, and a display module, wherein: a webpage obtaining module is configured to send a webpage link request to a server to obtain
  • the webpage processing includes a webpage processing module, configured to perform a sorting process according to the type of the obtained webpage content, to form a first preset type webpage content and a second preset type webpage content; the first preset type a picture type, a video type, and an audio type, the second preset type includes a text type, an output module, configured to output the webpage content of the first preset type to a second display terminal display, and a display module, configured to: The webpage content of the second preset type is displayed.
  • the webpage obtaining module includes a sending unit, a receiving unit, and an obtaining unit, wherein: the sending unit is configured to send the webpage link request to the server, and the receiving unit is configured to receive the HTML script fed back by the server according to the webpage link request a file; an obtaining unit, configured to obtain all webpage content in the HTML script file.
  • the webpage processing module includes a reading unit, a first determining unit, a URL address extracting unit, and a script file reorganizing unit, wherein: the reading unit is configured to sequentially read all webpage contents and corresponding labels in the HTML script file.
  • the tag is a webpage content type identification tag; the first determining unit is configured to determine, according to the currently corresponding tag of the webpage content, whether the currently read webpage content is the webpage content of the first preset type; a URL address extraction unit, configured to store the URL link address in the currently read webpage content into the preset list when the currently read webpage content is the first preset type of webpage content, and in the HTML script And deleting the currently read webpage content and the corresponding label; the script file reorganizing unit is configured to reorganize the HTML script file to obtain a new HTML script file after all the webpage content is read.
  • the label includes an IMG, an AUDIO, a VIDEO, an EMB, and an OBJECT; the webpage processing module further includes a second determining unit, where the label corresponding to the currently read webpage content is EMB or OBJECT And determining, according to the embedded object format in the webpage content, whether the currently read webpage content is the webpage content of the first preset type.
  • the display module is specifically configured to retrieve a browser to load the new HTML script file; the output module is specifically configured to output the preset list to the second display terminal, where The second display terminal loads the URL link address in the preset list.
  • the second display terminal performs decoding and playing on the webpage content of the first preset type obtained by loading the URL link address in the preset list.
  • the webpage data processing method provided by the present invention divides the content of the webpage into a webpage content of a first preset type including content of a picture type, a video type, and an audio type, and a webpage content of a second preset type including content of a text type.
  • the webpage content of the second preset type including the text type content is browsed using the first display terminal, and the webpage content of the first preset type including the content of the picture type, the video type, and the audio type is output to the second display
  • the terminal performs display, that is, only browsing the content of the text type on the first display terminal, and displaying only the content of the picture type, the video type, and the audio type on the second display terminal, thereby avoiding the use of the first display terminal or the second display.
  • the terminal browses a webpage with embedded multimedia content such as pictures, video or audio
  • the picture is uncoordinated.
  • the first display terminal is a handheld smart terminal
  • the video can be prevented from being viewed due to the small screen of the handheld smart terminal.
  • the user's body is enhanced. .
  • FIG. 1 is a schematic flowchart of a first embodiment of a webpage data processing method according to the present invention
  • FIG. 2 is a schematic flowchart of a second embodiment of a webpage data processing method according to the present invention.
  • FIG. 3 is a schematic flowchart of a third embodiment of a webpage data processing method according to the present invention.
  • FIG. 4 is a schematic diagram of functional modules of an embodiment of a webpage data processing apparatus according to the present invention.
  • FIG. 5 is a schematic diagram of a refinement function module of the webpage obtaining module in FIG. 4;
  • FIG. 6 is a schematic diagram of a refinement function module of the webpage processing module of FIG. 4.
  • the invention provides a webpage data processing method.
  • FIG. 1 is a schematic flowchart of a first embodiment of a webpage data processing method according to the present invention.
  • the webpage data processing method includes:
  • Step S10 The first display terminal sends a webpage link request to the server to obtain webpage content included in the webpage link.
  • the first display terminal first receives the webpage link request input by the user, and then requests the server accessing the Internet according to the webpage link input by the user, obtains the webpage content from the Internet, and temporarily stores the obtained webpage content in the webpage.
  • the first display terminal generally refers to a handheld smart terminal, and the handheld smart terminal can be, but is not limited to, a mobile phone, a PDA, a tablet computer, a handheld game console, etc., and has an application of a web browser.
  • step S20 the first display terminal performs classification processing according to the type of the obtained webpage content, and forms webpage content of the first preset type and webpage content of the second preset type.
  • the first display terminal first reads the acquired types of webpage content from the memory or the local storage, and the type of the webpage content includes a picture type, a video type, an audio type, a text type, and the like, and then The type of the webpage content is classified, and the multimedia content of the image type, the video type or the audio type in the webpage content is extracted to form the webpage content of the first preset type, and the content of the remaining text type in the webpage content is performed. Recomposing the second preset type of webpage content, thus dividing the entire webpage content into a webpage content of a first preset type including a picture type, a video type, and an audio type, and a second preset type including content of a text type Web content.
  • Step S30 the first display terminal outputs the webpage content of the first preset type to the second display terminal for display;
  • Step S40 The first display terminal displays the webpage content of the second preset type.
  • the second display terminal generally refers to a display terminal with a larger screen, such as a television set; the first display terminal can realize instant communication with the second display terminal, and the specific communication manner can be: but not limited to: wired LAN, WiFi LAN, wired Internet, WiFi Internet, WiFi hotspot, 2G/3G/4G wireless communication network, Bluetooth, NFC (Near Field Communication, near field communication, infrared, etc.; the first display terminal divides the entire webpage content into a webpage content of a first preset type including content of a picture type, a video type, and an audio type, and a second content including content of a text type After the webpage content of the preset type is output, the webpage content of the first preset type is output to the second display terminal, and the user can also use the first display terminal to display a no image according to the webpage content of the second preset type.
  • the first display terminal can realize instant communication with the second display terminal, and the specific communication manner can be: but not limited to: wired LAN, WiFi LAN, wired
  • a webpage of multimedia content such as video and audio
  • the content of the text type of the webpage can be displayed on the first display terminal (such as a mobile phone), and the second display terminal (such as a television)
  • the preset type of webpage content displays multimedia content such as pictures, videos, and/or audios without text mixing, so that the screen of the second display terminal when playing the multimedia content can be kept coordinated, and does not cause a messy feeling.
  • the webpage data processing method provided in this embodiment divides the content of the webpage into a webpage content of a first preset type including content of a picture type, a video type, and an audio type, and a webpage of a second preset type including content of a text type.
  • the webpage content of the second preset type including the text type content is browsed using the first display terminal, and the webpage content of the first preset type including the content of the picture type, the video type, and the audio type is output to the second
  • the display terminal performs display, that is, only the content of the text type is browsed on the first display terminal, and only the content of the picture type, the video type, and the audio type is displayed on the second display terminal, thereby avoiding the use of the first display terminal or the second
  • the terminal browses a webpage with embedded multimedia content such as pictures, video or audio
  • the screen is uncoordinated.
  • the first display terminal is a handheld smart terminal, the video can be prevented from being viewed due to the smaller screen of the handheld smart terminal. , the picture and other multimedia content, there is no shock, which enhances the user's Experience.
  • FIG. 2 is a schematic flowchart diagram of a second embodiment of a webpage data processing method according to the present invention.
  • step S10 specifically includes:
  • Step S101 the first display terminal sends a webpage link request to the server
  • Step S102 the first display terminal receives an HTML script file that is fed back by the server according to the webpage link request;
  • Step S103 The first display terminal acquires all webpage content in the HTML script file.
  • the first display terminal before the user browses the webpage by using the first display terminal, the first display terminal first receives a webpage link request input by the user, such as a webpage address or a hyperlink between webpages, etc.; then, the first display terminal according to the user The input webpage link requests the server to access the Internet, and the server of the internet feeds back the corresponding HTML script file to the first display terminal through the HTTP protocol; finally, the first display terminal acquires the corresponding webpage content according to the fed back HTML script file.
  • the HTML script file is composed of a plurality of tags and content corresponding to the corresponding tags.
  • the content that the server feeds back according to the webpage link request may be, but is not limited to, PHP (Hypertext). Preprocessor, Hypertext Preprocessor, JSP (Java Server Pages, Java Server Pages), ASP (Active Server Web page files in the form of Page, Dynamic Server Interface, CGI (Common Gateway Interface).
  • FIG. 3 is a schematic flowchart diagram of a third embodiment of a webpage data processing method according to the present invention.
  • step S20 specifically includes:
  • step S201 the first display terminal sequentially reads all webpage contents and corresponding tags in the HTML script file.
  • the first display terminal when the first display terminal performs the classification processing on the type of the obtained webpage content, the first display terminal first reads all the webpage content and the corresponding label in the HTML script file fed back by the Internet server, and the label is a webpage.
  • Content type identification tags that is, different types of web page content represented by different tags.
  • step S202 the first display terminal determines whether the currently read webpage content is the webpage content of the first preset type according to the label corresponding to the currently read webpage content; if yes, step S203 is performed.
  • the first display terminal determines whether the type of the currently read webpage content is a picture type or a video type or an audio type according to the name of the label corresponding to the currently read webpage content, that is, the embodiment.
  • the tags in the HTML script file include IMG, AUDIO, VIDEO, EMB, and OBJECT, wherein the tag indicating the content of the image type is IMG, and it should be noted that for different versions of the HTML script file, the video type is represented.
  • the content tag is different.
  • the tag indicating the content of the audio type is AUDIO
  • the tag indicating the content of the video type is VIDEO
  • the tag EMB or OBJECT The content may be a video.
  • it may be determined whether the content type of the corresponding tag is a video type by determining the format of the embedded object in the content corresponding to the label EMB or OBJECT, and the format of the embedded object is AVI, SWF, FLV. Or a video format such as 3GP, it can be determined that the content type of the EMB or OBJECT tag is a video type.
  • Step S203 The first display terminal stores the URL link address in the currently read webpage content into the preset list, and deletes the currently read webpage content and the corresponding label in the HTML script.
  • the URL link address in the currently read webpage content is extracted, and the URL link address is stored in the preset list. And deleting the currently read webpage content and the corresponding label in the HTML script, that is, extracting the currently read webpage content from the HTML script file, so that the remaining HTML script file is no longer Contains the webpage content of the first preset type and the corresponding label.
  • Step S204 after all the webpage content reading is completed, the first display terminal reorganizes the HTML script file to obtain a new HTML script file.
  • the first display terminal sequentially reads all webpage content and corresponding tags in the HTML script file until all webpage content is read, and then deletes the webpage content of the first preset type and corresponding
  • the remaining HTML script file of the tag is re-formatted to obtain a new HTML script file containing only the text type content, that is, only the second preset type of web page content is included in the new HTML script file.
  • step S30 is specifically that the first display terminal outputs the preset list to the second display terminal for display.
  • the second display terminal has a function of accessing the Internet, and the first display terminal outputs the preset list to the second display terminal (such as a television), and the preset list includes the a URL link address of a preset type of webpage content, and then the second display terminal loads the corresponding URL link address according to the preset list, thereby displaying the corresponding webpage content of the first preset type; in this embodiment, the second The display terminal can directly decode and play the content of the picture type, video type or audio type corresponding to the corresponding URL link address.
  • the multimedia content such as the picture type, the video type, or the audio type has a large amount of data after decoding
  • the first display terminal decodes the multimedia content and then sends the multimedia content to the second display terminal for playback, it is easy to cause in the process of sending.
  • the network is blocked, thereby causing a delay in the display screen of the second display terminal.
  • the second display terminal loads the corresponding multimedia link content according to the received preset list, and directly obtains the corresponding multimedia content for decoding and playing.
  • the second display terminal (such as a television) with a larger screen is also The content of the horizontal screen, that is, the video type, can be better adapted and played in the second display terminal, so that the user can obtain a better viewing experience.
  • step S40 is specifically that the first display terminal invokes the browser to load the new HTML script file. Since the new HTML script file only includes the webpage content of the second preset type, in the embodiment, the first display terminal displays only the webpage content of the text type when the browser loads the new HTML script file. In this way, on the first display terminal (such as a mobile phone), the content of the text type of the webpage can be displayed in a beautiful manner, and when the first display terminal is a handheld smart terminal, the text type can be prevented from being displayed on the handheld smart terminal with a small screen.
  • the content is also mixed with multimedia content such as pictures, videos or audio, which does not make people feel messy.
  • the URL link address of the webpage content of the first preset type is extracted, and the preset list is formed and sent to the second display terminal.
  • the first save can be saved. Displaying the traffic exchanged between the terminal and the second display terminal.
  • the first display terminal does not need to decode the multimedia webpage content such as the picture type, the video type, or the audio type, and then sends the content to the second display terminal, thereby avoiding the first
  • a display terminal (such as a mobile phone) consumes a large amount of battery power due to long-time decoding operations.
  • the invention further provides a webpage data processing apparatus.
  • FIG. 4 is a schematic diagram of functional modules of an embodiment of a webpage data processing apparatus according to the present invention.
  • the webpage data processing apparatus includes a webpage obtaining module 10, a webpage processing module 20, an output module 30, and a display module 40, wherein:
  • the webpage obtaining module 10 is configured to send a webpage link request to the server to obtain the webpage content included in the webpage link;
  • the webpage processing module 20 is configured to perform classification processing according to the type of the acquired webpage content, to form a webpage content of a first preset type including a picture, a video or an audio, and a webpage content of a second preset type including the text;
  • the output module 30 is configured to output the webpage content of the first preset type to the second display terminal for display;
  • the display module 40 is configured to display web content of the second preset type.
  • the webpage obtaining module 10 of the webpage data processing apparatus first receives the webpage link request input by the user, and then requests the server accessing the internet according to the webpage link input by the user, obtains the webpage content from the internet, and obtains the obtained webpage content.
  • the content of the webpage is temporarily stored in the memory or the local storage; the webpage processing module 20 of the webpage data processing apparatus reads the obtained types of webpage content from the memory or the local storage, and the type of the webpage content includes the image type and the video type.
  • the audio type and the text type and then classifying the type of the webpage content, extracting the multimedia content of the image type, the video type or the audio type in the webpage content to form the webpage content of the first preset type, and The content of the remaining text type in the content is reorganized to obtain the web content of the second preset type, so that the entire webpage content is divided into the webpage content of the first preset type including the image type, the video type and the audio type, and the text type Second preset type of web content Then, the output module 30 of the webpage data processing apparatus outputs the webpage content of the first preset type to the second display terminal, and the user can also use the display module 40 of the webpage data processing apparatus according to the second preset type.
  • the content of the web page displays a web page with no multimedia content such as images, videos, and audio.
  • the webpage data processing device is installed on the first display terminal, and the first display terminal generally refers to a handheld smart terminal (such as a mobile phone, a PDA, a tablet computer, a handheld game console, etc.), and the second display terminal generally refers to A display terminal having a large screen, such as a television set; the first display terminal can realize instant communication with the second display terminal.
  • the webpage data processing apparatus divides the content of the webpage into the webpage content of the first preset type including the content of the picture type, the video type and the audio type, and the second pre-content including the content of the text type by the webpage processing module 20.
  • a webpage content of a type wherein the webpage content of the second preset type including the text type content is browsed using the display module 40 of the webpage data processing apparatus, the first preset of the content including the picture type, the video type, and the audio type
  • the webpage content of the type is output to the second display terminal for display, that is, only the text type content is browsed on the display module 40, and only the image type, the video type, and the audio type content are displayed on the second display terminal, and the webpage data processing apparatus
  • the device is installed on the first display terminal, so that the phenomenon that the screen is uncoordinated when the first display terminal or the second display terminal browses the webpage with the multimedia content such as picture, video or audio is used, when the first display terminal is When holding a smart terminal, it can also avoid the appearance
  • the webpage obtaining module 10 includes:
  • the sending unit 11 is configured to send a webpage link request to the server
  • the receiving unit 12 is configured to receive an HTML script file that is fed back by the server according to the webpage link request;
  • the obtaining unit 13 is configured to acquire all webpage content in the HTML script file.
  • the sending unit 11 before the user browses the webpage by using the first display terminal equipped with the webpage data processing apparatus, the sending unit 11 first receives a webpage link request input by the user, such as a webpage address or a hyperlink between webpages, etc. And sending the webpage link request to the server of the Internet; then, the receiving unit 12 receives the HTML script file fed back by the server of the Internet through the HTTP protocol; finally, the obtaining unit 13 acquires the corresponding webpage content according to the fed back HTML script file.
  • the HTML script file is composed of a plurality of tags and content corresponding to the corresponding tags.
  • the content that the server feeds back according to the webpage link request may be, but is not limited to, PHP (Hypertext). Preprocessor, Hypertext Preprocessor, JSP (Java Server Pages, Java Server Pages), ASP (Active Server Web page files in the form of Page, Dynamic Server Interface, CGI (Common Gateway Interface).
  • the webpage processing module 20 includes:
  • the reading unit 21 is configured to sequentially read all webpage content and corresponding tags in the HTML script file; the tag is a webpage content type identification tag;
  • the first determining unit 22 is configured to determine, according to the currently corresponding label of the webpage content, whether the currently read webpage content is the webpage content of the first preset type;
  • the URL address extraction unit 23 is configured to: when the currently read webpage content is the webpage content of the first preset type, store the URL link address in the currently read webpage content into the preset list, and in the HTML Deleting the currently read content of the webpage and the corresponding label in the script;
  • a script file reorganization unit 24 configured to reorganize the HTML script file to obtain a new HTML script file after all webpage content is read;
  • the second determining unit 25 is configured to determine, according to the embedded object format in the webpage content, whether the currently read webpage content is the webpage content of the first preset type, when the label corresponding to the currently read webpage content is EMB or OBJECT .
  • the reading unit 21 firstly reads all the webpage contents and the corresponding labels in the HTML script file fed back by the internet server.
  • the tag is a webpage content type identification tag, that is, the webpage content type indicated by the different tags is different; the first determining unit 22 determines the currently read webpage content according to the name of the tag corresponding to each currently read webpage content.
  • the type of the picture is a picture type or a video type or an audio type, that is, the first judging unit 22 analyzes the name of each tag to determine whether the content type corresponding to the corresponding tag belongs to the multimedia type content or belongs to the text.
  • the URL address extracting unit 23 extracts the URL link address in the currently read webpage content, and extracts the URL link address Stored in the preset list, and delete the currently read webpage content in the HTML script and
  • the URL tag extracting unit 23 extracts the currently read webpage content from the HTML script file, so that the remaining HTML script file no longer includes the first preset type of webpage content and the corresponding tag.
  • the script file reorganizing unit 24 re-types the remaining HTML script files of the webpage content of the first preset type and the corresponding label to obtain the content containing only the text type.
  • the new HTML script file that is, only the second preset type of webpage content is included in the new HTML script file.
  • the tags in the HTML script file include IMG, AUDIO, VIDEO, EMB, and OBJECT, wherein the tag indicating the content of the image type is IMG, and it should be noted that for different versions of the HTML script file.
  • the label indicating the content of the video type is different.
  • the label indicating the content of the audio type is AUDIO
  • the label indicating the content of the video type is VIDEO
  • the content in the label EMB or OBJECT may be a video.
  • the second determining unit 25 determines whether the content type of the corresponding label is a video type by determining the format of the embedded object in the content corresponding to the label EMB or OBJECT.
  • the determining unit 25 determines that the format of the embedded object is a video format such as AVI, SWF, FLV or 3GP, it can be determined that the content type of the EMB or OBJECT tag is a video type.
  • the output module 30 of the webpage data processing apparatus is specifically configured to output the preset list to the second display terminal for display. More specifically, in this embodiment, the second display terminal has a function of accessing the Internet, and the output module 30 of the webpage data processing apparatus outputs the preset list to the second display terminal (such as a television), the pre- The list includes a URL link address of the first preset type of webpage content, and then the second display terminal loads the corresponding URL link address according to the preset list, thereby displaying the corresponding first preset type of webpage content; The second display terminal can directly decode and play the content of the picture type, the video type or the audio type corresponding to the corresponding URL link address.
  • the display module 40 of the webpage data processing apparatus is specifically configured to invoke a browser to load the new HTML script file.
  • the first display terminal equipped with the webpage data processing device uses the display module 40 to retrieve the browser to load the new webpage.
  • the HTML script file only displays the webpage content of the text type, so that the content of the text type of the webpage can be displayed on the first display terminal (such as a mobile phone).
  • the URL address extraction unit 23 extracts the URL link address of the webpage content of the first preset type, and composes the preset list to be sent to the second display terminal.
  • the data amount of the preset list is small.
  • the traffic between the first display terminal and the second display terminal on which the webpage data processing device is installed can be saved.
  • the first display terminal equipped with the webpage data processing device does not need to use the image type and video.
  • the multimedia webpage content, such as the type or the audio type is decoded and then sent to the second display terminal, so that the first display terminal (such as a mobile phone) equipped with the webpage data processing device can avoid a large amount of battery power consumption due to long-time decoding operation. phenomenon.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Human Computer Interaction (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

本发明公开了一种网页数据处理方法,包括步骤:第一显示终端发送网页链接请求至服务器,以获取网页链接包括的网页内容;第一显示终端根据获取的网页内容的类型进行分类处理,形成第一预置类型的网页内容和第二预置类型的网页内容;第一显示终端将第一预置类型的网页内容输出至第二显示终端显示;第一显示终端显示第二预置类型的网页内容。本发明还公开了一种网页数据处理装置。本发明所提供的网页数据处理方法可使得网页的文本类型的内容使用第一显示终端(如手机)浏览,网页的多媒体内容输出至第二显示终端进行显示,从而避免出现使用第一显示终端或者第二显示终端浏览内嵌有图片、视频或者音频等多媒体内容的网页时画面不协调的现象。

Description

网页数据处理方法和装置
技术领域
本发明涉及通信领域,尤其涉及网页数据处理方法和装置。
背景技术
目前,几乎所有手持智能设备,例如手机、PDA(Personal Digital Assistant,个人数码助理)、平板电脑、掌上游戏机等,都具备网页浏览器的应用。现行的网页中,通常会内嵌有一些图片、视频或者音频等多媒体内容;在使用手持智能设备浏览这些带有多媒体内容的网页时,多媒体内容会直接在网页里面显示或者播放;然而,由于手持智能设备的喇叭音量偏小、音效较差,难以获得在大屏设备上观看多媒体内容的感官享受,因此,市面上出现了一些手持智能设备与大屏设备(如电视机)互动的解决方案,例如将手机或者PDA上正在浏览的网页传屏到电视机进行播放显示,但这些方案在将网页传屏到电视机时会同时把网页的文字和多媒体内容混合传送,如此,在电视机上显示的是文字与多媒体内容混合搭配的画面,容易导致整个电视画面的不协调。
上述内容仅用于辅助理解本发明的技术方案,并不代表承认上述内容是现有技术。
发明内容
本发明的主要目的在于解决在使用显示终端浏览内嵌有图片、视频或者音频等多媒体内容的网页时画面不协调的技术问题。
为实现上述目的,本发明提供的一种网页数据处理方法,包括以下步骤:第一显示终端发送网页链接请求至服务器,以获取网页链接包括的网页内容;第一显示终端根据获取的网页内容的类型进行分类处理,形成第一预置类型的网页内容和第二预置类型的网页内容;所述第一预置类型包括图片类型、视频类型和音频类型,所述第二预置类型包括文本类型;第一显示终端将所述第一预置类型的网页内容输出至第二显示终端显示;第一显示终端显示第二预置类型的网页内容。
优选地,所述第一显示终端发送网页链接请求至服务器,以获取网页链接包括的网页内容包括:第一显示终端发送网页链接请求至服务器;第一显示终端接收服务器根据所述网页链接请求反馈的HTML脚本文件;第一显示终端获取所述HTML脚本文件中的所有网页内容。
优选地,所述第一显示终端根据获取的网页内容的类型进行分类处理,形成第一预置类型的网页内容和第二预置类型的网页内容具体包括:第一显示终端依次读取HTML脚本文件中所有网页内容及对应的标签;所述标签为网页内容类型识别标签;第一显示终端根据当前读取的所述网页内容对应的标签判断当前读取的网页内容是否为第一预置类型的网页内容;若是,则第一显示终端将当前读取的网页内容中的URL链接地址存储至预置列表中,并在所述HTML脚本中删除当前读取的所述网页内容及对应的标签;直至所有网页内容读取完成后,第一显示终端对所述HTML脚本文件重组处理得到新的HTML脚本文件;所述第一显示终端将第一预置类型的网页内容输出至第二显示终端显示具体为第一显示终端将所述预置列表输出至第二显示终端显示;所述第一显示终端显示第二预置类型的网页内容具体为第一显示终端调取浏览器加载所述新的HTML脚本文件。
优选地,所述标签包括IMG、AUDIO、VIDEO、EMB和OBJECT;当当前读取的网页内容对应的标签为EMB或OBJECT时,第一显示终端根据所述网页内容中嵌入对象格式判断当前读取的网页内容是否为第一预置类型的网页内容。
优选地,所述第一显示终端将预置列表输出至第二显示终端显示具体为第一显示终端将所述预置列表输出至所述第二显示终端,以供所述第二显示终端加载所述预置列表中的URL链接地址。
优选地,所述第二显示终端将通过加载所述预置列表中的URL链接地址所获取到的第一预置类型的网页内容进行解码播放。
此外,为实现上述目的,本发明还提供一种网页数据处理装置,包括网页获取模块、网页处理模块、输出模块以及显示模块,其中:网页获取模块,用于发送网页链接请求至服务器,以获取网页链接包括的网页内容;网页处理模块,用于根据获取的网页内容的类型进行分类处理,形成第一预置类型的网页内容和第二预置类型的网页内容;所述第一预置类型包括图片类型、视频类型和音频类型,所述第二预置类型包括文本类型;输出模块,用于将所述第一预置类型的网页内容输出至第二显示终端显示;显示模块,用于显示第二预置类型的网页内容。
优选地,所述网页获取模块包括发送单元、接收单元以及获取单元,其中:发送单元,用于将网页链接请求发送至服务器;接收单元,用于接收服务器根据所述网页链接请求反馈的HTML脚本文件;获取单元,用于获取所述HTML脚本文件中的所有网页内容。
优选地,所述网页处理模块包括读取单元、第一判断单元、URL地址提取单元以及脚本文件重组单元,其中:读取单元,用于依次读取HTML脚本文件中所有网页内容及对应的标签;所述标签为网页内容类型识别标签;第一判断单元,用于根据当前读取的所述网页内容对应的标签判断当前读取的网页内容是否为第一预置类型的网页内容; URL地址提取单元,用于在当前读取的网页内容为第一预置类型的网页内容时,将当前读取的网页内容中的URL链接地址存储至预置列表中,并在所述HTML脚本中删除当前读取的所述网页内容及对应的标签;脚本文件重组单元,用于在所有网页内容读取完成后,对所述HTML脚本文件重组处理得到新的HTML脚本文件。
优选地,所述标签包括IMG、AUDIO、VIDEO、EMB和OBJECT;所述网页处理模块还包括第二判断单元,该第二判断单元用于在当前读取的网页内容对应的标签为EMB或OBJECT时,根据所述网页内容中嵌入对象格式判断当前读取的网页内容是否为第一预置类型的网页内容。
优选地,所述显示模块具体用于调取浏览器加载所述新的HTML脚本文件;所述输出模块具体用于将所述预置列表输出至所述第二显示终端,以供所述第二显示终端加载所述预置列表中的URL链接地址。
优选地,所述第二显示终端将通过加载所述预置列表中的URL链接地址所获取到的第一预置类型的网页内容进行解码播放。
本发明所提供的网页数据处理方法通过将网页的内容分成包括图片类型、视频类型和音频类型的内容的第一预置类型的网页内容和包括文本类型的内容的第二预置类型的网页内容,其中,包括文本类型的内容的第二预置类型的网页内容使用第一显示终端来浏览,包括图片类型、视频类型和音频类型的内容的第一预置类型的网页内容输出至第二显示终端进行显示,即在第一显示终端上仅浏览文本类型的内容,在第二显示终端上仅显示图片类型、视频类型和音频类型的内容,从而可以避免出现使用第一显示终端或者第二显示终端浏览内嵌有图片、视频或者音频等多媒体内容的网页时画面不协调的现象,当第一显示终端为手持智能终端时,还可以避免出现由于手持智能终端较小的屏幕而在观看视频、图片等多媒体内容时无震撼感的现象,提升了用户的体验。
附图说明
图1为本发明网页数据处理方法第一实施例的流程示意图;
图2为本发明网页数据处理方法第二实施例的流程示意图;
图3为本发明网页数据处理方法第三实施例的流程示意图;
图4为本发明网页数据处理装置一实施例的功能模块示意图;
图5为图4中网页获取模块的细化功能模块示意图;
图6为图4中网页处理模块的细化功能模块示意图。
本发明目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。
具体实施方式
应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。
本发明提供一种网页数据处理方法。
参照图1,图1为本发明网页数据处理方法第一实施例的流程示意图。
在第一实施例中,该网页数据处理方法包括:
步骤S10,第一显示终端发送网页链接请求至服务器,以获取网页链接包括的网页内容。
在本实施例中,第一显示终端先接收用户输入的网页链接请求,再根据用户输入的网页链接请求访问互联网的服务器,从互联网上获取网页内容,并将所获取到的网页内容暂时存放在内存或者本地存储中;本实施例中,第一显示终端通常指手持智能终端,手持智能终端可以为但不限于手机、PDA、平板电脑、掌上游戏机等,具有网页浏览器的应用即可。
步骤S20,第一显示终端根据获取的网页内容的类型进行分类处理,形成第一预置类型的网页内容和第二预置类型的网页内容。
在本实施例中,第一显示终端先从内存或者本地存储中读取所获取到的各种类型的网页内容,网页内容的类型包括图片类型、视频类型、音频类型以及文本类型等,再对该网页内容的类型进行分类处理,将网页内容中的图片类型、视频类型或音频类型的多媒体内容抽离出来组成第一预置类型的网页内容,而将网页内容中余下的文本类型的内容进行重组得到第二预置类型的网页内容,如此,将整个网页内容分成了包括图片类型、视频类型和音频类型的第一预置类型的网页内容和包括文本类型的内容的第二预置类型的网页内容。
步骤S30,第一显示终端将第一预置类型的网页内容输出至第二显示终端显示;
步骤S40,第一显示终端显示第二预置类型的网页内容。
在本实施例中,第二显示终端通常指屏幕较大的显示终端,如电视机等;第一显示终端能与第二显示终端实现即时通信,具体的通信方式可以为但不局限于:有线局域网、WiFi局域网、有线互联网、WiFi互联网、WiFi热点、2G/3G/4G无线通信网络、蓝牙、NFC(Near Field Communication,近场通信)、红外等;第一显示终端在将整个网页内容分成了包括图片类型、视频类型和音频类型的内容的第一预置类型的网页内容和包括文本类型的内容的第二预置类型的网页内容之后,将第一预置类型的网页内容输出至第二显示终端显示,同时用户还可使用该第一显示终端根据该第二预置类型的网页内容显示一个没有图片、视频和音频等多媒体内容的网页,如此,在第一显示终端(如手机)上能规整美观的显示出网页的文本类型的内容,而第二显示终端(如电视机)则可根据该第一预置类型的网页内容显示没有文本混合搭配的图片、视频和/或音频等多媒体内容,使得第二显示终端在播放该多媒体内容时的画面可以保持协调,不会让人产生凌乱感。
本实施例所提供的网页数据处理方法通过将网页的内容分成包括图片类型、视频类型和音频类型的内容的第一预置类型的网页内容和包括文本类型的内容的第二预置类型的网页内容,其中,包括文本类型的内容的第二预置类型的网页内容使用第一显示终端来浏览,包括图片类型、视频类型和音频类型的内容的第一预置类型的网页内容输出至第二显示终端进行显示,即在第一显示终端上仅浏览文本类型的内容,在第二显示终端上仅显示图片类型、视频类型和音频类型的内容,从而可以避免出现使用第一显示终端或者第二显示终端浏览内嵌有图片、视频或者音频等多媒体内容的网页时画面不协调的现象,当第一显示终端为手持智能终端时,还可以避免出现由于手持智能终端较小的屏幕而在观看视频、图片等多媒体内容时无震撼感的现象,提升了用户的体验。
参照图2,图2为本发明网页数据处理方法第二实施例的流程示意图。
在本发明第一实施例的基础上,进一步地,步骤S10具体包括:
步骤S101,第一显示终端发送网页链接请求至服务器;
步骤S102,第一显示终端接收服务器根据所述网页链接请求反馈的HTML脚本文件;
步骤S103,第一显示终端获取所述HTML脚本文件中的所有网页内容。
在本实施例中,用户在使用第一显示终端浏览网页前,第一显示终端先接收用户输入的网页链接请求,如网页地址或者网页间超级链接跳转等;接着,第一显示终端根据用户输入的网页链接请求访问互联网的服务器,该互联网的服务器通过HTTP协议向第一显示终端反馈相应的HTML脚本文件;最后,第一显示终端根据所反馈的HTML脚本文件获取相应的网页内容。本实施例中,该HTML脚本文件由多个标签和相应标签所对应的内容组成。
需要强调的是,在本发明的其他实施例中,服务器根据所述网页链接请求所反馈的内容还可以但不限于PHP(Hypertext Preprocessor,超文本预处理器)、JSP(Java Server Pages,Java服务器页面)、ASP (Active Server Page,动态服务器页面)、CGI (Common Gateway Interface,通用网关接口)等格式的网页文件。
参照图3,图3为本发明网页数据处理方法第三实施例的流程示意图。
在本发明第二实施例的基础上,进一步地,步骤S20具体包括:
步骤S201,第一显示终端依次读取HTML脚本文件中所有网页内容及对应的标签。
在本实施例中,第一显示终端在对所获取的网页内容的类型进行分类处理时,先依次读取互联网服务器所反馈的HTML脚本文件中的所有网页内容及对应的标签,该标签为网页内容类型识别标签,即不同的标签所表示的网页内容类型不同。
步骤S202,第一显示终端根据当前读取的所述网页内容对应的标签判断当前读取的网页内容是否为第一预置类型的网页内容;若是,执行步骤S203。
在本实施例中,第一显示终端根据每一当前读取的所述网页内容对应的标签的名称判断当前读取的网页内容的类型是否为图片类型或者视频类型或者音频类型,即本实施例是通过分析比对每一标签的名称来判定相应标签所对应的内容类型是属于多媒体类型的内容还是属于文本类型的内容。具体的,HTML脚本文件中的标签包括IMG、AUDIO、VIDEO、EMB和OBJECT等,其中,表示图片类型的内容的标签为IMG,需要注意的是,对于不同版本的HTML脚本文件,表示视频类型的内容的标签不同,在版本为HTML5的HTML脚本文件中,表示音频类型的内容的标签为AUDIO,表示视频类型的内容的标签为VIDEO,而在旧版本的HTML脚本文件中,标签EMB或者OBJECT中的内容可能为视频,此时,可以通过判定标签EMB或者OBJECT所对应的内容中的嵌入对象的格式来判断相应标签的内容类型是否为视频类型,当该嵌入对象的格式为AVI、SWF、FLV或3GP等视频格式时,可以判定该EMB或者OBJECT标签的内容类型为视频类型。
步骤S203,第一显示终端将当前读取的网页内容中的URL链接地址存储至预置列表中,并在所述HTML脚本中删除当前读取的所述网页内容及对应的标签。
在本实施例中,当当前读取的网页内容为第一预置类型的网页内容时,将当前读取的网页内容中的URL链接地址提取出来,并将该URL链接地址存储到预置列表中,并在所述HTML脚本中删除当前读取的所述网页内容及对应的标签,即将当前读取的网页内容从所述HTML脚本文件中抽离,以使余下的HTML脚本文件中不再包含第一预置类型的网页内容及对应的标签。
步骤S204,直至所有网页内容读取完成后,第一显示终端对所述HTML脚本文件重组处理得到新的HTML脚本文件。
在本实施例中,第一显示终端在依次读取HTML脚本文件中所有网页内容及对应的标签,直至所有网页内容读取完成后,再将已删除了第一预置类型的网页内容及对应的标签的余下的HTML脚本文件进行重新排版处理,得到仅包含文本类型的内容的新的HTML脚本文件,即在新的HTML脚本文件中仅包含第二预置类型的网页内容。
本实施例中,步骤S30具体为第一显示终端将所述预置列表输出至第二显示终端显示。更为具体地,在本实施例中,所述第二显示终端具有访问互联网的功能,第一显示终端将该预置列表输出至第二显示终端(如电视机),该预置列表包含第一预置类型的网页内容的URL链接地址,然后,第二显示终端根据该预置列表加载相应的URL链接地址,从而显示相应的第一预置类型的网页内容;本实施例中,第二显示终端能对相应的URL链接地址所对应的图片类型、视频类型或音频类型的内容直接进行解码播放。
由于图片类型、视频类型或音频类型等多媒体内容在解码之后数据量很大,若第一显示终端先将这些多媒体内容解码后再发送给第二显示终端进行播放,在发送的过程中很容易造成网络堵塞,从而导致第二显示终端的显示画面出现延迟的现象;本实施例通过第二显示终端根据接收到预置列表加载相应的URL链接地址,直接从互联网上获取相应的多媒体内容进行解码播放,在正常网速下可以避免出现第二显示终端因网络堵塞而造成的画面延迟现象,同时由于视频类型的内容通常是横屏的,而屏幕较大的第二显示终端(如电视机)也是横屏的,即视频类型的内容在第二显示终端可以得到更好的适配播放,使用户获得更佳的观感体验。
本实施例中,步骤S40具体为第一显示终端调取浏览器加载所述新的HTML脚本文件。由于新的HTML脚本文件中仅包含第二预置类型的网页内容,故在本实施例中,第一显示终端在调取浏览器加载该新的HTML脚本文件时仅显示文本类型的网页内容,如此,在第一显示终端(如手机)上能规整美观的显示出网页的文本类型的内容,当第一显示终端为手持智能终端时,可避免在屏幕较小的手持智能终端上显示文本类型的内容的同时还混合搭配有图片、视频或音频等多媒体内容,不会让人产生凌乱感。
本实施例通过将第一预置类型的网页内容的URL链接地址提取出来,组成预置列表再发送至第二显示终端,一方面,由于该预置列表的数据量很小,可以节省第一显示终端与第二显示终端之间交互的流量,另一方面,第一显示终端无需将图片类型、视频类型或音频类型等多媒体网页内容解码后再发送至第二显示终端,可以避免出现第一显示终端(如手机)因进行长时间解码运算而大量耗费电池电量的现象。
本发明进一步提供一种网页数据处理装置。
参照图4,图4为本发明网页数据处理装置一实施例的功能模块示意图。
在一实施例中,该网页数据处理装置包括网页获取模块10、网页处理模块20、输出模块30以及显示模块40,其中:
网页获取模块10,用于发送网页链接请求至服务器,以获取网页链接包括的网页内容;
网页处理模块20,用于根据获取的网页内容的类型进行分类处理,形成包括图片、视频或音频的第一预置类型的网页内容以及包括文本的第二预置类型的网页内容;
输出模块30,用于将所述第一预置类型的网页内容输出至第二显示终端显示;
显示模块40,用于显示第二预置类型的网页内容。
在本实施例中,网页数据处理装置的网页获取模块10先接收用户输入的网页链接请求,再根据用户输入的网页链接请求访问互联网的服务器,从互联网上获取网页内容,并将所获取到的网页内容暂时存放在内存或者本地存储中;网页数据处理装置的网页处理模块20从内存或者本地存储中读取所获取到的各种类型的网页内容,网页内容的类型包括图片类型、视频类型、音频类型以及文本类型等,再对该网页内容的类型进行分类处理,将网页内容中的图片类型、视频类型或音频类型的多媒体内容抽离出来组成第一预置类型的网页内容,而将网页内容中余下的文本类型的内容进行重组得到第二预置类型的网页内容,如此,将整个网页内容分成了包括图片类型、视频类型和音频类型的第一预置类型的网页内容和包括文本类型的内容的第二预置类型的网页内容;然后,网页数据处理装置的输出模块30将第一预置类型的网页内容输出至第二显示终端显示,同时用户还可使用该网页数据处理装置的显示模块40根据该第二预置类型的网页内容显示一个没有图片、视频和音频等多媒体内容的网页。本实施例中,该网页数据处理装置装设在第一显示终端上,且第一显示终端通常指手持智能终端(如手机、PDA、平板电脑、掌上游戏机等),第二显示终端通常指屏幕较大的显示终端,如电视机等;第一显示终端能与第二显示终端实现即时通信。
本实施例所提供的网页数据处理装置通过网页处理模块20将网页的内容分成包括图片类型、视频类型和音频类型的内容的第一预置类型的网页内容和包括文本类型的内容的第二预置类型的网页内容,其中,包括文本类型的内容的第二预置类型的网页内容使用网页数据处理装置的显示模块40来浏览,包括图片类型、视频类型和音频类型的内容的第一预置类型的网页内容输出至第二显示终端进行显示,即在显示模块40上仅浏览文本类型的内容,在第二显示终端上仅显示图片类型、视频类型和音频类型的内容,该网页数据处理装置装设在第一显示终端上,从而可以避免出现使用第一显示终端或者第二显示终端浏览内嵌有图片、视频或者音频等多媒体内容的网页时画面不协调的现象,当第一显示终端为手持智能终端时,还可以避免出现由于手持智能终端(如手机)较小的屏幕而在观看视频、图片等多媒体内容时无震撼感的现象,提升了用户的体验。
参照图5,进一步地,网页获取模块10包括:
发送单元11,用于将网页链接请求发送至服务器;
接收单元12,用于接收服务器根据所述网页链接请求反馈的HTML脚本文件;
获取单元13,用于获取所述HTML脚本文件中的所有网页内容。
在本实施例中,用户在使用装设有该网页数据处理装置的第一显示终端浏览网页前,发送单元11先接收用户输入的网页链接请求,如网页地址或者网页间超级链接跳转等,并将该网页链接请求发送至互联网的服务器;接着,接收单元12接收该互联网的服务器通过HTTP协议反馈的HTML脚本文件;最后,获取单元13根据所反馈的HTML脚本文件获取相应的网页内容。本实施例中,该HTML脚本文件由多个标签和相应标签所对应的内容组成。
需要强调的是,在本发明的其他实施例中,服务器根据所述网页链接请求所反馈的内容还可以但不限于PHP(Hypertext Preprocessor,超文本预处理器)、JSP(Java Server Pages,Java服务器页面)、ASP (Active Server Page,动态服务器页面)、CGI (Common Gateway Interface,通用网关接口)等格式的网页文件。
参照图6,进一步地,网页处理模块20包括:
读取单元21,用于依次读取HTML脚本文件中所有网页内容及对应的标签;所述标签为网页内容类型识别标签;
第一判断单元22,用于根据当前读取的所述网页内容对应的标签判断当前读取的网页内容是否为第一预置类型的网页内容;
URL地址提取单元23,用于在当前读取的网页内容为第一预置类型的网页内容时,将当前读取的网页内容中的URL链接地址存储至预置列表中,并在所述HTML脚本中删除当前读取的所述网页内容及对应的标签;
脚本文件重组单元24,用于在所有网页内容读取完成后,对所述HTML脚本文件重组处理得到新的HTML脚本文件;
第二判断单元25,用于在当前读取的网页内容对应的标签为EMB或OBJECT时,根据所述网页内容中嵌入对象格式判断当前读取的网页内容是否为第一预置类型的网页内容。
在本实施例中,网页处理模块20在对所获取的网页内容的类型进行分类处理时,读取单元21先依次读取互联网服务器所反馈的HTML脚本文件中的所有网页内容及对应的标签,该标签为网页内容类型识别标签,即不同的标签所表示的网页内容类型不同;第一判断单元22再根据每一当前读取的所述网页内容对应的标签的名称判断当前读取的网页内容的类型是否为图片类型或者视频类型或者音频类型,即本实施例是通过第一判断单元22分析比对每一标签的名称来判定相应标签所对应的内容类型是属于多媒体类型的内容还是属于文本类型的内容;然后,当当前读取的网页内容为第一预置类型的网页内容时,URL地址提取单元23将当前读取的网页内容中的URL链接地址提取出来,并将该URL链接地址存储到预置列表中,并在所述HTML脚本中删除当前读取的所述网页内容及对应的标签,即URL地址提取单元23将当前读取的网页内容从所述HTML脚本文件中抽离,以使余下的HTML脚本文件中不再包含第一预置类型的网页内容及对应的标签;在所有网页内容读取完成后,脚本文件重组单元24再将已删除了第一预置类型的网页内容及对应的标签的余下的HTML脚本文件进行重新排版处理,得到仅包含文本类型的内容的新的HTML脚本文件,即在新的HTML脚本文件中仅包含第二预置类型的网页内容。
本实施例中,具体的,HTML脚本文件中的标签包括IMG、AUDIO、VIDEO、EMB和OBJECT等,其中,表示图片类型的内容的标签为IMG,需要注意的是,对于不同版本的HTML脚本文件,表示视频类型的内容的标签不同,在版本为HTML5的HTML脚本文件中,表示音频类型的内容的标签为AUDIO,表示视频类型的内容的标签为VIDEO,而在旧版本的HTML脚本文件中,标签EMB或者OBJECT中的内容可能为视频,此时,第二判断单元25通过判定标签EMB或者OBJECT所对应的内容中的嵌入对象的格式来判断相应标签的内容类型是否为视频类型,当第二判断单元25判定该嵌入对象的格式为AVI、SWF、FLV或3GP等视频格式时,可以判定该EMB或者OBJECT标签的内容类型为视频类型。
本实施例中,该网页数据处理装置的输出模块30具体用于将所述预置列表输出至第二显示终端显示。更为具体地,在本实施例中,所述第二显示终端具有访问互联网的功能,网页数据处理装置的输出模块30将该预置列表输出至第二显示终端(如电视机),该预置列表包含第一预置类型的网页内容的URL链接地址,然后,第二显示终端根据该预置列表加载相应的URL链接地址,从而显示相应的第一预置类型的网页内容;本实施例中,第二显示终端能对相应的URL链接地址所对应的图片类型、视频类型或音频类型的内容直接进行解码播放。
本实施例中,该网页数据处理装置的显示模块40具体用于调取浏览器加载所述新的HTML脚本文件。由于新的HTML脚本文件中仅包含第二预置类型的网页内容,故在本实施例中,装设有该网页数据处理装置的第一显示终端在使用显示模块40调取浏览器加载该新的HTML脚本文件时仅显示文本类型的网页内容,如此,在第一显示终端(如手机)上能规整美观的显示出网页的文本类型的内容。
本实施例通过URL地址提取单元23将第一预置类型的网页内容的URL链接地址提取出来,组成预置列表再发送至第二显示终端,一方面,由于该预置列表的数据量很小,可以节省装设有该网页数据处理装置的第一显示终端与第二显示终端之间交互的流量,另一方面,装设有该网页数据处理装置的第一显示终端无需将图片类型、视频类型或音频类型等多媒体网页内容解码后再发送至第二显示终端,可以避免出现装设有该网页数据处理装置的第一显示终端(如手机)因进行长时间解码运算而大量耗费电池电量的现象。
以上仅为本发明的优选实施例,并非因此限制本发明的专利范围,凡是利用本发明说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本发明的专利保护范围内。

Claims (14)

  1. 一种网页数据处理方法,其特征在于,所述网页数据处理方法包括以下步骤:
    第一显示终端发送网页链接请求至服务器,以获取网页链接包括的网页内容;
    第一显示终端根据获取的网页内容的类型进行分类处理,形成第一预置类型的网页内容和第二预置类型的网页内容;所述第一预置类型包括图片类型、视频类型和音频类型,所述第二预置类型包括文本类型;
    第一显示终端将所述第一预置类型的网页内容输出至第二显示终端显示;
    第一显示终端显示第二预置类型的网页内容。
  2. 如权利要求1所述的网页数据处理方法,其特征在于,所述发送网页链接请求至服务器,以获取网页链接包括的网页内容包括:
    第一显示终端发送网页链接请求至服务器;
    第一显示终端接收服务器根据所述网页链接请求反馈的HTML脚本文件;
    第一显示终端获取所述HTML脚本文件中的所有网页内容。
  3. 如权利要求2所述的网页数据处理方法,其特征在于,所述根据获取的网页内容的类型进行分类处理,形成第一预置类型的网页内容和第二预置类型的网页内容具体包括:
    第一显示终端依次读取HTML脚本文件中所有网页内容及对应的标签;所述标签为网页内容类型识别标签;
    第一显示终端根据当前读取的所述网页内容对应的标签判断当前读取的网页内容是否为第一预置类型的网页内容;
    若是,则第一显示终端将当前读取的网页内容中的URL链接地址存储至预置列表中,并在所述HTML脚本中删除当前读取的所述网页内容及对应的标签;
    直至所有网页内容读取完成后,第一显示终端对所述HTML脚本文件重组处理得到新的HTML脚本文件;
    所述第一显示终端将第一预置类型的网页内容输出至第二显示终端显示具体为第一显示终端将所述预置列表输出至第二显示终端显示;
    所述第一显示终端显示第二预置类型的网页内容具体为第一显示终端调取浏览器加载所述新的HTML脚本文件。
  4. 如权利要求3所述的网页数据处理方法,其特征在于,所述标签包括IMG、AUDIO、VIDEO、EMB和OBJECT;
    当当前读取的网页内容对应的标签为EMB或OBJECT时,第一显示终端根据所述网页内容中嵌入对象格式判断当前读取的网页内容是否为第一预置类型的网页内容。
  5. 如权利要求3所述的网页数据处理方法,其特征在于,所述第一显示终端将预置列表输出至第二显示终端显示具体为第一显示终端将所述预置列表输出至所述第二显示终端,以供所述第二显示终端加载所述预置列表中的URL链接地址。
  6. 如权利要求5所述的网页数据处理方法,其特征在于,所述第二显示终端将通过加载所述预置列表中的URL链接地址所获取到的第一预置类型的网页内容进行解码播放。
  7. 如权利要求4所述的网页数据处理方法,其特征在于,所述第一显示终端将预置列表输出至第二显示终端显示具体为第一显示终端将所述预置列表输出至所述第二显示终端,以供所述第二显示终端加载所述预置列表中的URL链接地址。
  8. 一种网页数据处理装置,其特征在于,所述网页数据处理装置包括网页获取模块、网页处理模块、输出模块以及显示模块,其中:
    网页获取模块,用于发送网页链接请求至服务器,以获取网页链接包括的网页内容;
    网页处理模块,用于根据获取的网页内容的类型进行分类处理,形成第一预置类型的网页内容和第二预置类型的网页内容;所述第一预置类型包括图片类型、视频类型和音频类型,所述第二预置类型包括文本类型;
    输出模块,用于将所述第一预置类型的网页内容输出至第二显示终端显示;
    显示模块,用于显示第二预置类型的网页内容。
  9. 如权利要求8所述的网页数据处理装置,其特征在于,所述网页获取模块包括发送单元、接收单元以及获取单元,其中:
    发送单元,用于将网页链接请求发送至服务器;
    接收单元,用于接收服务器根据所述网页链接请求反馈的HTML脚本文件;
    获取单元,用于获取所述HTML脚本文件中的所有网页内容。
  10. 如权利要求9所述的网页数据处理装置,其特征在于,所述网页处理模块包括读取单元、第一判断单元、URL地址提取单元以及脚本文件重组单元,其中:
    读取单元,用于依次读取HTML脚本文件中所有网页内容及对应的标签;所述标签为网页内容类型识别标签;
    第一判断单元,用于根据当前读取的所述网页内容对应的标签判断当前读取的网页内容是否为第一预置类型的网页内容;
    URL地址提取单元,用于在当前读取的网页内容为第一预置类型的网页内容时,将当前读取的网页内容中的URL链接地址存储至预置列表中,并在所述HTML脚本中删除当前读取的所述网页内容及对应的标签;
    脚本文件重组单元,用于在所有网页内容读取完成后,对所述HTML脚本文件重组处理得到新的HTML脚本文件。
  11. 如权利要求10所述的网页数据处理装置,其特征在于,所述标签包括IMG、AUDIO、VIDEO、EMB和OBJECT;
    所述网页处理模块还包括第二判断单元,该第二判断单元用于在当前读取的网页内容对应的标签为EMB或OBJECT时,根据所述网页内容中嵌入对象格式判断当前读取的网页内容是否为第一预置类型的网页内容。
  12. 如权利要求10所述的网页数据处理装置,其特征在于,所述显示模块具体用于调取浏览器加载所述新的HTML脚本文件;所述输出模块具体用于将所述预置列表输出至所述第二显示终端,以供所述第二显示终端加载所述预置列表中的URL链接地址。
  13. 如权利要求12所述的网页数据处理装置,其特征在于,所述第二显示终端将通过加载所述预置列表中的URL链接地址所获取到的第一预置类型的网页内容进行解码播放。
  14. 如权利要求11所述的网页数据处理装置,其特征在于,所述显示模块具体用于调取浏览器加载所述新的HTML脚本文件;所述输出模块具体用于将所述预置列表输出至所述第二显示终端,以供所述第二显示终端加载所述预置列表中的URL链接地址
PCT/CN2014/089244 2014-09-26 2014-10-23 网页数据处理方法和装置 WO2016045160A1 (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US15/125,965 US20170083487A1 (en) 2014-09-26 2014-10-23 Webpage data processing method and device
AU2014406868A AU2014406868A1 (en) 2014-09-26 2014-10-23 Webpage data processing method and device

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201410500228.7A CN104217024B (zh) 2014-09-26 2014-09-26 网页数据处理方法和装置
CN201410500228.7 2014-09-26

Publications (1)

Publication Number Publication Date
WO2016045160A1 true WO2016045160A1 (zh) 2016-03-31

Family

ID=52098514

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2014/089244 WO2016045160A1 (zh) 2014-09-26 2014-10-23 网页数据处理方法和装置

Country Status (4)

Country Link
US (1) US20170083487A1 (zh)
CN (1) CN104217024B (zh)
AU (1) AU2014406868A1 (zh)
WO (1) WO2016045160A1 (zh)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106331876A (zh) * 2016-09-09 2017-01-11 乐视控股(北京)有限公司 网页视频播放方法及装置
CN110659006A (zh) * 2019-08-20 2020-01-07 北京捷通华声科技股份有限公司 跨屏显示的方法、装置、电子设备及可读存储介质

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104951504B (zh) 2015-05-20 2018-08-14 深圳创维-Rgb电子有限公司 一种网页处理方法及系统
CN105183639A (zh) * 2015-07-23 2015-12-23 柳州易旺科技有限公司 一种用户侧网页数据测试方法
CN105183634A (zh) * 2015-07-23 2015-12-23 柳州一合科技有限公司 一种软件的多路同步检测方法
CN105183635A (zh) * 2015-07-23 2015-12-23 柳州一合科技有限公司 一种服务器端软件的安全测试方法
CN105138444A (zh) * 2015-07-23 2015-12-09 柳州一合科技有限公司 一种面向多用户的软件发布方法
CN105095437A (zh) * 2015-07-23 2015-11-25 柳州易旺科技有限公司 一种临时网页数据处理方法
CN105488145B (zh) * 2015-11-26 2019-07-09 小米科技有限责任公司 网页内容的显示方法、装置及终端
CN109271122B (zh) * 2018-09-03 2022-07-01 高新兴科技集团股份有限公司 一种基于双显示屏的文件显示方法、装置及设备
US10866704B1 (en) * 2019-06-04 2020-12-15 Microsoft Technology Licensing, Llc Intelligent web browser segregation
CN110532495A (zh) * 2019-08-30 2019-12-03 郑州阿帕斯科技有限公司 一种网页信息的展示方法及装置
US11899757B2 (en) * 2019-12-02 2024-02-13 Cox Automotive, Inc. Systems and methods for temporary digital content sharing
CN111273903B (zh) * 2020-01-10 2023-08-04 深圳市大富网络技术有限公司 网页制作方法、装置、计算机设备及计算机存储介质
JP2022012403A (ja) * 2020-07-01 2022-01-17 キヤノン株式会社 プログラム、情報処理装置及び制御方法

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103412896A (zh) * 2013-07-25 2013-11-27 深圳创维-Rgb电子有限公司 浏览器资源展示的方法和系统
CN103514202A (zh) * 2012-06-25 2014-01-15 北京新媒传信科技有限公司 一种网页显示的实现方法和装置
CN103942168A (zh) * 2014-01-08 2014-07-23 北京奇虎科技有限公司 一种浏览器进行信息传输的方法和系统

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7831992B2 (en) * 2002-09-18 2010-11-09 General Instrument Corporation Method and apparatus for forwarding television channel video image snapshots to an auxiliary display device
US20050039133A1 (en) * 2003-08-11 2005-02-17 Trevor Wells Controlling a presentation of digital content
US7460126B2 (en) * 2004-08-24 2008-12-02 Silicon Graphics, Inc. Scalable method and system for streaming high-resolution media
US20060203758A1 (en) * 2005-03-11 2006-09-14 Samsung Electronics Co., Ltd. Mobile terminal for relaying multimedia data to an external display device
US7853871B2 (en) * 2005-06-10 2010-12-14 Nokia Corporation System and method for identifying segments in a web resource
US8074248B2 (en) * 2005-07-26 2011-12-06 Activevideo Networks, Inc. System and method for providing video content associated with a source image to a television in a communication network
CN100539533C (zh) * 2007-08-31 2009-09-09 烽火通信科技股份有限公司 一种移动终端结合家庭网关控制多媒体播放的方法和系统
US8554938B2 (en) * 2010-08-31 2013-10-08 Millind Mittal Web browser proxy-client video system and method
US8774955B2 (en) * 2011-04-13 2014-07-08 Google Inc. Audio control of multimedia objects
US20130013583A1 (en) * 2011-05-30 2013-01-10 Lei Yu Online video tracking and identifying method and system
US9510055B2 (en) * 2013-01-23 2016-11-29 Sonos, Inc. System and method for a media experience social interface
US9679054B2 (en) * 2014-03-05 2017-06-13 Sonos, Inc. Webpage media playback

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103514202A (zh) * 2012-06-25 2014-01-15 北京新媒传信科技有限公司 一种网页显示的实现方法和装置
CN103412896A (zh) * 2013-07-25 2013-11-27 深圳创维-Rgb电子有限公司 浏览器资源展示的方法和系统
CN103942168A (zh) * 2014-01-08 2014-07-23 北京奇虎科技有限公司 一种浏览器进行信息传输的方法和系统

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106331876A (zh) * 2016-09-09 2017-01-11 乐视控股(北京)有限公司 网页视频播放方法及装置
CN110659006A (zh) * 2019-08-20 2020-01-07 北京捷通华声科技股份有限公司 跨屏显示的方法、装置、电子设备及可读存储介质
CN110659006B (zh) * 2019-08-20 2023-08-22 北京捷通华声科技股份有限公司 跨屏显示的方法、装置、电子设备及可读存储介质

Also Published As

Publication number Publication date
CN104217024A (zh) 2014-12-17
AU2014406868A1 (en) 2016-10-20
US20170083487A1 (en) 2017-03-23
CN104217024B (zh) 2018-02-16

Similar Documents

Publication Publication Date Title
WO2016045160A1 (zh) 网页数据处理方法和装置
US8495495B2 (en) Information processing apparatus, bookmark setting method, and program
WO2018186592A1 (en) Electronic device and operating method thereof
WO2015093637A1 (ko) 컨텐츠 공유를 위한 서버 장치, 클라이언트 장치 및 그 컨텐츠 공유 방법
WO2012026750A2 (ko) 컨텐츠 공유 제어 방법, 그를 이용한 휴대용 단말기 및 컨텐츠 공유 시스템
WO2014058250A1 (en) User terminal device, sns providing server, and contents providing method thereof
WO2014157899A1 (en) Method for controlling dynamically changing contents of web page and electronic device thereof
WO2019139270A1 (ko) 디스플레이 장치 및 이의 컨텐츠 제공 방법
CN107257510B (zh) 视频统一播放方法、终端及计算机可读存储介质
WO2016137134A1 (en) Broadcast receiving apparatus and method for controlling the same
WO2020045795A1 (ko) 전자 장치 및 이의 외부 장치를 제어하는 방법
WO2018093138A1 (en) Electronic apparatus and method of operating the same
WO2015069082A1 (en) Display apparatus and method of controlling the same
WO2017135776A1 (en) Display apparatus, user terminal apparatus, system, and controlling method thereof
WO2016206223A1 (zh) 截图的方法及系统
WO2017190451A1 (zh) 图片推送方法和装置
WO2016024824A1 (en) Display apparatus and method of controlling the same
WO2013015471A1 (en) Electronic device
WO2015182844A1 (ko) 디스플레이 장치, 사용자 단말 장치, 서버 및 그 제어 방법
WO2016080585A1 (en) System and method for providing cloud based user interfaces
WO2015190780A1 (ko) 사용자 단말 및 이의 제어 방법
CN109063079B (zh) 网页标注方法及电子设备
WO2022080866A1 (ko) 전자 장치 및 그 동작 방법
WO2016080586A1 (en) System and method for providing cloud based user interfaces
WO2018016760A1 (ko) 전자 장치 및 그의 제어 방법

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14902620

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 15125965

Country of ref document: US

ENP Entry into the national phase

Ref document number: 2014406868

Country of ref document: AU

Date of ref document: 20141023

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 30.08.2017)

122 Ep: pct application non-entry in european phase

Ref document number: 14902620

Country of ref document: EP

Kind code of ref document: A1