WO2014044154A1 - Method and apparatus for obtaining information - Google Patents

Method and apparatus for obtaining information Download PDF

Info

Publication number
WO2014044154A1
WO2014044154A1 PCT/CN2013/083508 CN2013083508W WO2014044154A1 WO 2014044154 A1 WO2014044154 A1 WO 2014044154A1 CN 2013083508 W CN2013083508 W CN 2013083508W WO 2014044154 A1 WO2014044154 A1 WO 2014044154A1
Authority
WO
WIPO (PCT)
Prior art keywords
preset
webpages
client
pages
information obtaining
Prior art date
Application number
PCT/CN2013/083508
Other languages
French (fr)
Inventor
Zixin HAN
Guoqiang Wang
Zhan Chen
Shuicheng HUANG
Peng Sun
Original Assignee
Tencent Technology (Shenzhen) Company Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology (Shenzhen) Company Limited filed Critical Tencent Technology (Shenzhen) Company Limited
Priority to EP13839230.3A priority Critical patent/EP2898433A4/en
Priority to US14/082,510 priority patent/US20140082484A1/en
Publication of WO2014044154A1 publication Critical patent/WO2014044154A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • G06F16/9577Optimising the visualization of content, e.g. distillation of HTML documents

Definitions

  • the present disclosure generally relates to computer network technologies and, more particularly, to an information obtaining method and apparatus.
  • One aspect of the present disclosure includes a method for obtaining information on the Internet.
  • the method includes an information obtaining apparatus changing from a paging mode to a reading mode of a client.
  • the method also includes the information obtaining apparatus downloading at least two pages of preset webpages when receiving a request for accessing the preset webpages sent from the client.
  • the method includes the information obtaining apparatus extracting body content of the at least two pages of the preset webpages.
  • the method includes the information obtaining apparatus splicing and outputting the body content of the preset webpages in a predetermined sequence.
  • Another aspect of the present disclosure includes an information obtaining apparatus.
  • the information obtaining apparatus includes a downloading module, an extraction module, and an output module.
  • the downloading module is configured to download at least two pages of preset webpages when receiving a request for accessing the preset webpages sent from a client.
  • the extraction module is configured to extract body content of at least two pages of the preset webpages.
  • the output module is configured to splice and output the body content of the preset webpages in a predetermined sequence.
  • Figure 1 illustrates a flow diagram of an exemplary information obtaining method consistent with the disclosed embodiments
  • Figure 2 illustrates a flow diagram of another exemplary information obtaining method consistent with the disclosed embodiments
  • Figure 3 illustrates a structure diagram of an exemplary information obtaining apparatus consistent with the disclosed embodiments
  • Figure 4 illustrates a structure diagram of another exemplary information obtaining apparatus consistent with the disclosed embodiments
  • Figure 5 illustrates an exemplary operating environment incorporating certain disclosed embodiments.
  • Figure 6 illustrates a block diagram of an exemplary computer system consistent with the disclosed embodiments.
  • FIG. 5 illustrates an exemplary operating environment 500 incorporating certain disclosed embodiments.
  • environment 500 may include a terminal 504, the Internet 503, and a server 502.
  • the Internet 503 may include any appropriate type of communication network for providing network connections to the terminal 504 and the server 502 or among multiple terminals and servers.
  • Internet 503 may include the Internet or other types of computer networks or telecommunication networks, either wired or wireless.
  • a server may refer to one or more server computers configured to provide certain web server functionalities to provide certain personalized services, which may require any user accessing the services to authenticate to the server before the access.
  • a web server may also include one or more processors to execute computer programs in parallel.
  • the server 502 may include any appropriate server computers configured to provide certain server functionalities, such as a file server functionality for responding a user's request for obtaining information operations or other application server. Although only one server is shown, any number of servers can be included.
  • the server 502 may be operated in a cloud or non-cloud computing environment.
  • Terminal 504 may include any appropriate type of mobile computing devices, such as mobile phones, smart phones, tablets, notebook computers, or any type of computing platform.
  • a terminal e.g., terminal 504 may include one or more clients 501.
  • the client 501 may include any appropriate mobile application software, hardware, or a combination of application software and hardware to achieve certain client functionalities.
  • client 501 may include a browser, etc.
  • a mobile client may be a browser installed on the terminal for browsing, including various types of existing and future browser installed on terminals.
  • Terminal 504, client 501, and/or server 502 may be implemented on any appropriate computing platform.
  • Figure 6 illustrates a block diagram of an exemplary computer system 600 capable of implementing terminal 504, client 501, and/or server 502.
  • computer system 600 may include a processor 602, a storage medium 604, a monitor 606, a communication module 608, a database 610, and peripherals 612. Certain devices may be omitted and other devices may be included.
  • Processor 602 may include any appropriate processor or processors. Further, processor 602 can include multiple cores for multi-thread or parallel processing.
  • Storage medium 604 may include memory modules, such as Read-only memory (ROM), Random Access Memory (RAM), flash memory modules, and erasable and rewritable memory, and mass storages, such as CD-ROM, U-disk, and hard disk, etc.
  • Storage medium 604 may store computer programs for implementing various processes, when executed by processor 602.
  • peripherals 612 may include I/O devices such as keyboard and mouse, and communication module 608 may include network devices for establishing connections through the communication network.
  • Database 610 may include one or more databases for storing certain data and for performing certain operations on the stored data, such as database searching.
  • FIG. 1 illustrates a flow diagram of an exemplary information obtaining process consistent with the disclosed embodiments.
  • the information obtaining process includes the following steps:
  • Step 101 an information obtaining apparatus downloads at least two pages of preset webpages when receiving a request for accessing the preset webpages sent from a client.
  • Step 102 the information obtaining apparatus extracts body content of the at least two pages of the preset webpages.
  • Step 103 the information obtaining apparatus splices and outputs the body content of the preset webpages in a predetermined sequence.
  • the information obtaining apparatus may also determine the total number of the preset webpages to be downloaded.
  • the information obtaining apparatus may obtain access point information of the client and, based on the access point information of the client, the information obtaining apparatus judges whether network access of the client is charged according to traffic amount. If the network access of the client is not charged according to traffic, the information obtaining apparatus determines to download the first number of preset pages of the preset webpages; if the network access of the client is charged according to traffic, the information obtaining apparatus determines to download the second number of preset pages of the preset webpages.
  • the information obtaining process further includes that, when receiving a request for displaying the next page from the client, the information obtaining apparatus downloads the webpages after the first preset pages.
  • the information obtaining process further includes:
  • the information obtaining apparatus obtains the total number of spliced pages of the current page cached on the client and judges whether the number of the spliced pages exceeds a threshold value. If the number of the spliced pages exceeds the threshold value, the information obtaining apparatus discards certain webpages (e.g., designated webpages) of the current page and downloads webpages after the second number of preset pages.
  • certain webpages e.g., designated webpages
  • the information obtaining apparatus may trims non-body content information of the downloaded webpages and reformats the trimmed body content as pure document or pure text to obtain the body content of the preset webpages. [0035] Therefore, the information obtaining apparatus downloads at least two pages of the preset webpages when receiving a request for accessing the preset webpages sent from the client. Then, the information obtaining apparatus extracts body content of at least two pages of the preset webpages. The information obtaining apparatus splices and outputs the body content of the preset webpages in the predetermined sequence.
  • the information obtaining apparatus downloads body content of at least two pages of the preset webpages. Then, the information obtaining apparatus splices and outputs the downloaded content as pure text in a clean, clutter-free format. Therefore, the user may browse webpages more conveniently when using the mobile terminal without interference from non-body content information, improving the user's reading experience.
  • Figure 2 illustrates a flow diagram of another exemplary information obtaining process consistent with the disclosed embodiments.
  • a preset browser i.e., a client
  • a preset browser e.g., a client
  • a reading mode is provided for the user.
  • the information obtaining apparatus automatically downloads webpages that the user may read, via intelligent judgment, and splices the previous page and the next page together in a layout similar to reading layout, allowing the user to enter an immersive reading state.
  • the terminal may include any appropriate type of mobile computing devices, such as mobile phones, smart phones, tablets, notebook computers, or any type of computing platform.
  • the client as used herein, may include any appropriate mobile application software, hardware, or a combination of application software and hardware to achieve certain client functionalities. There are no specific limitations on the client, and the information obtaining apparatus may refer to either or both of the terminal and the client.
  • a fast-reading mode and a traffic- saving reading mode are provided in the browser for the users. If the network access of the client is not charged according to actual traffic, the fast- reading mode may be selected. Under the fast-reading mode, because the network environment is relatively good, when receiving an access request from the client, the information obtaining apparatus may download more network contents. [0040] For example, the information obtaining apparatus may download and parse the first number of preset webpages. After the first number of preset webpages are parsed, the parsed webpages are stored in the cache and put on a display list to wait for being displayed. For example, if the first number of preset webpages is N, N pages of webpages are downloaded successively, and the downloaded webpages are parsed and cached in a display list.
  • the source code of (N+l)th page may be stored in the (N+l)th space in the cache.
  • the content of the (N+l)th page is downloaded, and the
  • next page content may be displayed by local parsing operation, thereby avoiding the time spent in waiting for requesting the network to receive data again.
  • the network environment that is not charged according to actual traffic may include, but not limited to, WiFi, LAN, etc.
  • the first number of the preset webpages may be 2, 3, 5, etc.
  • the information obtaining apparatus may determine the first number based on user configuration or based on particular applications. Further, the information obtaining apparatus may adjust the first number based on the network environment. For example, the first number of the preset pages may be set to 5 in a desired network environment, or the first number of the preset pages may be set to 3 in a less-desired network environment.
  • the traffic-saving reading mode is selected.
  • the information obtaining apparatus may download the second number of preset webpages.
  • the second number may be 2, 3, etc., and the information obtaining apparatus may adjust the second number of the preset pages based on traffic charge of the client.
  • the network environment that is charged according to actual traffic may include General Packet Radio Service (GPRS) or other wireless networks, etc.
  • GPRS General Packet Radio Service
  • the discard condition may be based on the total number of spliced pages, i.e., the total pages reformatted by removing page spacing and other non-body content, which may be set to a threshold value and may be adjusted dynamically based on the available cache and/or the network access condition. If the spliced page number reaches the threshold value, the oldest page (e.g., the most front page) may be discarded and the new page can be downloaded, parsed, and displayed.
  • the information obtaining process includes the following steps:
  • Step 201 the information obtaining apparatus determines the number of preset webpages to be downloaded when receiving a request for accessing preset webpages sent from a client.
  • the information obtaining apparatus Before downloading the preset webpages, based on the current network access type of the client, the information obtaining apparatus determines the number of the preset webpages to be downloaded. More specifically, to determine the number of the preset webpages to be downloaded, the information obtaining apparatus may first obtain access point information of the client.
  • the information obtaining apparatus judges whether the network access of the client is charged according to traffic. If the network access of the client is not charged according to traffic, the information obtaining apparatus determines to download the first number of preset webpages from the preset webpages. On the other hand, if the network access of the client is charged according to traffic, the information obtaining apparatus determines to download the second number of preset pages from the preset webpages.
  • the webpages are opened according to a current operating mode, i.e., the paging mode.
  • a 'reading mode' option button may be provided on the displayed pages under the paging mode for the user to change the paging mode into the 'reading mode.' If a user selects the 'reading mode' button, the 'reading mode' is used in the preset browser of the client. If the user does not select the 'reading mode' button, the default paging mode is used by the user, that is, the next page content is obtained by clicking 'next page' every time. Of course, the reading mode may be selected by other methods.
  • Step 202 the information obtaining apparatus downloads the preset webpages based on the determined number of the preset webpages to be downloaded.
  • the information obtaining apparatus determines the number of the webpages to be downloaded as the first preset page number and downloads the preset webpages based on the first preset page number.
  • the information obtaining apparatus determines the number of the webpages to be downloaded as the second preset page number and downloads the preset webpages based on the second preset page number.
  • the information obtaining apparatus downloads in order and parses a first page content of the webpages to be downloaded. Further, the information obtaining apparatus judges whether the number of pages of the downloaded webpages matches the number of the webpages that are determined to be downloaded.
  • keywords of the first page are searched, and then the information obtaining apparatus downloads and parses a second page based on the keywords. Such matching/downloading is repeated until all webpages to be downloaded are downloaded.
  • the information obtaining apparatus automatically searches the keywords of the webpages and automatically downloads the linked content corresponding to the keywords.
  • the keywords may include 'Next Page', page number, or similar words or phrases, etc. For instance, if the number of webpages to be downloaded is 5, the first page is downloaded and parsed first. Then the information obtaining apparatus searches the keywords in the first page. If the keyword in the first page is 'Next Page', the information obtaining apparatus automatically downloads and parses the linked content corresponding to 'next page,' which is the second page. The downloading process can be repeated until the fifth page content is downloaded.
  • Step 203 the information obtaining apparatus extracts body contents of at least two pages of the preset webpages, and splices and outputs the body contents of the preset webpages in a predetermined sequence.
  • the information obtaining apparatus extracts body content of at least two pages of the preset webpages, and splices and outputs the body content of the preset webpages in a predetermined sequence. Therefore, the user may browse webpages more conveniently without interference from non-body content information, enjoying an immersive reading status.
  • the body content includes, but not limited to, images, text, or videos.
  • the information obtaining apparatus trims non-body content information of the downloaded webpages and reformats the trimmed body content as pure contents to obtain body content of the preset webpages.
  • the non-body content information includes, but not limited to, page header, footer, advertising information, etc.
  • the body content is reformatted as plain text which is similar to book text style, or as other content formats, as long as the non-body contents of the pages can be removed and the remaining contents are reformatted or republished such that the effects of the non-body contents are no longer visible.
  • spacing among pages may also be removed or adjusted.
  • the information obtaining apparatus may remove the spacing between the pages such that the user can read the reformatted contents without any page separation for continuous content reading.
  • the spacing between the pages may be adjusted to fit the terminal screen used by the user to view the contents. Thus, pure text contents can be displayed for the user, improving the user's reading experience.
  • the information obtaining apparatus may determine the network access type of the client so that the reading mode can be further adjusted to fit the user's needs, requirements, or configurations. For example, based on the access point information of the client, the information obtaining apparatus judges whether the network access of the client is charged according to traffic amount.
  • Step 204 when the network access of the client is not charged according to traffic, and after receiving an access request for displaying next page from the client, the information obtaining apparatus downloads webpages after the first number of preset pages.
  • the client receives a request for displaying next page content from the user or for displaying more pages from the user, the information obtaining apparatus
  • Step 205 when the network access of the client is charged according to traffic amount, the information obtaining apparatus obtains the total number of spliced pages cached on the client and judges whether the splicing number of the current page exceeds a threshold value. If the splicing number of the current page exceeds the threshold value, the information obtaining apparatus discards assigned webpages of the current page based on the discard condition and downloads the webpages after the second preset pages.
  • the information obtaining apparatus obtains the splicing number of the current page cached in the client.
  • the information obtaining apparatus discards the content that meets the discard condition, and downloads and parses the content that has not been downloaded previously from the network request to display the next page.
  • the discard condition may be based on a preset threshold value.
  • a threshold value When a threshold value is exceeded, the information obtaining apparatus discards the assigned webpages of the current page.
  • the threshold value may be a fixed value.
  • the threshold value may also be dynamically adjusted based on the current remaining memory and/or network condition.
  • the assigned webpages may be the first one or more pages of the current webpage.
  • the information obtaining apparatus downloads at least two pages of the preset webpages when receiving a request for accessing the preset webpages sent from the client. Then, the information obtaining apparatus extracts body content of at least two pages of the preset webpages. The information obtaining apparatus splices and outputs the body content of the preset webpages in a predetermined sequence. That is, when the client receives an access request from a user, the information obtaining apparatus downloads body content of at least two pages of the preset webpages. Then, the information obtaining apparatus splices and outputs the downloaded content in a clean, clutter-free format. Therefore, the user may browse webpages more conveniently without interference from non-body content information, improving the user's reading experience. Further, the next page is obtained without having to click next page link every time by the user, reducing the user's operation and time waiting for the Internet response after each clicking of next page, and further improving the user's reading experience.
  • Figure 3 illustrates a structure diagram of an exemplary information obtaining apparatus consistent with the disclosed embodiments.
  • the information obtaining apparatus includes a downloading module 301, an extraction module 302, and an output module 303.
  • the downloading module 301 is configured to download at least two pages of preset webpages when receiving a request for accessing the preset webpages sent from a client.
  • the extraction module 302 is configured to extract body content of at least two pages of the preset webpages.
  • the output module 303 is configured to splice and output the body content of the preset webpages in a predetermined sequence.
  • Figure 4 illustrates a structure diagram of another exemplary information obtaining apparatus consistent with the disclosed embodiments.
  • the information obtaining apparatus also includes a determination module 304, in addition to downloading module 301, extraction module 302, and output module 303.
  • the determination module 304 is configured to determine the number of preset webpages to be downloaded before downloading at least two pages of preset webpages.
  • the determination module 304 may further include an obtaining unit 304a and a determination unit 304b.
  • the obtaining unit 304a is configured to obtain access point information of the client.
  • the determination unit 304b is configured to judge whether the network access of the client is charged according to traffic amount, based on the access point information of the client. If the network access of the client is not charged according to traffic amount, the determination module determines to download the first number of preset pages from the preset webpages; if the network access of the client is charged according to traffic amount, the determination module determines to download the second number of preset pages from the preset webpages.
  • the downloading module 301 is also configured to download the webpages after the first number of preset pages when receiving a request for displaying the next page from the client.
  • the downloading module 301 is also configured to obtain the splicing number of the current page cached on the client and judges whether the splicing number of the current page exceeds a threshold value. If the splicing number of the current page exceeds the threshold value, the downloading module 301 discards the assigned webpages of the current page and downloads the webpages after the second number of preset pages.
  • the extraction module 302 is further configured to trim non-body content information of the downloaded webpages and reformat or republish the trimmed body content to obtain body content of the preset webpages.
  • each functional module is listed only for illustrative purposes. In practical applications, the above functions are implemented by different functional modules according to the needs. That is, the internal structure of the device for obtaining information is divided into different functional modules to complete all or part of the functions described above.
  • the information obtaining apparatus downloads at least two pages of the preset webpages when receiving a request for accessing the preset webpages sent from the client. Then, the information obtaining apparatus extracts body content of at least two pages of the preset webpages. The information obtaining apparatus splices and outputs the body content of the preset webpages in a predetermined sequence. That is, when the client receives an access request from a user, the information obtaining apparatus downloads body content of at least two pages of the preset webpages. Then, the information obtaining apparatus splices and outputs the downloaded content in a clean, clutter-free format.
  • the user may browse webpages more conveniently without interference from non-body content information, improving the user's reading experience. Further, the next page is obtained without having to click next page link every time by the user, reducing the user's operation and time waiting for the Internet response after each clicking of next page, and further improving the user's reading experience.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

A method is provided for obtaining information on the Internet. The method includes an information obtaining apparatus changing from a paging mode to a reading mode of a client. The method also includes the information obtaining apparatus downloading at least two pages of preset webpages when receiving a request for accessing the preset webpages sent from the client. Further, the method includes the information obtaining apparatus extracting body content of the at least two pages of the preset webpages. The method includes the information obtaining apparatus splicing and outputting the body content of the preset webpages in a predetermined sequence.

Description

METHOD AND APPARATUS FOR OBTAINING INFORMATION
CROSS-REFERENCES TO RELATED APPLICATIONS
[0001] This application claims priority of Chinese Patent Application No.
201210350647.8, filed on September 20, 2012, the entire contents of which are incorporated by reference herein.
FIELD OF THE INVENTION
[0002] The present disclosure generally relates to computer network technologies and, more particularly, to an information obtaining method and apparatus.
BACKGROUND
[0003] With the rapid development of mobile terminals, browsers are becoming one important entry of mobile Internet. More and more users use mobile browser to read novels or view pictures. However, webpages for continuous reading in the browser are stored in separate pages, i.e., a paging mode, and spacing between adjacent pages is relatively large. A user may need to drag the current webpage over a long distance when the user wants to read the next page. In addition, there is a lot of information that the user does not need to read in many webpages, e.g., advertising, repeated titles, etc. Such information that the user does not need to read further interferes with the user's reading of the content in the webpage body.
[0004] The disclosed methods and apparatus are directed to solve one or more problems set forth above and other problems. BRIEF SUMMARY OF THE DISCLOSURE
[0005] One aspect of the present disclosure includes a method for obtaining information on the Internet. The method includes an information obtaining apparatus changing from a paging mode to a reading mode of a client. The method also includes the information obtaining apparatus downloading at least two pages of preset webpages when receiving a request for accessing the preset webpages sent from the client. Further, the method includes the information obtaining apparatus extracting body content of the at least two pages of the preset webpages. The method includes the information obtaining apparatus splicing and outputting the body content of the preset webpages in a predetermined sequence. [0006] Another aspect of the present disclosure includes an information obtaining apparatus. The information obtaining apparatus includes a downloading module, an extraction module, and an output module. The downloading module is configured to download at least two pages of preset webpages when receiving a request for accessing the preset webpages sent from a client. The extraction module is configured to extract body content of at least two pages of the preset webpages. Further, the output module is configured to splice and output the body content of the preset webpages in a predetermined sequence.
[0007] Other aspects of the present disclosure can be understood by those skilled in the art in light of the description, the claims, and the drawings of the present disclosure.
BRIEF DESCRIPTION OF THE DRAWINGS
[0008] In order to more clearly illustrate technical solutions of the present disclosure or the existing technology, the figures which are needed to be used in the description of the present disclosure or the existing technology are briefly described in the following. Obviously, the figures in the following description are only some embodiments of the present disclosure, and it is easily for those skilled in the art to obtain other figures based on the following figures without creative work.
[0009] Figure 1 illustrates a flow diagram of an exemplary information obtaining method consistent with the disclosed embodiments;
[0010] Figure 2 illustrates a flow diagram of another exemplary information obtaining method consistent with the disclosed embodiments;
[0011] Figure 3 illustrates a structure diagram of an exemplary information obtaining apparatus consistent with the disclosed embodiments; [0012] Figure 4 illustrates a structure diagram of another exemplary information obtaining apparatus consistent with the disclosed embodiments;
[0013] Figure 5 illustrates an exemplary operating environment incorporating certain disclosed embodiments; and
[0014] Figure 6 illustrates a block diagram of an exemplary computer system consistent with the disclosed embodiments.
DETAILED DESCRIPTION
[0015] Reference will now be made in detail to exemplary embodiments of the invention, which are illustrated in the accompanying drawings.
[0016] Figure 5 illustrates an exemplary operating environment 500 incorporating certain disclosed embodiments. As shown in Figure 5, environment 500 may include a terminal 504, the Internet 503, and a server 502. The Internet 503 may include any appropriate type of communication network for providing network connections to the terminal 504 and the server 502 or among multiple terminals and servers. For example, Internet 503 may include the Internet or other types of computer networks or telecommunication networks, either wired or wireless.
[0017] A server, as used herein, may refer to one or more server computers configured to provide certain web server functionalities to provide certain personalized services, which may require any user accessing the services to authenticate to the server before the access. A web server may also include one or more processors to execute computer programs in parallel.
[0018] The server 502 may include any appropriate server computers configured to provide certain server functionalities, such as a file server functionality for responding a user's request for obtaining information operations or other application server. Although only one server is shown, any number of servers can be included. The server 502 may be operated in a cloud or non-cloud computing environment.
[0019] Terminal 504 may include any appropriate type of mobile computing devices, such as mobile phones, smart phones, tablets, notebook computers, or any type of computing platform. A terminal (e.g., terminal 504) may include one or more clients 501. The client 501, as used herein, may include any appropriate mobile application software, hardware, or a combination of application software and hardware to achieve certain client functionalities. For example, client 501 may include a browser, etc. According to actual needs in different terminals, a mobile client may be a browser installed on the terminal for browsing, including various types of existing and future browser installed on terminals. Although only one client 501 is shown in the environment 500, any number of clients 501 may be included. [0020] Terminal 504, client 501, and/or server 502 may be implemented on any appropriate computing platform. Figure 6 illustrates a block diagram of an exemplary computer system 600 capable of implementing terminal 504, client 501, and/or server 502.
[0021] As shown in Figure 6, computer system 600 may include a processor 602, a storage medium 604, a monitor 606, a communication module 608, a database 610, and peripherals 612. Certain devices may be omitted and other devices may be included.
[0022] Processor 602 may include any appropriate processor or processors. Further, processor 602 can include multiple cores for multi-thread or parallel processing. Storage medium 604 may include memory modules, such as Read-only memory (ROM), Random Access Memory (RAM), flash memory modules, and erasable and rewritable memory, and mass storages, such as CD-ROM, U-disk, and hard disk, etc. Storage medium 604 may store computer programs for implementing various processes, when executed by processor 602.
[0023] Further, peripherals 612 may include I/O devices such as keyboard and mouse, and communication module 608 may include network devices for establishing connections through the communication network. Database 610 may include one or more databases for storing certain data and for performing certain operations on the stored data, such as database searching.
[0024] In operation, terminals/clients and servers 502 may interact with each other to provide an information obtaining service to the user(s) of the terminals. Figure 1 illustrates a flow diagram of an exemplary information obtaining process consistent with the disclosed embodiments. [0025] As shown in Figure 1, the information obtaining process includes the following steps:
[0026] Step 101: an information obtaining apparatus downloads at least two pages of preset webpages when receiving a request for accessing the preset webpages sent from a client.
[0027] Step 102: the information obtaining apparatus extracts body content of the at least two pages of the preset webpages.
[0028] Step 103: the information obtaining apparatus splices and outputs the body content of the preset webpages in a predetermined sequence.
[0029] Before the information obtaining apparatus downloads at least two pages of preset webpages, the information obtaining apparatus may also determine the total number of the preset webpages to be downloaded.
[0030] More specifically, when determining the number of the preset webpages to be downloaded, the information obtaining apparatus may obtain access point information of the client and, based on the access point information of the client, the information obtaining apparatus judges whether network access of the client is charged according to traffic amount. If the network access of the client is not charged according to traffic, the information obtaining apparatus determines to download the first number of preset pages of the preset webpages; if the network access of the client is charged according to traffic, the information obtaining apparatus determines to download the second number of preset pages of the preset webpages.
[0031] Optionally, when the network access of the client is not charged according to traffic, and after the information obtaining apparatus splices and outputs the body content of the preset webpages in a predetermined sequence, the information obtaining process further includes that, when receiving a request for displaying the next page from the client, the information obtaining apparatus downloads the webpages after the first preset pages.
[0032] Further, when the access type of the client is charged according to traffic, after the information obtaining apparatus splices and outputs the body content of the preset webpages in a predetermined sequence, the information obtaining process further includes:
[0033] The information obtaining apparatus obtains the total number of spliced pages of the current page cached on the client and judges whether the number of the spliced pages exceeds a threshold value. If the number of the spliced pages exceeds the threshold value, the information obtaining apparatus discards certain webpages (e.g., designated webpages) of the current page and downloads webpages after the second number of preset pages.
[0034] In addition, when extracting body content of at least two pages of the preset webpages, the information obtaining apparatus may trims non-body content information of the downloaded webpages and reformats the trimmed body content as pure document or pure text to obtain the body content of the preset webpages. [0035] Therefore, the information obtaining apparatus downloads at least two pages of the preset webpages when receiving a request for accessing the preset webpages sent from the client. Then, the information obtaining apparatus extracts body content of at least two pages of the preset webpages. The information obtaining apparatus splices and outputs the body content of the preset webpages in the predetermined sequence. That is, when the client receives an access request from a user, the information obtaining apparatus downloads body content of at least two pages of the preset webpages. Then, the information obtaining apparatus splices and outputs the downloaded content as pure text in a clean, clutter-free format. Therefore, the user may browse webpages more conveniently when using the mobile terminal without interference from non-body content information, improving the user's reading experience.
[0036] Figure 2 illustrates a flow diagram of another exemplary information obtaining process consistent with the disclosed embodiments.
[0037] A preset browser (i.e., a client) is provided for the user of the terminal. When a user uses the preset browser (e.g., a mobile browser) to read novels or view pictures on a terminal screen, a reading mode is provided for the user. Under the provided reading mode, when the user uses the preset browser to read lengthy graphic and text information, such as novels, the information obtaining apparatus automatically downloads webpages that the user may read, via intelligent judgment, and splices the previous page and the next page together in a layout similar to reading layout, allowing the user to enter an immersive reading state.
[0038] As described, the terminal may include any appropriate type of mobile computing devices, such as mobile phones, smart phones, tablets, notebook computers, or any type of computing platform. The client, as used herein, may include any appropriate mobile application software, hardware, or a combination of application software and hardware to achieve certain client functionalities. There are no specific limitations on the client, and the information obtaining apparatus may refer to either or both of the terminal and the client.
[0039] In practical applications, according to different types of network access of the client, a fast-reading mode and a traffic- saving reading mode are provided in the browser for the users. If the network access of the client is not charged according to actual traffic, the fast- reading mode may be selected. Under the fast-reading mode, because the network environment is relatively good, when receiving an access request from the client, the information obtaining apparatus may download more network contents. [0040] For example, the information obtaining apparatus may download and parse the first number of preset webpages. After the first number of preset webpages are parsed, the parsed webpages are stored in the cache and put on a display list to wait for being displayed. For example, if the first number of preset webpages is N, N pages of webpages are downloaded successively, and the downloaded webpages are parsed and cached in a display list.
[0041] Further, although the (N+l)th page is not parsed, the source code of (N+l)th page may be stored in the (N+l)th space in the cache. When receiving a request for displaying the next page from the client/user, the content of the (N+l)th page is downloaded, and the
downloaded webpage content is parsed and put on the display list. Thus, the next page content may be displayed by local parsing operation, thereby avoiding the time spent in waiting for requesting the network to receive data again.
[0042] The network environment that is not charged according to actual traffic may include, but not limited to, WiFi, LAN, etc. The first number of the preset webpages may be 2, 3, 5, etc., and the information obtaining apparatus may determine the first number based on user configuration or based on particular applications. Further, the information obtaining apparatus may adjust the first number based on the network environment. For example, the first number of the preset pages may be set to 5 in a desired network environment, or the first number of the preset pages may be set to 3 in a less-desired network environment.
[0043] If the network access of the client is charged according to actual traffic, the traffic-saving reading mode is selected. Under the traffic-saving reading mode, in order to save the traffic generated by the client, when receiving an access request from the client, the information obtaining apparatus may download the second number of preset webpages. The second number may be 2, 3, etc., and the information obtaining apparatus may adjust the second number of the preset pages based on traffic charge of the client. The network environment that is charged according to actual traffic may include General Packet Radio Service (GPRS) or other wireless networks, etc.
[0044] Further, under the traffic-saving reading mode, only page information currently displayed is cached, and there is no (N+l)th unparsed page downloaded and stored in the (N+l)th space. When a discard condition is satisfied, the information obtaining apparatus discards an old page and downloads and parses the (N+l)th page to be displayed.
[0045] The discard condition may be based on the total number of spliced pages, i.e., the total pages reformatted by removing page spacing and other non-body content, which may be set to a threshold value and may be adjusted dynamically based on the available cache and/or the network access condition. If the spliced page number reaches the threshold value, the oldest page (e.g., the most front page) may be discarded and the new page can be downloaded, parsed, and displayed.
[0046] More particularly, as shown in Figure 2, the information obtaining process includes the following steps:
[0047] Step 201: the information obtaining apparatus determines the number of preset webpages to be downloaded when receiving a request for accessing preset webpages sent from a client.
[0048] Before downloading the preset webpages, based on the current network access type of the client, the information obtaining apparatus determines the number of the preset webpages to be downloaded. More specifically, to determine the number of the preset webpages to be downloaded, the information obtaining apparatus may first obtain access point information of the client.
[0049] Based on the access point information of the client, the information obtaining apparatus judges whether the network access of the client is charged according to traffic. If the network access of the client is not charged according to traffic, the information obtaining apparatus determines to download the first number of preset webpages from the preset webpages. On the other hand, if the network access of the client is charged according to traffic, the information obtaining apparatus determines to download the second number of preset pages from the preset webpages.
[0050] Specifically, when the client uses the preset browser, the webpages are opened according to a current operating mode, i.e., the paging mode. A 'reading mode' option button may be provided on the displayed pages under the paging mode for the user to change the paging mode into the 'reading mode.' If a user selects the 'reading mode' button, the 'reading mode' is used in the preset browser of the client. If the user does not select the 'reading mode' button, the default paging mode is used by the user, that is, the next page content is obtained by clicking 'next page' every time. Of course, the reading mode may be selected by other methods.
[0051] Step 202: the information obtaining apparatus downloads the preset webpages based on the determined number of the preset webpages to be downloaded.
[0052] For example, when the network access type of the client is WiFi access, the information obtaining apparatus determines the number of the webpages to be downloaded as the first preset page number and downloads the preset webpages based on the first preset page number. When the network access type of the client is GPRS access, the information obtaining apparatus determines the number of the webpages to be downloaded as the second preset page number and downloads the preset webpages based on the second preset page number.
[0053] More specifically, when downloading the preset webpages based on the determined number of the preset webpages to be downloaded, the information obtaining apparatus downloads in order and parses a first page content of the webpages to be downloaded. Further, the information obtaining apparatus judges whether the number of pages of the downloaded webpages matches the number of the webpages that are determined to be downloaded.
[0054] If there is a match, the step of downloading the preset webpages is paused.
Otherwise, keywords of the first page are searched, and then the information obtaining apparatus downloads and parses a second page based on the keywords. Such matching/downloading is repeated until all webpages to be downloaded are downloaded.
[0055] For example, after determining the number of the webpages to be downloaded, the information obtaining apparatus automatically searches the keywords of the webpages and automatically downloads the linked content corresponding to the keywords. The keywords may include 'Next Page', page number, or similar words or phrases, etc. For instance, if the number of webpages to be downloaded is 5, the first page is downloaded and parsed first. Then the information obtaining apparatus searches the keywords in the first page. If the keyword in the first page is 'Next Page', the information obtaining apparatus automatically downloads and parses the linked content corresponding to 'next page,' which is the second page. The downloading process can be repeated until the fifth page content is downloaded. [0056] Step 203: the information obtaining apparatus extracts body contents of at least two pages of the preset webpages, and splices and outputs the body contents of the preset webpages in a predetermined sequence.
[0057] To improve the user's reading experience, the information obtaining apparatus extracts body content of at least two pages of the preset webpages, and splices and outputs the body content of the preset webpages in a predetermined sequence. Therefore, the user may browse webpages more conveniently without interference from non-body content information, enjoying an immersive reading status. The body content includes, but not limited to, images, text, or videos.
[0058] Specifically, when extracting body content of at least two pages of the preset webpages, the information obtaining apparatus trims non-body content information of the downloaded webpages and reformats the trimmed body content as pure contents to obtain body content of the preset webpages. The non-body content information includes, but not limited to, page header, footer, advertising information, etc. The body content is reformatted as plain text which is similar to book text style, or as other content formats, as long as the non-body contents of the pages can be removed and the remaining contents are reformatted or republished such that the effects of the non-body contents are no longer visible.
[0059] Further, spacing among pages may also be removed or adjusted. For example, the information obtaining apparatus may remove the spacing between the pages such that the user can read the reformatted contents without any page separation for continuous content reading. Or the spacing between the pages may be adjusted to fit the terminal screen used by the user to view the contents. Thus, pure text contents can be displayed for the user, improving the user's reading experience. [0060] In addition, the information obtaining apparatus may determine the network access type of the client so that the reading mode can be further adjusted to fit the user's needs, requirements, or configurations. For example, based on the access point information of the client, the information obtaining apparatus judges whether the network access of the client is charged according to traffic amount.
[0061] Step 204: when the network access of the client is not charged according to traffic, and after receiving an access request for displaying next page from the client, the information obtaining apparatus downloads webpages after the first number of preset pages.
[0062] For example, when the network access type of the client is WIFI access, after the current page is displayed, the client receives a request for displaying next page content from the user or for displaying more pages from the user, the information obtaining apparatus
automatically downloads the content that is not yet downloaded in the preset webpages. The request of displaying a new webpage is triggered automatically after the previous webpage is displayed. Therefore, the user may smoothly browse the webpages by using this method when the network speed is relatively slow.
[0063] Step 205: when the network access of the client is charged according to traffic amount, the information obtaining apparatus obtains the total number of spliced pages cached on the client and judges whether the splicing number of the current page exceeds a threshold value. If the splicing number of the current page exceeds the threshold value, the information obtaining apparatus discards assigned webpages of the current page based on the discard condition and downloads the webpages after the second preset pages.
[0064] The information obtaining apparatus obtains the splicing number of the current page cached in the client. When the content cached in the client meets the discard condition, the information obtaining apparatus discards the content that meets the discard condition, and downloads and parses the content that has not been downloaded previously from the network request to display the next page.
[0065] The discard condition may be based on a preset threshold value. When a threshold value is exceeded, the information obtaining apparatus discards the assigned webpages of the current page. The threshold value may be a fixed value. The threshold value may also be dynamically adjusted based on the current remaining memory and/or network condition. The assigned webpages may be the first one or more pages of the current webpage.
[0066] Thus, the information obtaining apparatus downloads at least two pages of the preset webpages when receiving a request for accessing the preset webpages sent from the client. Then, the information obtaining apparatus extracts body content of at least two pages of the preset webpages. The information obtaining apparatus splices and outputs the body content of the preset webpages in a predetermined sequence. That is, when the client receives an access request from a user, the information obtaining apparatus downloads body content of at least two pages of the preset webpages. Then, the information obtaining apparatus splices and outputs the downloaded content in a clean, clutter-free format. Therefore, the user may browse webpages more conveniently without interference from non-body content information, improving the user's reading experience. Further, the next page is obtained without having to click next page link every time by the user, reducing the user's operation and time waiting for the Internet response after each clicking of next page, and further improving the user's reading experience.
[0067] Figure 3 illustrates a structure diagram of an exemplary information obtaining apparatus consistent with the disclosed embodiments. As shown in Figure 3, the information obtaining apparatus includes a downloading module 301, an extraction module 302, and an output module 303.
[0068] The downloading module 301 is configured to download at least two pages of preset webpages when receiving a request for accessing the preset webpages sent from a client. The extraction module 302 is configured to extract body content of at least two pages of the preset webpages. The output module 303 is configured to splice and output the body content of the preset webpages in a predetermined sequence.
[0069] Figure 4 illustrates a structure diagram of another exemplary information obtaining apparatus consistent with the disclosed embodiments. As shown in Figure 4, the information obtaining apparatus also includes a determination module 304, in addition to downloading module 301, extraction module 302, and output module 303.
[0070] The determination module 304 is configured to determine the number of preset webpages to be downloaded before downloading at least two pages of preset webpages. The determination module 304 may further include an obtaining unit 304a and a determination unit 304b.
[0071] The obtaining unit 304a is configured to obtain access point information of the client. The determination unit 304b is configured to judge whether the network access of the client is charged according to traffic amount, based on the access point information of the client. If the network access of the client is not charged according to traffic amount, the determination module determines to download the first number of preset pages from the preset webpages; if the network access of the client is charged according to traffic amount, the determination module determines to download the second number of preset pages from the preset webpages. [0072] In addition, when the network access of the client is not charged according to traffic amount, after the output module 303 splices and outputs the body content of the preset webpages in a predetermined sequence, the downloading module 301 is also configured to download the webpages after the first number of preset pages when receiving a request for displaying the next page from the client.
[0073] When the network access of the client is charged according to traffic amount, after the output module 303 splices and outputs the body content of the preset webpages in a predetermined sequence, the downloading module 301 is also configured to obtain the splicing number of the current page cached on the client and judges whether the splicing number of the current page exceeds a threshold value. If the splicing number of the current page exceeds the threshold value, the downloading module 301 discards the assigned webpages of the current page and downloads the webpages after the second number of preset pages.
[0074] The extraction module 302 is further configured to trim non-body content information of the downloaded webpages and reformat or republish the trimmed body content to obtain body content of the preset webpages.
[0075] It should be noted that, in the above server and terminal device for obtaining information, each functional module is listed only for illustrative purposes. In practical applications, the above functions are implemented by different functional modules according to the needs. That is, the internal structure of the device for obtaining information is divided into different functional modules to complete all or part of the functions described above.
[0076] Those skilled in the art should understand that all or part of the steps in the above method may be executed by relevant hardware instructed by a program, and the program may be stored in a computer-readable storage medium such as a read only memory, a magnetic disk, a Compact Disc (CD), and so on.
[0077] The embodiments disclosed herein are exemplary only and not limiting the scope of this disclosure. Without departing from the spirit and scope of this invention, other modifications, equivalents, or improvements to the disclosed embodiments are obvious to those skilled in the art and are intended to be encompassed within the scope of the present disclosure.
INDUSTRIAL APPLICABILITY AND ADVANTAGEOUS EFFECTS
[0078] Without limiting the scope of any claim and/or the specification, examples of industrial applicability and certain advantageous effects of the disclosed embodiments are listed for illustrative purposes. Various alternations, modifications, or equivalents to the technical solutions of the disclosed embodiments can be obvious to those skilled in the art and can be included in this disclosure.
[0079] By using the disclosed methods and apparatus for obtaining information, thus, the information obtaining apparatus downloads at least two pages of the preset webpages when receiving a request for accessing the preset webpages sent from the client. Then, the information obtaining apparatus extracts body content of at least two pages of the preset webpages. The information obtaining apparatus splices and outputs the body content of the preset webpages in a predetermined sequence. That is, when the client receives an access request from a user, the information obtaining apparatus downloads body content of at least two pages of the preset webpages. Then, the information obtaining apparatus splices and outputs the downloaded content in a clean, clutter-free format. Therefore, the user may browse webpages more conveniently without interference from non-body content information, improving the user's reading experience. Further, the next page is obtained without having to click next page link every time by the user, reducing the user's operation and time waiting for the Internet response after each clicking of next page, and further improving the user's reading experience.

Claims

What is claimed is:
1. A method for obtaining information, comprising:
changing from a paging mode to a reading mode of a client; downloading, by an information obtaining apparatus, at least two pages of preset webpages when receiving a request for accessing the preset webpages sent from the client; extracting, by the information obtaining apparatus, body content of the at least two pages of the preset webpages; and splicing and outputting, by the information obtaining apparatus, the body content of the preset webpages in a predetermined sequence.
2. The method according to claim 1, before downloading at least two pages of preset webpages, further comprising:
determining, by the information obtaining apparatus, a number of the preset webpages to be downloaded.
3. The method according to claim 2, wherein determining the number of the preset webpages to be downloaded further comprises:
obtaining, by the information obtaining apparatus, access point information of the client; and determining, by the information obtaining apparatus and based on the access point information of the client, whether network access of the client is charged according to traffic amount, when it is determined that the network access of the client is not charged according to traffic amount, using a fast-reading mode to download the preset webpages; and when it is determined that the network access of the client is charged according to traffic amount, using a traffic-saving reading mode to download the preset webpages.
4. The method according to claim 3, wherein using the fast-reading mode and the traffic- saving reading mode further comprises:
when the network access of the client is not charged according to traffic amount, the information obtaining apparatus determines to download a first number of preset pages from the preset webpages; and when the network access of the client is charged according to traffic amount, the information obtaining apparatus determines to download a second number of preset pages from the preset webpages.
5. The method according to claim 3, wherein, under the fast-reading mode and provided that the first number of preset pages is N, the method further comprises:
parsing and storing the N number of downloaded pages in a cache; putting the N number of downloaded pages on a display list; and downloading, without parsing, a (N+l)th webpage in a (N+l) space in the cache without putting the (N+l)th page on the display list.
6. The method according to claim 4, when the network access of the client is not charged according to traffic amount, after splicing and outputting the body content of the preset webpages in a predetermined sequence, further comprising:
downloading, by the information obtaining apparatus, webpages after the first number of preset pages when receiving a request for displaying a next page from the client.
7. The method according to claim 4, when the network access of the client is charged according to traffic amount, after splicing and outputting the body content of the preset webpages in a predetermined sequence, further comprising:
obtaining, by the information obtaining apparatus, a number of spliced pages of the current page cached on the client; and judging, by the information obtaining apparatus, whether the number of spliced pages of the current page exceeds a threshold value, wherein: when the number of spliced pages of the current page exceeds the threshold value, the information obtaining apparatus discards assigned webpages of the current page and downloads a webpage after the second number of preset pages.
8. The method according to claim 1, wherein extracting body content of at least two pages of the preset webpages further comprises:
trimming non-body content information of the downloaded preset webpages; and republishing the trimmed content to create body content of the preset webpages.
9. The method according to claim 8, wherein extracting body content of at least two pages of the preset webpages further comprises:
removing at least page header, footer, and advertising information from the downloaded preset webpages to obtain the trimmed content; and removing page spacing from the downloaded preset webpages such that contents of the downloaded preset webpages are displayed continuously.
10. The method according to claim 1 , wherein changing from a paging mode to a reading mode of a client further comprises: receiving a user selection from a reading mode button on a webpage displayed; and changing the paging mode to the reading mode based on the user selection.
11. A apparatus for obtaining information, comprising:
a downloading module configured to download at least two pages of preset webpages when receiving a request for accessing the preset webpages sent from a client; an extraction module configured to extract body content of at least two pages of the preset webpages; and an output module configured to splice and output the body content of the preset webpages in a predetermined sequence.
12. The apparatus according to claim 11, further comprising:
a determination module configured to determine a number of the preset webpages to be downloaded before downloading at least two pages of the preset webpages.
13. The apparatus according to claim 12, wherein the determination module further comprises:
an obtaining unit configured to obtain access point information of the client; and a determination unit configured to determine whether the network access of the client is charged according to traffic amount, based on the access point information of the client, when it is determined that the network access of the client is not charged according to traffic amount, to use a fast-reading mode to download the preset webpages; and when it is determined that the network access of the client is charged according to traffic amount, to use a traffic-saving reading mode to download the preset webpages.
14. The apparatus according to claim 13, wherein:
when the network access of the client is not charged according to traffic amount, the determination unit determines to download a first number of preset pages from the preset webpages; and when the network access of the client is charged according to traffic amount, the determination unit determines to download a second number of preset pages from the preset webpages.
15. The apparatus according to claim 13, wherein, under the fast-reading mode and provided that the first number of preset pages is N, the information obtaining apparatus is further configured to:
parse and store the N number of downloaded pages in a cache; put the N number of downloaded pages on a display list; and download, without parsing, a (N+l)th webpage in a (N+l) space in the cache without putting the (N+l)th page on the display list.
16. The apparatus according to claim 14, wherein, when the network access of the client is not charged according to traffic amount, after the output module splices and outputs the body content of the preset webpages in a predetermined sequence, the downloading module is configured to:
download webpages after the first number of preset pages when receiving a request for displaying a next page from the client.
17. The apparatus according to claim 14, wherein, when the network access of the client is charged according to traffic amount, after the output module splices and outputs the body content of the preset webpages in a predetermined sequence, the downloading module is also configured to:
obtain a number of spliced pages of the current page cached on the client; and judge whether the number of spliced pages of the current page exceeds a threshold value, wherein: when the number of spliced pages of the current page exceeds the threshold value, the downloading module discards assigned webpages of the current page and downloads a webpage after the second number of preset pages.
18. The apparatus according to claim 11, wherein the extraction module is further configured to:
trim non-body content information of the downloaded preset webpages; and republish the trimmed content to create body content of the preset webpages.
19. The apparatus according to claim 18, wherein the extraction module is further configured to:
remove at least page header, footer, and advertising information from the downloaded preset webpages to obtain the trimmed content; and remove page spacing from the downloaded preset webpages such that contents of the downloaded preset webpages are displayed continuously.
20. The apparatus according to claim 11 , wherein the information obtaining apparatus is further configured to:
receive a user selection from a reading mode button on a webpage displayed; and change a paging mode to a reading mode based on the user selection.
PCT/CN2013/083508 2012-09-20 2013-09-13 Method and apparatus for obtaining information WO2014044154A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP13839230.3A EP2898433A4 (en) 2012-09-20 2013-09-13 Method and apparatus for obtaining information
US14/082,510 US20140082484A1 (en) 2012-09-20 2013-11-18 Method and apparatus for obtaining information

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201210350647.8A CN103678393B (en) 2012-09-20 2012-09-20 The method and apparatus for obtaining information
CN201210350647.8 2012-09-20

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US14/082,510 Continuation US20140082484A1 (en) 2012-09-20 2013-11-18 Method and apparatus for obtaining information

Publications (1)

Publication Number Publication Date
WO2014044154A1 true WO2014044154A1 (en) 2014-03-27

Family

ID=50315974

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2013/083508 WO2014044154A1 (en) 2012-09-20 2013-09-13 Method and apparatus for obtaining information

Country Status (3)

Country Link
EP (1) EP2898433A4 (en)
CN (1) CN103678393B (en)
WO (1) WO2014044154A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104156233B (en) * 2014-07-21 2016-05-18 小米科技有限责任公司 Light application off-line update method, device and terminal
US9672026B2 (en) 2014-07-21 2017-06-06 Xiaomi Inc. Light app offline updating method, device and terminal
CN105468629B (en) * 2014-09-04 2019-06-14 北大方正集团有限公司 The implementation method of mobile device number reporting system, apparatus and system
CN107145441B (en) * 2017-04-12 2020-12-29 阿里巴巴(中国)有限公司 Page display method and device
CN112800043A (en) * 2021-02-05 2021-05-14 凯通科技股份有限公司 Internet of things terminal information extraction method, device, equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6222634B1 (en) 1997-07-11 2001-04-24 International Business Machines Corporation Apparatus and method for printing related web pages
EP1324226A1 (en) 2001-12-27 2003-07-02 Samsung Electronics Co., Ltd. Apparatus and method for rendering web page html data into a format suitable for display on the screen of a wireless mobile station
CN101325602A (en) * 2008-07-30 2008-12-17 广州市动景计算机科技有限公司 Method and system for pre-reading web page by micro-browser intelligently
US20110138267A1 (en) * 2009-12-09 2011-06-09 Lg Electronics Inc. Mobile terminal and method of controlling the operation of the mobile terminal
CN102123168A (en) * 2011-01-14 2011-07-13 广州市动景计算机科技有限公司 Web page pre-reading and integration method and system based on relay server
CN102214211A (en) * 2011-05-20 2011-10-12 广州市动景计算机科技有限公司 Webpage content browsing method and device as well as mobile terminal

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7441047B2 (en) * 2002-06-17 2008-10-21 Microsoft Corporation Device specific pagination of dynamically rendered data

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6222634B1 (en) 1997-07-11 2001-04-24 International Business Machines Corporation Apparatus and method for printing related web pages
EP1324226A1 (en) 2001-12-27 2003-07-02 Samsung Electronics Co., Ltd. Apparatus and method for rendering web page html data into a format suitable for display on the screen of a wireless mobile station
CN101325602A (en) * 2008-07-30 2008-12-17 广州市动景计算机科技有限公司 Method and system for pre-reading web page by micro-browser intelligently
US20110138267A1 (en) * 2009-12-09 2011-06-09 Lg Electronics Inc. Mobile terminal and method of controlling the operation of the mobile terminal
CN102123168A (en) * 2011-01-14 2011-07-13 广州市动景计算机科技有限公司 Web page pre-reading and integration method and system based on relay server
CN102214211A (en) * 2011-05-20 2011-10-12 广州市动景计算机科技有限公司 Webpage content browsing method and device as well as mobile terminal

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP2898433A4 *

Also Published As

Publication number Publication date
EP2898433A1 (en) 2015-07-29
CN103678393B (en) 2018-06-15
EP2898433A4 (en) 2016-06-08
CN103678393A (en) 2014-03-26

Similar Documents

Publication Publication Date Title
CA2865187C (en) Method and system relating to salient content extraction for electronic content
RU2618910C2 (en) Method and device for displaying information
US9910932B2 (en) System and method for completing a user query and for providing a query response
RU2522103C2 (en) Update notification method and browser
JP5133984B2 (en) Input candidate providing device, input candidate providing system, input candidate providing method, and input candidate providing program
WO2013178094A1 (en) Page display method and device
EP2898433A1 (en) Method and apparatus for obtaining information
US20130305131A1 (en) Method, system and computer storage medium for pre-reading network data
US11157576B2 (en) Method, system and terminal for performing search in a browser
US20180239834A1 (en) Data transmission method and device
US11308275B2 (en) Management of content tailoring by services
CN106708885A (en) Method and device for achieving searching
CN102523296B (en) Method, device and system for optimizing wireless webpage browsing resources
US20140082484A1 (en) Method and apparatus for obtaining information
EP3080722A1 (en) Web page rendering on wireless devices
CN102033926B (en) Page content processing method and device
WO2008132706A1 (en) A web browsing method and system
US10621260B2 (en) Method and device for restoring pages of browser
US9485330B2 (en) Web browser operation method and system
US11307897B2 (en) Resource pre-fetch using age threshold
WO2014019467A1 (en) A web browser operation method and system
CN105589870B (en) Method and system for filtering webpage advertisements
CN111915348A (en) Method, system and storage medium for inserting advertisement in electronic book
CN112016017A (en) Method and device for determining characteristic data
KR20120016335A (en) Web page pre-caching system and method for offline-executing

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13839230

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2013839230

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE