WO2008141426A1 - Method and system for an aggregate web site search database - Google Patents
Method and system for an aggregate web site search database Download PDFInfo
- Publication number
- WO2008141426A1 WO2008141426A1 PCT/CA2008/000908 CA2008000908W WO2008141426A1 WO 2008141426 A1 WO2008141426 A1 WO 2008141426A1 CA 2008000908 W CA2008000908 W CA 2008000908W WO 2008141426 A1 WO2008141426 A1 WO 2008141426A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- data
- web
- web page
- page
- database
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/957—Browsing optimisation, e.g. caching or content distillation
- G06F16/9577—Optimising the visualization of content, e.g. distillation of HTML documents
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation; Time management
Definitions
- the present application relates generally to telecommunications and more particularly to a system and method for an aggregate web site search database.
- Web sites host and provide information using web pages that are communicated electronically via a telecommunications network. Accessing this information by some client computing devices can be challenging. Computing devices are becoming smaller and increasingly utilize wireless connectivity. Examples of such computing devices include portable computing devices that include wireless network browsing capability as well as telephony and personal information management capabilities.
- Figure 1 is schematic representation of a system for content navigation.
- Figure 2 is a schematic representation of a wireless communication device from Figure 1.
- Figure 3 illustrates a flow of interactions among components of the system of Figure 1.
- Figure 4 is a schematic representation of an aggregate web site search database for e-commerce.
- Figure 5 is a schematic representation of tables in an aggregate web site search database for e-commerce.
- Figure 6 illustrates a method of creating the aggregate web site search database.
- Figure 7 illustrates a method of querying the aggregate web site search database.
- Figures 8A-8D and 9A-9D respectively illustrate representative web pages rendered on a first browser window and portions of said representative web pages transcoded and rendered on a second browser window in accordance with an embodiment.
- signature schema instructions identify a web page family for the web page and extract desired data from the web page in accordance with its web page family.
- the instructions use signatures previously identified within web pages of the same family to distinguish the web page family (e.g. in accordance with a shared template for each family) from others of the web site and to distinguish the desired data from other data for the web page family.
- a server may make one or more requests to obtain web pages from various web sites and apply respective signature schemas maintained in a repository coupled to the engine. Extracted desired data may be stored to a database coupled to the engine to facilitate querying of the data and enable aggregate results to be presented to a client machine (e.g. a wireless communication device) or enable regeneration of original web pages based upon the signature schema.
- a method of aggregating web site data from one or more web sites comprising: sending a page request to a web site selected from the one or more web sites; receiving the requested web page from the selected web site; retrieving signature schema associated with the requested web page wherein the signature scheme identifies data fields within the requested web page; applying signature schema to the requested web page to extract data from the requested web page; and storing extracted data to an aggregate database, wherein the aggregate database comprises data extracted from the one or more web sites.
- a system for aggregating web site data from one or more web sites comprising: at least one computing device comprising a processor and a memory coupled thereto, said memory storing instructions and data for configuring the processor to: send a page request to a web site selected from the one or more web sites; receive a web page from the selected web site based upon the sent page request; retrieve signature schema associated with the requested web page; apply signature schema to the requested web page data to extract data identified by the signature schema; and store extracted data to an aggregate database comprising data extracted from the one ore more web sites.
- a computer program product storing computer readable instructions which when executed by a computer processor configure the processor for: sending a page request to a web site selected from the one or more web sites; receiving the requested web page from the selected web site; retrieving signature schema associated with the requested web page wherein the signature scheme identifies data fields within the requested web page; applying signature schema to the requested web page to extract data from the requested web page; and storing extracted data to an aggregate database, wherein the aggregate database comprises data extracted from the one or more web sites.
- a method of aggregating web site data from one or more web sites comprising: sending a page request to a web site selected from the one or more web sites; receiving the requested web page from the selected web site; retrieving signature schema associated with the requested web page wherein the signature scheme identifies data fields within the web and wherein the signature schema are extensible Markup Language (XML) documents comprising query language for extracting data from the requested web page; applying signature schema to the received web page to extract data from the requested web page; storing extracted data to an aggregate database, wherein the aggregate database comprises data extracted from the one or more web sites; receiving a search query from a client machine for data stored in the aggregate database; generating a database query based upon the received search query; and retrieving data from the aggregate database defined by the query.
- the client machine may be a wireless device.
- system 100 comprises one or more client computing devices in the form of client machines 102A and 102B (collectively 102), web site servers 106, 107 and 109 respectively host web sites 104, 103 and 105 and a gateway and schema server 120.
- Machines 102 are respectively coupled to communicate with gateway and schema server 120 to obtain web pages (e.g. 110) transcoded from web sites 103, 104 and 105 and to access aggregate data from the web sites through web server 125 hosting web site 150.
- web pages e.g. 110
- web sites 103, 104 and 105 host web sites which contain data that is to be aggregated into database 126.
- web site 104 comprises a web server 106 serving web pages (e.g. 110) defined from one or more web page family templates 108A-108D (collectively 108) and web page content (described further herein below) from data store 112.
- gateway and schema server 120 is coupled to a schema repository 124 from which to obtain a signature schema 122 for a particular web site.
- Signature schema documents (e.g.122) provide instructions and data with which an engine 140 of server 120 can extract data from web pages (e.g.
- Gateway and schema server 120 may also be coupled to a database 126 for retrieving/storing data extracted from web sites in accordance with its operations.
- the database 126 may be a relational database for storing extracted data objects and elements and their relationships from web sites in relation to the defined signature schema.
- the stored data can be accessed by a Structured Query Language (SQL) to retrieve desired data from database 126.
- SQL Structured Query Language
- Signature schemas for respective web sites may be defined (e.g. coded) using a computing device 128 as described herein below.
- a web server 125 is coupled to the aggregate web site database 126 enables access to the aggregated web site database 126 data by a web site 150.
- the web server 125 can also provide a data collection engine 152, or web crawler, for sending requests to web sites 103, 104 and 105 for desired page and provide content to schema engine 140 for processing.
- Representative client machines 102 include any type of computing or electronic device that can be used to communicate and interact with content available via web sites. Each of the client machines 102 may be operated by a respective user U (not shown). Interaction with a particular user includes presenting information on a client machine (e.g. by rendering on a display screen) as well as receiving input at a client machine (e.g. such as via a keyboard for transmitting to a web site).
- client machine 102A comprises a mobile or wireless electronic device with the combined functionality of a personal digital assistant, cell phone, email paging device, and a web-browser.
- Such a mobile electronic device may comprise a keyboard (or other input device(s)), a display screen, a speaker, (and other output device(s) (e.g. LEDs)) and a chassis for housing such components.
- the chassis may further house one or more central processing units, volatile memory (e.g. random access memory), persistent memory (e.g. Flash read only memory) and network interfaces to allow client machine 102A to communicate over the telecommunication network.
- volatile memory e.g. random access memory
- persistent memory e.g. Flash read only memory
- FIG. 2 a schematic block diagram shows an exemplary client machine 102A in greater detail. It should be emphasized that the structure in Figure 2 is purely exemplary, and contemplates a device that may be used for both wireless voice (e.g. telephony) and wireless data (e.g. email, web browsing, text) communications.
- Client machine 102A includes one or more input devices which in a present embodiment includes a keyboard and, typically, additional input buttons, collectively 200, an optional pointing device 202 (e.g. a trackball or trackwheel) and a microphone 204. Other input devices, such as a touch screen, and camera lens are also contemplated.
- Processor 208 may be further operatively coupled with a volatile storage unit 212 (e.g. read only memory (“ROM”), Erasable Electronic Programmable Read Only Memory (“EEPROM”), or Flash Memory) and a volatile storage unit 216 (e.g. random access memory (“RAM”) speaker 220, display screen 224 and one or more lights (LEDs 222).
- ROM read only memory
- EEPROM Erasable Electronic Programmable Read Only Memory
- RAM random access memory
- Processor 208 may be operatively coupled for network communications via a subsystem 226.
- Wireless communications are effective via at least one radio (e.g. 228) such as for Wi-Fi or cellular wireless communications.
- Client machine 102A also may be configured for wired communications such as via a USB or other port and for short range wireless communications such as via a Bluetooth® radio (all not shown).
- Programming instructions that implement the functional teachings of client machine 102A as described herein are typically maintained, persistently, in non-volatile storage unit 212 and used by processor 208 which makes appropriate utilization of volatile storage 216 during the execution of such programming instructions.
- non-volatile storage unit 212 persistently maintains a web browser application 86 and, in the present embodiment, a native menu application 82, each of which can be executed on processor 208 making use of non-volatile storage 216 as appropriate.
- non-volatile storage unit 212 An operating system and various other applications (not shown) are maintained in non-volatile storage unit 212 according to the desired configuration and functioning of client machine 102A, one specific non-limiting example of which is a contact manager application (also known as an address book, not shown) which stores a list of contacts, addresses and phone numbers of interest to user U and allows user U to view, update, and delete those contacts, as well as providing user U an option to initiate telecommunications (e.g. telephone, email, instant message (IM), short message service (SMS)) directly from that contact manager application.
- telecommunications e.g. telephone, email, instant message (IM), short message service (SMS)
- Native menu application 82 may be configured to provide menu choices to user U according to the particular application (or other context) that is being accessed.
- user U can activate menu application 82 to access one or more menu choices available that are respective to contact manger application 90.
- menu choices may include options to invoke other applications (e.g. a mapping application to map a contact's address) or communication functions (e.g. call, SMS, IM, email, etc.) on the client machine 102A for a particular contact.
- Menu application 82 may be associated to a particular input button (e.g.
- buttons 200 and invoked to provide a contextual menu comprised of one or more menu choices that are reflective of the context in which the button 200 was selected.
- options in a contextual menu are stored within non-volatile storage 212 as being specifically associated with a respective application.
- Menu application 82 may be therefore configured to generate one or more different contextual menus that are reflective of the particular context in which the menu application 82 is invoked. For example, in an email application where an email is being composed, invoking menu application 82 would generate a contextual menu that included the options of sending the email, cancelling the email, adding addresses to the email, adding attachments, and the like. The contents for such a contextual menu would also be maintained in non-volatile storage 212. Other examples of contextual menus will occur to those of ordinary skill in the art.
- Figures 8A-8D and 9A-9D respectively illustrate representative web pages rendered on a first browser window and portions of a subset of data from said representative web pages transcoded and rendered on a second browser window in accordance with an embodiment.
- Figure 8A illustrates a representative home web page 860A of an e-commerce web site (e.g. 104) in a browser window 850.
- Window 850 is illustrative of a rendering to a large size display device (e.g. desktop monitor).
- Web page 860A comprises, among other things, a menu portion 852 and a primary content display portion 854, in the example, showing various advertisements 855 for products.
- Figure 9A illustrates the menu portion 852 extracted and transcoded and rendered as a web page on a second browser window 950.
- Window 950 is illustrative of a rendering to a small size display device (e.g. of a wireless mobile device).
- menu portion 852 may be transcoded for menu application 82 e.g. for invocation when browsing the site 104 as referenced further herein.
- Figure 8B illustrates an exemplary product web page 860B in window 850 showing various product data (collectively 866) including image 866A, price 866, title 866C and description 866D data that is transcoded and shown in window 950 of Figure 9B. Also transcoded is the web page hierarchy list 868 showing where the page is on the web site.
- Figure 8C illustrates an exemplary product list web page 860C in window 850 showing a list of products (collectively 870). A subset of the product data such as image 870A, price 870B, and title 870C is transcoded and shown in window 950 of Figure 9C. Note that multiple pages 872 may be provided for the list 870.
- Figure 8D illustrates an exemplary account checkout web page 860D in window 850 showing a login form 880 for receiving account login and password, which form is transcoded and shown in window 950 of Figure 9D. Though not shown, other checkout pages (e.g. for payment or order confirmation, etc.), search pages, product and information pages may be similarly transcoded.
- web servers 106, 107 and 109 and gateway and schema server 120 can be based on any commonly available server environments or platforms including a module that houses one or more central processing units, volatile memory (e.g. random access memory), persistent memory (e.g. hard disk devices) and network interfaces to allow servers 106, 107, 109 and 120 to communicate over the telecommunications network.
- Web servers hosts software applications comprising instructions and data for generating and serving web pages dynamically from the template families 108 and current informational content therefore from data store 112. Load balancing, security/firewall, billing, account and other applications may also be present as is well-known in the art.
- Gateway and schema server 120 hosts software applications comprising instructions and data for proxying requests and responses between the client machines 102 and web sites 103, 104 and 105.
- engine 140 may be implemented in software to apply the signature schemas to web pages from web sites.
- an interpreter that interprets the signature schema document and applies the actions against the web page code (as an ASCII (plain text) to extract desired data to produce a result set.
- a renderer may be provided to express the desired data result set (i.e. transcode to a target format such as cHTML (Compact HTML) for a mobile device browser) for transmitting to the client machines also in accordance with the signature schema.
- the web server 125 provides web pages to requesting client machine through a browser or application on the client for rendering.
- the web data may be directly pushed to client machines 102A by e-mail or by other push based applications, or the data may be accessed by queries to web site 150 directly.
- the web site 150 may also extract content from the aggregate database 126 and apply signature schema 122 to the extracted database data, which schema may be configured to transcode the data in accordance with the target client machine 102A to tailor the output result.
- Machines 102 schema server 120, and web sites 103, 104, 105 and 125 are coupled via a telecommunication network (not shown) typically comprising one or more interconnected networks that may include wired and (at least for machine 102A) wireless networks.
- a telecommunication network typically comprising one or more interconnected networks that may include wired and (at least for machine 102A) wireless networks.
- the nature of the network is not particularly limited and is, in general, based on any combination of architectures that will support interactions between client machines 102 and servers 106, 107, 109, 125 and 120.
- the network includes the Internet as well as appropriate gateways and backhauls.
- a wireless network for client machine 102A may be based on core mobile network infrastructure (e.g. Global System for Mobile communications ("GSM”); Code Division Multiple Access (“CDMA”), Enhanced Data rates for GSM Evolution (“EDGE”), Evolution Data-Optimized (“EV- DO”), High Speed Downlink Packet Access (“HSPDA”), Universal Mobile Telecommunications System (“UMTS”), etc.) or on wireless local area network (“WLAN”) infrastructures such as the Institute for Electrical and Electronic Engineers (“IEEE”) 802.11 Standard (and its variants) or Bluetooth or the like or hybrids thereof.
- GSM Global System for Mobile communications
- CDMA Code Division Multiple Access
- EDGE Enhanced Data rates for GSM Evolution
- EV- DO Evolution Data-Optimized
- HPDA High Speed Downlink Packet Access
- UMTS Universal Mobile Telecommunications System
- IEEE Institute for Electrical and Electronic Engineers
- client machine 102B may be another type of client machine such as a PC (desktop or laptop) configured to include a full desktop computer or as a "thin-client". Typically such have larger display monitors/screens than portable machines like 102A.
- a wired network for system 100 and machine 102B can be based on a T1 , T3 or any other suitable wired connection.
- each of the client machines 102 is configured to interact with content available over the network, including web pages on web site 104.
- client machines 102A and 102B may navigate for content using a browser application (e.g. 86).
- browser application 86 may be a mini-browser in the sense that it may be configured to render web pages on the relatively small display 224 of client machine 102A. Often, during such rendering, those pages are presented in a format that may be different from how those pages are rendered on a traditional desktop browser application (e.g. browser 86 of client machine 102B).
- Mini-browsers typically attempt to convey substantially the same information as if the web pages had been rendered on a full browser such as Internet Explorer®, Safari® or Firefox® on a traditional desktop or laptop computer like client machine 102B.
- FIG. 3 is a flowchart illustrating operations/interactions for generating and maintaining an aggregate database from web sites 103-105 for populating and updating database 126.
- the flowchart provides an example of the interaction among the gateway and schema server 120/140 and data collection engine 152, with web servers 106, 107 and 109 hosting web sites 103, 104 and 105 to generate and maintain the aggregate database 126.
- the data collection engine 152 (DCE), makes a request 302 to the web site's web server (for example web server 106) for the specified pages based upon the type of data to be aggregated.
- the web page code (e.g. 110) is generated by server 106 and sent 306 to DCE 152.
- the web page code received is a text file.
- the schema server engine 140 (SSE) (for example, in parallel or without waiting for a response from server 106) makes a request 304 to the signature repository 124 for the signature schema document 122 for the web site, which request may use the domain in the URL as an identifier for obtaining the document 122.
- the schema server engine 140 receives 308 the schema and does not render the web page 110 per se but instead uses the instructions in the signature schema document 122 to extract the desired data from the web page 110.
- the signature schema 122 is configured to extract data from web page 110 in accordance with the specific desired content characteristics for the database 126 which is based upon the target client machines 102A, having knowledge of display 224 capabilities - such as screen size, resolution, and other parameters - useful in determining the way in which the transcoded data is to be displayed on the machine 102A.
- the web page 110 or extracted data is stored 310 in database 126 in a relational database structure, where only data related to the defined signature schema from the web site is stored.
- Client machine 102A can then make a request 312 to web site 150 on server 125 for a query to the database 126 regarding desired web sites having a specific domain (URL).
- Web site 150 requests 314 relevant data from the database 126.
- the results are extracted 316 and then sent 318 to the client machine as aggregate results, or as a proxy as if the query was made directly to the source web site, and transcoded in accordance with the schema 122, to the requesting client machine 102A processed by the signature schema engine 140 before presentation by web server 125.
- the data may be pushed to a client machine to a push based application.
- transcoded data 130 may comprise transcoded navigational data for menu application 82 and informational content data (e.g. a list of products and related information from a web page) for displaying by browser application 86. The process can then be repeated for each identified web site such as web sites 103 and 105.
- Signature schemas are pre-defined documents, and may be extensible Markup Language (XML) documents utilizing an SQL-like query language, to incorporate instructions and data with which to intelligently extract the data from web pages (which web pages are typically coded in HTML, DHTML, XHTML, XML, RSS, Javascript, etc). This extracted data may be transcoded and provided to client machines 102, used to dynamically generate a relational database (e.g. 126) or both.
- Each signature schema incorporates an understanding of a particular web site's data including relationships among the various data (e.g. among its primary informational content found in the body of its web pages as well as among such content and associated navigational data (e.g. web page links) that govern the data in the page.
- prior knowledge of the web page code including specific identifiers, tags and text (i.e. strings) used within the code may be used to define instructions to identify portions of the code of interest and to extract specific desired data.
- a signature schema document may be defined for all the pages of a particular web site.
- Large data-driven web sites e.g. 104 don't maintain thousands of individual web pages per se.
- the sites typically adopt a few page family templates 108 and dynamically populate these with pertinent content from database 112 comprising information (e.g. weather, stock data, news, shopping/product data, patent data, trade-mark data etc.) as applicable when a client requests a particular page.
- Each template represents a family of pages having objects and attributes. Below are representative example page family templates and their objects and attributes for a web site offering news and an e-commerce web site offering products for sale electronically:
- Objects lists a selection of news stories
- Objects lists a single news story (and optionally other related stories)
- Each family of pages can be identified by a "signature" or unique set of one or more features that automatically identifies a given page on a web site as part of the family and differentiates that family from another family of pages.
- each object and attribute field of interest can be identified with its respective unique signature within a family of pages.
- a signature schema document typically comprise numerous pieces of information (commands), for example, information that instructs the engine 140 for: identifying all page families; identifying and extracting data (i.e. desired objects and attributes) for each page family; capturing the (implicit or explicit) relationships between the objects and attributes; and transcoding the data.
- a signature schema document may also be configured to enable special functionality for the target web site including searching, logging in a user, purchasing items, etc.
- Engine 140 may be configured to receive web page code comprising text data and search through the text in accordance with the schema document instructions that provide SQL-query like language instructions. Engine 140 maintains a pointer within the text as it moves through the web page code performing various actions, as described below, in accordance with the schema instructions.
- Table 1 illustrates a snippet of a representative signature schema:
- instructions at line 4 are for verifying that the web page under consideration and the signature schema relate to the same web site/domain - eshop.ca.
- Instructions at lines 9-15 are for determining the particular page family to which the web page under consideration belongs. A respective signature that defines the particular page family has been previously identified for use to distinguish the page.
- id assigns an id to the page family that is used across the signature schema document.
- the instructions identify a web page using the alternative signatures "Compare products" or "Sort Products”. Web pages with these strings are of the same family type.
- the instructions at line 10 provide a reference tag to further instructions for this family, providing a link to instructions for the list_elements page family with and ID of mylist_1 (see lines 16-17).
- the other lookup instructions provide references to the specific instructions within the signature schema document for handling a web page of each web page family. Representative instructions for some of the web page families are provided in Table 1 , for example, at lines 16-17 and 18-29 with others omitted for brevity.
- the instruction at line 20 advances the scan pointer within the text file of the web page code to a beginning limit of a region of interest indicated by a signature reference. This establishes an upper limit for review within the text file. Though not shown in this table, an end limit may be defined as well (See Table 4). Further such instructions at lines 22-28 may comprise commands to locate desired data using "signatures” such as string identifiers that uniquely identify the data within the region of interest. In the present example the instructions locate and extract one or more elements, namely, product image, title, price, sale price and description for a product of the item web page family.
- the string returned is the path (relative URL at web site eshop.ca) to the product image.
- Table 1 shows at least some of the instructions (e.g. lines 23 -27) including one or more directional references relative to the signatures to locate and extract the desired data.
- such instructions may further include at least one of a start reference or an end reference further pinpointing the location of the desired data in accordance with that direction. Additional directional reference information is discussed herein with reference to code snippets in other Tables and the discussion of an embodiment of signature transcoding engine syntax presented below.
- the example within Table 1 demonstrates the extraction of data and the establishment of relationships between objects and elements within a same page of a web site.
- signature schema documents may further capture relevant attributes of an object across pages.
- a user of client machine 102A may click through a number of web pages in eshop.ca to get to a specific product page (e.g. Department -> Product Category -> Product Sub-Category -> Specific Product, such as TV & Video > 19"-21" TVs > LCD TVs > BrandX Product.
- the navigational hierarchy representing a categorization may be captured and associated to the extracted objects and there elements.
- Tables 2-4 provide representative instructions for further web page families for e-shop.ca that may be read with Table 1.
- Table 2 below provides representative instructions, e.g. for lines 16 and 17 of Table 1 , including instructions for a web page family related to a list of items/products for sale. Whereas instructions at lines 22-28 provided product data extraction instructions for a web page family showing a single item (i.e. product), the instructions of Table 2 provide additional instructions that repeat product data extractions for each product in the list.
- the engine 140 determines the location in the signature schema document that contains the signature for the objects and elements of that family and applies the instructions therefor.
- a product list at e-shop.ca may span multiple web pages. Instructions at lines 2-6 of Table 2 find the number of pages and generate the links for each of the pages. Instructions at lines 7-9 (action tag) advance the search scan pointer to the region of web page code that may be of interest (i.e. in this case, the start of the list). In this way, a local signature reference can be used and any earlier ambiguous references skipped. Skipping to the local region of interest may also make the specification of the signature reference less complicated.
- instructions at lines 10-16 (elements tag) of Table 2 provide product data extraction instructions that may be repeated for each product in the list.
- the instruction at line 15 moves the scan pointer to the end of the object (in this example a product in a list of products) to ready the instructions for application against the next object (product) in the list.
- search_elements id "mysearch_1 ">
- search_path value http://www.eshop.ca/search/search.asp" />
- the engine 140 If the engine 140 has identified that it is looking for a menu on a page that contains the menu style of the "mymenu_1" family, the engine applies the portion of the signature schema document that contains the signature for the objects and elements of that family, shown above in Table 4.
- ⁇ element>... ⁇ /element> Contains lookup instructions for each object element as previously described.
- lines 10 and 11 of Table 4 an element in 'mymenu_1 ' (each individual menu entry of web page) contains link and title as its properties.
- Line 12 instructs the engine to move the pointer to "</li>" to get ready to loop through and extract the next menu item with the same elements, taking advantage of the repeated patterns within the text of the web page code.
- Signature schema document 122 may further comprise transcoding instructions (not shown) for use by engine 140 to express the extracted desired data (which may be retrieved from database 126) in a target format (e.g. a format of HTML, XML, script etc.) for use by the requesting client machine 102.
- the transcoding instructions may define a web page for displaying the extracted data in browser application 86 that is suitable for display on the client machine 102.
- the formatting rules can be system and/or user defined and can include parameters such as but not limited to: object positioning, object colour, object size, object shape, object font/image characteristics, background style, and navigational item display (e.g. in a menu as described above) or for display with the content in the generated page on the client screen.
- Browser application 86 may be configured for using a markup language (e.g. cHTML) or other code format that is not identical to the code provided by web page 110.
- transcoding instructions may be defined to express the extracted desired data in XML or another code format such as for use by a different client application or plug-in to a client application such as menu application 82 or another application (not shown) on client machine 102.
- Signature schema documents may be prepared (i.e. coded) using a computing device such as computing device 128.
- Computing device 128 may be any suitable desktop or laptop device capable of coding documents (which may be but need not be XML-type documents) and may be configured to automate or semi-automate coding of such documents.
- Computing device 128 may be coupled to web site 104 to retrieve web pages from the site for reviewing to prepare the custom signature schema document for the site.
- Computing device 128 may be configured to automatically review the web page code and apply heuristics or other techniques (e.g. spatial analysis) to determine probable content of interest (i.e. desired data) and generate code to extract the desired data. For example, primary content of interest tends to be located toward the centre of the web page.
- the computing device may facilitate a user coding signature schema to manually assist with the analysis of the web page and identification of desired data and the generation of the instructions.
- Computing device 128 may be further coupled to repository 124 to provide (e.g. up-load or publish) coded signature schema documents for use by server 120.
- the lookup tag instructs the engine 140 to perform an insert, delete or query the document contents.
- Type Defines the data type of the lookup. Type may be "pex" for a string expression. Type may also support more advanced options such as regular expressions, API calls, and SQL queries.
- Action "replace_string”: Replace a string within the data with the "ref” identifier.
- Action "move_ptr”: Remove all characters in the data that exist before the location of the "ref” identifier.
- Action "end_ptr”: Remove all characters in the data that exist after the location of the "ref” identifier.
- Action "get_string” Extract a string based on the location of the "ref”, “start”, and “end” identifiers.
- ID is an identifier of another section within the signature. It allows the result of a query to trigger another set of actions within the signature. This is primarily used when identifying page types. Once a match has been made, specific instructions are executed that are marked with this ID. Recursive data structures (e.g. lists within lists) may also be supported.
- Max_lndex_Use_Ref is a Boolean value set to O or 1. It is used with Maxjndex. When set to O 1 the "maxjndex” will begin querying at the beginning of the data. When set to 1 , the "maxjndex” will begin querying from the "ref" identifier index.
- Gbl_append_[identifier] GbLappend appends a string passed via the url to the identifiers query value
- Tolerance is a Boolean value set to 0 or 1. It is used to return an empty string. By default tolerance is set to 0 which enforces that a property be found on a page, otherwise the page will be marked as "invalid" and an appropriate error message returned. When set to one, an empty value is returned for properties that can not be located.
- lnclude_sz is a Boolean value set to 0 or 1 and used with get_string. It is by default set to 0. When set to 1 it includes the "start" value and the "end” value as part of the result.
- lnclude_start is a Boolean value set to 0 or 1 and used with get_string. It is by default set to 0. When set to 1 it includes the "start" value as part of the result.
- lnclude_end is a Boolean value set to 0 or 1 and used with get_string. It is by default set to 0. When set to 1 it includes the "end" value as part of the result.
- Strip_tags 2": remove all br and script tags.
- Strip_tags "tag1 ,tag2...tagN”:remove all tag1 , tag2,...tagN leaving any tag not listed.
- the page syntax extracts the paging information from the data. This allows the end user the ability to change pages just as on the desktop.
- Page_variable Defines unique key that defines a family's paging feature.
- Page_start Defines value of first page in a family's paging feature.
- Page_post Path where paging variable(s) must be transmitted to.
- Page_start Defines value of first page in a family's paging feature.
- Pagejncrement Defines value that paging increases by for each page in a family's paging feature.
- Page_block Defines unique key that defines a family's paging block feature.
- Page_block_size Defines the size of the family's page block, (i.e. 10 items per page)
- UrLappend Append the unique key that defines a family's paging feature and the page number.
- Search_path Search path where search variable must be transmitted to
- Search_variable Name of search variable which a web site's search feature is looking to read, request, post, etc.
- UrLreplace Remove a portion of the url that is specific to posting search parameters
- Name is the name to display when browsing using the gateway 120
- Location defines the fully qualified address of the site.
- the advanced tag defines global properties for the site. This at a minimum includes the path to the initial page of the site.
- Indexjink specifies the path to the initial page of the site. This is usually the same page as the location property from the URL syntax. This field is always required.
- Appendjink Appends a string value to every URL requested for this site.
- No_purchase is a Boolean value 0 or 1. The default value is 0 which implies that an item should contain a purchase link. When true, the purchase link is removed.
- No_item is a Boolean value 0 or 1. The default value is 0 which implies that Item pages should show up in the breadcrumb. When true, the item is not added to the breadcrumb.
- Check_out is a Boolean value 0 or 1. The default value is 0 which implies that Item purchase link sends the request and control away from the gateway server 120. When true, then a checkout process has been created for use with gateway server 120.
- Product_img_width defines the width of all item images.
- Use_cookies Use_cookies a Boolean value 0 or 1. By default it is set to 0, and cookies are not passed to the site. When true, gateway 120 passes all cookies from client machine 102 to the site 104, and from the site 104 to the client machine. Page Type Syntax
- the page type is a collection of lookup queries that have an id associated with them. Lookup queries may be processed in a top down fashion. The first successful lookup will trigger another section in the signature schema document. For example, if the following evaluates to true:
- Elements include list_elements, menu_elements, item_elements, search_elements, form_elements.
- Each element has an ID. For example a menu element:
- the element may contain the following sub containers (settings, actions, elements, paging) which scope resides only within the element.
- Each element is associated with a specific rendering function.
- Settings syntax varies based on the type of element it resides in. Settings allow customizations that only apply to a specific page family.
- Blackjist - menu_elements Blackjist removes menu items with names that reside in the black list. Each entry is separated delimited (e.g. using two pound characters
- Pass_image - list_elements, search_elements Passjmage adds the image path to the url when requesting an item. The image added to the url will be used as the item image.
- Price[n] - item_elements Price[n] where n is an integer renames the rendered item with name price[n].
- Action - form_elements Overrides the action of a form displayed to the end user.
- Handle "display" - display the form to the end user.
- Cookie - form_elements Send additional cookies when posting this form.
- Input tag adds/modifies a form value with name [identifier] setting its value.
- Rename_[identifier] - form_elements Rename tag renames a form value with name [identifier]. Actions Syntax
- the actions tag primary function is data manipulation. It contains lookup queries that modify data with actions of "move_ptr" or "end_ptr”.
- System 100 may be implemented so that one or more web sites are coupled to the telecommunication network (either alone by a server 106 or by one or more web servers like web-server 106), and that a corresponding one or more schemas for each of those web sites (or each of the web pages therein, or both) can be maintained by gateway and schema server 120 and repository 124.
- Client machines 102 can be configured for proxied connection through different servers 120 and for accessing aggregated web site data from database 126.
- server 120 and web server 125 can be hosted by a variety of different parties, including, for example but without limitation: a manufacturer of client machine 102, a service provider that provides access to the telecommunication network on behalf of user U of a client machine 102; the entity that hosts web-site 104 or a third party intermediary. In web site host example it can even be desired to simply combine the web server 106 and schema server engine 120 on a single server to thereby obviate the need for separate servers. Alternatively the functionality of server 120 and web server 125 may be locally resident on the client machine providing.
- Figure 4 is a schematic representation of an aggregate web site search database 126.
- the database 126 contains one or more tables for storing aggregate web page data extracted from target web sites.
- the database may be a relational database enabling structured database queries and provides temporary/persistent storage of structured data as a whole or partially may be indexed for fast performance.
- the database 126 may contain tables defining a web site identification and category index 402 and category data 404 containing specific item details, as will be discussed in connection with Fig. 5.
- the category data 404 may be further divided into product data 406 sub-tables to store additional product related information.
- Indices 408 can be created to reference aggregate web site data to improve query responsiveness and results.
- Metadata 410 associated with the original web pages may be stored such as images, web page formatting or navigation data.
- the appropriate tables would be configured to store the extracted data based upon the signature schema.
- the database 126 may alternatively reside in client machine 102A memory. The client machine would generate the request to the desired websites and store extracted data locally.
- Figure 5 is a schematic representation of tables in an aggregate web site search database for e-commerce.
- the extracted data from selected e-commerce web sites can be formatted into category index 402 which indicates identification 502 of the web site where the data was extracted from and the category 504 of interest.
- an individual category 404 table can be created for each entry in the category index 402, an individual category 404 table can be created.
- the category table provides location identifiers 510 for each product retrieved from the web site based upon the defined signature schema and populated at step 310 during the retrieval of data from the web site.
- the vendor 512, title 514 of the product, price 516 and description 518 extracted can then be stored.
- the database then provides a means for querying the aggregate data from the web site to present meaningful information to the client machine 102A. By storing the aggregate data queries can be created to meet desired requirements and transcoded for presentation on the client machine 102A.
- Figure 6 illustrates a method of creating the aggregate web site search database.
- the web sites 103, 104 and 105 that are to be indexed are identified, and page request(s) 602 are sent to the web site to fetch pages by data collection engine 150 hosted on web server 125.
- the data collection engine 150 is a web crawler to independently collect data from target web sites for storage in aggregate database 126. Alternatively, the data collection engine 150 may be included in the engine 140 of gateway 120.
- the signature schema for the web site is retrieved 604 from repository 124 by engine 140.
- the schema is applied 606 to the received web site data to extract relevant data from the web page content.
- the extracted data is stored 608 to the aggregate database 126 in the appropriate tables.
- the process is repeated for each relevant web site to generate the aggregate web site search database.
- the data collection engine 150 can then periodically access web sites 103, 104 and 105 to ensure stored data is accurate and up to date.
- the schema is defined to extract elements with which objects and their attributes on the web page can be defined or described and the schema incorporates knowledge of what these objects and attributes represent, an intelligent and indexed database 126 can be defined.
- Figure 7 illustrates a method of querying the aggregate web site search database 126.
- a search query or request is generated by a user via an interface on client machine 102A and received 702 at gateway 120 or may be directed to the aggregate web site 150.
- the web site 150 processes the request and generates 704 the relevant database query such as SQL (Structured Query Language) to database 126.
- the relevant data is then retrieved 706 from the database 126.
- the retrieved data or search results can then be formatted and provided 708 accommodate client machine 102A characteristics in response to request 702 to the client machine 102A.
- the search result data may be formatted to accommodate the client machine browser. For example as shown in Fig 7C.
- the data may be pushed to a push application on the client machine 102A.
- the web site data may be retrieved periodically by data collection engine 152 which may be triggered manually, scheduled or on-going.
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- Strategic Management (AREA)
- Human Resources & Organizations (AREA)
- Entrepreneurship & Innovation (AREA)
- General Physics & Mathematics (AREA)
- Economics (AREA)
- Physics & Mathematics (AREA)
- Operations Research (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Marketing (AREA)
- Quality & Reliability (AREA)
- Tourism & Hospitality (AREA)
- General Business, Economics & Management (AREA)
- General Engineering & Computer Science (AREA)
- Development Economics (AREA)
- Educational Administration (AREA)
- Game Theory and Decision Science (AREA)
- Information Transfer Between Computers (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
A method and system for aggregating web site data from one or more web sites are provided. The method makes a request for a web page to a web site selected from one or more web sites and applies an associated signature schema that is maintained in a repository. The method uses the associated signature schema document that is predefined using a query language to provide instructions for applications to extract identified data from data fields within the requested web page. The extracted data is then stored to an aggregate database, wherein the aggregate database comprises data extracted from the one or more web sites.
Description
METHOD AND SYSTEM FOR AN AGGREGATE WEB SITE SEARCH DATABASE
CROSS REFERENCE
[0001] This application claims the benefit of the prior filing of U.S. Provisional Patent Application Serial No. 60/924503 filed May 17, 2007, the disclosure of which is incorporated herein by reference.
COPYRIGHT
[0002] A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights.
FIELD
[0003] The present application relates generally to telecommunications and more particularly to a system and method for an aggregate web site search database.
BACKGROUND
[0004] Web sites host and provide information using web pages that are communicated electronically via a telecommunications network. Accessing this information by some client computing devices can be challenging. Computing devices are becoming smaller and increasingly utilize wireless connectivity. Examples of such computing devices include portable computing devices that include wireless network browsing capability as well as telephony and personal information management capabilities.
BRIEF DESCRIPTION OF THE DRAWINGS
[0005] Figure 1 is schematic representation of a system for content navigation.
[0006] Figure 2 is a schematic representation of a wireless communication device
from Figure 1.
[0007] Figure 3 illustrates a flow of interactions among components of the system of Figure 1.
[0008] Figure 4 is a schematic representation of an aggregate web site search database for e-commerce.
[0009] Figure 5 is a schematic representation of tables in an aggregate web site search database for e-commerce.
[0010] Figure 6 illustrates a method of creating the aggregate web site search database.
[0011] Figure 7 illustrates a method of querying the aggregate web site search database.
[0012] Figures 8A-8D and 9A-9D respectively illustrate representative web pages rendered on a first browser window and portions of said representative web pages transcoded and rendered on a second browser window in accordance with an embodiment.
DETAILED DESCRIPTION OF THE EMBODIMENTS
[0013] The smaller size of such client devices necessarily limits their display capabilities. Furthermore the wireless connections to such devices typically have less or more expensive bandwidth than corresponding wired connections. The Wireless Application Protocol ("WAP") was designed to address such issues, but WAP can still provide a very unsatisfactory experience or even completely ineffective experience, particularly where the small client device needs to effect a connection with web sites that host web pages that are directed to traditional full desktop browsers. In addition, the ability to access data from multiple web sites concurrently and extract relevant data can be difficult and time consuming.
[0014] Signature schema documents may be pre-defined using a query language, to provide instructions for application by an engine to extract data from web pages of respective web sites for storage to an aggregate database. For a particular web page, signature schema instructions identify a web page family for the web page and extract desired data from the web page in accordance with its web page family. The instructions use signatures previously identified within web pages of the same family to distinguish the web page family (e.g. in accordance with a shared template for each family) from others of the web site and to distinguish the desired data from other data for the web page family. A server may make one or more requests to obtain web pages from various web sites and apply respective signature schemas maintained in a repository coupled to the engine. Extracted desired data may be stored to a database coupled to the engine to facilitate querying of the data and enable aggregate results to be presented to a client machine (e.g. a wireless communication device) or enable regeneration of original web pages based upon the signature schema.
[0015] In the present disclosure there is provided a method of aggregating web site data from one or more web sites, the method comprising: sending a page request to a web site selected from the one or more web sites; receiving the requested web page from the selected web site; retrieving signature schema associated with the requested web page wherein the signature scheme identifies data fields within the requested web page; applying signature schema to the requested web page to extract data from the requested web page; and storing extracted data to an aggregate database, wherein the aggregate database comprises data extracted from the one or more web sites.
[0016] In the present disclosure there is provided a system for aggregating web site data from one or more web sites, the system comprising: at least one computing device comprising a processor and a memory coupled thereto, said memory storing instructions and data for configuring the processor to: send a page request to a web site selected from the one or more web sites; receive a web page from the selected web site based upon the sent page request; retrieve signature schema associated with the requested web page; apply signature schema to the requested web page data to extract
data identified by the signature schema; and store extracted data to an aggregate database comprising data extracted from the one ore more web sites.
[0017] In the present disclosure there is provided a computer program product storing computer readable instructions which when executed by a computer processor configure the processor for: sending a page request to a web site selected from the one or more web sites; receiving the requested web page from the selected web site; retrieving signature schema associated with the requested web page wherein the signature scheme identifies data fields within the requested web page; applying signature schema to the requested web page to extract data from the requested web page; and storing extracted data to an aggregate database, wherein the aggregate database comprises data extracted from the one or more web sites.
[0018] In the present disclosure there is provided a method of aggregating web site data from one or more web sites, the method comprising: sending a page request to a web site selected from the one or more web sites; receiving the requested web page from the selected web site; retrieving signature schema associated with the requested web page wherein the signature scheme identifies data fields within the web and wherein the signature schema are extensible Markup Language (XML) documents comprising query language for extracting data from the requested web page; applying signature schema to the received web page to extract data from the requested web page; storing extracted data to an aggregate database, wherein the aggregate database comprises data extracted from the one or more web sites; receiving a search query from a client machine for data stored in the aggregate database; generating a database query based upon the received search query; and retrieving data from the aggregate database defined by the query. The client machine may be a wireless device.
[0019] Referring now to Figure 1 , there is illustrated a system 100 for content navigation via a telecommunications network. In a present embodiment system 100 comprises one or more client computing devices in the form of client machines 102A and 102B (collectively 102), web site servers 106, 107 and 109 respectively host web
sites 104, 103 and 105 and a gateway and schema server 120. Machines 102 are respectively coupled to communicate with gateway and schema server 120 to obtain web pages (e.g. 110) transcoded from web sites 103, 104 and 105 and to access aggregate data from the web sites through web server 125 hosting web site 150.
[0020] In the present embodiment, web sites 103, 104 and 105 host web sites which contain data that is to be aggregated into database 126. For example, web site 104 comprises a web server 106 serving web pages (e.g. 110) defined from one or more web page family templates 108A-108D (collectively 108) and web page content (described further herein below) from data store 112. In the present embodiment of system 100, gateway and schema server 120 is coupled to a schema repository 124 from which to obtain a signature schema 122 for a particular web site. Signature schema documents (e.g.122) provide instructions and data with which an engine 140 of server 120 can extract data from web pages (e.g. 110) and transcode same to a target format to provide transcoded web page data (e.g. 130 and 132) to the respective requesting client machines 102A and 102B as described more fully below. Gateway and schema server 120 may also be coupled to a database 126 for retrieving/storing data extracted from web sites in accordance with its operations. The database 126 may be a relational database for storing extracted data objects and elements and their relationships from web sites in relation to the defined signature schema. The stored data can be accessed by a Structured Query Language (SQL) to retrieve desired data from database 126. Signature schemas for respective web sites may be defined (e.g. coded) using a computing device 128 as described herein below. A web server 125 is coupled to the aggregate web site database 126 enables access to the aggregated web site database 126 data by a web site 150. The web server 125 can also provide a data collection engine 152, or web crawler, for sending requests to web sites 103, 104 and 105 for desired page and provide content to schema engine 140 for processing.
[0021] Representative client machines 102 include any type of computing or electronic device that can be used to communicate and interact with content available via web sites. Each of the client machines 102 may be operated by a respective user U (not
shown). Interaction with a particular user includes presenting information on a client machine (e.g. by rendering on a display screen) as well as receiving input at a client machine (e.g. such as via a keyboard for transmitting to a web site). In the present embodiment, client machine 102A comprises a mobile or wireless electronic device with the combined functionality of a personal digital assistant, cell phone, email paging device, and a web-browser. Such a mobile electronic device may comprise a keyboard (or other input device(s)), a display screen, a speaker, (and other output device(s) (e.g. LEDs)) and a chassis for housing such components. The chassis may further house one or more central processing units, volatile memory (e.g. random access memory), persistent memory (e.g. Flash read only memory) and network interfaces to allow client machine 102A to communicate over the telecommunication network.
[0022] Referring now to Figure 2, a schematic block diagram shows an exemplary client machine 102A in greater detail. It should be emphasized that the structure in Figure 2 is purely exemplary, and contemplates a device that may be used for both wireless voice (e.g. telephony) and wireless data (e.g. email, web browsing, text) communications. Client machine 102A includes one or more input devices which in a present embodiment includes a keyboard and, typically, additional input buttons, collectively 200, an optional pointing device 202 (e.g. a trackball or trackwheel) and a microphone 204. Other input devices, such as a touch screen, and camera lens are also contemplated. Input from keyboard/buttons 200, pointing device 202 and microphone 204 may be received at a processor 208. Processor 208 may be further operatively coupled with a volatile storage unit 212 (e.g. read only memory ("ROM"), Erasable Electronic Programmable Read Only Memory ("EEPROM"), or Flash Memory) and a volatile storage unit 216 (e.g. random access memory ("RAM") speaker 220, display screen 224 and one or more lights (LEDs 222). Processor 208 may be operatively coupled for network communications via a subsystem 226. Wireless communications are effective via at least one radio (e.g. 228) such as for Wi-Fi or cellular wireless communications. Client machine 102A also may be configured for wired communications such as via a USB or other port and for short range wireless communications such as via a Bluetooth® radio (all not shown).
[0023] Programming instructions that implement the functional teachings of client machine 102A as described herein are typically maintained, persistently, in non-volatile storage unit 212 and used by processor 208 which makes appropriate utilization of volatile storage 216 during the execution of such programming instructions. Of particular note is that non-volatile storage unit 212 persistently maintains a web browser application 86 and, in the present embodiment, a native menu application 82, each of which can be executed on processor 208 making use of non-volatile storage 216 as appropriate. An operating system and various other applications (not shown) are maintained in non-volatile storage unit 212 according to the desired configuration and functioning of client machine 102A, one specific non-limiting example of which is a contact manager application (also known as an address book, not shown) which stores a list of contacts, addresses and phone numbers of interest to user U and allows user U to view, update, and delete those contacts, as well as providing user U an option to initiate telecommunications (e.g. telephone, email, instant message (IM), short message service (SMS)) directly from that contact manager application.
[0024] Native menu application 82 may be configured to provide menu choices to user U according to the particular application (or other context) that is being accessed. By way of example, while user U is activating the contact manager application, user U can activate menu application 82 to access one or more menu choices available that are respective to contact manger application 90. For example, menu choices may include options to invoke other applications (e.g. a mapping application to map a contact's address) or communication functions (e.g. call, SMS, IM, email, etc.) on the client machine 102A for a particular contact. Menu application 82 may be associated to a particular input button (e.g. one of buttons 200) and invoked to provide a contextual menu comprised of one or more menu choices that are reflective of the context in which the button 200 was selected. Note that the options in a contextual menu are stored within non-volatile storage 212 as being specifically associated with a respective application. Menu application 82 may be therefore configured to generate one or more different contextual menus that are reflective of the particular context in which the menu application 82 is invoked. For example, in an email application where an email is being
composed, invoking menu application 82 would generate a contextual menu that included the options of sending the email, cancelling the email, adding addresses to the email, adding attachments, and the like. The contents for such a contextual menu would also be maintained in non-volatile storage 212. Other examples of contextual menus will occur to those of ordinary skill in the art.
[0025] Figures 8A-8D and 9A-9D respectively illustrate representative web pages rendered on a first browser window and portions of a subset of data from said representative web pages transcoded and rendered on a second browser window in accordance with an embodiment. Figure 8A illustrates a representative home web page 860A of an e-commerce web site (e.g. 104) in a browser window 850. Window 850 is illustrative of a rendering to a large size display device (e.g. desktop monitor). Web page 860A comprises, among other things, a menu portion 852 and a primary content display portion 854, in the example, showing various advertisements 855 for products. Figure 9A illustrates the menu portion 852 extracted and transcoded and rendered as a web page on a second browser window 950. Window 950 is illustrative of a rendering to a small size display device (e.g. of a wireless mobile device). In addition to transcoding as a web page, menu portion 852 may be transcoded for menu application 82 e.g. for invocation when browsing the site 104 as referenced further herein.
[0026] Figure 8B illustrates an exemplary product web page 860B in window 850 showing various product data (collectively 866) including image 866A, price 866, title 866C and description 866D data that is transcoded and shown in window 950 of Figure 9B. Also transcoded is the web page hierarchy list 868 showing where the page is on the web site.
[0027] Figure 8C illustrates an exemplary product list web page 860C in window 850 showing a list of products (collectively 870). A subset of the product data such as image 870A, price 870B, and title 870C is transcoded and shown in window 950 of Figure 9C. Note that multiple pages 872 may be provided for the list 870.
[0028] Figure 8D illustrates an exemplary account checkout web page 860D in window 850 showing a login form 880 for receiving account login and password, which form is transcoded and shown in window 950 of Figure 9D. Though not shown, other checkout pages (e.g. for payment or order confirmation, etc.), search pages, product and information pages may be similarly transcoded.
[0029] Returning now to Figure 1 , web servers 106, 107 and 109 and gateway and schema server 120 (which can, if desired, be implemented on a single server) can be based on any commonly available server environments or platforms including a module that houses one or more central processing units, volatile memory (e.g. random access memory), persistent memory (e.g. hard disk devices) and network interfaces to allow servers 106, 107, 109 and 120 to communicate over the telecommunications network. Web servers hosts software applications comprising instructions and data for generating and serving web pages dynamically from the template families 108 and current informational content therefore from data store 112. Load balancing, security/firewall, billing, account and other applications may also be present as is well-known in the art.
[0030] Gateway and schema server 120 hosts software applications comprising instructions and data for proxying requests and responses between the client machines 102 and web sites 103, 104 and 105. In addition to software for maintaining HTTP communications, performing requests, maintaining sessions, handling cookies, etc., engine 140 may be implemented in software to apply the signature schemas to web pages from web sites. There may be provided an interpreter that interprets the signature schema document and applies the actions against the web page code (as an ASCII (plain text) to extract desired data to produce a result set. A renderer may be provided to express the desired data result set (i.e. transcode to a target format such as cHTML (Compact HTML) for a mobile device browser) for transmitting to the client machines also in accordance with the signature schema.
[0031] The web server 125 provides web pages to requesting client machine through a browser or application on the client for rendering. The web data may be directly pushed
to client machines 102A by e-mail or by other push based applications, or the data may be accessed by queries to web site 150 directly. The web site 150 may also extract content from the aggregate database 126 and apply signature schema 122 to the extracted database data, which schema may be configured to transcode the data in accordance with the target client machine 102A to tailor the output result.
[0032] Machines 102 schema server 120, and web sites 103, 104, 105 and 125 are coupled via a telecommunication network (not shown) typically comprising one or more interconnected networks that may include wired and (at least for machine 102A) wireless networks. It should now be understood that the nature of the network is not particularly limited and is, in general, based on any combination of architectures that will support interactions between client machines 102 and servers 106, 107, 109, 125 and 120. In a present embodiment the network includes the Internet as well as appropriate gateways and backhauls.
[0033] More specifically, in the present embodiment, a wireless network for client machine 102A may be based on core mobile network infrastructure (e.g. Global System for Mobile communications ("GSM"); Code Division Multiple Access ("CDMA"), Enhanced Data rates for GSM Evolution ("EDGE"), Evolution Data-Optimized ("EV- DO"), High Speed Downlink Packet Access ("HSPDA"), Universal Mobile Telecommunications System ("UMTS"), etc.) or on wireless local area network ("WLAN") infrastructures such as the Institute for Electrical and Electronic Engineers ("IEEE") 802.11 Standard (and its variants) or Bluetooth or the like or hybrids thereof. In the present embodiment of system 100 it is contemplated that client machine 102B may be another type of client machine such as a PC (desktop or laptop) configured to include a full desktop computer or as a "thin-client". Typically such have larger display monitors/screens than portable machines like 102A. A wired network for system 100 and machine 102B can be based on a T1 , T3 or any other suitable wired connection.
[0034] As previously stated in relation to Figures 1 and 2, each of the client machines 102 is configured to interact with content available over the network, including web pages on web site 104. In a present embodiment, client machines 102A and 102B may
navigate for content using a browser application (e.g. 86). As will be explained further below, on client machine 102A, browser application 86 may be a mini-browser in the sense that it may be configured to render web pages on the relatively small display 224 of client machine 102A. Often, during such rendering, those pages are presented in a format that may be different from how those pages are rendered on a traditional desktop browser application (e.g. browser 86 of client machine 102B). Mini-browsers typically attempt to convey substantially the same information as if the web pages had been rendered on a full browser such as Internet Explorer®, Safari® or Firefox® on a traditional desktop or laptop computer like client machine 102B.
[0035] Figure 3 is a flowchart illustrating operations/interactions for generating and maintaining an aggregate database from web sites 103-105 for populating and updating database 126. The flowchart provides an example of the interaction among the gateway and schema server 120/140 and data collection engine 152, with web servers 106, 107 and 109 hosting web sites 103, 104 and 105 to generate and maintain the aggregate database 126. The data collection engine 152 (DCE), makes a request 302 to the web site's web server (for example web server 106) for the specified pages based upon the type of data to be aggregated. The web page code (e.g. 110) is generated by server 106 and sent 306 to DCE 152. The web page code received is a text file. It typically does not include objects referenced by the code such as images, video, audio, further web pages, etc. that are typically subsequently retrieved and inserted at the time of rendering a web page by a browser. The schema server engine 140 (SSE) (for example, in parallel or without waiting for a response from server 106) makes a request 304 to the signature repository 124 for the signature schema document 122 for the web site, which request may use the domain in the URL as an identifier for obtaining the document 122. The schema server engine 140 receives 308 the schema and does not render the web page 110 per se but instead uses the instructions in the signature schema document 122 to extract the desired data from the web page 110. The signature schema 122 is configured to extract data from web page 110 in accordance with the specific desired content characteristics for the database 126 which is based upon the target client machines 102A, having knowledge of display 224 capabilities -
such as screen size, resolution, and other parameters - useful in determining the way in which the transcoded data is to be displayed on the machine 102A. The web page 110 or extracted data is stored 310 in database 126 in a relational database structure, where only data related to the defined signature schema from the web site is stored.
[0036] Client machine 102A can then make a request 312 to web site 150 on server 125 for a query to the database 126 regarding desired web sites having a specific domain (URL). Web site 150 requests 314 relevant data from the database 126. The results are extracted 316 and then sent 318 to the client machine as aggregate results, or as a proxy as if the query was made directly to the source web site, and transcoded in accordance with the schema 122, to the requesting client machine 102A processed by the signature schema engine 140 before presentation by web server 125. Alternatively, the data may be pushed to a client machine to a push based application. As noted above, transcoded data 130 may comprise transcoded navigational data for menu application 82 and informational content data (e.g. a list of products and related information from a web page) for displaying by browser application 86. The process can then be repeated for each identified web site such as web sites 103 and 105.
[0037] Signature schemas are pre-defined documents, and may be extensible Markup Language (XML) documents utilizing an SQL-like query language, to incorporate instructions and data with which to intelligently extract the data from web pages (which web pages are typically coded in HTML, DHTML, XHTML, XML, RSS, Javascript, etc). This extracted data may be transcoded and provided to client machines 102, used to dynamically generate a relational database (e.g. 126) or both. Each signature schema incorporates an understanding of a particular web site's data including relationships among the various data (e.g. among its primary informational content found in the body of its web pages as well as among such content and associated navigational data (e.g. web page links) that govern the data in the page. As described further herein below, prior knowledge of the web page code including specific identifiers, tags and text (i.e. strings) used within the code (sometimes referred to as "signatures" herein), may be used to define instructions to identify portions of the code of interest and to extract
specific desired data.
[0038] In accordance with the present embodiment, a signature schema document may be defined for all the pages of a particular web site. Large data-driven web sites (e.g. 104) don't maintain thousands of individual web pages per se. The sites typically adopt a few page family templates 108 and dynamically populate these with pertinent content from database 112 comprising information (e.g. weather, stock data, news, shopping/product data, patent data, trade-mark data etc.) as applicable when a client requests a particular page. Each template represents a family of pages having objects and attributes. Below are representative example page family templates and their objects and attributes for a web site offering news and an e-commerce web site offering products for sale electronically:
Example 1 : News site
Family: List Page
Objects: lists a selection of news stories
Attributes: Title, abstract and date
Family: Detail page
Objects: lists a single news story (and optionally other related stories)
Attributes: Journalist, City, Date, Title, Full Story, Image
Example 2: E-commerce site
Family: List Page
Objects: lists a selection of products
Attributes: Image, Item Name, Price, Sale Price
Family: Search Page (a specific kind of list page)
Objects: Similar to a list page
Attributes: Similar to a list page
[0039] Each family of pages (the family template) can be identified by a "signature" or unique set of one or more features that automatically identifies a given page on a web site as part of the family and differentiates that family from another family of pages.
Similarly each object and attribute field of interest can be identified with its respective unique signature within a family of pages. A signature schema document typically comprise numerous pieces of information (commands), for example, information that instructs the engine 140 for: identifying all page families; identifying and extracting data (i.e. desired objects and attributes) for each page family; capturing the (implicit or explicit) relationships between the objects and attributes; and transcoding the data.
[0040] A signature schema document may also be configured to enable special functionality for the target web site including searching, logging in a user, purchasing items, etc.
[0041] In accordance with a present embodiment, the structure and syntax of a representative signature schema document for a representative e-commerce site eshop.ca is shown and described. Engine 140 may be configured to receive web page code comprising text data and search through the text in accordance with the schema document instructions that provide SQL-query like language instructions. Engine 140 maintains a pointer within the text as it moves through the web page code performing various actions, as described below, in accordance with the schema instructions. Table 1 illustrates a snippet of a representative signature schema:
1 <?xml version="1.0" encoding="ISO-8859-1 " ?>
2 <site>
3 <version major="1 " minor="2"/>
4 <url location="http://www.eshop.ca" key="eshop.ca" name="E-Shop" />
5 <advanced> 6
7 <index_link value="http://www.eshop.ca/home.asp" />
8 </advanced>
9 <page_type>
10 <lookup type="pex" action="locate_string" name="list_elements" id="mylist_1 "
ref="Compare products" alt1="Sort products" />
11 <lookup type="pex" action="locate_string" name="item_elements" id="myitem_1 " ref=""product-details"" />
12 <lookup type="pex" action="locate_string" name="menu_elements" id=llmymenu_2" ref="anc-lhsnav-subltem" />
13 <lookup type="pex" action="locate_string" name="menu_elements" id="mymenu_1 " ref="product-table" />
14 <lookup type="pex" action="locate_string" name="item_elements" id="myitem_1" rβf ="*" />
15 </page_type>
16 <list_elements id="mylist_1 ">
17 </list_elements>
18 <item_θlements id="myitem_1 ">
19 <actions>
20 <lookup type="pex" action="move_ptr" ref="</head>" />
21 </actions>
22 <element>
23 <lookup type="pex" action="get_string" name="image" ref="largeimageref" location="after" start="<img src="" end=""" />
24 <lookup type="pex" action="get_string" name="title" ref="product- details-prd-title" location="after" start="<span" end="</span>" include_sz="1 " strip_tags="1 " />
25 <lookup type="pex" action="get_string" name="price" ref="our price:" location="after" start="<td" end="</td>" include_sz="1 " strip_tags="1 " />
26 <lookup type="pex" action="get_string" name="sale_price" ref="sale price:" location="after" start="<td" end="</td>" include_sz="1 " strip_tags="1 " tolerance="1 " />
27 <lookup type="pex" action="get_string" name=" description" ref="detailbox-text" location="middle" start="<p" end="</p>" include_sz="1 " strip_tags="1 " />
28 </element>
29 </item_elements>
Table 1 - XML Signature Schema Snippet for E-Shop.ca
[0042] In the XML code snippet of Table 1 , instructions at line 4 are for verifying that the web page under consideration and the signature schema relate to the same web site/domain - eshop.ca. Instructions at lines 9-15 are for determining the particular page
family to which the web page under consideration belongs. A respective signature that defines the particular page family has been previously identified for use to distinguish the page. The engine 140 processes the <page type> tag by registering the identification strings for each page family. When a web page is obtained by the engine as input, the engine may be able to identify the page family by its unique string ref=" and the command provides the related tag within the signature schema document where further instructions for the particular web pages are found: action="locate_string": command to check for the existence of a string. name=": identifies the type of page family for each identified family.
id=": assigns an id to the page family that is used across the signature schema document.
[0043] For example, at line 10, the instructions identify a web page using the alternative signatures "Compare products" or "Sort Products". Web pages with these strings are of the same family type. The instructions at line 10 provide a reference tag to further instructions for this family, providing a link to instructions for the list_elements page family with and ID of mylist_1 (see lines 16-17). Similarly the other lookup instructions provide references to the specific instructions within the signature schema document for handling a web page of each web page family. Representative instructions for some of the web page families are provided in Table 1 , for example, at lines 16-17 and 18-29 with others omitted for brevity.
[0044] With reference to the extraction instructions for one of the web page families (e.g. item_elements id="myitem_1") at lines 18-29, the instruction at line 20 advances the scan pointer within the text file of the web page code to a beginning limit of a region of interest indicated by a signature reference. This establishes an upper limit for review within the text file. Though not shown in this table, an end limit may be defined as well (See Table 4). Further such instructions at lines 22-28 may comprise commands to locate desired data using "signatures" such as string identifiers that uniquely identify the
data within the region of interest. In the present example the instructions locate and extract one or more elements, namely, product image, title, price, sale price and description for a product of the item web page family. For example, instructions at line 23 extract a string in between the first "<img src="" and """ that appears after next appearance of "largeimageref". The string returned is the path (relative URL at web site eshop.ca) to the product image. By advancing a search scan pointer within the web code to a desired location, references before that location can be skipped when searching. Any prior instances of a signature string such as "largeimageref" may be ignored. In this way, otherwise ambiguous signature references can be avoided.
[0045] The example in Table 1 shows at least some of the instructions (e.g. lines 23 -27) including one or more directional references relative to the signatures to locate and extract the desired data. For example, directional references such as "before" or "after" command the engine to extract desired data that is in a relative position in the web page before or after the signature string (i.e. ref=). Moreover, such instructions may further include at least one of a start reference or an end reference further pinpointing the location of the desired data in accordance with that direction. Additional directional reference information is discussed herein with reference to code snippets in other Tables and the discussion of an embodiment of signature transcoding engine syntax presented below.
[0046] The example within Table 1 demonstrates the extraction of data and the establishment of relationships between objects and elements within a same page of a web site. However, signature schema documents may further capture relevant attributes of an object across pages. For example, a user of client machine 102A may click through a number of web pages in eshop.ca to get to a specific product page (e.g. Department -> Product Category -> Product Sub-Category -> Specific Product, such as TV & Video > 19"-21" TVs > LCD TVs > BrandX Product. The navigational hierarchy representing a categorization may be captured and associated to the extracted objects and there elements.
[0047] For brevity, certain instructions were omitted from Table 1. Tables 2-4 provide
representative instructions for further web page families for e-shop.ca that may be read with Table 1. Table 2 below provides representative instructions, e.g. for lines 16 and 17 of Table 1 , including instructions for a web page family related to a list of items/products for sale. Whereas instructions at lines 22-28 provided product data extraction instructions for a web page family showing a single item (i.e. product), the instructions of Table 2 provide additional instructions that repeat product data extractions for each product in the list.
1 <list_elements id="mylist_1 ">
2 <paging>
3 <page_variable value="page" />
4 <page_start value="0" />
5 <lookup type="pex" action="get_string" name="link" ref="Next&nbsp" location="before" start="<a class=" end="</a>" include_sz=" 1 " strip_tags=" 1 " />
6 </paging>
7 <actions>
8 <lookup type="pex" action="move_ptr" ref="Sort or compare products" ref_alt_1 ="Sort products" />
9 </actions>
10 <element>
11 <lookup type="pex" action="get_string" name="link" ref="thumbnail" location="before" start="<ahref="" end="">" />
12 <lookup type="pex" action="get_string" name="image" ref=11thumbnail" location="middle" start=""" end=""" />
13 <lookup type="pex" action="get_string" name="title" ref="class="tx-strong-dgrey&quot;" location="after" start="<a href=" end="</a>" include_sz="1" strip_tags="1" />
14 <lookup type="pex" action="get_string" name="price" TOf=11PnCePiII/11 location="after" start=7" repeat_start=" 1 " end=".gif" tolerance=T7>
15 <lookup type="pex" action="move_ptr" ref="pricepill/" />
16 </element>
17 </list_elements> __
Table 2 - XML Signature Schema Snippet for Product List Web Page Family of E- Shop.ca
[0048] If the engine 140 identifies that the page is of the "mylisM" family, the engine
determines the location in the signature schema document that contains the signature for the objects and elements of that family and applies the instructions therefor. A product list at e-shop.ca may span multiple web pages. Instructions at lines 2-6 of Table 2 find the number of pages and generate the links for each of the pages. Instructions at lines 7-9 (action tag) advance the search scan pointer to the region of web page code that may be of interest (i.e. in this case, the start of the list). In this way, a local signature reference can be used and any earlier ambiguous references skipped. Skipping to the local region of interest may also make the specification of the signature reference less complicated.
[0049] Taking advantage of inherent repeated patterns in the web page code, instructions at lines 10-16 (elements tag) of Table 2 provide product data extraction instructions that may be repeated for each product in the list. The engine 140 may be provided with commands to scan for each data element of interest using a signature reference e.g. ref=", an action, one or more positional instruction(s) to further identify the data within the text of the web page code, and any additional text data manipulation instructions to extract the desired data (e.g. to remove HTML formatting characters or add characters). The instruction at line 15 moves the scan pointer to the end of the object (in this example a product in a list of products) to ready the instructions for application against the next object (product) in the list.
[0050] More particularly: lookup type="pex": string lookup action ="get_string": returns a value back that is the desired element of the object. name="link": the object element, in this case the link to the product page ref="thumbnail11: the reference string that identifies where to find the value of the link location="before": the value of the link is before the ref string start="<a href="": look for the ref string after this value end="">": look for the ref string before this value.
1 <search_elements id="mysearch_1 ">
2 <settings>
3 <search_path value="http://www.eshop.ca/search/search.asp" />
4 <search_variable value="keyword" />
5 </settings>
6 <paging>
7 <page_variable value="page" />
8 <page_start value="0" />
9 <lookup type="pex" action="get_string" name="link" ref="Next&nbsp" location="before" start="<a href=" repeat_start="1" end="</a>" include_sz="1 " strip_tags="1 " />
10 </paging>
11 <actions>
12 <lookup type="pex" action="move_ptr" ref="bg-compare-hero" />
13 </actions>
14 <element>
15 <lookup type="pex" action="get_string" name="link" ref=">" location="after" start="<a href="" end=M">" />
16 <lookup type="pex" action="get_string" name="image" ref="<a href" location="after" start="<img src="" end=""" />
17 <lookup type="pex" action="get_string" name="title" ref="class="tx-strong-dgrey&quot;" location="after" start="<a href=" end="</a>" include_sz="1 " strip_tags="1 " />
18 <lookup type="pex" action="move_ptr" ref="bg-compare-hero" />
19 </element>
20 </search elements>
Table 3 - E-Shop Search Family Signature Schema Snippet
[0051] If the engine 140 has identified that the page is of the "mysearch_1" family the engine applies the portion of the signature schema document that contains the signature for the objects and elements of that family, shown above in Table 3.
<settings>...</settings>: Contains any web page specific manual overrides such as excluding certain menu items, customization, modification of a menu that may be desired. In this example, as per line 3 a value of form variable "keyword" will be posted
to "http://www.eshop.ca/search/search.asp". <paging>...</paging>: Manages paging for the search pages. <actions>...</actions>: Instruct the engine to move the scan pointer to the string "bg- compare-hero" (line 12 of Table 3) and start looking for elements from there. <element>...</element>: Contains lookup instructions for each object element as previously described.
1 <menu_elements id="mymenu_1 ">
2 <settings>
3 <black_list value="Site lndex##Extemal Link" />
4 </settings>
5 <actions>
6 <lookup type="pex" action="move_ptr" ref="bg-lhsnav-title" />
7 <lookup type="pex" action="end_ptr" ref="</table>" />
8 </actions>
9 <element>
10 <lookup type="pex" action="get_string" name="link" ref="<li>" location="after" start="<a href="" end=""" />
11 <lookup type="pex" action="get_string" name="title" ref="<li>" location="after" start="<a href="" end="</a>" include_sz="1 " strip_tags="1" />
12 <lookup type="pex" action="move_ptr" ref="</li>7>
13 </element>
14 </menu_elements>
Table 4 - E-shop Menu Family Signature Schema Snippet
[0052] If the engine 140 has identified that it is looking for a menu on a page that contains the menu style of the "mymenu_1" family, the engine applies the portion of the signature schema document that contains the signature for the objects and elements of that family, shown above in Table 4.
<settings>...</settings>: Contains any page specific manual overrides such as exclude list, customization, modification, personalization, etc. In this example, as per line 3, any result that matches "Site Index", "External Link" are excluded but partial matches are also possible by using wild card strings.
<action>...</action>: Lines 6 - 7 of Table 4 sets the start and end limits to instruct the engine 140 where to look for menu items.
<element>...</element>: Contains lookup instructions for each object element as previously described. In this example, lines 10 and 11 of Table 4, an element in 'mymenu_1 ' (each individual menu entry of web page) contains link and title as its properties. Line 12 instructs the engine to move the pointer to "</li>" to get ready to loop through and extract the next menu item with the same elements, taking advantage of the repeated patterns within the text of the web page code.
[0053] Though the example described relates to extracting informational content for an e-commerce oriented site, no limitation should be applied. Similar instructions may be defined for other types of sites, for pages which permit a user to input information and for navigational data extraction.
[0054] Signature schema document 122 may further comprise transcoding instructions (not shown) for use by engine 140 to express the extracted desired data (which may be retrieved from database 126) in a target format (e.g. a format of HTML, XML, script etc.) for use by the requesting client machine 102. For example, the transcoding instructions may define a web page for displaying the extracted data in browser application 86 that is suitable for display on the client machine 102. The formatting rules can be system and/or user defined and can include parameters such as but not limited to: object positioning, object colour, object size, object shape, object font/image characteristics, background style, and navigational item display (e.g. in a menu as described above) or for display with the content in the generated page on the client screen. Browser application 86 (e.g. of machine 102A) may be configured for using a markup language (e.g. cHTML) or other code format that is not identical to the code provided by web page 110. Alternatively, transcoding instructions may be defined to express the extracted desired data in XML or another code format such as for use by a different client application or plug-in to a client application such as menu application 82 or another application (not shown) on client machine 102.
[0055] Signature schema documents may be prepared (i.e. coded) using a
computing device such as computing device 128. Computing device 128 may be any suitable desktop or laptop device capable of coding documents (which may be but need not be XML-type documents) and may be configured to automate or semi-automate coding of such documents.
[0056] Computing device 128 may be coupled to web site 104 to retrieve web pages from the site for reviewing to prepare the custom signature schema document for the site. Computing device 128 may be configured to automatically review the web page code and apply heuristics or other techniques (e.g. spatial analysis) to determine probable content of interest (i.e. desired data) and generate code to extract the desired data. For example, primary content of interest tends to be located toward the centre of the web page. In another embodiment, the computing device may facilitate a user coding signature schema to manually assist with the analysis of the web page and identification of desired data and the generation of the instructions. Computing device 128 may be further coupled to repository 124 to provide (e.g. up-load or publish) coded signature schema documents for use by server 120.
[0057] It will be apparent to a person of ordinary skill in the art that as a web site may be re-designed or otherwise changed such that the code of one or more web page families may be changed or a family added, an existing signature schema may require re-coding to account for the change/addition, as applicable.
Signature (Transcoding) Engine Syntax
[0058] In accordance with a present embodiment, further details concerning the syntax of schema instructions are described.
Lookup Syntax
The lookup tag instructs the engine 140 to perform an insert, delete or query the document contents.
Type: Defines the data type of the lookup. Type may be "pex" for a string expression. Type may also support more advanced options such as regular
expressions, API calls, and SQL queries.
Action:
Action = "locate_string": Look for a string ("ref identifier") value within the data. Return true iff the string exists in the data (i.e. the "ref" identifier index >= 0).
Action = "replace_string": Replace a string within the data with the "ref" identifier.
Action = "move_ptr": Remove all characters in the data that exist before the location of the "ref" identifier.
Action = "end_ptr": Remove all characters in the data that exist after the location of the "ref" identifier.
Action = "get_string" Extract a string based on the location of the "ref", "start", and "end" identifiers.
ID: ID is an identifier of another section within the signature. It allows the result of a query to trigger another set of actions within the signature. This is primarily used when identifying page types. Once a match has been made, specific instructions are executed that are marked with this ID. Recursive data structures (e.g. lists within lists) may also be supported.
Ref: Ref defines the initial identifier that the lookup searches for. If an AND case is required multiple ref identifiers can be used (i.e. ref="string1" ref1="string2"). If an OR case is required ref_[ref identifier] _alt_1 can be used (i.e. ref="string1" ref_alt_1="string2"). To demonstrate (X=T || Y="2") && (A="8" || B="9") would translate to ref=T ref_alt_1="2" ref1="8" ref1_alt_1="9".
Repeat_[identifier]: Repeat executes the identifier query additional times. For example, if ref="hello" to set the identifier index at the second occurrence of hello the following tag would be added: repeat_ref="1".
Location:
Location = "before": Search the data in a reverse direction, starting from the "ref" identifier. This implies that both the "start" and "end" identifier indexes must be less than the "ref" index.
Location = "middle": Search the data in two directions, starting from the "ref" identifier. This implies that the "ref" identifier index is greater than the "start" identifier index and less than the "end" identifier index.
Location = "after": Search the data in a forward direction, starting from the "ref" identifier. This implies that both the "start" and "end" identifier indexes must be greater than the "ref" index.
Start: Start is primarily used when action="get_string" and may also be used for replace/remove instructions. . The start identifier index will be the start index of the string to extract. If an AND case is required multiple "start" identifiers can be used (i.e. start="string1" start"! ="string2"). If an OR case is required startjstart identifier] _alt_1 can be used (i.e. start="string1" start_alt_1="string2"). To demonstrate (X=1T || Y="2") && (A="8" Il B="9") would translate to start="1" start_alt_1 ="2" start1="8" start1_alt_1="9". To find the nth match see the repeat syntax.
End: End is primarily used when action="get_string" and may also be used for replace/remove instructions. If an AND case is required multiple "end" identifiers can be used (i.e. end="string1" end1="string2"). If an OR case is required endjend identifier] _alt_1 can be used (i.e. end="string1" end_alt_1="string2"). To demonstrate (X=1T || Y="2") && (A="8" H B="9") would translate to end="1" end_alt_1 ="2" end1="8" end1_alt_1="9". To find the nth match see the repeat syntax
Maxjndex: Maxjndex is used to limit the scope of a query by ensuring that no other identifier index is greater than the "maxjndex". . If an AND case is required multiple "maxjndex" identifiers can be used (i.e. max_index="string1" max_index1="string2"). If an OR case is required max_index_[ maxjndex identifier] _alt_1 can be used (i.e. maxjndex="string1" maxjndex_alt_1="string2"). To demonstrate (X=1T || Y="2") &&
(A="8" H B="9") would translate to max_index="1" maxjndex alt_1="2" maxjndex ="8" max_index _alt_1 ="9". To find the nth match see the repeat syntax.
Max_lndex_Use_Ref : Max_lndex_Use_Ref is a Boolean value set to O or 1. It is used with Maxjndex. When set to O1 the "maxjndex" will begin querying at the beginning of the data. When set to 1 , the "maxjndex" will begin querying from the "ref" identifier index.
Gbl_append_[identifier]: GbLappend appends a string passed via the url to the identifiers query value
Gbl_Repeat_[identifier]: GbLRepeat executes the identifier query additional times. For example, if ref="hello" to set the identifier index at the second occurrence of hello the following tag would be added: gbl_repeat_ref="var" where var would be passed in the URL i.e. http://www.eshop.ca/mobile/fatf ree.asp?site=...&url=...&var=1.
Tolerance: Tolerance is a Boolean value set to 0 or 1. It is used to return an empty string. By default tolerance is set to 0 which enforces that a property be found on a page, otherwise the page will be marked as "invalid" and an appropriate error message returned. When set to one, an empty value is returned for properties that can not be located.
lnclude_sz: lnclude_sz is a Boolean value set to 0 or 1 and used with get_string. It is by default set to 0. When set to 1 it includes the "start" value and the "end" value as part of the result.
lnclude_start: lnclude_start is a Boolean value set to 0 or 1 and used with get_string. It is by default set to 0. When set to 1 it includes the "start" value as part of the result.
lnclude_end: lnclude_end is a Boolean value set to 0 or 1 and used with get_string. It is by default set to 0. When set to 1 it includes the "end" value as part of the result.
Closetag: Closetag is a Boolean value set to 0 or 1 and used when action="get_string". It appends /> to the extracted value.
Strip_Tags: Strip_Tags removes HTML tags from the value and used when action="get_string".
Strip_tags="1": remove all tags.
Strip_tags="2": remove all br and script tags.
Strip_tags="3": remove all tags except replace </p> </li> with <br>.
Strip_tags="4": remove all tags except replace </div> <br> with <br>.
Strip_tags="tag1 ,tag2...tagN":remove all tag1 , tag2,...tagN leaving any tag not listed.
Notrim: Notrim is a Boolean value set to O or 1 and used when action="get_string". By default all value have white spaced trimmed. When this property is set to 1 , white space is not trimmed.
Append: Append is a string value and used when action="get_string". It appends a string to the extracted value.
Prepend: Prepend is a string value and used when action="get_string". It prepends a string to the extracted value.
Upper: Upper is a Boolean value set to 0 or 1 and used when action="get_string". It converts all characters to upper case.
Lower: Lower is a Boolean value set to 0 or 1 and used when action="get_string". It converts all characters to lower case.
Page Syntax
The page syntax extracts the paging information from the data. This allows the end user the ability to change pages just as on the desktop.
Page_variable: Defines unique key that defines a family's paging feature.
Page_start: Defines value of first page in a family's paging feature.
Page_post: Path where paging variable(s) must be transmitted to.
Page_start :Defines value of first page in a family's paging feature.
Pagejncrement: Defines value that paging increases by for each page in a family's paging feature.
Page_block: Defines unique key that defines a family's paging block feature.
Page_block_size: Defines the size of the family's page block, (i.e. 10 items per page)
UrLappend: Append the unique key that defines a family's paging feature and the page number.
Search Syntax
Make a web site family's search feature functional by specifying details such as what variable to post.
Search_path: Search path where search variable must be transmitted to
Search_variable: Name of search variable which a web site's search feature is looking to read, request, post, etc.
UrLreplace: Remove a portion of the url that is specific to posting search parameters
URL Syntax
The url tag defines global properties for a site, including the url, and name: <url location="http://www.eshop.ca" key="eshop.ca" name="E-Shop" />
Name: Name is the name to display when browsing using the gateway 120
Location: Location defines the fully qualified address of the site.
Key: Key is the site.
Advanced Syntax
The advanced tag defines global properties for the site. This at a minimum includes the path to the initial page of the site.
<advanced>
<index_link value="http://www.eshop.ca" />
<check_out value="1" /> </advanced>
Indexjink: Indexjink specifies the path to the initial page of the site. This is usually the same page as the location property from the URL syntax. This field is always required.
Appendjink: Appends a string value to every URL requested for this site.
No_purchase: No_purchase is a Boolean value 0 or 1. The default value is 0 which implies that an item should contain a purchase link. When true, the purchase link is removed.
No_item: No_item is a Boolean value 0 or 1. The default value is 0 which implies that Item pages should show up in the breadcrumb. When true, the item is not added to the breadcrumb.
Check_out: Check_out is a Boolean value 0 or 1. The default value is 0 which implies that Item purchase link sends the request and control away from the gateway server 120. When true, then a checkout process has been created for use with gateway server 120.
Product_img_width: Product_img_width defines the width of all item images.
Use_cookies: Use_cookies a Boolean value 0 or 1. By default it is set to 0, and cookies are not passed to the site. When true, gateway 120 passes all cookies from client machine 102 to the site 104, and from the site 104 to the client machine.
Page Type Syntax
The page type is a collection of lookup queries that have an id associated with them. Lookup queries may be processed in a top down fashion. The first successful lookup will trigger another section in the signature schema document. For example, if the following evaluates to true:
<page_type>
<lookup type="pex" action="locate_string" name="list_elements" id="mylist_1" ref="<!-
</page_type>
Then the tag element <list_elements id="mylist_1 "> would be executed next. General Element Syntax
Elements include list_elements, menu_elements, item_elements, search_elements, form_elements. Each element has an ID. For example a menu element:
<menu_element id="menu_id"/>
The element may contain the following sub containers (settings, actions, elements, paging) which scope resides only within the element. Each element is associated with a specific rendering function.
<menu_element id="menu_id"/>
<settings> </settings>
<paging> </ paging >
<elements> </ elements >
<actions> </ actions > </menu element>
Settings Syntax
Settings syntax varies based on the type of element it resides in. Settings allow customizations that only apply to a specific page family.
Blackjist - menu_elements: Blackjist removes menu items with names that reside in the black list. Each entry is separated delimited (e.g. using two pound characters
(##).
Pass_image - list_elements, search_elements: Passjmage adds the image path to the url when requesting an item. The image added to the url will be used as the item image.
Price[n] - item_elements: Price[n] where n is an integer renames the rendered item with name price[n].
Action - form_elements: Overrides the action of a form displayed to the end user.
Handle - form_elements
Handle = "display" - display the form to the end user.
Handle = "post" - post the form.
Handle = "get" - get the form.
Cookie - form_elements: Send additional cookies when posting this form.
lnput_[identifier] - form_elements: Input tag adds/modifies a form value with name [identifier] setting its value.
Rename_[identifier] - form_elements: Rename tag renames a form value with name [identifier].
Actions Syntax
The actions tag primary function is data manipulation. It contains lookup queries that modify data with actions of "move_ptr" or "end_ptr".
<actions>
<lookup type="pex" action="move_ptr" ref="</head>" /> </actions>
[0059] Persons of ordinary skill in the art will appreciate that alternative embodiments are contemplated. System 100 may be implemented so that one or more web sites are coupled to the telecommunication network (either alone by a server 106 or by one or more web servers like web-server 106), and that a corresponding one or more schemas for each of those web sites (or each of the web pages therein, or both) can be maintained by gateway and schema server 120 and repository 124. Client machines 102 can be configured for proxied connection through different servers 120 and for accessing aggregated web site data from database 126. Those skilled in the art will now further recognize that server 120 and web server 125 can be hosted by a variety of different parties, including, for example but without limitation: a manufacturer of client machine 102, a service provider that provides access to the telecommunication network on behalf of user U of a client machine 102; the entity that hosts web-site 104 or a third party intermediary. In web site host example it can even be desired to simply combine the web server 106 and schema server engine 120 on a single server to thereby obviate the need for separate servers. Alternatively the functionality of server 120 and web server 125 may be locally resident on the client machine providing.
[0060] Figure 4 is a schematic representation of an aggregate web site search database 126. The database 126 contains one or more tables for storing aggregate web page data extracted from target web sites. The database may be a relational database enabling structured database queries and provides temporary/persistent storage of structured data as a whole or partially may be indexed for fast performance. For an e- commerce web site, the database 126 may contain tables defining a web site
identification and category index 402 and category data 404 containing specific item details, as will be discussed in connection with Fig. 5. The category data 404 may be further divided into product data 406 sub-tables to store additional product related information. Indices 408 can be created to reference aggregate web site data to improve query responsiveness and results. Metadata 410 associated with the original web pages may be stored such as images, web page formatting or navigation data. In other web site applications such as news, weather, stock data, patent data, trade-mark data etc., the appropriate tables would be configured to store the extracted data based upon the signature schema. The database 126 may alternatively reside in client machine 102A memory. The client machine would generate the request to the desired websites and store extracted data locally.
[0061] Figure 5 is a schematic representation of tables in an aggregate web site search database for e-commerce. The extracted data from selected e-commerce web sites can be formatted into category index 402 which indicates identification 502 of the web site where the data was extracted from and the category 504 of interest. In this example entry 506 identifies http://www.eshop2.ca/pg=3 as the reference for where digital cameras may be indexed. For each entry in the category index 402, an individual category 404 table can be created. The category table provides location identifiers 510 for each product retrieved from the web site based upon the defined signature schema and populated at step 310 during the retrieval of data from the web site. The vendor 512, title 514 of the product, price 516 and description 518 extracted can then be stored. It should be understood that the categories identified can be tailored to the application. The database then provides a means for querying the aggregate data from the web site to present meaningful information to the client machine 102A. By storing the aggregate data queries can be created to meet desired requirements and transcoded for presentation on the client machine 102A.
[0062] Figure 6 illustrates a method of creating the aggregate web site search database. The web sites 103, 104 and 105 that are to be indexed are identified, and page request(s) 602 are sent to the web site to fetch pages by data collection engine 150 hosted on web server 125. The data collection engine 150 is a web crawler to
independently collect data from target web sites for storage in aggregate database 126. Alternatively, the data collection engine 150 may be included in the engine 140 of gateway 120. The signature schema for the web site is retrieved 604 from repository 124 by engine 140. The schema is applied 606 to the received web site data to extract relevant data from the web page content. The extracted data is stored 608 to the aggregate database 126 in the appropriate tables. The process is repeated for each relevant web site to generate the aggregate web site search database. The data collection engine 150 can then periodically access web sites 103, 104 and 105 to ensure stored data is accurate and up to date. As the schema is defined to extract elements with which objects and their attributes on the web page can be defined or described and the schema incorporates knowledge of what these objects and attributes represent, an intelligent and indexed database 126 can be defined.
[0063] Figure 7 illustrates a method of querying the aggregate web site search database 126. A search query or request is generated by a user via an interface on client machine 102A and received 702 at gateway 120 or may be directed to the aggregate web site 150. The web site 150 processes the request and generates 704 the relevant database query such as SQL (Structured Query Language) to database 126. The relevant data is then retrieved 706 from the database 126. The retrieved data or search results, can then be formatted and provided 708 accommodate client machine 102A characteristics in response to request 702 to the client machine 102A. If the request is made through a web application the search result data may be formatted to accommodate the client machine browser. For example as shown in Fig 7C. Alternatively the data may be pushed to a push application on the client machine 102A. The web site data may be retrieved periodically by data collection engine 152 which may be triggered manually, scheduled or on-going.
Claims
1. A method of aggregating web site data from one or more web sites, the method comprising: sending a page request to a web site selected from the one or more web sites; receiving the requested web page from the selected web site; retrieving signature schema associated with the requested web page wherein the signature scheme identifies data fields within the requested web page; applying signature schema to the requested web page to extract data from the requested web page; and storing extracted data to an aggregate database, wherein the aggregate database comprises data extracted from the one or more web sites.
2. The method of claim 1 further comprising: receiving a search query from a client machine for data stored in the aggregate database; generating a database query based upon the received search query; and retrieving data from the aggregate database defined by the database query.
3. The method of claim 2 further comprising: generating a retrieved data web page for the client machine using the retrieved data; and providing the retrieved data web page to the client machine.
4. The method of claim 2 wherein the search query is generated by a push application on the client machine and wherein the retrieved data is sent to the push application.
5. The method of claim 1 wherein the data is stored in a relational database.
6. The method of claim 5 wherein the database query is a Structured Query Language (SQL) query.
7. The method of claim 1 wherein the signature schema is retrieved from a repository of one ore more signature schemas, wherein each schema is defined for each of the one or more web sites.
8. The method of claim 1 wherein the signature schema comprises extensible Markup Language (XML) documents comprising query language for extracting data from the requested web page.
9. The method of claim 1 wherein the aggregate database comprises a category table identifying a hypertext transport protocol (HTTP) link and a product category and wherein the aggregate database further comprises a product table identifying a product specific HTTP link, product information and pricing for each identified product category.
10. The method of claim 1 wherein the aggregated database is stored on the client machine.
11.The method of claim 1 where in the client machine is a wireless device.
12. A system for aggregating web site data from one or more web sites, the system comprising: at least one computing device comprising a processor and a memory coupled thereto, said memory storing instructions and data for configuring the processor to: send a page request to a web site selected from the one or more web sites; receive a web page from the selected web site based upon the sent page request; retrieve signature schema associated with the requested web page; apply signature schema to the requested web page data to extract data identified by the signature schema; and store extracted data to an aggregate database comprising data extracted from the one ore more web sites.
13. The system of claim 12 wherein the instructions for configuring the processor are further configured to: receive a search request for data from the aggregate database from a client machine; generate a database query based upon the received search request; and retrieve data from the database defined by the database query.
14. The system of claim 12 wherein the signature schema is retrieved from a repository of one ore more signature schemas wherein each schema is defined for each of the one or more web sites.
15. The system of claim 12 wherein the signature schema are extensible Markup Language (XML) documents comprises query language for extracting data the requested web page.
16. The system of claim 12 wherein the aggregate database comprises a category table identifying a hypertext transport protocol (HTTP) link and a product category and wherein the aggregate database further comprises a product table identifying a product specific HTTP link, product information and pricing for each identified product category.
17. The system of claim 12 wherein the data is stored in a relational database.
18. The system of claim 17 wherein the query is a Structured Query Language (SQL) query.
19. The system of claim 12 wherein the processor is further configured to: generate a retrieved data web page for the client machine using the retrieved data; and provide the retrieved data web page to the client machine.
20. The system of claim 12 where in the search query is generated by a push application on the client machine and the retrieved data is sent to the push application.
21.The system of claim 12 wherein the aggregated data base is stored on the client machine.
22. The system of claim 21 where in the client machine is a wireless device.
23. A computer program product storing computer readable instructions which when executed by a computer processor configure the processor for: sending a page request to a web site selected from the one or more web sites; receiving the requested web page from the selected web site; retrieving signature schema associated with the requested web page wherein the signature scheme identifies data fields within the requested web page; applying signature schema to the requested web page to extract data from the requested web page; and storing extracted data to an aggregate database, wherein the aggregate database comprises data extracted from the one or more web sites.
24.A method of aggregating web site data from one or more web sites, the method comprising: sending a page request to a web site selected from the one or more web sites; receiving the requested web page from the selected web site; retrieving signature schema associated with the requested web page wherein the signature scheme identifies data fields within the web and wherein the signature schema are extensible Markup Language (XML) documents comprising query language for extracting data from the requested web page; applying signature schema to the received web page to extract data from the requested web page; storing extracted data to an aggregate database, wherein the aggregate database comprises data extracted from the one or more web sites; receiving a search query from a client machine for data stored in the aggregate database; generating a database query based upon the received search query; and retrieving data from the aggregate database defined by the query.
25. The method of claim 25 where in the client machine is a wireless device.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP08748305A EP2151088A4 (en) | 2007-05-17 | 2008-05-12 | Method and system for an aggregate web site search database |
CA002687478A CA2687478A1 (en) | 2007-05-17 | 2008-05-12 | Method and system for an aggregate web site search database |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US92450307P | 2007-05-17 | 2007-05-17 | |
US60/924,503 | 2007-05-17 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2008141426A1 true WO2008141426A1 (en) | 2008-11-27 |
Family
ID=40028558
Family Applications (10)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CA2008/000907 WO2008141425A1 (en) | 2007-05-17 | 2008-05-12 | Method and system for aggregate web site database price watch feature |
PCT/CA2008/000917 WO2008141432A1 (en) | 2007-05-17 | 2008-05-12 | Web page transcoding method and system applying queries to plain text |
PCT/CA2008/000909 WO2008141427A1 (en) | 2007-05-17 | 2008-05-12 | Method and system for automatically generating web page transcoding instructions |
PCT/CA2008/000914 WO2008141429A1 (en) | 2007-05-17 | 2008-05-12 | Method and system for generating an aggregate website search database using smart indexes for searching |
PCT/CA2008/000908 WO2008141426A1 (en) | 2007-05-17 | 2008-05-12 | Method and system for an aggregate web site search database |
PCT/CA2008/000903 WO2008141424A1 (en) | 2007-05-17 | 2008-05-12 | System and method for content navigation |
PCT/CA2008/000918 WO2008141433A1 (en) | 2007-05-17 | 2008-05-12 | Method and system for transcoding web pages |
PCT/CA2008/000915 WO2008141430A1 (en) | 2007-05-17 | 2008-05-12 | Method and system for continuation of browsing sessions between devices |
PCT/CA2008/000919 WO2008141434A1 (en) | 2007-05-17 | 2008-05-12 | Method and system for transcoding web pages by limiting selection through direction |
PCT/CA2008/000916 WO2008141431A1 (en) | 2007-05-17 | 2008-05-12 | Method and system for desktop tagging of a web page |
Family Applications Before (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CA2008/000907 WO2008141425A1 (en) | 2007-05-17 | 2008-05-12 | Method and system for aggregate web site database price watch feature |
PCT/CA2008/000917 WO2008141432A1 (en) | 2007-05-17 | 2008-05-12 | Web page transcoding method and system applying queries to plain text |
PCT/CA2008/000909 WO2008141427A1 (en) | 2007-05-17 | 2008-05-12 | Method and system for automatically generating web page transcoding instructions |
PCT/CA2008/000914 WO2008141429A1 (en) | 2007-05-17 | 2008-05-12 | Method and system for generating an aggregate website search database using smart indexes for searching |
Family Applications After (5)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CA2008/000903 WO2008141424A1 (en) | 2007-05-17 | 2008-05-12 | System and method for content navigation |
PCT/CA2008/000918 WO2008141433A1 (en) | 2007-05-17 | 2008-05-12 | Method and system for transcoding web pages |
PCT/CA2008/000915 WO2008141430A1 (en) | 2007-05-17 | 2008-05-12 | Method and system for continuation of browsing sessions between devices |
PCT/CA2008/000919 WO2008141434A1 (en) | 2007-05-17 | 2008-05-12 | Method and system for transcoding web pages by limiting selection through direction |
PCT/CA2008/000916 WO2008141431A1 (en) | 2007-05-17 | 2008-05-12 | Method and system for desktop tagging of a web page |
Country Status (4)
Country | Link |
---|---|
US (10) | US8396881B2 (en) |
EP (10) | EP2158723A4 (en) |
CA (10) | CA2687483C (en) |
WO (10) | WO2008141425A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8782281B2 (en) | 2004-03-23 | 2014-07-15 | Cisco Technology Inc. | Optimally adapting multimedia content for mobile subscriber device playback |
Families Citing this family (338)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7913312B2 (en) | 2002-09-13 | 2011-03-22 | Oracle America, Inc. | Embedded content requests in a rights locker system for digital content access control |
US7240365B2 (en) * | 2002-09-13 | 2007-07-03 | Sun Microsystems, Inc. | Repositing for digital content access control |
US7512972B2 (en) * | 2002-09-13 | 2009-03-31 | Sun Microsystems, Inc. | Synchronizing for digital content access control |
US8782654B2 (en) | 2004-03-13 | 2014-07-15 | Adaptive Computing Enterprises, Inc. | Co-allocating a reservation spanning different compute resources types |
WO2005089241A2 (en) | 2004-03-13 | 2005-09-29 | Cluster Resources, Inc. | System and method for providing object triggers |
US20070266388A1 (en) | 2004-06-18 | 2007-11-15 | Cluster Resources, Inc. | System and method for providing advanced reservations in a compute environment |
US8176490B1 (en) | 2004-08-20 | 2012-05-08 | Adaptive Computing Enterprises, Inc. | System and method of interfacing a workload manager and scheduler with an identity manager |
US8856108B2 (en) | 2006-07-31 | 2014-10-07 | Ricoh Co., Ltd. | Combining results of image retrieval processes |
US8144921B2 (en) | 2007-07-11 | 2012-03-27 | Ricoh Co., Ltd. | Information retrieval using invisible junctions and geometric constraints |
US8276088B2 (en) | 2007-07-11 | 2012-09-25 | Ricoh Co., Ltd. | User interface for three-dimensional navigation |
US8385589B2 (en) * | 2008-05-15 | 2013-02-26 | Berna Erol | Web-based content detection in images, extraction and recognition |
US8086038B2 (en) | 2007-07-11 | 2011-12-27 | Ricoh Co., Ltd. | Invisible junction features for patch recognition |
US9530050B1 (en) | 2007-07-11 | 2016-12-27 | Ricoh Co., Ltd. | Document annotation sharing |
US8868555B2 (en) | 2006-07-31 | 2014-10-21 | Ricoh Co., Ltd. | Computation of a recongnizability score (quality predictor) for image retrieval |
US8510283B2 (en) | 2006-07-31 | 2013-08-13 | Ricoh Co., Ltd. | Automatic adaption of an image recognition system to image capture devices |
US8600989B2 (en) * | 2004-10-01 | 2013-12-03 | Ricoh Co., Ltd. | Method and system for image matching in a mixed media environment |
US8989431B1 (en) | 2007-07-11 | 2015-03-24 | Ricoh Co., Ltd. | Ad hoc paper-based networking with mixed media reality |
US8156427B2 (en) * | 2005-08-23 | 2012-04-10 | Ricoh Co. Ltd. | User interface for mixed media reality |
US9405751B2 (en) | 2005-08-23 | 2016-08-02 | Ricoh Co., Ltd. | Database for mixed media document system |
US8335789B2 (en) * | 2004-10-01 | 2012-12-18 | Ricoh Co., Ltd. | Method and system for document fingerprint matching in a mixed media environment |
US8825682B2 (en) * | 2006-07-31 | 2014-09-02 | Ricoh Co., Ltd. | Architecture for mixed media reality retrieval of locations and registration of images |
US8176054B2 (en) | 2007-07-12 | 2012-05-08 | Ricoh Co. Ltd | Retrieving electronic documents by converting them to synthetic text |
US7812986B2 (en) * | 2005-08-23 | 2010-10-12 | Ricoh Co. Ltd. | System and methods for use of voice mail and email in a mixed media environment |
US7991778B2 (en) * | 2005-08-23 | 2011-08-02 | Ricoh Co., Ltd. | Triggering actions with captured input in a mixed media environment |
US8838591B2 (en) | 2005-08-23 | 2014-09-16 | Ricoh Co., Ltd. | Embedding hot spots in electronic documents |
US8521737B2 (en) * | 2004-10-01 | 2013-08-27 | Ricoh Co., Ltd. | Method and system for multi-tier image matching in a mixed media environment |
US9384619B2 (en) | 2006-07-31 | 2016-07-05 | Ricoh Co., Ltd. | Searching media content for objects specified using identifiers |
US8195659B2 (en) * | 2005-08-23 | 2012-06-05 | Ricoh Co. Ltd. | Integration and use of mixed media documents |
US7885955B2 (en) * | 2005-08-23 | 2011-02-08 | Ricoh Co. Ltd. | Shared document annotation |
US9171202B2 (en) | 2005-08-23 | 2015-10-27 | Ricoh Co., Ltd. | Data organization and access for mixed media document system |
US8005831B2 (en) * | 2005-08-23 | 2011-08-23 | Ricoh Co., Ltd. | System and methods for creation and use of a mixed media environment with geographic location information |
US7669148B2 (en) * | 2005-08-23 | 2010-02-23 | Ricoh Co., Ltd. | System and methods for portable device for mixed media system |
US7702673B2 (en) * | 2004-10-01 | 2010-04-20 | Ricoh Co., Ltd. | System and methods for creation and use of a mixed media environment |
US7917554B2 (en) | 2005-08-23 | 2011-03-29 | Ricoh Co. Ltd. | Visibly-perceptible hot spots in documents |
US9373029B2 (en) | 2007-07-11 | 2016-06-21 | Ricoh Co., Ltd. | Invisible junction feature recognition for document security or annotation |
US7970171B2 (en) | 2007-01-18 | 2011-06-28 | Ricoh Co., Ltd. | Synthetic image and video generation from ground truth data |
US8184155B2 (en) | 2007-07-11 | 2012-05-22 | Ricoh Co. Ltd. | Recognition and tracking using invisible junctions |
US8949287B2 (en) | 2005-08-23 | 2015-02-03 | Ricoh Co., Ltd. | Embedding hot spots in imaged documents |
US8369655B2 (en) | 2006-07-31 | 2013-02-05 | Ricoh Co., Ltd. | Mixed media reality recognition using multiple specialized indexes |
US8156116B2 (en) * | 2006-07-31 | 2012-04-10 | Ricoh Co., Ltd | Dynamic presentation of targeted information in a mixed media reality recognition system |
US7920759B2 (en) * | 2005-08-23 | 2011-04-05 | Ricoh Co. Ltd. | Triggering applications for distributed action execution and use of mixed media recognition as a control input |
US8332401B2 (en) * | 2004-10-01 | 2012-12-11 | Ricoh Co., Ltd | Method and system for position-based image matching in a mixed media environment |
US8145908B1 (en) * | 2004-10-29 | 2012-03-27 | Akamai Technologies, Inc. | Web content defacement protection system |
US8271980B2 (en) | 2004-11-08 | 2012-09-18 | Adaptive Computing Enterprises, Inc. | System and method of providing system jobs within a compute environment |
US10755313B2 (en) | 2004-12-27 | 2020-08-25 | Andrew Levi | System and method for distribution of targeted content between mobile communication devices |
US10354280B2 (en) | 2004-12-27 | 2019-07-16 | Blue Calypso, Llc | System and method for distribution of targeted advertising between mobile communication devices |
US9314697B2 (en) | 2013-07-26 | 2016-04-19 | Blue Calypso, Llc | System and method for advertising distribution through mobile social gaming |
US8631130B2 (en) | 2005-03-16 | 2014-01-14 | Adaptive Computing Enterprises, Inc. | Reserving resources in an on-demand compute environment from a local compute environment |
US8863143B2 (en) | 2006-03-16 | 2014-10-14 | Adaptive Computing Enterprises, Inc. | System and method for managing a hybrid compute environment |
US9231886B2 (en) | 2005-03-16 | 2016-01-05 | Adaptive Computing Enterprises, Inc. | Simple integration of an on-demand compute environment |
CA2603577A1 (en) | 2005-04-07 | 2006-10-12 | Cluster Resources, Inc. | On-demand access to compute resources |
US7769772B2 (en) * | 2005-08-23 | 2010-08-03 | Ricoh Co., Ltd. | Mixed media reality brokerage network with layout-independent recognition |
US8073263B2 (en) | 2006-07-31 | 2011-12-06 | Ricoh Co., Ltd. | Multi-classifier selection and monitoring for MMR-based image recognition |
US8676810B2 (en) | 2006-07-31 | 2014-03-18 | Ricoh Co., Ltd. | Multiple index mixed media reality recognition using unequal priority indexes |
US8201076B2 (en) | 2006-07-31 | 2012-06-12 | Ricoh Co., Ltd. | Capturing symbolic information from documents upon printing |
US9020966B2 (en) | 2006-07-31 | 2015-04-28 | Ricoh Co., Ltd. | Client device for interacting with a mixed media reality recognition system |
US9176984B2 (en) | 2006-07-31 | 2015-11-03 | Ricoh Co., Ltd | Mixed media reality retrieval of differentially-weighted links |
US8489987B2 (en) | 2006-07-31 | 2013-07-16 | Ricoh Co., Ltd. | Monitoring and analyzing creation and usage of visual content using image and hotspot interaction |
US9063952B2 (en) | 2006-07-31 | 2015-06-23 | Ricoh Co., Ltd. | Mixed media reality recognition with image tracking |
EP2095313A4 (en) | 2006-10-27 | 2011-11-02 | Cvon Innovations Ltd | Method and device for managing subscriber connection |
US8515912B2 (en) | 2010-07-15 | 2013-08-20 | Palantir Technologies, Inc. | Sharing and deconflicting data changes in a multimaster database system |
IL180020A (en) * | 2006-12-12 | 2013-03-24 | Waterfall Security Solutions Ltd | Encryption -and decryption-enabled interfaces |
GB2445630B (en) | 2007-03-12 | 2008-11-12 | Cvon Innovations Ltd | Dynamic message allocation system and method |
US10679198B2 (en) * | 2007-03-26 | 2020-06-09 | Transform Sr Brands Llc | System and method for handling an offer to purchase a followed item |
US8396881B2 (en) * | 2007-05-17 | 2013-03-12 | Research In Motion Limited | Method and system for automatically generating web page transcoding instructions |
GB2443760B (en) * | 2007-05-18 | 2008-07-30 | Cvon Innovations Ltd | Characterisation system and method |
US8935718B2 (en) | 2007-05-22 | 2015-01-13 | Apple Inc. | Advertising management method and system |
GB2449944B (en) | 2007-06-09 | 2012-08-08 | Wecomm Ltd | Supplying applications to mobile devices |
GB2452789A (en) | 2007-09-05 | 2009-03-18 | Cvon Innovations Ltd | Selecting information content for transmission by identifying a keyword in a previous message |
IL187492A0 (en) * | 2007-09-06 | 2008-02-09 | Human Interface Security Ltd | Information protection device |
US8041773B2 (en) | 2007-09-24 | 2011-10-18 | The Research Foundation Of State University Of New York | Automatic clustering for self-organizing grids |
GB2453810A (en) | 2007-10-15 | 2009-04-22 | Cvon Innovations Ltd | System, Method and Computer Program for Modifying Communications by Insertion of a Targeted Media Content or Advertisement |
US20090119678A1 (en) * | 2007-11-02 | 2009-05-07 | Jimmy Shih | Systems and methods for supporting downloadable applications on a portable client device |
US8595634B2 (en) * | 2007-11-30 | 2013-11-26 | Red Hat, Inc. | Distributed hosting of web application styles |
US8533588B2 (en) * | 2007-11-30 | 2013-09-10 | Red Hat, Inc. | Overriding images in user interfaces |
US9117219B2 (en) * | 2007-12-31 | 2015-08-25 | Peer 39 Inc. | Method and a system for selecting advertising spots |
US8260765B2 (en) * | 2008-01-14 | 2012-09-04 | International Business Machines Corporation | System and method for a tagging service |
US11157975B2 (en) * | 2008-01-18 | 2021-10-26 | Blackberry Limited | System and method for network interaction between computing devices |
GB0802585D0 (en) | 2008-02-12 | 2008-03-19 | Mtld Top Level Domain Ltd | Determining a property of communication device |
NZ566291A (en) * | 2008-02-27 | 2008-12-24 | Actionthis Ltd | Methods and devices for post processing rendered web pages and handling requests of post processed web pages |
WO2009111047A2 (en) | 2008-03-05 | 2009-09-11 | Ebay Inc. | Method and apparatus for image recognition services |
US9495386B2 (en) * | 2008-03-05 | 2016-11-15 | Ebay Inc. | Identification of items depicted in images |
US20090241032A1 (en) * | 2008-03-18 | 2009-09-24 | David Carroll Challener | Apparatus, system, and method for uniform resource locator sharing |
US8584029B1 (en) * | 2008-05-23 | 2013-11-12 | Intuit Inc. | Surface computer system and method for integrating display of user interface with physical objects |
US7865573B2 (en) * | 2008-05-29 | 2011-01-04 | Research In Motion Limited | Method, system and devices for communicating between an internet browser and an electronic device |
US8041346B2 (en) | 2008-05-29 | 2011-10-18 | Research In Motion Limited | Method and system for establishing a service relationship between a mobile communication device and a mobile data server for connecting to a wireless network |
US8418168B2 (en) * | 2008-05-29 | 2013-04-09 | Research In Motion Limited | Method and system for performing a software upgrade on an electronic device connected to a computer |
US8726164B2 (en) * | 2008-05-30 | 2014-05-13 | Microsoft Corporation | Mark-up extensions for semantically more relevant thumbnails of content |
JP5539974B2 (en) * | 2008-06-18 | 2014-07-02 | トムソン ライセンシング | A method for pre-processing a digital document for display and navigation within the document |
US20090319481A1 (en) * | 2008-06-18 | 2009-12-24 | Yahoo! Inc. | Framework for aggregating information of web pages from a website |
US9200901B2 (en) | 2008-06-19 | 2015-12-01 | Microsoft Technology Licensing, Llc | Predictive services for devices supporting dynamic direction information |
US8700301B2 (en) * | 2008-06-19 | 2014-04-15 | Microsoft Corporation | Mobile computing devices, architecture and user interfaces based on dynamic direction information |
US8467991B2 (en) | 2008-06-20 | 2013-06-18 | Microsoft Corporation | Data services based on gesture and location information of device |
US9600459B2 (en) * | 2008-07-16 | 2017-03-21 | International Business Machines Corporation | Visual macro showing how some icon or object or text was constructed |
US9251281B2 (en) * | 2008-07-29 | 2016-02-02 | International Business Machines Corporation | Web browsing using placemarks and contextual relationships in a data processing system |
US8462394B2 (en) * | 2008-08-05 | 2013-06-11 | Xerox Corporation | Document type classification for scanned bitmaps |
US20100043011A1 (en) * | 2008-08-18 | 2010-02-18 | International Business Machines Corporation | Method for tracking local storage of web pages |
US20100057586A1 (en) * | 2008-09-04 | 2010-03-04 | China Software Venture | Offer Reporting Apparatus and Method |
US20100070848A1 (en) * | 2008-09-15 | 2010-03-18 | Apple Inc. | Preserving the structure of a list in a document while displaying an outline view of the document |
US8984390B2 (en) | 2008-09-15 | 2015-03-17 | Palantir Technologies, Inc. | One-click sharing for screenshots and related documents |
KR101035560B1 (en) * | 2008-09-23 | 2011-05-19 | 한국전자통신연구원 | Service offering system and its method |
US8117306B1 (en) | 2008-09-29 | 2012-02-14 | Amazon Technologies, Inc. | Optimizing content management |
US8316124B1 (en) | 2008-09-29 | 2012-11-20 | Amazon Technologies, Inc. | Managing network data display |
US7930393B1 (en) | 2008-09-29 | 2011-04-19 | Amazon Technologies, Inc. | Monitoring domain allocation performance |
US8122124B1 (en) | 2008-09-29 | 2012-02-21 | Amazon Technologies, Inc. | Monitoring performance and operation of data exchanges |
US8095569B2 (en) * | 2008-10-06 | 2012-01-10 | International Business Machines Corporation | Customized context menu for files based on their content |
GB2464313A (en) * | 2008-10-10 | 2010-04-14 | Mtld Top Level Domain Ltd | Trancoding a web page |
IL194943A0 (en) * | 2008-10-27 | 2009-09-22 | Human Interface Security Ltd | Verification of data transmitted by computer |
US8832319B2 (en) * | 2008-11-18 | 2014-09-09 | Amazon Technologies, Inc. | Synchronization of digital content |
US20100125523A1 (en) * | 2008-11-18 | 2010-05-20 | Peer 39 Inc. | Method and a system for certifying a document for advertisement appropriateness |
US20100131529A1 (en) * | 2008-11-26 | 2010-05-27 | Vishal Kasera | Open entity extraction system |
US20100146399A1 (en) * | 2008-12-09 | 2010-06-10 | Charles Laurence Stinson | Method, apparatus and system for modifying website flow stack to manage site-wide configuration |
US9715557B2 (en) * | 2008-12-09 | 2017-07-25 | Blackberry Limited | System, device and method for providing context sensitive content on a computing device |
KR20100089339A (en) * | 2009-02-03 | 2010-08-12 | 삼성전자주식회사 | Method and apparatus for generating and displaying image |
US9406042B2 (en) | 2009-02-24 | 2016-08-02 | Ebay Inc. | System and method for supplementing an image gallery with status indicators |
US7917618B1 (en) | 2009-03-24 | 2011-03-29 | Amazon Technologies, Inc. | Monitoring web site content |
EP2411923A4 (en) * | 2009-03-25 | 2013-10-30 | Hewlett Packard Development Co | Capturing an application state |
US8700982B2 (en) | 2009-03-30 | 2014-04-15 | Blackberry Limited | System, device and method for providing interactive content on an computing device |
US8385660B2 (en) * | 2009-06-24 | 2013-02-26 | Ricoh Co., Ltd. | Mixed media reality indexing and retrieval for repeated content |
KR20110003213A (en) * | 2009-07-03 | 2011-01-11 | 주식회사 아인스아이앤에스 | Method and system for providing contents |
US8872767B2 (en) | 2009-07-07 | 2014-10-28 | Microsoft Corporation | System and method for converting gestures into digital graffiti |
US20110041073A1 (en) * | 2009-08-17 | 2011-02-17 | Hoff Aaron C | Key-Based Storage and Retrieval of Information |
US8725794B2 (en) * | 2009-09-30 | 2014-05-13 | Tracking. Net | Enhanced website tracking system and method |
US20110078580A1 (en) * | 2009-09-30 | 2011-03-31 | Logicplant | Method and system for providing interface of web page |
WO2011050368A1 (en) | 2009-10-23 | 2011-04-28 | Moov Corporation | Configurable and dynamic transformation of web content |
US11720290B2 (en) | 2009-10-30 | 2023-08-08 | Iii Holdings 2, Llc | Memcached server functionality in a cluster of data processing nodes |
US10877695B2 (en) | 2009-10-30 | 2020-12-29 | Iii Holdings 2, Llc | Memcached server functionality in a cluster of data processing nodes |
EP2499558A4 (en) * | 2009-11-09 | 2013-07-17 | Research In Motion Ltd | Directional navigation of page content |
US8843849B2 (en) * | 2009-11-09 | 2014-09-23 | Blackberry Limited | Directional navigation of page content |
US8751952B2 (en) * | 2009-11-18 | 2014-06-10 | Sap Ag | Dataflow-driven service composition at the presentation layer |
US8495081B2 (en) * | 2009-12-14 | 2013-07-23 | International Business Machines Corporation | Method, system and computer program product for federating tags across multiple systems |
US9164577B2 (en) | 2009-12-22 | 2015-10-20 | Ebay Inc. | Augmented reality system, method, and apparatus for displaying an item image in a contextual environment |
US20110208840A1 (en) * | 2010-02-22 | 2011-08-25 | Lee Blackman | Cookie alert |
US8775488B2 (en) * | 2010-04-14 | 2014-07-08 | Siemens Product Lifecycle Management Software Inc. | System and method for data caching |
US9141724B2 (en) | 2010-04-19 | 2015-09-22 | Afilias Technologies Limited | Transcoder hinting |
CN102238563A (en) * | 2010-04-21 | 2011-11-09 | 华为终端有限公司 | Wireless connection method and equipment |
US8898217B2 (en) | 2010-05-06 | 2014-11-25 | Apple Inc. | Content delivery based on user terminal events |
US9471700B2 (en) * | 2010-05-18 | 2016-10-18 | Tksn Holdings, Llc | System and method for monitoring changes in databases and websites |
US20110289155A1 (en) * | 2010-05-20 | 2011-11-24 | Kambiz David Pirnazar | Method and Apparatus for the Implementation of a Real-Time, Sharable Browsing Experience |
US9171087B2 (en) | 2010-05-20 | 2015-10-27 | Samesurf, Inc. | Method and apparatus for the implementation of a real-time, sharable browsing experience on a host device |
US8527591B2 (en) | 2010-05-20 | 2013-09-03 | Actual Works, Inc. | Method and apparatus for the implementation of a real-time, sharable browsing experience on a guest device |
US8504419B2 (en) | 2010-05-28 | 2013-08-06 | Apple Inc. | Network-based targeted content delivery based on queue adjustment factors calculated using the weighted combination of overall rank, context, and covariance scores for an invitational content item |
AU2010354077A1 (en) * | 2010-05-28 | 2012-12-20 | Unisys Corporation | System and method for continuation of a web session |
US20110320395A1 (en) * | 2010-06-29 | 2011-12-29 | Uzair Dada | Optimization of Multi-channel Commerce |
US8224823B1 (en) * | 2010-06-30 | 2012-07-17 | A9.Com, Inc. | Browsing history restoration |
JP5221601B2 (en) * | 2010-06-30 | 2013-06-26 | 楽天株式会社 | Order receiving device, order receiving method, program, and information recording medium |
GB2481843A (en) | 2010-07-08 | 2012-01-11 | Mtld Top Level Domain Ltd | Web based method of generating user interfaces |
US8510658B2 (en) | 2010-08-11 | 2013-08-13 | Apple Inc. | Population segmentation |
US8510309B2 (en) | 2010-08-31 | 2013-08-13 | Apple Inc. | Selection and delivery of invitational content based on prediction of user interest |
US8640032B2 (en) | 2010-08-31 | 2014-01-28 | Apple Inc. | Selection and delivery of invitational content based on prediction of user intent |
US8983978B2 (en) | 2010-08-31 | 2015-03-17 | Apple Inc. | Location-intention context for content delivery |
US8751513B2 (en) | 2010-08-31 | 2014-06-10 | Apple Inc. | Indexing and tag generation of content for optimal delivery of invitational content |
US8655881B2 (en) * | 2010-09-16 | 2014-02-18 | Alcatel Lucent | Method and apparatus for automatically tagging content |
GB2484268A (en) * | 2010-09-16 | 2012-04-11 | Uniloc Usa Inc | Psychographic profiling of users of computing devices |
CN103229126B (en) * | 2010-09-17 | 2016-04-13 | 谷歌公司 | Mobile message between computing devices |
US8966446B1 (en) * | 2010-09-29 | 2015-02-24 | A9.Com, Inc. | Systems and methods of live experimentation on content provided by a web site |
US10423672B2 (en) * | 2010-10-04 | 2019-09-24 | Excalibur Ip, Llc | Network resource-specific search assistance |
US8650249B2 (en) * | 2010-10-13 | 2014-02-11 | International Business Machines Corporation | Selectively processing cookies in a proxy |
US10127606B2 (en) | 2010-10-13 | 2018-11-13 | Ebay Inc. | Augmented reality system and method for visualizing an item |
US8595640B2 (en) * | 2010-12-02 | 2013-11-26 | Microsoft Corporation | Render transform based scrolling and panning for smooth effects |
GB2500537A (en) * | 2010-12-03 | 2013-09-25 | Titus Inc | Method and system of hierarchical metadata management and application |
CN102567358B (en) * | 2010-12-16 | 2014-07-02 | 腾讯科技(深圳)有限公司 | Navigation content display method and navigation content display device |
US8645491B2 (en) | 2010-12-18 | 2014-02-04 | Qualcomm Incorporated | Methods and apparatus for enabling a hybrid web and native application |
US10102242B2 (en) * | 2010-12-21 | 2018-10-16 | Sybase, Inc. | Bulk initial download of mobile databases |
US9124957B1 (en) | 2011-02-07 | 2015-09-01 | Christopher Fogel | System and method for dynamic multifactor routing |
US9348939B2 (en) | 2011-03-18 | 2016-05-24 | International Business Machines Corporation | Web site sectioning for mobile web browser usability |
US8660673B2 (en) * | 2011-03-23 | 2014-02-25 | Electronic Entertainment Design And Research | In-game contextual telemetry systems and methods |
US9754045B2 (en) * | 2011-04-01 | 2017-09-05 | Harman International (China) Holdings Co., Ltd. | System and method for web text content aggregation and presentation |
US9912718B1 (en) * | 2011-04-11 | 2018-03-06 | Viasat, Inc. | Progressive prefetching |
US10096046B2 (en) * | 2011-04-14 | 2018-10-09 | Garrett Wade Mathews | Web-based user support system and method |
US8706762B1 (en) * | 2011-05-16 | 2014-04-22 | Intuit Inc. | System and method for automated web site information retrieval scripting using untrained users |
US20120310914A1 (en) * | 2011-05-31 | 2012-12-06 | NetSol Technologies, Inc. | Unified Crawling, Scraping and Indexing of Web-Pages and Catalog Interface |
US20130254649A1 (en) * | 2011-06-07 | 2013-09-26 | Michael O'Neill | Establishing user consent to cookie storage on user terminal equipment |
US9280273B2 (en) * | 2011-06-30 | 2016-03-08 | Nokia Technologies Oy | Method, apparatus, and computer program for displaying content items in display regions |
US9875486B2 (en) | 2014-10-21 | 2018-01-23 | Slice Technologies, Inc. | Extracting product purchase information from electronic messages |
US9563904B2 (en) | 2014-10-21 | 2017-02-07 | Slice Technologies, Inc. | Extracting product purchase information from electronic messages |
US8844010B2 (en) | 2011-07-19 | 2014-09-23 | Project Slice | Aggregation of emailed product order and shipping information |
US9846902B2 (en) * | 2011-07-19 | 2017-12-19 | Slice Technologies, Inc. | Augmented aggregation of emailed product order and shipping information |
US9058331B2 (en) | 2011-07-27 | 2015-06-16 | Ricoh Co., Ltd. | Generating a conversation in a social network based on visual search results |
US8850515B2 (en) * | 2011-08-15 | 2014-09-30 | Bank Of America Corporation | Method and apparatus for subject recognition session validation |
US8458174B1 (en) | 2011-09-02 | 2013-06-04 | Google Inc. | Semantic image label synthesis |
US9329851B2 (en) * | 2011-09-09 | 2016-05-03 | Microsoft Technology Licensing, Llc | Browser-based discovery and application switching |
KR20130028375A (en) * | 2011-09-09 | 2013-03-19 | 주식회사 팬택 | Mobile apparatus for synchronizing working environment of web browser based on html5 and method thereof |
USD667449S1 (en) * | 2011-09-12 | 2012-09-18 | Microsoft Corporation | Display screen with icon |
US8381282B1 (en) * | 2011-09-30 | 2013-02-19 | Kaspersky Lab Zao | Portable security device and methods for maintenance of authentication information |
US9069743B2 (en) | 2011-10-13 | 2015-06-30 | Microsoft Technology Licensing, Llc | Application of comments in multiple application functionality content |
US9176933B2 (en) | 2011-10-13 | 2015-11-03 | Microsoft Technology Licensing, Llc | Application of multiple content items and functionality to an electronic content item |
US9449342B2 (en) | 2011-10-27 | 2016-09-20 | Ebay Inc. | System and method for visualization of items in an environment using augmented reality |
TW201322247A (en) * | 2011-11-23 | 2013-06-01 | Inst Information Industry | Device, method and computer readable storage medium for storing the method for displaying multiple language characters |
US9330188B1 (en) | 2011-12-22 | 2016-05-03 | Amazon Technologies, Inc. | Shared browsing sessions |
CN102594796B (en) * | 2011-12-27 | 2015-05-20 | 中兴通讯股份有限公司 | Terminal device and user information synchronization method |
US20130173593A1 (en) * | 2011-12-28 | 2013-07-04 | Target Brands, Inc. | Breadcrumb filtering |
US10015134B2 (en) | 2011-12-29 | 2018-07-03 | Verisign, Inc. | Methods and systems for creating new domains |
US9223534B1 (en) | 2011-12-30 | 2015-12-29 | hopTo Inc. | Client side detection of motion vectors for cross-platform display |
US9367931B1 (en) | 2011-12-30 | 2016-06-14 | hopTo Inc. | Motion vectors for cross-platform display |
US8856262B1 (en) | 2011-12-30 | 2014-10-07 | hopTo Inc. | Cloud-based image hosting |
US9218107B1 (en) | 2011-12-30 | 2015-12-22 | hopTo Inc. | Cloud-based text management for cross-platform display |
US9454617B1 (en) | 2011-12-30 | 2016-09-27 | hopTo Inc. | Client rendering |
US10108737B2 (en) * | 2012-01-25 | 2018-10-23 | Microsoft Technology Licensing, Llc | Presenting data driven forms |
US10296558B1 (en) * | 2012-02-27 | 2019-05-21 | Amazon Technologies, Inc. | Remote generation of composite content pages |
US20130230248A1 (en) * | 2012-03-02 | 2013-09-05 | International Business Machines Corporation | Ensuring validity of the bookmark reference in a collaborative bookmarking system |
US20130246218A1 (en) * | 2012-03-15 | 2013-09-19 | Balaji Gopalan | Remote third party payment of in-store items |
US10474728B2 (en) * | 2012-03-21 | 2019-11-12 | Oath Inc. | Seamless browsing between devices |
US9535888B2 (en) | 2012-03-30 | 2017-01-03 | Bmenu As | System, method, software arrangement and computer-accessible medium for a generator that automatically identifies regions of interest in electronic documents for transcoding |
CN102708192B (en) * | 2012-05-15 | 2017-11-28 | 华为技术有限公司 | A kind of method and system of document sharing, equipment |
US9106612B1 (en) * | 2012-05-18 | 2015-08-11 | hopTo Inc. | Decomposition and recomposition for cross-platform display |
US8990363B1 (en) * | 2012-05-18 | 2015-03-24 | hopTo, Inc. | Decomposition and recomposition for cross-platform display |
US9124562B1 (en) | 2012-05-18 | 2015-09-01 | hopTo Inc. | Cloud-based decomposition and recomposition for cross-platform display |
KR101935357B1 (en) * | 2012-05-21 | 2019-01-04 | 엘지전자 주식회사 | A Method for Providing a Web Page Using Home Device Web Browser and A Home device thereof |
US9141504B2 (en) | 2012-06-28 | 2015-09-22 | Apple Inc. | Presenting status data received from multiple devices |
JP6050625B2 (en) * | 2012-06-28 | 2016-12-21 | サターン ライセンシング エルエルシーSaturn Licensing LLC | Information processing apparatus and information processing method, computer program, and information communication system |
US10846766B2 (en) | 2012-06-29 | 2020-11-24 | Ebay Inc. | Contextual menus based on image recognition |
US9146981B2 (en) * | 2012-07-06 | 2015-09-29 | International Business Machines Corporation | Automated electronic discovery collections and preservations |
WO2014008468A2 (en) | 2012-07-06 | 2014-01-09 | Blekko, Inc. | Searching and aggregating web pages |
US9442687B2 (en) * | 2012-07-23 | 2016-09-13 | Korea Advanced Institute Of Science And Technology | Method and apparatus for moving web object based on intent |
IL227831B (en) | 2012-08-06 | 2019-03-31 | Wix Com Ltd | A system for supporting flexible color assignment in complex documents |
US9898445B2 (en) | 2012-08-16 | 2018-02-20 | Qualcomm Incorporated | Resource prefetching via sandboxed execution |
US20140053064A1 (en) | 2012-08-16 | 2014-02-20 | Qualcomm Incorporated | Predicting the usage of document resources |
US9215269B2 (en) | 2012-08-23 | 2015-12-15 | Amazon Technologies, Inc. | Predictive caching for content |
US9305084B1 (en) | 2012-08-30 | 2016-04-05 | deviantArt, Inc. | Tag selection, clustering, and recommendation for content hosting services |
CN103678347B (en) * | 2012-09-07 | 2018-05-04 | 百度在线网络技术(北京)有限公司 | Method and apparatus for web site to be converted to target web app websites |
US10162492B2 (en) | 2012-09-11 | 2018-12-25 | Microsoft Technology Licensing, Llc | Tap-to-open link selection areas |
US9348490B2 (en) * | 2012-09-14 | 2016-05-24 | Ca, Inc. | User interface with configuration, registration, and runtime selection of views |
US20140089139A1 (en) * | 2012-09-26 | 2014-03-27 | Auction.com, LLC. | System and method for provisioning assets for online transactions |
US20140089772A1 (en) * | 2012-09-27 | 2014-03-27 | Gururaj B. Shetty | Automatically Creating Tables of Content for Web Pages |
EP2904526A4 (en) | 2012-10-01 | 2016-06-15 | Yandex Europe Ag | Method and system for navigating to a sub-resource of an internet resource |
US8965880B2 (en) | 2012-10-05 | 2015-02-24 | Google Inc. | Transcoding and serving resources |
US9081975B2 (en) | 2012-10-22 | 2015-07-14 | Palantir Technologies, Inc. | Sharing information between nexuses that use different classification schemes for information access control |
US9501761B2 (en) | 2012-11-05 | 2016-11-22 | Palantir Technologies, Inc. | System and method for sharing investigation results |
US9378456B2 (en) | 2012-11-19 | 2016-06-28 | Microsoft Technology Licensing, Llc | Task completion |
US10311014B2 (en) * | 2012-12-28 | 2019-06-04 | Iii Holdings 2, Llc | System, method and computer readable medium for offloaded computation of distributed application protocols within a cluster of data processing nodes |
US20140189538A1 (en) * | 2012-12-31 | 2014-07-03 | Motorola Mobility Llc | Recommendations for Applications Based on Device Context |
US20140189519A1 (en) * | 2012-12-31 | 2014-07-03 | W.W. Grainger, Inc. | Systems and methods for providing website browsing history to repeat users of a website |
US20140258816A1 (en) * | 2013-03-08 | 2014-09-11 | True Xiong | Methodology to dynamically rearrange web content for consumer devices |
US9430134B1 (en) | 2013-03-15 | 2016-08-30 | hopTo Inc. | Using split windows for cross-platform document views |
US8917274B2 (en) | 2013-03-15 | 2014-12-23 | Palantir Technologies Inc. | Event matrix based on integrated data |
US20140283002A1 (en) * | 2013-03-15 | 2014-09-18 | Stephen Frechette | Method and system for anonymous circumvention of internet filter firewalls without detection or identification |
US9292157B1 (en) | 2013-03-15 | 2016-03-22 | hopTo Inc. | Cloud-based usage of split windows for cross-platform document views |
US8788405B1 (en) | 2013-03-15 | 2014-07-22 | Palantir Technologies, Inc. | Generating data clusters with customizable analysis strategies |
US9965937B2 (en) | 2013-03-15 | 2018-05-08 | Palantir Technologies Inc. | External malware data item clustering and analysis |
US8937619B2 (en) | 2013-03-15 | 2015-01-20 | Palantir Technologies Inc. | Generating an object time series from data objects |
DE102013006351A1 (en) * | 2013-04-12 | 2014-10-30 | Unify Gmbh & Co. Kg | Method and device for managing a call to a call center |
US9875494B2 (en) * | 2013-04-16 | 2018-01-23 | Sri International | Using intents to analyze and personalize a user's dialog experience with a virtual personal assistant |
US20140316946A1 (en) * | 2013-04-23 | 2014-10-23 | Christine M. Sanders | Event-driven sales platform and notification system and method |
US9912720B2 (en) * | 2013-05-13 | 2018-03-06 | Appsense Us Llc | Context aware browser policy |
CN104182408B (en) * | 2013-05-23 | 2019-01-29 | 腾讯科技(深圳)有限公司 | A kind of webpage offline access method and device |
US9559902B2 (en) * | 2013-06-02 | 2017-01-31 | Microsoft Technology Licensing, Llc | Distributed state model for system configuration synchronization |
US20160162602A1 (en) * | 2013-06-21 | 2016-06-09 | Stillman Bradish | Methods and apparatus for proximally informed database searches |
KR102112605B1 (en) * | 2013-07-01 | 2020-05-19 | 삼성전자 주식회사 | Mobile device and Method for controlling network transfer of the mobile device |
US9594811B2 (en) | 2013-07-02 | 2017-03-14 | Sap Se | Method and system for limiting database access results |
JP5447722B1 (en) * | 2013-07-17 | 2014-03-19 | 富士ゼロックス株式会社 | Information processing system and program |
US9607039B2 (en) * | 2013-07-18 | 2017-03-28 | International Business Machines Corporation | Subject-matter analysis of tabular data |
US10373431B2 (en) | 2013-07-26 | 2019-08-06 | Blue Calypso, Llc | System and method for advertising distribution through mobile social gaming |
US9814985B2 (en) | 2013-07-26 | 2017-11-14 | Blue Calypso, Llc | System and method for advertising distribution through mobile social gaming |
US9565152B2 (en) | 2013-08-08 | 2017-02-07 | Palantir Technologies Inc. | Cable reader labeling |
CN103414734A (en) * | 2013-09-05 | 2013-11-27 | 广东动易网络科技有限公司 | Multi-screen optimization technology for supporting different types of terminal equipment to access same website |
US8938686B1 (en) | 2013-10-03 | 2015-01-20 | Palantir Technologies Inc. | Systems and methods for analyzing performance of an entity |
JP5963815B2 (en) * | 2013-11-08 | 2016-08-03 | 株式会社ソニー・インタラクティブエンタテインメント | Information processing apparatus and information processing method |
US8924850B1 (en) | 2013-11-21 | 2014-12-30 | Google Inc. | Speeding up document loading |
WO2015078231A1 (en) * | 2013-11-26 | 2015-06-04 | 优视科技有限公司 | Method for generating webpage template and server |
US10275765B2 (en) * | 2013-12-11 | 2019-04-30 | Ebay Inc. | Omni-channel state preservation |
US20160259641A1 (en) * | 2013-12-16 | 2016-09-08 | Hewlett Packard Enterprise Development Lp | Tagging a program code portion |
US10579647B1 (en) | 2013-12-16 | 2020-03-03 | Palantir Technologies Inc. | Methods and systems for analyzing entity performance |
US9043696B1 (en) | 2014-01-03 | 2015-05-26 | Palantir Technologies Inc. | Systems and methods for visual definition of data associations |
US9483162B2 (en) | 2014-02-20 | 2016-11-01 | Palantir Technologies Inc. | Relationship visualizations |
US20150242377A1 (en) * | 2014-02-24 | 2015-08-27 | Autodesk, Inc. | Logical structure-based document navigation |
WO2015143416A1 (en) * | 2014-03-21 | 2015-09-24 | Ptc Inc. | Systems and methods for developing and using real-time data applications |
CN104978358B (en) * | 2014-04-11 | 2019-11-15 | 阿里巴巴集团控股有限公司 | The method and intercepting page segment of desktop presentation web page fragments are to desktop presentation system |
US10356071B2 (en) * | 2014-04-14 | 2019-07-16 | Mcafee, Llc | Automatic log-in and log-out of a session with session sharing |
US9857958B2 (en) | 2014-04-28 | 2018-01-02 | Palantir Technologies Inc. | Systems and user interfaces for dynamic and interactive access of, investigation of, and analysis of data objects stored in one or more databases |
US9544388B1 (en) * | 2014-05-09 | 2017-01-10 | Amazon Technologies, Inc. | Client-side predictive caching for content |
US10162473B2 (en) * | 2014-06-17 | 2018-12-25 | Lenovo (Singapore) Pte. Ltd. | Integrating browser navigation between multiple browsing contexts |
US9202249B1 (en) | 2014-07-03 | 2015-12-01 | Palantir Technologies Inc. | Data item clustering and analysis |
US9454281B2 (en) | 2014-09-03 | 2016-09-27 | Palantir Technologies Inc. | System for providing dynamic linked panels in user interface |
US9767172B2 (en) | 2014-10-03 | 2017-09-19 | Palantir Technologies Inc. | Data aggregation and analysis system |
US9501851B2 (en) | 2014-10-03 | 2016-11-22 | Palantir Technologies Inc. | Time-series analysis system |
US9984133B2 (en) | 2014-10-16 | 2018-05-29 | Palantir Technologies Inc. | Schematic and database linking system |
US9912758B2 (en) * | 2014-12-16 | 2018-03-06 | Yahoo Holdings, Inc. | Continuing an application session on a different device |
US10027739B1 (en) * | 2014-12-16 | 2018-07-17 | Amazon Technologies, Inc. | Performance-based content delivery |
US9769248B1 (en) | 2014-12-16 | 2017-09-19 | Amazon Technologies, Inc. | Performance-based content delivery |
US10225365B1 (en) | 2014-12-19 | 2019-03-05 | Amazon Technologies, Inc. | Machine learning based content delivery |
US10311372B1 (en) | 2014-12-19 | 2019-06-04 | Amazon Technologies, Inc. | Machine learning based content delivery |
US10311371B1 (en) | 2014-12-19 | 2019-06-04 | Amazon Technologies, Inc. | Machine learning based content delivery |
US9367872B1 (en) | 2014-12-22 | 2016-06-14 | Palantir Technologies Inc. | Systems and user interfaces for dynamic and interactive investigation of bad actor behavior based on automatic clustering of related data in various data structures |
US9335911B1 (en) | 2014-12-29 | 2016-05-10 | Palantir Technologies Inc. | Interactive user interface for dynamic data analysis exploration and query processing |
US9326046B1 (en) | 2015-03-19 | 2016-04-26 | Amazon Technologies, Inc. | Uninterrupted playback of video streams using lower quality cached files |
US20160321285A1 (en) * | 2015-05-02 | 2016-11-03 | Mohammad Faraz RASHID | Method for organizing and distributing data |
US10366431B1 (en) * | 2015-05-15 | 2019-07-30 | Amazon Technologies, Inc. | Computer-readable medium, system, and method for resuming sessions across devices |
US10380227B2 (en) * | 2015-06-07 | 2019-08-13 | Apple Inc. | Generating layout for content presentation structures |
JP2017016446A (en) * | 2015-07-02 | 2017-01-19 | 富士ゼロックス株式会社 | Information processor and information processing program |
CN107864677B (en) * | 2015-07-22 | 2022-05-27 | 爱维士软件有限责任公司 | Content access authentication system and method |
US10489391B1 (en) | 2015-08-17 | 2019-11-26 | Palantir Technologies Inc. | Systems and methods for grouping and enriching data items accessed from one or more databases for presentation in a user interface |
US10482167B2 (en) * | 2015-09-24 | 2019-11-19 | Mcafee, Llc | Crowd-source as a backup to asynchronous identification of a type of form and relevant fields in a credential-seeking web page |
US9760556B1 (en) | 2015-12-11 | 2017-09-12 | Palantir Technologies Inc. | Systems and methods for annotating and linking electronic documents |
US10089289B2 (en) | 2015-12-29 | 2018-10-02 | Palantir Technologies Inc. | Real-time document annotation |
US9823818B1 (en) | 2015-12-29 | 2017-11-21 | Palantir Technologies Inc. | Systems and interactive user interfaces for automatic generation of temporal representation of data objects |
US9996236B1 (en) | 2015-12-29 | 2018-06-12 | Palantir Technologies Inc. | Simplified frontend processing and visualization of large datasets |
US10229214B2 (en) * | 2015-12-31 | 2019-03-12 | Ca, Inc. | Dynamic web page navigation |
US10602332B2 (en) * | 2016-06-20 | 2020-03-24 | Microsoft Technology Licensing, Llc | Programming organizational links that propagate to mobile applications |
US10324609B2 (en) | 2016-07-21 | 2019-06-18 | Palantir Technologies Inc. | System for providing dynamic linked panels in user interface |
US10719188B2 (en) | 2016-07-21 | 2020-07-21 | Palantir Technologies Inc. | Cached database and synchronization system for providing dynamic linked panels in user interface |
CN106897346A (en) | 2016-08-04 | 2017-06-27 | 阿里巴巴集团控股有限公司 | The method and device of data processing |
US11232252B2 (en) * | 2016-09-01 | 2022-01-25 | Verint Americas Inc. | System and computer-implemented method for in-page reporting of user feedback on a website or mobile app |
US10445462B2 (en) * | 2016-10-12 | 2019-10-15 | Terarecon, Inc. | System and method for medical image interpretation |
US10133588B1 (en) | 2016-10-20 | 2018-11-20 | Palantir Technologies Inc. | Transforming instructions for collaborative updates |
CN106547915B (en) * | 2016-11-29 | 2019-10-29 | 上海轻维软件有限公司 | Intelligent data extracting method based on model library |
EP3337132A1 (en) * | 2016-12-15 | 2018-06-20 | Awingu Nv | Intermediate broker with multi-session recording |
US10044836B2 (en) | 2016-12-19 | 2018-08-07 | Palantir Technologies Inc. | Conducting investigations under limited connectivity |
CN108268488B (en) * | 2016-12-30 | 2022-04-19 | 百度在线网络技术(北京)有限公司 | Webpage main graph identification method and device |
US10216811B1 (en) | 2017-01-05 | 2019-02-26 | Palantir Technologies Inc. | Collaborating using different object models |
EP3586250B1 (en) * | 2017-02-22 | 2023-09-20 | OpSec Online Limited | Systems and methods for direct in-browser markup of elements in internet content |
JP6895795B2 (en) * | 2017-04-27 | 2021-06-30 | 株式会社日立製作所 | Data processing systems, data processing methods, and data processing programs |
US11074277B1 (en) | 2017-05-01 | 2021-07-27 | Palantir Technologies Inc. | Secure resolution of canonical entities |
US10574644B2 (en) | 2017-05-03 | 2020-02-25 | International Business Machines Corporation | Stateful session manager |
US10447635B2 (en) | 2017-05-17 | 2019-10-15 | Slice Technologies, Inc. | Filtering electronic messages |
US11416817B2 (en) * | 2017-06-02 | 2022-08-16 | Apple Inc. | Event extraction systems and methods |
US10614030B2 (en) | 2017-06-02 | 2020-04-07 | Microsoft Technology Licensing Llc | Task creation and completion with bi-directional user interactions |
US10942947B2 (en) | 2017-07-17 | 2021-03-09 | Palantir Technologies Inc. | Systems and methods for determining relationships between datasets |
US10956508B2 (en) | 2017-11-10 | 2021-03-23 | Palantir Technologies Inc. | Systems and methods for creating and managing a data integration workspace containing automatically updated data models |
US11061874B1 (en) | 2017-12-14 | 2021-07-13 | Palantir Technologies Inc. | Systems and methods for resolving entity data across various data structures |
US10853352B1 (en) | 2017-12-21 | 2020-12-01 | Palantir Technologies Inc. | Structured data collection, presentation, validation and workflow management |
GB201800595D0 (en) | 2018-01-15 | 2018-02-28 | Palantir Technologies Inc | Management of software bugs in a data processing system |
US11803883B2 (en) | 2018-01-29 | 2023-10-31 | Nielsen Consumer Llc | Quality assurance for labeled training data |
US11599369B1 (en) | 2018-03-08 | 2023-03-07 | Palantir Technologies Inc. | Graphical user interface configuration system |
CN110324377B (en) * | 2018-03-30 | 2022-03-25 | 武汉斗鱼网络科技有限公司 | Push interaction method and device, electronic equipment and medium |
US10885021B1 (en) | 2018-05-02 | 2021-01-05 | Palantir Technologies Inc. | Interactive interpreter and graphical user interface |
US11061542B1 (en) | 2018-06-01 | 2021-07-13 | Palantir Technologies Inc. | Systems and methods for determining and displaying optimal associations of data items |
US10719573B2 (en) | 2018-10-31 | 2020-07-21 | Flinks Technology Inc. | Systems and methods for retrieving web data |
CN109948020A (en) * | 2019-01-14 | 2019-06-28 | 北京三快在线科技有限公司 | Data capture method, device, system and readable storage medium storing program for executing |
US11080358B2 (en) | 2019-05-03 | 2021-08-03 | Microsoft Technology Licensing, Llc | Collaboration and sharing of curated web data from an integrated browser experience |
CN118740489A (en) * | 2019-05-06 | 2024-10-01 | 谷歌有限责任公司 | Secure communication in mobile digital pages |
CN112667529B (en) * | 2019-10-16 | 2024-02-13 | 戴尔产品有限公司 | Network fabric storage system |
CN110825742B (en) * | 2019-10-31 | 2022-09-06 | 北京东方通科技股份有限公司 | Data paging retrieval method and device, electronic equipment and storage medium |
US11044279B1 (en) | 2019-12-19 | 2021-06-22 | Capital One Services, Llc | Systems and methods for secure browser session transfer |
US11847181B2 (en) * | 2020-02-14 | 2023-12-19 | Microsoft Technology Licensing, Llc | Updating a search page upon return of user focus |
US11805207B2 (en) | 2020-08-28 | 2023-10-31 | Intermetro Communications, Inc. | Systems and methods for configuring and dynamically applying call route groups |
US20220327129A1 (en) * | 2021-04-09 | 2022-10-13 | Tekion Corp | Enabling Real-Time Integration of Up-To-Date Siloed Data |
US20230350967A1 (en) * | 2022-04-30 | 2023-11-02 | Microsoft Technology Licensing, Llc | Assistance user interface for computer accessibility |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US924503A (en) | 1908-09-01 | 1909-06-08 | Ernst Otto Schneider | Cylindrical positive electrode. |
US20020120714A1 (en) * | 2001-02-26 | 2002-08-29 | Borislav Agapiev | Distributed-code, custom-generated dynamic internet inclusion agent |
US20020133484A1 (en) * | 1999-12-02 | 2002-09-19 | International Business Machines Corporation | Storing fragmented XML data into a relational database by decomposing XML documents with application specific mappings |
US20020184266A1 (en) * | 2001-05-31 | 2002-12-05 | Blessin Stephen W. | Universal file format for products that allows both parametric and textual searching |
US6601100B2 (en) * | 1999-01-27 | 2003-07-29 | International Business Machines Corporation | System and method for collecting and analyzing information about content requested in a network (world wide web) environment |
US20040249824A1 (en) * | 2003-06-05 | 2004-12-09 | International Business Machines Corporation | Semantics-bases indexing in a distributed data processing system |
US20060173985A1 (en) * | 2005-02-01 | 2006-08-03 | Moore James F | Enhanced syndication |
US20070038643A1 (en) * | 2005-08-09 | 2007-02-15 | Epstein Samuel S | Methods and apparatuses to assemble, extract and deploy content from electronic documents |
CA2622625A1 (en) * | 2005-09-14 | 2007-03-22 | O-Ya!, Inc. | Networked information indexing and search apparatus and method |
Family Cites Families (118)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US103036A (en) * | 1870-05-17 | Thomas b | ||
US712072A (en) * | 1901-07-09 | 1902-10-28 | Colonial Scale Company | Lifting-machine. |
US811939A (en) * | 1905-03-22 | 1906-02-06 | William Loudon | Feed-water mechanism. |
US2433375A (en) * | 1944-08-22 | 1947-12-30 | Raytheon Mfg Co | Lead-in connection for electron discharge tubes |
US2622625A (en) * | 1951-03-16 | 1952-12-23 | Crompton & Knowles Loom Works | Method and means for cleaning nose boards on axminster looms |
CA2047871C (en) * | 1990-07-25 | 1996-05-14 | Ikio Yoshida | Portable transceiver and esn transfer system therefor |
US5517618A (en) * | 1992-02-10 | 1996-05-14 | Matsushita Electric Industrial Co., Ltd. | Mobile migration communications control device |
US5508695A (en) * | 1994-07-28 | 1996-04-16 | Motorola, Inc. | Method and apparatus for identifying messages associated with a sports team in a communication system |
US5583983A (en) * | 1994-11-17 | 1996-12-10 | Objectware, Inc. | Multi-platform object-oriented software development and deployment system |
US5918013A (en) | 1996-06-03 | 1999-06-29 | Webtv Networks, Inc. | Method of transcoding documents in a network environment using a proxy server |
US5995102A (en) * | 1997-06-25 | 1999-11-30 | Comet Systems, Inc. | Server system and method for modifying a cursor image |
US6173316B1 (en) * | 1998-04-08 | 2001-01-09 | Geoworks Corporation | Wireless communication device with markup language based man-machine interface |
US6925595B1 (en) * | 1998-08-05 | 2005-08-02 | Spyglass, Inc. | Method and system for content conversion of hypertext data using data mining |
US7032183B2 (en) | 1998-08-14 | 2006-04-18 | Microsoft Corporation | Dynamic site browser |
US6377949B1 (en) * | 1998-09-18 | 2002-04-23 | Tacit Knowledge Systems, Inc. | Method and apparatus for assigning a confidence level to a term within a user knowledge profile |
EP1018689A3 (en) * | 1999-01-08 | 2001-01-24 | Lucent Technologies Inc. | Methods and apparatus for enabling shared web-based interaction in stateful servers |
JP2000242392A (en) * | 1999-02-12 | 2000-09-08 | Dbm Korea:Kk | Computer system for dynamic information display and method thereof |
US6405204B1 (en) * | 1999-03-02 | 2002-06-11 | Sector Data, Llc | Alerts by sector/news alerts |
US6401085B1 (en) * | 1999-03-05 | 2002-06-04 | Accenture Llp | Mobile communication and computing system and method |
US6263503B1 (en) * | 1999-05-26 | 2001-07-17 | Neal Margulis | Method for effectively implementing a wireless television system |
US6539494B1 (en) * | 1999-06-17 | 2003-03-25 | Art Technology Group, Inc. | Internet server session backup apparatus |
AU5904600A (en) * | 1999-07-02 | 2001-01-22 | Quickdog, Inc. | Method and apparatus for comparing, ranking and selecting data items including web pages |
US20040225865A1 (en) * | 1999-09-03 | 2004-11-11 | Cox Richard D. | Integrated database indexing system |
US6715129B1 (en) * | 1999-10-13 | 2004-03-30 | International Business Machines Corporation | Achieving application-specific document content by transcoding using Java Server Pages |
US7590644B2 (en) * | 1999-12-21 | 2009-09-15 | International Business Machine Corporation | Method and apparatus of streaming data transformation using code generator and translator |
US7188176B1 (en) * | 2000-01-20 | 2007-03-06 | Priceline.Com Incorporated | Apparatus, system, and method for maintaining a persistent data state on a communications network |
WO2001067286A2 (en) * | 2000-03-03 | 2001-09-13 | Merinta, Inc. | Framework for a customizable graphics user interface |
US6311180B1 (en) * | 2000-03-28 | 2001-10-30 | Linx Data Terminals, Inc. | Method for mapping and formatting information for a display device |
US7702995B2 (en) * | 2000-04-24 | 2010-04-20 | TVWorks, LLC. | Method and system for transforming content for execution on multiple platforms |
US20040049737A1 (en) * | 2000-04-26 | 2004-03-11 | Novarra, Inc. | System and method for displaying information content with selective horizontal scrolling |
US6484187B1 (en) * | 2000-04-28 | 2002-11-19 | International Business Machines Corporation | Coordinating remote copy status changes across multiple logical sessions to maintain consistency |
AU2001259590A1 (en) | 2000-05-08 | 2001-11-20 | Leap Wireless International, Inc. | Method of converting html/xml to hdml/wml in real-time for display on mobile devices |
US20020007379A1 (en) * | 2000-05-19 | 2002-01-17 | Zhi Wang | System and method for transcoding information for an audio or limited display user interface |
US7210099B2 (en) * | 2000-06-12 | 2007-04-24 | Softview Llc | Resolution independent vector display of internet content |
US7134073B1 (en) * | 2000-06-15 | 2006-11-07 | International Business Machines Corporation | Apparatus and method for enabling composite style sheet application to multi-part electronic documents |
KR20040041082A (en) * | 2000-07-24 | 2004-05-13 | 비브콤 인코포레이티드 | System and method for indexing, searching, identifying, and editing portions of electronic multimedia files |
US6455421B1 (en) * | 2000-07-31 | 2002-09-24 | Applied Materials, Inc. | Plasma treatment of tantalum nitride compound films formed by chemical vapor deposition |
AU2001283389A1 (en) * | 2000-08-18 | 2002-03-04 | Anderson Merchandisers Lp | System and method for an interactive shopping news and price information service |
US20020054090A1 (en) | 2000-09-01 | 2002-05-09 | Silva Juliana Freire | Method and apparatus for creating and providing personalized access to web content and services from terminals having diverse capabilities |
EP1346290A2 (en) * | 2000-09-29 | 2003-09-24 | Victor Hsieh | Online intelligent information comparison agent of multilingual electronic data sources over inter-connected computer networks |
US6871214B2 (en) | 2000-10-30 | 2005-03-22 | Nortel Networks Limited | Generating and providing alert messages in a communications network |
US7320107B2 (en) * | 2001-02-10 | 2008-01-15 | Samsung Electronics Co., Ltd. | Bookmark frame and method of launching browsers using the bookmark in an internet terminal |
US6828988B2 (en) * | 2001-02-27 | 2004-12-07 | Microsoft Corporation | Interactive tooltip |
US7120702B2 (en) * | 2001-03-03 | 2006-10-10 | International Business Machines Corporation | System and method for transcoding web content for display by alternative client devices |
US6934736B2 (en) * | 2001-03-15 | 2005-08-23 | Microsoft Corporation | Systems and methods for automatically generating cookies |
US7134075B2 (en) * | 2001-04-26 | 2006-11-07 | International Business Machines Corporation | Conversion of documents between XML and processor efficient MXML in content based routing networks |
US7219309B2 (en) * | 2001-05-02 | 2007-05-15 | Bitstream Inc. | Innovations for the display of web pages |
US6925481B2 (en) * | 2001-05-03 | 2005-08-02 | Symantec Corp. | Technique for enabling remote data access and manipulation from a pervasive device |
WO2002097667A2 (en) | 2001-05-31 | 2002-12-05 | Lixto Software Gmbh | Visual and interactive wrapper generation, automated information extraction from web pages, and translation into xml |
EP1271877B1 (en) * | 2001-06-22 | 2008-11-05 | Microsoft Corporation | Wireless browser |
US7051119B2 (en) * | 2001-07-12 | 2006-05-23 | Yahoo! Inc. | Method and system for enabling a script on a first computer to communicate and exchange data with a script on a second computer over a network |
US20030018668A1 (en) * | 2001-07-20 | 2003-01-23 | International Business Machines Corporation | Enhanced transcoding of structured documents through use of annotation techniques |
US6904569B1 (en) * | 2001-07-26 | 2005-06-07 | Gateway, Inc. | Link-level browser instance control |
US20030020747A1 (en) * | 2001-07-26 | 2003-01-30 | Aravinda Korala | User input peripheral device substitution for web commerce delivery |
EP1423805A2 (en) * | 2001-09-05 | 2004-06-02 | Danger, Inc. | Transcoding of telephone numbers to links in received web pages |
US6938077B2 (en) * | 2001-11-07 | 2005-08-30 | Microsoft Corporation | Client version advertisement service for overriding default client version properties |
US20030115365A1 (en) * | 2001-12-19 | 2003-06-19 | Teddy Lindsey | Transcoding information in a first markup language into a second markup language |
US8799464B2 (en) * | 2001-12-28 | 2014-08-05 | Motorola Mobility Llc | Multi-modal communication using a session specific proxy server |
EP1330098A1 (en) * | 2002-01-21 | 2003-07-23 | BRITISH TELECOMMUNICATIONS public limited company | Method and communication system for data web session transfer |
US20030154398A1 (en) * | 2002-02-08 | 2003-08-14 | Eaton Eric Thomas | System for providing continuity between session clients and method therefor |
US20030172186A1 (en) * | 2002-03-07 | 2003-09-11 | International Business Machines Coporation | Method, system and program product for transcoding content |
US20050066037A1 (en) | 2002-04-10 | 2005-03-24 | Yu Song | Browser session mobility system for multi-platform applications |
US6931405B2 (en) * | 2002-04-15 | 2005-08-16 | Microsoft Corporation | Flexible subscription-based event notification |
AU2003239385A1 (en) * | 2002-05-10 | 2003-11-11 | Richard R. Reisman | Method and apparatus for browsing using multiple coordinated device |
US6687485B2 (en) * | 2002-05-21 | 2004-02-03 | Thinksmark Performance Systems Llc | System and method for providing help/training content for a web-based application |
US7146400B2 (en) * | 2002-05-29 | 2006-12-05 | International Business Machines Corporation | Web and lotus notes adapter layers |
WO2003107174A1 (en) * | 2002-06-13 | 2003-12-24 | Cerisent Corporation | Xml database mixed structural-textual classification system |
US7266553B1 (en) * | 2002-07-01 | 2007-09-04 | Microsoft Corporation | Content data indexing |
KR100463835B1 (en) * | 2002-10-17 | 2004-12-29 | 한국전자통신연구원 | Index extraction method of web contents transcoding system for small display devices |
KR100461019B1 (en) | 2002-11-01 | 2004-12-09 | 한국전자통신연구원 | web contents transcoding system and method for small display devices |
US20040142683A1 (en) * | 2002-11-08 | 2004-07-22 | Matt Clark | Programming interface layer of a service provider for data service delivery |
US7949765B2 (en) * | 2002-12-02 | 2011-05-24 | Sap Aktiengesellschaft | Data structure for analyzing user sessions |
US7376913B1 (en) * | 2003-01-07 | 2008-05-20 | Vulcan Portals Inc. | Navigation and selection control for a hand-held portable computer |
GB2418509A (en) * | 2003-06-03 | 2006-03-29 | Forbes Holton Norris Iii | Flexible, dynamic menu-based web-page architecture |
US20050021851A1 (en) * | 2003-06-09 | 2005-01-27 | Kimmo Hamynen | System, apparatus, and method for directional control input browsing in smart phones |
CA2433375A1 (en) | 2003-06-25 | 2004-12-25 | Ibm Canada Limited - Ibm Canada Limitee | System and method for presenting personalized content on electronic commerce web pages |
US20070276911A1 (en) * | 2003-07-11 | 2007-11-29 | Soujanya Bhumkar | Method and System for Transferring Contact Information and Calendar Events to a Wireless Device Via E-Mail |
US7325204B2 (en) * | 2003-08-29 | 2008-01-29 | Yahoo! Inc. | Slideout windows |
US7558776B2 (en) * | 2004-01-27 | 2009-07-07 | United Services Automobile Association (Usaa) | System and method of managing internet browser navigation |
US7533144B2 (en) * | 2004-05-14 | 2009-05-12 | Hisham Kassab | Method of providing a web page with additional content inserted in an intermediate network entity (INE) platform |
US20060026216A1 (en) * | 2004-07-30 | 2006-02-02 | Mirra, Inc. | Server-assited communication among clients |
WO2006034038A2 (en) * | 2004-09-17 | 2006-03-30 | Become, Inc. | Systems and methods of retrieving topic specific information |
WO2006036781A2 (en) * | 2004-09-22 | 2006-04-06 | Perfect Market Technologies, Inc. | Search engine using user intent |
US7543069B2 (en) * | 2004-10-18 | 2009-06-02 | International Business Machines Corporation | Dynamically updating session state affinity |
US8145908B1 (en) * | 2004-10-29 | 2012-03-27 | Akamai Technologies, Inc. | Web content defacement protection system |
US20060143568A1 (en) * | 2004-11-10 | 2006-06-29 | Scott Milener | Method and apparatus for enhanced browsing |
US20060116893A1 (en) * | 2004-11-24 | 2006-06-01 | Carnes Joseph L | Apparatus and method of collecting and monitoring shipment data |
WO2006066411A1 (en) * | 2004-12-22 | 2006-06-29 | Research In Motion Limited | Handling attachment content on a mobile device |
US20060168547A1 (en) * | 2005-01-07 | 2006-07-27 | International Business Machines Corporation | Method and system for synchronizing content in multiple web browser windows |
US7805495B2 (en) * | 2005-03-31 | 2010-09-28 | Google Inc. | Method and system for transferring web browser data between web browsers |
US7895651B2 (en) * | 2005-07-29 | 2011-02-22 | Bit 9, Inc. | Content tracking in a network security system |
US7433888B2 (en) * | 2005-08-25 | 2008-10-07 | Microsoft Corporation | Schema packaging, distribution and availability |
GB2430507A (en) * | 2005-09-21 | 2007-03-28 | Stephen Robert Ives | System for managing the display of sponsored links together with search results on a mobile/wireless device |
US7912755B2 (en) * | 2005-09-23 | 2011-03-22 | Pronto, Inc. | Method and system for identifying product-related information on a web page |
US20070073593A1 (en) | 2005-09-28 | 2007-03-29 | Redcarpet, Inc. | Method and system for consumer price alerts |
US20080288600A1 (en) * | 2005-10-18 | 2008-11-20 | Clark Christopher M | Apparatus and method for providing access to associated data related to primary media data via email |
US7895604B2 (en) * | 2005-11-17 | 2011-02-22 | Opera Software Asa | Method and device for event communication between documents |
US20070208751A1 (en) * | 2005-11-22 | 2007-09-06 | David Cowan | Personalized content control |
NO325961B1 (en) * | 2005-12-05 | 2008-08-25 | Holte Bjoern | System, process and software arrangement to assist in navigation on the Internet |
US20070157304A1 (en) * | 2006-01-05 | 2007-07-05 | International Business Machines Corporation | Method, apparatus and computer program product for automatic cookie synchronization between distinct web browsers |
US20070180148A1 (en) * | 2006-02-02 | 2007-08-02 | Multimedia Abacus Corporation | Method and apparatus for creating scalable hi-fidelity HTML forms |
US7937458B2 (en) * | 2006-02-14 | 2011-05-03 | Nanamura Roberto N | On-demand software service system and method |
WO2007101182A2 (en) * | 2006-02-28 | 2007-09-07 | Maven Networks, Inc. | Systems and methods for delivering and managing media content downloaded to a network connected device |
US8739027B2 (en) * | 2006-03-01 | 2014-05-27 | Infogin, Ltd. | Methods and apparatus for enabling use of web content on various types of devices |
US8086698B2 (en) * | 2006-06-02 | 2011-12-27 | Google Inc. | Synchronizing configuration information among multiple clients |
US9195372B2 (en) * | 2006-06-28 | 2015-11-24 | Scenera Technologies, Llc | Methods, systems, and computer program products for grouping tabbed portion of a display object based on content relationships and user interaction levels |
US20080052381A1 (en) * | 2006-08-22 | 2008-02-28 | Koon Wah Yu | Method and system for selecting a transcoder to convert between content types |
NO325628B1 (en) * | 2006-09-20 | 2008-06-30 | Opera Software Asa | Procedure, computer program, transcoding server and computer system to modify a digital document |
US9128596B2 (en) * | 2006-09-22 | 2015-09-08 | Opera Software Asa | Method and device for selecting and displaying a region of interest in an electronic document |
US8001013B2 (en) * | 2006-12-18 | 2011-08-16 | Razz Serbanescu | System and method for electronic commerce and other uses |
US7739410B2 (en) * | 2007-01-07 | 2010-06-15 | Apple Inc. | Synchronization methods and systems |
KR20080078291A (en) * | 2007-02-23 | 2008-08-27 | 엘지전자 주식회사 | Method for displaying browser and terminal capable of implementing the same |
US20080256485A1 (en) * | 2007-04-12 | 2008-10-16 | Jason Gary Krikorian | User Interface for Controlling Video Programs on Mobile Computing Devices |
US8396881B2 (en) * | 2007-05-17 | 2013-03-12 | Research In Motion Limited | Method and system for automatically generating web page transcoding instructions |
US20090013085A1 (en) * | 2007-06-18 | 2009-01-08 | Hadas Liberman Ben-Ami | Interaction-management methods and platform for client-agent interaction-related environments |
US20090063690A1 (en) * | 2007-09-05 | 2009-03-05 | Motorola, Inc. | Continuing an application session using a different device from one that originally initiated the application session while preserving session while preserving session state and data |
US7873748B2 (en) * | 2007-11-30 | 2011-01-18 | International Business Machines Corporation | Synchronization of locally and remotely stored browser data |
US20110077032A1 (en) * | 2009-09-29 | 2011-03-31 | Harry Benjamin Correale | Restriction Method and Apparatus for Texting Based on Speed |
-
2008
- 2008-05-12 US US12/119,257 patent/US8396881B2/en not_active Expired - Fee Related
- 2008-05-12 EP EP08748301A patent/EP2158723A4/en not_active Withdrawn
- 2008-05-12 WO PCT/CA2008/000907 patent/WO2008141425A1/en active Application Filing
- 2008-05-12 EP EP08757083A patent/EP2156393A4/en not_active Withdrawn
- 2008-05-12 WO PCT/CA2008/000917 patent/WO2008141432A1/en active Application Filing
- 2008-05-12 CA CA2687483A patent/CA2687483C/en not_active Expired - Fee Related
- 2008-05-12 CA CA002687480A patent/CA2687480A1/en not_active Abandoned
- 2008-05-12 CA CA002687479A patent/CA2687479A1/en not_active Abandoned
- 2008-05-12 CA CA002687475A patent/CA2687475A1/en not_active Abandoned
- 2008-05-12 WO PCT/CA2008/000909 patent/WO2008141427A1/en active Application Filing
- 2008-05-12 WO PCT/CA2008/000914 patent/WO2008141429A1/en active Application Filing
- 2008-05-12 EP EP08748312A patent/EP2151090A4/en not_active Withdrawn
- 2008-05-12 US US12/119,317 patent/US20080289029A1/en not_active Abandoned
- 2008-05-12 WO PCT/CA2008/000908 patent/WO2008141426A1/en active Application Filing
- 2008-05-12 WO PCT/CA2008/000903 patent/WO2008141424A1/en active Application Filing
- 2008-05-12 EP EP08748305A patent/EP2151088A4/en not_active Withdrawn
- 2008-05-12 CA CA002687478A patent/CA2687478A1/en not_active Abandoned
- 2008-05-12 US US12/119,300 patent/US8037084B2/en not_active Expired - Fee Related
- 2008-05-12 US US12/119,251 patent/US20080288459A1/en not_active Abandoned
- 2008-05-12 CA CA2687530A patent/CA2687530C/en not_active Expired - Fee Related
- 2008-05-12 US US12/119,328 patent/US8572105B2/en not_active Expired - Fee Related
- 2008-05-12 WO PCT/CA2008/000918 patent/WO2008141433A1/en active Application Filing
- 2008-05-12 WO PCT/CA2008/000915 patent/WO2008141430A1/en active Application Filing
- 2008-05-12 CA CA2687473A patent/CA2687473C/en not_active Expired - Fee Related
- 2008-05-12 US US12/119,309 patent/US20080288486A1/en not_active Abandoned
- 2008-05-12 EP EP08748306A patent/EP2151089A4/en not_active Ceased
- 2008-05-12 EP EP08748311A patent/EP2156328A4/en not_active Withdrawn
- 2008-05-12 CA CA002687484A patent/CA2687484A1/en not_active Abandoned
- 2008-05-12 US US12/119,269 patent/US20080288449A1/en not_active Abandoned
- 2008-05-12 WO PCT/CA2008/000919 patent/WO2008141434A1/en active Application Filing
- 2008-05-12 CA CA2686292A patent/CA2686292A1/en not_active Abandoned
- 2008-05-12 US US12/119,097 patent/US20090235187A1/en not_active Abandoned
- 2008-05-12 EP EP08748313A patent/EP2151091A4/en not_active Ceased
- 2008-05-12 EP EP08757086A patent/EP2151093A4/en not_active Withdrawn
- 2008-05-12 CA CA002687497A patent/CA2687497A1/en not_active Abandoned
- 2008-05-12 WO PCT/CA2008/000916 patent/WO2008141431A1/en active Application Filing
- 2008-05-12 EP EP08757085A patent/EP2156606A4/en not_active Withdrawn
- 2008-05-12 US US12/119,221 patent/US20080288515A1/en not_active Abandoned
- 2008-05-12 US US12/119,338 patent/US20080288477A1/en not_active Abandoned
- 2008-05-12 EP EP08757084A patent/EP2151092A4/en not_active Ceased
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US924503A (en) | 1908-09-01 | 1909-06-08 | Ernst Otto Schneider | Cylindrical positive electrode. |
US6601100B2 (en) * | 1999-01-27 | 2003-07-29 | International Business Machines Corporation | System and method for collecting and analyzing information about content requested in a network (world wide web) environment |
US20020133484A1 (en) * | 1999-12-02 | 2002-09-19 | International Business Machines Corporation | Storing fragmented XML data into a relational database by decomposing XML documents with application specific mappings |
US20020120714A1 (en) * | 2001-02-26 | 2002-08-29 | Borislav Agapiev | Distributed-code, custom-generated dynamic internet inclusion agent |
US20020184266A1 (en) * | 2001-05-31 | 2002-12-05 | Blessin Stephen W. | Universal file format for products that allows both parametric and textual searching |
US20040249824A1 (en) * | 2003-06-05 | 2004-12-09 | International Business Machines Corporation | Semantics-bases indexing in a distributed data processing system |
US20060173985A1 (en) * | 2005-02-01 | 2006-08-03 | Moore James F | Enhanced syndication |
US20070038643A1 (en) * | 2005-08-09 | 2007-02-15 | Epstein Samuel S | Methods and apparatuses to assemble, extract and deploy content from electronic documents |
CA2622625A1 (en) * | 2005-09-14 | 2007-03-22 | O-Ya!, Inc. | Networked information indexing and search apparatus and method |
Non-Patent Citations (3)
Title |
---|
AGRAWAL N. ET AL.: "EShopMonitor: a web content monitoring tool", DATA ENGINEERING, 2004. PROCEEDINGS. 20TH INTERNATIONAL CONFERENCE ON, 30 March 2004 (2004-03-30), pages 817 - 820, XP010713566, DOI: doi:10.1109/ICDE.2004.1320055 |
See also references of EP2151088A4 * |
XIAO-LING W. ET AL.: "Enhancive index for structured document retrieval", PROCEEDINGS OF THE 12TH INTERNATIONAL WORKSHOP ON RESEARCH ISSUES IN DATA ENGINEERING: ENGINEERING E-COMMERCE/E-BUSINESS SYSTEMS, 24 February 2002 (2002-02-24) - 25 February 2002 (2002-02-25), pages 34 - 38, XP010588295 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8782281B2 (en) | 2004-03-23 | 2014-07-15 | Cisco Technology Inc. | Optimally adapting multimedia content for mobile subscriber device playback |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20080288449A1 (en) | Method and system for an aggregate web site search database | |
US7620891B2 (en) | Method and system for improving presentation of HTML pages in web devices | |
US9311373B2 (en) | Taxonomy driven site navigation | |
US9436362B2 (en) | OData service provisioning on top of GenIL layer | |
US8943086B2 (en) | Model-based backend service adaptation of business objects | |
US20110119298A1 (en) | Method and apparatus for searching information | |
US20070061486A1 (en) | Systems and methods for creating customized applications | |
EP2657858A2 (en) | Odata service provisioning on top of genil layer | |
KR20130098589A (en) | Search engine selection system of homepage input bar and method thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 08748305 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2687478 Country of ref document: CA |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2008748305 Country of ref document: EP |