CN106033468A - Webpage content extracting method, device and system - Google Patents

Webpage content extracting method, device and system Download PDF

Info

Publication number
CN106033468A
CN106033468A CN201510124714.8A CN201510124714A CN106033468A CN 106033468 A CN106033468 A CN 106033468A CN 201510124714 A CN201510124714 A CN 201510124714A CN 106033468 A CN106033468 A CN 106033468A
Authority
CN
China
Prior art keywords
extraction
agreement
webpage
page
content
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510124714.8A
Other languages
Chinese (zh)
Inventor
罗鑫骥
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201510124714.8A priority Critical patent/CN106033468A/en
Publication of CN106033468A publication Critical patent/CN106033468A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

The invention relates to a webpage content extracting method, device and system; the method comprises the following steps: allowing an extraction system to receive an extraction webpage URL request sent by a business layer, wherein the extraction webpage URL request carries a template protocol file of a webpage extraction field configured by the business layer; invoking a webpage reptile system to crawl the page original content assigned by the URL according to the extraction webpage URL request; using the template protocol file as matching standard, extracting the webpage original content, and returning the extracted content to the business layer. The method can fully utilize the background crawl webpage ability, can parse the original webpage and extract the template, thus extracting the assigned label content of the original webpage; the method, device and system are suitable for extracting assigned webpage label content of all web webpage formats, thus improving original webpage extraction ability, and extracting webpage content with flexibility.

Description

Webpage content extraction method, Apparatus and system
Technical field
The present invention relates to networking technology area, particularly relate to a kind of webpage content extraction method, dress Put and system.
Background technology
Along with the continuous expansion of smart machine browser consumption demand, user browses for mobile terminal The demand of device product the most gradually increases.Promoting the same of the search performance of browser own and experience Time, excavate the demand relevant to browser content polymerizing power extremely the most urgent.User is to browsing Product content-aggregated in device proposes more requirement, such as novel bookshelf, popular video etc. The content-aggregated ability of product has just become to determine whether the crucial of high-quality selects product content.Generally The polymerization of vertical series products content is dependent on the Web site content of Related product and crawls and extract, So the ability of extraction original web page is the basic embodiment of content-aggregated ability, and at present to original The extraction of webpage named web page label substance generally relies on extraction web object format, thus reduces The motility of webpage content extraction.
Summary of the invention
The embodiment of the present invention provides a kind of webpage content extraction method, Apparatus and system, it is intended to carry The ability of high extraction original web page and the motility of webpage content extraction.
The embodiment of the present invention proposes a kind of webpage content extraction method, including:
Extraction system receives the extraction webpage URL request that operation layer sends, described extraction webpage URL request carries the template document of agreement of the web page extraction field of described operation layer configuration;
According to described extraction webpage URL request, invoking web page crawler system crawls described URL The page original contents specified;
With described template document of agreement for coupling standard, described page original contents is taken out Take, and the content of extraction is returned to described operation layer.
The embodiment of the present invention also proposes a kind of webpage content extraction device, including:
Receiver module, for receive operation layer send extraction webpage URL request, described in take out Take the template agreement that webpage URL request carries the web page extraction field of described operation layer configuration File;
Calling module, for according to described extraction webpage URL request, invoking web page reptile is System crawls the page original contents that described URL specifies;
Abstraction module, for described template document of agreement for coupling standard, former to the described page Beginning content extracts, and the content of extraction is returned to described operation layer.
The embodiment of the present invention also proposes the system of a kind of extracting content on web pages, including: operation layer, Crawler system and extraction system;Wherein:
Described operation layer, for extraction system send extraction webpage URL request, described in take out Take the template agreement that webpage URL request carries the web page extraction field of described operation layer configuration File;
Described extraction system, including device as above;
Described crawler system, for the call instruction according to described extraction system, from third party's net Station crawls the page original contents that described URL specifies, and returns to described extraction system.
A kind of webpage content extraction method of embodiment of the present invention proposition, Apparatus and system, business Layer sends extraction webpage URL request to extraction system;Extraction system is according to extraction webpage URL Request, invoking web page crawler system crawls the page original contents that URL specifies;Extraction system With described template document of agreement for coupling standard, page original contents is extracted, and will take out The content taken returns to operation layer, and the present invention makes full use of backstage and crawls the ability of webpage, simultaneously The energy of extraction original web page appointment label substance is realized by resolving original web page and extraction template Power, the program adapts to all Web page formats extraction named web page label substance, improves The ability of extraction original web page and the motility of webpage content extraction.
Accompanying drawing explanation
Fig. 1 is the system architecture schematic diagram that embodiment of the present invention scheme relates to;
Fig. 2 is embodiment of the present invention extraction template specific field schematic diagram;
Fig. 3 is the hardware architecture diagram of the mobile terminal that embodiment of the present invention scheme relates to;
Fig. 4 is the wireless communication system schematic diagram of mobile terminal as shown in Figure 3;
Fig. 5 is the schematic flow sheet of webpage content extraction method first embodiment of the present invention;
Fig. 6 is the schematic flow sheet of webpage content extraction method the second embodiment of the present invention;
Fig. 7 is the high-level schematic functional block diagram of webpage content extraction device preferred embodiment of the present invention.
In order to make technical scheme clearer, understand, below in conjunction with accompanying drawing make into One step describes in detail.
Detailed description of the invention
Should be appreciated that specific embodiment described herein only in order to explain the present invention, not For limiting the present invention.
The core concept of embodiment of the present invention scheme is: operation layer sends extraction net to extraction system Page URL request is stated extraction webpage URL request and is carried the web page extraction of described operation layer configuration The template document of agreement of field;Extraction system, according to described extraction webpage URL request, calls Spiders system crawls the page original contents that described URL specifies;Extraction system is with described Template document of agreement is coupling standard, extracts page original contents, and by extraction Holding and return to operation layer, the present invention makes full use of backstage and crawls the ability of webpage, simultaneously by solving Analysis original web page and extraction template realize extraction original web page and specify the ability of label substance, the party Case adapts to all Web page formats extraction named web page label substance, improves extraction original The ability of webpage and the motility of webpage content extraction.
The embodiment of the present invention is it is considered that at present extraction to original web page named web page label substance Generally rely on extraction web object format, reduce the motility of webpage content extraction.
Embodiment of the present invention scheme can make full use of backstage and crawl the ability of webpage, passes through simultaneously Resolve the ability of original web page and extraction template realization extraction original web page appointment label substance, should Scheme adapts to all Web page formats extraction named web page label substance, improves extraction former The ability of beginning webpage and the motility of webpage content extraction.
Specifically, as it is shown in figure 1, the system architecture that embodiment of the present invention scheme relates to can be wrapped Include operation layer, extraction system, crawler system and third party website;Wherein:
Operation layer determines the page field specified for extraction, generates the template of web page extraction field Document of agreement, and send extraction webpage URL request, described extraction webpage URL to extraction system Request carries the template document of agreement of the web page extraction field of described operation layer configuration;
Extraction system is for according to described extraction webpage URL request, invoking web page crawler system Crawl the page original contents that described URL specifies;
Described crawler system is for the call instruction according to described extraction system, from third party website Crawl the page original contents that described URL specifies, return to described extraction system;
Described extraction system is with described template document of agreement for coupling standard, original to the described page Content extracts, and the content of extraction is returned to described operation layer.
Thus make full use of backstage and crawl the ability of webpage, simultaneously by resolving original web page and taking out Delivery plate realizes extraction original web page and specifies the ability of label substance, and the program adapts to all Web Webpage format extraction named web page label substance, improves ability and the net of extraction original web page The motility of page content extraction.
Concrete handling process is as follows:
1.1 business side according to demand, determine the page-tag field that extraction is specified, and generate webpage The template document of agreement of extraction field, and verify the correctness of template document of agreement, wherein, mould Plate agreement is the document protocol of definition extractor segment identification;Business side sends extraction webpage URL Request, the flow process of extraction given content starts.
After 1.2 extraction systems receive extraction URL request parameter, start invoking web page reptile system System crawls the page data information of request URL from third party website, only obtains and specifies URL Page data information, extraction template configuration field content could be started.
1.3 crawler systems receive after specifying the URL parameter that crawls, by Http agreement from Third party website crawls page data information;
1.4 crawler systems crawl parent page data message;
1.5 crawler system normal back page data message obtains to extraction system, extraction system The parent page information that crawler system returns, starts to resolve parent page and definition extraction field Template agreement, coupling extracts appointment template configuration field contents.
It is embodiment of the present invention extraction template specific field schematic diagram with reference to Fig. 2, Fig. 2, wherein, Left hand view represents parent page, and right part of flg represents extraction template document of agreement.
As in figure 2 it is shown, first, extraction system needs the page data by crawling return the most primary (dom tree is DOM Document Object Model to one-tenth standard dom tree, and that the present invention represents is HTML Object tree after formatting, facilitates computer disposal traversal, accessing operation), after completing, will The template document of agreement of configuration reduces generation standard dom tree equally and (convenient processes and the former page Dom tree element contrast operation).Two above step all smoothly completes, then start recursive traversal The dom tree that template protocol configuration generates (must be wherein, to be with template configuration document of agreement Coupling standard, the most just can possess more compatibility), comparison former page dom tree successively, Run into extraction special identifier (representing in the present invention) with " " symbol, then by former page ad eundem Content preserves, until coupling is made mistakes or traveled through and exits.
The identifier that extraction is completed by 1.6 last extraction systems and content return to call business Layer, completes whole extraction flow process.
From above-mentioned flow process, the present embodiment scheme is based on existing crawler system crawls webpage energy Power, meanwhile, has separated business and development module in flow process, and extraction system is completely to operation layer Transparent, business side has only to pay close attention to necessary extraction field, by configuring extraction template document of agreement Completing to extract flow tasks to build, operation layer just can take out extraction field smoothly afterwards, greatly carries Rise whole efficiency.
Compared to existing technology, the present invention makes full use of backstage and crawls the ability of webpage, passes through simultaneously Resolve the ability of original web page and extraction template realization extraction original web page appointment label substance, should Scheme adapts to all Web page formats extraction named web page label substance, improves extraction former The ability of beginning webpage and the motility of webpage content extraction.The present invention can the most fast, easily Solution business extraction demand, simultaneously as possess extremely strong motility so that Change cost is very Little, only need to change template configuration document of agreement.
It should be noted that the function of above-mentioned extraction system can be with the form collection of client software In Cheng Yi webpage content extraction device, this webpage content extraction device can be carried on PC end, Can also be carried on the various mobile terminals such as mobile phone, panel computer, portable handheld device, The embodiment of the present invention is illustrated with mobile terminal, and mobile terminal passes through client software to user Application operating interface is provided, and according to the corresponding operating of user, carries out webpage content extraction, real Now extraction original web page specifies the ability of label substance, improves the motility of webpage content extraction.
First, will be described with reference to the drawings and realize the mobile terminal of each embodiment of the present invention.Rear In continuous description, use such as " module ", " parts " or " unit " for representing element Suffix only for the explanation of the beneficially present invention, itself do not have specific meaning.Therefore, " module " can mixedly use with " parts ".
Above-mentioned mobile terminal can be implemented in a variety of manners.Such as, the end described in the present invention End can include such as mobile phone, smart phone, notebook computer, digit broadcasting receiver, PDA (personal digital assistant), PAD (panel computer), PMP (portable media player), The mobile terminal of guider etc. and fixing of such as numeral TV, desk computer etc. Terminal.
Below, it is that mobile terminal is illustrated with terminal.But, those skilled in the art will manage Solve, in addition to being used in particular for the element of mobile purpose, according to the embodiment of the present invention Structure can also apply to the terminal of fixed type.
As it is shown on figure 3, the hardware configuration that Fig. 3 is the mobile terminal realizing each embodiment of the present invention Signal.
It is single that mobile terminal 100 can include that wireless communication unit 110, A/V (audio/video) input Unit 120, user input unit 130, sensing unit 140, output unit 150, memorizer 160, Interface unit 170, controller 180 and power subsystem 190 etc..
Fig. 3 shows the mobile terminal with various assembly, it should be understood that and not Realistic execute all assemblies illustrated.Can alternatively implement more or less of assembly.Will under Face describes the element of mobile terminal in detail.
Wireless communication unit 110 generally includes one or more assembly, and it allows mobile terminal Radio communication between 100 and wireless communication system or network.Such as, wireless communication unit Can include broadcast reception module 111, mobile communication module 112, wireless Internet module 113, At least one in short range communication module 114 and positional information module 115.
Broadcast reception module 111 receives wide via broadcast channel from external broadcasting management server Broadcast signal and/or broadcast related information.Broadcast channel can include satellite channel and/or ground letter Road.Broadcast management server can be to generate and send broadcast singal and/or broadcast related information Server or the broadcast singal generated before receiving and/or broadcast related information and by it It is sent to the server of terminal.Broadcast singal can include TV broadcast singal, radiobroadcasting Signal, data broadcasting signal etc..And, broadcast singal may further include with TV or The broadcast singal of radio signals combination.Broadcast related information can also be via mobile communication Network provides, and in this case, broadcast related information can be by mobile communication module 112 Receive.Broadcast singal can exist in a variety of manners, and such as, it can be with digital multimedia The broadcast electronic program guides (EPG) of (DMB), the electricity of digital video broadcast-handheld (DVB-H) The form of sub-services guide (ESG) etc. and exist.Broadcast reception module 111 can be by using Various types of broadcast systems receive signal broadcast.Especially, broadcast reception module 111 is permissible By using such as multimedia broadcasting-ground (DMB-T), DMB-satellite (DMB-S), DVB-hand-held (DVB-H), forward link media (MediaFLO@) Radio Data System, the digital broadcasting system of received terrestrial digital broadcasting integrated service (ISDB-T) etc. System receives digital broadcasting.Broadcast reception module 111 may be constructed such that and is adapted to provide for broadcast singal Various broadcast systems and above-mentioned digit broadcasting system.Receive via broadcast reception module 111 Broadcast singal and/or broadcast related information can be stored in memorizer 160 (or other type Storage medium) in.
Mobile communication module 112 sends radio signals to base station (such as, access point, node B etc.), in exterior terminal and server at least one and/or receive from it aerogram Number.Such radio signal can include voice call signal, video calling signal or The various types of data sending according to text and/or Multimedia Message and/or receiving.
Wireless Internet module 113 supports the Wi-Fi (Wireless Internet Access) of mobile terminal.This module can To be internally or externally couple to terminal.Wi-Fi (Wireless Internet Access) technology involved by this module can To include WLAN (WLAN) (Wi-Fi), Wibro (WiMAX), Wimax, (whole world is micro- Ripple interconnection accesses), HSDPA (high-speed downlink packet access) etc..
Short range communication module 114 is the module for supporting junction service.Short-range communication technology Some examples include bluetoothTM, RF identification (RFID), Infrared Data Association (I rDA), ultra-wide Band (UWB), purple honeybeeTMEtc..
Positional information module 115 is the mould of positional information for checking or obtain mobile terminal Block.The typical case of positional information module is GPS (global positioning system).According to current skill Art, GPS module 115 calculate from three or more satellites range information and accurately time Between information and for the Information application triangulation calculated, thus according to longitude, latitude and Highly accurately calculate three-dimensional current location information.Currently, it is used for calculating position and temporal information Method use three satellites and by using an other satellite to correct the position calculated Put the error with temporal information.Additionally, GPS module 115 can be by Continuous plus in real time Current location information calculates velocity information.
A/V input block 120 is used for receiving audio or video signal.A/V input block 120 can To include camera 121 and mike 1220, camera 121 is caught in Video Capture pattern or image The view data obtaining static images or the video obtained in pattern by image capture apparatus is carried out Reason.Picture frame after process may be displayed on display unit 151.After camera 121 processes Picture frame can be stored in memorizer 160 (or other storage medium) or via radio communication Unit 110 is transmitted, and can provide two or more cameras according to the structure of mobile terminal 1210.Mike 122 can be at telephone calling model, logging mode, speech recognition mode etc. Sound (voice data) is received via mike etc. in operational mode, and can be by such sound Sound is processed as voice data.Audio frequency (voice) data after process can be at telephone calling model In the case of to be converted to can be sent to the form of mobile communication base station via mobile communication module 112 defeated Go out.Mike 122 can implement various types of noise eliminate (or suppression) algorithm eliminating (or Suppression) in the noise received and produce during transmission audio signal or interference.
User input unit 130 can generate key input data with control according to the order of user's input The various operations of mobile terminal processed.It is various types of that user input unit 130 allows user to input Information, and (such as, detection is due to touched can to include keyboard, metal dome, touch pad And the sensitive component of the change of the resistance caused, pressure, electric capacity etc.), roller, rocking bar etc. Deng.Especially, when touch pad is superimposed upon on display unit 151 as a layer, can be with shape Become touch screen.
Sensing unit 140 detects the current state of mobile terminal 100, (such as, mobile terminal 100 Open or close state), the position of mobile terminal 100, user is for mobile terminal 100 The contact presence or absence of (that is, touch input), the orientation of mobile terminal 100, the adding of mobile terminal 100 Speed or deceleration are moved and direction etc., and generation is for controlling the operation of mobile terminal 100 Order or signal.Such as, when mobile terminal 100 is embodied as sliding-type mobile phone, sensing Unit 140 can sense this sliding-type phone and open or close.It addition, sensing unit 140 Power subsystem 190 can be detected and whether provide whether electric power or interface unit 170 fill with outside Put and couple.Sensing unit 140 can include that proximity transducer 1410 will combine touch screen below This is described.
Interface unit 170 is used as at least one external device (ED) and is connected can lead to mobile terminal 100 The interface crossed.Such as, external device (ED) can include wired or wireless head-band earphone port, outer Portion's power supply (or battery charger) port, wired or wireless FPDP, memory card port, use In connecting, there is the port of device of identification module, audio frequency input/output (I/O) port, video I/O port, ear port etc..Identification module can be that storage is for verifying that user uses shifting Move the various information of terminal 100 and can include that subscriber identification module (UIM), client identify mould Block (SIM), Universal Subscriber identification module (USIM) etc..It addition, have the dress of identification module Putting (hereinafter referred to as " identifying device ") can be to take the form of smart card, therefore, identifies that device can To be connected with mobile terminal 100 via port or other attachment means.Interface unit 170 is permissible And will connect from the input (such as, data message, electric power etc.) of external device (ED) for reception One or more elements that the input received is transferred in mobile terminal 100 or may be used for Data are transmitted between mobile terminal and external device (ED).
It addition, when mobile terminal 100 is connected with external base, interface unit 170 can serve as Allow provide the path of mobile terminal 100 by electric power from base by it or can serve as permitting Permitted to pass through its path being transferred to mobile terminal from the various command signals of base input.From base The various command signals of input or electric power may serve as identifying mobile terminal the most exactly The signal being arranged on base.Output unit 150 is configured to vision, audio frequency and/or sense of touch Mode provides output signal (such as, audio signal, video signal, alarm signal, vibration letter Number etc.).Output unit 150 can include display unit 151, dio Output Modules 152, police Declaration form unit 153 etc..
Display unit 151 may be displayed on the information processed in mobile terminal 100.Such as, shifting is worked as When dynamic terminal 100 is in telephone calling model, display unit 151 can show and converse or it The user interface that its communication (such as, text messaging, multimedia file download etc.) is relevant Or graphic user interface (GUI) (UI).When mobile terminal 100 be in video calling pattern or During image capture mode, display unit 151 can show the image of capture and/or the figure of reception Picture, UI or GUI illustrating video or image and correlation function etc..
Meanwhile, the most superposed on one another to form touch when display unit 151 and touch pad During screen, display unit 151 can serve as input equipment and output device.Display unit 151 can To include liquid crystal display (LCD), thin film transistor (TFT) LCD (TFT-LCD), organic light emission two In pole pipe (OLED) display, flexible display, three-dimensional (3D) display etc. at least one Kind.Some in these display may be constructed such that transparence is to allow user to see from outside Seeing, this is properly termed as transparent display, and typical transparent display can for example, TOLED (transparent organic light emitting diode) display etc..According to the specific embodiment wanted, mobile Terminal 100 can include two or more display units (or other display device), such as, mobile Terminal can include outernal display unit (not shown) and inner display unit (not shown).Touch Screen can be used for detecting touch input pressure and touch input position and touch input area.
Dio Output Modules 152 can be in call signal at mobile terminal and receive pattern, call Time under the isotypes such as pattern, logging mode, speech recognition mode, broadcast reception mode, by nothing That line communication unit 110 receives or storage in memorizer 160 voice data transducing audio Signal and be output as sound.And, dio Output Modules 152 can provide and mobile terminal Audio frequency output (such as, call signal reception sound, the message that 100 specific functions performed are relevant Receive sound etc.).Dio Output Modules 152 can include speaker, buzzer etc..
Alarm unit 153 can provide output to notify event to mobile terminal 100. Typical event can include calling reception, message sink, key signals input, touch input etc. Deng.In addition to audio or video exports, alarm unit 153 can provide in a different manner Export the generation with notification event.Such as, alarm unit 153 can provide with the form of vibration Output, when receiving calling, message or some other entrance communication (incoming Communication), time, alarm unit 153 can provide sense of touch output (that is, vibration) with by it Notice is to user.By providing such sense of touch to export, even if the mobile phone user is in Time in the pocket of user, user also is able to identify the generation of various event.Alarm unit 153 The generation of notification event can also be provided via display unit 151 or dio Output Modules 152 Output.
Memorizer 160 can store the process performed by controller 180 and control the software of operation Program etc., or can temporarily store oneself through exporting the data that maybe will export (such as, Telephone directory, message, still image, video etc.).And, memorizer 160 can store pass Vibration and the data of audio signal in the various modes exported when touching and being applied to touch screen.
Memorizer 160 can include the storage medium of at least one type, described storage medium bag Include flash memory, hard disk, multimedia card, card-type memorizer (such as, SD or DX memorizer etc.), Random access storage device (RAM), static random-access memory (SRAM), read only memory (ROM), Electrically Erasable Read Only Memory (EEPROM), programmable read only memory (PROM), magnetic storage, disk, CD etc..And, mobile terminal 100 can be with The network storage device being connected the storage function performing memorizer 160 by network is cooperated.
Controller 180 generally controls the overall operation of mobile terminal.Such as, controller 180 is held The row control relevant to voice call, data communication, video calling etc. and process.It addition, Controller 180 can include the multi-media module 1810 for reproducing (or playback) multi-medium data, Multi-media module 1810 can construct in controller 180, or it is so structured that and controller 180 separate.Controller 180 can perform pattern recognition process, performing on the touchscreen Handwriting input or picture are drawn input and are identified as character or image.
Power subsystem 190 receives external power or internal power also under the control of controller 180 And the suitable electric power operated needed for each element and assembly is provided.
Various embodiment described herein can with use such as computer software, hardware or its Any combination of computer-readable medium is implemented.Hardware is implemented, enforcement described herein Mode can by use application-specific IC (ASIC), digital signal processor (DSP), Digital signal processing device (DSPD), programmable logic device (PLD), field programmable gate Array (FPGA), processor, controller, microcontroller, microprocessor, it is designed to hold At least one in the electronic unit of row function described herein is implemented, in some cases, Such embodiment can be implemented in controller 180.Software is implemented, such as process Or the embodiment of function can with allow to perform the softest of at least one function or operation Part module is implemented.Software code can be answered by the software write with any suitable programming language Implementing by program (or program), software code can be stored in memorizer 160 and by controlling Device 180 performs.
So far, oneself is through describing mobile terminal according to its function.
Below, for the sake of brevity, such as folded form, board-type, oscillating-type, cunning will be described Slide type mobile terminal conduct in various types of mobile terminals of ejector half mobile terminal etc. Example.Therefore, the present invention can be applied to any kind of mobile terminal, and is not limited to sliding Ejector half mobile terminal.
Mobile terminal 100 may be constructed such that utilization is sent out via frame or packet as shown in Figure 3 The the most wired of data and wireless communication system and satellite-based communication system is sent to operate.
Describe wherein according to leading to that the mobile terminal of the present invention is operable to referring now to Fig. 4 Communication system.
Such communication system can use different air interfaces and/or physical layer.Such as, The air interface used by communication system includes such as frequency division multiple access (FDMA), time division multiple acess (TDMA), CDMA (CDMA) and UMTS (UMTS) are (especially, long Phase evolution (LTE)), global system for mobile communications (GSM) etc..As non-limiting example, Explained below relates to cdma communication system, but such teaching is equally applicable to other The system of type.
With reference to Fig. 4, cdma wireless communication system can include multiple mobile terminal 100, many Individual base station (BS) 270, base station controller (BSC) 275 and mobile switching centre (MSC) 2800MSC 280 is configured to be formed with Public Switched Telephony Network (PSTN) 290 Interface.MSC 280 is also structured to and can be couple to base station 270 via back haul link BSC 275 forms interface.If back haul link can come according to any one in the interface that Ganji knows Structure, described interface includes such as E 1/T1, ATM, IP, PPP, frame relay, HDSL, ADSL or xDSL.It will be appreciated that system as shown in Figure 2 can include multiple BSC2750。
Each BS 270 can service one or more subregion (or region), by multidirectional antenna or refer to To each subregion of the antenna covering of specific direction radially away from BS 270.Or, each Subregion can be covered by two or more antennas for diversity reception.Each BS 270 can be by It is configured to support that multiple frequency is distributed, and (such as, the distribution of each frequency has specific frequency spectrum 1.25MHz, 5MHz etc.).
Intersecting that subregion and frequency are distributed can be referred to as CDMA Channel.BS 270 can also It is referred to as base station transceiver subsystem (BTS) or other equivalent terms.In such situation Under, term " base station " may be used for broadly representing single BSC 275 and at least one BS 270.Base station can also be referred to as " cellular station ".Or, each subregion of specific BS270 is permissible It is referred to as multiple cellular station.
As shown in Figure 4, broadcast singal is sent in system by broadcsting transmitter (BT) 295 The mobile terminal 100 of interior operation.Broadcast reception module 111 is arranged on shifting as shown in Figure 3 Dynamic terminal 100 is sentenced and is received the broadcast singal sent by BT 295.In fig. 4 it is shown that it is several Individual global positioning system (GPS) satellite 300.Satellite 300 helps to position multiple mobile terminals 100 In at least one.
In the diagram, depict multiple satellite 300, it is understood that be, it is possible to use any The satellite of number obtains useful location information.GPS module 115 is usual as shown in Figure 3 It is configured to coordinate the location information wanted with acquisition with satellite 300.Substitute GPS and follow the tracks of skill Art or outside GPS tracking technique, it is possible to use the position of mobile terminal can be followed the tracks of Other technology.It addition, at least one gps satellite 300 can be optionally or additionally Process satellite dmb transmits.
As a typical operation of wireless communication system, BS270 receives from various mobile whole The reverse link signal of end 100.Mobile terminal 100 generally participate in call, information receiving and transmitting and other The communication of type.Each reverse link signal that certain base station 270 receives is by specific BS 270 Inside process.The data obtained are forwarded to the BSC275 being correlated with.BSC provides call money Source distribution and the mobile management function of the coordination of soft switching process included between BS 270. The data received also are routed to MSC280 by BSC 275, and it provides and is used for and PSTN 290 Form the extra route service of interface.Similarly, PSTN 290 and MSC 280 is formed and connects Mouthful, MSC Yu BSC 275 forms interface, and BSC 275 correspondingly controls BS270 with will just It is sent to mobile terminal 100 to link signal.
Based on said system framework, mobile terminal hardware configuration and communication system, this is proposed Bright webpage content extraction method embodiment.
As it is shown in figure 5, first embodiment of the invention proposes a kind of webpage content extraction method, bag Include:
Step S101, operation layer to extraction system send extraction webpage URL request, described in take out Take the template agreement that webpage URL request carries the web page extraction field of described operation layer configuration File;
Step S102, extraction system is according to described extraction webpage URL request, and invoking web page is climbed Worm system crawls the page original contents that described URL specifies;
Step S103, described extraction system is with described template document of agreement for coupling standard, right Described page original contents extracts, and the content of extraction is returned to described operation layer.
Specifically, first, business side according to demand, determines the page-tag field that extraction is specified, The template document of agreement of generation web page extraction field, and verify the correctness of template document of agreement, Wherein, template agreement is the document protocol of definition extractor segment identification;Business side sends extraction net Page URL request, the flow process of extraction given content starts.
Then, after extraction system receives extraction URL request parameter, start invoking web page and climb Worm system crawls the page data information of request URL from third party website, only obtains appointment The page data information of URL, could start extraction template configuration field content.
Afterwards, crawler system receives after specifying the URL parameter crawled, by Http agreement Page data information is crawled from third party website.
After crawler system crawls parent page data message, the normal back page of crawler system Data message obtains, to extraction system, extraction system, the parent page information that crawler system returns, Starting the template agreement resolving parent page with definition extraction field, coupling extracts appointment template Configuration field content.
It is embodiment of the present invention extraction template specific field schematic diagram with reference to Fig. 2, Fig. 2, wherein, Left hand view represents parent page, and right part of flg represents extraction template document of agreement.
As in figure 2 it is shown, first, extraction system needs the page data by crawling return the most primary (dom tree is DOM Document Object Model to one-tenth standard dom tree, and that the present invention represents is HTML Object tree after formatting, facilitates computer disposal traversal, accessing operation), after completing, will The template document of agreement of configuration reduces generation standard dom tree equally and (convenient processes and the former page Dom tree element contrast operation).Two above step all smoothly completes, then start recursive traversal The dom tree that template protocol configuration generates (must be wherein, to be with template configuration document of agreement Coupling standard, the most just can possess more compatibility), comparison former page dom tree successively, Run into extraction special identifier (representing in the present invention) with " " symbol, then by former page ad eundem Content preserves, until coupling is made mistakes or traveled through and exits.
Finally, the identifier that extraction is completed by extraction system and content return to call operation layer, Complete whole extraction flow process.
From above-mentioned flow process, the present embodiment scheme is based on existing crawler system crawls webpage energy Power, meanwhile, has separated business and development module in flow process, and extraction system is completely to operation layer Transparent, business side has only to pay close attention to necessary extraction field, by configuring extraction template document of agreement Completing to extract flow tasks to build, operation layer just can take out extraction field smoothly afterwards, greatly carries Rise whole efficiency.
Compared to existing technology, the present invention makes full use of backstage and crawls the ability of webpage, passes through simultaneously Resolve the ability of original web page and extraction template realization extraction original web page appointment label substance, should Scheme adapts to all Web page formats extraction named web page label substance, improves extraction former The ability of beginning webpage and the motility of webpage content extraction.The present invention can the most fast, easily Solution business extraction demand, simultaneously as possess extremely strong motility so that Change cost is very Little, only need to change template configuration document of agreement.
As shown in Figure 6, second embodiment of the invention proposes a kind of webpage content extraction method, bag Include:
Step S201, extraction system receives the extraction webpage URL request that operation layer sends, institute State the template that extraction webpage URL request carries the web page extraction field of described operation layer configuration Document of agreement;
Step S202, according to described extraction webpage URL request, invoking web page crawler system is climbed Take the page original contents that described URL specifies;
Step S203 is with described template document of agreement for coupling standard, original to the described page Content extracts, and the content of extraction is returned to described operation layer.
Specifically, first, business side according to demand, determines the page-tag field that extraction is specified, The template document of agreement of generation web page extraction field, and verify the correctness of template document of agreement, Wherein, template agreement is the document protocol of definition extractor segment identification;Business side sends extraction net Page URL request, the flow process of extraction given content starts.
Then, after extraction system receives extraction URL request parameter, start invoking web page and climb Worm system crawls the page data information of request URL from third party website, only obtains appointment The page data information of URL, could start extraction template configuration field content.
Afterwards, crawler system receives after specifying the URL parameter crawled, by Http agreement Page data information is crawled from third party website.
After crawler system crawls parent page data message, the normal back page of crawler system Data message obtains, to extraction system, extraction system, the parent page information that crawler system returns, Starting the template agreement resolving parent page with definition extraction field, coupling extracts appointment template Configuration field content.
It is embodiment of the present invention extraction template specific field schematic diagram with reference to Fig. 2, Fig. 2, wherein, Left hand view represents parent page, and right part of flg represents extraction template document of agreement.
As in figure 2 it is shown, first, extraction system needs the page data by crawling return the most primary (dom tree is DOM Document Object Model to one-tenth standard dom tree, and that the present invention represents is HTML Object tree after formatting, facilitates computer disposal traversal, accessing operation), after completing, will The template document of agreement of configuration reduces generation standard dom tree equally and (convenient processes and the former page Dom tree element contrast operation).Two above step all smoothly completes, then start recursive traversal The dom tree that template protocol configuration generates (must be wherein, to be with template configuration document of agreement Coupling standard, the most just can possess more compatibility), comparison former page dom tree successively, Run into extraction special identifier (representing in the present invention) with " " symbol, then by former page ad eundem Content preserves, until coupling is made mistakes or traveled through and exits.
Finally, the identifier that extraction is completed by extraction system and content return to call operation layer, Complete whole extraction flow process.
From above-mentioned flow process, the present embodiment scheme is based on existing crawler system crawls webpage energy Power, meanwhile, has separated business and development module in flow process, and extraction system is completely to operation layer Transparent, business side has only to pay close attention to necessary extraction field, by configuring extraction template document of agreement Completing to extract flow tasks to build, operation layer just can take out extraction field smoothly afterwards, greatly carries Rise whole efficiency.
Compared to existing technology, the present invention makes full use of backstage and crawls the ability of webpage, passes through simultaneously Resolve the ability of original web page and extraction template realization extraction original web page appointment label substance, should Scheme adapts to all Web page formats extraction named web page label substance, improves extraction former The ability of beginning webpage and the motility of webpage content extraction.The present invention can the most fast, easily Solution business extraction demand, simultaneously as possess extremely strong motility so that Change cost is very Little, only need to change template configuration document of agreement.
Accordingly, webpage content extraction device embodiment of the present invention is proposed.
As it is shown in fig. 7, present pre-ferred embodiments proposes a kind of webpage content extraction device, bag Include: receiver module 301, calling module 302 and abstraction module 303, wherein:
Receiver module 301, for receiving the extraction webpage URL request that operation layer sends, institute State the template that extraction webpage URL request carries the web page extraction field of described operation layer configuration Document of agreement;
Calling module 302, for according to described extraction webpage URL request, invoking web page is climbed Worm system crawls the page original contents that described URL specifies;
Abstraction module 303, for described template document of agreement for coupling standard, to described page Face original contents extracts, and the content of extraction is returned to described operation layer.
Further, described abstraction module 303, be additionally operable to described page original contents and Template document of agreement is reduced into the document of dom tree form, obtains the DOM of page original contents Tree and the dom tree of template document of agreement;With the dom tree of described template document of agreement it is Coupling standard, travels through the dom tree of described template document of agreement, and the page described in comparison is former successively The dom tree of beginning content, extracts specific field identifier;Obtain in described page original contents Content that specific field identifier is corresponding also preserves;After traversal completes, obtain the appointment of extraction Web page contents, and the content of extraction is returned to described operation layer.
Specifically, first, business side according to demand, determines the page-tag field that extraction is specified, The template document of agreement of generation web page extraction field, and verify the correctness of template document of agreement, Wherein, template agreement is the document protocol of definition extractor segment identification;Business side sends extraction net Page URL request, the flow process of extraction given content starts.
Then, after extraction system receives extraction URL request parameter, start invoking web page and climb Worm system crawls the page data information of request URL from third party website, only obtains appointment The page data information of URL, could start extraction template configuration field content.
Afterwards, crawler system receives after specifying the URL parameter crawled, by Http agreement Page data information is crawled from third party website.
After crawler system crawls parent page data message, the normal back page of crawler system Data message obtains, to extraction system, extraction system, the parent page information that crawler system returns, Starting the template agreement resolving parent page with definition extraction field, coupling extracts appointment template Configuration field content.
It is embodiment of the present invention extraction template specific field schematic diagram with reference to Fig. 2, Fig. 2, wherein, Left hand view represents parent page, and right part of flg represents extraction template document of agreement.
As in figure 2 it is shown, first, extraction system needs the page data by crawling return the most primary (dom tree is DOM Document Object Model to one-tenth standard dom tree, and that the present invention represents is HTML Object tree after formatting, facilitates computer disposal traversal, accessing operation), after completing, will The template document of agreement of configuration reduces generation standard dom tree equally and (convenient processes and the former page Dom tree element contrast operation).Two above step all smoothly completes, then start recursive traversal The dom tree that template protocol configuration generates (must be wherein, to be with template configuration document of agreement Coupling standard, the most just can possess more compatibility), comparison former page dom tree successively, Run into extraction special identifier (representing in the present invention) with " " symbol, then by former page ad eundem Content preserves, until coupling is made mistakes or traveled through and exits.
Finally, the identifier that extraction is completed by extraction system and content return to call operation layer, Complete whole extraction flow process.
From above-mentioned flow process, the present embodiment scheme is based on existing crawler system crawls webpage energy Power, meanwhile, has separated business and development module in flow process, and extraction system is completely to operation layer Transparent, business side has only to pay close attention to necessary extraction field, by configuring extraction template document of agreement Completing to extract flow tasks to build, operation layer just can take out extraction field smoothly afterwards, greatly carries Rise whole efficiency.
Compared to existing technology, the present invention makes full use of backstage and crawls the ability of webpage, passes through simultaneously Resolve the ability of original web page and extraction template realization extraction original web page appointment label substance, should Scheme adapts to all Web page formats extraction named web page label substance, improves extraction former The ability of beginning webpage and the motility of webpage content extraction.The present invention can the most fast, easily Solution business extraction demand, simultaneously as possess extremely strong motility so that Change cost is very Little, only need to change template configuration document of agreement.
Accordingly, the system embodiment of extracting content on web pages of the present invention is proposed.
Can refer to shown in Fig. 1, the system of this extracting content on web pages may include that operation layer, climbs Worm system and extraction system;Wherein:
Described operation layer, for extraction system send extraction webpage URL request, described in take out Take the template agreement that webpage URL request carries the web page extraction field of described operation layer configuration File;
Described extraction system, can include the device described in above-described embodiment;
Described crawler system, for the call instruction according to described extraction system, from third party's net Station crawls the page original contents that described URL specifies, and returns to described extraction system.
Described operation layer, is additionally operable to determine the page field that extraction is specified, generates web page extraction word The template document of agreement of section;And verify the correctness of described template document of agreement.
The system of embodiment of the present invention extracting content on web pages realizes the principle of webpage content extraction, please With reference to the various embodiments described above, do not repeat them here.
Also, it should be noted in this article, term " include ", " comprising " or it is any Other variants are intended to comprising of nonexcludability, so that include the mistake of a series of key element Journey, method, article or device not only include those key elements, but also include the most clearly arranging Other key elements gone out, or also include being consolidated by this process, method, article or device Some key elements.In the case of there is no more restriction, statement " including ... " limit Key element, it is not excluded that also deposit in including the process of this key element, method, article or device In other identical element.
The invention described above embodiment sequence number, just to describing, does not represent the quality of embodiment.
Through the above description of the embodiments, those skilled in the art it can be understood that The mode of required general hardware platform can be added by software to above-described embodiment method to realize, Can certainly pass through hardware, but a lot of in the case of the former is more preferably embodiment.Based on this The understanding of sample, the portion that prior art is contributed by technical scheme the most in other words Dividing and can embody with the form of software product, this computer software product is stored in one and deposits In storage media (such as ROM/RAM, magnetic disc, CD), including some instructions with so that one Station terminal equipment (can be mobile phone, computer, server, or the network equipment etc.) performs Method described in each embodiment of the present invention.
The foregoing is only the preferred embodiments of the present invention, not thereby limit the patent of the present invention Scope, every equivalent structure utilizing description of the invention and accompanying drawing content to be made or flow process become Change, or be directly or indirectly used in other relevant technical field, be the most in like manner included in the present invention Scope of patent protection in.

Claims (8)

1. a webpage content extraction method, it is characterised in that including:
Extraction system receives the extraction webpage URL request that operation layer sends, and described extraction webpage URL please Seek the template document of agreement of the web page extraction field carrying the configuration of described operation layer;
According to described extraction webpage URL request, invoking web page crawler system crawls what described URL specified Page original contents;
With described template document of agreement for coupling standard, described page original contents is extracted, and will The content of extraction returns to described operation layer.
Method the most according to claim 1, it is characterised in that described with described template document of agreement For coupling standard, described page original contents is extracted, and the content of extraction is returned to described industry The step of business layer includes:
Described page original contents and template document of agreement are reduced into dom tree lattice by described extraction system The document of formula, obtains dom tree and the dom tree of template document of agreement of page original contents;
With the dom tree of described template document of agreement for coupling standard, travel through described template document of agreement Dom tree, the successively dom tree of page original contents described in comparison, extract specific field identifier;
Obtain and described page original contents is specified content corresponding to field specifier and preserves;
After traversal completes, obtain the named web page content of extraction, and the content of extraction is returned to described Operation layer.
3. a webpage content extraction device, it is characterised in that including:
Receiver module, for receiving the extraction webpage URL request that operation layer sends, described extraction webpage URL request carries the template document of agreement of the web page extraction field of described operation layer configuration;
Calling module, for according to described extraction webpage URL request, invoking web page crawler system crawls institute State the page original contents that URL specifies;
Abstraction module, for described template document of agreement for coupling standard, to described page original contents Extract, and the content of extraction is returned to described operation layer.
Device the most according to claim 3, it is characterised in that
Described abstraction module, is additionally operable to be reduced into described page original contents and template document of agreement The document of dom tree form, obtains dom tree and the DOM of template document of agreement of page original contents Tree;With the dom tree of described template document of agreement for coupling standard, travel through described template document of agreement Dom tree, the successively dom tree of page original contents described in comparison, extract specific field identifier;Obtain Take and described page original contents is specified content corresponding to field specifier and preserves;After traversal completes, Obtain the named web page content of extraction, and the content of extraction is returned to described operation layer.
5. the system of an extracting content on web pages, it is characterised in that including: operation layer, crawler system and Extraction system;Wherein:
Described operation layer, for sending extraction webpage URL request, described extraction webpage to extraction system URL request carries the template document of agreement of the web page extraction field of described operation layer configuration;
Described extraction system, for receiving the extraction webpage URL request that operation layer sends, described extraction net Page URL request carries the template document of agreement of the web page extraction field of described operation layer configuration;According to institute State extraction webpage URL request, invoking web page crawler system crawl the page that described URL specifies original in Hold;With described template document of agreement for coupling standard, described page original contents is extracted, and will The content of extraction returns to described operation layer;
Described crawler system, for the call instruction according to described extraction system, crawls from third party website The page original contents that described URL specifies, returns to described extraction system.
System the most according to claim 5, it is characterised in that
Described extraction system, is additionally operable to be reduced into described page original contents and template document of agreement The document of dom tree form, obtains dom tree and the DOM of template document of agreement of page original contents Tree;With the dom tree of described template document of agreement for coupling standard, travel through described template document of agreement Dom tree, the successively dom tree of page original contents described in comparison, extract specific field identifier;Obtain Take and described page original contents is specified content corresponding to field specifier and preserves;After traversal completes, Obtain the named web page content of extraction, and the content of extraction is returned to described operation layer.
System the most according to claim 5, it is characterised in that
Described operation layer, is additionally operable to, before sending extraction webpage URL request to extraction system, determine and take out The page field that fetching is fixed, generates the template document of agreement of web page extraction field.
System the most according to claim 7, it is characterised in that
Described operation layer, is additionally operable to verify the correctness of described template document of agreement.
CN201510124714.8A 2015-03-20 2015-03-20 Webpage content extracting method, device and system Pending CN106033468A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510124714.8A CN106033468A (en) 2015-03-20 2015-03-20 Webpage content extracting method, device and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510124714.8A CN106033468A (en) 2015-03-20 2015-03-20 Webpage content extracting method, device and system

Publications (1)

Publication Number Publication Date
CN106033468A true CN106033468A (en) 2016-10-19

Family

ID=57148894

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510124714.8A Pending CN106033468A (en) 2015-03-20 2015-03-20 Webpage content extracting method, device and system

Country Status (1)

Country Link
CN (1) CN106033468A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106534145A (en) * 2016-11-28 2017-03-22 北京天行网安信息技术有限责任公司 Application identification method and equipment
CN106534146A (en) * 2016-11-28 2017-03-22 北京天行网安信息技术有限责任公司 Safety monitoring system and method
CN107861974A (en) * 2017-09-19 2018-03-30 北京金堤科技有限公司 A kind of adaptive network crawler system and its data capture method
CN108961045A (en) * 2018-07-09 2018-12-07 中国建设银行股份有限公司 A kind of method, system, audit terminal and the transaction terminal of page reduction
CN109871505A (en) * 2019-01-28 2019-06-11 维沃移动通信有限公司 Obtain method, Cloud Server and the terminal device of web page contents
CN113391804A (en) * 2021-06-15 2021-09-14 北京达佳互联信息技术有限公司 Page generation method and device, electronic equipment and storage medium
CN114528811A (en) * 2022-01-21 2022-05-24 北京麦克斯泰科技有限公司 Article content extraction method, device, equipment and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101192234A (en) * 2007-06-07 2008-06-04 腾讯科技(深圳)有限公司 Searching system and method based on web page extraction
CN103345532A (en) * 2013-07-26 2013-10-09 人民搜索网络股份公司 Method and device for extracting webpage information

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101192234A (en) * 2007-06-07 2008-06-04 腾讯科技(深圳)有限公司 Searching system and method based on web page extraction
CN103345532A (en) * 2013-07-26 2013-10-09 人民搜索网络股份公司 Method and device for extracting webpage information

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106534145A (en) * 2016-11-28 2017-03-22 北京天行网安信息技术有限责任公司 Application identification method and equipment
CN106534146A (en) * 2016-11-28 2017-03-22 北京天行网安信息技术有限责任公司 Safety monitoring system and method
CN106534146B (en) * 2016-11-28 2019-11-15 拓尔思天行网安信息技术有限责任公司 A kind of safety monitoring system and method
CN106534145B (en) * 2016-11-28 2019-11-15 拓尔思天行网安信息技术有限责任公司 A kind of application and identification method and equipment
CN107861974A (en) * 2017-09-19 2018-03-30 北京金堤科技有限公司 A kind of adaptive network crawler system and its data capture method
CN107861974B (en) * 2017-09-19 2018-12-25 北京金堤科技有限公司 A kind of adaptive network crawler system and its data capture method
CN108961045A (en) * 2018-07-09 2018-12-07 中国建设银行股份有限公司 A kind of method, system, audit terminal and the transaction terminal of page reduction
CN109871505A (en) * 2019-01-28 2019-06-11 维沃移动通信有限公司 Obtain method, Cloud Server and the terminal device of web page contents
CN113391804A (en) * 2021-06-15 2021-09-14 北京达佳互联信息技术有限公司 Page generation method and device, electronic equipment and storage medium
CN114528811A (en) * 2022-01-21 2022-05-24 北京麦克斯泰科技有限公司 Article content extraction method, device, equipment and storage medium
CN114528811B (en) * 2022-01-21 2022-09-02 北京麦克斯泰科技有限公司 Article content extraction method, device, equipment and storage medium

Similar Documents

Publication Publication Date Title
CN106033468A (en) Webpage content extracting method, device and system
CN104883430B (en) A kind of mobile terminal and do not read the treating method and apparatus of footmark
CN104883658B (en) The processing method of virtual card information and system
CN105159533A (en) Mobile terminal and automatic verification code input method thereof
CN105718071A (en) Terminal and method for recommending associational words in input method
CN106911806A (en) A kind of method of PUSH message, terminal, server and system
CN105208082B (en) A kind of method and device for instructing user's using terminal, terminal
CN106775372A (en) A kind of display adjusting method of suspension procedure disk, device and terminal
CN106528576A (en) Page search method and system, and terminal
CN109033263A (en) A kind of application recommended method and terminal
CN105634921A (en) Message filtering reminding method and terminal device
CN104811565A (en) Voice change communication realization method and terminal
CN106598538A (en) Method and system for updating instruction set
CN105975500A (en) Data processing method, data statistical system and backstage management system
CN108829267A (en) A kind of vocabulary recommended method, equipment and computer can storage mediums
CN104898927B (en) The method and device of information search
CN105335055A (en) Self-response type realization method and system of menu as well as terminal equipment
CN104866095A (en) Mobile terminal, and method and apparatus for managing desktop thereof
CN104980576A (en) Method and device for automatically extracting number for mobile terminal
CN104883274A (en) Problem checking method and server
CN106657643A (en) Mobile terminal and communication session display method
CN106791149A (en) A kind of method of mobile terminal and control screen
CN105187621A (en) Message prompt method, message prompt device and terminal
CN104735259A (en) Mobile terminal shooting parameter setting method and device and mobile terminal
CN106776845A (en) A kind of information flow adaptive management method and terminal

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20161019