CN108334634A - A kind of method, apparatus, equipment and the storage medium of extraction data information - Google Patents

A kind of method, apparatus, equipment and the storage medium of extraction data information Download PDF

Info

Publication number
CN108334634A
CN108334634A CN201810163956.1A CN201810163956A CN108334634A CN 108334634 A CN108334634 A CN 108334634A CN 201810163956 A CN201810163956 A CN 201810163956A CN 108334634 A CN108334634 A CN 108334634A
Authority
CN
China
Prior art keywords
extraction
data
information
template
requirement
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810163956.1A
Other languages
Chinese (zh)
Inventor
陈晓泰
朱国壮
王能
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Zhongguancun Kejin Technology Co Ltd
Original Assignee
Beijing Zhongguancun Kejin Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Zhongguancun Kejin Technology Co Ltd filed Critical Beijing Zhongguancun Kejin Technology Co Ltd
Priority to CN201810163956.1A priority Critical patent/CN108334634A/en
Publication of CN108334634A publication Critical patent/CN108334634A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24552Database cache management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • G06F16/2433Query languages
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24564Applying rules; Deductive queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

This application discloses a kind of methods of extraction data information, including obtain the extraction requirement of data;Extraction is parsed further according to pre-set extraction logic to require, and obtains extraction template;Then pending data is obtained, and utilizes the information of extraction template extraction pending data.As it can be seen that when extraction requires change, this method can be parsed the extraction template after being changed again according to extraction logic, then obtained pending data and utilized the information of the extraction template extraction pending data after change.It requires the cumbersome of corresponding code so as to avoid extraction is write again and writes risk caused by code, improve the efficiency of extraction data information, while improving the stability of system.Disclosed herein as well is a kind of device, equipment and the computer readable storage mediums of extraction data information, all have above-mentioned advantageous effect.

Description

A kind of method, apparatus, equipment and the storage medium of extraction data information
Technical field
The present invention relates to data processing field, more particularly to a kind of method, apparatus, equipment and the calculating of extraction data information Machine readable storage medium storing program for executing.
Background technology
With the fast development of Internet technology, various data are constantly precipitated and are accumulated, but in these numbers In, it is understood that there may be valuable information.Therefore, it is necessary to carry out secondary use to these data, valuable information is extracted.
In the prior art, the generally pre-defined extraction of mode for extracting the valuable information in mass data requires, Then it is required to develop corresponding code according to extraction, the information of corresponding code extraction pending data is recycled, to obtain Extraction requires corresponding information in pending data.But the iteratively faster in face of internet and various demands complicated and changeable, Extraction requires usually change, and corresponding after extraction requires to change, the mode of the extraction data information of the prior art must It according to the extraction after change must require that code is written over or is changed, then retest, reach the standard grade, then could recycle change The information of code extraction pending data afterwards.So, a large amount of manpower will be expended, and writes, change the mistake of code Journey is not only cumbersome, can also bring risk, therefore the stability of system will also be affected.
Therefore, how to improve the efficiency of extraction data information is that those skilled in the art need the technology solved to ask at present Topic.
Invention content
In view of this, the purpose of the present invention is to provide a kind of method of extraction data information, extraction data can be improved The efficiency of information;It is a further object of the present invention to provide a kind of device, equipment and the computer-readable storages of extraction data information Medium all has above-mentioned advantageous effect.
In order to solve the above technical problems, the present invention provides a kind of method of extraction data information, including:
Obtain the extraction requirement of data;
The extraction requirement is parsed according to pre-set extraction logic, obtains extraction template;
Pending data is obtained, and utilizes the information of pending data described in the extraction template extraction.
Preferably, described to parse the extraction requirement according to pre-set extraction logic, obtaining extraction template is specially:
The extraction is parsed according to pre-set extraction logic to require, obtain pair using SQL engines and/or http engines The extraction template answered.
Preferably, the information using pending data described in the extraction template extraction is specially:
The information of pending data described in the extraction template extraction is utilized by the way of multithreading.
Preferably, further comprise after the extraction for obtaining data requires:
The extraction of acquisition is required to cache.
Preferably, further comprise after the extraction by acquisition requires to be cached:
When receiving new extraction and requiring, judge whether the new extraction is required identical as the extraction of caching requirement;
If it is not, then requiring the extraction requirement of the update caching using the new extraction.
Preferably, further comprise:
Prompt inputs the extraction requirement.
In order to solve the above technical problems, the present invention also provides a kind of devices of extraction data information, including:
Acquisition module, the extraction requirement for obtaining data;
Parsing module obtains extraction template for parsing the extraction requirement according to pre-set extraction logic;
Extraction module for obtaining pending data, and utilizes the letter of pending data described in the extraction template extraction Breath.
In order to solve the above technical problems, the present invention also provides a kind of equipment of extraction data information, including:
Memory, for storing computer program;
Processor realizes the step of the method for any of the above-described kind of extraction data information when for executing the computer program Suddenly.
In order to solve the above technical problems, the present invention also provides a kind of computer readable storage medium, it is described computer-readable Computer program is stored on storage medium, the computer program realizes any of the above-described kind of extraction data when being executed by processor The step of method of information.
The method of extraction data information provided by the invention, including obtain the extraction requirement of data;Further according to pre-setting Extraction logic parsing extraction require, obtain extraction template;Then pending data is obtained, and waits locating using extraction template extraction Manage the information of data.As it can be seen that when extraction requires change, this method can again be parsed according to extraction logic and be changed Then extraction template afterwards obtains pending data and utilizes the information of the extraction template extraction pending data after change. It is required that is parsing extraction by pre-set extraction logic, it, can be according to setting in advance when extraction requires to change The extraction logic set dynamically requires extraction to be parsed and obtained extraction template, is required so as to avoid extraction is write again Corresponding code cumbersome and risk caused by code is write, improves the efficiency of extraction data information, improve simultaneously The stability of system.
In order to solve the above technical problems, the present invention also provides a kind of device, equipment and the computers of extraction data information Readable storage medium storing program for executing all has above-mentioned advantageous effect.
Description of the drawings
It in order to illustrate the embodiments of the present invention more clearly or the technical solution of the prior art, below will be to embodiment or existing Attached drawing is briefly described needed in technology description, it should be apparent that, the accompanying drawings in the following description is only this hair Some bright embodiments for those of ordinary skill in the art without creative efforts, can be with root Other attached drawings are obtained according to the attached drawing of offer.
Fig. 1 is a kind of flow chart of the method for extraction data information provided in an embodiment of the present invention;
Fig. 2 is the flow chart of another method for extracting data information provided in an embodiment of the present invention;
Fig. 3 is a kind of structure chart of the device of extraction data information provided in an embodiment of the present invention;
Fig. 4 is a kind of structure chart of the equipment of extraction data information provided in an embodiment of the present invention.
Specific implementation mode
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation describes, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts every other Embodiment shall fall within the protection scope of the present invention.
The core of the embodiment of the present invention is to provide a kind of method of extraction data information, can improve extraction data information Efficiency;Another core of the present invention is to provide a kind of device, equipment and the computer readable storage medium of extraction data information, With above-mentioned advantageous effect.
It is right with reference to the accompanying drawings and detailed description in order to make those skilled in the art more fully understand the present invention program The present invention is described in further detail.
Fig. 1 is a kind of flow chart of the method for extraction data information provided in an embodiment of the present invention, as shown in Figure 1, extraction The method of data information specifically includes:
S10:Obtain the extraction requirement of data.
It is understood that with the fast development of Internet technology, various data are constantly accumulated and are precipitated. But it in these data may include some useful data, it is therefore desirable to handle data, such as extract in data Valuable information so that these data are secondary use.By obtaining the extraction requirement of data, need to extract number to learn According to which information.Therefore, it is necessary to according to actual demand, the extraction requirement of data is arranged.Extraction requires generally to pass through operator Input, there are many modes of input, such as keyboard input, handwriting input or voice input etc., and the extraction to obtain data is wanted It asks.It should be noted that the concrete mode that the present embodiment requires the extraction for obtaining data does not limit.
S20:Extraction is parsed according to pre-set extraction logic to require, and obtains extraction template.
It is understood that the extraction of operator's input requires to be generally the information such as word or picture, in order to logarithm According to information is extracted, the extraction requirement by word or graphic form is needed to parse, obtains extraction template.Extraction logic is It is carried for the extraction template for obtaining different with the resolution rules used when obtaining extraction template using that will extract to require to parse Logic is taken generally to be different.The analytics engine used when in addition, being parsed to extraction requirement is different, and extraction logic also has Institute is different.
S30:Pending data is obtained, and utilizes the information of extraction template extraction pending data.
After obtaining extraction template, to the information using extraction template extraction pending data, then also need first to obtain Pending data needs the data for carrying out information extraction.Specifically, the concrete mode for obtaining pending data can be from number Can also be to be obtained from webpage, the present embodiment does not limit the concrete mode for obtaining pending data according to being obtained in library.Specifically , pending data is carried into row information using the information of extraction template extraction pending data, that is, using extraction template It takes, corresponding information is required to obtain extraction in pending data.
As preferred embodiment, when executing extraction template, if it is SQL rules, then according to extraction template generation The sentence of executable SQL sends sql command to corresponding database, and obtains implementing result, that is to say, that pending data It is stored in database,
The method of extraction data information provided in this embodiment, including obtain the extraction requirement of data;Further according to setting in advance The extraction logic parsing extraction set requires, and obtains extraction template;Then pending data is obtained, and is waited for using extraction template extraction Handle the information of data.As it can be seen that when extraction requires change, this method can again be parsed according to extraction logic to be become Then extraction template after more obtains pending data and utilizes the information of the extraction template extraction pending data after change. It is required that is, parsing extraction by pre-set extraction logic, it, can be according to advance when extraction requires to change The extraction logic of setting dynamically requires extraction to be parsed and obtained extraction template, is wanted so as to avoid extraction is write again It asks the cumbersome of corresponding code and writes risk caused by code, improve the efficiency of extraction data information, improve simultaneously The stability of system.
On the basis of the above embodiments, the present embodiment has made further instruction and optimization to technical solution, specifically, Extraction is parsed according to pre-set extraction logic to require, obtaining extraction template is specially:
Extraction is parsed according to pre-set extraction logic to require, obtain corresponding using SQL engines and/or http engines Extract template.
It requires, obtains it should be noted that SQL engines can be utilized to parse extraction according to pre-set extraction logic The extraction template of SQL engines;It can also be to parse extraction according to pre-set extraction logic using http engines to require, obtain The extraction template of http engines;Can certainly utilize SQL engines and http engines to obtain corresponding extraction mould respectively simultaneously Plate.The present embodiment does not limit this.
It should be noted that SQL engines, that is, database engine SQL, for converting to database the extraction template of SQL to Operation, such as input SQL extraction template, then SQL engines just convert the extraction template of SQL to the behaviour to database Make, is specially the operation that the data in database are carried out with information extraction in the present embodiment;Http engines are used for http's Extraction template is converted into the operation to browsing device net page, such as the extraction template of input http, http engines are carried using http's Modulus plate operates browsing device net page, is specially to carry out information extraction to the data in browsing device net page in the present embodiment Operation.
More specifically, SQL engines recycle executable first by the extraction form assembly of SQL at efficient executable module Module extracts the information of data in database;The extraction template assembly of http is gone out effective url (unified resources by http engines in advance Finger URL), recycle the information of data in url extraction browsing device net pages.
In order to improve working efficiency, as preferred embodiment, it can be arranged and grasp SQL engines and http engines parallel Make, multiple SQL engines and multiple http engines can also be distinguished concurrent working, the present embodiment does not limit this.
On the basis of the above embodiments, the present embodiment has made further instruction and optimization to technical solution, specifically, Using extraction template extraction pending data information be specially:
The information of extraction template extraction pending data is utilized by the way of multithreading.
It is understood that using the information of extraction template processing pending data by the way of multithreading, specifically, The information for pre-setting multithreading extraction pending data, after obtaining multiple pending datas, by pending data according to Pre-set rule carries out shunting processing, and multiple pending datas is made to be respectively processed on different threads.
As it can be seen that using the information of extraction template extraction pending data by way of multithreading, that is, identical The information of more pending datas can be extracted in time, to improve the speed of extraction data information, raising carries Take the efficiency of data information.
On the basis of the above embodiments, the present embodiment has made further instruction and optimization to technical solution, specifically, Further comprise:
Prompt input extraction requires.
It is understood that when needing to extract the operation of data information, prompts operator to input extraction and require.It should Prompt can be text information, pictorial information, can also be acoustic information, the present embodiment does not do the specific implementation form of prompt It limits.By prompt input extraction require, can person easy to operation operate, improve user experience.
Fig. 2 is the flow chart of another method for extracting data information provided in an embodiment of the present invention, referring to FIG. 2, On the basis of above-described embodiment, the present embodiment has made further instruction and optimization to technical solution, specifically, obtaining data Extraction require after further comprise:
S40:The extraction of acquisition is required to cache.
Specifically, after getting the extraction requirement of data, the extraction of acquisition is required to cache, it is every to avoid It is secondary all to go to obtain extraction requirement, so that next time can use identical extraction to require extract information to data, carried in next time When taking, if extraction require it is identical, then acquisition extraction the step of requiring, can be reduced, to make the process of extraction data information It is more convenient.It should be noted that the present embodiment requires the specific location cached not limit extraction, correspondingly, root It is also differed according to the caching of different cache locations, progress, therefore, the tool that the present embodiment is cached to that will extract information Body mode of operation does not also limit.
Further comprise after requiring to be cached by the extraction of acquisition:
S50:When receiving new extraction and requiring, judge whether new extraction is required identical as the extraction requirement that caches;
S60:If it is not, then requiring the extraction requirement of update caching using new extraction.
It is understood that when extraction requires to change, the extraction requirement to caching is needed also to be updated.Specifically , when receiving new extraction and requiring, judge whether new extraction is required identical as the extraction requirement that caches.Concrete implementation Mode is to require to require to be compared with the extraction of caching using new extraction, judges that extraction requires whether to change with this. If it is not, i.e. new extraction requirement is different from the extraction requirement of caching, then the extraction requirement to caching is needed to be updated, specifically , the extraction requirement of update caching is required using new extraction.
As it can be seen that by when extraction requires to change, the extraction requirement of update caching is required using new extraction so that The extraction of caching requires unanimously to keep newest state.
It is described in detail above for a kind of embodiment of method of extraction data information provided by the invention, this Invention additionally provides a kind of device, equipment and the computer readable storage medium of extraction data information corresponding with this method, by Mutually correlate in the embodiment of device, equipment and computer readable storage medium part and the embodiment of method part, therefore fills Set, the embodiment of equipment and computer readable storage medium part refer to method part embodiment description, here wouldn't It repeats.
Fig. 3 is a kind of structure chart of the device of extraction data information provided in an embodiment of the present invention, as shown in figure 3, extraction The device of data information includes:
Acquisition module 31, the extraction requirement for obtaining data;
Parsing module 32 is required for parsing extraction according to pre-set extraction logic, obtains extraction template;
Extraction module 33 for obtaining pending data, and utilizes the information of extraction template extraction pending data.
The device of extraction data information provided in this embodiment, the beneficial effect of the method with said extracted data information Fruit.
Fig. 4 is a kind of structure chart of the equipment of extraction data information provided in an embodiment of the present invention, as shown in figure 4, extraction The equipment of data information specifically includes:
Memory 41, for storing computer program;
Processor 42, realizes following steps when for executing computer program:
Obtain the extraction requirement of data;
Extraction is parsed according to pre-set extraction logic to require, and obtains extraction template;
Pending data is obtained, and utilizes the information of extraction template extraction pending data.
The equipment of extraction data information provided in this embodiment, the beneficial effect of the method with said extracted data information Fruit.
In order to solve the above technical problems, the present invention also provides a kind of computer readable storage medium, computer-readable storage It is stored with computer program on medium, lower step is realized when computer program is executed by processor:
Obtain the extraction requirement of data;
Extraction is parsed according to pre-set extraction logic to require, and obtains extraction template;
Pending data is obtained, and utilizes the information of extraction template extraction pending data.
Computer readable storage medium provided in this embodiment, the beneficial effect of the method with said extracted data information Fruit.
Above to method, apparatus, equipment and the computer readable storage medium of extraction data information provided by the present invention It is described in detail.Principle and implementation of the present invention are described for specific embodiment used herein, above The explanation of embodiment is merely used to help understand the method and its core concept of the present invention.It should be pointed out that for the art Those of ordinary skill for, without departing from the principle of the present invention, can also to the present invention carry out it is several improvement and repair Decorations, these improvement and modification are also fallen within the protection scope of the claims of the present invention.
Each embodiment is described by the way of progressive in specification, the highlights of each of the examples are with other realities Apply the difference of example, just to refer each other for identical similar portion between each embodiment.For device disclosed in embodiment Speech, since it is corresponded to the methods disclosed in the examples, so description is fairly simple, related place is referring to method part illustration .
Professional further appreciates that, unit described in conjunction with the examples disclosed in the embodiments of the present disclosure And algorithm steps, can be realized with electronic hardware, computer software, or a combination of the two, in order to clearly demonstrate hardware and The interchangeability of software generally describes each exemplary composition and step according to function in the above description.These Function is implemented in hardware or software actually, depends on the specific application and design constraint of technical solution.Profession Technical staff can use different methods to achieve the described function each specific application, but this realization is not answered Think beyond the scope of this invention.
The step of method described in conjunction with the examples disclosed in this document or algorithm, can directly be held with hardware, processor The combination of capable software module or the two is implemented.Software module can be placed in random access memory (RAM), memory, read-only deposit Reservoir (ROM), electrically programmable ROM, electrically erasable ROM, register, hard disk, moveable magnetic disc, CD-ROM or technology In any other form of storage medium well known in field.

Claims (9)

1. a kind of method of extraction data information, which is characterized in that including:
Obtain the extraction requirement of data;
The extraction requirement is parsed according to pre-set extraction logic, obtains extraction template;
Pending data is obtained, and utilizes the information of pending data described in the extraction template extraction.
2. according to the method described in claim 1, it is characterized in that, described carry according to described in the parsing of pre-set extraction logic Requirement is taken, obtaining extraction template is specially:
The extraction requirement is parsed according to pre-set extraction logic using SQL engines and/or http engines, is obtained corresponding Extract template.
3. method according to claim 1 or 2, which is characterized in that described utilize waits locating described in the extraction template extraction Reason data information be specially:
The information of pending data described in the extraction template extraction is utilized by the way of multithreading.
4. according to the method described in claim 1, it is characterized in that, further being wrapped after the extraction for obtaining data requires It includes:
The extraction of acquisition is required to cache.
5. according to the method described in claim 4, it is characterized in that, requiring to carry out caching it in the extraction by acquisition After further comprise:
When receiving new extraction and requiring, judge whether the new extraction is required identical as the extraction of caching requirement;
If it is not, then requiring the extraction requirement of the update caching using the new extraction.
6. according to the method described in claim 1, it is characterized in that, further comprising:
Prompt inputs the extraction requirement.
7. a kind of device of extraction data information, which is characterized in that including:
Acquisition module, the extraction requirement for obtaining data;
Parsing module obtains extraction template for parsing the extraction requirement according to pre-set extraction logic;
Extraction module for obtaining pending data, and utilizes the information of pending data described in the extraction template extraction.
8. a kind of equipment of extraction data information, which is characterized in that including:
Memory, for storing computer program;
Processor realizes that claim 1 to 6 any one of them such as extracts data information when for executing the computer program Method the step of.
9. a kind of computer readable storage medium, which is characterized in that be stored with computer on the computer readable storage medium Program is realized when the computer program is executed by processor as claim 1 to 6 any one of them extracts data information The step of method.
CN201810163956.1A 2018-02-27 2018-02-27 A kind of method, apparatus, equipment and the storage medium of extraction data information Pending CN108334634A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810163956.1A CN108334634A (en) 2018-02-27 2018-02-27 A kind of method, apparatus, equipment and the storage medium of extraction data information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810163956.1A CN108334634A (en) 2018-02-27 2018-02-27 A kind of method, apparatus, equipment and the storage medium of extraction data information

Publications (1)

Publication Number Publication Date
CN108334634A true CN108334634A (en) 2018-07-27

Family

ID=62929894

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810163956.1A Pending CN108334634A (en) 2018-02-27 2018-02-27 A kind of method, apparatus, equipment and the storage medium of extraction data information

Country Status (1)

Country Link
CN (1) CN108334634A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111125589A (en) * 2018-10-31 2020-05-08 北大方正集团有限公司 Data acquisition method and device and computer readable storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102360368A (en) * 2011-10-09 2012-02-22 山东大学 Web data extraction method based on visual customization of extraction template
WO2013097597A1 (en) * 2011-12-30 2013-07-04 华为技术有限公司 Content filtering method, device and system
CN104516979A (en) * 2014-12-31 2015-04-15 北京锐安科技有限公司 Data query method and data query system based on quadratic search
CN104573040A (en) * 2015-01-19 2015-04-29 百度在线网络技术(北京)有限公司 Method and system for capturing webpage data
CN106886535A (en) * 2015-12-16 2017-06-23 大唐软件技术股份有限公司 A kind of data pick-up method and apparatus for being adapted to multiple data sources

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102360368A (en) * 2011-10-09 2012-02-22 山东大学 Web data extraction method based on visual customization of extraction template
WO2013097597A1 (en) * 2011-12-30 2013-07-04 华为技术有限公司 Content filtering method, device and system
CN104516979A (en) * 2014-12-31 2015-04-15 北京锐安科技有限公司 Data query method and data query system based on quadratic search
CN104573040A (en) * 2015-01-19 2015-04-29 百度在线网络技术(北京)有限公司 Method and system for capturing webpage data
CN106886535A (en) * 2015-12-16 2017-06-23 大唐软件技术股份有限公司 A kind of data pick-up method and apparatus for being adapted to multiple data sources

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111125589A (en) * 2018-10-31 2020-05-08 北大方正集团有限公司 Data acquisition method and device and computer readable storage medium
CN111125589B (en) * 2018-10-31 2023-09-05 新方正控股发展有限责任公司 Data acquisition method and device and computer readable storage medium

Similar Documents

Publication Publication Date Title
EP3688672B1 (en) Attention-based decoder-only sequence transduction neural networks
CN105550206B (en) The edition control method and device of structured query sentence
US10162812B2 (en) Natural language processing system to analyze mobile application feedback
CN107766532A (en) A kind of front end Node.js automates positive static method
US9754083B2 (en) Automatic creation of clinical study reports
CN110941427B (en) Code generation method and code generator
EP3828751A2 (en) Method, apparatus and electronic device for annotating information of structured document
CN109669632A (en) Metadata wiring method, device and medium based on distributed objects storage system
US9990353B2 (en) Natural language interface for software customization
CN109241037A (en) A kind of method, apparatus that realizing database processing, equipment and storage medium
CN113468204A (en) Data query method, device, equipment and medium
US11688393B2 (en) Machine learning to propose actions in response to natural language questions
CN108334634A (en) A kind of method, apparatus, equipment and the storage medium of extraction data information
KR20220002814A (en) Method, electronic device, and storage medium for processing visual data of deep model
CN110209780A (en) A kind of question template generation method, device, server and storage medium
DE112018005844T5 (en) Phonetic patterns for fuzzy matching in natural language processing
Sailaja et al. An overview of pre-processing text clustering methods
US9146918B2 (en) Compressing data for natural language processing
CN112966031A (en) Data processing method and device, electronic equipment and computer readable storage medium
CN111368091A (en) Document translation method and device
CN111125280A (en) Map cache processing method, system and equipment based on IndexDB
Buyuktanir et al. Enhancing Accessibility to Data in Data-Intensive Web Applications by Using Intelligent Web Prefetching Methodologies.
CN103208093A (en) Method and device for eliminating over-high load of large picture during rendering of graphic processing unit (GPU)
CN109359021A (en) A kind of detection method of system mode, device, equipment and storage medium
JP5941345B2 (en) Character information analysis method, information analysis apparatus, and program

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180727

RJ01 Rejection of invention patent application after publication