CN108334634A - A kind of method, apparatus, equipment and the storage medium of extraction data information - Google Patents
A kind of method, apparatus, equipment and the storage medium of extraction data information Download PDFInfo
- Publication number
- CN108334634A CN108334634A CN201810163956.1A CN201810163956A CN108334634A CN 108334634 A CN108334634 A CN 108334634A CN 201810163956 A CN201810163956 A CN 201810163956A CN 108334634 A CN108334634 A CN 108334634A
- Authority
- CN
- China
- Prior art keywords
- extraction
- data
- information
- template
- requirement
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2455—Query execution
- G06F16/24552—Database cache management
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/242—Query formulation
- G06F16/2433—Query languages
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2455—Query execution
- G06F16/24564—Applying rules; Deductive queries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/955—Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
This application discloses a kind of methods of extraction data information, including obtain the extraction requirement of data;Extraction is parsed further according to pre-set extraction logic to require, and obtains extraction template;Then pending data is obtained, and utilizes the information of extraction template extraction pending data.As it can be seen that when extraction requires change, this method can be parsed the extraction template after being changed again according to extraction logic, then obtained pending data and utilized the information of the extraction template extraction pending data after change.It requires the cumbersome of corresponding code so as to avoid extraction is write again and writes risk caused by code, improve the efficiency of extraction data information, while improving the stability of system.Disclosed herein as well is a kind of device, equipment and the computer readable storage mediums of extraction data information, all have above-mentioned advantageous effect.
Description
Technical field
The present invention relates to data processing field, more particularly to a kind of method, apparatus, equipment and the calculating of extraction data information
Machine readable storage medium storing program for executing.
Background technology
With the fast development of Internet technology, various data are constantly precipitated and are accumulated, but in these numbers
In, it is understood that there may be valuable information.Therefore, it is necessary to carry out secondary use to these data, valuable information is extracted.
In the prior art, the generally pre-defined extraction of mode for extracting the valuable information in mass data requires,
Then it is required to develop corresponding code according to extraction, the information of corresponding code extraction pending data is recycled, to obtain
Extraction requires corresponding information in pending data.But the iteratively faster in face of internet and various demands complicated and changeable,
Extraction requires usually change, and corresponding after extraction requires to change, the mode of the extraction data information of the prior art must
It according to the extraction after change must require that code is written over or is changed, then retest, reach the standard grade, then could recycle change
The information of code extraction pending data afterwards.So, a large amount of manpower will be expended, and writes, change the mistake of code
Journey is not only cumbersome, can also bring risk, therefore the stability of system will also be affected.
Therefore, how to improve the efficiency of extraction data information is that those skilled in the art need the technology solved to ask at present
Topic.
Invention content
In view of this, the purpose of the present invention is to provide a kind of method of extraction data information, extraction data can be improved
The efficiency of information;It is a further object of the present invention to provide a kind of device, equipment and the computer-readable storages of extraction data information
Medium all has above-mentioned advantageous effect.
In order to solve the above technical problems, the present invention provides a kind of method of extraction data information, including:
Obtain the extraction requirement of data;
The extraction requirement is parsed according to pre-set extraction logic, obtains extraction template;
Pending data is obtained, and utilizes the information of pending data described in the extraction template extraction.
Preferably, described to parse the extraction requirement according to pre-set extraction logic, obtaining extraction template is specially:
The extraction is parsed according to pre-set extraction logic to require, obtain pair using SQL engines and/or http engines
The extraction template answered.
Preferably, the information using pending data described in the extraction template extraction is specially:
The information of pending data described in the extraction template extraction is utilized by the way of multithreading.
Preferably, further comprise after the extraction for obtaining data requires:
The extraction of acquisition is required to cache.
Preferably, further comprise after the extraction by acquisition requires to be cached:
When receiving new extraction and requiring, judge whether the new extraction is required identical as the extraction of caching requirement;
If it is not, then requiring the extraction requirement of the update caching using the new extraction.
Preferably, further comprise:
Prompt inputs the extraction requirement.
In order to solve the above technical problems, the present invention also provides a kind of devices of extraction data information, including:
Acquisition module, the extraction requirement for obtaining data;
Parsing module obtains extraction template for parsing the extraction requirement according to pre-set extraction logic;
Extraction module for obtaining pending data, and utilizes the letter of pending data described in the extraction template extraction
Breath.
In order to solve the above technical problems, the present invention also provides a kind of equipment of extraction data information, including:
Memory, for storing computer program;
Processor realizes the step of the method for any of the above-described kind of extraction data information when for executing the computer program
Suddenly.
In order to solve the above technical problems, the present invention also provides a kind of computer readable storage medium, it is described computer-readable
Computer program is stored on storage medium, the computer program realizes any of the above-described kind of extraction data when being executed by processor
The step of method of information.
The method of extraction data information provided by the invention, including obtain the extraction requirement of data;Further according to pre-setting
Extraction logic parsing extraction require, obtain extraction template;Then pending data is obtained, and waits locating using extraction template extraction
Manage the information of data.As it can be seen that when extraction requires change, this method can again be parsed according to extraction logic and be changed
Then extraction template afterwards obtains pending data and utilizes the information of the extraction template extraction pending data after change.
It is required that is parsing extraction by pre-set extraction logic, it, can be according to setting in advance when extraction requires to change
The extraction logic set dynamically requires extraction to be parsed and obtained extraction template, is required so as to avoid extraction is write again
Corresponding code cumbersome and risk caused by code is write, improves the efficiency of extraction data information, improve simultaneously
The stability of system.
In order to solve the above technical problems, the present invention also provides a kind of device, equipment and the computers of extraction data information
Readable storage medium storing program for executing all has above-mentioned advantageous effect.
Description of the drawings
It in order to illustrate the embodiments of the present invention more clearly or the technical solution of the prior art, below will be to embodiment or existing
Attached drawing is briefly described needed in technology description, it should be apparent that, the accompanying drawings in the following description is only this hair
Some bright embodiments for those of ordinary skill in the art without creative efforts, can be with root
Other attached drawings are obtained according to the attached drawing of offer.
Fig. 1 is a kind of flow chart of the method for extraction data information provided in an embodiment of the present invention;
Fig. 2 is the flow chart of another method for extracting data information provided in an embodiment of the present invention;
Fig. 3 is a kind of structure chart of the device of extraction data information provided in an embodiment of the present invention;
Fig. 4 is a kind of structure chart of the equipment of extraction data information provided in an embodiment of the present invention.
Specific implementation mode
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete
Site preparation describes, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on
Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts every other
Embodiment shall fall within the protection scope of the present invention.
The core of the embodiment of the present invention is to provide a kind of method of extraction data information, can improve extraction data information
Efficiency;Another core of the present invention is to provide a kind of device, equipment and the computer readable storage medium of extraction data information,
With above-mentioned advantageous effect.
It is right with reference to the accompanying drawings and detailed description in order to make those skilled in the art more fully understand the present invention program
The present invention is described in further detail.
Fig. 1 is a kind of flow chart of the method for extraction data information provided in an embodiment of the present invention, as shown in Figure 1, extraction
The method of data information specifically includes:
S10:Obtain the extraction requirement of data.
It is understood that with the fast development of Internet technology, various data are constantly accumulated and are precipitated.
But it in these data may include some useful data, it is therefore desirable to handle data, such as extract in data
Valuable information so that these data are secondary use.By obtaining the extraction requirement of data, need to extract number to learn
According to which information.Therefore, it is necessary to according to actual demand, the extraction requirement of data is arranged.Extraction requires generally to pass through operator
Input, there are many modes of input, such as keyboard input, handwriting input or voice input etc., and the extraction to obtain data is wanted
It asks.It should be noted that the concrete mode that the present embodiment requires the extraction for obtaining data does not limit.
S20:Extraction is parsed according to pre-set extraction logic to require, and obtains extraction template.
It is understood that the extraction of operator's input requires to be generally the information such as word or picture, in order to logarithm
According to information is extracted, the extraction requirement by word or graphic form is needed to parse, obtains extraction template.Extraction logic is
It is carried for the extraction template for obtaining different with the resolution rules used when obtaining extraction template using that will extract to require to parse
Logic is taken generally to be different.The analytics engine used when in addition, being parsed to extraction requirement is different, and extraction logic also has
Institute is different.
S30:Pending data is obtained, and utilizes the information of extraction template extraction pending data.
After obtaining extraction template, to the information using extraction template extraction pending data, then also need first to obtain
Pending data needs the data for carrying out information extraction.Specifically, the concrete mode for obtaining pending data can be from number
Can also be to be obtained from webpage, the present embodiment does not limit the concrete mode for obtaining pending data according to being obtained in library.Specifically
, pending data is carried into row information using the information of extraction template extraction pending data, that is, using extraction template
It takes, corresponding information is required to obtain extraction in pending data.
As preferred embodiment, when executing extraction template, if it is SQL rules, then according to extraction template generation
The sentence of executable SQL sends sql command to corresponding database, and obtains implementing result, that is to say, that pending data
It is stored in database,
The method of extraction data information provided in this embodiment, including obtain the extraction requirement of data;Further according to setting in advance
The extraction logic parsing extraction set requires, and obtains extraction template;Then pending data is obtained, and is waited for using extraction template extraction
Handle the information of data.As it can be seen that when extraction requires change, this method can again be parsed according to extraction logic to be become
Then extraction template after more obtains pending data and utilizes the information of the extraction template extraction pending data after change.
It is required that is, parsing extraction by pre-set extraction logic, it, can be according to advance when extraction requires to change
The extraction logic of setting dynamically requires extraction to be parsed and obtained extraction template, is wanted so as to avoid extraction is write again
It asks the cumbersome of corresponding code and writes risk caused by code, improve the efficiency of extraction data information, improve simultaneously
The stability of system.
On the basis of the above embodiments, the present embodiment has made further instruction and optimization to technical solution, specifically,
Extraction is parsed according to pre-set extraction logic to require, obtaining extraction template is specially:
Extraction is parsed according to pre-set extraction logic to require, obtain corresponding using SQL engines and/or http engines
Extract template.
It requires, obtains it should be noted that SQL engines can be utilized to parse extraction according to pre-set extraction logic
The extraction template of SQL engines;It can also be to parse extraction according to pre-set extraction logic using http engines to require, obtain
The extraction template of http engines;Can certainly utilize SQL engines and http engines to obtain corresponding extraction mould respectively simultaneously
Plate.The present embodiment does not limit this.
It should be noted that SQL engines, that is, database engine SQL, for converting to database the extraction template of SQL to
Operation, such as input SQL extraction template, then SQL engines just convert the extraction template of SQL to the behaviour to database
Make, is specially the operation that the data in database are carried out with information extraction in the present embodiment;Http engines are used for http's
Extraction template is converted into the operation to browsing device net page, such as the extraction template of input http, http engines are carried using http's
Modulus plate operates browsing device net page, is specially to carry out information extraction to the data in browsing device net page in the present embodiment
Operation.
More specifically, SQL engines recycle executable first by the extraction form assembly of SQL at efficient executable module
Module extracts the information of data in database;The extraction template assembly of http is gone out effective url (unified resources by http engines in advance
Finger URL), recycle the information of data in url extraction browsing device net pages.
In order to improve working efficiency, as preferred embodiment, it can be arranged and grasp SQL engines and http engines parallel
Make, multiple SQL engines and multiple http engines can also be distinguished concurrent working, the present embodiment does not limit this.
On the basis of the above embodiments, the present embodiment has made further instruction and optimization to technical solution, specifically,
Using extraction template extraction pending data information be specially:
The information of extraction template extraction pending data is utilized by the way of multithreading.
It is understood that using the information of extraction template processing pending data by the way of multithreading, specifically,
The information for pre-setting multithreading extraction pending data, after obtaining multiple pending datas, by pending data according to
Pre-set rule carries out shunting processing, and multiple pending datas is made to be respectively processed on different threads.
As it can be seen that using the information of extraction template extraction pending data by way of multithreading, that is, identical
The information of more pending datas can be extracted in time, to improve the speed of extraction data information, raising carries
Take the efficiency of data information.
On the basis of the above embodiments, the present embodiment has made further instruction and optimization to technical solution, specifically,
Further comprise:
Prompt input extraction requires.
It is understood that when needing to extract the operation of data information, prompts operator to input extraction and require.It should
Prompt can be text information, pictorial information, can also be acoustic information, the present embodiment does not do the specific implementation form of prompt
It limits.By prompt input extraction require, can person easy to operation operate, improve user experience.
Fig. 2 is the flow chart of another method for extracting data information provided in an embodiment of the present invention, referring to FIG. 2,
On the basis of above-described embodiment, the present embodiment has made further instruction and optimization to technical solution, specifically, obtaining data
Extraction require after further comprise:
S40:The extraction of acquisition is required to cache.
Specifically, after getting the extraction requirement of data, the extraction of acquisition is required to cache, it is every to avoid
It is secondary all to go to obtain extraction requirement, so that next time can use identical extraction to require extract information to data, carried in next time
When taking, if extraction require it is identical, then acquisition extraction the step of requiring, can be reduced, to make the process of extraction data information
It is more convenient.It should be noted that the present embodiment requires the specific location cached not limit extraction, correspondingly, root
It is also differed according to the caching of different cache locations, progress, therefore, the tool that the present embodiment is cached to that will extract information
Body mode of operation does not also limit.
Further comprise after requiring to be cached by the extraction of acquisition:
S50:When receiving new extraction and requiring, judge whether new extraction is required identical as the extraction requirement that caches;
S60:If it is not, then requiring the extraction requirement of update caching using new extraction.
It is understood that when extraction requires to change, the extraction requirement to caching is needed also to be updated.Specifically
, when receiving new extraction and requiring, judge whether new extraction is required identical as the extraction requirement that caches.Concrete implementation
Mode is to require to require to be compared with the extraction of caching using new extraction, judges that extraction requires whether to change with this.
If it is not, i.e. new extraction requirement is different from the extraction requirement of caching, then the extraction requirement to caching is needed to be updated, specifically
, the extraction requirement of update caching is required using new extraction.
As it can be seen that by when extraction requires to change, the extraction requirement of update caching is required using new extraction so that
The extraction of caching requires unanimously to keep newest state.
It is described in detail above for a kind of embodiment of method of extraction data information provided by the invention, this
Invention additionally provides a kind of device, equipment and the computer readable storage medium of extraction data information corresponding with this method, by
Mutually correlate in the embodiment of device, equipment and computer readable storage medium part and the embodiment of method part, therefore fills
Set, the embodiment of equipment and computer readable storage medium part refer to method part embodiment description, here wouldn't
It repeats.
Fig. 3 is a kind of structure chart of the device of extraction data information provided in an embodiment of the present invention, as shown in figure 3, extraction
The device of data information includes:
Acquisition module 31, the extraction requirement for obtaining data;
Parsing module 32 is required for parsing extraction according to pre-set extraction logic, obtains extraction template;
Extraction module 33 for obtaining pending data, and utilizes the information of extraction template extraction pending data.
The device of extraction data information provided in this embodiment, the beneficial effect of the method with said extracted data information
Fruit.
Fig. 4 is a kind of structure chart of the equipment of extraction data information provided in an embodiment of the present invention, as shown in figure 4, extraction
The equipment of data information specifically includes:
Memory 41, for storing computer program;
Processor 42, realizes following steps when for executing computer program:
Obtain the extraction requirement of data;
Extraction is parsed according to pre-set extraction logic to require, and obtains extraction template;
Pending data is obtained, and utilizes the information of extraction template extraction pending data.
The equipment of extraction data information provided in this embodiment, the beneficial effect of the method with said extracted data information
Fruit.
In order to solve the above technical problems, the present invention also provides a kind of computer readable storage medium, computer-readable storage
It is stored with computer program on medium, lower step is realized when computer program is executed by processor:
Obtain the extraction requirement of data;
Extraction is parsed according to pre-set extraction logic to require, and obtains extraction template;
Pending data is obtained, and utilizes the information of extraction template extraction pending data.
Computer readable storage medium provided in this embodiment, the beneficial effect of the method with said extracted data information
Fruit.
Above to method, apparatus, equipment and the computer readable storage medium of extraction data information provided by the present invention
It is described in detail.Principle and implementation of the present invention are described for specific embodiment used herein, above
The explanation of embodiment is merely used to help understand the method and its core concept of the present invention.It should be pointed out that for the art
Those of ordinary skill for, without departing from the principle of the present invention, can also to the present invention carry out it is several improvement and repair
Decorations, these improvement and modification are also fallen within the protection scope of the claims of the present invention.
Each embodiment is described by the way of progressive in specification, the highlights of each of the examples are with other realities
Apply the difference of example, just to refer each other for identical similar portion between each embodiment.For device disclosed in embodiment
Speech, since it is corresponded to the methods disclosed in the examples, so description is fairly simple, related place is referring to method part illustration
.
Professional further appreciates that, unit described in conjunction with the examples disclosed in the embodiments of the present disclosure
And algorithm steps, can be realized with electronic hardware, computer software, or a combination of the two, in order to clearly demonstrate hardware and
The interchangeability of software generally describes each exemplary composition and step according to function in the above description.These
Function is implemented in hardware or software actually, depends on the specific application and design constraint of technical solution.Profession
Technical staff can use different methods to achieve the described function each specific application, but this realization is not answered
Think beyond the scope of this invention.
The step of method described in conjunction with the examples disclosed in this document or algorithm, can directly be held with hardware, processor
The combination of capable software module or the two is implemented.Software module can be placed in random access memory (RAM), memory, read-only deposit
Reservoir (ROM), electrically programmable ROM, electrically erasable ROM, register, hard disk, moveable magnetic disc, CD-ROM or technology
In any other form of storage medium well known in field.
Claims (9)
1. a kind of method of extraction data information, which is characterized in that including:
Obtain the extraction requirement of data;
The extraction requirement is parsed according to pre-set extraction logic, obtains extraction template;
Pending data is obtained, and utilizes the information of pending data described in the extraction template extraction.
2. according to the method described in claim 1, it is characterized in that, described carry according to described in the parsing of pre-set extraction logic
Requirement is taken, obtaining extraction template is specially:
The extraction requirement is parsed according to pre-set extraction logic using SQL engines and/or http engines, is obtained corresponding
Extract template.
3. method according to claim 1 or 2, which is characterized in that described utilize waits locating described in the extraction template extraction
Reason data information be specially:
The information of pending data described in the extraction template extraction is utilized by the way of multithreading.
4. according to the method described in claim 1, it is characterized in that, further being wrapped after the extraction for obtaining data requires
It includes:
The extraction of acquisition is required to cache.
5. according to the method described in claim 4, it is characterized in that, requiring to carry out caching it in the extraction by acquisition
After further comprise:
When receiving new extraction and requiring, judge whether the new extraction is required identical as the extraction of caching requirement;
If it is not, then requiring the extraction requirement of the update caching using the new extraction.
6. according to the method described in claim 1, it is characterized in that, further comprising:
Prompt inputs the extraction requirement.
7. a kind of device of extraction data information, which is characterized in that including:
Acquisition module, the extraction requirement for obtaining data;
Parsing module obtains extraction template for parsing the extraction requirement according to pre-set extraction logic;
Extraction module for obtaining pending data, and utilizes the information of pending data described in the extraction template extraction.
8. a kind of equipment of extraction data information, which is characterized in that including:
Memory, for storing computer program;
Processor realizes that claim 1 to 6 any one of them such as extracts data information when for executing the computer program
Method the step of.
9. a kind of computer readable storage medium, which is characterized in that be stored with computer on the computer readable storage medium
Program is realized when the computer program is executed by processor as claim 1 to 6 any one of them extracts data information
The step of method.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810163956.1A CN108334634A (en) | 2018-02-27 | 2018-02-27 | A kind of method, apparatus, equipment and the storage medium of extraction data information |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810163956.1A CN108334634A (en) | 2018-02-27 | 2018-02-27 | A kind of method, apparatus, equipment and the storage medium of extraction data information |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108334634A true CN108334634A (en) | 2018-07-27 |
Family
ID=62929894
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810163956.1A Pending CN108334634A (en) | 2018-02-27 | 2018-02-27 | A kind of method, apparatus, equipment and the storage medium of extraction data information |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108334634A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111125589A (en) * | 2018-10-31 | 2020-05-08 | 北大方正集团有限公司 | Data acquisition method and device and computer readable storage medium |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102360368A (en) * | 2011-10-09 | 2012-02-22 | 山东大学 | Web data extraction method based on visual customization of extraction template |
WO2013097597A1 (en) * | 2011-12-30 | 2013-07-04 | 华为技术有限公司 | Content filtering method, device and system |
CN104516979A (en) * | 2014-12-31 | 2015-04-15 | 北京锐安科技有限公司 | Data query method and data query system based on quadratic search |
CN104573040A (en) * | 2015-01-19 | 2015-04-29 | 百度在线网络技术(北京)有限公司 | Method and system for capturing webpage data |
CN106886535A (en) * | 2015-12-16 | 2017-06-23 | 大唐软件技术股份有限公司 | A kind of data pick-up method and apparatus for being adapted to multiple data sources |
-
2018
- 2018-02-27 CN CN201810163956.1A patent/CN108334634A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102360368A (en) * | 2011-10-09 | 2012-02-22 | 山东大学 | Web data extraction method based on visual customization of extraction template |
WO2013097597A1 (en) * | 2011-12-30 | 2013-07-04 | 华为技术有限公司 | Content filtering method, device and system |
CN104516979A (en) * | 2014-12-31 | 2015-04-15 | 北京锐安科技有限公司 | Data query method and data query system based on quadratic search |
CN104573040A (en) * | 2015-01-19 | 2015-04-29 | 百度在线网络技术(北京)有限公司 | Method and system for capturing webpage data |
CN106886535A (en) * | 2015-12-16 | 2017-06-23 | 大唐软件技术股份有限公司 | A kind of data pick-up method and apparatus for being adapted to multiple data sources |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111125589A (en) * | 2018-10-31 | 2020-05-08 | 北大方正集团有限公司 | Data acquisition method and device and computer readable storage medium |
CN111125589B (en) * | 2018-10-31 | 2023-09-05 | 新方正控股发展有限责任公司 | Data acquisition method and device and computer readable storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP3688672B1 (en) | Attention-based decoder-only sequence transduction neural networks | |
CN105550206B (en) | The edition control method and device of structured query sentence | |
US10162812B2 (en) | Natural language processing system to analyze mobile application feedback | |
CN107766532A (en) | A kind of front end Node.js automates positive static method | |
US9754083B2 (en) | Automatic creation of clinical study reports | |
CN110941427B (en) | Code generation method and code generator | |
EP3828751A2 (en) | Method, apparatus and electronic device for annotating information of structured document | |
CN109669632A (en) | Metadata wiring method, device and medium based on distributed objects storage system | |
US9990353B2 (en) | Natural language interface for software customization | |
CN109241037A (en) | A kind of method, apparatus that realizing database processing, equipment and storage medium | |
CN113468204A (en) | Data query method, device, equipment and medium | |
US11688393B2 (en) | Machine learning to propose actions in response to natural language questions | |
CN108334634A (en) | A kind of method, apparatus, equipment and the storage medium of extraction data information | |
KR20220002814A (en) | Method, electronic device, and storage medium for processing visual data of deep model | |
CN110209780A (en) | A kind of question template generation method, device, server and storage medium | |
DE112018005844T5 (en) | Phonetic patterns for fuzzy matching in natural language processing | |
Sailaja et al. | An overview of pre-processing text clustering methods | |
US9146918B2 (en) | Compressing data for natural language processing | |
CN112966031A (en) | Data processing method and device, electronic equipment and computer readable storage medium | |
CN111368091A (en) | Document translation method and device | |
CN111125280A (en) | Map cache processing method, system and equipment based on IndexDB | |
Buyuktanir et al. | Enhancing Accessibility to Data in Data-Intensive Web Applications by Using Intelligent Web Prefetching Methodologies. | |
CN103208093A (en) | Method and device for eliminating over-high load of large picture during rendering of graphic processing unit (GPU) | |
CN109359021A (en) | A kind of detection method of system mode, device, equipment and storage medium | |
JP5941345B2 (en) | Character information analysis method, information analysis apparatus, and program |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180727 |
|
RJ01 | Rejection of invention patent application after publication |