CN102117303A - Patent data analysis method and system - Google Patents
Patent data analysis method and system Download PDFInfo
- Publication number
- CN102117303A CN102117303A CN2009102657657A CN200910265765A CN102117303A CN 102117303 A CN102117303 A CN 102117303A CN 2009102657657 A CN2009102657657 A CN 2009102657657A CN 200910265765 A CN200910265765 A CN 200910265765A CN 102117303 A CN102117303 A CN 102117303A
- Authority
- CN
- China
- Prior art keywords
- data
- local database
- establishing
- extracting
- module
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000007405 data analysis Methods 0.000 title claims abstract description 38
- 238000000034 method Methods 0.000 title claims abstract description 37
- 238000004458 analytical method Methods 0.000 claims abstract description 44
- 238000000605 extraction Methods 0.000 claims description 77
- 238000013075 data extraction Methods 0.000 claims description 28
- 230000004083 survival effect Effects 0.000 claims description 16
- 230000000737 periodic effect Effects 0.000 claims description 2
- 238000004883 computer application Methods 0.000 abstract description 2
- 230000008569 process Effects 0.000 abstract description 2
- 238000012545 processing Methods 0.000 description 11
- 230000008859 change Effects 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 7
- 238000010586 diagram Methods 0.000 description 5
- 238000011161 development Methods 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 230000001960 triggered effect Effects 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 2
- 230000002860 competitive effect Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 239000002699 waste material Substances 0.000 description 2
- 230000001133 acceleration Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000004451 qualitative analysis Methods 0.000 description 1
- 238000004445 quantitative analysis Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a patent data analysis method and system, belonging to the computer application field. The patent data analysis method comprises: constructing a subject corresponding to an analysis target in a local database; constructing a data mart consistent with the subject; constructing a data view corresponding to the subject in a data warehouse; extracting the patent data in the local database; storing the extracted patent data in the data warehouse; determining the corresponding data mart according to the request of a user; and analyzing the determined data mart, and returning the analysis result to the user in a view manner. The method can process the patent data by ETL (extract-transform-load) treatment specific to the subject, and can analyze the patent data to improve the patent data analysis efficiency and quality, thereby providing convenience for users.
Description
Technical Field
The invention relates to the field of computer application, in particular to a patent data analysis method and system.
Background
With the acceleration of the globalization process and the coming of the intellectual economy, intellectual property becomes an important factor for determining the competitiveness of the future countries and enterprises, and has more and more important status and function. Among them, patent, especially patent technology, plays a leading role in industry as a technology with exclusive rights, and often can determine the competitive position and market scope of enterprises.
At present, more than 6000 million patents are issued by national patent offices of China, the United states, Europe, Japan, and the like, and more than 500 million accumulated accepted patent applications by the national intellectual Property office of China to 2009. The patent information is an acquisition source of competitive information and technical information, and becomes an important object of data analysis because the carried information has the characteristics of comprehensive data, quick update, clear rights and the like. The patent analysis result has good use value for knowledge creators such as inventors, small and medium-sized enterprises, laboratories, universities and the like, and is also useful for large enterprises or government agencies.
The patent information analysis is to search published patent application files and patent files, to clean and screen the search results, to select proper patent information analysis items according to the analysis purpose, and to extract the right information, technical information, business information, development trend and other contents contained in the patent from the micro level and the macro level by adopting information processing technologies such as quantitative analysis, qualitative analysis, text mining and the like.
The rapid increase of the patent information volume and the complexity of inquiry make the patent information analysis become a processing procedure with larger data volume. Meanwhile, due to the diversity of user requirements, the complexity of patent analysis is further aggravated. When the existing data analysis system and method are used for patent analysis, the processing speed is relatively slow; in severe cases, erroneous analysis results are generated, which further affects strategic decisions made based on patent analysis.
Aiming at the problem that the data analysis system and method in the related technology can not reasonably and quickly carry out patent analysis, an effective solution is not provided at present.
Disclosure of Invention
The invention aims to provide a patent data analysis method and system capable of improving the accuracy of patent data analysis.
According to an aspect of the present invention, there is provided a patent data analysis method, including:
establishing a theme corresponding to an analysis purpose in a local database;
establishing a data mart consistent with the theme, and establishing a data view corresponding to the theme in a data warehouse;
extracting patent data in the local database;
storing the extracted patent data into the data warehouse, wherein the storage is stored in a form based on the data view;
determining a corresponding data mart according to a request of a user, and analyzing patent data according to the determined data mart;
and returning the analysis result to the user in a view form.
According to another aspect of the present invention, there is provided a patent data analysis system including:
the local database is used for storing patent data, and the patent data corresponds to a pre-established theme;
the establishing module is used for establishing a data mart consistent with the theme and establishing a data view corresponding to the theme in the local database in a data warehouse;
the data extraction module is used for extracting the patent data in the local database;
the data warehouse is used for storing the patent data extracted by the data extraction module, and the storage is stored in a form based on the data view;
the data analysis module is used for determining the data mart established by the establishing module according to the request of the user and analyzing the patent data according to the determined data mart;
and the display module is used for returning the result analyzed by the data analysis module to the user in a view form.
According to another aspect of the present invention, there is provided a patent data analysis system, the system including:
the first establishing module is used for establishing a theme corresponding to an analysis purpose in a local database;
the second establishing module is used for establishing a data mart consistent with the theme and establishing a data view corresponding to the theme established by the first establishing module in a data warehouse;
the data extraction module is used for extracting the patent data in the local database;
the storage module is used for storing the patent data extracted by the data extraction module to a data warehouse, and the storage is stored in a form based on the data view;
the data analysis module is used for determining the data mart established by the second establishing module according to the request of the user and analyzing the patent data according to the determined data mart;
and the display module is used for returning the result analyzed by the data analysis module to the user in a view form.
By adopting the method, the patent data are extracted and processed aiming at the theme, the patent data are analyzed, and the analysis result is returned to the user in a visual form of a view, so that the efficiency and the quality of patent analysis can be improved, and the user can use the patent data conveniently.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the invention and together with the description serve to explain the invention without limiting the invention. In the drawings:
fig. 1 is a block diagram showing a configuration of a patent data analysis system provided in embodiment 1;
FIG. 2 is a block diagram showing the construction of another patent data analysis system provided in example 1;
FIG. 3 is a block diagram showing the construction of another patent data analysis system provided in example 1;
FIG. 4 is a flow chart showing a patent data analysis method provided in example 2;
fig. 5 shows a schematic structural diagram of a data view in a data warehouse provided in embodiment 2.
Detailed Description
The invention will be described in detail hereinafter with reference to the accompanying drawings in conjunction with embodiments. It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict.
The embodiment of the invention mainly aims at the rapid increase of the patent information quantity and the complicated condition of inquiry, and adopts the technology of combining a data warehouse and an OLAP (On-Line Analytical Processing) to carry out patent analysis under the condition that the traditional OLTP (On-Line Transaction Processing) system cannot meet the requirement of carrying out deep multidimensional analysis On data.
Example 1
Referring to fig. 1, the present embodiment provides a patent data analysis system, which includes: a local database 102, an establishing module 104, a data extracting module 106, a data warehouse 107, a data analyzing module 108 and a display module 110; wherein,
a local database 102 for storing patent data corresponding to a pre-established topic;
the pre-established theme in this embodiment refers to a patent set, which may be established according to the requirement of a user (i.e., an analysis purpose), for example: to inquire the technical development status of the competitor 'Zhongxing company' of the Hua-Shi company, a theme is established in a local database: the applicant is Zhongxing company, and only the patents under the subject are considered when the patent analysis is carried out; or the inventor needs to understand the current development of the research topic, the subject consistent with the research topic can be established, for example: the invention name is the theme of the single chip microcomputer, etc.;
the patents in the local database 102 may be obtained by downloading from the internet, or may be obtained by other means.
The establishing module 104 is configured to establish a data mart consistent with the above theme, and establish a data view corresponding to the theme in the local database 102 in the data warehouse; data marts are typically built on analytical servers;
a data extraction module 106, configured to extract patent data in the local database 102;
a data warehouse 107, configured to store the patent data extracted by the data extraction module 106, where the storage in this embodiment is stored in a data view-based form;
the data analysis module 108 is used for determining the data mart established by the establishment module 104 according to the request of the user and performing patent data analysis according to the determined data mart;
and the display module 110 is used for returning the result analyzed by the data analysis module 108 to the user in a view form.
In order to better analyze patent data, topics in a local database and data views in a data warehouse need to be kept consistent with data marts on an analysis server, the establishing module 104 may periodically monitor whether each topic has a corresponding data view in the data warehouse and a corresponding data mart, and if not, the establishing module uses a script to establish the data view; when a certain theme in the local database is deleted, the corresponding data view and the corresponding data mart in the data warehouse are also deleted;
the embodiment displays the patent analysis result to the user in a view form, so that the user can more intuitively acquire the required information, and the use of the user is facilitated.
Preferably, the local database 102 comprises:
a plurality of original information bases for storing original data of various patent information; the original information base can be stored in a centralized way or in a distributed storage way;
the building module 104, the data extracting module 106, the data warehouse 107, the data analyzing module 108 and the display module 110 may be integrated into one device, and referring to fig. 2, another patent data analyzing system is provided for this embodiment, in which the building module 104, the data extracting module 106, the data warehouse 107, the data analyzing module 108 and the display module 110 are integrated into the server 10.
Extracting patents in the local database in this embodiment refers to performing ETL (Extract-Transform-Load) processing;
the data extraction comprises full-library extraction and incremental extraction, wherein the full-library extraction is to copy all data in a local database to a data warehouse; incremental extraction is to extract only data added after the last extraction is finished; the extraction period for incremental extraction may be one day; one month, or fixed time per week;
when the data extraction module 106 extracts data, it is determined whether the topic of the local database has a corresponding data cube (i.e., a data mart) in the analysis server, and if not, the corresponding data cube is dynamically created in the analysis server, so that the topics of the analysis server and the local database are consistent; at the same time, the data cube ID may be set according to some rule, such as: the data cube of the same name, and the data view in the data warehouse of the same name, are created per the topic ID to make the three (topic, data view and data cube in the local database) consistent.
The data extraction of the embodiment can be completed by combining full-library extraction and incremental extraction, and the specific method comprises the following steps: extracting primary data by adopting a whole database; then, a longer period (for example: one month) is set for the whole library extraction, and a plurality of shorter periods (for example: one week) are set in the whole library extraction period for incremental extraction;
when incremental extraction is carried out, a timestamp is added to a local database, and only data added after the last extraction is finished is extracted in each incremental extraction;
because after a long time, the patents in the raw database that have been extracted to the data warehouse may also change, for example: in the legal state, some patent states are changed newly after a period of time, the patent states are changed from public states to actual states, and a user modifies and indexes patent data; therefore, after a long time, the whole library extraction is needed;
furthermore, on the basis of the above extraction method, an update extraction method may be combined, where the update extraction method is to extract the patents in the local patent database and store the extracted patents in the data warehouse, but as time goes on, the patents in the local patent database change, and the update extraction needs to be adopted; the specific updating and extracting method comprises the following steps:
1) adding an updating time stamp in a local patent database, updating the time stamp by a service system after a local patent is changed, and automatically extracting patent data which is updated and extracted last time;
2) the local patent database adopts a trigger mode to establish an updating trigger, when the data in the patent table is changed, the updating trigger writes the changed data into a temporary table, the data is extracted from the temporary table by updating and extracting, and the data extracted from the temporary table is marked or deleted;
3) adding an update timestamp in a local patent database, establishing an update trigger in a trigger mode, automatically recording the update timestamp of a changed patent by the update trigger every time when data in a patent table is changed, and automatically extracting the patent data which is updated and extracted last time by an update extraction service;
the updating extraction and the incremental extraction can be carried out simultaneously or asynchronously;
preferably, when the data extraction module 106 performs data extraction setting, a period of full library extraction, a period of incremental extraction, and an extraction period of updating may be set; and recording the time of the whole-library extraction, the time of incremental extraction and the time of updating the extraction after the extraction is finished every time, and judging whether to perform new extraction work or not according to the last extraction time and a set period when the next time is started.
Preferably, the system may further include:
the legal state acquisition module is used for acquiring legal state information of the specified patent from the legal state retrieval website;
and/or the survival time calculating module is used for analyzing the collected legal state information and calculating the survival time of the specified patent according to the analyzed information.
Wherein, the survival period refers to the time length from the application date to the current time of the patent; patent survival time is non-physical storage data which changes along with time, is a calculation item, automatically changes after a period of time, needs to be calculated again to know, and actual data does not change, such as: survival time is changed from 4 to 5 (which is the difference between the current time and the application time), the change of non-physical data is unavailable by other means, a full-base extraction strategy is adopted, and the survival time of the patent is recalculated in the extraction process, although the strategy has certain hysteresis, the survival time is acceptable;
the scheme of the optimization processing is that the whole database extraction is not carried out, but a thread is started to carry out periodic complete recalculation on the survival data of the data warehouse, the calculation can be carried out according to the period of days, weeks, months and the like, the system load is reduced, but the situation that the patent of the local database is deleted cannot be solved;
further, the whole database extraction is combined with the survival time recalculation strategy of the data warehouse data, and the whole database extraction can be set to a longer period properly;
the search for the specified patent from the legal status search website (for example, the intellectual property office of the people's republic of China) can be performed according to the patent number of the patent and the patent name of the patent.
By inquiring about the legal state of a given patent and analyzing the result of the inquiry, the method can provide a basis for calculating the survival time of the patent, so that a user can further know the patent.
Preferably, the system may further include:
and the patent attribution analyzing module is used for automatically judging the attribution of the patent according to the address information of the applicant of the specified patent and a preset region code table. For example: country, province, city, etc.
The data extraction in this embodiment may be started automatically or triggered by a user; the automatic starting time can be set as the time when the server is idle, such as the later midnight, when no one is accessing the server, the peak of the system load is avoided, or the system automatically detects the load condition of the server and adjusts the starting time, which is called idle time extraction; sometimes, the user wants to analyze the latest patent data immediately, and then the extraction process is started immediately through manual trigger starting.
Preferably, when data extraction is performed, a specific topic can also be extracted, for example: only the patent data of a certain theme in the local database is changed, and only the changed theme can be extracted;
preferably, for newly-added patent data, such as newly-published patents, it is necessary to determine which topic it belongs to, set an update identifier of the topic to which it belongs, and perform ETL processing on the topic with the update identifier; for the patent right transfer with only changed state, such as bibliographic change, corresponding patents in the data warehouse can be directly written, and the waste of system resources is reduced.
Referring to fig. 3, the present embodiment further provides a patent data analysis system, which includes:
a first establishing module 1002, configured to establish a topic corresponding to an analysis purpose in a local database;
a second establishing module 1004, configured to establish a data mart consistent with the theme, and establish a data view in the data warehouse corresponding to the theme established by the first establishing module 1002;
a data extraction module 1006, configured to extract patent data in the local database;
the storage module 1008 is used for storing the patent data extracted by the data extraction module 1006 into a data warehouse, wherein the storage is stored in a form based on a data view;
a data analysis module 1010, configured to determine, according to a request of a user, a data mart established by the second establishing module 1004, and perform patent data analysis according to the determined data mart;
and a display module 1012 for returning the result analyzed by the data analysis module 1010 to the user in the form of a view.
The implementation of the data extraction module 1006 may be the same as the implementation of the data extraction module 106 in the system shown in fig. 1, and is not described herein again.
Preferably, the system may further include:
the legal state acquisition module is used for acquiring legal state information of the specified patent from the legal state retrieval website;
and/or the survival time calculating module is used for analyzing the collected legal state information and calculating the survival time of the specified patent according to the analyzed information.
Or the system also comprises a patent attribution analyzing module which is used for automatically judging the attribution of the patent according to the applicant address information of the specified patent and a preset region code table. For example: country, province, city, etc.
In the embodiment, by adopting a technology of combining the data warehouse and the OLAP, the patent data is subjected to ETL processing, multidimensional analysis is performed on the data stored in the data warehouse, and an analysis result is returned to a user in an intuitive form of a multidimensional view, so that the efficiency and quality of patent analysis can be improved, and the patent analysis is convenient for the user to use.
Example 2
Referring to fig. 4, the present embodiment provides a patent data analysis method, including:
step S302: establishing a theme corresponding to an analysis purpose in a local database;
in order to facilitate the management of patents, in the embodiment, according to the requirements of users, corresponding topics are established in a local database, patents are organized according to a topic mode, and users of patent data under each topic can perform operations such as indexing and modification;
step S304: establishing a data mart consistent with the theme, and establishing a data view corresponding to the theme in a data warehouse;
in this embodiment, it is preferable that the data view in the data warehouse, the data mart on the analysis server and the theme in the local database maintain a corresponding relationship, and the same identification number ID may be assigned to the theme, the data view and the data mart in the local database according to a certain rule, and it is determined whether the three (the theme, the data view and the data cube in the local database) are consistent according to the ID, and if not, the data view and the data mart corresponding to the theme in the local database need to be established according to the theme in the local database;
step S306: extracting patent data in a local database, and storing the extracted patent data into a data warehouse, wherein the storage in the embodiment is stored in a form based on the data view;
the data extraction mentioned in this embodiment refers to performing ETL processing, that is, performing extraction, conversion, cleaning, filtering, loading, and the like, and the processed patents are stored in a data warehouse, which is disposed in a server in this embodiment.
The embodiment can perform the updating operation on the data in the data warehouse regularly, namely, perform the above extraction operation, and the data extraction includes full-database extraction, incremental extraction and updating extraction, wherein the full-database extraction is to copy all the data in the local database to the data warehouse of the server; incremental extraction is to extract only data added after the last extraction is finished; the extraction period for incremental extraction may be one day; one month, or fixed time per week; the updating extraction refers to a mode of extracting a changed patent when the patent in the local database is changed; the specific implementation of performing full-library extraction, incremental extraction or update extraction is the same as that in embodiment 1, and is not described in detail here.
The data extraction can be performed regularly, for example, all patent data in the local database are extracted at intervals of a first preset time (for example, a month); or extracting patent data of a specific subject from the local database at intervals of a second preset time (for example: one week); or extracting patent data of a specific state in the local database at intervals of a third preset time (for example, one month).
When performing update extraction, the method may further include:
setting an update timestamp in a local database; correspondingly, the extraction of the patent data in the local database comprises: when the patent data in the local database changes, extracting the patent data after the timestamp is updated; or,
establishing an updating trigger in a local database, and writing the changed patent data into a temporary table by the updating trigger when the patent data in the local database are changed; correspondingly, the extraction of the patent data in the local database comprises: extracting patent data from the temporary table, and deleting or marking the extracted patent data in the temporary table specifically; or,
setting an update timestamp and establishing an update trigger in a local database at the same time; when the patent data in the local database are changed, the updating trigger records the updating time stamp of the changed patent data; correspondingly, the extraction of the patent data in the local database comprises: and extracting the patent data after the last extraction according to the updating time stamp recorded by the updating trigger.
When extraction is carried out, all patent data in the local database can be extracted at preset time intervals; or extracting patent data of a specific subject in the local database at preset time intervals; or, the extraction of patent data is automatically triggered by a preset system clock, and the like; then, converting and other operations are carried out on the extracted data, so that the storage form of the extracted data conforms to the form of the data view, and the obtained analysis result is more accurate and reliable;
step S308: determining a corresponding data mart according to a request of a user, and analyzing the patent data according to the determined data mart;
step S310: and returning the analysis result to the user in a view form.
During data extraction, judging whether the theme of the local database has a corresponding data view in the data warehouse or not and whether a corresponding data cube (namely, a data mart) exists in the analysis server or not, if not, establishing the data view according to the theme and dynamically establishing the corresponding data cube in the analysis server to realize that the data view, the data cube and the theme of the local database are consistent;
the data extraction of the embodiment can be completed by combining full-library extraction and incremental extraction, and the specific method comprises the following steps: extracting primary data by adopting a whole database; then, a longer period (for example: one month) is set for the whole library extraction, and a plurality of shorter periods (for example: one week) are set in the whole library extraction period for incremental extraction; or the updating extraction and the incremental extraction are carried out simultaneously or asynchronously.
When incremental extraction is carried out, a timestamp is added to a local database, and only data added after the last extraction is finished is extracted in each incremental extraction;
because after a long time, the patents in the raw database that have been extracted to the data warehouse may also change, for example: in the legal state, some patent states are changed newly after a period of time, the patent states are changed from public states to actual states, and a user modifies and indexes patent data; therefore, after a long time, the whole library extraction is needed;
the data extraction in this embodiment may be started automatically or triggered by a user; the automatic starting time can be generally set as the time when the server is idle, such as the later midnight, when almost no one is accessing the server, the peak of the system load is avoided, or the system automatically detects the load condition of the server and adjusts the starting time, which is called idle time extraction; sometimes, the user wants to analyze the latest patent data immediately, and then the extraction process is started immediately through manual trigger starting.
Preferably, when data extraction is performed, a specific topic can also be extracted, for example: in the local database, only the patent data of a certain theme is changed, and only the changed theme can be extracted.
Preferably, the method further comprises: when a user wants to inquire the survival time of a specified patent, acquiring legal state information of the specified patent from a legal state retrieval website; analyzing the legal state information; and calculating the survival time of the specified patent according to the analyzed information.
Preferably, the method further comprises: when the user wants to inquire the attribution of the designated patent, the attribution of the designated patent is judged according to the applicant address information in the designated patent and a preset region code table. The region code table indicates that places (for example, countries, provinces, cities, etc.) are represented by codes, and each code is in one-to-one correspondence with each place in the form of a table.
Referring to fig. 5, a schematic structural diagram of a data view in a data warehouse provided in this embodiment is shown, where the data view organizes information in patent data in a star-shaped mode, and this embodiment is described only by taking a part of the information as an example.
Preferably, for newly-added patent data, such as newly-published patents, it is necessary to determine which topic it belongs to, set an update identifier of the topic to which it belongs, and perform ETL processing on the topic with the update identifier; for the patent right transfer with only changed state, such as bibliographic change, corresponding patents in the data warehouse can be directly written, and the waste of system resources is reduced.
In the embodiment, by adopting a technology of combining the data warehouse and the OLAP, the patent data is subjected to ETL processing, multidimensional analysis is performed on the data stored in the data warehouse, and an analysis result is returned to a user in an intuitive form of a multidimensional view, so that the efficiency and quality of patent analysis can be improved, and the patent analysis is convenient for the user to use.
It will be apparent to those skilled in the art that the modules or steps of the present invention described above may be implemented by a general purpose computing device, they may be centralized on a single computing device or distributed across a network of multiple computing devices, and alternatively, they may be implemented by program code executable by a computing device, such that they may be stored in a storage device and executed by a computing device, and in some cases, the steps shown or described may be performed in an order different than that described herein, or they may be separately fabricated into individual integrated circuit modules, or multiple ones of them may be fabricated into a single integrated circuit module. Thus, the present invention is not limited to any specific combination of hardware and software.
The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.
Claims (12)
1. A patent data analysis method, characterized in that the method comprises:
establishing a theme corresponding to an analysis purpose in a local database;
establishing a data mart consistent with the theme, and establishing a data view corresponding to the theme in a data warehouse;
extracting patent data in the local database;
storing the extracted patent data into the data warehouse, wherein the storage is stored in a form based on the data view;
determining a corresponding data mart according to a request of a user, and analyzing patent data according to the determined data mart;
and returning the analysis result to the user in a view form.
2. The method of claim 1, wherein extracting patent data from the local database comprises:
and extracting the patent data in the local database regularly.
3. The method of claim 2, wherein periodically extracting patent data from the local database comprises at least one of:
extracting all patent data in the local database at intervals of first preset time;
extracting patent data of a specific subject in the local database at intervals of second preset time; and
and extracting the patent data of the specific state in the local database at intervals of a third preset time.
4. The method of claim 2, wherein the periodic extraction of patent data from the local database is a system-timed extraction or an idle extraction.
5. The method of claim 1,
setting an update timestamp in the local database;
correspondingly, the extracting the patent data in the local database comprises:
and when the patent data in the local database changes, extracting the patent data after the update time stamp.
6. The method of claim 1,
establishing an updating trigger in the local database;
when the patent data in the local database are changed, the updating trigger writes the changed patent data into a temporary table;
correspondingly, the extracting the patent data in the local database comprises:
and extracting the patent data from the temporary table, and deleting or specifically marking the extracted patent data in the temporary table.
7. The method of claim 1, further comprising:
setting an update timestamp and establishing an update trigger in the local database;
when the patent data in the local database are changed, the updating trigger records the updating time stamp of the changed patent data;
correspondingly, the extracting the patent data in the local database comprises:
and extracting the patent data after the last extraction according to the updating time stamp recorded by the updating trigger.
8. The method of claim 1,
the method further comprises the following steps: legal state information of a specified patent is collected from a legal state retrieval website.
9. The method of claim 1,
the method further comprises the following steps: acquiring the survival period of the specified patent, comprising:
collecting legal state information of the specified patent from a legal state retrieval website;
analyzing the legal state information;
and calculating the survival time of the specified patent according to the analyzed information.
10. The method of claim 1,
the method further comprises the following steps: acquiring the belonged place of the specified patent, comprising:
and judging the location of the specified patent according to the applicant address information in the specified patent and a preset region code table.
11. A patent data analysis system, the system comprising:
the local database is used for storing patent data, and the patent data corresponds to a pre-established theme;
the establishing module is used for establishing a data mart consistent with the theme and establishing a data view corresponding to the theme in the local database in a data warehouse;
the data extraction module is used for extracting the patent data in the local database;
the data warehouse is used for storing the patent data extracted by the data extraction module, and the storage is stored in a form based on the data view;
the data analysis module is used for determining the data mart established by the establishing module according to the request of the user and analyzing the patent data according to the determined data mart;
and the display module is used for returning the result analyzed by the data analysis module to the user in a view form.
12. A patent data analysis system, the system comprising:
the first establishing module is used for establishing a theme corresponding to an analysis purpose in a local database;
the second establishing module is used for establishing a data mart consistent with the theme and establishing a data view corresponding to the theme established by the first establishing module in a data warehouse;
the data extraction module is used for extracting the patent data in the local database;
the storage module is used for storing the patent data extracted by the data extraction module to a data warehouse, and the storage is stored in a form based on the data view;
the data analysis module is used for determining the data mart established by the second establishing module according to the request of the user and analyzing the patent data according to the determined data mart;
and the display module is used for returning the result analyzed by the data analysis module to the user in a view form.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2009102657657A CN102117303A (en) | 2009-12-31 | 2009-12-31 | Patent data analysis method and system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2009102657657A CN102117303A (en) | 2009-12-31 | 2009-12-31 | Patent data analysis method and system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN102117303A true CN102117303A (en) | 2011-07-06 |
Family
ID=44216077
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2009102657657A Pending CN102117303A (en) | 2009-12-31 | 2009-12-31 | Patent data analysis method and system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102117303A (en) |
Cited By (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102508908A (en) * | 2011-11-11 | 2012-06-20 | 北京用友政务软件有限公司 | Method for acquiring subordinate financial business data and system for acquiring subordinate financial business data |
CN102682109A (en) * | 2012-05-09 | 2012-09-19 | 北京彼速信息技术有限公司 | Patent information analysis method and device |
CN102915336A (en) * | 2012-09-18 | 2013-02-06 | 北京金和软件股份有限公司 | Incremental data capturing and extraction method based on timestamps and logs |
CN103177010A (en) * | 2011-12-22 | 2013-06-26 | 苏州威世博知识产权服务有限公司 | Patent analytical method and system |
CN103455500A (en) * | 2012-05-30 | 2013-12-18 | 航天信息股份有限公司 | Method and device for extracting and issuing data |
CN104615778A (en) * | 2015-02-27 | 2015-05-13 | 浪潮集团有限公司 | Method, device and system for avoiding re-extracting data |
CN105005881A (en) * | 2015-08-31 | 2015-10-28 | 佛山市恒南微科技有限公司 | System for implementing intellectual property investigation and management for regional enterprises |
CN105069585A (en) * | 2015-08-31 | 2015-11-18 | 佛山市恒南微科技有限公司 | Enterprise patent announcement information grabbing and management system |
CN105139308A (en) * | 2015-08-31 | 2015-12-09 | 佛山市恒南微科技有限公司 | Regional enterprise patent information thorough investigation and management system |
CN105138651A (en) * | 2015-08-31 | 2015-12-09 | 佛山市恒南微科技有限公司 | Method for grabbing and managing enterprise trademark notice information |
CN105160471A (en) * | 2015-08-31 | 2015-12-16 | 佛山市恒南微科技有限公司 | Method for investigating and managing regional enterprise patent information |
CN105160472A (en) * | 2015-08-31 | 2015-12-16 | 佛山市恒南微科技有限公司 | Enterprise software copyright announcement information grasping and managing system |
CN105183821A (en) * | 2015-08-31 | 2015-12-23 | 佛山市恒南微科技有限公司 | Method for implementing regional enterprise software copyright bulletin fundamental investigation and management |
CN105183822A (en) * | 2015-08-31 | 2015-12-23 | 佛山市恒南微科技有限公司 | Enterprise trademark bulletin information capture and management system |
CN105184704A (en) * | 2015-08-31 | 2015-12-23 | 佛山市恒南微科技有限公司 | System for realizing investigation and management of area enterprise trademark information |
CN105320683A (en) * | 2014-07-24 | 2016-02-10 | 贾新志 | Graphical display method of literature theme content analysis |
CN106095899A (en) * | 2016-06-07 | 2016-11-09 | 安庆市扬智信息科技有限公司 | A kind of intellectual property intelligence system with patent analytic function |
CN111159154A (en) * | 2019-12-31 | 2020-05-15 | 新奥数能科技有限公司 | Energy data warehouse system |
CN112667691A (en) * | 2021-03-16 | 2021-04-16 | 中汽数据有限公司 | Database-based patent indexing method, device, equipment and storage medium |
-
2009
- 2009-12-31 CN CN2009102657657A patent/CN102117303A/en active Pending
Cited By (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102508908A (en) * | 2011-11-11 | 2012-06-20 | 北京用友政务软件有限公司 | Method for acquiring subordinate financial business data and system for acquiring subordinate financial business data |
CN102508908B (en) * | 2011-11-11 | 2015-04-08 | 北京用友政务软件有限公司 | Method for acquiring subordinate financial business data and system for acquiring subordinate financial business data |
CN103177010A (en) * | 2011-12-22 | 2013-06-26 | 苏州威世博知识产权服务有限公司 | Patent analytical method and system |
CN102682109A (en) * | 2012-05-09 | 2012-09-19 | 北京彼速信息技术有限公司 | Patent information analysis method and device |
CN103455500A (en) * | 2012-05-30 | 2013-12-18 | 航天信息股份有限公司 | Method and device for extracting and issuing data |
CN102915336A (en) * | 2012-09-18 | 2013-02-06 | 北京金和软件股份有限公司 | Incremental data capturing and extraction method based on timestamps and logs |
CN102915336B (en) * | 2012-09-18 | 2015-07-15 | 北京金和软件股份有限公司 | Incremental data capturing and extraction method based on timestamps and logs |
CN105320683A (en) * | 2014-07-24 | 2016-02-10 | 贾新志 | Graphical display method of literature theme content analysis |
CN104615778A (en) * | 2015-02-27 | 2015-05-13 | 浪潮集团有限公司 | Method, device and system for avoiding re-extracting data |
CN105139308A (en) * | 2015-08-31 | 2015-12-09 | 佛山市恒南微科技有限公司 | Regional enterprise patent information thorough investigation and management system |
CN105069585A (en) * | 2015-08-31 | 2015-11-18 | 佛山市恒南微科技有限公司 | Enterprise patent announcement information grabbing and management system |
CN105138651A (en) * | 2015-08-31 | 2015-12-09 | 佛山市恒南微科技有限公司 | Method for grabbing and managing enterprise trademark notice information |
CN105160471A (en) * | 2015-08-31 | 2015-12-16 | 佛山市恒南微科技有限公司 | Method for investigating and managing regional enterprise patent information |
CN105160472A (en) * | 2015-08-31 | 2015-12-16 | 佛山市恒南微科技有限公司 | Enterprise software copyright announcement information grasping and managing system |
CN105183821A (en) * | 2015-08-31 | 2015-12-23 | 佛山市恒南微科技有限公司 | Method for implementing regional enterprise software copyright bulletin fundamental investigation and management |
CN105183822A (en) * | 2015-08-31 | 2015-12-23 | 佛山市恒南微科技有限公司 | Enterprise trademark bulletin information capture and management system |
CN105184704A (en) * | 2015-08-31 | 2015-12-23 | 佛山市恒南微科技有限公司 | System for realizing investigation and management of area enterprise trademark information |
CN105005881A (en) * | 2015-08-31 | 2015-10-28 | 佛山市恒南微科技有限公司 | System for implementing intellectual property investigation and management for regional enterprises |
CN106095899A (en) * | 2016-06-07 | 2016-11-09 | 安庆市扬智信息科技有限公司 | A kind of intellectual property intelligence system with patent analytic function |
CN111159154A (en) * | 2019-12-31 | 2020-05-15 | 新奥数能科技有限公司 | Energy data warehouse system |
CN112667691A (en) * | 2021-03-16 | 2021-04-16 | 中汽数据有限公司 | Database-based patent indexing method, device, equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102117303A (en) | Patent data analysis method and system | |
US8140495B2 (en) | Asynchronous database index maintenance | |
CN111459985B (en) | Identification information processing method and device | |
US6408312B1 (en) | Method and system for supporting multiple, historical, and future designs in a relational database | |
US7672966B2 (en) | Adding extrinsic data columns to an existing database schema using a temporary column pool | |
CN103154935B (en) | For inquiring about the system and method for data stream | |
CN100596353C (en) | Method and system for providing log service | |
CN105989195A (en) | Approach and system for processing data in database | |
US8862588B1 (en) | Generating an empirically-determined schema for a schemaless database | |
US20030037114A1 (en) | System, method and apparatus for updating electronic mail recipient lists | |
CN110928903B (en) | Data extraction method and device, equipment and storage medium | |
US20080250073A1 (en) | Sql change tracking layer | |
US7493323B2 (en) | Document group analyzing apparatus, a document group analyzing method, a document group analyzing system, a program, and a recording medium | |
CN106709851B (en) | Big data retrieval method and device | |
KR20040054471A (en) | Contact user interface | |
CN103460208A (en) | Methods and systems for loading data into a temporal data warehouse | |
CN101136027B (en) | System and method for database indexing, searching and data retrieval | |
US8620946B2 (en) | Storage and searching of temporal entity information | |
CN106503158A (en) | Method of data synchronization and device | |
CN104781793A (en) | Systems and methods for integrating storage usage information | |
CN111125213A (en) | Data acquisition method, device and system | |
CN106503186A (en) | A kind of data managing method, client and system | |
CN107291951B (en) | Data processing method, device, storage medium and processor | |
RU2635886C2 (en) | Systems and methods for managing files through mobile computer devices | |
CN110502529B (en) | Data processing method, device, server and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C53 | Correction of patent for invention or patent application | ||
CB03 | Change of inventor or designer information |
Inventor after: Zhen Chunjie Inventor before: Pan Xiaomei |
|
COR | Change of bibliographic data |
Free format text: CORRECT: INVENTOR; FROM: PAN XIAOMEI TO: ZHEN CHUNJIE |
|
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20110706 |