WO2022188820A1 - 文件的分类处理方法、装置、服务器、系统及计算机程序产品 - Google Patents

文件的分类处理方法、装置、服务器、系统及计算机程序产品 Download PDF

Info

Publication number
WO2022188820A1
WO2022188820A1 PCT/CN2022/080005 CN2022080005W WO2022188820A1 WO 2022188820 A1 WO2022188820 A1 WO 2022188820A1 CN 2022080005 W CN2022080005 W CN 2022080005W WO 2022188820 A1 WO2022188820 A1 WO 2022188820A1
Authority
WO
WIPO (PCT)
Prior art keywords
field
classification
account
converted
file
Prior art date
Application number
PCT/CN2022/080005
Other languages
English (en)
French (fr)
Inventor
刘旭阳
杨林林
张鑫
项晓露
周志翔
Original Assignee
智慧芽信息科技(苏州)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 智慧芽信息科技(苏州)有限公司 filed Critical 智慧芽信息科技(苏州)有限公司
Publication of WO2022188820A1 publication Critical patent/WO2022188820A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/353Clustering; Classification into predefined classes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/358Browsing; Visualisation therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/38Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/30Authentication, i.e. establishing the identity or authorisation of security principals
    • G06F21/31User authentication
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/604Tools and structures for managing or administering access control systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/62Protecting access to data via a platform, e.g. using keys or access control rules
    • G06F21/6218Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2216/00Indexing scheme relating to additional aspects of information retrieval not explicitly covered by G06F16/00 and subgroups
    • G06F2216/11Patent retrieval

Definitions

  • the present disclosure relates to the technical field of patent data processing, and in particular, to a method, device, server, system and computer program product for classifying and processing files.
  • patent agency companies or patent related parties such as patent application, management, and operation maintain patent data, they usually need to classify patents.
  • some patent classification rules can be set on the user side, and different R&D or patent managers within the user can index patents based on the understanding of the patent content, so as to realize the classification of patents.
  • different R&D or patent management personnel lack a unified review node for indexing patents. It is difficult for different indexers to ensure the accuracy of patent indexing, and there are problems of difficult management and low classification accuracy.
  • the index is converted to the classification field corresponding to the classification rule, it can only be converted one by one at present, and the classification efficiency is low.
  • the present disclosure provides a file classification processing method, device, server, system and computer program product to at least solve the technical problem of low file classification efficiency in the related art.
  • the technical solutions of the present disclosure are as follows:
  • a method for classifying and processing files comprising:
  • Identify the operating authority of the operating account where the operating authority includes pre-set business contents that can be handled by different account types;
  • the operation account has the conversion operation authority, obtain a corresponding file data set including the second field according to the business field allocated to the operation account, wherein the second file data set in the file data set including the second field is obtained. fields, including data information obtained by processing file data by operating accounts that match different or the same operating authority;
  • the second field in the file data set is converted into a classification field of the target field type
  • the method further includes:
  • the second field in the file data set is modified to obtain modified field data.
  • the operating authority of the operating account includes the account and corresponding authority set in the following manner:
  • the first account type which has the permission to index files and has no permission to convert
  • the second account type which modifies the data information generated by the processing of the file by the indexing account in the first account type, and has the authority to convert the field to be converted into a business field of a specified type;
  • the third account type has conversion operation authority, and assigns the conversion account in the second account type according to a preset matching rule, a business field that allows the conversion account to convert the field to be converted into.
  • the matching rules include:
  • the method further includes:
  • a notification message is sent to the allocated account of the third account type
  • the file corresponding to the exception field is reassigned to the converted account number in the second account type that matches the exception field.
  • the abnormal field is determined in the following manner:
  • the reference field is compared with the converted classification field, and if the difference between the reference field and the converted classification field is greater than a preset condition, it is determined that the classification field with the difference greater than the preset condition is an abnormal field.
  • the second field includes:
  • the second field further includes:
  • the approval information included in the extracted file wherein the approval information includes at least one of the following:
  • the document includes a patent document.
  • the method further includes: in response to the second conversion operation of the field to be converted, converting the field to be converted corresponding to the file into a classification field in the target field type, wherein,
  • the target field type is a hierarchical field and/or a text field
  • the target field type is an option field and/or a text field
  • the target field type is an option field and/or a hierarchical field.
  • the hierarchical fields are tree-structured data with classification fields as nodes;
  • the displaying the classification result after the conversion operation of the second field includes: displaying the classification fields of the leaf node, intermediate node, and root node corresponding to the field to be converted in the tree structure, and displaying the classification field in a preset symbol and/or format. Shows the hierarchical relationship between the classification fields to which they belong.
  • the method when the target field type is a hierarchical field or an option field, in the conversion process, the method further includes:
  • the classification field to be converted to is determined based on the selection operation instruction of the classification field.
  • the classification field in the target field type includes a classification field of user-defined classification settings.
  • the method further includes:
  • Matching result information is displayed, where the matching result information includes the type of the field to be converted, the type of the target field, the number of files to be converted this time, and the number of successful and/or failed file conversions.
  • the conversion process is performed by creating an asynchronous task.
  • a file classification processing device comprising:
  • an authority identification module used for identifying the operation authority of the operation account, the operation authority including pre-set business contents that can be handled by different account types;
  • a data acquisition module configured to acquire a corresponding file data set containing the second field according to the business field allocated for the operating account when the operating account has the conversion operation authority, wherein the file containing the second field
  • the second field in the data set includes data information obtained by processing file data by operating accounts that match different or the same operating authority;
  • a first conversion module configured to respond to the first conversion operation of the second field, and convert the second field in the file data set into a classification field of the target field type
  • the display module is configured to display the classification result after the conversion operation of the second field.
  • the apparatus further comprises:
  • the modification module is configured to modify the second field in the file data set in response to the modification operation of the second field to obtain the modified field data.
  • the operation authority of the operation account includes the account and corresponding authority set in the following manner:
  • the first account type which has the permission to index files and has no permission to convert
  • the second account type which modifies the data information generated by the processing of the file by the indexing account in the first account type, and has the authority to convert the field to be converted into a business field of a specified type;
  • the third account type has conversion operation authority, and assigns the conversion account in the second account type according to a preset matching rule, a business field that allows the conversion account to convert the field to be converted into.
  • the matching rule includes:
  • the device also includes:
  • An exception notification module used for assigning to the third account type when the operating account is of the second account type and the classification result after the conversion of the field to be converted includes an abnormal field that does not belong to the business field allocated for the converted account
  • the account sends a notification message
  • the reassignment module is configured to, in response to the reassignment operation of the assigned account, reassign the file corresponding to the exception field to the converted account in the second account type that matches the exception field.
  • the abnormal field is determined in the following manner:
  • the reference field is compared with the converted classification field, and if the difference between the reference field and the converted classification field is greater than a preset condition, it is determined that the classification field with the difference greater than the preset condition is an abnormal field.
  • the second field includes:
  • the second field further includes:
  • the approval information included in the extracted file wherein the approval information includes at least one of the following:
  • the document includes a patent document.
  • the device also includes:
  • the second conversion module is configured to respond to the second conversion operation of the field to be converted, and convert the field to be converted corresponding to the file into a classification field in the target field type, wherein,
  • the target field type is a hierarchical field and/or a text field
  • the target field type is an option field and/or a text field
  • the target field type is an option field and/or a hierarchical field.
  • the hierarchical field is tree-structured data with a classification field as a node
  • the displaying the classification result after the conversion operation of the second field includes: displaying the classification fields of the leaf node, intermediate node, and root node corresponding to the field to be converted in the tree structure, and displaying the classification field in a preset symbol and/or format. Shows the hierarchical relationship between the classification fields to which they belong.
  • the apparatus further comprises:
  • Repeated classification display module for when the target field type is a hierarchical field or an option field, if there are multiple classification fields matching the description information in the target field type, then display the multiple matching classifications field;
  • the classification selection module is used to receive the selection operation instruction of the classification field, and determine the target field to be converted into.
  • the target field type includes a category field of a user-defined category setting.
  • the apparatus further comprises:
  • the matching result display module is used to display the matching result information after being converted into the target field type, and the matching result information includes the type of the field to be converted, the target field type, the number of files converted this time, and the files converted successfully and/or failed. quantity.
  • the first conversion module or the second conversion module converts the to-be-processed field by creating an asynchronous task.
  • Another aspect of the embodiments of the present disclosure further provides a server, including:
  • a memory for storing the processor-executable instructions
  • the processor is configured to execute the instructions to implement the method described in any embodiment of the present disclosure.
  • Another aspect of the embodiments of the present disclosure further provides a computer-readable storage medium.
  • the server can execute any one of the methods in the present disclosure. method described.
  • Another aspect of the embodiments of the present disclosure further provides a computer program product, including a computer program/instructions, wherein, when the computer program is executed by a processor, the method described in any one of the embodiments of the present disclosure is implemented.
  • Another aspect of the embodiments of the present disclosure further provides a patent management system, including the apparatus described in any one of the embodiments of the present disclosure, or, when the processor of the patent management system executes the executable instructions stored in the memory, any one of the embodiments of the present disclosure can be implemented.
  • a method for classifying and processing documents, or the patent management system includes the computer program product.
  • the operation authority of the operation account can be determined first, and then the operation account with the conversion operation authority can uniformly perform the classification field data in the file.
  • the conversion of audit and target field types not only greatly improves the conversion processing efficiency of different classification field types of files, saves the classification processing time of files, but also enables managers to centrally and uniformly audit the classification field data to ensure the classification field data audit. Consistency and accuracy of quality, optimize the process of document classification management.
  • the indexing results of each indexer who specifically implements patent indexing can be reviewed and data traceable.
  • the integrated conversion of fields ensures the accuracy and efficiency of collaborative work and patent classification.
  • the converted classification fields can be displayed in the work space, so that users can more clearly and comprehensively view the classification fields to which the patent text belongs or the hierarchical relationship of the classification fields, etc., and improve the user experience of the patent document management service.
  • FIG. 1 is a schematic diagram of an application scenario of a method for classifying and processing files according to an exemplary embodiment.
  • Fig. 2 is a flow chart of a method for classifying and processing files according to an exemplary embodiment.
  • FIG. 3 is a schematic diagram of a result after indexing data provided by the present disclosure is converted into an option field.
  • FIG. 4 is a schematic diagram of a result of converting index data provided by the present disclosure into hierarchical fields.
  • FIG. 5 is a schematic diagram of a hierarchical relationship of hierarchical fields of a pre-classified design provided by the present disclosure.
  • Fig. 6 is a flowchart of a method for classifying and processing files according to another exemplary embodiment.
  • Fig. 7 is a flowchart of a method for classifying files according to another exemplary embodiment.
  • Fig. 8 is a flowchart of a method for classifying files according to another exemplary embodiment.
  • FIG. 9 is a schematic diagram of a scenario of indexing data including batch reply information in an embodiment provided by the present disclosure.
  • FIG. 10 is a schematic diagram of a scenario of approval information including reply information in an embodiment provided by the present disclosure.
  • Fig. 11 is a flowchart of a method for classifying files according to another exemplary embodiment.
  • FIG. 12 is a schematic diagram of a scenario provided by the present disclosure for determining a target field for a user when there are repeated hierarchical nodes.
  • Fig. 13 is a schematic diagram of a scenario of a method for classifying and processing files according to an exemplary embodiment.
  • Fig. 14 is a schematic structural diagram of an apparatus for classifying and processing a file according to an exemplary embodiment.
  • Fig. 15 is a schematic structural diagram of an apparatus for classifying and processing a file according to another exemplary embodiment.
  • Fig. 16 is a schematic structural diagram of an apparatus for classifying and processing a file according to another exemplary embodiment.
  • Fig. 17 is a schematic structural diagram of an apparatus for classifying and processing a file according to another exemplary embodiment.
  • FIG. 18 is a schematic structural diagram of a file classification processing device S00 according to an exemplary embodiment.
  • a method for classifying and processing files provided by the present disclosure can be applied to the application scenario shown in FIG. 1 .
  • the patent management system 110 provided to the user, the patent management system 110 can build a patent database, provide a patent management interface, and realize the classification, storage, novelty search, analysis, and update of patent data.
  • users usually can only index patent documents one by one, such as adding text type description information to a patent, or directly indexing patents into a certain category, such as category "A".
  • category "A" category "A"
  • the method for classifying and processing documents provided by the present disclosure can be applied to the patent management system 110, and can provide, including but not limited to, classifying and processing patent documents, and not only optimizes the patent classification processing and management process, but also realizes different rights, Different levels of management can also achieve more efficient and accurate classification, improve the efficiency of patent management operations and the accuracy of classification results.
  • the fields to be converted described in some embodiments of the present disclosure can be understood as the data information of the files to be converted, such as the indexing data of the patent text classification records online or offline by the indexers themselves, and may also include the data provided by the present disclosure.
  • the target field type can be understood as the field type to which the classification field to be converted belongs.
  • the field types provided by the present disclosure may include, but are not limited to, text fields, option fields, and hierarchical fields.
  • the text field may include information content entered by the user to describe the content of the file, key technologies, and the like. The user is usually free to enter categorical descriptive words or other information in the text type (except for restricting illegal fields).
  • the option field usually includes a user-defined classification field, and the classification field in the option field may be a parallel classification relationship or a hierarchical relationship.
  • a file can belong to the classification fields in multiple option fields.
  • a patent file belongs to A and B of the four option fields of A, B, C, and D, or it can belong to the option fields A and B1, of which A, B, Both C and D are primary classifications, and B1 is a sub-category (secondary classification) of primary classification B.
  • the hierarchical field may include the subordination or hierarchical relationship of different classification fields.
  • the index data of a certain patent text includes keywords such as "sweeping robot" and "cleaning device", then according to the index data, the patent can be identified.
  • the patent management system 110 described in the embodiments of the present disclosure may include, but is not limited to, local or remote servers, server clusters, distributed subsystems, cloud processing platforms, servers including blockchain nodes, and devices of combinations thereof. Including various personal computers, laptops, smartphones, tablets, wearable devices, in-vehicle devices, medical devices, etc.
  • Fig. 2 is a flowchart of a method for classifying and processing files according to an exemplary embodiment. As shown in Fig. 2 , the method can be used in the aforementioned patent management system 110, and can include the following steps.
  • S200 Identify the operation authority of the operating account, where the operation authority includes preset business contents that can be handled by different account types.
  • different account types may be preset, and different account types have different operation rights in the patent management system or work space.
  • the first type of account can only index files and cannot convert the indexing data generated by the indexing into option fields or hierarchical fields.
  • the second type of account has the conversion operation authority, which can convert the indexing data into the target field type, and can also review the indexing data of the third type of account, such as modifying it.
  • the third type of account can have the highest authority, for example, it can have all the authority in the work space or the patent management system, and it can have the authority to assign some operation authority to different second type accounts according to certain rules, or to the second type of account. The conversion result of the account is reviewed, modified, etc.
  • the third type of account can also have the conversion operation authority and the authority to index files.
  • the corresponding account type can be set according to the requirements for file classification management.
  • the present disclosure also provides an implementation method for setting accounts with different permissions.
  • the operation authority of the operation account includes the account and corresponding authority set in the following manner:
  • the first account type which has the permission to index files and has no permission to convert
  • the second account type which modifies the data information generated by the processing of the file by the indexing account in the first account type, and has the authority to convert the field to be converted into a business field of a specified type;
  • the third account type has conversion operation authority, and assigns the conversion account in the second account type according to a preset matching rule, a business field that allows the conversion account to convert the field to be converted into.
  • the first account type can be the same as the aforementioned first type of account, and can be provided to a specific person who performs initial classification processing of files, such as an operator who indexes files.
  • the account used by the indexing personnel may be called an indexing account, which belongs to the first account type.
  • the account in the second account type (may be referred to as a conversion account), with reference to the aforementioned second type of account, the data information generated by the processing of the file by the index account in the first account type can be modified, and Has the permission to convert the field to be converted into a business field of the specified type.
  • the third account type refer to the aforementioned third type of account, and a business field to which the to-be-converted field can be converted can be set for different conversion accounts.
  • a distribution account belonging to the third type of account type can set or assign a conversion account user33 to handle electronic patent document classification, that is, the classification after conversion account user33 converts the index data should be the electronic classification field.
  • the business field may generally refer to a field allocated to the operation account according to the business field to which the operation account belongs or the professional scope to be handled.
  • the business field is usually the target field (category field in the target field type) to which the operation account needs to convert the to-be-processed field.
  • the target field category field in the target field type
  • the business field allocated to the operation account may be a target field belonging to the machinery field.
  • the document may include one or more patents that are pre-selected or screened.
  • solr a search application server
  • the operation account with the conversion operation authority may be the conversion account in the second account type as described above.
  • the corresponding business field can be determined, and the file data set assigned to the conversion account for field conversion can be obtained.
  • the file data set may include the second field.
  • the second field may include various data information.
  • the second field may be indexing data.
  • the embodiment of the present disclosure does not limit that the second field must be index data, which mainly refers to the operation account in the first account type processing the file to obtain data information. Therefore, the second field in the file data set containing the second field may include data information obtained by processing the file data by operating accounts that match different or the same operating authority. For example, data information obtained by indexing patent documents by multiple operators who index patent documents.
  • the indexing personnel are usually one or more R&D or patent classification personnel, and may also include review and management personnel for patent management. Therefore, it may be data information obtained by processing file data by operating accounts that match different or the same operating authority.
  • the file data set may include data information of one or more files, usually including data information obtained by processing file data, such as index data.
  • the indexing data may include the overall classification description of the patent by the indexer, and may include the text language description of the patent classification input by the R&D or patent administrator. For example, "This patent is a cleaning device applied to a sweeping robot, which can automatically identify obstacles and automatically charge.”
  • indexing data can be used as a pending field for initial classification of patents using the patent management system.
  • the fields to be processed may also be option fields and/or hierarchical fields.
  • a patent publication (publication) number may be used to uniquely identify a patent, and all or part of the description information, text field, option field, and level field are represented by letters.
  • the text field may be A, B, C, etc.
  • the second field may include data information obtained by indexing one or more files.
  • the file data set may include one or more second fields.
  • the operation account has the conversion operation permission and can perform field classification conversion between different field types.
  • the aforementioned second field may be an implementation scenario of the field to be converted, for example, the second field may be index data.
  • the field type to which the field to be converted is converted can be called the target field type, such as an option field or a hierarchy field. Under each field type, multiple classification fields to be finally determined by the file are usually pre-defined by the user.
  • the operation account performs the conversion operation of the second field (here, it may be the first conversion operation)
  • the second field in the file data set is converted into a classification field of the target field type.
  • a preset algorithm can be used for conversion processing.
  • the option field or level field may be a preset classification field, and then the second field may be matched with the classification field in the target field type. When the content of the second field matches the classification field, the matched classification field is used as the classification field converted with the second field.
  • the content of the field to be processed and the classification field usually need to meet certain matching requirements. If the field to be processed is A, and a certain classification field A1 in the target field type contains A, then A can be used as the field to be processed. Convert to categorical fields of A1.
  • the matching process may also include other processing, such as word segmentation processing, semantic analysis, similarity calculation, and so on.
  • the classification results after the conversion operation can be displayed for users to view.
  • the classification result after the conversion operation on the second field may be the classification fields corresponding to each of the multiple files. Displaying the classification result after the second field conversion operation is to display the corresponding classification fields of the multiple files.
  • FIG. 3 is a schematic diagram of a result after indexing data provided by the present disclosure is converted into an option field.
  • FIG. 4 is a schematic diagram of a result of converting index data provided by the present disclosure into hierarchical fields.
  • the indexing data in the text field can be the descriptive language entered by the indexers to summarize and summarize. Different indexers may have different description information for the same category, such as "cleaning device", "Cleaning Device”.
  • the option field can include various categories such as A, B, C, D.
  • the grading field can include A, B, C, and D of the first-level classification, and can also include the second-level classification A1 under the A classification, the second-level classification B1 under the B-classification, and the third-level classification A11 under the second-level classification A1. etc., and so on.
  • the options category may also include options of sub-categories such as secondary or tertiary.
  • the specific option classification or hierarchical classification is set in advance, and during the conversion process, the patent can be re-indexed according to the pre-classified method (determine the target field of the patent), as shown in Figure 5.
  • FIG. 5 is a schematic diagram of a hierarchical relationship of hierarchical fields of a pre-classified design provided by the present disclosure.
  • the above embodiments provide a new classification processing solution for files. If the conversion operator converts the file, he can first determine the operation authority he has, and only those who have the conversion operation authority can review the classification field data in the file and convert the target field type, which not only greatly improves the different classification of the file.
  • the conversion processing efficiency of field types saves the classification processing time of files, and can also realize centralized and unified review of classification field data by managers, ensure the consistency and accuracy of classification field data review quality, and optimize the process of file classification management. .
  • FIG. 6 it may further include:
  • S602 In response to the modification operation of the second field, modify the second field in the file data set to obtain modified field data.
  • the file data set may include the modified field data of the second field, and during conversion, the modified field data is also converted.
  • Managers can review the indexing data made by each indexer in the patent management system. Generally, administrators need to obtain certain operation authority to facilitate the review and conversion of patent indexing data in a safe and centralized manner.
  • the administrator can modify it in the work space of the patent management system. For example, the text-type index data "cleaning device" is changed to "disinfecting device".
  • the patent management system can modify the index data to be adjusted in response to the modification operation of the index data (a kind of data information in the second field) to obtain the modified index data.
  • the reviewer can uniformly convert it through the patent management system to obtain the classification field of the target field type.
  • the type of the field to be converted may be index data of text type. Indexed data in a text type can be referred to as a text field.
  • the type of the field to be converted described in other implementations can also be a text field or an option field.
  • the option field obtained by the conversion processing of the description information of the patent document as before.
  • the target field type may represent the type into which the description information is to be converted, for example, a text field is converted into a hierarchical field.
  • the third account type may allocate, according to a preset matching rule, to the converted account in the second account type, a business field that allows the converted account to convert the field to be converted into.
  • Specific matching rules can be set in advance according to different business scenarios and requirements.
  • the matching rules may include:
  • the attribute information of the business field may generally include the professional field corresponding to the converted account, and the corresponding business field is assigned to the professional field according to the professional field.
  • the assigned business fields may include one or more business fields related to the mechanical field, such as lawn mowers, sweeping robots, and so on.
  • a low-level field type can be converted to a higher-level field type.
  • a low-level text field can be converted to a higher-level option field, and an option field can be converted to a higher-level hierarchical field.
  • it can be set that the level field cannot be converted to the option field.
  • field types of different levels can be converted to each other, which better meets the viewing and display requirements of different classifications and different classification results.
  • the method may further include:
  • the target field type is a hierarchical field and/or a text field; when the type of the field to be converted is a hierarchical field, the target field type is an option field and/or a text field; When the type of the field to be converted is a text field, the target field type is an option field and/or a hierarchical field.
  • the field to be converted can be converted into one target field type in one conversion process, or can be converted into multiple target field types.
  • index data can be converted into option fields and level fields at the same time.
  • This embodiment provides the target field types of the option field and the hierarchical field, and the description information can be displayed in a variety of ways to more clearly display the classification structure and hierarchical relationship, so as to achieve a more refined, diversified, and more accurate classification of patents.
  • the method may further include:
  • the conversion account in the preset second account type needs to convert the fields to be converted into business fields, it does not rule out that in some application scenarios, the converted classification results and assignments of some files may still appear.
  • different business fields For example, the set business field to be converted into an assigned account number is an electronic business field, but during the conversion process, the processor or personnel mark certain files as mechanical business fields, and convert these files into If the classification fields of the mechanical type are set, at this time, the converted classification fields that do not belong to the electronic type assigned to them are abnormal fields.
  • the embodiment of the present disclosure provides an error correction mechanism, and if the above situation occurs, a notification message can be sent to the assigned account of the third account type with higher authority. These, the assigned account number can verify whether these exception fields are really wrong.
  • the abnormal field If it is determined that the abnormal field is indeed abnormal, it can be reassigned, and the files corresponding to these abnormal fields are reassigned to the corresponding conversion account for processing. As in the above scenario, reassign the file of the abnormal field pair to the conversion account of the field conversion of a certain mechanical type.
  • the solution of this embodiment can greatly improve the accuracy of document classification.
  • whether the abnormal field is really an allocation error or has other abnormal conditions may be determined after subjective review based on the second account type or the third account type.
  • an implementation scheme that can automatically assist audit is provided, which can be automatically calculated and identified by a processor, and an output result of whether the converted classification field is an abnormal field is provided for the user to carry out Assisted viewing.
  • the abnormal field may be determined in the following manner:
  • the reference field is compared with the converted classification field, and if the difference between the reference field and the converted classification field is greater than a preset condition, it is determined that the classification field with the difference greater than the preset condition is an abnormal field.
  • the indexing data may be description information generated by indexing the patent to be converted online.
  • the online may include operations performed in a management platform such as a patent management system, such as description information generated by indexing the patent to be converted in a work space for converting patent text description information.
  • offline may refer to operations performed outside the patent management system, etc.
  • the description information of the patent may be the text information entered in excel in advance by indexing before uploading to the patent management system.
  • the method of the embodiments of the present disclosure not only supports the conversion of the description information entered by the online indexing, but also supports the conversion of the description information entered by the patent indexing offline. Indexed patent description information conversion requirements.
  • the second field may include:
  • the second field may further include:
  • the approval information included in the extracted file wherein the approval information includes at least one of the following:
  • the second field may be index data.
  • the indexing data may also include comments, remarks, memos and other approval information for the specific content of the patent in the patent to be converted. These approval information may be information added by one or more different users, as shown in the figure 9 shown.
  • the indexing data may also include reply information to these comments, comments, memos and other approval information, as shown in FIG. 10 , the reply content of User1 to User2's comment. This embodiment of the present disclosure may also include these reply information in the indexing data for patent classification.
  • the reply information may further include these reply information, which expands the data source describing the fields to be processed in the information, and can further include the reply information. It matches the target field of the description information more precisely, and realizes a more accurate classification of the patent in the target field type.
  • the hierarchical field is tree structure data with a classification field as a node
  • the displaying the classification result after the conversion operation of the second field includes: displaying the classification fields of the leaf nodes, intermediate nodes, and root nodes corresponding to the fields to be processed in the tree structure, and using preset symbols and/ Or the format shows the hierarchical relationship between the classification fields to which they belong.
  • the indexing data may correspond to multiple hierarchical classification fields in the hierarchical field (the depth of the tree is greater than or equal to 2).
  • the depth of the tree is greater than or equal to 2.
  • the index data is converted to the intermediate node A1 in the corresponding tree data structure in the hierarchical field (its leaf nodes are A11, 21), and A1 belongs to the root node A, then the corresponding hierarchical field position in Figure 4 is represented by
  • the inverted triangle symbol and the dislocation method show the classification fields A1 and A, as well as the hierarchical relationship between A1 and A.
  • the user can clearly see the hierarchical relationship between A1 and A with the conventional image recognition ability, which is convenient for users to check the converted target field. Relationship information in the Hierarchy field.
  • Fig. 11 is a flowchart of a method for classifying files according to another exemplary embodiment.
  • the method may further include:
  • S1104 Determine the classification field to be converted into based on the selection operation instruction of the classification field.
  • FIG. 12 is a schematic diagram of a scenario provided by the present disclosure for determining a target field for a user when there are repeated hierarchical nodes.
  • Figure 12 if it is found that there are two classification fields with the same (one of the matching cases) names in the hierarchical field after field matching according to the indexing data, the repeated classification fields can be displayed, and the user can choose the required classification fields. In which field the patent is divided. In this way, users can make personalized selections according to their own classification needs, and match the classification fields selected by the users as the target fields to be converted into.
  • the target field type may include a category field set by a user-defined category.
  • the classification field in the target field type can be set by the user, which greatly facilitates the user to define the type of the classification field to be converted.
  • the name of the root node, middle node, leaf node and other node classification fields in the hierarchical field can be customized by the user and set according to the classification requirements, and can not use or partially use the system's own classification rules, which is convenient for users to flexibly define classification types .
  • Fig. 13 is a schematic diagram of a scenario of a method for classifying and processing files according to an exemplary embodiment.
  • the method may further include:
  • Matching result information is displayed, where the matching result information includes the type of the field to be converted, the type of the target field, the number of files to be converted this time, and the number of successful and/or failed file conversions.
  • the matching result information of this conversion can also be displayed in the display interface.
  • the matching result information can usually include the type of fields to be processed, the type of target fields, the number of files to be converted this time, and the number of successful and/or failed file conversions, so that users can timely check whether all conversions are successful and the target of conversion. Information such as whether the field type is correct, as shown in Figure 13. Of course, in some embodiments, the numbers of successful conversions and failed conversions may also be displayed at the same time.
  • the conversion process is performed by creating an asynchronous task.
  • the conversion can be performed by creating an asynchronous task.
  • the processor of the patent management system can create corresponding asynchronous tasks for the description information of each patent, and each asynchronous task processes its own conversion task. No matter the successful processing and identification of multiple tasks, the processing of other tasks will not be affected. . In this way, the conversion processing speed can be further accelerated, the conversion response time of the target field type can be reduced, and the user experience of patent management can be improved.
  • the document classification processing method provided by the embodiment of the present disclosure provides a new document classification processing scheme, and the administrator with the authority can uniformly review the indexing data of the patent and other documents by the indexing personnel and convert the target field type. It not only greatly improves the conversion and processing efficiency of indexing data, but also saves the classification and processing time of documents. It can also realize centralized and unified auditing of indexing data by management personnel, ensure the consistency and accuracy of indexing data auditing quality, and optimize the quality of indexing data.
  • the process of document classification management The solution of the embodiment of the present disclosure can review the indexing results and data traceability of each indexer who specifically implements patent indexing.
  • the management personnel can review and process the indexing data, they can centrally and uniformly integrate and convert the fields to ensure that It improves the accuracy and efficiency of collaborative work and patent classification.
  • the converted classification fields can be displayed in the work space, so that users can more clearly and comprehensively view the classification fields to which the target patent text belongs or the hierarchical relationship of classification fields, etc., and improve the user experience of patent management services.
  • the present disclosure further provides a device for classifying and processing files.
  • the apparatuses may include systems (including distributed systems), software (applications), modules, components, servers, clients, etc., which use the methods described in the embodiments of this specification, in combination with necessary implementation hardware apparatuses.
  • the apparatuses in one or more embodiments provided by the embodiments of the present disclosure are described in the following embodiments. Since the implementation solution of the device to solve the problem is similar to the method, the implementation of the specific device in the embodiment of the present specification can refer to the implementation of the foregoing method, and repeated details will not be repeated.
  • unit or “module” may be a combination of software and/or hardware that implements a predetermined function.
  • the apparatus described in the following embodiments is preferably implemented in software, implementations in hardware, or a combination of software and hardware, are also possible and contemplated.
  • Fig. 14 is a schematic structural diagram of an apparatus for classifying and processing a file according to an exemplary embodiment.
  • the device may be the aforementioned patent management system 110, or may be an individual server or a server cluster or the like.
  • the apparatus 100 for classifying and processing files may include the following modules.
  • the authority identification module 1402 can be used to identify the operation authority of the operation account, and the operation authority includes pre-set business contents that can be handled by different account types;
  • the data acquisition module 1404 can be used to acquire a corresponding file data set containing the second field according to the business field allocated for the operating account when the operating account has the conversion operation authority, wherein the operating account contains the second field
  • the second field in the set of file data includes data information obtained by processing the file data by operating accounts that match different or the same operating authority;
  • the first conversion module 1406 can be configured to respond to the first conversion operation of the second field, and convert the second field in the file data set into a classification field of the target field type;
  • the display module 1408 may be configured to display the classification result after the conversion operation of the second field.
  • FIG. 15 is a schematic structural diagram of an apparatus for classifying and processing a file according to another exemplary embodiment.
  • the apparatus 100 for classifying and processing files may further include the following modules.
  • the modification module 1502 may be configured to modify the second field in the file data set in response to the modification operation of the second field to obtain modified field data.
  • the operation authority of the operation account includes the account and corresponding authority set in the following manner:
  • the first account type which has the permission to index files and has no permission to convert
  • the second account type which modifies the data information generated by the processing of the file by the indexing account in the first account type, and has the authority to convert the field to be converted into a business field of a specified type;
  • the third account type has conversion operation authority, and assigns the conversion account in the second account type according to a preset matching rule, a business field that allows the conversion account to convert the field to be converted into.
  • the matching rule includes:
  • FIG. 16 is a schematic structural diagram of an apparatus for classifying and processing a file according to an exemplary embodiment.
  • another embodiment of the apparatus further includes:
  • the abnormality notification module 1602 can be configured to notify the third account type to the third account type when the operation account is of the second account type and the classification result after the conversion of the field to be converted includes an abnormal field that does not belong to the business field allocated for the converted account. the assigned account to send a notification message;
  • the reassignment module 1604 may be configured to, in response to the reassignment operation of the assigned account, reassign the file corresponding to the exception field to the converted account in the second account type that matches the exception field.
  • the abnormal field is determined in the following manner:
  • the reference field is compared with the converted classification field, and if the difference between the reference field and the converted classification field is greater than a preset condition, it is determined that the classification field with the difference greater than the preset condition is an abnormal field.
  • the second field includes:
  • the second field further includes:
  • the approval information included in the extracted file wherein the approval information includes at least one of the following:
  • the documents include patent documents.
  • another embodiment of the apparatus further includes:
  • the second conversion module can be configured to respond to the second conversion operation of the field to be converted, and convert the field to be converted corresponding to the file into a classification field in the target field type, wherein,
  • the target field type is a hierarchical field and/or a text field
  • the target field type is an option field and/or a text field
  • the target field type is an option field and/or a hierarchical field.
  • the hierarchical field is tree-structured data with a classification field as a node
  • the displaying the classification result after the conversion operation of the second field includes: displaying the classification fields of the leaf nodes, intermediate nodes, and root nodes corresponding to the data to be processed in the tree structure, and displaying the classification fields of the leaf nodes, intermediate nodes, and root nodes in the tree structure with preset symbols and/ Or the format shows the hierarchical relationship between the classification fields to which they belong.
  • FIG. 17 is a schematic structural diagram of an apparatus for classifying and processing a file according to an exemplary embodiment. 17 , in another embodiment of the apparatus provided by the present disclosure, the apparatus may further include:
  • Repeated classification display module 1702 which can be configured to, when the target field type is a hierarchical field or an option field, if there are multiple classification fields matching the description information in the target field type, display the multiple matching fields The classification field of ;
  • the classification selection module 1704 can be configured to receive a selection operation instruction of the classification field, and determine the target field to be converted into.
  • the target field type includes a category field set by a user-defined category.
  • FIG. 18 is a schematic structural diagram of an apparatus for classifying and processing a file according to an exemplary embodiment. 18, the apparatus may further include:
  • the matching result display module 1802 can be used to display matching result information after conversion to the target field type, where the matching result information includes the type of the field to be converted, the target field type, the number of files to be converted this time, the file conversion success and/or number of failures.
  • the first conversion module 1406 or the second conversion module may convert the to-be-processed field by creating an asynchronous task.
  • a computer program product including a computer program, which, when executed by a processor, implements the method for classifying and processing files described in any one of this specification.
  • FIG. 18 is a schematic structural diagram of a file classification processing device S00 according to an exemplary embodiment.
  • the device S00 can be the patent management system described above, and specifically can be a server, a server cluster, a distributed processing server, a blockchain Servers, cloud computing platforms, etc., and combinations thereof.
  • device S00 may be a combination of one or more servers.
  • apparatus S00 includes a processing component S20, which further includes one or more processors, and a memory resource, represented by memory S22, for storing instructions, such as application programs, executable by the processing component S20.
  • the application program stored in the memory S22 may include one or more modules, each corresponding to a set of instructions.
  • the processing component S20 is configured to execute the instruction to execute the above-mentioned method that can be implemented on the side of the proxy server.
  • Device S00 may also include a power supply assembly S24 configured to perform power management of device S00, a wired or wireless network interface S26 configured to connect device S00 to a network, and an input output (I/O) interface S28.
  • Device S00 can operate based on an operating system stored in memory S22, such as Window12 12erver, Mac O12 X, Unix, Linux, FreeB12D or the like.
  • the above-mentioned device S00 may be an exemplary description of a data processing device, such as a patent management platform. In some data processing devices, it may not be necessary to include all the above components or all functional units under a certain component.
  • a computer-readable storage medium including instructions, such as a memory S04 including instructions, which are executable by the processor component S20 of the apparatus S00 to accomplish the above method.
  • the storage medium may be a computer-readable storage medium such as ROM, random access memory (RAM), CD-ROM, magnetic tape, floppy disk and optical data storage device, graphene, and the like.
  • the present disclosure further provides a patent management system, and the patent management system may include the apparatus described in any one of the embodiments of the present disclosure; or, the processor of the patent management system executes the When the executable instructions are stored in the memory, the method for classifying and processing files described in any one of the embodiments of the present disclosure is implemented; or, the above-mentioned computer program product of the patent management system.
  • the processing method for custom field indexing of files may also have corresponding processing devices, servers, equipment, storage media, computer program products, etc.
  • the module, the second module, the third module, the fourth module and so on implement the processing function corresponding to the method in the apparatus by analogous modules.
  • each module can be implemented in the same one or more software and/or hardware, and the modules that implement the same function can also be implemented by a combination of multiple sub-modules or sub-units, etc. .
  • the apparatus embodiments described above are only illustrative.
  • the division of modules or units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components may be combined or integrated. to another system, or some features can be ignored, or not implemented.
  • the coupling, communication connection, etc. between the shown or described devices or units can be realized in the form of direct and/or indirect coupling/connection, which can be achieved through some standard or self-defined interfaces, protocols, etc. Sexually, mechanically or otherwise.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Security & Cryptography (AREA)
  • Databases & Information Systems (AREA)
  • Computer Hardware Design (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Bioethics (AREA)
  • General Health & Medical Sciences (AREA)
  • Automation & Control Theory (AREA)
  • Library & Information Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

一种文件的分类处理方法、装置、服务器及系统,以及计算机程序产品。一个方法实施例中,本公开提供了新的文件的分类处理方案。若对文件进行转换,则可以先确定其具有的操作权限,之后才可以对文件中的分类字段数据统一进行审核和目标字段类型的转换,大大提高了文件不同分类字段类型的转换处理效率,节省文件的分类处理时间,实现由管理人员集中、统一对分类字段数据进行审核,保障分类字段数据审核质量的一致性和准确性,优化了文件分类管理的流程。利用实施例方案可以集中、统一进行字段的整合转换,保证了协同工作时和专利分类的准确性与高效性,提升用户对专利文件的管理服务的使用体验。

Description

文件的分类处理方法、装置、服务器、系统及计算机程序产品 技术领域
本公开涉及专利数据处理的技术领域,尤其涉及一种文件的分类处理方法、装置、服务器、系统及计算机程序产品。
发明背景
目前,随着科技的不断创新与进步,专利申请的数量也越来越多。而专利资料的维护对业内专利申请方向、专利发展趋势、专利布局等具有重要的参考价值。
专利代理公司或专利申请、管理、运营等专利关联方(用户)维护专利数据时,通常需要对专利进行分类。目前一些相关技术中,用户一侧可以设定一些专利分类规则,由用户内部不同的研发或专利管理人员基于专利内容的理解对专利进行标引,实现对专利进行类别划分。而不同的研发或专利管理人员等对专利进行标引的方式缺乏统一的审核节点,不同标引人员对专利标引的准确性难以保障,存在管理难度大、分类准确性较低的问题。同时,若将标引转换到分类规则所对应的分类字段时,目前也只能逐条进行转换处理,分类效率较低。
发明内容
本公开提供一种文件的分类处理方法、装置、服务器、系统及计算机程序产品,以至少解决相关技术中文件分类效率低的技术问题。本公开的技术方案如下:
一种文件的分类处理方法,包括:
识别操作账号的操作权限,所述操作权限包括预先设置的不同账号类型可处理的业务内容;
若所述操作账号具有转换操作权限,则根据为所述操作账号分配的业务字段获取对应的包含第二字段的文件数据集合,其中,所述的包含第二字段的文件数据集合中的第二字段,包括由匹配不同或相同操作权限的操作账号对文件数据进行处理获得的数据信息;
响应第二字段的第一转换操作,将所述文件数据集合中的第二字段转换为目标字段类型的分类字段;
展示所述第二字段转换操作后的分类结果。
所述方法的另一个实施例中,所述方法还包括:
响应第二字段的修改操作,对所述文件数据集合中的第二字段进行修改,得到修改后的字段数据。
所述操作账号的操作权限包括采用下述方式设置的账号以及对应的权限:
第一账号类型,具有对文件进行标引的权限、无转换操作的权限;
第二账号类型,对所述第一账号类型中的标引账号对文件进行处理产生的数据信息进行修改,以及具有将待转换字段转换为指定类型的业务字段的权限;
第三账号类型,具有转换操作权限,以及根据预设的匹配规则为所述第二账号类型中的转换账号分配允许所述转换账号将待转换字段转换为的业务字段。
所述方法的另一个实施例中,所述匹配规则包括:
基于所述转换账号的业务领域属性信息为其分配匹配将待转换字段转换为的业务字段。
所述方法的另一个实施例中,所述方法还包括:
若所述操作账号为第二账号类型,待转换字段转换后的分类结果中包括不属于为转换账号分配的业务字段的异常字段,则向所述第三账号类型的分配账号发送通知消息;
响应所述分配账号的重分配操作,将所述异常字段所对应的文件重新分配给的第二账号类型中与所述异常字段匹配的转换账号。
所述方法的另一个实施例中,采用下述方式确定异常字段:
根据所述文件的内容信息和/或标引数据计算所述文件的参考字段;
将所述参考字段与转换后的分类字段进行比较,若所述参考字段与所述转换后的分类字段的差异大于预设条件,则确定差异大于预设条件的分类字段为异常字段。
所述方法的另一个实施例中,所述第二字段包括:
线上对文件进行标引产生的描述信息;
和/或,接收的标引操作对象线下对所述文件进行标引并上传的描述信息。
所述方法的另一个实施例中,所述第二字段还包括:
提取的所述文件中所包括的批复信息,其中,所述批复信息包括下述中的至少一种:
文件中内容的注释信息;
文件中内容的批注信息;
文件中内容的备忘信息;
以及与所述注释信息、批注信息、备忘信息相对应的回复信息。
所述方法的另一个实施例中,所述文件包括专利文件。
所述方法的另一个实施例中,所述方法还包括:响应待转换字段的第二转换操作,将文件对应的待转换字段转换为目标字段类型中的分类字段,其中,
当待转换字段的类型为选项字段时,所述目标字段类型为层级字段和/或文本字段;
当待转换字段的类型为层级字段时,所述目标字段类型为选项字段和/或文本字段;
当待转换字段的类型为文本字段时,所述目标字段类型为选项字段和/或层级字段。
所述方法的另一个实施例中,所述层级字段为以分类字段为节点的树结构数据;
所述展示所述第二字段转换操作后的分类结果包括:展示待转换字段对应在所述树结构中的叶子节点、中间节点、根节点的分类字段,并以预设的符号和/或格式展现出所属分类字段之间的层级关系。
所述方法的另一个实施例中,当所述目标字段类型为层级字段或选项字段时,在转换的过程中,所述方法还包括:
若所述目标字段类型中存在多个与描述信息相匹配的分类字段,则展示所述多个相匹配的分类字段;
基于分类字段的选择操作指令确定所需转换为的分类字段。
所述方法的另一个实施例中,所述目标字段类型中的分类字段包括用户自定义分类设置的分类字段。
所述方法的另一个实施例中,所述方法还包括:
展示匹配结果信息,所述匹配结果信息包括待转换字段的类型、目标字段类型、本次转换的文件的数量、文件转换成功和/或失败的数量。
所述方法的另一个实施例中,采用创建异步任务的方式执行转换处理。
还提供一种文件的分类处理装置,包括:
权限识别模块,用于识别操作账号的操作权限,所述操作权限包括预先设置的不同账号类型可处理的业务内容;
数据获取模块,用于在所述操作账号具有转换操作权限时,根据为所述操作账号分配的业务字段获取对应的包含第二字段的文件数据集合,其中,所述的包含第二字段的文件数据集合中的第二字段,包括由匹配不同或相同操作权限的操作账号对文件数据进行处理获得的数据信息;
第一转换模块,用于响应第二字段的第一转换操作,将所述文件数据集合中的第二字段转换为目标字段类型的分类字段;
展示模块,用于展示所述第二字段转换操作后的分类结果。
所述装置的另一个实施例中,所述装置还包括:
修改模块,用于响应第二字段的修改操作,对所述文件数据集合中的第二字段进行修改,得到修改后的字段数据。
所述装置的另一个实施例中,所述操作账号的操作权限包括采用下述方式设置的账号以及对应的权限:
第一账号类型,具有对文件进行标引的权限、无转换操作的权限;
第二账号类型,对所述第一账号类型中的标引账号对文件进行处理产生的数据信息进行修改,以及具有将待转换字段转换为指定类型的业务字段的权限;
第三账号类型,具有转换操作权限,以及根据预设的匹配规则为所述第二账号类型中的转换账号分配允许所述转换账号将待转换字段转换为的业务字段。
所述装置的另一个实施例中,所述匹配规则包括:
基于所述转换账号的业务领域属性信息为其分配匹配将待转换字段转换为的业务字段。
所述装置的另一个实施例中,还包括:
异常通知模块,用于在所述操作账号为第二账号类型,待转换字段转换后的分类结果中包括不属于为转换账号分配的业务字段的异常字段时,向所述第三账号类型的分配账号发送通知消息;
重分配模块,用于响应所述分配账号的重分配操作,将所述异常字段所对应的文件重新分配给的第二账号类型中与所述异常字段匹配的转换账号。
所述装置的另一个实施例中,采用下述方式确定异常字段:
根据文件的内容信息和/或标引数据计算所述文件的参考字段;
将所述参考字段与转换后的分类字段进行比较,若所述参考字段与所述转换后的分类字段的差异大于预设条件,则确定差异大于预设条件的分类字段为异常字段。
所述装置的另一个实施例中,所述第二字段包括:
线上对文件进行标引产生的描述信息;
和/或,接收的标引操作对象线下对所述文件进行标引并上传的描述信息。
所述装置的另一个实施例中,所述第二字段还包括:
提取的所述文件中所包括的批复信息,其中,所述批复信息包括下述中的至少一种:
文件中内容的注释信息;
文件中内容的批注信息;
文件中内容的备忘信息;
以及与所述注释信息、批注信息、备忘信息相对应的回复信息。
所述装置的另一个实施例中,所述文件包括专利文件。
所述装置的另一个实施例中,还包括:
第二转换模块,用于响应待转换字段的第二转换操作,将文件对应的待转换字段转换为目标字段类型中的分类字段,其中,
当待转换字段的类型为选项字段时,所述目标字段类型为层级字段和/或文本字段;
当待转换字段的类型为层级字段时,所述目标字段类型为选项字段和/或文本字段;
当待转换字段的类型为文本字段时,所述目标字段类型为选项字段和/或层级字段。
所述装置的另一个实施例中,所述层级字段为以分类字段为节点的树结构数据;
所述展示所述第二字段转换操作后的分类结果包括:展示待转换字段对应在所述树结构中的叶子节点、中间节点、根节点的分类字段,并以预设的符号和/或格式展现出所属分类字段之间的层级关系。
所述装置的另一个实施例中,所述装置还包括:
重复分类展示模块,用于当所述目标字段类型为层级字段或选项字段时,若所述目标字段类型中存在多个与描述信息相匹配的分类字段,则展示所述多个相匹配的分类字段;
分类选择模块,用于接收分类字段的选择操作指令,确定所需转换为的目标字段。
所述装置的另一个实施例中,所述目标字段类型包括用户自定义分类设置的分类字段。
所述装置的另一个实施例中,所述装置还包括:
匹配结果展示模块,用于转换为目标字段类型之后展示匹配结果信息,所述匹配结果信息包括待转换字段的类型、目标字段类型、本次转换的文件的数量、文件转换成功和/或失败的数 量。
所述装置的另一个实施例中,所述第一转换模块或第二转换模块采用创建异步任务的方式对所述待处理字段进行转换。
本公开实施例的另一方面,还提供一种服务器,包括:
至少一个处理器;
用于存储所述处理器可执行指令的存储器;
其中,所述处理器被配置为执行所述指令,以实现本公开任一项实施例所述的方法。
本公开实施例的另一方面,还提供一种计算机可读存储介质,当所述计算机可读存储介质中的指令被服务器的处理器执行时,使得所述服务器能够执行本公开任一项所述的方法。
本公开实施例的另一方面,还提供一种计算机程序产品,包括计算机程序/指令,其中,所述计算机程序被处理器执行时实现本公开任一项实施例所述的方法。
本公开实施例的另一方面,还提供一种专利管理系统,包括本公开任意一个实施例所述的装置,或者,专利管理系统的处理器执行存储器存储的可执行指令时,实现本公开任意一个文件的分类处理方法,或者,所述专利管理系统包括所述的计算机程序产品。
根据本公开实施例提供的文件的分类处理方法,若作业人员对文件进行转换,则可以先确定操作账号具有的操作权限,具有转换操作权限的操作账号才可以对文件中的分类字段数据统一进行审核和目标字段类型的转换,不仅大大提高了文件不同分类字段类型的转换处理效率,节省文件的分类处理时间,还可以实现由管理人员集中、统一对分类字段数据进行审核,保障分类字段数据审核质量的一致性和准确性,优化了文件分类管理的流程。利用本公开提供的文件的分类处理方法,可以对每个具体实施专利标引的标引人员的标引结果进行审核和数据溯源,管理人员可以对标引数据审核处理之后,再集中、统一进行字段的整合转换,保证了协同工作时和专利分类的准确性与高效性。转换后的分类字段可以在作业空间进行展示,使得用户可以更加清晰、全面的查看专利文本所属的分类字段或分类字段的层级关系等情况,提升用户对专利文件的管理服务的使用体验。
应当理解的是,以上的一般描述和后文的细节描述仅是示例性和解释性的,并不能限制本公开。
附图简要说明
此处的附图被并入说明书中并构成本说明书的一部分,示出了符合本公开的实施例,并与说明书一起用于解释本公开的原理,并不构成对本公开的不当限定。
图1是根据一示例性实施例示出的一种文件的分类处理方法的应用场景示意图。
图2是根据一示例性实施例示出的一种文件的分类处理方法的流程图。
图3是本公开提供的一个标引数据转换为选项字段后的结果示意图。
图4是本公开提供的一个标引数据转换为层级字段后的结果示意图。
图5是本公开的提供的一个预先分类设计的层级字段的层级关系示意图。
图6是根据另一示例性实施例示出的一种文件的分类处理方法的流程图。
图7是根据另一示例性实施例示出的一种文件的分类处理方法的流程图。
图8是根据另一示例性实施例示出的一种文件的分类处理方法的流程图。
图9是本公开提供的一个实施例中包含批复信息的标引数据的场景示意图。
图10是本公开提供的一个实施例中包含回复信息的批复信息的场景示意图。
图11是根据另一示例性实施例示出的一种文件的分类处理方法的流程图。
图12是本公开提供的一个存在重复层级节点时提供给用户进行确定目标字段的场景示意图。
图13是根据一示例性实施例示出的一种文件的分类处理方法的场景示意图。
图14是根据一示例性实施例示出的一个文件的分类处理装置结构示意图。
图15是根据另一示例性实施例示出的一个文件的分类处理装置结构示意图。
图16是根据另一示例性实施例示出的一个文件的分类处理装置结构示意图。
图17是根据另一示例性实施例示出的一个文件的分类处理装置结构示意图。
图18是根据一示例性实施例示出的一个文件的分类处理设备S00的结构示意图。
实施本发明的方式
为了使本领域普通人员更好地理解本公开的技术方案,下面将结合附图,对本公开实施例中的技术方案进行清楚、完整地描述。
需要说明的是,本公开的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。术语的顺序在适当情况下可以互换,以便这里描述的本公开的实施例能够以除了在这里图示或描述的那些以外的顺序实施。以下示例性实施例中所描述的实施方式并不代表与本公开相一致的所有实施方式。相反,它们仅是与如所附权利要求书中所详述的、本公开的一些方面相一致的装置和方法的例子。术语“包括”、“包含”或者其任何其它变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、产品或者设备不仅包括那些要素,而且还包括没有明确列出的其它要素,或者是还包括为这种过程、方法、产品或者设备所固有的要素。在没有更多限制的情况下,并不排除在包括所述要素的过程、方法、产品或者设备中还存在另外的相同或等同要素。例如若使用到第一,第二等词表示名称,而并不表示任何特定的顺序。
本公开提供的一种文件的分类处理方法,可以应用于如图1所示的应用场景。例如提供给用户的专利管理系统110,专利管理系统110可以构建专利数据库,提供专利管理界面,实现对专利数据的分类、存储、查新、分析、更新。目前的一些专利管理系统中,用户通常只能一个一个对专利文件进行标引,如对某个专利添加文本类型描述信息,或者直接将专利标引为某个分类,如分类“A”。当需要大量专利进行分类时,如对专利管理系统搜索出来的5000条专利进行分类,则目前的处理方式和流程效率低下,无法满足客户需求。而本公开提供的文件的分类处理方法可以应用于所述专利管理系统110,可以提供包括但不限于对专利文件的分类处理,不仅在专利分类处理和管理流程上进行了优化,实现不同权限、不同级别的管理,还可以更加高效、准确的实现分类,提高专利管理作业的效率和分类结果的准确性。
本公开一些实施例中所述的待转换字段可以理解为待转换的文件的数据信息,如标引人员自己线上或线下对专利文本分类记录的标引数据,也可以包括本公开提供的选项字段或层级字段。所述的目标字段类型可以理解为需要转换为的分类字段所属的字段类型。本公开提供的字段类型可以包括但不限于文本字段、选项字段、层级字段。所述文本字段可以包括用户输入的对文件的内容、关键技术等进行描述的信息内容。用户通常可以在文本类型中自由输入分类的描述词汇或其他信息(限定非法字段的除外)。所述的选项字段通常包括用户自定义的分类字段,选项字段中的分类字段可以是并列的分类关系,也可以是层级关系。一个文件可以归属于多个选项字段中的分类字段,如某专利文件属于A、B、C、D四个选项字段中的A和B,也可以属于选项字段A和B1,其中A、B、C、D均为一级分类,B1为一级分类B的子分类(二级分类)。所述的层级字段中可以包括不同分类字段的从属或分级关系,例如某个专利文本的标引数据中包括“扫地机器人”、“清洁装置”等关键词,则根据标引数据可以将该专利匹配到层级字段中“扫地机器人”节点下的子节点“清洁装置”的分类字段,那么该专利在本次转换处理中可以被划分到属于“扫地机器人”节点下的“清洁装置”分类中。字段转换后可以更新专利管理系统的专利数据库。本公开实施例中所述的专利管理系统110可以包括但不限于本地或远程的服务器、服务器集群、分布式分系统、云处理平台、包含区块链节点的服务器以及其组合的设备,也可以包括各种个人计算机、笔记本电脑、智能手机、平板电脑、可穿戴设备、车载设备、医疗设备等。
下面以专利文件在专利管理系统的一个作业空间中将对专利文件的标引数据转换为目标字段类型的实施场景对本公开实施例方案进行说明。需要说明的,本公开实施例方案并不限于对专利文件的分类处理,基于本公开的创新思想,本公开的实施例方案还可以用于其他文件类型 的分类处理,如对论文、报刊、图书文件资料等。本公开的一些实施例中,所述文件可以是专利文件。相应的,根据专利文件的实施例描述,下述实施例中所描述的专利词语也可以适应性的调整,如论文管理系统等。图2是根据一示例性实施例示出的一种文件的分类处理方法的流程图,如图2所示,所述方法可以用于前述专利管理系统110中,可以包括以下步骤。
S200:识别操作账号的操作权限,所述操作权限包括预先设置的不同账号类型可处理的业务内容。
本公开实施例中可以预先设置不同账号类型,不同账号类型在专利管理系统或作业空间中有不同的操作权限。例如第一类账号只能对文件进行标引而不能将标引产生的标引数据转换为选项字段或层级字段。第二类账号具有转换操作权限,可以将标引数据转换为目标字段类型,也可以对第三类账号的标引数据进行审核,如进行修改。第三类账号可以具有最高的权限,如可以具有在作业空间或专利管理系统中的所有权限,可以具有按照一定的规则为不同的第二类账号分配一些操作权限的权限,或者对第二类账号的转换结果进行审核、修改等,当然,第三类账号也可以具有转换操作权限及对文件进行标引的权限等待。
具体的可以根据对文件分类管理的需求设置相应的账号类型。本公开还提供一种设置不同权限的账号的实施方法。具体的,所述方法的另一个实施例中,所述操作账号的操作权限包括采用下述方式设置的账号以及对应的权限:
第一账号类型,具有对文件进行标引的权限、无转换操作的权限;
第二账号类型,对所述第一账号类型中的标引账号对文件进行处理产生的数据信息进行修改,以及具有将待转换字段转换为指定类型的业务字段的权限;
第三账号类型,具有转换操作权限,以及根据预设的匹配规则为所述第二账号类型中的转换账号分配允许所述转换账号将待转换字段转换为的业务字段。
所述的第一账号类型可以如同前述的第一类账号,可以提供给具体的对文件进行初始分类处理的人员,如对文件进行标引的作业人员。标引人员使用的账号可以称为标引账号,属于第一账号类型。所述的第二账号类型中的账号(可以称为转换账号),参考前述第二类账号,可以对所述第一账号类型中的标引账号对文件进行处理产生的数据信息进行修改,以及具有将待转换字段转换为指定类型的业务字段的权限。第三账号类型参考前述第三类账号,可以为不同的转换账号设置其可以将待转换字段转换为的业务字段。例如属于第三类账号类型的某个分配账号可以设置或分配某个转换账号user33处理电子类的专利文件分类,即转换账号user33将标引数据转换后的分类应该是电子类的分类字段。
当然,本公开不限于还有其他的设置不同账号的不同权限的实施方案。
S202:若所述操作账号具有转换操作权限,则根据为所述操作账号分配的业务字段获取对应的包含第二字段的文件数据集合,其中,所述的包含第二字段的文件数据集合中的第二字段,包括由匹配不同或相同操作权限的操作账号对文件数据进行处理获得的数据信息。
所述的业务字段通常可以指根据操作账号所属的业务领域或者所处理的专业范围,为操作账号分配的字段。该业务字段通常是操作账号需要将待处理字段转为的目标字段(目标字段类型中的分类字段)。例如,某个操作账号的业务领域是机械,则为该操作账号分配的业务字段可以是属于机械领域的目标字段。
本公开实施例在专利文件分类的应用场景中,所述文件可以包括预先选择或筛选出的一条或多个专利。如可以根据查询条件、过滤条件、空间配置、排序规则等,调用solr(一种搜索应用服务器)查找出此次需要转换的一批专利,可以以专利公开(公告)号列表的方式展示在作业空间界面中。
具有转换操作权限的操作账号,可以是如前述所述的第二账号类型中的转换账号。转换账号在经过权限确定后,可以确定其对应的业务字段,并可以获取分配给转换账号进行字段转换的文件数据集合。其中,文件数据集合中可以包含第二字段。所述的第二字段可以包括多种数据信息,一种在专利文件分类的应用场景中,所述的第二字段可以为标引数据。当然,本公开实施例并不限定第二字段一定是标引数据,其中主要是指第一账号类型中的操作账号对文件进行处理得到数据信息。因此,所述的包含第二字段的文件数据集合中的第二字段,可以包括由 匹配不同或相同操作权限的操作账号对文件数据进行处理获得的数据信息。如由多个对专利文件进行标引的作业人员对专利文件进行标引得到的数据信息。
标引的人员通常为一个或多个研发或专利分类处理的人员,也可以包括对专利管理的审核、管理人员等。因此,可以是由匹配不同或相同操作权限的操作账号对文件数据进行处理获得的数据信息。
所述文件数据集合中可以包括一个或多个文件的数据信息,通常包括对文件数据进行处理获得的数据信息,如标引数据。所述的标引数据可以包括标引人员对专利整体的分类描述,可以包括研发或专利管理人员输入的对专利分类的文本语言描述。如“该专利是应用在扫地机器人上的清洁装置,可以自动识别障碍物并自动充电”。一些应用场景中,标引数据可以作为使用专利管理系统对专利初次分类的待处理字段。另一些实施场景中,待处理字段也可以为选项字段和/或层级字段。为便于描述,本公开的一些实施例中,可以使用专利公开(公告)号来唯一标识专利,描述信息、文本字段、选项字段、层级字段中的全部或部分使用字母表示,如文本字段可以为A、B、C等。
在一实施例中,第二字段可以包括对一个或多个文件进行标引得到的数据信息。文件数据集合可以包括一个或多个第二字段。
S204:响应第二字段的第一转换操作,将所述文件数据集合中的第二字段转换为目标字段类型的分类字段。
操作账号具有转换操作权限,可以执行不同字段类型之间的字段分类转换。前述所述的第二字段可以是待转换字段的一种实施场景,如第二字段可以是标引数据。待转换字段要转换成的字段类型可以称为目标字段类型,如选项字段或层级字段。各个字段类型下通常由用户预先自定义设置了文件最终要确定下来的多个分类字段。操作账号执行第二字段的转换操作(这里可以成为第一转换操作)时,将所述文件数据集合中的第二字段转换为目标字段类型的分类字段。
确定待处理字段和需要转换成的目标字段类型后,可以采用预设的算法进行转换处理。本实施中,所述的选项字段或层级字段可以是预先设置的分类字段,然后可以将第二字段与目标字段类型中分类字段进行匹配。当第二字段的内容与所述分类字段相匹配时,将所述相匹配的分类字段作为与所述第二字段转换的分类字段。一般的,在转换处理中通常需要待处理字段的内容与分类字段达到一定的匹配要求,如待处理字段为A,目标字段类型中的某个分类字段A1包A,则A可以作为待处理字段转换为A1的分类字段。当然,匹配的过程中还可以包括其他处理,如分词处理、语义分析、相似度计算等。
S206:展示所述第二字段转换操作后的分类结果。
不同类型的字段转换后,可以展示转换操作后的分类结果,以供用户查看。
在一实施例中,对所述第二字段转换操作后的分类结果,可以是多个文件各自对应的分类字段。展示所述第二字段转换操作后的分类结果,即为展示多个文件各自对应的分类字段。
将本次转换的第二字段转换为相应的目标字段类型的分类字段后,可以在专利文本信息转换的作业空间界面进行展示,当然也可以在专门的展示界面进行展示,以便用户查看转换后的结果。图3是本公开提供的一个标引数据转换为选项字段后的结果示意图。图4是本公开提供的一个标引数据转换为层级字段后的结果示意图。在图3、图4中,文本字段中的标引数据可以为标引人员输入的自己归纳总结的描述语言,不同标引人员对同一个分类可能有不同的描述信息,如“清洁装置”、“清扫装置”。选项字段可以包括多种分类,如A、B、C、D。分级字段可以包括一级分类的A、B、C、D,也可以包括属于A分类下的二级分类A1、属于B分类下的二级分类B1,属于二级分类A1下的三级分类A11等,以此类推。当然,选项分类中也可以包括二级或三级等子级分类的选项。具体的选项分类或层级分类预先分类设置好,在转换过程中可以按照预先分类好的方式对专利重新标引(确定专利的目标字段),如图5所示。图5是本公开的提供的一个预先分类设计的层级字段的层级关系示意图。
上述实施例提供了新的文件的分类处理方案。若转换操作人员对文件进行转换,则可以先确定其具有的操作权限,具有转换操作权限的才可以对文件中的分类字段数据统一进行审核和 目标字段类型的转换,不仅大大提高了文件不同分类字段类型的转换处理效率,节省文件的分类处理时间,还可以实现由管理人员集中、统一对分类字段数据进行审核,保障分类字段数据审核质量的一致性和准确性,优化了文件分类管理的流程。同时,还需要为不同的具有转换操作权限的账号分配其可以转换为的业务字段,细分了转换操作人员各自文件分类的字段,大大提高了转换操作人员各自分类的专业性,提高了分类效率和准确性。
进一步的,本公开所述方法的另一个实施例中,如图6所示,还可以包括:
S602:响应第二字段的修改操作,对所述文件数据集合中的第二字段进行修改,得到修改后的字段数据。相应的,若对第二字段进行了修改,则所述文件数据集合中可以包括所述第二字段修改后的字段数据,那么在转换时,转换的也是修改后的字段数据。
管理人员可以在专利管理系统中对各个标引人员做出的标引数据进行审核。一般的,管理人员需要获取一定的操作权限,以便于安全、集中的对专利标引数据的审核和转换处理。本实施例应用场景中,若发现需要对标引数据进行修改,则管理人员可以在专利管理系统的作业空间中进行修改。如将文本类型的标引数据“清洁装置”修改为“消毒装置”。专利管理系统可以响应标引数据(第二字段的一种数据信息)的修改操作,对需要调整的标引数据进行修改,得到修改后标引数据。
审核人员对标引数据审核或校准之后,可以通过专利管理系统统一进行转换,得到目标字段类型的分类字段。当然,也可以直接进行转换。本实施例中,所述的待转换字段的类型可以是文本类型的标引数据。文本类型中的标引数据可以称为文本字段。其他的实施中所述的待转换字段的类型也可以是文本字段或选项字段。如之前对专利文件的描述信息的转换处理得到的选项字段。所述的目标字段类型可以表示描述信息所要转换成的类型,如文本字段转换成层级字段。
前述中,第三账号类型可以根据预设的匹配规则为所述第二账号类型中的转换账号分配允许所述转换账号将待转换字段转换为的业务字段。具体的匹配规则可以设置预先根据不同的业务场景和需求进行设置。本公开提供的一种实施例中,所述匹配规则可以包括:
基于所述转换账号的业务领域属性信息为其分配匹配将待转换字段转换为的业务字段。
业务领域属性信息通常可以包括转换账号对应的专业领域,根据其专业领域为其分配对口的业务字段。如机械领域的作业人员,为其分配的业务字段可以包括一个或多个机械领域相关的业务字段,如割草机、扫地机器人等。
一般的,低级别的字段类型可以向更高级的字段类型转换,如级别低的文本字段可以向高一级的选项字段转换,选项字段可以向更高级别的层级字段转换。一些实施例中可以设置层级字段无法向选项字段转换。本公开的所述方法的另一个实施例中,不同级别的字段类型可以相互转换,更加满足对不同分类以及不同分类结果的查看、展示需求。具体的,本公开的另一些实施例中,如图7所述,所述方法还可以包括:
S702:响应待转换字段的第二转换操作,将文件对应的待转换字段转换为目标字段类型中的分类字段,其中,
当待转换字段的类型为选项字段时,所述目标字段类型为层级字段和/或文本字段;当待转换字段的类型为层级字段时,所述目标字段类型为选项字段和/或文本字段;当待转换字段的类型为文本字段时,所述目标字段类型为选项字段和/或层级字段。
待转换字段可以在一次转换处理中转换成一个目标字段类型,也可以转换成多个目标字段类型。如标引数据可以同时转换成选项字段和层级字段。本实施例提供了选项字段和层级字段的目标字段类型,可以将描述信息以多种方式更加清晰的展示其分类架构和层级关系,实现专利更加精细化、多元化、更准确的分类。
另一些实施例中,如图8所示,所述方法还可以包括:
S802:若所述操作账号为第二账号类型,待转换字段转换后的分类结果中包括不属于为转换账号分配的业务字段的异常字段,则向所述第三账号类型的分配账号发送通知消息;
S804:响应所述分配账号的重分配操作,将所述异常字段所对应的文件重新分配给的第二账号类型中与所述异常字段匹配的转换账号。
本实施例中,虽然已经预先设置的第二账号类型中的转换账号需要将待转换字段转换为的业务字段,但不排除在一些应用场景中仍然会出现一些文件的转换后的分类结果与分配的业务字段不同的情况。如设置的某个分配账号要转换为的业务字段为电子类型的业务字段,但其在转换过程中由处理器或人员标记出某些文件应属于机械类型的业务字段,并将这些文件转换成了机械类型的分类字段,此时,这些不属于为其分配的电子类型的分转换后的分类字段为异常字段。本公开实施例提供了纠错机制,若出现上述情况,则可以向更高权限的第三账号类型的分配账号发送通知消息。这些,所述分配账号可以核实这些异常字段是否真的出现错误。若确定异常字段确实是出现异常,则可以进行重新分配,将这些异常字段对应的文件重新分配给相应的转换账号进行处理。如上述场景中,将异常字段对的文件重新分配给某个机械类型的字段转换的转换账号。本实施例的方案,可以大大提高文件分类的准确性。
本公开的另一些实施例中,异常字段是否真的是分配错误或存在其他异常情况,可以基于第二账号类型或第三账号类型的主观审核之后确定。本公开提供的另一些实施例中,提供了一种可以自动辅助审核的实施方案,可以由处理器自动计算和识别,给出转换后的分类字段是否是异常字段的输出结果,以供用户进行辅助查看。具体的,所述方法的另一种实施例中,可以采用下述方式确定异常字段:
根据文件的内容信息和/或标引数据计算所述文件的参考字段;
将所述参考字段与转换后的分类字段进行比较,若所述参考字段与所述转换后的分类字段的差异大于预设条件,则确定差异大于预设条件的分类字段为异常字段。
本实施例应用场景中,所述标引数据可以为线上对所述待转换专利进行标引产生的描述信息。所述的线上可以包括在专利管理系统等管理平台中进行的操作,如专利文本描述信息转换的作业空间中对所述待转换专利进行标引产生的描述信息。相对应的,线下可以指在所述专利管理系统等外部进行的操作,如专利的描述信息可以为标引在上传专利管理系统之前提前在excel中录入的文本信息。本公开的实施例方法,不仅支持线上标引录入的描述信息进行转换,也可以支持线下对专利标引录入的描述信息进行转换,适应多种不同的实施场景,满足部分用户线下已经标引好的专利描述信息转换需求。具体的,本公开的提供的另一个实施例中,所述第二字段可以包括:
线上对文件进行标引产生的描述信息;
和/或,接收的标引操作对象线下对所述文件进行标引并上传的描述信息。
本公开提供的另一个实施例中,所述第二字段还可以包括:
提取的所述文件中所包括的批复信息,其中,所述批复信息包括下述中的至少一种:
文件中内容的注释信息;
文件中内容的批注信息;
文件中内容的备忘信息;
以及与所述注释信息、批注信息、备忘信息相对应的回复信息。
在本实施例中应用场景中,第二字段可以是标引数据,当然,其他的实施方式中也可以是其他的数据信息或者包括其他的数据信息。具体的,所述的标引数据还可以包括待转换专利中针对专利具体内容所做的注释、批注、备忘等批复信息,这些批复信息可以为一个或多个不同用户添加的信息,如图9所示。不仅如此,标引数据还可以包括对这些注释、批注、备忘等批复信息所做的答复信息,如图10所示中User1对User2批注的回复内容。本公开实施例在对专利分类的标引数据中还可以包括这些批复信息,如有批复信息对应的回复信息还可以进一步包括这些回复信息,扩充了描述信息待处理字段的数据源,可以进一步的更加精确匹配到描述信息的目标字段,实现对专利在目标字段类型中更加准确的分类。
本公开提供的所述方法的另一个实施例中,所述层级字段为以分类字段为节点的树结构数据;
所述展示所述第二字段转换操作后的分类结果包括:展示所述待处理字段对应在所述树结构中的叶子节点、中间节点、根节点的分类字段,并以预设的符号和/或格式展现出所属分类字段之间的层级关系。
如图4所示,当目标字段类型为层级字段时,标引数据可以对应层级字段中的多个层级分类字段(树的深度≥2)。这样,在层级字段的目标字段类型展示中不仅可以展示描述信息对应的一个或多个目标字段,还可以采用预设的一些展示方式展现出所属层级的层级关系。如图4中,标引数据转换到层级字段中对应树形数据结构中的中间节点A1(其叶子节点为A11、21),A1属于根节点A,则在图4中相应的层级字段位置处以倒三角符号和错位的方式展现出分类字段A1和A,以及A1和A的层级关系,用户以常规的识图能力可以清晰的看出A1与A的层级关系,便于用户查阅转换后的目标字段在层级字段中的关系信息。
图11是根据另一示例性实施例示出的一种文件的分类处理方法的流程图。如图11所示,本公开提供的所述方法的另一个实施例中,所当所述目标字段类型为层级字段或选项字段时,在转换的过程中,所述方法还可以包括:
S1102:若所述目标字段类型中存在多个与描述信息相匹配的分类字段,则展示所述多个相匹配的分类字段;
S1104:基于分类字段的选择操作指令确定所需转换为的分类字段。
图12是本公开提供的一个存在重复层级节点时提供给用户进行确定目标字段的场景示意图。在图12中,若根据标引数据进行字段匹配后发现层级字段中有两个相同(相匹配的其中一种情况)名称的分类字段,则可以展示重复的分类字段,由用户自行选择需要将专利划分为哪个字段中。这样,用户可以根据自己的分类需要进行个性化的选择,匹配用户选择的分类字段作为要转换为的目标字段。
另一些实施例中,所述目标字段类型可以包括用户自定义分类设置的分类字段。本公开的另一个创新之处在于,目标字段类型中的分类字段可以是由用户自定义设置的,极大的方便了用户自定义所需转换的分类字段的类型。如,层级字段中根节点、中节点、叶子节点等各个节点分类字段的名称可以由用户自定义根据分类需求进行设置,可以不使用或者部分使用系统自带的分类规则,便于用户灵活的定义分类类型。
图13是根据一示例性实施例示出的一种文件的分类处理方法的场景示意图。如图13所示另一些实施例中,将待转换字段转换为目标字段类型的分类字段之后,所述方法还可以包括:
展示匹配结果信息,所述匹配结果信息包括待转换字段的类型、目标字段类型、本次转换的文件的数量、文件转换成功和/或失败的数量。
将对标引数据或其他类型待处理字段进行转换之后,还可以在展示界面中展示本次转换的匹配结果信息。匹配结果信息中通常可以包括待处理字段类型、目标字段类型、待本次转换的文件的数量,还可以文件转换成功和/或失败的数量,以便用户及时查看到是否全部转换成功、转换的目标字段类型是否正确等信息,如图13所示。当然,一些实施例中,转换成功和转换失败的数量也可以同时进行展示。
本公开提供的所述方法的另一个实施例中,采用创建异步任务的方式执行转换处理。在对标引数据或其他类型的待转换字段进行转换处理时,可以采用创建异步任务的方式进行转换。如专利管理系统的处理器可以分别对每条专利的描述信息创建相应的异步任务,每个异步任务对自己的转换任务进行处理,无论多个任务处理成功和识别可以均不影响其他任务的处理。这样,可以进一步加快转换的处理速度,减少目标字段类型的转换响应时间,提高用户专利管理使用体验。
本公开实施例提供的文件的分类处理方法,提供了新的文件的分类处理方案,具有权限的管理人员可以对标引人员对专利等文件的标引数据统一进行审核和目标字段类型的转换,不仅大大提高了标引数据的转换处理效率,节省文件的分类处理时间,还可以实现由管理人员集中、统一对标引数据进行审核,保障标引数据审核质量的一致性和准确性,优化了文件分类管理的流程。本公开实施例方案可以对每个具体实施专利标引的标引人员的标引结果进行审核和数据溯源,管理人员可以对标引数据审核处理之后,再集中、统一进行字段的整合转换,保证了协同工作时和专利分类的准确性与高效性。转换后的分类字段可以在作业空间进行展示,使得用户可以更加清晰、全面的查看目标专利文本所属的分类字段或分类字段的层级关系等情况,提升用户对专利管理服务的使用体验。
可以理解的是,本说明书中上述方法的各个实施例均采用递进的方式描述,各个实施例之间相同/相似的部分互相参见即可,每个实施例重点说明的都是与其它实施例的不同之处。相关之处参见其他方法实施例的描述说明即可。
应该理解的是,虽然附图中涉及的流程图中的各个步骤按照箭头的指示依次显示,但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,这些步骤可以以其它的顺序执行。而且,附图2中的至少一部分步骤可以包括多个步骤或者多个阶段,这些步骤或者阶段并不必然是在同一时刻执行完成,而是可以在不同的时刻执行,这些步骤或者阶段的执行顺序也不必然是依次进行,而是可以与其它步骤或者其它步骤的步骤或者阶段的至少一部分轮流或者交替地执行。
基于上述所述的文件的分类处理方法实施例的描述,本公开还提供一种文件的分类处理装置。所述装置可以包括使用了本说明书实施例所述方法的系统(包括分布式系统)、软件(应用)、模块、组件、服务器、客户端等并结合必要的实施硬件的装置。基于同一创新构思,本公开实施例提供的一个或多个实施例中的装置如下面的实施例所述。由于装置解决问题的实现方案与方法相似,因此本说明书实施例具体的装置的实施可以参见前述方法的实施,重复之处不再赘述。以下所使用的,术语“单元”或者“模块”可以实现预定功能的软件和/或硬件的组合。尽管以下实施例所描述的装置较佳地以软件来实现,但是硬件,或者软件和硬件的组合的实现也是可能并被构想的。
图14是根据一示例性实施例示出的一个文件的分类处理装置结构示意图。所述装置可以为前述所述专利管理系统110,也可以为单独的服务器或服务器集群等。具体的可以参照图14,文件的分类处理装置100可以包括如下模块。
权限识别模块1402,可以用于识别操作账号的操作权限,所述操作权限包括预先设置的不同账号类型可处理的业务内容;
数据获取模块1404,可以用于在所述操作账号具有转换操作权限时,根据为所述操作账号分配的业务字段获取对应的包含第二字段的文件数据集合,其中,所述的包含第二字段的文件数据集合中的第二字段,包括由匹配不同或相同操作权限的操作账号对文件数据进行处理获得的数据信息;
第一转换模块1406,可以用于响应第二字段的第一转换操作,将所述文件数据集合中的第二字段转换为目标字段类型的分类字段;
展示模块1408,可以用于展示所述第二字段转换操作后的分类结果。
一示例性实施例如图15所示,图15是根据另一示例性实施例示出的一个文件的分类处理装置结构示意图。参照图15,本公开提供的所述装置的另一个实施例中,文件的分类处理装置100还可以包括如下模块。
修改模块1502,可以用于响应第二字段的修改操作,对所述文件数据集合中的第二字段进行修改,得到修改后的字段数据。
参照前述方法实施例所述,所述装置的另一实施例中,所述操作账号的操作权限包括采用下述方式设置的账号以及对应的权限:
第一账号类型,具有对文件进行标引的权限、无转换操作的权限;
第二账号类型,对所述第一账号类型中的标引账号对文件进行处理产生的数据信息进行修改,以及具有将待转换字段转换为指定类型的业务字段的权限;
第三账号类型,具有转换操作权限,以及根据预设的匹配规则为所述第二账号类型中的转换账号分配允许所述转换账号将待转换字段转换为的业务字段。
参照前述方法实施例所述,所述装置的另一实施例中,所述匹配规则包括:
基于所述转换账号的业务领域属性信息为其分配匹配将待转换字段转换为的业务字段。
一示例性实施例如图16所示,图16是根据一示例性实施例示出的一个文件的分类处理装置结构示意图。参照前述方法实施例所述,所述装置的另一实施例中,还包括:
异常通知模块1602,可以用于在所述操作账号为第二账号类型,待转换字段转换后的分类结果中包括不属于为转换账号分配的业务字段的异常字段时,向所述第三账号类型的分配账号 发送通知消息;
重分配模块1604,可以用于响应所述分配账号的重分配操作,将所述异常字段所对应的文件重新分配给的第二账号类型中与所述异常字段匹配的转换账号。
参照前述方法实施例所述,所述装置的另一实施例中,采用下述方式确定异常字段:
根据所述文件的内容信息和/或标引数据计算所述文件的参考字段;
将所述参考字段与转换后的分类字段进行比较,若所述参考字段与所述转换后的分类字段的差异大于预设条件,则确定差异大于预设条件的分类字段为异常字段。
参照前述方法实施例所述,所述装置的另一实施例中,所述第二字段包括:
线上对文件进行标引产生的描述信息;
和/或,接收的标引操作对象线下对所述文件进行标引并上传的描述信息。
参照前述方法实施例所述,所述装置的另一实施例中,所述第二字段还包括:
提取的所述文件中所包括的批复信息,其中,所述批复信息包括下述中的至少一种:
文件中内容的注释信息;
文件中内容的批注信息;
文件中内容的备忘信息;
以及与所述注释信息、批注信息、备忘信息相对应的回复信息。
参照前述方法实施例所述,所述装置的另一实施例中,所述文件包括专利文件。
参照前述方法实施例所述,所述装置的另一实施例中,还包括:
第二转换模块,可以用于响应待转换字段的第二转换操作,将文件对应的待转换字段转换为目标字段类型中的分类字段,其中,
当待转换字段的类型为选项字段时,所述目标字段类型为层级字段和/或文本字段;
当待转换字段的类型为层级字段时,所述目标字段类型为选项字段和/或文本字段;
当待转换字段的类型为文本字段时,所述目标字段类型为选项字段和/或层级字段。
参照前述方法实施例所述,所述装置的另一实施例中,所述层级字段为以分类字段为节点的树结构数据;
所述展示所述第二字段转换操作后的分类结果包括:展示所述待处理数据对应在所述树结构中的叶子节点、中间节点、根节点的分类字段,并以预设的符号和/或格式展现出所属分类字段之间的层级关系。
一示例性实施例如图17所示,图17是根据一示例性实施例示出的一个文件的分类处理装置结构示意图。参照图17,本公开提供的所述装置的另一个实施例中,所述装置还可以包括:
重复分类展示模块1702,可以用于当所述目标字段类型为层级字段或选项字段时,若所述目标字段类型中存在多个与描述信息相匹配的分类字段,则展示所述多个相匹配的分类字段;
分类选择模块1704,可以用于接收分类字段的选择操作指令,确定所需转换为的目标字段。
本公开提供的所述装置的另一个实施例中,所述目标字段类型包括用户自定义分类设置的分类字段。
一示例性实施例如图18所示,图18是根据一示例性实施例示出的一个文件的分类处理装置结构示意图。参照图18,所述装置还可以包括:
匹配结果展示模块1802,可以用于转换为目标字段类型之后展示匹配结果信息,所述匹配结果信息包括待转换字段的类型、目标字段类型、本次转换的文件的数量、文件转换成功和/或失败的数量。
本公开提供的所述装置的另一个实施例中,所述第一转换模块1406或第二转换模块可以采用创建异步任务的方式对所述待处理字段进行转换。
关于上述实施例中的装置,其中各个模块执行操作的具体方式已经在有关该方法的实施例中进行了详细描述,此处将不做详细阐述说明。
在示例性实施例中,还提供一种计算机程序产品,包括计算机程序,所述计算机程序被处理器执行时实现本说明书中任一项所述的文件的分类处理方法。
图18是根据一示例性实施例示出的一个文件的分类处理设备S00的结构示意图,设备S00 可以如前述所述专利管理系统,具体的可以是服务器、服务器集群、分布式处理服务器、区块链服务器、云计算平台等以及其组合。例如,设备S00可以为一个或多个服务器的组合。参照图18,设备S00包括处理组件S20,其进一步包括一个或多个处理器,以及由存储器S22所代表的存储器资源,用于存储可由处理组件S20的执行的指令,例如应用程序。存储器S22中存储的应用程序可以包括一个或一个以上的每一个对应于一组指令的模块。此外,处理组件S20被配置为执行指令,以执行上述可以实施于代理服务端一侧的方法。
设备S00还可以包括一个电源组件S24被配置为执行设备S00的电源管理,一个有线或无线网络接口S26被配置为将设备S00连接到网络,和一个输入输出(I/O)接口S28。设备S00可以操作基于存储在存储器S22的操作系统,例如Window12 12erver,Mac O12 X,Unix,Linux,FreeB12D或类似。
需要说明的是,上述设备S00可以是数据处理设备的示例性描述,如专利管理平台。在一些数据处理设备中,可以不必包含上述全部组件或某个组件下的全部功能单元。
在示例性实施例中,还提供了一种包括指令的计算机可读存储介质,例如包括指令的存储器S04,上述指令可由设备S00的处理器组件S20执行以完成上述方法。存储介质可以是计算机可读存储介质,例如,所述计算机可读存储介质可以是ROM、随机存取存储器(RAM)、CD-ROM、磁带、软盘和光数据存储设备、石墨烯等。
基于前述方法、装置、计算机程序产品的实施例描述,本公开还一种专利管理系统,所述专利管理系统可以包括本公开任意一个实施例所述的装置;或者,专利管理系统的处理器执行存储器存储的可执行指令时,实现本公开任意一个实施例所述的文件的分类处理方法;或者,所述专利管理系统上述所述的计算机程序产品。
本说明书中的各个实施例均采用递进的方式描述,各个实施例之间相同相似的部分互相参见即可,每个实施例重点说明的都是与其它实施例的不同之处。尤其,对于硬件+程序类实施例而言,由于其基本相似于方法实施例,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。
对应前述文本的分类处方法,本公开提供的对文件进行自定义字段标引的处理方法也可以有对应的处理装置、服务器、设备、存储介质、计算机程序产品等,如装置中可以使用第一模块、第二模块、第三模块、第四模块等以此类推的模块来实现装置中与方法对应的处理功能。具体可以参照与文件的分类处理方法的装置实施例描述,在此不做逐一赘述。
需要说明的,上述所述的装置、设备、服务器等根据方法实施例的描述还可以包括其它的实施方式,具体的实现方式可以参照相关方法实施例的描述。同时各个方法以及装置、设备、服务器实施例之间特征的相互组合组成的新的实施例仍然属于本公开所涵盖的实施范围之内,在此不作一一赘述。
为了描述的方便,描述以上装置时以功能分为各种模块分别描述。当然,在实施本说明书一个或多个时可以把各模块的功能在同一个或多个软件和/或硬件中实现,也可以将实现同一功能的模块由多个子模块或子单元的组合实现等。以上所描述的装置实施例仅仅是示意性的,例如,模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或描述的装置或单元相互之间的耦合、通信连接等可以是直接和/或间接耦合/连接的方式实现,可以是通过一些标准或自定义的接口、协议等,是电性,机械或其它的形式实现。
本领域技术人员在考虑说明书及实践这里公开的发明后,将容易想到本公开的其它实施方案。本公开旨在涵盖本公开的任何变型、用途或者适应性变化,这些变型、用途或者适应性变化遵循本公开的一般性原理并包括本公开未公开的本技术领域中的公知常识或惯用技术手段。说明书和实施例仅被视为示例性的,本公开的真正范围和精神由下面的权利要求指出。
应当理解的是,本公开并不局限于上面已经描述并在附图中示出的精确结构,并且可以在不脱离其范围进行各种修改和改变。

Claims (34)

  1. 一种文件的分类处理方法,包括:
    识别操作账号的操作权限,所述操作权限包括预先设置的不同账号类型可处理的业务内容;
    若所述操作账号具有转换操作权限,则根据为所述操作账号分配的业务字段获取对应的包含第二字段的文件数据集合,其中,所述的包含第二字段的文件数据集合中的第二字段,包括由匹配不同或相同操作权限的操作账号对文件数据进行处理获得的数据信息;
    响应第二字段的第一转换操作,将所述文件数据集合中的第二字段转换为目标字段类型的分类字段;
    展示所述第二字段转换操作后的分类结果。
  2. 根据权利要求1所述的文件的分类处理方法,其中,所述方法还包括:
    响应第二字段的修改操作,对所述文件数据集合中的第二字段进行修改,得到修改后的字段数据。
  3. 根据权利要求1或2所述的文件的分类处理方法,其中,所述操作账号的操作权限包括采用下述方式设置的账号以及对应的权限:
    第一账号类型,具有对文件进行标引的权限、无转换操作的权限;
    第二账号类型,对所述第一账号类型中的标引账号对文件进行处理产生的数据信息进行修改,以及具有将待转换字段转换为指定类型的业务字段的权限;
    第三账号类型,具有转换操作权限,以及根据预设的匹配规则为所述第二账号类型中的转换账号分配允许所述转换账号将待转换字段转换为的业务字段。
  4. 根据权利要求3所述的文件的分类处理方法,其中,所述匹配规则包括:
    基于所述转换账号的业务领域属性信息为其分配匹配将待转换字段转换为的业务字段。
  5. 根据权利要求3所述的文件的分类处理方法,其中,所述方法还包括:
    若所述操作账号为第二账号类型,待转换字段转换后的分类结果中包括不属于为转换账号分配的业务字段的异常字段,则向所述第三账号类型的分配账号发送通知消息;
    响应所述分配账号的重分配操作,将所述异常字段所对应的文件重新分配给的第二账号类型中与所述异常字段匹配的转换账号。
  6. 根据权利要求5所述的文件的分类处理方法,其中,采用下述方式确定异常字段:
    根据所述文件的内容信息和/或标引数据计算所述文件的参考字段;
    将所述参考字段与转换后的分类字段进行比较,若所述参考字段与所述转换后的分类字段的差异大于预设条件,则确定差异大于预设条件的分类字段为异常字段。
  7. 根据权利要求1至6任一项所述的文件的分类处理方法,其中,所述第二字段包括:
    线上对文件进行标引产生的描述信息;
    和/或,接收的标引操作对象线下对所述文件进行标引并上传的描述信息。
  8. 根据权利要求7所述的文件的分类处理方法,其中,所述第二字段还包括:
    提取的所述文件中所包括的批复信息,其中,所述批复信息包括下述中的至少一种:
    文件中内容的注释信息;
    文件中内容的批注信息;
    文件中内容的备忘信息;
    以及与所述注释信息、批注信息、备忘信息相对应的回复信息。
  9. 根据权利要求1至8任一项所述的文件的分类处理方法,其中,所述文件包括专利文件。
  10. 根据权利要求1至9任一项所述的文件的分类处理方法,其中,所述方法还包括:响应待转换字段的第二转换操作,将文件对应的待转换字段转换为目标字段类型中的分类字段,其中,
    当待转换字段的类型为选项字段时,所述目标字段类型为层级字段和/或文本字段;
    当待转换字段的类型为层级字段时,所述目标字段类型为选项字段和/或文本字段;
    当待转换字段的类型为文本字段时,所述目标字段类型为选项字段和/或层级字段。
  11. 根据权利要求10所述的文件的分类处理方法,其中,所述层级字段为以分类字段为节点的树结构数据;
    所述展示所述第二字段转换操作后的分类结果包括:展示待转换字段对应在所述树结构中的叶子节点、中间节点、根节点的分类字段,并以预设的符号和/或格式展现出所属分类字段之间的层级关系。
  12. 根据权利要求10所述的文件的分类处理方法,其中,当所述目标字段类型为层级字段或选项字段时,在转换的过程中,所述方法还包括:
    若所述目标字段类型中存在多个与描述信息相匹配的分类字段,则展示多个相匹配的分类字段;
    基于分类字段的选择操作指令确定所需转换为的分类字段。
  13. 根据权利要求10所述的文件的分类处理方法,其中,所述目标字段类型中的分类字段包括用户自定义分类设置的分类字段。
  14. 根据权利要求1或10所述的文件的分类处理方法,其中,所述方法还包括:
    展示匹配结果信息,所述匹配结果信息包括待转换字段的类型、目标字段类型、本次转换的文件的数量、文件转换成功和/或失败的数量。
  15. 根据权利要求1或10所述的文件的分类处理方法,其中,采用创建异步任务的方式执行转换处理。
  16. 一种文件的分类处理装置,包括:
    权限识别模块,用于识别操作账号的操作权限,所述操作权限包括预先设置的不同账号类型可处理的业务内容;
    数据获取模块,用于在所述操作账号具有转换操作权限时,根据为所述操作账号分配的业务字段获取对应的包含第二字段的文件数据集合,其中,所述的包含第二字段的文件数据集合中的第二字段,包括由匹配不同或相同操作权限的操作账号对文件数据进行处理获得的数据信息;
    第一转换模块,用于响应第二字段的第一转换操作,将所述文件数据集合中的第二字段转换为目标字段类型的分类字段;
    展示模块,用于展示所述第二字段转换操作后的分类结果。
  17. 根据权利要求16所述的文件的分类处理装置,其中,所述装置还包括:
    修改模块,用于响应第二字段的修改操作,对所述文件数据集合中的第二字段进行修改,得到修改后的字段数据。
  18. 根据权利要求16所述的文件的分类处理装置,其中,所述操作账号的操作权限包括采用下述方式设置的账号以及对应的权限:
    第一账号类型,具有对文件进行标引的权限、无转换操作的权限;
    第二账号类型,对所述第一账号类型中的标引账号对文件进行处理产生的数据信息进行修改,以及具有将待转换字段转换为指定类型的业务字段的权限;
    第三账号类型,具有转换操作权限,以及根据预设的匹配规则为所述第二账号类型中的转换账号分配允许所述转换账号将待转换字段转换为的业务字段。
  19. 根据权利要求18所述的文件的分类处理装置,其中,所述匹配规则包括:
    基于所述转换账号的业务领域属性信息为其分配匹配将待转换字段转换为的业务字段。
  20. 根据权利要求18所述的文件的分类处理装置,其中,还包括:
    异常通知模块,用于在所述操作账号为第二账号类型,待转换字段转换后的分类结果中包括不属于为转换账号分配的业务字段的异常字段时,向所述第三账号类型的分配账号发送通知消息;
    重分配模块,用于响应所述分配账号的重分配操作,将所述异常字段所对应的文件重新分配给的第二账号类型中与所述异常字段匹配的转换账号。
  21. 根据权利要求20所述的文件的分类处理装置,其中,采用下述方式确定异常字段:
    根据文件的内容信息和/或标引数据计算所述文件的参考字段;
    将所述参考字段与转换后的分类字段进行比较,若所述参考字段与所述转换后的分类字段的差异大于预设条件,则确定差异大于预设条件的分类字段为异常字段。
  22. 根据权利要求16至21任一项所述的文件的分类处理装置,其中,所述第二字段包括:
    线上对文件进行标引产生的描述信息;
    和/或,接收的标引操作对象线下对所述文件进行标引并上传的描述信息。
  23. 根据权利要求22所述的文件的分类处理装置,其中,所述第二字段还包括:
    提取的所述文件中所包括的批复信息,其中,所述批复信息包括下述中的至少一种:
    文件中内容的注释信息;
    文件中内容的批注信息;
    文件中内容的备忘信息;
    以及与所述注释信息、批注信息、备忘信息相对应的回复信息。
  24. 根据权利要求16至21任一项所述的文件的分类处理装置,其中,所述文件包括专利文件。
  25. 根据权利要求16至21任一项所述的文件的分类处理装置,其中,还包括:
    第二转换模块,用于响应待转换字段的第二转换操作,将文件对应的待转换字段转换为目标字段类型中的分类字段,其中,
    当待转换字段的类型为选项字段时,所述目标字段类型为层级字段和/或文本字段;
    当待转换字段的类型为层级字段时,所述目标字段类型为选项字段和/或文本字段;
    当待转换字段的类型为文本字段时,所述目标字段类型为选项字段和/或层级字段。
  26. 根据权利要求25所述的文件的分类处理装置,其中,所述层级字段为以分类字段为节点的树结构数据;
    所述展示所述第二字段转换操作后的分类结果包括:展示待转换字段对应在所述树结构中的叶子节点、中间节点、根节点的分类字段,并以预设的符号和/或格式展现出所属分类字段之间的层级关系。
  27. 根据权利要求25所述的文件的分类处理装置,其中,所述装置还包括:
    重复分类展示模块,用于当所述目标字段类型为层级字段或选项字段时,若所述目标字段类型中存在多个与描述信息相匹配的分类字段,则展示多个相匹配的分类字段;
    分类选择模块,用于接收分类字段的选择操作指令,确定所需转换为的目标字段。
  28. 根据权利要求25所述的文件的分类处理装置,其中,所述目标字段类型包括用户自定义分类设置的分类字段。
  29. 根据权利要求16或25所述的文件的分类处理装置,其中,所述装置还包括:
    匹配结果展示模块,用于转换为目标字段类型之后展示匹配结果信息,所述匹配结果信息包括待转换字段的类型、目标字段类型、本次转换的文件的数量、文件转换成功和/或失败的数量。
  30. 根据权利要求16或25所述的文件的分类处理装置,其中,所述第一转换模块或第二转换模块采用创建异步任务的方式执行转换处理。
  31. 一种服务器,包括:
    至少一个处理器;
    用于存储所述处理器可执行指令的存储器;
    其中,所述处理器被配置为执行所述指令,以实现如权利要求1至15中任一项所述的文件的分类处理方法。
  32. 一种计算机可读存储介质,当所述计算机可读存储介质中的指令被服务器的处理器执行时,使得所述服务器能够执行如权利要求1至15中任一项所述的文件的分类处理方法。
  33. 一种计算机程序产品,包括计算机程序,所述计算机程序被处理器执行时实现权利要求1至15任一项所述的文件的分类处理方法。
  34. 一种专利管理系统,包括权利要求16-30中任意一项所述的装置;
    或者,专利管理系统的处理器执行存储器存储的可执行指令时,实现如权利要求1至15中任一项所述的文件的分类处理方法;
    或者,所述专利管理系统包括权利要求33所述的计算机程序产品。
PCT/CN2022/080005 2021-03-09 2022-03-09 文件的分类处理方法、装置、服务器、系统及计算机程序产品 WO2022188820A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110254319.7A CN113111179A (zh) 2021-03-09 2021-03-09 文件的分类处理方法、装置、服务器及系统
CN202110254319.7 2021-03-09

Publications (1)

Publication Number Publication Date
WO2022188820A1 true WO2022188820A1 (zh) 2022-09-15

Family

ID=76710756

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/080005 WO2022188820A1 (zh) 2021-03-09 2022-03-09 文件的分类处理方法、装置、服务器、系统及计算机程序产品

Country Status (2)

Country Link
CN (1) CN113111179A (zh)
WO (1) WO2022188820A1 (zh)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113111179A (zh) * 2021-03-09 2021-07-13 智慧芽信息科技(苏州)有限公司 文件的分类处理方法、装置、服务器及系统
CN114048230B (zh) * 2021-11-29 2024-05-07 平安科技(深圳)有限公司 业务数据处理方法、装置、设备及存储介质
CN114219443A (zh) * 2021-12-16 2022-03-22 中国建设银行股份有限公司 单据数据处理方法、装置及设备

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070073689A1 (en) * 2005-09-29 2007-03-29 Arunesh Chandra Automated intelligent discovery engine for classifying computer data files
CN101763386A (zh) * 2008-12-25 2010-06-30 新奥特硅谷视频技术有限责任公司 一种基于检索范围权限的检索方法、装置和用户设备
CN101853358A (zh) * 2010-05-11 2010-10-06 南京赛孚科技有限公司 一种文件对象权限管理的实现方法
CN108549623A (zh) * 2018-04-12 2018-09-18 北京三快在线科技有限公司 协作文档编辑控制方法、装置、电子设备及存储介质
CN113111179A (zh) * 2021-03-09 2021-07-13 智慧芽信息科技(苏州)有限公司 文件的分类处理方法、装置、服务器及系统

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104301198A (zh) * 2013-07-17 2015-01-21 北大方正集团有限公司 基于即时通讯软件的远程信息传送方法及设备
CN111259627A (zh) * 2020-01-08 2020-06-09 深圳市采薇科技咨询有限公司 文档分析方法、装置、计算机存储介质及设备
CN111949845B (zh) * 2020-07-02 2024-04-12 广州仓实信息科技有限公司 处理测绘信息的方法、装置、计算机设备和存储介质

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070073689A1 (en) * 2005-09-29 2007-03-29 Arunesh Chandra Automated intelligent discovery engine for classifying computer data files
CN101763386A (zh) * 2008-12-25 2010-06-30 新奥特硅谷视频技术有限责任公司 一种基于检索范围权限的检索方法、装置和用户设备
CN101853358A (zh) * 2010-05-11 2010-10-06 南京赛孚科技有限公司 一种文件对象权限管理的实现方法
CN108549623A (zh) * 2018-04-12 2018-09-18 北京三快在线科技有限公司 协作文档编辑控制方法、装置、电子设备及存储介质
CN113111179A (zh) * 2021-03-09 2021-07-13 智慧芽信息科技(苏州)有限公司 文件的分类处理方法、装置、服务器及系统

Also Published As

Publication number Publication date
CN113111179A (zh) 2021-07-13

Similar Documents

Publication Publication Date Title
WO2022188820A1 (zh) 文件的分类处理方法、装置、服务器、系统及计算机程序产品
US11651032B2 (en) Determining semantic content of textual clusters
US11182707B2 (en) Method and system for providing a multi-dimensional human resource allocation adviser
US10073837B2 (en) Method and system for implementing alerts in semantic analysis technology
CN110088749B (zh) 自动本体生成的方法、系统和介质
US8266138B1 (en) On-demand database service system, method and computer program product for generating a custom report
US8209407B2 (en) System and method for web service discovery and access
DE112018005462T5 (de) Anomalie-erkennung unter verwendung von cognitive-computing
CN105989523B (zh) 用于分析的基于策略的数据收集处理及协商的方法与系统
US8141160B2 (en) Mitigating and managing privacy risks using planning
US11619761B2 (en) Dynamic representation of exploration and/or production entity relationships
US9461890B1 (en) Delegation of data management policy in an information management system
US20160217427A1 (en) Systems, methods, and devices for implementing a referral processing engine
CN112732466A (zh) 一种服务调用方法、装置和系统
EP1449123A2 (en) Method and system for identifying purchasing cost savings
Lin et al. Consumer-centric QoS-aware selection of web services
WO2022188821A1 (zh) 对文件进行自定义字段标引的处理方法、装置、服务器及系统
Di Martino et al. Towards AI-powered multiple cloud management
CN114510575A (zh) 关系发现和量化
CN108073617A (zh) 一种基于Solr的分布式检索方法
Kortum et al. Analyzing Smart Services from a (Data-) Ecosystem Perspective: Utilizing Network Theory for a graph-based Software Tool in the Domain Smart Living
Todoran et al. Quest for requirements: Scrutinizing advanced search queries for cloud services with fuzzy Galois lattices
US20030070141A1 (en) Data conversion system and method
Birzniece et al. Predictive modeling of hr dynamics using machine learning
US11711226B2 (en) Visualizing web conference participants in subgroups

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22766342

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 22766342

Country of ref document: EP

Kind code of ref document: A1