CN110083839B - Text importing method, device and equipment - Google Patents

Text importing method, device and equipment Download PDF

Info

Publication number
CN110083839B
CN110083839B CN201910359179.2A CN201910359179A CN110083839B CN 110083839 B CN110083839 B CN 110083839B CN 201910359179 A CN201910359179 A CN 201910359179A CN 110083839 B CN110083839 B CN 110083839B
Authority
CN
China
Prior art keywords
text
information
interface
provider
identification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910359179.2A
Other languages
Chinese (zh)
Other versions
CN110083839A (en
Inventor
胡建
周振华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhuhai Baohaowan Technology Co Ltd
Original Assignee
Zhuhai Baohaowan Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhuhai Baohaowan Technology Co Ltd filed Critical Zhuhai Baohaowan Technology Co Ltd
Priority to CN201910359179.2A priority Critical patent/CN110083839B/en
Publication of CN110083839A publication Critical patent/CN110083839A/en
Application granted granted Critical
Publication of CN110083839B publication Critical patent/CN110083839B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/16File or folder operations, e.g. details of user interfaces specifically adapted to file systems
    • G06F16/168Details of user interfaces specifically adapted to file systems, e.g. browsing and visualisation, 2d or 3d GUIs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Human Computer Interaction (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Computer And Data Communications (AREA)
  • Machine Translation (AREA)

Abstract

The invention provides a text importing method, a text importing device and text importing equipment, wherein the method comprises the following steps: acquiring first interface information of a first text; identifying the first interface information through a target interface identification model to obtain text information of the first text, wherein the text information comprises a plurality of first field types and field information corresponding to each of the plurality of first field types, and the target interface identification model comprises a plurality of field types and identification parameters of each of the plurality of field types; and importing the text information of the first text into a text resource system, wherein the text resource system is used for displaying the text information of the first text to a user. The technical scheme can improve the efficiency of text importing into the text resource system.

Description

Text importing method, device and equipment
Technical Field
The present invention relates to the field of computer technologies, and in particular, to a text importing method, apparatus, and device.
Background
Currently, when a user views text information, the user views corresponding text information in a text resource system, and a text information developer needs to import text information of a plurality of text information providers into the text resource system in advance.
When the text information of a plurality of text information providers is imported into a text resource system by the current text information developer, the text information developer needs to make targeted adaptation for the interface of each text information provider due to the differences of the types of each field and the parameters of the types of each field contained in the interface of each text information provider, and corresponding adaptation codes need to be developed. Because the number of the text information providers is large, and corresponding adaptation codes need to be developed for identifying each text information provider interface each time, the operation is complex, and the efficiency of text importing into a text resource system is reduced.
Disclosure of Invention
The embodiment of the invention provides a text importing method, device and equipment, which solve the problems of low text recognition efficiency and low text importing text resource system efficiency.
In a first aspect, a text import method is provided, including:
acquiring first interface information of a first text;
identifying the first interface information through a target interface identification model to obtain text information of the first text, wherein the text information comprises a plurality of first field types and field information corresponding to each of the plurality of first field types, and the target interface identification model comprises a plurality of field types and identification parameters of each of the plurality of field types;
And importing the text information of the first text into a text resource system, wherein the text resource system is used for displaying the text information of the first text to a user.
With reference to the first aspect, in one possible implementation manner, the target interface identification model is a basic interface identification model, and the basic interface identification model can be used for identifying interface information of texts of different providers; the identifying the first interface information through the target interface identification model to obtain text information of the first text includes: acquiring an identification of a first provider of the first text; if the identification of the first provider is not contained in the identified provider identification set, identifying the first interface information through a basic interface identification model to obtain text information of the first text; wherein the identity of any one provider in the set of identified provider identities is used to indicate that identification has been performed on interface information of text of the provider.
With reference to the first aspect, in a possible implementation manner, the method further includes: generating a first interface identification model according to the plurality of first field types and identification parameters of the plurality of first field types; and establishing a corresponding relation between the first provider and the first interface identification model.
With reference to the first aspect, in one possible implementation manner, before the identifying, by the target interface identifying model, the first interface information, obtaining text information of the first text, the method further includes: acquiring an identification of a first provider of the first text; if the identification of the first provider is contained in the identified provider identification set, acquiring a first interface identification model with a corresponding relation with the first provider; and determining the first interface identification model as the target interface identification model.
With reference to the first aspect, in a possible implementation manner, the plurality of field types in the target interface identification model include a plurality of necessary field types; the method further comprises the steps of: and outputting identification abnormal information, wherein the identification abnormal information comprises at least one unidentified necessary field type under the condition that the field information corresponding to the at least one necessary field type is unidentified to the first interface information through the target interface identification model.
In a second aspect, there is provided a text importation apparatus comprising:
the interface information acquisition module is used for acquiring first interface information of the first text;
The interface information identification module is used for identifying the first interface information through a target interface identification model to obtain text information of the first text, wherein the text information comprises a plurality of first field types and field information corresponding to each of the plurality of first field types, and the target interface identification model comprises a plurality of field types and identification parameters of each of the plurality of field types;
the text information importing module is used for importing the text information of the first text into a text resource system, and the text resource system is used for displaying the text information of the first text to a user.
With reference to the second aspect, in one possible implementation manner, the target interface identification model is a basic interface identification model, and the basic interface identification model may be used to identify interface information of texts of different providers; the interface information identification module is specifically configured to obtain an identifier of a first provider of the first text; the interface information identification module is specifically configured to identify, if the identifier of the first provider is not included in the identified provider identifier set, the first interface information through a basic interface identification model, so as to obtain text information of the first text; wherein the identity of any one provider in the set of identified provider identities is used to indicate that identification has been performed on interface information of text of the provider.
With reference to the second aspect, in a possible implementation manner, the apparatus further includes: the first model generation module is used for generating a first interface identification model according to the plurality of first field types and the identification parameters of the plurality of first field types; the first model generating module is further configured to establish a correspondence between the first provider and the first interface identification model.
With reference to the second aspect, in a possible implementation manner, the apparatus further includes: the target model determining module is used for acquiring the identification of a first provider of the first text; the target model determining module is further configured to obtain a first interface identification model having a corresponding relationship with the first provider if the identifier of the first provider is included in the identified provider identifier set; the object model determining module is further configured to determine the first interface identification model as the object interface identification model.
With reference to the second aspect, in a possible implementation manner, the plurality of field types in the target interface identification model include a plurality of necessary field types; the apparatus further comprises: and the abnormal information output module is used for outputting identification abnormal information when the first interface information is not identified to the field information corresponding to the at least one necessary field type through the target interface identification model, wherein the identification abnormal information comprises the at least one unrecognized necessary field type.
In a third aspect, a text importation apparatus is provided, comprising a processor, a memory, and an input-output interface, the processor, memory, and input-output interface being interconnected, wherein the input-output interface is for inputting or outputting data, the memory is for storing application program code for the text importation apparatus to perform the above method, and the processor is configured for performing the above method of the first aspect.
In a fourth aspect, there is provided a computer storage medium storing a computer program comprising program instructions which, when executed by a processor, cause the processor to perform the method of the first aspect described above.
In the embodiment of the invention, the first interface information of the acquired first text is identified according to the target interface identification model, the text information of the first text is obtained, and the text information of the first text is imported into a text resource system. The first interface information of the first text is automatically identified through the target interface identification model, the text information of the first text is obtained, the text information of the first text is automatically imported into the text resource system, the corresponding adaptation code is manually operated and developed when the first interface information of the first text is identified each time, the efficiency of importing the text into the text resource system is improved, and the user experience is improved.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings that are needed in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a schematic flow chart of a text importing method according to an embodiment of the present invention;
FIG. 2 is a flow chart of another text import method according to an embodiment of the present invention;
FIG. 3 is a schematic diagram of an identification anomaly information display interface according to an embodiment of the present invention;
FIG. 4 is an exemplary diagram of a text import method according to an embodiment of the present invention;
FIG. 5 is an exemplary diagram of another text import method provided by an embodiment of the present invention;
fig. 6 is a schematic structural diagram of a text importing apparatus according to an embodiment of the present invention;
fig. 7 is a schematic structural diagram of a text importing apparatus according to an embodiment of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
The scheme of the embodiment of the invention is suitable for identifying the interface information of the text provider and importing the identified text information into the scene of the text resource system, and the corresponding text information is obtained and imported into the text resource system by automatically identifying the interface information of the text provider, so that the text identification efficiency and the text importing efficiency of the text resource system can be improved.
Referring to fig. 1, fig. 1 is a flow chart of a text importing method according to an embodiment of the present invention, as shown in the drawing, the method includes:
s101, acquiring first interface information of a first text.
In some possible scenarios, the first text may be an electronic document, such as an electronic novel, that is provided by a first provider, and the same provider may provide different first text, wherein the interface information of each first text includes common features. For example, the field names of the text names of the N first texts provided by the first provider a often contain fields with all characters being bookname, the field names of the text classifications often contain fields with all characters being category, and the field names of the text words often contain characters being words; the field names of the text names of the M first texts provided by the first provider B often contain the characters of book_name, the field names of the text classifications often contain the characters of cat, the field names of the text words often contain the characters of count, etc.
In a specific implementation, the interface information of the texts provided by the providers may be added to the interface to-be-identified list, so that the interface to-be-identified list may include the interface information of the texts provided by the multiple providers, so that the text importing device obtains the first interface information of the first text from the to-be-identified list.
S102, identifying first interface information of a first text through a target interface identification model to obtain text information of the first text, wherein the text information of the first text comprises a plurality of first field types and field information corresponding to each of the plurality of first field types, and the target interface identification model comprises a plurality of field types and identification parameters of each of the plurality of field types.
Here, the field type in the target interface recognition model may be a feature possessed by the text to be recognized, for example, the field type may be a text ID, a text name, a text author, a text classification, a text chapter name, a text word number, text content, or the like; the identification parameters of the field type may include a data type of field information corresponding to the field type, a length of field information corresponding to the field type, a data range of field information corresponding to the field type, a field type often containing characters, and the like.
The data type of the field information corresponding to the field type is the specific type of the data of the corresponding field information. For example, the data type of the field information corresponding to the text name (for example, the text name is edge city) is character type, and the data type of the field information corresponding to the text word number (for example, the text word number is 2000000) is numerical type.
The length of the field information corresponding to the field type, that is, the information length of the field information corresponding to the specific field type, for example, the text name is edge city, and the field information corresponding to the field type is edge city, and the field length of the edge city is 4; the number of text words is 2000000, the field information corresponding to the field type is "2000000", and the field length of "2000000" is 7.
The data range of the field information corresponding to the field type, that is, the specific text name data range, for example, the data range of the specific text name (such as edge city) may be in the range of 1-50, that is, the data range of the field information corresponding to the field type belonging to the data range is the text name and the data range of the text word number may be in the range of 1-2147483647, that is, the field information corresponding to the field type belonging to the data range is the text word number.
The field type often contains characters, i.e. which characters can be used to represent the field type, e.g. the field type is a text name, then the often contained characters can be name, bookname, book_name, etc.; the field type is text classification, and the frequently contained characters can be cate, category, class and the like; the field type is the number of text words, and the common containing characters can be words, count, cnt, etc.
For example, a process of recognizing first interface information of a first text through a target interface recognition model to obtain text information of the first text is illustrated. The field types contained in the target interface recognition model comprise text names, text classifications, text word numbers and the like, and recognition parameters of the text names comprise character types, 1-50 characters, utf8 range characters and name/book name/book_name; the recognition parameters of the text classification are characters, 2-4 characters, and characters in the utf8 range, and category/class; the recognition parameters of the text word number are numerical values, 1-10 characters, 1-2147483647, cate/category/class, and the like, and the text information of the first text obtained after recognition may be:
the book name is salted fish turn-over record,
"author": "Oriental non-failure",
"category" means "metropolitan",
"words":"1964556",
"chapter count":"998",
"cover":"http://xxx.com/yyyy.jpg",
zhang san of university graduation, which is a key to the job promotion, is trapped by friends of the user, is swept by reputation, and is forced to engage in work … … which is least favored by the user "
……
The plurality of first field types includes: the field information corresponding to the plurality of first field types includes: "salted fish turn-over memory", "Oriental non-failure", "City", "1964556", "998", "http:// xxx.com/yyyy.jpg", "Zhang Sanof university graduation", on the key of the job-improvement, it is trapped by its own friends, the reputation is swept, forced to engage in its own favourite work … … "
S103, importing the text information of the first text into a text resource system, wherein the text resource system is used for displaying the text information of the first text to a user.
Here, the text resource system may store the text information. The user can view the text information of the first text through the text resource system, and can specifically view the text ID, the text name, the text author, the text classification, the text chapter name, the text word number, the text content and the like of the first text. The text resource system may be a novel resource system, for example, a novel resource library corresponding to XX novel software, a novel resource library corresponding to a applet, as one possible scenario.
In a specific implementation, a corresponding relationship between the target interface recognition model and the text resource system can be pre-established. The target interface identification model comprises a plurality of field types, and the text resource system comprises a plurality of service field types, namely, the corresponding relation between the field types and the service field types in the text resource system is established.
Here, the correspondence may be a mapping relationship, and according to a field type in the target interface identification model and a mapping relationship between the target interface identification model and the text resource system, a service field type in the text resource system having a mapping relationship with the field type in the target interface identification model is determined, and field information corresponding to the identified first field type is mapped to field information corresponding to the service field type corresponding to the field type in the target interface identification model, so that text information of the first text identified by the target interface identification model is imported into the text resource system.
For example, the first field type in the target interface identification model is a text name corresponding to the service field type in the text resource system being a text name; the first field type in the target interface recognition model is a text classification corresponding to the service field type in the text resource system being a text classification; the first field type in the target interface recognition model is the number of text words and the corresponding service field type in the text resource system is the number of text words. The first field type and the field information corresponding to the first field type which are identified by the target interface identification model are respectively "book name", "salted fish turning record", "category", "city", "book words", "1964556", the field information corresponding to the "text name" in the text resource system is "salted fish turning record", the field information corresponding to the "text classification" in the text resource system is "salted fish turning record", and the field information corresponding to the "text word number" in the text resource system is "1964556".
Optionally, a type conversion relationship between the target interface identification model and the text resource system may be further established, that is, a type conversion relationship between a plurality of field types in the target interface identification model and a plurality of service field types in the text resource system may be established, including:
For example, the first text information of the first text obtained after recognition is "book words": "1964556", the identified "1964556" is character type, the service field in the text resource system corresponding to the number of text words in the target interface identification model is "number of text words", the field information corresponding to the number of text words in the text resource system should be numerical, if the correspondence is character type number of revolutions, the field information corresponding to the number of text words is numerical 982278 instead of character type "1964556".
For another example, the first text information of the first text obtained after recognition is "date": "2019/1/14", the business field in the text resource system corresponding to the text date in the target interface recognition model is "date", the field information corresponding to the "date" in the text resource system should be numerical, the corresponding relationship is character type to date type, and the field information corresponding to the "date" is date type 2019, 1 month, 14, but not character type "2019/1/14". Alternatively, the date may be a date of the marketing of the text, a date of completion of the writing of the text, or the like.
In one possible case, the text information of the first text obtained by recognition is:
to know whether the green manure should be red thin,
"author": care is then disorder ",
"emotion",
"words":"1964556",
……
if the two types of the field information corresponding to the two first field types with different green, red, thin and the meaning are of the character type and the length is within 1-50 characters, the field information with more character length can be determined as the field information corresponding to the text name, the field information with less character length can be determined as the field information corresponding to the text classification, namely, the two types of the field information with different green, red, thin and the meaning are determined as the text name, and the meaning is determined as the text classification. Optionally, the text importer may manually adjust field information corresponding to the service field type in the text resource system to determine that the field information corresponding to the service field type in the text resource system is imported accurately.
In the embodiment of the invention, the first interface information of the acquired first text is identified according to the target interface identification model, the text information of the first text is obtained, and the text information of the first text is imported into a text resource system. The first interface information of the first text is automatically identified through the target interface identification model, the text information of the first text is obtained, the text information of the first text is automatically imported into the text resource system, the corresponding adaptation code is manually operated and developed when the first interface information of the first text is identified each time, the efficiency of importing the text into the text resource system is improved, and the user experience is improved.
In one possible implementation manner, the target interface recognition model for recognizing the first interface information of the first text can be determined by judging whether the first interface information of the first text is first recognized, so that the efficiency and the accuracy of text recognition can be improved. Referring to fig. 2, fig. 2 is a schematic flow chart of another text importing method according to an embodiment of the present invention, as shown in the drawing, the method includes:
s201, first interface information of a first text is acquired.
The specific method for acquiring the first interface information of the first text is described in step S101, and is not repeated here.
S202, acquiring an identification of a first provider of the first text.
Here, the identification of the first provider of the first text is used to uniquely indicate the first provider of the first text. Specifically, the identification of the first provider of the first text may be one or more of a name of the first provider of the first text, a name abbreviation of the first provider of the first text, an icon of the first provider of the first text, and the like.
S203, judging whether the identification of the first provider is contained in the identified provider identification set.
S204, if not, identifying the first interface information through the basic interface identification model to obtain text information of the first text.
Here, the identifier of any provider in the identified provider identifier set is used to indicate that the identification has been performed on the interface information of the text of the provider, and the basic interface identification model may be used to identify the interface information of the text of different providers, where the basic interface identification model includes features that may be included in the text of all the providers.
Specifically, if the identifier of the first provider is not included in the identified provider identifier set, the interface information representing the text of the first provider is identified for the first time, and the first interface information is identified through the basic interface identification model. After the first interface information is identified for the first time through the basic interface identification model, the identification of the first provider is added into the identified provider identification set. Here, the basic interface recognition model may be the target interface recognition model in step S102, and the method for recognizing the first interface information by using the target interface recognition model is described in step S102, which is not repeated herein.
And under the condition that the interface information of the text of the first provider is first identified, the first interface information is identified through the basic interface identification model, and after the text information of the first text is obtained, the first interface identification model can be generated according to a plurality of first field types and identification parameters of the plurality of first field types in the text information of the first text. In other words, the first interface recognition model contains interface information common to all texts of the text provider corresponding to the first interface recognition model. The field types in the first interface recognition models corresponding to different providers may be different, for example, the field type in the first interface recognition model a of the provider a is a text name, the text name may be a book name, the field type is a text classification, the text classification may be a cat, the field type is a text word number, and the text word number may be words. The first interface of the provider B recognizes that the field type is a text name, the text name may often include characters that are name, the field type is a text category, the text category may often include characters that are category, the field type is a text word number, and the text word number may often include characters that are count.
Optionally, after the first interface recognition model is generated by first recognition, a corresponding relationship between the first provider and the first interface recognition model may be further established, specifically, the identifier of the first provider and the first interface recognition model are corresponding to each other, so that the first interface recognition model corresponding to the first provider may be searched according to the identifier of the first provider under the condition that interface information of other texts of the first provider needs to be recognized.
Here, the identifier of each first provider corresponds to one first interface recognition model, the first interface recognition model corresponding to the identifier can be determined by determining the identifier of the first provider, and the interface information of the text of the first provider can be recognized by the first interface recognition model, so that the text information of the text can be obtained.
The interface information of the text of each provider can be identified through the basic interface identification model by the method of steps S201-S204, so that the text information of the first text of each provider is obtained, and further, the first interface identification model corresponding to each provider is generated.
For example, the provider A, B, C recognizes the interface information of the text provided by the provider A, B, C by the basic interface recognition model, and obtains the first field types A1 to A8 and the recognition parameters A1 to A8 of the first field types corresponding to the provider a, the first field types B1 to B7 and the recognition parameters B1 to B7 of the first field types corresponding to the provider B, and the first field types C1 to C9 and the recognition parameters C1 to C7 of the first field types corresponding to the provider C. Generating a first interface identification model A through first field types A1-A8 and identification parameters A1-A8 of the first field types corresponding to the provider A, generating a first interface identification model B through first field types B1-B7 and identification parameters B1-B7 of the first field types corresponding to the provider B, and generating a first interface identification model C through first field types C1-C9 and identification parameters C1-C9 of the first field types corresponding to the provider C; the correspondence between the first provider and the first interface recognition model is: the provider a corresponds to the first interface recognition model a, the provider B corresponds to the first interface recognition model B, and the provider C corresponds to the first interface recognition model C, whereby the first interface recognition model corresponding to each provider can be obtained.
S205, if yes, a first interface identification model with a corresponding relation with the first provider is obtained.
Here, if the identifier of the first provider is included in the identified provider identifier set, the interface information representing the text of the first provider is not first identified, and then the first interface identification model corresponding to the first provider is obtained, if the first interface identification model B corresponding to the first provider B is the provider B in step S204, then the first interface identification model corresponding to the provider B is the first interface identification model B.
S206, determining the first interface identification model as a target interface identification model.
For example, the first interface recognition model B exemplified in step S205 is determined as the target interface recognition model.
S207, identifying the first interface information of the first text through the target interface identification model to obtain text information of the first text.
Here, the method for identifying the first interface information of the first text by the target interface identification model is referred to the description in step S102, and will not be repeated here.
For one possible implementation manner of this embodiment, in a case that field information corresponding to at least one mandatory field type is not recognized for the first interface information by the target interface recognition model, recognition anomaly information is output, where the recognition anomaly information includes at least one mandatory field type that is not recognized. In this case, optionally, when the target interface recognition model is the basic interface recognition model, the basic interface recognition model may include a plurality of optional fields and a plurality of optional field types, and when the target interface recognition model is the first interface recognition model, the first interface recognition model may also include a plurality of optional fields and a plurality of optional field types.
Here, the necessary character type may be a text ID, a text name, a text author, a text category, a text chapter name, a text word number, a text content, or the like, which are necessary features for describing the text; the identification abnormality information may be "XX field type identification abnormality" displayed on the display interface, or the identification abnormality information may be "XX field type identification abnormality" by voice prompt. For example, when the first interface information does not recognize the field information corresponding to the text content in the necessary field through the target interface recognition model, the recognition anomaly information "text content recognition anomaly" is output, as shown in fig. 3, and fig. 3 is a schematic diagram of a recognition anomaly information display interface provided by an embodiment of the present invention.
Optionally, the plurality of field types in the target interface recognition model may further include a plurality of optional field types, for example, unnecessary features of the text, such as text reader information, that is, whether the text reader is a member, or the age of the text reader, that is, the text reader must be in a certain age interval, for example, the age interval may be [8,60], and so on. In this case, in the case where the field information corresponding to the at least one unnecessary field type is not recognized for the first interface information by the target interface recognition model, it may be determined whether to output the recognition abnormality information according to the specific case. It may be determined whether to output the identifying abnormality information, for example, according to the importance of the optional field type.
S208, importing the text information of the first text into a text resource system, wherein the text resource system is used for displaying the text information of the first text to a user.
Here, the method for importing the text information of the first text into the text resource system is specifically referred to the description of step S103, and will not be repeated here.
In the embodiment of the invention, whether the identification of the first provider of the first text is contained in the identified provider identification set is judged, so that whether the interface information of the text of the first provider is first identified is determined, and under the condition that the interface information of the text of the first provider is first identified, a first interface identification model corresponding to the first provider is generated according to the identified text information of the first text, and because the first interface identification model has the characteristic of the text information of the first text of the first provider, the identification efficiency can be improved by using the first interface identification model to identify the text information of the first text of the first provider, so that the identification result is more accurate, and under the condition that the interface information of the text of the first provider is identified through the first interface identification model, the identification abnormal information of all the field corresponding to the necessary character selection field is not identified, the text importer can know the reason of the current identification abnormal condition, so that corresponding adjustment can be carried out, and the text identification efficiency and the text import text information library efficiency can be improved.
In one possible implementation, the text import method may be applied in the structure diagram of the following system. The first system structure may be shown in fig. 4, and fig. 4 is an exemplary diagram of a text importing method according to an embodiment of the present invention, where the text importing system includes a first provider 401 and a text importing apparatus 402, and the text resource system 402 includes the text importing apparatus 4021. First, the first provider 401 provides first interface information of a first text; next, the text importing device 4021 obtains first interface information of the first text, and identifies the first interface information of the first text through a target interface identification model in the text importing device 4021 to obtain text information of the first text; finally, the text importing device 4021 imports the text information of the first text into the field information corresponding to each service field type in the text resource system 402 according to the corresponding relation between the text information of the first text and the text resource system 402, so as to implement text importing into the text resource system 402.
A second system architecture may be shown in fig. 5, where fig. 5 is an exemplary diagram of another text import method according to an embodiment of the present invention, and the text import system includes a first provider 501, a text import device 502, and a text resource system 503. First, the first provider 501 provides first interface information of a first text; next, the text importing device 502 obtains first interface information of the first text, and recognizes the first interface information of the first text through a target interface recognition model in the text importing device 502 to obtain text information of the first text; finally, according to the corresponding relation between the target interface identification model in the text importing device 502 and the text resource system 503, importing the text information of the first text into the field information corresponding to each service field type in the text resource system 503 corresponding to the target interface identification model, so as to realize text importing into the text resource system 503.
The text importing device according to the embodiment of the present invention may be a device with processing capability, for example: tablet computers, cell phones, electronic readers, personal computers (Personal Computer, PCs), notebook computers, servers and other devices; or may be a text importation module embedded in a text resource system. The embodiment of the present invention is not limited thereto.
The method of the embodiment of the invention is described above, and the apparatus of the embodiment of the invention is described below.
Referring to fig. 6, fig. 6 is a schematic structural diagram of a text importing apparatus according to an embodiment of the present invention, where the apparatus includes:
an interface information obtaining module 601, configured to obtain first interface information of a first text;
the interface information identifying module 602 is configured to identify the first interface information through a target interface identifying model, so as to obtain text information of the first text, where the text information includes a plurality of first field types and field information corresponding to each of the plurality of first field types, and the target interface identifying model includes a plurality of field types and identifying parameters of each of the plurality of field types;
a text information importing module 603, configured to import the text information of the first text into a text resource system, where the text resource system is configured to display the text information of the first text to a user.
In one possible design, the target interface recognition model is a basic interface recognition model that can be used to recognize interface information of texts of different providers;
the interface information identifying module 602 is specifically configured to obtain an identifier of a first provider of the first text;
the interface information identifying module 602 is specifically configured to identify, if the identifier of the first provider is not included in the identified provider identifier set, the first interface information through a basic interface identifying model, so as to obtain text information of the first text;
wherein the identity of any one provider in the set of identified provider identities is used to indicate that identification has been performed on interface information of text of the provider.
In one possible design, the apparatus further comprises:
a first model generating module 604, configured to generate a first interface identification model according to the plurality of first field types and identification parameters of the plurality of first field types;
the first model generating module 604 is further configured to establish a correspondence between the first provider and the first interface identification model.
In one possible design, the apparatus 60 further comprises:
a target model determining module 605, configured to obtain an identification of a first provider of the first text;
the object model determining module 605 is further configured to obtain a first interface identification model that has a correspondence with the first provider if the identifier of the first provider is included in the identified provider identifier set;
the object model determining module 605 is further configured to determine the first interface identification model as the object interface identification model.
In one possible design, the plurality of field types in the target interface recognition model includes a plurality of mandatory field types;
the apparatus 60 further comprises:
and an anomaly information output module 606, configured to output, when field information corresponding to at least one mandatory field type is not identified to the first interface information by the target interface identification model, identification anomaly information, where the identification anomaly information includes at least one mandatory field type that is not identified.
It should be noted that, in the embodiment corresponding to fig. 6, the content not mentioned may be referred to the description of the method embodiment, and will not be repeated here.
In the embodiment of the invention, the first interface information of the acquired first text is identified according to the target interface identification model, the text information of the first text is obtained, and the text information of the first text is imported into a text resource system. The first interface information of the first text is automatically identified through the target interface identification model, the text information of the first text is obtained, the text information of the first text is automatically imported into the text resource system, the corresponding adaptation code is manually operated and developed when the first interface information of the first text is identified each time is omitted, and the efficiency of importing the text into the text resource system is improved; determining whether the interface information of the text of the first provider is first identification by judging whether the identification of the first provider of the first text is contained in the identified provider identification set, and generating a first interface identification model corresponding to the first provider under the condition that the interface information of the text of the first provider is first identification, wherein the first interface identification model has the characteristic of the text information of the first text of the first provider, so that the identification efficiency can be improved by using the first interface identification model to identify the text information of the first text of the first provider, and the identification result is more accurate; when the interface information of the text of the first provider is identified through the first interface identification model, under the condition that field information corresponding to all the necessary fields is not identified, the identification abnormality information of the unrecognized necessary fields is output, so that text importing personnel can know the cause of the current identification abnormality to correspondingly adjust, and the text identification efficiency and the text importing text resource system efficiency are improved.
Referring to fig. 7, fig. 7 is a schematic structural diagram of a text import apparatus according to an embodiment of the present invention, and the apparatus 70 includes a processor 701, a memory 702, and an input/output interface 703. The processor 701 is connected to a memory 702 and an input-output interface 703, for example, the processor 701 may be connected to the memory 702 and the input-output interface 703 through a bus.
The processor 701 is configured to support the text importation device to perform the corresponding functions in the text importation method described in fig. 1-2. The processor 701 may be a central processing unit (central processing unit, CPU), a network processor (network processor, NP), a hardware chip or any combination thereof. The hardware chip may be an application specific integrated circuit (application specific integrated circuit, ASIC), a programmable logic device (programmable logic device, PLD), or a combination thereof. The PLD may be a complex programmable logic device (complex programmable logic device, CPLD), a field-programmable gate array (field-programmable gate array, FPGA), general-purpose array logic (generic array logic, GAL), or any combination thereof.
The memory 702 stores program codes and the like. The memory 702 may include Volatile Memory (VM), such as random access memory (random access memory, RAM); the memory 702 may also include a non-volatile memory (NVM), such as read-only memory (ROM), flash memory (flash memory), hard disk (HDD) or Solid State Drive (SSD); the memory 702 may also include a combination of the above types of memory.
The input/output interface 703 is used for inputting or outputting data.
The processor 701 may call the program code to:
acquiring first interface information of a first text;
identifying the first interface information through a target interface identification model to obtain text information of the first text, wherein the text information comprises a plurality of first field types and field information corresponding to each of the plurality of first field types, and the target interface identification model comprises a plurality of field types and identification parameters of each of the plurality of field types;
and importing the text information of the first text into a text resource system, wherein the text resource system is used for displaying the text information of the first text to a user.
It should be noted that, implementation of each operation may also correspond to the corresponding description referring to the above method embodiment; the processor 701 may also cooperate with the input-output interface 703 to perform other operations in the method embodiments described above.
Embodiments of the present invention also provide a computer storage medium storing a computer program comprising program instructions which, when executed by a computer, cause the computer to perform a method as described in the previous embodiments, the computer being part of the text importation apparatus mentioned above. Such as the processor 701 described above.
Those skilled in the art will appreciate that implementing all or part of the above-described methods in the embodiments may be accomplished by computer programs stored in a computer-readable storage medium, which when executed, may include the steps of the embodiments of the methods described above. Wherein the storage medium can be a magnetic disk, an optical disk, a ROM or a RAM, etc.
The foregoing disclosure is illustrative of the present invention and is not to be construed as limiting the scope of the invention, which is defined by the appended claims.

Claims (10)

1. A text importing method, comprising:
acquiring first interface information of a first text;
identifying the first interface information through a target interface identification model to obtain text information of the first text, wherein the text information comprises a plurality of first field types and field information corresponding to each of the plurality of first field types, and the target interface identification model comprises a plurality of field types and identification parameters of each of the plurality of field types;
Importing the text information of the first text into a text resource system, wherein the text resource system is used for displaying the text information of the first text to a user;
the target interface identification model is a basic interface identification model, and the basic interface identification model can be used for identifying interface information of texts of different providers;
the identifying the first interface information through the target interface identification model to obtain text information of the first text includes:
acquiring an identification of a first provider of the first text;
if the identification of the first provider is not contained in the identified provider identification set, identifying the first interface information through a basic interface identification model to obtain text information of the first text;
wherein the identity of any one provider in the set of identified provider identities is used to indicate that identification has been performed on interface information of text of the provider.
2. The method according to claim 1, wherein the method further comprises:
generating a first interface identification model according to the plurality of first field types and identification parameters of the plurality of first field types;
And establishing a corresponding relation between the first provider and the first interface identification model.
3. The method according to claim 1, wherein before the identifying the first interface information by the target interface identification model to obtain the text information of the first text, further comprises:
acquiring an identification of a first provider of the first text;
if the identification of the first provider is contained in the identified provider identification set, acquiring a first interface identification model with a corresponding relation with the first provider;
and determining the first interface identification model as the target interface identification model.
4. The method of claim 3, wherein the plurality of field types in the target interface identification model includes a plurality of mandatory field types;
the method further comprises the steps of:
and outputting identification abnormal information, wherein the identification abnormal information comprises at least one unidentified necessary field type under the condition that the field information corresponding to the at least one necessary field type is unidentified to the first interface information through the target interface identification model.
5. A text importation apparatus, comprising:
The interface information acquisition module is used for acquiring first interface information of the first text;
the interface information identification module is used for identifying the first interface information through a target interface identification model to obtain text information of the first text, wherein the text information comprises a plurality of first field types and field information corresponding to each of the plurality of first field types, and the target interface identification model comprises a plurality of field types and identification parameters of each of the plurality of field types;
the text information importing module is used for importing the text information of the first text into a text resource system, and the text resource system is used for displaying the text information of the first text to a user, wherein the target interface recognition model is a basic interface recognition model which can be used for recognizing the interface information of the texts of different providers;
the interface information identification module is specifically configured to obtain an identifier of a first provider of the first text;
the interface information identification module is specifically configured to identify, if the identifier of the first provider is not included in the identified provider identifier set, the first interface information through a basic interface identification model, so as to obtain text information of the first text;
Wherein the identity of any one provider in the set of identified provider identities is used to indicate that identification has been performed on interface information of text of the provider.
6. The apparatus of claim 5, wherein the apparatus further comprises:
the first model generation module is used for generating a first interface identification model according to the plurality of first field types and the identification parameters of the plurality of first field types;
the first model generating module is further configured to establish a correspondence between the first provider and the first interface identification model.
7. The apparatus of claim 5, wherein the apparatus further comprises:
the target model determining module is used for acquiring the identification of a first provider of the first text;
the target model determining module is further configured to obtain a first interface identification model having a corresponding relationship with the first provider if the identifier of the first provider is included in the identified provider identifier set;
the object model determining module is further configured to determine the first interface identification model as the object interface identification model.
8. The apparatus of claim 7, wherein the plurality of field types in the target interface identification model comprises a plurality of mandatory field types;
The apparatus further comprises:
and the abnormal information output module is used for outputting identification abnormal information when the first interface information is not identified to the field information corresponding to the at least one necessary field type through the target interface identification model, wherein the identification abnormal information comprises the at least one unrecognized necessary field type.
9. A text importation apparatus comprising a processor, a memory and an input output interface, said processor, memory and input output interface being interconnected, wherein said input output interface is for inputting or outputting data, said memory is for storing program code, said processor is for invoking said program code to perform the method of any of claims 1-4.
10. A computer storage medium storing a computer program comprising program instructions which, when executed by a processor, cause the processor to perform the method of any of claims 1-4.
CN201910359179.2A 2019-04-29 2019-04-29 Text importing method, device and equipment Active CN110083839B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910359179.2A CN110083839B (en) 2019-04-29 2019-04-29 Text importing method, device and equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910359179.2A CN110083839B (en) 2019-04-29 2019-04-29 Text importing method, device and equipment

Publications (2)

Publication Number Publication Date
CN110083839A CN110083839A (en) 2019-08-02
CN110083839B true CN110083839B (en) 2023-08-22

Family

ID=67417937

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910359179.2A Active CN110083839B (en) 2019-04-29 2019-04-29 Text importing method, device and equipment

Country Status (1)

Country Link
CN (1) CN110083839B (en)

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102982011A (en) * 2011-09-07 2013-03-20 百度在线网络技术(北京)有限公司 Method and device for identifying out-of-sequence texts
CN104462716A (en) * 2014-12-23 2015-03-25 北京理工大学 Method for designing brain-computer interface parameters and kinetic parameters of brain controlled vehicle based on human-vehicle-road model
CN105607938A (en) * 2015-12-30 2016-05-25 中国银联股份有限公司 Method for allocating interface elements of security applications
CN106293727A (en) * 2016-08-04 2017-01-04 深圳市微我科技有限公司 A kind of method of shared wisdom based on tables of data
CN107251030A (en) * 2015-02-09 2017-10-13 皇家飞利浦有限公司 It is used as the wearable device of service
CN108279885A (en) * 2017-01-03 2018-07-13 中国航发商用航空发动机有限责任公司 A kind of method and device that multiple model codes are carried out with Integrated Simulation
CN108304368A (en) * 2017-04-20 2018-07-20 腾讯科技(深圳)有限公司 The kind identification method and device and storage medium and processor of text message
CN108829882A (en) * 2018-06-27 2018-11-16 深圳乐信软件技术有限公司 Formation gathering method, device, terminal and medium
CN109189666A (en) * 2018-08-02 2019-01-11 腾讯科技(北京)有限公司 Interface test method, device and computer equipment
CN109388675A (en) * 2018-10-12 2019-02-26 平安科技(深圳)有限公司 Data analysing method, device, computer equipment and storage medium

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8751963B1 (en) * 2013-01-23 2014-06-10 Splunk Inc. Real time indication of previously extracted data fields for regular expressions
US10354257B2 (en) * 2017-05-05 2019-07-16 Servicenow, Inc. Identifying clusters for service management operations

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102982011A (en) * 2011-09-07 2013-03-20 百度在线网络技术(北京)有限公司 Method and device for identifying out-of-sequence texts
CN104462716A (en) * 2014-12-23 2015-03-25 北京理工大学 Method for designing brain-computer interface parameters and kinetic parameters of brain controlled vehicle based on human-vehicle-road model
CN107251030A (en) * 2015-02-09 2017-10-13 皇家飞利浦有限公司 It is used as the wearable device of service
CN105607938A (en) * 2015-12-30 2016-05-25 中国银联股份有限公司 Method for allocating interface elements of security applications
CN106293727A (en) * 2016-08-04 2017-01-04 深圳市微我科技有限公司 A kind of method of shared wisdom based on tables of data
CN108279885A (en) * 2017-01-03 2018-07-13 中国航发商用航空发动机有限责任公司 A kind of method and device that multiple model codes are carried out with Integrated Simulation
CN108304368A (en) * 2017-04-20 2018-07-20 腾讯科技(深圳)有限公司 The kind identification method and device and storage medium and processor of text message
CN108829882A (en) * 2018-06-27 2018-11-16 深圳乐信软件技术有限公司 Formation gathering method, device, terminal and medium
CN109189666A (en) * 2018-08-02 2019-01-11 腾讯科技(北京)有限公司 Interface test method, device and computer equipment
CN109388675A (en) * 2018-10-12 2019-02-26 平安科技(深圳)有限公司 Data analysing method, device, computer equipment and storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
大数据时代下的动态可配置数据采集系统的研究与设计;丁俊等;《计算机应用与软件》;20180315(第03期);第81-85页 *

Also Published As

Publication number Publication date
CN110083839A (en) 2019-08-02

Similar Documents

Publication Publication Date Title
CN110765770A (en) Automatic contract generation method and device
CN109388675B (en) Data analysis method, device, computer equipment and storage medium
CN109582772B (en) Contract information extraction method, contract information extraction device, computer equipment and storage medium
WO2019074574A1 (en) Automated orchestration of incident triage workflows
CN112613917A (en) Information pushing method, device and equipment based on user portrait and storage medium
CN111240688B (en) excel file analysis method and device, computer equipment and storage medium
CN111400126B (en) Network service abnormal data detection method, device, equipment and medium
CN110688315A (en) Interface code detection report generation method, electronic device, and storage medium
CN117851575A (en) Large language model question-answer optimization method and device, electronic equipment and storage medium
CN110362630B (en) Data management method, device, equipment and computer readable storage medium
CN113126955A (en) Random data generation method and device, intelligent terminal and storage medium
CN113360300B (en) Interface call link generation method, device, equipment and readable storage medium
CN105094562A (en) Information processing method and terminal
CN108595685B (en) Data processing method and device
KR102280490B1 (en) Training data construction method for automatically generating training data for artificial intelligence model for counseling intention classification
US11500840B2 (en) Contrasting document-embedded structured data and generating summaries thereof
CN113515703A (en) Information recommendation method and device, electronic equipment and readable storage medium
CN110083839B (en) Text importing method, device and equipment
US10936801B2 (en) Automated electronic form generation with context cues
CN117314139A (en) Modeling method and device for business process, terminal equipment and storage medium
CN113672497B (en) Method, device and equipment for generating non-buried point event and storage medium
CN111400245B (en) Art resource migration method and device
CN111291178B (en) Dialogue classification method and device, electronic equipment and storage medium
CN108460159B (en) Information reply method, terminal equipment and computer readable storage medium
JP2020101898A (en) Design drawing creation support method, design drawing creation support device, and design drawing creation support program

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant