CN110083839A - Text introduction method, device and equipment - Google Patents

Text introduction method, device and equipment Download PDF

Info

Publication number
CN110083839A
CN110083839A CN201910359179.2A CN201910359179A CN110083839A CN 110083839 A CN110083839 A CN 110083839A CN 201910359179 A CN201910359179 A CN 201910359179A CN 110083839 A CN110083839 A CN 110083839A
Authority
CN
China
Prior art keywords
text
interface
information
provider
identification model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910359179.2A
Other languages
Chinese (zh)
Other versions
CN110083839B (en
Inventor
胡建
周振华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhuhai Seal Fun Technology Co Ltd
Original Assignee
Zhuhai Seal Fun Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhuhai Seal Fun Technology Co Ltd filed Critical Zhuhai Seal Fun Technology Co Ltd
Priority to CN201910359179.2A priority Critical patent/CN110083839B/en
Publication of CN110083839A publication Critical patent/CN110083839A/en
Application granted granted Critical
Publication of CN110083839B publication Critical patent/CN110083839B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/16File or folder operations, e.g. details of user interfaces specifically adapted to file systems
    • G06F16/168Details of user interfaces specifically adapted to file systems, e.g. browsing and visualisation, 2d or 3d GUIs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Human Computer Interaction (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Computer And Data Communications (AREA)
  • Machine Translation (AREA)

Abstract

The present invention provides text introduction method, device and equipment, wherein method includes: to obtain the first interface information of the first text;The first interface information is identified by target interface identification model, obtain the text information of first text, the text information include in multiple first field types and the multiple first field type the corresponding field information of each first field type, the target interface identification model contain the identification parameter of each field type in multiple field types and the multiple field type;The text information of first text is imported into textual resources system, the textual resources system is used to show the text information of first text to user.The efficiency that text imports textual resources system can be improved in the technical solution.

Description

Text introduction method, device and equipment
Technical field
The present invention relates to field of computer technology more particularly to text introduction methods, device and equipment.
Background technique
At present when user's viewing textual information, corresponding text information, text are usually checked in textual resources system Information development quotient needs that the text information of multiple text information providers is imported textual resources system in advance.
When the text information of multiple text information providers is imported textual resources system by current text information developer, The parameter of each field type and each field type that include due to the interface of each text provider has differences, text Information development quotient needs the interface for each text information provider to do targetedly to be adapted to, need to develop corresponding adaptation generation Code.Since the quantity of text information provider is more, and identify that each text information provider interface requires exploitation phase every time The adaptation code answered, it is complicated for operation, reduce the efficiency that text imports textual resources system.
Summary of the invention
The embodiment of the present invention provides text introduction method, device and equipment, solves text identification low efficiency and text is led Enter the low problem of textual resources system effectiveness.
In a first aspect, providing text introduction method, comprising:
Obtain the first interface information of the first text;
The first interface information is identified by target interface identification model, obtains the text of first text Information, the text information include each first field type in multiple first field types and the multiple first field type Corresponding field information, the target interface identification model contain each in multiple field types and the multiple field type The identification parameter of field type;
The text information of first text is imported into textual resources system, the textual resources system is used for user's exhibition Show the text information of first text.
With reference to first aspect, in one possible implementation, the target interface identification model is basic interface knowledge Other model, the basic interface identification model can be used for identifying the interface message of the text of different providers;It is described logical It crosses target interface identification model to identify the first interface information, obtains the text information of first text, comprising: Obtain the mark of the first provider of first text;If the mark of first provider is not included in identified offer In square logo collection, then the first interface information is identified by basic interface identification model, obtains first text This text information;Wherein, the mark of any one provider in identified provider's logo collection is used to indicate Identification is executed to the interface message of the text of the provider.
With reference to first aspect, in one possible implementation, the method also includes: according to the multiple first word The identification parameter of segment type and the multiple first field type generates first interface identification model;Described first is established to provide The square corresponding relationship with the first interface identification model.
With reference to first aspect, in one possible implementation, it is described by target interface identification model to described One interface message identified, before obtaining the text information of first text, further includes: obtains the of first text The mark of one provider;If the mark of first provider is contained in identified provider's logo collection, acquisition and institute State the first interface identification model that the first provider has corresponding relationship;The first interface identification model is determined as the mesh Tag splice mouth identification model.
With reference to first aspect, in one possible implementation, multiple fields in the target interface identification model Type includes multiple mandatory field types;The method also includes: by the target interface identification model to described first Interface message is unidentified in the case where at least one corresponding field information of mandatory field type, output identification exception information, The identification exception information includes unidentified at least one mandatory field type arrived.
Second aspect provides text gatherer, comprising:
Interface message obtains module, for obtaining the first interface information of the first text;
Interface message identification module, for being identified by target interface identification model to the first interface information, The text information of first text is obtained, the text information includes multiple first field types and the multiple first field The corresponding field information of each first field type in type, the target interface identification model contain multiple field types and The identification parameter of each field type in the multiple field type;
Text information import modul, for the text information of first text to be imported textual resources system, the text This resource system is used to show the text information of first text to user.
In conjunction with second aspect, in one possible implementation, the target interface identification model is basic interface knowledge Other model, the basic interface identification model can be used for identifying the interface message of the text of different providers;It is described to connect Mouth information identification module, the mark of the first provider specifically for obtaining first text;The interface message identifies mould Block, if the mark specifically for first provider is not included in identified provider's logo collection, by basic Interface identification model identifies the first interface information, obtains the text information of first text;Wherein, it is described The mark of any one provider in provider's logo collection of identification is used to indicate connecing for the text to the provider Message breath executes identification.
In conjunction with second aspect, in one possible implementation, described device further include: the first model generation module, For the identification parameter according to the multiple first field type and the multiple first field type, first interface identification is generated Model;The first model generation module is also used to establish pair of first provider Yu the first interface identification model It should be related to.
In conjunction with second aspect, in one possible implementation, described device further include: object module determining module, For obtaining the mark of the first provider of first text;The object module determining module, if being also used to described first The mark of provider is contained in identified provider's logo collection, and obtaining has corresponding relationship with first provider First interface identification model;The object module determining module is also used to for the first interface identification model being determined as described Target interface identification model.
In conjunction with second aspect, in one possible implementation, multiple fields in the target interface identification model Type includes multiple mandatory field types;Described device further include: exception information output module, for being connect by the target The case where mouthful identification model unidentified to first interface information field information corresponding at least one mandatory field type Under, output identification exception information, the identification exception information includes unidentified at least one mandatory field type arrived.
The third aspect provides text and imports equipment, including processor, memory and input/output interface, the processing Device, memory and input/output interface are connected with each other, wherein the input/output interface is described for input or output data Memory is used to store text and imports the application code that equipment executes the above method, and the processor is configured for executing The method of above-mentioned first aspect.
Fourth aspect provides a kind of computer storage medium, and the computer storage medium is stored with computer program, institute Stating computer program includes program instruction, and described program instruction makes the processor execute above-mentioned first when being executed by a processor The method of aspect.
In the embodiment of the present invention, according to target interface identification model to the first interface information of the first text got into Row identification obtains the text information of the first text, and the text information of the first text is imported textual resources system.Pass through target Interface identification model automatically identifies the first interface information of the first text, obtains the text information of the first text and automatic The text information of first text is imported into textual resources system, eliminates the first interface information in the first text of identification every time When manual operation develop corresponding adaptation code, improve the efficiency that text imports textual resources system, the user experience is improved.
Detailed description of the invention
It to describe the technical solutions in the embodiments of the present invention more clearly, below will be to needed in the embodiment Attached drawing is briefly described, it should be apparent that, drawings in the following description are only some embodiments of the invention, for ability For the those of ordinary skill of domain, without creative efforts, it can also be obtained according to these attached drawings other attached Figure.
Fig. 1 is a kind of flow diagram for text introduction method that inventive embodiments provide;
Fig. 2 is the flow diagram for another text introduction method that inventive embodiments provide;
Fig. 3 is a kind of identification exception information display interface schematic diagram provided in an embodiment of the present invention;
Fig. 4 is a kind of exemplary diagram of text introduction method provided in an embodiment of the present invention;
Fig. 5 is the exemplary diagram of another text introduction method provided in an embodiment of the present invention;
Fig. 6 is a kind of structural schematic diagram of text gatherer provided in an embodiment of the present invention;
Fig. 7 is the structural schematic diagram that a kind of text provided in an embodiment of the present invention imports equipment.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that the described embodiment is only a part of the embodiment of the present invention, instead of all the embodiments.Based on this Embodiment in invention, every other reality obtained by those of ordinary skill in the art without making creative efforts Example is applied, shall fall within the protection scope of the present invention.
The scheme of the embodiment of the present invention is suitable for the text information that identifies the interface message of text provider and will recognize It imports in the scene of textual resources system, is identified by the interface message automatically to text provider, obtain corresponding text This information is simultaneously directed into textual resources system, and text identification efficiency can be improved and text imports the effect of textual resources system Rate.
It is a kind of flow diagram of text introduction method provided in an embodiment of the present invention referring to Fig. 1, Fig. 1, as shown, This method comprises:
S101 obtains the first interface information of the first text.
In some possible scenes, the first text can be electronic document, such as e-novel, and the first text is by the What one provider provided, the same provider can provide the first different texts, wherein the interface message of each first text It all include common feature.Such as first the field name of text title of N number of first text that provides of provider A often include character All be bookname, text classification field name often include character be all category, text number of words field name often include word Fu Douwei words;It is all book_ that the field name of the text title for M the first texts that first provider B is provided, which often includes character, Name, text classification field name often include character be all cate, text number of words field name often include character be all count, Deng.
In the specific implementation, interface list to be identified is added in the interface message for the text that each provider can be provided, this Sample may include the interface message of each text provided by multiple providers in interface list to be identified, so that text imports Device obtains the first interface information of the first text from list to be identified.
S102 is identified by first interface information of the target interface identification model to the first text, obtains the first text This text information, the text information of the first text include each the in multiple first field types and multiple first field types The corresponding field information of one field type, target interface identification model contain every in multiple field types and multiple field types The identification parameter of a field type.
Here, the field type in target interface identification model can be feature possessed by text to be identified, such as Field type can be text ID, text title, text author, text classification, text chapters and sections name, text number of words, content of text Deng;The identification parameter of field type may include that the data type of the corresponding field information of field type, field type are corresponding The corresponding field information data area of the length of field information, field type, field type often include character etc..
Wherein, the data type of the corresponding field information of field type is the concrete kind of the data of corresponding field information Type.Such as the data type of the corresponding field information of text title (the entitled border town of such as text) is character type, text number of words pair The data type for the field information (such as text number of words is 2000000) answered is numeric type.
The length of the corresponding field information of field type, that is, specific field type corresponding field information message length, example Such as the entitled border town of text, then the corresponding field information of field type is " border town ", and the field length of " border town " is 4;Text word Number is 2000000, then the corresponding field information of field type is " 2000000 ", and the field length of " 2000000 " is 7.
The corresponding field information data area of field type, that is, specific text title data area, such as specific text The data area of title (such as border town) can belong to the corresponding field of field type of the data area in 1~50 range Information is text title, the data area of text number of words can be to belong to the data area in 1~2147483647 range The corresponding field information of field type is text number of words.
Field type often includes that character i.e. which character can be used to indicate that field type, for example, field type is text Title, the then character for often including can be name, bookname, book_name etc.;Field type is text classification, then often wraps It can be cate, category, class etc. containing character;Field type be text number of words, then often comprising character can be words, Count, cnt etc..
For example, identifying to by first interface information of the target interface identification model to the first text, first is obtained The process of the text information of text is illustrated.The field type for including in target interface identification model have text title, Text classification, text number of words etc., the identification parameter of text title are character type, 1~50 character, utf8 range character, name/ bookname/book_name;The identification parameter of text classification is character type, 2~4 characters, utf8 range character, cate/ category/class;The identification parameter of text number of words is numeric type, 1~10 character, 1~2147483647, cate/ Category/class, etc., the then text information of the first text obtained after identifying can be with are as follows:
" bookname ": " cured fish turns over note ",
" author ": " the east is unbeaten ",
" category ": " city ",
" words ": " 1964556 ",
" chapter count ": " 998 ",
" cover ": " http://xxx.com/yyyy.GIF ",
" introduction ": " Zhang San to graduate from university is frame, fame is swept in the key promoted by the friend of oneself Ground is forced to be engaged in the work ... oneself least liked "
……
Then multiple first field types include: " bookname ", " author ", " category ", " words ", " Chapter count ", " cover ", " introduction ", the corresponding field information of multiple first field types includes: " salty Fish turns over note ", " the east is unbeaten ", " city ", " 1964556 ", " 998 ", " http://xxx.com/yyyy.GIF ", " university and finishes The Zhang San of industry is frame by the friend of oneself, falls into disrepute in the key promoted, and is forced to be engaged in the work oneself least liked Make ... "
The text information of first text is imported textual resources system by S103, and textual resources system is used to show to user The text information of first text.
Here, textual resources system can store text information.User can view by textual resources system The text information of one text can specifically view text ID, text title, the text author, text classification, text of the first text This chapter section name, text number of words, content of text etc..In a kind of possible situation, textual resources system can be novel resource system System, such as can be the corresponding novel resources bank of XX novel software, the corresponding novel resources bank of small routine.
In the specific implementation, the corresponding relationship of target interface identification model Yu textual resources system can be pre-established.Wherein, Target interface identification model contains multiple field types, and textual resources system contains multiple service fields types, that is, establishes The corresponding relationship of multiple service fields types in multiple field types and textual resources system.
Here, corresponding relationship can be mapping relations, according to the field type and target in target interface identification model The mapping relations of interface identification model and textual resources system, determination has with the field type in target interface identification model reflects The service fields type in the textual resources system of relationship is penetrated, and the corresponding field of the first field type that identification is obtained is believed Breath, is mapped to field information corresponding to service fields type corresponding with field type in target interface identification model, thus The text information for being accomplished by the first text that target interface identification model identifies imports textual resources system.
For example, the first field type in target interface identification model is that text title corresponds in textual resources system Service fields type is text title;The first field type in target interface identification model is that text classification is provided corresponding to text Service fields type in the system of source is text classification;The first field type in target interface identification model is text number of words pair It should be text number of words in the service fields type in textual resources system.First identified by target interface identification model Field type and the corresponding field information of the first field type are respectively " bookname ": " cured fish turns over note ", " Category ": " city ", " book words ": " 1964556 ", then " text title " corresponding field in textual resources system Information is " cured fish turns over note ", " text classification " corresponding field information is " cured fish turns over note ", text in textual resources system " text number of words " corresponding field information is " 1964556 " in resource system.
Optionally, the type transformational relation that can also establish target interface identification model Yu textual resources system, that is, establish The type transformational relation of multiple field types and multiple service fields types in textual resources system in target interface identification model, Include:
Such as the first text information of the first text obtained after identification is " book words ": " 1964556 " are identified " 1964556 " come are character type, business word in textual resources system corresponding with text number of words in target interface identification model Section is " text number of words ", and " text number of words " corresponding field information should be numeric type in textual resources system, then corresponding relationship Turn numeric type for character type, then " text number of words " corresponding field information is numeric type 982278 rather than character type " 1964556 ".
For another example, the first text information of the first text obtained after identification is " date ": " 2019/1/14 " connects with target Service fields are " date " in text date corresponding textual resources system in mouth identification model, in textual resources system " date " Corresponding field information should be numeric type, then corresponding relationship is that character type turns date type, then " date " corresponding field information For date type on January 14th, 2019 rather than character type " 2019/1/14 ".Optionally, the date can be listing date, the text of text This writing target date etc..
In a kind of possible situation, the text information of the first text identified are as follows:
" know no know that no should be flourishing leaves and withering flowers ",
" author ": " being concerned about then disorderly ",
" describing love affairs ",
" words ": " 1964556 ",
……
Wherein, " know no know that no should be flourishing leaves and withering flowers " is the different corresponding fields of two the first field types from " describing love affairs " Information, the type of the two are all that character type, length are all in 1~50 character, then can be by the field information more than character length It is determined as the corresponding field information of text title, the field information that character length is lacked is determined as the corresponding field letter of text classification " know no know that no should be flourishing leaves and withering flowers " is determined as text title, " describing love affairs " is determined as text classification by breath.Optionally, literary This importing personnel can manually adjust the corresponding field information of service fields type in textual resources system, with determination It is accurate that the corresponding field information of service fields type in textual resources system imports.
In the embodiment of the present invention, according to target interface identification model to the first interface information of the first text got into Row identification obtains the text information of the first text, and the text information of the first text is imported textual resources system.Pass through target Interface identification model automatically identifies the first interface information of the first text, obtains the text information of the first text and automatic The text information of first text is imported into textual resources system, eliminates the first interface information in the first text of identification every time When manual operation develop corresponding adaptation code, improve the efficiency that text imports textual resources system, the user experience is improved.
It in one possible implementation, whether can be to know for the first time by judging the first interface information of the first text Not, so that it is determined that the target interface identification model of the first interface information of the first text of identification, can be improved the effect of text identification Rate and accuracy rate.Referring to fig. 2, Fig. 2 is the flow diagram for another text introduction method that inventive embodiments provide, and is such as schemed It is shown, this method comprises:
S201 obtains the first interface information of the first text.
The method of the specific first interface information for obtaining the first text is no longer superfluous herein referring to the description in step S101 It states.
S202 obtains the mark of the first provider of the first text.
Here, the mark of the first provider of the first text is for uniquely indicating the first provider of first text. Specifically, the mark of the first provider of first text can be the title of the first provider of the first text, first text The title abbreviation of this first provider, first text one of the mark such as the icon of the first provider or a variety of.
S203 judges that the first provider identifies whether to be contained in identified provider's logo collection.
S204 obtains the first text if it is not, then identifying by basic interface identification model to first interface information Text information.
Here, the mark of any one provider in identified provider's logo collection is used to indicate to provider The interface message of text execute identification, basic interface identification model can be used for the interface message of the text of different providers into Row identifies, includes the feature that the text of all providers may include in basic interface identification model.
Specifically, if the mark of the first provider is not included in identified provider's logo collection, then it represents that first The interface message of the text of provider is to identify for the first time, then is known by basic interface identification model to first interface information Not.After carrying out first time identification to first interface information by basic interface identification model, by the mark of first provider It is added in identified provider's logo collection.Here, basic interface identification model can be the target interface in step S102 Identification model, especially by the method for target interface identification model identification first interface information referring to the description in step S102, Details are not described herein again.
In the case where the interface message of the text of the first provider is to identify for the first time, pass through basic interface identification model pair First interface information is identified, can also be in the text information according to the first text after obtaining the text information of the first text Multiple first field types and multiple first field types identification parameter, generate first interface identification model.In other words, First interface identification model includes common to all texts with text provider corresponding to the first interface identification model Interface message.Field type in the corresponding first interface identification model of different providers often includes that character may be different, such as Field type is text title in the first interface identification model A of provider A, then text title often includes that character may be Bookname, field type are text classification, then it may be cate, field type be text word that text classification, which often includes character, Number, then it may be words that text number of words, which often includes character,.Field type is text in the first interface identification model B of provider B Title, then it may be name, field type be text classification that text title, which often includes character, then text classification often include character can Energy be category, field type is text number of words, then it may be count that text number of words, which often includes character,.
Optionally, after identification generates first interface identification model for the first time, the first provider can also be established and connect with first The mark of first provider, is specifically mapped by the corresponding relationship of mouth identification model with first interface identification model, so as to In the case where needing to identify the interface message of other texts of first provider, it can be looked into according to the mark of the first provider Look for corresponding first interface identification model.
Here, the corresponding first interface identification model of the mark of each first provider, by determining the first provider Mark can determine first interface identification model corresponding with the mark, pass through first interface identification model identification first provide The interface message of the text of side, the text information of the available text.
The text that each provider is identified by basic interface identification model may be implemented by the method for step S201-S204 This interface message obtains the text information of the first text of each provider, and then generates each provider corresponding first Interface identification model.
Citing is to be illustrated, such as the side of being provided with A, B, C, by basic interface identification model identify respectively provider A, B, the interface message for the text that C is provided respectively, the corresponding first field type A1~A8 of the side of being provided A and the first field class The corresponding first field type B1~B7 of identification parameter a1~a8, provider B of type and the identification parameter of the first field type Identification parameter c1~c7 of the corresponding first field type C1~C9 of b1~b7, provider C and the first field type.By mentioning Identification parameter a1~a8 of the corresponding first field type A1~A8 of supplier A and the first field type generates first interface identification Model A, it is generated by identification parameter b1~b7 of the corresponding first field type B1~B7 of provider B and the first field type First interface identification model B, joined by the identification of the corresponding first field type C1~C9 of provider C and the first field type Number c1~c9 generates first interface identification model C;The then corresponding relationship of the first provider and first interface identification model are as follows: provide Square A corresponds to first interface identification model A, provider B corresponds to first interface identification model B, provider C corresponds to first interface identification The corresponding first interface identification model of each provider can be obtained in MODEL C as a result,.
S205, if so, obtaining the first interface identification model that there is corresponding relationship with the first provider.
Here, if the mark of the first provider is contained in identified provider's logo collection, then it represents that first provides The interface message of the text of side identifies that the first interface that then obtaining has corresponding relationship with first provider identifies to be non-for the first time Model is the provider B in step S204 as the first provider B corresponds to first interface identification model B, then has with provider B The first interface identification model for having corresponding relationship is first interface identification model B.
First interface identification model is determined as target interface identification model by S206.
For example, the first interface identification model B to illustrate in step S205 is determined as target interface identification model.
S207 is identified by first interface information of the target interface identification model to the first text, obtains the first text This text information.
Here, identify the method for the first interface information of the first text referring to step especially by target interface identification model Description in S102, details are not described herein again.
For a kind of possible implementation of the present embodiment, passing through target interface identification model to first interface information It is unidentified in the case where at least one corresponding field information of mandatory field type, output identification exception information, identification is abnormal Information includes unidentified at least one mandatory field type arrived.Under this case, optionally, target interface identification model is base When this interface identification model, it may include multiple mandatory fields and multiple optionally field types in basic interface identification model, When target interface identification model is first interface identification model, it also may include multiple mandatory fields in first interface identification model With multiple optionally field types.
Here, mandatory field type can for text ID, text title, text author, text classification, text chapters and sections name, Text number of words, content of text etc. are used to describe the essential feature of text;Identification exception information can be to show in the display interface " identification of XX field type is abnormal ", identification exception information may be by voice prompting " identification of XX field type is abnormal ".Example Such as, pass through the target interface identification model corresponding field information of content of text into mandatory field unidentified to first interface information In the case where, output identification exception information " content of text identification is abnormal ", as shown in figure 3, Fig. 3 is provided in an embodiment of the present invention A kind of identification exception information display interface schematic diagram.
Optionally, multiple field types in target interface identification model can also include multiple optionally field types, Such as the inessential feature of text, such as text reading person's information, i.e. whether text reading user is member or text reading person Age etc., i.e. text reading user must be in some age ranges, such as age range can be [8,60] etc..In such case Under, at least one optionally corresponding field of field type is arrived by the way that target interface identification model is unidentified to first interface information In the case where information, output identification exception information can be determined whether as the case may be.Such as it can be according to the optionally word The importance of segment type determines whether output identification exception information.
The text information of first text is imported textual resources system by S208, and textual resources system is used to show to user The text information of first text.
Here, the text information of the first text is specifically imported to method the retouching referring to step S103 of textual resources system It states, details are not described herein again.
In the embodiment of the present invention, by judging that the first provider of the first text identifies whether to be contained in identified mention In supplier's logo collection, so that it is determined that whether the interface message of the text of the first provider is to identify for the first time, in the first provider The interface message of text be in the case where identifying for the first time, to be generated according to the text information of the first text recognized and mentioned with first The corresponding first interface identification model of supplier, since the first interface identification model has the text of the first text of the first provider The feature of this information, so identifying that the text information of the first text of the first provider can mention using first interface identification model High recognition efficiency is identifying connecing for the text of the first provider by first interface identification model so that recognition result is more acurrate When message ceases, it is unidentified go out the corresponding field information of all mandatory fields in the case where, output for it is unidentified go out essential word The identification exception information of section can make text import personnel and understand current identification abnormal cause to be adjusted correspondingly, mention High text identification efficiency and text import the efficiency in text information library.
In one possible implementation, text introduction method can be applied in the structure chart of following system.Wherein, The first system construction drawing can with as shown in figure 4, Fig. 4 be a kind of text introduction method provided in an embodiment of the present invention example Figure includes the first provider 401, text gatherer 402 in text import system, wherein textual resources system 402 includes Text gatherer 4021.Firstly, the first provider 401 provides the first interface information of the first text;Then, text imports Device 4021 obtains the first interface information of the first text, passes through the target interface identification model pair in text gatherer 4021 The first interface information of first text is identified, the text information of the first text is obtained;Finally, text gatherer 4021 The text information of first text is imported into text according to the text information of the first text and the corresponding relationship of textual resources system 402 The corresponding field information of each service fields type in resource system 402 realizes that text imports textual resources system 402.
Second of system construction drawing can be as shown in figure 5, Fig. 5 be another text importing side provided in an embodiment of the present invention The exemplary diagram of method includes the first provider 501, text gatherer 502 and textual resources system in text import system 503.Firstly, the first provider 501 provides the first interface information of the first text;Then, text gatherer 502 obtains the The first interface information of one text, by the target interface identification model in text gatherer 502 to the first of the first text Interface message is identified, the text information of the first text is obtained;Finally, according to the target interface in text gatherer 502 The corresponding relationship of identification model and textual resources system 503 imports the text information of the first text and target interface identification model The corresponding field information of each service fields type in corresponding textual resources system 503 realizes that text imports textual resources System 503.
The present embodiments relate to text gatherer can be the equipment for having processing capacity, such as: tablet computer, The equipment such as mobile phone, electronic reader, personal computer (Personal Computer, PC), laptop, server;Or It can be the text import modul being embedded in textual resources system.It is not limited in the embodiment of the present invention.
The method of inventive embodiments is described above, the device of inventive embodiments is described below.
It is a kind of structural schematic diagram of text gatherer provided in an embodiment of the present invention, the device packet referring to Fig. 6, Fig. 6 It includes:
Interface message obtains module 601, for obtaining the first interface information of the first text;
Interface message identification module 602, for being known by target interface identification model to the first interface information Not, the text information of first text is obtained, the text information includes multiple first field types and the multiple first The corresponding field information of each first field type, the target interface identification model contain multiple field classes in field type The identification parameter of each field type in type and the multiple field type;
Text information import modul 603, it is described for the text information of first text to be imported textual resources system Textual resources system is used to show the text information of first text to user.
In a kind of possible design, the target interface identification model is basic interface identification model, described to connect substantially Mouth identification model can be used for identifying the interface message of the text of different providers;
The interface message identification module 602, the mark of the first provider specifically for obtaining first text;
The interface message identification module 602, if the mark specifically for first provider is not included in and has identified Provider's logo collection in, then the first interface information is identified by basic interface identification model, is obtained described The text information of first text;
Wherein, the mark of any one provider in identified provider's logo collection is used to indicate to institute The interface message for stating the text of provider executes identification.
In a kind of possible design, described device further include:
First model generation module 604, for according to the multiple first field type and the multiple first field class The identification parameter of type generates first interface identification model;
The first model generation module 604, is also used to establish first provider and the first interface identifies mould The corresponding relationship of type.
In a kind of possible design, described device 60 further include:
Object module determining module 605, the mark of the first provider for obtaining first text;
The object module determining module 605, if the mark for being also used to first provider is contained in identified mention In supplier's logo collection, the first interface identification model that there is corresponding relationship with first provider is obtained;
The object module determining module 605 is also used to the first interface identification model being determined as the target to connect Mouth identification model.
In a kind of possible design, multiple field types in the target interface identification model include multiple essential words Segment type;
Described device 60 further include:
Exception information output module 606, for by the target interface identification model to the first interface information It is unidentified in the case where at least one corresponding field information of mandatory field type, output identification exception information, the identification Exception information includes unidentified at least one mandatory field type arrived.
It should be noted that unmentioned content can be found in the description of embodiment of the method in the corresponding embodiment of Fig. 6, here It repeats no more.
In the embodiment of the present invention, according to target interface identification model to the first interface information of the first text got into Row identification obtains the text information of the first text, and the text information of the first text is imported textual resources system.Pass through target Interface identification model automatically identifies the first interface information of the first text, obtains the text information of the first text and automatic The text information of first text is imported into textual resources system, eliminates the first interface information in the first text of identification every time When manual operation develop corresponding adaptation code, improve the efficiency that text imports textual resources system;By judging the first text This first provider's identifies whether to be contained in identified provider's logo collection, so that it is determined that the text of the first provider Whether this interface message is to identify for the first time, raw in the case where the interface message of the text of the first provider is to identify for the first time At first interface identification model corresponding with the first provider, since the first interface identification model has the of the first provider The feature of the text information of one text, so identifying the text of the first text of the first provider using first interface identification model Recognition efficiency can be improved in information, so that recognition result is more acurrate;The first provider is being identified by first interface identification model Text interface message when, it is unidentified go out the corresponding field information of all mandatory fields in the case where, output is for unidentified The identification exception information of mandatory field out, so that it may so that text imports personnel and understands current identification abnormal cause to carry out phase The adjustment answered, improves text identification efficiency and text imports the efficiency of textual resources system.
It is the structural schematic diagram that a kind of text provided in an embodiment of the present invention imports equipment, the equipment 70 referring to Fig. 7, Fig. 7 Including processor 701, memory 702 and input/output interface 703.Processor 701 is connected to memory 702 and input and output Interface 703, such as processor 701 can be connected to memory 702 and input/output interface 703 by bus.
Processor 701 is configured as that the text is supported to import equipment and execute in text introduction method described in Fig. 1-Fig. 2 Corresponding function.The processor 701 can be central processing unit (central processing unit, CPU), network processes Device (network processor, NP), hardware chip or any combination thereof.Above-mentioned hardware chip can be dedicated integrated electricity Road (application specific integrated circuit, ASIC), programmable logic device (programmable Logic device, PLD) or combinations thereof.Above-mentioned PLD can be Complex Programmable Logic Devices (complex Programmable logic device, CPLD), field programmable gate array (field-programmable gate Array, FPGA), Universal Array Logic (generic array logic, GAL) or any combination thereof.
702 memory of memory is for storing program code etc..Memory 702 may include volatile memory (volatile memory, VM), such as random access memory (random access memory, RAM);Memory 702 It may include nonvolatile memory (non-volatile memory, NVM), such as read-only memory (read-only Memory, ROM), flash memory (flash memory), hard disk (hard disk drive, HDD) or solid state hard disk (solid-state drive, SSD);Memory 702 can also include the combination of the memory of mentioned kind.
The input/output interface 703 is for input or output data.
Processor 701 can call said program code to execute following operation:
Obtain the first interface information of the first text;
The first interface information is identified by target interface identification model, obtains the text of first text Information, the text information include each first field type in multiple first field types and the multiple first field type Corresponding field information, the target interface identification model contain each in multiple field types and the multiple field type The identification parameter of field type;
The text information of first text is imported into textual resources system, the textual resources system is used for user's exhibition Show the text information of first text.
It should be noted that realizing for each operation can also corresponding description to should refer to above method embodiment;Institute Other operations executed in above method embodiment can also be cooperated with input/output interface 703 by stating processor 701.
The embodiment of the present invention also provides a kind of computer storage medium, and the computer storage medium is stored with computer journey Sequence, the computer program include program instruction, and described program instruction executes the computer such as Method described in previous embodiment, the computer can import a part of equipment for text mentioned above.On for example, The processor 701 stated.
Those of ordinary skill in the art will appreciate that realizing all or part of the process in above-described embodiment method, being can be with Relevant hardware is instructed to complete by computer program, the program can be stored in computer-readable storage medium In, the program is when being executed, it may include such as the process of the embodiment of above-mentioned each method.Wherein, the storage medium can be magnetic Dish, CD, ROM or RAM etc..
The above disclosure is only the preferred embodiments of the present invention, cannot limit the right model of the present invention with this certainly It encloses, therefore equivalent changes made in accordance with the claims of the present invention, is still within the scope of the present invention.

Claims (10)

1. a kind of text introduction method characterized by comprising
Obtain the first interface information of the first text;
The first interface information is identified by target interface identification model, obtains the text envelope of first text Breath, the text information includes each first field type pair in multiple first field types and the multiple first field type The field information answered, the target interface identification model contain each word in multiple field types and the multiple field type The identification parameter of segment type;
The text information of first text is imported into textual resources system, the textual resources system is used to show institute to user State the text information of the first text.
2. the method according to claim 1, wherein the target interface identification model is that basic interface identifies mould Type, the basic interface identification model can be used for identifying the interface message of the text of different providers;
It is described that the first interface information is identified by target interface identification model, obtain the text of first text Information, comprising:
Obtain the mark of the first provider of first text;
If the mark of first provider is not included in identified provider's logo collection, identified by basic interface Model identifies the first interface information, obtains the text information of first text;
Wherein, the mark of any one provider in identified provider's logo collection is used to indicate mentions to described The interface message of the text of supplier executes identification.
3. according to the method described in claim 2, it is characterized in that, the method also includes:
According to the identification parameter of the multiple first field type and the multiple first field type, first interface identification is generated Model;
Establish the corresponding relationship of first provider Yu the first interface identification model.
4. the method according to claim 1, wherein described connect by target interface identification model to described first Message breath is identified, before obtaining the text information of first text, further includes:
Obtain the mark of the first provider of first text;
If the mark of first provider is contained in identified provider's logo collection, obtain and first provider First interface identification model with corresponding relationship;
The first interface identification model is determined as the target interface identification model.
5. according to the method described in claim 4, it is characterized in that, multiple field types in the target interface identification model Including multiple mandatory field types;
The method also includes:
At least one mandatory field type is arrived by the way that the target interface identification model is unidentified to the first interface information In the case where corresponding field information, output identification exception information, the identification exception information includes unidentified at least one arrived A mandatory field type.
6. a kind of text gatherer characterized by comprising
Interface message obtains module, for obtaining the first interface information of the first text;
Interface message identification module is obtained for being identified by target interface identification model to the first interface information The text information of first text, the text information include multiple first field types and the multiple first field type In the corresponding field information of each first field type, the target interface identification model contains multiple field types and described The identification parameter of each field type in multiple field types;
Text information import modul, for the text information of first text to be imported textual resources system, the text money Source system is used to show the text information of first text to user.
7. device according to claim 6, which is characterized in that the target interface identification model is that basic interface identifies mould Type, the basic interface identification model can be used for identifying the interface message of the text of different providers;
The interface message identification module, the mark of the first provider specifically for obtaining first text;
The interface message identification module, if being not included in identified provider specifically for the mark of first provider In logo collection, then the first interface information is identified by basic interface identification model, obtain first text Text information;
Wherein, the mark of any one provider in identified provider's logo collection is used to indicate mentions to described The interface message of the text of supplier executes identification.
8. device according to claim 7, which is characterized in that described device further include:
First model generation module, for the identification according to the multiple first field type and the multiple first field type Parameter generates first interface identification model;
It is corresponding with the first interface identification model to be also used to establish first provider for the first model generation module Relationship.
9. a kind of text imports equipment, including processor, memory and input/output interface, the processor, memory and Input/output interface is connected with each other, wherein the input/output interface is for input or output data, and the memory is for depositing Program code is stored up, the processor executes the method according to claim 1 to 5 for calling said program code.
10. a kind of computer storage medium, which is characterized in that the computer storage medium is stored with computer program, described Computer program includes program instruction, and described program instruction makes the processor execute such as claim when being executed by a processor The described in any item methods of 1-5.
CN201910359179.2A 2019-04-29 2019-04-29 Text importing method, device and equipment Active CN110083839B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910359179.2A CN110083839B (en) 2019-04-29 2019-04-29 Text importing method, device and equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910359179.2A CN110083839B (en) 2019-04-29 2019-04-29 Text importing method, device and equipment

Publications (2)

Publication Number Publication Date
CN110083839A true CN110083839A (en) 2019-08-02
CN110083839B CN110083839B (en) 2023-08-22

Family

ID=67417937

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910359179.2A Active CN110083839B (en) 2019-04-29 2019-04-29 Text importing method, device and equipment

Country Status (1)

Country Link
CN (1) CN110083839B (en)

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102982011A (en) * 2011-09-07 2013-03-20 百度在线网络技术(北京)有限公司 Method and device for identifying out-of-sequence texts
US20140236971A1 (en) * 2013-01-23 2014-08-21 Splunk Inc. Real Time Indication Of Previously Extracted Data Fields For Regular Expressions
CN104462716A (en) * 2014-12-23 2015-03-25 北京理工大学 Method for designing brain-computer interface parameters and kinetic parameters of brain controlled vehicle based on human-vehicle-road model
CN105607938A (en) * 2015-12-30 2016-05-25 中国银联股份有限公司 Method for allocating interface elements of security applications
CN106293727A (en) * 2016-08-04 2017-01-04 深圳市微我科技有限公司 A kind of method of shared wisdom based on tables of data
CN107251030A (en) * 2015-02-09 2017-10-13 皇家飞利浦有限公司 It is used as the wearable device of service
CN108279885A (en) * 2017-01-03 2018-07-13 中国航发商用航空发动机有限责任公司 A kind of method and device that multiple model codes are carried out with Integrated Simulation
CN108304368A (en) * 2017-04-20 2018-07-20 腾讯科技(深圳)有限公司 The kind identification method and device and storage medium and processor of text message
US20180322509A1 (en) * 2017-05-05 2018-11-08 Servicenow, Inc. Identifying clusters for service management operations
CN108829882A (en) * 2018-06-27 2018-11-16 深圳乐信软件技术有限公司 Formation gathering method, device, terminal and medium
CN109189666A (en) * 2018-08-02 2019-01-11 腾讯科技(北京)有限公司 Interface test method, device and computer equipment
CN109388675A (en) * 2018-10-12 2019-02-26 平安科技(深圳)有限公司 Data analysing method, device, computer equipment and storage medium

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102982011A (en) * 2011-09-07 2013-03-20 百度在线网络技术(北京)有限公司 Method and device for identifying out-of-sequence texts
US20140236971A1 (en) * 2013-01-23 2014-08-21 Splunk Inc. Real Time Indication Of Previously Extracted Data Fields For Regular Expressions
CN104462716A (en) * 2014-12-23 2015-03-25 北京理工大学 Method for designing brain-computer interface parameters and kinetic parameters of brain controlled vehicle based on human-vehicle-road model
CN107251030A (en) * 2015-02-09 2017-10-13 皇家飞利浦有限公司 It is used as the wearable device of service
CN105607938A (en) * 2015-12-30 2016-05-25 中国银联股份有限公司 Method for allocating interface elements of security applications
CN106293727A (en) * 2016-08-04 2017-01-04 深圳市微我科技有限公司 A kind of method of shared wisdom based on tables of data
CN108279885A (en) * 2017-01-03 2018-07-13 中国航发商用航空发动机有限责任公司 A kind of method and device that multiple model codes are carried out with Integrated Simulation
CN108304368A (en) * 2017-04-20 2018-07-20 腾讯科技(深圳)有限公司 The kind identification method and device and storage medium and processor of text message
US20180322509A1 (en) * 2017-05-05 2018-11-08 Servicenow, Inc. Identifying clusters for service management operations
CN108829882A (en) * 2018-06-27 2018-11-16 深圳乐信软件技术有限公司 Formation gathering method, device, terminal and medium
CN109189666A (en) * 2018-08-02 2019-01-11 腾讯科技(北京)有限公司 Interface test method, device and computer equipment
CN109388675A (en) * 2018-10-12 2019-02-26 平安科技(深圳)有限公司 Data analysing method, device, computer equipment and storage medium

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
C.J.HUTTO 等: ""A Parsimonious Rule-based Model for Sentiment Analysis of Social Media Text"", 《PROCEEDINGS OF THE EIGHTH INTERNATIONAL AAAI CONFERENCE ON WEBLOGS AND SOCIAL MEDIA》 *
丁俊等: "大数据时代下的动态可配置数据采集系统的研究与设计", 《计算机应用与软件》 *
朱会峰: ""Deep Web查询接口模式抽取研究"", 《中国优秀硕士学位论文全文数据库 (信息科技辑)》 *
郭艳: ""医院排队管理系统软件的设计和开发"", 《中国优秀硕士学位论文全文数据库 (信息科技辑)》 *

Also Published As

Publication number Publication date
CN110083839B (en) 2023-08-22

Similar Documents

Publication Publication Date Title
CN110147402A (en) Excel file introduction method and equipment, deriving method and equipment
CN103546446B (en) Phishing website detection method, device and terminal
CN106528791B (en) A kind of method and device of sending out notice message
US8145716B2 (en) Method and apparatus for assigning cost metrics to electronic messages
CN108268323A (en) User Defined Resource in resource stack
US10489748B2 (en) Managing the generation of text messages
CN106708912B (en) Junk file identification and management method, identification device, management device and terminal
CN103065625A (en) Method and device for adding digital voice tag
CN102546668A (en) Method, device and system for counting unique visitors
CN110232156B (en) Information recommendation method and device based on long text
CN109598526A (en) The analysis method and device of media contribution
CN109669678A (en) Template engine integration method, device, electronic equipment and storage medium
CN109032693A (en) Method and device for loading display information, electronic equipment and readable storage medium
CN110083839A (en) Text introduction method, device and equipment
CN114840634B (en) Information storage method and device, electronic equipment and computer readable medium
US20170324800A1 (en) Adding contextual clarity to shared links
CN110413279A (en) Data load method and device
US9159044B2 (en) Notification system based on intelligent mail barcodes
CN110647568B (en) Method and device for converting graph database data into programming language data
Darvas et al. Will European Union recovery spending be enough to fill digital investment gaps?
JPWO2011052025A1 (en) Data processing apparatus, data processing method, and program
CN104361094A (en) Storage method and device for file in search result, and browser client
CN104699765B (en) A kind of date storage method and mobile terminal
CN108460159B (en) Information reply method, terminal equipment and computer readable storage medium
CN115280298A (en) Preventing disclosure of sensitive information

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant