CN109726266A - Note signature processing method, equipment and computer readable storage medium - Google Patents

Note signature processing method, equipment and computer readable storage medium Download PDF

Info

Publication number
CN109726266A
CN109726266A CN201811574935.5A CN201811574935A CN109726266A CN 109726266 A CN109726266 A CN 109726266A CN 201811574935 A CN201811574935 A CN 201811574935A CN 109726266 A CN109726266 A CN 109726266A
Authority
CN
China
Prior art keywords
enterprise
name
information
prediction result
black
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811574935.5A
Other languages
Chinese (zh)
Inventor
段雨洛
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZHUHAI XIAOYUAN TECHNOLOGY Co Ltd
Original Assignee
ZHUHAI XIAOYUAN TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZHUHAI XIAOYUAN TECHNOLOGY Co Ltd filed Critical ZHUHAI XIAOYUAN TECHNOLOGY Co Ltd
Priority to CN201811574935.5A priority Critical patent/CN109726266A/en
Publication of CN109726266A publication Critical patent/CN109726266A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses the signing messages that a kind of note signature processing method, equipment and computer readable storage medium, the method pass through acquisition short message;The corresponding black and white lists prediction result of the signing messages is obtained by the black and white lists network model pre-established according to the signing messages, it is tentatively whether effective to signing messages is distinguished;When the black and white lists prediction result is white list, the corresponding enterprise's name prediction result of the signing messages is obtained by the enterprise's name network model pre-established according to the signing messages;Enterprise name prediction result is successively matched with the enterprise information for prestoring in the database, to obtain from the database and the matched enterprise's name information of enterprise's name prediction result and its corresponding trade classification information, go out enterprise's alias, trade classification by note signature information excavating, improves the utilization rate of note signature information.

Description

Note signature processing method, equipment and computer readable storage medium
Technical field
The present invention relates to short breath process field more particularly to a kind of note signature processing methods, equipment and computer-readable Storage medium.
Background technique
Short message erects effective channel as the carrier for transmitting information between magnanimity client, for communication to each other.Along with Short message is widely used, and how more efficiently to utilize, excavates short breath data, becomes the research neck popular for a comparison Domain.
Currently, for enterprise-class short message, it generally can be in the beginning of short message content or ending often with some enterprise's names The identification information about the enterprise such as title, i.e., so-called note signature information.But since the note signature data of enterprise-class are deposited The features such as information content is fewer, signature contents itself are more chaotic, cause signing messages utilization rate not high.
Summary of the invention
It can in view of the above-mentioned problems, the purpose of the present invention is to provide a kind of note signature processing method, equipment and computers Storage medium is read, enterprise's alias, trade classification can be identified by note signature information, improves the benefit of note signature information With rate.
In a first aspect, the embodiment of the invention provides a kind of note signature processing methods, comprising the following steps:
Obtain the signing messages of short message;
It is corresponding to be obtained by the black and white lists network model pre-established for the signing messages according to the signing messages Black and white lists prediction result;
When the black and white lists prediction result is white list, according to the signing messages, pass through the enterprise pre-established Name network model obtains the corresponding enterprise's name prediction result of the signing messages;
Enterprise name prediction result is successively matched with the enterprise information for prestoring in the database, with from described It is obtained in database and the matched enterprise's name information of enterprise's name prediction result and its corresponding trade classification information.
Preferably, described successively to carry out enterprise name prediction result with the enterprise's name information for prestoring in the database Match, to obtain from the database and the matched enterprise's name information of enterprise's name prediction result and its corresponding trade classification Information specifically includes:
Judge whether enterprise name prediction result matches with the enterprise information for prestoring in the database;
If so, obtaining from the database and the matched enterprise information of enterprise name prediction result and its corresponding Trade classification information;
If it is not, enterprise's name prediction result and the enterprise's name information prestored in the database are successively carried out similarity meter It calculates, from the database acquisition and the highest enterprise's name information of enterprise's name prediction result similarity, and according to the label Name data obtain trade classification information by the trade classification network model pre-established.
Preferably, the method also includes:
When enterprise's name prediction result is less than preset threshold with the similarity for prestoring enterprise's name information in the database When value, the signing messages is ended processing.
Preferably, the method also includes following black and white lists network model construction steps:
Dictionary is constructed to signature sample gathered in advance;Wherein, the signature sample include original signature information, with it is described The corresponding black and white lists classification information of original signature information, enterprise's name information, trade classification coding;The dictionary includes the original Mapping relations in beginning signing messages between each word and number;
Black and white lists classified dictionary is constructed to signature sample gathered in advance;Wherein, the black and white lists classified dictionary packet Include the mapping relations between the corresponding black and white lists classification information of the original signature information and number;
According to the dictionary, the black and white lists classified dictionary, by the original signature information and its corresponding black and white name Single classification information is converted to Serial No., as the first Serial No.;
First Serial No. is trained by convolutional neural networks model, establishes the black and white lists network mould Type.
Preferably, described that the label are obtained by the black and white lists network model pre-established according to the signing messages The corresponding black and white lists prediction result of name information, specifically includes:
According to the dictionary, the signing messages is converted into Serial No., as the second Serial No.;
Using second Serial No. as the input value of the black and white lists network model, to obtain the signing messages Corresponding black and white lists prediction result.
Preferably, the method also includes following enterprise's name network model construction steps:
To signature sample construction enterprise name dictionary;Wherein, enterprise's name dictionary includes the original signature information Mapping relations between corresponding enterprise's name information and number;
According to the dictionary, enterprise name dictionary, the original signature information and its corresponding enterprise name information are turned It is changed to Serial No., as third Serial No.;
The third Serial No. is trained by neural translation model, establishes enterprise's name network model.
Preferably, described that enterprise name prediction result is successively subjected to phase with the enterprise information for prestoring in the database It is calculated like degree, from the database acquisition and the highest enterprise's name information of enterprise's name prediction result similarity, and according to The signed data is obtained trade classification information, is specifically included by the trade classification network model pre-established:
Each word in enterprise name prediction result, enterprise name information is converted into word vector respectively, and according to The frequency of each word calculates enterprise name prediction result, the corresponding word weight of each word in enterprise name information;
According to the word vector of enterprise's name prediction result and its word vector of corresponding word weight, enterprise name information And its corresponding word weight, sentence vector, the enterprise name information of enterprise's name prediction result are calculated separately using SIF algorithm Sentence vector;
The sentence vector of sentence vector and enterprise name information to enterprise's name prediction result carries out cosine similarity meter It calculates, from the database acquisition and the highest enterprise's name information of enterprise's name prediction result cosine similarity, and according to institute Signed data is stated, by the trade classification network model pre-established, obtains trade classification information.
Preferably, the method also includes following trade classification network model construction steps:
Trade classification dictionary is constructed to signature sample gathered in advance;The trade classification dictionary includes the trade classification Mapping relations between coding and number;
According to the dictionary, the trade classification dictionary, the original signature information and its corresponding trade classification are compiled Code is converted to Serial No., as the 4th Serial No.;
The 4th Serial No. is trained by convolutional neural networks model, establishes the trade classification network mould Type.
Second aspect, the embodiment of the invention provides a kind of note signature processing equipment, including processor, memory and The computer program executed by the processor is stored in the memory and is configured as, the processor executes the meter The note signature processing method as described in any one of first aspect is realized when calculation machine program.
The third aspect, the embodiment of the invention provides a kind of computer readable storage medium, the computer-readable storage Medium includes the computer program of storage, wherein controls the computer-readable storage medium in computer program operation Equipment executes the note signature processing method as described in any one of first aspect where matter.
Above embodiments have the following beneficial effects:
By the signing messages for obtaining short message;According to the signing messages, pass through the black and white lists network mould pre-established Whether type obtains the corresponding black and white lists prediction result of the signing messages, tentatively effective to signing messages is distinguished;When described When black and white lists prediction result is white list, obtained according to the signing messages by the enterprise's name network model pre-established The corresponding enterprise's name prediction result of the signing messages;By enterprise name prediction result with prestore enterprise's name in the database Information is successively matched, with obtain from the database with the matched enterprise's name information of enterprise name prediction result and its Corresponding trade classification information identifies enterprise's alias, trade classification by note signature information, improves note signature information Utilization rate.
Detailed description of the invention
In order to illustrate more clearly of technical solution of the present invention, attached drawing needed in embodiment will be made below Simply introduce, it should be apparent that, the accompanying drawings in the following description is only some embodiments of the present invention, general for this field For logical technical staff, without creative efforts, it is also possible to obtain other drawings based on these drawings.
Fig. 1 is the flow diagram for the note signature processing method that first embodiment of the invention provides.
Fig. 2 is the structural schematic diagram for the note signature processing equipment that second embodiment of the invention provides.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts every other Embodiment shall fall within the protection scope of the present invention.
Fig. 1 is please referred to, first embodiment of the invention provides a kind of note signature processing method, can be by note signature Equipment is managed to execute, and the following steps are included:
S11 obtains the signing messages of short message;
In embodiments of the present invention, the note signature processing equipment can be computer, mobile phone, tablet computer, notebook electricity Brain or server etc. calculate equipment, the note signature processing method can be used as one of functional module it is integrated with it is described short In letter signature processing equipment, executed by the note signature processing equipment.
S12 obtains the signing messages by the black and white lists network model pre-established according to the signing messages Corresponding black and white lists prediction result;
Since the signing messages of short message is disorderly and unsystematic, by the black and white lists network model that pre-establishes to the signature of short message Information carries out black and white lists classification prediction, distinguishes effective signing messages and invalid signature information, effective signing messages is pre- It surveys and is predicted to be blacklist for white list, invalid signing messages.When the black and white lists prediction result is blacklist, terminate Handle signing messages.
S13, according to the signing messages, passes through what is pre-established when the black and white lists prediction result is white list Enterprise's name network model obtains the corresponding enterprise's name prediction result of the signing messages;
S14 successively matches enterprise name prediction result with the enterprise information for prestoring in the database, with from It is obtained in the database and the matched enterprise's name information of enterprise's name prediction result and its corresponding trade classification information.
The embodiment of the present invention is few to information content, chaotic note signature information carries out depth excavation, identify enterprise's alias, Trade classification information improves the utilization rate of note signature information.
In an alternative embodiment, it is described by enterprise name prediction result with prestore enterprise's name in the database Information is successively matched, with obtain from the database with the matched enterprise's name information of enterprise name prediction result and its Corresponding trade classification information, specifically includes:
Judge whether enterprise name prediction result matches with the enterprise information for prestoring in the database;
If so, obtaining from the database and the matched enterprise information of enterprise name prediction result and its corresponding Trade classification information;
If it is not, enterprise's name prediction result and the enterprise's name information prestored in the database are successively carried out similarity meter It calculates, from the database acquisition and the highest enterprise's name information of enterprise's name prediction result similarity, and according to the label Name data obtain trade classification information by the trade classification network model pre-established.
In embodiments of the present invention, it for enterprise's name prediction result, needs and enterprise's name information pair existing in database Than;If identical as enterprise's name information in database, enterprise's name information and its associated industry point in output database Category information;If different from enterprise's name information in database, similarity calculation is carried out with enterprise's name information in database, it is defeated Out with enterprise's name prediction result similarity highest enterprise name information and according to the signed data in database, by preparatory The trade classification network model of foundation exports trade classification information.The embodiment of the present invention can simplify the data mining of signing messages Process improves the treatment effeciency of signing messages.
In an alternative embodiment, the method also includes:
When enterprise's name prediction result is less than preset threshold with the similarity for prestoring enterprise's name information in the database When value, the signing messages is ended processing.
In an alternative embodiment, the method also includes following black and white lists network model construction steps:
Dictionary is constructed to signature sample gathered in advance;Wherein, the signature sample include original signature information, with it is described The corresponding black and white lists classification information of original signature information, enterprise's name information, trade classification coding;The dictionary includes the original Mapping relations in beginning signing messages between each word and number;
Black and white lists classified dictionary is constructed to signature sample gathered in advance;Wherein, the black and white lists classified dictionary packet Include the mapping relations between the corresponding black and white lists classification information of the original signature information and number;
According to the dictionary, the black and white lists classified dictionary, by the original signature information and its corresponding black and white name Single classification information is converted to Serial No., as the first Serial No.;
First Serial No. is trained by convolutional neural networks model, establishes the black and white lists network mould Type.
In embodiments of the present invention, for example, signature sample for acquisition, " you " is mapped as 101 in the dictionary, " good " is mapped as 205;White list is mapped as number 0 by the black and white lists classified dictionary, and blacklist is mapped as number 1, according to The dictionary, the black and white lists classified dictionary turn the original signature information and its corresponding black and white lists classification information It is changed to Serial No., and feeds convolutional neural networks (CNN) model and carries out model training, later to unknown signing messages, together Sample is converted to Serial No. according to the dictionary, the black and white lists classified dictionary, and feeds the black and white lists network after training Model obtains the prediction result that it is blacklist or white list, carries out black and white lists prediction by black and white lists network model The identification accuracy that black and white lists can be greatlyd improve, avoids effective signing messages misjudged.
In an alternative embodiment, described according to the signing messages, pass through the black and white lists network pre-established Model obtains the corresponding black and white lists prediction result of the signing messages, specifically includes:
According to the dictionary, the signing messages is converted into Serial No., as the second Serial No.;
Using second Serial No. as the input value of the black and white lists network model, to obtain the signing messages Corresponding black and white lists prediction result.
In an alternative embodiment, the method also includes following enterprise's name network model construction steps:
To signature sample construction enterprise name dictionary;Wherein, enterprise's name dictionary includes the original signature information Mapping relations between corresponding enterprise's name information and number;
According to the dictionary, enterprise name dictionary, the original signature information and its corresponding enterprise name information are turned It is changed to Serial No., as third Serial No.;
The third Serial No. is trained by neural translation model, establishes enterprise's name network model.
In embodiments of the present invention, for example, signature sample for acquisition, " length " is mapped as in enterprise's name dictionary 0, " river " is mapped as 1, according to the dictionary, enterprise name dictionary, by the original signature information and its corresponding enterprise name Information is converted to Serial No., and feeds nerve translation (NMT) model and carry out model training, later to unknown signing messages, Also according to the dictionary, enterprise name dictionary, Serial No. is converted to, and feeds enterprise's name network model after training, Its enterprise name prediction result is obtained, the corresponding enterprise's alias of signing messages can be recognized accurately by enterprise's name network model, The content of depth excavation signing messages.
In an alternative embodiment, it is described by enterprise name prediction result with prestore enterprise's name in the database Information successively carries out similarity calculation, obtains from the database and the highest enterprise of enterprise's name prediction result similarity Name information, and trade classification information, tool are obtained by the trade classification network model pre-established according to the signed data Body includes:
Each word in enterprise name prediction result, enterprise name information is converted into word vector respectively, and according to The frequency of each word calculates enterprise name prediction result, the corresponding word weight of each word in enterprise name information;
According to the word vector of enterprise's name prediction result and its word vector of corresponding word weight, enterprise name information And its corresponding word weight, sentence vector, the enterprise name information of enterprise's name prediction result are calculated separately using SIF algorithm Sentence vector;
The sentence vector of sentence vector and enterprise name information to enterprise's name prediction result carries out cosine similarity meter It calculates, from the database acquisition and the highest enterprise's name information of enterprise's name prediction result cosine similarity, and according to institute Signed data is stated, by the trade classification network model pre-established, obtains trade classification information.
In embodiments of the present invention, first enterprise's name prediction result, enterprise name information are segmented, Zhi Houtong Cross the CBOW algorithm in word2vec to after segmenting the enterprise name prediction result, the enterprise name information in each word turn Be changed to word vector, and calculate word weight according to the frequency that each word occurs, by SIF algorithm to enterprise's name prediction result, Each word vector does weighted average to obtain enterprise's name prediction result, enterprise name information in enterprise's name information The sentence vector of entire sentence calculates the similarity between sentence vector by cosine similarity, and in output database with enterprise's name Prediction result similarity highest enterprise name information and according to the signed data, passes through the trade classification network pre-established Model exports trade classification information.
Calculating for sentence vector, is exemplified below:
(1) it is based on enterprise's name database library, the word frequency is counted by formula a/ (a+v/N);Wherein a is fixed constant, example If a takes 1-3 any number, v is word frequency, and N is total word frequency of all words;Reflecting for each word and its corresponding word weight is calculated Firing table.
(2) enterprise's name information in enterprise's name prediction result and database is added together and is put into mapping table, pass through word Enterprise's name prediction result, enterprise name information are converted to sequence vector respectively by vector;Meanwhile it being reflected by word and weight Enterprise's name prediction result, enterprise name information are converted to weight sequence respectively by firing table.At this point, each enterprise's name is deposited In two sequences.Such as: the sequence vector of " XY " this enterprise isWord weight sequence (0.1 0.2); So word weight matrix has just obtained one (0.05 0.04 0.05) multiplied by vector matrix, then divided by enterprise name length, Obtain " XY " this enterprise weighted average sequence (0.025 0.02 0.025).
(3) after calculating the sequence of all enterprise's name information, the matrix A of a N*3 can be obtained by putting together;Then Singular value matrix B is calculated, final Matrix C is A-B;The sentence vector of each enterprise is just certain a line in Matrix C.
In an alternative embodiment, the method also includes following trade classification network model construction steps:
Trade classification dictionary is constructed to signature sample gathered in advance;The trade classification dictionary includes the trade classification Mapping relations between coding and number;
According to the dictionary, the trade classification dictionary, the original signature information and its corresponding trade classification are compiled Code is converted to Serial No., as the 4th Serial No.;
The 4th Serial No. is trained by convolutional neural networks model, establishes the trade classification network mould Type.
Compared with the existing technology, the beneficial effect that the present invention is implemented is:
1, relative to it is traditional based on the rule pre-established (such as in signing messages include certain keywords it is determined that Blacklist) black and white lists classification is carried out, the embodiment of the present invention is based on CNN model and carries out black and white lists classification, greatly improves The accuracy rate and coverage rate of black and white lists identification;
2, traditional to must manually perform and handle one by one by signature extraction enterprise's name and the corresponding trade classification of association , and the embodiment of the present invention is based on CNN model and NMT model and carries out automatic processing, artificial only needs pair to signing messages The enterprise's name information finally predicted and its corresponding trade classification information carry out review operations, substantially increase the place of signing messages Efficiency is managed, while note signature information few to information content, chaotic carries out depth excavation, identifies enterprise's alias, trade classification Information improves the utilization rate of note signature information.
It referring to fig. 2, is the schematic diagram for the note signature processing equipment that second embodiment of the invention provides.As shown in Fig. 2, should Note signature processing equipment includes: at least one processor 11, such as CPU, at least one network interface 14 or other users Interface 13, memory 15, at least one communication bus 12, communication bus 12 is for realizing the connection communication between these components. Wherein, user interface 13 optionally may include USB interface and other standards interface, wireline interface.Network interface 14 is optional May include Wi-Fi interface and other wireless interfaces.Memory 15 may include high speed RAM memory, it is also possible to also wrap It includes non-labile memory (non-volatilememory), for example, at least a magnetic disk storage.Memory 15 is optional It may include at least one storage device for being located remotely from aforementioned processor 11.
In some embodiments, memory 15 stores following element, executable modules or data structures, or Their subset or their superset:
Operating system 151 includes various system programs, for realizing various basic businesses and hardware based of processing Business;
Program 152.
Specifically, processor 11 executes short described in above-described embodiment for calling the program 152 stored in memory 15 Letter signature processing method, such as step S11 shown in FIG. 1.Alternatively, the processor is realized when executing the computer program State the function of each module/unit in each Installation practice.
Illustratively, the computer program can be divided into one or more module/units, one or more A module/unit is stored in the memory, and is executed by the processor, to complete the present invention.It is one or more A module/unit can be the series of computation machine program instruction section that can complete specific function, and the instruction segment is for describing institute State implementation procedure of the computer program in the note signature processing equipment.
The note signature processing equipment can be desktop PC, notebook, palm PC and cloud server etc. Calculate equipment.The note signature processing equipment may include, but be not limited only to, processor, memory.Those skilled in the art can To understand, the schematic diagram is only the example of note signature processing equipment, does not constitute the limit to note signature processing equipment It is fixed, it may include perhaps combining certain components or different components than illustrating more or fewer components.
Alleged processor 11 can be central processing unit (Central Processing Unit, CPU), can also be Other general processors, digital signal processor (Digital Signal Processor, DSP), specific integrated circuit (Application Specific Integrated Circuit, ASIC), ready-made programmable gate array (Field- Programmable Gate Array, FPGA) either other programmable logic device, discrete gate or transistor logic, Discrete hardware components etc..General processor can be microprocessor or the processor is also possible to any conventional processor Deng the processor 11 is the control centre of the note signature processing equipment, entirely short using various interfaces and connection The various pieces of letter signature processing equipment.
The memory 15 can be used for storing the computer program and/or module, the processor 11 by operation or Computer program and/or the module stored in the memory is executed, and calls the data being stored in memory, is realized The various functions of the note signature processing equipment.The memory 15 can mainly include storing program area and storage data area, Wherein, storing program area can application program needed for storage program area, at least one function (such as sound-playing function, figure As playing function etc.) etc.;Storage data area, which can be stored, uses created data (such as audio data, phone according to mobile phone This etc.) etc..In addition, memory 15 may include high-speed random access memory, it can also include nonvolatile memory, such as Hard disk, memory, plug-in type hard disk, intelligent memory card (Smart Media Card, SMC), secure digital (Secure Digital, SD) card, flash card (Flash Card), at least one disk memory, flush memory device or other volatibility are solid State memory device.
Wherein, if module/unit that the note signature processing equipment integrates is realized in the form of SFU software functional unit And when sold or used as an independent product, it can store in a computer readable storage medium.Based on such Understand, the present invention realizes all or part of the process in above-described embodiment method, can also instruct phase by computer program The hardware of pass is completed, and the computer program can be stored in a computer readable storage medium, which exists When being executed by processor, it can be achieved that the step of above-mentioned each embodiment of the method.Wherein, the computer program includes computer journey Sequence code, the computer program code can be source code form, object identification code form, executable file or certain intermediate shapes Formula etc..The computer-readable medium may include: any entity or device, note that can carry the computer program code Recording medium, USB flash disk, mobile hard disk, magnetic disk, CD, computer storage, read-only memory (ROM, Read-Only Memory), Random access memory (RAM, Random Access Memory), electric carrier signal, telecommunication signal and software distribution medium Deng.It should be noted that the content that the computer-readable medium includes can be real according to legislation in jurisdiction and patent The requirement trampled carries out increase and decrease appropriate, such as in certain jurisdictions, according to legislation and patent practice, computer-readable medium It does not include electric carrier signal and telecommunication signal.
Third embodiment of the invention provides a kind of computer readable storage medium, the computer readable storage medium packet Include the computer program of storage, wherein where controlling the computer readable storage medium in computer program operation Equipment executes note signature processing method as in the first embodiment.
The above is a preferred embodiment of the present invention, it is noted that for those skilled in the art For, various improvements and modifications may be made without departing from the principle of the present invention, these improvements and modifications are also considered as Protection scope of the present invention.

Claims (10)

1. a kind of note signature processing method characterized by comprising
Obtain the signing messages of short message;
It is corresponding black to be obtained by the black and white lists network model pre-established for the signing messages according to the signing messages White list prediction result;
When the black and white lists prediction result is white list, according to the signing messages, pass through the enterprise's name net pre-established Network model obtains the corresponding enterprise's name prediction result of the signing messages;
Enterprise name prediction result is successively matched with the enterprise information for prestoring in the database, with from the data It is obtained in library and the matched enterprise's name information of enterprise's name prediction result and its corresponding trade classification information.
2. note signature processing method as described in claim 1, which is characterized in that it is described by enterprise name prediction result with The enterprise's name information prestored in the database is successively matched, and is tied with obtaining from the database with enterprise's name prediction The matched enterprise's name information of fruit and its corresponding trade classification information, specifically include:
Judge whether enterprise name prediction result matches with the enterprise information for prestoring in the database;
If so, obtaining from the database and the matched enterprise's name information of enterprise's name prediction result and its corresponding industry Classification information;
If it is not, enterprise's name prediction result and the enterprise's name information prestored in the database are successively subjected to similarity calculation, Acquisition and the highest enterprise's name information of enterprise's name prediction result similarity from the database, and according to the number of signature According to, pass through the trade classification network model pre-established, obtain trade classification information.
3. note signature processing method as claimed in claim 2, which is characterized in that the method also includes:
When enterprise's name prediction result is less than preset threshold value with the similarity for prestoring enterprise's name information in the database, End processing the signing messages.
4. note signature processing method as described in claim 1, which is characterized in that the method also includes following black and white lists Network model construction step:
Dictionary is constructed to signature sample gathered in advance;Wherein, the signature sample include original signature information, with it is described original The corresponding black and white lists classification information of signing messages, enterprise's name information, trade classification coding;The dictionary includes the original label Mapping relations in name information between each word and number;
Black and white lists classified dictionary is constructed to signature sample gathered in advance;Wherein, the black and white lists classified dictionary includes institute State the mapping relations between the corresponding black and white lists classification information of original signature information and number;
According to the dictionary, the black and white lists classified dictionary, by the original signature information and its corresponding black and white lists point Category information is converted to Serial No., as the first Serial No.;
First Serial No. is trained by convolutional neural networks model, establishes the black and white lists network model.
5. note signature processing method as claimed in claim 4, which is characterized in that it is described according to the signing messages, pass through The black and white lists network model pre-established obtains the corresponding black and white lists prediction result of the signing messages, specifically includes:
According to the dictionary, the signing messages is converted into Serial No., as the second Serial No.;
It is corresponding to obtain the signing messages using second Serial No. as the input value of the black and white lists network model Black and white lists prediction result.
6. note signature processing method as claimed in claim 4, which is characterized in that the method also includes following enterprise's name nets Network model construction step:
To signature sample construction enterprise name dictionary;Wherein, enterprise's name dictionary includes that the original signature information is corresponding Enterprise name information and number between mapping relations;
According to the dictionary, enterprise name dictionary, the original signature information and its corresponding enterprise name information are converted to Serial No., as third Serial No.;
The third Serial No. is trained by neural translation model, establishes enterprise's name network model.
7. note signature processing method as claimed in claim 2, which is characterized in that it is described by enterprise name prediction result with The enterprise's name information prestored in the database successively carries out similarity calculation, obtains from the database pre- with enterprise's name The highest enterprise's name information of result similarity is surveyed, and according to the signed data, passes through the trade classification network mould pre-established Type obtains trade classification information, specifically includes:
Each word in enterprise's name prediction result, enterprise name information is converted into word vector respectively, and according to each The frequency of word calculates enterprise name prediction result, the corresponding word weight of each word in enterprise name information;
According to the word vector of enterprise name prediction result and its corresponding word weight, enterprise name information word vector and its Corresponding word weight calculates separately the sentence vector of enterprise's name prediction result, the sentence of enterprise name information using SIF algorithm Vector;
The sentence vector of sentence vector and enterprise name information to enterprise's name prediction result carries out cosine similarity calculating, from Acquisition and the highest enterprise's name information of enterprise's name prediction result cosine similarity in the database, and according to the signature Data obtain trade classification information by the trade classification network model pre-established.
8. note signature processing method as claimed in claim 4, which is characterized in that the method also includes following trade classifications Network model construction step:
Trade classification dictionary is constructed to signature sample gathered in advance;The trade classification dictionary includes the trade classification coding Mapping relations between number;
According to the dictionary, the trade classification dictionary, the original signature information and its corresponding trade classification coding are turned It is changed to Serial No., as the 4th Serial No.;
The 4th Serial No. is trained by convolutional neural networks model, establishes the trade classification network model.
9. a kind of note signature processing equipment, including processor, memory and storage in the memory and are configured as The computer program executed by the processor, the processor realize such as claim 1 to 8 when executing the computer program Any one of described in note signature processing method.
10. a kind of computer readable storage medium, which is characterized in that the computer readable storage medium includes the calculating of storage Machine program, wherein equipment where controlling the computer readable storage medium in computer program operation is executed as weighed Benefit require any one of 1 to 8 described in note signature processing method.
CN201811574935.5A 2018-12-21 2018-12-21 Note signature processing method, equipment and computer readable storage medium Pending CN109726266A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811574935.5A CN109726266A (en) 2018-12-21 2018-12-21 Note signature processing method, equipment and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811574935.5A CN109726266A (en) 2018-12-21 2018-12-21 Note signature processing method, equipment and computer readable storage medium

Publications (1)

Publication Number Publication Date
CN109726266A true CN109726266A (en) 2019-05-07

Family

ID=66297105

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811574935.5A Pending CN109726266A (en) 2018-12-21 2018-12-21 Note signature processing method, equipment and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN109726266A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111966806A (en) * 2020-08-17 2020-11-20 深圳市活力天汇科技股份有限公司 Information extraction and repair method for hotel reservation short message

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101030856A (en) * 2006-07-19 2007-09-05 王李琰 Method for verifying SMS and transmitting reliability classification based on cipher technology mark
US20090234925A1 (en) * 2008-03-14 2009-09-17 International Business Machines Corporation Dyanmic Domain Based Electronic Mail Signature Lines
CN104301532A (en) * 2014-09-30 2015-01-21 小米科技有限责任公司 Communication message identification method and device
CN106779467A (en) * 2016-12-31 2017-05-31 成都数联铭品科技有限公司 Enterprises ' industry categorizing system based on automatic information screening
CN106792589A (en) * 2016-11-17 2017-05-31 中卓信(北京)科技有限公司 Presentation of information updating menu method and apparatus
CN107797989A (en) * 2017-10-16 2018-03-13 平安科技(深圳)有限公司 Enterprise name recognition methods, electronic equipment and computer-readable recording medium
CN108763205A (en) * 2018-05-21 2018-11-06 阿里巴巴集团控股有限公司 A kind of brand alias recognition methods, device and electronic equipment

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101030856A (en) * 2006-07-19 2007-09-05 王李琰 Method for verifying SMS and transmitting reliability classification based on cipher technology mark
US20090234925A1 (en) * 2008-03-14 2009-09-17 International Business Machines Corporation Dyanmic Domain Based Electronic Mail Signature Lines
CN104301532A (en) * 2014-09-30 2015-01-21 小米科技有限责任公司 Communication message identification method and device
CN106792589A (en) * 2016-11-17 2017-05-31 中卓信(北京)科技有限公司 Presentation of information updating menu method and apparatus
CN106779467A (en) * 2016-12-31 2017-05-31 成都数联铭品科技有限公司 Enterprises ' industry categorizing system based on automatic information screening
CN107797989A (en) * 2017-10-16 2018-03-13 平安科技(深圳)有限公司 Enterprise name recognition methods, electronic equipment and computer-readable recording medium
CN108763205A (en) * 2018-05-21 2018-11-06 阿里巴巴集团控股有限公司 A kind of brand alias recognition methods, device and electronic equipment

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111966806A (en) * 2020-08-17 2020-11-20 深圳市活力天汇科技股份有限公司 Information extraction and repair method for hotel reservation short message
CN111966806B (en) * 2020-08-17 2023-10-27 深圳市活力天汇科技股份有限公司 Information extraction and repair method for hotel reservation short message

Similar Documents

Publication Publication Date Title
US11062089B2 (en) Method and apparatus for generating information
CN109271631A (en) Segmenting method, device, equipment and storage medium
CN109299458A (en) Entity recognition method, device, equipment and storage medium
CN107679144A (en) News sentence clustering method, device and storage medium based on semantic similarity
WO2022134421A1 (en) Multi-knowledge graph based intelligent reply method and apparatus, computer device and storage medium
US20160321254A1 (en) Unsolicited bulk email detection using url tree hashes
CN107368856A (en) Clustering method and device, the computer installation and readable storage medium storing program for executing of Malware
CN110347790A (en) Text duplicate checking method, apparatus, equipment and storage medium based on attention mechanism
CN105630767A (en) Text similarity comparison method and device
CN107644106B (en) Method, terminal device and storage medium for automatically mining service middleman
CN107908796A (en) E-Government duplicate checking method, apparatus and computer-readable recording medium
CN115941322B (en) Attack detection method, device, equipment and storage medium based on artificial intelligence
CN106648557A (en) Application programming interface API sharing method and device
CN110300191A (en) Service system and data processing method
CN114496083A (en) Cell type determination method, device, equipment and storage medium
CN110309293A (en) Text recommended method and device
CN114995791A (en) API (application program interface) polymerization method and device, electronic equipment and storage medium
CN109726266A (en) Note signature processing method, equipment and computer readable storage medium
CN114726823A (en) Domain name generation method, device and equipment based on generation countermeasure network
CN112905753A (en) Method and device for distinguishing text information
CN110147223A (en) Generation method, device and the equipment of Component Gallery
CN110503117A (en) The method and apparatus of data clusters
CN109754273A (en) The method and apparatus for promoting any active ues quantity
WO2021000411A1 (en) Neural network-based document classification method and apparatus, and device and storage medium
CN110895655A (en) Method and device for extracting text core phrase

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190507