Specific embodiment
With reference to the accompanying drawing, the scheme provided this specification is described.
Fig. 1 is the information transmission system schematic diagram that this specification provides.As shown in Figure 1, the system may include data work
Journey layer 102, model training layer 104, algorithm layer 106 and gas producing formation 108.Data engineering layer 102: majority is according to depth mode
(schema) change support, the filtering of feature extraction OP, repeated exposure configurationization.Model training layer 104: feature selecting reconstruct, just amount
Variation, optimizer tune ginseng and Topological expansion, hour grade model-aided tuning.Algorithm layer 106:DMP is carried out based on Ray frame
Deep learning (Deep learning) in intention assessment and enhances natural language sequence to sequence (Sequence to
Sequence, seq2seq) mode carries out multiprecision arithmetic output.Gas producing formation 108: for the mould of the Deep learning of DMP
Type is got through with gas producing formation, is formed the official documents and correspondence based on intention assessment in conjunction with special scenes (such as Spring Festival) and is intelligently split combined scene
Change the intelligent meaning of one's words and combine operation tool, and got through with the pending class product of project in intelligent reminding, forms entirety and user is closed
Note degree clicks wish and clicks the operation tool of the frequency (viscosity).
In addition, above system can also include: infrastructure layer 110, model validation layer 112 and model service layer 114.Its
In, infrastructure layer 110: system bubble reminder application programming interface (Application Programming
Interface, API) call association and the primary interaction page of H5 to conduct in real time.Model validation layer 112: prolong after load off-line model
Slow export, horse racing mechanism, real time monitoring.Model service layer 114: sufficiently pressure, which is surveyed, guarantees service stability, balance effect and pressure
Power monitoring.
In Fig. 1, deep learning algorithm of the gas producing formation 110 based on Ray can preferably utilize higher order relationship, especially with
Under family behavioral data missing and the insufficient situation of corpus, still operation scene can be combined to carry out by high-precision intention assessment
High compactness and accuracy user push.
It should be noted that the information-pushing method that this specification provides is realized based on label phrase mapping table,
Therefore, before the scheme that description this specification provides, first the establishment process of label phrase mapping table is illustrated.
Fig. 2 is the method for building up flow chart for the label phrase mapping table that this specification provides.As shown in Fig. 2, the foundation
Journey may include steps of:
Step 202, a plurality of business corpus relevant to multiple users is collected.
Such as, it can be and collect a plurality of business corpus relevant to multiple users from business corpus.Wherein, every business
Corpus can have corresponding initial labels.
Specifically, it can be based on specific behavior scene (e.g., the new year) and want keyword (e.g., five happinesses red packet outstanding
Deng) carry out business corpus collection and arrangement.In one example, collected business corpus for example can be with are as follows: " exquisite shirt
Dedicate exquisite you to ", " in short give you to strive: never stopping believing oneself good morning " etc..Wherein, with it is " exquisite
Shirt dedicates exquisite you to " corresponding initial labels can be with are as follows: and " the beautiful younger sister of fashion " and " in short gives you to strive: never
Stopping believing oneself good morning " corresponding initial labels can be with are as follows: " workplace office ".
It should be noted that after being collected into business corpus dependency rule can be based on, there are End-user relevances for filtering
The business corpus of difference, to realize recalling for related corpus.
Step 204, for business corpus relevant to each user, corresponding keyword is extracted from the business corpus.
The semanteme of above-mentioned keyword reflects behavioural habits of the user under specifies behavior scene.
In one implementation, corresponding pass can be extracted from business corpus based on context_service algorithm
Keyword.By business corpus: for for " exquisite shirt dedicate to exquisite you ", the keyword extracted for example can be with are as follows: " exquisite
You ".Again by business corpus: for for " in short give you to strive: never stopping believing oneself good morning ", being taken out
The keyword taken for example can be with are as follows: " strive you ".Here, the semanteme of " exquisite you " and " strive you " reflects user
It is the behavioural habits under scene of doing shopping.
It should be noted that can be carried out based on the rule such as part of speech word frequency, correlation for the keyword that said extracted arrives
Screening.
Step 206, correlation word is determined using deep learning algorithm based on keyword.
Here, used deep learning algorithm can be the DMP algorithm based on Ray frame.
Determine that the process of correlation word is referred to Fig. 3 using the DMP algorithm based on Ray frame.Specifically, based on figure
When DMP algorithm shown in 3 determines correlation word, keyword can be inputted in the position that top side is finally distributed.It later, can be right
Keyword is split, to obtain the first sub- word and the second sub- word.Wherein, the semanteme of the first sub- word reflects user's
Lifestyle.The personage that the semanteme of second sub- word reflects user claims in generation.By keyword: for for " exquisite you ", institute
The sub- word of first obtained can be with are as follows: " exquisiteness ", the second sub- word can be with are as follows: " you ".Again by keyword: for " strive you "
For, obtained first sub- word can be with are as follows: " striving ", the second sub- word can be with are as follows: " you ".For the son after above-mentioned fractionation
Word can screen it using convolutional neural networks algorithm (e.g., Text_CNN), to remove unreasonable fractionation word
Language.
In Fig. 3, determining correlation word includes at least the first correlation word.The output position of first correlation word can be with
For the position of bottom right side section abstract.The semanteme of first correlation word reflects other rows of the user under specifies behavior scene
For habit and/or user's portrait.For by taking user draws a portrait as an example, such as can be with are as follows: " gender ", " occupation ", " region ", " work
Make field " and " hobby " etc..In addition, identified correlation word can also include the second correlation word.Second related term
The output position of language is the position that lower left side carrys out source text.The semanteme of second correlation word reflects user in other behavior fields
Behavioural habits and user's portrait under scape.For by taking aforementioned " exquisite you " and " strive you " as an example, second here is related
Word can be user's portrait under professional scene: " white collar " and " student " etc..
From figure 3, it can be seen that determining that the process of correlation word experienced following process: decoding hidden state -> vocabulary point
Cloth -> final distribution, and the relationship between the three as shown in figure 3, do not repeat again herein.
In another example, pointer occupier model can also be used, determines correlation word.The model is in base
The mechanism and coverage mechanism that k-means has been incorporated on plinth DMP model, protect user in the precision of semantics recognition
Card.
This specification is above-mentioned when determining correlation word, be adjusted on BERT distributed strategy multiple branches jointly into
Row search is finally carried out preferably and is exported to the result of generation, so as to greatly promote the flexibility of determined word.This
Outside, above-mentioned determining method is to derive to obtain correlation word based on the intention of user, it is possible thereby to realize the intelligence of keyword
It splits.
It returns in Fig. 2, Fig. 2 can also include the following steps:
Step 208, it is based on keyword and correlation word, generates the corresponding user's phrase of user under specifies behavior scene
And user tag.
Herein, before executing above-mentioned generation step, first above-mentioned keyword and correlation word can be normalized
Mapping relations matching.Specifically, the identical keyword of meaning and/or correlation word can be normalized to identical word, and,
Keyword and/or correlation word are mapped as to embody the word (being referred to as intended to word) of user's intention.
It is understood that after executing above-mentioned normalized mapping relationship match, so that it may generate user's phrase and
User tag.Specifically, initial labels corresponding with business corpus, Lai Shengcheng user tag can be based on.Can also based on from
Correlation word determined by the keyword extracted in the corresponding business corpus of the initial labels, Lai Shengcheng user tag.Citing comes
It says, can be by initial labels above-mentioned: " the beautiful younger sister of fashion " be used as user tag." white collar " can also be used as user tag.
It, can be based on the keyword and the first correlation under behavior scene corresponding to relative users label for user's phrase
Word generates.For by taking " the beautiful younger sister of fashion " as an example, corresponding user's phrase can be with are as follows: " exquisite you ".I.e. to fractionation after
Keyword is reconfigured.
When above-mentioned correlation word further includes the second correlation word, user tag generated can be with are as follows: and " white collar ", and it is right
That answers can be with for phrase are as follows: " cause is more and more prosperous ".In this example embodiment, user's phrase can be based on dependency rule to the second phase
It closes word and carries out automatic combination producing.That is, can also be generated other when correlation word further includes the second correlation word
The corresponding user tag of user and user's phrase under behavior scene.
In the present specification, for the user's phrase and user tag of generation, convolutional neural networks algorithm can be used
(e.g., Text_CNN) screens it, to screen the higher user's phrase of mass.
Step 210, mark is at least established based on the corresponding user's phrase of user and user tag under specifies behavior scene
Sign phrase mapping table.
It is understood that in practical applications, above-mentioned steps 204 and step 208, which can be, to be repeated, until raw
At the corresponding user tag of each user and user's phrase.
When generating user tag corresponding with multiple users and user's phrase, step 210 be can be based on multiple use
The user tag and user's phrase at family, Lai Jianli label phrase mapping table.
It should be noted that the user tag and user's phrase in the label phrase mapping table finally established can be used as newly
The completion of business corpus into business corpus, it is possible thereby to achieve the purpose that information flow back.By the reflux of information, can make
Business corpus in business corpus is more and more abundant.
To sum up, this specification embodiment can be come based on behavioural habits data, intention word and the completion information with user
Establish above-mentioned label phrase mapping table.
Fig. 4 is the establishment process schematic diagram for the label phrase mapping table that this specification provides.In Fig. 4, firstly, from corpus
It is middle to collect a plurality of business corpus relevant to multiple users, and corresponding keyword is extracted from each corpus.Here
A plurality of business corpus is corresponding with specific behavior scene.Later, it is learned based on the keyword extracted using deep learning algorithm
Practise other words relevant to above-mentioned specific behavior scene;And study other words relevant to other behavior scenes.For
The other words learnt can be filtered it using convolutional neural networks algorithm.Finally, being based on keyword and filtering
Other words afterwards generate the corresponding relationship under different behavior scenes between user's phrase and user tag, and right based on this
It should be related to, establish label phrase mapping table.
It is understood that being the explanation to the establishment process of label phrase mapping table above, below to based on the mapping
The information-pushing method of table is illustrated.
Fig. 5 is the information-pushing method flow chart that this specification one embodiment provides.As shown in figure 5, the method can
To include the following steps:
Step 502, the user behavior data of user is obtained.
Here user behavior data can refer to user in the upper execution browsing behavior of application, click behavior or consumption row
Generated data when to wait business conducts.
Step 504, it is based on user behavior data, determines the behavior scene of user.
What can be here pre-defined, it can include but is not limited to the new year scene, shopping scene and occupation
Scene etc..
Step 506, the user tag of user is identified.
Here it is possible to be to identify the user tag of user based on user behavior data.The user tag is for describing user
User portrait.
Under scene of doing shopping, the user tag of user for example can be with are as follows: " the beautiful younger sister of fashion ", " housewife " and " workplace
Office " etc..Under professional scene, the user tag of user for example can be with are as follows: " white collar ", " university student " etc..
Step 508, the label phrase firing table pre-established is inquired, is obtained under behavior scene, user tag is corresponding
User's phrase.
User's phrase is used to describe the behavioural habits of user.
For by taking label phrase mapping table shown in Fig. 4 as an example, when the behavior scene for determining user are as follows: shopping scene, and
The user tag of user are as follows: when " the beautiful younger sister of fashion ", user's phrase of acquisition are as follows: " exquisite you ".
Step 510, the target information to match with user's phrase is searched.
Step 512, target information is pushed to user.
Here it is possible to be based on preset matching relationship, to search the target information to match.As an example it is assumed that having
Following matching relationship:So can to active user push digital product or with cook
It prepares food relevant article.
To sum up, the information-pushing method that this specification embodiment provides can first be known after capturing the behavior of user
The not user tag of the user is based on user's phrase corresponding with user tag later, may be interested to search user
Information simultaneously pushes, thus, it is possible to realize the accurate push of information.Further, it is also possible to greatly promote the flexibility of institute's pushed information.
Finally, relying on Ray as Distributed Computing Platform in this specification scheme, being accurate, the efficient hair of deep learning
It waves and plays important function.Realize that deep learning operation is efficient to final transformation model from data source header acquisition user behavior data
Energy intention assessment, End2End is within a frame always.Realize data engineering configurationization, model training real time implementation, model
Verify onlineization, the key properties such as minute grade model modification.
Accordingly with above- mentioned information method for pushing, a kind of information push-delivery apparatus that this specification one embodiment also provides,
As shown in fig. 6, the apparatus may include:
Acquiring unit 602, for obtaining the user behavior data of user.
Determination unit 604, the user behavior data for being obtained based on acquiring unit 602, determines the behavior scene of user.
Recognition unit 606, the user tag of user, the user tag are used to describe user's portrait of user for identification.
Acquiring unit 602 is also used to inquire the label phrase firing table pre-established, obtain under current behavior scene, uses
The corresponding user's phrase of family label.User's phrase is used to describe the behavioural habits of user.Here label phrase mapping table is used for
It is recorded in the corresponding relationship under different behavior scenes between user tag and user's phrase.Wherein, the use under a kind of behavior scene
Family label and user's phrase are behavioural habits and depth based on the user under behavior scene or other corelation behaviour scenes
Spend what learning algorithm determined.
Here deep learning algorithm for example can be the DMP algorithm based on Ray frame.
Searching unit 608, the target information to match for searching the user's phrase obtained with acquiring unit 602.
Push unit 610, for pushing the target information to user.
Optionally, which can also include:
Unit (not shown) is established, for collecting a plurality of business corpus relevant to multiple users.
For business corpus relevant to arbitrary first user, corresponding keyword is extracted from business corpus, the pass
The semanteme of keyword reflects behavioural habits of first user under specifies behavior scene.
Correlation word is determined using deep learning algorithm based on keyword.The correlation word includes at least the first related term
Language, the semanteme of the first correlation word reflect other behavioural habits and/or user of first user under specifies behavior scene and draw
Picture.
Based on keyword and correlation word, generate under specifies behavior scene the corresponding user's phrase of the first user and
User tag.
At least based on the corresponding user's phrase of the first user and user tag under specifies behavior scene, it is short to establish label
Language mapping table.
Optionally, above-mentioned correlation word can also include the second correlation word, and the semanteme of the second correlation word reflect the
Behavioural habits and user portrait of one user under other behavior scenes.
Determination unit 604 is also used to determine that the first user is corresponding under other behavior scenes based on the second correlation word
User's phrase and user tag.
Establishing unit specifically can be used for:
At least based on the corresponding user's phrase of the first user and user tag under specifies behavior scene, and other
The corresponding user's phrase of first user and user tag, establish label phrase mapping table under behavior scene.
Establishing unit also specifically can be used for:
Keyword is split, to obtain the first sub- word and the second sub- word.The semanteme of first sub- word reflects
The lifestyle of first user, the personage that the semanteme of the second sub- word reflects the first user claim in generation.
Correlation word is determined using deep learning algorithm based on the first sub- word and the second sub- word.
Optionally, which can also include:
Screening unit (not shown), for being based on convolutional neural networks algorithm, to the first sub- word and the second sub- word
Language is screened.
Establishing unit also specifically can be used for:
Based on after screening the first sub- word and the second sub- word correlation word determined using deep learning algorithm.
The function of each functional module of this specification above-described embodiment device can pass through each step of above method embodiment
Rapid to realize, therefore, the specific work process for the device that this specification one embodiment provides does not repeat again herein.
The information push-delivery apparatus that this specification one embodiment provides, acquiring unit 602 obtain the user behavior number of user
According to.User behavior data of the determination unit 604 based on acquisition determines the behavior scene of user.Recognition unit 606 identifies user's
User tag, the user tag are used to describe user's portrait of user.Acquiring unit 602 is inquired the label phrase pre-established and is penetrated
Table obtains the corresponding user's phrase of user tag under current behavior scene.The behavior that user's phrase is used to describe user is practised
It is used.Here label phrase mapping table is used to be recorded under different behavior scenes the corresponding pass between user tag and user's phrase
System.Wherein, the user tag under a kind of behavior scene and user's phrase are based in behavior scene or other corelation behaviours
The behavioural habits of user and deep learning algorithm determine under scene.Searching unit 608 searches user's phrase phase with acquisition
Matched target information.Push unit 610 pushes target information to user.Thus, it is possible to realize the accurate push of information.
Accordingly with above- mentioned information method for pushing, this specification embodiment additionally provides a kind of information pushing equipment, such as Fig. 7
Shown, which may include: memory 702, one or more processors 704 and one or more programs.Wherein, this one
A or multiple programs are stored in memory 702, and are configured to be executed by one or more processors 704, the program quilt
Processor 704 performs the steps of when executing
Obtain the user behavior data of user.
Based on user behavior data, the behavior scene of user is determined.
Identify that the user tag of user, the user tag are used to describe user's portrait of user.
The label phrase firing table pre-established is inquired, the corresponding user of user tag under determining behavior scene is obtained
Phrase.User's phrase is used to describe the behavioural habits of user.Label phrase mapping table is for being recorded under different behavior scenes
Corresponding relationship between user tag and user's phrase.Wherein, the user tag under a kind of behavior scene and user's phrase are bases
It is determined in the behavioural habits and deep learning algorithm of the user under behavior scene or other corelation behaviour scenes.
Search the target information to match with user's phrase.
Target information is pushed to user.
The information pushing equipment that this specification one embodiment provides, may be implemented the accurate push of information.
All the embodiments in this specification are described in a progressive manner, same and similar portion between each embodiment
Dividing may refer to each other, and each embodiment focuses on the differences from other embodiments.Especially for equipment reality
For applying example, since it is substantially similar to the method embodiment, so being described relatively simple, related place is referring to embodiment of the method
Part explanation.
The step of method in conjunction with described in this disclosure content or algorithm can realize in a manner of hardware,
It can be and the mode of software instruction is executed by processor to realize.Software instruction can be made of corresponding software module, software
Module can be stored on RAM memory, flash memory, ROM memory, eprom memory, eeprom memory, register, hard
Disk, mobile hard disk, CD-ROM or any other form well known in the art storage medium in.A kind of illustrative storage Jie
Matter is coupled to processor, to enable a processor to from the read information, and information can be written to the storage medium.
Certainly, storage medium is also possible to the component part of processor.Pocessor and storage media can be located in ASIC.In addition, should
ASIC can be located in server.Certainly, pocessor and storage media can also be used as discrete assembly and be present in server.
Those skilled in the art are it will be appreciated that in said one or multiple examples, function described in the invention
It can be realized with hardware, software, firmware or their any combination.It when implemented in software, can be by these functions
Storage in computer-readable medium or as on computer-readable medium one or more instructions or code transmitted.
Computer-readable medium includes computer storage media and communication media, and wherein communication media includes convenient for from a place to another
Any medium of one place transmission computer program.Storage medium can be general or specialized computer can access it is any
Usable medium.
It is above-mentioned that this specification specific embodiment is described.Other embodiments are in the scope of the appended claims
It is interior.In some cases, the movement recorded in detail in the claims or step can be come according to the sequence being different from embodiment
It executes and desired result still may be implemented.In addition, process depicted in the drawing not necessarily require show it is specific suitable
Sequence or consecutive order are just able to achieve desired result.In some embodiments, multitasking and parallel processing be also can
With or may be advantageous.
Above-described specific embodiment has carried out into one the purpose of this specification, technical scheme and beneficial effects
Step is described in detail, it should be understood that being not used to limit this foregoing is merely the specific embodiment of this specification
The protection scope of specification, all any modifications on the basis of the technical solution of this specification, made, change equivalent replacement
Into etc., it should all include within the protection scope of this specification.