CN108271041A - Mess code treating method and apparatus - Google Patents
Mess code treating method and apparatus Download PDFInfo
- Publication number
- CN108271041A CN108271041A CN201611264769.XA CN201611264769A CN108271041A CN 108271041 A CN108271041 A CN 108271041A CN 201611264769 A CN201611264769 A CN 201611264769A CN 108271041 A CN108271041 A CN 108271041A
- Authority
- CN
- China
- Prior art keywords
- correspondence
- mess code
- text data
- mess
- code
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/235—Processing of additional data, e.g. scrambling of additional data or processing content descriptors
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/435—Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/60—Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client
- H04N21/61—Network physical structure; Signal processing
- H04N21/6156—Network physical structure; Signal processing specially adapted to the upstream path of the transmission network
- H04N21/6175—Network physical structure; Signal processing specially adapted to the upstream path of the transmission network involving transmission via Internet
Abstract
The embodiment of the invention discloses a kind of mess code treating method and apparatus, for facilitating the mess code for eliminating text data.Present invention method includes:Obtain text data;Judge whether the text data includes the mess code in the correspondence pre-established, the correspondence includes mess code and the correspondence of operation;If the text data includes the mess code in the correspondence, according to the correspondence, the text data is handled using operation corresponding with the mess code, to eliminate the mess code in the text data.In this way, for including the text data of mess code, its mess code with correspondence is compared, if text data includes the mess code of the correspondence, the operation that the correspondence may be used eliminates the mess code from this article notebook data, such mess code removing method implements convenient and efficient, and the intervention without developer and operation maintenance personnel can be realized.
Description
Technical field
The present invention relates to data processing field more particularly to a kind of mess code treating method and apparatus.
Background technology
It is collected the problem of because of gatherer process or the reason of equipment in the text data collected by equipment
Data often will appear mess code.
For example, in IPTV Data processings, obtained since data source may be by equipment acquisition, obtained data can
Mess code can be come out, as shown in following table one:
Table one:
Channel | Viewing number | Watch duration |
Satellite TV of China~~ | 1000 | 2300 |
Central ## channels A | 2000 | 4000 |
%@satellite TVs of China | 3000 | 5300 |
xx4486e | 100 | 5000 |
User often wants developer or operation maintenance personnel to intervene, logarithm in the mess code in these text datas
It is optimized according to the equipment of acquisition or algorithm etc., such settling mode often spends the more time, and cumbersome.
Invention content
An embodiment of the present invention provides a kind of mess code treating method and apparatus, for facilitating the mess code for eliminating text data.
In order to solve the above-mentioned technical problem, an embodiment of the present invention provides following technical schemes:
A kind of mess code processing method, including:
Obtain text data;
Judge whether the text data includes the mess code in the correspondence pre-established, the correspondence is included disorderly
Code and the correspondence of operation;
If the text data includes the mess code in the correspondence, according to the correspondence, using with it is described
The corresponding operation of mess code in text data handles the text data, to eliminate the mess code in the text data.
In order to solve the above-mentioned technical problem, the embodiment of the present invention additionally provides following technical scheme:
A kind of mess code processing unit, including:
Acquiring unit, for obtaining text data;
Judging unit, it is described for judging whether the text data includes the mess code in the correspondence pre-established
Correspondence includes mess code and the correspondence of operation;
Processing unit, if including the mess code in the correspondence for the text data, according to the corresponding pass
System is handled the text data using operation corresponding with the mess code, to eliminate the mess code in the text data.
As can be seen from the above technical solutions, the embodiment of the present invention has the following advantages:
After obtaining text data, judge whether text data includes the mess code in the correspondence pre-established, the correspondence
Relationship includes mess code and the correspondence of operation.If text data includes the mess code in correspondence, according to correspondence, make
Text data is handled with operation corresponding with the mess code, so as to which the mess code in text data can be eliminated.In this way, for
Its mess code with correspondence is compared by the text data including mess code, if text data includes the unrest of the correspondence
Code, the operation that the correspondence may be used eliminate the mess code from this article notebook data, and such mess code removing method is realized
Get up convenient and efficient, the intervention without developer and operation maintenance personnel can be realized.
Description of the drawings
Fig. 1 is the method flow diagram of a kind of mess code processing method that one embodiment of the invention provides;
Fig. 2 is the method flow diagram of a kind of mess code processing method that another embodiment of the present invention provides;
Fig. 3 is the structure diagram of a kind of mess code processing unit that another embodiment of the present invention provides.
Specific embodiment
An embodiment of the present invention provides a kind of mess code treating method and apparatus, for facilitating the mess code for eliminating text data.
Fig. 1 is a kind of method flow diagram of mess code processing method provided in an embodiment of the present invention.Refering to Fig. 1, the present invention is real
The method for applying example includes:
Step 101:Obtain text data;
Step 102:Judge whether text data includes the mess code in the correspondence pre-established, which includes
Mess code and the correspondence of operation;If text data includes the mess code in correspondence, step 103 is performed.
Step 103:According to correspondence, text data is handled using operation corresponding with the mess code, to eliminate
Mess code in text data.
Optionally,
Correspondence includes at least two types, and different types of correspondence corresponds to different types of operation and difference
The priority of grade;
Judge whether text data includes the mess code in the correspondence pre-established, including:
According to the hierarchal order of priority, by judging whether text data wraps using different types of correspondence after arriving first
Include the mess code in correspondence.
Optionally,
Correspondence includes the first correspondence, the second correspondence, third correspondence and the 4th correspondence,
First correspondence includes the first mess code and the correspondence of the first operation, and the first operation is will include the first mess code
Pending text data replace with the first text, the first mess code includes all characters in pending text data;
Second correspondence includes the second mess code and the correspondence of the second operation, and the second operation is will include the second mess code
Pending text data replace with the second text, the second mess code is the partial character in pending text data;
Third correspondence includes the correspondence that third mess code and third operate, and third operation is from treating by third mess code
It is deleted in processing text data;
4th correspondence includes the 4th mess code and the correspondence of the 4th operation, and the 4th operation is will include the 4th mess code
Pending text data be hidden.
Optionally,
The priority level of correspondence is followed successively by from high to low:First correspondence, the second correspondence, third correspond to
Relationship and the 4th correspondence.
Optionally,
The number of correspondence is including multiple, and each correspondence further includes the first pre-set level, and text data further includes
Second pre-set level,
Before judging whether text data includes the mess code in the correspondence pre-established, the method for the embodiment of the present invention
It further includes:
The first pre-set level target correspondence corresponding with the second pre-set level is determined from multiple correspondence;
Judge whether text data includes the mess code in the correspondence pre-established, including:
Judge whether text data includes the mess code in target correspondence.
Optionally,
First pre-set level is the first settling time of correspondence, and the second pre-set level is established for the second of text data
Time.
Optionally,
For the number of correspondence including multiple, each correspondence further includes user name,
Before judging whether text data includes the mess code in the correspondence pre-established, the method for the embodiment of the present invention
It further includes:
Obtain the user name of current operation user;
The user name correspondence identical with the user name of current operation user is determined from multiple correspondence;
Judge whether text data includes the mess code in the correspondence pre-established, including:
Judge whether text data includes the mess code in the correspondence determined.
In conclusion after obtaining text data, judge whether text data includes the unrest in the correspondence pre-established
Code, the correspondence include mess code and the correspondence of operation.If text data includes the mess code in correspondence, according to right
It should be related to, text data is handled using operation corresponding with the mess code, so as to which the mess code in text data can be eliminated.
In this way, the text data for including mess code, its mess code with correspondence is compared, if text data includes the correspondence
The mess code of relationship, the operation that the correspondence may be used eliminate the mess code from this article notebook data, and such mess code is eliminated
Method implements convenient and efficient, and the intervention without developer and operation maintenance personnel can be realized.
Fig. 2 is a kind of mess code processing method provided in an embodiment of the present invention.With reference to the above and referring to Fig.2, below
Embodiment shown in Fig. 2 is illustrated.
Before the flow to the method for embodiment shown in Fig. 2 is described, first the method for the embodiment of the present invention is used
To correspondence illustrate, to make place mat.
In order to eliminate the mess code in the text data got in the method for the embodiment of the present invention, need to use corresponding pass
System, the correspondence include multiple dimensions, which mainly includes the correspondence of mess code and operation.The correspondence
For mess code for being matched with the character in text data, whether matching is identical, if matching is identical, performs the correspondence
In operation corresponding with the mess code, the operation include but not limited to replace text data for preset text, delete mess code, hidden
Tibetan includes text data of the mess code etc..In order to which text data is replaced with preset text, the correspondence further include with
The corresponding pre-set text dimension of mess code.
It is appreciated that the dimension of the correspondence can also include much information, such as settling time, user name etc..
In the embodiment having in the present invention, priority level can also be distributed to different correspondences, according to different
Priority determines to use sequence to different correspondence.
That is, correspondence include at least two types, different types of correspondence correspond to it is different types of operation and
Different grades of priority.According to the hierarchal order of these priority, these different types of correspondences are selected one by one
It selects, judges whether text data includes the mess code in the correspondence pre-established so that the correspondence execution selected is subsequent
The step of.
About the specific situation of correspondence, such as can be as follows:
Correspondence includes the first correspondence, the second correspondence, third correspondence and the 4th correspondence.
1.1 first correspondences include the first mess code and the correspondence of the first operation.First operation is will include first
The pending text data of mess code replaces with the first text, and the first mess code includes all characters in pending text data.That is,
First correspondence includes the correspondence of the first mess code, the first text and the first operation three, if text data includes first
Mess code, and all characters of text data be first mess code, i.e., text data be the first mess code, then according to this first operation,
Text data is replaced using the first text.
1.2 second correspondences include the second mess code and the correspondence of the second operation, and the second operation is will include second
The pending text data of mess code replaces with the second text, and the second mess code is the partial character in pending text data.That is, the
One correspondence includes the correspondence of the second mess code, the second text and the second operation three, if text data includes second disorderly
Code, as long as second mess code belongs to the character of the part in this article notebook data, you can according to second operation, use the second text
Replace text data.The difference of first correspondence and the second correspondence is to be, the first mess code and pending text
Data are equivalent, and the second mess code belongs to the partial data in pending text data.
The correspondence that 1.3 third correspondences include third mess code and third operates, third operation are by third mess code
It is deleted from pending text data.If i.e. text data includes third mess code, operated according to third, by the third mess code from
It is deleted in text data.
1.4 the 4th correspondences include the 4th mess code and the correspondence of the 4th operation, and the 4th operation is will include the 4th
The pending text data of mess code is hidden.If i.e. text data includes the 4th mess code, according to the 4th operation, by the text
The entire data of data are all hidden, i.e., the text data for not including the 4th mess code to this in display is shown.
In the embodiment of setting priority, the priority level of above-mentioned correspondence is followed successively by from high to low:First pair
It should be related to, the second correspondence, third correspondence and the 4th correspondence.
In an embodiment of the present invention, which pre-establishes, can be user oneself on a processing device
It establishes or user downloads to obtain from server-side, to the acquisition source of correspondence, the embodiment of the present invention does not do specific limit
It is fixed.
In order to more intuitively be illustrated to the above, below i.e. in IPTV fields, user establish correspondence into
Row description.Wherein, processing equipment is the equipment for the method for performing the embodiment of the present invention, can be the equipment such as computer.
The relation table that user Zhang San logs in processing equipment using user name " Zhang San " establishes interface.In the embodiment having,
User Zhang San can also use " public " option to log in the relation table and establish interface.If being logged in using particular user name, establish
The upper user name of correspondence mark, subsequently may be such that the correspondence only has the user of the user name just usable;If make
With the option of the public, then the mark of the upper public of correspondence mark established represents that the correspondence can be by the institute user of the machine
It uses.
Then Zhang San operates the processing equipment and performs data query operation, and query result is as shown in the table, mid band dimension
The data of degree are the row in table, which is text data above.In order to describe simplicity, it is with above-mentioned table one now
The query result, i.e.,
Table one
Channel | Viewing number | Watch duration |
Satellite TV of China~~ | 1000 | 2300 |
Central ## channels A | 2000 | 4000 |
%@satellite TVs of China | 3000 | 5300 |
xx4486e | 100 | 5000 |
The data of the query result are equipment acquisitions, it sometimes appear that mess code, such as the channel data of table two occur
Mess code, thus user perform operations described below, to eliminate these mess codes and establish follow-up correspondence to be used.
2.1 establish accurate replacement correspondence --- the first correspondence
User Zhang San is by processing equipment, the total data of selection " satellite TV of China~~", by above-mentioned table two " China defends
Depending on~~" replace with " satellite TV of China ", and current processing time is 11 days 13 December in 2016:00, in this way, in processing equipment
It is upper to establish a correspondence, a data is obtained, shown in following Table A 1.
Table A 1.1:
First row | Secondary series | Third arranges | 4th row |
" satellite TV of China~~" | " satellite TV of China " | 2016121113:00 | Zhang San |
If Zhang San was at second day, after inquiring new data, found again in query result " satellite TV of China~~", so as to
The total data of selection " satellite TV of China~~" replaces with " satellite TV of China~~" " satellite TV of China A ", and current processing
Time is 12 days 10 December in 2016:00, in this way, establishing a correspondence on a processing device, obtain another number
According to shown in following Table A 2.
Table A 1.2:
First row | Secondary series | Third arranges | 4th row |
Satellite TV of China~~ | Satellite TV of China A | 2016121113:00-2016121210:00 | Zhang San |
Two datas in above-mentioned 2.1, i.e. the two correspondences, should corresponding to the first correspondence in above-mentioned 1.1
Table can be used to represent in the concrete numerical value of correspondence, Table A 1.1 as escribed above, has four row in table, and first row is mess code, the
Two row are the first texts of mapping, and third row are the periods, represent the period that this record comes into force, the 4th row are user names
Claim.
Wherein first operation can be recorded in the form of code or on the correspondence plus the first operation
Identification information, as long as equipment subsequently read the identification information can perform first operation or preserve first relationship pass
When being, which is stored in the storage unit for the correspondence for preserving the first class of operation, passes through above-mentioned specific side
Formula can establish the first operation in the first correspondence.
2.2 establish fuzzy replacement correspondence --- the second correspondence
User Zhang San, select above-mentioned table two channel data " central ## channels " in " center ", and control process is set
Alternative obscures replacement option, which is replaced with " central satellite TV ", and current time is on December 11st, 2016
13:40, then correspondence is established on a processing device, obtains following data:
Table A 2:
First row | Secondary series | Third arranges | 4th row |
Center | Central satellite TV | 2016121113:40 | Zhang San |
Data in above-mentioned 2.2, i.e. this correspondence, corresponding to the second correspondence in above-mentioned 1.2, which closes
Table can be used to represent in the concrete numerical value of system, Table A 2 as escribed above, there is four row in table, and first row is mess code, and secondary series is to reflect
The second text penetrated, third row are the periods, represent the period that this record comes into force, the 4th row are user's names.
The foundation of the second operation in wherein the second correspondence can refer to described in above-mentioned 1.1.
2.3 establish deletion correspondence --- third correspondence
User Zhang San selects "@" character of the channel data of above-mentioned table two, selects after determining option, reselection channel number
According to " % " character, then control process equipment selection delete option, current time be 11 days 13 December in 2016:46, then exist
Correspondence is established in processing equipment, obtains following data.
Table A 3:
First row | Secondary series | Third arranges |
" % " | 2016121113:46 | Zhang San |
“@” | 2016121113:46 | Zhang San |
Data in above-mentioned 23, i.e. this correspondence, corresponding to the third correspondence in above-mentioned 1.3, which closes
Table can be used to represent in the concrete numerical value of system, Table A 3 as escribed above, there is three row in table, and first row stores mess code character, such as
" % ,~" etc., secondary series is the period, represents the period that this record comes into force, third row are user's names.
The foundation of third operation wherein in third correspondence can refer to described in above-mentioned 1.1.
2.4 establish hiding correspondence --- the 4th correspondence
User Zhang San selects the channel data " xx4486e " in the channel data of above-mentioned table two, and control process equipment is selected
It selects and hides Options, current time is 11 days 13 December in 2016:50, then establish correspondence on a processing device, obtain as
Lower data:
Table A 4:
First row | Third arranges | 4th row |
xx4486e | 2016121113:50 | Zhang San |
Data in above-mentioned 2.4, i.e. this correspondence, corresponding to the 4th correspondence in above-mentioned 1.4, which closes
Table can be used to represent in the concrete numerical value of system, Table A 4 as escribed above, there is three row in table, first row storage mess code character, and second
Row are the periods, represent the period that this record comes into force, third row are user's names.
The foundation of the 4th operation in wherein the 4th correspondence can refer to described in above-mentioned 1.4.
Wherein, the foundation of priority can be that equipment is pre-set, such as according to the operations of different correspondences
Classification sets priority to different correspondence respectively, for example, during above-mentioned correspondence is established, equipment preset including
Operation for the correspondence accurately replaced be the first priority, including operation for the fuzzy correspondence replaced be second excellent
First grade, including operation be the correspondence deleted be third priority, including operation for hiding correspondence be the 4th
Priority.
The settling time of wherein above-mentioned each table and other dimensional informations of the entitled each correspondence of user, according to these dimensions
Degree information may be such that these correspondences, and there are many occupation modes, meet the diversified demand of user.Wherein, settling time
One pre-set level, further included on the text data subsequently obtained with matched second pre-set level of first pre-set level, i.e.,
First pre-set level is matched available for index corresponding on text data.Certainly, these first, second pre-set levels
It can also be other types of information.
It is appreciated that above-mentioned each type correspondence can include it is multiple, such as the first correspondence i.e. wrap
Two have been included, more can also have been included, such as 5,12.
It establishes after completing these correspondences, you can store these correspondences on a processing device, storage mode has
It is a variety of, such as stored etc. using above-mentioned form form storage or using character string forms.
After the completion of the foundation of above-mentioned correspondence, you can the operation of subsequent processing text data is performed, it is as described below to retouch
It states, in order to describe more intuitive, is described below using IPTV fields, text data as channel data.
Step 201:Obtain inquiry data set.
Wherein inquiry data set includes channel data, which is the text data of the embodiment of the present invention, at this
It is possible that mess code in channel data.Wherein mess code refers to character to be processed, these mess codes are often that user does not need to use
Or it is wrong.
For example, after user Zhang San logs in the data query page of IPTV processing equipments using user name " Zhang San ", selection is looked into
Operation is ask, sends out inquiry instruction, processing equipment is to obtain the inquiry data set for including channel data, as shown in following table two.
Table is Y.0:
Channel | Viewing number | Watch duration |
Satellite TV of China~~ | 1000 | 2300 |
Central ## channels A | 2000 | 4000 |
%@satellite TVs of China | 3000 | 5300 |
xx4486e | 100 | 5000 |
Processing equipment can obtain the correspondence of above-mentioned foundation after inquiry data set is got from storage unit, with
Inquire and handle the mess code of the channel data in inquiry data set.
After the correspondence pre-established is obtained, first these correspondences can be screened, met with selecting
It is required that correspondence to carry out subsequent mess code processing.
For example, optionally, for the number of correspondence including multiple, each correspondence further includes the first pre-set level, text
Notebook data further includes the second pre-set level.So as to judge whether text data includes the mess code in the correspondence pre-established
Before, the method for the embodiment of the present invention further includes:The first pre-set level and the second default finger are determined from multiple correspondence
Mark corresponding target correspondence.When subsequently judging whether text data includes the mess code in the correspondence pre-established, i.e.,
It can be realized by judging whether text data includes the mess code in the target correspondence.
First, second advance index include diversified forms, such as the first pre-set level for correspondence first establish when
Between, the second pre-set level is the second settling time of text data.
Alternatively, the screening to correspondence can also be realized by following modes:
For the number of correspondence including multiple, each correspondence further includes user name,
Before judging whether text data includes the mess code in the correspondence pre-established, method further includes:It obtains and works as
The user name of preceding operation user;User name pair identical with the user name of current operation user is determined from multiple correspondence
It should be related to.It, can be by sentencing so as to which the step of whether text data includes the mess code in the correspondence pre-established subsequently judged
Whether disconnected text data is realized including the mess code in the correspondence determined.
For example, processing equipment is carried out at screening the correspondence pre-established according to the type and Query Dates of user
Reason, is filtered, mistake according to the user name on first, second, third, fourth correspondence to be prestored and settling time respectively
The correspondence filtered out meets the two requirements simultaneously:The user name of correspondence and the user name " Zhang San " of current operation user
The acquisition time of identical, inquiry data set settling time, that is, channel data belonged in the range of the settling time of correspondence.
Wherein, the user in correspondence is entitled " public ", then the correspondence is suitable for all user's uses, i.e.,
User name " public " and the user name of any other current operation users in correspondence regard identical as.
The correspondence obtained in this way is respectively the subset of first, second, third, fourth correspondence.Then, using sieve
The correspondence selected performs following flows.
Step 202:Judge whether channel data includes the mess code in the correspondence pre-established.If channel data includes
Mess code in correspondence, then perform step 203.
Step 203:According to correspondence, operated to channel data using corresponding with the mess code in channel data
Reason, to eliminate the mess code in channel data.
If channel data includes the mess code in correspondence, according to correspondence, using with the mess code in channel data
Corresponding operation handles channel data, to eliminate the mess code in channel data.If channel data does not include correspondence
In mess code then without processing.
If correspondence include at least two types, and different types of correspondence correspond to it is different types of operation and
Different grades of priority;
Then judging the specific implementation procedure whether text data includes the mess code in the correspondence pre-established is:According to
The hierarchal order of priority, by judging whether text data includes in correspondence using different types of correspondence after arriving first
Mess code.
That is, before the mess code in judging whether text data includes the correspondence pre-established, according to priority
Hierarchal order, selects correspondence one by one, and above-mentioned step 202 and step are performed in turn with the correspondence selected
203, until these correspondences all used or the channel data of the inquiry data set of step 201 was all judged as extremely.
For example, correspondence, which is the accurate of above-mentioned foundation completion, replaces correspondence, fuzzy replacement correspondence, deletion pair
Should be related to, hide correspondence when, after being screened according to settling time and user name to these correspondences, obtain this four
Then the subset of a correspondence performs following FOUR EASY STEPSs according to priority using these subsets.
3.1 first steps accurately match inquiry data set
Obtain accurately replacing the subset of correspondence after the user of the 4th row and tertial temporal filtering is used, so
Afterwards step 202 and step 203 are performed using the subset.That is, the channel data row of inquiry data set are the dimension row for occurring mess code,
The system of processing equipment can do accurate matching to the first row of the channel column of the table of the inquiry data set Y.0 and Table A 1.1.
Why select Table A 1.1 without select Table A 1.2, be because according to accurately replacement correspondence settling time into
Result after row screening.Such as the settling time of settling time, that is, channel data of inquiry data set is on December 11st, 2016,
And the settling time of Table A 1.1 is 2016121113:00, A1.2 settling time is 2016121113:00-2016121210:
00, the settling time for inquiring data set belongs to the settling time of Table A 1.1, so as to the correspondence of processing equipment selection Table A 1.1
Accurately matched.
After matching, the channel data of the first row of the inquiry data set " satellite TV of China~~" replaces correspondence to be accurate
In one of correspondence in mess code, according to the first of the correspondence the operation, will include the mess code " satellite TV of China~
~" channel data replace with the first text " satellite TV of China "
It is as follows so as to obtain table Y.1:
Channel | Viewing number | Watch duration |
Satellite TV of China | 1000 | 2300 |
Central ## channels A | 2000 | 4000 |
%@satellite TVs of China | 3000 | 5300 |
xx4486e | 100 | 5000 |
3.2nd, second step carries out fuzzy matching to inquiry data set
Due to accurately replacing all correspondences impossible to exhaust in correspondence, so this step needs do fuzzy
Match.
The fuzzy subset for replacing correspondence is obtained after the user of the 4th row and tertial temporal filtering is used, so
Afterwards step 202 and step 203 are performed using the subset.That is, after the first step is handled, the system of processing equipment can be to the inquiry
The first row of the table of data set channel column Y.1 and Table A 2 does fuzzy matching.
After matching, the channel data " central ## channels A " of the second row of the inquiry data set includes fuzzy replace and corresponds to
Mess code " center " in one of correspondence in relationship according to the second of the correspondence the operation, will include the mess code
The channel data in " center " replaces with the second text " central satellite TV ".
It is as follows so as to obtain table Y.2:
Channel | Viewing number | Watch duration |
Satellite TV of China | 1000 | 2300 |
Central satellite TV | 2000 | 4000 |
%@satellite TVs of China | 3000 | 5300 |
xx4486e | 100 | 5000 |
3.3rd, third walks, and fuzzy matching is carried out to inquiry data set
Third is performed using the table 7.2 that second step obtains to walk.
It obtains deleting the subset of correspondence after the temporal filtering of tertial user and secondary series is used, then makes
Step 202 and step 203 are performed with the subset.That is, after second step is handled, the system of processing equipment can be to the inquiry data
The first row of the table of collection channel column Y.2 and Table A 3 matches.
After matching, the channel data " %@satellite TVs of China " of the third line of the inquiry data set includes deleting correspondence
In wherein two correspondence in mess code " % " and "@", operated according to the third of the correspondence, by mess code " % " and
"@" is deleted from channel data.
It is as follows so as to obtain table Y3:
3.4th, the 4th step is hidden matching to inquiry data set
It obtains hiding the subset of correspondence after the temporal filtering of tertial user and secondary series is used, then makes
Step 202 and step 203 are performed with the subset.That is, after three step process, the system of processing equipment can be to the inquiry data
The first row of the table of collection channel column Y.3 and Table A 4 matches.
After matching, the channel data " xx4486e " of the fourth line of the inquiry data set includes hiding in correspondence
Mess code " xx4486e " in one of correspondence will include mess code " xx4486e " according to the 4th of the correspondence the operation
Channel data be hidden.
It is as follows so as to obtain table Y.4:
Channel | Viewing number | Watch duration |
Satellite TV of China | 1000 | 2300 |
Central satellite TV | 2000 | 4000 |
Satellite TV of China | 3000 | 5300 |
In this way, into after crossing the processing of above-mentioned steps, the mess code for inquiring the channel data of data set has closely been eliminated, from
And solve the Confused-code in text data, in the dimension during this article notebook data is multi-dimensional data, for example, above-mentioned
Channel data is the dimension for inquiring data set, solves dimension data Confused-code by the above method.
It by the above method, solves the problems, such as occur mess code in IPTV data, considers various situations comprehensively, mess code
Problem can solve by the operation of user, avoid the intervention of developer and operation maintenance personnel.
In conclusion after obtaining text data, judge whether text data includes the unrest in the correspondence pre-established
Code, the correspondence include mess code and the correspondence of operation.If text data includes the mess code in correspondence, according to right
It should be related to, text data is handled using operation corresponding with the mess code, so as to which the mess code in text data can be eliminated.
In this way, the text data for including mess code, its mess code with correspondence is compared, if text data includes the correspondence
The mess code of relationship, the operation that the correspondence may be used eliminate the mess code from this article notebook data, and such mess code is eliminated
Method implements convenient and efficient, and the intervention without developer and operation maintenance personnel can be realized.
Fig. 3 is a kind of structure diagram of mess code processing unit provided in an embodiment of the present invention.The device can be integrated in place
It manages in equipment, to perform above-mentioned Fig. 1 and method shown in Fig. 2, refering to Fig. 3, the device of the embodiment of the present invention includes:
Acquiring unit 301, for obtaining text data;
Judging unit 302, it is corresponding to close for judging whether text data includes the mess code in the correspondence pre-established
System includes mess code and the correspondence of operation;
Processing unit 303, if including the mess code in correspondence for text data, according to correspondence, using with
The corresponding operation of the mess code handles text data, to eliminate the mess code in text data.
Optionally,
Correspondence includes at least two types, and different types of correspondence corresponds to different types of operation and difference
The priority of grade;
Judging unit 302 is additionally operable to the hierarchal order according to priority, by using different types of correspondence after arriving first
Judge whether text data includes the mess code in correspondence.
Optionally,
Correspondence includes the first correspondence, the second correspondence, third correspondence and the 4th correspondence,
First correspondence includes the first mess code and the correspondence of the first operation, and the first operation is will include the first mess code
Pending text data replace with the first text, the first mess code includes all characters in pending text data;
Second correspondence includes the second mess code and the correspondence of the second operation, and the second operation is will include the second mess code
Pending text data replace with the second text, the second mess code is the partial character in pending text data;
Third correspondence includes the correspondence that third mess code and third operate, and third operation is from treating by third mess code
It is deleted in processing text data;
4th correspondence includes the 4th mess code and the correspondence of the 4th operation, and the 4th operation is will include the 4th mess code
Pending text data be hidden.
Optionally,
The priority level of correspondence is followed successively by from high to low:First correspondence, the second correspondence, third correspond to
Relationship and the 4th correspondence.
Optionally,
The number of correspondence is including multiple, and each correspondence further includes the first pre-set level, and text data further includes
Second pre-set level,
The device of the embodiment of the present invention further includes:
Index determination unit 305, for determining the first pre-set level and the second pre-set level pair from multiple correspondences
The target correspondence answered;
Judging unit 302 is additionally operable to judge the mess code whether text data is included in target correspondence.
Optionally,
First pre-set level is the first settling time of correspondence, and the second pre-set level is established for the second of text data
Time.
Optionally,
For the number of correspondence including multiple, each correspondence further includes user name,
The device of the embodiment of the present invention further includes:
Name acquiring unit 306, for obtaining the user name of current operation user;
Title determination unit 304, for determining user name and the user name of current operation user from multiple correspondences
Identical correspondence;
Judging unit 302 is additionally operable to judge the mess code whether text data is included in the correspondence determined.
In conclusion after acquiring unit 301 obtains text data, it is pre- that judging unit 302 judges whether text data includes
Mess code in the correspondence first established, the correspondence include mess code and the correspondence of operation.If text data include pair
Mess code in should being related to, then processing unit 303 is according to correspondence, using operation corresponding with the mess code to text data progress
Processing, so as to which the mess code in text data can be eliminated.In this way, the text data for including mess code, by itself and correspondence
Mess code be compared, if text data includes the mess code of the correspondence, the operation of the correspondence may be used by the unrest
Code eliminated from this article notebook data, such mess code removing method implement it is convenient and efficient, without developer and O&M people
The intervention of member can be realized.
It is apparent to those skilled in the art that for convenience and simplicity of description, the system of foregoing description,
The specific work process of device and unit can refer to the corresponding process in preceding method embodiment, and details are not described herein.
In several embodiments provided herein, it should be understood that disclosed system, device and method can be with
It realizes by another way.For example, the apparatus embodiments described above are merely exemplary, for example, the unit
It divides, only a kind of division of logic function can have other dividing mode, such as multiple units or component in actual implementation
It may be combined or can be integrated into another system or some features can be ignored or does not perform.Another point, it is shown or
The mutual coupling, direct-coupling or communication connection discussed can be the indirect coupling by some interfaces, device or unit
It closes or communicates to connect, can be electrical, machinery or other forms.
The unit illustrated as separating component may or may not be physically separate, be shown as unit
The component shown may or may not be physical unit, you can be located at a place or can also be distributed to multiple
In network element.Some or all of unit therein can be selected according to the actual needs to realize the mesh of this embodiment scheme
's.
In addition, each functional unit in each embodiment of the present invention can be integrated in a processing unit, it can also
That each unit is individually physically present, can also two or more units integrate in a unit.Above-mentioned integrated list
The form that hardware had both may be used in member is realized, can also be realized in the form of SFU software functional unit.
If the integrated unit is realized in the form of SFU software functional unit and is independent product sale or uses
When, it can be stored in a computer read/write memory medium.Based on such understanding, technical scheme of the present invention is substantially
The part to contribute in other words to the prior art or all or part of the technical solution can be in the form of software products
It embodies, which is stored in a storage medium, is used including some instructions so that a computer
Equipment (can be personal computer, server or the network equipment etc.) performs the complete of each embodiment the method for the present invention
Portion or part steps.And aforementioned storage medium includes:USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only
Memory), random access memory (RAM, Random Access Memory), magnetic disc or CD etc. are various can store journey
The medium of sequence code.
The above, the above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations;Although with reference to before
Embodiment is stated the present invention is described in detail, it will be understood by those of ordinary skill in the art that:It still can be to preceding
The technical solution recorded in each embodiment is stated to modify or carry out equivalent replacement to which part technical characteristic;And these
Modification is replaced, the spirit and scope for various embodiments of the present invention technical solution that it does not separate the essence of the corresponding technical solution.
Claims (10)
1. a kind of mess code processing method, which is characterized in that including:
Obtain text data;
Judge whether the text data includes the mess code in the correspondence that pre-establishes, the correspondence include mess code and
The correspondence of operation;
If the text data includes the mess code in the correspondence, according to the correspondence, using with the mess code
Corresponding operation handles the text data, to eliminate the mess code in the text data.
2. according to the method described in claim 1, it is characterized in that,
The correspondence includes at least two types, and different types of correspondence corresponds to different types of operation and difference
The priority of grade;
It is described to judge whether the text data includes the mess code in the correspondence pre-established, including:
According to the hierarchal order of the priority, by judging that the text data is using different types of correspondence after arriving first
The no mess code including in correspondence.
3. according to the method described in claim 1, it is characterized in that,
The correspondence includes the first correspondence, the second correspondence, third correspondence and the 4th correspondence,
First correspondence includes the first mess code and the correspondence of the first operation, and first operation is described for that will include
The pending text data of first mess code replaces with the first text, and first mess code is included in the pending text data
All characters;
Second correspondence includes the second mess code and the correspondence of the second operation, and second operation is described for that will include
The pending text data of second mess code replaces with the second text, and second mess code is the portion in the pending text data
Divide character;
The correspondence that the third correspondence includes third mess code and third operates, the third operation is by the third
Mess code is deleted from pending text data;
4th correspondence includes the 4th mess code and the correspondence of the 4th operation, and the 4th operation is described for that will include
The pending text data of 4th mess code is hidden.
4. according to the method described in claim 3, it is characterized in that,
The priority level of the correspondence is followed successively by from high to low:First correspondence, second correspondence,
The third correspondence and the 4th correspondence.
5. method according to any one of claims 1 to 4, which is characterized in that
The number of the correspondence is including multiple, and each correspondence further includes the first pre-set level, and the text data is also
Including the second pre-set level,
It is described judge whether the text data includes the mess code in the correspondence that pre-establishes before, the method is also wrapped
It includes:
Determine that first pre-set level target corresponding with second pre-set level is corresponding from the multiple correspondence
Relationship;
It is described to judge whether the text data includes the mess code in the correspondence pre-established, including:
Judge whether the text data includes the mess code in the target correspondence.
6. according to the method described in claim 5, it is characterized in that,
First pre-set level is the first settling time of the correspondence, and second pre-set level is the textual data
According to the second settling time.
7. method according to any one of claims 1 to 4, which is characterized in that
For the number of the correspondence including multiple, each correspondence further includes user name,
It is described judge whether the text data includes the mess code in the correspondence that pre-establishes before, the method is also wrapped
It includes:
Obtain the user name of current operation user;
The user name correspondence identical with the user name of the current operation user is determined from the multiple correspondence;
It is described to judge whether the text data includes the mess code in the correspondence pre-established, including:
Judge whether the text data includes the mess code in the correspondence determined.
8. a kind of mess code processing unit, which is characterized in that including:
Acquiring unit, for obtaining text data;
Judging unit, for judging whether the text data includes the mess code in the correspondence pre-established, the correspondence
Relationship includes mess code and the correspondence of operation;
Processing unit if including the mess code in the correspondence for the text data, according to the correspondence, makes
The text data is handled with operation corresponding with the mess code, to eliminate the mess code in the text data.
9. device according to claim 8, which is characterized in that
The correspondence includes at least two types, and different types of correspondence corresponds to different types of operation and difference
The priority of grade;
The judging unit is additionally operable to the hierarchal order according to the priority, by using different types of corresponding pass after arriving first
System judges whether the text data includes the mess code in correspondence.
10. device according to claim 8, which is characterized in that
The correspondence includes the first correspondence, the second correspondence, third correspondence and the 4th correspondence,
First correspondence includes the first mess code and the correspondence of the first operation, and first operation is described for that will include
The pending text data of first mess code replaces with the first text, and first mess code is included in the pending text data
All characters;
Second correspondence includes the second mess code and the correspondence of the second operation, and second operation is described for that will include
The pending text data of second mess code replaces with the second text, and second mess code is the portion in the pending text data
Divide character;
The correspondence that the third correspondence includes third mess code and third operates, the third operation is by the third
Mess code is deleted from pending text data;
4th correspondence includes the 4th mess code and the correspondence of the 4th operation, and the 4th operation is described for that will include
The pending text data of 4th mess code is hidden.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611264769.XA CN108271041B (en) | 2016-12-30 | 2016-12-30 | Method and device for processing messy codes |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611264769.XA CN108271041B (en) | 2016-12-30 | 2016-12-30 | Method and device for processing messy codes |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108271041A true CN108271041A (en) | 2018-07-10 |
CN108271041B CN108271041B (en) | 2021-01-22 |
Family
ID=62770158
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201611264769.XA Active CN108271041B (en) | 2016-12-30 | 2016-12-30 | Method and device for processing messy codes |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108271041B (en) |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0728810A (en) * | 1993-07-14 | 1995-01-31 | Matsushita Electric Ind Co Ltd | Character processing method and device therefor |
CN102479174A (en) * | 2010-11-23 | 2012-05-30 | 盛乐信息技术(上海)有限公司 | Chinese character automatic checking and error-correcting system aiming at GBK (Chinese Internal Code Specification) encoding and method thereof |
CN104424010A (en) * | 2013-09-06 | 2015-03-18 | 北大方正集团有限公司 | Method and system for detecting and repairing text document messy codes |
CN104516862A (en) * | 2013-09-29 | 2015-04-15 | 北大方正集团有限公司 | Method and system for selecting and reading coded format of target document |
CN104750663A (en) * | 2013-12-27 | 2015-07-01 | 阿里巴巴集团控股有限公司 | Identification method and device for text messy codes in page |
CN105426390A (en) * | 2015-10-23 | 2016-03-23 | 广东小天才科技有限公司 | Image recognition-based question search method and system |
-
2016
- 2016-12-30 CN CN201611264769.XA patent/CN108271041B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0728810A (en) * | 1993-07-14 | 1995-01-31 | Matsushita Electric Ind Co Ltd | Character processing method and device therefor |
CN102479174A (en) * | 2010-11-23 | 2012-05-30 | 盛乐信息技术(上海)有限公司 | Chinese character automatic checking and error-correcting system aiming at GBK (Chinese Internal Code Specification) encoding and method thereof |
CN104424010A (en) * | 2013-09-06 | 2015-03-18 | 北大方正集团有限公司 | Method and system for detecting and repairing text document messy codes |
CN104516862A (en) * | 2013-09-29 | 2015-04-15 | 北大方正集团有限公司 | Method and system for selecting and reading coded format of target document |
CN104750663A (en) * | 2013-12-27 | 2015-07-01 | 阿里巴巴集团控股有限公司 | Identification method and device for text messy codes in page |
CN105426390A (en) * | 2015-10-23 | 2016-03-23 | 广东小天才科技有限公司 | Image recognition-based question search method and system |
Also Published As
Publication number | Publication date |
---|---|
CN108271041B (en) | 2021-01-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN100501732C (en) | Displaying events by category based on a logarithmic timescale | |
CN102483731B (en) | Have according to search load by the medium of the fingerprint database of equilibrium | |
CN104317806B (en) | Financial data inquiry method and financial data system | |
CN108920611B (en) | Article generation method, device, equipment and storage medium | |
CN108153719A (en) | Merge the method and apparatus of electrical form | |
EP3800559A1 (en) | Accessing datasets | |
KR101500294B1 (en) | Patent Analysis System and Method therefor and Computer Readable Recording Medium whereon Program therefor is Recorded | |
US9355227B2 (en) | Dynamic document display personalization implemented in a digital rights management system | |
CN107015987A (en) | A kind of method and apparatus for updating and searching for database | |
CN104915426A (en) | Information sorting method, method for generating information ordering models and device | |
CN109857661B (en) | Method and system for intelligently generating test cases based on big data analysis | |
CN106294785A (en) | Content Selection method and system | |
CN106599291B (en) | Data grouping method and device | |
CN106611031A (en) | Data query method and device | |
CN103823614A (en) | Information processing method, device and electronic equipment | |
CN101770474A (en) | History searching record-based searching method and device | |
CN112214557B (en) | Data matching classification method and device | |
CN106569683A (en) | Method and equipment for performing batch processing of applications on mobile terminal | |
CN108271041A (en) | Mess code treating method and apparatus | |
CN110825947B (en) | URL deduplication method, device, equipment and computer readable storage medium | |
CN109977977A (en) | A kind of method and corresponding intrument identifying potential user | |
CN107492036B (en) | Insurance policy escrow system | |
KR20130126012A (en) | Method and apparatusfor providing report of business intelligence | |
CN106779909A (en) | Material matching process and device | |
CN111782684B (en) | Distribution network electronic handover information matching method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Address after: 100080 No. 401, 4th Floor, Haitai Building, 229 North Fourth Ring Road, Haidian District, Beijing Applicant after: Beijing Guoshuang Technology Co.,Ltd. Address before: 100086 Cuigong Hotel, 76 Zhichun Road, Shuangyushu District, Haidian District, Beijing Applicant before: Beijing Guoshuang Technology Co.,Ltd. |
|
CB02 | Change of applicant information | ||
GR01 | Patent grant | ||
GR01 | Patent grant |