CN108271041A - Mess code treating method and apparatus - Google Patents

Mess code treating method and apparatus Download PDF

Info

Publication number
CN108271041A
CN108271041A CN201611264769.XA CN201611264769A CN108271041A CN 108271041 A CN108271041 A CN 108271041A CN 201611264769 A CN201611264769 A CN 201611264769A CN 108271041 A CN108271041 A CN 108271041A
Authority
CN
China
Prior art keywords
correspondence
mess code
text data
mess
code
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201611264769.XA
Other languages
Chinese (zh)
Other versions
CN108271041B (en
Inventor
焦张波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Gridsum Technology Co Ltd
Original Assignee
Beijing Gridsum Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Gridsum Technology Co Ltd filed Critical Beijing Gridsum Technology Co Ltd
Priority to CN201611264769.XA priority Critical patent/CN108271041B/en
Publication of CN108271041A publication Critical patent/CN108271041A/en
Application granted granted Critical
Publication of CN108271041B publication Critical patent/CN108271041B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/235Processing of additional data, e.g. scrambling of additional data or processing content descriptors
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/61Network physical structure; Signal processing
    • H04N21/6156Network physical structure; Signal processing specially adapted to the upstream path of the transmission network
    • H04N21/6175Network physical structure; Signal processing specially adapted to the upstream path of the transmission network involving transmission via Internet

Abstract

The embodiment of the invention discloses a kind of mess code treating method and apparatus, for facilitating the mess code for eliminating text data.Present invention method includes:Obtain text data;Judge whether the text data includes the mess code in the correspondence pre-established, the correspondence includes mess code and the correspondence of operation;If the text data includes the mess code in the correspondence, according to the correspondence, the text data is handled using operation corresponding with the mess code, to eliminate the mess code in the text data.In this way, for including the text data of mess code, its mess code with correspondence is compared, if text data includes the mess code of the correspondence, the operation that the correspondence may be used eliminates the mess code from this article notebook data, such mess code removing method implements convenient and efficient, and the intervention without developer and operation maintenance personnel can be realized.

Description

Mess code treating method and apparatus
Technical field
The present invention relates to data processing field more particularly to a kind of mess code treating method and apparatus.
Background technology
It is collected the problem of because of gatherer process or the reason of equipment in the text data collected by equipment Data often will appear mess code.
For example, in IPTV Data processings, obtained since data source may be by equipment acquisition, obtained data can Mess code can be come out, as shown in following table one:
Table one:
Channel Viewing number Watch duration
Satellite TV of China~~ 1000 2300
Central ## channels A 2000 4000
%@satellite TVs of China 3000 5300
xx4486e 100 5000
User often wants developer or operation maintenance personnel to intervene, logarithm in the mess code in these text datas It is optimized according to the equipment of acquisition or algorithm etc., such settling mode often spends the more time, and cumbersome.
Invention content
An embodiment of the present invention provides a kind of mess code treating method and apparatus, for facilitating the mess code for eliminating text data.
In order to solve the above-mentioned technical problem, an embodiment of the present invention provides following technical schemes:
A kind of mess code processing method, including:
Obtain text data;
Judge whether the text data includes the mess code in the correspondence pre-established, the correspondence is included disorderly Code and the correspondence of operation;
If the text data includes the mess code in the correspondence, according to the correspondence, using with it is described The corresponding operation of mess code in text data handles the text data, to eliminate the mess code in the text data.
In order to solve the above-mentioned technical problem, the embodiment of the present invention additionally provides following technical scheme:
A kind of mess code processing unit, including:
Acquiring unit, for obtaining text data;
Judging unit, it is described for judging whether the text data includes the mess code in the correspondence pre-established Correspondence includes mess code and the correspondence of operation;
Processing unit, if including the mess code in the correspondence for the text data, according to the corresponding pass System is handled the text data using operation corresponding with the mess code, to eliminate the mess code in the text data.
As can be seen from the above technical solutions, the embodiment of the present invention has the following advantages:
After obtaining text data, judge whether text data includes the mess code in the correspondence pre-established, the correspondence Relationship includes mess code and the correspondence of operation.If text data includes the mess code in correspondence, according to correspondence, make Text data is handled with operation corresponding with the mess code, so as to which the mess code in text data can be eliminated.In this way, for Its mess code with correspondence is compared by the text data including mess code, if text data includes the unrest of the correspondence Code, the operation that the correspondence may be used eliminate the mess code from this article notebook data, and such mess code removing method is realized Get up convenient and efficient, the intervention without developer and operation maintenance personnel can be realized.
Description of the drawings
Fig. 1 is the method flow diagram of a kind of mess code processing method that one embodiment of the invention provides;
Fig. 2 is the method flow diagram of a kind of mess code processing method that another embodiment of the present invention provides;
Fig. 3 is the structure diagram of a kind of mess code processing unit that another embodiment of the present invention provides.
Specific embodiment
An embodiment of the present invention provides a kind of mess code treating method and apparatus, for facilitating the mess code for eliminating text data.
Fig. 1 is a kind of method flow diagram of mess code processing method provided in an embodiment of the present invention.Refering to Fig. 1, the present invention is real The method for applying example includes:
Step 101:Obtain text data;
Step 102:Judge whether text data includes the mess code in the correspondence pre-established, which includes Mess code and the correspondence of operation;If text data includes the mess code in correspondence, step 103 is performed.
Step 103:According to correspondence, text data is handled using operation corresponding with the mess code, to eliminate Mess code in text data.
Optionally,
Correspondence includes at least two types, and different types of correspondence corresponds to different types of operation and difference The priority of grade;
Judge whether text data includes the mess code in the correspondence pre-established, including:
According to the hierarchal order of priority, by judging whether text data wraps using different types of correspondence after arriving first Include the mess code in correspondence.
Optionally,
Correspondence includes the first correspondence, the second correspondence, third correspondence and the 4th correspondence,
First correspondence includes the first mess code and the correspondence of the first operation, and the first operation is will include the first mess code Pending text data replace with the first text, the first mess code includes all characters in pending text data;
Second correspondence includes the second mess code and the correspondence of the second operation, and the second operation is will include the second mess code Pending text data replace with the second text, the second mess code is the partial character in pending text data;
Third correspondence includes the correspondence that third mess code and third operate, and third operation is from treating by third mess code It is deleted in processing text data;
4th correspondence includes the 4th mess code and the correspondence of the 4th operation, and the 4th operation is will include the 4th mess code Pending text data be hidden.
Optionally,
The priority level of correspondence is followed successively by from high to low:First correspondence, the second correspondence, third correspond to Relationship and the 4th correspondence.
Optionally,
The number of correspondence is including multiple, and each correspondence further includes the first pre-set level, and text data further includes Second pre-set level,
Before judging whether text data includes the mess code in the correspondence pre-established, the method for the embodiment of the present invention It further includes:
The first pre-set level target correspondence corresponding with the second pre-set level is determined from multiple correspondence;
Judge whether text data includes the mess code in the correspondence pre-established, including:
Judge whether text data includes the mess code in target correspondence.
Optionally,
First pre-set level is the first settling time of correspondence, and the second pre-set level is established for the second of text data Time.
Optionally,
For the number of correspondence including multiple, each correspondence further includes user name,
Before judging whether text data includes the mess code in the correspondence pre-established, the method for the embodiment of the present invention It further includes:
Obtain the user name of current operation user;
The user name correspondence identical with the user name of current operation user is determined from multiple correspondence;
Judge whether text data includes the mess code in the correspondence pre-established, including:
Judge whether text data includes the mess code in the correspondence determined.
In conclusion after obtaining text data, judge whether text data includes the unrest in the correspondence pre-established Code, the correspondence include mess code and the correspondence of operation.If text data includes the mess code in correspondence, according to right It should be related to, text data is handled using operation corresponding with the mess code, so as to which the mess code in text data can be eliminated. In this way, the text data for including mess code, its mess code with correspondence is compared, if text data includes the correspondence The mess code of relationship, the operation that the correspondence may be used eliminate the mess code from this article notebook data, and such mess code is eliminated Method implements convenient and efficient, and the intervention without developer and operation maintenance personnel can be realized.
Fig. 2 is a kind of mess code processing method provided in an embodiment of the present invention.With reference to the above and referring to Fig.2, below Embodiment shown in Fig. 2 is illustrated.
Before the flow to the method for embodiment shown in Fig. 2 is described, first the method for the embodiment of the present invention is used To correspondence illustrate, to make place mat.
In order to eliminate the mess code in the text data got in the method for the embodiment of the present invention, need to use corresponding pass System, the correspondence include multiple dimensions, which mainly includes the correspondence of mess code and operation.The correspondence For mess code for being matched with the character in text data, whether matching is identical, if matching is identical, performs the correspondence In operation corresponding with the mess code, the operation include but not limited to replace text data for preset text, delete mess code, hidden Tibetan includes text data of the mess code etc..In order to which text data is replaced with preset text, the correspondence further include with The corresponding pre-set text dimension of mess code.
It is appreciated that the dimension of the correspondence can also include much information, such as settling time, user name etc..
In the embodiment having in the present invention, priority level can also be distributed to different correspondences, according to different Priority determines to use sequence to different correspondence.
That is, correspondence include at least two types, different types of correspondence correspond to it is different types of operation and Different grades of priority.According to the hierarchal order of these priority, these different types of correspondences are selected one by one It selects, judges whether text data includes the mess code in the correspondence pre-established so that the correspondence execution selected is subsequent The step of.
About the specific situation of correspondence, such as can be as follows:
Correspondence includes the first correspondence, the second correspondence, third correspondence and the 4th correspondence.
1.1 first correspondences include the first mess code and the correspondence of the first operation.First operation is will include first The pending text data of mess code replaces with the first text, and the first mess code includes all characters in pending text data.That is, First correspondence includes the correspondence of the first mess code, the first text and the first operation three, if text data includes first Mess code, and all characters of text data be first mess code, i.e., text data be the first mess code, then according to this first operation, Text data is replaced using the first text.
1.2 second correspondences include the second mess code and the correspondence of the second operation, and the second operation is will include second The pending text data of mess code replaces with the second text, and the second mess code is the partial character in pending text data.That is, the One correspondence includes the correspondence of the second mess code, the second text and the second operation three, if text data includes second disorderly Code, as long as second mess code belongs to the character of the part in this article notebook data, you can according to second operation, use the second text Replace text data.The difference of first correspondence and the second correspondence is to be, the first mess code and pending text Data are equivalent, and the second mess code belongs to the partial data in pending text data.
The correspondence that 1.3 third correspondences include third mess code and third operates, third operation are by third mess code It is deleted from pending text data.If i.e. text data includes third mess code, operated according to third, by the third mess code from It is deleted in text data.
1.4 the 4th correspondences include the 4th mess code and the correspondence of the 4th operation, and the 4th operation is will include the 4th The pending text data of mess code is hidden.If i.e. text data includes the 4th mess code, according to the 4th operation, by the text The entire data of data are all hidden, i.e., the text data for not including the 4th mess code to this in display is shown.
In the embodiment of setting priority, the priority level of above-mentioned correspondence is followed successively by from high to low:First pair It should be related to, the second correspondence, third correspondence and the 4th correspondence.
In an embodiment of the present invention, which pre-establishes, can be user oneself on a processing device It establishes or user downloads to obtain from server-side, to the acquisition source of correspondence, the embodiment of the present invention does not do specific limit It is fixed.
In order to more intuitively be illustrated to the above, below i.e. in IPTV fields, user establish correspondence into Row description.Wherein, processing equipment is the equipment for the method for performing the embodiment of the present invention, can be the equipment such as computer.
The relation table that user Zhang San logs in processing equipment using user name " Zhang San " establishes interface.In the embodiment having, User Zhang San can also use " public " option to log in the relation table and establish interface.If being logged in using particular user name, establish The upper user name of correspondence mark, subsequently may be such that the correspondence only has the user of the user name just usable;If make With the option of the public, then the mark of the upper public of correspondence mark established represents that the correspondence can be by the institute user of the machine It uses.
Then Zhang San operates the processing equipment and performs data query operation, and query result is as shown in the table, mid band dimension The data of degree are the row in table, which is text data above.In order to describe simplicity, it is with above-mentioned table one now The query result, i.e.,
Table one
Channel Viewing number Watch duration
Satellite TV of China~~ 1000 2300
Central ## channels A 2000 4000
%@satellite TVs of China 3000 5300
xx4486e 100 5000
The data of the query result are equipment acquisitions, it sometimes appear that mess code, such as the channel data of table two occur Mess code, thus user perform operations described below, to eliminate these mess codes and establish follow-up correspondence to be used.
2.1 establish accurate replacement correspondence --- the first correspondence
User Zhang San is by processing equipment, the total data of selection " satellite TV of China~~", by above-mentioned table two " China defends Depending on~~" replace with " satellite TV of China ", and current processing time is 11 days 13 December in 2016:00, in this way, in processing equipment It is upper to establish a correspondence, a data is obtained, shown in following Table A 1.
Table A 1.1:
First row Secondary series Third arranges 4th row
" satellite TV of China~~" " satellite TV of China " 2016121113:00 Zhang San
If Zhang San was at second day, after inquiring new data, found again in query result " satellite TV of China~~", so as to The total data of selection " satellite TV of China~~" replaces with " satellite TV of China~~" " satellite TV of China A ", and current processing Time is 12 days 10 December in 2016:00, in this way, establishing a correspondence on a processing device, obtain another number According to shown in following Table A 2.
Table A 1.2:
First row Secondary series Third arranges 4th row
Satellite TV of China~~ Satellite TV of China A 2016121113:00-2016121210:00 Zhang San
Two datas in above-mentioned 2.1, i.e. the two correspondences, should corresponding to the first correspondence in above-mentioned 1.1 Table can be used to represent in the concrete numerical value of correspondence, Table A 1.1 as escribed above, has four row in table, and first row is mess code, the Two row are the first texts of mapping, and third row are the periods, represent the period that this record comes into force, the 4th row are user names Claim.
Wherein first operation can be recorded in the form of code or on the correspondence plus the first operation Identification information, as long as equipment subsequently read the identification information can perform first operation or preserve first relationship pass When being, which is stored in the storage unit for the correspondence for preserving the first class of operation, passes through above-mentioned specific side Formula can establish the first operation in the first correspondence.
2.2 establish fuzzy replacement correspondence --- the second correspondence
User Zhang San, select above-mentioned table two channel data " central ## channels " in " center ", and control process is set Alternative obscures replacement option, which is replaced with " central satellite TV ", and current time is on December 11st, 2016 13:40, then correspondence is established on a processing device, obtains following data:
Table A 2:
First row Secondary series Third arranges 4th row
Center Central satellite TV 2016121113:40 Zhang San
Data in above-mentioned 2.2, i.e. this correspondence, corresponding to the second correspondence in above-mentioned 1.2, which closes Table can be used to represent in the concrete numerical value of system, Table A 2 as escribed above, there is four row in table, and first row is mess code, and secondary series is to reflect The second text penetrated, third row are the periods, represent the period that this record comes into force, the 4th row are user's names.
The foundation of the second operation in wherein the second correspondence can refer to described in above-mentioned 1.1.
2.3 establish deletion correspondence --- third correspondence
User Zhang San selects "@" character of the channel data of above-mentioned table two, selects after determining option, reselection channel number According to " % " character, then control process equipment selection delete option, current time be 11 days 13 December in 2016:46, then exist Correspondence is established in processing equipment, obtains following data.
Table A 3:
First row Secondary series Third arranges
" % " 2016121113:46 Zhang San
“@” 2016121113:46 Zhang San
Data in above-mentioned 23, i.e. this correspondence, corresponding to the third correspondence in above-mentioned 1.3, which closes Table can be used to represent in the concrete numerical value of system, Table A 3 as escribed above, there is three row in table, and first row stores mess code character, such as " % ,~" etc., secondary series is the period, represents the period that this record comes into force, third row are user's names.
The foundation of third operation wherein in third correspondence can refer to described in above-mentioned 1.1.
2.4 establish hiding correspondence --- the 4th correspondence
User Zhang San selects the channel data " xx4486e " in the channel data of above-mentioned table two, and control process equipment is selected It selects and hides Options, current time is 11 days 13 December in 2016:50, then establish correspondence on a processing device, obtain as Lower data:
Table A 4:
First row Third arranges 4th row
xx4486e 2016121113:50 Zhang San
Data in above-mentioned 2.4, i.e. this correspondence, corresponding to the 4th correspondence in above-mentioned 1.4, which closes Table can be used to represent in the concrete numerical value of system, Table A 4 as escribed above, there is three row in table, first row storage mess code character, and second Row are the periods, represent the period that this record comes into force, third row are user's names.
The foundation of the 4th operation in wherein the 4th correspondence can refer to described in above-mentioned 1.4.
Wherein, the foundation of priority can be that equipment is pre-set, such as according to the operations of different correspondences Classification sets priority to different correspondence respectively, for example, during above-mentioned correspondence is established, equipment preset including Operation for the correspondence accurately replaced be the first priority, including operation for the fuzzy correspondence replaced be second excellent First grade, including operation be the correspondence deleted be third priority, including operation for hiding correspondence be the 4th Priority.
The settling time of wherein above-mentioned each table and other dimensional informations of the entitled each correspondence of user, according to these dimensions Degree information may be such that these correspondences, and there are many occupation modes, meet the diversified demand of user.Wherein, settling time One pre-set level, further included on the text data subsequently obtained with matched second pre-set level of first pre-set level, i.e., First pre-set level is matched available for index corresponding on text data.Certainly, these first, second pre-set levels It can also be other types of information.
It is appreciated that above-mentioned each type correspondence can include it is multiple, such as the first correspondence i.e. wrap Two have been included, more can also have been included, such as 5,12.
It establishes after completing these correspondences, you can store these correspondences on a processing device, storage mode has It is a variety of, such as stored etc. using above-mentioned form form storage or using character string forms.
After the completion of the foundation of above-mentioned correspondence, you can the operation of subsequent processing text data is performed, it is as described below to retouch It states, in order to describe more intuitive, is described below using IPTV fields, text data as channel data.
Step 201:Obtain inquiry data set.
Wherein inquiry data set includes channel data, which is the text data of the embodiment of the present invention, at this It is possible that mess code in channel data.Wherein mess code refers to character to be processed, these mess codes are often that user does not need to use Or it is wrong.
For example, after user Zhang San logs in the data query page of IPTV processing equipments using user name " Zhang San ", selection is looked into Operation is ask, sends out inquiry instruction, processing equipment is to obtain the inquiry data set for including channel data, as shown in following table two.
Table is Y.0:
Channel Viewing number Watch duration
Satellite TV of China~~ 1000 2300
Central ## channels A 2000 4000
%@satellite TVs of China 3000 5300
xx4486e 100 5000
Processing equipment can obtain the correspondence of above-mentioned foundation after inquiry data set is got from storage unit, with Inquire and handle the mess code of the channel data in inquiry data set.
After the correspondence pre-established is obtained, first these correspondences can be screened, met with selecting It is required that correspondence to carry out subsequent mess code processing.
For example, optionally, for the number of correspondence including multiple, each correspondence further includes the first pre-set level, text Notebook data further includes the second pre-set level.So as to judge whether text data includes the mess code in the correspondence pre-established Before, the method for the embodiment of the present invention further includes:The first pre-set level and the second default finger are determined from multiple correspondence Mark corresponding target correspondence.When subsequently judging whether text data includes the mess code in the correspondence pre-established, i.e., It can be realized by judging whether text data includes the mess code in the target correspondence.
First, second advance index include diversified forms, such as the first pre-set level for correspondence first establish when Between, the second pre-set level is the second settling time of text data.
Alternatively, the screening to correspondence can also be realized by following modes:
For the number of correspondence including multiple, each correspondence further includes user name,
Before judging whether text data includes the mess code in the correspondence pre-established, method further includes:It obtains and works as The user name of preceding operation user;User name pair identical with the user name of current operation user is determined from multiple correspondence It should be related to.It, can be by sentencing so as to which the step of whether text data includes the mess code in the correspondence pre-established subsequently judged Whether disconnected text data is realized including the mess code in the correspondence determined.
For example, processing equipment is carried out at screening the correspondence pre-established according to the type and Query Dates of user Reason, is filtered, mistake according to the user name on first, second, third, fourth correspondence to be prestored and settling time respectively The correspondence filtered out meets the two requirements simultaneously:The user name of correspondence and the user name " Zhang San " of current operation user The acquisition time of identical, inquiry data set settling time, that is, channel data belonged in the range of the settling time of correspondence.
Wherein, the user in correspondence is entitled " public ", then the correspondence is suitable for all user's uses, i.e., User name " public " and the user name of any other current operation users in correspondence regard identical as.
The correspondence obtained in this way is respectively the subset of first, second, third, fourth correspondence.Then, using sieve The correspondence selected performs following flows.
Step 202:Judge whether channel data includes the mess code in the correspondence pre-established.If channel data includes Mess code in correspondence, then perform step 203.
Step 203:According to correspondence, operated to channel data using corresponding with the mess code in channel data Reason, to eliminate the mess code in channel data.
If channel data includes the mess code in correspondence, according to correspondence, using with the mess code in channel data Corresponding operation handles channel data, to eliminate the mess code in channel data.If channel data does not include correspondence In mess code then without processing.
If correspondence include at least two types, and different types of correspondence correspond to it is different types of operation and Different grades of priority;
Then judging the specific implementation procedure whether text data includes the mess code in the correspondence pre-established is:According to The hierarchal order of priority, by judging whether text data includes in correspondence using different types of correspondence after arriving first Mess code.
That is, before the mess code in judging whether text data includes the correspondence pre-established, according to priority Hierarchal order, selects correspondence one by one, and above-mentioned step 202 and step are performed in turn with the correspondence selected 203, until these correspondences all used or the channel data of the inquiry data set of step 201 was all judged as extremely.
For example, correspondence, which is the accurate of above-mentioned foundation completion, replaces correspondence, fuzzy replacement correspondence, deletion pair Should be related to, hide correspondence when, after being screened according to settling time and user name to these correspondences, obtain this four Then the subset of a correspondence performs following FOUR EASY STEPSs according to priority using these subsets.
3.1 first steps accurately match inquiry data set
Obtain accurately replacing the subset of correspondence after the user of the 4th row and tertial temporal filtering is used, so Afterwards step 202 and step 203 are performed using the subset.That is, the channel data row of inquiry data set are the dimension row for occurring mess code, The system of processing equipment can do accurate matching to the first row of the channel column of the table of the inquiry data set Y.0 and Table A 1.1.
Why select Table A 1.1 without select Table A 1.2, be because according to accurately replacement correspondence settling time into Result after row screening.Such as the settling time of settling time, that is, channel data of inquiry data set is on December 11st, 2016, And the settling time of Table A 1.1 is 2016121113:00, A1.2 settling time is 2016121113:00-2016121210: 00, the settling time for inquiring data set belongs to the settling time of Table A 1.1, so as to the correspondence of processing equipment selection Table A 1.1 Accurately matched.
After matching, the channel data of the first row of the inquiry data set " satellite TV of China~~" replaces correspondence to be accurate In one of correspondence in mess code, according to the first of the correspondence the operation, will include the mess code " satellite TV of China~ ~" channel data replace with the first text " satellite TV of China "
It is as follows so as to obtain table Y.1:
Channel Viewing number Watch duration
Satellite TV of China 1000 2300
Central ## channels A 2000 4000
%@satellite TVs of China 3000 5300
xx4486e 100 5000
3.2nd, second step carries out fuzzy matching to inquiry data set
Due to accurately replacing all correspondences impossible to exhaust in correspondence, so this step needs do fuzzy Match.
The fuzzy subset for replacing correspondence is obtained after the user of the 4th row and tertial temporal filtering is used, so Afterwards step 202 and step 203 are performed using the subset.That is, after the first step is handled, the system of processing equipment can be to the inquiry The first row of the table of data set channel column Y.1 and Table A 2 does fuzzy matching.
After matching, the channel data " central ## channels A " of the second row of the inquiry data set includes fuzzy replace and corresponds to Mess code " center " in one of correspondence in relationship according to the second of the correspondence the operation, will include the mess code The channel data in " center " replaces with the second text " central satellite TV ".
It is as follows so as to obtain table Y.2:
Channel Viewing number Watch duration
Satellite TV of China 1000 2300
Central satellite TV 2000 4000
%@satellite TVs of China 3000 5300
xx4486e 100 5000
3.3rd, third walks, and fuzzy matching is carried out to inquiry data set
Third is performed using the table 7.2 that second step obtains to walk.
It obtains deleting the subset of correspondence after the temporal filtering of tertial user and secondary series is used, then makes Step 202 and step 203 are performed with the subset.That is, after second step is handled, the system of processing equipment can be to the inquiry data The first row of the table of collection channel column Y.2 and Table A 3 matches.
After matching, the channel data " %@satellite TVs of China " of the third line of the inquiry data set includes deleting correspondence In wherein two correspondence in mess code " % " and "@", operated according to the third of the correspondence, by mess code " % " and "@" is deleted from channel data.
It is as follows so as to obtain table Y3:
3.4th, the 4th step is hidden matching to inquiry data set
It obtains hiding the subset of correspondence after the temporal filtering of tertial user and secondary series is used, then makes Step 202 and step 203 are performed with the subset.That is, after three step process, the system of processing equipment can be to the inquiry data The first row of the table of collection channel column Y.3 and Table A 4 matches.
After matching, the channel data " xx4486e " of the fourth line of the inquiry data set includes hiding in correspondence Mess code " xx4486e " in one of correspondence will include mess code " xx4486e " according to the 4th of the correspondence the operation Channel data be hidden.
It is as follows so as to obtain table Y.4:
Channel Viewing number Watch duration
Satellite TV of China 1000 2300
Central satellite TV 2000 4000
Satellite TV of China 3000 5300
In this way, into after crossing the processing of above-mentioned steps, the mess code for inquiring the channel data of data set has closely been eliminated, from And solve the Confused-code in text data, in the dimension during this article notebook data is multi-dimensional data, for example, above-mentioned Channel data is the dimension for inquiring data set, solves dimension data Confused-code by the above method.
It by the above method, solves the problems, such as occur mess code in IPTV data, considers various situations comprehensively, mess code Problem can solve by the operation of user, avoid the intervention of developer and operation maintenance personnel.
In conclusion after obtaining text data, judge whether text data includes the unrest in the correspondence pre-established Code, the correspondence include mess code and the correspondence of operation.If text data includes the mess code in correspondence, according to right It should be related to, text data is handled using operation corresponding with the mess code, so as to which the mess code in text data can be eliminated. In this way, the text data for including mess code, its mess code with correspondence is compared, if text data includes the correspondence The mess code of relationship, the operation that the correspondence may be used eliminate the mess code from this article notebook data, and such mess code is eliminated Method implements convenient and efficient, and the intervention without developer and operation maintenance personnel can be realized.
Fig. 3 is a kind of structure diagram of mess code processing unit provided in an embodiment of the present invention.The device can be integrated in place It manages in equipment, to perform above-mentioned Fig. 1 and method shown in Fig. 2, refering to Fig. 3, the device of the embodiment of the present invention includes:
Acquiring unit 301, for obtaining text data;
Judging unit 302, it is corresponding to close for judging whether text data includes the mess code in the correspondence pre-established System includes mess code and the correspondence of operation;
Processing unit 303, if including the mess code in correspondence for text data, according to correspondence, using with The corresponding operation of the mess code handles text data, to eliminate the mess code in text data.
Optionally,
Correspondence includes at least two types, and different types of correspondence corresponds to different types of operation and difference The priority of grade;
Judging unit 302 is additionally operable to the hierarchal order according to priority, by using different types of correspondence after arriving first Judge whether text data includes the mess code in correspondence.
Optionally,
Correspondence includes the first correspondence, the second correspondence, third correspondence and the 4th correspondence,
First correspondence includes the first mess code and the correspondence of the first operation, and the first operation is will include the first mess code Pending text data replace with the first text, the first mess code includes all characters in pending text data;
Second correspondence includes the second mess code and the correspondence of the second operation, and the second operation is will include the second mess code Pending text data replace with the second text, the second mess code is the partial character in pending text data;
Third correspondence includes the correspondence that third mess code and third operate, and third operation is from treating by third mess code It is deleted in processing text data;
4th correspondence includes the 4th mess code and the correspondence of the 4th operation, and the 4th operation is will include the 4th mess code Pending text data be hidden.
Optionally,
The priority level of correspondence is followed successively by from high to low:First correspondence, the second correspondence, third correspond to Relationship and the 4th correspondence.
Optionally,
The number of correspondence is including multiple, and each correspondence further includes the first pre-set level, and text data further includes Second pre-set level,
The device of the embodiment of the present invention further includes:
Index determination unit 305, for determining the first pre-set level and the second pre-set level pair from multiple correspondences The target correspondence answered;
Judging unit 302 is additionally operable to judge the mess code whether text data is included in target correspondence.
Optionally,
First pre-set level is the first settling time of correspondence, and the second pre-set level is established for the second of text data Time.
Optionally,
For the number of correspondence including multiple, each correspondence further includes user name,
The device of the embodiment of the present invention further includes:
Name acquiring unit 306, for obtaining the user name of current operation user;
Title determination unit 304, for determining user name and the user name of current operation user from multiple correspondences Identical correspondence;
Judging unit 302 is additionally operable to judge the mess code whether text data is included in the correspondence determined.
In conclusion after acquiring unit 301 obtains text data, it is pre- that judging unit 302 judges whether text data includes Mess code in the correspondence first established, the correspondence include mess code and the correspondence of operation.If text data include pair Mess code in should being related to, then processing unit 303 is according to correspondence, using operation corresponding with the mess code to text data progress Processing, so as to which the mess code in text data can be eliminated.In this way, the text data for including mess code, by itself and correspondence Mess code be compared, if text data includes the mess code of the correspondence, the operation of the correspondence may be used by the unrest Code eliminated from this article notebook data, such mess code removing method implement it is convenient and efficient, without developer and O&M people The intervention of member can be realized.
It is apparent to those skilled in the art that for convenience and simplicity of description, the system of foregoing description, The specific work process of device and unit can refer to the corresponding process in preceding method embodiment, and details are not described herein.
In several embodiments provided herein, it should be understood that disclosed system, device and method can be with It realizes by another way.For example, the apparatus embodiments described above are merely exemplary, for example, the unit It divides, only a kind of division of logic function can have other dividing mode, such as multiple units or component in actual implementation It may be combined or can be integrated into another system or some features can be ignored or does not perform.Another point, it is shown or The mutual coupling, direct-coupling or communication connection discussed can be the indirect coupling by some interfaces, device or unit It closes or communicates to connect, can be electrical, machinery or other forms.
The unit illustrated as separating component may or may not be physically separate, be shown as unit The component shown may or may not be physical unit, you can be located at a place or can also be distributed to multiple In network element.Some or all of unit therein can be selected according to the actual needs to realize the mesh of this embodiment scheme 's.
In addition, each functional unit in each embodiment of the present invention can be integrated in a processing unit, it can also That each unit is individually physically present, can also two or more units integrate in a unit.Above-mentioned integrated list The form that hardware had both may be used in member is realized, can also be realized in the form of SFU software functional unit.
If the integrated unit is realized in the form of SFU software functional unit and is independent product sale or uses When, it can be stored in a computer read/write memory medium.Based on such understanding, technical scheme of the present invention is substantially The part to contribute in other words to the prior art or all or part of the technical solution can be in the form of software products It embodies, which is stored in a storage medium, is used including some instructions so that a computer Equipment (can be personal computer, server or the network equipment etc.) performs the complete of each embodiment the method for the present invention Portion or part steps.And aforementioned storage medium includes:USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disc or CD etc. are various can store journey The medium of sequence code.
The above, the above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations;Although with reference to before Embodiment is stated the present invention is described in detail, it will be understood by those of ordinary skill in the art that:It still can be to preceding The technical solution recorded in each embodiment is stated to modify or carry out equivalent replacement to which part technical characteristic;And these Modification is replaced, the spirit and scope for various embodiments of the present invention technical solution that it does not separate the essence of the corresponding technical solution.

Claims (10)

1. a kind of mess code processing method, which is characterized in that including:
Obtain text data;
Judge whether the text data includes the mess code in the correspondence that pre-establishes, the correspondence include mess code and The correspondence of operation;
If the text data includes the mess code in the correspondence, according to the correspondence, using with the mess code Corresponding operation handles the text data, to eliminate the mess code in the text data.
2. according to the method described in claim 1, it is characterized in that,
The correspondence includes at least two types, and different types of correspondence corresponds to different types of operation and difference The priority of grade;
It is described to judge whether the text data includes the mess code in the correspondence pre-established, including:
According to the hierarchal order of the priority, by judging that the text data is using different types of correspondence after arriving first The no mess code including in correspondence.
3. according to the method described in claim 1, it is characterized in that,
The correspondence includes the first correspondence, the second correspondence, third correspondence and the 4th correspondence,
First correspondence includes the first mess code and the correspondence of the first operation, and first operation is described for that will include The pending text data of first mess code replaces with the first text, and first mess code is included in the pending text data All characters;
Second correspondence includes the second mess code and the correspondence of the second operation, and second operation is described for that will include The pending text data of second mess code replaces with the second text, and second mess code is the portion in the pending text data Divide character;
The correspondence that the third correspondence includes third mess code and third operates, the third operation is by the third Mess code is deleted from pending text data;
4th correspondence includes the 4th mess code and the correspondence of the 4th operation, and the 4th operation is described for that will include The pending text data of 4th mess code is hidden.
4. according to the method described in claim 3, it is characterized in that,
The priority level of the correspondence is followed successively by from high to low:First correspondence, second correspondence, The third correspondence and the 4th correspondence.
5. method according to any one of claims 1 to 4, which is characterized in that
The number of the correspondence is including multiple, and each correspondence further includes the first pre-set level, and the text data is also Including the second pre-set level,
It is described judge whether the text data includes the mess code in the correspondence that pre-establishes before, the method is also wrapped It includes:
Determine that first pre-set level target corresponding with second pre-set level is corresponding from the multiple correspondence Relationship;
It is described to judge whether the text data includes the mess code in the correspondence pre-established, including:
Judge whether the text data includes the mess code in the target correspondence.
6. according to the method described in claim 5, it is characterized in that,
First pre-set level is the first settling time of the correspondence, and second pre-set level is the textual data According to the second settling time.
7. method according to any one of claims 1 to 4, which is characterized in that
For the number of the correspondence including multiple, each correspondence further includes user name,
It is described judge whether the text data includes the mess code in the correspondence that pre-establishes before, the method is also wrapped It includes:
Obtain the user name of current operation user;
The user name correspondence identical with the user name of the current operation user is determined from the multiple correspondence;
It is described to judge whether the text data includes the mess code in the correspondence pre-established, including:
Judge whether the text data includes the mess code in the correspondence determined.
8. a kind of mess code processing unit, which is characterized in that including:
Acquiring unit, for obtaining text data;
Judging unit, for judging whether the text data includes the mess code in the correspondence pre-established, the correspondence Relationship includes mess code and the correspondence of operation;
Processing unit if including the mess code in the correspondence for the text data, according to the correspondence, makes The text data is handled with operation corresponding with the mess code, to eliminate the mess code in the text data.
9. device according to claim 8, which is characterized in that
The correspondence includes at least two types, and different types of correspondence corresponds to different types of operation and difference The priority of grade;
The judging unit is additionally operable to the hierarchal order according to the priority, by using different types of corresponding pass after arriving first System judges whether the text data includes the mess code in correspondence.
10. device according to claim 8, which is characterized in that
The correspondence includes the first correspondence, the second correspondence, third correspondence and the 4th correspondence,
First correspondence includes the first mess code and the correspondence of the first operation, and first operation is described for that will include The pending text data of first mess code replaces with the first text, and first mess code is included in the pending text data All characters;
Second correspondence includes the second mess code and the correspondence of the second operation, and second operation is described for that will include The pending text data of second mess code replaces with the second text, and second mess code is the portion in the pending text data Divide character;
The correspondence that the third correspondence includes third mess code and third operates, the third operation is by the third Mess code is deleted from pending text data;
4th correspondence includes the 4th mess code and the correspondence of the 4th operation, and the 4th operation is described for that will include The pending text data of 4th mess code is hidden.
CN201611264769.XA 2016-12-30 2016-12-30 Method and device for processing messy codes Active CN108271041B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611264769.XA CN108271041B (en) 2016-12-30 2016-12-30 Method and device for processing messy codes

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611264769.XA CN108271041B (en) 2016-12-30 2016-12-30 Method and device for processing messy codes

Publications (2)

Publication Number Publication Date
CN108271041A true CN108271041A (en) 2018-07-10
CN108271041B CN108271041B (en) 2021-01-22

Family

ID=62770158

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611264769.XA Active CN108271041B (en) 2016-12-30 2016-12-30 Method and device for processing messy codes

Country Status (1)

Country Link
CN (1) CN108271041B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0728810A (en) * 1993-07-14 1995-01-31 Matsushita Electric Ind Co Ltd Character processing method and device therefor
CN102479174A (en) * 2010-11-23 2012-05-30 盛乐信息技术(上海)有限公司 Chinese character automatic checking and error-correcting system aiming at GBK (Chinese Internal Code Specification) encoding and method thereof
CN104424010A (en) * 2013-09-06 2015-03-18 北大方正集团有限公司 Method and system for detecting and repairing text document messy codes
CN104516862A (en) * 2013-09-29 2015-04-15 北大方正集团有限公司 Method and system for selecting and reading coded format of target document
CN104750663A (en) * 2013-12-27 2015-07-01 阿里巴巴集团控股有限公司 Identification method and device for text messy codes in page
CN105426390A (en) * 2015-10-23 2016-03-23 广东小天才科技有限公司 Image recognition-based question search method and system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0728810A (en) * 1993-07-14 1995-01-31 Matsushita Electric Ind Co Ltd Character processing method and device therefor
CN102479174A (en) * 2010-11-23 2012-05-30 盛乐信息技术(上海)有限公司 Chinese character automatic checking and error-correcting system aiming at GBK (Chinese Internal Code Specification) encoding and method thereof
CN104424010A (en) * 2013-09-06 2015-03-18 北大方正集团有限公司 Method and system for detecting and repairing text document messy codes
CN104516862A (en) * 2013-09-29 2015-04-15 北大方正集团有限公司 Method and system for selecting and reading coded format of target document
CN104750663A (en) * 2013-12-27 2015-07-01 阿里巴巴集团控股有限公司 Identification method and device for text messy codes in page
CN105426390A (en) * 2015-10-23 2016-03-23 广东小天才科技有限公司 Image recognition-based question search method and system

Also Published As

Publication number Publication date
CN108271041B (en) 2021-01-22

Similar Documents

Publication Publication Date Title
CN100501732C (en) Displaying events by category based on a logarithmic timescale
CN102483731B (en) Have according to search load by the medium of the fingerprint database of equilibrium
CN104317806B (en) Financial data inquiry method and financial data system
CN108920611B (en) Article generation method, device, equipment and storage medium
CN108153719A (en) Merge the method and apparatus of electrical form
EP3800559A1 (en) Accessing datasets
KR101500294B1 (en) Patent Analysis System and Method therefor and Computer Readable Recording Medium whereon Program therefor is Recorded
US9355227B2 (en) Dynamic document display personalization implemented in a digital rights management system
CN107015987A (en) A kind of method and apparatus for updating and searching for database
CN104915426A (en) Information sorting method, method for generating information ordering models and device
CN109857661B (en) Method and system for intelligently generating test cases based on big data analysis
CN106294785A (en) Content Selection method and system
CN106599291B (en) Data grouping method and device
CN106611031A (en) Data query method and device
CN103823614A (en) Information processing method, device and electronic equipment
CN101770474A (en) History searching record-based searching method and device
CN112214557B (en) Data matching classification method and device
CN106569683A (en) Method and equipment for performing batch processing of applications on mobile terminal
CN108271041A (en) Mess code treating method and apparatus
CN110825947B (en) URL deduplication method, device, equipment and computer readable storage medium
CN109977977A (en) A kind of method and corresponding intrument identifying potential user
CN107492036B (en) Insurance policy escrow system
KR20130126012A (en) Method and apparatusfor providing report of business intelligence
CN106779909A (en) Material matching process and device
CN111782684B (en) Distribution network electronic handover information matching method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: 100080 No. 401, 4th Floor, Haitai Building, 229 North Fourth Ring Road, Haidian District, Beijing

Applicant after: Beijing Guoshuang Technology Co.,Ltd.

Address before: 100086 Cuigong Hotel, 76 Zhichun Road, Shuangyushu District, Haidian District, Beijing

Applicant before: Beijing Guoshuang Technology Co.,Ltd.

CB02 Change of applicant information
GR01 Patent grant
GR01 Patent grant