CN104867019A - Electronic commerce evaluation identification system based on ID similarity identification - Google Patents

Electronic commerce evaluation identification system based on ID similarity identification Download PDF

Info

Publication number
CN104867019A
CN104867019A CN201510250996.6A CN201510250996A CN104867019A CN 104867019 A CN104867019 A CN 104867019A CN 201510250996 A CN201510250996 A CN 201510250996A CN 104867019 A CN104867019 A CN 104867019A
Authority
CN
China
Prior art keywords
evaluation
similarity
judge module
identification
false
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510250996.6A
Other languages
Chinese (zh)
Inventor
吴雨浓
何宏靖
刘世林
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu Business Big Data Technology Co Ltd
Original Assignee
Chengdu Business Big Data Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chengdu Business Big Data Technology Co Ltd filed Critical Chengdu Business Big Data Technology Co Ltd
Priority to CN201510250996.6A priority Critical patent/CN104867019A/en
Publication of CN104867019A publication Critical patent/CN104867019A/en
Pending legal-status Critical Current

Links

Abstract

The invention relates to the internet field, and especially relates to an electronic commerce evaluation identification system based on ID similarity identification. The system comprises a client, a network connection device, an ID similarity judgment module, an ID cheat factor judgment module and a false evaluation marking module. The client acquires relevant evaluation data information of a target commodity through the network connection device, and outputs the information to the ID similarity judgment module, the ID cheat factor judgment module and the false evaluation marking module which are sequentially connected. Based on an ID similarity analysis of target commodity evaluations, the system determines the cheat possibility of IDs giving the same or similar evaluations through the ID cheat factor judgment module; if the frequency of the IDs giving the evaluations is obviously higher than the normal frequency, the system determines that the IDs are the false evaluation IDs; and the system timely marks the false evaluation IDs and the corresponding evaluations through the false evaluation marking module. The system can automatically indentify the false evaluations from the target commodity evaluations.

Description

Electric business based on the identification of ID similarity evaluates identification system
Technical field
The present invention relates to internet arena, the electric business particularly based on the identification of ID similarity evaluates identification system.
Background technology
In the present age, along with popularizing of internet, ecommerce has become a kind of commerce and trade mode be widely used.Both parties mainly carry out transaction by the webpage of electric business or software.Because ecommerce does not have traditional entity StoreFront, not high to the quantitative requirement of sales force yet, so compare conventional transaction pattern more can control operation cost, thus there is larger price advantage.But, have a lot of illegal businessman to improve the sales volume of oneself thus employing specialty brush to evaluate team also to manufacture a large amount of false evaluation and carry out false publication to the commodity of oneself, thus deception consumer improves the true sales volume of oneself.
The development of current ecommerce is swift and violent, the scale of construction is huge, Seller Number in electricity quotient ring border is numerous, user is difficult to when carrying out purchase decision the authenticity judging descriptive labelling, the dependency degree evaluated commodity is very high, and the situation of buyer's interests loss that the situation of the performance favorable comment degree virtual height of commodity caused because seller evaluates cheating causes is serious.Under these circumstances, how the evaluation cheating of businessman in ecommerce identified and judge into problem demanding prompt solution in e-commerce development process; Judge the accuracy how improving judgement in false evaluation process, avoid the generation of erroneous judgement situation to be also very important considerations; The judgement that relevant device accurately and effectively realizes being correlated with also is lacked in currently available technology.
Summary of the invention
In order to solve problems of the prior art, the electric business that the invention provides based on the identification of ID similarity evaluates identification system, is identified the identical and similar evaluation ID in end article evaluating data by ID similarity judge module; And by ID cheating factor judge module on the basis judging identical and similar ID, identify the possibility that these identical or similar evaluation ID are cheating ID, and then judge a large amount of false evaluation given by professional brush evaluation personnel; And this electric business based on the identification of ID similarity is evaluated identification system and also the false ID judged by ID cheating factor judge module and corresponding evaluation content is marked by false evaluation mark module, achieve the automatic identification of false evaluation in end article evaluation like this, for electric business environment administrator and commodity consumption person provide simple and reliable evaluation identification instrument.
In order to realize foregoing invention object, the invention provides following technical scheme:
Electric business based on the identification of ID similarity evaluates identification system; Comprise client computer, network connection device, ID similarity judge module, ID cheating factor judge module and false evaluation mark module; The relevant evaluation data message that wherein said client computer one end obtains end article by network connection device (can get the relevant information in target web at present very easily by crawler technology, the speed extracted is fast, the total amount can analyzing data is huge, to extract the analytical approach of data ripe, with low cost); The other end of described client computer is connected with the input end of described ID similarity judge module, and the practise fraud input end of factor judge module of output terminal and the described ID of described ID similarity judge module is connected.The end article evaluation information got outputs in described ID similarity judge module by described client computer, whether by text similarity, described ID similarity judge module judges that these evaluate ID identical or similar, and will judge that result (identical or similar ID) is input to ID and practises fraud in factor judge module; If these ID send the frequency of evaluation higher than threshold value, these ID are then judged as false evaluation ID by described ID factor judge module of practising fraud.
If businessman wants by wash sale and evaluates the sales volume and the favorable comment situation that improve system display of commodity at present, the quantity of required false evaluation is comparatively large, under these circumstances; Occupation brush evaluation team manually or can utilize automatic register machine, and to register a lot of trumpet, (so-called trumpet refers to, same person registration and different No. ID of using), the small size ID that these vocational evaluation team register and use has certain regularity; Generally vocational evaluation teacher register a series of No. ID also according to system recommendation or automatically generate, such mode No. ID of producing can have larger relevance and similarity, such as ABC1, ABC2, ABC3, ABC4, ABC5.....ABCn.By relatively just judging that whether the evaluation ID corresponding to identical or similar evaluation content is identical or similar to the text similarity evaluating ID; If ID is identical or similar, so these ID are that the possibility of false ID is very high.
In order to improve the accuracy that false evaluation judges further, make the result of judgement more strict, judged result is input to described ID and practises fraud in factor judge module by described ID similarity judge module; Described ID practises fraud factor judge module on the basis of the identical or similar ID judged, analyze frequency and time that corresponding ID sends evaluation, the average ratings frequency of the frequency and end article evaluation that corresponding ID are sent evaluation compares, if its ratio is higher than the threshold value of setting, then these are evaluated ID and be judged as false evaluation ID, by native system judge that the process of false evaluation is strict, judged result accuracy is high.
Preferred as one, described ID similarity judge module is that similar evaluation ID judges server; Described ID cheating factor judge module is that the cheating factor judges server.Described similar evaluation content judges server, similar evaluation ID judges server and the cheating factor judges that server is connected successively by data connecting line.Server is exhibits excellent in processing power, stability, reliability, security, extensibility, manageability etc., relevant content similarities is completed by server, the correlated judgment of ID similarity, can the related data of a large amount of electric business's end article of fast processing, processing speed is fast, and efficiency is high.
Further, described ID cheating factor judge module is also connected with false evaluation mark module by data connecting line.Described false evaluation mark module is false evaluation mark server, and the false evaluation judged is marked according to the Output rusults of described ID cheating factor judge module by described false evaluation mark module.The present invention carries out scientific analysis to the authenticity of the evaluation of end article and reasonably judges, the false evaluation identified in end article evaluation is (high to the discriminating accuracy rate of a large amount of false evaluation given by occupation brush evaluation team, there is stronger specific aim), and by the mark to false evaluation, intuitively the non-honest behavior that the evaluation of electric business is practised fraud is shown in face of commodity buyer and electric business supvr; Be conducive to the purification of e-commerce environment, maintain the rational interests of commodity purchaser and sincere seller, improve the confidence level of businessman's prestige; Contribute to the sound development of electric firm industry.
Compared with prior art, beneficial effect of the present invention: the electric business that the invention provides based on the identification of ID similarity evaluates identification system.By the network address of client access end article, crawl the evaluating data of corresponding goods webpage; And by server, the evaluating data crawled is judged, evaluation content in assay data, described similar ID judges server, evaluation ID is analyzed, counted the quantity of identical ID by text similarity algorithm, and judge the likelihood probability of other ID, evaluation ID similar threshold value likelihood probability and machine learning drawn compares, determine similar evaluation ID, and add up the judged result of similar ID; On the basis judging identical and similar ID, by described ID cheating factor judge module, judged by the frequency and time Target id being sent to evaluation, determine that these identical or similar ID are the possibility of false ID; Eventually through false evaluation mark module, the false evaluation related content judged and ID are marked, electric business's customer evaluation identification system of the present invention, accurately, the false evaluation of end article is comprehensively analyzed, similar ID identification has targetedly been carried out to the trumpet of vocational evaluation teacher registration, the identification capability of vocational evaluation teacher evaluation cheating serious is like this engaged to significantly improve to end article, contribute to the confidence level improving electric quotient ring border, be conducive to the formation of normal management and control order.Buyer is helped to evade the transaction risk brought because seller evaluates cheating.
Accompanying drawing illustrates:
Fig. 1 is that this electric business based on the identification of ID similarity evaluates identification system annexation figure.
Fig. 2 is the preferred annexation figure that this electric business based on the identification of ID similarity evaluates identification system.
Embodiment
Below in conjunction with test example and embodiment, the present invention is described in further detail.But this should be interpreted as that the scope of the above-mentioned theme of the present invention is only limitted to following embodiment, all technology realized based on content of the present invention all belong to scope of the present invention.
The electric business that the invention provides based on the identification of ID similarity evaluates identification system, is identified the identical and similar evaluation ID in end article evaluating data by ID similarity judge module; And by ID cheating factor judge module on the basis judging identical and similar ID, identify the possibility that these identical or similar evaluation ID are cheating ID, and then judge a large amount of false evaluation given by professional brush evaluation personnel; And this electric business based on the identification of ID similarity is evaluated identification system and also the false ID judged by ID cheating factor judge module and corresponding evaluation content is marked by false evaluation mark module, achieve the automatic identification of false evaluation in end article evaluation like this, for electric business environment administrator and commodity consumption person provide simple and reliable evaluation identification instrument.
In order to realize foregoing invention object, the invention provides following technical scheme:
Electric business based on the identification of ID similarity evaluates identification system, comprises client computer, network connection device, ID similarity judge module, (evaluating ID) ID cheating factor judge module and false evaluation mark module as shown in Figure 1; The relevant evaluation data message that wherein said client computer one end obtains end article by network connection device (can get the relevant information in target web at present very easily by crawler technology, the speed extracted is fast, the total amount can analyzing data is huge, to extract the analytical approach of data ripe, with low cost); The other end of described client computer is connected with the input end of described ID similarity judge module, and the practise fraud input end of factor judge module of output terminal and the described ID of described ID similarity judge module is connected.The end article evaluation information got outputs in described ID similarity judge module by described client computer, whether by text similarity, described ID similarity judge module judges that these evaluate ID identical or similar, and will judge that result (identical and similar ID) is input to ID and practises fraud in factor judge module; If these ID send the frequency of evaluation higher than threshold value, these ID are then judged as false evaluation ID by described ID factor judge module of practising fraud.
If businessman wants by wash sale and evaluates the sales volume and the favorable comment situation that improve system display of commodity at present, the quantity of required false evaluation is comparatively large, under these circumstances; Occupation brush evaluation team manually or can utilize automatic register machine, and to register a lot of trumpet, (so-called trumpet refers to, same person registration and different No. ID of using), the small size ID that these vocational evaluation team register and use has certain regularity; Generally vocational evaluation teacher register a series of No. ID also according to system recommendation or automatically generate, such mode No. ID of producing can have larger relevance and similarity, such as ABC1, ABC2, ABC3, ABC4, ABC5.....ABCn.By comparing the text similarity evaluating ID, (method that current text similarity judges is ripe, such as adopt cosine similarity to judge, and detailed process does not repeat them here) just can judge that whether the evaluation ID corresponding to identical or similar evaluation content is identical or similar; If ID is identical or similar, so these ID are that the possibility of false ID is very high.In order to improve the accuracy that false evaluation judges further, make the result of judgement more strict, judged result is input to described ID and practises fraud in factor judge module by described ID similarity judge module; Described ID practises fraud factor judge module on the basis of the identical or similar ID judged, analyze frequency and time that corresponding ID sends evaluation, the average ratings frequency of the frequency and end article evaluation that corresponding ID are sent evaluation compares, if its ratio is higher than the threshold value of setting, then these are evaluated ID and be judged as false evaluation ID, about the cheating factor, make following definition, the cheating factor is a value between [0 ~ ∞], be worth larger, represent that the possibility of cheating is higher, on the contrary lower.Detailed computing method are as follows: in the average ratings time interval calculating i-th ID, computing formula is as follows:
t i ‾ = t n - t 1 n - 1
Wherein t nsend out the time point evaluated, t n-th time 1send out the time point evaluated the 1st time; Calculate the overall average evaluation intervals of all ID of these commodity, computing formula is as follows:
t ‾ = Σ i = 1 N t i ‾ N = Σ i = 1 N t ni - t 1 n i - 1 N
Calculate the cheating factor, computing formula is as follows:
η = t ‾ t i ‾
Wherein η is the cheating factor; The predicting relation of cheating ID is: (namely during η>=2), (the evaluation time frequency of false ID is 2 times of the evaluation frequency of all evaluations of end article, and wherein the factor of 2 times is through experimental verification, is one and more preferably selects; Namely the interval sending out comment as this ID is less than equispaced time, namely think that this ID is the ID providing false evaluation).By native system judge that the process of false evaluation is strict, judged result accuracy is high.
Preferred as one, as shown in Figure 2, described ID similarity judge module is that similar evaluation ID judges server; Described ID cheating factor judge module is that the cheating factor judges server.Described similar evaluation content judges server, similar evaluation ID judges server and the cheating factor judges that server is connected successively by data connecting line.Server is exhibits excellent in processing power, stability, reliability, security, extensibility, manageability etc., relevant content similarities is completed by server, the correlated judgment of ID similarity, can the related data of a large amount of electric business's end article of fast processing, processing speed is fast, and efficiency is high.
Further, described ID cheating factor judge module is also connected with false evaluation mark module by data connecting line.Described false evaluation mark module is false evaluation mark server, and the false evaluation judged is marked according to the Output rusults of described ID cheating factor judge module by described false evaluation mark module.The present invention carries out scientific analysis to the authenticity of the evaluation of end article and reasonably judges, the false evaluation identified in end article evaluation is (high to the discriminating accuracy rate of a large amount of false evaluation given by occupation brush evaluation team, there is stronger specific aim), and by the mark to false evaluation, intuitively the non-honest behavior that the evaluation of electric business is practised fraud is shown in face of commodity buyer and electric business supvr; Be conducive to the purification of e-commerce environment, maintain the rational interests of commodity purchaser and sincere seller, improve the confidence level of businessman's prestige; Contribute to the sound development of electric firm industry.

Claims (6)

1. the electric business based on the identification of ID similarity evaluates identification system, it is characterized in that, comprises client computer, network connection device, ID similarity judge module, ID cheating factor judge module and false evaluation mark module; Wherein said client computer one end obtains the relevant evaluation data message of end article by network connection device, the other end of described client computer is connected with the input end of described ID similarity judge module, the practise fraud input end of factor judge module of output terminal and the ID of described ID similarity judge module is connected, and the practise fraud output terminal of factor judge module of described ID is connected with the input end of described false evaluation judge module.
2. evaluate identification system based on the electric business of ID similarity identification as claimed in claim 1, it is characterized in that, the end article evaluation information got outputs in ID similarity judge module by described client computer; Through text similarity identification, described ID similarity judge module judges that whether these ID are identical or similar, and judged result be input in ID cheating factor judge module; If wait to judge ID send the frequency of evaluation higher than threshold value, then these ID are judged as false evaluation ID by described ID factor judge module of practising fraud.
3. evaluate identification system based on the electric business of ID similarity identification as claimed in claim 2, it is characterized in that, described ID similarity judge module is that similar evaluation ID judges server; Described ID cheating factor judge module is that the cheating factor judges server; Described false evaluation mark module is false evaluation mark server.
4. evaluate identification system based on the electric business of ID similarity identification as claimed in claim 3, it is characterized in that, similar evaluation ID judges server, the cheating factor judges server and false evaluation mark server is connected successively by data connecting line.
5. evaluate identification system based on the electric business of ID similarity identification as claimed in claim 4, it is characterized in that, the false evaluation judged is marked according to the Output rusults of described ID cheating factor judge module by described false evaluation mark module.
6. evaluate identification system based on the electric business of ID similarity identification as claimed in claim 5, it is characterized in that, described ID similarity judge module, by carrying out text identification to the evaluation ID in evaluating data, counting identical and similar evaluation ID respectively.
CN201510250996.6A 2015-05-16 2015-05-16 Electronic commerce evaluation identification system based on ID similarity identification Pending CN104867019A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510250996.6A CN104867019A (en) 2015-05-16 2015-05-16 Electronic commerce evaluation identification system based on ID similarity identification

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510250996.6A CN104867019A (en) 2015-05-16 2015-05-16 Electronic commerce evaluation identification system based on ID similarity identification

Publications (1)

Publication Number Publication Date
CN104867019A true CN104867019A (en) 2015-08-26

Family

ID=53912837

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510250996.6A Pending CN104867019A (en) 2015-05-16 2015-05-16 Electronic commerce evaluation identification system based on ID similarity identification

Country Status (1)

Country Link
CN (1) CN104867019A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103020482A (en) * 2013-01-05 2013-04-03 南京邮电大学 Relation-based spam comment detection method
CN103778186A (en) * 2013-12-31 2014-05-07 南京财经大学 Method for detecting sockpuppet
CN103984673A (en) * 2013-02-11 2014-08-13 谷歌股份有限公司 Automatic detection of fraudulent ratings/comments related to an application store

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103020482A (en) * 2013-01-05 2013-04-03 南京邮电大学 Relation-based spam comment detection method
CN103984673A (en) * 2013-02-11 2014-08-13 谷歌股份有限公司 Automatic detection of fraudulent ratings/comments related to an application store
CN103778186A (en) * 2013-12-31 2014-05-07 南京财经大学 Method for detecting sockpuppet

Similar Documents

Publication Publication Date Title
CN104867017A (en) Electronic commerce client false evaluation identification system
CN104881795A (en) E-commerce false comment judging and recognizing method
CN104881796A (en) False comment judgment system based on comment content and ID recognition
TWI706422B (en) Risk control method, device, server and storage medium
Algur et al. Conceptual level similarity measure based review spam detection
TW201812689A (en) System, method, and device for identifying malicious address/malicious purchase order
WO2019061994A1 (en) Electronic device, insurance product recommendation method and system, and computer readable storage medium
US20170140464A1 (en) Method and apparatus for evaluating relevance of keyword to asset price
US20170053213A1 (en) Method and system for filtering goods evaluation information
CN105335496A (en) Customer service repeated call treatment method based on cosine similarity text mining algorithm
CN103678659A (en) E-commerce website cheat user identification method and system based on random forest algorithm
CN108764705A (en) A kind of data quality accessment platform and method
CN103544436A (en) System and method for distinguishing phishing websites
Chauhan et al. Research on product review analysis and spam review detection
CN104867032A (en) Electronic commerce client evaluation identification system
CN115391669B (en) Intelligent recommendation method and device and electronic equipment
WO2019072098A1 (en) Method and system for identifying core product terms
CN112258303A (en) Surrounding string mark early warning analysis method and device, electronic equipment and storage medium
CN104867018A (en) Electronic commerce evaluation judgment system based on evaluation content and ID similarity identification
CN103294686B (en) A kind of webpage cheating user, the recognition methods of cheating webpages and system
CN106920124A (en) A kind of Data acquisition and issuance method and device
CN113672787B (en) Stock market trading behavior monitoring method and device and storage medium
KR20210029006A (en) Product Evolution Mining Method And Apparatus Thereof
CN104867020A (en) False evaluation ID judgment and identification system
CN104867033A (en) Electronic commerce client evaluation judging and marking system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20150826

WD01 Invention patent application deemed withdrawn after publication