CN112800291B - Similar account determination method and device, electronic equipment and storage medium - Google Patents

Similar account determination method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN112800291B
CN112800291B CN202110406247.3A CN202110406247A CN112800291B CN 112800291 B CN112800291 B CN 112800291B CN 202110406247 A CN202110406247 A CN 202110406247A CN 112800291 B CN112800291 B CN 112800291B
Authority
CN
China
Prior art keywords
accounts
distance
determining
character string
account
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110406247.3A
Other languages
Chinese (zh)
Other versions
CN112800291A (en
Inventor
陈程
王贺
李涵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan Zhuoer Digital Media Technology Co ltd
Original Assignee
Wuhan Zhuoer Digital Media Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan Zhuoer Digital Media Technology Co ltd filed Critical Wuhan Zhuoer Digital Media Technology Co ltd
Priority to CN202110406247.3A priority Critical patent/CN112800291B/en
Publication of CN112800291A publication Critical patent/CN112800291A/en
Application granted granted Critical
Publication of CN112800291B publication Critical patent/CN112800291B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/90335Query processing
    • G06F16/90344Query processing by using string matching techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/01Social networking

Abstract

The embodiment of the invention is applicable to the technical field of computers, and provides a similar account determination method, a similar account determination device, electronic equipment and a storage medium, wherein the similar account determination method comprises the following steps: determining a character string corresponding to each of at least two first accounts; each character in the character string corresponds to an access object; the frequency of the access object accessed by the first account is greater than a first set value; determining a first distance between each two of the at least two first accounts; the first distance represents the editing distance of the character strings corresponding to the two first accounts; based on a first distance between each two of the at least two first accounts, determining a similar account corresponding to each of the at least two first accounts.

Description

Similar account determination method and device, electronic equipment and storage medium
Technical Field
The invention belongs to the technical field of computers, and particularly relates to a method and a device for determining a similar account, electronic equipment and a storage medium.
Background
Pushing various information, products or services to the account based on the similarity between the accounts is a common internet marketing behavior in network applications. The related art divides accounts with similar attributes into similar accounts by acquiring the attributes of the accounts. However, for the accounts of the social software, the determination of the similarity between the accounts based on the account attributes is not completely applicable, and the accuracy of the determined account similarity is not high.
Disclosure of Invention
In view of this, embodiments of the present invention provide a method, an apparatus, an electronic device, and a storage medium for determining a similar account, so as to at least solve the problem in the related art that the accuracy of the account similarity determined based on the account attribute is not high.
The technical scheme of the invention is realized as follows:
in a first aspect, an embodiment of the present invention provides a method for determining a similar account, where the method includes:
determining a character string corresponding to each of at least two first accounts; each character in the character string corresponds to an access object; the frequency of the access object accessed by the first account is greater than a first set value;
determining a first distance between each two of the at least two first accounts; the first distance represents the editing distance of the character strings corresponding to the two first accounts;
based on a first distance between each two of the at least two first accounts, determining a similar account corresponding to each of the at least two first accounts.
In the above solution, the access object at least satisfies any one of the following:
the ratio of the number of times of accessing the access object to the total number of times of accessing all the access objects in a set time period by the first account is greater than or equal to a second set value;
the ratio of the time length of the first account for accessing the access object in the set time period to the total time length of all the access objects is larger than or equal to a third set value.
In the above solution, the determining, based on a first distance between each two of the at least two first accounts, a similar account corresponding to each of the at least two first accounts includes:
determining similarity between the corresponding two first accounts based on the first distance;
and under the condition that the similarity is smaller than a fourth set value, determining that the two corresponding first accounts are similar accounts.
In the above scheme, the similarity represents a first ratio of a first distance between the two first accounts to a first length; the first length represents the length of the character string with the maximum character string length in the character strings corresponding to the two first accounts.
In the foregoing solution, the determining the similarity between the two corresponding first accounts based on the first distance includes:
determining the difference value of the character string lengths of the character strings corresponding to the two first accounts;
and determining the similarity between the two corresponding first accounts based on the first distance when the difference value is smaller than a fifth set value.
In the foregoing solution, the determining a first distance between each two of the at least two first accounts includes:
determining the minimum number of editing operations for converting the first character string into the second character string, wherein the minimum number of editing operations represents the first distance; the first character string and the second character string respectively represent character strings corresponding to any two first accounts in the at least two first accounts.
In the above solution, the access object at least includes any one of the following items:
social application software;
a social group.
In a second aspect, an embodiment of the present invention provides a similar account determination apparatus, where the apparatus includes:
the first determining module is used for determining a character string corresponding to each of at least two first accounts; each character in the character string corresponds to an access object; the frequency of the access object accessed by the first account is greater than a first set value;
a second determining module, configured to determine a first distance between each two of the at least two first accounts; the first distance represents the editing distance of the character strings corresponding to the two first accounts;
a third determining module, configured to determine, based on a first distance between each two of the at least two first accounts, a similar account corresponding to each of the at least two first accounts.
In a third aspect, an embodiment of the present invention provides an electronic device, including a processor and a memory, where the processor and the memory are connected to each other, where the memory is used to store a computer program, and the computer program includes program instructions, and the processor is configured to call the program instructions to execute the steps of the similar account determination method provided in the first aspect of the embodiment of the present invention.
In a fourth aspect, an embodiment of the present invention provides a computer-readable storage medium, including: the computer-readable storage medium stores a computer program. The computer program when executed by a processor performs the steps of the similar account determination method as provided in the first aspect of the embodiment of the present invention.
The method and the device for determining the distance between the first accounts determine a first distance between every two of the at least two first accounts by determining a character string corresponding to each of the at least two first accounts, and determine a similar account corresponding to each of the at least two first accounts based on the first distance between every two of the at least two first accounts. Each character in the character strings corresponds to an access object, the frequency of the access object accessed by the first account is greater than a first set value, and the first distance represents the editing distance of the character strings corresponding to the two first accounts. According to the embodiment of the invention, the similarity of the two accounts is determined through the editing distance, so that the accuracy of similar account identification can be improved, and the method and the device have a good effect on identifying the similarity of the accounts using the social software.
Drawings
Fig. 1 is a schematic flow chart illustrating an implementation of a similar account determination method according to an embodiment of the present invention;
fig. 2 is a schematic flow chart illustrating an implementation of another similar account determination method according to an embodiment of the present invention;
fig. 3 is a schematic flow chart illustrating an implementation of another similar account determination method according to an embodiment of the present invention;
FIG. 4 is a schematic diagram of a similar account determination process provided by an embodiment of the present invention;
FIG. 5 is a schematic diagram of a similar account determination apparatus provided by an embodiment of the present invention;
fig. 6 is a schematic diagram of an electronic device according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
In the related art, the similarity of accounts is mainly mined through account attributes, for example, accounts of the same age or the same gender are determined as similar accounts, the similarity determination method is too simple, the accuracy of the determined account similarity is low, and the method is not particularly suitable for similar accounts in social scenes.
In view of the above disadvantages of the related art, the embodiment of the present invention provides a method for determining a similar account, which can at least improve the accuracy of account similarity identification. In order to explain the technical means of the present invention, the following description will be given by way of specific examples.
Fig. 1 is a schematic view of an implementation flow of a similar account determination method according to an embodiment of the present invention, where an execution subject of the similar account determination method is an electronic device, and the electronic device includes a desktop computer, a notebook computer, a server, and the like. Referring to fig. 1, the similar account determination method includes:
s101, determining a character string corresponding to each of at least two first accounts; each character in the character string corresponds to an access object; the frequency of the access object accessed by the first account is greater than a first set value.
Here, the access object includes at least any one of:
social application software;
a social group.
When the access object is social application software, the social application software can be communication software such as WeChat, QQ and nail, and can also be shopping software such as Taobao and Jingdong, and the software has social functions and can interact with accounts.
The access object may also be a social group in the social application software, for example, the social group may be a WeChat group in WeChat, a QQ group in QQ, a staple group in staple, a shopping group in shopping software, and so forth.
The access target may be an electronic shop in shopping software such as Taobao and Jingdong.
It is understood that in one embodiment, with multiple access objects, all access objects can only be social applications or all access objects can only be social groups, and cannot include both social applications and social groups.
Here, the at least two first accounts refer to an account cluster for determining the similarity, for example, the at least two first accounts may be accounts in the same social group, and the at least two first accounts may also be usage accounts of the same social application software.
In an embodiment of the present invention, each character in the character string corresponds to an access object, for example, the character string abc, where a corresponds to WeChat, b corresponds to QQ, and c corresponds to nailing. The frequency of the access object corresponding to the character string being accessed by the first account is greater than the first set value, that is, the access object corresponding to each character in the character string is the access object with higher access frequency of the first account.
In one embodiment, the access object satisfies at least any one of:
the ratio of the number of times of accessing the access object to the total number of times of accessing all the access objects in a set time period by the first account is greater than or equal to a second set value;
the ratio of the time length of the first account for accessing the access object in the set time period to the total time length of all the access objects is larger than or equal to a third set value.
In practical application, it is assumed that the first account has a total of 4 access objects, namely, WeChat, QQ, Hosta and nail, the number of times of accessing the WeChat in a set time period by the first account is 10, the number of times of accessing the QQ is 10, the number of times of accessing the Hosta is 5, and the number of times of accessing the nail is 5. The ratio of the number of times of accessing the WeChat to the total number of times of accessing all the access objects is 10/30, and assuming that the second setting value is 0.2, the ratio of the number of times of accessing the WeChat to the total number of times of accessing all the access objects is greater than the second setting value.
Assuming that the time length of accessing the WeChat by the first account in the set time period is 4 hours, the time length of accessing the QQ is 4 hours, the time length known for accessing is 2 hours, the time length of accessing the nail is 2 hours, the ratio of the time length of accessing the WeChat to the total time length of accessing all the access objects is 4/12, and assuming that the third set value is 0.2, the ratio of the time length of accessing the WeChat to the total time length of accessing all the access objects is greater than the third set value.
Here, the access object corresponding to each character in the character string may only satisfy any one of the above conditions, or the access object corresponding to the character may only be written into the character string when both of the above conditions are satisfied. For example, in the above embodiment, the WeChat satisfies two conditions at the same time, which indicates that the WeChat is an access object with a higher access frequency of the first account, and writes the character corresponding to the WeChat into the character string corresponding to the first account.
And counting the character strings corresponding to each of the at least two first accounts according to the method, for example, the character string corresponding to account 1 is abc, and the character string corresponding to account 2 is abcd. Wherein. a corresponds to WeChat, b corresponds to QQ, c corresponds to Tama, and d corresponds to the nail. Alternatively, a corresponds to WeChat group 1, b corresponds to WeChat group 2, c corresponds to QQ group 3, and d corresponds to QQ group 4.
S102, determining a first distance between every two first accounts in the at least two first accounts; the first distance represents the editing distance of the character strings corresponding to the two first accounts.
Here, the first distance is an edit distance of the character strings corresponding to the two first accounts, the edit distance is a quantitative measure of a difference degree between the two character strings, and the measure is to determine how many times a process is required to change one character string into another character string. The processing here refers to single character editing operations, of which there are and only three: insertion (Insertion), Deletion (Deletion), and replacement (replacement).
In an embodiment, the determining the first distance between each two of the at least two first accounts comprises:
determining the minimum number of editing operations for converting the first character string into the second character string, wherein the minimum number of editing operations represents the first distance; the first character string and the second character string respectively represent character strings corresponding to any two first accounts in the at least two first accounts.
For example, if the first character string is abcd, the second character string is abe, and abcd is converted into abe, c in the first character string needs to be replaced by e, and d in the first character string needs to be deleted, so that the first character string is converted into the second character string, where the minimum number of editing operations is 2, that is, the first distance is 2.
S103, determining a similar account corresponding to each of the at least two first accounts based on a first distance between each of the at least two first accounts.
Here, whether the two first accounts are similar accounts may be determined by using a first distance between the two first accounts, for example, when the first distance between the two first accounts is smaller than a set value, it is determined that the two first accounts are similar accounts.
Referring to fig. 2, in an embodiment of the present invention, the determining, based on a first distance between each two of the at least two first accounts, a similar account corresponding to each of the at least two first accounts includes:
s201, determining the similarity between the two corresponding first accounts based on the first distance.
In an embodiment, the similarity characterizes a first ratio of a first distance to a first length between two first accounts; the first length represents the length of the character string with the maximum character string length in the character strings corresponding to the two first accounts.
Here, the first length refers to the length of a string having the largest string length among strings corresponding to two first accounts, for example, the first string is abcd, the second string is abe, where the first string length is 4, the second string length is 3, where the first string length is the longer string, and therefore the first length is 4.
The similarity calculation formula is as follows:
Figure 655543DEST_PATH_IMAGE001
wherein the content of the first and second substances,
Figure 480542DEST_PATH_IMAGE002
is account u andva first distance between the first and second electrodes,
Figure 805213DEST_PATH_IMAGE003
is a first length.
Referring to fig. 3, in an embodiment, the determining the similarity between the two corresponding first accounts based on the first distance includes:
s301, determining the difference value of the character string lengths of the character strings corresponding to the two first accounts.
And subtracting the length of the second character string from the length of the first character string, and then taking an absolute value to obtain the difference value of the character string lengths of the character strings corresponding to the two first accounts.
And S302, determining the similarity between the two corresponding first accounts based on the first distance under the condition that the difference value is smaller than a fifth set value.
The difference value is smaller than the fifth setting value, which indicates that the lengths of the character strings corresponding to the two first accounts are not greatly different, and the probability that the two first accounts are similar accounts is higher. And if the difference value is larger than the fifth set value, directly judging that the two first accounts are not similar accounts. According to the embodiment of the invention, a part of non-similar accounts can be eliminated in advance through the difference value of the character string lengths of the character strings corresponding to the two first accounts, so that the accounts are prevented from entering the next similarity calculation, and the time for determining the account similarity is saved.
And S202, determining that the two corresponding first accounts are similar accounts under the condition that the similarity is smaller than a fourth set value.
In the embodiment of the invention, the similarityUserSimThe smaller the value, the more similar the account is, inUserSimAnd if the second account is smaller than the fourth set value, the two first accounts are similar accounts.
Based on the above embodiment, a similar account for each of the at least two first accounts may be calculated, each first account may obtain a similar account cluster, and accounts in the similar account cluster are all similar accounts of the first account. When information, products or services need to be pushed to the accounts based on the similarity between the accounts, the related information can be directly pushed to the accounts in the similar account cluster.
The method and the device for determining the distance between the first accounts determine a first distance between every two of the at least two first accounts by determining a character string corresponding to each of the at least two first accounts, and determine a similar account corresponding to each of the at least two first accounts based on the first distance between every two of the at least two first accounts. Each character in the character strings corresponds to an access object, the frequency of the access object accessed by the first account is greater than a first set value, and the first distance represents the editing distance of the character strings corresponding to the two first accounts. According to the method and the device for identifying the accounts, the similarity of the accounts is determined through the editing distance, the accuracy of similar account identification can be improved, and particularly, the method and the device have a good effect on identifying the similarity of the accounts using social software.
Referring to fig. 4, fig. 4 is a schematic diagram of a similar account determination process provided in an application embodiment of the present invention, where the similar account determination process includes:
the database stores access data of each first account to the access object, the access data comprise the times and duration of the first account accessing the access object, a character string corresponding to each first account is determined according to the access data, each character in the character string corresponds to one access object, and the frequency of the access object corresponding to the character accessed by the first account is greater than a first set value. Specifically, the ratio of the number of times that the first account accesses the access object within the set time period to the total number of times that all the access objects are accessed is greater than or equal to a second set value, and the ratio of the duration that the first account accesses the access object within the set time period to the total duration that all the access objects are accessed is greater than or equal to a third set value. And then, calculating the editing distance between every two first accounts according to the character strings, and determining a similar account corresponding to each first account according to the editing distance. Here, the similarity between the two first accounts represents a first ratio of the edit distance between the two first accounts to a first length, and the first length represents a length of a character string having a maximum character string length among character strings corresponding to the two first accounts. According to the application embodiment of the invention, the similarity between the two first accounts is determined through the editing distance, the time dimension information of the access object accessed by the account is integrated, and the accuracy of determining the similarity of the account is effectively improved.
It should be understood that, the sequence numbers of the steps in the foregoing embodiments do not imply an execution sequence, and the execution sequence of each process should be determined by its function and inherent logic, and should not constitute any limitation to the implementation process of the embodiments of the present invention.
It will be understood that the terms "comprises" and/or "comprising," when used in this specification and the appended claims, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
The technical means described in the embodiments of the present invention may be arbitrarily combined without conflict.
In addition, in the embodiments of the present invention, "first", "second", and the like are used for distinguishing similar objects, and are not necessarily used for describing a specific order or a sequential order.
Referring to fig. 5, fig. 5 is a schematic diagram of a similar account determining apparatus according to an embodiment of the present invention, as shown in fig. 5, the apparatus includes: a first determination module, a second determination module, and a third determination module.
The first determining module is used for determining a character string corresponding to each of at least two first accounts; each character in the character string corresponds to an access object; the frequency of the access object accessed by the first account is greater than a first set value;
a second determining module, configured to determine a first distance between each two of the at least two first accounts; the first distance represents the editing distance of the character strings corresponding to the two first accounts;
a third determining module, configured to determine, based on a first distance between each two of the at least two first accounts, a similar account corresponding to each of the at least two first accounts.
The access object satisfies at least any one of the following:
the ratio of the number of times of accessing the access object to the total number of times of accessing all the access objects in a set time period by the first account is greater than or equal to a second set value;
the ratio of the time length of the first account for accessing the access object in the set time period to the total time length of all the access objects is larger than or equal to a third set value.
The third determining module is specifically configured to:
determining similarity between the corresponding two first accounts based on the first distance;
and under the condition that the similarity is smaller than a fourth set value, determining that the two corresponding first accounts are similar accounts.
The similarity characterizes a first ratio of a first distance to a first length between the two first accounts; the first length represents the length of the character string with the maximum character string length in the character strings corresponding to the two first accounts.
The third determining module is specifically configured to:
determining the difference value of the character string lengths of the character strings corresponding to the two first accounts;
and determining the similarity between the two corresponding first accounts based on the first distance when the difference value is smaller than a fifth set value.
The second determining module is specifically configured to:
determining the minimum number of editing operations for converting the first character string into the second character string, wherein the minimum number of editing operations represents the first distance; the first character string and the second character string respectively represent character strings corresponding to any two first accounts in the at least two first accounts.
The access object comprises at least any one of the following:
social application software;
a social group.
In practical applications, the first determining module, the second determining module, and the third determining module may be implemented by a Processor in an electronic device, such as a Central Processing Unit (CPU), a Digital Signal Processor (DSP), a Micro Control Unit (MCU), or a Programmable Gate Array (FPGA).
It should be noted that: in the similar account determination apparatus provided in the above embodiment, when performing similar account determination, only the division of the above modules is exemplified, and in practical applications, the above processing may be distributed to different modules according to needs, that is, the internal structure of the apparatus may be divided into different modules to complete all or part of the above-described processing. In addition, the similar account determining apparatus provided in the above embodiments and the similar account determining method embodiments belong to the same concept, and specific implementation processes thereof are detailed in the method embodiments and are not described herein again.
Based on the hardware implementation of the program module, in order to implement the method of the embodiment of the present application, an embodiment of the present application further provides an electronic device. Fig. 6 is a schematic diagram of a hardware structure of an electronic device according to an embodiment of the present application, and as shown in fig. 6, the electronic device includes:
the communication interface can carry out information interaction with other equipment such as network equipment and the like;
and the processor is connected with the communication interface to realize information interaction with other equipment, and is used for executing the method provided by one or more technical schemes on the electronic equipment side when running a computer program. And the computer program is stored on the memory.
Of course, in practice, the various components in an electronic device are coupled together by a bus system. It will be appreciated that a bus system is used to enable communications among the components. The bus system includes a power bus, a control bus, and a status signal bus in addition to a data bus. But for the sake of clarity the various buses are labeled as a bus system in figure 6.
The memory in the embodiments of the present application is used to store various types of data to support the operation of the electronic device. Examples of such data include: any computer program for operating on an electronic device.
It will be appreciated that the memory can be either volatile memory or nonvolatile memory, and can include both volatile and nonvolatile memory. Among them, the nonvolatile Memory may be a Read Only Memory (ROM), a Programmable Read Only Memory (PROM), an Erasable Programmable Read-Only Memory (EPROM), an Electrically Erasable Programmable Read-Only Memory (EEPROM), a magnetic random access Memory (FRAM), a Flash Memory (Flash Memory), a magnetic surface Memory, an optical disk, or a Compact Disc Read-Only Memory (CD-ROM); the magnetic surface storage may be disk storage or tape storage. Volatile Memory can be Random Access Memory (RAM), which acts as external cache Memory. By way of illustration and not limitation, many forms of RAM are available, such as Static Random Access Memory (SRAM), Synchronous Static Random Access Memory (SSRAM), Dynamic Random Access Memory (DRAM), Synchronous Dynamic Random Access Memory (SDRAM), Double Data Rate Synchronous Dynamic Random Access Memory (DDRSDRAM), Enhanced Synchronous Dynamic Random Access Memory (ESDRAM), Enhanced Synchronous Dynamic Random Access Memory (Enhanced DRAM), Synchronous Dynamic Random Access Memory (SLDRAM), Direct Memory (DRmb Access), and Random Access Memory (DRAM). The memories described in the embodiments of the present application are intended to comprise, without being limited to, these and any other suitable types of memory.
The method disclosed in the embodiments of the present application may be applied to a processor, or may be implemented by a processor. The processor may be an integrated circuit chip having signal processing capabilities. In implementation, the steps of the above method may be performed by integrated logic circuits of hardware in a processor or instructions in the form of software. The processor described above may be a general purpose processor, a DSP, or other programmable logic device, discrete gate or transistor logic device, discrete hardware components, or the like. The processor may implement or perform the methods, steps, and logic blocks disclosed in the embodiments of the present application. A general purpose processor may be a microprocessor or any conventional processor or the like. The steps of the method disclosed in the embodiments of the present application may be directly implemented by a hardware decoding processor, or implemented by a combination of hardware and software modules in the decoding processor. The software modules may be located in a storage medium located in a memory where a processor reads the programs in the memory and in combination with its hardware performs the steps of the method as previously described.
Optionally, when the processor executes the program, the corresponding process implemented by the electronic device in each method of the embodiment of the present application is implemented, and for brevity, no further description is given here.
In an exemplary embodiment, the present application further provides a storage medium, specifically a computer storage medium, for example, a first memory storing a computer program, where the computer program is executable by a processor of an electronic device to perform the steps of the foregoing method. The computer readable storage medium may be Memory such as FRAM, ROM, PROM, EPROM, EEPROM, Flash Memory, magnetic surface Memory, optical disk, or CD-ROM.
In the several embodiments provided in the present application, it should be understood that the disclosed apparatus, electronic device and method may be implemented in other ways. The above-described device embodiments are merely illustrative, for example, the division of the unit is only a logical functional division, and there may be other division ways in actual implementation, such as: multiple units or components may be combined, or may be integrated into another system, or some features may be omitted, or not implemented. In addition, the coupling, direct coupling or communication connection between the components shown or discussed may be through some interfaces, and the indirect coupling or communication connection between the devices or units may be electrical, mechanical or other forms.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed on a plurality of network units; some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, all functional units in the embodiments of the present application may be integrated into one processing unit, or each unit may be separately regarded as one unit, or two or more units may be integrated into one unit; the integrated unit can be realized in a form of hardware, or in a form of hardware plus a software functional unit.
Those of ordinary skill in the art will understand that: all or part of the steps for implementing the method embodiments may be implemented by hardware related to program instructions, and the program may be stored in a computer readable storage medium, and when executed, the program performs the steps including the method embodiments; and the aforementioned storage medium includes: a removable storage device, a ROM, a RAM, a magnetic or optical disk, or various other media that can store program code.
Alternatively, the integrated units described above in the present application may be stored in a computer-readable storage medium if they are implemented in the form of software functional modules and sold or used as independent products. Based on such understanding, the technical solutions of the embodiments of the present application may be essentially implemented or portions thereof contributing to the prior art may be embodied in the form of a software product stored in a storage medium, and including several instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the methods described in the embodiments of the present application. And the aforementioned storage medium includes: a removable storage device, a ROM, a RAM, a magnetic or optical disk, or various other media that can store program code.
The technical means described in the embodiments of the present application may be arbitrarily combined without conflict.
In addition, in the examples of the present application, "first", "second", and the like are used for distinguishing similar objects, and are not necessarily used for describing a specific order or a sequential order.
The above description is only for the specific embodiments of the present application, but the scope of the present application is not limited thereto, and any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present application, and shall be covered by the scope of the present application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims (8)

1. A method for similar account determination, the method comprising:
determining a character string corresponding to each of at least two first accounts; each character in the character string corresponds to an access object; the frequency of the access object accessed by the first account is greater than a first set value;
determining a first distance between each two of the at least two first accounts; the first distance represents the editing distance of the character strings corresponding to the two first accounts;
determining similarity between the corresponding two first accounts based on the first distance; the similarity characterizes a first ratio of a first distance to a first length between the two first accounts; the first length represents the length of the character string with the maximum character string length in the character strings corresponding to the two first accounts;
and under the condition that the similarity is smaller than a fourth set value, determining that the two corresponding first accounts are similar accounts.
2. The method of claim 1, wherein the access object satisfies at least any one of:
the ratio of the number of times of accessing the access object to the total number of times of accessing all the access objects in a set time period by the first account is greater than or equal to a second set value;
the ratio of the time length of the first account for accessing the access object in the set time period to the total time length of all the access objects is larger than or equal to a third set value.
3. The method of claim 1, wherein determining the similarity between the corresponding two first accounts based on the first distance comprises:
determining the difference value of the character string lengths of the character strings corresponding to the two first accounts;
and determining the similarity between the two corresponding first accounts based on the first distance when the difference value is smaller than a fifth set value.
4. The method of claim 1, wherein the determining a first distance between each of the at least two first accounts comprises:
determining the minimum number of editing operations for converting the first character string into the second character string, wherein the minimum number of editing operations represents the first distance; the first character string and the second character string respectively represent character strings corresponding to any two first accounts in the at least two first accounts.
5. The method of claim 1, wherein the access object comprises at least any one of:
social application software;
a social group.
6. A similar account determination apparatus, comprising:
the first determining module is used for determining a character string corresponding to each of at least two first accounts; each character in the character string corresponds to an access object; the frequency of the access object accessed by the first account is greater than a first set value;
a second determining module, configured to determine a first distance between each two of the at least two first accounts; the first distance represents the editing distance of the character strings corresponding to the two first accounts;
a third determining module, configured to determine a similarity between the two corresponding first accounts based on the first distance; the similarity characterizes a first ratio of a first distance to a first length between the two first accounts; the first length represents the length of the character string with the maximum character string length in the character strings corresponding to the two first accounts;
and under the condition that the similarity is smaller than a fourth set value, determining that the two corresponding first accounts are similar accounts.
7. An electronic device comprising a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein the processor implements the similar account determination method according to any one of claims 1 to 5 when executing the computer program.
8. A computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program comprising program instructions that, when executed by a processor, cause the processor to perform the similar account determination method according to any one of claims 1 to 5.
CN202110406247.3A 2021-04-15 2021-04-15 Similar account determination method and device, electronic equipment and storage medium Active CN112800291B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110406247.3A CN112800291B (en) 2021-04-15 2021-04-15 Similar account determination method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110406247.3A CN112800291B (en) 2021-04-15 2021-04-15 Similar account determination method and device, electronic equipment and storage medium

Publications (2)

Publication Number Publication Date
CN112800291A CN112800291A (en) 2021-05-14
CN112800291B true CN112800291B (en) 2021-07-16

Family

ID=75811457

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110406247.3A Active CN112800291B (en) 2021-04-15 2021-04-15 Similar account determination method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN112800291B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113538108A (en) * 2021-07-27 2021-10-22 北京沃东天骏信息技术有限公司 Resource information determination method and device, electronic equipment and storage medium
CN114742479B (en) * 2022-06-10 2022-09-06 深圳竹云科技股份有限公司 Account identification method, account identification device, server and storage medium

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103198418A (en) * 2013-03-15 2013-07-10 北京亿赞普网络技术有限公司 Application recommendation method and application recommendation system
CN105511869A (en) * 2015-12-01 2016-04-20 福建工程学院 Demand tracking system and method based on user feedback
CN107305559A (en) * 2016-04-21 2017-10-31 中国移动通信集团广东有限公司 Method and apparatus are recommended in one kind application
CN107111651A (en) * 2016-07-29 2017-08-29 深圳越界创新科技有限公司 A kind of matching degree computational methods, device and user equipment
KR20180099073A (en) * 2017-02-28 2018-09-05 주식회사 인카코커뮤니케이션즈 Apparatus and Method for Recommending User Oriented Application

Also Published As

Publication number Publication date
CN112800291A (en) 2021-05-14

Similar Documents

Publication Publication Date Title
CN111090628B (en) Data processing method and device, storage medium and electronic equipment
CN112800291B (en) Similar account determination method and device, electronic equipment and storage medium
WO2020211236A1 (en) Read-write conflict resolution method and apparatus employing b+ tree and storage medium
CN109597979B (en) List table generation method and device, computer equipment and storage medium
EP4174677A1 (en) Data reading and writing method and device
WO2018040270A1 (en) Method and device for loading linux-system elf file in windows system
KR20220125198A (en) Data additional writing method, apparatus, electronic device, storage medium and computer programs
CN110569218B (en) Offline modification method and device for EXT file system and storage medium
US10698608B2 (en) Method, apparatus and computer storage medium for data input and output
CN115470156A (en) RDMA-based memory use method, system, electronic device and storage medium
CN115114232A (en) Method, device and medium for enumerating historical version objects
CN109783321B (en) Monitoring data management method and device and terminal equipment
CN113010542B (en) Service data processing method, device, computer equipment and storage medium
WO2022089235A1 (en) Product demonstration method and apparatus, computer device, and storage medium
US20230409235A1 (en) File system improvements for zoned storage device operations
CN111858590B (en) Storage system metadata organization method, system, terminal and storage medium
CN113806803A (en) Data storage method, system, terminal equipment and storage medium
CN114374392A (en) Data compression storage method and device, terminal equipment and readable storage medium
CN114297236A (en) Data blood relationship analysis method, terminal equipment and storage medium
CN112231232A (en) Method, device and equipment for determining test data model and generating test data
CN107741992B (en) Network storage method and device for conference records, intelligent tablet and storage medium
CN112527745B (en) Embedded file system multi-partition analysis method, terminal device and storage medium
CN112506651A (en) Method and equipment for data operation in large-data-volume environment
CN113411395B (en) Access request routing method, device, computer equipment and storage medium
CN112380174B (en) XFS file system analysis method containing deleted files, terminal device and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant