CN117370680A - Information determination method, device, equipment and computer storage medium - Google Patents
Information determination method, device, equipment and computer storage medium Download PDFInfo
- Publication number
- CN117370680A CN117370680A CN202210739615.0A CN202210739615A CN117370680A CN 117370680 A CN117370680 A CN 117370680A CN 202210739615 A CN202210739615 A CN 202210739615A CN 117370680 A CN117370680 A CN 117370680A
- Authority
- CN
- China
- Prior art keywords
- information
- position information
- broadband
- installation position
- address
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 55
- 238000009434 installation Methods 0.000 claims abstract description 165
- 230000011218 segmentation Effects 0.000 claims description 24
- 238000004590 computer program Methods 0.000 claims description 16
- 230000014509 gene expression Effects 0.000 claims description 11
- 238000001914 filtration Methods 0.000 claims description 10
- 238000013507 mapping Methods 0.000 claims description 8
- 238000012545 processing Methods 0.000 description 12
- 238000010586 diagram Methods 0.000 description 10
- 238000007635 classification algorithm Methods 0.000 description 8
- 230000008569 process Effects 0.000 description 8
- 238000004891 communication Methods 0.000 description 6
- 238000012360 testing method Methods 0.000 description 6
- 238000004364 calculation method Methods 0.000 description 5
- 238000002790 cross-validation Methods 0.000 description 5
- 230000009471 action Effects 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 230000003287 optical effect Effects 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 3
- 238000012549 training Methods 0.000 description 3
- 230000008859 change Effects 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 238000007792 addition Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 230000005055 memory storage Effects 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 238000012821 model calculation Methods 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9537—Spatial or temporal dependent retrieval, e.g. spatiotemporal queries
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The embodiment of the application provides an information determining method, device, equipment and computer storage medium, wherein the method comprises the steps of acquiring resident position information of a first object, and resident position information and broadband installation position information of a plurality of second objects, wherein the first object is an object to be installed with a broadband, and the second object is an object installed with the broadband; calculating the distance between the resident location information of each second object and the resident location information of the first object; arranging the second objects in sequence from near to far according to the distance to obtain a target sequence; acquiring broadband installation position information of a second object corresponding to the first k distances in a target sequence, wherein k is a preset value; and determining the most wideband installation position information in the wideband installation position information of the second object corresponding to the first k distances as target installation position information of the first object. According to the embodiment of the application, the broadband installation address of the object to be installed with the broadband is matched, so that the accuracy of the matching result is improved.
Description
Technical Field
The application belongs to the technical field of internet, and particularly relates to an information determining method, an information determining device, information determining equipment and a computer storage medium.
Background
With the rapid development of internet technology, the market of broadband installation services is becoming larger, and in order to rapidly complete broadband installation services, it is necessary to determine broadband coverage addresses corresponding to resident addresses of potential customers.
In the prior art, a user mobile phone number is generally utilized, a user home resident area is determined through a call base station and a mobile data use base station, and the home resident area is matched with a broadband coverage address stored in a background broadband resource management system, so that the broadband coverage address corresponding to the user home resident area is obtained, and the broadband installation service is completed. However, fine granularity mismatch of base station addresses and broadband addresses may result in lower accuracy of the match results.
Disclosure of Invention
The embodiment of the application provides an information determining method, an information determining device, information determining equipment and a computer storage medium, which can solve the problem that in the prior art, matching with a background broadband resource management system through a home resident area can cause lower accuracy of a matching result.
In a first aspect, an embodiment of the present application provides an information determining method, including:
acquiring resident position information of a first object, and resident position information and broadband installation position information of a plurality of second objects, wherein the first object is an object to be installed with a broadband, and the second object is an object installed with the broadband;
Calculating the distance between the resident location information of each second object and the resident location information of the first object;
arranging the second objects according to the sequence from near to far of the distance to obtain a target sequence;
acquiring broadband installation position information of a second object corresponding to the first k distances in the target sequence, wherein k is a preset value;
and determining the most wideband installation position information in the wideband installation position information of the second object corresponding to the first k distances as target installation position information of the first object.
In one embodiment, the acquiring the resident location information of the first object includes:
acquiring a plurality of pieces of position information obtained by positioning the first object at a preset frequency, and staying time at a plurality of positions;
and determining the position information with the longest stay time and/or the largest stay times in the plurality of positions as the resident position information of the first object.
In one embodiment, obtaining wideband installation location information for a plurality of second objects includes:
acquiring the address information of broadband installation of a plurality of second objects, wherein the address information comprises the information of a first address sequence and the information of a second address sequence;
Filtering information of the second address sequence in the address information through a regular expression;
and determining the information of the first address sequence in the address information as broadband installation position information of a plurality of second objects.
In one embodiment, the information of the first address sequence includes information of a sequence of provinces, cities, regions, streets, administrative villages, cells; the information of the second address sequence includes information of a sequence of blocks, units and floors.
In one embodiment, after said calculating the distance of the resident location information of each second object from the resident location information of the first object, the method further comprises:
and determining the most broadband installation position information in the broadband installation position information of the second object corresponding to the distance smaller than the preset distance threshold as the target installation position information.
In one embodiment, the method further comprises:
acquiring a document of the first object related to the address;
performing word segmentation on the target installation position information to obtain a plurality of words after word segmentation;
acquiring the number of times of occurrence of the plurality of words in the document according to the mapping relation between the plurality of words and the positions of the words in the document;
And outputting the target installation position information under the condition that the times exceed a preset threshold value.
In a second aspect, an embodiment of the present application provides an information determining apparatus, including:
the system comprises an acquisition module, a storage module and a storage module, wherein the acquisition module is used for acquiring the resident position information of a first object, the resident position information of a plurality of second objects and the broadband installation position information, the first object is an object to be installed with a broadband, and the second object is an object installed with the broadband;
the calculating module is used for calculating the distance between the resident position information of each second object and the resident position information of the first object;
the arrangement module is used for arranging the second objects according to the sequence from the near to the far of the distance to obtain a target sequence;
the acquisition module is further configured to acquire wideband installation position information of a second object corresponding to the first k distances in the target sequence, where k is a preset value;
and the determining module is used for determining the most wideband installation position information in the wideband installation position information of the second object corresponding to the first k distances as the target installation position information of the first object.
In one embodiment, the acquiring module is further configured to acquire a plurality of location information obtained by locating the first object at a preset frequency, and stay times at a plurality of the locations;
The determining module is further configured to determine, from the plurality of location information, location information with the longest residence time and/or the greatest residence times in the plurality of locations as resident location information of the first object.
In one embodiment, the information determining apparatus further comprises a filtering module;
the acquisition module is further used for acquiring the address information of the broadband installation of the plurality of second objects, wherein the address information comprises the information of the first address sequence and the information of the second address sequence;
the filtering module is used for filtering the information of the second address sequence in the address information through a regular expression;
the determining module is further configured to determine that the information of the first address sequence in the address information is broadband installation location information of a plurality of second objects.
In one embodiment, the information of the first address sequence includes information of a sequence of provinces, cities, regions, streets, administrative villages, cells; the information of the second address sequence includes information of a sequence of blocks, units and floors.
In one embodiment, the determining module is further configured to determine, after the calculating the distance between the resident location information of each second object and the resident location information of the first object, the most amount of wideband installation location information among wideband installation location information of the second objects corresponding to the distance less than the preset distance threshold, as the target installation location information.
In one embodiment, the information determining apparatus further comprises a processing module and an output module;
the acquisition module is further used for acquiring the document of the first object related to the address;
the processing module is used for carrying out word segmentation on the target installation position information to obtain a plurality of words after word segmentation;
the acquisition module is further used for acquiring the number of times of occurrence of the plurality of words in the document according to the mapping relation between the plurality of words and the positions of the words in the document;
the output module is used for outputting the target installation position information under the condition that the times exceed a preset threshold value.
In a third aspect, an embodiment of the present application provides an electronic device, including: a processor and a memory storing computer program instructions;
the processor when executing the computer program instructions implements the information determination method as described in any embodiment of the first aspect.
In a fourth aspect, embodiments of the present application provide a computer storage medium having stored thereon computer program instructions which, when executed by a processor, implement the information determining method as described in any of the embodiments of the first aspect.
In a fifth aspect, embodiments of the present application provide a computer program product, instructions in which, when executed by a processor of an electronic device, cause the electronic device to perform the information determining method as described in any one of the embodiments of the first aspect.
According to the information determining method, the information determining device, the information determining equipment and the computer storage medium, through obtaining the resident position information of the first object, the resident position information of the plurality of second objects and the broadband installation position information, the first object is the object to be installed with the broadband, and the second object is the object to be installed with the broadband, so that the subsequent prediction of the broadband installation address of the object to be installed with the broadband is facilitated. And then, calculating the distance between the resident position information of each second object and the resident position information of the first object, arranging the second objects according to the sequence from near to far of the distance to obtain a target sequence, and further obtaining the broadband installation position information of the second object corresponding to the first k distances in the target sequence, wherein k is a preset value, so that the most broadband installation position information in the broadband installation position information of the second object corresponding to the first k distances is determined to be the target installation position information of the first object. Therefore, the k nearest neighbor classification algorithm can be utilized to match the broadband installation address of the object to be installed with the broadband through the corresponding relation between the broadband installation address of the object to be installed with the broadband and the resident position and the broadband installation address of the object to be installed with the broadband, so that the accuracy of the matching result is improved.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings that are needed in the embodiments of the present application will be briefly described, and it is possible for a person skilled in the art to obtain other drawings according to these drawings without inventive effort.
FIG. 1 is a flow chart of a method for determining information provided in one embodiment of the present application;
fig. 2 is a schematic structural view of an information determining apparatus provided in an embodiment of the present application;
fig. 3 is a schematic structural diagram of an electronic device according to an embodiment of the present application.
Detailed Description
Features and exemplary embodiments of various aspects of the present application are described in detail below to make the objects, technical solutions and advantages of the present application more apparent, and to further describe the present application in conjunction with the accompanying drawings and the detailed embodiments. It should be understood that the specific embodiments described herein are intended to be illustrative of the application and are not intended to be limiting. It will be apparent to one skilled in the art that the present application may be practiced without some of these specific details. The following description of the embodiments is merely intended to provide a better understanding of the present application by showing examples of the present application.
It is noted that relational terms such as first and second, and the like are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Moreover, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising … …" does not exclude the presence of other like elements in a process, method, article or apparatus that comprises the element.
As described in the background, there are three schemes for acquiring a broadband coverage address of a user in the prior art. Scheme one: and obtaining a client address by questionnaire investigation by using the mobile phone number of the user, and matching the questionnaire-obtained address with a broadband resource management system to obtain the corresponding relation between the user address and the broadband coverage address. Scheme II: and matching the address information of the user mobile phone network access registration with a background broadband resource management system to obtain the corresponding relation between the mobile phone number and the broadband coverage address. Scheme III: and judging a user home resident area through the call base station and the mobile data using base station, and obtaining a broadband coverage address corresponding to the user home resident area by matching the home resident area with the broadband coverage address stored in the background broadband resource management system so as to finish broadband installation service.
However, the address belongs to personal sensitive information in the first scheme, the address cannot be obtained due to reasons such as unmatched users and unwilling to provide, and for descriptions of the same geographic location, there are often differences in descriptions of individual addresses, and various expressions such as "department store", "XX street XX in XX city" and "new hundred XX street" may exist. In addition, based on personal information protection, it is not excluded that the user provides wrong address information, and thus, the acquired information is inaccurate. In the second scheme, addresses registered in the network are more commonly certificate addresses, unit addresses, temporary addresses and the like, the matching rate of the addresses with actual residence addresses is low, dynamic updating is basically not performed, the differences between the addresses and the actual residence places are large, and the accuracy of recognition results is poor. In the third scheme, the base station address granularity is too coarse, and the coverage radius of part of village and town area base stations reaches thousands of meters or even tens of thousands of meters, and is not matched with the fine granularity of the broadband address, for example, the area address positioned by the base station is an X area-Y property building, and the broadband address is an X area-Z street-Q cell, so that the accuracy of the identification result is insufficient; the base station address naming rule (basically, the base station installation position or the nearby marking position) has great difference with the broadband address naming rule, and the corresponding relation of the broadband address cannot be established through the base station address.
In order to solve the above-mentioned problems, the embodiments of the present application provide an information determining method, apparatus, device, and computer storage medium, where the information determining method may obtain the resident location information of a first object, and the resident location information and broadband installation location information of a plurality of second objects, where the first object is an object to be installed with a broadband, and the second object is an object to which a broadband is installed, so that a subsequent prediction of a broadband installation address of the object to be installed with the broadband is facilitated. And then, calculating the distance between the resident position information of each second object and the resident position information of the first object, arranging the second objects according to the sequence from near to far of the distance to obtain a target sequence, and further obtaining the broadband installation position information of the second object corresponding to the first k distances in the target sequence, wherein k is a preset value, so that the most broadband installation position information in the broadband installation position information of the second object corresponding to the first k distances is determined to be the target installation position information of the first object. Therefore, the k nearest neighbor classification algorithm can be utilized to match the broadband installation address of the object to be installed with the broadband through the corresponding relation between the broadband installation address of the object to be installed with the broadband and the resident position and the broadband installation address of the object to be installed with the broadband, so that the accuracy of the matching result is improved. The information determining architecture provided in the embodiments of the present application will be first described below.
The information determination architecture proposed by the embodiment of the present application may include a resident location calculation module, a known broadband coverage calculation module, a slice resident location and broadband coverage analysis module, a customer group broadband coverage calibration module, and a potential customer broadband standard address output module.
The resident position calculation module is used for calculating the resident positions of the object to be installed with the broadband and the object with the broadband, the resident positions can be obtained by positioning through a mobile communication base station positioning technology, and the resident positions of the object to be measured can be calculated by utilizing a geometric triangle relation based on distance measurement and angle measurement.
The known broadband coverage calculation module is used to determine broadband installation locations of objects to which broadband has been installed by regular expressions. The broadband standard address format is: provinces, cities, regions, streets (towns, villages, offices), administrative villages (streets, roads, roadways), communities, blocks (buildings, numbers of buildings, village groups), units, floors (room numbers). The broadband standard address format of the object with the broadband installed is reserved to the level of the cell through the regular expression, and the excessively detailed address becomes interference data of model calculation, so that the prediction of a result by a subsequent model is not facilitated.
The slice resident location and broadband coverage analysis module is used for predicting the broadband installation location of the object to be installed with the broadband, and the prediction can be performed through the resident location and the broadband installation location of the object with the broadband installed by using a k-Nearest Neighbor (KNN) classification algorithm in the model.
The client group broadband coverage calibration module is used for calibrating the model result by adopting the object and address related data document, and matching is respectively carried out on the predicted broadband installation position by utilizing word segmentation and full text retrieval technologies, and if matching is successful, the broadband installation position can be considered to be the most accurate.
The potential customer broadband standard address output module is used for program timing automatic scheduling, and outputs broadband installation positions to each service subsystem for use in the form of a file interface.
According to the above architecture, the information determining method provided in the embodiment of the present application is described in detail below with reference to fig. 1.
Fig. 1 is a flow chart of an information determining method according to an embodiment of the present application.
As shown in fig. 1, the information determining method specifically may include the following steps:
s110, acquiring the resident location information of the first object, and the resident location information and broadband installation location information of a plurality of second objects.
The first object may be an object to which a broadband is to be installed, and the second object may be an object to which a broadband is to be installed. The resident location information may be the location information having the longest residence time of the object, or the location information having the largest number of times of locating the object. The broadband installation location information may be an address of the broadband installation, for example, may be a broadband standard address format: provinces, cities, regions, streets (towns, villages, offices), administrative villages (streets, roads, roadways), communities, blocks (buildings, numbers of buildings, village groups), units, floors (room numbers).
S120, calculating the distance between the resident location information of each second object and the resident location information of the first object.
The resident location information may be represented by coordinates of longitude and latitude, and the distance between the resident location information of the first object and the resident location information of the second object may be calculated by using euclidean distance.
S130, arranging the second objects in the sequence from near to far according to the distance to obtain a target sequence.
As one example, the distance of installed broadband user a from the location of the broadband user to be installed is calculated to be 1 km, the distance of installed broadband user B from the location of the broadband user to be installed is calculated to be 500 meters, the distance of installed broadband user C from the location of the broadband user to be installed is calculated to be 800 meters, and the distance of installed broadband user D from the location of the broadband user to be installed is calculated to be 600 meters. The installed broadband users are arranged in the sequence from near to far according to the distance to obtain the sequence: user B, user D, user C, user a.
S140, broadband installation position information of a second object corresponding to the first k distances in the target sequence is obtained, wherein k is a preset value.
Screening out second objects corresponding to the first k distances in the target sequence, and further obtaining broadband installation position information of the second objects corresponding to the first k distances, wherein the k value can be a value preset by a user according to needs and experiences, and the k value can be changed according to the number change of the second objects. For example, if the selected k value is 3 and the order of the second object in the target sequence is object B, object D, object C, and object a, then object B, object D, and object C corresponding to the first 3 distances in the target sequence are obtained, and further broadband installation location information of object B, object D, and object C is obtained.
S150, determining the most wideband installation position information in the wideband installation position information of the second object corresponding to the first k distances as the target installation position information of the first object.
K pieces of wideband installation position information of the second object corresponding to the first k distances in the target sequence are acquired, and the target installation position information may be the most number of wideband installation position information in the k pieces of wideband installation position information. For example, the k value is 3, and the object B, the object D, and the object C corresponding to the first 3 distances in the target sequence are obtained, and the broadband installation position information of the object B, the object D, and the object C are a, B, and a respectively, where the target installation position information of the first object is a greater number of a.
In some embodiments, the target installation location information of the first object is determined from the resident location information of the first object, the resident location information of the plurality of second objects, and the broadband installation location information based on a k nearest neighbor classification algorithm in the preset model.
The implementation process of the k nearest neighbor classification algorithm is as follows: first, an object of the broadband to be installed is defined as a data sample x_test to be marked, and an object of the broadband to be installed is a marked data set x_train. And selecting the resident position (longitude and latitude) of the object as a characteristic x, and selecting the broadband installation position as a tag y. Then, the samples in the marked dataset x_train are traversed, the distance between each sample and the point to be marked is calculated, and the distances are stored in an array d_array. The d_array array is ordered, and k nearest points are marked as x_ knn. Next, the number of each category (broadband installation location) is counted in x_ knn. The class of the sample to be marked is the class with the largest number of samples in x_ knn.
The k value can be selected using k-fold cross-validation and a learning curve with cross-validation. For example, the raw data (the resident location information of the second object and the broadband installation location information) is randomly divided into k parts, k-1 parts each time being selected as a training set, and the remaining 1 part as a test set. The cross verification is repeated for k times, and an average value of the accuracy of the k times is taken as an evaluation index of a final preset model. The partitioning of the training set and the test set interferes with the results of the model, so that the average value obtained with the k-time results of cross-validation is a better measure of the model's effectiveness. The score is calculated for the cross-validation test set, and the data in the test set also affects the choice of parameters, possibly resulting in overfitting of the model to the training data and the test data. The learning curve with cross verification can detect whether the model is over-fitted, the function of the learning curve can be to observe the influence of the size of a data set on the performance of the model, and the point with high accuracy and relatively small variance of the learning curve is selected, so that the generalization capability of the point is strongest. Under the action of the cross-validation and learning curve, the selected super-parameters (k values) can ensure better generalization capability.
In the embodiment of the application, the resident position information of the first object, the resident position information of the plurality of second objects and the broadband installation position information are obtained, the first object is the object to be installed with the broadband, and the second object is the object to be installed with the broadband, so that the subsequent prediction of the broadband installation address of the object to be installed with the broadband is facilitated. And then, calculating the distance between the resident position information of each second object and the resident position information of the first object, arranging the second objects according to the sequence from near to far of the distances to obtain a target sequence, and further obtaining the broadband installation position information of the second object corresponding to the first k distances in the target sequence, wherein k is a preset value, so that the most broadband installation position information in the broadband installation position information of the second object corresponding to the first k distances is determined to be the target installation position information. Therefore, the k nearest neighbor classification algorithm can be utilized to match the broadband installation address of the object to be installed with the broadband through the corresponding relation between the broadband installation address of the object to be installed with the broadband and the resident position and the broadband installation address of the object to be installed with the broadband, so that the accuracy of the matching result is improved.
In some embodiments, S110: the obtaining the resident location information of the first object may specifically include:
acquiring a plurality of pieces of position information obtained by positioning a first object at a preset frequency and stay time at a plurality of positions;
and determining the position information with the longest stay time and/or the largest stay times in the plurality of positions as the resident position information of the first object.
Positioning the first object at a preset frequency to obtain a plurality of position information, wherein the preset frequency can be a frequency preset by a user, for example, the first object is positioned at a frequency of 1 hour and 1 time to obtain a plurality of position information, and the time of staying at a plurality of positions is obtained. The resident location information of the first object may be the location information having the largest number of times among the plurality of location information, or the location information having the largest stay time among the plurality of location information, or the location information having the largest number of times among the plurality of location information and the largest stay time among the plurality of location information. The resident location information of the second object is determined in the same manner as the resident location information of the first object.
In some embodiments, the location information with the longest residence time and/or the greatest residence times of the plurality of locations within the preset time period is determined as the resident location information of the first object.
The preset time period may be a night time period preset by a user.
As one example, a user who has installed and is to install a broadband is located at a frequency of 1 hour and 1 time over a period of time, a plurality of location information is obtained, and a resident location having the largest number of daytime periods and the largest time and a resident location having the largest number of nighttime periods and the largest time are selected. The resident location in the daytime period is considered as a workplace, the resident location in the nighttime period is considered as a residence, and the resident location in the nighttime period (residence) is considered as the final resident location of the user.
In the embodiment of the present application, by acquiring a plurality of pieces of location information obtained by locating a first object at a preset frequency and a time of stay at a plurality of locations, and determining the location information with the longest time and/or the largest frequency of stay at the plurality of locations in the plurality of pieces of location information as the resident location information of the first object. In this way, accurate resident position information of the first object can be obtained, and subsequent matching of broadband installation positions is facilitated.
In some embodiments, S110: the obtaining wideband installation position information of the plurality of second objects may specifically include:
Acquiring the address information of the broadband installation of the plurality of second objects, wherein the address information can comprise the information of the first address sequence and the information of the second address sequence;
filtering information of a second address sequence in the address information through a regular expression;
the information of the first address sequence in the address information is determined as broadband installation position information of a plurality of second objects.
The first address sequence and the second address sequence may be any sequence of the broadband-installed addresses of the second object. For example, the broadband installed address information of the second object may include information of provinces, cities, regions, streets (towns, villages, offices), administrative villages (streets, roads, roadways), cells, blocks (buildings, village groups), units, floors (room numbers), for example, the information of the first address sequence may include information of provinces, cities, regions, streets (towns, villages, offices), administrative villages (streets, roads, roadways), cells, and the information of the second address sequence may include information of blocks (buildings, villages, village groups), units, floors (room numbers). And acquiring the information of the first address sequence from the character string of the broadband installation address information of the second object through the regular expression, and filtering the information of the second address sequence, namely reserving the broadband installation address to the level of the cell.
In some embodiments, the information of the first address sequence may include information of a sequence of provinces, cities, regions, streets, administrative villages, cells; the information of the second address series may include information of a series of blocks, units, and floors.
As an example, a certain installed broadband is installed with an address of "XX street XX administrative village XX in XX city XX" and a character string of "XX street XX administrative village XX cell XX X unit X floor in XX city XX" is filtered by a regular expression, so as to obtain a filtered position of "XX street XX administrative village XX cell in XX city XX", and then matching is performed with a resident position based on the position.
In this embodiment of the present application, by obtaining the wideband installed address information of the plurality of second objects, the address information may include information of the first address sequence and information of the second address sequence, and filter the information of the second address sequence in the address information by using a regular expression, so as to determine that the information of the first address sequence in the address information is the wideband installed location information of the plurality of second objects. Thus, interference data of information such as excessively detailed floors can be filtered, and accuracy of matching of broadband installation positions and resident positions is improved.
In some embodiments, at S120: after calculating the distance between the resident location information of each second object and the resident location information of the first object, the information determining method may further include:
and determining the most broadband installation position information as target installation position information in the broadband installation position information of the second object corresponding to the distance smaller than the preset distance threshold value.
The preset distance threshold may be a distance preset by a user, and in addition to determining the most amount of wideband installation position information in the wideband installation position information of the second object corresponding to the first k distances as target installation position information of the first object, the most amount of wideband installation position information in the wideband installation position information of the second object corresponding to the distance smaller than the preset distance threshold may be determined as target installation position information.
As an example, when the data distribution of the broadband installation locations of the users who have installed the broadband is uneven, for example, the distances between the installed broadband users A, B, C, D and E and the locations where the broadband users to be installed reside are 500 meters, 600 meters, 800 meters, 5 meters, and 8 meters, respectively, if the k nearest neighbor classification algorithm is used to obtain the first k distances, k is 4, the distance between the locations where the user D resides is far different from the user A, B, C, and at this time, the broadband installation locations of the users A, B and C corresponding to the distances less than the preset distance threshold of 1 kilometer are obtained, and further, it is determined that the locations where the numbers of the broadband installation locations of the users A, B and C are the most are the broadband installation location information of the broadband users to be installed, it is possible to obtain better effects.
In the embodiment of the application, the most amount of broadband installation position information in the broadband installation position information of the second object corresponding to the distance smaller than the preset distance threshold is determined as the target installation position information, so that the accuracy of determining the target installation position information can be improved when the data distribution of the broadband installation positions of the users with installed broadband is uneven.
In some embodiments, the information determining method may further include:
acquiring a document of a first object related to an address;
performing word segmentation on the target installation position information to obtain a plurality of words after word segmentation;
acquiring the number of times of occurrence of a plurality of words in a document according to the mapping relation between the plurality of words and the positions of the words in the document;
and outputting the target installation position information when the number of times exceeds a preset threshold value.
The address-related document of the first object may be, for example, a document of information related to a certificate address of the first object, group & family cluster information, group relationship, family relationship, or the like. The target installation position information is subjected to word segmentation processing through a word segmentation device to obtain a plurality of words, the word segmentation device can be an IK word segmentation device, the IK word segmentation device adopts a special forward iteration finest granularity segmentation algorithm, and a multi-sub-processor analysis mode is adopted to support word segmentation processing of English letters, numbers, chinese words (names, place names) and the like. The word segmentation calculation flow comprises dictionary loading, preprocessing, word segmentation, ambiguity processing and sorting output.
The mapping relation between the words and the positions of the words in the document can be established and stored by adopting an inverted index so as to facilitate full text retrieval and obtain the occurrence times of the words in the document. The full text retrieval means that a computer indexing program scans each word in a document, establishes an index for each word, indicates the number and the position of the word in an article, and when a user inquires, the retrieval program searches according to the index established in advance and feeds back the searched result to the retrieval mode of the user. And outputting the target installation position information when the number of times exceeds a preset threshold, wherein the preset threshold can be a threshold preset by a user.
As an example, after predicting the installation position of the user who obtains the broadband to be installed, since the predicted result may be different when the super parameter k takes different values, calibration of the predicted installation position of the broadband is required. Firstly, word segmentation processing is carried out on the predicted installation positions of the users to be installed with the broadband, a plurality of words after word segmentation are obtained, and mapping between the words and the positions of the words in the document is stored by adopting an inverted index through a full text retrieval engine toolkit Lucene. Through the inverted index, a list of documents containing a word can be quickly obtained from the word, thereby obtaining the number of times the word appears in the document. If the number of times of each word in the document exceeds a preset threshold, the predicted broadband installation position of the user to be installed with the broadband is considered to be the most accurate, and the installation position is output to each service subsystem for use.
According to the method, the device and the system, documents of the first object which are related to the address are obtained, word segmentation processing is conducted on target installation position information, a plurality of words after word segmentation are obtained, the number of times of the plurality of words in the documents is obtained according to the mapping relation between the plurality of words and the positions of the words in the documents, and then the target installation position information is output under the condition that the number of times exceeds a preset threshold value. Thus, other data files of the second object can be adopted to calibrate the matching result, the accuracy of the output result is improved,
fig. 2 is a schematic diagram showing a structure of an information determining apparatus 200 according to an exemplary embodiment.
As shown in fig. 2, the information determining apparatus 200 may include:
an obtaining module 201, configured to obtain resident location information of a first object, and resident location information and broadband installation location information of a plurality of second objects, where the first object is an object to which a broadband is to be installed, and the second object is an object to which a broadband is installed;
a calculating module 202, configured to calculate a distance between the resident location information of each second object and the resident location information of the first object;
an arrangement module 203, configured to arrange the second objects in order from near to far according to the distance, so as to obtain a target sequence;
The obtaining module 201 is further configured to obtain wideband installation position information of a second object corresponding to the first k distances in the target sequence, where k is a preset value;
the determining module 204 is configured to determine, as target installation location information, the most amount of wideband installation location information among wideband installation location information of the second objects corresponding to the first k distances.
In one embodiment, the obtaining module 201 is further configured to obtain a plurality of location information obtained by locating the first object at a preset frequency, and stay time at a plurality of locations;
the determining module 204 is further configured to determine, from the plurality of location information, location information with the longest residence time and/or the greatest residence times in the plurality of locations as the resident location information of the first object.
In one embodiment, the information determining apparatus 200 may further include a filtering module;
the obtaining module 201 is further configured to obtain address information of broadband installation of a plurality of second objects, where the address information includes information of a first address sequence and information of a second address sequence;
the filtering module is used for filtering information of a second address sequence in the address information through a regular expression;
the determining module 204 is further configured to determine that the information of the first address sequence in the address information is broadband installation location information of the plurality of second objects.
In one embodiment, the information of the first address sequence includes information of a sequence of provinces, cities, regions, streets, administrative villages, cells; the information of the second address sequence includes information of a sequence of blocks, units and floors.
In one embodiment, the determining module 204 is further configured to determine, after calculating the distance between the resident location information of each second object and the resident location information of the first object, the most amount of wideband installation location information among the wideband installation location information of the second objects corresponding to the distance less than the preset distance threshold, as the target installation location information.
In one embodiment, the information determining apparatus 200 may further include a processing module and an output module;
an obtaining module 201, configured to obtain a document related to an address of a first object;
the processing module is used for performing word segmentation on the target installation position information to obtain a plurality of words after word segmentation;
the obtaining module 201 is further configured to obtain the number of times that the plurality of words appear in the document according to the mapping relationship between the plurality of words and the positions of the words in the document;
and the output module is used for outputting the target installation position information under the condition that the times exceed a preset threshold value.
Therefore, the resident position information of the first object, the resident position information of the plurality of second objects and the broadband installation position information are obtained, the first object is the object to be installed with the broadband, and the second object is the object to be installed with the broadband, so that the subsequent prediction of the broadband installation address of the object to be installed with the broadband is facilitated. And then, calculating the distance between the resident position information of each second object and the resident position information of the first object, arranging the second objects according to the sequence from near to far of the distances to obtain a target sequence, and further obtaining the broadband installation position information of the second object corresponding to the first k distances in the target sequence, wherein k is a preset value, so that the most broadband installation position information in the broadband installation position information of the second object corresponding to the first k distances is determined to be the target installation position information. Therefore, the k nearest neighbor classification algorithm can be utilized to match the broadband installation address of the object to be installed with the broadband through the corresponding relation between the broadband installation address of the object to be installed with the broadband and the resident position and the broadband installation address of the object to be installed with the broadband, so that the accuracy of the matching result is improved.
Fig. 3 shows a schematic diagram of a hardware structure of an electronic device according to an embodiment of the present application.
A processor 301 and a memory 302 storing computer program instructions may be included in an electronic device.
In particular, the processor 301 may include a Central Processing Unit (CPU), or an application specific integrated circuit (Application Specific Integrated Circuit, ASIC), or may be configured to implement one or more integrated circuits of embodiments of the present application.
Memory 302 may include mass storage for data or instructions. By way of example, and not limitation, memory 302 may comprise a Hard Disk Drive (HDD), floppy Disk Drive, flash memory, optical Disk, magneto-optical Disk, magnetic tape, or universal serial bus (Universal Serial Bus, USB) Drive, or a combination of two or more of the foregoing. Memory 302 may include removable or non-removable (or fixed) media, where appropriate. Memory 302 may be internal or external to the integrated gateway disaster recovery device, where appropriate. In a particular embodiment, the memory 302 is a non-volatile solid-state memory.
The memory may include Read Only Memory (ROM), random Access Memory (RAM), magnetic disk storage media devices, optical storage media devices, flash memory devices, electrical, optical, or other physical/tangible memory storage devices. Thus, in general, the memory includes one or more tangible (non-transitory) computer-readable storage media (e.g., memory devices) encoded with software comprising computer-executable instructions and when the software is executed (e.g., by one or more processors) it is operable to perform the operations described with reference to methods in accordance with aspects of the present disclosure.
The processor 301 implements any of the information determination methods of the above embodiments by reading and executing computer program instructions stored in the memory 302.
In one example, the electronic device may also include a communication interface 303 and a bus 310. As shown in fig. 3, the processor 301, the memory 302, and the communication interface 303 are connected to each other by a bus 310 and perform communication with each other.
The communication interface 303 is mainly used to implement communication between each module, device, unit and/or apparatus in the embodiments of the present application.
Bus 310 includes hardware, software, or both that couple the components of the information-determining device to one another. By way of example, and not limitation, the buses may include an Accelerated Graphics Port (AGP) or other graphics bus, an Enhanced Industry Standard Architecture (EISA) bus, a Front Side Bus (FSB), a HyperTransport (HT) interconnect, an Industry Standard Architecture (ISA) bus, an infiniband interconnect, a Low Pin Count (LPC) bus, a memory bus, a micro channel architecture (MCa) bus, a Peripheral Component Interconnect (PCI) bus, a PCI-Express (PCI-X) bus, a Serial Advanced Technology Attachment (SATA) bus, a video electronics standards association local (VLB) bus, or other suitable bus, or a combination of two or more of the above. Bus 310 may include one or more buses, where appropriate. Although embodiments of the present application describe and illustrate a particular bus, the present application contemplates any suitable bus or interconnect.
The electronic device may perform the information determining method in the embodiment of the present application based on acquiring the resident location information of the first object, and the resident location information and the broadband installation location information of the plurality of second objects, thereby implementing the information determining method described in connection with fig. 1.
In addition, in combination with the information determining method in the above embodiment, the embodiment of the application may be implemented by providing a computer storage medium. The computer storage medium has stored thereon computer program instructions; the computer program instructions, when executed by a processor, implement any of the information determination methods of the above embodiments.
It should be clear that the present application is not limited to the particular arrangements and processes described above and illustrated in the drawings. For the sake of brevity, a detailed description of known methods is omitted here. In the above embodiments, several specific steps are described and shown as examples. However, the method processes of the present application are not limited to the specific steps described and illustrated, and those skilled in the art can make various changes, modifications, and additions, or change the order between steps, after appreciating the spirit of the present application.
The functional blocks shown in the above-described structural block diagrams may be implemented in hardware, software, firmware, or a combination thereof. When implemented in hardware, it may be, for example, an electronic circuit, an Application Specific Integrated Circuit (ASIC), suitable firmware, a plug-in, a function card, or the like. When implemented in software, the elements of the present application are the programs or code segments used to perform the required tasks. The program or code segments may be stored in a machine readable medium or transmitted over transmission media or communication links by a data signal carried in a carrier wave. A "machine-readable medium" may include any medium that can store or transfer information. Examples of machine-readable media include electronic circuitry, semiconductor memory devices, ROM, flash memory, erasable ROM (EROM), floppy disks, CD-ROMs, optical disks, hard disks, fiber optic media, radio Frequency (RF) links, and the like. The code segments may be downloaded via computer networks such as the internet, intranets, etc.
It should also be noted that the exemplary embodiments mentioned in this application describe some methods or systems based on a series of steps or devices. However, the present application is not limited to the order of the above-described steps, that is, the steps may be performed in the order mentioned in the embodiments, may be different from the order in the embodiments, or several steps may be performed simultaneously.
Aspects of the present disclosure are described above with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the disclosure. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, enable the implementation of the functions/acts specified in the flowchart and/or block diagram block or blocks. Such a processor may be, but is not limited to being, a general purpose processor, a special purpose processor, an application specific processor, or a field programmable logic circuit. It will also be understood that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware which performs the specified functions or acts, or combinations of special purpose hardware and computer instructions.
In the foregoing, only the specific embodiments of the present application are described, and it will be clearly understood by those skilled in the art that, for convenience and brevity of description, the specific working processes of the systems, modules and units described above may refer to the corresponding processes in the foregoing method embodiments, which are not repeated herein. It should be understood that the scope of the present application is not limited thereto, and any person skilled in the art can easily conceive various equivalent modifications or substitutions within the technical scope of the present application, which are intended to be included in the scope of the present application.
Claims (10)
1. An information determination method, comprising:
acquiring resident position information of a first object, and resident position information and broadband installation position information of a plurality of second objects, wherein the first object is an object to be installed with a broadband, and the second object is an object installed with the broadband;
calculating the distance between the resident location information of each second object and the resident location information of the first object;
arranging the second objects according to the sequence from near to far of the distance to obtain a target sequence;
acquiring broadband installation position information of a second object corresponding to the first k distances in the target sequence, wherein k is a preset value;
And determining the most wideband installation position information in the wideband installation position information of the second object corresponding to the first k distances as target installation position information of the first object.
2. The method of claim 1, wherein the obtaining the resident location information of the first object comprises:
acquiring a plurality of pieces of position information obtained by positioning the first object at a preset frequency, and staying time at a plurality of positions;
and determining the position information with the longest stay time and/or the largest stay times in the plurality of positions as the resident position information of the first object.
3. The method of claim 1, wherein obtaining wideband installation location information for a plurality of second objects comprises:
acquiring the address information of broadband installation of a plurality of second objects, wherein the address information comprises the information of a first address sequence and the information of a second address sequence;
filtering information of the second address sequence in the address information through a regular expression;
and determining the information of the first address sequence in the address information as broadband installation position information of a plurality of second objects.
4. A method according to claim 3, characterized in that the information of the first address sequence comprises information of a sequence of provinces, cities, regions, streets, administrative villages, cells; the information of the second address sequence includes information of a sequence of blocks, units and floors.
5. The method of claim 1, wherein after said calculating the distance of the resident location information of each second object from the resident location information of the first object, the method further comprises:
and determining the most broadband installation position information in the broadband installation position information of the second object corresponding to the distance smaller than the preset distance threshold as the target installation position information.
6. The method according to claim 1, wherein the method further comprises:
acquiring a document of the first object related to the address;
performing word segmentation on the target installation position information to obtain a plurality of words after word segmentation;
acquiring the number of times of occurrence of the plurality of words in the document according to the mapping relation between the plurality of words and the positions of the words in the document;
and outputting the target installation position information under the condition that the times exceed a preset threshold value.
7. An information determining apparatus, characterized in that the apparatus comprises:
the system comprises an acquisition module, a storage module and a storage module, wherein the acquisition module is used for acquiring the resident position information of a first object, the resident position information of a plurality of second objects and the broadband installation position information, the first object is an object to be installed with a broadband, and the second object is an object installed with the broadband;
The calculating module is used for calculating the distance between the resident position information of each second object and the resident position information of the first object;
the arrangement module is used for arranging the second objects according to the sequence from the near to the far of the distance to obtain a target sequence;
the acquisition module is further configured to acquire wideband installation position information of a second object corresponding to the first k distances in the target sequence, where k is a preset value;
and the determining module is used for determining the most wideband installation position information in the wideband installation position information of the second object corresponding to the first k distances as the target installation position information of the first object.
8. An electronic device, the device comprising: a processor and a memory storing computer program instructions; the processor reads and executes the computer program instructions to implement the information determination method according to any of claims 1-6.
9. A computer storage medium having stored thereon computer program instructions which, when executed by a processor, implement the information determining method according to any of claims 1-6.
10. A computer program product, characterized in that instructions in the computer program product, when executed by a processor of an electronic device, cause the electronic device to perform the information determination method according to any of claims 1-6.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210739615.0A CN117370680A (en) | 2022-06-28 | 2022-06-28 | Information determination method, device, equipment and computer storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210739615.0A CN117370680A (en) | 2022-06-28 | 2022-06-28 | Information determination method, device, equipment and computer storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117370680A true CN117370680A (en) | 2024-01-09 |
Family
ID=89393312
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210739615.0A Pending CN117370680A (en) | 2022-06-28 | 2022-06-28 | Information determination method, device, equipment and computer storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117370680A (en) |
-
2022
- 2022-06-28 CN CN202210739615.0A patent/CN117370680A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108628811B (en) | Address text matching method and device | |
Eom et al. | Improving the prediction of annual average daily traffic for nonfreeway facilities by applying a spatial statistical method | |
CN111489008B (en) | Traffic accident influencing factor space effect analysis method and application thereof | |
CN109885643B (en) | Position prediction method based on semantic track and storage medium | |
CN112579718B (en) | Urban land function identification method and device and terminal equipment | |
CN110020224B (en) | Correlation method, device, equipment and medium for map interest point data | |
CN104599002B (en) | Method and equipment for predicting order value | |
CN110674208B (en) | Method and device for determining position information of user | |
US20150248436A1 (en) | Methods, Circuits, Devices, Systems and Associated Computer Executable Code for Assessing a Presence Likelihood of a Subject at One or More Venues | |
CN111190988B (en) | Address resolution method, device, equipment and computer readable storage medium | |
CN111782741A (en) | Interest point mining method and device, electronic equipment and storage medium | |
CN111931077A (en) | Data processing method and device, electronic equipment and storage medium | |
Cai et al. | Discovering regions of anomalous spatial co-locations | |
Núñez et al. | Resolving regional frequency analysis of precipitation at large and complex scales using a bottom-up approach: The Latin America and the Caribbean Drought Atlas | |
CN111460044B (en) | Geographic position data processing method and device | |
Mathew et al. | Comparative assessment of geospatial and statistical methods to estimate local road annual average daily traffic | |
CN111382138B (en) | POI data processing method, device, equipment and medium | |
CN105894323A (en) | User evaluation method, index model training method and apparatus | |
Rodrigues et al. | Extracting 3D maps from crowdsourced GNSS skyview data | |
CN104166659A (en) | Method and system for map data duplication judgment | |
CN108540926B (en) | Wireless signal fingerprint construction method and device | |
CN117370680A (en) | Information determination method, device, equipment and computer storage medium | |
CN114007186B (en) | Positioning method and related product | |
CN112990779A (en) | Method, device, equipment and storage medium for scoring candidate address | |
CN115310735A (en) | Method, device, equipment and storage medium for identifying business district boundary |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |