CN117370680A - Information determination method, device, equipment and computer storage medium - Google Patents

Information determination method, device, equipment and computer storage medium Download PDF

Info

Publication number
CN117370680A
CN117370680A CN202210739615.0A CN202210739615A CN117370680A CN 117370680 A CN117370680 A CN 117370680A CN 202210739615 A CN202210739615 A CN 202210739615A CN 117370680 A CN117370680 A CN 117370680A
Authority
CN
China
Prior art keywords
information
broadband
location information
address
resident
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210739615.0A
Other languages
Chinese (zh)
Inventor
张兴东
思永坤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Mobile Communications Group Co Ltd
China Mobile Online Services Co Ltd
Original Assignee
China Mobile Communications Group Co Ltd
China Mobile Online Services Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Mobile Communications Group Co Ltd, China Mobile Online Services Co Ltd filed Critical China Mobile Communications Group Co Ltd
Priority to CN202210739615.0A priority Critical patent/CN117370680A/en
Publication of CN117370680A publication Critical patent/CN117370680A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9537Spatial or temporal dependent retrieval, e.g. spatiotemporal queries

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The embodiment of the application provides an information determining method, device, equipment and computer storage medium, wherein the method comprises the steps of acquiring resident position information of a first object, and resident position information and broadband installation position information of a plurality of second objects, wherein the first object is an object to be installed with a broadband, and the second object is an object installed with the broadband; calculating the distance between the resident location information of each second object and the resident location information of the first object; arranging the second objects in sequence from near to far according to the distance to obtain a target sequence; acquiring broadband installation position information of a second object corresponding to the first k distances in a target sequence, wherein k is a preset value; and determining the most wideband installation position information in the wideband installation position information of the second object corresponding to the first k distances as target installation position information of the first object. According to the embodiment of the application, the broadband installation address of the object to be installed with the broadband is matched, so that the accuracy of the matching result is improved.

Description

信息确定方法、装置、设备及计算机存储介质Information determination methods, devices, equipment and computer storage media

技术领域Technical field

本申请属于互联网技术领域,尤其涉及一种信息确定方法、装置、设备及计算机存储介质。This application belongs to the field of Internet technology, and in particular relates to an information determination method, device, equipment and computer storage medium.

背景技术Background technique

随着互联网技术的快速发展,宽带安装业务的市场越来越大,为了快捷的完成宽带安装业务,需要确定潜在客户的常驻地址对应的宽带覆盖地址。With the rapid development of Internet technology, the market for broadband installation services is getting larger and larger. In order to quickly complete the broadband installation services, it is necessary to determine the broadband coverage address corresponding to the permanent address of potential customers.

现有技术中,通常利用用户手机号码,通过通话基站和移动数据使用基站判定用户家庭常驻区域,通过家庭常驻区域与后台宽带资源管理系统中存储的宽带覆盖地址匹配,从而得出该用户常驻区域对应的宽带覆盖地址,以便完成宽带安装业务。然而,基站地址与宽带地址的细粒度不匹配,会导致匹配结果准确性较低。In the existing technology, the user's mobile phone number is usually used to determine the user's home area through the call base station and the mobile data usage base station. The home area is matched with the broadband coverage address stored in the background broadband resource management system to determine the user's home area. Broadband coverage address corresponding to the resident area in order to complete the broadband installation business. However, the fine-grained mismatch between the base station address and the broadband address will lead to less accurate matching results.

发明内容Contents of the invention

本申请实施例提供一种信息确定方法、装置、设备及计算机存储介质,能够解决现有技术中通过家庭常驻区域与后台宽带资源管理系统匹配,会导致匹配结果准确性较低的问题。Embodiments of the present application provide an information determination method, device, equipment and computer storage medium, which can solve the problem in the prior art that matching the home resident area with the background broadband resource management system will lead to low accuracy of the matching results.

第一方面,本申请实施例提供一种信息确定方法,该方法包括:In a first aspect, embodiments of the present application provide an information determination method, which method includes:

获取第一对象的常驻位置信息,以及多个第二对象的常驻位置信息和宽带安装位置信息,所述第一对象为待安装宽带的对象,所述第二对象为已安装宽带的对象;Obtain the resident location information of the first object, and the resident location information and broadband installation location information of a plurality of second objects. The first object is the object to be installed with broadband, and the second object is the object that has installed broadband. ;

计算每个第二对象的常驻位置信息与第一对象的常驻位置信息的距离;Calculate the distance between the resident position information of each second object and the resident position information of the first object;

按照所述距离的由近至远的顺序对所述第二对象进行排列,得到目标序列;Arrange the second objects in order from near to far to obtain a target sequence;

获取所述目标序列中前k个距离对应的第二对象的宽带安装位置信息,其中,k为预设数值;Obtain the broadband installation position information of the second object corresponding to the first k distances in the target sequence, where k is a preset value;

确定所述前k个距离对应的第二对象的宽带安装位置信息中,数量最多的宽带安装位置信息为第一对象的目标安装位置信息。It is determined that among the broadband installation location information of the second object corresponding to the first k distances, the largest number of broadband installation location information is the target installation location information of the first object.

在一种实施方式中,所述获取第一对象的常驻位置信息,包括:In one implementation, obtaining the resident location information of the first object includes:

获取对所述第一对象以预设频率定位得到的多个位置信息,以及在多个所述位置停留的时间;Obtain multiple location information obtained by locating the first object at a preset frequency, and the time spent at multiple locations;

确定所述多个位置信息中,在所述多个位置停留的时间最长和/或次数最多的位置信息为第一对象的常驻位置信息。It is determined that among the plurality of position information, the position information that stays at the plurality of positions for the longest time and/or the most times is the resident position information of the first object.

在一种实施方式中,获取多个第二对象的宽带安装位置信息,包括:In one implementation, obtaining broadband installation location information of multiple second objects includes:

获取多个第二对象的宽带安装的地址信息,所述地址信息包括第一地址序列的信息和第二地址序列的信息;Obtain address information of broadband installations of multiple second objects, where the address information includes information of a first address sequence and information of a second address sequence;

通过正则表达式过滤所述地址信息中所述第二地址序列的信息;Filter the information of the second address sequence in the address information through a regular expression;

确定所述地址信息中所述第一地址序列的信息为多个第二对象的宽带安装位置信息。It is determined that the information of the first address sequence in the address information is broadband installation location information of a plurality of second objects.

在一种实施方式中,所述第一地址序列的信息包括省、市、区、街道、行政村、小区的序列的信息;所述第二地址序列的信息包括栋、单元以及楼层的序列的信息。In one embodiment, the information on the first address sequence includes information on the sequence of provinces, cities, districts, streets, administrative villages, and communities; the information on the second address sequence includes the information on the sequence of buildings, units, and floors. information.

在一种实施方式中,在所述计算每个第二对象的常驻位置信息与第一对象的常驻位置信息的距离之后,所述方法还包括:In one implementation, after calculating the distance between the resident location information of each second object and the resident location information of the first object, the method further includes:

确定小于预设距离阈值的距离对应的第二对象的宽带安装位置信息中,数量最多的宽带安装位置信息为所述目标安装位置信息。It is determined that among the broadband installation location information of the second object corresponding to a distance smaller than the preset distance threshold, the largest number of broadband installation location information is the target installation location information.

在一种实施方式中,所述方法还包括:In one embodiment, the method further includes:

获取所述第一对象的与地址相关的文档;Obtain address-related documents of the first object;

对所述目标安装位置信息进行分词处理,得到分词后的多个单词;Perform word segmentation processing on the target installation location information to obtain multiple words after word segmentation;

根据所述多个单词与单词在所述文档中的位置之间的映射关系,获取所述多个单词在所述文档中出现的次数;According to the mapping relationship between the multiple words and the position of the word in the document, obtain the number of times the multiple words appear in the document;

在所述次数超过预设阈值的情况下,将所述目标安装位置信息输出。If the number of times exceeds the preset threshold, the target installation location information is output.

第二方面,本申请实施例提供了一种信息确定装置,该信息确定装置包括:In a second aspect, embodiments of the present application provide an information determination device, which includes:

获取模块,用于获取第一对象的常驻位置信息,以及多个第二对象的常驻位置信息和宽带安装位置信息,所述第一对象为待安装宽带的对象,所述第二对象为已安装宽带的对象;An acquisition module is used to obtain the resident location information of a first object, as well as the resident location information and broadband installation location information of a plurality of second objects. The first object is an object to be installed with broadband, and the second object is Objects with installed broadband;

计算模块,用于计算每个第二对象的常驻位置信息与第一对象的常驻位置信息的距离;a calculation module for calculating the distance between the resident position information of each second object and the resident position information of the first object;

排列模块,用于按照所述距离的由近至远的顺序对所述第二对象进行排列,得到目标序列;An arrangement module, configured to arrange the second objects in order from near to far according to the distance to obtain a target sequence;

所述获取模块,还用于获取所述目标序列中前k个距离对应的第二对象的宽带安装位置信息,其中,k为预设数值;The acquisition module is also used to acquire the broadband installation position information of the second object corresponding to the first k distances in the target sequence, where k is a preset value;

确定模块,用于确定所述前k个距离对应的第二对象的宽带安装位置信息中,数量最多的宽带安装位置信息为第一对象的目标安装位置信息。A determination module configured to determine that among the broadband installation location information of the second object corresponding to the first k distances, the broadband installation location information with the largest amount is the target installation location information of the first object.

在一种实施方式中,所述获取模块,还用于获取对所述第一对象以预设频率定位得到的多个位置信息,以及在多个所述位置停留的时间;In one embodiment, the acquisition module is also used to acquire multiple location information obtained by locating the first object at a preset frequency, and the time spent at multiple locations;

所述确定模块,还用于确定所述多个位置信息中,在所述多个位置停留的时间最长和/或次数最多的位置信息为第一对象的常驻位置信息。The determining module is further configured to determine that among the plurality of position information, the position information that has stayed at the plurality of positions for the longest time and/or the most times is the resident position information of the first object.

在一种实施方式中,该信息确定装置还包括过滤模块;In one embodiment, the information determining device further includes a filtering module;

所述获取模块,还用于获取多个第二对象的宽带安装的地址信息,所述地址信息包括第一地址序列的信息和第二地址序列的信息;The acquisition module is also used to acquire the address information of the broadband installation of multiple second objects, where the address information includes information of the first address sequence and information of the second address sequence;

过滤模块,用于通过正则表达式过滤所述地址信息中所述第二地址序列的信息;A filtering module, configured to filter the information of the second address sequence in the address information through regular expressions;

所述确定模块,还用于确定所述地址信息中所述第一地址序列的信息为多个第二对象的宽带安装位置信息。The determining module is further configured to determine that the information of the first address sequence in the address information is broadband installation location information of a plurality of second objects.

在一种实施方式中,所述第一地址序列的信息包括省、市、区、街道、行政村、小区的序列的信息;所述第二地址序列的信息包括栋、单元以及楼层的序列的信息。In one embodiment, the information on the first address sequence includes information on the sequence of provinces, cities, districts, streets, administrative villages, and communities; the information on the second address sequence includes the information on the sequence of buildings, units, and floors. information.

在一种实施方式中,所述确定模块,还用于在所述计算每个第二对象的常驻位置信息与第一对象的常驻位置信息的距离之后,确定小于预设距离阈值的距离对应的第二对象的宽带安装位置信息中,数量最多的宽带安装位置信息为所述目标安装位置信息。In one implementation, the determination module is further configured to determine a distance less than a preset distance threshold after calculating the distance between the resident location information of each second object and the resident location information of the first object. Among the broadband installation location information corresponding to the second object, the broadband installation location information with the largest amount is the target installation location information.

在一种实施方式中,该信息确定装置还包括处理模块和输出模块;In one implementation, the information determining device further includes a processing module and an output module;

所述获取模块,还用于获取所述第一对象的与地址相关的文档;The acquisition module is also used to acquire documents related to the address of the first object;

处理模块,用于对所述目标安装位置信息进行分词处理,得到分词后的多个单词;A processing module, used to perform word segmentation processing on the target installation location information to obtain multiple words after word segmentation;

所述获取模块,还用于根据所述多个单词与单词在所述文档中的位置之间的映射关系,获取所述多个单词在所述文档中出现的次数;The acquisition module is also configured to obtain the number of times the multiple words appear in the document based on the mapping relationship between the multiple words and the positions of the words in the document;

所述输出模块,用于在所述次数超过预设阈值的情况下,将所述目标安装位置信息输出。The output module is configured to output the target installation location information when the number of times exceeds a preset threshold.

第三方面,本申请实施例提供了一种电子设备,该电子设备包括:处理器以及存储有计算机程序指令的存储器;In a third aspect, embodiments of the present application provide an electronic device, which includes: a processor and a memory storing computer program instructions;

处理器执行计算机程序指令时实现如第一方面的任一项实施例中所述的信息确定方法。When the processor executes the computer program instructions, the information determining method as described in any embodiment of the first aspect is implemented.

第四方面,本申请实施例提供了一种计算机存储介质,计算机存储介质上存储有计算机程序指令,计算机程序指令被处理器执行时实现如第一方面的任一项实施例中所述的信息确定方法。In the fourth aspect, embodiments of the present application provide a computer storage medium. Computer program instructions are stored on the computer storage medium. When the computer program instructions are executed by a processor, the information described in any embodiment of the first aspect is realized. Determine the method.

第五方面,本申请实施例提供了一种计算机程序产品,所述计算机程序产品中的指令由电子设备的处理器执行时,使得所述电子设备执行如第一方面的任一项实施例中所述的信息确定方法。In a fifth aspect, embodiments of the present application provide a computer program product. When instructions in the computer program product are executed by a processor of an electronic device, the electronic device causes the electronic device to execute as in any embodiment of the first aspect. Described information determination method.

本申请实施例的信息确定方法、装置、设备及计算机存储介质,通过获取第一对象的常驻位置信息,以及多个第二对象的常驻位置信息和宽带安装位置信息,第一对象为待安装宽带的对象,第二对象为已安装宽带的对象,这样,便于后续对待安装宽带的对象的宽带安装地址的预测。接着,计算每个第二对象的常驻位置信息与第一对象的常驻位置信息的距离,并按照距离的由近至远的顺序对第二对象进行排列,得到目标序列,进而获取目标序列中前k个距离对应的第二对象的宽带安装位置信息,其中,k为预设数值,从而确定前k个距离对应的第二对象的宽带安装位置信息中,数量最多的宽带安装位置信息为第一对象的目标安装位置信息。这样,可以利用k最近邻分类算法,通过已安装宽带的对象的宽带安装地址与常驻位置的对应关系,以及待安装宽带的对象宽带安装地址,匹配待安装宽带的对象的宽带安装地址,提高了匹配结果的准确性。The information determination method, device, equipment and computer storage medium of the embodiments of the present application obtain the resident location information of the first object, as well as the resident location information and broadband installation location information of multiple second objects. The second object is the object that has installed broadband, so that the subsequent prediction of the broadband installation address of the object to be installed with broadband is facilitated. Then, calculate the distance between the resident position information of each second object and the resident position information of the first object, and arrange the second objects in order from near to far to obtain the target sequence, and then obtain the target sequence Among the broadband installation location information of the second object corresponding to the first k distances, where k is a preset value, it is determined that among the broadband installation location information of the second object corresponding to the first k distances, the largest number of broadband installation location information is Target installation location information of the first object. In this way, the k-nearest neighbor classification algorithm can be used to match the broadband installation address of the object to be installed through the corresponding relationship between the broadband installation address of the object with broadband installed and the resident location, and the broadband installation address of the object to be installed with broadband, thereby improving the performance of the object. The accuracy of the matching results.

附图说明Description of the drawings

为了更清楚地说明本申请实施例的技术方案,下面将对本申请实施例中所需要使用的附图作简单的介绍,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to explain the technical solutions of the embodiments of the present application more clearly, the drawings required to be used in the embodiments of the present application will be briefly introduced below. For those of ordinary skill in the art, without exerting creative efforts, they can also Additional drawings can be obtained from these drawings.

图1是本申请一个实施例提供的信息确定方法的流程示意图;Figure 1 is a schematic flowchart of an information determination method provided by an embodiment of the present application;

图2是本申请一个实施例提供的信息确定装置的结构示意图;Figure 2 is a schematic structural diagram of an information determination device provided by an embodiment of the present application;

图3是本申请一个实施例提供的电子设备的结构示意图。Figure 3 is a schematic structural diagram of an electronic device provided by an embodiment of the present application.

具体实施方式Detailed ways

下面将详细描述本申请的各个方面的特征和示例性实施例,为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及具体实施例,对本申请进行进一步详细描述。应理解,此处所描述的具体实施例仅意在解释本申请,而不是限定本申请。对于本领域技术人员来说,本申请可以在不需要这些具体细节中的一些细节的情况下实施。下面对实施例的描述仅仅是为了通过示出本申请的示例来提供对本申请更好的理解。Features and exemplary embodiments of various aspects of the present application will be described in detail below. In order to make the purpose, technical solutions and advantages of the present application clearer, the present application will be described in further detail below with reference to the accompanying drawings and specific embodiments. It should be understood that the specific embodiments described here are only intended to explain the application, but not to limit the application. It will be apparent to one skilled in the art that the present application may be practiced without some of these specific details. The following description of embodiments is merely intended to provide a better understanding of the present application by illustrating examples thereof.

需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。It should be noted that in this article, relational terms such as first and second are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply that these entities or operations are mutually exclusive. any such actual relationship or sequence exists between them. Furthermore, the terms "comprises," "comprises," or any other variations thereof are intended to cover a non-exclusive inclusion such that a process, method, article, or apparatus that includes a list of elements includes not only those elements, but also those not expressly listed other elements, or elements inherent to the process, method, article or equipment. Without further limitation, an element defined by the statement "comprising..." does not exclude the presence of additional identical elements in a process, method, article, or device that includes the stated element.

如背景技术所述,现有技术中通常有以下三个方案获取用户的宽带覆盖地址。方案一:利用用户手机号码,通过问卷调查得到客户地址,以问卷得出的地址与宽带资源管理系统匹配,从而得出用户地址与宽带覆盖地址的对应关系。方案二:根据用户手机入网登记的地址信息与后台宽带资源管理系统匹配,得出手机号与宽带覆盖地址对应关系。方案三:通过通话基站和移动数据使用基站判定用户家庭常驻区域,通过家庭常驻区域与后台宽带资源管理系统中存储的宽带覆盖地址匹配,从而得出该用户常驻区域对应的宽带覆盖地址,以便完成宽带安装业务。As mentioned in the background art, there are usually the following three solutions in the prior art to obtain the user's broadband coverage address. Option 1: Use the user's mobile phone number to obtain the customer's address through a questionnaire, and match the address obtained from the questionnaire with the broadband resource management system to obtain the corresponding relationship between the user's address and the broadband coverage address. Option 2: Match the address information registered by the user's mobile phone with the background broadband resource management system to obtain the corresponding relationship between the mobile phone number and the broadband coverage address. Option 3: Determine the user's home area through call base stations and mobile data usage base stations, and match the home area with the broadband coverage address stored in the background broadband resource management system to obtain the broadband coverage address corresponding to the user's home area. , in order to complete the broadband installation business.

然而,方案一中住址属于个人敏感信息,往往会因用户不配合、不愿意提供等原因无法获取,且对于同一地理位置的描述,常常存在个体地址描述差异,可能存在多种表述,例如“百货大楼”、“XX市XX路XX街XX号”以及“XX路新百大”等不同表达。另外,基于个人信息保护,也不排除用户提供错误的地址信息,因此,获取的信息不准确。方案二中入网登记的地址比较常见为证件地址、单位地址、临时地址等,与实际居住地址匹配率低,且基本不进行动态更新,与实际居住地差异较大,识别结果准确性较差。方案三中基站地址粒度过粗,部分乡镇区域基站覆盖半径达到几千米甚至几十千米,与宽带地址的细粒度不匹配,例如,通过基站进行定位的区域地址是X区-Y物业楼,而宽带地址是X区—Z街道—Q小区,导致识别结果准确性不足;基站地址命名规则(基本为基站安装位置或附近标志性位置)与宽带地址命名规则差异性大,无法通过基站地址建立宽带地址的对应关系。However, the address in Scheme 1 is personal sensitive information, which often cannot be obtained due to the user's uncooperation and unwillingness to provide it. Moreover, there are often differences in individual address descriptions for the same geographical location, and there may be multiple expressions, such as "department store" Building", "No. XX Street, XX Road, XX City" and "New Hundred Building on XX Road" and other different expressions. In addition, based on personal information protection, it is not ruled out that users provide incorrect address information, so the information obtained is inaccurate. In the second option, the addresses registered on the network are more commonly used as certificate addresses, work addresses, temporary addresses, etc. The matching rate with the actual residential address is low, and there is basically no dynamic update. It is quite different from the actual residential address, and the accuracy of the identification results is poor. In Solution 3, the base station address granularity is too coarse. The coverage radius of the base stations in some towns and villages reaches several kilometers or even tens of kilometers, which does not match the fine granularity of the broadband address. For example, the regional address for positioning through the base station is X District-Y Property Building. , and the broadband address is District Establish the corresponding relationship between broadband addresses.

为了解决上述问题,本申请实施例提供了一种信息确定方法、装置、设备及计算机存储介质,该信息确定方法可以通过获取第一对象的常驻位置信息,以及多个第二对象的常驻位置信息和宽带安装位置信息,第一对象为待安装宽带的对象,第二对象为已安装宽带的对象,这样,便于后续对待安装宽带的对象的宽带安装地址的预测。接着,计算每个第二对象的常驻位置信息与第一对象的常驻位置信息的距离,并按照距离的由近至远的顺序对第二对象进行排列,得到目标序列,进而获取目标序列中前k个距离对应的第二对象的宽带安装位置信息,其中,k为预设数值,从而确定前k个距离对应的第二对象的宽带安装位置信息中,数量最多的宽带安装位置信息为第一对象的目标安装位置信息。这样,可以利用k最近邻分类算法,通过已安装宽带的对象的宽带安装地址与常驻位置的对应关系,以及待安装宽带的对象宽带安装地址,匹配待安装宽带的对象的宽带安装地址,提高了匹配结果的准确性。下面首先对本申请实施例所提供的信息确定架构进行介绍。In order to solve the above problems, embodiments of the present application provide an information determination method, device, equipment and computer storage medium. The information determination method can obtain the resident location information of the first object and the resident location information of multiple second objects. For location information and broadband installation location information, the first object is the object to be installed with broadband, and the second object is the object that has installed broadband. This facilitates subsequent prediction of the broadband installation address of the object to be installed with broadband. Then, calculate the distance between the resident position information of each second object and the resident position information of the first object, and arrange the second objects in order from near to far to obtain the target sequence, and then obtain the target sequence Among the broadband installation location information of the second object corresponding to the first k distances, where k is a preset value, it is determined that among the broadband installation location information of the second object corresponding to the first k distances, the largest number of broadband installation location information is Target installation location information of the first object. In this way, the k-nearest neighbor classification algorithm can be used to match the broadband installation address of the object to be installed through the corresponding relationship between the broadband installation address of the object with broadband installed and the resident location, and the broadband installation address of the object to be installed with broadband, thereby improving the performance of the object. The accuracy of the matching results. The information determination architecture provided by the embodiment of this application is first introduced below.

本申请实施例提出的信息确定架构可以包括常驻位置计算模块、已知宽带覆盖计算模块、切片常驻位置与宽带覆盖分析模块、客户群宽带覆盖校准模块以及潜在客户宽带标准地址输出模块。The information determination architecture proposed in the embodiment of this application may include a resident location calculation module, a known broadband coverage calculation module, a slice resident location and broadband coverage analysis module, a customer group broadband coverage calibration module, and a potential customer broadband standard address output module.

常驻位置计算模块用于计算待安装宽带的对象和已安装宽带的对象的常驻位置,可以通过移动通信基站定位技术进行定位得到常驻位置,还可以基于距离测量和角度测量,利用几何三角关系计算被测对象的常驻位置。The resident position calculation module is used to calculate the resident position of objects to be installed with broadband and objects with installed broadband. The resident position can be obtained by positioning through mobile communication base station positioning technology. It can also use geometric triangulation based on distance measurement and angle measurement. The relationship calculates the resident position of the measured object.

已知宽带覆盖计算模块用于通过正则表达式确定已安装宽带的对象的宽带安装位置。宽带标准地址格式是:省、市、区、街道(镇、乡、办事处)、行政村(街道、路、巷)、小区、栋(幢、号楼、村民组)、单元、楼层(房间号)。通过正则表达式将已安装宽带的对象的宽带标准地址格式,保留至小区这一层级,过于详细的地址会成为模型计算的干扰数据,不利于后续模型对结果的预测。The known broadband coverage calculation module is used to determine the broadband installation location of objects that have broadband installed through regular expressions. The standard broadband address format is: province, city, district, street (town, township, office), administrative village (street, road, lane), community, building (building, building, village group), unit, floor (room) Number). The broadband standard address format of objects with installed broadband is retained to the community level through regular expressions. Overly detailed addresses will become interference data for model calculations, which is not conducive to subsequent model predictions.

切片常驻位置与宽带覆盖分析模块用于预测待安装宽带的对象的宽带安装位置,可以利用模型中的k最近邻(k-Nearest Neighbor,KNN)分类算法,通过已安装宽带的对象的常驻位置和宽带安装位置进行预测。The slice resident position and broadband coverage analysis module is used to predict the broadband installation position of objects to be installed with broadband. The k-Nearest Neighbor (KNN) classification algorithm in the model can be used to calculate the resident position of objects with installed broadband. location and broadband installation locations.

客户群宽带覆盖校准模块用于采用对象与地址相关资料文档对模型结果进行校准,利用分词、全文检索技术分别于上述预测得到的宽带安装位置进行匹配,如果匹配成功,则可认为该宽带安装位置是最准确的。The customer group broadband coverage calibration module is used to calibrate the model results using object and address related data documents, and uses word segmentation and full-text search technologies to match the broadband installation locations predicted above. If the match is successful, the broadband installation location can be considered is the most accurate.

潜在客户宽带标准地址输出模块用于程序定时自动调度,以文件接口的形式将宽带安装位置输出到各个业务子系统中使用。The potential customer broadband standard address output module is used for automatic program scheduling and outputs the broadband installation location in the form of a file interface to various business subsystems for use.

根据上述架构,下面结合图1对本申请实施例提供的信息确定方法进行详细说明。Based on the above architecture, the information determination method provided by the embodiment of the present application will be described in detail below with reference to Figure 1 .

图1示出了本申请一个实施例提供的信息确定方法的流程示意图。Figure 1 shows a schematic flowchart of an information determination method provided by an embodiment of the present application.

如图1所示,该信息确定方法具体可以包括如下步骤:As shown in Figure 1, the information determination method may specifically include the following steps:

S110,获取第一对象的常驻位置信息,以及多个第二对象的常驻位置信息和宽带安装位置信息。S110. Obtain the resident location information of the first object, as well as the resident location information and broadband installation location information of multiple second objects.

第一对象可以是待安装宽带的对象,第二对象可以是已安装宽带的对象。常驻位置信息可以是对象的停留时间最长的位置信息,也可以是对对象进行定位得到的次数最多的位置信息。宽带安装位置信息可以是宽带安装的地址,例如可以是宽带标准地址格式:省、市、区、街道(镇、乡、办事处)、行政村(街道、路、巷)、小区、栋(幢、号楼、村民组)、单元、楼层(房间号)。The first object may be an object for which broadband is to be installed, and the second object may be an object for which broadband has been installed. The resident location information may be the location information where the object stays for the longest time, or it may be the location information where the object is positioned the most times. The broadband installation location information may be the address of the broadband installation, for example, it may be in the broadband standard address format: province, city, district, street (town, township, office), administrative village (street, road, lane), community, building (building) , building number, villager group), unit, floor (room number).

S120,计算每个第二对象的常驻位置信息与第一对象的常驻位置信息的距离。S120: Calculate the distance between the resident location information of each second object and the resident location information of the first object.

常驻位置信息可以以经纬度的坐标表示,第一对象与第二对象的常驻位置信息的距离可以采用欧氏距离进行计算。The resident location information may be represented by coordinates of longitude and latitude, and the distance between the resident location information of the first object and the second object may be calculated using the Euclidean distance.

S130,按照距离的由近至远的顺序对第二对象进行排列,得到目标序列。S130: Arrange the second objects in order of distance from near to far to obtain a target sequence.

作为一个示例,计算得到已安装宽带用户A与待安装宽带用户的常驻位置的距离为1千米、已安装宽带用户B与待安装宽带用户的常驻位置的距离为500米、已安装宽带用户C与待安装宽带用户的常驻位置的距离为800米,以及已安装宽带用户D与待安装宽带用户的常驻位置的距离为600米。按照距离的由近至远的顺序对已安装宽带用户进行排列得到序列:用户B、用户D、用户C、用户A。As an example, it is calculated that the distance between the installed broadband user A and the resident location of the broadband user to be installed is 1 km, the distance between the installed broadband user B and the resident location of the broadband user to be installed is 500 meters, and the distance between the installed broadband user B and the resident location of the broadband user to be installed is 500 meters. The distance between user C and the permanent location of the broadband user to be installed is 800 meters, and the distance between the installed broadband user D and the permanent location of the broadband user to be installed is 600 meters. Arrange the installed broadband users in order of distance from near to far to obtain the sequence: user B, user D, user C, user A.

S140,获取目标序列中前k个距离对应的第二对象的宽带安装位置信息,其中,k为预设数值。S140: Obtain the broadband installation position information of the second object corresponding to the first k distances in the target sequence, where k is a preset value.

筛选出目标序列中前k个距离对应的第二对象,进而获取前k个距离对应的第二对象的宽带安装位置信息,其中,k值可以是用户根据需要和经验预先设置的数值,k值可以根据第二对象的数量变化进行变化。示例性地,选取的k值为3,目标序列中第二对象的顺序为对象B、对象D、对象C、对象A,则获取目标序列中前3个距离对应的对象B、对象D和对象C,进而获取对象B、对象D和对象C的宽带安装位置信息。Filter out the second objects corresponding to the first k distances in the target sequence, and then obtain the broadband installation location information of the second objects corresponding to the first k distances. The k value can be a value preset by the user based on needs and experience. The k value Can be changed according to the change in the number of second objects. For example, the selected k value is 3, and the order of the second object in the target sequence is object B, object D, object C, and object A. Then the object B, object D, and object corresponding to the first three distances in the target sequence are obtained. C, and then obtain the broadband installation location information of object B, object D and object C.

S150,确定前k个距离对应的第二对象的宽带安装位置信息中,数量最多的宽带安装位置信息为第一对象的目标安装位置信息。S150: Determine that among the broadband installation location information of the second object corresponding to the first k distances, the largest number of broadband installation location information is the target installation location information of the first object.

获取目标序列中前k个距离对应的第二对象的k个宽带安装位置信息,目标安装位置信息可以是k个宽带安装位置信息中数量最多的宽带安装位置信息。示例性地,选取的k值为3,获取目标序列中前3个距离对应的对象B、对象D和对象C,对象B、对象D和对象C的宽带安装位置信息分别为a、b、a,则第一对象的目标安装位置信息为数量较多的a。Obtain k broadband installation location information of the second object corresponding to the first k distances in the target sequence. The target installation location information may be the broadband installation location information with the largest number among the k broadband installation location information. For example, the selected k value is 3, and the object B, object D, and object C corresponding to the first three distances in the target sequence are obtained. The broadband installation location information of object B, object D, and object C are a, b, and a respectively. , then the target installation location information of the first object is a with a larger number.

在一些实施例中,基于预设模型中的k最近邻分类算法,根据第一对象的常驻位置信息,多个第二对象的常驻位置信息和宽带安装位置信息,确定第一对象的目标安装位置信息。In some embodiments, based on the k-nearest neighbor classification algorithm in the preset model, the target of the first object is determined according to the resident location information of the first object, the resident location information of a plurality of second objects and the broadband installation location information. Installation location information.

k最近邻分类算法的实现过程如下:首先定义待安装宽带的对象为待标记的数据样本x_test,已安装宽带的对象为已标记的数据集x_train。选取对象的常驻位置(经纬度)作为特征x,宽带安装位置作为标签y。然后,遍历已标记数据集x_train中的样本,计算每个样本与待标记点的距离,并把距离保存在数组d_array中。对d_array数组进行排序,取距离最近的k个点,记为x_knn。接着,在x_knn中统计每个类别(宽带安装位置)的个数。待标记样本的类别,即为在x_knn中样本个数最多的类别。The implementation process of the k-nearest neighbor classification algorithm is as follows: First, define the object to be installed with broadband as the data sample to be labeled x_test, and the object with installed broadband as the labeled data set x_train. Select the resident location (latitude and longitude) of the object as the feature x, and the broadband installation location as the label y. Then, traverse the samples in the labeled data set x_train, calculate the distance between each sample and the point to be labeled, and save the distance in the array d_array. Sort the d_array array and take the k closest points, recorded as x_knn. Next, count the number of each category (broadband installation location) in x_knn. The category of the sample to be labeled is the category with the largest number of samples in x_knn.

k值可以利用k折交叉验证和带交叉验证的学习曲线进行选取。例如,将原始数据(第二对象的常驻位置信息和宽带安装位置信息)随机分成k份,每次选择k-1份作为训练集,剩余的1份作为测试集。交叉验证重复k次,取k次准确率的平均值作为最终预设模型的评价指标。训练集和测试集的划分会干扰模型的结果,因此用交叉验证k次的结果求出的均值,是对模型效果的一个更好的度量。交叉验证用测试集计算得分score,测试集中的数据也影响了参数的选择,有可能会导致模型对训练数据和测试数据的过拟合。带交叉验证的学习曲线可以检测模型是否过拟合,学习曲线的作用可以是观察数据集的大小对于模型性能的影响,选取学习曲线准确率高且方差相对较小的点,这样的点泛化能力是最强的。在“交叉验证+学习曲线”的作用下,选出的超参数(k值)能够保证更好的泛化能力。The k value can be selected using k-fold cross-validation and learning curve with cross-validation. For example, the original data (the second object's resident location information and broadband installation location information) are randomly divided into k parts, k-1 parts are selected as the training set each time, and the remaining 1 part is used as the test set. Cross-validation is repeated k times, and the average of the k times' accuracy is taken as the evaluation index of the final preset model. The division of the training set and the test set will interfere with the results of the model, so the mean calculated from the results of k times of cross-validation is a better measure of the model's effect. Cross-validation uses the test set to calculate the score. The data in the test set also affects the selection of parameters, which may cause the model to overfit the training data and test data. The learning curve with cross-validation can detect whether the model is overfitting. The role of the learning curve can be to observe the impact of the size of the data set on the model performance, and select points with high accuracy and relatively small variance in the learning curve. Such points can generalize. Ability is the strongest. Under the influence of "cross-validation + learning curve", the selected hyperparameters (k values) can ensure better generalization ability.

本申请实施例中,通过获取第一对象的常驻位置信息,以及多个第二对象的常驻位置信息和宽带安装位置信息,第一对象为待安装宽带的对象,第二对象为已安装宽带的对象,这样,便于后续对待安装宽带的对象的宽带安装地址的预测。接着,计算每个第二对象的常驻位置信息与第一对象的常驻位置信息的距离,并按照距离的由近至远的顺序对第二对象进行排列,得到目标序列,进而获取目标序列中前k个距离对应的第二对象的宽带安装位置信息,其中,k为预设数值,从而确定前k个距离对应的第二对象的宽带安装位置信息中,数量最多的宽带安装位置信息为目标安装位置信息。这样,可以利用k最近邻分类算法,通过已安装宽带的对象的宽带安装地址与常驻位置的对应关系,以及待安装宽带的对象宽带安装地址,匹配待安装宽带的对象的宽带安装地址,提高了匹配结果的准确性。In the embodiment of the present application, by obtaining the resident location information of the first object, as well as the resident location information and broadband installation location information of multiple second objects, the first object is the object to be installed with broadband, and the second object is the installed broadband object. Broadband objects, in this way, facilitate subsequent prediction of the broadband installation address of the object to be installed with broadband. Then, calculate the distance between the resident position information of each second object and the resident position information of the first object, and arrange the second objects in order from near to far to obtain the target sequence, and then obtain the target sequence Among the broadband installation location information of the second object corresponding to the first k distances, where k is a preset value, it is determined that among the broadband installation location information of the second object corresponding to the first k distances, the largest number of broadband installation location information is Target installation location information. In this way, the k-nearest neighbor classification algorithm can be used to match the broadband installation address of the object to be installed through the corresponding relationship between the broadband installation address of the object with broadband installed and the resident location, and the broadband installation address of the object to be installed with broadband, thereby improving the performance of the object. The accuracy of the matching results.

在一些实施例中,S110:获取第一对象的常驻位置信息,具体可以包括:In some embodiments, S110: Obtain the resident location information of the first object, which may specifically include:

获取对第一对象以预设频率定位得到的多个位置信息,以及在多个位置停留的时间;Obtain multiple location information obtained by locating the first object at a preset frequency, and the time spent at multiple locations;

确定多个位置信息中,在多个位置停留的时间最长和/或次数最多的位置信息为第一对象的常驻位置信息。It is determined that among the plurality of position information, the position information that stays at the plurality of positions for the longest time and/or the most times is the resident position information of the first object.

对第一对象以预设频率进行定位,得到多个位置信息,其中,预设频率可以是用户预先设置的频率,例如以1小时1次的频率对第一对象进行定位,得到多个位置信息,并获取在多个位置停留的时间。第一对象的常驻位置信息可以是多个位置信息中次数最多的位置信息,或者多个位置信息中在多个位置停留的时间最长的位置信息,也可以是多个位置信息中次数最多且在多个位置停留的时间最长的位置信息。另外,第二对象的常驻位置信息与第一对象的常驻位置信息的确定方式相同。The first object is positioned at a preset frequency to obtain a plurality of location information. The preset frequency may be a frequency preset by the user. For example, the first object is positioned at a frequency of once an hour to obtain a plurality of location information. , and get the time spent at multiple locations. The permanent location information of the first object may be the location information with the largest number of times among the multiple location information, or the location information with the longest stay time at multiple locations among the multiple location information, or it may be the location information with the most times among the multiple location information. And the location information that stayed the longest in multiple locations. In addition, the resident location information of the second object is determined in the same manner as the resident location information of the first object.

在一些实施例中,确定多个位置信息中,预设时间段内在多个位置停留的时间最长和/或次数最多的位置信息为第一对象的常驻位置信息。In some embodiments, it is determined that among the plurality of location information, the location information that stays at multiple locations for the longest time and/or the most times within the preset time period is the resident location information of the first object.

预设时间段可以是用户预先设置的夜间时间段。The preset time period may be a night time period preset by the user.

作为一个示例,在一段时间内,以1小时1次的频率对已安装宽带和待安装宽带的用户进行定位,得到多个位置信息,选出白天时间段数量最多和时间最长的常驻位置,以及夜间时间段数量最多和时间最长的常驻位置。白天时间段的常驻位置认为是工作地,夜间时间段的常驻位置认为是居住地,将夜间时间段的常驻位置(居住地)作为用户最终的常驻位置。As an example, within a period of time, users with installed broadband and to-be-installed broadband are positioned once an hour, multiple location information is obtained, and the resident location with the largest number and longest duration during the day is selected. , as well as the most numerous and longest resident locations during the night time period. The permanent location during the day time period is considered to be the work place, the permanent location during the night time period is considered to be the residence, and the permanent location (residence) during the night time period is regarded as the user's final permanent location.

本申请实施例中,通过获取对第一对象以预设频率定位得到的多个位置信息,以及在多个位置停留的时间,并确定多个位置信息中,在多个位置停留的时间最长和/或次数最多的位置信息为第一对象的常驻位置信息。这样,可以获取准确的第一对象的常驻位置信息,便于后续匹配宽带安装位置。In the embodiment of the present application, multiple location information obtained by locating the first object at a preset frequency and the time spent at the multiple locations are obtained, and it is determined that among the multiple location information, the longest time spent at the multiple locations is And/or the location information with the largest number of times is the resident location information of the first object. In this way, accurate resident location information of the first object can be obtained to facilitate subsequent matching of the broadband installation location.

在一些实施例中,S110:获取多个第二对象的宽带安装位置信息,具体可以包括:In some embodiments, S110: Obtain broadband installation location information of multiple second objects, which may specifically include:

获取多个第二对象的宽带安装的地址信息,地址信息可以包括第一地址序列的信息和第二地址序列的信息;Obtain the address information of the broadband installation of multiple second objects. The address information may include information of the first address sequence and information of the second address sequence;

通过正则表达式过滤地址信息中第二地址序列的信息;Filter the information of the second address sequence in the address information through regular expressions;

确定地址信息中第一地址序列的信息为多个第二对象的宽带安装位置信息。The information of the first address sequence in the address information is determined to be broadband installation location information of a plurality of second objects.

第一地址序列和第二地址序列可以是第二对象的宽带安装的地址中的任意序列。例如,第二对象的宽带安装的地址信息例如可以包括省、市、区、街道(镇、乡、办事处)、行政村(街道、路、巷)、小区、栋(幢、号楼、村民组)、单元、楼层(房间号)的信息,第一地址序列的信息可以包括省、市、区、街道(镇、乡、办事处)、行政村(街道、路、巷)、小区的信息,第二地址序列的信息可以包括栋(幢、号楼、村民组)、单元、楼层(房间号)的信息。通过正则表达式从第二对象的宽带安装的地址信息的字符串中获取第一地址序列的信息,将第二地址序列的信息进行过滤,即将宽带安装的地址保留到小区这一层级。The first address sequence and the second address sequence may be any sequence of addresses of the broadband installation of the second object. For example, the address information of the broadband installation of the second object may include province, city, district, street (town, township, office), administrative village (street, road, lane), community, building (building, building number, villager Group), unit, floor (room number) information, the information of the first address sequence may include information of province, city, district, street (town, township, office), administrative village (street, road, lane), community , the information of the second address sequence may include information of the building (building, building number, villager group), unit, and floor (room number). The information of the first address sequence is obtained from the string of address information of the broadband installation of the second object through a regular expression, and the information of the second address sequence is filtered, that is, the address of the broadband installation is retained at the community level.

在一些实施例中,第一地址序列的信息可以包括省、市、区、街道、行政村、小区的序列的信息;第二地址序列的信息可以包括栋、单元以及楼层的序列的信息。In some embodiments, the information on the first address sequence may include information on the sequence of provinces, cities, districts, streets, administrative villages, and communities; the information on the second address sequence may include information on the sequence of buildings, units, and floors.

作为一个示例,某一已安装宽带的宽带安装的地址为“XX省XX市XX街道XX行政村XX小区XX栋X单元X楼层”,通过正则表达式将字符串“XX省XX市XX街道XX行政村XX小区XX栋X单元X楼层”中的“XX栋X单元X楼层”过滤,得到过滤后的位置“XX省XX市XX街道XX行政村XX小区”,后续基于该位置与常驻位置进行匹配。As an example, the address of a certain installed broadband is "XX administrative village XX administrative village XX community XX building XX unit X floor", and the string "XX province XX city XX street XX" is changed using a regular expression. Filter "XX Building X Unit X Floor" in "XX Building Make a match.

本申请实施例中,通过获取多个第二对象的宽带安装的地址信息,地址信息可以包括第一地址序列的信息和第二地址序列的信息,并通过正则表达式过滤地址信息中第二地址序列的信息,进而确定地址信息中第一地址序列的信息为多个第二对象的宽带安装位置信息。这样,可以过滤掉过于详细的楼层等信息的干扰数据,提高宽带安装位置与常驻位置进行匹配的准确性。In the embodiment of the present application, by obtaining the address information of the broadband installation of multiple second objects, the address information may include information of the first address sequence and information of the second address sequence, and the second address in the address information is filtered through a regular expression. sequence information, and further determines that the information of the first address sequence in the address information is the broadband installation location information of a plurality of second objects. In this way, interference data from overly detailed information such as floors can be filtered out, and the accuracy of matching the broadband installation location with the permanent location can be improved.

在一些实施例中,在S120:计算每个第二对象的常驻位置信息与第一对象的常驻位置信息的距离之后,该信息确定方法还可以包括:In some embodiments, after S120: calculating the distance between the resident location information of each second object and the resident location information of the first object, the information determining method may further include:

确定小于预设距离阈值的距离对应的第二对象的宽带安装位置信息中,数量最多的宽带安装位置信息为目标安装位置信息。It is determined that among the broadband installation location information of the second object corresponding to a distance smaller than the preset distance threshold, the largest number of broadband installation location information is the target installation location information.

预设距离阈值可以是用户预先设置的距离,除了确定前k个距离对应的第二对象的宽带安装位置信息中,数量最多的宽带安装位置信息为第一对象的目标安装位置信息,还可以确定小于预设距离阈值的距离对应的第二对象的宽带安装位置信息中,数量最多的宽带安装位置信息为目标安装位置信息。The preset distance threshold can be a distance preset by the user. In addition to determining that among the broadband installation location information of the second object corresponding to the first k distances, the largest number of broadband installation location information is the target installation location information of the first object, it can also be determined Among the broadband installation location information of the second object corresponding to a distance smaller than the preset distance threshold, the largest number of broadband installation location information is the target installation location information.

作为一个示例,当已安装宽带的用户的宽带安装位置的数据分布不均匀时,例如,已安装宽带用户A、B、C、D以及E与待安装宽带用户的常驻位置的距离分别为500米、600米、800米、5万米以及8万米,若利用k最近邻分类算法获取前k个距离,k为4,则用户D的常驻位置的距离与用户A、B、C相差较远,此时获取小于预设距离阈值1千米的距离对应的用户A、B以及C的宽带安装位置,进而确定用户A、B以及C的宽带安装位置中数量最多的位置为待安装宽带用户的宽带安装位置信息,可以取得更好的效果。As an example, when the data distribution of the broadband installation locations of users who have already installed broadband is uneven, for example, the distances between installed broadband users A, B, C, D, and E and the resident locations of the broadband users to be installed are 500 respectively. meters, 600 meters, 800 meters, 50,000 meters and 80,000 meters. If the k nearest neighbor classification algorithm is used to obtain the first k distances, and k is 4, then the distance between user D’s resident location and users A, B, and C At this time, the broadband installation locations of users A, B, and C corresponding to a distance less than the preset distance threshold of 1 km are obtained, and then the location with the largest number of broadband installation locations of users A, B, and C is determined to be the broadband to be installed User's broadband installation location information can achieve better results.

本申请实施例中,通过确定小于预设距离阈值的距离对应的第二对象的宽带安装位置信息中,数量最多的宽带安装位置信息为目标安装位置信息,可以在已安装宽带的用户的宽带安装位置的数据分布不均匀时,提高确定目标安装位置信息的准确性。In the embodiment of the present application, by determining the broadband installation location information of the second object corresponding to a distance smaller than the preset distance threshold, the broadband installation location information with the largest number is the target installation location information, and the broadband installation location of the user who has installed broadband can be determined. When the location data is unevenly distributed, the accuracy of determining the target installation location information is improved.

在一些实施例中,该信息确定方法还可以包括:In some embodiments, the information determining method may also include:

获取第一对象的与地址相关的文档;Obtain the address-related document of the first object;

对目标安装位置信息进行分词处理,得到分词后的多个单词;Perform word segmentation processing on the target installation location information to obtain multiple words after word segmentation;

根据多个单词与单词在文档中的位置之间的映射关系,获取多个单词在文档中出现的次数;According to the mapping relationship between multiple words and the word's position in the document, obtain the number of times multiple words appear in the document;

在次数超过预设阈值的情况下,将目标安装位置信息输出。When the number of times exceeds the preset threshold, the target installation location information is output.

第一对象的与地址相关的文档例如可以是第一对象的证件地址、集团&家庭集群网信息、集团关系、家庭关系等地址相关的资料文档。通过分词器对目标安装位置信息进行分词处理,得到多个单词,分词器可以是IK分词器,IK分词器采用特有的“正向迭代最细粒度切分算法”,采用多子处理器分析模式,支持英文字母、数字、中文词汇(姓名、地名)等分词处理。分词计算流程包括加载字典、预处理、分词、歧义处理以及整理输出。The address-related document of the first object may be, for example, the first object's ID address, group & family cluster network information, group relationship, family relationship and other address-related data documents. The target installation location information is segmented through a word segmenter to obtain multiple words. The word segmenter can be an IK word segmenter. The IK word segmenter uses a unique "forward iterative finest-grained segmentation algorithm" and adopts a multi-subprocessor analysis mode. , supports word segmentation processing of English letters, numbers, Chinese vocabulary (names, place names), etc. The word segmentation calculation process includes loading dictionary, preprocessing, word segmentation, ambiguity processing and sorting output.

多个单词与单词在文档中的位置之间的映射关系可以是采用倒排索引进行建立和存储,以便于进行全文检索,获取多个单词在文档中出现的次数。全文检索是指计算机索引程序通过扫描文档中的每一个词,对每一个词建立一个索引,指明该词在文章中出现的次数和位置,当用户查询时,检索程序就根据事先建立的索引进行查找,并将查找的结果反馈给用户的检索方式。在次数超过预设阈值的情况下,将目标安装位置信息输出,预设阈值可以是用户预先设置的阈值。The mapping relationship between multiple words and the word's position in the document can be established and stored using an inverted index to facilitate full-text retrieval and obtain the number of times multiple words appear in the document. Full-text retrieval means that the computer indexing program scans each word in the document and builds an index for each word, indicating the number and location of the word in the article. When the user queries, the retrieval program will proceed based on the pre-established index. Search method and feed the search results back to the user. When the number of times exceeds a preset threshold, the target installation location information is output. The preset threshold may be a threshold preset by the user.

作为一个示例,在预测得到待安装宽带的用户的安装位置之后,由于超参数k选取不同值时预测的结果可能不一样,因此,需要对预测得到的宽带安装位置进行校准。首先,对预测得到的待安装宽带的用户的安装位置进行分词处理,得到分词后的多个单词,并通过全文检索引擎工具包Lucene,采用倒排索引存储单词与单词在文档中所在位置之间的映射。通过倒排索引,可以根据单词快速获取包含这个单词的文档列表,从而获取单词在文档中出现的次数。若每个单词在文档中出现的次数都超过预设阈值,则认为预测得到的待安装宽带的用户的宽带安装位置是最准确的,将该安装位置输出到各个业务子系统中使用。As an example, after predicting the installation locations of users to be installed with broadband, the predicted results may be different when the hyperparameter k is selected with different values. Therefore, the predicted broadband installation locations need to be calibrated. First, perform word segmentation processing on the predicted installation locations of users who want to install broadband, and obtain multiple words after word segmentation. Through the full-text search engine toolkit Lucene, an inverted index is used to store the relationship between the word and its position in the document. of mapping. Through the inverted index, you can quickly obtain a list of documents containing a word based on the word, thereby obtaining the number of times the word appears in the document. If the number of occurrences of each word in the document exceeds the preset threshold, the predicted broadband installation location of the user to be installed is considered to be the most accurate, and the installation location is output to each business subsystem for use.

本申请实施例中,通过获取第一对象的与地址相关的文档,并对目标安装位置信息进行分词处理,得到分词后的多个单词,进而根据多个单词与单词在文档中的位置之间的映射关系,获取多个单词在文档中出现的次数,接着,在次数超过预设阈值的情况下,将目标安装位置信息输出。这样,可以采用第二对象的其他资料文档对匹配结果进行校准,提高输出结果的准确性,In the embodiment of the present application, by obtaining the address-related document of the first object and performing word segmentation processing on the target installation location information, multiple words after word segmentation are obtained, and then the relationship between the multiple words and the position of the word in the document is obtained. Mapping relationship, obtain the number of times multiple words appear in the document, and then, when the number exceeds the preset threshold, output the target installation location information. In this way, other data documents of the second object can be used to calibrate the matching results to improve the accuracy of the output results.

图2是根据一示例性实施例示出的一种信息确定装置200的结构示意图。FIG. 2 is a schematic structural diagram of an information determining device 200 according to an exemplary embodiment.

如图2所示,该信息确定装置200可以包括:As shown in Figure 2, the information determining device 200 may include:

获取模块201,用于获取第一对象的常驻位置信息,以及多个第二对象的常驻位置信息和宽带安装位置信息,第一对象为待安装宽带的对象,第二对象为已安装宽带的对象;The acquisition module 201 is used to obtain the resident location information of the first object, as well as the resident location information and broadband installation location information of a plurality of second objects. The first object is the object to be installed with broadband, and the second object is the installed broadband. Object;

计算模块202,用于计算每个第二对象的常驻位置信息与第一对象的常驻位置信息的距离;The calculation module 202 is used to calculate the distance between the resident position information of each second object and the resident position information of the first object;

排列模块203,用于按照距离的由近至远的顺序对第二对象进行排列,得到目标序列;The arrangement module 203 is used to arrange the second objects in the order of distance from near to far to obtain the target sequence;

获取模块201,还用于获取目标序列中前k个距离对应的第二对象的宽带安装位置信息,其中,k为预设数值;The acquisition module 201 is also used to acquire the broadband installation position information of the second object corresponding to the first k distances in the target sequence, where k is a preset value;

确定模块204,用于确定前k个距离对应的第二对象的宽带安装位置信息中,数量最多的宽带安装位置信息为目标安装位置信息。The determination module 204 is used to determine that among the broadband installation location information of the second object corresponding to the first k distances, the largest number of broadband installation location information is the target installation location information.

在一种实施方式中,获取模块201,还用于获取对第一对象以预设频率定位得到的多个位置信息,以及在多个位置停留的时间;In one implementation, the acquisition module 201 is also used to acquire multiple location information obtained by locating the first object at a preset frequency, as well as the time spent at multiple locations;

确定模块204,还用于确定多个位置信息中,在多个位置停留的时间最长和/或次数最多的位置信息为第一对象的常驻位置信息。The determination module 204 is also configured to determine, among the plurality of position information, the position information that has stayed at the plurality of positions for the longest time and/or the most times as the resident position information of the first object.

在一种实施方式中,该信息确定装置200还可以包括过滤模块;In one implementation, the information determining device 200 may also include a filtering module;

获取模块201,还用于获取多个第二对象的宽带安装的地址信息,地址信息包括第一地址序列的信息和第二地址序列的信息;The acquisition module 201 is also used to acquire the address information of the broadband installation of multiple second objects. The address information includes the information of the first address sequence and the information of the second address sequence;

过滤模块,用于通过正则表达式过滤地址信息中第二地址序列的信息;A filtering module used to filter the information of the second address sequence in the address information through regular expressions;

确定模块204,还用于确定地址信息中第一地址序列的信息为多个第二对象的宽带安装位置信息。The determining module 204 is also configured to determine that the information of the first address sequence in the address information is the broadband installation location information of the plurality of second objects.

在一种实施方式中,第一地址序列的信息包括省、市、区、街道、行政村、小区的序列的信息;第二地址序列的信息包括栋、单元以及楼层的序列的信息。In one implementation, the information on the first address sequence includes information on the sequence of provinces, cities, districts, streets, administrative villages, and communities; the information on the second address sequence includes information on the sequence of buildings, units, and floors.

在一种实施方式中,确定模块204,还用于在计算每个第二对象的常驻位置信息与第一对象的常驻位置信息的距离之后,确定小于预设距离阈值的距离对应的第二对象的宽带安装位置信息中,数量最多的宽带安装位置信息为目标安装位置信息。In one implementation, the determination module 204 is further configured to determine, after calculating the distance between the resident location information of each second object and the resident location information of the first object, the third location corresponding to the distance less than the preset distance threshold. Among the broadband installation location information of the two objects, the broadband installation location information with the largest amount is the target installation location information.

在一种实施方式中,该信息确定装置200还可以包括处理模块和输出模块;In one implementation, the information determining device 200 may also include a processing module and an output module;

获取模块201,还用于获取第一对象的与地址相关的文档;The acquisition module 201 is also used to acquire the address-related documents of the first object;

处理模块,用于对目标安装位置信息进行分词处理,得到分词后的多个单词;The processing module is used to segment the target installation location information and obtain multiple words after segmentation;

获取模块201,还用于根据多个单词与单词在文档中的位置之间的映射关系,获取多个单词在文档中出现的次数;The acquisition module 201 is also used to obtain the number of times multiple words appear in the document based on the mapping relationship between the multiple words and the position of the word in the document;

输出模块,用于在次数超过预设阈值的情况下,将目标安装位置信息输出。The output module is used to output the target installation location information when the number of times exceeds the preset threshold.

由此,通过获取第一对象的常驻位置信息,以及多个第二对象的常驻位置信息和宽带安装位置信息,第一对象为待安装宽带的对象,第二对象为已安装宽带的对象,这样,便于后续对待安装宽带的对象的宽带安装地址的预测。接着,计算每个第二对象的常驻位置信息与第一对象的常驻位置信息的距离,并按照距离的由近至远的顺序对第二对象进行排列,得到目标序列,进而获取目标序列中前k个距离对应的第二对象的宽带安装位置信息,其中,k为预设数值,从而确定前k个距离对应的第二对象的宽带安装位置信息中,数量最多的宽带安装位置信息为目标安装位置信息。这样,可以利用k最近邻分类算法,通过已安装宽带的对象的宽带安装地址与常驻位置的对应关系,以及待安装宽带的对象宽带安装地址,匹配待安装宽带的对象的宽带安装地址,提高了匹配结果的准确性。Thus, by obtaining the resident location information of the first object, as well as the resident location information and broadband installation location information of a plurality of second objects, the first object is the object to be installed with broadband, and the second object is the object with broadband installed. , in this way, it is convenient to predict the broadband installation address of the object to be installed with broadband. Then, calculate the distance between the resident position information of each second object and the resident position information of the first object, and arrange the second objects in order from near to far to obtain the target sequence, and then obtain the target sequence Among the broadband installation location information of the second object corresponding to the first k distances, where k is a preset value, it is determined that among the broadband installation location information of the second object corresponding to the first k distances, the largest number of broadband installation location information is Target installation location information. In this way, the k-nearest neighbor classification algorithm can be used to match the broadband installation address of the object to be installed through the corresponding relationship between the broadband installation address of the object with broadband installed and the resident location, and the broadband installation address of the object to be installed with broadband, thereby improving the performance of the object. The accuracy of the matching results.

图3示出了本申请实施例提供的电子的硬件结构示意图。Figure 3 shows a schematic diagram of the electronic hardware structure provided by the embodiment of the present application.

在电子设备可以包括处理器301以及存储有计算机程序指令的存储器302。The electronic device may include a processor 301 and a memory 302 storing computer program instructions.

具体地,上述处理器301可以包括中央处理器(CPU),或者特定集成电路(Application Specific Integrated Circuit,ASIC),或者可以被配置成实施本申请实施例的一个或多个集成电路。Specifically, the above-mentioned processor 301 may include a central processing unit (CPU), or an application specific integrated circuit (Application Specific Integrated Circuit, ASIC), or may be configured to implement one or more integrated circuits according to the embodiments of the present application.

存储器302可以包括用于数据或指令的大容量存储器。举例来说而非限制,存储器302可包括硬盘驱动器(Hard Disk Drive,HDD)、软盘驱动器、闪存、光盘、磁光盘、磁带或通用串行总线(Universal Serial Bus,USB)驱动器或者两个或更多个以上这些的组合。在合适的情况下,存储器302可包括可移除或不可移除(或固定)的介质。在合适的情况下,存储器302可在综合网关容灾设备的内部或外部。在特定实施例中,存储器302是非易失性固态存储器。Memory 302 may include bulk storage for data or instructions. By way of example and not limitation, the memory 302 may include a Hard Disk Drive (HDD), a floppy disk drive, flash memory, an optical disk, a magneto-optical disk, a magnetic tape, or a Universal Serial Bus (USB) drive or two or more A combination of many of the above. Memory 302 may include removable or non-removable (or fixed) media, where appropriate. Where appropriate, the memory 302 may be internal or external to the integrated gateway disaster recovery device. In certain embodiments, memory 302 is non-volatile solid-state memory.

存储器可包括只读存储器(ROM),随机存取存储器(RAM),磁盘存储介质设备,光存储介质设备,闪存设备,电气、光学或其他物理/有形的存储器存储设备。因此,通常,存储器包括一个或多个编码有包括计算机可执行指令的软件的有形(非暂态)计算机可读存储介质(例如,存储器设备),并且当该软件被执行(例如,由一个或多个处理器)时,其可操作来执行参考根据本公开的一方面的方法所描述的操作。Memory may include read only memory (ROM), random access memory (RAM), magnetic disk storage media devices, optical storage media devices, flash memory devices, electrical, optical or other physical/tangible memory storage devices. Thus, generally, memory includes one or more tangible (non-transitory) computer-readable storage media (e.g., memory devices) encoded with software including computer-executable instructions, and when the software is executed (e.g., by one or multiple processors) operable to perform the operations described with reference to a method according to an aspect of the present disclosure.

处理器301通过读取并执行存储器302中存储的计算机程序指令,以实现上述实施例中的任意一种信息确定方法。The processor 301 reads and executes the computer program instructions stored in the memory 302 to implement any of the information determination methods in the above embodiments.

在一个示例中,电子设备还可包括通信接口303和总线310。其中,如图3所示,处理器301、存储器302、通信接口303通过总线310连接并完成相互间的通信。In one example, the electronic device may also include a communication interface 303 and a bus 310. Among them, as shown in Figure 3, the processor 301, the memory 302, and the communication interface 303 are connected through the bus 310 and complete communication with each other.

通信接口303,主要用于实现本申请实施例中各模块、装置、单元和/或设备之间的通信。The communication interface 303 is mainly used to implement communication between modules, devices, units and/or equipment in the embodiments of this application.

总线310包括硬件、软件或两者,将信息确定设备的部件彼此耦接在一起。举例来说而非限制,总线可包括加速图形端口(AGP)或其他图形总线、增强工业标准架构(EISA)总线、前端总线(FSB)、超传输(HT)互连、工业标准架构(ISA)总线、无限带宽互连、低引脚数(LPC)总线、存储器总线、微信道架构(MCA)总线、外围组件互连(PCI)总线、PCI-Express(PCI-X)总线、串行高级技术附件(SATA)总线、视频电子标准协会局部(VLB)总线或其他合适的总线或者两个或更多个以上这些的组合。在合适的情况下,总线310可包括一个或多个总线。尽管本申请实施例描述和示出了特定的总线,但本申请考虑任何合适的总线或互连。Bus 310 includes hardware, software, or both, coupling the components of the information-determining device to one another. By way of example, and not limitation, the bus may include Accelerated Graphics Port (AGP) or other graphics bus, Enhanced Industry Standard Architecture (EISA) bus, Front Side Bus (FSB), HyperTransport (HT) interconnect, Industry Standard Architecture (ISA) Bus, Infinite Bandwidth Interconnect, Low Pin Count (LPC) Bus, Memory Bus, Micro Channel Architecture (MCA) Bus, Peripheral Component Interconnect (PCI) Bus, PCI-Express (PCI-X) Bus, Serial Advanced Technology Attachment (SATA) bus, Video Electronics Standards Association Local (VLB) bus or other suitable bus or a combination of two or more of these. Where appropriate, bus 310 may include one or more buses. Although the embodiments of this application describe and illustrate a specific bus, this application contemplates any suitable bus or interconnection.

该电子设备可以基于获取第一对象的常驻位置信息,以及多个第二对象的常驻位置信息和宽带安装位置信息执行本申请实施例中的信息确定方法,从而实现结合图1描述的信息确定方法。The electronic device can perform the information determination method in the embodiment of the present application based on obtaining the resident location information of the first object, as well as the resident location information and broadband installation location information of multiple second objects, thereby realizing the information described in conjunction with Figure 1 Determine the method.

另外,结合上述实施例中的信息确定方法,本申请实施例可提供一种计算机存储介质来实现。该计算机存储介质上存储有计算机程序指令;该计算机程序指令被处理器执行时实现上述实施例中的任意一种信息确定方法。In addition, combined with the information determination method in the above embodiment, the embodiment of the present application can be implemented by providing a computer storage medium. The computer storage medium stores computer program instructions; when the computer program instructions are executed by the processor, any one of the information determination methods in the above embodiments is implemented.

需要明确的是,本申请并不局限于上文所描述并在图中示出的特定配置和处理。为了简明起见,这里省略了对已知方法的详细描述。在上述实施例中,描述和示出了若干具体的步骤作为示例。但是,本申请的方法过程并不限于所描述和示出的具体步骤,本领域的技术人员可以在领会本申请的精神后,作出各种改变、修改和添加,或者改变步骤之间的顺序。To be clear, this application is not limited to the specific configurations and processes described above and illustrated in the figures. For the sake of brevity, detailed descriptions of known methods are omitted here. In the above embodiments, several specific steps are described and shown as examples. However, the method process of the present application is not limited to the specific steps described and shown. Those skilled in the art can make various changes, modifications and additions, or change the order between steps after understanding the spirit of the present application.

以上所述的结构框图中所示的功能块可以实现为硬件、软件、固件或者它们的组合。当以硬件方式实现时,其可以例如是电子电路、专用集成电路(ASIC)、适当的固件、插件、功能卡等等。当以软件方式实现时,本申请的元素是被用于执行所需任务的程序或者代码段。程序或者代码段可以存储在机器可读介质中,或者通过载波中携带的数据信号在传输介质或者通信链路上传送。“机器可读介质”可以包括能够存储或传输信息的任何介质。机器可读介质的例子包括电子电路、半导体存储器设备、ROM、闪存、可擦除ROM(EROM)、软盘、CD-ROM、光盘、硬盘、光纤介质、射频(RF)链路,等等。代码段可以经由诸如因特网、内联网等的计算机网络被下载。The functional blocks shown in the above structural block diagram can be implemented as hardware, software, firmware or a combination thereof. When implemented in hardware, it may be, for example, an electronic circuit, an application specific integrated circuit (ASIC), appropriate firmware, a plug-in, a function card, or the like. When implemented in software, elements of the application are programs or code segments that are used to perform the required tasks. The program or code segments may be stored in a machine-readable medium or transmitted over a transmission medium or communications link via a data signal carried in a carrier wave. "Machine-readable medium" may include any medium capable of storing or transmitting information. Examples of machine-readable media include electronic circuits, semiconductor memory devices, ROM, flash memory, erasable ROM (EROM), floppy disks, CD-ROMs, optical disks, hard disks, fiber optic media, radio frequency (RF) links, and the like. Code segments may be downloaded via computer networks such as the Internet, intranets, and the like.

还需要说明的是,本申请中提及的示例性实施例,基于一系列的步骤或者装置描述一些方法或系统。但是,本申请不局限于上述步骤的顺序,也就是说,可以按照实施例中提及的顺序执行步骤,也可以不同于实施例中的顺序,或者若干步骤同时执行。It should also be noted that the exemplary embodiments mentioned in this application describe some methods or systems based on a series of steps or devices. However, the present application is not limited to the order of the above steps. That is to say, the steps may be performed in the order mentioned in the embodiment, or may be different from the order in the embodiment, or several steps may be performed simultaneously.

上面参考根据本公开的实施例的方法、装置(系统)和计算机程序产品的流程图和/或框图描述了本公开的各方面。应当理解,流程图和/或框图中的每个方框以及流程图和/或框图中各方框的组合可以由计算机程序指令实现。这些计算机程序指令可被提供给通用计算机、专用计算机、或其它可编程数据处理装置的处理器,以产生一种机器,使得经由计算机或其它可编程数据处理装置的处理器执行的这些指令使能对流程图和/或框图的一个或多个方框中指定的功能/动作的实现。这种处理器可以是但不限于是通用处理器、专用处理器、特殊应用处理器或者现场可编程逻辑电路。还可理解,框图和/或流程图中的每个方框以及框图和/或流程图中的方框的组合,也可以由执行指定的功能或动作的专用硬件来实现,或可由专用硬件和计算机指令的组合来实现。Aspects of the present disclosure are described above with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the disclosure. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine such that execution of the instructions via the processor of the computer or other programmable data processing apparatus enables Implementation of the functions/actions specified in one or more blocks of a flowchart and/or block diagram. Such a processor may be, but is not limited to, a general-purpose processor, a special-purpose processor, a special application processor, or a field-programmable logic circuit. It will also be understood that each block in the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations, can also be implemented by special purpose hardware that performs the specified functions or actions, or can be implemented by special purpose hardware and A combination of computer instructions.

以上所述,仅为本申请的具体实施方式,所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,上述描述的系统、模块和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。应理解,本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到各种等效的修改或替换,这些修改或替换都应涵盖在本申请的保护范围之内。The above are only specific implementation modes of the present application. Those skilled in the art can clearly understand that for the convenience and simplicity of description, the specific working processes of the above-described systems, modules and units can be referred to the foregoing method embodiments. The corresponding process will not be described again here. It should be understood that the protection scope of the present application is not limited thereto. Any person familiar with the technical field can easily think of various equivalent modifications or substitutions within the technical scope disclosed in the present application, and these modifications or substitutions should be covered. within the protection scope of this application.

Claims (10)

1.一种信息确定方法,其特征在于,包括:1. An information determination method, characterized by including: 获取第一对象的常驻位置信息,以及多个第二对象的常驻位置信息和宽带安装位置信息,所述第一对象为待安装宽带的对象,所述第二对象为已安装宽带的对象;Obtain the resident location information of the first object, and the resident location information and broadband installation location information of a plurality of second objects. The first object is the object to be installed with broadband, and the second object is the object that has installed broadband. ; 计算每个第二对象的常驻位置信息与第一对象的常驻位置信息的距离;Calculate the distance between the resident position information of each second object and the resident position information of the first object; 按照所述距离的由近至远的顺序对所述第二对象进行排列,得到目标序列;Arrange the second objects in order from near to far to obtain a target sequence; 获取所述目标序列中前k个距离对应的第二对象的宽带安装位置信息,其中,k为预设数值;Obtain the broadband installation position information of the second object corresponding to the first k distances in the target sequence, where k is a preset value; 确定所述前k个距离对应的第二对象的宽带安装位置信息中,数量最多的宽带安装位置信息为第一对象的目标安装位置信息。It is determined that among the broadband installation location information of the second object corresponding to the first k distances, the largest number of broadband installation location information is the target installation location information of the first object. 2.根据权利要求1所述的方法,其特征在于,所述获取第一对象的常驻位置信息,包括:2. The method according to claim 1, characterized in that obtaining the resident location information of the first object includes: 获取对所述第一对象以预设频率定位得到的多个位置信息,以及在多个所述位置停留的时间;Obtain multiple location information obtained by locating the first object at a preset frequency, and the time spent at multiple locations; 确定所述多个位置信息中,在所述多个位置停留的时间最长和/或次数最多的位置信息为第一对象的常驻位置信息。It is determined that among the plurality of position information, the position information that stays at the plurality of positions for the longest time and/or the most times is the resident position information of the first object. 3.根据权利要求1所述的方法,其特征在于,获取多个第二对象的宽带安装位置信息,包括:3. The method according to claim 1, characterized in that obtaining broadband installation location information of multiple second objects includes: 获取多个第二对象的宽带安装的地址信息,所述地址信息包括第一地址序列的信息和第二地址序列的信息;Obtain address information of broadband installations of multiple second objects, where the address information includes information of a first address sequence and information of a second address sequence; 通过正则表达式过滤所述地址信息中所述第二地址序列的信息;Filter the information of the second address sequence in the address information through a regular expression; 确定所述地址信息中所述第一地址序列的信息为多个第二对象的宽带安装位置信息。It is determined that the information of the first address sequence in the address information is broadband installation location information of a plurality of second objects. 4.根据权利要求3所述的方法,其特征在于,所述第一地址序列的信息包括省、市、区、街道、行政村、小区的序列的信息;所述第二地址序列的信息包括栋、单元以及楼层的序列的信息。4. The method according to claim 3, characterized in that the information of the first address sequence includes information of the sequence of province, city, district, street, administrative village, and community; the information of the second address sequence includes Building, unit and floor sequence information. 5.根据权利要求1所述的方法,其特征在于,在所述计算每个第二对象的常驻位置信息与第一对象的常驻位置信息的距离之后,所述方法还包括:5. The method of claim 1, wherein after calculating the distance between the resident location information of each second object and the resident location information of the first object, the method further includes: 确定小于预设距离阈值的距离对应的第二对象的宽带安装位置信息中,数量最多的宽带安装位置信息为所述目标安装位置信息。It is determined that among the broadband installation location information of the second object corresponding to a distance smaller than the preset distance threshold, the largest number of broadband installation location information is the target installation location information. 6.根据权利要求1所述的方法,其特征在于,所述方法还包括:6. The method according to claim 1, characterized in that, the method further comprises: 获取所述第一对象的与地址相关的文档;Obtain address-related documents of the first object; 对所述目标安装位置信息进行分词处理,得到分词后的多个单词;Perform word segmentation processing on the target installation location information to obtain multiple words after word segmentation; 根据所述多个单词与单词在所述文档中的位置之间的映射关系,获取所述多个单词在所述文档中出现的次数;According to the mapping relationship between the multiple words and the position of the word in the document, obtain the number of times the multiple words appear in the document; 在所述次数超过预设阈值的情况下,将所述目标安装位置信息输出。If the number of times exceeds the preset threshold, the target installation location information is output. 7.一种信息确定装置,其特征在于,所述装置包括:7. An information determination device, characterized in that the device includes: 获取模块,用于获取第一对象的常驻位置信息,以及多个第二对象的常驻位置信息和宽带安装位置信息,所述第一对象为待安装宽带的对象,所述第二对象为已安装宽带的对象;An acquisition module is used to obtain the resident location information of a first object, as well as the resident location information and broadband installation location information of a plurality of second objects. The first object is an object to be installed with broadband, and the second object is Objects with installed broadband; 计算模块,用于计算每个第二对象的常驻位置信息与第一对象的常驻位置信息的距离;a calculation module for calculating the distance between the resident position information of each second object and the resident position information of the first object; 排列模块,用于按照所述距离的由近至远的顺序对所述第二对象进行排列,得到目标序列;An arrangement module, configured to arrange the second objects in order from near to far according to the distance to obtain a target sequence; 所述获取模块,还用于获取所述目标序列中前k个距离对应的第二对象的宽带安装位置信息,其中,k为预设数值;The acquisition module is also used to acquire the broadband installation position information of the second object corresponding to the first k distances in the target sequence, where k is a preset value; 确定模块,用于确定所述前k个距离对应的第二对象的宽带安装位置信息中,数量最多的宽带安装位置信息为第一对象的目标安装位置信息。A determination module configured to determine that among the broadband installation location information of the second object corresponding to the first k distances, the broadband installation location information with the largest amount is the target installation location information of the first object. 8.一种电子设备,其特征在于,所述设备包括:处理器,以及存储有计算机程序指令的存储器;所述处理器读取并执行所述计算机程序指令,以实现如权利要求1-6任意一项所述的信息确定方法。8. An electronic device, characterized in that the device includes: a processor, and a memory storing computer program instructions; the processor reads and executes the computer program instructions to implement claims 1-6 Any of the information determination methods described above. 9.一种计算机存储介质,其特征在于,所述计算机存储介质上存储有计算机程序指令,所述计算机程序指令被处理器执行时实现如权利要求1-6任意一项所述的信息确定方法。9. A computer storage medium, characterized in that computer program instructions are stored on the computer storage medium, and when the computer program instructions are executed by a processor, the information determination method according to any one of claims 1-6 is implemented. . 10.一种计算机程序产品,其特征在于,所述计算机程序产品中的指令由电子设备的处理器执行时,使得所述电子设备执行如权利要求1-6任意一项所述的信息确定方法。10. A computer program product, characterized in that, when the instructions in the computer program product are executed by a processor of an electronic device, the electronic device causes the electronic device to execute the information determination method according to any one of claims 1-6. .
CN202210739615.0A 2022-06-28 2022-06-28 Information determination method, device, equipment and computer storage medium Pending CN117370680A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210739615.0A CN117370680A (en) 2022-06-28 2022-06-28 Information determination method, device, equipment and computer storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210739615.0A CN117370680A (en) 2022-06-28 2022-06-28 Information determination method, device, equipment and computer storage medium

Publications (1)

Publication Number Publication Date
CN117370680A true CN117370680A (en) 2024-01-09

Family

ID=89393312

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210739615.0A Pending CN117370680A (en) 2022-06-28 2022-06-28 Information determination method, device, equipment and computer storage medium

Country Status (1)

Country Link
CN (1) CN117370680A (en)

Similar Documents

Publication Publication Date Title
KR101120728B1 (en) Methods for determining the approximate location of a device from ambient signals
KR101099204B1 (en) Systems for determining the approximate location of a device from ambient signals
KR101099160B1 (en) Utilization of the approximate location of a device determined from ambient signals
CN111028016A (en) Sales data prediction method and device and related equipment
CN109996278A (en) Road network method for evaluating quality, device, equipment and medium
CN111461410A (en) Air quality prediction method and device based on transfer learning
CN110674208B (en) Method and device for determining position information of user
CN112579718A (en) Urban land function identification method and device and terminal equipment
CN114374449A (en) Interference source determination method, device, equipment and medium
CN108540926B (en) Wireless signal fingerprint construction method and device
CN117370680A (en) Information determination method, device, equipment and computer storage medium
US9591454B2 (en) Computational complexity reduction of training wireless strength-based probabilistic models from big data
WO2024105640A1 (en) Frequency interference detection in cellular networks
US10996310B2 (en) Estimated user location from cellular telephony data
CN112566013B (en) Target equipment positioning method, device, equipment and computer storage medium
Dey et al. Identification of parking spaces from multi‐modal trajectory data
CN112990779A (en) Method, device, equipment and storage medium for scoring candidate address
CN109874170B (en) Geographic coordinate system blind detection method, device, equipment and medium
Paradeda et al. Bayesian estimation of passenger boardings at bus stops using Wi-Fi probe requests
CN115442758A (en) A method, device, terminal device, and storage medium for determining a travel mode of a user
CN111597277B (en) Site aggregation method, device, computer equipment and medium in electronic map
CN114782000B (en) Family relationship identification method, device, equipment, medium and product
CN110188937B (en) Business hall business scale forecasting method, device, equipment and storage medium
CN113194426B (en) Fingerprint database updating method, device, equipment and computer storage medium
CN111667026B (en) Debugging method and device for geographic position of multimedia equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination