CN107656913A - Map point of interest address extraction method, apparatus, server and storage medium - Google Patents

Map point of interest address extraction method, apparatus, server and storage medium Download PDF

Info

Publication number
CN107656913A
CN107656913A CN201710922733.4A CN201710922733A CN107656913A CN 107656913 A CN107656913 A CN 107656913A CN 201710922733 A CN201710922733 A CN 201710922733A CN 107656913 A CN107656913 A CN 107656913A
Authority
CN
China
Prior art keywords
address
interest
waybill
map
point
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710922733.4A
Other languages
Chinese (zh)
Other versions
CN107656913B (en
Inventor
宋宽
王海南
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Baidu Online Network Technology Beijing Co Ltd
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201710922733.4A priority Critical patent/CN107656913B/en
Publication of CN107656913A publication Critical patent/CN107656913A/en
Application granted granted Critical
Publication of CN107656913B publication Critical patent/CN107656913B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the invention discloses a kind of map point of interest address extraction method, apparatus, server and storage medium.Wherein, method includes:The waybill address for including map interest point name is chosen from all waybill addresses of express waybill, obtains address candidates collection, wherein, the address candidates centralized recording has the corresponding relation of waybill address and map point of interest;Using natural language processing technique, the address fragment for meeting preset address standard is extracted from every waybill address that the address candidates are concentrated;City corresponding to map point of interest corresponding to the address fragment extracted and its affiliated waybill address and administrative division are combined, obtain the full address of map point of interest.The embodiment of the present invention solves the problems, such as manually to examine, improves map interest dot address cost height, effect difference, realizes and fast and accurately examines, improves map interest dot address, can improve cartographic information accuracy, lifts Consumer's Experience.

Description

Map point of interest address extraction method, apparatus, server and storage medium
Technical field
The present embodiments relate to electronic map information digging technology, more particularly to a kind of map point of interest address extraction side Method, device, server and storage medium.
Background technology
Have substantial amounts of located sites in electronic map, for example, marked on map restaurant, hotel, sight spot, charge station etc., These anchor points are that user may be inquired about or want to reach point of interest.The address of point of interest is that user pays close attention to the most One of data, thus electronic map need to user show description in detail, the address of compound with regular structure, so as to help user can be with It is easier according to address descriptor by the position correspondence of point of interest into real world.But due to the point of interest in electronic map Substantial amounts, often have that the address of some points of interest is imperfect, incorrect, and the user of easy electron map causes to mislead, Influence Consumer's Experience.
In the prior art, can only manually be checked and mended for the imperfection of interest point in electronic map address Fill.But this method cost is high, effect difference is, it is necessary to which point of interest on electronic map could be improved by expending substantial amounts of manpower and time Address.
The content of the invention
The embodiment of the present invention provides a kind of map point of interest address extraction method, apparatus, server and storage medium, with reality Now efficiently it is perfect, corrigendum map point of interest address so that the address of map point of interest is more complete and accurate, so as to improve The usage experience of graphical user, reduce interest dot address and improve cost.
In a first aspect, the embodiments of the invention provide a kind of extracting method of map interest dot address, this method includes:
The waybill address for including map interest point name is chosen from all waybill addresses of express waybill, obtains address time Selected works, wherein, the address candidates centralized recording has the corresponding relation of waybill address and map point of interest;
Using natural language processing technique, extraction, which meets, from every waybill address that the address candidates are concentrated presets ground The address fragment of location standard;
By the city corresponding to map point of interest corresponding to the address fragment extracted and its affiliated waybill address and administration Zoning is combined, and obtains the full address of map point of interest.
Second aspect, the embodiment of the present invention additionally provide a kind of extraction element of map interest dot address, and the device includes:
Address candidates collection establishes module, is called the roll for being chosen from all waybill addresses of express waybill comprising map interest The waybill address of title, obtains address candidates collection, wherein, the address candidates centralized recording has waybill address and map point of interest Corresponding relation;
Address fragment extraction module, for using natural language processing technique, being transported from every of address candidates concentration Extraction meets the address fragment of preset address standard in single-address;
Full address composite module, for by map interest corresponding to the address fragment extracted and its described waybill address The corresponding city of point and administrative division are combined, and obtain the full address of map point of interest.
The third aspect, the embodiment of the present invention additionally provide a kind of server, and the server includes:
One or more processors;
Storage device, for storing one or more programs;
When one or more of programs are by one or more of computing devices so that one or more of processing Device realizes the extracting method of the map interest dot address as described in any in the embodiment of the present invention.
Fourth aspect, the embodiment of the present invention additionally provide a kind of computer-readable recording medium, are stored thereon with computer Program, it is characterised in that the map point of interest as described in any in the embodiment of the present invention is realized when the program is executed by processor The extracting method of address.
The embodiment of the present invention passes through address slice that will be extracted from the express waybill address for including map interest dot address City and administrative division corresponding to section map point of interest corresponding with waybill address are combined, and obtain the complete of map point of interest Site preparation location, solve the problems, such as manually to examine, improve map interest dot address cost height, effect difference, realize fast and accurately Examine, improve map interest dot address, cartographic information accuracy can be improved, lift Consumer's Experience.
Brief description of the drawings
Fig. 1 is the flow chart of the extracting method of the map interest dot address in the embodiment of the present invention one;
Fig. 2 is the flow chart of the extracting method of the map interest dot address in the embodiment of the present invention two;
Fig. 3 is the structural representation of the extraction element of the map interest dot address in the embodiment of the present invention three;
Fig. 4 is the structural representation of the server in the embodiment of the present invention four.
Embodiment
The present invention is described in further detail with reference to the accompanying drawings and examples.It is understood that this place is retouched The specific embodiment stated is used only for explaining the present invention, rather than limitation of the invention.It also should be noted that in order to just Part related to the present invention rather than entire infrastructure are illustrate only in description, accompanying drawing.
Embodiment one
Fig. 1 is the flow chart of the extracting method for the map interest dot address that the embodiment of the present invention one provides, and the present embodiment can Situation suitable for improving electronic map information, this method can be performed by the extraction element of map interest dot address, the dress Put and be for example configured in server.As shown in figure 1, this method specifically includes:
S110, the waybill address for including map interest point name is chosen from all waybill addresses of express waybill, obtained Address candidates collection, wherein, the address candidates centralized recording has the corresponding relation of waybill address and map point of interest.
Wherein, waybill address refers to all addresses for coming from express waybill, abundant address be contained in express waybill The information source of address is extracted in information, and the embodiment of the present invention.Point of interest refers to mark place on map, such as restaurant, wine Shop, market, sight spot, charge station etc., it is that the user of electronic map may search for positioning on the electronic map or want to reach Place.The data of point of interest are stored with the database of server or cloud database, the data include point of interest The information such as title and coordinate position.Electronic map, which needs to show to user as far as possible, describes detailed, compound with regular structure address, from And user is helped to be easier to find destination in real world according to address descriptor.
Specifically, when needing to improve a certain interest dot address, the title of the point of interest is compared with waybill address, If including the title of this point of interest in a certain bar waybill address, using the waybill address as one of candidate site, add Address candidates collection, and record the corresponding relation of waybill address and map point of interest.Wherein, corresponding relation refers to a waybill address Refer to which point of interest address corresponds to, each address that address candidates are concentrated in embodiments of the present invention corresponds to a point of interest.
Preferably, it is described obtain address candidates collection before, in all waybill addresses from express waybill choose bag The waybill address of the interest point name containing map also includes:
The coordinate of the waybill address comprising map interest point name and corresponding point of interest in electronic map is obtained respectively;
According to the coordinate, judge the waybill address comprising map interest point name and corresponding point of interest in electronic map Distance whether exceed predetermined threshold value;
The waybill address that will be concentrated without departing from the waybill address of predetermined threshold value as the address candidates.
Specifically, it needs to be determined that the distance between the waybill address of the candidate site concentration got and point of interest are default In the range of.The preset range can configure according to actual conditions, and appropriate preset range can improve candidate's waybill address The accuracy of acquisition.It is exemplary, it is assumed that the distance of candidate's waybill address of selection and the coordinate position of point of interest more than 50 meters, So, the address information extracted from the waybill address can not represent the address of point of interest, and its degree of accuracy has certain Deviation, certain puzzlement can be caused to interest point location to user.
S120, using natural language processing technique, extract and meet from every waybill address that the address candidates are concentrated The address fragment of preset address standard.
One waybill address text formally sees a character string being made up of Chinese character (including punctuation mark etc.).Tool Body, can by being segmented to address character string, the analysis method of the natural language processing such as semantic analysis, finally therefrom carry Take out the address fragment for meeting preset address standard.Preset address standard in the embodiment of the present invention refers to detailed to address descriptor To road, street and door location.It is of course also possible to be configured according to being actually needed to address standard, the application does not appoint to this What is limited.
S130, by the city corresponding to map point of interest corresponding to the address fragment extracted and its affiliated waybill address and Administrative division is combined, and obtains the full address of map point of interest.
Specifically, it is detailed comprising road and its following rank that mainly supplement is corrected and supplemented to interest dot address Address descriptor, it is contemplated that incorrect, nonstandard situation may be described in waybill address to city or administrative division, directly will The city corresponding to point of interest and administrative division information stored in database is combined with the address fragment extracted, you can Obtain the complete detailed interest dot address of description.
Preferably, when multiple address fragments be present in the address fragment extracted and correspond to same target point of interest, institute State the city corresponding to the address fragment that will be extracted and its affiliated waybill address corresponding to map point of interest and administrative division is entered Row combination, obtains the full address of map point of interest, in addition to:
By in the multiple address fragment, the most address fragment of identical address number of fragments is as the target point of interest Destination address fragment;
Destination address fragment and the city corresponding to target point of interest and administrative division are combined, obtain target interest The full address of point.
Specifically, very big due to transporting single address quantity, and in waybill address user to the difference of address describing mode, Therefore, corresponding same target point of interest, it is understood that there may be a plurality of address descriptor the level of detail identical waybill address.It is exemplary , after natural language processing (i.e. participle and Words ' Attributes mark) is carried out in the waybill address in address Candidate Set, have more Bar waybill address all includes the address details such as street and house number, and all corresponds to same target point of interest, that Multiple address fragments can be extracted from described a plurality of waybill address.Further, from the plurality of address fragment, choosing Go out destination address fragment of the most address fragment of identical address number of fragments as the target point of interest, this selection result And most accurate address descriptor fragment.Further, by obtained destination address fragment and the city corresponding to target point of interest City and administrative division are combined, you can the full address of target point of interest are obtained, after being improved as target interest dot address Result.
The technical scheme of the present embodiment, by choosing qualified waybill address, and utilize natural language processing technique Qualified address fragment is therefrom extracted, city corresponding with point of interest and administrative division information are combined to obtain completely Detailed address information, and correspond to same target point of interest when multiple address fragments be present in the address fragment extracted When, select the most address fragment of identical address number of fragments to solve needs as final address fragment and manually examine, be complete The problem of kind interest dot address cost is high, effect difference, can improve cartographic information accuracy, lift Consumer's Experience.
Embodiment two
The flow chart of the extracting method for the map interest dot address that Fig. 2 provides for the embodiment of the present invention two, the present embodiment two Make on the basis of embodiment one and further optimizing.As shown in Fig. 2 methods described includes:
S210, the waybill address for including map interest point name is chosen from all waybill addresses of express waybill, obtained Address candidates collection, wherein, the address candidates centralized recording has the corresponding relation of waybill address and map point of interest.
S220, word segmentation and Words ' Attributes mark are carried out to every waybill address that the address candidates are concentrated, obtained The participle and its address properties of every address.
Specifically, a sector address is usually the combination of country, province, city, district, villages and small towns, road, street, door location fragment. Here using country, province, city, district, villages and small towns, road, street, these characters of door location as keyword, every waybill address is entered Row word segmentation, and mark out by semantic analysis the address properties of word after cutting, i.e. address fragment after cutting belongs to state Which of family, province, city, district, villages and small towns, road, street, door location fragment.
S230, participle and its address properties according to every address, the address properties for choosing participle are concentrated from address candidates Meet the waybill address of preset address standard, obtain destination address Candidate Set.
Specifically, after being segmented in the waybill address in address Candidate Set, according to word segmentation result and address properties, from In pick out most detailed address descriptor, be stored in destination address Candidate Set.Exemplary, it is assumed that address candidates are concentrated with 100 waybill addresses for including interest point name, after participle and semantic analysis, it can be determined that with going out some of waybills Location describes the level of detail to district, some address descriptor the level of details to street, also has some that specific door location has been depicted.That , just further the most detailed waybill address of the description for meeting preset address standard in all addresses is deposited as destination address It is put into destination address Candidate Set.
S240, to every waybill address in the destination address Candidate Set, according to participle and its address properties, extract Meet the address fragment of preset address standard.
Specifically, this operation is that the extraction of address fragment is carried out to the waybill address after screening, that is, include road And its those fragments of address in detailed below, corresponding address fragment is extracted according to participle and address properties.
S250, by the city corresponding to map point of interest corresponding to the address fragment extracted and its affiliated waybill address and Administrative division is combined, and obtains the full address of map point of interest.
The technical scheme of the present embodiment, marked by carrying out word segmentation and Words ' Attributes to waybill address, from meeting bar Qualified address fragment is extracted in the waybill address of part, city corresponding with point of interest and administrative division information carry out group Close and obtain complete detailed address information, solving needs manually to examine, improves that interest dot address cost is high, effect is poor asks Topic, cartographic information accuracy can be improved, lift Consumer's Experience.
Embodiment three
Fig. 3 is the structural representation of the extraction element of the map interest dot address in the embodiment of the present invention three.Such as Fig. 3 institutes Show, the extraction element of map interest dot address includes:
Address candidates collection establishes module 310, and map interest is included for being chosen from all waybill addresses of express waybill The waybill address of point title, obtains address candidates collection, wherein, the address candidates centralized recording has waybill address and map interest The corresponding relation of point;
Address fragment extraction module 320, for every using natural language processing technique, concentrated from the address candidates Extraction meets the address fragment of preset address standard in waybill address;
Full address composite module 330, for by map corresponding to the address fragment extracted and its described waybill address City and administrative division corresponding to point of interest are combined, and obtain the full address of map point of interest.
Further, address candidates collection establishes module 310, including:
Address information acquiring unit, for being obtained respectively comprising map interest point name before address candidates collection is obtained The coordinate of waybill address and corresponding point of interest in electronic map;
Distance Judgment unit, for according to the coordinate, judging the waybill address comprising map interest point name and correspondingly Whether distance of the point of interest in electronic map exceeds predetermined threshold value;
Candidate site selecting unit, for being concentrated without departing from the waybill address of predetermined threshold value as the address candidates Waybill address.
Further, address fragment extraction module 320, is specifically used for:
Word segmentation and Words ' Attributes mark are carried out to every waybill address that the address candidates are concentrated, obtain every ground The participle and its address properties of location;
According to the participle and its address properties of every address, the address properties for choosing participle are concentrated to meet from address candidates pre- If the waybill address of address standard, obtains destination address Candidate Set;
To every waybill address in the destination address Candidate Set, according to participle and its address properties, extract and meet The address fragment of preset address standard.
Further, when multiple address fragments be present in the address fragment extracted and correspond to same target point of interest, Full address composite module 330, is additionally operable to:
By in the multiple address fragment, the most address fragment of identical address number of fragments is as the target point of interest Destination address fragment;
Destination address fragment and the city corresponding to target point of interest and administrative division are combined, obtain target interest The full address of point.
The extraction element for the map interest dot address that the embodiment of the present invention is provided can perform any embodiment institute of the present invention The extracting method of the map interest dot address of offer, possesses the corresponding functional module of execution method and beneficial effect.
Example IV
Fig. 4 is the structural representation of the server in the embodiment of the present invention four.Fig. 4 is shown suitable for being used for realizing the present invention The block diagram of the exemplary servers 412 of embodiment.The server 412 that Fig. 4 is shown is only an example, should not be to the present invention The function and use range of embodiment bring any restrictions.
As shown in figure 4, server 412 is showed in the form of universal computing device.The component of server 412 can include but It is not limited to:One or more processor or processing unit 416, system storage 428, connection different system component (including System storage 428 and processing unit 416) bus 418.
Bus 418 represents the one or more in a few class bus structures, including memory bus or Memory Controller, Peripheral bus, graphics acceleration port, processor or the local bus using any bus structures in a variety of bus structures.Lift For example, these architectures include but is not limited to industry standard architecture (ISA) bus, MCA (MAC) Bus, enhanced isa bus, VESA's (VESA) local bus and periphery component interconnection (PCI) bus.
Server 412 typically comprises various computing systems computer-readable recording medium.These media can be it is any being capable of bedding and clothing The usable medium that business device 412 accesses, including volatibility and non-volatile media, moveable and immovable medium.
System storage 428 can include the computer system readable media of form of volatile memory, such as deposit at random Access to memory (RAM) 430 and/or cache memory 432.Server 412 may further include it is other it is removable/can not Mobile, volatile/non-volatile computer system storage medium.Only as an example, storage system 434 can be used for read-write not Movably, non-volatile magnetic media (Fig. 4 is not shown, is commonly referred to as " hard disk drive ").Although not shown in Fig. 4, can with There is provided for the disc driver to may move non-volatile magnetic disk (such as " floppy disk ") read-write, and to removable non-volatile The CD drive of CD (such as CD-ROM, DVD-ROM or other optical mediums) read-write.In these cases, each driving Device can be connected by one or more data media interfaces with bus 418.Memory 428 can include at least one program Product, the program product have one group of (for example, at least one) program module, and these program modules are configured to perform the present invention The function of each embodiment.
Program/utility 440 with one group of (at least one) program module 442, can be stored in such as memory In 428, such program module 442 includes but is not limited to operating system, one or more application program, other program modules And routine data, the realization of network environment may be included in each or certain combination in these examples.Program module 442 Generally perform the function and/or method in embodiment described in the invention.
Server 412 can also be with one or more external equipments 414 (such as keyboard, sensing equipment, display 424 etc.) Communication, can also enable a user to the equipment communication interacted with the server 412 with one or more, and/or with causing the clothes Any equipment (such as network interface card, modem etc.) that business device 412 can be communicated with one or more of the other computing device Communication.This communication can be carried out by input/output (I/O) interface 422.Also, server 412 can also be fitted by network Orchestration 420 and one or more network (such as LAN (LAN), wide area network (WAN) and/or public network, such as because of spy Net) communication.As illustrated, network adapter 420 is communicated by bus 418 with other modules of server 412.It should be understood that Although not shown in Fig. 4, server 412 can be combined and use other hardware and/or software module, included but is not limited to:Micro- generation Code, device driver, redundant processing unit, external disk drive array, RAID system, tape drive and data backup are deposited Storage system etc..
Processing unit 416 is stored in program in system storage 428 by operation, so as to perform various function application with And data processing, such as the extracting method for the map interest dot address that the embodiment of the present invention is provided is realized, this method includes:
The waybill address for including map interest point name is chosen from all waybill addresses of express waybill, obtains address time Selected works, wherein, the address candidates centralized recording has the corresponding relation of waybill address and map point of interest;
Using natural language processing technique, extraction, which meets, from every waybill address that the address candidates are concentrated presets ground The address fragment of location standard;
By the city corresponding to map point of interest corresponding to the address fragment extracted and its affiliated waybill address and administration Zoning is combined, and obtains the full address of map point of interest.
Embodiment five
The embodiment of the present invention five additionally provides a kind of computer-readable recording medium, is stored thereon with computer program, should The extracting method of the map interest dot address provided such as the embodiment of the present invention is realized when program is executed by processor, its feature exists In, including:
The waybill address for including map interest point name is chosen from all waybill addresses of express waybill, obtains address time Selected works, wherein, the address candidates centralized recording has the corresponding relation of waybill address and map point of interest;
Using natural language processing technique, extraction, which meets, from every waybill address that the address candidates are concentrated presets ground The address fragment of location standard;
By the city corresponding to map point of interest corresponding to the address fragment extracted and its affiliated waybill address and administration Zoning is combined, and obtains the full address of map point of interest.
The computer-readable storage medium of the embodiment of the present invention, any of one or more computer-readable media can be used Combination.Computer-readable medium can be computer-readable signal media or computer-readable recording medium.It is computer-readable Storage medium for example may be-but not limited to-the system of electricity, magnetic, optical, electromagnetic, infrared ray or semiconductor, device or Device, or any combination above.The more specifically example (non exhaustive list) of computer-readable recording medium includes:Tool There are the electrical connections of one or more wires, portable computer diskette, hard disk, random access memory (RAM), read-only storage (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, portable compact disc read-only storage (CD- ROM), light storage device, magnetic memory device or above-mentioned any appropriate combination.In this document, computer-readable storage Medium can be any includes or the tangible medium of storage program, the program can be commanded execution system, device or device Using or it is in connection.
Computer-readable signal media can include in a base band or as carrier wave a part propagation data-signal, Wherein carry computer-readable program code.The data-signal of this propagation can take various forms, including but unlimited In electromagnetic signal, optical signal or above-mentioned any appropriate combination.Computer-readable signal media can also be that computer can Any computer-readable medium beyond storage medium is read, the computer-readable medium, which can send, propagates or transmit, to be used for By instruction execution system, device either device use or program in connection.
The program code included on computer-readable medium can be transmitted with any appropriate medium, including --- but it is unlimited In wireless, electric wire, optical cable, RF etc., or above-mentioned any appropriate combination.
It can be write with one or more programming languages or its combination for performing the computer that operates of the present invention Program code, described program design language include object oriented program language-such as Java, Smalltalk, C++, Also include conventional procedural programming language-such as " such as " language or similar programming language.Program code can be with Fully perform, partly perform on the user computer on the user computer, the software kit independent as one performs, portion Divide and partly perform or performed completely on remote computer or server on the remote computer on the user computer. Be related in the situation of remote computer, remote computer can pass through the network of any kind --- including LAN (LAN) or Wide area network (WAN) domain is connected to subscriber computer, or, it may be connected to outer computer (such as carried using Internet service Pass through Internet connection for business).
Pay attention to, above are only presently preferred embodiments of the present invention and institute's application technology principle.It will be appreciated by those skilled in the art that The invention is not restricted to specific embodiment described here, can carry out for a person skilled in the art various obvious changes, Readjust and substitute without departing from protection scope of the present invention.Therefore, although being carried out by above example to the present invention It is described in further detail, but the present invention is not limited only to above example, without departing from the inventive concept, also Other more equivalent embodiments can be included, and the scope of the present invention is determined by scope of the appended claims.

Claims (10)

  1. A kind of 1. extracting method of map interest dot address, it is characterised in that including:
    The waybill address for including map interest point name is chosen from all waybill addresses of express waybill, obtains address candidates Collection, wherein, the address candidates centralized recording has the corresponding relation of waybill address and map point of interest;
    Using natural language processing technique, extracted from every waybill address that the address candidates are concentrated and meet preset address mark Accurate address fragment;
    By the city corresponding to map point of interest corresponding to the address fragment extracted and its affiliated waybill address and administrative division It is combined, obtains the full address of map point of interest.
  2. 2. the extracting method of map interest dot address according to claim 1, it is characterised in that waited in the address that obtains The waybill address comprising map interest point name is chosen before selected works, in all waybill addresses from express waybill also to wrap Include:
    The coordinate of the waybill address comprising map interest point name and corresponding point of interest in electronic map is obtained respectively;
    According to the coordinate, judge the waybill address comprising map interest point name and corresponding point of interest in electronic map away from From whether exceeding predetermined threshold value;
    The waybill address that will be concentrated without departing from the waybill address of predetermined threshold value as the address candidates.
  3. 3. the extracting method of map interest dot address according to claim 1, it is characterised in that described to utilize natural language Treatment technology, the address fragment for meeting preset address standard, bag are extracted from every waybill address that the address candidates are concentrated Include:
    Word segmentation and Words ' Attributes mark are carried out to every waybill address that the address candidates are concentrated, obtain every address Participle and its address properties;
    According to the participle and its address properties of every address, the address properties for choosing participle are concentrated to meet default ground from address candidates The waybill address of location standard, obtains destination address Candidate Set;
    To every waybill address in the destination address Candidate Set, according to participle and its address properties, extract meet it is default The address fragment of address standard.
  4. 4. the extracting method of map interest dot address according to claim 1, it is characterised in that when the address slice extracted It is described by the address fragment extracted and its affiliated waybill when multiple address fragments in section be present and correspond to same target point of interest City and administrative division corresponding to address corresponding to map point of interest are combined, and obtain the full address of map point of interest, Also include:
    By in the multiple address fragment, mesh of the most address fragment of identical address number of fragments as the target point of interest Mark address fragment;
    Destination address fragment and the city corresponding to target point of interest and administrative division are combined, obtain target point of interest Full address.
  5. A kind of 5. extraction element of map interest dot address, it is characterised in that including:
    Address candidates collection establishes module, for being chosen from all waybill addresses of express waybill comprising map interest point name Waybill address, address candidates collection is obtained, wherein, the address candidates centralized recording has waybill address corresponding with map point of interest Relation;
    Address fragment extraction module, for every waybill using natural language processing technique, being concentrated from the address candidates Extraction meets the address fragment of preset address standard in location;
    Full address composite module, for by map point of interest institute corresponding to the address fragment extracted and its described waybill address Corresponding city and administrative division are combined, and obtain the full address of map point of interest.
  6. 6. the extraction element of map interest dot address according to claim 5, it is characterised in that the address candidates collection is built Formwork erection block includes:
    Address information acquiring unit, for obtaining the waybill for including map interest point name respectively before address candidates collection is obtained The coordinate of address and corresponding point of interest in electronic map;
    Distance Judgment unit, for according to the coordinate, judging the waybill address comprising map interest point name and corresponding interest Whether distance of the point in electronic map exceeds predetermined threshold value;
    Candidate site selecting unit, for the fortune for concentrating the waybill address without departing from predetermined threshold value as the address candidates Single-address.
  7. 7. the extraction element of map interest dot address according to claim 5, it is characterised in that the address fragment extraction Module, it is specifically used for:
    Word segmentation and Words ' Attributes mark are carried out to every waybill address that the address candidates are concentrated, obtain every address Participle and its address properties;
    According to the participle and its address properties of every address, the address properties for choosing participle are concentrated to meet default ground from address candidates The waybill address of location standard, obtains destination address Candidate Set;
    To every waybill address in the destination address Candidate Set, according to participle and its address properties, extract meet it is default The address fragment of address standard.
  8. 8. the extraction element of map interest dot address according to claim 5, it is characterised in that when the address slice extracted When multiple address fragments in section be present and correspond to same target point of interest, the full address composite module, it is additionally operable to:
    By in the multiple address fragment, mesh of the most address fragment of identical address number of fragments as the target point of interest Mark address fragment;
    Destination address fragment and the city corresponding to target point of interest and administrative division are combined, obtain target point of interest Full address.
  9. 9. a kind of server, it is characterised in that the server includes:
    One or more processors;
    Storage device, for storing one or more programs;
    When one or more of programs are by one or more of computing devices so that one or more of processors are real The now extracting method of the map interest dot address as described in any in claim 1-4.
  10. 10. a kind of computer-readable recording medium, is stored thereon with computer program, it is characterised in that the program is by processor The extracting method of the map interest dot address as described in any in claim 1-4 is realized during execution.
CN201710922733.4A 2017-09-30 2017-09-30 Map interest point address extraction method, map interest point address extraction device, server and storage medium Active CN107656913B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710922733.4A CN107656913B (en) 2017-09-30 2017-09-30 Map interest point address extraction method, map interest point address extraction device, server and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710922733.4A CN107656913B (en) 2017-09-30 2017-09-30 Map interest point address extraction method, map interest point address extraction device, server and storage medium

Publications (2)

Publication Number Publication Date
CN107656913A true CN107656913A (en) 2018-02-02
CN107656913B CN107656913B (en) 2021-03-23

Family

ID=61117611

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710922733.4A Active CN107656913B (en) 2017-09-30 2017-09-30 Map interest point address extraction method, map interest point address extraction device, server and storage medium

Country Status (1)

Country Link
CN (1) CN107656913B (en)

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109389119A (en) * 2018-10-23 2019-02-26 百度在线网络技术(北京)有限公司 Point of interest area determination method, device, equipment and medium
CN110175216A (en) * 2019-05-15 2019-08-27 腾讯科技(深圳)有限公司 Coordinate error correction method, device and computer equipment
CN110457706A (en) * 2019-08-15 2019-11-15 腾讯科技(深圳)有限公司 Interest point name preference pattern training method, application method, device and storage medium
CN110556049A (en) * 2018-06-04 2019-12-10 百度在线网络技术(北京)有限公司 map data processing method, device, server and storage medium
CN110716992A (en) * 2018-06-27 2020-01-21 百度在线网络技术(北京)有限公司 Method and device for recommending name of point of interest
CN110874442A (en) * 2018-08-31 2020-03-10 阿里巴巴集团控股有限公司 Method, apparatus, device and medium for processing information
CN110968654A (en) * 2018-09-29 2020-04-07 阿里巴巴集团控股有限公司 Method, equipment and system for determining address category of text data
CN111325022A (en) * 2018-11-28 2020-06-23 北京京东尚科信息技术有限公司 Method and device for identifying hierarchical address
CN111460054A (en) * 2019-01-21 2020-07-28 阿里巴巴集团控股有限公司 Address data processing method and device, equipment and storage medium
CN111460057A (en) * 2019-01-22 2020-07-28 阿里巴巴集团控股有限公司 POI coordinate determination method, device and equipment
CN111723172A (en) * 2020-06-10 2020-09-29 广东世纪高通科技有限公司 Data fusion method and device
CN111723165A (en) * 2019-03-18 2020-09-29 阿里巴巴集团控股有限公司 Address interest point determining method, device and system
CN111782741A (en) * 2020-06-04 2020-10-16 汉海信息技术(上海)有限公司 Interest point mining method and device, electronic equipment and storage medium
CN111984747A (en) * 2019-05-21 2020-11-24 丰图科技(深圳)有限公司 Method, device and equipment for acquiring geographic information data
CN112016326A (en) * 2020-09-25 2020-12-01 北京百度网讯科技有限公司 Map area word recognition method and device, electronic equipment and storage medium
CN112488194A (en) * 2020-11-30 2021-03-12 上海寻梦信息技术有限公司 Address abbreviation generation method, model training method and related equipment
CN112966192A (en) * 2021-02-09 2021-06-15 北京百度网讯科技有限公司 Region address naming method and device, electronic equipment and readable storage medium
CN113190640A (en) * 2021-05-20 2021-07-30 拉扎斯网络科技(上海)有限公司 Method and device for processing point of interest data
CN113706065A (en) * 2020-05-22 2021-11-26 百度在线网络技术(北京)有限公司 Goods classification method, device, equipment and storage medium
CN113935293A (en) * 2021-12-16 2022-01-14 湖南四方天箭信息科技有限公司 Address splitting and complementing method and device, computer equipment and storage medium

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080222256A1 (en) * 2007-03-08 2008-09-11 Rosenberg Greg A Autocomplete for intergrating diverse methods of electronic communication
CN103218375A (en) * 2012-01-20 2013-07-24 北京四维图新科技股份有限公司 POI (Point of Interest) information supplementing method and device
CN104216895A (en) * 2013-05-31 2014-12-17 高德软件有限公司 Method and device for generating POI data
CN104899243A (en) * 2015-03-31 2015-09-09 北京奇虎科技有限公司 Method and apparatus for detecting accuracy of POI (Point of Interest) data
KR101556743B1 (en) * 2014-04-07 2015-10-02 주식회사 케이티 Apparatus and method for generating poi information based on web collection
CN105160031A (en) * 2015-09-30 2015-12-16 北京奇虎科技有限公司 Mining method and device for map point of interest (POI) data
CN105760360A (en) * 2014-12-16 2016-07-13 高德软件有限公司 Address correction method and device
CN106156145A (en) * 2015-04-13 2016-11-23 阿里巴巴集团控股有限公司 The management method of a kind of address date and device
CN106682175A (en) * 2016-12-29 2017-05-17 华南师范大学 Method and system for matching address
CN106874384A (en) * 2017-01-10 2017-06-20 广东精规划信息科技股份有限公司 A kind of isomery address standard handovers and matching process
CN106919569A (en) * 2015-12-24 2017-07-04 北京四维图新科技股份有限公司 A kind of method and device of the administrative division information for obtaining point of interest POI
CN106919567A (en) * 2015-12-24 2017-07-04 北京四维图新科技股份有限公司 A kind of processing method and processing device of point of interest POI addresses

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080222256A1 (en) * 2007-03-08 2008-09-11 Rosenberg Greg A Autocomplete for intergrating diverse methods of electronic communication
CN103218375A (en) * 2012-01-20 2013-07-24 北京四维图新科技股份有限公司 POI (Point of Interest) information supplementing method and device
CN104216895A (en) * 2013-05-31 2014-12-17 高德软件有限公司 Method and device for generating POI data
KR101556743B1 (en) * 2014-04-07 2015-10-02 주식회사 케이티 Apparatus and method for generating poi information based on web collection
CN105760360A (en) * 2014-12-16 2016-07-13 高德软件有限公司 Address correction method and device
CN104899243A (en) * 2015-03-31 2015-09-09 北京奇虎科技有限公司 Method and apparatus for detecting accuracy of POI (Point of Interest) data
CN106156145A (en) * 2015-04-13 2016-11-23 阿里巴巴集团控股有限公司 The management method of a kind of address date and device
CN105160031A (en) * 2015-09-30 2015-12-16 北京奇虎科技有限公司 Mining method and device for map point of interest (POI) data
CN106919569A (en) * 2015-12-24 2017-07-04 北京四维图新科技股份有限公司 A kind of method and device of the administrative division information for obtaining point of interest POI
CN106919567A (en) * 2015-12-24 2017-07-04 北京四维图新科技股份有限公司 A kind of processing method and processing device of point of interest POI addresses
CN106682175A (en) * 2016-12-29 2017-05-17 华南师范大学 Method and system for matching address
CN106874384A (en) * 2017-01-10 2017-06-20 广东精规划信息科技股份有限公司 A kind of isomery address standard handovers and matching process

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
王勇 等: "顾及位置关系的网络POI地址信息标准化处理方法", 《测绘学报》 *

Cited By (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110556049A (en) * 2018-06-04 2019-12-10 百度在线网络技术(北京)有限公司 map data processing method, device, server and storage medium
CN110716992A (en) * 2018-06-27 2020-01-21 百度在线网络技术(北京)有限公司 Method and device for recommending name of point of interest
CN110716992B (en) * 2018-06-27 2022-05-27 百度在线网络技术(北京)有限公司 Method and device for recommending name of point of interest
CN110874442A (en) * 2018-08-31 2020-03-10 阿里巴巴集团控股有限公司 Method, apparatus, device and medium for processing information
CN110968654B (en) * 2018-09-29 2023-10-20 阿里巴巴集团控股有限公司 Address category determining method, equipment and system for text data
CN110968654A (en) * 2018-09-29 2020-04-07 阿里巴巴集团控股有限公司 Method, equipment and system for determining address category of text data
CN109389119A (en) * 2018-10-23 2019-02-26 百度在线网络技术(北京)有限公司 Point of interest area determination method, device, equipment and medium
CN109389119B (en) * 2018-10-23 2021-10-26 百度在线网络技术(北京)有限公司 Method, device, equipment and medium for determining interest point region
CN111325022B (en) * 2018-11-28 2023-11-03 北京京东振世信息技术有限公司 Method and device for identifying hierarchical address
CN111325022A (en) * 2018-11-28 2020-06-23 北京京东尚科信息技术有限公司 Method and device for identifying hierarchical address
CN111460054B (en) * 2019-01-21 2023-06-30 阿里巴巴集团控股有限公司 Address data processing method and device, equipment and storage medium
CN111460054A (en) * 2019-01-21 2020-07-28 阿里巴巴集团控股有限公司 Address data processing method and device, equipment and storage medium
CN111460057A (en) * 2019-01-22 2020-07-28 阿里巴巴集团控股有限公司 POI coordinate determination method, device and equipment
CN111460057B (en) * 2019-01-22 2023-06-27 阿里巴巴集团控股有限公司 POI (Point of interest) coordinate determining method, device and equipment
CN111723165A (en) * 2019-03-18 2020-09-29 阿里巴巴集团控股有限公司 Address interest point determining method, device and system
CN110175216B (en) * 2019-05-15 2021-05-11 腾讯科技(深圳)有限公司 Coordinate error correction method and device and computer equipment
CN110175216A (en) * 2019-05-15 2019-08-27 腾讯科技(深圳)有限公司 Coordinate error correction method, device and computer equipment
CN111984747A (en) * 2019-05-21 2020-11-24 丰图科技(深圳)有限公司 Method, device and equipment for acquiring geographic information data
CN110457706B (en) * 2019-08-15 2023-08-22 腾讯科技(深圳)有限公司 Point-of-interest name selection model training method, using method, device and storage medium
CN110457706A (en) * 2019-08-15 2019-11-15 腾讯科技(深圳)有限公司 Interest point name preference pattern training method, application method, device and storage medium
CN113706065A (en) * 2020-05-22 2021-11-26 百度在线网络技术(北京)有限公司 Goods classification method, device, equipment and storage medium
CN111782741A (en) * 2020-06-04 2020-10-16 汉海信息技术(上海)有限公司 Interest point mining method and device, electronic equipment and storage medium
CN111723172A (en) * 2020-06-10 2020-09-29 广东世纪高通科技有限公司 Data fusion method and device
CN112016326A (en) * 2020-09-25 2020-12-01 北京百度网讯科技有限公司 Map area word recognition method and device, electronic equipment and storage medium
CN112488194A (en) * 2020-11-30 2021-03-12 上海寻梦信息技术有限公司 Address abbreviation generation method, model training method and related equipment
CN112966192A (en) * 2021-02-09 2021-06-15 北京百度网讯科技有限公司 Region address naming method and device, electronic equipment and readable storage medium
CN112966192B (en) * 2021-02-09 2023-10-27 北京百度网讯科技有限公司 Regional address naming method, apparatus, electronic device and readable storage medium
CN113190640B (en) * 2021-05-20 2023-02-07 拉扎斯网络科技(上海)有限公司 Method and device for processing point of interest data
CN113190640A (en) * 2021-05-20 2021-07-30 拉扎斯网络科技(上海)有限公司 Method and device for processing point of interest data
CN113935293B (en) * 2021-12-16 2022-03-22 湖南四方天箭信息科技有限公司 Address splitting and complementing method and device, computer equipment and storage medium
CN113935293A (en) * 2021-12-16 2022-01-14 湖南四方天箭信息科技有限公司 Address splitting and complementing method and device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN107656913B (en) 2021-03-23

Similar Documents

Publication Publication Date Title
CN107656913A (en) Map point of interest address extraction method, apparatus, server and storage medium
US11698261B2 (en) Method, apparatus, computer device and storage medium for determining POI alias
CN108628811B (en) Address text matching method and device
CN107679189A (en) A kind of point of interest update method, device, server and medium
US20130218879A1 (en) Update systems of space of interest data and methods thereof
CN108509339A (en) Method for generating test case, device based on browser and mind map and equipment
CN106897919A (en) With the foundation of car type prediction model, information providing method and device
CN110750654A (en) Knowledge graph acquisition method, device, equipment and medium
CN108304423A (en) A kind of information identifying method and device
CN106462624A (en) Tile-based geocoder
CN107577819A (en) A kind of content of text shows method, apparatus, computer equipment and storage medium
CN101779206B (en) Method for providing three dimensional map service and geographic information system
US10628465B2 (en) Generating a ranked list of best fitting place names
CN112632844A (en) Extracting and analyzing information from engineering drawings
US10949600B2 (en) Semiconductor package floating metal checks
CN111522838B (en) Address similarity calculation method and device
CN107766250A (en) Method of testing, device, server and the storage medium of advertisement pattern
CN107506499A (en) The method, apparatus and server of logical relation are established between point of interest and building
JP2023530795A (en) Geolocation zone encoding method, method for establishing encoding model, and apparatus
CN108256020B (en) Abnormal route detection method, abnormal route detection device, server and storage medium
CN110555432B (en) Method, device, equipment and medium for processing interest points
CN107391516B (en) Bus stop aggregation method and device
CN105426443A (en) Map data processing method, device and system
CN106886517A (en) Business site selecting method, device and system
CN112818072A (en) Tourism knowledge map updating method, system, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant