US20200349175A1 - Address Search Method and Device - Google Patents

Address Search Method and Device Download PDF

Info

Publication number
US20200349175A1
US20200349175A1 US16/929,611 US202016929611A US2020349175A1 US 20200349175 A1 US20200349175 A1 US 20200349175A1 US 202016929611 A US202016929611 A US 202016929611A US 2020349175 A1 US2020349175 A1 US 2020349175A1
Authority
US
United States
Prior art keywords
address information
address
information
sub
string
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US16/929,611
Inventor
Quan Qi
Jiulong Zhang
Hang Li
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to US16/929,611 priority Critical patent/US20200349175A1/en
Assigned to HUAWEI TECHNOLOGIES CO., LTD. reassignment HUAWEI TECHNOLOGIES CO., LTD. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: ZHANG, Jiulong, LI, HANG, QI, QUAN
Publication of US20200349175A1 publication Critical patent/US20200349175A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01CMEASURING DISTANCES, LEVELS OR BEARINGS; SURVEYING; NAVIGATION; GYROSCOPIC INSTRUMENTS; PHOTOGRAMMETRY OR VIDEOGRAMMETRY
    • G01C21/00Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00
    • G01C21/26Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00 specially adapted for navigation in a road network
    • G01C21/34Route searching; Route guidance
    • G01C21/36Input/output arrangements for on-board computers
    • G01C21/3605Destination input or retrieval
    • G01C21/3611Destination input or retrieval using character input or menus, e.g. menus of POIs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/9032Query formulation
    • G06F16/90332Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/9038Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Definitions

  • the present application relates to the field of data processing technologies, and in particular, to an address search method and device.
  • a user can initiate a navigation request to a mobile terminal in a voice or text input manner, and a navigation application selects a navigation route according to the received navigation request, and pushes the selected navigation route to the user.
  • a manner in which the navigation application selects the navigation route according to the received navigation request includes but is not limited to the following.
  • a conditional random field (CRF) algorithm is used to extract an address name or a building name/organization name from the received navigation request, and determine a destination address of the navigation request using the extracted address name or building name/organization name when the navigation request is entered in a text form.
  • CCF conditional random field
  • Address information included in the voice is used as a search basis to determine a destination address of the navigation request when the navigation request is entered in a voice form.
  • a disadvantage is when the destination address is being determined, only the address name (or the building name/organization name) extracted from the text or voice input is used to match address information in an address database. As a result, a determined destination address set includes many irrelevant addresses, which reduces accuracy of address searching.
  • embodiments of the present application provide an address search method and device in order to solve a currently existing problem that address searching accuracy is relatively low because multiple irrelevant addresses are found in an address search process.
  • an address search method including acquiring address search request information, determining to-be-found destination address information included in the address search request information, splitting the destination address information into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types, matching the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information with different address information included in an address database, where each piece of address information stored in the address database includes different sub-address information that forms the address information, and setting address information that is obtained by the matching and whose match degree is greater than a set threshold as found destination address information for output.
  • acquiring address search request information includes receiving entered voice data, where the voice data is used to initiate an address search, and performing recognition on the voice data to obtain the to-be-found destination address information included in the voice data.
  • the to-be-found destination address information is obtained in the following manner. Determining at least one keyword included in the address search request information, finding, from preset text address dictionaries corresponding to different address types, a text address dictionary including the keyword for each determined keyword, and replacing the keyword with a string used to denote an address type corresponding to the found text address dictionary, determining, using a regular expression used to represent address information, whether a string group formed after each keyword is replaced with a corresponding string represents address information, determining, according to the string group, a quasi address string corresponding to the to-be-found destination address information when the string group represents address information, and setting the quasi address string as a CRF feature, and extracting the to-be-found destination address information from the address search request information based on a CRF algorithm.
  • determining, according to the string group, a quasi address string corresponding to the to-be-found destination address information includes determining that multiple strings are included in the string group, and that keywords that are respectively replaced with the multiple strings are continuous in position in the address search request information, combining the multiple strings into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information if none of the multiple strings is a duplicate, and removing the duplicate string, and combining at least one string obtained after the duplicate string is removed into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information if any of the multiple strings is a duplicate.
  • splitting the destination address information into at least one piece of sub-address information includes splitting, according to the address types represented by the preset text address dictionaries corresponding to different address types, the destination address information into sub-address information corresponding to the different address types.
  • the address type includes one or a combination of multiple of the following information, administrative region information, road name information, building/organization name information, auxiliary content of the road name, and auxiliary content of the building/organization name.
  • matching the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information with different address information included in an address database includes correspondingly matching each of the at least one piece of sub-address information in the destination address information with sub-address information of a same address type in the address database.
  • matching the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information with different address information included in an address database includes selecting one piece of address information from the address database, determining sub-address information included in the selected address information, performing matching calculation between each of the at least one piece of sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree, where an address type of the sub-address information that is included in the to-be-found destination address and for which matching calculation is
  • obtaining a total match degree between the selected address information and the to-be-found destination address information according to the first match degree includes performing matching calculation between the selected address information and the to-be-found destination address information to obtain a second match degree, and obtaining the total match degree between the selected address information and the to-be-found destination address information according to the first match degree and the second match degree.
  • the performing matching calculation between sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree includes that for each piece of sub-address information included in the to-be-found destination address information, finding, from the selected address information, sub-address information whose address type is the same as that of the sub-address information for the sub-address information, calculating an editing distance required for mutual conversion between the sub-address information and the found sub-address information, and calculating the first match degree according to an editing distance obtained corresponding to each piece of sub-address information included in the to-be-found destination address information.
  • setting address information that is obtained by matching and whose match degree is greater than a set threshold as found destination address information for output includes obtaining, by means of calculation, a total match degree between address information selected each time and the to-be-found destination address information, and determining in sequence a set quantity of total match degrees in descending order of the total match degrees, and setting address information selected corresponding to the determined total match degrees as the found destination address information for output.
  • an address search device including an acquiring module configured to acquire address search request information, and determine to-be-found destination address information included in the address search request information, a splitting module configured to split the destination address information determined by the acquiring module into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types, and a searching module configured to match the at least one piece of sub-address information obtained by the splitting module or the at least one piece of sub-address information and the destination address information with different address information included in an address database, where each piece of address information stored in the address database includes different sub-address information that forms the address information, and set address information that is obtained by the matching and whose match degree is greater than a set threshold as found destination address information for output.
  • the acquiring module is further configured to receive entered voice data, where the voice data is used to initiate an address search, and perform recognition on the voice data to obtain the to-be-found destination address information included in the voice data.
  • the acquiring module is further configured to obtain the to-be-found destination address information in the manner of determining at least one keyword included in the address search request information, finding, from preset text address dictionaries corresponding to different address types, a text address dictionary including the keyword for each determined keyword, and replacing the keyword with a string used to denote an address type corresponding to the found text address dictionary, determining, using a regular expression used to represent address information, whether a string group formed after each keyword is replaced with a corresponding string represents address information, determining, according to the string group, a quasi address string corresponding to the to-be-found destination address information when the string group represents address information, and setting the quasi address string as a CRF feature, and extracting the to-be-found destination address information from the address search request information based on a CRF algorithm.
  • the acquiring module is determining, according to the string group, a quasi address string corresponding to the to-be-found destination address information, which further includes determining that multiple strings are included in the string group, and that keywords that are respectively replaced with the multiple strings are continuous in position in the address search request information, combining the multiple strings into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information if none of the multiple strings is a duplicate, and removing the duplicate string, and combining at least one string obtained after the duplicate string is removed into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information if any of the multiple strings is a duplicate.
  • the splitting module is further configured to split, according to the address types represented by the preset text address dictionaries corresponding to different address types, the destination address information into sub-address information corresponding to the different address types.
  • the address type includes one or a combination of multiple of the following information, administrative region information, road name information, building/organization name information, auxiliary content of the road name, and auxiliary content of the building/organization name.
  • the searching module is further configured to correspondingly match each of the at least one piece of sub-address information in the destination address information with sub-address information of a same address type in the address database.
  • the searching module is further configured to select one piece of address information from the address database, and determine sub-address information included in the selected address information, perform matching calculation between each of the at least one piece of sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree, where an address type of the sub-address information that is included in the to-be-found destination address and for which matching calculation is performed is the same as an address type of the sub-address information included in the selected address information, and obtain a total match degree between the selected address
  • the searching module is obtaining a total match degree between the selected address information and the to-be-found destination address information according to the first match degree further includes performing matching calculation between the selected address information and the to-be-found destination address information to obtain a second match degree, and obtaining the total match degree between the selected address information and the to-be-found destination address information according to the first match degree and the second match degree.
  • the searching module is performing matching calculation between sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree further includes for each piece of sub-address information included in the to-be-found destination address information, finding, from the selected address information, sub-address information whose address type is the same as that of the sub-address information for the sub-address information, calculating an editing distance required for mutual conversion between the sub-address information and the found sub-address information, and calculating the first match degree according to an editing distance obtained corresponding to each piece of sub-address information included in the to-be-found destination address information.
  • the searching module is setting address information that is obtained by the matching and whose match degree is greater than a set threshold as found destination address information for output further includes obtaining, by means of calculation, a total match degree between address information selected each time and the to-be-found destination address information, and determining in sequence a set quantity of total match degrees in descending order of the total match degrees, and setting address information selected corresponding to the determined total match degrees as the found destination address information for output.
  • an address search device including a signal receiver configured to acquire address search request information, and determine to-be-found destination address information included in the address search request information, a processor configured to split the destination address information into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types, match the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information with different address information included in an address database, where each piece of address information stored in the address database includes different sub-address information that forms the address information, and set address information that is obtained by matching and whose match degree is greater than a set threshold as found destination address information for output.
  • the processor further executes receiving entered voice data, where the voice data is used to initiate an address search, and performing recognition on the voice data to obtain the to-be-found destination address information included in the voice data.
  • the processor further executes obtaining to-be-found destination address information in the following manner.
  • Determine at least one keyword included in the address search request information find, from preset text address dictionaries corresponding to different address types, a text address dictionary including the keyword for each determined keyword, and replace the keyword with a string used to denote an address type corresponding to the found text address dictionary, determine, using a regular expression used to represent address information, whether a string group formed after each keyword is replaced with a corresponding string represents address information, determine, according to the string group, a quasi address string corresponding to the to-be-found destination address information when the string group represents address information, and set the quasi address string as a CRF feature, and extract the to-be-found destination address information from the address search request information based on a CRF algorithm.
  • the processor further executes the determining, according to the string group, a quasi address string corresponding to the to-be-found destination address information, which includes determining that multiple strings are included in the string group, and keywords that are respectively replaced with the multiple strings are continuous in position in the address search request information, combining the multiple strings into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information if none of the multiple strings is a duplicate, and removing the duplicate string if any of the multiple strings is a duplicate, and combining at least one string obtained after the duplicate string is removed into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information.
  • the processor further executes splitting the destination address information into at least one piece of sub-address information, which includes splitting, according to the address types represented by the preset text address dictionaries corresponding to different address types, the destination address information into sub-address information corresponding to the different address types.
  • the address type includes one or a combination of multiple of the following information administrative region information, road name information, building/organization name information, auxiliary content of the road name, and auxiliary content of the building/organization name.
  • the processor further executes correspondingly matching each of the at least one piece of sub-address information in the destination address information with sub-address information of a same address type in the address database.
  • the processor further executes matching the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information with different address information included in an address database, which includes selecting one piece of address information from the address database, and determining sub-address information included in the selected address information, performing matching calculation between each of the at least one piece of sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree, where an address type of the sub-address information that is included in the to-be-found
  • the processor further executes obtaining a total match degree between the selected address information and the to-be-found destination address information according to the first match degree, which further includes performing matching calculation between the selected address information and the to-be-found destination address information to obtain a second match degree, and obtaining the total match degree between the selected address information and the to-be-found destination address information according to the first match degree and the second match degree.
  • the processor further executes performing matching calculation between sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree, which includes for each piece of sub-address information included in the to-be-found destination address information, finding, from the selected address information, sub-address information whose address type is the same as that of the sub-address information for the sub-address information, calculating an editing distance required for mutual conversion between the sub-address information and the found sub-address information, and calculating the first match degree according to an editing distance obtained corresponding to each piece of sub-address information included in the to-be-found destination address information.
  • the processor further executes setting address information that is obtained by matching and whose match degree is greater than a set threshold as found destination address information for output, which includes obtaining, by means of calculation, a total match degree between address information selected each time and the to-be-found destination address information, and determining in sequence a set quantity of total match degrees in descending order of the total match degrees, and setting address information selected corresponding to the determined total match degrees as the found destination address information for output.
  • the present application has the following beneficial effect.
  • address search request information is acquired, and to-be-found destination address information included in the address search request information is determined.
  • the destination address information is split into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types.
  • the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information are matched with different address information included in an address database, where each piece of address information stored in the address database includes different sub-address information that forms the address information, and address information that is obtained by the matching and whose match degree is greater than a set threshold is used as found destination address information for output.
  • sub-address information corresponding to destination address information is extracted from the address search request information, which improves correctness of extraction of destination address information.
  • the extracted sub-address information and the destination address information are matched with different address information included in an address database, and then address information whose match degree is greater than a set threshold is used as found destination address information, which effectively increases correctness of address matching and accuracy of address searching.
  • FIG. 1 is a schematic structural diagram of an address search system according to Embodiment 1 of the present application.
  • FIG. 2 is a schematic structural diagram of an address search device according to Embodiment 2 of the present application.
  • FIG. 3 is a schematic structural diagram of an address search device according to Embodiment 3 of the present application.
  • FIG. 4 is a schematic flowchart of an address search method according to Embodiment 4 of the present application.
  • embodiments of the present application provide an address search method and device, including acquiring address search request information, determining to-be-found destination address information included in the address search request information, splitting the destination address information into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types, matching the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information with different address information included in an address database, where each piece of address information stored in the address database includes different sub-address information that forms the address information, and setting address information that is obtained by the matching and whose match degree is greater than a set threshold as found destination address information for output.
  • sub-address information corresponding to destination address information is extracted from the address search request information when address search request information is acquired, which improves correctness of extraction of destination address information.
  • the extracted sub-address information and the destination address information are matched with different address information included in an address database, and then address information whose match degree is greater than a set threshold is used as found destination address information, which effectively increases correctness of address matching and accuracy of address searching.
  • FIG. 1 is a schematic structural diagram of an address search system according to Embodiment 1 of the present application.
  • the address search system includes a receiving device 11 , an address extracting device 12 , and an address matching device 13 .
  • the receiving device 11 is configured to acquire address search request information.
  • the address extracting device 12 is configured to determine to-be-found destination address information included in the address search request information, and split the destination address information into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types.
  • the receiving device 11 is further configured to receive entered text data, where the text data is used to initiate an address search.
  • the address extracting device 12 is further configured to perform recognition on the text data to obtain the to-be-found destination address information included in the text data.
  • the entered text data that is received is “Exit No. 4, Caoyang Road Station, Shanghai.” Therefore, it can be determined that the to-be-found destination address information included in the text data is Exit No. 4, Caoyang Road Station, Shanghai.
  • the receiving device 11 is further configured to receive entered voice data, where the voice data is used to initiate an address search.
  • the address extracting device 12 is further configured to perform recognition on the voice data to obtain the to-be-found destination address information included in the voice data.
  • the entered voice data that is received is “go to Exit No. 4, Caoyang Road Station, Shanghai.” Therefore, it can be determined that the to-be-found destination address information included in the voice data is, Exit No. 4, Caoyang Road Station, Shanghai.
  • the address search request information may further include other auxiliary information according to a language habit, for example, text information such as “go to.”
  • the address extracting device 12 is further configured to obtain the to-be-found destination address information in the manner of determining at least one keyword included in the address search request information, finding, from preset text address dictionaries corresponding to different address types, a text address dictionary including the keyword for each determined keyword, and replacing the keyword with a string used to denote an address type corresponding to the found text address dictionary, determining, using a regular expression used to represent address information, whether a string group formed after each keyword is replaced with a corresponding string represents address information, determining, according to the string group, a quasi address string corresponding to the to-be-found destination address information when the string group represents address information, and setting the quasi address string as a CRF feature, and extracting the to-be-found destination address information from the address search request information based on a CRF algorithm.
  • the address information is hierarchical. That is, there is address information that represents an address region, such as address information corresponding to an administrative region, for example, Beijing serves as a piece of address information, and then “Beijing” corresponds to an address region on a map. There is also address information that represents a specific location, for example, XX Building, No. XX, XX Road, XX District, XX City, where this piece of address information corresponds to a specific location of XX Building.
  • address information is classified to different address types according to different levels of address information.
  • the address type includes one or a combination of multiple of the following information, administrative region information, road name information, building/organization name information, auxiliary content of the road name, and auxiliary content of the building/organization name.
  • an address type corresponding to “XX District, XX City” is administrative region information
  • an address type corresponding to “XX Road” is road name information
  • an address type corresponding to “No. XX” is auxiliary content of the road name
  • an address type corresponding to “XX Building” is building/organization name information
  • an address type corresponding to “Room XX, Floor XX” is auxiliary content of the building/organization name.
  • auxiliary content of the road name indicates that it is meaningless without the road name in front of it, and a specific address cannot be located using the auxiliary content of the road name alone
  • auxiliary content of the building/organization name indicates that it is also meaningless without the building/organization name in front of it, and a specific address cannot be located using the auxiliary content of the building/organization name alone.
  • the preset text address dictionaries corresponding to different address types include at least an administrative region dictionary, which further includes administrative region information such as province, city, district, county, township, village, autonomous prefecture, league, and banner, which is, for example, Beijing Municipality, Beijing, Shanghai Municipality, Shanghai, Shenzhen City, and Shenzhen, an organization/building end word dictionary, which includes end-words of names of building/organizations, for example, police station, building, and center, a street name dictionary, which includes street names, for example, XX Road, XX Street, and XX Station.
  • the following dictionaries are also included a stop word dictionary, which includes words or phrases that convey the meaning of stop, for example, arrive, a symbol dictionary, which includes punctuation marks, and a number dictionary, which includes numbers and the like.
  • the at least one keyword included in the address search request information is “go to,” “Shanghai,” “Caoyang Road,” “Station,” and “Exit No. 4.”
  • Step 1 Find, from the preset text address dictionaries corresponding to different address types, a text address dictionary including the keyword, and replace the keyword using a string used to represent an address type corresponding to the found text address dictionary.
  • “go to” belongs to the stop word dictionary
  • “Shanghai” belongs to the administrative region dictionary
  • “Caoyang Road” and “Station” belong to the street name dictionary
  • “4” in “Exit No. 4” belongs to the number dictionary
  • “No.” and “Exit” in “Exit No. 4” belong to the end word dictionary.
  • a string of an address type corresponding to the stop word dictionary is SSS
  • a string of an address type corresponding to the administrative region dictionary is AAA
  • a string of an address type corresponding to the street name dictionary is RRR
  • a string of an address type corresponding to the number dictionary is DDD
  • a string of an address type corresponding to the end word dictionary is OOO.
  • a string group obtained after the replacement may be SSSAAARRRRRRDDDOOOOOO, or may be SSSAAA City RRR Road Station DDDOOOOOO. Whether a text used to represent an address type in the keyword is replaced is not limited herein.
  • Step 2 Determine, using a regular expression used to represent address information, whether a string group formed after each keyword is replaced with a corresponding string represents address information, and determine, according to the string group, a quasi address string corresponding to the to-be-found destination address information when the string group represents address information.
  • a regular expression describes and matches, using a single string, a string that meets a syntactical rule, and the regular expression involved in this embodiment of the present application describes address information.
  • whether the string group represents address information is determined using a regular expression.
  • AAA City” and “RRR Road Station” may represent address information according to the determining using the regular expression
  • SSS may not represent address information according to the determining using the regular expression
  • Step 3 Set the quasi address string as a CRF feature, and extract the to-be-found destination address information from the address search request information based on a CRF algorithm.
  • the quasi address string is used as a CRF feature, but a CRF feature for extracting the destination address information based on the CRF algorithm is not limited to the quasi address string alone, and multiple strings may further be included.
  • the CRF feature for extracting the destination address information based on the CRF algorithm includes the CRF feature served by the quasi address string.
  • the extracted destination address information is Exit No. 4, Caoyang Road Station, Shanghai.
  • That the address extracting device 12 is determining the quasi address string corresponding to the to-be-found destination address information which includes determining that multiple strings are included in the string group, and that keywords that are respectively replaced with the multiple strings are continuous in position in the address search request information, combining the multiple strings into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information if none of the multiple strings is a duplicate, and removing the duplicate string if any of the multiple strings is a duplicate, and combining at least one string obtained after the duplicate string is removed into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information.
  • AAA City, RRR Road Station, and DDD are a duplicate, and then a quasi address string, AAA City RRR Road Station DDD, is obtained by means of combination.
  • AAA, RRR, RRR, DDD, OOO, and OOO is a duplicate, and then a quasi address string, AAARRRDDDOOO, is obtained by means of combination.
  • the address extracting device 12 is further configured to split the destination address information into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types.
  • the address extracting device 12 is further configured to split, according to the address types represented by the preset text address dictionaries corresponding to different address types, the destination address information into sub-address information corresponding to the different address types.
  • sub-address information obtained by splitting “Exit No. 4, Caoyang Road Station, Shanghai” is Shanghai, administrative region information, Caoyang Road Station, street name information, and Exit No. 4, auxiliary information of the street name.
  • the address matching device 13 is configured to match the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information with different address information included in an address database, where each piece of address information stored in the address database includes different sub-address information that forms the address information, and set address information that is obtained by the matching and whose match degree is greater than a set threshold as found destination address information for output.
  • the address matching device 13 is further configured to correspondingly match each of the at least one piece of sub-address information in the destination address information with sub-address information of a same address type in the address database.
  • the address matching device 13 is further configured to select one piece of address information from the address database, and determine sub-address information included in the selected address information, perform matching calculation between each of the at least one piece of sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree, where an address type of the sub-address information that is included in the to-be-found destination address and for which matching calculation is performed is the same as an address type of the sub-address information included in the selected address information, and obtain a total match degree between the selected address information and the to-be-found destination address information according to the first match degree.
  • each of the at least one piece of sub-address information in the destination address information is matched with sub-address information of a same address type in the address database.
  • the address matching device 13 performs, using an address type corresponding to the sub-address information as a granularity, matching calculation in sequence between the sub-address information that represents an address type in the to-be-found destination address information and the sub-address information that represents the same address type in the selected address information, and obtains, by means of calculation, the first match degree corresponding to the sub-address information.
  • a total match degree between the selected address information and the to-be-found destination address information is obtained according to the first match degree.
  • matching calculation is performed between the to-be-found destination address information and the selected address information to obtain a second match degree.
  • address information that includes only one or a few pieces of sub-address information of the to-be-found destination address information for example, address information that includes only Exit No. 4, is effectively excluded from the address information obtained using the first match degree.
  • the total match degree between the selected address information and the to-be-found destination address information is obtained according to the first match degree and the second match degree.
  • That the address matching device 13 is performing matching calculation between sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree further includes for each piece of sub-address information included in the to-be-found destination address information, finding, from the selected address information, sub-address information whose address type is the same as that of the sub-address information for the sub-address information, calculating an editing distance required for mutual conversion between the sub-address information and the found sub-address information, and calculating the first match degree according to an editing distance obtained corresponding to each piece of sub-address information included in the to-be-found destination address information.
  • the editing distance refers to a minimum quantity of editing operations required for converting one string into the other string between two strings.
  • the so-called editing operation refers to replacing one character with another character, or inserting one character, or deleting one character, or the like.
  • That the address matching device 13 is calculating the first match degree according to an editing distance obtained corresponding to each piece of sub-address information included in the to-be-found destination address information further includes calculating a sum of editing distances obtained separately corresponding to sub-address information included in the to-be-found destination address information, where the sum is the first match degree.
  • That the address matching device 13 is setting address information that is obtained by the matching and whose match degree is greater than a set threshold as found destination address information for output further includes obtaining, by means of calculation, a total match degree between address information selected each time and the to-be-found destination address information, and determining in sequence a set quantity of total match degrees in descending order of the total match degrees, and setting address information selected corresponding to the determined total match degrees as the found destination address information for output.
  • set similarity threshold may be defined by a system default parameter, or may be determined according to an actual requirement, which is not limited herein.
  • set quantity may be defined by a system default parameter, or may be determined according to an actual requirement, which is not limited herein.
  • sub-address information corresponding to destination address information is extracted from the address search request information when address search request information is acquired, which improves correctness of extraction of destination address information.
  • the extracted sub-address information and the destination address information are matched with different address information included in an address database, and then address information whose match degree is greater than a set threshold is used as found destination address information, which effectively increases correctness of address matching and accuracy of address searching.
  • FIG. 2 is a schematic structural diagram of an address search device according to Embodiment 2 of the present application.
  • the address search device includes an acquiring module 21 , a splitting module 22 , and a searching module 23 , where the acquiring module 21 is configured to acquire address search request information, and determine to-be-found destination address information included in the address search request information.
  • the splitting module 22 is configured to split the destination address information determined by the acquiring module 21 into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types
  • the searching module 23 is configured to match the at least one piece of sub-address information obtained by the splitting module 22 or the at least one piece of sub-address information and the destination address information with different address information included in an address database, where each piece of address information stored in the address database includes different sub-address information that forms the address information, and set address information that is obtained by the matching and whose match degree is greater than a set threshold as found destination address information for output.
  • the acquiring module 21 is further configured to receive entered voice data, where the voice data is used to initiate an address search, and perform recognition on the voice data to obtain the to-be-found destination address information included in the voice data.
  • the acquiring module 21 is further configured to obtain the to-be-found destination address information in the manner of determining at least one keyword included in the address search request information, finding, from preset text address dictionaries corresponding to different address types, a text address dictionary including the keyword for each determined keyword, and replacing the keyword with a string used to denote an address type corresponding to the found text address dictionary, determining, using a regular expression used to represent address information, whether a string group formed after each keyword is replaced with a corresponding string represents address information, determining, according to the string group, a quasi address string corresponding to the to-be-found destination address information when the string group represents address information, and setting the quasi address string as a CRF feature, and extracting the to-be-found destination address information from the address search request information based on a CRF algorithm.
  • That the acquiring module 21 determining the quasi address string corresponding to the to-be-found destination address information further includes determining that multiple strings are included in the string group, and keywords that are respectively replaced with the multiple strings are continuous in position in the address search request information, combining the multiple strings into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information if none of the multiple strings is a duplicate, and removing the duplicate string if any of the multiple strings is a duplicate, and combining at least one string obtained after the duplicate string is removed into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information.
  • the splitting module 22 is further configured to split, according to the address types represented by the preset text address dictionaries corresponding to different address types, the destination address information into sub-address information corresponding to the different address types.
  • the address type includes one or a combination of multiple of the following information, administrative region information, road name information, building/organization name information, auxiliary content of the road name, and auxiliary content of the building/organization name.
  • the searching module 23 is further configured to correspondingly match each of the at least one piece of sub-address information in the destination address information with sub-address information of a same address type in the address database.
  • the searching module 23 is further configured to select one piece of address information from the address database, and determine sub-address information included in the selected address information, perform matching calculation between each of the at least one piece of sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree, where an address type of the sub-address information that is included in the to-be-found destination address and for which matching calculation is performed is the same as an address type of the sub-address information included in the selected address information, and obtain a total match degree between the selected address information and the to-be-found destination address information according to the first match degree.
  • That the searching module 23 is obtaining a total match degree between the selected address information and the to-be-found destination address information according to the first match degree further includes performing matching calculation between the selected address information and the to-be-found destination address information to obtain a second match degree, and obtaining the total match degree between the selected address information and the to-be-found destination address information according to the first match degree and the second match degree.
  • That the searching module 23 is performing matching calculation between sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree further includes for each piece of sub-address information included in the to-be-found destination address information, finding, from the selected address information, sub-address information whose address type is the same as that of the sub-address information for the sub-address information, calculating an editing distance required for mutual conversion between the sub-address information and the found sub-address information, and calculating the first match degree according to an editing distance obtained corresponding to each piece of sub-address information included in the to-be-found destination address information.
  • That the searching module 23 is setting address information that is obtained by matching and whose match degree is greater than a set threshold as found destination address information for output further includes obtaining, by means of calculation, a total match degree between address information selected each time and the to-be-found destination address information, and determining in sequence a set quantity of total match degrees in descending order of the total match degrees, and setting address information selected corresponding to the determined total match degrees as the found destination address information for output.
  • address search device described in this embodiment of the present application may be implemented by means of hardware, or may be implemented by means of software, and an implementation manner is not limited herein.
  • the address search device extracts sub-address information corresponding to destination address information from the address search request information when acquiring address search request information, which improves correctness of extraction of destination address information, and matches the extracted sub-address information and the destination address information with different address information included in an address database, and then sets address information whose match degree is greater than a set threshold as found destination address information, which effectively increases correctness of address matching and accuracy of address searching.
  • FIG. 3 is a schematic structural diagram of an address search device according to Embodiment 3 of the present application.
  • the address search device has functions described in Embodiment 4 of the present application.
  • the address search device may use a general computer system structure, where the computer system may further be a processor-based computer.
  • the address search device includes at least one processor 31 and a signal receiver 32 .
  • the processor 31 and the signal receiver 32 are connected using a communications bus 33 .
  • the signal receiver 32 is configured to acquire address search request information, and determine to-be-found destination address information included in the address search request information.
  • the processor 31 is configured to split the destination address information into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types, match the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information with different address information included in an address database, where each piece of address information stored in the address database includes different sub-address information that forms the address information, and set address information that is obtained by matching and whose match degree is greater than a set threshold as found destination address information for output.
  • the processor 31 further executes receiving entered voice data, where the voice data is used to initiate an address search, and performing recognition on the voice data to obtain the to-be-found destination address information included in the voice data.
  • the processor 31 further executes obtaining to-be-found destination address information in the following manner. Determining at least one keyword included in the address search request information, finding, from preset text address dictionaries corresponding to different address types, a text address dictionary including the keyword for each determined keyword, and replacing the keyword with a string used to denote an address type corresponding to the found text address dictionary, determining, using a regular expression used to represent address information, whether a string group formed after each keyword is replaced with a corresponding string represents address information, determining, according to the string group, a quasi address string corresponding to the to-be-found destination address information when the string group represents address information, setting the quasi address string as a CRF feature, and extracting the to-be-found destination address information from the address search request information based on a CRF algorithm.
  • the processor 31 further executes determining that multiple strings are included in the string group, and keywords that are respectively replaced with the multiple strings are continuous in position in the address search request information, combining the multiple strings into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information if none of the multiple strings is a duplicate, and removing the duplicate string if any of the multiple strings is a duplicate, and combining at least one string obtained after the duplicate string is removed into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information.
  • the processor 31 further executes splitting the destination address information into at least one piece of sub-address information, which includes splitting, according to the address types represented by the preset text address dictionaries corresponding to different address types, the destination address information into sub-address information corresponding to the different address types.
  • the address type includes one or a combination of multiple of the following information, administrative region information, road name information, building/organization name information, auxiliary content of the road name, and auxiliary content of the building/organization name.
  • the processor 31 further executes correspondingly matching each of the at least one piece of sub-address information in the destination address information with sub-address information of a same address type in the address database.
  • the processor 31 further executes matching the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information with different address information included in an address database, which includes selecting one piece of address information from the address database, and determining sub-address information included in the selected address information, performing matching calculation between each of the at least one piece of sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree, where an address type of the sub-address information that is included in the to-be-found destination address and for which matching calculation is performed is the same as an address type of the sub-address information included in the selected address information, and obtaining a total match degree between the selected address information and the to-be-found destination address information according to the first match degree.
  • the processor 31 further executes obtaining a total match degree between the selected address information and the to-be-found destination address information according to the first match degree, which further includes performing matching calculation between the selected address information and the to-be-found destination address information to obtain a second match degree, and obtaining the total match degree between the selected address information and the to-be-found destination address information according to the first match degree and the second match degree.
  • the processor 31 further executes performing matching calculation between sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree, which includes for each piece of sub-address information included in the to-be-found destination address information, finding, from the selected address information, sub-address information whose address type is the same as that of the sub-address information for the sub-address information, calculating an editing distance required for mutual conversion between the sub-address information and the found sub-address information, and calculating the first match degree according to an editing distance obtained corresponding to each piece of sub-address information included in the to-be-found destination address information.
  • the processor 31 further executes setting address information that is obtained by matching and whose match degree is greater than a set threshold as found destination address information for output, which includes obtaining, by means of calculation, a total match degree between address information selected each time and the to-be-found destination address information, and determining in sequence a set quantity of total match degrees in descending order of the total match degrees, and setting address information selected corresponding to the determined total match degrees as the found destination address information for output.
  • the processor 31 may be a general purpose central processing unit (CPU), a microprocessor, an application-specific integrated circuit (ASIC), or one or more integrated circuits used to control execution of the solution program of the present application.
  • CPU central processing unit
  • ASIC application-specific integrated circuit
  • the address search device when acquiring address search request information, extracts sub-address information corresponding to destination address information from the address search request information, which improves correctness of extraction of destination address information, and matches the extracted sub-address information and the destination address information with different address information included in an address database, and then sets address information whose match degree is greater than a set threshold as found destination address information, which effectively increases correctness of address matching and accuracy of address searching.
  • FIG. 4 is a schematic flowchart of an address search method according to Embodiment 4 of the present application. The method may be shown in the following steps.
  • Step 401 Acquire address search request information.
  • the address search request information includes to-be-found destination address information.
  • a manner for acquiring the address search request information includes but is not limited to the following manner. Receiving entered text data, where the text data is used to initiate an address search, or receiving entered voice data, where the voice data is used to initiate an address search.
  • the method further includes performing recognition on the received voice data to obtain text data corresponding to the voice data.
  • Step 402 Determine to-be-found destination address information included in the address search request information.
  • the to-be-found destination address information is obtained in the following manner. Determining at least one keyword included in the address search request information, finding, from preset text address dictionaries corresponding to different address types, a text address dictionary including the keyword for each determined keyword, and replacing the keyword with a string used to denote an address type corresponding to the found text address dictionary, determining, using a regular expression used to represent address information, whether a string group formed after each keyword is replaced with a corresponding string represents address information, determining, according to the string group, a quasi address string corresponding to the to-be-found destination address information when the string group represents address information, and setting the quasi address string as a CRF feature, and extracting the to-be-found destination address information from the address search request information based on a CRF algorithm.
  • the address information is hierarchical. That is, there is address information that represents an address region, such as address information corresponding to an administrative region, for example, Beijing serves as a piece of address information, and then “Beijing” corresponds to an address region on a map. There is also address information that represents a specific location, for example, XX Building, No. XX, XX Road, XX District, XX City, where this piece of address information corresponds to a specific location of XX Building.
  • address information is classified to different address types according to different levels of address information.
  • the address type includes one or a combination of multiple of the following information, administrative region information, road name information, building/organization name information, auxiliary content of the road name, and auxiliary content of the building/organization name.
  • an address type corresponding to “XX District, XX City” is administrative region information
  • an address type corresponding to “XX Road” is road name information
  • an address type corresponding to “No. XX” is auxiliary content of the road name
  • an address type corresponding to “XX Building” is building/organization name information
  • an address type corresponding to “Room XX, Floor XX” is auxiliary content of the building/organization name.
  • auxiliary content of the road name indicates that it is meaningless without the road name in front of it, and a specific address cannot be located using the auxiliary content of the road name alone.
  • auxiliary content of the building/organization name indicates that it is also meaningless without the building/organization name in front of it, and a specific address cannot be located using the auxiliary content of the building/organization name alone.
  • the preset text address dictionaries corresponding to different address types include at least an administrative region dictionary, which further includes administrative region information such as province, city, district, county, township, village, autonomous prefecture, league, and banner, which is further, for example, Beijing Municipality, Beijing, Shanghai Municipality, Shanghai, Shenzhen City, and Shenzhen, an organization/building end word dictionary, which includes end words of names of buildings/organizations, for example, police station, building, and center, a street name dictionary, which includes street names, for example, XX Road, XX Street, and XX Station.
  • a stop word dictionary which includes words or phrases that convey the meaning of stop, for example, arrive
  • a symbol dictionary which includes punctuation marks
  • a number dictionary which includes numbers and the like.
  • the at least one keyword included in the address search request information is “go to,” “Shanghai,” “Caoyang Road,” “Station,” and “Exit No. 4.”
  • Step 1 Find, from the preset text address dictionaries corresponding to different address types, a text address dictionary including the keyword, and replace the keyword using a string used to represent an address type corresponding to the found text address dictionary.
  • “go to” belongs to the stop word dictionary
  • “Shanghai” belongs to the administrative region dictionary
  • “Caoyang Road” and “Station” belong to the street name dictionary
  • “4” in “Exit No. 4” belongs to the number dictionary
  • “No.” and “Exit” in “Exit No. 4” belong to the end word dictionary.
  • a string of an address type corresponding to the stop word dictionary is SSS
  • a string of an address type corresponding to the administrative region dictionary is AAA
  • a string of an address type corresponding to the street name dictionary is RRR
  • a string of an address type corresponding to the number dictionary is DDD
  • a string of an address type corresponding to the end word dictionary is OOO.
  • a string group obtained after the replacement may be SSSAAARRRRRRDDDOOOOOO, or may be SSSAAA City RRR Road Station DDDOOOOOO. Whether a text used to represent an address type in the keyword is replaced is not limited herein.
  • Step 2 Determine, using a regular expression used to represent address information, whether a string group formed after each keyword is replaced with a corresponding string represents address information, and determine, according to the string group, a quasi address string corresponding to the to-be-found destination address information when the string group represents address information.
  • a regular expression describes and matches, using a single string, a string that meets a syntactical rule, and the regular expression involved in this embodiment of the present application describes address information.
  • whether the string group represents address information is determined using a regular expression.
  • AAA City” and “RRR Road Station” may represent address information according to the determining using the regular expression
  • SSS may not represent address information according to the determining using the regular expression
  • Step 3 Set the quasi address string as a CRF feature, and extract the to-be-found destination address information from the address search request information based on a CRF algorithm.
  • the quasi address string is used as a CRF feature, but a CRF feature for extracting the destination address information based on the CRF algorithm is not limited to only the one CRF feature served by the quasi address string, and multiple CRF features may further be included.
  • the CRF feature for extracting the destination address information based on the CRF algorithm includes the CRF feature served by the quasi address string.
  • the extracted destination address information is Exit No. 4, Caoyang Road Station, Shanghai.
  • determining, according to the string group, a quasi address string corresponding to the to-be-found destination address information further includes determining that multiple strings are included in the string group, and that keywords that are respectively replaced with the multiple strings are continuous in position in the address search request information, combining the multiple strings into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information if none of the multiple strings is a duplicate, and removing the duplicate string if any of the multiple strings is a duplicate, and combining at least one string obtained after the duplicate string is removed into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information.
  • Step 403 Split the destination address information into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types.
  • step 403 the destination address information is split, according to the address types represented by the preset text address dictionaries corresponding to different address types, into sub-address information corresponding to the different address types.
  • sub-address information obtained by splitting “Exit No. 4, Caoyang Road Station, Shanghai” is Shanghai, administrative region information, Caoyang Road Station, street name information, and Exit No. 4, auxiliary information of the street name.
  • Step 404 Match the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information with different address information included in an address database, and set address information that is obtained by matching and whose match degree is greater than a set threshold as found destination address information for output.
  • Each piece of address information stored in the address database includes different sub-address information that forms the address information.
  • step 404 first, one piece of address information is selected from the address database, and sub-address information included in the selected address information is determined.
  • mapping calculation is performed between each of the at least one piece of sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree, where an address type of the sub-address information that is included in the to-be-found destination address and for which matching calculation is performed is the same as an address type of the sub-address information included in the selected address information.
  • Performing matching calculation between sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree further includes for each piece of sub-address information included in the to-be-found destination address information, finding, from the selected address information, sub-address information whose address type is the same as that of the sub-address information for the sub-address information, calculating an editing distance required for mutual conversion between the sub-address information and the found sub-address information, and calculating the first match degree according to an editing distance obtained corresponding to each piece of sub-address information included in the to-be-found destination address information.
  • the editing distance refers to a minimum quantity of editing operations required for converting one string into the other string between two strings.
  • the so-called editing operation refers to replacing one character with another character, or inserting one character, or deleting one character, or the like.
  • one piece of address data selected from the address database is XXX Station, XXX District, XX City.
  • one piece of sub-address information “Shanghai” is selected from the to-be-found destination address information “Exit No. 4, Caoyang Road Station, Shanghai,” and the address type corresponding to “Shanghai” is determined as an administrative region.
  • sub-address information “XX City” that represents the administrative region is found from the “XXX Station, XXX District, XX City,” and an editing distance required for performing mutual conversion between “XX City” and “Shanghai” is calculated.
  • a first match degree between the sub-address information in the to-be-found destination address information and the sub-address information of the same address type in the selected address information is obtained according to the editing distance obtained by means of calculation.
  • a first match degree corresponding to each piece of sub-address information included in the to-be-found destination address information is obtained, a first match degree between the sub-address information included in the to-be-found destination address information and the sub-address information included in the selected address information is obtained according to the obtained first match degree.
  • One manner is setting the obtained first match degree as the total match degree between the selected address information and the to-be-found destination address information.
  • Another manner is continuing performing matching calculation between the selected address information and the to-be-found destination address information to obtain a second match degree.
  • the total match degree between the selected address information and the to-be-found destination address information is obtained according to the first match degree and the second match degree.
  • the setting address information that is obtained by matching and whose match degree is greater than a set threshold as found destination address information for output further includes obtaining, by means of calculation, a total match degree between address information selected each time and the to-be-found destination address information, and determining in sequence a set quantity of total match degrees in descending order of the total match degrees, and setting address information selected corresponding to the determined total match degrees as the found destination address information for output.
  • set similarity threshold may be defined by a system default parameter, or may be determined according to an actual requirement, which is not limited herein.
  • set quantity may be defined by a system default parameter, or may be determined according to an actual requirement, which is not limited herein.
  • address search request information is acquired, and to-be-found destination address information included in the address search request information is determined.
  • the destination address information is split into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types.
  • the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information are matched with different address information included in an address database, where each piece of address information stored in the address database includes different sub-address information that forms the address information, and address information that is obtained by the matching and whose match degree is greater than a set threshold is used as found destination address information for output.
  • each piece of address information stored in the address database includes different sub-address information that forms the address information, and address information that is obtained by the matching and whose match degree is greater than a set threshold is used as found destination address information for output.
  • sub-address information corresponding to destination address information is extracted from the address search request information, which improves correctness of extraction of destination address information.
  • the extracted sub-address information and the destination address information are matched with different address information included in an address database, and then address information whose match degree is greater than a set threshold is used as found destination address information, which effectively increases correctness of address matching and accuracy of address
  • the embodiments of the present application may be provided as a method, an apparatus (device), or a computer program product. Therefore, the present application may use a form of hardware only embodiments, software only embodiments, or embodiments with a combination of software and hardware. Moreover, the present application may use a form of a computer program product that is implemented on one or more computer-usable storage media (including but not limited to a disk memory, a compact-disc read-only memory (CD-ROM), an optical memory, and the like) that include computer-usable program code.
  • computer-usable storage media including but not limited to a disk memory, a compact-disc read-only memory (CD-ROM), an optical memory, and the like
  • These computer program instructions may be provided for a general-purpose computer, a dedicated computer, an embedded processor, or a processor of any other programmable data processing device to generate a machine such that the instructions executed by a computer or a processor of any other programmable data processing device generate an apparatus for implementing a specific function in one or more processes in the flowcharts and/or in one or more blocks in the block diagrams.
  • These computer program instructions may also be stored in a computer readable memory that can instruct the computer or any other programmable data processing device to work in a specific manner such that the instructions stored in the computer readable memory generate an artifact that includes an instruction apparatus.
  • the instruction apparatus implements a specific function in one or more processes in the flowcharts and/or in one or more blocks in the block diagrams.
  • These computer program instructions may also be loaded onto a computer or another programmable data processing device such that a series of operations and steps are performed on the computer or the other programmable device, thereby generating computer-implemented processing. Therefore, the instructions executed on the computer or the other programmable device provide steps for implementing a specific function in one or more processes in the flowcharts and/or in one or more blocks in the block diagrams.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Remote Sensing (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Human Computer Interaction (AREA)
  • Automation & Control Theory (AREA)
  • Artificial Intelligence (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Navigation (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

An address search method and device, includes acquiring address search request information, determining to-be-found destination address information included in the address search request information, splitting the destination address information into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types, matching the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information with different address information included in an address database, where each piece of address information stored in the address database includes different sub-address information that forms the address information, and setting address information that is obtained by matching and whose match degree is greater than a set threshold as found destination address information for output.

Description

    CROSS-REFERENCE TO RELATED APPLICATIONS
  • This application is a continuation of U.S. patent application Ser. No. 15/398,260, filed on Jan. 4, 2017, which is a continuation of International Patent Application No. PCT/CN2015/079816, filed on May 26, 2015, which claims priority to Chinese Patent Application No. 201410525978.X filed on Sep. 30, 2014. All of the aforementioned patent applications are hereby incorporated by reference in their entireties.
  • TECHNICAL FIELD
  • The present application relates to the field of data processing technologies, and in particular, to an address search method and device.
  • BACKGROUND
  • With the development of communications technologies and terminal technologies, increasingly more applications depend on mobile terminals, particularly navigation applications. For example, a user can initiate a navigation request to a mobile terminal in a voice or text input manner, and a navigation application selects a navigation route according to the received navigation request, and pushes the selected navigation route to the user.
  • Further, a manner in which the navigation application selects the navigation route according to the received navigation request includes but is not limited to the following.
  • A conditional random field (CRF) algorithm is used to extract an address name or a building name/organization name from the received navigation request, and determine a destination address of the navigation request using the extracted address name or building name/organization name when the navigation request is entered in a text form.
  • Address information included in the voice is used as a search basis to determine a destination address of the navigation request when the navigation request is entered in a voice form.
  • A disadvantage is when the destination address is being determined, only the address name (or the building name/organization name) extracted from the text or voice input is used to match address information in an address database. As a result, a determined destination address set includes many irrelevant addresses, which reduces accuracy of address searching.
  • SUMMARY
  • In view of this, embodiments of the present application provide an address search method and device in order to solve a currently existing problem that address searching accuracy is relatively low because multiple irrelevant addresses are found in an address search process.
  • According to a first aspect of the present application, an address search method is provided, including acquiring address search request information, determining to-be-found destination address information included in the address search request information, splitting the destination address information into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types, matching the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information with different address information included in an address database, where each piece of address information stored in the address database includes different sub-address information that forms the address information, and setting address information that is obtained by the matching and whose match degree is greater than a set threshold as found destination address information for output.
  • With reference to the first aspect of the present application, in a first possible manner, acquiring address search request information includes receiving entered voice data, where the voice data is used to initiate an address search, and performing recognition on the voice data to obtain the to-be-found destination address information included in the voice data.
  • With reference to the first aspect of the present application, or with reference to the first possible implementation manner of the first aspect of the present application, in a second possible manner, the to-be-found destination address information is obtained in the following manner. Determining at least one keyword included in the address search request information, finding, from preset text address dictionaries corresponding to different address types, a text address dictionary including the keyword for each determined keyword, and replacing the keyword with a string used to denote an address type corresponding to the found text address dictionary, determining, using a regular expression used to represent address information, whether a string group formed after each keyword is replaced with a corresponding string represents address information, determining, according to the string group, a quasi address string corresponding to the to-be-found destination address information when the string group represents address information, and setting the quasi address string as a CRF feature, and extracting the to-be-found destination address information from the address search request information based on a CRF algorithm.
  • With reference to the second possible implementation manner of the first aspect of the present application, in a third possible implementation manner, determining, according to the string group, a quasi address string corresponding to the to-be-found destination address information includes determining that multiple strings are included in the string group, and that keywords that are respectively replaced with the multiple strings are continuous in position in the address search request information, combining the multiple strings into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information if none of the multiple strings is a duplicate, and removing the duplicate string, and combining at least one string obtained after the duplicate string is removed into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information if any of the multiple strings is a duplicate.
  • With reference to the first aspect of the present application, or with reference to the first possible implementation manner of the first aspect of the present application, or with reference to the second possible implementation manner of the first aspect of the present application, or with reference to the third possible implementation manner of the first aspect of the present application, in a fourth possible manner, splitting the destination address information into at least one piece of sub-address information includes splitting, according to the address types represented by the preset text address dictionaries corresponding to different address types, the destination address information into sub-address information corresponding to the different address types.
  • With reference to the first aspect of the present application, or with reference to the first possible implementation manner of the first aspect of the present application, or with reference to the second possible implementation manner of the first aspect of the present application, or with reference to the third possible implementation manner of the first aspect of the present application, or with reference to the fourth possible implementation manner of the first aspect of the present application, in a fifth possible manner, the address type includes one or a combination of multiple of the following information, administrative region information, road name information, building/organization name information, auxiliary content of the road name, and auxiliary content of the building/organization name.
  • With reference to the first aspect of the present application, or with reference to the first possible implementation manner of the first aspect of the present application, or with reference to the second possible implementation manner of the first aspect of the present application, or with reference to the third possible implementation manner of the first aspect of the present application, or with reference to the fourth possible implementation manner of the first aspect of the present application, or with reference to the fifth possible implementation manner of the first aspect of the present application, in a sixth possible manner, matching the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information with different address information included in an address database includes correspondingly matching each of the at least one piece of sub-address information in the destination address information with sub-address information of a same address type in the address database.
  • With reference to the first aspect of the present application, or with reference to the first possible implementation manner of the first aspect of the present application, or with reference to the second possible implementation manner of the first aspect of the present application, or with reference to the third possible implementation manner of the first aspect of the present application, or with reference to the fourth possible implementation manner of the first aspect of the present application, or with reference to the fifth possible implementation manner of the first aspect of the present application, in a seventh possible manner, matching the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information with different address information included in an address database includes selecting one piece of address information from the address database, determining sub-address information included in the selected address information, performing matching calculation between each of the at least one piece of sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree, where an address type of the sub-address information that is included in the to-be-found destination address and for which matching calculation is performed is the same as an address type of the sub-address information included in the selected address information, and obtaining a total match degree between the selected address information and the to-be-found destination address information according to the first match degree.
  • With reference to the seventh possible implementation manner of the first aspect of the present application, in an eighth possible manner, obtaining a total match degree between the selected address information and the to-be-found destination address information according to the first match degree includes performing matching calculation between the selected address information and the to-be-found destination address information to obtain a second match degree, and obtaining the total match degree between the selected address information and the to-be-found destination address information according to the first match degree and the second match degree.
  • With reference to the sixth possible implementation manner of the first aspect of the present application, or with reference to the seventh possible implementation manner of the first aspect of the present application, or with reference to the eighth possible implementation manner of the first aspect of the present application, in a ninth possible manner, the performing matching calculation between sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree includes that for each piece of sub-address information included in the to-be-found destination address information, finding, from the selected address information, sub-address information whose address type is the same as that of the sub-address information for the sub-address information, calculating an editing distance required for mutual conversion between the sub-address information and the found sub-address information, and calculating the first match degree according to an editing distance obtained corresponding to each piece of sub-address information included in the to-be-found destination address information.
  • With reference to the sixth possible implementation manner of the first aspect of the present application, or with reference to the seventh possible implementation manner of the first aspect of the present application, or with reference to the eighth possible implementation manner of the first aspect of the present application, or with reference to the ninth possible implementation manner of the first aspect of the present application, in a tenth possible manner, setting address information that is obtained by matching and whose match degree is greater than a set threshold as found destination address information for output includes obtaining, by means of calculation, a total match degree between address information selected each time and the to-be-found destination address information, and determining in sequence a set quantity of total match degrees in descending order of the total match degrees, and setting address information selected corresponding to the determined total match degrees as the found destination address information for output.
  • According to a second aspect of the present application, an address search device is provided, including an acquiring module configured to acquire address search request information, and determine to-be-found destination address information included in the address search request information, a splitting module configured to split the destination address information determined by the acquiring module into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types, and a searching module configured to match the at least one piece of sub-address information obtained by the splitting module or the at least one piece of sub-address information and the destination address information with different address information included in an address database, where each piece of address information stored in the address database includes different sub-address information that forms the address information, and set address information that is obtained by the matching and whose match degree is greater than a set threshold as found destination address information for output.
  • With reference to the second aspect of the present application, in a first possible manner, the acquiring module is further configured to receive entered voice data, where the voice data is used to initiate an address search, and perform recognition on the voice data to obtain the to-be-found destination address information included in the voice data.
  • With reference to the second aspect of the present application, or with reference to the first possible implementation manner of the second aspect of the present application, in a second possible manner, the acquiring module is further configured to obtain the to-be-found destination address information in the manner of determining at least one keyword included in the address search request information, finding, from preset text address dictionaries corresponding to different address types, a text address dictionary including the keyword for each determined keyword, and replacing the keyword with a string used to denote an address type corresponding to the found text address dictionary, determining, using a regular expression used to represent address information, whether a string group formed after each keyword is replaced with a corresponding string represents address information, determining, according to the string group, a quasi address string corresponding to the to-be-found destination address information when the string group represents address information, and setting the quasi address string as a CRF feature, and extracting the to-be-found destination address information from the address search request information based on a CRF algorithm.
  • With reference to the second possible implementation manner of the second aspect of the present application, in a third possible implementation manner, that the acquiring module is determining, according to the string group, a quasi address string corresponding to the to-be-found destination address information, which further includes determining that multiple strings are included in the string group, and that keywords that are respectively replaced with the multiple strings are continuous in position in the address search request information, combining the multiple strings into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information if none of the multiple strings is a duplicate, and removing the duplicate string, and combining at least one string obtained after the duplicate string is removed into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information if any of the multiple strings is a duplicate.
  • With reference to the second aspect of the present application, or with reference to the first possible implementation manner of the second aspect of the present application, or with reference to the second possible implementation manner of the second aspect of the present application, or with reference to the third possible implementation manner of the second aspect of the present application, in a fourth possible manner, the splitting module is further configured to split, according to the address types represented by the preset text address dictionaries corresponding to different address types, the destination address information into sub-address information corresponding to the different address types.
  • With reference to the second aspect of the present application, or with reference to the first possible implementation manner of the second aspect of the present application, or with reference to the second possible implementation manner of the second aspect of the present application, or with reference to the third possible implementation manner of the second aspect of the present application, or with reference to the fourth possible implementation manner of the second aspect of the present application, in a fifth possible manner, the address type includes one or a combination of multiple of the following information, administrative region information, road name information, building/organization name information, auxiliary content of the road name, and auxiliary content of the building/organization name.
  • With reference to the second aspect of the present application, or with reference to the first possible implementation manner of the second aspect of the present application, or with reference to the second possible implementation manner of the second aspect of the present application, or with reference to the third possible implementation manner of the second aspect of the present application, or with reference to the fourth possible implementation manner of the second aspect of the present application, or with reference to the fifth possible implementation manner of the second aspect of the present application, in a sixth possible manner, the searching module is further configured to correspondingly match each of the at least one piece of sub-address information in the destination address information with sub-address information of a same address type in the address database.
  • With reference to the second aspect of the present application, or with reference to the first possible implementation manner of the second aspect of the present application, or with reference to the second possible implementation manner of the second aspect of the present application, or with reference to the third possible implementation manner of the second aspect of the present application, or with reference to the fourth possible implementation manner of the second aspect of the present application, or with reference to the fifth possible implementation manner of the second aspect of the present application, in a seventh possible manner, the searching module is further configured to select one piece of address information from the address database, and determine sub-address information included in the selected address information, perform matching calculation between each of the at least one piece of sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree, where an address type of the sub-address information that is included in the to-be-found destination address and for which matching calculation is performed is the same as an address type of the sub-address information included in the selected address information, and obtain a total match degree between the selected address information and the to-be-found destination address information according to the first match degree.
  • With reference to the seventh implementation manner of the second aspect of the present application, in an eighth possible manner, that the searching module is obtaining a total match degree between the selected address information and the to-be-found destination address information according to the first match degree further includes performing matching calculation between the selected address information and the to-be-found destination address information to obtain a second match degree, and obtaining the total match degree between the selected address information and the to-be-found destination address information according to the first match degree and the second match degree.
  • With reference to the sixth possible implementation manner of the second aspect of the present application, or with reference to the seventh possible implementation manner of the second aspect of the present application, or with reference to the eighth possible implementation manner of the second aspect of the present application, in a ninth possible manner, that the searching module is performing matching calculation between sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree further includes for each piece of sub-address information included in the to-be-found destination address information, finding, from the selected address information, sub-address information whose address type is the same as that of the sub-address information for the sub-address information, calculating an editing distance required for mutual conversion between the sub-address information and the found sub-address information, and calculating the first match degree according to an editing distance obtained corresponding to each piece of sub-address information included in the to-be-found destination address information.
  • With reference to the sixth possible implementation manner of the second aspect of the present application, or with reference to the seventh possible implementation manner of the second aspect of the present application, or with reference to the eighth possible implementation manner of the second aspect of the present application, or with reference to the ninth possible implementation manner of the second aspect of the present application, in a tenth possible manner, that the searching module is setting address information that is obtained by the matching and whose match degree is greater than a set threshold as found destination address information for output further includes obtaining, by means of calculation, a total match degree between address information selected each time and the to-be-found destination address information, and determining in sequence a set quantity of total match degrees in descending order of the total match degrees, and setting address information selected corresponding to the determined total match degrees as the found destination address information for output.
  • According to a third aspect of the present application, an address search device is provided, including a signal receiver configured to acquire address search request information, and determine to-be-found destination address information included in the address search request information, a processor configured to split the destination address information into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types, match the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information with different address information included in an address database, where each piece of address information stored in the address database includes different sub-address information that forms the address information, and set address information that is obtained by matching and whose match degree is greater than a set threshold as found destination address information for output.
  • With reference to the third aspect of the present application, in a first possible manner, the processor further executes receiving entered voice data, where the voice data is used to initiate an address search, and performing recognition on the voice data to obtain the to-be-found destination address information included in the voice data.
  • With reference to the third aspect of the present application, or with reference to the first possible implementation manner of the third aspect of the present application, in a second possible manner, the processor further executes obtaining to-be-found destination address information in the following manner. Determine at least one keyword included in the address search request information, find, from preset text address dictionaries corresponding to different address types, a text address dictionary including the keyword for each determined keyword, and replace the keyword with a string used to denote an address type corresponding to the found text address dictionary, determine, using a regular expression used to represent address information, whether a string group formed after each keyword is replaced with a corresponding string represents address information, determine, according to the string group, a quasi address string corresponding to the to-be-found destination address information when the string group represents address information, and set the quasi address string as a CRF feature, and extract the to-be-found destination address information from the address search request information based on a CRF algorithm.
  • With reference to the second possible implementation manner of the third aspect of the present application, in a third possible manner, the processor further executes the determining, according to the string group, a quasi address string corresponding to the to-be-found destination address information, which includes determining that multiple strings are included in the string group, and keywords that are respectively replaced with the multiple strings are continuous in position in the address search request information, combining the multiple strings into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information if none of the multiple strings is a duplicate, and removing the duplicate string if any of the multiple strings is a duplicate, and combining at least one string obtained after the duplicate string is removed into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information.
  • With reference to the third aspect of the present application, or with reference to the first possible implementation manner of the third aspect of the present application, or with reference to the second possible implementation manner of the third aspect of the present application, or with reference to the third possible implementation manner of the third aspect of the present application, in a fourth possible manner, the processor further executes splitting the destination address information into at least one piece of sub-address information, which includes splitting, according to the address types represented by the preset text address dictionaries corresponding to different address types, the destination address information into sub-address information corresponding to the different address types.
  • With reference to the third aspect of the present application, or with reference to the first possible implementation manner of the third aspect of the present application, or with reference to the second possible implementation manner of the third aspect of the present application, or with reference to the third possible implementation manner of the third aspect of the present application, or with reference to the fourth possible implementation manner of the third aspect of the present application, in a fifth possible manner, the address type includes one or a combination of multiple of the following information administrative region information, road name information, building/organization name information, auxiliary content of the road name, and auxiliary content of the building/organization name.
  • With reference to the third aspect of the present application, or with reference to the first possible implementation manner of the third aspect of the present application, or with reference to the second possible implementation manner of the third aspect of the present application, or with reference to the third possible implementation manner of the third aspect of the present application, or with reference to the fourth possible implementation manner of the third aspect of the present application, or with reference to the fifth possible implementation manner of the third aspect of the present application, in a sixth possible manner, the processor further executes correspondingly matching each of the at least one piece of sub-address information in the destination address information with sub-address information of a same address type in the address database.
  • With reference to the third aspect of the present application, or with reference to the first possible implementation manner of the third aspect of the present application, or with reference to the second possible implementation manner of the third aspect of the present application, or with reference to the third possible implementation manner of the third aspect of the present application, or with reference to the fourth possible implementation manner of the third aspect of the present application, or with reference to the fifth possible implementation manner of the third aspect of the present application, in a seventh possible manner, the processor further executes matching the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information with different address information included in an address database, which includes selecting one piece of address information from the address database, and determining sub-address information included in the selected address information, performing matching calculation between each of the at least one piece of sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree, where an address type of the sub-address information that is included in the to-be-found destination address and for which matching calculation is performed is the same as an address type of the sub-address information included in the selected address information, and obtaining a total match degree between the selected address information and the to-be-found destination address information according to the first match degree.
  • With reference to the seventh possible implementation manner of the third aspect of the present application, in an eighth possible manner, the processor further executes obtaining a total match degree between the selected address information and the to-be-found destination address information according to the first match degree, which further includes performing matching calculation between the selected address information and the to-be-found destination address information to obtain a second match degree, and obtaining the total match degree between the selected address information and the to-be-found destination address information according to the first match degree and the second match degree.
  • With reference to the sixth possible implementation manner of the third aspect of the present application, or with reference to the seventh possible implementation manner of the third aspect of the present application, or with reference to the eighth possible implementation manner of the third aspect of the present application, in a ninth possible manner, the processor further executes performing matching calculation between sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree, which includes for each piece of sub-address information included in the to-be-found destination address information, finding, from the selected address information, sub-address information whose address type is the same as that of the sub-address information for the sub-address information, calculating an editing distance required for mutual conversion between the sub-address information and the found sub-address information, and calculating the first match degree according to an editing distance obtained corresponding to each piece of sub-address information included in the to-be-found destination address information.
  • With reference to the sixth possible implementation manner of the third aspect of the present application, or with reference to the seventh possible implementation manner of the third aspect of the present application, or with reference to the eighth possible implementation manner of the third aspect of the present application, or with reference to the ninth possible implementation manner of the third aspect of the present application, in a tenth possible manner, the processor further executes setting address information that is obtained by matching and whose match degree is greater than a set threshold as found destination address information for output, which includes obtaining, by means of calculation, a total match degree between address information selected each time and the to-be-found destination address information, and determining in sequence a set quantity of total match degrees in descending order of the total match degrees, and setting address information selected corresponding to the determined total match degrees as the found destination address information for output.
  • The present application has the following beneficial effect.
  • In the embodiments of the present application, address search request information is acquired, and to-be-found destination address information included in the address search request information is determined. The destination address information is split into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types. The at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information are matched with different address information included in an address database, where each piece of address information stored in the address database includes different sub-address information that forms the address information, and address information that is obtained by the matching and whose match degree is greater than a set threshold is used as found destination address information for output. In the embodiments of the present application, when address search request information is acquired, sub-address information corresponding to destination address information is extracted from the address search request information, which improves correctness of extraction of destination address information. The extracted sub-address information and the destination address information are matched with different address information included in an address database, and then address information whose match degree is greater than a set threshold is used as found destination address information, which effectively increases correctness of address matching and accuracy of address searching.
  • BRIEF DESCRIPTION OF DRAWINGS
  • To describe the technical solutions in the embodiments of the present application more clearly, the following briefly introduces the accompanying drawings required for describing the embodiments. The accompanying drawings in the following description show merely some embodiments of the present application, and a person of ordinary skill in the art may still derive other drawings from these accompanying drawings without creative efforts.
  • FIG. 1 is a schematic structural diagram of an address search system according to Embodiment 1 of the present application.
  • FIG. 2 is a schematic structural diagram of an address search device according to Embodiment 2 of the present application.
  • FIG. 3 is a schematic structural diagram of an address search device according to Embodiment 3 of the present application.
  • FIG. 4 is a schematic flowchart of an address search method according to Embodiment 4 of the present application.
  • DESCRIPTION OF EMBODIMENTS
  • To achieve an objective of the present application, embodiments of the present application provide an address search method and device, including acquiring address search request information, determining to-be-found destination address information included in the address search request information, splitting the destination address information into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types, matching the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information with different address information included in an address database, where each piece of address information stored in the address database includes different sub-address information that forms the address information, and setting address information that is obtained by the matching and whose match degree is greater than a set threshold as found destination address information for output. In the embodiments of the present application, sub-address information corresponding to destination address information is extracted from the address search request information when address search request information is acquired, which improves correctness of extraction of destination address information. The extracted sub-address information and the destination address information are matched with different address information included in an address database, and then address information whose match degree is greater than a set threshold is used as found destination address information, which effectively increases correctness of address matching and accuracy of address searching.
  • The following further describes the embodiments of the present application in detail with reference to accompanying drawings in this specification. The described embodiments are merely some but not all of the embodiments of the present application. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present application without creative efforts shall fall within the protection scope of the present application.
  • Embodiment 1
  • As shown in FIG. 1, FIG. 1 is a schematic structural diagram of an address search system according to Embodiment 1 of the present application. The address search system includes a receiving device 11, an address extracting device 12, and an address matching device 13.
  • The receiving device 11 is configured to acquire address search request information.
  • The address extracting device 12 is configured to determine to-be-found destination address information included in the address search request information, and split the destination address information into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types.
  • The receiving device 11 is further configured to receive entered text data, where the text data is used to initiate an address search.
  • The address extracting device 12 is further configured to perform recognition on the text data to obtain the to-be-found destination address information included in the text data.
  • For example, the entered text data that is received is “Exit No. 4, Caoyang Road Station, Shanghai.” Therefore, it can be determined that the to-be-found destination address information included in the text data is Exit No. 4, Caoyang Road Station, Shanghai.
  • The receiving device 11 is further configured to receive entered voice data, where the voice data is used to initiate an address search.
  • The address extracting device 12 is further configured to perform recognition on the voice data to obtain the to-be-found destination address information included in the voice data.
  • For example, the entered voice data that is received is “go to Exit No. 4, Caoyang Road Station, Shanghai.” Therefore, it can be determined that the to-be-found destination address information included in the voice data is, Exit No. 4, Caoyang Road Station, Shanghai.
  • That is, in addition to including the to-be-found destination address information, the address search request information may further include other auxiliary information according to a language habit, for example, text information such as “go to.”
  • The address extracting device 12 is further configured to obtain the to-be-found destination address information in the manner of determining at least one keyword included in the address search request information, finding, from preset text address dictionaries corresponding to different address types, a text address dictionary including the keyword for each determined keyword, and replacing the keyword with a string used to denote an address type corresponding to the found text address dictionary, determining, using a regular expression used to represent address information, whether a string group formed after each keyword is replaced with a corresponding string represents address information, determining, according to the string group, a quasi address string corresponding to the to-be-found destination address information when the string group represents address information, and setting the quasi address string as a CRF feature, and extracting the to-be-found destination address information from the address search request information based on a CRF algorithm.
  • It should be noted that the address information is hierarchical. That is, there is address information that represents an address region, such as address information corresponding to an administrative region, for example, Beijing serves as a piece of address information, and then “Beijing” corresponds to an address region on a map. There is also address information that represents a specific location, for example, XX Building, No. XX, XX Road, XX District, XX City, where this piece of address information corresponds to a specific location of XX Building.
  • Therefore, address information is classified to different address types according to different levels of address information. The address type includes one or a combination of multiple of the following information, administrative region information, road name information, building/organization name information, auxiliary content of the road name, and auxiliary content of the building/organization name.
  • For example, for the address information “Room XX, Floor XX, XX Building, No.
  • XX, XX Road, XX District, XX City”, an address type corresponding to “XX District, XX City” is administrative region information, an address type corresponding to “XX Road” is road name information, an address type corresponding to “No. XX” is auxiliary content of the road name, an address type corresponding to “XX Building” is building/organization name information, and an address type corresponding to “Room XX, Floor XX” is auxiliary content of the building/organization name.
  • It should be noted that the auxiliary content of the road name indicates that it is meaningless without the road name in front of it, and a specific address cannot be located using the auxiliary content of the road name alone, likewise, the auxiliary content of the building/organization name indicates that it is also meaningless without the building/organization name in front of it, and a specific address cannot be located using the auxiliary content of the building/organization name alone.
  • The preset text address dictionaries corresponding to different address types include at least an administrative region dictionary, which further includes administrative region information such as province, city, district, county, township, village, autonomous prefecture, league, and banner, which is, for example, Beijing Municipality, Beijing, Shanghai Municipality, Shanghai, Shenzhen City, and Shenzhen, an organization/building end word dictionary, which includes end-words of names of building/organizations, for example, police station, building, and center, a street name dictionary, which includes street names, for example, XX Road, XX Street, and XX Station. In addition, the following dictionaries are also included a stop word dictionary, which includes words or phrases that convey the meaning of stop, for example, arrive, a symbol dictionary, which includes punctuation marks, and a number dictionary, which includes numbers and the like.
  • For example, the at least one keyword included in the address search request information is “go to,” “Shanghai,” “Caoyang Road,” “Station,” and “Exit No. 4.”
  • In this case, for each obtained keyword, execute the following steps.
  • Step 1: Find, from the preset text address dictionaries corresponding to different address types, a text address dictionary including the keyword, and replace the keyword using a string used to represent an address type corresponding to the found text address dictionary.
  • For example, “go to” belongs to the stop word dictionary, “Shanghai” belongs to the administrative region dictionary, “Caoyang Road” and “Station” belong to the street name dictionary, “4” in “Exit No. 4” belongs to the number dictionary, and “No.” and “Exit” in “Exit No. 4” belong to the end word dictionary.
  • A string of an address type corresponding to the stop word dictionary is SSS, a string of an address type corresponding to the administrative region dictionary is AAA, a string of an address type corresponding to the street name dictionary is RRR, a string of an address type corresponding to the number dictionary is DDD, and a string of an address type corresponding to the end word dictionary is OOO.
  • Then a string group obtained after the replacement may be SSSAAARRRRRRDDDOOOOOO, or may be SSSAAA City RRR Road Station DDDOOOOOO. Whether a text used to represent an address type in the keyword is replaced is not limited herein.
  • Step 2: Determine, using a regular expression used to represent address information, whether a string group formed after each keyword is replaced with a corresponding string represents address information, and determine, according to the string group, a quasi address string corresponding to the to-be-found destination address information when the string group represents address information.
  • It should be noted that a regular expression describes and matches, using a single string, a string that meets a syntactical rule, and the regular expression involved in this embodiment of the present application describes address information.
  • For example, for the string group formed after each keyword is replaced with the corresponding string, whether the string group represents address information is determined using a regular expression.
  • “AAA City” and “RRR Road Station” may represent address information according to the determining using the regular expression, “SSS” may not represent address information according to the determining using the regular expression.
  • Step 3: Set the quasi address string as a CRF feature, and extract the to-be-found destination address information from the address search request information based on a CRF algorithm.
  • It should be noted that the quasi address string is used as a CRF feature, but a CRF feature for extracting the destination address information based on the CRF algorithm is not limited to the quasi address string alone, and multiple strings may further be included. However, the CRF feature for extracting the destination address information based on the CRF algorithm includes the CRF feature served by the quasi address string.
  • In this case, the extracted destination address information is Exit No. 4, Caoyang Road Station, Shanghai.
  • That the address extracting device 12 is determining the quasi address string corresponding to the to-be-found destination address information which includes determining that multiple strings are included in the string group, and that keywords that are respectively replaced with the multiple strings are continuous in position in the address search request information, combining the multiple strings into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information if none of the multiple strings is a duplicate, and removing the duplicate string if any of the multiple strings is a duplicate, and combining at least one string obtained after the duplicate string is removed into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information.
  • For example, none of the obtained multiple strings, AAA City, RRR Road Station, and DDD, is a duplicate, and then a quasi address string, AAA City RRR Road Station DDD, is obtained by means of combination.
  • One of the obtained multiple strings, AAA, RRR, RRR, DDD, OOO, and OOO, is a duplicate, and then a quasi address string, AAARRRDDDOOO, is obtained by means of combination.
  • The address extracting device 12 is further configured to split the destination address information into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types.
  • The address extracting device 12 is further configured to split, according to the address types represented by the preset text address dictionaries corresponding to different address types, the destination address information into sub-address information corresponding to the different address types.
  • For example, sub-address information obtained by splitting “Exit No. 4, Caoyang Road Station, Shanghai” is Shanghai, administrative region information, Caoyang Road Station, street name information, and Exit No. 4, auxiliary information of the street name.
  • The address matching device 13 is configured to match the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information with different address information included in an address database, where each piece of address information stored in the address database includes different sub-address information that forms the address information, and set address information that is obtained by the matching and whose match degree is greater than a set threshold as found destination address information for output.
  • The address matching device 13 is further configured to correspondingly match each of the at least one piece of sub-address information in the destination address information with sub-address information of a same address type in the address database.
  • The address matching device 13 is further configured to select one piece of address information from the address database, and determine sub-address information included in the selected address information, perform matching calculation between each of the at least one piece of sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree, where an address type of the sub-address information that is included in the to-be-found destination address and for which matching calculation is performed is the same as an address type of the sub-address information included in the selected address information, and obtain a total match degree between the selected address information and the to-be-found destination address information according to the first match degree.
  • That is, each of the at least one piece of sub-address information in the destination address information is matched with sub-address information of a same address type in the address database. The address matching device 13 performs, using an address type corresponding to the sub-address information as a granularity, matching calculation in sequence between the sub-address information that represents an address type in the to-be-found destination address information and the sub-address information that represents the same address type in the selected address information, and obtains, by means of calculation, the first match degree corresponding to the sub-address information.
  • In this case, a total match degree between the selected address information and the to-be-found destination address information is obtained according to the first match degree.
  • Optionally, on this basis, matching calculation is performed between the to-be-found destination address information and the selected address information to obtain a second match degree.
  • By performing the two steps of match degree calculation, address information that includes only one or a few pieces of sub-address information of the to-be-found destination address information, for example, address information that includes only Exit No. 4, is effectively excluded from the address information obtained using the first match degree.
  • The total match degree between the selected address information and the to-be-found destination address information is obtained according to the first match degree and the second match degree.
  • That the address matching device 13 is performing matching calculation between sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree further includes for each piece of sub-address information included in the to-be-found destination address information, finding, from the selected address information, sub-address information whose address type is the same as that of the sub-address information for the sub-address information, calculating an editing distance required for mutual conversion between the sub-address information and the found sub-address information, and calculating the first match degree according to an editing distance obtained corresponding to each piece of sub-address information included in the to-be-found destination address information.
  • It should be noted that the editing distance refers to a minimum quantity of editing operations required for converting one string into the other string between two strings. The so-called editing operation refers to replacing one character with another character, or inserting one character, or deleting one character, or the like.
  • That the address matching device 13 is calculating the first match degree according to an editing distance obtained corresponding to each piece of sub-address information included in the to-be-found destination address information further includes calculating a sum of editing distances obtained separately corresponding to sub-address information included in the to-be-found destination address information, where the sum is the first match degree.
  • That the address matching device 13 is setting address information that is obtained by the matching and whose match degree is greater than a set threshold as found destination address information for output further includes obtaining, by means of calculation, a total match degree between address information selected each time and the to-be-found destination address information, and determining in sequence a set quantity of total match degrees in descending order of the total match degrees, and setting address information selected corresponding to the determined total match degrees as the found destination address information for output.
  • It should be noted that the set similarity threshold may be defined by a system default parameter, or may be determined according to an actual requirement, which is not limited herein.
  • It should be noted that the set quantity may be defined by a system default parameter, or may be determined according to an actual requirement, which is not limited herein.
  • According to the address search system provided in the embodiments of the present application, sub-address information corresponding to destination address information is extracted from the address search request information when address search request information is acquired, which improves correctness of extraction of destination address information. The extracted sub-address information and the destination address information are matched with different address information included in an address database, and then address information whose match degree is greater than a set threshold is used as found destination address information, which effectively increases correctness of address matching and accuracy of address searching.
  • Embodiment 2
  • As shown in FIG. 2, FIG. 2 is a schematic structural diagram of an address search device according to Embodiment 2 of the present application. The address search device includes an acquiring module 21, a splitting module 22, and a searching module 23, where the acquiring module 21 is configured to acquire address search request information, and determine to-be-found destination address information included in the address search request information. The splitting module 22 is configured to split the destination address information determined by the acquiring module 21 into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types, and the searching module 23 is configured to match the at least one piece of sub-address information obtained by the splitting module 22 or the at least one piece of sub-address information and the destination address information with different address information included in an address database, where each piece of address information stored in the address database includes different sub-address information that forms the address information, and set address information that is obtained by the matching and whose match degree is greater than a set threshold as found destination address information for output.
  • The acquiring module 21 is further configured to receive entered voice data, where the voice data is used to initiate an address search, and perform recognition on the voice data to obtain the to-be-found destination address information included in the voice data.
  • The acquiring module 21 is further configured to obtain the to-be-found destination address information in the manner of determining at least one keyword included in the address search request information, finding, from preset text address dictionaries corresponding to different address types, a text address dictionary including the keyword for each determined keyword, and replacing the keyword with a string used to denote an address type corresponding to the found text address dictionary, determining, using a regular expression used to represent address information, whether a string group formed after each keyword is replaced with a corresponding string represents address information, determining, according to the string group, a quasi address string corresponding to the to-be-found destination address information when the string group represents address information, and setting the quasi address string as a CRF feature, and extracting the to-be-found destination address information from the address search request information based on a CRF algorithm.
  • That the acquiring module 21 determining the quasi address string corresponding to the to-be-found destination address information further includes determining that multiple strings are included in the string group, and keywords that are respectively replaced with the multiple strings are continuous in position in the address search request information, combining the multiple strings into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information if none of the multiple strings is a duplicate, and removing the duplicate string if any of the multiple strings is a duplicate, and combining at least one string obtained after the duplicate string is removed into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information.
  • The splitting module 22 is further configured to split, according to the address types represented by the preset text address dictionaries corresponding to different address types, the destination address information into sub-address information corresponding to the different address types.
  • The address type includes one or a combination of multiple of the following information, administrative region information, road name information, building/organization name information, auxiliary content of the road name, and auxiliary content of the building/organization name.
  • The searching module 23 is further configured to correspondingly match each of the at least one piece of sub-address information in the destination address information with sub-address information of a same address type in the address database.
  • The searching module 23 is further configured to select one piece of address information from the address database, and determine sub-address information included in the selected address information, perform matching calculation between each of the at least one piece of sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree, where an address type of the sub-address information that is included in the to-be-found destination address and for which matching calculation is performed is the same as an address type of the sub-address information included in the selected address information, and obtain a total match degree between the selected address information and the to-be-found destination address information according to the first match degree.
  • That the searching module 23 is obtaining a total match degree between the selected address information and the to-be-found destination address information according to the first match degree further includes performing matching calculation between the selected address information and the to-be-found destination address information to obtain a second match degree, and obtaining the total match degree between the selected address information and the to-be-found destination address information according to the first match degree and the second match degree.
  • That the searching module 23 is performing matching calculation between sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree further includes for each piece of sub-address information included in the to-be-found destination address information, finding, from the selected address information, sub-address information whose address type is the same as that of the sub-address information for the sub-address information, calculating an editing distance required for mutual conversion between the sub-address information and the found sub-address information, and calculating the first match degree according to an editing distance obtained corresponding to each piece of sub-address information included in the to-be-found destination address information.
  • That the searching module 23 is setting address information that is obtained by matching and whose match degree is greater than a set threshold as found destination address information for output further includes obtaining, by means of calculation, a total match degree between address information selected each time and the to-be-found destination address information, and determining in sequence a set quantity of total match degrees in descending order of the total match degrees, and setting address information selected corresponding to the determined total match degrees as the found destination address information for output.
  • It should be noted that the address search device described in this embodiment of the present application may be implemented by means of hardware, or may be implemented by means of software, and an implementation manner is not limited herein.
  • The address search device, extracts sub-address information corresponding to destination address information from the address search request information when acquiring address search request information, which improves correctness of extraction of destination address information, and matches the extracted sub-address information and the destination address information with different address information included in an address database, and then sets address information whose match degree is greater than a set threshold as found destination address information, which effectively increases correctness of address matching and accuracy of address searching.
  • Embodiment 3
  • As shown in FIG. 3, FIG. 3 is a schematic structural diagram of an address search device according to Embodiment 3 of the present application. The address search device has functions described in Embodiment 4 of the present application. The address search device may use a general computer system structure, where the computer system may further be a processor-based computer. The address search device includes at least one processor 31 and a signal receiver 32. The processor 31 and the signal receiver 32 are connected using a communications bus 33.
  • The signal receiver 32 is configured to acquire address search request information, and determine to-be-found destination address information included in the address search request information.
  • The processor 31 is configured to split the destination address information into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types, match the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information with different address information included in an address database, where each piece of address information stored in the address database includes different sub-address information that forms the address information, and set address information that is obtained by matching and whose match degree is greater than a set threshold as found destination address information for output.
  • In a possible implementation manner, the processor 31 further executes receiving entered voice data, where the voice data is used to initiate an address search, and performing recognition on the voice data to obtain the to-be-found destination address information included in the voice data.
  • In a possible implementation manner, the processor 31 further executes obtaining to-be-found destination address information in the following manner. Determining at least one keyword included in the address search request information, finding, from preset text address dictionaries corresponding to different address types, a text address dictionary including the keyword for each determined keyword, and replacing the keyword with a string used to denote an address type corresponding to the found text address dictionary, determining, using a regular expression used to represent address information, whether a string group formed after each keyword is replaced with a corresponding string represents address information, determining, according to the string group, a quasi address string corresponding to the to-be-found destination address information when the string group represents address information, setting the quasi address string as a CRF feature, and extracting the to-be-found destination address information from the address search request information based on a CRF algorithm.
  • In a possible implementation manner, the processor 31 further executes determining that multiple strings are included in the string group, and keywords that are respectively replaced with the multiple strings are continuous in position in the address search request information, combining the multiple strings into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information if none of the multiple strings is a duplicate, and removing the duplicate string if any of the multiple strings is a duplicate, and combining at least one string obtained after the duplicate string is removed into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information.
  • In a possible implementation manner, the processor 31 further executes splitting the destination address information into at least one piece of sub-address information, which includes splitting, according to the address types represented by the preset text address dictionaries corresponding to different address types, the destination address information into sub-address information corresponding to the different address types.
  • In a possible implementation manner, the address type includes one or a combination of multiple of the following information, administrative region information, road name information, building/organization name information, auxiliary content of the road name, and auxiliary content of the building/organization name.
  • In a possible implementation manner, the processor 31 further executes correspondingly matching each of the at least one piece of sub-address information in the destination address information with sub-address information of a same address type in the address database.
  • In a possible implementation manner, the processor 31 further executes matching the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information with different address information included in an address database, which includes selecting one piece of address information from the address database, and determining sub-address information included in the selected address information, performing matching calculation between each of the at least one piece of sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree, where an address type of the sub-address information that is included in the to-be-found destination address and for which matching calculation is performed is the same as an address type of the sub-address information included in the selected address information, and obtaining a total match degree between the selected address information and the to-be-found destination address information according to the first match degree.
  • In a possible implementation manner, the processor 31 further executes obtaining a total match degree between the selected address information and the to-be-found destination address information according to the first match degree, which further includes performing matching calculation between the selected address information and the to-be-found destination address information to obtain a second match degree, and obtaining the total match degree between the selected address information and the to-be-found destination address information according to the first match degree and the second match degree.
  • In a possible implementation manner, the processor 31 further executes performing matching calculation between sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree, which includes for each piece of sub-address information included in the to-be-found destination address information, finding, from the selected address information, sub-address information whose address type is the same as that of the sub-address information for the sub-address information, calculating an editing distance required for mutual conversion between the sub-address information and the found sub-address information, and calculating the first match degree according to an editing distance obtained corresponding to each piece of sub-address information included in the to-be-found destination address information.
  • In a possible implementation manner, the processor 31 further executes setting address information that is obtained by matching and whose match degree is greater than a set threshold as found destination address information for output, which includes obtaining, by means of calculation, a total match degree between address information selected each time and the to-be-found destination address information, and determining in sequence a set quantity of total match degrees in descending order of the total match degrees, and setting address information selected corresponding to the determined total match degrees as the found destination address information for output.
  • The processor 31 may be a general purpose central processing unit (CPU), a microprocessor, an application-specific integrated circuit (ASIC), or one or more integrated circuits used to control execution of the solution program of the present application.
  • The address search device provided in this embodiment of the present application, when acquiring address search request information, extracts sub-address information corresponding to destination address information from the address search request information, which improves correctness of extraction of destination address information, and matches the extracted sub-address information and the destination address information with different address information included in an address database, and then sets address information whose match degree is greater than a set threshold as found destination address information, which effectively increases correctness of address matching and accuracy of address searching.
  • Embodiment 4
  • As shown in FIG. 4, FIG. 4 is a schematic flowchart of an address search method according to Embodiment 4 of the present application. The method may be shown in the following steps.
  • Step 401: Acquire address search request information.
  • The address search request information includes to-be-found destination address information.
  • In step 401, a manner for acquiring the address search request information includes but is not limited to the following manner. Receiving entered text data, where the text data is used to initiate an address search, or receiving entered voice data, where the voice data is used to initiate an address search.
  • It should be noted that, if the address search request information is voice data, then the method further includes performing recognition on the received voice data to obtain text data corresponding to the voice data.
  • Step 402: Determine to-be-found destination address information included in the address search request information.
  • In step 402, the to-be-found destination address information is obtained in the following manner. Determining at least one keyword included in the address search request information, finding, from preset text address dictionaries corresponding to different address types, a text address dictionary including the keyword for each determined keyword, and replacing the keyword with a string used to denote an address type corresponding to the found text address dictionary, determining, using a regular expression used to represent address information, whether a string group formed after each keyword is replaced with a corresponding string represents address information, determining, according to the string group, a quasi address string corresponding to the to-be-found destination address information when the string group represents address information, and setting the quasi address string as a CRF feature, and extracting the to-be-found destination address information from the address search request information based on a CRF algorithm.
  • Further, the address information is hierarchical. That is, there is address information that represents an address region, such as address information corresponding to an administrative region, for example, Beijing serves as a piece of address information, and then “Beijing” corresponds to an address region on a map. There is also address information that represents a specific location, for example, XX Building, No. XX, XX Road, XX District, XX City, where this piece of address information corresponds to a specific location of XX Building.
  • Therefore, address information is classified to different address types according to different levels of address information. The address type includes one or a combination of multiple of the following information, administrative region information, road name information, building/organization name information, auxiliary content of the road name, and auxiliary content of the building/organization name.
  • For example, for the address information “Room XX, Floor XX, XX Building, No. XX, XX Road, XX District, XX City”, an address type corresponding to “XX District, XX City” is administrative region information, an address type corresponding to “XX Road” is road name information, an address type corresponding to “No. XX” is auxiliary content of the road name, an address type corresponding to “XX Building” is building/organization name information, and an address type corresponding to “Room XX, Floor XX” is auxiliary content of the building/organization name.
  • It should be noted that the auxiliary content of the road name indicates that it is meaningless without the road name in front of it, and a specific address cannot be located using the auxiliary content of the road name alone. Likewise, the auxiliary content of the building/organization name indicates that it is also meaningless without the building/organization name in front of it, and a specific address cannot be located using the auxiliary content of the building/organization name alone.
  • The preset text address dictionaries corresponding to different address types include at least an administrative region dictionary, which further includes administrative region information such as province, city, district, county, township, village, autonomous prefecture, league, and banner, which is further, for example, Beijing Municipality, Beijing, Shanghai Municipality, Shanghai, Shenzhen City, and Shenzhen, an organization/building end word dictionary, which includes end words of names of buildings/organizations, for example, police station, building, and center, a street name dictionary, which includes street names, for example, XX Road, XX Street, and XX Station. In addition, the following dictionaries are also included. A stop word dictionary, which includes words or phrases that convey the meaning of stop, for example, arrive, a symbol dictionary, which includes punctuation marks, and a number dictionary, which includes numbers and the like.
  • For example, the at least one keyword included in the address search request information is “go to,” “Shanghai,” “Caoyang Road,” “Station,” and “Exit No. 4.”
  • In this case, for each obtained keyword, execute the following steps.
  • Step 1: Find, from the preset text address dictionaries corresponding to different address types, a text address dictionary including the keyword, and replace the keyword using a string used to represent an address type corresponding to the found text address dictionary.
  • For example, “go to” belongs to the stop word dictionary, “Shanghai” belongs to the administrative region dictionary, “Caoyang Road” and “Station” belong to the street name dictionary, “4” in “Exit No. 4” belongs to the number dictionary, and “No.” and “Exit” in “Exit No. 4” belong to the end word dictionary.
  • A string of an address type corresponding to the stop word dictionary is SSS, a string of an address type corresponding to the administrative region dictionary is AAA, a string of an address type corresponding to the street name dictionary is RRR, a string of an address type corresponding to the number dictionary is DDD, and a string of an address type corresponding to the end word dictionary is OOO.
  • Then a string group obtained after the replacement may be SSSAAARRRRRRDDDOOOOOO, or may be SSSAAA City RRR Road Station DDDOOOOOO. Whether a text used to represent an address type in the keyword is replaced is not limited herein.
  • Step 2: Determine, using a regular expression used to represent address information, whether a string group formed after each keyword is replaced with a corresponding string represents address information, and determine, according to the string group, a quasi address string corresponding to the to-be-found destination address information when the string group represents address information.
  • It should be noted that a regular expression describes and matches, using a single string, a string that meets a syntactical rule, and the regular expression involved in this embodiment of the present application describes address information.
  • For example, for the string group formed after each keyword is replaced with the corresponding string, whether the string group represents address information is determined using a regular expression.
  • “AAA City” and “RRR Road Station” may represent address information according to the determining using the regular expression, “SSS” may not represent address information according to the determining using the regular expression.
  • Step 3 Set the quasi address string as a CRF feature, and extract the to-be-found destination address information from the address search request information based on a CRF algorithm.
  • It should be noted that the quasi address string is used as a CRF feature, but a CRF feature for extracting the destination address information based on the CRF algorithm is not limited to only the one CRF feature served by the quasi address string, and multiple CRF features may further be included. However, the CRF feature for extracting the destination address information based on the CRF algorithm includes the CRF feature served by the quasi address string.
  • In this case, the extracted destination address information is Exit No. 4, Caoyang Road Station, Shanghai.
  • Optionally, determining, according to the string group, a quasi address string corresponding to the to-be-found destination address information further includes determining that multiple strings are included in the string group, and that keywords that are respectively replaced with the multiple strings are continuous in position in the address search request information, combining the multiple strings into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information if none of the multiple strings is a duplicate, and removing the duplicate string if any of the multiple strings is a duplicate, and combining at least one string obtained after the duplicate string is removed into one string group based on continuity of positions of the replaced keywords, as the quasi address string corresponding to the to-be-found destination address information.
  • Step 403: Split the destination address information into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types.
  • In step 403, the destination address information is split, according to the address types represented by the preset text address dictionaries corresponding to different address types, into sub-address information corresponding to the different address types.
  • For example, sub-address information obtained by splitting “Exit No. 4, Caoyang Road Station, Shanghai” is Shanghai, administrative region information, Caoyang Road Station, street name information, and Exit No. 4, auxiliary information of the street name.
  • Step 404: Match the at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information with different address information included in an address database, and set address information that is obtained by matching and whose match degree is greater than a set threshold as found destination address information for output.
  • Each piece of address information stored in the address database includes different sub-address information that forms the address information.
  • In step 404, first, one piece of address information is selected from the address database, and sub-address information included in the selected address information is determined.
  • Second, matching calculation is performed between each of the at least one piece of sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree, where an address type of the sub-address information that is included in the to-be-found destination address and for which matching calculation is performed is the same as an address type of the sub-address information included in the selected address information.
  • Performing matching calculation between sub-address information included in the to-be-found destination address and the sub-address information included in the selected address information to obtain a first match degree further includes for each piece of sub-address information included in the to-be-found destination address information, finding, from the selected address information, sub-address information whose address type is the same as that of the sub-address information for the sub-address information, calculating an editing distance required for mutual conversion between the sub-address information and the found sub-address information, and calculating the first match degree according to an editing distance obtained corresponding to each piece of sub-address information included in the to-be-found destination address information.
  • It should be noted that the editing distance refers to a minimum quantity of editing operations required for converting one string into the other string between two strings. The so-called editing operation refers to replacing one character with another character, or inserting one character, or deleting one character, or the like.
  • Assuming that one piece of address data selected from the address database is XXX Station, XXX District, XX City. In this case, one piece of sub-address information “Shanghai” is selected from the to-be-found destination address information “Exit No. 4, Caoyang Road Station, Shanghai,” and the address type corresponding to “Shanghai” is determined as an administrative region. Then, sub-address information “XX City” that represents the administrative region is found from the “XXX Station, XXX District, XX City,” and an editing distance required for performing mutual conversion between “XX City” and “Shanghai” is calculated.
  • If “XX City” is Shanghai, then the editing distance required for performing mutual conversion between “XX City” and “Shanghai” is 0, if “XX City” is Beijing, then the editing distance required for performing mutual conversion between “XX City” and “Shanghai” is 2.
  • After the editing distance is determined, a first match degree between the sub-address information in the to-be-found destination address information and the sub-address information of the same address type in the selected address information is obtained according to the editing distance obtained by means of calculation.
  • For example, the first match degree=1−editing distance/MAX (a quantity of strings corresponding to the sub-address information in the to-be-found destination address information, a quantity of strings corresponding to the sub-address information of the same address type in the selected address information), or, the first match degree=editing distance/MAX (a quantity of strings corresponding to the sub-address information in the to-be-found destination address information, a quantity of strings corresponding to the sub-address information of the same address type in the selected address information).
  • When a first match degree corresponding to each piece of sub-address information included in the to-be-found destination address information is obtained, a first match degree between the sub-address information included in the to-be-found destination address information and the sub-address information included in the selected address information is obtained according to the obtained first match degree.
  • Finally, a total match degree between the selected address information and the to-be-found destination address information is obtained according to the first match degree.
  • One manner is setting the obtained first match degree as the total match degree between the selected address information and the to-be-found destination address information.
  • Another manner is continuing performing matching calculation between the selected address information and the to-be-found destination address information to obtain a second match degree.
  • The total match degree between the selected address information and the to-be-found destination address information is obtained according to the first match degree and the second match degree.
  • The setting address information that is obtained by matching and whose match degree is greater than a set threshold as found destination address information for output further includes obtaining, by means of calculation, a total match degree between address information selected each time and the to-be-found destination address information, and determining in sequence a set quantity of total match degrees in descending order of the total match degrees, and setting address information selected corresponding to the determined total match degrees as the found destination address information for output.
  • It should be noted that the set similarity threshold may be defined by a system default parameter, or may be determined according to an actual requirement, which is not limited herein.
  • It should be noted that the set quantity may be defined by a system default parameter, or may be determined according to an actual requirement, which is not limited herein.
  • According to the solution of Embodiment 4 of the present application, address search request information is acquired, and to-be-found destination address information included in the address search request information is determined. The destination address information is split into at least one piece of sub-address information, where the destination address information is formed by multiple pieces of sub-address information, and the multiple pieces of sub-address information respectively correspond to different address types. The at least one piece of sub-address information or the at least one piece of sub-address information and the destination address information are matched with different address information included in an address database, where each piece of address information stored in the address database includes different sub-address information that forms the address information, and address information that is obtained by the matching and whose match degree is greater than a set threshold is used as found destination address information for output. In the embodiments of the present application, when address search request information is acquired, sub-address information corresponding to destination address information is extracted from the address search request information, which improves correctness of extraction of destination address information. The extracted sub-address information and the destination address information are matched with different address information included in an address database, and then address information whose match degree is greater than a set threshold is used as found destination address information, which effectively increases correctness of address matching and accuracy of address searching.
  • A person skilled in the art should understand that the embodiments of the present application may be provided as a method, an apparatus (device), or a computer program product. Therefore, the present application may use a form of hardware only embodiments, software only embodiments, or embodiments with a combination of software and hardware. Moreover, the present application may use a form of a computer program product that is implemented on one or more computer-usable storage media (including but not limited to a disk memory, a compact-disc read-only memory (CD-ROM), an optical memory, and the like) that include computer-usable program code.
  • The present application is described with reference to the flowcharts and/or block diagrams of the method, the apparatus (device), and the computer program product according to the embodiments of the present application. It should be understood that computer program instructions may be used to implement each process and/or each block in the flowcharts and/or the block diagrams and a combination of a process and/or a block in the flowcharts and/or the block diagrams. These computer program instructions may be provided for a general-purpose computer, a dedicated computer, an embedded processor, or a processor of any other programmable data processing device to generate a machine such that the instructions executed by a computer or a processor of any other programmable data processing device generate an apparatus for implementing a specific function in one or more processes in the flowcharts and/or in one or more blocks in the block diagrams.
  • These computer program instructions may also be stored in a computer readable memory that can instruct the computer or any other programmable data processing device to work in a specific manner such that the instructions stored in the computer readable memory generate an artifact that includes an instruction apparatus. The instruction apparatus implements a specific function in one or more processes in the flowcharts and/or in one or more blocks in the block diagrams.
  • These computer program instructions may also be loaded onto a computer or another programmable data processing device such that a series of operations and steps are performed on the computer or the other programmable device, thereby generating computer-implemented processing. Therefore, the instructions executed on the computer or the other programmable device provide steps for implementing a specific function in one or more processes in the flowcharts and/or in one or more blocks in the block diagrams.
  • Although some exemplary embodiments of the present application have been described, a person skilled in the art can make changes and modifications to these embodiments once they learn the basic concept. Therefore, the following claims are intended to be construed as to cover the exemplary embodiments and all changes and modifications falling within the scope of the present application.
  • Obviously, a person skilled in the art can make various modifications and variations to the present application without departing from the scope of the present application. The present application is intended to cover these modifications and variations provided that they fall within the scope of protection defined by the following claims and their equivalent technologies.

Claims (20)

What is claimed is:
1. An address search method, comprising:
acquiring address search request information comprising to-be-found destination address information;
obtaining the to-be-found destination address information;
splitting the to-be-found destination address information into at least one piece of sub-address information;
matching the at least one piece of sub-address information with different address information in an address database to obtain a plurality of match degrees, wherein each piece of address information stored in the address database comprises different sub-address information that forms the address information, and wherein each piece of address information and the to-be-found destination address information has one match degree; and
setting first address information as found destination address information,
wherein a first match degree between the first address information and the to-be-found destination address information is greater than a preset threshold.
2. The address search method of claim 1, further comprising:
receiving voice data, wherein the voice data initiates an address search; and
performing recognition on the voice data to obtain the to-be-found destination address information, wherein the voice data comprises the to-be-found destination address information.
3. The address search method of claim 1, further comprising:
determining at least one keyword comprised in the address search request information;
finding, for each keyword from preset text address dictionaries corresponding to different address types, a text address dictionary comprising a respective keyword;
replacing the respective keyword with a string denoting an address type corresponding to the text address dictionary;
determining, using a regular expression representing the address information, whether a string group formed after each keyword is replaced with a corresponding string representing the address information;
determining, according to the string group, a quasi address string corresponding to the to-be-found destination address information, wherein the string group represents the address information;
setting the quasi address string as a conditional random field (CRF) feature; and
extracting the to-be-found destination address information from the address search request information based on a CRF algorithm.
4. The address search method of claim 3, wherein determining the quasi address string comprises:
determining that the string group comprises a plurality of strings;
determining that a plurality of replaced keywords that are respectively replaced with the strings are continuous in position in the address search request information;
combining the strings into one string group based on a continuity of positions of the replaced keywords to form the quasi address string in response to a duplicate string not existing in the strings;
removing the duplicate string in response to the duplicate string existing in the strings; and
combining at least one string obtained after the duplicate string is removed into the one string group based on the continuity of positions of the replaced keywords to form the quasi address string.
5. The address search method of claim 1, further comprising splitting, according to address types represented by preset text address dictionaries corresponding to different address types, the to-be-found destination address information into the sub-address information, wherein the sub-address information corresponds to the different address types.
6. The address search method of claim 1, wherein the address type comprises at least one of:
administrative region information;
road name information;
a building or an organization name information;
auxiliary content of the road name; or
auxiliary content of the building or the organization name.
7. The address search method of claim 1, further comprising correspondingly matching each of the at least one piece of sub-address information with sub-address information of a same address type in the address database.
8. The address search method of claim 1, wherein matching the at least one piece of sub-address information with the different address information to obtain the match degrees comprises:
selecting one piece of address information from the address database to obtain selected address information;
determining sub-address information comprised in the selected address information;
performing a matching calculation between each of the at least one piece of sub-address information of the to-be-found destination address information and the sub-address information of the selected address information to obtain a first match degree, wherein an address type of the at least one piece of sub-address information is the same as an address type of the sub-address information of the selected address information; and
obtaining a total match degree between the selected address information and the to-be-found destination address information according to the first match degree.
9. The address search method of claim 8, further comprising:
performing a matching calculation between the selected address information and the to-be-found destination address information to obtain a second match degree;
obtaining the total match degree between the selected address information and the to-be-found destination address information according to the first match degree and the second match degree; and
setting address information based on the total match degree being greater than a set threshold as the found destination address information.
10. The address search method of claim 8, further comprising:
finding, for a respective piece of sub-address information from the selected address information, sub-address information having an address type that is the same as that of the respective piece of sub-address information;
calculating an editing distance for mutual conversion between the respective piece of sub-address information and the sub-address information having the address type that is the same as that of the respective piece of sub-address; and
calculating the first match degree according to the editing distance obtained corresponding to each piece of sub-address information.
11. The address search method of claim 8, further comprising:
obtaining the total match degree between address information and the to-be-found destination address information;
determining in sequence a set quantity of total match degrees in descending order of the total match degrees; and
setting address information selected corresponding to the total match degrees as the found destination address information.
12. An address search device, comprising:
a memory comprising instructions; and
a processor coupled to the memory and configured to execute the instructions, which cause the processor to be configured to:
acquire address search request information comprising to-be-found destination address information;
obtain the-be-found destination address information;
split the to-be-found destination address information into at least one piece of sub-address information;
match the at least one piece of sub-address information with different address information in an address database to obtain a plurality of match degrees, wherein each piece of address information stored in the address database comprises different sub-address information that forms the address information, and wherein each piece of address information and the to-be-found destination address information has one match degree; and
set first address information as found destination address information,
wherein a match degree between the first address information and the to-be-found destination address information is greater than a preset threshold.
13. The address search device of claim 12, wherein the instructions further cause the processor to be configured to:
determine at least one keyword comprised in the address search request information;
find, for each keyword, from preset text address dictionaries corresponding to different address types, a text address dictionary comprising a respective keyword;
replace the respective keyword with a string denoting an address type corresponding to the text address dictionary;
determine, using a regular expression representing address information, whether a string group formed after each keyword is replaced with a corresponding string representing the address information;
determine, according to the string group, a quasi address string corresponding to the to-be-found destination address information, wherein the string group represents the address information;
set the quasi address string as a conditional random field (CRF) feature; and
extract the to-be-found destination address information from the address search request information based on a CRF algorithm.
14. The address search device of claim 13, wherein the instructions further cause the processor to be configured to:
determine that the string group comprises a plurality of strings;
determine that a plurality of replaced keywords that are respectively replaced with the strings are continuous in position in the address search request information;
combine the strings into one string group based on continuity of positions of the replaced keywords to form the quasi address string when a duplicate string does not exist in the strings;
remove the duplicate string when the duplicate string exists in the strings; and
combine at least one string obtained after the duplicate string is removed into the one string group based on the continuity of positions of the replaced keywords to form the quasi address string.
15. The address search device of claim 12, wherein the instructions further cause the processor to be configured to split, according to the address types represented by preset text address dictionaries corresponding to the different address types, the to-be-found destination address information into the sub-address information, wherein the sub-address information corresponds to the different address types.
16. The address search device of claim 12, wherein the address type comprises at least one of:
administrative region information;
road name information;
a building or an organization name information;
auxiliary content of the road name; and
auxiliary content of the building or the organization name.
17. The address search device of claim 12, wherein the instructions further cause the processor to be configured to correspondingly match each of the at least one piece of sub-address information with sub-address information of a same address type in the address database.
18. The address search device of claim 12, wherein the instructions further cause the processor to be configured to:
select one piece of address information from the address database to obtain selected address information;
determine sub-address information of the selected address information;
perform a matching calculation between each of the at least one piece of sub-address information of the to-be-found destination address and the sub-address information of the selected address information to obtain a first match degree, wherein an address type of the at least one piece of sub-address information is the same as an address type of the sub-address information of the selected address information; and
obtain a total match degree between the selected address information and the to-be-found destination address information according to the first match degree.
19. The address search device of claim 19, wherein the instructions further cause the processor to be configured to:
perform a matching calculation between the selected address information and the to-be-found destination address information to obtain a second match degree; and
obtain the total match degree between the selected address information and the to-be-found destination address information according to the first match degree and the second match degree.
20. A computer program product comprising computer-executable instructions for storage on a non-transitory computer-readable medium that, when executed by a processor, cause an address search device to:
acquire address search request information comprising to-be-found destination address information;
obtain the-be-found destination address information;
split the to-be-found destination address information into at least one piece of sub-address information;
match the at least one piece of sub-address information with different address information in an address database to obtain a plurality of match degrees, wherein each piece of address information stored in the address database comprises different sub-address information that forms the address information, and wherein each piece of address information and the to-be-found destination address information has one match degree; and
set first address information as found destination address information,
wherein a match degree between the first address information and the to-be-found destination address information is greater than a preset threshold.
US16/929,611 2014-09-30 2020-07-15 Address Search Method and Device Abandoned US20200349175A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US16/929,611 US20200349175A1 (en) 2014-09-30 2020-07-15 Address Search Method and Device

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
CN201410525978.XA CN105528372B (en) 2014-09-30 2014-09-30 A kind of address search method and equipment
CN201410525978.X 2014-09-30
PCT/CN2015/079816 WO2016050088A1 (en) 2014-09-30 2015-05-26 Address search method and device
US15/398,260 US10783171B2 (en) 2014-09-30 2017-01-04 Address search method and device
US16/929,611 US20200349175A1 (en) 2014-09-30 2020-07-15 Address Search Method and Device

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US15/398,260 Continuation US10783171B2 (en) 2014-09-30 2017-01-04 Address search method and device

Publications (1)

Publication Number Publication Date
US20200349175A1 true US20200349175A1 (en) 2020-11-05

Family

ID=55629404

Family Applications (2)

Application Number Title Priority Date Filing Date
US15/398,260 Active 2037-03-23 US10783171B2 (en) 2014-09-30 2017-01-04 Address search method and device
US16/929,611 Abandoned US20200349175A1 (en) 2014-09-30 2020-07-15 Address Search Method and Device

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US15/398,260 Active 2037-03-23 US10783171B2 (en) 2014-09-30 2017-01-04 Address search method and device

Country Status (4)

Country Link
US (2) US10783171B2 (en)
EP (1) EP3153978B1 (en)
CN (1) CN105528372B (en)
WO (1) WO2016050088A1 (en)

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107766383B (en) * 2016-08-22 2020-04-07 平安科技(深圳)有限公司 Address positioning method and device
CN108132956A (en) * 2016-12-01 2018-06-08 北京搜狗科技发展有限公司 A kind of searching method, device and electronic equipment
CN109255565B (en) * 2017-07-14 2022-12-16 菜鸟智能物流控股有限公司 Address attribution identification and logistics task distribution method and device
WO2019023911A1 (en) * 2017-07-31 2019-02-07 Beijing Didi Infinity Technology And Development Co., Ltd. System and method for segmenting text
CN107577744A (en) * 2017-08-28 2018-01-12 苏州科技大学 Nonstandard Address automatic matching model, matching process and method for establishing model
CN108197188B (en) * 2017-12-26 2020-06-30 北京星选科技有限公司 Address information processing method and device
CN108416062A (en) * 2018-03-26 2018-08-17 国家电网公司客户服务中心 A kind of electric network data correlating method based on address matching technology
US11531120B2 (en) 2018-04-17 2022-12-20 Huawei Technologies Co., Ltd. Picture processing method and related device
CN108960645B (en) * 2018-07-10 2020-11-13 创新先进技术有限公司 Risk prevention and control method, system and terminal equipment
JP7183600B2 (en) 2018-07-20 2022-12-06 株式会社リコー Information processing device, system, method and program
CN109388634B (en) * 2018-09-18 2024-05-03 平安科技(深圳)有限公司 Address information processing method, terminal device and computer readable storage medium
CN111488409A (en) * 2019-01-25 2020-08-04 阿里巴巴集团控股有限公司 City address library construction method, retrieval method and device
CN112115214B (en) * 2019-06-20 2024-04-02 中科聚信信息技术(北京)有限公司 Address standardization method, address standardization device and electronic equipment
CN112884390A (en) * 2019-11-29 2021-06-01 北京三快在线科技有限公司 Order processing method and device, readable storage medium and electronic equipment
CN113111230B (en) * 2020-02-13 2024-04-12 北京明亿科技有限公司 Regular expression-based alarm receiving text home address extraction method and device
CN113111229B (en) * 2020-02-13 2024-04-12 北京明亿科技有限公司 Regular expression-based alarm receiving text track address extraction method and device
CN111522901B (en) * 2020-03-18 2023-10-20 大箴(杭州)科技有限公司 Method and device for processing address information in text
US11523250B1 (en) * 2021-05-12 2022-12-06 Valassis Digital Corp. Computer system with features for determining reliable location data using messages with unreliable location data
CN113515677B (en) * 2021-07-22 2023-10-27 中移(杭州)信息技术有限公司 Address matching method, device and computer readable storage medium

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2267618A3 (en) 2000-06-02 2011-05-04 Navteq North America, LLC Method and system for forming a keyword database for referencing physical locations
US6487495B1 (en) 2000-06-02 2002-11-26 Navigation Technologies Corporation Navigation applications using related location-referenced keywords
US7376636B1 (en) * 2002-06-07 2008-05-20 Oracle International Corporation Geocoding using a relational database
US6934634B1 (en) * 2003-09-22 2005-08-23 Google Inc. Address geocoding
CN101350012B (en) * 2007-07-18 2013-01-16 北京灵图软件技术有限公司 Method and system for matching address
US8150848B2 (en) * 2008-01-04 2012-04-03 Google Inc. Geocoding multi-feature addresses
US8867999B2 (en) * 2009-01-26 2014-10-21 Qualcomm Incorporated Downlink interference cancellation methods
CN101719128B (en) * 2009-12-31 2012-05-23 浙江工业大学 Fuzzy matching-based Chinese geo-code determination method
US20110270815A1 (en) * 2010-04-30 2011-11-03 Microsoft Corporation Extracting structured data from web queries
CN101882163A (en) * 2010-06-30 2010-11-10 中国科学院地理科学与资源研究所 Fuzzy Chinese address geographic evaluation method based on matching rule
CN101996248B (en) * 2010-11-10 2012-10-10 百度在线网络技术(北京)有限公司 Address lookup method and device
WO2013155619A1 (en) * 2012-04-20 2013-10-24 Sam Pasupalak Conversational agent
US9544721B2 (en) * 2013-07-26 2017-01-10 Apple Inc. Address point data mining
CN103914544A (en) * 2014-04-03 2014-07-09 浙江大学 Method for quickly matching Chinese addresses in multi-level manner on basis of address feature words

Also Published As

Publication number Publication date
EP3153978A1 (en) 2017-04-12
EP3153978A4 (en) 2017-10-18
WO2016050088A1 (en) 2016-04-07
US20170116224A1 (en) 2017-04-27
CN105528372B (en) 2019-05-24
EP3153978B1 (en) 2020-04-22
CN105528372A (en) 2016-04-27
US10783171B2 (en) 2020-09-22

Similar Documents

Publication Publication Date Title
US20200349175A1 (en) Address Search Method and Device
US9390711B2 (en) Information recognition method and apparatus
CN104142915B (en) A kind of method and system adding punctuate
WO2018201600A1 (en) Information mining method and system, electronic device and readable storage medium
WO2017177809A1 (en) Word segmentation method and system for language text
CN107203526B (en) Query string semantic demand analysis method and device
CN103970765A (en) Error correcting model training method and device, and text correcting method and device
CN103971684A (en) Method and system for adding punctuations and method and device for establishing language model for adding punctuations
CN111079386B (en) Address recognition method, device, equipment and storage medium
WO2019227581A1 (en) Interest point recognition method, apparatus, terminal device, and storage medium
CN109918664B (en) Word segmentation method and device
CN110209781B (en) Text processing method and device and related equipment
WO2016041428A1 (en) Method and device for inputting english
CN103559177A (en) Geographical name identification method and geographical name identification device
WO2024066903A1 (en) Method and device for recognizing pharmaceutical-industry target object to be recognized, and medium
CN110442696B (en) Query processing method and device
CN112069824A (en) Region identification method, device and medium based on context probability and citation
CN112579713B (en) Address recognition method, address recognition device, computing equipment and computer storage medium
JP2015215390A (en) Speech recognition dictionary update device, speech recognition dictionary update method, and program
CN112765963A (en) Sentence segmentation method and device, computer equipment and storage medium
CN104978553A (en) Image analysis method and device
CN109727591B (en) Voice search method and device
CN111382322B (en) Method and device for determining similarity of character strings
JP5806974B2 (en) Neighboring information retrieval apparatus, method and program
CN107203512B (en) Method for extracting key elements from natural language input of user

Legal Events

Date Code Title Description
AS Assignment

Owner name: HUAWEI TECHNOLOGIES CO., LTD., CHINA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:QI, QUAN;ZHANG, JIULONG;LI, HANG;SIGNING DATES FROM 20151009 TO 20151015;REEL/FRAME:053220/0390

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: FINAL REJECTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: ADVISORY ACTION MAILED

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION