CN107463583A - Application developer region determines method and apparatus - Google Patents

Application developer region determines method and apparatus Download PDF

Info

Publication number
CN107463583A
CN107463583A CN201610397799.1A CN201610397799A CN107463583A CN 107463583 A CN107463583 A CN 107463583A CN 201610397799 A CN201610397799 A CN 201610397799A CN 107463583 A CN107463583 A CN 107463583A
Authority
CN
China
Prior art keywords
application developer
field
title
developer
application
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610397799.1A
Other languages
Chinese (zh)
Inventor
康明吉
路博
王跃
王洪岭
秦娇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Taier Zhixin Technology Co Ltd
Original Assignee
Guangzhou Taier Zhixin Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Taier Zhixin Technology Co Ltd filed Critical Guangzhou Taier Zhixin Technology Co Ltd
Priority to CN201610397799.1A priority Critical patent/CN107463583A/en
Publication of CN107463583A publication Critical patent/CN107463583A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9537Spatial or temporal dependent retrieval, e.g. spatiotemporal queries

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present invention provides a kind of application developer region and determines method, obtains the title of application developer first;Then administrative division field is extracted from the title of the application developer;The region of the application developer is finally determined according to the administrative division field.The method that the present invention solves the problems, such as the region for lacking the developer for determining each each application using in shop in the prior art, and step is simple, can determine the region of the developer of multiple applications quickly, mass.

Description

Application developer region determines method and apparatus
Technical field
The present invention relates to application information to obtain field, specifically a kind of application developer region determination side Method and a kind of application developer region determining device.
Background technology
With the rapid popularization of the intelligent terminals such as smart mobile phone, tablet personal computer, operated based on IOS, android The various application programs of system and windows operating systems are (referred to as:Using;English abbreviation:App;English full name: Application, should) in the life for having goed deep into consumer from every field such as social activity, shopping, traffic, service, medical treatment, communications It is in explosive growth with the total quantity of program, at present, the application sum based on IOS is based on opening more than 1,500,000 The number of applications of the android operating systems in source is huger, and these apply restocking in major application shop in internet, Installed so that user downloads.
Because the development of application program is the importance of internet development, according to where the developer of application, developer The information such as region can macroscopic view understand internet science-and-technology enterprise Regional Distribution situation, and then understand each province and city, each region it is mutual Network number of the enterprise, scale and development, macro adjustments and controls to government, have to the strategic of enterprise and market analysis etc. Important directive function, therefore, it is necessary to know the developer region of each application of in the market.
At present, the country has tens to apply shop, each applies equal restocking in shop to have millions of applications, and each The information such as the title of application, version, download and developer are illustrated in the download page of application, but will not typically be corresponded to Developer region illustrates, there is no in the prior art a kind of method can determine it is each using in shop it is each should The region of developer.
The content of the invention
In view of the above problems, there is an urgent need to a kind of place for the developer that can determine each each application using in shop The application developer region in region determines method, and a kind of corresponding application developer region determining device.
The technical solution adopted by the present invention is:
The application provides a kind of application developer region and determines method, including:
Obtain the title of application developer;
Administrative division field is extracted from the title of the application developer;
The region of the application developer is determined according to the administrative division field.
Optionally, it is described extract administrative division field from the title of the application developer before, in addition to:
Judge whether include administrative division field in the title of the application developer;
If it is not, font size field is then extracted from the title of the application developer;
The geographical location information of the application developer is crawled in a network according to the font size field, according to the geography Positional information determines the region of the application developer.
Optionally, the geographical location information for crawling the application developer in a network according to the font size field, The region of the application developer is determined according to the geographical location information, including:
Corresponding encyclopaedia entry is crawled in encyclopaedia class website by term of the font size field;
The geographical location information of the application developer is retrieved from the encyclopaedia entry, is believed according to the geographical position Breath determines the region of the application developer.
Optionally, the geographical location information for crawling the application developer in a network according to the font size field, The region of the application developer is determined according to the geographical location information, including:
The company web page of the application developer is crawled according to the font size field;
The ICP information of the application developer is obtained from the company web page;
First Chinese character in the ICP information determines the region of the application developer.
Optionally, it is described to extract font size field from the title of the application developer, including:
According to default operational characteristics database and organizational form database, in the title for determining the application developer Operational characteristics field and organizational form field;
The operational characteristics field and the organizational form field are deleted from the title of the application developer, is remained Remaining field;
The remaining field is added according to similarity degree and sorted out in table in corresponding group, will be most short in the group Font size field of one field as the application developer.
Optionally, it is described to extract administrative division field from the title of the application developer, including:
Word segmentation processing is carried out to the title of the application developer using predetermined administrative division dictionary library, obtains administrative area Draw field.
Optionally, the title for obtaining application developer, including:
The title of the application developer of application is crawled from application shop using network crawling method.
Accordingly, the application also provides a kind of application developer region determining device, including:
Developer's name acquiring module, for obtaining the title of application developer;
Administrative division extraction module, for extracting administrative division field from the title of the application developer;
Administrative division determines regions module, for determining the place of the application developer according to the administrative division field Region.
Optionally, application developer region determining device, in addition to:
Administrative division judge module, whether administrative division field is included in the title for judging the application developer;
Font size field extraction module, for if it is not, then extracting font size field from the title of the application developer;
Font size field determines regions module, for crawling the application developer in a network according to the font size field Geographical location information, the region of the application developer is determined according to the geographical location information.
Optionally, the font size field determines regions module, including:
Encyclopaedia entry crawls unit, in encyclopaedia class website crawling corresponding hundred using the font size field as term Section's entry;
Encyclopaedia entry determines territory element, for retrieving the geographical position of the application developer from the encyclopaedia entry Confidence is ceased, and the region of the application developer is determined according to the geographical location information.
Optionally, the font size field determines regions module, including:
Company web page crawls unit, for crawling the company web page of the application developer according to the font size field;
ICP information acquisition units, for obtaining the ICP information of the application developer from the company web page;
ICP determines territory element, and the application developer is determined for first Chinese character in the ICP information Region.
Optionally, the font size field extraction module, including:
Organizational form determining unit, for according to default operational characteristics database and organizational form database, determining institute State the operational characteristics field and organizational form field in the title of application developer;
Organizational form deletes unit, for deleting the operational characteristics field and institute from the title of the application developer Organizational form field is stated, obtains remaining field;
Font size field acquiring unit, sorts out corresponding group in table for the remaining field being added according to similarity degree It is interior, the font size field using a field most short in the group as the application developer.
Optionally, the administrative division extraction module, including:
Administrative division participle unit, for being entered using predetermined administrative division dictionary library to the title of the application developer Row word segmentation processing, obtain administrative division field.
Optionally, developer's name acquiring module, including:
Developer's title crawls unit, for crawling the application and development of application from application shop using network crawling method The title of person.
Compared with prior art, the present invention has advantages below:
A kind of application developer region provided by the invention determines method, obtains the title of application developer first; Then administrative division field is extracted from the title of the application developer;Institute is finally determined according to the administrative division field State the region of application developer.The present invention, which solves to lack in the prior art, determines each each application using in shop The problem of method of the region of developer, and step is simple, can determine the exploitation of multiple applications quickly, mass The region of person.
Further, it is contemplated that the situation of administrative division field, this hair may not be contained in certain applications developer's title It is bright that font size field can also be extracted from the application developer title, should according to the font size field crawls in a network With the geographical location information of developer, the region of the application developer is also can determine, solution does not have administrative division word The problem of not can determine that the region of application developer during section, improve the compatibility of the present invention, so improve present invention determine that The success rate of the region of application developer.
Brief description of the drawings
In order to illustrate the technical solution of the embodiments of the present invention more clearly, below by embodiment it is required use it is attached Figure is briefly described, it will be appreciated that the following drawings illustrate only certain embodiments of the present invention, therefore be not construed as pair The restriction of scope, for those of ordinary skill in the art, on the premise of not paying creative work, can also be according to this A little accompanying drawings obtain other related accompanying drawings.
Fig. 1 is the flow chart that a kind of application developer region provided by the invention determines embodiment of the method;
Fig. 2 is a kind of schematic diagram of application developer region determining device embodiment provided by the invention.
Embodiment
To make the purpose, technical scheme and advantage of the embodiment of the present invention clearer, below in conjunction with the embodiment of the present invention Middle accompanying drawing, the technical scheme in the embodiment of the present invention is clearly and completely described, it is clear that described embodiment is only It is part of the embodiment of the present invention, rather than whole embodiments.Therefore, the implementation of the invention to providing in the accompanying drawings below The detailed description of example is not intended to limit the scope of protection of present invention, and the application can be described here to be much different from Other manner is implemented, and those skilled in the art can do similar popularization, therefore this in the case of without prejudice to the application intension Application is not limited by following public specific implementation.
Region in view of lacking the developer for determining each each application using in shop in currently available technology Method the problem of, to determine method the embodiments of the invention provide a kind of application developer region, and corresponding a kind of Embodiments of the invention are described in detail by application developer region determining device with reference to accompanying drawing in turn below.
Fig. 1 is refer to, it is the flow that a kind of application developer region provided by the invention determines embodiment of the method Figure, the application developer region determines that method comprises the following steps:
Step S101:Obtain the title of application developer.
Wherein, the title of the application developer refers to the title for developing the enterprise of a certain application program, for example, QQ is applied Developer entitled " Shenzhen Tencent Computer System Co., Ltd ", drop drop call a taxi application developer it is entitled " Beijing little Ju Science and Technology Ltd.s ".
In this step, the title of the application developer can directly apply shop from major using network crawling method In crawl, due to mess code or other characters may be contained from title of some using the application developer obtained in shop Deng, the application developer title can also crawl in advance after the completion of arranged again, change after obtain.
Wherein, the network crawling method is also referred to as internet data acquisition method, is that one kind is automatically sent out from internet Now and webpage is captured, and the method for obtaining target data, also referred to as web crawlers are inquired about in webpage.It is next from principle is crawled See, web crawlers is generally divided into traditional reptile and focused crawler, traditional reptile since the URL of one or several Initial pages, The URL on Initial page is obtained, during webpage is captured, new URL is constantly extracted from current page and is put into queue, directly To the certain stop condition for meeting system.Popular is said, that is, desired content is obtained by source code parsing.Focused crawler Workflow it is complex, it is necessary to linked according to certain web page analysis algorithm filtering is unrelated with theme, remain with Link and put it into the URL queues for waiting crawl.Then, it will be selected in next step according to certain search strategy from queue The webpage URL to be captured, and said process is repeated, stop when reaching a certain condition of system.In addition, all grabbed by reptile The webpage taken will be stored by system, carry out certain analysis, filtering, and establish index, so as to inquiry and retrieval afterwards;It is right For focused crawler, the analysis result obtained by this process is also possible to provide later crawl process feedback and instructed.
A kind of typical network crawling method is nutch reptiles, nutch reptiles include crawler (reptile) and Searcher (inquiry) two parts, wherein, Crawler is mainly used in capturing webpage from network and establishes rope for these webpages Draw, Searcher mainly produces lookup result i.e. target data using the lookup keyword of these indexed search user.Utilize Nutch reptiles can be according to the url in application shop, described using the five application page that link is automatically opened up in shop, and from institute State in five application page inquiry obtain application Apply Names, using coding, application version, application developer, using download, answer With contents such as descriptions.
More than it is merely exemplary web crawlers is illustrated, in the prior art, according to programming language, application environment etc., Also diversified web crawlers, such as Java reptiles, Python reptiles, C++ reptiles, C# reptiles, PHP reptiles, ErLang Reptile and Ruby reptiles etc., this is ripe basis of the prior art, therefore is repeated no more herein, and it is in the guarantor of the application Within the scope of shield.
Step S102:Administrative division field is extracted from the title of the application developer.
By step S101, obtained the title of application developer, due to the title of developer be enterprise title it is general It is made up of administrative division, font size, industry or operational characteristics, the part of organizational form four, for example, enterprise name " is risen Shenzhen Interrogate computer system Co., Ltd " in, " Shenzhen " be administrative division, and " Tengxun " is font size, " computer system " for industry or Operational characteristics, " Co., Ltd " are organizational form, and the application is can determine that according to administrative division field " Shenzhen " therein The region of developer is " Shenzhen ".
This step, i.e., extract administrative division field from the title of the application developer, and specific extracting mode has more Kind embodiment, a kind of embodiment are that the keys such as " province ", " city ", " area ", " county " are retrieved in the title of application developer Word, field by the keyword and before extract, as administrative division field;Another embodiment is pre- Mr. Into the administrative division database for including each administrative division title in the whole nation, default administrative division database is then traveled through, by described in Title of the administrative division title successively with the application developer in administrative division database is contrasted, and is answered when discovery is described During with containing a certain administrative division title in the title of developer, respective field in the title of the application developer is extracted i.e. For administrative division field;Another embodiment is to advance with Trie trees technology customization generation to include each administrative area in the whole nation The administrative division dictionary library of title (such as " Beijing ", " Shenzhen ") is drawn, then using predetermined administrative division dictionary library to described The title of application developer carries out word segmentation processing, administrative division field is obtained, for example, using administrative division dictionary library to " Shenzhen Computer system Co., Ltd of Tengxun of city " scans for, you can identifies that (Shenzhen is stored in word to administrative division field " Shenzhen " Feature Words in allusion quotation), extracted.
It is it is easily understood that merely exemplary to extracting administrative division field from the title of the application developer above Embodiment illustrates, in the prior art the also embodiment of numerous variations, and here is omitted, and it is the application's Within protection domain.
Step S103:The region of the application developer is determined according to the administrative division field.
By step S102, administrative division field is extracted from the title of the application developer, according to the row It is that can determine that the region of the application developer that field is drawn in administrative division, when it is implemented, can be directly by the administrative division Region of the field as the application developer, an application developer region database can also be previously generated, Zone name corresponding to the administrative division field is inquired about in the database of the application developer region, you can according to unified Form or administrative grade export the region of the application developer.
So far, by step S101 to step S103, complete application developer region and determine flow.
The present invention solves the location for lacking the developer for determining each each application using in shop in the prior art The problem of method in domain, and step is simple, can determine the region of the developer of multiple applications quickly, mass.
It is considered that due to application shop in the title of typing application developer more arbitrarily, to the name of application developer Claim management and control not wait a variety of causes strictly, may there is no administrative division field in the title of the application developer, therefore, at this Apply provide one embodiment in, it is described extract administrative division field from the title of the application developer before, also Including:
Judge whether include administrative division field in the title of the application developer;
If it is not, font size field is then extracted from the title of the application developer;
The geographical location information of the application developer is crawled in a network according to the font size field, according to the geography Positional information determines the region of the application developer.
Wherein, judge whether to include the embodiment of administrative division field, Ke Yishi in the title of the application developer Administrative division dictionary library is traveled through, the title of the administrative division in the administrative division dictionary library and the application developer is carried out Matching, if illustrating not including administrative division field in the title of the application developer without the match is successful after the completion of traversal.
Because font size is set by each freedom of enterprise, it is therefore possible to use deleting other words in developer's title The mode of section extracts font size field, due to through judging, not including administrative division field in the title of the application developer, because This, it is described that font size field, bag are extracted from the title of the application developer in one embodiment that the application provides Include:
According to default operational characteristics database and organizational form database, in the title for determining the application developer Operational characteristics field and organizational form field;
The operational characteristics field and the organizational form field are deleted from the title of the application developer, is remained Remaining field;
The remaining field is added according to similarity degree and sorted out in table in corresponding group, will be most short in the group Font size field of one field as the application developer.
Wherein, multiple industries or operational characteristics field are stored with the operational characteristics database, such as " science and technology ", " life Production ", " photoelectron " etc., are stored with multiple organizational form fields in the organizational form database, such as " Co., Ltd ", " share Co., Ltd ", " " center " etc., in the present embodiment, it is referred to described in the embodiment determination of said extracted administrative division field Operational characteristics field and organizational form field in the title of application developer, then by the operational characteristics field and described group After knitting the deletion of form field, font size field is just deposited in remaining field, and generally, the remaining field is extracted It can be used as font size field.
But because the title of application developer is possible to mix some other characters or input error in typing mistiming, or Contain industry or operational characteristics field unlisted in the operational characteristics database, Huo Zheying in the title of person's application developer With containing the reason such as unlisted organizational form field, the remaining word in the organizational form database in the title of developer Font size field may be not only included in section, it is also possible to containing other characters, therefore, in one embodiment that the application provides In, it is that developer's title of multiple applications is handled together, after remaining field is obtained, establishes developer's title Sorting out table, (developer's title is sorted out in table is provided with multiple groups according to the difference of developer's title, has in each group more Individual is in the nature multiple developer's titles of same developer, as in " Tengxun " group, there is " Tengxun ", " Tentent Science ", " Tengxun Multiple essence such as group ", " Tengxun's information " are all developer's title of Tengxun), then to developer's name of the multiple application Title is traveled through, and is contrasted with developer's title classification table, if sorting out existing developer's name in table with developer's title Claim similar (can judge whether both are similar by judging whether to include each other or whether including identical public substring), Then developer's title is added in the table in corresponding group, after the completion of all being contrasted to all multiple developer's titles, then Font size field using a field most short in the group as the application developer, as described above in " Tengxun " group, most Afterwards by font size field of " Tengxun " field as the application developer.
The geographical location information for crawling the application developer in a network according to the font size field, according to described Geographical location information determines the region of the application developer, there is numerous embodiments, for example, provided in the application one In individual embodiment, the geographical location information for crawling the application developer in a network according to the font size field, according to The geographical location information determines the region of the application developer, including:
Corresponding encyclopaedia entry is crawled in encyclopaedia class website by term of the font size field;
The geographical location information of the application developer is retrieved from the encyclopaedia entry, is believed according to the geographical position Breath determines the region of the application developer.
In the present embodiment, according to font size field in the encyclopaedia class such as such as Baidupedia, wikipedia, interactive encyclopaedia website Corresponding encyclopaedia entry is retrieved, therefore, can be with due to the geographical location information of general Dou Huiyou enterprises in the encyclopaedia entry of enterprise The geographical location information of the application developer is further retrieved from the encyclopaedia entry, then according to the geographical position Information determines the region of the application developer, wherein it is possible to by the geographical location information directly as the application The region of developer, an application developer region database can also be previously generated, in the application developer institute Zone name corresponding to the geographical location information is inquired about in regional database, you can according to unified form or administrative grade Export the region of the application developer.For example, in the encyclopaedia entry of the Baidupedia of " Alibaba ", record " general headquarters place:The information of Hangzhou China ", after above- mentioned information is retrieved, you can determine that the application is opened according to " Hangzhou China " The region " Hangzhou " of originator " Alibaba ", institute can also be inquired about in predetermined application developer region database Zone name " Zhejiang Province " corresponding to geographical location information " Hangzhou " is stated, each application and development is determined according to the rank of province with unified The region of person.
It is described that the application is crawled according to the font size field in a network in another embodiment that the application provides The geographical location information of developer, the region of the application developer is determined according to the geographical location information, including:
The company web page of the application developer is crawled according to the font size field;
The ICP information of the application developer is obtained from the company web page;
First Chinese character in the ICP information determines the region of the application developer.
ICP, full name Web content service provider, English full name are Internet Content Provider, because country is right Operational Internet Information Service carries out licensing system, and filing system is carried out to non-profit-making Internet Information Service, therefore, There should be ICP information i.e. ICP codings in the webpage of each company, ICP codings are sent out by unification of the motherland core, for example, capital ICP 000007, Anhui ICP is demonstrate,proved for No. 05001217 etc., wherein, first Chinese character of ICP information is the abbreviation of provinces and cities where enterprise, because This, first Chinese character in the ICP information determines the region of the application developer.
In the present embodiment, the Corporation web site of the application developer is searched for (for example, first according to font size field in a network The Baidu search font size is first first removed, is then that com, cn etc. are judged according to domain name ending, only domain name is the knot such as com, cn Tail, just it is considered company's homepage of the application developer), then crawl the company web page of the application developer, then from institute The ICP information that the application developer is obtained in company web page is stated, intercepts first Chinese character in the ICP information, Ran Houcong Predetermined national provinces and cities' full name is with searching corresponding full name in abbreviation database, you can determine the location of the application developer Domain.Such as ICP information is " capital ICP cards 000007 ", then the region that can determine that the application developer is Beijing.
It should be noted that in the above two embodiments, it is utilized respectively encyclopaedia entry and ICP information determines the application The region of developer, two ways can select a use, can also be used together, for example, true first with encyclopaedia entry The region of the fixed application developer, when the geographical location information not responded in encyclopaedia entry, recycle ICP information Determine the region of the application developer.So as to guarantee to determine the application developer from multi-angle, many-side Region, the compatibility of the present invention is improved, and then improved present invention determine that the success rate of the region of application developer.
In the above-described embodiment, there is provided a kind of application developer region determines method, corresponding, this Application also provides a kind of application developer region determining device.Fig. 2 is refer to, it is opened for a kind of application provided by the invention The schematic diagram of originator region determining device embodiment.Because device embodiment is substantially similar to embodiment of the method, so retouching State fairly simple, the relevent part can refer to the partial explaination of embodiments of method.Device embodiment described below is only Schematically.
A kind of application developer region determining device that the present embodiment provides, including:
Developer's name acquiring module 101, for obtaining the title of application developer;
Administrative division extraction module 102, for extracting administrative division field from the title of the application developer;
Administrative division determines regions module 103, for determining the application developer according to the administrative division field Region.
In one embodiment that the application provides, application developer region determining device, in addition to:
Administrative division judge module, whether administrative division field is included in the title for judging the application developer;
Font size field extraction module, for if it is not, then extracting font size field from the title of the application developer;
Font size field determines regions module, for crawling the application developer in a network according to the font size field Geographical location information, the region of the application developer is determined according to the geographical location information.
In one embodiment that the application provides, the font size field determines regions module, including:
Encyclopaedia entry crawls unit, in encyclopaedia class website crawling corresponding hundred using the font size field as term Section's entry;
Encyclopaedia entry determines territory element, for retrieving the geographical position of the application developer from the encyclopaedia entry Confidence is ceased, and the region of the application developer is determined according to the geographical location information.
In one embodiment that the application provides, the font size field determines regions module, including:
Company web page crawls unit, for crawling the company web page of the application developer according to the font size field;
ICP information acquisition units, for obtaining the ICP information of the application developer from the company web page;
ICP determines territory element, and the application developer is determined for first Chinese character in the ICP information Region.
In one embodiment that the application provides, the font size field extraction module, including:
Organizational form determining unit, for according to default operational characteristics database and organizational form database, determining institute State the operational characteristics field and organizational form field in the title of application developer;
Organizational form deletes unit, for deleting the operational characteristics field and institute from the title of the application developer Organizational form field is stated, obtains remaining field;
Font size field acquiring unit, sorts out corresponding group in table for the remaining field being added according to similarity degree It is interior, the font size field using a field most short in the group as the application developer.
In one embodiment that the application provides, the administrative division extraction module 102, including:
Administrative division participle unit, for being entered using predetermined administrative division dictionary library to the title of the application developer Row word segmentation processing, obtain administrative division field.
In one embodiment that the application provides, developer's name acquiring module 101, including:
Developer's title crawls unit, for crawling the application and development of application from application shop using network crawling method The title of person.
More than, for a kind of embodiment of application developer region determining device provided by the invention.
It should be noted that:Similar label and letter represents similar terms in following accompanying drawing, therefore, once a certain Xiang Yi It is defined, then it further need not be defined and explained in subsequent accompanying drawing in individual accompanying drawing.
In the description of the invention, it is also necessary to explanation, unless otherwise clearly defined and limited, term " setting ", " installation ", " connected ", " connection " should be interpreted broadly, for example, it may be fixedly connected or be detachably connected, or one Connect body;Can be mechanical connection or electrical connection;Can be joined directly together, can also be indirect by intermediary It is connected, can is the connection of two element internals.For the ordinary skill in the art, on being understood with concrete condition State the concrete meaning of term in the present invention.
Finally it should be noted that:Embodiment described above, it is only the embodiment of the present invention, to illustrate the present invention Technical scheme, rather than its limitations, protection scope of the present invention is not limited thereto, although with reference to the foregoing embodiments to this hair It is bright to be described in detail, it will be understood by those within the art that:Any one skilled in the art The invention discloses technical scope in, it can still modify to the technical scheme described in previous embodiment or can be light Change is readily conceivable that, or equivalent substitution is carried out to which part technical characteristic;And these modifications, change or replacement, do not make The essence of appropriate technical solution departs from the spirit and scope of technical scheme of the embodiment of the present invention.The protection in the present invention should all be covered Within the scope of.Therefore, protection scope of the present invention described should be defined by scope of the claims.
In a typical configuration, computing device includes one or more processors (CPU), input/output interface, net Network interface and internal memory.
Internal memory may include computer-readable medium in volatile memory, random access memory (RAM) and/or The forms such as Nonvolatile memory, such as read-only storage (ROM) or flash memory (flash RAM).Internal memory is computer-readable medium Example.
1st, computer-readable medium can be by any side including permanent and non-permanent, removable and non-removable media Method or technology realize that information stores.Information can be computer-readable instruction, data structure, the module of program or other numbers According to.The example of the storage medium of computer includes, but are not limited to phase transition internal memory (PRAM), static RAM (SRAM), dynamic random access memory (DRAM), other kinds of random access memory (RAM), read-only storage (ROM), Electrically Erasable Read Only Memory (EEPROM), fast flash memory bank or other memory techniques, read-only optical disc are read-only Memory (CD-ROM), digital versatile disc (DVD) or other optical storages, magnetic cassette tape, tape magnetic rigid disk storage or Other magnetic storage apparatus or any other non-transmission medium, the information that can be accessed by a computing device available for storage.According to Herein defines, and computer-readable medium does not include non-temporary computer readable media (transitory media), such as modulates Data-signal and carrier wave.
2nd, it will be understood by those skilled in the art that embodiments herein can be provided as method, system or computer program production Product.Therefore, the application can use the embodiment in terms of complete hardware embodiment, complete software embodiment or combination software and hardware Form.Moreover, the application can use the computer for wherein including computer usable program code in one or more can use The computer program product that storage medium is implemented on (including but is not limited to magnetic disk storage, CD-ROM, optical memory etc.) Form.

Claims (10)

1. a kind of application developer region determines method, it is characterised in that including:
Obtain the title of application developer;
Administrative division field is extracted from the title of the application developer;
The region of the application developer is determined according to the administrative division field.
2. application developer region according to claim 1 determines method, it is characterised in that is answered described from described Before administrative division field is extracted in the title of developer, in addition to:
Judge whether include administrative division field in the title of the application developer;
If it is not, font size field is then extracted from the title of the application developer;
The geographical location information of the application developer is crawled in a network according to the font size field, according to the geographical position Information determines the region of the application developer.
3. application developer region according to claim 2 determines method, it is characterised in that described according to the word Number field crawls the geographical location information of the application developer in a network, and described answer is determined according to the geographical location information With the region of developer, including:
Corresponding encyclopaedia entry is crawled in encyclopaedia class website by term of the font size field;
The geographical location information of the application developer is retrieved from the encyclopaedia entry, it is true according to the geographical location information The region of the fixed application developer.
4. application developer region according to claim 2 determines method, it is characterised in that described according to the word Number field crawls the geographical location information of the application developer in a network, and described answer is determined according to the geographical location information With the region of developer, including:
The company web page of the application developer is crawled according to the font size field;
The ICP information of the application developer is obtained from the company web page;
First Chinese character in the ICP information determines the region of the application developer.
5. application developer region according to claim 2 determines method, it is characterised in that described from the application Font size field is extracted in the title of developer, including:
According to default operational characteristics database and organizational form database, the operation in the title of the application developer is determined Feature field and organizational form field;
The operational characteristics field and the organizational form field are deleted from the title of the application developer, obtains remaining word Section;
The remaining field is added according to similarity degree and sorted out in table in corresponding group, by one most short in the group Font size field of the field as the application developer.
6. application developer region according to claim 1 determines method, it is characterised in that described from the application Administrative division field is extracted in the title of developer, including:
Word segmentation processing is carried out to the title of the application developer using predetermined administrative division dictionary library, obtains administrative division word Section.
7. application developer region according to claim 1 determines method, it is characterised in that the acquisition application is opened The title of originator, including:
The title of the application developer of application is crawled from application shop using network crawling method.
A kind of 8. application developer region determining device, it is characterised in that including:
Developer's name acquiring module, for obtaining the title of application developer;
Administrative division extraction module, for extracting administrative division field from the title of the application developer;
Administrative division determines regions module, for determining the location of the application developer according to the administrative division field Domain.
9. application developer region according to claim 8 determining device, it is characterised in that also include:
Administrative division judge module, whether administrative division field is included in the title for judging the application developer;
Font size field extraction module, for if it is not, then extracting font size field from the title of the application developer;
Font size field determines regions module, for crawling the geography of the application developer in a network according to the font size field Positional information, the region of the application developer is determined according to the geographical location information.
10. application developer region according to claim 8 determining device, it is characterised in that the administrative division Extraction module, including:
Administrative division participle unit, for being divided using predetermined administrative division dictionary library the title of the application developer Word processing, obtains administrative division field.
CN201610397799.1A 2016-06-06 2016-06-06 Application developer region determines method and apparatus Pending CN107463583A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610397799.1A CN107463583A (en) 2016-06-06 2016-06-06 Application developer region determines method and apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610397799.1A CN107463583A (en) 2016-06-06 2016-06-06 Application developer region determines method and apparatus

Publications (1)

Publication Number Publication Date
CN107463583A true CN107463583A (en) 2017-12-12

Family

ID=60545001

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610397799.1A Pending CN107463583A (en) 2016-06-06 2016-06-06 Application developer region determines method and apparatus

Country Status (1)

Country Link
CN (1) CN107463583A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110990427A (en) * 2019-12-16 2020-04-10 北京智游网安科技有限公司 Statistical method, system and storage medium for application program affiliated area
CN111190937A (en) * 2019-12-19 2020-05-22 北京旷视科技有限公司 Native place information query method and device, electronic equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102651013A (en) * 2012-03-23 2012-08-29 上海安捷力信息系统有限公司 Method and system for extracting area information from enterprise name data
CN102663000A (en) * 2012-03-15 2012-09-12 北京百度网讯科技有限公司 Establishment method for malicious website database, method and device for identifying malicious website
CN102930059A (en) * 2012-11-26 2013-02-13 电子科技大学 Method for designing focused crawler
CN103198250A (en) * 2013-03-11 2013-07-10 青岛海信传媒网络技术有限公司 Method for auditing applications of intelligent television
CN104539634A (en) * 2015-01-22 2015-04-22 北京成众志科技有限公司 Security-enhanced authorizing and authenticating method of mobile application

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102663000A (en) * 2012-03-15 2012-09-12 北京百度网讯科技有限公司 Establishment method for malicious website database, method and device for identifying malicious website
CN102651013A (en) * 2012-03-23 2012-08-29 上海安捷力信息系统有限公司 Method and system for extracting area information from enterprise name data
CN102930059A (en) * 2012-11-26 2013-02-13 电子科技大学 Method for designing focused crawler
CN103198250A (en) * 2013-03-11 2013-07-10 青岛海信传媒网络技术有限公司 Method for auditing applications of intelligent television
CN104539634A (en) * 2015-01-22 2015-04-22 北京成众志科技有限公司 Security-enhanced authorizing and authenticating method of mobile application

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110990427A (en) * 2019-12-16 2020-04-10 北京智游网安科技有限公司 Statistical method, system and storage medium for application program affiliated area
CN110990427B (en) * 2019-12-16 2024-05-10 北京智游网安科技有限公司 Method, system and storage medium for counting application program affiliated area
CN111190937A (en) * 2019-12-19 2020-05-22 北京旷视科技有限公司 Native place information query method and device, electronic equipment and storage medium
CN111190937B (en) * 2019-12-19 2024-02-23 北京旷视科技有限公司 Method and device for inquiring native information, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN110147437B (en) Knowledge graph-based searching method and device
US10817613B2 (en) Access and management of entity-augmented content
US10255253B2 (en) Augmenting and presenting captured data
CN110309393A (en) Data processing method, device, equipment and readable storage medium storing program for executing
CN112749284B (en) Knowledge graph construction method, device, equipment and storage medium
US9990428B2 (en) Computerized identification of app search functionality for search engine access
CN108959244A (en) The method and apparatus of address participle
CN110427614B (en) Construction method and device of paragraph level, electronic equipment and storage medium
US20110295823A1 (en) Method and apparatus for modeling relations among data items
EP2643772A1 (en) Method and system for compiling a unique sample code for an existing digital sample
Zhu et al. Cyber-physical-social-thinking modeling and computing for geological information service system
US20200401639A1 (en) Personalizing a search query using social media
CN113011126B (en) Text processing method, text processing device, electronic equipment and computer readable storage medium
US20170185608A1 (en) App Onboarding System For Developer-Defined Creation Of Search Engine Results
US11836331B2 (en) Mathematical models of graphical user interfaces
Rebele et al. Adding missing words to regular expressions
CN107463583A (en) Application developer region determines method and apparatus
CN111797297B (en) Page data processing method and device, computer equipment and storage medium
Aranda-Corral et al. Reconciling knowledge in social tagging web services
Luo et al. Automated structural semantic annotation for RESTful services
CN107463581A (en) Using download acquisition methods, device and terminal device
Liepina et al. Explaining potentially unfair clauses to the consumer with the CLAUDETTE tool
CN104598482A (en) Method for updating book information based on depth-first search strategy
CN112632981A (en) New word discovery method and device
CN117633197B (en) Search information generation method and device applied to paraphrasing document and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20171212

RJ01 Rejection of invention patent application after publication