CN107463583A - Application developer region determines method and apparatus - Google Patents
Application developer region determines method and apparatus Download PDFInfo
- Publication number
- CN107463583A CN107463583A CN201610397799.1A CN201610397799A CN107463583A CN 107463583 A CN107463583 A CN 107463583A CN 201610397799 A CN201610397799 A CN 201610397799A CN 107463583 A CN107463583 A CN 107463583A
- Authority
- CN
- China
- Prior art keywords
- application developer
- field
- title
- developer
- application
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9537—Spatial or temporal dependent retrieval, e.g. spatiotemporal queries
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The present invention provides a kind of application developer region and determines method, obtains the title of application developer first;Then administrative division field is extracted from the title of the application developer;The region of the application developer is finally determined according to the administrative division field.The method that the present invention solves the problems, such as the region for lacking the developer for determining each each application using in shop in the prior art, and step is simple, can determine the region of the developer of multiple applications quickly, mass.
Description
Technical field
The present invention relates to application information to obtain field, specifically a kind of application developer region determination side
Method and a kind of application developer region determining device.
Background technology
With the rapid popularization of the intelligent terminals such as smart mobile phone, tablet personal computer, operated based on IOS, android
The various application programs of system and windows operating systems are (referred to as:Using;English abbreviation:App;English full name:
Application, should) in the life for having goed deep into consumer from every field such as social activity, shopping, traffic, service, medical treatment, communications
It is in explosive growth with the total quantity of program, at present, the application sum based on IOS is based on opening more than 1,500,000
The number of applications of the android operating systems in source is huger, and these apply restocking in major application shop in internet,
Installed so that user downloads.
Because the development of application program is the importance of internet development, according to where the developer of application, developer
The information such as region can macroscopic view understand internet science-and-technology enterprise Regional Distribution situation, and then understand each province and city, each region it is mutual
Network number of the enterprise, scale and development, macro adjustments and controls to government, have to the strategic of enterprise and market analysis etc.
Important directive function, therefore, it is necessary to know the developer region of each application of in the market.
At present, the country has tens to apply shop, each applies equal restocking in shop to have millions of applications, and each
The information such as the title of application, version, download and developer are illustrated in the download page of application, but will not typically be corresponded to
Developer region illustrates, there is no in the prior art a kind of method can determine it is each using in shop it is each should
The region of developer.
The content of the invention
In view of the above problems, there is an urgent need to a kind of place for the developer that can determine each each application using in shop
The application developer region in region determines method, and a kind of corresponding application developer region determining device.
The technical solution adopted by the present invention is:
The application provides a kind of application developer region and determines method, including:
Obtain the title of application developer;
Administrative division field is extracted from the title of the application developer;
The region of the application developer is determined according to the administrative division field.
Optionally, it is described extract administrative division field from the title of the application developer before, in addition to:
Judge whether include administrative division field in the title of the application developer;
If it is not, font size field is then extracted from the title of the application developer;
The geographical location information of the application developer is crawled in a network according to the font size field, according to the geography
Positional information determines the region of the application developer.
Optionally, the geographical location information for crawling the application developer in a network according to the font size field,
The region of the application developer is determined according to the geographical location information, including:
Corresponding encyclopaedia entry is crawled in encyclopaedia class website by term of the font size field;
The geographical location information of the application developer is retrieved from the encyclopaedia entry, is believed according to the geographical position
Breath determines the region of the application developer.
Optionally, the geographical location information for crawling the application developer in a network according to the font size field,
The region of the application developer is determined according to the geographical location information, including:
The company web page of the application developer is crawled according to the font size field;
The ICP information of the application developer is obtained from the company web page;
First Chinese character in the ICP information determines the region of the application developer.
Optionally, it is described to extract font size field from the title of the application developer, including:
According to default operational characteristics database and organizational form database, in the title for determining the application developer
Operational characteristics field and organizational form field;
The operational characteristics field and the organizational form field are deleted from the title of the application developer, is remained
Remaining field;
The remaining field is added according to similarity degree and sorted out in table in corresponding group, will be most short in the group
Font size field of one field as the application developer.
Optionally, it is described to extract administrative division field from the title of the application developer, including:
Word segmentation processing is carried out to the title of the application developer using predetermined administrative division dictionary library, obtains administrative area
Draw field.
Optionally, the title for obtaining application developer, including:
The title of the application developer of application is crawled from application shop using network crawling method.
Accordingly, the application also provides a kind of application developer region determining device, including:
Developer's name acquiring module, for obtaining the title of application developer;
Administrative division extraction module, for extracting administrative division field from the title of the application developer;
Administrative division determines regions module, for determining the place of the application developer according to the administrative division field
Region.
Optionally, application developer region determining device, in addition to:
Administrative division judge module, whether administrative division field is included in the title for judging the application developer;
Font size field extraction module, for if it is not, then extracting font size field from the title of the application developer;
Font size field determines regions module, for crawling the application developer in a network according to the font size field
Geographical location information, the region of the application developer is determined according to the geographical location information.
Optionally, the font size field determines regions module, including:
Encyclopaedia entry crawls unit, in encyclopaedia class website crawling corresponding hundred using the font size field as term
Section's entry;
Encyclopaedia entry determines territory element, for retrieving the geographical position of the application developer from the encyclopaedia entry
Confidence is ceased, and the region of the application developer is determined according to the geographical location information.
Optionally, the font size field determines regions module, including:
Company web page crawls unit, for crawling the company web page of the application developer according to the font size field;
ICP information acquisition units, for obtaining the ICP information of the application developer from the company web page;
ICP determines territory element, and the application developer is determined for first Chinese character in the ICP information
Region.
Optionally, the font size field extraction module, including:
Organizational form determining unit, for according to default operational characteristics database and organizational form database, determining institute
State the operational characteristics field and organizational form field in the title of application developer;
Organizational form deletes unit, for deleting the operational characteristics field and institute from the title of the application developer
Organizational form field is stated, obtains remaining field;
Font size field acquiring unit, sorts out corresponding group in table for the remaining field being added according to similarity degree
It is interior, the font size field using a field most short in the group as the application developer.
Optionally, the administrative division extraction module, including:
Administrative division participle unit, for being entered using predetermined administrative division dictionary library to the title of the application developer
Row word segmentation processing, obtain administrative division field.
Optionally, developer's name acquiring module, including:
Developer's title crawls unit, for crawling the application and development of application from application shop using network crawling method
The title of person.
Compared with prior art, the present invention has advantages below:
A kind of application developer region provided by the invention determines method, obtains the title of application developer first;
Then administrative division field is extracted from the title of the application developer;Institute is finally determined according to the administrative division field
State the region of application developer.The present invention, which solves to lack in the prior art, determines each each application using in shop
The problem of method of the region of developer, and step is simple, can determine the exploitation of multiple applications quickly, mass
The region of person.
Further, it is contemplated that the situation of administrative division field, this hair may not be contained in certain applications developer's title
It is bright that font size field can also be extracted from the application developer title, should according to the font size field crawls in a network
With the geographical location information of developer, the region of the application developer is also can determine, solution does not have administrative division word
The problem of not can determine that the region of application developer during section, improve the compatibility of the present invention, so improve present invention determine that
The success rate of the region of application developer.
Brief description of the drawings
In order to illustrate the technical solution of the embodiments of the present invention more clearly, below by embodiment it is required use it is attached
Figure is briefly described, it will be appreciated that the following drawings illustrate only certain embodiments of the present invention, therefore be not construed as pair
The restriction of scope, for those of ordinary skill in the art, on the premise of not paying creative work, can also be according to this
A little accompanying drawings obtain other related accompanying drawings.
Fig. 1 is the flow chart that a kind of application developer region provided by the invention determines embodiment of the method;
Fig. 2 is a kind of schematic diagram of application developer region determining device embodiment provided by the invention.
Embodiment
To make the purpose, technical scheme and advantage of the embodiment of the present invention clearer, below in conjunction with the embodiment of the present invention
Middle accompanying drawing, the technical scheme in the embodiment of the present invention is clearly and completely described, it is clear that described embodiment is only
It is part of the embodiment of the present invention, rather than whole embodiments.Therefore, the implementation of the invention to providing in the accompanying drawings below
The detailed description of example is not intended to limit the scope of protection of present invention, and the application can be described here to be much different from
Other manner is implemented, and those skilled in the art can do similar popularization, therefore this in the case of without prejudice to the application intension
Application is not limited by following public specific implementation.
Region in view of lacking the developer for determining each each application using in shop in currently available technology
Method the problem of, to determine method the embodiments of the invention provide a kind of application developer region, and corresponding a kind of
Embodiments of the invention are described in detail by application developer region determining device with reference to accompanying drawing in turn below.
Fig. 1 is refer to, it is the flow that a kind of application developer region provided by the invention determines embodiment of the method
Figure, the application developer region determines that method comprises the following steps:
Step S101:Obtain the title of application developer.
Wherein, the title of the application developer refers to the title for developing the enterprise of a certain application program, for example, QQ is applied
Developer entitled " Shenzhen Tencent Computer System Co., Ltd ", drop drop call a taxi application developer it is entitled
" Beijing little Ju Science and Technology Ltd.s ".
In this step, the title of the application developer can directly apply shop from major using network crawling method
In crawl, due to mess code or other characters may be contained from title of some using the application developer obtained in shop
Deng, the application developer title can also crawl in advance after the completion of arranged again, change after obtain.
Wherein, the network crawling method is also referred to as internet data acquisition method, is that one kind is automatically sent out from internet
Now and webpage is captured, and the method for obtaining target data, also referred to as web crawlers are inquired about in webpage.It is next from principle is crawled
See, web crawlers is generally divided into traditional reptile and focused crawler, traditional reptile since the URL of one or several Initial pages,
The URL on Initial page is obtained, during webpage is captured, new URL is constantly extracted from current page and is put into queue, directly
To the certain stop condition for meeting system.Popular is said, that is, desired content is obtained by source code parsing.Focused crawler
Workflow it is complex, it is necessary to linked according to certain web page analysis algorithm filtering is unrelated with theme, remain with
Link and put it into the URL queues for waiting crawl.Then, it will be selected in next step according to certain search strategy from queue
The webpage URL to be captured, and said process is repeated, stop when reaching a certain condition of system.In addition, all grabbed by reptile
The webpage taken will be stored by system, carry out certain analysis, filtering, and establish index, so as to inquiry and retrieval afterwards;It is right
For focused crawler, the analysis result obtained by this process is also possible to provide later crawl process feedback and instructed.
A kind of typical network crawling method is nutch reptiles, nutch reptiles include crawler (reptile) and
Searcher (inquiry) two parts, wherein, Crawler is mainly used in capturing webpage from network and establishes rope for these webpages
Draw, Searcher mainly produces lookup result i.e. target data using the lookup keyword of these indexed search user.Utilize
Nutch reptiles can be according to the url in application shop, described using the five application page that link is automatically opened up in shop, and from institute
State in five application page inquiry obtain application Apply Names, using coding, application version, application developer, using download, answer
With contents such as descriptions.
More than it is merely exemplary web crawlers is illustrated, in the prior art, according to programming language, application environment etc.,
Also diversified web crawlers, such as Java reptiles, Python reptiles, C++ reptiles, C# reptiles, PHP reptiles, ErLang
Reptile and Ruby reptiles etc., this is ripe basis of the prior art, therefore is repeated no more herein, and it is in the guarantor of the application
Within the scope of shield.
Step S102:Administrative division field is extracted from the title of the application developer.
By step S101, obtained the title of application developer, due to the title of developer be enterprise title it is general
It is made up of administrative division, font size, industry or operational characteristics, the part of organizational form four, for example, enterprise name " is risen Shenzhen
Interrogate computer system Co., Ltd " in, " Shenzhen " be administrative division, and " Tengxun " is font size, " computer system " for industry or
Operational characteristics, " Co., Ltd " are organizational form, and the application is can determine that according to administrative division field " Shenzhen " therein
The region of developer is " Shenzhen ".
This step, i.e., extract administrative division field from the title of the application developer, and specific extracting mode has more
Kind embodiment, a kind of embodiment are that the keys such as " province ", " city ", " area ", " county " are retrieved in the title of application developer
Word, field by the keyword and before extract, as administrative division field;Another embodiment is pre- Mr.
Into the administrative division database for including each administrative division title in the whole nation, default administrative division database is then traveled through, by described in
Title of the administrative division title successively with the application developer in administrative division database is contrasted, and is answered when discovery is described
During with containing a certain administrative division title in the title of developer, respective field in the title of the application developer is extracted i.e.
For administrative division field;Another embodiment is to advance with Trie trees technology customization generation to include each administrative area in the whole nation
The administrative division dictionary library of title (such as " Beijing ", " Shenzhen ") is drawn, then using predetermined administrative division dictionary library to described
The title of application developer carries out word segmentation processing, administrative division field is obtained, for example, using administrative division dictionary library to " Shenzhen
Computer system Co., Ltd of Tengxun of city " scans for, you can identifies that (Shenzhen is stored in word to administrative division field " Shenzhen "
Feature Words in allusion quotation), extracted.
It is it is easily understood that merely exemplary to extracting administrative division field from the title of the application developer above
Embodiment illustrates, in the prior art the also embodiment of numerous variations, and here is omitted, and it is the application's
Within protection domain.
Step S103:The region of the application developer is determined according to the administrative division field.
By step S102, administrative division field is extracted from the title of the application developer, according to the row
It is that can determine that the region of the application developer that field is drawn in administrative division, when it is implemented, can be directly by the administrative division
Region of the field as the application developer, an application developer region database can also be previously generated,
Zone name corresponding to the administrative division field is inquired about in the database of the application developer region, you can according to unified
Form or administrative grade export the region of the application developer.
So far, by step S101 to step S103, complete application developer region and determine flow.
The present invention solves the location for lacking the developer for determining each each application using in shop in the prior art
The problem of method in domain, and step is simple, can determine the region of the developer of multiple applications quickly, mass.
It is considered that due to application shop in the title of typing application developer more arbitrarily, to the name of application developer
Claim management and control not wait a variety of causes strictly, may there is no administrative division field in the title of the application developer, therefore, at this
Apply provide one embodiment in, it is described extract administrative division field from the title of the application developer before, also
Including:
Judge whether include administrative division field in the title of the application developer;
If it is not, font size field is then extracted from the title of the application developer;
The geographical location information of the application developer is crawled in a network according to the font size field, according to the geography
Positional information determines the region of the application developer.
Wherein, judge whether to include the embodiment of administrative division field, Ke Yishi in the title of the application developer
Administrative division dictionary library is traveled through, the title of the administrative division in the administrative division dictionary library and the application developer is carried out
Matching, if illustrating not including administrative division field in the title of the application developer without the match is successful after the completion of traversal.
Because font size is set by each freedom of enterprise, it is therefore possible to use deleting other words in developer's title
The mode of section extracts font size field, due to through judging, not including administrative division field in the title of the application developer, because
This, it is described that font size field, bag are extracted from the title of the application developer in one embodiment that the application provides
Include:
According to default operational characteristics database and organizational form database, in the title for determining the application developer
Operational characteristics field and organizational form field;
The operational characteristics field and the organizational form field are deleted from the title of the application developer, is remained
Remaining field;
The remaining field is added according to similarity degree and sorted out in table in corresponding group, will be most short in the group
Font size field of one field as the application developer.
Wherein, multiple industries or operational characteristics field are stored with the operational characteristics database, such as " science and technology ", " life
Production ", " photoelectron " etc., are stored with multiple organizational form fields in the organizational form database, such as " Co., Ltd ", " share
Co., Ltd ", " " center " etc., in the present embodiment, it is referred to described in the embodiment determination of said extracted administrative division field
Operational characteristics field and organizational form field in the title of application developer, then by the operational characteristics field and described group
After knitting the deletion of form field, font size field is just deposited in remaining field, and generally, the remaining field is extracted
It can be used as font size field.
But because the title of application developer is possible to mix some other characters or input error in typing mistiming, or
Contain industry or operational characteristics field unlisted in the operational characteristics database, Huo Zheying in the title of person's application developer
With containing the reason such as unlisted organizational form field, the remaining word in the organizational form database in the title of developer
Font size field may be not only included in section, it is also possible to containing other characters, therefore, in one embodiment that the application provides
In, it is that developer's title of multiple applications is handled together, after remaining field is obtained, establishes developer's title
Sorting out table, (developer's title is sorted out in table is provided with multiple groups according to the difference of developer's title, has in each group more
Individual is in the nature multiple developer's titles of same developer, as in " Tengxun " group, there is " Tengxun ", " Tentent Science ", " Tengxun
Multiple essence such as group ", " Tengxun's information " are all developer's title of Tengxun), then to developer's name of the multiple application
Title is traveled through, and is contrasted with developer's title classification table, if sorting out existing developer's name in table with developer's title
Claim similar (can judge whether both are similar by judging whether to include each other or whether including identical public substring),
Then developer's title is added in the table in corresponding group, after the completion of all being contrasted to all multiple developer's titles, then
Font size field using a field most short in the group as the application developer, as described above in " Tengxun " group, most
Afterwards by font size field of " Tengxun " field as the application developer.
The geographical location information for crawling the application developer in a network according to the font size field, according to described
Geographical location information determines the region of the application developer, there is numerous embodiments, for example, provided in the application one
In individual embodiment, the geographical location information for crawling the application developer in a network according to the font size field, according to
The geographical location information determines the region of the application developer, including:
Corresponding encyclopaedia entry is crawled in encyclopaedia class website by term of the font size field;
The geographical location information of the application developer is retrieved from the encyclopaedia entry, is believed according to the geographical position
Breath determines the region of the application developer.
In the present embodiment, according to font size field in the encyclopaedia class such as such as Baidupedia, wikipedia, interactive encyclopaedia website
Corresponding encyclopaedia entry is retrieved, therefore, can be with due to the geographical location information of general Dou Huiyou enterprises in the encyclopaedia entry of enterprise
The geographical location information of the application developer is further retrieved from the encyclopaedia entry, then according to the geographical position
Information determines the region of the application developer, wherein it is possible to by the geographical location information directly as the application
The region of developer, an application developer region database can also be previously generated, in the application developer institute
Zone name corresponding to the geographical location information is inquired about in regional database, you can according to unified form or administrative grade
Export the region of the application developer.For example, in the encyclopaedia entry of the Baidupedia of " Alibaba ", record
" general headquarters place:The information of Hangzhou China ", after above- mentioned information is retrieved, you can determine that the application is opened according to " Hangzhou China "
The region " Hangzhou " of originator " Alibaba ", institute can also be inquired about in predetermined application developer region database
Zone name " Zhejiang Province " corresponding to geographical location information " Hangzhou " is stated, each application and development is determined according to the rank of province with unified
The region of person.
It is described that the application is crawled according to the font size field in a network in another embodiment that the application provides
The geographical location information of developer, the region of the application developer is determined according to the geographical location information, including:
The company web page of the application developer is crawled according to the font size field;
The ICP information of the application developer is obtained from the company web page;
First Chinese character in the ICP information determines the region of the application developer.
ICP, full name Web content service provider, English full name are Internet Content Provider, because country is right
Operational Internet Information Service carries out licensing system, and filing system is carried out to non-profit-making Internet Information Service, therefore,
There should be ICP information i.e. ICP codings in the webpage of each company, ICP codings are sent out by unification of the motherland core, for example, capital ICP
000007, Anhui ICP is demonstrate,proved for No. 05001217 etc., wherein, first Chinese character of ICP information is the abbreviation of provinces and cities where enterprise, because
This, first Chinese character in the ICP information determines the region of the application developer.
In the present embodiment, the Corporation web site of the application developer is searched for (for example, first according to font size field in a network
The Baidu search font size is first first removed, is then that com, cn etc. are judged according to domain name ending, only domain name is the knot such as com, cn
Tail, just it is considered company's homepage of the application developer), then crawl the company web page of the application developer, then from institute
The ICP information that the application developer is obtained in company web page is stated, intercepts first Chinese character in the ICP information, Ran Houcong
Predetermined national provinces and cities' full name is with searching corresponding full name in abbreviation database, you can determine the location of the application developer
Domain.Such as ICP information is " capital ICP cards 000007 ", then the region that can determine that the application developer is Beijing.
It should be noted that in the above two embodiments, it is utilized respectively encyclopaedia entry and ICP information determines the application
The region of developer, two ways can select a use, can also be used together, for example, true first with encyclopaedia entry
The region of the fixed application developer, when the geographical location information not responded in encyclopaedia entry, recycle ICP information
Determine the region of the application developer.So as to guarantee to determine the application developer from multi-angle, many-side
Region, the compatibility of the present invention is improved, and then improved present invention determine that the success rate of the region of application developer.
In the above-described embodiment, there is provided a kind of application developer region determines method, corresponding, this
Application also provides a kind of application developer region determining device.Fig. 2 is refer to, it is opened for a kind of application provided by the invention
The schematic diagram of originator region determining device embodiment.Because device embodiment is substantially similar to embodiment of the method, so retouching
State fairly simple, the relevent part can refer to the partial explaination of embodiments of method.Device embodiment described below is only
Schematically.
A kind of application developer region determining device that the present embodiment provides, including:
Developer's name acquiring module 101, for obtaining the title of application developer;
Administrative division extraction module 102, for extracting administrative division field from the title of the application developer;
Administrative division determines regions module 103, for determining the application developer according to the administrative division field
Region.
In one embodiment that the application provides, application developer region determining device, in addition to:
Administrative division judge module, whether administrative division field is included in the title for judging the application developer;
Font size field extraction module, for if it is not, then extracting font size field from the title of the application developer;
Font size field determines regions module, for crawling the application developer in a network according to the font size field
Geographical location information, the region of the application developer is determined according to the geographical location information.
In one embodiment that the application provides, the font size field determines regions module, including:
Encyclopaedia entry crawls unit, in encyclopaedia class website crawling corresponding hundred using the font size field as term
Section's entry;
Encyclopaedia entry determines territory element, for retrieving the geographical position of the application developer from the encyclopaedia entry
Confidence is ceased, and the region of the application developer is determined according to the geographical location information.
In one embodiment that the application provides, the font size field determines regions module, including:
Company web page crawls unit, for crawling the company web page of the application developer according to the font size field;
ICP information acquisition units, for obtaining the ICP information of the application developer from the company web page;
ICP determines territory element, and the application developer is determined for first Chinese character in the ICP information
Region.
In one embodiment that the application provides, the font size field extraction module, including:
Organizational form determining unit, for according to default operational characteristics database and organizational form database, determining institute
State the operational characteristics field and organizational form field in the title of application developer;
Organizational form deletes unit, for deleting the operational characteristics field and institute from the title of the application developer
Organizational form field is stated, obtains remaining field;
Font size field acquiring unit, sorts out corresponding group in table for the remaining field being added according to similarity degree
It is interior, the font size field using a field most short in the group as the application developer.
In one embodiment that the application provides, the administrative division extraction module 102, including:
Administrative division participle unit, for being entered using predetermined administrative division dictionary library to the title of the application developer
Row word segmentation processing, obtain administrative division field.
In one embodiment that the application provides, developer's name acquiring module 101, including:
Developer's title crawls unit, for crawling the application and development of application from application shop using network crawling method
The title of person.
More than, for a kind of embodiment of application developer region determining device provided by the invention.
It should be noted that:Similar label and letter represents similar terms in following accompanying drawing, therefore, once a certain Xiang Yi
It is defined, then it further need not be defined and explained in subsequent accompanying drawing in individual accompanying drawing.
In the description of the invention, it is also necessary to explanation, unless otherwise clearly defined and limited, term " setting ",
" installation ", " connected ", " connection " should be interpreted broadly, for example, it may be fixedly connected or be detachably connected, or one
Connect body;Can be mechanical connection or electrical connection;Can be joined directly together, can also be indirect by intermediary
It is connected, can is the connection of two element internals.For the ordinary skill in the art, on being understood with concrete condition
State the concrete meaning of term in the present invention.
Finally it should be noted that:Embodiment described above, it is only the embodiment of the present invention, to illustrate the present invention
Technical scheme, rather than its limitations, protection scope of the present invention is not limited thereto, although with reference to the foregoing embodiments to this hair
It is bright to be described in detail, it will be understood by those within the art that:Any one skilled in the art
The invention discloses technical scope in, it can still modify to the technical scheme described in previous embodiment or can be light
Change is readily conceivable that, or equivalent substitution is carried out to which part technical characteristic;And these modifications, change or replacement, do not make
The essence of appropriate technical solution departs from the spirit and scope of technical scheme of the embodiment of the present invention.The protection in the present invention should all be covered
Within the scope of.Therefore, protection scope of the present invention described should be defined by scope of the claims.
In a typical configuration, computing device includes one or more processors (CPU), input/output interface, net
Network interface and internal memory.
Internal memory may include computer-readable medium in volatile memory, random access memory (RAM) and/or
The forms such as Nonvolatile memory, such as read-only storage (ROM) or flash memory (flash RAM).Internal memory is computer-readable medium
Example.
1st, computer-readable medium can be by any side including permanent and non-permanent, removable and non-removable media
Method or technology realize that information stores.Information can be computer-readable instruction, data structure, the module of program or other numbers
According to.The example of the storage medium of computer includes, but are not limited to phase transition internal memory (PRAM), static RAM
(SRAM), dynamic random access memory (DRAM), other kinds of random access memory (RAM), read-only storage
(ROM), Electrically Erasable Read Only Memory (EEPROM), fast flash memory bank or other memory techniques, read-only optical disc are read-only
Memory (CD-ROM), digital versatile disc (DVD) or other optical storages, magnetic cassette tape, tape magnetic rigid disk storage or
Other magnetic storage apparatus or any other non-transmission medium, the information that can be accessed by a computing device available for storage.According to
Herein defines, and computer-readable medium does not include non-temporary computer readable media (transitory media), such as modulates
Data-signal and carrier wave.
2nd, it will be understood by those skilled in the art that embodiments herein can be provided as method, system or computer program production
Product.Therefore, the application can use the embodiment in terms of complete hardware embodiment, complete software embodiment or combination software and hardware
Form.Moreover, the application can use the computer for wherein including computer usable program code in one or more can use
The computer program product that storage medium is implemented on (including but is not limited to magnetic disk storage, CD-ROM, optical memory etc.)
Form.
Claims (10)
1. a kind of application developer region determines method, it is characterised in that including:
Obtain the title of application developer;
Administrative division field is extracted from the title of the application developer;
The region of the application developer is determined according to the administrative division field.
2. application developer region according to claim 1 determines method, it is characterised in that is answered described from described
Before administrative division field is extracted in the title of developer, in addition to:
Judge whether include administrative division field in the title of the application developer;
If it is not, font size field is then extracted from the title of the application developer;
The geographical location information of the application developer is crawled in a network according to the font size field, according to the geographical position
Information determines the region of the application developer.
3. application developer region according to claim 2 determines method, it is characterised in that described according to the word
Number field crawls the geographical location information of the application developer in a network, and described answer is determined according to the geographical location information
With the region of developer, including:
Corresponding encyclopaedia entry is crawled in encyclopaedia class website by term of the font size field;
The geographical location information of the application developer is retrieved from the encyclopaedia entry, it is true according to the geographical location information
The region of the fixed application developer.
4. application developer region according to claim 2 determines method, it is characterised in that described according to the word
Number field crawls the geographical location information of the application developer in a network, and described answer is determined according to the geographical location information
With the region of developer, including:
The company web page of the application developer is crawled according to the font size field;
The ICP information of the application developer is obtained from the company web page;
First Chinese character in the ICP information determines the region of the application developer.
5. application developer region according to claim 2 determines method, it is characterised in that described from the application
Font size field is extracted in the title of developer, including:
According to default operational characteristics database and organizational form database, the operation in the title of the application developer is determined
Feature field and organizational form field;
The operational characteristics field and the organizational form field are deleted from the title of the application developer, obtains remaining word
Section;
The remaining field is added according to similarity degree and sorted out in table in corresponding group, by one most short in the group
Font size field of the field as the application developer.
6. application developer region according to claim 1 determines method, it is characterised in that described from the application
Administrative division field is extracted in the title of developer, including:
Word segmentation processing is carried out to the title of the application developer using predetermined administrative division dictionary library, obtains administrative division word
Section.
7. application developer region according to claim 1 determines method, it is characterised in that the acquisition application is opened
The title of originator, including:
The title of the application developer of application is crawled from application shop using network crawling method.
A kind of 8. application developer region determining device, it is characterised in that including:
Developer's name acquiring module, for obtaining the title of application developer;
Administrative division extraction module, for extracting administrative division field from the title of the application developer;
Administrative division determines regions module, for determining the location of the application developer according to the administrative division field
Domain.
9. application developer region according to claim 8 determining device, it is characterised in that also include:
Administrative division judge module, whether administrative division field is included in the title for judging the application developer;
Font size field extraction module, for if it is not, then extracting font size field from the title of the application developer;
Font size field determines regions module, for crawling the geography of the application developer in a network according to the font size field
Positional information, the region of the application developer is determined according to the geographical location information.
10. application developer region according to claim 8 determining device, it is characterised in that the administrative division
Extraction module, including:
Administrative division participle unit, for being divided using predetermined administrative division dictionary library the title of the application developer
Word processing, obtains administrative division field.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610397799.1A CN107463583A (en) | 2016-06-06 | 2016-06-06 | Application developer region determines method and apparatus |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610397799.1A CN107463583A (en) | 2016-06-06 | 2016-06-06 | Application developer region determines method and apparatus |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107463583A true CN107463583A (en) | 2017-12-12 |
Family
ID=60545001
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610397799.1A Pending CN107463583A (en) | 2016-06-06 | 2016-06-06 | Application developer region determines method and apparatus |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107463583A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110990427A (en) * | 2019-12-16 | 2020-04-10 | 北京智游网安科技有限公司 | Statistical method, system and storage medium for application program affiliated area |
CN111190937A (en) * | 2019-12-19 | 2020-05-22 | 北京旷视科技有限公司 | Native place information query method and device, electronic equipment and storage medium |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102651013A (en) * | 2012-03-23 | 2012-08-29 | 上海安捷力信息系统有限公司 | Method and system for extracting area information from enterprise name data |
CN102663000A (en) * | 2012-03-15 | 2012-09-12 | 北京百度网讯科技有限公司 | Establishment method for malicious website database, method and device for identifying malicious website |
CN102930059A (en) * | 2012-11-26 | 2013-02-13 | 电子科技大学 | Method for designing focused crawler |
CN103198250A (en) * | 2013-03-11 | 2013-07-10 | 青岛海信传媒网络技术有限公司 | Method for auditing applications of intelligent television |
CN104539634A (en) * | 2015-01-22 | 2015-04-22 | 北京成众志科技有限公司 | Security-enhanced authorizing and authenticating method of mobile application |
-
2016
- 2016-06-06 CN CN201610397799.1A patent/CN107463583A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102663000A (en) * | 2012-03-15 | 2012-09-12 | 北京百度网讯科技有限公司 | Establishment method for malicious website database, method and device for identifying malicious website |
CN102651013A (en) * | 2012-03-23 | 2012-08-29 | 上海安捷力信息系统有限公司 | Method and system for extracting area information from enterprise name data |
CN102930059A (en) * | 2012-11-26 | 2013-02-13 | 电子科技大学 | Method for designing focused crawler |
CN103198250A (en) * | 2013-03-11 | 2013-07-10 | 青岛海信传媒网络技术有限公司 | Method for auditing applications of intelligent television |
CN104539634A (en) * | 2015-01-22 | 2015-04-22 | 北京成众志科技有限公司 | Security-enhanced authorizing and authenticating method of mobile application |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110990427A (en) * | 2019-12-16 | 2020-04-10 | 北京智游网安科技有限公司 | Statistical method, system and storage medium for application program affiliated area |
CN110990427B (en) * | 2019-12-16 | 2024-05-10 | 北京智游网安科技有限公司 | Method, system and storage medium for counting application program affiliated area |
CN111190937A (en) * | 2019-12-19 | 2020-05-22 | 北京旷视科技有限公司 | Native place information query method and device, electronic equipment and storage medium |
CN111190937B (en) * | 2019-12-19 | 2024-02-23 | 北京旷视科技有限公司 | Method and device for inquiring native information, electronic equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110147437B (en) | Knowledge graph-based searching method and device | |
US10817613B2 (en) | Access and management of entity-augmented content | |
US10255253B2 (en) | Augmenting and presenting captured data | |
CN110309393A (en) | Data processing method, device, equipment and readable storage medium storing program for executing | |
CN112749284B (en) | Knowledge graph construction method, device, equipment and storage medium | |
US9990428B2 (en) | Computerized identification of app search functionality for search engine access | |
CN108959244A (en) | The method and apparatus of address participle | |
CN110427614B (en) | Construction method and device of paragraph level, electronic equipment and storage medium | |
US20110295823A1 (en) | Method and apparatus for modeling relations among data items | |
EP2643772A1 (en) | Method and system for compiling a unique sample code for an existing digital sample | |
Zhu et al. | Cyber-physical-social-thinking modeling and computing for geological information service system | |
US20200401639A1 (en) | Personalizing a search query using social media | |
CN113011126B (en) | Text processing method, text processing device, electronic equipment and computer readable storage medium | |
US20170185608A1 (en) | App Onboarding System For Developer-Defined Creation Of Search Engine Results | |
US11836331B2 (en) | Mathematical models of graphical user interfaces | |
Rebele et al. | Adding missing words to regular expressions | |
CN107463583A (en) | Application developer region determines method and apparatus | |
CN111797297B (en) | Page data processing method and device, computer equipment and storage medium | |
Aranda-Corral et al. | Reconciling knowledge in social tagging web services | |
Luo et al. | Automated structural semantic annotation for RESTful services | |
CN107463581A (en) | Using download acquisition methods, device and terminal device | |
Liepina et al. | Explaining potentially unfair clauses to the consumer with the CLAUDETTE tool | |
CN104598482A (en) | Method for updating book information based on depth-first search strategy | |
CN112632981A (en) | New word discovery method and device | |
CN117633197B (en) | Search information generation method and device applied to paraphrasing document and electronic equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20171212 |
|
RJ01 | Rejection of invention patent application after publication |