CN110472120A - A kind of rent a house formation gathering method and system based on social networks - Google Patents
A kind of rent a house formation gathering method and system based on social networks Download PDFInfo
- Publication number
- CN110472120A CN110472120A CN201910676168.7A CN201910676168A CN110472120A CN 110472120 A CN110472120 A CN 110472120A CN 201910676168 A CN201910676168 A CN 201910676168A CN 110472120 A CN110472120 A CN 110472120A
- Authority
- CN
- China
- Prior art keywords
- information
- house
- renting
- user
- social networks
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 46
- 230000015572 biosynthetic process Effects 0.000 title claims abstract description 15
- 238000000605 extraction Methods 0.000 claims abstract description 9
- 239000000284 extract Substances 0.000 claims description 6
- 230000001747 exhibiting effect Effects 0.000 claims description 3
- 238000005086 pumping Methods 0.000 claims 1
- 230000009193 crawling Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 230000008569 process Effects 0.000 description 3
- 244000018633 Prunus armeniaca Species 0.000 description 2
- 235000009827 Prunus armeniaca Nutrition 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000013467 fragmentation Methods 0.000 description 2
- 238000006062 fragmentation reaction Methods 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 241000251468 Actinopterygii Species 0.000 description 1
- 241000239290 Araneae Species 0.000 description 1
- 244000046052 Phaseolus vulgaris Species 0.000 description 1
- 235000010627 Phaseolus vulgaris Nutrition 0.000 description 1
- 238000012550 audit Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000000739 chaotic effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 230000003749 cleanliness Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 230000003997 social interaction Effects 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/21—Design, administration or maintenance of databases
- G06F16/211—Schema design and management
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/23—Updating
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
- G06F16/313—Selection or weighting of terms for indexing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/06—Buying, selling or leasing transactions
- G06Q30/0601—Electronic shopping [e-shopping]
- G06Q30/0609—Buyer or seller confidence or verification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/06—Buying, selling or leasing transactions
- G06Q30/0601—Electronic shopping [e-shopping]
- G06Q30/0639—Item locations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/06—Buying, selling or leasing transactions
- G06Q30/0645—Rental transactions; Leasing transactions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/01—Social networking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/16—Real estate
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Physics & Mathematics (AREA)
- Accounting & Taxation (AREA)
- Finance (AREA)
- Databases & Information Systems (AREA)
- Marketing (AREA)
- General Business, Economics & Management (AREA)
- Strategic Management (AREA)
- Economics (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Development Economics (AREA)
- Tourism & Hospitality (AREA)
- Human Resources & Organizations (AREA)
- Primary Health Care (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Computing Systems (AREA)
- Software Systems (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a kind of rent a house formation gathering method and system based on social networks crawl by web crawlers rent a house information and the corresponding website information of information of renting a house of preset social networks;The entity information in rental housing information is extracted by name entity recognition techniques, and building source of houses database is carried out according to the entity information of extraction;By the corresponding website information of information of renting a house described in web crawlers regular visit, whether information of renting a house described in judgement is outdated information, and is updated according to judging result to the source of houses database;The collection efficiency for information of renting a house can not only be improved, and guarantees the validity of information, avoids the interference of invalid information, user experience is more preferable.
Description
Technical field
The present invention relates to data acquisition and processing (DAP) technical field, especially a kind of information collection of renting a house based on social networks
The system of method and its application this method.
Background technique
Social networks (Social Network), be with the Internet such as E-mail, BBS, blog, microblogging application and
A kind of form for the reaction social interaction group that organic growth is got up, essence are to provide one and share interest, love in crowd
The information such as good, state and activity in line platform.Side of the social networks to get up with internet development to human social activity
Formula, efficiency etc. produce profound influence.
Social networks has been deep into daily life, and form is varied.The rise of social networks, at some
Really bringing to everybody life in level is convenient.The 'inertia' of the mankind is utilized in it, and social networks is become people
Life style, allow it is many indulge in social friends, dependence strongly is produced to present social medium.
It is flooded with the information of magnanimity in social networks, is largely useless information, but there is also part useful informations, only
It is the useful information is still scattered, nonstandard.Undoubtedly, we have come into the fragmentation information epoch.In order to
These fragmentation informations are efficiently used, need to be collected information and arrange.
The prior art is usually to pass through web crawlers to be collected for the massive information obtained in social networks, still,
Since information content is very huge, such as information of renting a house has a very strong timeliness again, and the publication of personal or house property medium is rented a house
The format disunity of information, typesetting are chaotic, not only require a great deal of time and are collected and arrange, but also may be most of
The information collected is expired invalid information.
Summary of the invention
The present invention to solve the above problems, provide a kind of rent a house formation gathering method and system based on social networks,
The collection efficiency for information of renting a house can not only be improved, and guarantees the validity of information, avoids the interference of invalid information, user's body
It tests more preferable.
To achieve the above object, the technical solution adopted by the present invention are as follows:
A kind of formation gathering method of renting a house based on social networks comprising following steps:
Crawl by web crawlers rent a house information and the corresponding network address of information of renting a house of preset social networks
Information;
By name entity recognition techniques extract rental housing information in entity information, and according to the entity information of extraction into
Row building source of houses database;
By the corresponding website information of information of renting a house described in web crawlers regular visit, information of renting a house described in judgement whether be
Outdated information, and the source of houses database is updated according to judging result.
Preferably, after crawling the information of renting a house, whether comprising address information in information of further renting a house described in judgement, if
It is that then invocation map API obtains the corresponding latitude and longitude information of the address information, and the latitude and longitude information is stored to the room
In source database.
It further, further include that the information of renting a house in the source of houses database is showed into user;Methods of exhibiting is to pass through
The point to be taken land on lease of user is obtained, and invocation map API acquisition is described wait a little corresponding latitude and longitude information of taking land on lease, then matching obtains
In the source of houses database with the information of renting a house of the close position wait latitude and longitude information a little of taking land on lease, by the close position
Information of renting a house shows user.
Preferably, the point to be taken land on lease for obtaining user is the place by obtaining user's input, or obtains user on ground
The place being arranged on figure, or carry out obtaining user by GPS positioning and be currently located place;The close position refer to
The region within the scope of pre-set interval centered on the point that waits taking land on lease.
Preferably, the source of houses database uses redis database, and the entity information that the information extraction of renting a house obtains is adopted
It is stored in the redis database with the form of table.
Preferably, whether information of renting a house described in judgement is outdated information, is by believing to accessing the network address for the last time
The acquired information of renting a house of breath carries out following any judgement:
A. judge that the corresponding page details of the website information then judge if it does not exist with the presence or absence of the information of renting a house
For outdated information;
B. information of renting a house described in judgement whether there is the comment information hired out, and if it exists, then be judged as outdated information;
Whether the title for information of c. renting a house described in judgement is revised as having hired out, if having modified, is judged as outdated information.
Preferably, the entity information includes: source of houses place, floor space, price, feature, traffic, restrictive condition, connection
It is mode, website information, data source, renewal time or last time renewal time, whether expired.
Preferably, further include information of renting a house that personal user directly uploads, the personal user includes landlord user and rent
Objective user;The landlord user uploads when renting a house information, further carries out landlord to landlord user by identity card and property ownership certificate
The verifying of identity.
Preferably, intermediary's identification further is carried out to the landlord user, intermediary's recognition methods includes following any
Kind:
A. judge whether the landlord user is associated with the information of renting a house more than preset quantity, if so, being judged as the landlord
User is house property medium;
B. it is carried out judging whether the landlord user is house property medium according to the report information of lessee or tourist.
Corresponding with the collection method, the present invention also provides a kind of, and the information of renting a house based on social networks collects system
System comprising:
Information crawler module, by web crawlers crawl preset social networks rent a house information and this rent a house
The corresponding website information of information;
Database sharing module extracts the entity information in rental housing information, and root by name entity recognition techniques
Building source of houses database is carried out according to the entity information of extraction;
Database update module passes through the corresponding website information of information of renting a house described in web crawlers regular visit, judgement
Whether the information of renting a house is outdated information, and is updated according to judging result to the source of houses database.
The beneficial effects of the present invention are:
(1) present invention carries out building source of houses database by extracting the entity information for information of renting a house, and passes through regular visit
Whether information of renting a house described in being judged is expired, to be updated database (for example, deleting outdated information or modification information
State), the collection efficiency for information of renting a house can not only be improved, and guarantee the validity of information, avoid the interference of invalid information,
User experience is more preferable;
(2) present invention also further extracts the address information rented a house in information, when user searches for the source of houses, can according to
Family wait the information of renting a house for being a little shown close position of taking land on lease, save the browsing time of user;
(3) present invention also further carries out judging whether website information also deposits by the rent a house network address of information of regular visit
, whether comment information updates, whether heading message updates, so that whether judgement information of renting a house expired, greatly ensure that letter
The timeliness and validity of breath;
(4) present invention also further carries out authentication to landlord user, is not only to rent to prevent house property medium from intervening
Visitor saves money, and can be avoided house property medium publication deceptive information.
Specific embodiment
In order to be clearer and more clear technical problems, technical solutions and advantages to be solved, tie below
Closing specific embodiment, the present invention will be described in further detail.It should be appreciated that specific embodiment described herein only to
It explains the present invention, is not intended to limit the present invention.
A kind of formation gathering method of renting a house based on social networks of the invention comprising following steps:
Crawl by web crawlers rent a house information and the corresponding network address of information of renting a house of preset social networks
Information;
By name entity recognition techniques extract rental housing information in entity information, and according to the entity information of extraction into
Row building source of houses database;
By the corresponding website information of information of renting a house described in web crawlers regular visit, information of renting a house described in judgement whether be
Outdated information, and the source of houses database is updated according to judging result.
Wherein, web crawlers also known as " Web Spider " are to find webpage by the chained address of webpage, a certain from website
A page starts, and reads the content of webpage, finds other chained addresses in webpage, is then found by these chained addresses
Next webpage so recycles, technology until webpage all on internet has all been grabbed according to certain strategy.Life
Name Entity recognition (Named Entity Recognition, abbreviation NER) also referred to as " proper name identification " refers in identification text
Entity with certain sense mainly includes name, place name, mechanism name, proper noun etc..
In the present embodiment, after crawling the information of renting a house, whether believe comprising address in information of further renting a house described in judgement
Breath, if so, automatic upload the address for crawling details page, and invocation map API (such as Baidu API) obtains the address information
Corresponding latitude and longitude information, and the latitude and longitude information is stored into the source of houses database.It also, further include by the source of houses
Information of renting a house in database shows user;Methods of exhibiting is the point to be taken land on lease by obtaining user, and invocation map API is obtained
Take it is described wait a little corresponding latitude and longitude information of taking land on lease, then match obtain in the source of houses database with the warp wait take land on lease a little
The information of renting a house of the close position is showed user by the information of renting a house of the close position of latitude information.Wherein, obtain user's
Point to be taken land on lease is the place by obtaining user's input, or obtains the place that user is arranged on map, or pass through
GPS positioning, which carries out obtaining user, is currently located place;The close position refers to the preset areas by this centered on waiting taking land on lease point
Between region in range.
Whether information of renting a house in the present embodiment, described in judgement is outdated information, is by accessing the net for the last time
Information of renting a house acquired in the information of location carries out following any judgement:
A. judge that the corresponding page details of the website information then judge if it does not exist with the presence or absence of the information of renting a house
For outdated information;
B. information of renting a house described in judgement whether there is the comment information hired out and (such as judge whether comprising having hired out,
Rent, the keywords such as subleted and turned), and if it exists, then it is judged as outdated information;
C. the title for information of renting a house described in judgement whether be revised as having hired out (such as judge whether comprising having hired out, having rented,
Sublet and turned to wait keywords), if having modified, it is judged as outdated information.
Rent a house the crawling of information except through web crawlers, further includes that personal user directly uploads in the present embodiment
It rents a house information, the personal user includes landlord user and lessee user;The landlord user uploads when renting a house information, further
The verifying of landlord's identity is carried out to landlord user by identity card and property ownership certificate, lessee user uploads and can not need to audit.
Preferably, intermediary's identification further is carried out to the landlord user, when being judged as house property medium, then deletes user publication
It rents a house information, and limits user speech, situation serious person carries out account and closes.Intermediary's recognition methods includes following any
Kind:
A. judge whether the landlord user is associated with the information of renting a house more than preset quantity, if so, being judged as the landlord
User be house property medium (the same multiple sources of houses of account are then judged as house property medium, if so, carry out label for labelling, and via
Manual examination and verification verifying);
B. according to the report information of lessee or tourist (preferred, report information includes relevant evidence, such as intermediary fee etc.)
It carries out judging whether the landlord user is house property medium.
In the present embodiment, the source of houses database uses redis database, the entity letter that the information extraction of renting a house obtains
Breath is stored in the redis database in the form of table.Wherein, redis is one memory-based high performance
Key-value database is supported numerous types of data and is supported distributed.Key is that speed is fastly and free.
The present embodiment by crawler traverse all platforms (including but not limited to microblogging, small routine, bean cotyledon rent a house, 58 same cities
Deng) information of renting a house, meanwhile, it is to be noted that the crawler want limit frequency and pay attention to platform anti-crawler, it is preferred that pass through maintenance one
Agent pool, the data that crawler is got are stored in redis database.Then, it is extracted and is hired out by name entity recognition techniques
Entity in room information, then the entity extracted is stored in redis database in table form (if there is address is believed
Breath then requests a Baidu API to store latitude and longitude information in the database, and no then longitude and latitude is sky);Finally by regular
Access the information of real estate being timed in more new database.In the present embodiment, the entity information includes: source of houses place, house
Area, price, feature, traffic, restrictive condition, contact method, website information, data source, renewal time or last time update
It is time, whether expired.Concrete example is as follows:
Embodiment one:
The information of renting a house crawled is as follows:
[subleting] sublets the garden the Ling Doujia building * * * at software centre west gate soft three because company moves to, no intermediary fee, and 1300/
Month, one pair one is given as security, 5 electricity 1 of water wraps broadband and property fees.Cell can be arrived directly inside software centre, and walk to work only needs 10 points
Clock.Neighbouring traffic convenience, there are many bus route, the distance for also there was only 5-6 minutes from brt.Furniture is complete, there is washing machine, empty
It adjusts, water heater.Brush access card is needed downstairs, there is an elevator, and safety and comfort, well lighted is with fresh air.Room is all solid wall, other
That side is lived is all small elder sister, will not be disturbed by making noise at night, and unmarried young inhabitation is suitble to.This house is rented by intermediary, because company removes
It moves and just haves no alternative but sublet, lived nearly half a year, think that condition is all well and good.Need to see the addition WeChat ID * * * * * * * in room
The entity information extracted in rental housing information by name entity recognition techniques is as follows:
Embodiment two:
The information of renting a house crawled is as follows:
By the subway station of the road Xing Jin, 45 square metres, one Room of a room, landlord is directly rented.900 yuan.Completely new apartment, landlord directly rent.Family
Have complete, handbag is moved in.Quiet comfortable, room area is big, and daylighting is good, air circulation.Traffic convenience closes on the road Xing Jin subway
It stands, light industry school station, it is very convenient to pass in and out island for Air China, station, city.The neighbouring garden You Xing primary school, the five edge experimental primary school branch schooles Yuan Bo, gently
Engineering school.By Jinyuan Garden Bo Yuan, the Xiamen library shop Ji Meixin also nearby, apricot woods gulf enterprise operation center, Citizen Square, weekend leisure
Good place to go!Neighbouring more supermarkets, side fish shop, pharmacy.Parking is convenient, have the friend of vehicle vehicle can be parked in bis- tunnel apricot Lin Bei or
The road Xing Jin subway station.Searching is liked cleanliness, not noisy, lessee steady in a long-term.
The entity information extracted in rental housing information by name entity recognition techniques is as follows:
In addition, the present invention also provides a kind of Information Collection System of renting a house based on social networks, the collection system with it is described
Collection method is corresponding, and the collection system includes:
Information crawler module, by web crawlers crawl preset social networks rent a house information and this rent a house
The corresponding website information of information;
Database sharing module extracts the entity information in rental housing information, and root by name entity recognition techniques
Building source of houses database is carried out according to the entity information of extraction;
Database update module passes through the corresponding website information of information of renting a house described in web crawlers regular visit, judgement
Whether the information of renting a house is outdated information, and is updated according to judging result to the source of houses database.
It should be noted that all the embodiments in this specification are described in a progressive manner, each embodiment weight
Point explanation is the difference from other embodiments, and the same or similar parts between the embodiments can be referred to each other.
For system embodiments, since it is basically similar to the method embodiment, so being described relatively simple, related place referring to
The part of embodiment of the method illustrates.
Also, herein, the terms "include", "comprise" or its any other variant are intended to the packet of nonexcludability
Contain, so that the process, method, article or equipment for including a series of elements not only includes those elements, but also including
Other elements that are not explicitly listed, or further include for elements inherent to such a process, method, article, or device.
In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including the element
Process, method, article or equipment in there is also other identical elements.In addition, those of ordinary skill in the art can manage
Solution realizes that all or part of the steps of above-described embodiment may be implemented by hardware, and can also be instructed by program relevant
Hardware is completed, and the program can store in a kind of computer readable storage medium, and storage medium mentioned above can be with
It is read-only memory, disk or CD etc..
The preferred embodiment of the present invention has shown and described in above description, it should be understood that the present invention is not limited to this paper institute
The form of disclosure, should not be regarded as an exclusion of other examples, and can be used for other combinations, modifications, and environments, and energy
Enough in this paper invented the scope of the idea, modifications can be made through the above teachings or related fields of technology or knowledge.And people from this field
The modifications and changes that member is carried out do not depart from the spirit and scope of the present invention, then all should be in the protection of appended claims of the present invention
In range.
Claims (10)
1. a kind of formation gathering method of renting a house based on social networks, which comprises the following steps:
Crawl by web crawlers rent a house information and the corresponding website information of information of renting a house of preset social networks;
The entity information in rental housing information is extracted by name entity recognition techniques, and structure is carried out according to the entity information of extraction
Build a house source database;
By the corresponding website information of information of renting a house described in web crawlers regular visit, whether information of renting a house described in judgement is expired
Information, and the source of houses database is updated according to judging result.
2. a kind of formation gathering method of renting a house based on social networks according to claim 1, it is characterised in that: crawl institute
It states after renting a house information, whether comprising address information in information of further renting a house described in judgement, if so, invocation map API is obtained
The corresponding latitude and longitude information of the address information, and the latitude and longitude information is stored into the source of houses database.
3. a kind of formation gathering method of renting a house based on social networks according to claim 2, it is characterised in that: further include
Information of renting a house in the source of houses database is showed into user;Methods of exhibiting is the point to be taken land on lease by obtaining user, and is adjusted
Obtained with map API it is described wait a little corresponding latitude and longitude information of taking land on lease, then match obtain in the source of houses database with it is described
The information of renting a house of the close position is showed user by the information of renting a house of the close position wait latitude and longitude information a little of taking land on lease.
4. a kind of formation gathering method of renting a house based on social networks according to claim 3, it is characterised in that: obtain and use
The point to be taken land on lease at family is by the place of acquisition user's input, or the place that acquisition user is arranged on map, either
It carries out obtaining user by GPS positioning and is currently located place;The close position refer to by this wait take land on lease point centered on it is pre-
If the region in interval range.
5. a kind of formation gathering method of renting a house based on social networks according to claim 1, it is characterised in that: the room
Source database uses redis database, and the entity information that the information extraction of renting a house obtains is stored in institute in the form of table
It states in redis database.
6. a kind of formation gathering method of renting a house based on social networks according to claim 1, it is characterised in that: judge institute
State whether information of renting a house is outdated information, be by access for the last time rent a house acquired in the website information information carry out
Any judgement below:
A. judge that the corresponding page details of the website information were then judged as if it does not exist with the presence or absence of the information of renting a house
Phase information;
B. information of renting a house described in judgement whether there is the comment information hired out, and if it exists, then be judged as outdated information;
Whether the title for information of c. renting a house described in judgement is revised as having hired out, if having modified, is judged as outdated information.
7. a kind of formation gathering method of renting a house based on social networks according to claim 1, it is characterised in that: the reality
Body information includes: source of houses place, floor space, price, feature, traffic, restrictive condition, contact method, website information, data
It is source, renewal time or last time renewal time, whether expired.
8. a kind of formation gathering method of renting a house based on social networks according to any one of claims 1 to 7, feature exist
In: it further include the information of renting a house that personal user directly uploads, the personal user includes landlord user and lessee user;The room
Eastern user uploads when renting a house information, further carries out the verifying of landlord's identity to landlord user by identity card and property ownership certificate.
9. a kind of formation gathering method of renting a house based on social networks according to claim 8, it is characterised in that: further
Intermediary's identification is carried out to the landlord user, intermediary's recognition methods includes following any:
A. judge whether the landlord user is associated with the information of renting a house more than preset quantity, if so, being judged as the landlord user
For house property medium;
B. it is carried out judging whether the landlord user is house property medium according to the report information of lessee or tourist.
10. a kind of Information Collection System of renting a house based on social networks characterized by comprising
Information crawler module crawl by web crawlers rent a house information and the information of renting a house of preset social networks
Corresponding website information;
Database sharing module extracts the entity information in rental housing information by name entity recognition techniques, and according to pumping
The entity information taken carries out building source of houses database;
Database update module, by the corresponding website information of information of renting a house described in web crawlers regular visit, described in judgement
Whether information of renting a house is outdated information, and is updated according to judging result to the source of houses database.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910676168.7A CN110472120A (en) | 2019-07-25 | 2019-07-25 | A kind of rent a house formation gathering method and system based on social networks |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910676168.7A CN110472120A (en) | 2019-07-25 | 2019-07-25 | A kind of rent a house formation gathering method and system based on social networks |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110472120A true CN110472120A (en) | 2019-11-19 |
Family
ID=68508958
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910676168.7A Pending CN110472120A (en) | 2019-07-25 | 2019-07-25 | A kind of rent a house formation gathering method and system based on social networks |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110472120A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112163137A (en) * | 2020-09-02 | 2021-01-01 | 北京神鹰城讯科技股份有限公司 | House renting information searching method based on data acquisition and information extraction |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104182466A (en) * | 2014-07-21 | 2014-12-03 | 安徽华贞信息科技有限公司 | House information base network system |
CN104317857A (en) * | 2014-10-15 | 2015-01-28 | 安徽华贞信息科技有限公司 | House information acquisition service system |
CN106528785A (en) * | 2016-11-03 | 2017-03-22 | 杜剑峰 | Question synthesis based user renting preference capturing method |
-
2019
- 2019-07-25 CN CN201910676168.7A patent/CN110472120A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104182466A (en) * | 2014-07-21 | 2014-12-03 | 安徽华贞信息科技有限公司 | House information base network system |
CN104317857A (en) * | 2014-10-15 | 2015-01-28 | 安徽华贞信息科技有限公司 | House information acquisition service system |
CN106528785A (en) * | 2016-11-03 | 2017-03-22 | 杜剑峰 | Question synthesis based user renting preference capturing method |
Non-Patent Citations (1)
Title |
---|
张浩: ""基于Scrapy的房屋租赁信息搜索系统的设计与实现"", 《中国优秀博硕士学位论文全文数据库(硕士) 信息科技辑》 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112163137A (en) * | 2020-09-02 | 2021-01-01 | 北京神鹰城讯科技股份有限公司 | House renting information searching method based on data acquisition and information extraction |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Bloch | Evicting heritage: spatial cleansing and cultural legacy at the Hampi UNESCO site in India | |
CN103154994A (en) | Dynamic place visibility in geo-social networking system | |
Talen | Design for diversity: evaluating the context of socially mixed neighbourhoods | |
Schroder et al. | Giving the ‘right’route directions: The requirements for pedestrian navigation systems | |
Gonick | Interrogating Madrid's “Slum of Shame”: Urban Expansion, Race, and Place‐Based Activisms in the Cañada Real Galiana | |
Zerbini | Human mobility in the Roman Near East: patterns and motives | |
Sletto et al. | The liminality of open space and rhythms of the everyday in Jallah Town, Monrovia, Liberia | |
Pradinie et al. | Who's Own the Public Space?: The Adaptation of Limited Space in Arabic Kampong | |
CN111488409A (en) | City address library construction method, retrieval method and device | |
CN110472120A (en) | A kind of rent a house formation gathering method and system based on social networks | |
KR101513347B1 (en) | Method and apparatus for providing spatial information | |
Hou et al. | [Retracted] Analyzing the Check‐In Behavior of Visitors through Machine Learning Model by Mining Social Network’s Big Data | |
CN108198117A (en) | A kind of wisdom rural area management and construction platform | |
Ikuomola et al. | A secured mobile cloud-based house rental management system | |
Yanagisawa et al. | How mohallas were formed: Typology of mohallas from the viewpoint of spatial formation and the urbanization process in Varanasi, India | |
Bełej | Analysis of spatial distribution of touristic accommodation in Poland with the kernel density estimation of POIs | |
Beer | The intensification of rural-urban networks in the Markham Valley, Papua New Guinea: from gold-prospecting to large-scale capitalist projects | |
Zhan et al. | Minority tourist information service and sustainable development of tourism under the background of smart city | |
Rego | New capital cities in the Global South. Post-modernist context, modernist layout in Nigeria and Brazil | |
Juul | Migration, transit and the informal: Homeless West-African migrants in Copenhagen | |
Ashery | A Ghetto within an Island? The Satmar community of Canvey Island | |
Hou et al. | Spatiotemporal analysis of residents in shanghai by utilizing Chinese microblog Weibo data | |
Bi et al. | Analysis of travel hot spots of taxi passengers based on community detection | |
Carman et al. | THE INTANGIBLE PRESENCE: Investigating battlefi elds | |
Majiet | Gated communities for the working class: A Cape Flats case study |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20191119 |