WO2018058609A1 - Data collecting method and apparatus based on crowdsourcing, and server - Google Patents

Data collecting method and apparatus based on crowdsourcing, and server Download PDF

Info

Publication number
WO2018058609A1
WO2018058609A1 PCT/CN2016/101274 CN2016101274W WO2018058609A1 WO 2018058609 A1 WO2018058609 A1 WO 2018058609A1 CN 2016101274 W CN2016101274 W CN 2016101274W WO 2018058609 A1 WO2018058609 A1 WO 2018058609A1
Authority
WO
WIPO (PCT)
Prior art keywords
task
data collection
data
crowdsourcing
requirement
Prior art date
Application number
PCT/CN2016/101274
Other languages
French (fr)
Chinese (zh)
Inventor
于文渊
许松
张旭
贾西贝
Original Assignee
深圳市华傲数据技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市华傲数据技术有限公司 filed Critical 深圳市华傲数据技术有限公司
Priority to PCT/CN2016/101274 priority Critical patent/WO2018058609A1/en
Publication of WO2018058609A1 publication Critical patent/WO2018058609A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W64/00Locating users or terminals or network equipment for network management purposes, e.g. mobility management

Definitions

  • the present invention relates to the field of data collection technologies, and in particular, to a crowdsourcing-based data collection method, apparatus, and server.
  • the data can be divided into online data and offline data.
  • the online data is the data existing in the network, and the data can be collected through crawler crawling and other technical means;
  • the offline data refers to the data that needs to be collected in the field, such as The number of flower shops in the city, the release, the size of the store, the occupancy of the community, etc.
  • crowd-source collection is generally used to collect offline data.
  • the crowdsourcing acquisition divides the data collection task into multiple sub-items.
  • the task is arranged for multiple users and given a certain commission.
  • the way of completing the entire data collection task by many users can effectively improve the collection efficiency.
  • there is a lack of a system data collection scheme and there is a lack of connection between various links of data collection, and the user experience is poor.
  • the management of the crowdsourcing task is not perfect, and the task is not perfect. The situation in which the recipient falsifies the data often occurs, so that the authenticity of the data cannot be guaranteed.
  • the prior art lacks a system data collection scheme, and there is a lack of connection between various links of data collection, and the user experience is poor.
  • the present invention provides a crowdsourcing-based data collection method, apparatus, and server to provide a system data collection scheme, improve user experience, and solve existing crowdsourcing collection methods.
  • the management of crowdsourcing tasks is not perfect, and the situation in which the task recipients falsify data often occurs, resulting in the inability to guarantee the authenticity of the data.
  • the problem of the card is not perfect, and the situation in which the task recipients falsify data often occurs, resulting in the inability to guarantee the authenticity of the data. The problem of the card.
  • the present invention provides a crowdsourcing-based data collection method, including:
  • the crowdsourcing-based data collection method organically combines data acquisition requirements acquisition, data collection task generation and release, and acquisition and review of collected data, and provides a kind of data collection link.
  • the system's crowdsourcing-based data collection method has a good user experience. Among them, through the authenticity verification of the collected data, the forged data can be effectively identified, and the problem of forging data by the task recipient is reduced; All data is denoised and integrated, which can remove bad data in the data and improve the validity of the data.
  • the authenticating of the data is performed according to a preset auditing method, including:
  • the sensing information generated by the built-in sensor of the mobile terminal is objectively generated. Therefore, the reliability of the authenticity of the verification data is high, and the method can effectively determine the authenticity of the data.
  • the data collection requirement is a requirement for collecting data to a designated area
  • the acquiring, by the task recipient uploaded by the user, the sensing information generated by the built-in sensor of the mobile terminal used by the data collection includes:
  • Determining the authenticity of the data according to the sensing information including:
  • the location information generated by the GPS module and the location information corresponding to the designated area may be inclusively matched to determine whether the task recipient is data collected in the designated area, thereby determining the task recipient.
  • the authenticity of the uploaded data is easy to understand. If the matching fails, the task recipient can be considered not to be involved in the designated area, and the uploaded data may be forged, that is, the data is not true when judging the data.
  • the method is applicable to the situation that the data collection requirement is a requirement for collecting data in a designated area, and the data authenticity is relatively accurate.
  • the data is matched with the data collection requirement, and specifically, whether the data meets the foregoing indicators, and if yes, Match, otherwise the match fails.
  • the execution of the data collection task can be more comprehensively supervised, and the execution degree of the task can be improved to ensure the validity of the data.
  • the obtaining data collection requirements initiated by the task publisher include:
  • the data collection task dynamic form can function as a template, and by providing the task publisher with a data collection task dynamic form, the task publisher can input the data collection requirement more intuitively and quickly, and the form form is more Preface, easy to change, easy to change and adjust during the execution of the task.
  • the publishing the data collection task comprises:
  • the data collection task is pushed to the user end of the task recipient who meets the specified condition for the task recipient to receive.
  • the above provides two ways to publish data collection tasks.
  • the first one is published through the platform, and the task recipients
  • the second is to pre-select the task recipients according to the requirements of the data collection task according to the specified conditions (such as the distance from the collection location, the number of historical tasks completed, the quality of the historical mission completion, etc.), and then Sending a single order to it is more targeted.
  • the task recipients can be screened during the release task phase to improve the success rate and quality of the data collection task.
  • the generating a data collection task according to the data collection requirement includes:
  • Determining a task publishing mode of the data collection task where the task publishing mode includes a task subcontracting mode, a task allocation mode, and a basic task pricing;
  • the task publishing mode of the data collection task may be determined according to the task density, the number of users of the executable task, the historical score of each user, the spatial distribution of the user, and the like, thereby selecting a more reasonable task publishing mode, and further Make the generated tasks more reasonable to ensure that the data collection task is completed more effectively and reasonably.
  • the crowdsourcing-based data collection method further includes:
  • the data collection task is dynamically adjusted according to the results of denoising and integration.
  • the specific implementation manner is to modify the corresponding data collection requirement dynamic form according to the denoising and integrated abnormal data, and adjust the data collection task according to the content of the modified part in the dynamic collection form of the data collection requirement. In this way, it is possible to correct the obvious erroneous data and abnormal data by re-collecting the data, thereby improving the validity of the finally collected data.
  • the present invention provides a crowdsourcing-based data collection device, including:
  • the data collection requirement acquisition module is configured to acquire a data collection requirement initiated by the task publisher
  • a data collection task publishing module configured to generate a data collection task according to the data collection requirement and issue the data collection task
  • the collection data receiving module is configured to receive data collected by the task recipient uploaded by the user end for the data collection task
  • Collecting data review module for reviewing the authenticity of the data according to a preset audit method
  • the data integration module is used for denoising and integrating all data uploaded by multiple clients.
  • the collecting data review module includes:
  • the sensing information acquiring unit is configured to acquire sensing information generated by a built-in sensor of the mobile terminal used by the task recipient uploaded by the user terminal to collect the data;
  • the authenticity determining unit is configured to determine the authenticity of the data according to the sensing information.
  • the data collection requirement is a requirement for collecting data to a designated area
  • the sensing information acquiring unit includes:
  • a location information obtaining sub-unit configured to acquire location information generated by a built-in GPS module of the mobile terminal used by the task recipient uploaded by the user terminal to collect the data
  • the authenticity determining unit includes:
  • a location matching subunit configured to match the location information with location information corresponding to the specified area
  • the authenticity judging subunit is configured to judge that the data is untrue when the matching fails.
  • the crowdsourcing-based data collection device further includes:
  • the quality auditing module is configured to match the data with the data collection requirement, and determine a quality of completion of the data collection task according to the matching result.
  • the data collection requirement acquisition module includes:
  • a dynamic form providing unit for providing a data collection task dynamic form to a task publisher
  • the collection requirement acquisition unit is configured to obtain the task publisher data collection requirement according to the content input by the task publisher in the data collection task dynamic form.
  • the data collection task publishing module includes:
  • a platform publishing unit configured to publish the data collection task to a designated network crowdsourcing public platform for the task recipient to receive;
  • the push issuing unit is configured to push the data collection task to a user end of the task recipient that meets the specified condition for the task recipient to receive.
  • the data collection task publishing module includes:
  • a publishing mode determining unit configured to determine a task publishing mode of the data collection task, where the task publishing mode includes a task subcontracting mode, a task allocation mode, and a basic task pricing;
  • the collection task generating unit is configured to generate a data collection task according to the data collection requirement and the task publishing mode.
  • the crowdsourcing-based data collection device further includes:
  • the dynamic adjustment module is configured to dynamically adjust the data collection task according to the result of denoising and integration.
  • the present invention provides a crowdsourcing-based data collection server, including: a processor, a memory, a bus interface, a bus, and a transceiver;
  • the processor, the memory and the bus interface are connected by the bus, the transceiver is connected to the bus interface, and the antenna is connected to the transceiver;
  • the memory is used to store a program
  • the processor is configured to read a program in the memory, and execute the crowdsourcing-based data collection method according to any one of the present invention
  • the transceiver is configured to receive and transmit data under the control of the processor.
  • the crowdsourcing-based data collection server provided by the present invention and the crowdsourcing-based data collection method are based on the same inventive concept and have the same beneficial effects.
  • FIG. 1 is a flow chart showing a crowdsourcing-based data collection method according to a first embodiment of the present invention
  • FIG. 2 is a schematic diagram of a crowdsourcing-based data collection device according to a second embodiment of the present invention.
  • FIG. 3 is a schematic diagram of a crowdsourcing-based data collection server according to a third embodiment of the present invention.
  • the application provides a crowdsourcing-based data collection method, device and server. Embodiments of the present invention will be described below with reference to the accompanying drawings.
  • the data described in the embodiments of the present invention may be any form of data such as characters, images, sounds, audio and video, or the like.
  • an application scenario of an embodiment of the present invention is that if a real estate agent wants to cut into a market of a certain city, it is necessary to analyze the community situation of the city (such as occupancy rate, floor plan, infrastructure, etc.) of the city.
  • the data collection task can be issued by the method provided by the present invention, and the community resident or freelancer can receive the task, take a photo of the cell, fill in the description information, and the like, and return the data to the server through the mobile phone.
  • the photos and description information are the data that needs to be collected.
  • FIG. 1 is a flowchart of a crowdsourcing-based data collection method according to a first embodiment of the present invention. As shown in FIG. 1 , a crowdsourcing-based data collection method according to a first embodiment of the present invention is provided. Includes the following steps:
  • Step S101 Acquire a data collection requirement initiated by the task publisher.
  • the execution body of the embodiment of the present invention is a server, which is generally installed in the network background, and the data submission requirement of the user is implemented by the user end of the network front end. Therefore, when the step is implemented, the user can edit the user end. After the data collection requirement is sent by the client to the server, the server can obtain the data collection requirement.
  • the data collection requirements initiated by the task publisher are included, as the data collected by the user is different, and the data requirements are different. :
  • the data collection task dynamic form is an editable form, and the user can freely modify and edit the form to fully express his data collection requirements.
  • the data collection task dynamic form can also be in the collection task. Flexible modification during the execution process to instantly modify the data collection requirements, and then adjust and modify the data collection tasks.
  • the data collection task dynamic form can function as a template, and by providing the task publisher with a data collection task dynamic form, the task publisher can input the data collection requirement more intuitively and quickly, and at the same time, the form form It is more orderly and easy to change, so it is easy to change and adjust during the execution of the task.
  • Step S102 Generate a data collection task according to the data collection requirement and issue the data collection task.
  • the corresponding data collection task can be generated according to the data collection requirement.
  • a simple implementation manner is to directly input the data collection task dynamic form input in step S101 as the data collection task content; The data may be sub-packaged.
  • the data collection task dynamic form is to collect the task content of a cell, and then import a cell list of the city as seed data, according to the above data collection task.
  • the dynamic form and the cell list can generate a plurality of data collection tasks differentiated by cells (which can be regarded as subtasks of the overall data collection task of the city).
  • the generating a data collection task according to the data collection requirement includes:
  • Determining a task publishing mode of the data collection task where the task publishing mode includes a task subcontracting mode, a task allocation mode, and a basic task pricing;
  • the task subcontracting method includes subcontracting or non-subcontracting, and the task allocation manner includes task collection or task assignment.
  • the task distribution pattern may be determined, but not limited to, based on task density, number of users of the executable task, rating of each user, spatial distribution, and the like.
  • the task publishing mode of the data collection task may be determined according to the task density, the number of users of the executable task, the historical score of each user, the spatial distribution of the user, and the like, thereby selecting a more reasonable task publishing mode. In order to make the generated tasks more reasonable, to ensure that the data collection task is completed more effectively and reasonably.
  • the publishing the data collection task includes:
  • the data collection task is pushed to the user end of the task recipient who meets the specified condition for the task recipient to receive.
  • the above provides two methods for publishing data collection tasks.
  • the first is to be released through the platform, and the task recipients are robbed to receive the order.
  • the second is to pre-select the specified conditions according to the requirements of the data collection task (such as the location of the collection).
  • the task recipients who are close to the distance, the amount of historical tasks completed, the quality of the historical missions, etc., are then assigned to them, which is more targeted.
  • the task recipients can be screened during the release task phase to improve data collection. Set the success rate and quality of the task.
  • Step S103 Receive data collected by the task recipient uploaded by the user for the data collection task.
  • the user terminal described in the embodiment of the present invention may be any server device with Internet access functions, such as a mobile phone, a tablet computer, a personal digital assistant (PDA), a notebook computer, a desktop computer, etc., or may be installed on
  • the client software on the server device the client software can control the server device to perform functions such as data collection, reception, and transmission, which are all within the protection scope of the present invention.
  • Step S104 Review the authenticity of the data according to a preset auditing method.
  • the authenticity of the data is reviewed according to a preset auditing method, including:
  • the sensing information generated by the built-in sensor of the mobile terminal is objectively generated. Therefore, the reliability of the authenticity of the verification data is high, and the method can effectively determine the authenticity of the data.
  • the data collection requirement is a requirement for collecting data to a designated area
  • the acquiring, by the task recipient uploaded by the user, the sensing information generated by the built-in sensor of the mobile terminal used by the data collection includes:
  • Determining the authenticity of the data according to the sensing information including:
  • the location information generated by the GPS module and the location information corresponding to the designated area may be inclusively matched to determine whether the task recipient is data collected in the designated area, thereby determining the task recipient.
  • the authenticity of the uploaded data it is easy to understand that if the match fails, the task recipient can be considered as not If there is a specified area, the data uploaded by the user may be forged, that is, when the data is judged to be untrue, the method is applicable to the situation that the data collection requirement is to collect data in a designated area, The data authenticity judgment is more accurate.
  • the task recipient uploaded on the receiving client is for the data collection. After the steps of collecting data from the task, it also includes:
  • the data is matched with the data collection requirement, and specifically, whether the data meets the foregoing indicators, and if yes, Match, otherwise the match fails.
  • the execution of the data collection task can be more comprehensively supervised, and the execution degree of the task can be improved to ensure the validity of the data.
  • Step S105 Denoising and integrating all data uploaded by multiple clients.
  • the data can be summarized into multiple index conditions, and corresponding thresholds are set for each index condition, and then the data uploaded by each client is matched according to the above-mentioned index conditions and thresholds, and the data with failed matching is deleted. Is the process of denoising.
  • the crowdsourcing-based data collection method further includes:
  • the data collection task is dynamically adjusted according to the results of denoising and integration.
  • the specific embodiment is based on Denoising, integrating the abnormal data, modifying the corresponding data collection requirement dynamic form, and adjusting the data collection task according to the content of the modified part in the dynamic form of the data collection requirement.
  • the original data collection task is to collect a floor plan of 200 cells, wherein if the floor plan collected by the 100th cell has a problem, the floor plan data of the cell is re-acquired.
  • the bad data in the data can be removed and the data validity can be improved.
  • the crowdsourcing-based data collection method provided by the present invention organically acquires data acquisition requirements, generates and distributes data collection tasks, and acquires and audits data collected.
  • the combination of the ground provides a systematic crowdsourcing-based data collection method with a good user experience.
  • the forged data can be effectively identified, and the problem of forging data by the task recipient is reduced.
  • the bad data in the data can be removed and the data validity can be improved.
  • FIG. 2 is a schematic diagram of a crowdsourcing-based data collection device according to a second embodiment of the present invention. Since the device embodiment is basically similar to the method embodiment, the description is relatively simple, and the relevant parts can be referred to the description of the method embodiment.
  • the device embodiments described below are merely illustrative.
  • the data collection requirement acquisition module 101 is configured to acquire a data collection requirement initiated by the task publisher;
  • the data collection task issuing module 102 is configured to generate a data collection task according to the data collection requirement and issue the data collection task;
  • the collection data receiving module 103 is configured to receive data collected by the task recipient uploaded by the user end for the data collection task;
  • the data collection module 104 is configured to review the authenticity of the data according to a preset audit method
  • the data collection integration module 105 is configured to perform denoising and integration on all data uploaded by multiple clients.
  • the collecting data review module 104 includes:
  • the sensing information acquiring unit is configured to acquire sensing information generated by a built-in sensor of the mobile terminal used by the task recipient uploaded by the user terminal to collect the data;
  • the authenticity determining unit is configured to determine the authenticity of the data according to the sensing information.
  • the data collection requirement is a requirement for collecting data to a designated area
  • the sensing information acquiring unit includes:
  • a location information obtaining sub-unit configured to acquire location information generated by a built-in GPS module of the mobile terminal used by the task recipient uploaded by the user terminal to collect the data
  • the authenticity determining unit includes:
  • a location matching subunit configured to match the location information with location information corresponding to the specified area
  • the authenticity judging subunit is configured to judge that the data is untrue when the matching fails.
  • the crowdsourcing-based data collection device further includes:
  • the quality auditing module is configured to match the data with the data collection requirement, and determine a quality of completion of the data collection task according to the matching result.
  • the data collection requirement acquisition module 101 includes:
  • a dynamic form providing unit for providing a data collection task dynamic form to a task publisher
  • the collection requirement acquisition unit is configured to obtain the task publisher data collection requirement according to the content input by the task publisher in the data collection task dynamic form.
  • the data collection task issuing module 102 includes:
  • a platform publishing unit configured to publish the data collection task to a designated network crowdsourcing public platform for the task recipient to receive;
  • the push issuing unit is configured to push the data collection task to a user end of the task recipient that meets the specified condition for the task recipient to receive.
  • the data collection task issuing module 102 includes:
  • a publishing mode determining unit configured to determine a task publishing mode of the data collection task, where the task publishing mode includes a task subcontracting mode, a task allocation mode, and a basic task pricing;
  • the collection task generating unit is configured to generate a data collection task according to the data collection requirement and the task publishing mode.
  • the crowdsourcing-based data collection device further includes:
  • the dynamic adjustment module is configured to dynamically adjust the data collection task according to the result of denoising and integration.
  • a crowdsourcing-based data collection device provided by the present invention has the same beneficial effects as the above-described crowdsourcing-based data collection method, and will not be described herein.
  • FIG. 3 is a schematic diagram of a crowdsourcing-based data collection server according to a third embodiment of the present invention.
  • the invention provides a crowdsourcing-based data collection server, comprising: a processor 1, a memory 2, a bus interface 3, a bus 4, and a transceiver 5 and an antenna 6;
  • the processor 1, the memory 2 and the bus interface 3 are connected by the bus 4, the transceiver 5 is connected to the bus interface 3, and the antenna 6 is connected to the transceiver 5;
  • the memory 2 is used to store a program
  • the processor 1 is configured to read a program in the memory 2, and execute the crowdsourcing-based data collection method according to any one of the present inventions;
  • the transceiver 5 is configured to receive and transmit data under the control of the processor 1.
  • a bus architecture (represented by bus 4), which may include any number of interconnected buses and bridges, the bus 4 will include one or more processors represented by processor 1 and a memory represented by memory 2.
  • the various circuits are linked together.
  • the bus 4 can also link various other circuits such as peripherals, voltage regulators, and power management circuits, which are well known in the art and, therefore, will not be further described herein.
  • the bus interface 3 provides an interface between the bus 4 and the transceiver 5.
  • the transceiver 5 can be an element or a plurality of elements, such as a plurality of receivers and transmitters, providing means for communicating with various other devices on a transmission medium.
  • the data processed by the processor 1 is transmitted over the wireless medium via the antenna 6, and further, the antenna 6 also receives the data and transmits the data to the processor 1.
  • Processor 1 is responsible for managing bus 4 and the usual Processing can also provide a variety of functions, including timing, peripheral interfaces, voltage regulation, power management, and other control functions.
  • the memory 2 can be used to store data used by the processor 1 when performing operations.
  • the processor 1 may be a CPU (Central Embedded Device), an ASIC (Application Specific Integrated Circuit), an FPGA (Field-Programmable Gate Array), or a CPLD (Complex Programmable Logic Device). , complex programmable logic devices).
  • a crowdsourcing-based data collection server provided by the present invention has the same beneficial effects as the above-described crowdsourcing-based data collection method, and will not be described herein.
  • each block of the flowchart or block diagram can represent a module, a program segment, or a portion of code that includes one or more of the Executable instructions.
  • the functions noted in the blocks may also occur in a different order than that illustrated in the drawings. For example, two consecutive blocks may be executed substantially in parallel, and they may sometimes be executed in the reverse order, depending upon the functionality involved.
  • each block of the block diagrams and/or flowcharts, and combinations of blocks in the block diagrams and/or flowcharts can be implemented by a dedicated hardware-based server that performs the specified function or action. Or it can be implemented by a combination of dedicated hardware and computer instructions.
  • the crowdsourcing-based data collection device may be a computer program product, including a computer readable storage medium storing program code, the program code including instructions may be used to execute the foregoing method embodiments.
  • program code including instructions may be used to execute the foregoing method embodiments.
  • the disclosed server, apparatus, and method may be implemented in other manners.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • multiple units or components may be combined or Can be integrated into another server, or some features can be ignored or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some communication interface, device or unit, and may be electrical, mechanical or otherwise.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the functions may be stored in a computer readable storage medium if implemented in the form of a software functional unit and sold or used as a standalone product.
  • the technical solution of the present invention which is essential or contributes to the prior art, or a part of the technical solution, may be embodied in the form of a software product, which is stored in a storage medium, including
  • the instructions are used to cause a computer device (which may be a personal computer, server, or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present invention.
  • the foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like. .

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

Provided is a data collecting method based on crowdsourcing. The method comprises: acquiring a data collection requirement initiated by a task publisher; generating a data collection task according to the data collection requirement, and publishing the data collection task; receiving data which is uploaded by a user side and is collected by a task receiver with respect to the data collection task; checking the authenticity of the data according to a pre-set checking method; and denoising and integrating all pieces of data uploaded by a plurality of user sides. The present invention organically combines various data collection links, such as acquisition of a data collection requirement, generation and publishing of a data collection task, and acquisition and checking of collected data, provides a systematic data collection method, and has a good user experience. By means of checking the authenticity of collected data, falsified data can be effectively recognised, and the problem that a task receiver falsifies data is reduced. By means of denoising and integrating data, the validity of the data can be improved.

Description

基于众包的数据采集方法、装置和服务器Crowdsourcing-based data collection method, device and server 技术领域Technical field
本发明涉及数据采集技术领域,具体涉及一种基于众包的数据采集方法、装置及服务器。The present invention relates to the field of data collection technologies, and in particular, to a crowdsourcing-based data collection method, apparatus, and server.
背景技术Background technique
随着互联网信息和大数据分析技术的发展,当今社会在商业、经济、政府及相关领域中,决策行为越来越取决于数据和分析,而不再是经验和直觉。例如企业在产品开发阶段,需要进行市场调研以研究用户需求;商店在选址时,需要对附近小区的入住率进行调研。With the development of Internet information and big data analytics technology, today's society in business, economics, government and related fields, decision-making behavior increasingly depends on data and analysis, and is no longer experience and intuition. For example, in the product development stage, enterprises need to conduct market research to study user needs; when the store is site selection, it is necessary to investigate the occupancy rate of nearby communities.
根据数据来源,数据可以分为线上数据和线下数据,线上数据即网络中存在的数据,可以通过爬虫爬取等技术手段进行数据采集;线下数据是指需要实地采集的数据,如城市中鲜花店的数量、发布、店面大小,小区的入住情况等。According to the data source, the data can be divided into online data and offline data. The online data is the data existing in the network, and the data can be collected through crawler crawling and other technical means; the offline data refers to the data that needs to be collected in the field, such as The number of flower shops in the city, the release, the size of the store, the occupancy of the community, etc.
由于线下数据往往具有采集时间长、区域跨度大等特点,安排专人采集效率低、时效性差,因此,一般采用众包采集的方式采集线下数据,众包采集是将数据采集任务分成多个子任务安排给多个用户并给与一定的佣金,由众多用户共同完成整个数据采集任务的方式,可以有效提高采集效率。但现有技术中缺乏一种系统的数据采集方案,数据采集的各个环节之间缺乏衔接,用户体验较差,此外,现有的众包采集方式中,对众包任务的管理不完善,任务领取者伪造数据的情况时常发生,致使数据真实性无法保证。此外,现有技术中缺乏一种系统的数据采集方案,数据采集的各个环节之间缺乏衔接,用户体验都较差。Because offline data often has the characteristics of long acquisition time and large regional span, the arrangement of special personnel is low in efficiency and time-sensitive. Therefore, crowd-source collection is generally used to collect offline data. The crowdsourcing acquisition divides the data collection task into multiple sub-items. The task is arranged for multiple users and given a certain commission. The way of completing the entire data collection task by many users can effectively improve the collection efficiency. However, in the prior art, there is a lack of a system data collection scheme, and there is a lack of connection between various links of data collection, and the user experience is poor. In addition, in the existing crowdsourcing collection method, the management of the crowdsourcing task is not perfect, and the task is not perfect. The situation in which the recipient falsifies the data often occurs, so that the authenticity of the data cannot be guaranteed. In addition, the prior art lacks a system data collection scheme, and there is a lack of connection between various links of data collection, and the user experience is poor.
发明内容Summary of the invention
针对现有技术中的缺陷,本发明提供一种基于众包的数据采集方法、装置及服务器,以提供一种系统的数据采集方案,提升用户体验,同时解决现有的众包采集方式中,对众包任务的管理不完善,任务领取者伪造数据的情况时常发生,致使数据真实性无法保 证的问题。In view of the deficiencies in the prior art, the present invention provides a crowdsourcing-based data collection method, apparatus, and server to provide a system data collection scheme, improve user experience, and solve existing crowdsourcing collection methods. The management of crowdsourcing tasks is not perfect, and the situation in which the task recipients falsify data often occurs, resulting in the inability to guarantee the authenticity of the data. The problem of the card.
第一方面,本发明提供的一种基于众包的数据采集方法,包括:In a first aspect, the present invention provides a crowdsourcing-based data collection method, including:
获取任务发布者发起的数据采集需求;Obtain data collection requirements initiated by the task publisher;
根据所述数据采集需求生成数据采集任务以及发布该数据采集任务;Generating a data collection task according to the data collection requirement and publishing the data collection task;
接收用户端上传的任务领取者针对所述数据采集任务采集的数据;Receiving data collected by the task recipient uploaded by the user for the data collection task;
按照预设的审核方法对所述数据的真实性进行审核;Review the authenticity of the data according to a preset audit method;
对多个用户端上传的所有数据进行去噪、整合。Denoise and integrate all data uploaded by multiple clients.
本发明提供的所述基于众包的数据采集方法,将数据采集需求的获取、数据采集任务的生成、发布以及采集的数据的获取、审核等各个数据采集环节有机地结合起来,提供了一种系统的基于众包的数据采集方法,具有良好的用户体验,其中,通过对采集的数据进行真实性审核,可有效识别伪造数据,减少任务领取者伪造数据的问题;通过对多个用户端上传的所有数据进行去噪、整合,可以去除所述数据中的不良数据,提高数据的有效性。The crowdsourcing-based data collection method provided by the invention organically combines data acquisition requirements acquisition, data collection task generation and release, and acquisition and review of collected data, and provides a kind of data collection link. The system's crowdsourcing-based data collection method has a good user experience. Among them, through the authenticity verification of the collected data, the forged data can be effectively identified, and the problem of forging data by the task recipient is reduced; All data is denoised and integrated, which can remove bad data in the data and improve the validity of the data.
可选的,所述按照预设的审核方法对所述数据的真实性进行审核,包括:Optionally, the authenticating of the data is performed according to a preset auditing method, including:
获取用户端上传的任务领取者采集所述数据使用的移动终端的内置传感器产生的感应信息;Obtaining sensing information generated by a built-in sensor of the mobile terminal used by the task recipient uploaded by the client to collect the data;
根据所述感应信息判断所述数据的真实性。Determining the authenticity of the data based on the sensing information.
其中,由于移动终端的内置传感器产生的感应信息是客观产生的,因此作为验证数据真实性的判断依据可靠性较高,本方法可以有效判断所述数据的真实性。The sensing information generated by the built-in sensor of the mobile terminal is objectively generated. Therefore, the reliability of the authenticity of the verification data is high, and the method can effectively determine the authenticity of the data.
可选的,所述数据采集需求为到指定区域采集数据的需求;Optionally, the data collection requirement is a requirement for collecting data to a designated area;
所述获取用户端上传的任务领取者采集所述数据使用的移动终端的内置传感器产生的感应信息,包括:The acquiring, by the task recipient uploaded by the user, the sensing information generated by the built-in sensor of the mobile terminal used by the data collection, includes:
获取用户端上传的任务领取者采集所述数据使用的移动终端的内置GPS模块产生的位置信息;Obtaining location information generated by a built-in GPS module of the mobile terminal used by the task recipient uploaded by the client to collect the data;
所述根据所述感应信息判断所述数据的真实性,包括:Determining the authenticity of the data according to the sensing information, including:
将所述位置信息与所述指定区域对应的位置信息进行匹配;Matching the location information with location information corresponding to the designated area;
在匹配失败时,判断所述数据不真实。 When the matching fails, it is judged that the data is not true.
上述方法中,可以将GPS模块产生的位置信息与所述指定区域对应的位置信息进行包容性匹配,以判断任务领取者是否是在所述指定区域内采集的数据,从而判断所述任务领取者上传的数据的真实性,容易理解的是,若匹配失败,可以认为任务领取者并没有涉足指定区域,那么其上传的数据有较大的可能是伪造的,即判断所述数据时不真实的,本方法适用于所述数据采集需求为到指定区域采集数据的需求的情形,对数据真实性判断较为准确。In the above method, the location information generated by the GPS module and the location information corresponding to the designated area may be inclusively matched to determine whether the task recipient is data collected in the designated area, thereby determining the task recipient. The authenticity of the uploaded data is easy to understand. If the matching fails, the task recipient can be considered not to be involved in the designated area, and the uploaded data may be forged, that is, the data is not true when judging the data. The method is applicable to the situation that the data collection requirement is a requirement for collecting data in a designated area, and the data authenticity is relatively accurate.
考虑到,对任务完成情况的审核应该是多方面的,不只是真实性一方面,因此,可选的,在所述接收用户端上传的任务领取者针对所述数据采集任务采集的数据的步骤后,还包括:It is considered that the review of the task completion situation should be multi-faceted, not only on the one hand, and therefore, optional, the steps of the data collected by the task recipient uploaded by the receiving user for the data collection task. After that, it also includes:
将所述数据与所述数据采集需求进行匹配,根据匹配结果确定所述数据采集任务的完成质量。Matching the data with the data collection requirement, and determining a quality of completion of the data collection task according to the matching result.
一般情况下,数据采集需求中会有多项采集指标,本方法中,所述将所述数据与所述数据采集需求进行匹配,具体可以是判断所述数据是否符合上述指标,若符合,则匹配,否则匹配失败。这样,可以更加全面的对数据采集任务的执行情况进行监督,提高任务的执行度,以保证数据的有效性。In general, there are multiple collection indicators in the data collection requirement. In the method, the data is matched with the data collection requirement, and specifically, whether the data meets the foregoing indicators, and if yes, Match, otherwise the match fails. In this way, the execution of the data collection task can be more comprehensively supervised, and the execution degree of the task can be improved to ensure the validity of the data.
可选的,所述获取任务发布者发起的数据采集需求,包括:Optionally, the obtaining data collection requirements initiated by the task publisher include:
向任务发布者提供数据采集任务动态表单;Provide a data collection task dynamic form to the task publisher;
根据所述任务发布者在所述数据采集任务动态表单中输入的内容获得所述任务发布者数据采集需求。Obtaining the task publisher data collection requirement according to the content input by the task publisher in the data collection task dynamic form.
本方法中,数据采集任务动态表单可以起到模板的作用,通过为任务发布者提供数据采集任务动态表单,可以使任务发布者更加直观、快捷的输入数据采集需求,同时,表单的形式更加有序、易于更改,便于后续在任务执行过程中变更、调整。In the method, the data collection task dynamic form can function as a template, and by providing the task publisher with a data collection task dynamic form, the task publisher can input the data collection requirement more intuitively and quickly, and the form form is more Preface, easy to change, easy to change and adjust during the execution of the task.
可选的,所述发布该数据采集任务,包括:Optionally, the publishing the data collection task comprises:
将所述数据采集任务发布至指定的网络众包公共平台,以供任务领取者领取;Distributing the data collection task to a designated network crowdsourcing public platform for the task recipient to receive;
或者or
将所述数据采集任务推送至符合指定条件的任务领取者的用户端,以供所述任务领取者领取。The data collection task is pushed to the user end of the task recipient who meets the specified condition for the task recipient to receive.
以上提供了两种数据采集任务的发布方式,第一种是通过平台发布,由任务领取者 抢单领取,第二种是预先根据数据采集任务的需求筛选出符合指定条件(如与采集位置的距离远近、历史任务完成量的多少、历史任务完成质量的高低等)的任务领取者,然后向其派单,更有针对性,可以在发布任务阶段对任务领取者进行筛选,以提高数据采集任务的成功率和完成质量。The above provides two ways to publish data collection tasks. The first one is published through the platform, and the task recipients The second is to pre-select the task recipients according to the requirements of the data collection task according to the specified conditions (such as the distance from the collection location, the number of historical tasks completed, the quality of the historical mission completion, etc.), and then Sending a single order to it is more targeted. The task recipients can be screened during the release task phase to improve the success rate and quality of the data collection task.
可选的,所述根据所述数据采集需求生成数据采集任务,包括:Optionally, the generating a data collection task according to the data collection requirement includes:
确定所述数据采集任务的任务发布模式,所述任务发布模式包括任务分包方式、任务分配方式和基本任务定价;Determining a task publishing mode of the data collection task, where the task publishing mode includes a task subcontracting mode, a task allocation mode, and a basic task pricing;
根据所述数据采集需求和所述任务发布模式,生成数据采集任务。Generating a data collection task according to the data collection requirement and the task publishing mode.
本部分,可以根据任务密度、可执行任务的用户数、每个用户的历史评分、用户的空间分布等等要素确定所述数据采集任务的任务发布模式,从而选择更合理的任务发布模式,进而使生成的任务更加合理,以保证数据采集任务更加有效、合理的完成。In this part, the task publishing mode of the data collection task may be determined according to the task density, the number of users of the executable task, the historical score of each user, the spatial distribution of the user, and the like, thereby selecting a more reasonable task publishing mode, and further Make the generated tasks more reasonable to ensure that the data collection task is completed more effectively and reasonably.
可选的,所述基于众包的数据采集方法,还包括:Optionally, the crowdsourcing-based data collection method further includes:
根据去噪、整合的结果对所述数据采集任务进行动态调整。The data collection task is dynamically adjusted according to the results of denoising and integration.
其具体实施方式为根据去噪、整合出来的异常数据,修改相应的数据采集需求动态表单,根据数据采集需求动态表单中修改部分的内容有针对性的对数据采集任务进行调整。这样,可以通过重新采集数据对明显的错误数据和异常数据进行修正,提高最终采集的数据的有效性。The specific implementation manner is to modify the corresponding data collection requirement dynamic form according to the denoising and integrated abnormal data, and adjust the data collection task according to the content of the modified part in the dynamic collection form of the data collection requirement. In this way, it is possible to correct the obvious erroneous data and abnormal data by re-collecting the data, thereby improving the validity of the finally collected data.
第二方面,本发明提供的一种基于众包的数据采集装置,包括:In a second aspect, the present invention provides a crowdsourcing-based data collection device, including:
数据采集需求获取模块,用于获取任务发布者发起的数据采集需求;The data collection requirement acquisition module is configured to acquire a data collection requirement initiated by the task publisher;
数据采集任务发布模块,用于根据所述数据采集需求生成数据采集任务以及发布该数据采集任务;a data collection task publishing module, configured to generate a data collection task according to the data collection requirement and issue the data collection task;
采集数据接收模块,用于接收用户端上传的任务领取者针对所述数据采集任务采集的数据;The collection data receiving module is configured to receive data collected by the task recipient uploaded by the user end for the data collection task;
采集数据审核模块,用于按照预设的审核方法对所述数据的真实性进行审核;Collecting data review module for reviewing the authenticity of the data according to a preset audit method;
采集数据整合模块,用于对多个用户端上传的所有数据进行去噪、整合。The data integration module is used for denoising and integrating all data uploaded by multiple clients.
可选的,所述采集数据审核模块,包括:Optionally, the collecting data review module includes:
感应信息获取单元,用于获取用户端上传的任务领取者采集所述数据使用的移动终端的内置传感器产生的感应信息; The sensing information acquiring unit is configured to acquire sensing information generated by a built-in sensor of the mobile terminal used by the task recipient uploaded by the user terminal to collect the data;
真实性判断单元,用于根据所述感应信息判断所述数据的真实性。The authenticity determining unit is configured to determine the authenticity of the data according to the sensing information.
可选的,所述数据采集需求为到指定区域采集数据的需求;Optionally, the data collection requirement is a requirement for collecting data to a designated area;
所述感应信息获取单元,包括:The sensing information acquiring unit includes:
定位信息获取子单元,用于获取用户端上传的任务领取者采集所述数据使用的移动终端的内置GPS模块产生的位置信息;a location information obtaining sub-unit, configured to acquire location information generated by a built-in GPS module of the mobile terminal used by the task recipient uploaded by the user terminal to collect the data;
所述真实性判断单元,包括:The authenticity determining unit includes:
位置匹配子单元,用于将所述位置信息与所述指定区域对应的位置信息进行匹配;a location matching subunit, configured to match the location information with location information corresponding to the specified area;
真实性判断子单元,用于在匹配失败时,判断所述数据不真实。The authenticity judging subunit is configured to judge that the data is untrue when the matching fails.
可选的,所述基于众包的数据采集装置,还包括:Optionally, the crowdsourcing-based data collection device further includes:
完成质量审核模块,用于将所述数据与所述数据采集需求进行匹配,根据匹配结果确定所述数据采集任务的完成质量。The quality auditing module is configured to match the data with the data collection requirement, and determine a quality of completion of the data collection task according to the matching result.
可选的,所述数据采集需求获取模块,包括:Optionally, the data collection requirement acquisition module includes:
动态表单提供单元,用于向任务发布者提供数据采集任务动态表单;A dynamic form providing unit for providing a data collection task dynamic form to a task publisher;
采集需求获取单元,用于根据所述任务发布者在所述数据采集任务动态表单中输入的内容获得所述任务发布者数据采集需求。The collection requirement acquisition unit is configured to obtain the task publisher data collection requirement according to the content input by the task publisher in the data collection task dynamic form.
可选的,所述数据采集任务发布模块,包括:Optionally, the data collection task publishing module includes:
平台发布单元,用于将所述数据采集任务发布至指定的网络众包公共平台,以供任务领取者领取;a platform publishing unit, configured to publish the data collection task to a designated network crowdsourcing public platform for the task recipient to receive;
或者or
推送发布单元,用于将所述数据采集任务推送至符合指定条件的任务领取者的用户端,以供所述任务领取者领取。The push issuing unit is configured to push the data collection task to a user end of the task recipient that meets the specified condition for the task recipient to receive.
可选的,所述数据采集任务发布模块,包括:Optionally, the data collection task publishing module includes:
发布模式确定单元,用于确定所述数据采集任务的任务发布模式,所述任务发布模式包括任务分包方式、任务分配方式和基本任务定价;a publishing mode determining unit, configured to determine a task publishing mode of the data collection task, where the task publishing mode includes a task subcontracting mode, a task allocation mode, and a basic task pricing;
采集任务生成单元,用于根据所述数据采集需求和所述任务发布模式,生成数据采集任务。The collection task generating unit is configured to generate a data collection task according to the data collection requirement and the task publishing mode.
可选的,所述基于众包的数据采集装置,还包括: Optionally, the crowdsourcing-based data collection device further includes:
动态调整模块,用于根据去噪、整合的结果对所述数据采集任务进行动态调整。The dynamic adjustment module is configured to dynamically adjust the data collection task according to the result of denoising and integration.
第三方面,本发明提供的一种基于众包的数据采集服务器,包括:处理器、存储器、总线接口、总线和收发机;In a third aspect, the present invention provides a crowdsourcing-based data collection server, including: a processor, a memory, a bus interface, a bus, and a transceiver;
所述处理器、所述存储器和所述总线接口通过所述总线连接,所述收发机与所述总线接口连接,所述天线与所述收发机连接;The processor, the memory and the bus interface are connected by the bus, the transceiver is connected to the bus interface, and the antenna is connected to the transceiver;
其中,所述存储器用于存储程序;Wherein the memory is used to store a program;
所述处理器,用于读取所述存储器中的程序,执行本发明提供的任一项所述的基于众包的数据采集方法;The processor is configured to read a program in the memory, and execute the crowdsourcing-based data collection method according to any one of the present invention;
所述收发机,用于在所述处理器的控制下接收和发送数据。The transceiver is configured to receive and transmit data under the control of the processor.
本发明提供的所述基于众包的数据采集服务器与所述基于众包的数据采集方法基于相同的发明构思,具有相同的有益效果。The crowdsourcing-based data collection server provided by the present invention and the crowdsourcing-based data collection method are based on the same inventive concept and have the same beneficial effects.
附图说明DRAWINGS
为了更清楚地说明本发明具体实施方式或现有技术中的技术方案,下面将对具体实施方式或现有技术描述中所需要使用的附图作简单地介绍。在所有附图中,类似的元件或部分一般由类似的附图标记标识。附图中,各元件或部分并不一定按照实际的比例绘制。In order to more clearly illustrate the specific embodiments of the present invention or the technical solutions in the prior art, the drawings to be used in the specific embodiments or the description of the prior art will be briefly described below. In all the figures, like elements or parts are generally identified by like reference numerals. In the figures, elements or parts are not necessarily drawn to scale.
图1示出了本发明第一实施例所提供的一种基于众包的数据采集方法的流程图;1 is a flow chart showing a crowdsourcing-based data collection method according to a first embodiment of the present invention;
图2示出了本发明第二实施例所提供的一种基于众包的数据采集装置的示意图;2 is a schematic diagram of a crowdsourcing-based data collection device according to a second embodiment of the present invention;
图3示出了本发明第三实施例所提供的一种基于众包的数据采集服务器的示意图。FIG. 3 is a schematic diagram of a crowdsourcing-based data collection server according to a third embodiment of the present invention.
具体实施方式detailed description
下面将结合附图对本发明技术方案的实施例进行详细的描述。以下实施例仅用于更加清楚地说明本发明的技术方案,因此只是作为示例,而不能以此来限制本发明的保护范围。The embodiments of the technical solution of the present invention will be described in detail below with reference to the accompanying drawings. The following embodiments are only used to more clearly illustrate the technical solutions of the present invention, and thus are merely exemplary and are not intended to limit the scope of the present invention.
需要注意的是,除非另有说明,本申请使用的技术术语或者科学术语应当为本发明所属领域技术人员所理解的通常意义。 It should be noted that the technical terms or scientific terms used herein should be used in the ordinary meaning as understood by those skilled in the art to which the invention belongs, unless otherwise stated.
本申请提供一种基于众包的数据采集方法、装置及服务器。下面结合附图对本发明的实施例进行说明。The application provides a crowdsourcing-based data collection method, device and server. Embodiments of the present invention will be described below with reference to the accompanying drawings.
需要说明的是,本发明实施例中所述的数据,可以是文字、图像、声音、影音等任意形式的数据或其变更。例如,本发明实施例的一种应用场景为,某房地产中介想要切入某城市的市场,需要对该城市的小区情况(如小区的入住率、户型图、基础设施等)进行摸底,因此,可以通过本发明提供的方法发布数据采集任务,由小区居民或自由职业者领取任务,到小区拍照、填写描述信息等通过手机回传到服务器,这些照片、描述信息即为需要采集的数据。It should be noted that the data described in the embodiments of the present invention may be any form of data such as characters, images, sounds, audio and video, or the like. For example, an application scenario of an embodiment of the present invention is that if a real estate agent wants to cut into a market of a certain city, it is necessary to analyze the community situation of the city (such as occupancy rate, floor plan, infrastructure, etc.) of the city. The data collection task can be issued by the method provided by the present invention, and the community resident or freelancer can receive the task, take a photo of the cell, fill in the description information, and the like, and return the data to the server through the mobile phone. The photos and description information are the data that needs to be collected.
图1示出了本发明第一实施例所提供的一种基于众包的数据采集方法的流程图,如图1所示,本发明第一实施例提供的一种基于众包的数据采集方法包括以下步骤:FIG. 1 is a flowchart of a crowdsourcing-based data collection method according to a first embodiment of the present invention. As shown in FIG. 1 , a crowdsourcing-based data collection method according to a first embodiment of the present invention is provided. Includes the following steps:
步骤S101:获取任务发布者发起的数据采集需求。Step S101: Acquire a data collection requirement initiated by the task publisher.
本发明实施例的执行主体为服务器,一般架设于网络后台,而用户提交数据采集需求是通过网络前端的用户端实现的,因此,本步骤在实施时,可以是由用户在用户端上编辑好数据采集需求后,由用户端发送至服务器,服务器即可获取该数据采集需求。The execution body of the embodiment of the present invention is a server, which is generally installed in the network background, and the data submission requirement of the user is implemented by the user end of the network front end. Therefore, when the step is implemented, the user can edit the user end. After the data collection requirement is sent by the client to the server, the server can obtain the data collection requirement.
由于不同的用户采集的数据是不同的,其数据要求也各不相同,为了便于用户输入数据采集需求,在本发明提供的一个实施例中,所述获取任务发布者发起的数据采集需求,包括:In the embodiment provided by the present invention, the data collection requirements initiated by the task publisher are included, as the data collected by the user is different, and the data requirements are different. :
向任务发布者提供数据采集任务动态表单;Provide a data collection task dynamic form to the task publisher;
根据所述任务发布者在所述数据采集任务动态表单中输入的内容获得所述任务发布者数据采集需求。Obtaining the task publisher data collection requirement according to the content input by the task publisher in the data collection task dynamic form.
该数据采集任务动态表单为可编辑表单,用户可以自由修改、编辑该表单,以充分表达自己的数据采集需求,为了实现任务变更、修改的灵活性,该数据采集任务动态表单还可以在采集任务执行过程中灵活修改,从而即时修改数据采集需求,进而对数据采集任务进行调整、修改。The data collection task dynamic form is an editable form, and the user can freely modify and edit the form to fully express his data collection requirements. In order to realize task change and modification flexibility, the data collection task dynamic form can also be in the collection task. Flexible modification during the execution process to instantly modify the data collection requirements, and then adjust and modify the data collection tasks.
本发明实施例中,数据采集任务动态表单可以起到模板的作用,通过为任务发布者提供数据采集任务动态表单,可以使任务发布者更加直观、快捷的输入数据采集需求,同时,表单的形式更加有序、易于更改,便于后续在任务执行过程中变更、调整。 In the embodiment of the present invention, the data collection task dynamic form can function as a template, and by providing the task publisher with a data collection task dynamic form, the task publisher can input the data collection requirement more intuitively and quickly, and at the same time, the form form It is more orderly and easy to change, so it is easy to change and adjust during the execution of the task.
步骤S102:根据所述数据采集需求生成数据采集任务以及发布该数据采集任务。Step S102: Generate a data collection task according to the data collection requirement and issue the data collection task.
在获得数据采集需求后,即可根据所述数据采集需求生成相应的数据采集任务,一种简单的实施方式,是直接将步骤S101中输入完成的数据采集任务动态表单作为数据采集任务内容;考虑到数据可能分包采集,如上述采集某城市小区信息的应用场景中,数据采集任务动态表单为采集一个小区的任务内容,另外再导入一个该城市的小区列表作为种子数据,根据上述数据采集任务动态表单和该小区列表即可生成多个以小区进行区分的多个数据采集任务(可以视为该城市整体数据采集任务的子任务)。After obtaining the data collection requirement, the corresponding data collection task can be generated according to the data collection requirement. A simple implementation manner is to directly input the data collection task dynamic form input in step S101 as the data collection task content; The data may be sub-packaged. In the application scenario of collecting the information of a certain city, the data collection task dynamic form is to collect the task content of a cell, and then import a cell list of the city as seed data, according to the above data collection task. The dynamic form and the cell list can generate a plurality of data collection tasks differentiated by cells (which can be regarded as subtasks of the overall data collection task of the city).
在本发明提供的一个实施例中,所述根据所述数据采集需求生成数据采集任务,包括:In an embodiment provided by the present invention, the generating a data collection task according to the data collection requirement includes:
确定所述数据采集任务的任务发布模式,所述任务发布模式包括任务分包方式、任务分配方式和基本任务定价;Determining a task publishing mode of the data collection task, where the task publishing mode includes a task subcontracting mode, a task allocation mode, and a basic task pricing;
根据所述数据采集需求和所述任务发布模式,生成数据采集任务。Generating a data collection task according to the data collection requirement and the task publishing mode.
其中,任务分包方式包括分包或不分包,任务分配方式包括任务领取或任务分配。任务分布模式可以但不仅限于根据任务密度、可执行任务的用户数、每个用户的评分、空间分布等等确定。The task subcontracting method includes subcontracting or non-subcontracting, and the task allocation manner includes task collection or task assignment. The task distribution pattern may be determined, but not limited to, based on task density, number of users of the executable task, rating of each user, spatial distribution, and the like.
本发明实施例,可以根据任务密度、可执行任务的用户数、每个用户的历史评分、用户的空间分布等等要素确定所述数据采集任务的任务发布模式,从而选择更合理的任务发布模式,进而使生成的任务更加合理,以保证数据采集任务更加有效、合理的完成。In the embodiment of the present invention, the task publishing mode of the data collection task may be determined according to the task density, the number of users of the executable task, the historical score of each user, the spatial distribution of the user, and the like, thereby selecting a more reasonable task publishing mode. In order to make the generated tasks more reasonable, to ensure that the data collection task is completed more effectively and reasonably.
另外,在本发明提供的一个实施例中,所述发布该数据采集任务,包括:In addition, in an embodiment provided by the present invention, the publishing the data collection task includes:
将所述数据采集任务发布至指定的网络众包公共平台,以供任务领取者领取;Distributing the data collection task to a designated network crowdsourcing public platform for the task recipient to receive;
或者or
将所述数据采集任务推送至符合指定条件的任务领取者的用户端,以供所述任务领取者领取。The data collection task is pushed to the user end of the task recipient who meets the specified condition for the task recipient to receive.
以上提供了两种数据采集任务的发布方式,第一种是通过平台发布,由任务领取者抢单领取,第二种是预先根据数据采集任务的需求筛选出符合指定条件(如与采集位置的距离远近、历史任务完成量的多少、历史任务完成质量的高低等)的任务领取者,然后向其派单,更有针对性,可以在发布任务阶段对任务领取者进行筛选,以提高数据采 集任务的成功率和完成质量。The above provides two methods for publishing data collection tasks. The first is to be released through the platform, and the task recipients are robbed to receive the order. The second is to pre-select the specified conditions according to the requirements of the data collection task (such as the location of the collection). The task recipients who are close to the distance, the amount of historical tasks completed, the quality of the historical missions, etc., are then assigned to them, which is more targeted. The task recipients can be screened during the release task phase to improve data collection. Set the success rate and quality of the task.
步骤S103:接收用户端上传的任务领取者针对所述数据采集任务采集的数据。Step S103: Receive data collected by the task recipient uploaded by the user for the data collection task.
本发明实施例中所述的用户端可以是任何具有上网功能的服务器设备,如手机、平板电脑、个人数字助理(Personal Digital Assistant,PDA)、笔记本电脑、台式机电脑等;也可以是安装于上述服务器设备上的客户端软件,该客户端软件可以控制所述服务器设备执行数据采集、接收和发送等功能,其均在本发明的保护范围之内。The user terminal described in the embodiment of the present invention may be any server device with Internet access functions, such as a mobile phone, a tablet computer, a personal digital assistant (PDA), a notebook computer, a desktop computer, etc., or may be installed on The client software on the server device, the client software can control the server device to perform functions such as data collection, reception, and transmission, which are all within the protection scope of the present invention.
步骤S104:按照预设的审核方法对所述数据的真实性进行审核。Step S104: Review the authenticity of the data according to a preset auditing method.
在本发明提供的一个实施例中,所述按照预设的审核方法对所述数据的真实性进行审核,包括:In an embodiment provided by the present invention, the authenticity of the data is reviewed according to a preset auditing method, including:
获取用户端上传的任务领取者采集所述数据使用的移动终端的内置传感器产生的感应信息;Obtaining sensing information generated by a built-in sensor of the mobile terminal used by the task recipient uploaded by the client to collect the data;
根据所述感应信息判断所述数据的真实性。Determining the authenticity of the data based on the sensing information.
其中,由于移动终端的内置传感器产生的感应信息是客观产生的,因此作为验证数据真实性的判断依据可靠性较高,本方法可以有效判断所述数据的真实性。The sensing information generated by the built-in sensor of the mobile terminal is objectively generated. Therefore, the reliability of the authenticity of the verification data is high, and the method can effectively determine the authenticity of the data.
在本发明提供的一个实施例中,所述数据采集需求为到指定区域采集数据的需求;In an embodiment provided by the present invention, the data collection requirement is a requirement for collecting data to a designated area;
所述获取用户端上传的任务领取者采集所述数据使用的移动终端的内置传感器产生的感应信息,包括:The acquiring, by the task recipient uploaded by the user, the sensing information generated by the built-in sensor of the mobile terminal used by the data collection, includes:
获取用户端上传的任务领取者采集所述数据使用的移动终端的内置GPS模块产生的位置信息;Obtaining location information generated by a built-in GPS module of the mobile terminal used by the task recipient uploaded by the client to collect the data;
所述根据所述感应信息判断所述数据的真实性,包括:Determining the authenticity of the data according to the sensing information, including:
将所述位置信息与所述指定区域对应的位置信息进行匹配;Matching the location information with location information corresponding to the designated area;
在匹配失败时,判断所述数据不真实。When the matching fails, it is judged that the data is not true.
上述方法中,可以将GPS模块产生的位置信息与所述指定区域对应的位置信息进行包容性匹配,以判断任务领取者是否是在所述指定区域内采集的数据,从而判断所述任务领取者上传的数据的真实性,容易理解的是,若匹配失败,可以认为任务领取者并没 有涉足指定区域,那么其上传的数据有较大的可能是伪造的,即判断所述数据时不真实的,本方法适用于所述数据采集需求为到指定区域采集数据的需求的情形,对数据真实性判断较为准确。In the above method, the location information generated by the GPS module and the location information corresponding to the designated area may be inclusively matched to determine whether the task recipient is data collected in the designated area, thereby determining the task recipient. The authenticity of the uploaded data, it is easy to understand that if the match fails, the task recipient can be considered as not If there is a specified area, the data uploaded by the user may be forged, that is, when the data is judged to be untrue, the method is applicable to the situation that the data collection requirement is to collect data in a designated area, The data authenticity judgment is more accurate.
考虑到,对任务完成情况的审核应该是多方面的,不只是真实性一方面,因此,在本发明提供的一个实施例中,在所述接收用户端上传的任务领取者针对所述数据采集任务采集的数据的步骤后,还包括:It is considered that the review of the task completion situation should be multi-faceted, not only on the one hand, and therefore, in an embodiment provided by the present invention, the task recipient uploaded on the receiving client is for the data collection. After the steps of collecting data from the task, it also includes:
将所述数据与所述数据采集需求进行匹配,根据匹配结果确定所述数据采集任务的完成质量。Matching the data with the data collection requirement, and determining a quality of completion of the data collection task according to the matching result.
一般情况下,数据采集需求中会有多项采集指标,本方法中,所述将所述数据与所述数据采集需求进行匹配,具体可以是判断所述数据是否符合上述指标,若符合,则匹配,否则匹配失败。这样,可以更加全面的对数据采集任务的执行情况进行监督,提高任务的执行度,以保证数据的有效性。In general, there are multiple collection indicators in the data collection requirement. In the method, the data is matched with the data collection requirement, and specifically, whether the data meets the foregoing indicators, and if yes, Match, otherwise the match fails. In this way, the execution of the data collection task can be more comprehensively supervised, and the execution degree of the task can be improved to ensure the validity of the data.
步骤S105:对多个用户端上传的所有数据进行去噪、整合。Step S105: Denoising and integrating all data uploaded by multiple clients.
在众包的数据采集中,同一个任务会有多个任务领取者去执行,每个任务领取者可能是执行其中的一部分,最后将各个部分的数据进行汇总、整合;也可能多个任务领取者执行相同的内容,最后也需要将这些任务领取者采集的数据进行整合。In crowdsourced data collection, multiple task recipients will be executed in the same task. Each task recipient may be part of the execution. Finally, the data of each part will be aggregated and integrated. The same content is executed, and finally the data collected by these task recipients needs to be integrated.
其中,由于任务领取者采集的数据未必都能够符合预期的标准,上传的数据也有可能会有明显的错误等,这些数据如果用来进行数据分析,可能对最终的分析结果产生不良影响,因此,还需要对这些数据进行去噪处理,删除其中明显的错误数据和异常数据。Among them, because the data collected by the task recipients may not meet the expected standards, there may be obvious errors in the uploaded data. If these data are used for data analysis, the final analysis results may be adversely affected. It is also necessary to denoise these data and delete the obvious erroneous data and abnormal data.
具体实施时,可以对数据归纳出多个指标条件,同时为各个指标条件设定相应的阈值,然后根据上述指标条件及其阈值对各个用户端上传的数据进行匹配,将匹配失败的数据进行删除,即为去噪的过程。In the specific implementation, the data can be summarized into multiple index conditions, and corresponding thresholds are set for each index condition, and then the data uploaded by each client is matched according to the above-mentioned index conditions and thresholds, and the data with failed matching is deleted. Is the process of denoising.
最后将各个用户端上传的数据进行汇总、整合。Finally, the data uploaded by each client is aggregated and integrated.
在本步骤中,若发现异常数据和错误数据,可以根据实际情况修改原数据采集任务,重新采集相应的数据,以保证最终数据采集结果的有效性。在本发明提供的一个实施例中,所述基于众包的数据采集方法,还包括:In this step, if abnormal data and error data are found, the original data collection task can be modified according to the actual situation, and the corresponding data is re-acquired to ensure the validity of the final data collection result. In an embodiment of the present invention, the crowdsourcing-based data collection method further includes:
根据去噪、整合的结果对所述数据采集任务进行动态调整。其具体实施方式为根据 去噪、整合出来的异常数据,修改相应的数据采集需求动态表单,根据数据采集需求动态表单中修改部分的内容有针对性的对数据采集任务进行调整。这样,可以通过重新采集数据对明显的错误数据和异常数据进行修正,提高最终采集的数据的有效性。例如,原数据采集任务为采集200个小区的户型图,其中第100个小区采集的户型图有问题,则重新采集该小区的户型图数据。The data collection task is dynamically adjusted according to the results of denoising and integration. The specific embodiment is based on Denoising, integrating the abnormal data, modifying the corresponding data collection requirement dynamic form, and adjusting the data collection task according to the content of the modified part in the dynamic form of the data collection requirement. In this way, it is possible to correct the obvious erroneous data and abnormal data by re-collecting the data, thereby improving the validity of the finally collected data. For example, the original data collection task is to collect a floor plan of 200 cells, wherein if the floor plan collected by the 100th cell has a problem, the floor plan data of the cell is re-acquired.
最后,通过对多个用户端上传的所有数据进行去噪、整合,可以去除所述数据中的不良数据,提高数据的有效性。Finally, by denoising and integrating all the data uploaded by multiple clients, the bad data in the data can be removed and the data validity can be improved.
至此,通过步骤S101至步骤S105,完成了本发明第一实施例所提供的一种基于众包的数据采集方法的流程。相较于现有技术中,本发明提供的所述基于众包的数据采集方法,将数据采集需求的获取、数据采集任务的生成、发布以及采集的数据的获取、审核等各个数据采集环节有机地结合起来,提供了一种系统的基于众包的数据采集方法,具有良好的用户体验,其中,通过对采集的数据进行真实性审核,可有效识别伪造数据,减少任务领取者伪造数据的问题;通过对多个用户端上传的所有数据进行去噪、整合,可以去除所述数据中的不良数据,提高数据的有效性。So far, through the steps S101 to S105, the flow of a crowdsourcing-based data collection method provided by the first embodiment of the present invention is completed. Compared with the prior art, the crowdsourcing-based data collection method provided by the present invention organically acquires data acquisition requirements, generates and distributes data collection tasks, and acquires and audits data collected. The combination of the ground provides a systematic crowdsourcing-based data collection method with a good user experience. Among them, through the authenticity verification of the collected data, the forged data can be effectively identified, and the problem of forging data by the task recipient is reduced. By denoising and integrating all data uploaded by multiple clients, the bad data in the data can be removed and the data validity can be improved.
在上述的第一实施例中,提供了一种基于众包的数据采集方法,与之相对应的,本申请还提供一种基于众包的数据采集装置。请参考图2,其为本发明第二实施例提供的一种基于众包的数据采集装置的示意图。由于装置实施例基本相似于方法实施例,所以描述得比较简单,相关之处参见方法实施例的部分说明即可。下述描述的装置实施例仅仅是示意性的。In the first embodiment described above, a crowdsourcing-based data collection method is provided. Correspondingly, the present application further provides a crowdsourcing-based data collection device. Please refer to FIG. 2 , which is a schematic diagram of a crowdsourcing-based data collection device according to a second embodiment of the present invention. Since the device embodiment is basically similar to the method embodiment, the description is relatively simple, and the relevant parts can be referred to the description of the method embodiment. The device embodiments described below are merely illustrative.
本发明第二实施例提供的一种基于众包的数据采集装置,包括:A crowdsourcing-based data collection device according to a second embodiment of the present invention includes:
数据采集需求获取模块101,用于获取任务发布者发起的数据采集需求;The data collection requirement acquisition module 101 is configured to acquire a data collection requirement initiated by the task publisher;
数据采集任务发布模块102,用于根据所述数据采集需求生成数据采集任务以及发布该数据采集任务;The data collection task issuing module 102 is configured to generate a data collection task according to the data collection requirement and issue the data collection task;
采集数据接收模块103,用于接收用户端上传的任务领取者针对所述数据采集任务采集的数据;The collection data receiving module 103 is configured to receive data collected by the task recipient uploaded by the user end for the data collection task;
采集数据审核模块104,用于按照预设的审核方法对所述数据的真实性进行审核; The data collection module 104 is configured to review the authenticity of the data according to a preset audit method;
采集数据整合模块105,用于对多个用户端上传的所有数据进行去噪、整合。The data collection integration module 105 is configured to perform denoising and integration on all data uploaded by multiple clients.
在本发明提供的一个实施例中,所述采集数据审核模块104,包括:In an embodiment provided by the present invention, the collecting data review module 104 includes:
感应信息获取单元,用于获取用户端上传的任务领取者采集所述数据使用的移动终端的内置传感器产生的感应信息;The sensing information acquiring unit is configured to acquire sensing information generated by a built-in sensor of the mobile terminal used by the task recipient uploaded by the user terminal to collect the data;
真实性判断单元,用于根据所述感应信息判断所述数据的真实性。The authenticity determining unit is configured to determine the authenticity of the data according to the sensing information.
在本发明提供的一个实施例中,所述数据采集需求为到指定区域采集数据的需求;In an embodiment provided by the present invention, the data collection requirement is a requirement for collecting data to a designated area;
所述感应信息获取单元,包括:The sensing information acquiring unit includes:
定位信息获取子单元,用于获取用户端上传的任务领取者采集所述数据使用的移动终端的内置GPS模块产生的位置信息;a location information obtaining sub-unit, configured to acquire location information generated by a built-in GPS module of the mobile terminal used by the task recipient uploaded by the user terminal to collect the data;
所述真实性判断单元,包括:The authenticity determining unit includes:
位置匹配子单元,用于将所述位置信息与所述指定区域对应的位置信息进行匹配;a location matching subunit, configured to match the location information with location information corresponding to the specified area;
真实性判断子单元,用于在匹配失败时,判断所述数据不真实。The authenticity judging subunit is configured to judge that the data is untrue when the matching fails.
在本发明提供的一个实施例中,所述基于众包的数据采集装置,还包括:In an embodiment of the present invention, the crowdsourcing-based data collection device further includes:
完成质量审核模块,用于将所述数据与所述数据采集需求进行匹配,根据匹配结果确定所述数据采集任务的完成质量。The quality auditing module is configured to match the data with the data collection requirement, and determine a quality of completion of the data collection task according to the matching result.
在本发明提供的一个实施例中,所述数据采集需求获取模块101,包括:In an embodiment of the present invention, the data collection requirement acquisition module 101 includes:
动态表单提供单元,用于向任务发布者提供数据采集任务动态表单;A dynamic form providing unit for providing a data collection task dynamic form to a task publisher;
采集需求获取单元,用于根据所述任务发布者在所述数据采集任务动态表单中输入的内容获得所述任务发布者数据采集需求。The collection requirement acquisition unit is configured to obtain the task publisher data collection requirement according to the content input by the task publisher in the data collection task dynamic form.
在本发明提供的一个实施例中,所述数据采集任务发布模块102,包括:In an embodiment of the present invention, the data collection task issuing module 102 includes:
平台发布单元,用于将所述数据采集任务发布至指定的网络众包公共平台,以供任务领取者领取;a platform publishing unit, configured to publish the data collection task to a designated network crowdsourcing public platform for the task recipient to receive;
或者or
推送发布单元,用于将所述数据采集任务推送至符合指定条件的任务领取者的用户端,以供所述任务领取者领取。The push issuing unit is configured to push the data collection task to a user end of the task recipient that meets the specified condition for the task recipient to receive.
在本发明提供的一个实施例中,所述数据采集任务发布模块102,包括: In an embodiment of the present invention, the data collection task issuing module 102 includes:
发布模式确定单元,用于确定所述数据采集任务的任务发布模式,所述任务发布模式包括任务分包方式、任务分配方式和基本任务定价;a publishing mode determining unit, configured to determine a task publishing mode of the data collection task, where the task publishing mode includes a task subcontracting mode, a task allocation mode, and a basic task pricing;
采集任务生成单元,用于根据所述数据采集需求和所述任务发布模式,生成数据采集任务。The collection task generating unit is configured to generate a data collection task according to the data collection requirement and the task publishing mode.
在本发明提供的一个实施例中,所述基于众包的数据采集装置,还包括:In an embodiment of the present invention, the crowdsourcing-based data collection device further includes:
动态调整模块,用于根据去噪、整合的结果对所述数据采集任务进行动态调整。The dynamic adjustment module is configured to dynamically adjust the data collection task according to the result of denoising and integration.
以上,为本发明第二实施例提供的一种基于众包的数据采集装置说明。The above is a description of a crowdsourcing-based data collection device according to a second embodiment of the present invention.
本发明提供的一种基于众包的数据采集装置与上述基于众包的数据采集方法出于相同的发明构思,具有相同的有益效果,此处不再赘述。A crowdsourcing-based data collection device provided by the present invention has the same beneficial effects as the above-described crowdsourcing-based data collection method, and will not be described herein.
请参考图3,其为本发明第三实施例所提供的一种基于众包的数据采集服务器的示意图。本发明提供的一种基于众包的数据采集服务器,包括:处理器1、存储器2、总线接口3、总线4和收发机5和天线6;Please refer to FIG. 3 , which is a schematic diagram of a crowdsourcing-based data collection server according to a third embodiment of the present invention. The invention provides a crowdsourcing-based data collection server, comprising: a processor 1, a memory 2, a bus interface 3, a bus 4, and a transceiver 5 and an antenna 6;
所述处理器1、所述存储器2和所述总线接口3通过所述总线4连接,所述收发机5与所述总线接口3连接,所述天线6与所述收发机5连接;The processor 1, the memory 2 and the bus interface 3 are connected by the bus 4, the transceiver 5 is connected to the bus interface 3, and the antenna 6 is connected to the transceiver 5;
其中,所述存储器2用于存储程序;Wherein the memory 2 is used to store a program;
所述处理器1,用于读取所述存储器2中的程序,执行本发明提供的任一项所述的基于众包的数据采集方法;The processor 1 is configured to read a program in the memory 2, and execute the crowdsourcing-based data collection method according to any one of the present inventions;
所述收发机5,用于在所述处理器1的控制下接收和发送数据。The transceiver 5 is configured to receive and transmit data under the control of the processor 1.
在图3中,总线架构(用总线4来代表),总线4可以包括任意数量的互联的总线和桥,总线4将包括由处理器1代表的一个或多个处理器和存储器2代表的存储器的各种电路链接在一起。总线4还可以将诸如外围设备、稳压器和功率管理电路等之类的各种其他电路链接在一起,这些都是本领域所公知的,因此,本文不再对其进行进一步描述。总线接口3在总线4和收发机5之间提供接口。收发机5可以是一个元件,也可以是多个元件,比如多个接收器和发送器,提供用于在传输介质上与各种其他装置通信的单元。经处理器1处理的数据通过天线6在无线介质上进行传输,进一步,天线6还接收数据并将数据传送给处理器1。处理器1负责管理总线4和通常的 处理,还可以提供各种功能,包括定时,外围接口,电压调节、电源管理以及其他控制功能。而存储器2可以被用于存储处理器1在执行操作时所使用的数据。可选的,处理器1可以是CPU(中央处埋器)、ASIC(Application Specific Integrated Circuit,专用集成电路)、FPGA(Field-Programmable Gate Array,现场可编程门阵列)或CPLD(Complex Programmable Logic Device,复杂可编程逻辑器件)。In FIG. 3, a bus architecture (represented by bus 4), which may include any number of interconnected buses and bridges, the bus 4 will include one or more processors represented by processor 1 and a memory represented by memory 2. The various circuits are linked together. The bus 4 can also link various other circuits such as peripherals, voltage regulators, and power management circuits, which are well known in the art and, therefore, will not be further described herein. The bus interface 3 provides an interface between the bus 4 and the transceiver 5. The transceiver 5 can be an element or a plurality of elements, such as a plurality of receivers and transmitters, providing means for communicating with various other devices on a transmission medium. The data processed by the processor 1 is transmitted over the wireless medium via the antenna 6, and further, the antenna 6 also receives the data and transmits the data to the processor 1. Processor 1 is responsible for managing bus 4 and the usual Processing can also provide a variety of functions, including timing, peripheral interfaces, voltage regulation, power management, and other control functions. The memory 2 can be used to store data used by the processor 1 when performing operations. Optionally, the processor 1 may be a CPU (Central Embedded Device), an ASIC (Application Specific Integrated Circuit), an FPGA (Field-Programmable Gate Array), or a CPLD (Complex Programmable Logic Device). , complex programmable logic devices).
本发明提供的一种基于众包的数据采集服务器与上述基于众包的数据采集方法出于相同的发明构思,具有相同的有益效果,此处不再赘述。A crowdsourcing-based data collection server provided by the present invention has the same beneficial effects as the above-described crowdsourcing-based data collection method, and will not be described herein.
在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不必须针对的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任一个或多个实施例或示例中以合适的方式结合。此外,在不相互矛盾的情况下,本领域的技术人员可以将本说明书中描述的不同实施例或示例以及不同实施例或示例的特征进行结合和组合。In the description of the present specification, the description with reference to the terms "one embodiment", "some embodiments", "example", "specific example", or "some examples" and the like means a specific feature described in connection with the embodiment or example. A structure, material or feature is included in at least one embodiment or example of the invention. In the present specification, the schematic representation of the above terms is not necessarily directed to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in a suitable manner in any one or more embodiments or examples. In addition, various embodiments or examples described in the specification, as well as features of various embodiments or examples, may be combined and combined.
需要说明的是,附图中的流程图和框图显示了根据本发明的多个实施例的服务器、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段或代码的一部分,所述模块、程序段或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个连续的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或动作的专用的基于硬件的服务器来实现,或者可以用专用硬件与计算机指令的组合来实现。It should be noted that the flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of servers, methods, and computer program products in accordance with various embodiments of the present invention. In this regard, each block of the flowchart or block diagram can represent a module, a program segment, or a portion of code that includes one or more of the Executable instructions. It should also be noted that in some alternative implementations, the functions noted in the blocks may also occur in a different order than that illustrated in the drawings. For example, two consecutive blocks may be executed substantially in parallel, and they may sometimes be executed in the reverse order, depending upon the functionality involved. It is also noted that each block of the block diagrams and/or flowcharts, and combinations of blocks in the block diagrams and/or flowcharts, can be implemented by a dedicated hardware-based server that performs the specified function or action. Or it can be implemented by a combination of dedicated hardware and computer instructions.
本发明实施例所提供的基于众包的数据采集装置可以是计算机程序产品,包括存储了程序代码的计算机可读存储介质,所述程序代码包括的指令可用于执行前面方法实施例中所述的方法,具体实现可参见方法实施例,在此不再赘述。The crowdsourcing-based data collection device provided by the embodiments of the present invention may be a computer program product, including a computer readable storage medium storing program code, the program code including instructions may be used to execute the foregoing method embodiments. For the specific implementation, refer to the method embodiment, and details are not described herein again.
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的服务器、装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。 A person skilled in the art can clearly understand that, for the convenience and brevity of the description, the specific working process of the server, the device and the unit described above can refer to the corresponding process in the foregoing method embodiment, and details are not described herein again.
在本申请所提供的几个实施例中,应该理解到,所揭露的服务器、装置和方法,可以通过其它的方式实现。以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,又例如,多个单元或组件可以结合或者可以集成到另一个服务器,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些通信接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided by the present application, it should be understood that the disclosed server, apparatus, and method may be implemented in other manners. The device embodiments described above are merely illustrative. For example, the division of the unit is only a logical function division. In actual implementation, there may be another division manner. For example, multiple units or components may be combined or Can be integrated into another server, or some features can be ignored or not executed. In addition, the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some communication interface, device or unit, and may be electrical, mechanical or otherwise.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以发布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。The functions may be stored in a computer readable storage medium if implemented in the form of a software functional unit and sold or used as a standalone product. Based on such understanding, the technical solution of the present invention, which is essential or contributes to the prior art, or a part of the technical solution, may be embodied in the form of a software product, which is stored in a storage medium, including The instructions are used to cause a computer device (which may be a personal computer, server, or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present invention. The foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like. .
最后应说明的是:以上各实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述各实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分或者全部技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的范围,其均应涵盖在本发明的权利要求和说明书的范围当中。 Finally, it should be noted that the above embodiments are merely illustrative of the technical solutions of the present invention, and are not intended to be limiting; although the present invention has been described in detail with reference to the foregoing embodiments, those skilled in the art will understand that The technical solutions described in the foregoing embodiments may be modified, or some or all of the technical features may be equivalently replaced; and the modifications or substitutions do not deviate from the technical solutions of the embodiments of the present invention. The scope is intended to be included within the scope of the claims and the description of the invention.

Claims (10)

  1. 一种基于众包的数据采集方法,其特征在于,包括:A crowdsourcing-based data collection method, comprising:
    获取任务发布者发起的数据采集需求;Obtain data collection requirements initiated by the task publisher;
    根据所述数据采集需求生成数据采集任务以及发布该数据采集任务;Generating a data collection task according to the data collection requirement and publishing the data collection task;
    接收用户端上传的任务领取者针对所述数据采集任务采集的数据;Receiving data collected by the task recipient uploaded by the user for the data collection task;
    按照预设的审核方法对所述数据的真实性进行审核;Review the authenticity of the data according to a preset audit method;
    对多个用户端上传的所有数据进行去噪、整合。Denoise and integrate all data uploaded by multiple clients.
  2. 根据权利要求1所述的基于众包的数据采集方法,其特征在于,所述按照预设的审核方法对所述数据的真实性进行审核,包括:The crowdsourcing-based data collection method according to claim 1, wherein the reviewing the authenticity of the data according to a preset auditing method comprises:
    获取用户端上传的任务领取者采集所述数据使用的移动终端的内置传感器产生的感应信息;Obtaining sensing information generated by a built-in sensor of the mobile terminal used by the task recipient uploaded by the client to collect the data;
    根据所述感应信息判断所述数据的真实性。Determining the authenticity of the data based on the sensing information.
  3. 根据权利要求2所述的基于众包的数据采集方法,其特征在于,所述数据采集需求为到指定区域采集数据的需求;The crowdsourcing-based data collection method according to claim 2, wherein the data collection requirement is a requirement for collecting data to a designated area;
    所述获取用户端上传的任务领取者采集所述数据使用的移动终端的内置传感器产生的感应信息,包括:The acquiring, by the task recipient uploaded by the user, the sensing information generated by the built-in sensor of the mobile terminal used by the data collection, includes:
    获取用户端上传的任务领取者采集所述数据使用的移动终端的内置GPS模块产生的位置信息;Obtaining location information generated by a built-in GPS module of the mobile terminal used by the task recipient uploaded by the client to collect the data;
    所述根据所述感应信息判断所述数据的真实性,包括:Determining the authenticity of the data according to the sensing information, including:
    将所述位置信息与所述指定区域对应的位置信息进行匹配;Matching the location information with location information corresponding to the designated area;
    在匹配失败时,判断所述数据不真实。When the matching fails, it is judged that the data is not true.
  4. 根据权利要求1所述的基于众包的数据采集方法,其特征在于,在所述接收用户端上传的任务领取者针对所述数据采集任务采集的数据的步骤后,还包括:The crowdsourcing-based data collection method according to claim 1, wherein after the step of receiving the data collected by the task recipient uploaded by the user for the data collection task, the method further includes:
    将所述数据与所述数据采集需求进行匹配,根据匹配结果确定所述数据采集任务的完成质量。Matching the data with the data collection requirement, and determining a quality of completion of the data collection task according to the matching result.
  5. 根据权利要求1所述的基于众包的数据采集方法,其特征在于,所述获取任务发布者发起的数据采集需求,包括: The crowdsourcing-based data collection method according to claim 1, wherein the acquiring data collection requirements initiated by the task publisher includes:
    向任务发布者提供数据采集任务动态表单;Provide a data collection task dynamic form to the task publisher;
    根据所述任务发布者在所述数据采集任务动态表单中输入的内容获得所述任务发布者数据采集需求。Obtaining the task publisher data collection requirement according to the content input by the task publisher in the data collection task dynamic form.
  6. 根据权利要求1所述的基于众包的数据采集方法,其特征在于,所述发布该数据采集任务,包括:The crowdsourcing-based data collection method according to claim 1, wherein the publishing the data collection task comprises:
    将所述数据采集任务发布至指定的网络众包公共平台,以供任务领取者领取;Distributing the data collection task to a designated network crowdsourcing public platform for the task recipient to receive;
    或者or
    将所述数据采集任务推送至符合指定条件的任务领取者的用户端,以供所述任务领取者领取。The data collection task is pushed to the user end of the task recipient who meets the specified condition for the task recipient to receive.
  7. 根据权利要求1所述的基于众包的数据采集方法,其特征在于,所述根据所述数据采集需求生成数据采集任务,包括:The crowdsourcing-based data collection method according to claim 1, wherein the generating a data collection task according to the data collection requirement comprises:
    确定所述数据采集任务的任务发布模式,所述任务发布模式包括任务分包方式、任务分配方式和基本任务定价;Determining a task publishing mode of the data collection task, where the task publishing mode includes a task subcontracting mode, a task allocation mode, and a basic task pricing;
    根据所述数据采集需求和所述任务发布模式,生成数据采集任务。Generating a data collection task according to the data collection requirement and the task publishing mode.
  8. 根据权利要求1所述的基于众包的数据采集方法,其特征在于,还包括:The crowdsourcing-based data collection method according to claim 1, further comprising:
    根据去噪、整合的结果对所述数据采集任务进行动态调整。The data collection task is dynamically adjusted according to the results of denoising and integration.
  9. 一种基于众包的数据采集装置,其特征在于,包括:A crowdsourcing-based data collection device, comprising:
    数据采集需求获取模块,用于获取任务发布者发起的数据采集需求;The data collection requirement acquisition module is configured to acquire a data collection requirement initiated by the task publisher;
    数据采集任务发布模块,用于根据所述数据采集需求生成数据采集任务以及发布该数据采集任务;a data collection task publishing module, configured to generate a data collection task according to the data collection requirement and issue the data collection task;
    采集数据接收模块,用于接收用户端上传的任务领取者针对所述数据采集任务采集的数据;The collection data receiving module is configured to receive data collected by the task recipient uploaded by the user end for the data collection task;
    采集数据审核模块,用于按照预设的审核方法对所述数据的真实性进行审核;Collecting data review module for reviewing the authenticity of the data according to a preset audit method;
    采集数据整合模块,用于对多个用户端上传的所有数据进行去噪、整合。The data integration module is used for denoising and integrating all data uploaded by multiple clients.
  10. 一种基于众包的数据采集服务器,其特征在于,包括:处理器、存储器、总线接口、总线、收发机和天线;A crowdsourcing-based data collection server, comprising: a processor, a memory, a bus interface, a bus, a transceiver, and an antenna;
    所述处理器、所述存储器和所述总线接口通过所述总线连接,所述收发机与所述总 线接口连接,所述天线与所述收发机连接;The processor, the memory, and the bus interface are connected by the bus, the transceiver and the total a line interface connection, the antenna being connected to the transceiver;
    其中,所述存储器用于存储程序;Wherein the memory is used to store a program;
    所述处理器,用于读取所述存储器中的程序,执行权利要求1至权利要求8任一项所述的基于众包的数据采集方法;The processor is configured to read a program in the memory, and execute the crowdsourcing-based data collection method according to any one of claims 1 to 8.
    所述收发机,用于在所述处理器的控制下接收和发送数据。 The transceiver is configured to receive and transmit data under the control of the processor.
PCT/CN2016/101274 2016-09-30 2016-09-30 Data collecting method and apparatus based on crowdsourcing, and server WO2018058609A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2016/101274 WO2018058609A1 (en) 2016-09-30 2016-09-30 Data collecting method and apparatus based on crowdsourcing, and server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2016/101274 WO2018058609A1 (en) 2016-09-30 2016-09-30 Data collecting method and apparatus based on crowdsourcing, and server

Publications (1)

Publication Number Publication Date
WO2018058609A1 true WO2018058609A1 (en) 2018-04-05

Family

ID=61763633

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/101274 WO2018058609A1 (en) 2016-09-30 2016-09-30 Data collecting method and apparatus based on crowdsourcing, and server

Country Status (1)

Country Link
WO (1) WO2018058609A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112615744A (en) * 2020-12-18 2021-04-06 安徽中杰信息科技有限公司 Computer lab asset cloud safety management platform
CN112729319A (en) * 2020-12-17 2021-04-30 武汉中海庭数据技术有限公司 Automatic data acquisition and analysis system and method
CN116781550A (en) * 2023-08-23 2023-09-19 北京赢科天地电子有限公司 Method, system and equipment for realizing data acquisition

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103383263A (en) * 2013-05-24 2013-11-06 薛俊华 Interactive dynamic cloud navigation system
US20150051451A1 (en) * 2012-03-23 2015-02-19 National Institute Of Japan Science And Technology Agency Personal genome information environment providing device, personal genome information environment providing method, and computer program product
CN105808588A (en) * 2014-12-31 2016-07-27 北京瑞狮天智信息技术有限公司 Crowdsourcing model based distributed directional vertical information search system and method
CN106358289A (en) * 2016-09-30 2017-01-25 深圳市华傲数据技术有限公司 Data acquiring method and device based on crowdsourcing and server

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150051451A1 (en) * 2012-03-23 2015-02-19 National Institute Of Japan Science And Technology Agency Personal genome information environment providing device, personal genome information environment providing method, and computer program product
CN103383263A (en) * 2013-05-24 2013-11-06 薛俊华 Interactive dynamic cloud navigation system
CN105808588A (en) * 2014-12-31 2016-07-27 北京瑞狮天智信息技术有限公司 Crowdsourcing model based distributed directional vertical information search system and method
CN106358289A (en) * 2016-09-30 2017-01-25 深圳市华傲数据技术有限公司 Data acquiring method and device based on crowdsourcing and server

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112729319A (en) * 2020-12-17 2021-04-30 武汉中海庭数据技术有限公司 Automatic data acquisition and analysis system and method
CN112729319B (en) * 2020-12-17 2023-12-01 武汉中海庭数据技术有限公司 Automatic data acquisition and analysis system and method
CN112615744A (en) * 2020-12-18 2021-04-06 安徽中杰信息科技有限公司 Computer lab asset cloud safety management platform
CN116781550A (en) * 2023-08-23 2023-09-19 北京赢科天地电子有限公司 Method, system and equipment for realizing data acquisition

Similar Documents

Publication Publication Date Title
WO2018058610A1 (en) Data collecting method and apparatus based on crowdsourcing, and server
US11709819B2 (en) Validating test results using a blockchain network
CN110175913B (en) Data processing system, method, computing device and storage medium based on block chain
US20180331835A1 (en) Trusted agent blockchain oracle
US20180322303A1 (en) Systems and methods for digital content delivery
JP6975250B2 (en) Methods and equipment for providing transaction data to blockchain systems for processing
US20180095857A1 (en) Devices and Method for Detecting and Addressing Anomalies in Data Retrieval Requests
CN112804218B (en) Block chain-based data processing method, device, equipment and storage medium
CN107844698B (en) Method, device and equipment for setting authority of financial APP and storage medium
CN106358289A (en) Data acquiring method and device based on crowdsourcing and server
WO2018058609A1 (en) Data collecting method and apparatus based on crowdsourcing, and server
US20190089549A1 (en) Information processing system and charge calculation apparatus
WO2018126344A1 (en) Data processing method and related device
CN110020945B (en) Data reading method and system based on multiple block chain networks
CN106803815B (en) Flow control method and device
CN104182900A (en) Business data processing method, device and system
CN101996230B (en) Information processing apparatus, reference value determination method, and program
EP3942834B1 (en) System and method for proof of view via blockchain
US20150142700A1 (en) Dynamic risk evaluation for proposed information technology projects
DE102015206993A1 (en) System and method for performing synchronized trading activities on multiple exchanges
WO2017020716A1 (en) Method and device for data access control
CN114297277A (en) Carbon emission activity data acquisition method and device, electronic equipment and storage medium
CN111198763B (en) Method for detecting reuse of resources, terminal and computer-readable storage medium
CN112053058A (en) Index model generation method and device
CN105787791B (en) Service request processing method and device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16917359

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16917359

Country of ref document: EP

Kind code of ref document: A1